College of Public Health

Costa-Rica 2019 Conference

International Conference on Mathematical Multiscale Modeling in Biology

Guanacaste, Costa Rica, October 21-24, 2019

Conference Pic


CHAIRS: 

Andrzej Kloczkowski, Ph.D., Nationwide Children’s Hospital & The Ohio State University

Grzegorz Rempala, Ph.D., The Ohio State University

Marek Cieplak Ph.D., Institute of Physics Polish Academy of Sciences


COORDINATOR:

Karen Blosser


NOTE:  All times listed are Central Standard Time (local time)

 

Monday, Oct 21- Plaza Bar (In front of the show stage)


6:00pm

●   Registration

●   Welcome

●   Reception

 


Tuesday, Oct 22 [Young Investigators Session I, Chairs: G. Rempala, M. Pietrzak, R. Jernigan]

9:00-9:30am

Colin Klaus, The Ohio State University “Multiscale Approaches in Visual Transduction:  A Novel Homogenized Diffusion Model for Cone Photoreceptors”

Chair: Rempala

9:30-10:00am

Maciej Pietrzak, The Ohio State University  “Application of information-theoretical approaches for the analysis of human immunome”


10:00-10:30am

Kevin DeMarco, University of California-Davis “A Multiscale Model for in silico Prediction of Drug-induced Arrhythmogenic Risk”

 


10:30-11:00am

Maksim Kouza, Nationwide Children’s Hospital “Assessing the Rate of Fibril Formation using Steered Molecular Dynamics Simulations


COFFEE BREAK



11:30-12:00pm

Tamara Bidone, University of Utah Multiscale Models of Integrin-based Mechanosensing”

Chair: Pietrzak

12:00-12:30pm

Qin Ma, The Ohio State University “Construction of Cell Specific Gene Co-Regulations Signatures Based on Single-Cell Multi-omics Data Analysis”


OPEN DISCUSSIONS/FREE TIME



6:00-6:20pm

John Dawson, University of California-Davis “Are Two Blockers Better Than One: Investigating Drug-induced hERG and Beta-Adrenergic Receptor Block”

 

Chair: Jernigan

6:20-6:40pm

Aiyana Emigh, University of Califormia- Davis  Predicting Arrhythmogenicity: Structural Modeling of Safe and Unsafe hERG Blockers


6:40-7:00pm

Esteban Vargas Bernal, The Ohio State University “Relating Eulerian and Lagrangian Spatial Models for Vector-host Disease Dynamics Through a Fundamental Matrix”

 


7:00-7:20pm

Divya Kernik, University of California-Davis “A computational framework to predict mechanisms of phenotypic variability and disease severity in iPSC-CMs”


7:20-7:40pm

Pranav Khade, Iowa State University Using alpha shapes to characterize protein packing and capture the multiscale aspects of allostery


7:40-8:00pm

Xianli Jiang, University of Texas- Dallas Unraveling and designing signal-response connections using direct couplings from a global coevolutionary model




Wednesday, Oct 23 [Young Investigators Session II, Regular Session I,  Chairs: A. Kloczkowski, K. Fidelis]

9:00-9:30am

Guido Espana, University of Notre Dame Using an Agent-based Model of Dengue Virus Transmission to Estimate the Impact of Vaccination Strategies in Different Settings”

 

Chair: Kloczkowski

9:30-10:00am

Grzegorz Rempala, The Ohio State University  Multiscale Stochastic  Models of Transcription/Translation


10:00-10:20am

Komla Gnona, Natiowide Children's Hospital  Developing genetic biomarkers for polygenic conditions: An Application to Neonatal Complications in Preterm Infants


10:20-10:40am

Daniel Kool, Iowa State University “Investigating the Changes in Amino Acid Properties in the Evolutionary and Multi-scale Context”


COFFEE BREAK



11:00-11:30pm

Jayajit Das, Nationwide Children’s Hospital and The Ohio State University Modeling formation of biofilms by a bacterial pathogen in vitro and in vivo


11:30-12:00pm

Leonor Saiz, University of California-Davis “Multilevel modeling of cellular networks in biomedicine”


OPEN DISCUSSIONS/FREE TIME



6:00-6:30pm

Mateusz Sikora, Max Planck Institute of Biophysics “Truss-like arrangement of cadherins is responsible for desmosome strength”

Chair: Fidelis

6:30-7:00pm

Sergei Grudinin, Inria/CNRS “Coarse-graining protein dynamics and representation for integrative structural  bioinformatics


7:00-7:30pm

Olivier Lichtarge, Baylor College of MedicineThe Skinny From All of Us to Each of Us:  A Calculus of Fitness Suggests Simple Rules for Complex Diseases


7:30-8:00pm

Marek Cieplak, Institute of Physics Polish Academy of Sciences “The Networks of the Inter-basin Traffic and Emergence of Knots in Intrinsically”


8pm GALA DINNER

Ocotal Steak House



Thursday, Oct 24 [Regular Session II, Chairs: A. Joachimiak, M. Cieplak]

9:00-9:30am

Remo Rohs, University of Southern CaliforniaMultiscale Modeling of Protein-DNA Binding Specificity”

 

Chair: Joachimiak

9:30-10:00am

Krzysztof Fidelis, University of California- Davis “Sequence-structure relationship in proteins: An amino acid structural neighborhood perspective”


10:00-10:30am

William Stewart, Nationwide Children’s Hospital and The Ohio State University “Multi-scale Modeling of Serial Measurements in Survival Data”


10:30-11:00am

Cezary Czaplewski, University of GdanskUsing coarse-grained UNRES model for simulations of protein structure and dynamics"


COFFEE BREAK



11:15-11:45pm

Ilya Vakser, University of Kansas “Docking of Protein Models”

Chair: Cieplak

11:45-12:15pm

Robert Jernigan, Iowa State University “Using High Order Coevolution Correlations to Identify Sites for Compensating Mutations to Rescue Function”


12:15-12:45pm

Andrzej Joachimiak, University of Chicago Addressing Ligand Binding Promiscuity


12:45-1:15pm

Yinglong Miao, University of Kansas “Gaussian Accelerated Molecular Dynamics (GaMD): Bridging gaps in multiscale simulations”


1:15-1:45pm

Andrzej Kloczkowski, Nationwide Children’s Hospital and The Ohio State University Modeling structure, stability and dynamics of proteins and protein aggregates”


1:45pm

Closing Remarks





 

 

Abstracts

SPEAKER: Bidone, Tamara (University of Utah)
TITLE: Multiscale Models of Cell Adhesions Mechanobiology
ABSTRACT:  Cell-substrate adhesions are specialized regions of the plasma membrane that couple the cytoskeleton network to the extracellular environment. They are critical in several cell activities, including tissue morphogenesis, homeostasis, cell migration and wound healing. Central elements of cell-substrate adhesions are transmembrane receptors, that form physical links between cell cytoskeleton and external binding partners. Integrins are transmembrane receptors that sense, resist and transmit cytoskeletal contractility to the substrate and respond to substrate rigidity via changes in conformation and ligand binding affinity. Integrin resists to contractility and this makes the receptor a cell mechanosensors, responding to mechanical cues with specific kinetic activities. Unfortunately, the biophysical and conformational mechanisms by which adhesions reinforce in response to stress are not fully understood. In this study, we developed a multiscale model of adhesions formation based on coupling Molecular Dynamics, coarse-graining techniques and Brownian Dynamics approaches in order to study mechanisms of adhesion reinforcement. Our model shows thaTITLE:  integrin activating mutants extend more easily under tension; conformational rearrangements  in proximity of the ligand binding domain of integrin underlie integrin extension; catch-bond kinetics is responsible for integrin-based mechanosensing, leading to enhanced cell spreading and traction stress. These results provide important insights into the biophysical principles and mechanisms of adhesion mechanosensing, with functional consequences on both cell and tissue physiology.


SPEAKER: Cieplak, Marek (Institute of Physics Polish Academy of Sciences)

TITLE: The networks of the inter-basin traffic and emergence of knots in intrinsically disordered proteins

ABSTRACT: The equilibrium dynamics of the intrinsically disordered proteins is thought to consist of transitions between many basins in the free energy landscape whereas structured proteins stay in the vicinity of one native basin. We demonstrate this picture explicitly by studying networks defined on the discretized plane: conformational end-to-end distances vs. radii of gyration. The bin sizes are defined by time scales that span orders of magnitude. The networks, derived from all-atom and coarse-grained molecular dynamics simulations, are nearly scale invariant. The bin representation also provides insights into the folding process

of the structured proteins and identifies regions of hindrance to folding.

Transient knotted structures are expected to arise during the volatile evolution of intrinsically disordered eptide chains. We show that this is indeed the case for sufficiently long polyglutamine tracts and α-synuclein.  The polyglutamine tracts are fused within huntingtin protein that is associated with the Huntington neurodegenerative disease. We show that the presence of knots in the tracts hinders and sometimes even jams translocation, especially when the knots are deep.  The knots in polyglutamine may form in tracts exceeding about 40 residues. This fact explains the existence of a similarly sized length treshold above which there is an experimentally observed toxicity at the monomeric level. We also discuss emergence of knots in α-synuclein. We show that these knots are either shallow or deep and last for about 3 – 5 µs, as inferred from an all-atom explicit-solvent 20 and 30 µs trajectories. We discuss conformational biasses that take place in α-synuclein and contact formation during aggregation of two chains of this protein. We then discuss several aspects of dynamics of knotted structured proteins as assessed within a Go-like model. In particular, we argue that folding under the nascent conditions is essential to fold to a structure that is deeply knotted.

In collaboration with: M. Chwastyk, Ł. Mioduszewski, B. de Aquino, A. Gomez-Sicilia, M. Carrion-Vazquez, P. Robustelli, and Y. Zhao.



SPEAKER: Cezary Czaplewski (University of Gdansk)

TITLE: Using coarse-grained UNRES model for simulations of protein structure and dynamics

ABSTRACT:  Computer modeling is routinely used for the prediction of unknown protein structures and simulation of protein dynamics. Coarse-grained models of proteins allow for an extension of the simulation time- and size-scale by orders of magnitude compared to all-atom models. The coarse-grained UNRES model is a highly reduced model of polypeptide chains, with two interaction sites per residue: united side chains and united peptide groups. The UNRES effective energy function is defined as a potential of mean force of polypeptide chains in water, which has been subsequently expanded into Kubo cluster cumulant functions, identified with the respective energy terms. UNRES simulations have been used with success in protein-structure prediction, studying protein-folding kinetics and free-energy landscapes as well as to solve biological problems. Recently, we have implemented a web server for coarse-grained simulations using the UNRES force field. Local energy minimization, canonical molecular dynamics simulations, replica exchange (REMD) and multiplexed replica exchange molecular dynamics (MREMD) simulations can be run with the current UNRES server. The user-supplied input includes protein sequence and, optionally, restraints from secondary-structure prediction or small x-ray scattering (SAXS) data, and simulation type and parameters which are selected or typed in. Oligomeric proteins, as well as those containing D-amino-acid residues and disulfide links, can be treated. The REMD/MREMD simulations are followed by the weighted histogram analysis method (WHAM) to compute the probabilities of conformations, thermodynamic quantities and ensemble averages. Cluster analysis of the ensemble at the desired temperature is carried out to construct the final models which are converted into all-atom representation. The UNRES server is available at http://unres-server.chem.ug.edu.pl. This website is free and open to all users and there is no login requirement.


SPEAKER: Jayajit Das (Nationwide Children’s Hospital and The Ohio State University)

TITLE: Modeling formation of biofilms by a bacterial pathogen in vitro and in vivo

ABSTRACT: Biofilms formed by nontypeable Haemophilus influenzae (NTHI) bacteria play an important role in multiple respiratory tract diseases. However, the mechanisms that underlie the emergence of specific NTHI biofilm structures in vitro and in vivo are unclear. We combined computational analysis tools and in silico modeling rooted in statistical physics with confocal imaging of NTHI biofilms formed in vitro during static culture and in the middle ears of chinchillas in order to identify mechanisms that give rise to distinguishing morphological features. 


SPEAKER: Dawson, John (University of California-Davis)
TITLE: Are Two Blockers Better Than One: Investigating Whether Beta Block Ameliorates hERG Blockade.
ABSTRACT:  Beta-blocking drugs are a standard therapy for the prevention of cardiac arrythmias. They bind to beta-adrenergic receptors and attenuate downstream signaling cascades, suppressing the sympathetic regulation of cardiac output. Some prescribed beta-blockers also block hERG potassium channel current (IKr), which can prolong the repolarization phase of the cardiac action potential, eliciting either beneficial anti-arrhythmic or potentially deadly arrhythmogenic effects. Such is the case of dl-sotalol, an antiarrhythmic medication representing a racemic mixture of d- and l-sotalol, which have similar hERG binding affinities. However, the l-stereoisomer has substantially stronger non-selective beta-blocking properties. In the infamous “Survival with Oral D-sotalol" (SWORD) trial, researchers found that patients taking d-sotalol exhibited a higher mortality rate than placebo due to acquired arrhythmogenicity. It can be presumed that a concomitant beta-blockade by l-sotalol in the racemic mixture conveys a protective effect in dl-sotalol administration, although this drug still possesses significant pro-arrhythmia risks. The widely used beta-blocking drug propranolol also has hERG binding affinity, but a better cardiac safety profile. The precise molecular mechanisms of the combined effect of hERG and beta-blocking activities of these and other drugs remain undetermined. Elucidating them and devising computational methods to predict their effects on cardiac rhythm can provide a new paradigm for drug cardiac safety testing and perhaps new avenues for rehabilitating drugs otherwise deemed unsafe. Our laboratory employed atomistic-scale molecular modeling and simulation techniques to assess hERG- and beta-blockade for several drugs, including dl-sotalol and propranolol. Here we present the development of human beta-adrenergic receptor and hERG channel models and their subsequent drug docking and molecular dynamics simulations to identify key energetic and kinetic parameters for their protein target interactions. We will implement these parameters into a multi-scale functional model to predict a combined effect of hERG and beta block on cardiac rhythm for different drug doses and combinations.


SPEAKER: DeMarco, Kevin (University of California-Davis)
TITLE:  A Multiscale Model for in silico Prediction of Drug-induced Arrhythmogenic Risk
ABSTRACT:  We present a multi-scale computational platform for the prediction of drug-induced arrhythmogenesis resulting from blockade of the voltage gated potassium channel encoded by the human Ether-à-go-go-related gene (hERG) in ventricular cardiomyocytes.  The hERG potassium channel gives rise to a major repolarization current (IKr) in the heart and is infamous for interacting with a diverse set of drugs that can lead to acquired long QT syndrome (LQTS) in humans, which is a surrogate indicator for potentially deadly arrhythmias. However, not all hERG blockers are proarrhythmic, and the underlying mechanisms relating hERG blockade and LQTS to emergent arrhythmogenicity are not well understood. Our multi-scale simulation framework utilizes predictions from atomistic hERG-drug simulations to generate functional kinetic model parameters that capture the dynamic electrophysiological effects of IKr block. We focused on a prototypical hERG blocking drug, dofetilide, which has a high proarrhythmic risk. Umbrella sampling molecular dynamics simulations of dofetilide in different ionization states with our open hERG model were used to compute potentials of mean force and diffusion coefficient profiles, from which we estimated drug binding affinities (Kd), as well as entry and egress rates. These data were then integrated into higher-order functional kinetic models at the channel, cell and tissue scales to identify the arrhythmia vulnerability mechanisms underlying emergent arrhythmogenic phenotypes, arising from molecular level hERG-drug interactions. We demonstrated this proof-of-concept multi-scale model by predicting differential arrhythmia markers, such as Triangulation, Reverse use dependence, beat-to-beat Instability of action potential duration, and temporal and spatial action potential duration Dispersion (TRIaD), upon application of dofetilide.


SPEAKER: Emigh, Aiyana (University of California-Davis)
TITLE:  Predicting Arrhythmogenicity: Structural Modeling of Safe and Unsafe hERG Blockers
ABSTRACT:  Human Ether-a-go-go-Related Gene (hERG) encodes a potassium-selective voltage-gated ion channel, KV11.1, essential for normal electrical activity in the heart. hERG mutations and blockage of the channel pore by drugs can cause long QT syndrome (LQTS) that predisposes individuals to arrhythmia and puts them at risk for stroke or sudden cardiac arrest. A major problem in antiarrhythmic drug therapies as well as drug development in general is the proclivity for many drugs and drug candidates to promote fatal arrhythmias through hERG blockade. However, not all hERG blocking drugs are pro-arrhythmic, and their differential affinities to discrete channel states and/or their state stability modulations have been suggested to contribute to arrhythmogenicity. In this study, we used Rosetta electron density refinement, loop rebuilding and homology modeling approaches to build complete structural models of wild-type and mutant hERG channels in open and closed states based on cryo-electron microscopy structures of hERG (pdb id: 5VA2) and the homologous EAG1 channel(pdb id: 5K7L), respectively. Here we present RosettaLigand molecular docking results for drug interactions with hERG in open and closed states. We observed that drugs with different pro-arrhythmia risks like dofetilide and nifekalant have distinct binding poses and affinities in closed and open hERG cavities, which might affect channel gating.  Also, Tyr652 residue in hERG pore-lining S6 helix was found to be crucial for stabilizing drug - channel binding in agreement with experiment. Our results provide structural insights into molecular mechanisms of state-dependent drug interactions with hERG that play a key role in differentiating safe and unsafe hERG blockers.


SPEAKER: Espana, Guido (University of Notre Dame)
TITLE:  Using an agent-based model of dengue virus transmission to estimate the impact of vaccination strategies in different settings
ABSTRACT:  Dengue is the fastest spreading vector-borne disease worldwide causing around 100 mil- lion cases every year. Dengue virus transmission is highly heterogeneous and its limited to temperate climates where its main vector of transmission, the Aedes aegypti mosquitoes, can survive. Even though a vaccine (CYD-TDV) has been develop against dengue, its efficacy is heterogeneous in the population. Hence, the World Health Organization has recommended to use serological testing prior to vaccination to administer the vaccine only to individuals that would benefit from it. To estimate the public health impact of the dengue vaccine, mathematical models should include sources of heterogeneity that could affect indirect protection from the vaccine. We used a stochastic, agent-based model to realistically account for the various factors of heterogeneity involved in dengue virus trans- mission. The model parameters were calibrated using more than a decade of empirical studies in the city of Iquitos, Peru. We used this model in various settings to simulate 40 years of dengue virus transmission, followed by 10 more years of routine vaccination. We calculated the potential benefits from the vaccine assuming different levels of the accu- racy of serological screening to detect the target population. In addition, we estimated the cost-effectiveness of these measures for three specific countries: the Philippines, Brazil, and Puerto Rico. Our results suggest that this vaccination strategy could benefit some coun- tries, particularly those with high transmission intensity. From a public payer perspective, pre-vaccination screening was cost-effective in scenarios with high screening sensitivity. In conclusion, vaccination with CYD-TDV following serological screening could have a posi- tive impact in certain epidemiological settings, provided that screening is highly specific (to reduce individual harm), at least moderately sensitive (to increase public health benefits), and inexpensive.


SPEAKER:  Fidelis, Krzysztof (University of California-David)

TITLE:  “Sequence-structure relationship in proteins: An amino acid structural neighborhood perspective”

ABSTRACT: Recent CASP13 assessments point to dramatic advancements in the protein residue-residue contact prediction and in modeling of protein structure itself. The progress is especially noteworthy for proteins with no available structural templates, where until now the most reliable technique, comparative modeling, cannot be used. But how do the recent advances relate to the original Anfinsenian stating of the protein folding problem over 50 years ago, where the amino acid sequence itself purportedly contains all the necessary information to answer the folding question? In the current methods, what is the role of the co-evolution based contact prediction and how to explain that of the artificial intelligence /deep learning? We take initial steps to answering these questions by looking at levels of the amino acid sequence similarity in local structures of proteins that are not necessarily related by homology. Using our previously developed formalism of structure descriptors, we examine levels of local structure ubiquity among protein folds. Then, for each local structure type, we assess whether sequence signal can be seen between representatives taken from different protein folds. We also examine whether local structure libraries can be used to model any new protein, i.e. if the structural coverage by library members is sufficient to assemble a complete model. Finally, we present an analysis of the CASP13 template-free and template-based modeling targets in terms of library coverage and the levels of sequence signal measured to the nearest correctly aligned library member.


SPEAKER: Gnona, Komla (Nationwide Children’s Hospital)
TITLE: Developing genetic biomarkers for polygenic conditions: An Application to Neonatal Complications in Preterm Infants
ABSTRACT:  Preterm birth is the leading cause of mortality in young children with more than a million deaths per year worldwide arising from neonatal complications (NC). Our most recent work (and the work of others) suggests that NC is heritable and the genetic burden of as many as 5,000 genes may be influencing NC. However, to date, no one has identified a single gene for this condition. This suggests that most of the genes involved in NC have small effects.  As such, we hypothesize that which genes get selected for biomarker construction don’t really matter. To test this hypothesis, we (1) examined three different prediction approaches: random forest, lasso, and polygenic risk scores, applied to the whole-exome data of 131 preterm infants; (2) looked at the different types of variants (e.g. exonic, intronic, regulatory) and the average burden in two disjoint gene sets: all genes with p<0.2 (ie, weakly associated) and all genes with p>0.2 (ie, non-associated).
We found that most of the associated genes tend to be intronic, suggesting that their biological effects are likely small, and confirming that NC is polygenetic. We also found that all the AUCs from the various prediction algorithms look the same, even though vastly different computational approaches were used. In conclusion, genetic biomarkers for polygenic traits can be improved without knowing exactly the genes involved, and replicating an association before building a genetic biomarker for such traits may not be prudent.


SPEAKER: Jernigan, Robert (Iowa State University)
TITLE: Using High Order Coevolution Correlations to Identify Sites for Compensating Mutations to Rescue Function

ABSTRACT:  Dense packing within globular proteins manifests itself in strong interdependencies for amino acid substitutions leading to strong correlations among them. The available sequence data is growing rapidly, and it can provide substantial important information regarding function and design. We use pair correlations in the multiple sequence alignments to improve protein sequence matching to bring sequence matches into congruence with sequence matches. We use mutual information to obtain the pairwise correlation and apply a method called symmetrized differential interaction information to measure the triplet correlation from the available large protein sequence information. Combining the results from these two methods, we are able to extract the top correlated amino acids acting as hubs for allosteric communication pathways. This method is applied to a variant of Bruton’s tyrosine kinase (BTK) linked to a disease, severe XLA. We previously showed that this deleterious mutant adversely affects the functional dynamics of the BTK kinase domain, and thus presumably the function. In another approach we utilize comparative cross-species comparisons in sequence alignments to identify the occurrence of a specific disease mutant in the native sequence of another organism, and uncover what other substitutions are likely to compensate for this change. We suggest this may be a way to identify candidate substitutions in the sites most strongly correlated with the disease mutant to restore the native dynamics in the presence of the deleterious mutation.


SPEAKER: Jiang, Xianli (University of Texas-Dallas)
TITLE: Unraveling and designing signal-response connections using direct couplings from a global coevolutionary model
ABSTRACT:  Coevolution plays a fundamental role in determining folding, structure, interactions and functionality of proteins. Structurally or functionally related residues coevolve during the evolutionary history due to selective pressure. Direct coupling analysis (DCA) model for coevolutionary analysis has demonstrated outstanding performances in predicting coupled residues key to structural or functional dynamics.We have used coevolutionary traits among LacI homologs to develop a model for predicting compatibility between an environmental sensing module (ESM) and a DNA recognition module (DRM). The inter-modular coevolutionary signature arises from the constraints on allosteric dynamics, which involve specific residue-residue associations. Engineering allosteric transcriptional repressors containing distinct but compatible ESM and DRM has the potential to flexibly construct new connections for environmental signals and biological responses. Our predictions accurately agree with experiments for a collection of engineered repressors. The engineered repressors are further explored in synthetic biology by developing a system of genetic circuits called multiple toggle switches with a master OFF signal. Coevolutionary model also has unraveled signal-response connections in two- component system where direct couplings have been used to evaluate binding specificity of protein-protein interactions. The mode accurately predicts partnerships not only between cognate pairs but also cross-talk, leading to a better understanding of specific phenotypes in Enterococcus Faecalis in response to antibiotic treatments.


SPEAKER: Kool, Daniel (Iowa State University)
TITLE:  Investigating the Changes in Amino Acid Properties in the Evolutionary and Multi-scale Context
ABSTRACT:  Understanding protein mutations is critical for comprehending evolution, drug development, personalized medicine, genomics and much more throughout biology. Classification here is performed by using features, which can be generated/extracted from data, or chosen by an investigator. One kind of feature set showing initial promise not only for classification but also finding out what the basis is for each classification utilizes the physical properties of amino acids. It has been shown that there are periodicities and patterns in protein sequences when they are encoded as these physical property values. An encoding of a sequence where each residue becomes a vector of the amino acid’s physical property values provides much more information for sequence similarity measures and can serve as feature components for classifiers. Once a classifier is constructed and trained, it can then be analyzed for feature importance. One of the best methods for this is the permutation importance algorithm. For each of the features, the values are randomly permuted, and then the prediction error is recalculated. The change in the error determines the feature’s importance for any specific model, and this technique can be used on any trained classification or regression model. Classification of protein mutants based on physical properties combined with permutation importance can reveal information about why or how each mutation leads to changes in structure, dynamics or function. Extensions include investigating the correlations between properties at residues seen to be correlated in the sequence alignments.  This method can be used on any scale, with any set of features, making it a potentially powerful tool for protein analysis.


SPEAKER: Grudinin, Sergei (Inria-CNRS)

TITLE: Coarse-graining protein dynamics and representation for integrative structural bioinformatics

ABSTRACT:  Our team develops novel algorithms for integrative structural bioinformatics, which are based on a combination of physics-based and knowledge-based approaches. I will briefly present computational pipelines developed in our team. I will also describe in more detail 3 coarse-graining approaches. The first one is a set of novel knowledge-based potentials for protein interactions. The novelty of our approach is the coarse-grained backbone-only representation of the protein. The second one is a method to compactly represent protein essential dynamics based on the nonlinear normal mode analysis. Our method, called NOLB, allows very CPU and memory-efficient computations that can be run in real time on a laptop, and produces rather large and realistic structural deformations.Finally, we developed a novel method to construct a multi-level representation of protein flexibility. This method is based on a binary tree representation of rigid domains, which are computed by a comparison of the dynamics of rigid bodies with the dynamics of individual protein's atoms.


SPEAKER: Kernik, Divya (University of California-Davis)

TITLE: A computational framework to predict mechanisms of phenotypic variability and disease severity in iPSC-CMs
ABSTRACT: There is a profound need to develop a strategy to predict patient-to-patient vulnerability in the emergence of cardiac arrhythmia. A promising in vitro method to address patient- specific proclivity to cardiac disease utilizes induced pluripotent stem cell-derived cardiomyocytes (iPSC-CMs). A major strength of this approach is that iPSC-CMs contain donor genetic information and therefore capture patient-specific genotype-phenotype relationships. A cited detriment of iPSC-CMs is the cell-to-cell variability observed in electrical activity. We postulated, however, that cell-to-cell variability may constitute a strength when appropriately utilized in a computational framework to build in silico cell populations that can be employed to identify phenotypic mechanisms and pinpoint key sensitive parameters. Thus, we have exploited variation in experimental data across multiple laboratories to develop a computational framework to investigate subcellular phenotypic mechanisms in healthy iPSC-CMs. The resulting population of cellular models predicts robust inter-subject variability in iPSC-CMs and recapitulates the experimentally observed range of whole-cell behaviors. We used this population-based framework to analyze a panel of genetic mutations related to Long QT Syndrome 1 (LQT1). Furthermore, we used our iPSC-CM model population to predict the severity of LQT1 mutations and explore patient-specific susceptibility to LQT1. LQT mutations are known to have vastly different cardiac effects in different patients, and these phenotypic differences are recapitulated in our model population.


SPEAKER: Khade, Pranav (Iowa State University)
TITLE: Using alpha shapes to characterize protein packing and capture the multiscale aspects of allostery
ABSTRACT: There are only limited methods available to study the global motions of the protein such as their hinge motions and shear motions. These motions take place over a broad range of time scales, from microseconds to seconds; however, molecular dynamics methods can only model easily the motions occurring on the time scale from picoseconds to microseconds, and in addition such simulations require that replicas be run. Thus, extracting the meaningful slow motions is difficult. Hence, there is a need to explore other ways to utilize the protein structures to model/ predict the global/large scale motions of the protein. The important motions depend on the protein packing as a multiscale phenomenon which can influence the global or local motions in the proteins. In order to model the protein packing, an efficient, robust and simple mathematical method is needed. In this study, we have explored alpha shapes (a subset of Delaunay tessellations) for the protein backbone coordinates as a model of protein packing. We demonstrate that the method can predict the protein hinges which are responsible for the global motions of the proteins. The method is named PACKMAN (PACking and Motion ANalyses). From a literature survey for randomly selected protein structures and another 367 protein structure pairs having known open and closed conformations, PACKMAN is able to predict hinge locations accurately on the proteins for both the open and closed conformations outperforming existing hinge predicting methods that usually require either the open form or both conformations to predict the protein hinges. The successful implementation of the mathematical method to model a multiscale phenomenon such as protein packing to predict the hotspots of the global/large scale motions in proteins shows promise for the further exploration of other types of protein and supramolecular dynamics.


SPEAKER: Klaus, Colin (The Ohio State University)
TITLE:  Multiscale Approaches in Visual Transduction: A Novel Homogenized Diffusion Model for Cone Photoreceptors in Dim and Bright Light
ABSTRACT:  Mammals have two types of photoreceptors, rods and cones. While rods are exceptionally sensitive and mediate vision at very low illumination levels, cones operate in daylight and are responsible for the bulk of visual perception. Through the mathematical techniques of homogenization and concentrated capacity, a novel diffusion model for cone visual transduction is presented along with preliminary sensitivity analysis of model parameters. This work also adapts a recently published dim light model of cone transduction to the case of high intensity light. This study is an interdisciplinary undertaking by mathematicians and pharmacologists at The Ohio State University, CNR IT, and Vanderbilt University.


SPEAKER:  Kloczkowski, Andrzej (Nationwide Children’s Hospital & The Ohio State University

TITLE:  Modeling structure, stability and dynamics of proteins and protein aggregates

ABSTRACT:  Recent progress in modeling structure and dynamics of proteins and protein aggregates will be reviewed. Recent advancements in structure prediction such as development of better potentials and force-fields (including multibody potentials), and improved modeling of free energies will be presented. We significantly improved protein structure evaluations by considering the effects of amino acid variants on protein stability, and we have shown that the outliers in stability are typically aberrant proteins.  Recently, we have made significant progress in understanding protein stability and dynamics by computing protein free energies extracted from structures to account for the high packing densities in proteins, including important novel evaluations of protein entropies. Preliminary results show large improvements over previous potentials for assessing protein stabilities. Our results demonstrate that there are substantial gains in specificity from combining the sequence with structural and protein dynamic data. These developments may significantly impact the advancement of precision/personalized medicine.


SPEAKER: Kouza, Maksim (Nationwide Children’s Hospital)
TITLE:  Assessing the Rate of Fibril Formation using Steered Molecular Dynamics Simulations
ABSTRACT:  Fibril formation resulting from protein misfolding and aggregation is a hallmark of several neurodegenerative diseases such as Alzheimer’s and Parkinson’s diseases. Despite much progress in the understanding of the protein aggregation process, the factors governing fibril formation rates and fibril stability have not been fully understood. Using all-atom explicit solvent molecular dynamics simulations with the GROMOS43a1 force field for full-length amyloid beta peptides Aβ40 and Aβ42 and truncated peptides, we demonstrated that kinetic stability can be accessed via mechanical stability in such a way that the higher the mechanical stability or the kinetic stability, the faster the fibril formation1,2. This result opens up a new way for predicting fibril formation rates based on mechanical stability that may be easily estimated by steered molecular dynamics.


SPEAKER: Joachimiak, Andrzej (University of Chicago)
TITLE: 
Addressing Ligand Binding Promiscuity

ABSTRACT: Enzymes are typically described as molecular machines selectively recognizing a ligand and cofactors and performing very specific chemical transformations. But in reality, this is not even close to being accurate. Recent studies showed that proteins are capable of recognizing multiple substrates and enzymes can perform different, sometimes unrelated chemical reactions. These activities can provide the starting point for evolution of new functions and pathways. Promiscuous activities available in nature appear to be very common and extensive. This has important practical implications as we are facing the rise of antibiotic resistance and enzymes rapidly evolving to recognize and neutralize new “synthetic” derivatives of antibiotic. Promiscuous enzymes and evolution of secondary metabolisms is important for synthetic biology efforts to construct novel pathways using catalysts derived from promiscuous enzymes via directed evolution. Structure-guided mutagenesis of active sites at the substrate-binding pocket is responsible for altering the specificity and promiscuity toward substrates and the diversity of products.

High-resolution structures obtained through x-ray crystallography can reveal specific protein-ligand interactions and when combined with functional assays and numerical simulation can construct a model of the enzyme catalytic pathway and account for recognition structurally unrelated ligands.  This can have pronounced impact on designing new, better enzyme inhibitors and eventually drugs.

Several examples will be provided including metallo-β-lactamases, inosine monophosphate dehyrogenases, solute-binding proteins of ABC transporters and transcription factors. Structures of b-lactamases belonging to four classes were obtained with the same antibiotic and revealed differences in binding to their respective active sites. The structures with hydrolyzed antibiotics showed differences in the product binding to the same enzyme. Inosine monophosphate dehydrogenase binds multiple compounds and was discovered to catalyze new chemical transformation. Solute-binding proteins of ABC transporters can recognize multiple ligands and transcription factors respond to different compounds.

This project has been funded in whole or in part with Federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under Contracts No. HHSN272201700060C.


SPEAKER:  Lichtarge, Olivier (Baylor College of Medicine)

TITLE:The Skinny From All of Us to Each of Us:  A Calculus of Fitness Suggests Simple Rules for Complex Diseases

ABSTRACT:  The genotype to phenotype relationship shapes evolution in the long run and human health in all of us. Rooted in random genetic errors and cryptic past selection forces, it seems intractably complex.  Nevertheless, a shift in perspective to the proper scale and resolution suffices for basic laws of calculus and statistical mechanics to describe genetic moves in fitness landscapes with simple yet general rules. From cancer to Alzheimer’s disease, this shift in perspective not only yields a synthesis between molecular and population genetics but also reveals new disease genes and plausible strategies for risk assessment and clinical decisions tailored to each of us. 


SPEAKER: Ma, Qin (The Ohio State University)
TITLE:  Analyzing Co-regulated Gene Modules
ABSTRACT:  A gene’s expression is regulated by a set of TRSs, including transcription factors (TFs), microRNAs, long non-coding RNA, and epigenomic regulators. Genes co-regulated by the same TRSs are collectively referred to as a co-regulated gene module (CRGM). Accurately identifying CRGMs is a key to understanding gene regulation and the underlying TRSs and the prerequisite to discover the genetic and epigenetic machinery that leading to different cell types. Single-cell multi-omics sequencing technology can detect both epigenetic and genetic information from an individual cell. Intuitively, the joint analysis of these data provides an unprecedented opportunity to predict all the encoded CRGM at the single-cell level, which in turn, can substantially improve the state-of-the-art performance in elucidating cellular heterogeneity. The successful identification of underlying cell-type-specific CRGM can substantially improve the elucidation of heterogeneous gene regulatory mechanisms, and determine the identity of cell types in immunotherapy. It will also benefit the design and assessment of therapeutics in complex diseases.


SPEAKER: Miao, Yinglong (University of Kansas)

TITLE:  Gaussian accelerated molecular dynamics (GaMD): Bridging gaps in multiscale simulations

ABSTRACT:  Gaussian accelerated molecular dynamics (GaMD) is a robust computational technique that provides simultaneous unconstrained enhanced sampling and free energy calculations of large biomolecules. GaMD works by adding a harmonic boost potential to smooth biomolecular potential energy surface and reduce the energy barriers. Without the need to set predefined collective variables, GaMD is advantageous for studying complex biological processes with no constraints applied in the simulations. Furthermore, because the boost potential follows a Gaussian distribution, the original free energy profiles of biomolecules can be recovered through cumulant expansion to the second order (“Gaussian approximation”). GaMD allows us to more accurately characterize biomolecular dynamics. Compared with conventional molecular dynamics, GaMD accelerates biomolecular simulations by orders of magnitude. This allows us to bridge gaps in multiscale simulations of biomolecular dynamics. Here, new method developments of GaMD and our recent applications of GaMD in accelerated simulations of ligand binding, peptide-protein binding and protein-protein/membrane interactions will be presented.


SPEAKER: Pietrzak, Maciej (The Ohio State University)
TITLE:  Application of information-theoretical approaches for the analysis of human immunome.
ABSTRACT:  The cellular immunome (the cellular components of the immune system) is a vastly complex and highly regulated structure that protects against infection and preserves good health. An ever-increasing number of cell types comprise the immunome, and these are being defined through increasingly complex patterns of antigen expression. Recent technological advances over the last decade now permit high-dimensional examination of the immunome cellular components, making possible in-depth analysis of the cellular immune system. The analytical frameworks for comprehensive assessment of immunome profiles are not well established yet. The complexity of data produced by the flow cytometric analysis of the human immunome requires computational approach that would allow to detect not only the large-scale changes in the dominant components of the immunome, but, what seems more important, consistent differences in the non-abundant components and relations between them. In this talk, I will present the preliminary results of a recent work on development of information-based feature selection algorithm to detect alternations of human immunome profiles.


 

SPEAKER: Rempala, Grzegorz (The Ohio State University)
TITLE:  Multiscale Stochastic  Models of Transcription/Translation
ABSTRACT:  The formalism of stochastic reaction networks (SRNs) provides building blocks for many models in modern mathematical biology both at molecular and population levels (e.g., gene transcription or epidemic outbreak). In particular, the SRNs allow to naturally incorporate both delay and multi-scale phenomena into computational models. In this talk I will provide a brief overview of the applications of SRNs to modeling basic molecular mechanisms emphasizing some recent collaborative work on multi-scaling analysis for simple stochastic gene transcription.


SPEAKER:  Rohs, Remo (University of Southern California)

TITLE: Multiscale Modeling of Protein-DNA Binding Specificity

ABSTRACT: Many structures of protein-DNA complexes have been solved and high-throughput binding assays were developed because structural biology and genomics researchers were equally puzzled by the question of how proteins bind DNA with high specificity. However, there was little communication between these two fields of research. High-throughput DNA shape prediction established a cross talk between both fields. The primary goal of DNA shape analysis remains the quest for mechanistic insights into protein-DNA readout modes based on sequencing data without the need of structure determination. A plethora of high-throughput sequencing data is available from a variety of experimental approaches. In contrast, structural biology, albeit being an atomic-resolution approach, often reports the binding of a single protein to only a single DNA target. A number of studies incorporated DNA shape features in the quantitative modeling of binding specificities. These studies emphasized the importance of interactions between nucleotide positions within a binding site and its flanks, although the definitions of DNA sequence versus shape still differ in the structural biology and genomics fields. Next steps discussed in the talk focus on understanding biological phenomena such as purifying selection, additional layers of binding specificity determinants such as DNA methylation and histone modifications, and new computational approaches including artificial intelligence and quantum computing.



SPEAKER: Saiz, Leonor (University of California-Davis)

TITLE:  Multilevel modeling of cellular networks in biomedicine

ABSTRACT: One of the main challenges of current biology is to integrate the available genetic, biochemical, molecular, and structural information into a physiologically relevant description of cellular and supracellular processes. Computational modeling has emerged as a promising tool for transforming molecular detail into a more integrated form of understanding complex behavior. In this talk, I will draw examples from the recent work of my laboratory to discuss the
state-of-the-art on modeling of biological processes at different temporal and spatial scales, going from molecular interactions to the assembly of macromolecular complexes on DNA and the stochastic dynamics of the resulting gene regulation networks. I will discuss our recent
results on the study of genetic and signal transduction networks including gene regulation by the RXR nuclear receptor and signalprocessing in the TGF-beta pathway.


SPEAKER: Mateusz Sikora (Max Planck Institute of Biophysics)

TITLE:  Truss-like arrangement of cadherins is responsible for desmosome strength

ABSTRACT:  Desmosomes are long-lasting cell-cell junctions that endow mature tissues with mechanical stability. The core of the desmosomal adhesion is formed by cadherins, specialised calciumdependent transmembrane proteins. Together with other adaptor proteins, the cadherins connect cortices of neighbouring cells. Robustness to external stress comes from a particularly dense arrangement of cadherins, which form characteristic electron-dense structures visible in EM micrographs. However, the structural details of desmosomal cadherin assemblies remain controversial despite their relevance for various diseases. To address and resolve these controversies, we performed large-scale molecular dynamics simulations of a different 3D cadherin arrangements in the desmosome. We found that only an antiparallel, truss-like arrangement of cadherins can explain both the mechanical robustness and the spacing observed between plasma membranes in the desmosome. We validated our predictions by cryo-electron tomography of the desmosomes from mouse liver.


SPEAKER: Stewart, William (Nationwide Children’s Hospital and The Ohio State University)

TITLE:  Multi-scale Modeling of Serial Measurements in Survival Data
ABSTRACT: An extremely important quantity that is difficult to predict accurately is 1-year post-transplant survival of candidates on the national waiting list for organ transplantation. Donor organs are a precious human resource, and the demand remains higher than the supply, especially for lungs. The current system for allocating donor lungs is based on a measure (denoted LAS) that balances post-transplant survival against medical need. In general, the highest priority is given to the candidate with the highest LAS. However, because donor organs become available at random times in random geographical locations, each candidate’s LAS is calculated frequently. The calculations do not occur at scheduled times, and the intervals between calculations are not regular. As such, these serial measurements (ie, LAS trajectories) are inherently noisy, but they may also contain new and relevant information that is not necessarily captured by LAS at a single time point. To test this hypothesis, we propose a mathematical concept that seeks to improve predictions of post-transplant survival using each candidate's LAS trajectory. Ideally, one would like to integrate both the cross-sectional and longitudinal data simultaneously.


SPEAKER: Vakser, Ilya (University of Kansas)

TITLE:  Docking of protein models
ABSTRACT: 
Modeling is important for structural characterization of macromolecular interactions. The accuracy of modeling, in general, is still less than that of the experimental approaches. To test the ability of docking procedures to predict the structure of protein model-model complexes, we built docking benchmark sets of protein models, with accuracy assessed by the modeling rank, for protein complexes from the Dockground docking benchmark and GWIDD database. The sets are different from previously generated ones, in which arrays of models were generated according to RMSD from the native structure. The current sets rather reflect the real case modeling/docking scenario where the accuracy of the models is assessed by the modeling procedure, with no reference to the native structure, which would be unknown in practical applications. The sets are available at http://dockground.compbio.ku.edu. The docking success rate was correlated with the modeling parameters for individual proteins. The exhaustive benchmarking was also done for protein pairs in which proteins are modeled at different levels of accuracy (e.g. low accuracy "receptor" and high accuracy "ligand"), again reflecting the real-case scenario of practical applications. The results show that docking techniques are applicable to protein models in a broad range of modeling accuracy. The study provides guidelines for practical applications of docking to protein models.


SPEAKER: Vargas Bernal, Esteban (The Ohio State University)

TITLE:  Relating Eulerian and Lagrangian spatial models for vector-host disease dynamics through a fundamental matrix
ABSTRACT: We explore the relationship between Eulerian and Lagrangian approaches for modeling movement in vector-borne diseases for discrete space. In the Eulerian approach we account for the movement of hosts explicitly through movement rates captured by a Laplacian matrix L. In the Lagrangian approach we only account for the proportion of time that individuals spend in foreign patches through a residence matrix P. We establish a relationship between an Eulerian model and a Lagrangian model for the hosts in terms of the matrices L and P. We explore the consistency of this relationship and in each scenario (inconsistent and consistent) and we look at the comparison of quantities like final outbreak size and basic reproduction number for reasonable parameters. In the case of two-patches model, we give an insight on how to relate the residence times and the flow rates according to how big the removal rates are with respect to the movement rates. We also observe how even in the inconsistent case, we could get similar values of the basic reproduction number and final outbreak size.