International conference on bioinformatics of genome regulation
Download 3.91 Kb. Pdf ko'rish
|
- Bu sahifa navigatsiya:
- P( model M )
- P( Model M ) = ∏P i , where P i
- P( Model M, E E t )
Key words: hippocampal slices, epileptiform activity, NMDA receptor, lambertianic acid amide, picrotoxin, magnesium-free solution Motivation and Aim: An imbalance between excitatory and inhibitory mediator systems in CNS leads to the development of a number of neurodegenerative diseases. Both hy- peractivity of glutamatergic system and decreasing the activity of GABAergic leads to a moderate neuron membrane depolarization, which relieves the magnesium blockade of the NMDA receptor and results in excitotoxicity. Currently, drugs with glutamatergic mechanisms of action are being developed for the treatment of cognitive disorders and neurodegenerative processes. The aim of the present study is to test the antiepileptic effect of lambertianic acid amide (AmLA), which has been shown to be a promising compound that could be used in the synthesis of new pharmaceutical reagents [1,2]. Methods and Algorithms: The experiments were carried out on the hippocampal slices of ICR male mice using standard electrophysiological techniques. Stimulation of Schaf- fer collaterals and registration of induced population spikes of pyramidal neurons in the CA1 field were made using the glass microelectrodes, filled with saline. The epilepti- form activity in the pyramidal neurons was induced by treatment the slices by picrotoxin or by magnesium-free medium. Results: The application of AmLA in concentration of 170 μМ significantly decreased the epileptiform activity or fully terminated it. The preincubation of slices with AmLA for an hour before applying epileptiform conditions prevented the development of epi- leptiform activity. Also incubation of the slices in normal solution with AmLA did not affect the initiation of NMDA-dependent synaptic potentiation. Conclusion: Thus, AmLA (produced from Siberian cedar) helps to normalize the activ- ity of hippocampal neurons both in glutamatergic system hyperactivation and the lack of GABAergic inhibition. Acknowledgements: The work was supported by VI.35.1.5 basic project of fundamental researches of RAS and RFBR grant 15-29-04875. References: 1. T.G. Tolstikova et al. (2001) Nootropic activity of lambertianic acid derivatives. Doklady Biological Sciences, 376: 8-9. 2. T.G. Tolstikova et al. (2004) Neurotropic activity of lambertianic acid adducts with N-substituted ma- leinimides. Pharm. Chem. J., 38: 532-534. 331 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY ARGO_CUDA: A FULL-EXHAUSTIVE GPU BASED APPROACH FOR A MOTIF DISCOVERY IN THE LARGE DNA DATASETS O.V. Vishnevsky 1, 2 *, A.V. Bocharnikov 2 , N.A. Kolchanov 1, 2 1 Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia 2 Novosibirsk State University, Novosibirsk, Russia * Corresponding author: oleg@bionet.nsc.ru Key words: degenerated oligonucleotide motif, transcription regulation, CUDA, GPU Motivation and Aim: A motif discovery in ChIP-Seq datasets remains a challenging issue. A low effectiveness of classic heuristic motif discovery approaches on a whole ChIP-Seq datasets forces the researchers to take into analysis only a fraction of top “peak” segments. Methods and Algorithms: Argo_CUDA web service is designed to process the massive DNA data. This program for detection of degenerate oligonucleotide motifs of fixed length is based on the full-exhaustive approach and uses high-performance GPU tech- nologies. Results: We compared an effectiveness of Argo_CUDA and Info-gibbs [1]. Info-gibbs is a Gibbs sampling algorithm that compares well with existing heuristic methods like MEME, BioProspector, Gibbs or GAME on both synthetic and biological data sets. The sets of random sequences of 128bp in length and of 100, 1000, and 10000 sequences in size were generated. The sample motifs of different degeneracy level were placed in a 60 percent of the sequences. The similarity between a motifs obtained by the programs and the sample motifs were measured with the average Kullback-Leiber distance (KLD). Conclusion: An effective web service for motif discovery in ChIP-Seq datasets is devel- oped. It is not as fast as a classic heuristic approaches, but it considerably reduces the restrictions on the size of the sample under analysis. Availability: wwwmgs.bionet.nsc.ru/mgs/programs/Argo_CUDA. Acknowledgements: The work was supported by the budget project 0324-2015-0003. References: 1. Defrance M, van Helden J. (2009) Info-gibbs: a motif discovery algorithm that directly optimizes in- formation content during sampling. Bioinfo rmatics 25(20):2715-22. 332 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY THE IMPACT OF HUMAN GENETIC VARIABILITY ON LIGAND-PROTEIN INTERACTIONS AND INDIVIDUAL DRUG RESPONSE P.K. Vlasov*, O. Pich I Rosello, A.V. Vlasova, F.A. Kondrashov Centre for Genomic Regulation, Universitat Pompeu Fabra, Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain * Corresponding author: peter.vlasov@crg.eu Key words: ligand-protein interactions, drugs, adverse drugs events, human genetic variability The mechanism of action for majority of modern therapeutics is directly related to the interaction of the drug molecule with its target, typically a protein. Adverse drug events (ADEs), instances when a medication causes an unintended response, contribute sub- stantially to morbidity, the cost of treatment and often appear unpredictably [1, 2]. Ge- netic variability is thought to account for a substantial fraction of the individual drug response in humans [3, 4, 5] - meanwhile our understanding of the contribution of such variability to the causes of individual drug response remains fragmented. In our project we combined genome-wide data on human single nucleotide polymorphisms (SNPs) with structural data on drug-protein complexes. Using data from 1000genome project and The Cancer Genome Atlas (TCGA) consortium, at the genome-wide scale we iden- tify all SNPs potentially affecting the proteins binding affinity for drugs, drug-like com- pounds and metabolites. Our results suggest that SNPs with a serious impact on ADE are present in most individuals, however, most of such polymorphisms are rare requiring a personalized approach to their identification. So, the genetic component for many ADEs may be highly personalized with each individual carrying a unique set of relevant SNPs. The reduction of ADEs may, therefore, primarily rely on the application of computa- tional genome analysis in the clinic rather than the experimental study of common SNPs. References: 1. Rodríguez-Monguió R, Otero MJ, Rovira J. Assessing the economic impact of adverse drug effects. Pharmacoeconomics. 2003;21(9):623-50. 2. Boeker EB, de Boer M, Kiewiet JJ, Lie-A-Huen L, Dijkgraaf MG, Boermeester MA. Occurrence and preventability of adverse drug events in surgical patients: a systematic review of literature. BMC Health Serv Res. 2013 Sep 28;13:364. 3. Evans WE, McLeod HL. Pharmacogenomics-drug disposition, drug targets, and side effects. N Engl J Med. 2003 Feb 6;348(6):538-49. 4. Wang L, McLeod HL, Weinshilboum RM. Genomics and drug response. N Engl J Med. 2011 Mar 24;364(12):1144-53. 5. Relling MV, Evans WE. Pharmacogenomics in the clinic. Nature. 2015 Oct 15;526(7573):343-50. 333 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY CHARACTERISTICS OF ACDS-GENE OF BACTERIA PSEUDOMONAS PUTIDA B-37 RESPONSIBLE FOR ACC-DEAMINASE SYNTHESIS D.S. Volkava*, S.I. Leanovich, A.A. Melnikava, E.A. Khramtsova Belarusian State University, Minsk, Republic of Belarus * Corresponding author: 95volkovads@gmail.com Key words: ACC-deaminase, acdS-gene, Pseudomonas putida, ethylene, transgenic plants Nowadays one of the main problems in agriculture to be solved is plants’ resistance to the numerous environmental factors expanding every year due to the active anthropogenic intervention. Plant’s natural reaction on stress is the production of stress hormone ethyl- ene that inhibits growth and development of plant organism during unfavorable periods of time. This process attends to decrease of the biomass production which is very un- profitable for agriculture. One of the most perspective ways to decline the level of stress ethylene is creating transgenic plants with acdS-gene coding for 1-aminocyclopropane- 1-carboxylate (ACC-deaminase). ACC-deaminase is required for increasing the concen- tration of ethylene’s precursor, ACC. This process promotes roots elongation, tuber form- ing, and biomass accumulation. The aim of current work was analysis of the primary nucleotide sequence of acdS-gene of Pseudomonas putida strain B-37 for further devel- opment of the recombinant plant cells Nicotiana benthamiana and Nicotiana tabacum. Primary nucleotide sequence of ACC-deaminase gene from P. putida B-37 was analyzed using programs available on on-line (NCBI, ExPASy, NPS@) and off-line resources. Search for homologs was conducted using BLASTn, and analysis of conserved domains – in Conserved Domains, available on-line on NCBI resource. To estimate ACC-deami- nase gene its amino acid sequence was constructed using Translator on ExPASy resource (CDS size – 1017 b.p.). Molecular weight, amino acid composition, estimated half-life, theoretical pI were valued using ProtParam on ExPASy resource. Secondary structures were forecasted by the consensus prediction from the multiple alignments using SOPMA on NPS@. Phylogenetic tree was constructed using Neighbor-joining method imple- mented in MEGA 6.0. It was shown that analyzing sequence has high homology with ACC-deaminase genes from different species of Pseudomonas genus. In the protein cod- ing for open reading frame of this gene was detected Aminocyclopropane-1-carboxylate deaminase (ACCD) domain refers to tryptophan synthase beta superfamily (fold type II). Molecular weight is above 36718.9 D, mean theoretical pI = 5.6, estimated half-life in Escherichia coli is above 10 hours. Among secondary structures were mostly predicted alpha helixes (31.95%) and random coils (34.91%). Assumptive 3D structure of ana- lyzing protein was modeled using SWISS-MODEL on ExPASy. To estimate divergence and homology of analyzing protein and ACC-deaminase proteins of different species of Pseudomonas phylogenetic tree was constructed; the highest homology with analyzing protein was obtained for ACC-deaminases from Pseudomonas putida strains. Analysis of the primary nucleotide sequence from P. putida B-37 showed the presence of ACC- deaminase gene that has high homology with the sequences coding for acdS-genes from other species and strains of Pseudomonas genus. This gene was isolated and cloned in vector with broad host range pBI121 which was used to create the recombinant plant cells N. benthamiana and N. tabacum. Such plants are assumed to be more resistive to the un- favorable environmental factors due to the ACC-deaminase synthesis. 334 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY AMPLISEQ ™: AMPLIFICATION AND SEQUENCING I.A. Volkov Department of scientific and methodological support of “Khimexpert Agency”, Moscow, Russia * Corresponding author: ilyavolkov83@gmail.com Key words: AmpliSeq, new generation sequencing (NGS), genome libraries, Ion Torrent ™, Ion S5 ™ High-performance sequencing in a relatively short period of time allows obtaining large tracts of genetic information. During the period of accumulation of knowledge it is im- portant to focus research on a limited number of targets. Ultramultiplex technology PCR followed by sequencing AmpliSeq ™ - this is the best method for mass screening of a large number of targets. AmpliSeq ™ panels are useful for sequencing indels (50 bp) and SNP, groups of genes, RNA molecules; there is a panel of sequencing exome person. The online resource Ion AmpliSeq ™ Designer (ampliseq.com) allows you to quickly create a research panel, which is a multiplex primer pool. The number of primers in a single pool may be from 12 to 6144 pairs allowing one panel genome sequence block size from 1 kb 5 mln.b.p. The primers in the same pool did not overlap, and overlap- ping primers are carried in different pools. Typically, the panel consists of two pools of primers. The panels for the analysis of point mutations and polymorphisms by the script HotSpot Designs, consist of 1 primer pool. Panels are generated automatically accord- ing to the Pipeline with maximum coverage of amplicons. Ion AmpliSeq ™ Designer automatically selects primers for amplicon of about 200 bp and 400 bp (Length of the readings on the Ion Torrent ™ platforms). There is enough only 10 ng DNA / RNA on 1 reaction to search for genetic variants and evaluation of gene expression. There is a considerable amount of ready-branded and custom AmpliSeq ™ panels that are available for use: more than 20 panels designed for the study and diagnosis of different groups of hereditary diseases. Panels include from one hundred to four hundred or more genes associated with the development of diseases. Automatic analysis of the results includes data processing software Torrent Suite ™, integrated in devices Ion Torrent ™. Evaluation of the quality of the results obtained by the coating is carried out using coverage Analysis plugin. Ion Reporter ™ software helps to interpret and use data annotations. Cloud resource Ion Reporter ™ appeals to a large number of databases, annotates the data and the results of the report in the analysis of the results. Thus, the establishment of libraries AmpliSeq ™ method, sequencing at Ion Torrent ™ platforms and analysis Torrent Suite ™ and Ion Reporter ™ makes the targeted method of sequencing a simple, convenient and fast to use in dealing with routine and research applications. 335 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY COUPLED MOLECULAR DYNAMIC AND CONTINUUM ELECTROSTATIC METHOD TO COMPUTE IONIZATION OF PROTEINS AS A FUNCTION OF PH Yu.N. Vorobjev Institute of Chemical Biology SB RAS, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia * Corresponding author: ynvorob@niboch.nsc.ru Key words: proteins, ionization constants Motivation: Accurate prediction of ionization constant of amino acids in proteins is a long standing problem. A computational method is developed to calculate the pKa val- ues of ionizable residues Asp, Glu, His, Tyr and Lys of proteins. Methods: Calculations of electrostatic energy of proteins is based on an effective ver- sion of continuum dielectric electrostatic model developed by us. A conformational flexibility is modeled by the method of molecular dynamics of 10 ns of length in an implicit water solvent. Results: The accuracy of proposed method of calculation of pKa values is estimated for a test set of proteins with experimental pKa data for 297 ionizable residues of 34 proteins. The pKa prediction test shows that 57%, 86% and 95% of all predictions have an error lower than 0.5, 1.0 and 1.5 pK units, respectively. In total, our method of pKa prediction demonstrates a good accuracy, which it treed protein flexibility by natural way as protein molecular dynamics in water solvent. Computer program is available by request from professor Yu.N. Vorobjev. The work is supported by grant RFFI #15-04-00387 and by program of RAS MCB (6.11), R SF #16-14-10038 336 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY METHODS TO CALCULATE P-VALUE OF RNA OF A DEFINITE SHAPE D.G. Vorobyev*, V.V. Solovyev Softberry Inc., Novosibirsk, Russia * Corresponding author: denis.g.vorobiev@gmail.com Key words: RNA, secondary structure, pattern, P-value, partition function A model of RNA structure (‘Model M’) is considered that is given by a tree topology, size ranges of elements and size ranges of total or subpattern length. Our program (‘Matcher’) is able to rapidly search model M in long sequences. In this work, three methods of cal- culating the frequency P(model M) of such pattern in a random sequence are suggested. Two of these methods use the idea of partition function. Energy is also taken into ac- count. Our Matcher is similar to the thermodynamic matcher from [1], except that it uses a more specifically defined shape. For each occurrence, the program outputs a best fold, its energy and the total number of possible folds corresponding to Model M. How fre- quently Model M occurs in a random sequence? In a model with fixed-sized elements, P(Model M) = ∏P i , where P i is the i-th stem probability, which values can be preliminary tabulated by sampling. When elements vary in length, the sum of probabilities of all theo- retically possible folds of Model M, the partition function S, can be calculated. The calcu- lation is performed with a dynamic programming procedure. It is clear that different folds occur interdependently and tend to form big clusters. Intuitively, dividing S by the average number C of folds in a single cluster, we get the desired P(Model M). It can be shown that the form of the distribution of C makes no difference. Thus, P(Model M) = S / M(C), where M(C) is the expectation of C. Because we are rather interested in stable structures, we should calculate the probability P(Model M, E<E t ) to meet model M with energy E under given threshold E t . Obviously, P(Model M, E<E t ) = P(Model M) ∙ P(E<E t | Model M). To obtain P(E<E t | Model M), we need a set of random occurrences of model M. Since, in general, we can’t get random occurrences by a lengthy simulation, we have to imitate them. We generate them by allowing pairs that are usually forbidden (“non-pairs”). We assign them energies with a big positive value, 50 kcal/mol, making them extremely unfavorable. This way, Matcher is able to fold any random fragment into a structure given by Model M. Matcher minimizes the number of non-pairs. In the calculated structure, we substitute non-pairs with GC, AU or GU pairs and, thus, get a sequence fragment that is able to fold according to Model M with normal complementarity rules. Having a big number of ‘ran- dom’ occurrences, we obtain both the estimate of M(C) and the energy distribution, which turns out to be near Gaussian. Thus, we can estimate P(E<E t | Model M). The 2-nd way to estimate P(Model M) is to obtain the distribution P(X) of number X of non-pairs in random sequences. The distribution turns out to be Gaussian-like as well. We need its value at X=0 meaning ‘no non-pairs’. The value of P(X=0) will be our estimate of P(Model M). The 3-rd way to estimate P(Model M) is to obtain the spectral decomposition of S. In such a way the value of S(E t ) can be calculated. At high E t values, C(E t )→1. Therefore, S(E t ) is our upper estimate of P(E<E t , Model M) made without sampling of ‘random’ occurrences. We have suggested 3 methods of calculating P-value of RNA pattern. They can be used along with context measures to search unknown ncRNAs by clustering similar structures. The software will be available at corporate web-site soon. References: 1. S. Janssen, R. Giegerich (2015) The RNA shapes studio, Bioinformatics, 31: 423-425. 337 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY HOCOMOCO COMPREHENSIVE MODEL COLLECTION AS A PRACTICAL GATEWAY TO REGULATORY MOTIF-OME OF HUMAN AND MOUSE TRANSCRIPTION FACTORS I.E. Vorontsov, Y.A. Medvedeva, V.J. Makeev, I.V. Kulakovskiy* Vavilov Institute of General Genetics, Moscow, Russia Engelhardt Institute of Molecular Biology, Moscow, Russia * Corresponding author: ivan.kulakovskiy@gmail.com Key words: transcription factor binding motifs, human, mouse, ChIP-Seq, HT-SELEX Motivation and Aim: Knowledge of sequence motifs resembling transcription factor binding sites is beneficial for a vast array of studies in regulatory genomics. Methods and Algorithms: Using ChIPMunk [1] motif discovery tools we performed de novo motif discovery in more than two thousands data sets for human and mouse transcription factors studied by ChIP-Seq (in vivo, obtained from GTRD [2]) and HT- SELEX (in vitro [3]). The newly created binding models were benchmarked against known binding patterns for mammalian transcription factors. Results: We present the latest release of the HOCOMOCO COmprehensive MOdel COl- lection [4] that provides binding models for 6 hundreds of human and almost 4 hundreds of mouse transcription factors. The primary collection provides classic mononucleotide position weight matrices (PWMs) which are linked with the hierarchical classification of transcription factors [5]. In addition, new release of HOCOMOCO includes dinucleotide position weight matrices based on ChIP-Seq data and a set of command-line java tools to facilitate motif finding with HOCOMOCO models. Conclusion: We present a complete workflow used to build HOCOMOCO and discuss practical applications of the HOCOMOCO motif-ome in regulatory genomics. Availability: HOCOMOCO and all the supporting tools are freely available online: http://hocomoco.autosome.ru and http://opera.autosome.ru. Acknowledgements: This study was supported by RFBR grants 15-34-20423 and, partly, 14-04-01838. References: 1. http://autosome.ru/ChIPMunk/ 2. http://gtrd.biouml.org 3. A. Jolma A., J. Yan, T. Whitington, J. Toivonen, K.R. Nitta, P. Rastas, E. Morgunova, M. Enge, M. Taipale, G. Wei, et al. (2013) Cell, 152:327-339.4. 4. I.V. Kulakovskiy, I.E. Vorontsov, I.S. Yevshin, A.V. Soboleva, A.S. Kasianov, H. Ashoor, W. Ba-Alawi, V.B, Bajic, Y.A. Medvedeva, F.A. Kolpakov, V.J. Makeev (2016) Nucleic Acids Res, 44(D1):D116-25. 5. E. Wingender, T. Schoeps, J. Dönitz (2013) Nucleic Acids Res, 41:D165-D170. 338 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY THE FREQUENCY, SPECTRUM AND FUNCTIONAL SIGNI- FICANCE OF MUTATIONS IN CODING SEQUENCE OF TP53 GENE IN RUSSIAN PATIENTS WITH DLBCL E.N. Voropaeva 1 *, T.I. Pospelova 2 , M.I. Voevoda 1 , V.N. Maximov 1, 2 1 Institute of Therapy and Preventive Medicine, Novosibirsk, Russia 2 State Medical University, Novosibirsk, Russia * Corresponding author: vena.81@mail.ru Download 3.91 Kb. Do'stlaringiz bilan baham: |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling