International conference on bioinformatics of genome regulation
Download 3.91 Kb. Pdf ko'rish
|
- Bu sahifa navigatsiya:
- Key words
Key words: DNA damage, DNA repair, BER, MMR, epigenetic demethylation Motivation and Aim: Active DNA demethylation (ADDM) in mammals occurs via hy- droxylation of 5-methylcytosine (5mC) by TET and/or deamination by AID/APOBEC family enzymes. The resulting 5mC derivatives are removed through the base excision repair (BER) pathway (1). At present, it is unclear how the cell manages to eliminate closely spaced 5mC residues while avoiding generation of double-strand breaks and whether alternative DNA repair pathways participate in ADDM (2). Recently, was pro- posed, that cytidine deaminase induced damage can lead to indirect ADDM through the BER or non-canonical DNA mismatch repair (ncMMR) (3). Methods and Algorithms: We used a phagemid DNA containing oxidative base lesions and methylated sites are used to examine the involvement of various DNA repair path- ways in ADDM in murine and human cell-free extracts. Results: We demonstrate that, in addition to short-patch BER, 5-hydroxymethyluracil and uracil mispaired with guanine can be processed by ncMMR and long-patch BER with concomitant removal of distant 5mC residues. Furthermore, the presence of mul- tiple mispairs in the same MMR nick/mismatch recognition region together with BER- mediated nick formation promotes proficient ncMMR resulting in the reactivation of an epigenetically silenced reporter gene in murine cells. Conclusion: These findings suggest cooperation between BER and ncMMR in the re- moval of multiple mismatches that might occur in mammalian cells during ADDM. This work was supported by Russian Science Foundation 14-24-00093; Fondation ARC PDF20101202141. References: 1. S.Moréra, I.Grin, A.Vigouroux, S.Couvé, V.Henriot, M.Saparbaev, A.A.Ishchenko (2012) Biochemical and structural characterization of the glycosylase domain of MBD4 bound to thymine and 5-hydroxy- methyuracil-containing DNA, Nucleic Acids Res., 40(19): 9917-9926. 2. D.O.Zharkov, I.R.Grin (2012) Mechanisms of somatic mutagenesis and active DNA demethylation in epigenetic regulation, In: Epigenetics, S.M.Zakian, V.V.Vlasov, E.V.Dementieva, 31-52 (Novosibirsk: Publishing House of SB RAS). 3. R.D.Kasymov, I.R.Grin, A.V.Endutkin, S.L.Smirnov, A.A.Ishchenko, M.K.Saparbaev, D.O.Zharkov (2013) Excision of 8-oxoguanine from methylated CpG dinucleotides by human 8-oxoguanine DNA glycosylase, FEBS Lett., 587(18): 3129-3134. 93 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY CONSERVATION LEVEL OF THE KEY MEIOTIC PROTEINS REFLECTS THEIR FUNCTION AND INDEPENDENT EVO- LUTION IN DIFFERENT LINEAGES OF EUKARYOTES T.M. Grishaeva*, Yu.F. Bogdanov Vavilov Institute of General Genetics RAS, Moscow, Russia * Corresponding author: grishaeva@vigg.ru Key words: proteins, amino acid sequence, functional domains, recombination, cohesins, synaptonemal complex, conservation, evolution Motivation and Aim: Meiosis is conserved in terms of its cytological appearance in the evolutionarily row of Eukaryota. Meiosis requires proteins participating in (1) recombi- nation and (2) chromosome remodeling and synapsis. Among them are meiosis-specific proteins and proteins shared with mitosis. According to a widespread opinion, group 1 proteins are more conserved than group 2 proteins, mainly due to similarity of their functional domains. Published estimates are rarely presented in a quantitative form. We analyzed the amino acid sequences of some key meiotic proteins from both of the groups in model eukaryotic species and quantitatively estimated the level of their evolutionary conservation. Methods and Algorithms: Protein Blast was used to perform pair-wise amino acid se- quence comparisons for orthologs of seven proteins in eight eukaryotic species repre- senting fungi, plants, invertebrates and vertebrates. Because the similarity index Score depends on the protein size, percent similarity was estimated for each protein, taking its similarity for itself as 100%. Results: The recombination mediators RAD51 and DMC1 and the mismatch repair pro- tein MLH1 were shown to be moderately conserved (from 13% to 76% similarity be- tween proteins from different phyla), while meiosis-specific endonuclease SPO11 had only a low similarity in the tested proteomes (5-32%). Among structural chromosomal proteins, cohesins RAD21 and REC8 demonstrated low conservation (0-25% and 0-5%, respectively). Synaptonemal complex components possessing the HORMA domain (Hop1 and its orthologs) displayed only a minor similarity between the taxa (2-14%). In a separate study, the level of conservation was estimated for functional domains of some proteins because they are thought to be more conserved than the full-size protein molecules. Unexpected results were obtained. The longer (TOPRIM_topoIIB_SPO) and short (TPGA_N) domains of SPO11 showed the same or even lower conservation as compared with the full-size SPO11 (14-38% and 1-36%, respectively). The HORMA domain was only two to three times more conserved than the full-size protein. Conclusion: Proteins responsible for the fidelity of meiotic recombination (RAD51, DMC1, and MLH1) are the most conserved among key meiotic proteins, while cohesins, synaptonemal complex proteins, and endonuclease SPO11 are less conserved. Meiotic proteins evolved independently in different phylogenetic lineages of Eukaryota. Acknowledgements: This work was supported by the Russian Foundation for Basic Re- search (Grant no. 16-04-01447-a). 94 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY A COMPUTATION SYSTEM FOR RANDOMIZATION-BASED ENRICHMENT ANALYSIS USING GPU: PERFORMANCE INVESTIGATION M. Grishenko 1 , A. Yakimenko 1, 2 , M. Khairetdinov 1, 2 , K. Gunbin 3 1 Novosibirsk State Technical University, Novosibirsk, Russia 2 Institute Computational Mathematics and Mathematical Geophysics SB RAS, Novosibirsk, Russia 3 Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia * Corresponding author: mikhail.grishch@gmail.com Key words: Randomization, GPU Motivation and Aim: Processing of genetic data for the analysis genetic determination of traits is very important problem for modern biology. Resampling methods are wide- ly used to solve such problems. However, they require much computational resources. Common trend of increasing the simultaneously tested set of genetic characteristics, reflecting the simultaneous contribution of many factors in the formation of a single biological trait is a challenge for computer science. The aim of this paper is improving of existing algorithm as well as it usability and performance. Methods and Algorithms: We used graphic processor units (GPUs) for increasing the performance of permutation test. The whole algorithm workflow can be divided into three main stages: 1) reading the input file and forming the input data array according to convenient format for further calculations; 2) the sum calculation of the measured values of the gene functioning in the context of the experiment under analysis for various gene characteristics, the cycle composed of mixing of the array elements and after that col- lecting the quantities necessary for the statistics; 3) the exact calculation of p-values and the formation of output file. The most consuming and parallelizable is the second stage. It performed by multiplying two matrices: matrix containing numerical values obtained by the experiments and a matrix of gene characteristics. The cuBLAS library of matrix- matrix multiplication, which allowed for a transition of this algorithm to the architecture of GPUs, was used. Results: A software for the permutation test aimed at finding statistically significant overrepresented gene characteristics under different external and/or internal conditions for computing devices: PC with NVIDIA GPU. We improved multiple testing of ge- netic characteristics overrepresentation by replacing matrix-vector multiplication with matrix-matrix multiplication. The resulting matrix contains a set of arrays with the sums reflecting various gene characteristics. We investigated performance of this software depending on the: (1) number of genes; (2) number of genetic characteristics. The execu- tion time of program using the matrix-matrix multiplication for the simultaneous testing of characteristics overrepresentation increases slightly with the number of testable char- acteristics and experiments. Conclusion: This software allows user to find statistically significant overrepresented characteristics of genes simultaneously. Availability: Upon the requests to the authors. Acknowledgements: This work was supported by the Russian foundation for basic re- search (Grant No. 16-37-00240) 95 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY ACTIVE MAINTENANCE OF PHYLOTRANSCRIPTOMIC HOURGLASS PATTERNS IN PLANT AND ANIMAL EMBRYOGENESIS H.-G. Drost 1 , A. Gabel 1 , I. Grosse 1, 2 , M. Quint 3, 4 1 Institute of Computer Science, Martin Luther University Halle-Wittenberg, Halle (Saale), Germany 2 German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany 3 Department of Molecular Signal Processing, Leibniz Institute of Plant Biochemistry, Halle (Saale), Germany 4 Institute of Agricultural and Nutritional Sciences, Martin Luther University Halle-Wittenberg, Halle (Saale), Germany * Corresponding author: grosse@informatik.uni-halle.de Key words: evolution, gene expression, hourglass model The developmental hourglass model has been used to describe the morphological transitions of related species throughout embryogenesis. Recently, quantifiable approaches combining transcriptomic and evolutionary information provided novel evidence for the presence of a phylotranscriptomic hourglass pattern across kingdoms. As its biological function is unknown it remains speculative whether this pattern is functional or merely represents a nonfunctional evolutionary relic. The latter would seriously hamper future experimental approaches designed to test hypotheses regarding its function. Here, we address this question by generating transcriptome divergence index profiles across embryogenesis of Danio rerio, Drosophila melanogaster, and Arabidopsis thaliana. To enable meaningful evaluation of the resulting patterns, we develop a statistical test that specifically assesses potential hourglass patterns. Based on this objective measure we find that two of these profiles follow a statistically significant hourglass pattern with the most conserved transcriptomes in the phylotypic periods. As the transcriptome divergence index considers only recent evolutionary signals, this indicates that the phylotranscriptomic hourglass pattern is not a rudiment but possibly actively maintained, implicating the existence of some linked biological function associated with embryogenesis in extant species. 96 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY THE EVOLUTION OF LANGUAGE-READINESS IN THE HOM- ININ LINEAGE: AN ANALYSIS OF OPEN CHROMATIN RE- GIONS IMPLICATED IN GENE REGULATION K. Gunbin 1,* , A. Benítez-Burraco 2 , F. Gusev 1 , E. Rogaev 1,3 1 Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia 2 Department of Philology, University of Huelva, Huelva, Spain 3 University of Massachusetts Medical School, Worcester, USA e-mail: genkvg@gmail.com * Corresponding author Key words: DNAseI Hypersensitive Sites (DHSs), language-readiness, extinct vs. extant hominins, self- domestication hypothesis. Motivation and Aim: Language evolution studies rely on indirect evidence, mostly related to speech organs and human behaviour. The successful retrieval of ancient genomes is expected to contribute significantly to our knowledge of how cognition and communication evolved. Comparative genomic studies have demonstrated >99.9% of identity between modern human (M) and Neanderthal (N) and Denisovan (D) protein-coding gene sequences, which is in sharp contrast with the significant (~1 MYA) evolutionary distance between these three hominin species and with their behavioral and physical distinctive features [1,2]. Therefore, it is reasonable to anticipate an elevated rate of changes in the regulatory regions of the genes. Methods and Algorithms: First, we aligned SNPs and indels from D [1], N [2], and UI [3] that are not found in M. Second, we mapped common changes in the D-N and D-N- UI clades onto DHSs from M with an annotated effect (activation or repression) on gene expression [4]. Third, we checked if the positions evolved in D-N and D-N-UI overlapped with transcription factor binding sites showing signatures of negative or positive selection in M lineage (using ENCODE RegTfbs data track and INSIGHT approach [5]). This allowed us to put forth a list of genes affected by changes in DHSs. In our past research we have put forth a list of candidate genes that could be associated with language-readiness [6-10]. Some of these genes show differences with the N and D homologs in regulatory (including methylation patterns) or coding regions [6-8]. Finally, we analyzed the nearest interactions between these two lists using GeneMania data (http://genemania.org/). Results: Our results are suggestive of potential differences between M and N, D and UI regarding the regulation pattern of genes involved in osteogenesis, brain function, and immunity. This is interesting in view of the link between the emergence of language-readiness and skull-brain cross-talk [6], but also in the brain-immune system cross-talk [11]. We have also found that the regulation of FOXP1 (in D), and CNTNAP2 (in N), two well-known genes related to language function could have been affected by severe changes in DHSs of these hominin species. Finally, we found that several of our candidates for language-readiness and self-domestication are predicted to interact with genes affected by changes in DHSs. In the whole, our findings give support to the view that differences in expression and/or interactions of known candidates for cognitive development and language evolution may account for the presumed differences in cognitive and linguistic abilities between extinct and extant hominins. Availability: Upon the requests to the authors. Acknowledgements: The study is supported by the grants 14.B25.31.0033 [Resolution No.220], FFI2014-61888-EXP, FFI2013-43823-P). References: 1. M. Meyer et al. (2012) Science 338:222-226.; 2. G. Renaud et al. (2014) Nature 505:43-49.; 3. Q. Fu et al. (2014) Nature 514:445-449.; 4. N.C. Sheffield et al. (2013) Genome Research 23:777-788.; 5. L. Arbiza et al. (2013) Na- ture Genetics 45:723-729.; 6. C. Boeckx, and A. Benítez-Burraco (2014). Front. Psychol. 5:282.; 7. C. Boeckx and A. Benítez-Burraco, (2014). Front. Psychol. 5:1324.; 8. A. Benítez-Burraco and C. Boeckx (2015). Front. Psychol. 6:794.; 9. A. S. Wilkins et al. (2014). Genetics 197:795-808.; 10. B. Hare, and M. Tomasello (2005). Trends Cogn. Sci. 9:439-444.; 11. A. Benítez-Burraco and J. Uriagereka, (2016) Front. Mol. Neurosci. 8:84. 97 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY SEARCHING FOR CYTOLYTIC GENETIC MARKERS OF NEWCASTLE DISEASE VIRUS USING COMPUTER ASSISTED ANALYSIS K.V. Gunbin 1, 4 , M.R. Kabilov 2 , K.S. Yurchenko 3 , A.V. Glushchenko 3 , A.M. Shestopalov 3 , N.V. Gubanova 1, 4 1 Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia 2 Institute of Chemical Biology and Fundamental Medicine SB RAS, Novosibirsk, Russia 3 Research Center of Clinical and Experimental Medicine SB RAMS, Novosibirsk, Russia 4 Novosibirsk State University, Novosibirsk, Russia * Corresponding author: genkvg@gmail.com Key words: Newcastle Disease Virus, whole genome sequencing, 3D protein structure Motivation and Aim: Newcastle disease virus (NDV) is one of the potential candidate viruses for therapy of human tumors. This is enveloped virus containing a non-segmented, single- stranded, negative-sense RNA genome. The replication of many viruses results in lysis of the tumor cells and inherent cytotoxicity. However, profound understanding of the mechanisms of virus replication is still illusive. Methods and Algorithms: In order to distinguish the cytolytic genetic markers, 15 NDV class II [1] strains (s.) were tested for their ability to lyse the human tumor cell cultures. Whole genome sequencing of 8 new NDV s. was carried out. NDV genomes were aligned by SaTe 2.2.7; NDV proteins were aligned using Promals3D. In searching for the genetic basis un- derlying cytotoxicity, various RNA and protein characteristics were analyzed: (1) common SNPs and indels; (2) the localization and number of long inverted, tandem, and palindrome RNA repeats; (3) local RNA similarity and (4) the values of various dinucleotide RNA prop- erties obtained by various window size; (5) codon adaptation indexes as described in [1]; (6) the 3D protein structure assisted comparison of conservative and variable positions. For this analysis M, HN, and F protein structures were taken directly from PDB, the 3D protein struc- tures of L, NP, and V proteins were modelled using I-TASSER 4.4 and target structures with DELTA-BLAST E-value<1E-40. The protein structural flexibility was solved by CABS-flex. Results: In their ability to lyse tumor cells, the strains were divided into two groups: cytolytic (8 s.) and non-cytolytic (7 s.). Only tiny differences between these groups were identified in two protein coding regions (<200nt length each) using various dinucleotide RNA properties. The projection of variable sites on 3D protein structures shown that non-cytolytic strains ac- cumulate mutations in NP, L, and M proteins in sterically interacted regions with high level of structural flexibility and high total number of intramolecular contacts significantly more frequently than cytolytic strains. The same is observed for V proteins in cytolytic strains. Taking into consideration that these three proteins are critically important to NDV RNA rep- lication and virion synthesis, it is reasonable to speculate that the cause of NDV cytotoxicity is the fast kinetic of proteins folding. The structure of NDV V protein resembles Simian virus 5V protein, which is known to modulate the activity of DDB1-CUL4-ROC1 E3 apparatus in favor of viral infection [3]. Therefore, the development of the same molecular mechanism promoting the viral infection is probable in cytopathic NDV strains. Availability: Data available upon the requests to the authors. Acknowledgements: This work was supported by the grants 14-04-01196 and 2013–2020 (VI.53.1.4, VI.62.1.3, 0309-2014-0007). References: 1. D.G. Diel et al. (2012) Infect Genet Evol. 12:1770-1779. 2. Y.Y. Waldman et al. (2010) Nucleic Acids Res. 38:2964-2974; 3. S. Angers et al. (2006) Nature. 443:590-593. 98 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY TRANSCRIPTOMIC ANALYSIS OF WHEAT ROOT IN RESPONSE TO ESSENTIAL NUTRIENT DEFICIENCY: A GEMOME-WIDE COMPARATIVE STUDY S. Gupta 1 , B.S. Yadav 2 , S. Freilich 3 , P.K. Varadwaj 1 * 1 Department of Bioinformatics, Indian Institute of Information Technology-Allahabad, India 2 Deparment of Molecular Biology and Ecology of Plants, Tel Aviv University, Israel 3 Systems Biology and Ecology ARO-Volcani Center-Bet-Dagan, Israel * Corresponding author: Pritish@iiita.ac.in Key words: genomics, wheat, nutrient deficiency Plant roots are essential for providing anchorage and uptake of nutrient and water, required for growth and development. The deficiency of necessary macro-nutrients, i.e. Potassium (K), Magnesium (Mg), Nitrogen (N), Phosphorus (P), and Sulfate (S) in soil directs to a reduction in crop yield and growth as a resulted of change in its gene expression profile. Aim of this study is to identify and annotate such deficiency related responsible genes and proteins as well as high- lights the cellular and metabolic pathways response which is essential for survival of the plant. To differentiate the gene expression levels, various nutrient deficiencies (ND) treated, microar- ray data of wheat root tissue sample were analyzed. To annotate the Gene Ontology (GO) term for each nutrient deficiency response genes (NDRGs), Singular Enrichment Analysis (SEA) was performed.Further, the functional enrichment analysis was performed using MapMan an- notations scheme. Macro-nutrients regulated genes were assigned into a functional category within the hierarchy of MapMan pathway scheme. Pathway enrichment in each experiment was determined by calculating the cumulative hypergeometric p value for the probability that a gene group is over-represented within a functional bin at a rate higher than chance expecta- tion. Multiple tests were performed for all gene group functional categories at all hierarchical levels and Heatmap for each gene group was generated using “gpplaot” package in R. To revalidate the identified results a meta-analysis was performed using root specific earlier iden- tified sample data of wheat through Genevestigator tool. A set of 435 statistically significant NDRGs was identified which got differentially expressed in response to at least one among all ND under study. Total, 58 NGRs were reported to be expressed in response to minimum two ND. P deficiency was seen to be affecting large numbers of NRGs in- comparison to other ND. The SEA predicts individual GO classification of NRGs in different biological process, cel- lular component and molecular function. MapMan analysis showed the carbohydrate and lipid pathways are up-regulated during potassium deficiency, hormonal (jasomonate and ethylene) as well amino acids synthesis (Asp) in Mg deficiency. In P deficiency, lipid degradation, MYB transcription factors, transportation and poly amines pathway were highly expressed but the cell-wall, fatty acid synthesis, protein translation modification, DNA synthesis and repairs and abiotic stress pathways were down-regulated. Amino acid and carbohydrate degraded and photosynthesis process down during nitrogen deficiency. Polyamine metabolism and protein synthesis pathways enriched in S deficiency in plant. Moreover, meta-analysis performed the significance analysis of NRGs, which is based on a large-scale systematic combination of nor- malized and quality-controlled expression data of root specific 109 samples of wheat. Study reveals several uncharacterized proteins in response to ND and other stress with provide a set of known genes, mRNA, proteins. Through functional annotation of these uncharacter- ized genes, components involved in different metabolic pathways and reaction mechanism of proteins/enzymes encoded by 155 transcripts were expressed during ND condition. The study demonstrated the deeper understanding of nutrient utilization in wheat root. 99 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY TRANSCRIPTION BY ALTERNATIVE SIGMA FACTORS: REVISING THE RIGIDNESS PARADIGM J. Guzina 1* , M. Djordjevic 1 1 Faculty of Biology, University of Belgrade, Belgrade, Serbia * Corresponding author: jelenag@bio.bg.ac.rs 200nt> Download 3.91 Kb. Do'stlaringiz bilan baham: |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling