International conference on bioinformatics of genome regulation
Download 3.91 Kb. Pdf ko'rish
|
Key words: oesophageal cancer, transcriptome, RNA-seq, next-generation sequencing Motivation and Aim: Oesophageal cancer is the eighth most common cancer in the world and the highest in Eastern Asia. The incidence rate in Kazakhstan is 10.1: 100 000. Oe- sophageal squamous cell carcinoma (ESCC) is the dominant histological type (>90%) of esophageal cancer cases. Transcriptomic profiling of cancer specimens with next- generation sequencing technologies has provided a comprehensive opportunity for in- depth investigation of gene expression and affected molecular pathways. In our study we aimed to perform whole-transcriptome sequencing to identify affected molecular path- ways and extract meaningful molecular signals from oesophageal cancer specimens of Kazakhstani patients. Methods and Algorithms: Twenty three patients with esophageal cancer that underwent surgery at Oncology center (Astana, Kazakhstan). Fresh frozen cancer and its adjacent normal tissue specimens were obtained from each patient (in total 23 tumor center sam- ples and 23 normal tissue samples). Whole-transcriptome sequencing was performed on Illumina HiSeq2000 platform at the Center for Life Sciences, Nazarbayev University. mRNA libraries were prepared using TruSeq RNA library prep kit according to standard protocol. Raw *.bcl files were converted and demultiplexed using bcltofastq. STAR and HTSeq have been used for alignment and mapping of sequencing reads. Differentially expressed genes have been identified using DeSeq. KEGG and Reactome databases were processed for analysis of signaling networks. Results: Grouped analysis of cancer and normal samples has identified 1072 down-reg- ulated and 1963 up-regulated genes. Functional analysis of up-regulated genes revealed the most significant enrichment for genes encoding products in the category of ‘cell cycle’ (p-value = 2.3×10 -6 ), ‘DNA replication’ (p-value = 1.8×10 -4 ) and ‘lysosome’(p- value = 2.31×10 -5 ), whereas down-regulated genes in the category ‘metabolism of lip- ids and lipoproteins’ (p-value=8.62×10 -4 ), ‘valine, leucine and isoleucine degradation’ (p-value = 1×10 -6 ) and ‘propanoate metabolism’ (p-value = 6.4×10 -6 ). Conclusion: Here, we report functional analysis of transcriptomic profiles from oesoph- ageal cancer and matched adjacent normal specimens from twenty three Kazakhstani patients. 115 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY DIVERGENCE OF PARALOGOUS GROWTH HORMONE GENES IN SALMONIDS D.N. Kamenskaya 1 *, M.V. Pankova 2 , D.M. Atopkin 2, 3 , V.A. Brykov 1, 2 1 A.V. Zhirmunsky Institute of Marine Biology FEB RAS, Vladivostok, Russia 2 Far Eastern Federal University, School of Natural Sciences, Vladivostok, Russia 3 Institute of Biology and Soil Science FEB RAS, Vladivostok, Russia * Corresponding author: kamenskaya.daria@mail.ru Key words: growth hormone genes, nucleotide diversity, gene duplication, Salvelinus, Salmonidae Motivation and Aim: Duplication of DNA sequences, especially genes, is one of the main evolutionary factors. The stage of whole-genome duplication is believed to take place in the history of all organisms, and many duplicated genes could originate from it. Subsequently, full or partial diploidization stages occurred in various phylogenetic branches of eukaryotes, and these duplications can be manifested in the form of mul- tiple gene families [1]. Salmonids are a unique group, which developed in the past, after the last autotetraploidization event. Many genes in this taxonomic group were multiple, including the growth hormone (GH) gene [2]. The GH gene is represented by two unre- lated paralogous genes, gh1 and gh2, in salmonids’ genome. Both genes exist through- out the time of divergence of species in this group. Therefore, salmonids are a suitable model system for investigating the origin, evolution, and functions of duplicated genes. The aim of the current research is to compare paralogous GH genes of three genera in the family Salmonidae–Salvelinus, Salmo and Oncorhynchus–in order to determine their potential differences and possible functions. Methods and Algorithms: To obtain nucleotide sequences of the GH genes, the following conventional techniques of molecular genetics were used: PCR, electrophoresis, molec- ular cloning, and sequencing. Nucleotide sequences were analyzed using the MEGA-6.0 and DnaSP-5.10.01 software packages. Results: A comparison of the complete paralogous GH genes gh1 and gh2 of salmo- nids has shown that the conserved regions are associated with exons, and the peaks of variable regions correspond to intron sequences. It should be noted that not all intron sequences are variable; conserved regions can also be found. The presumable regulatory elements, localized in some introns (Pit-1 motifs, CRE, ERE), are also conserved. We found that gh1 gene in charrs is more conserved than gh2 gene; the amount of variability in gh2 gene is 2–3 times as large as that in gh1 gene, but it is not so obvious when com- paring genes of the all investigated species. Conclusion: A high conservation of coding sequences (exons) in paralogous GH genes of salmonid fish can be determined by the fact that both genes are functional or probably subfunctional. Both exons and regulatory regions are under the influence of negative selection. But the different rate of changes can be explained by the different selection intensity in paralogous genes. References: 1. J. Zhang (2003) Evolution by gene duplication: an update, Trends Ecol. Evol., 18: 292–298. 2. F.W. Allendorf, G.H. Thorgaard (1984) Tetraploidy and the evolution of salmonid fishes, In: Evolu- tionary Biology of Fishes, B.J.Turner (Eds.), 1-Klimenko A.I., Matushkin Yu.G., Kolchanov N.A., Lashin S.A. Bacteriophages affect evolution of bacterial communities in spatially distributed habitats: a simulation study // BMC Microbiology, 2016, Vol. 16 (Suppl. 1), No.1, p. 31. 116 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY DINAMIC METBOLIC REGULATION BY A CHROMOSOME SEGMENT FROM A WILD SPECIES DURING FRUIT DEVELOPMENT IN A TOMATO INTROGRESSION LINE Y. Kanayama School of Agricultural Science, Tohoku University, Sendai, Japan * Corresponding author: yoshinori.kanayama.a7@tohoku.ac.jp Key words: metabolome, Solanum lycopersicum, Solanum pennellii, transcriptome Motivation and Aim: Tomato (Solanum lycopersicum) is one of the most important fruit crops, and useful as a fruit crop model. The genetic variation of cultivated tomato has been narrowed by domestication, and tomato wild relatives with useful genes that have been lost from cultivated tomatoes are useful for breeding. Tomato introgression lines have been developed from a cross between S. lycopersicum and wild relative S. pennellii for the efficient evaluation and use of useful quantitative trait loci from S. pennellii. Each line carries a single S. pennellii chromosome fragment in the genetic background of cul- tivated tomato cv. M82, and a line, IL8-3, was used as a near-isogenic line for analyzing traits associated with fruit quality. Comparative metabolome and transcriptome analyses were performed during fruit development using M82 and IL8-3, with interesting and useful traits such as high sugar content. Results and Conclusion: Marked differences between M82 and IL8-3 were found in ripe fruit and young fruit at 20 days after flowering (DAF) in the hierarchical clustering analysis of the metabolome, whereas patterns were similar between the two genotypes at 10 and 30 DAF. The metabolome analysis showed that 20 DAF is an important stage for fruit metabolism and that the S. pennellii introgressed region in IL8-3 plays a key role in metabolic changes at this stage. Carbohydrate and amino acid metabolism were promoted in IL8-3 at 20 DAF and ripening stage, respectively, whereas transcriptome pattern showed no marked differences between the two genotypes, indicating that dy- namic metabolic regulation at 20 DAF and ripening stage was controlled by relatively few genes. The expression of the cell wall invertase and sucrose synthase genes in starch and sucrose metabolic pathway and that of the glutamate synthase gene in the amino acid metabolic pathway in IL8-3 fruit were higher than those in M82. Our results suggested that sugar metabolism activated by the invertase and sucrose synthase in IL8-3 fruit at 20 DAF affects amino acid metabolism and accumulation by higher sugar concentration at the late stage of fruit development. References: 1. H. Ikeda, T. Shibuya, Y. Kanayama et al. (2016) Dynamic metabolic regulation by a chromosome seg- ment from a wild relative during fruit development in a tomato introgression line, IL8-3. Plant and Cell Physiology, doi:10.1093/pcp/pcw075 2. H. Ikeda, Y. Kanayama et al. (2013) Analysis of a tomato introgression line, IL8-3, with increased Brix content. Scientia Horticulturae, 153:103-108. 117 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY BASED ON THE LOCAL SEQUENCE SIMILARITY METHOD FOR PREDICTION OF AMINO ACID POSITIONS RELATED TO THE PROTEIN-LIGAND SPECIFICITY D.A. Karasev 1, 2 *, A.V. Veselovsky 1 , N.Yu. Oparina 3, 4 , A.V. Rudik 1 , D.A. Filimonov 1 , B.N. Sobolev 1 1 Institute of Biomedical Chemistry, Moscow, Russia 2 Russian National Research Medical University, Moscow, Russia 3 Engelhardt Institute of Molecular Biology, Moscow, Russia 4 Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden * Corresponding author: w.dmitrykarasev@gmail.com Key words: local sequence similarity, ligand-specific positions, protein kinase inhibitors Motivation and aim: Detection of the residues responsible for ligand specificity is ap- plied in protein engineering and discovery of new drug targets. However, many existing methods require the precise superposition of functionally specific residues in aligned sequences. This is not always possible in the case of diverged protein family. We applied original method SPrOS (Specificity Projection On Sequence) to detect amino acid posi- tions associated with ligand specificity of proteins belonging to the same family. Methods and algorithms: The method SPrOS allows detecting the amino acid residues specific to user-defined groups. SPrOS compares the sequence segments from the stud- ied protein and proteins from the training set. Contrary to other segment-comparison approaches extracting the string motifs, SPrOS calculates the scores for single positions by the similarity of their surroundings. We tested our method on the sequences of protein kinases classified by interaction with small molecular compounds known as the promis- ing leads for drug development. Results: The prediction of ligand-specific positions was performed on kinases with known 3D structure. The significant specificity estimates were obtained for residues lo- cated in ATP-binding cleft, which is a known binding-site for kinase inhibitors. The impact of several found residues is confirmed by the published experimental studies. Filtering out the close homologues of the test protein at the sequence comparisons, we were able to locate specific residues with the more precision. Conclusions: We showed the applicability of our method for recognition of the amino acid residues associated with ligand specificity. The method was successfully applied to complicatedly partitioned protein family when functional classification differs from phylogeny. Based on inspecting the 3D structures, we suggest that predicted positions determine specific interaction by directly contacts with the ligand molecule. Available: http://www.way2drug.com/spros/ Acknowledgments: This work is carried out with the financial support of Russian Foun- dation for Basic Research (grant no. 16-04-00491). References 1. Karasev D.A. et al. (2016) Prediction of amino acid positions specific for functional groups in a pro- tein family based on local sequence similarity, J Mol. Recognit., 29(4),159-169. 118 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY SEQUENCING FROM ROCHE: WHAT THE FUTURE WILL BRING FOR YOU? I.Y. Karpova* LCC “Roche Diagnostics Rus”, Moscow Russia * Corresponding author: irina.karpova@roche.com Key words: NGS, high-throughput, SMRT sequencing, targeted sequencing. DNA sequencing is used widely to solve of a large set of studies: sequencing de novo of different organisms; targeted gene sequencing and search of mutations; analysis of transcriptomes and methylomes and etc. High-throughput sequencing has become a routine method and is applied both in science and in medicine. DNA tests are utilized actively at all steps of diagnostics: from search of patients and diagnosis to monitoring of drug action. This approach is used more and more and it will become the same essential part in diagnostics as Real-Time PCR. The Company Roche based on its experience and knowledge in the field of diagnostic and pharmaceutical industry develops an integrated approach that would allow using NGS in medicine quickly and efficiently. It will include all basic steps: biological sam- pling, DNA extraction, preparation of libraries, high-throughput sequencing, data analy- sis and annotation. Today targeted sequencing of different genes and regions of the human genome is pre- ferred for medical research, because it is much cheaper and more effective than whole genome sequencing. Germline mutations associated with define inherited disease is de- termined by targeted sequencing. In addition, there are somatic mutations in genome, they can also cause diseases. Not all of targeted sequencing methods are able to identify somatic mutations with high accuracy. Roche offers a new enrichment technology based on amplification. This method overcomes previous limitations of the targeted approach to enrichment by the use of molecular inversion probes. These probes improve capture performance, the detection of alleles as a function of allele frequency in somatic refer- ence samples. Besides, Roche in collaboration with Pacific Bioscience introduces a new high-through- put sequencer, based on single molecular sequencing in real-time (SMRT). The new sequencer allows to obtain long reads to a few thousand base pairs, as well as to analyze the modified bases without additional manipulation with DNA during sample prepara- tion. SMRT technology is already used widely in science to sequencing small genome de novo, to improve the assembly of large genomes, searching for new splice variants and isoforms of transcripts, the targeted sequencing, etc. New SMRT sequencer from Roche will be the first step to the active use of NGS testing in the medicine diagnostics. 119 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY BIOMOLECULAR SYSTEMS MODELS SEMI-AUTOMATIC RECONSTRUCTION BASED ON STRUCTURAL AND QUANTITATIVE INFORMATION F.V. Kazantsev 1, 2 *, I.R. Akberdin 1, 5 , S.A. Lashin 1, 2 , N. Ree 1 , V. Timonov 2 , A.V. Ratushny 3, 4 , T.M. Khlebodarova 1 , V.A. Likhoshvai 1 Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia 2 Novosibirsk State University, Novosibirsk, Russia 3 Center for Infectious Disease Research, Seattle, USA 4 Institute for Systems Biology, Seattle, USA 5 San Diego State University, San Diego, USA Keywords: Mathematical models; generalized Hill functions; respiration; de novo nucleotide synthesis; gene network, gene expression, database Motivation: Living systems have a complex hierarchical organization that can be viewed as a set of dynamically interacting subsystems. Thus, to simulate the internal nature and dynamics of the whole biological system we should use the iterative way for a model reconstruction, which is a consistent composition and combination of its elementary subsystems. In accordance with this bottom-up approach, we have developed MAM- MOTh (MAthematical Models of bioMOlecular sysTems) database that allows integrat- ing manually curated mathematical models of biomolecular systems, which are fit to the experimental data. The database entries are organized as building blocks in a way that the model parts can be used in different combinations to describe systems with higher organizational level (metabolic pathways and/or transcription regulatory networks). Results: The database supports export of single model or their combinations in SBML or Mathematica standards. The database currently contains more than 100 mathemati- cal models for Escherichia coli elementary subsystems (enzymatic reactions and gene expression regulatory processes) that can be combined in at least 5100 complex/sophisti- cated models concerning such biological processes as: de novo nucleotide biosynthesis, aerobic/anaerobic respiration, and nitrate/nitrite utilization in E. coli. Conclusions: The database provide REST API that can be used for programmatic data access and the integration with external software tools. Currently, we have done inte- gration with the biomedb.ru resource. And now, one can create the structural model on biomedb.ru resource and obtain the mathematical model with subsystems extracted from MAMMOTh in a semi-automatic way. Availability: http://mammoth.biomodelsgroup.ru Acknowledgements: This work is partially supported by RFBR grants № 15-07-05889 and № 15-07-03879. 120 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY REGULATORY ROLE OF SINGLE CPG METHYLATION A. Khamis 1 , A.V. Artemov 2 , A.V. Lioznova 2 , V.B. Bajic 1 , Y.A. Medvedeva 2 * 1 King Abdullah University of Science and Technology 2 Research Center of Biotechnology RAS Key words: epigenetics, methylation, CpG, CAGE DNA methylation is a well studied epigenetic process. Several methods have been de- veloped to determine genome methylation at various scales, including those capturing DNA methylation with a single base resolution, such as bisulfite sequencing. Still, for a downstream analysis the most common strategy is to average methylation levels along regulatory regions, based on the assumption of the homogeneous distribution of DNA methylation within genomic regions. Despite the well known observations of unmethyl- ated CpGs co-localized within CpG islands (CGIs) and methylated CpGs co-localized within repetitive elements, the role of single CpG methylation has also been reported. Our previous study demonstrated that a share of gene-proximal CpGs exhibited a sig- nificant negative correlation of their methylation profiles with the expression profiles of neighboring genes across various cell types. We called such CpG positions CpG traffic lights. Although they can be co-localized in short CpG-clusters, they are quite often single CpGs. Previously we have demonstrated that CpG traffic lights are unlikely to be widely involved in regulation of transcription factor binding. Now we show that CpG traffic lights are over-represented within enhancers and transcriptional start sites deter- mined by CAGE (Cap Analysis of Gene Expresion). They are as well co-located with regions of histone modifications, supporting their regulatory potential. We also show depletion of SNP in such positions, suggesting the presence of natural selection. We conclude that thought the regulatory role of CpG traffic lights in not completely clear, they can represent regulatory regions and their methylation levels at very least can serve as markers for gene expression. 121 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY GENETIC DIVERSITY IN NATIVE SIBERIAN POPULATIONS: CORRELATION WITH CLIMATIC AND GEOGRAPHICAL PARAMETERS V.N. Kharkov 1, 2 *, A.V. Markov 1, 2 , I.Yu. Khitrinskaya 1 , V.A. Stepanov 1, 2 1 Research Institute for Medical Genetics, Tomsk, Russia 2 Tomsk State University, Tomsk, Russia * Corresponding author: vladimir.kharkov@medgenetics.ru Key words: human populations, genetic diversity, gene pool, adaptation, natural selection, human evolu- tionary genetics Motivation and Aim: Adaptive evolution to adverse or extreme climatic and geograph- ical conditions mediated by natural selection, probably played the substantial role in shaping the genetic structure of modern human populations. We have investigated the distribution of genome-wide SNPs, correlated with climatic and geographic parameters, in native Siberian populations comparing to worldwide human populations in order to detect the natural selection. Methods and Algorithms: Our data on genome-wide SNPs frequencies in 5 native Siberi- an populations (Buriat, Yakut, Tuva, Khants, Kets) were pooled with data on worldwide populations and analyzed by means of positional search of association of allele frequen- cies with climatic and geographic parameters and search for signals of natural selection. Results: For a considerable number of SNPs allele frequency demonstrate an increase of heterozygosity from tropical to northern populations. The level of genetic diversity and genetic differentiation of these SNPs is significantly different from the average for the genome. Among the genes demonstrating significant selection signals are: EPHA8, GRB2, LINGO2, LINC00669, YES1, CSMD1, DAAM1, DLGAP1, DRD3, KAZN, FRMD4B. Haplotypic tests show traces of natural selection in genomic regions of this genes. Conclusion: We suppose that genetic diversity in the substantial part of the human ge- nome in native Siberian populations were shaped by adaptation to cold climate. Acknowledgements This work was supported by Grant of President of Russian Federa- tion (grant № MD-8886.2016.4). 122 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY THE INFLUENCE OF RARE MUTATIONS IN THE APOB GENE TO THE LEVEL OF OXIDIZED LDL E.Yu. Khlebus*, N.V. Shcherbakova, I.S. Zhanin, A.A. Zharikova, A.I. Ershova, A.V. Kiseleva, S.A. Boytsov, A.N. Meshkov National Research Center for Preventive Medicine? Moscow, Russia * Corresponding author: elkhlebus@gmail.com Download 3.91 Kb. Do'stlaringiz bilan baham: |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling