International conference on bioinformatics of genome regulation
Download 3.91 Kb. Pdf ko'rish
|
Key words: UniPro UGENE, Sanger sequencing, Aligning, Raw data, pipeline, Workflow Designer Motivation and Aim: Sanger [1] sequencing has been used for several decades. Although more modern sequencing technologies have already appeared (e.g. NGS [2]), the Sanger technology is still widely used as a supplementary method for NGS for more precision or as an independent method for sequencing of small genomes. It makes developement of a high quality algorithm for Sanger data processing an actual objective. Methods and Algorithms: UGENE [3] is a universal toolkit for biological data visualiza- tion and analysis. One of UGENE’s features is Workflow Designer that allows anyone with no programming skills to design and run complex computational pipelines. Many algorithmic blocks are implemented in Workflow Designer to perform different kinds of tasks. These blocks have been used to build a new element that aligns Sanger reads to a reference sequence. The algorithm is based on the ВLAST [4] tool that first localizes an appropriate region of the read on the reference. Then its exact location is found by the Smith-Waterman algorithm [5] that can be launched on GPU to increase the overall pipeline performance. In the end, all aligned reads are merged into a single multiple alignment data structure via UGENE’s own merging algorithm. Results and Conclusion: The algorithm has proved itself as a fast and memory efficient solution for aligning Sanger reads to a reference. The Workflow Designer tool provides pipelined data processing for several independent blocks of Sanger data. In the nearest future the algorithm will be available in the UGENE toolkit along with a new viewer for raw Sanger data. Availability: http://ugene.net/download.html References: 1. Sanger F., Niclein S., Coulson A.R. (1977) DNA sequencing with chain-terminating inhibitors, Pro- ceedings of the National Academy of Sciences, USA, 74: 5463-5467. 2. de Magalhães JP, Finch CE, Janssens G (2010) Next-generation sequencing in aging research: emerg- ing applications, problems, pitfalls and possible solutions, Ageing Research Reviews 9 (3): 315–323. 3. Okonechnikov K., Golosova, O., Fursov, M., the UGENE team (2012) Unipro UGENE: a unified bioinformatics toolkit, Bioinformatics. 28 (8): 1166-1167. 4. Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) Basic local alignment search tool, Journal of Molecular Biology 215:403-410. 5. Smith, Temple F.; and Waterman, Michael S. (1981) Identification of Common Molecular Subse- quences, Journal of Molecular Biology 147: 195–197. 317 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY GENETIC BASIS OF AGGRESSION: CLUSTERIZATION OF EXPRESSION PROFILES E.S. Tiys 1, 2 *, A.O. Bragin 1, 2 , I.V. Medvedeva 1, 2 , I.V. Chadaeva 1 , A.L. Markel 1 , Y.L. Orlov 1, 2 1 Institute of Cytology of Genetics SB RAS, Novosibirsk, Russia 2 Novosibirsk State University, Novosibirsk, Russia * Corresponding author: tiys@bionet.nsc.ru Key words: aggression, RNA-Seq, differential expression, functional annotation, MeV, GOexpress Motivation and Aim: Aggression is a complex phenomenon of behavior with physiologi- cal and genetics roots. It is important to find traces of aggression on genetic level. RNA- seq experiment is invaluable source of information on genetic component of aggressive behavior. Methods and Algorithms: We used RNA-Seq data from brain areas of aggressive and tame rat lines obtained at ICG SB RAS [1]. MirVana™ miRNA Isolation Kit(Ambion) was used to extract mRNA. NEBNext® mRNA Library Prep Reagent Set for Illumina (NEB) was used to prepare libraries. Sequencing was conducted according to Illumina protocol. Reference genome RGSC Rnor_5.0\rn5 and TopHat2 tool were used for the alignment. The sequence reads were analyzed using Cufflink workflow [2] using FPKM to quantify expression. Clustering of samples by gene expression was done with MeV [3] using average linkage clustering. Correlation coefficient was used as a measure of similarity. Differential expression was calculated with MeV using Wilcoxon test with FDR multiple tests correction. Ranking of processes was made by GOexpress [4]. Results: Expression profiles were clustered by tissues and the tissue clusters were ac- curately clustered into aggressive and tame samples. Differential expression was not statistically significant. Applying GOexpress ranking allow us to prioritize the following Gene Ontology biological processes: “negative regulation of transporter activity”, “ex- trinsic apoptotic signaling pathway in absence of ligand”, “salivary gland cavitation”, “regulation of response to DNA damage stimulus, establishment of cell polarity”. Conclusion: On the one hand accurate clusterization of the expression profiles confirms the presence of molecular genetic traces of aggression behavior in the RNA-seq data, on the other hand differential expression and prioritizing were not show statistical signifi- cance. Further analysis is needed to uncover molecular basis of aggression. Acknowledgements: The work was supported by RSF grant 14-14-00269. References: 1. Y.L. Orlov et al. (2012) ICGenomics: program complex for symbol sequence analysis in genomics, Vavilov journal of genetics and breeding, 16(4/1): 732-741. (In Russian). 2. http://cole-trapnell-lab.github.io/cufflinks/ 3. A.I. Saeed (2003) TM4: a free, open-source system for microarray data management and analysis, Biotechniques, 34(2): 374. 4. K. Rue-Albrecht (2016) GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data, BMC bioinfor- matics, 17(1): 1. 318 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY GENEONTOLOGY BIOLOGICAL PROCESSES SENSITIVE TO SALT DIET CHANGES IN AN EXPREIMENT WITH 105-DAY ISOLATION: STATISTICAL ANALYSIS OF URINE PROTEOME E.S. Tiys 1, 2 *, E.D. Petrovskiy 1 , L.Kh. Pastushkova 3 , D.N. Kashirina 3 , I.M. Larina 3 , V.A. Ivanisenko 1 1 Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia 2 Novosibirsk state university, Novosibirsk, Russia 3 Institute for Biomedical Problems RAS, Moscow, Russia * Corresponding author tiys@bionet.nsc.ru Key words: salt intake, Gene Ontology, Random forest, GOexpress, Bioconductor Motivation and Aim: Conducting mass-spectrometry experiments allows to solve impor- tant problem of protein expression measurement. Promising approach to interpret mass spectrometric studies is based on the identification of biological processes, protein repre- sentation of which is the most contrast between case and control samples. The aim of the study was a computer search for biological processes, changing of functional activity of which was related with salt intake regime, based on urine proteome analysis in 105-day volunteer isolation experiment. Methods and Algorithms: Weekly during the 105-day isolation urine samples from six volunteers were collected and chromato-mass spectrometry was conducted. Collected spectra were analysed with Mascot tool. The results consist of lists of identified proteins for each volunteer and data on salt content in diet (6, 9 or 12 g/day). GOexpress package [1] of Bioconductor was used to search for relation between salt intake and Gene Ontol- ogy (GO) processes through corresponding protein sets. Built-in package estimation of statistical significance was used with 250000 permutations of protein ranks. Processes with corrected p<0.05 were considered as significant. The Benjamini-Hochberg correc- tion was used (R language, function p.adjust). Results: Using GOexpress we identified 131 GO processes, demonstrating statistically significant relation with salt consumption. Among these, we can emphasize metabolic processes of connective tissue, such as chondroitin sulfate metabolic process, collagen catabolic process, hyaluronan metabolic process. This fact consistent with earlier hy- pothesis of extracellular matrix participation in sodium deposition [2, 3]. Conclusion: Mass-spectrometry data analysis of volunteer urine proteomes gave po- tential link between changes in salt intake and functioning of biological processes, in- volving in metabolism of connective tissue, that in agreement with hypothesis about its protein participation in sodium deposition. Acknowledgements: Project № 0324-2015-0003 and RSF grant 14-24-00123 References: 1. K. Rue-Albrecht (2016) GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data, BMC bioinfor- matics, 17(1): 1. 2. I.M. Larina et al., (2012) Reconstruction of associative protein networks connected with processes of sodium exchange regulation and sodium deposition in healthy volunteers based on urine proteome analysis, Human Physiology, 38(3): 316–323. 3. L.Kh. Pastushkova, (2015), Proteome profile of urine of healthy human under normal conditions and under factors of space flight, thesis [In russian]. 319 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY PHSYOLOGICAL AND TRANSCRIPTIONAL CHANGES IN A BLOSSAM-END ROT RESISTANT TOMATO INTROGRESSION LINE IL8-3 FRUIT S. Tomoki, H. Ikeda, Y. Kanayama* School of Agricultural Science, Tohoku University, Sendai, Japan * Corresponding author: yoshinori.kanayama.a7@tohoku.ac.jp Key words: Blossom-end rot, Calcium, Solanum lycopersicum, Solanum pennellii, tomato, transcriptome Motivation and Aim: Tomato (Solanum lycopersicum) is one of the most important veg- etables due to its high economic and nutrition values and is also important as an ex- perimental model species of the Solanaceae family and fleshy-fruited plants. Tomato is sensitive to Calcium (Ca) deficiency, which causes several physiological disorders in fruits such as cracking and blossom-end rot (BER). Ca concentration, early fruit growth, and expression of Ca-movement-related genes were analyzed during early fruit devel- opment in tomato, which is the most important stage in the incidence of BER, to in- vestigate physiological mechanisms affecting the occurrence of BER. We used tomato introgression line IL8-3 having a chromosome segment from the wild relative (Solanum pennellii) because the line shows lower incidence of BER compared with the parent cul- tivar ‘M82’ (S. lycopersicum), as described previously. In addition, we tried to analyze transcriptomic character of early fruit development in IL8-3 on microarray data and characterize tomato Ca-movement-related genes by phylogenetic analysis. Results and Conclusion: Total Ca concentration in fruit and leaves was higher in IL8- 3 than in ‘M82’, whereas no significant differences were observed between total Ca concentration in roots and stems of ‘M82’ and IL8-3. Therefore, the lower incidence of BER in IL8-3 could be not related to whole-plant uptake but to transport within the plant. IL8-3 fruit showed a lower growth rate than ‘M82’, which could result in prevent- ing the occurrence of BER. The expression of genes encoding Ca 2+ /H + exchanger, auto- inhibited Ca 2+ -ATPase, Ca 2+ channel, and Na + /Ca 2+ exchanger-like protein, was higher in IL8-3 fruit than in ‘M82’ fruit, suggesting active Ca movement in IL8-3. In contrast, the expression of the gene encoding Group 3 Cation/Ca 2+ exchanger was lower in IL8-3 fruit than in ‘M82’. Additionally, the expression level of boron-movement-related genes is different between IL8-3 and ‘M82’. It is reported that boron deficiency may also affect the incidence of BER and Ca 2+ transportation. All our results could be related to physi- ological mechanisms accounting for the lower incidence of BER in IL8-3 and one of these factors may play a primary role. 320 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY DIFFERENTIAL EXPRESSION OF SHAGGY, A DROSOPHILA MELANOGASTER GENE ENCODING GSK-3 BETA, AFFECTS LIFESPAN M.V. Trostnikov, N.V. Roshina, E.G. Pasyukova Institute of Molecular Genetics, RAS, Russia * Corresponding author: egpas@rambler.ru Key words: Life span, GSK-3 beta, the nervous system, Drosophila Motivation and Aim: Protein kinases are important proteins involved in multiple phos- phorylation cascades, and their role in systemic regulation of various metabolic pro- cesses is well recognized. A serine-threonine kinase GSK-3 beta (glycogen synthase kinase-3 beta) is an important protein involved in various signaling pathways and meta- bolic processes, neurogenesis and neuronal function. Earlier, we have shown that sev- eral genes involved in asymmetric neuroblast division, including shaggy (sgg) encoding Drosophila GSK-3 beta, affect Drosophila lifespan. Several mutations in sgg increased male and female mean lifespan by 5% to 50% together with slightly reducing the total amount of GSK-3 beta protein. In order to further assess the role of GSK-3 beta in the control of lifespan, aging and neuronal function, we used transgenic lines with additional copies of the four sgg variants under inducible UAS promoters, to provide sgg overex- pression; transgenic lines with hairpins, to provide sgg RNA-i knockdown; and special driver lines, to induce overexpression and knock-down in various tissues and cell types. Results: We demonstrate that i) the proper expression of the main functional GSK-3 beta transcript, RB, in the nervous system and fat body is essential for survival; ii) sgg transcripts whose functionality has not previously been demonstrated, RA, RG and RO, appeared to be functional: alterations in their expression in the nervous system slightly decrease male lifespan; in muscles, only overexpression of RA affects lifespan; iii) sgg misexpression in different neurons affects lifespan in a neuron-specific manner: mod- erate increase and decrease in the amount of the main isoform of GSK-3 beta, PB, in peptidergic and cholinergic neurons has no effect on lifespan, while in glutaminergic and motor neurons it increases female and decreases male lifespan; iv) dopaminergic neu- rons are the most sensitive to sgg function: strong sgg RB overexpression is lethal, while moderate decrease in PB amount improves female lifespan. Analyses of the molecular and cellular mechanisms underlying positive effects of sgg misexpression on lifespan are under way. Conclusion: Differential expression of GSK-3 beta is one of the mechanisms involved in a complex regulation of lifespan and aging. 321 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY ADDITIVITY AND NON-ADDITIVITY OF GENETIC CONTROL OF HUMAN METABOLOME Y.A. Tsepilov 1, 2 *, S. Shin 3, 4 , N. Soranzo 3 , T.D. Spector 5 , J. Adamski 6, 7, 8 , G. Kastenmüller 9 , K. Strauch 10, 11 , R. Wang-Sattler 12 , C. Gieger 10 , Y.S. Aulchenko 1, 2 , J.S. Ried 10 1 Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia 2 Novosibirsk State University, Novosibirsk, Russia 3 Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, United Kingdom 4 MRC Integrative Epidemiology Unit (IEU), University of Bristol Integrative Epidemiology, Bristol, United Kingdom 5 Department of Twin Research and Genetic Epidemiology, King’s College London , London, United Kingdom 6 Institute of Experimental Genetics, Genome Analysis Center, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany 7 Institute of Experimental Genetics, Life and Food Science Center Weihenstephan, Technische Universität München, Freising-Weihenstephan, Germany 8 German Center for Diabetes Research, Neuherberg, Germany 9 Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München – German Research Center for Environmental Health, Neuherberg, Germany 10 Institute of Genetic Epidemiology, Helmholtz Zentrum München – German Research Center for Environ- mental Health, Neuherberg, Germany 11 Institute of Medical Informatics, Biometry and Epidemiology, Chair of Genetic Epidemiology, Ludwig- Maximilians-Universität, Munich, Germany 12 Research Unit of Molecular Epidemiology, Helmholtz Zentrum München – German Research Center for Environmental Health, Neuherberg, Germany * Corresponding author: tsepilov@bionet.nsc.ru Key words: genome-wide association studies, non-additive models, KORA, metabolomics, genotypic model Motivation and Aim: Genome-wide association studies (GWAS) are widely applied to analyze the genetic effects on phenotypes. With the availability of high-throughput technologies for metabolite measurements, GWAS successfully identified loci that affect metabolite concentrations and underlying pathways. In most GWAS the effect of each SNP on the phenotype is assumed to be additive. Other genetic models such as reces- sive, dominant or over-dominant were considered only by very few studies. In contrast to that, there are theories that emphasize the relevance of non-additive effects as a conse- quence of physiological mechanisms. This might be especially important for metabolites as these traits are closer to the underlying pathways than other traits or diseases. Methods and Algorithms: In this study we analyzed systematically non-additive effects on a large panel of serum metabolites and all possible ratios (22,801 in total) in a popula- tion based study (KORA F4, N = 1,785). We applied four different 1 df tests correspond- ing to an additive, dominant, recessive and over-dominant trait model and additionally a genotypic model with 2 df that allows a more general formulation of genetic effects. Results: Twenty three loci were found to be genome-wide significantly associated (Bon- ferroni corrected p-value ≤2.19x10 -12 ) with at least one metabolite or ratio. For five of them we show evidence of non-additive effects. We replicated seventeen loci (including three loci with non-additive effects) in an independent study (TwinsUK, N = 846). Conclusion: We proposed efficient methodology of identification of non-additive loci. We showed that genetic effects on metabolite concentrations and ratios were mostly ad- ditive; at the same time we also found significant and replicable non-additive loci. 322 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY IDENTIFICATION OF BREED-SPECIFIC SNP-MARKERS FOR SUS SCROFA DOMESTICUS USING SRA-DATA OF NGS PROJECTS I.S. Tsybovsky*, V.N. Kipen, S.A. Kotova Scientific and Practical Centre of the State Committee of Forensic Expertises, Minsk, Belarus * Corresponding author: tsybovsky@yahoo.com Key words: SNP, Sequence Read Archive, Next-Generation Sequencing, Sus scrofa domesticus Motivation and Goal: The following domestic pig breeds are widely spread in the Re- public of Belarus: Large White, Duroc, Landrace, Pietrain and others. Large White pig, and based on it generated factory types dominate in the breeding livestock. The per- centage of these four swine breeds represents at least 98% of the total population of Sus scrofa domesticus in Belarus. The breeders are oftenly faced with the objective to estimate breeding animals’ breed-purity to correct inbreeding values in order to manage breed productivity indexes. The goal of this study is to find breed-specific SNPs for domestic pigs, which are culti- vated in Belarus: Large White, Landrace, Pietrain, Duroc and Meishan.(for comparison) SRA-data were analyzed by whole genome sequencing (NGS), located in the public do- main of the DNAnexus cloud service (http://sra.dnanexus.com/) and NCBI-SRA server (http: //www.ncbi.nlm.nih .gov / sra). Search Basics were provided by the previously published results of Ramos A. et al. [1], produced using SNP-chip «PorcineSNP60» (Il- lumina). Methods and Algorithms: Analysis was performed using SRA Nucleotide BLAST al- gorithm and program BioEdit v.7.2.5. The amount included in analysis SNP – 193; the number of reads for whole genome Large White pigs – 19, for other breeds – 71 (Land- race – 22, Pietrain – 6, Duroc – 28, Meishan – 15). The total number of analyzed se- quences is 32 754 738 518. Results: Finally, 55 breed-specific markers were identified among 193 SNPs, i.e. for those for which one allele is unique to only one breed. 6 specific SNPs were identified for the Large White breed (for which minor allele prevalence frequency was in the range of 7,9-23,7%); for Landrace – 5 (9,1-20,5%); for Pietrain – 7 (16,7-66,7%); for Duroc – 32 (7,1-100%); and for Meishan – 5 (23,3-100%). Conclusion: Basing on these results it is expected in the future to develop and test SNP- markers genetic panel intended to provide the differentiation of domestic pig breeds, such as Large White, Landrace, Duroc, Pietrain and Meishan. Availability: http://sra.dnanexus.com/, http://www.ncbi.nlm.nih.gov/sra References: 1. A.M. Ramos et al. (2011) Identification of high utility SNPs for population assignment and traceability purposes in the pig using high-throughput sequencing, Animal Genetics, 42: 613-620 323 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY IDENTIFICATION OF THE TAXA OF THE ORDER ARTIODACTYLA FOR CRIMINAL INVESTIGATION CASES OF ILLEGAL HUNTING I.S. Tsybovsky*, S.A. Kotova, V.I. Rybakova, A.A. Rabcava, E.A. Spivak Scientific and Practical Centre of the State Committee of Forensic Expertises, Minsk, Belarus * Corresponding author: tsybovsky@ yahoo.com Download 3.91 Kb. Do'stlaringiz bilan baham: |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling