International conference on bioinformatics of genome regulation
Download 3.91 Kb. Pdf ko'rish
|
- Bu sahifa navigatsiya:
- Key words
Key words: rheumatoid arthritis, SNP, VEGF A, rs3025039, rs699947 Motivation and Aim: Genetic factors are involved in the developing of rheumatoid ar- thritis (RA). In RA pathogenesis a pannus formation play a crucial role. One of the fac- tors contributing to this process is the vascular endothelial growth factor (VEGF). The existence of an association between the VEGF SNP and RA progress is supposed. The aim of this study was to investigate the involvement of VEGF+C936T (rs3025039) and VEGF-С2578А (rs699947) SNPs in developing of RA. Methods and Algorithms: 229 Europeoid women with RA were included in our study. Patients had American College of Rheumatology (ACR)-defined RA (1987 classification criteria). The genotyping was performed by restriction fragment length polymorphism analysis of PCR-amplified fragments (PCR-RFLP), furthermore the age of disease onset was determined. Description analysis and Mann—Whitney U-test were employed for statistical processing the results. Results: No differences were found between groups with VEGF-С2578А (rs699947) SNPs. The age of disease onset was significantly different between group with vari- ous genotypes for VEGF+C936T SNPs: 42,5 years for VEGF+936СС compare with 47,8 years for VEGF+936СT (p-value=0,015). 168 (73,36%) and 59 (25,76%) patients had VEGF+936СС and VEGF+936СT genotype, respectively. The last mentioned are consistent with other investigator’s results. Two patients (0,88%) had VEGF+936TT genotype and this small group wasn’t included in analysis. Conclusion: Analysis of vascular endothelial growth factor polymorphisms (rs3025039) may be useful in clinical practice for evaluation of predisposition to earlier RA. 214 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY THEORETICAL MODEL OF MITOTIC SPINDLE MICROTUBULE GROWTH FOR FRAP CURVE INTERPRETATION L.V. Omelyanchuk 1, 2 *, A.F. Munzarova 1, 2 , T.Y. Mikhailova 2 1 Institute of Molecular and Cellular Biology, Novosibirsk, Russia 2 Novosibirsk State University, Novosibirsk, Russia Key words: mitosis, modeling Spindle FRAP recovery curve depends upon kinetic parameters of polymerization at microtubule plus ends. Empirical FRAP recovery curve of Salmon et al., 1984 permits to determine only one such dynamic parameter, commonly called as “tubulin turnover”. The aim of our study was to build FRAP recovery curve based upon already known ki- netic model of microtubule growth. Analytical expression describing the distribution of polymerizing and degrading microtubule ends as a function of four kinetic parameters was found. Theoretical FRAP recovery curve for spindle was constructed. Fitting the theoretical curve to experimental data gives the values of four parameters, describing spindle microtubule behavior. This gives the opportunity to study how mutation in mitot- ic proteins affects microtubule growth and shrinking. The alterations in kinetic constants of the transition between growth and shrinking and between shrinking and growth could also be determined for mutant proteins. 215 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY SEQUENCING OF CONIFER GENOMES USING NGS N.V. Oreshkova 1, 2 *, Yu.A. Putintseva 1, 2 , D.A. Kuzmin 1 , V.V. Sharov 1 , V.V. Biryukov 1 , S.V. Makolov 1 , K.V. Krutovsky 1, 3, 4, 5 1 Siberian Federal University, Krasnoyarsk, Russia 2 V.N. Sukachev Institute of Forest SB RAS, Krasnoyarsk, Russia 3 Georg-August University of Göttingen, Göttingen, Germany 4 N.I. Vavilov Institute of General Genetics, RAS, Moscow, Russia 5 Texas A&M University, College Station, USA * Corresponding author: oreshkova@ksc.krasn.ru Key words: genome sequencing, genome assembly, organelle genomes, Pinus sibirica, Larix sibirica Motivation and Aim: Pinus sibirica and Larix sibirica are the key forest-forming species of great ecological and economical importance. To investigate specific features of these species, interspecific interactions, genetic diversity and for other important studies it is necessary to assemble and annotate reference genomes. Methods and Algorithms: The Siberian larch (Larix sibirica Ledeb.) and Siberian pine (Pinus sibirica Du Tour.) nuclear and organelle genomes are being de novo sequenced in the Laboratory of Forest Genomics at the Siberian Federal University using Illumina HiSeq 2000 and MiSeq, and their first draft genome assemblies were generated. Esti- mated genome size was 12.03 Gbp for Siberian larch and 28.90 Gbp for Siberian pine. DNAs isolated from needles, single megagametophytes and a haploid tissue culture of a reference larch tree and from needles and single megagametophytes of a reference pine tree were used to generate multiple PE libraries with 250, 400 and 500 bp long inserts and MPE libraries representing 3 and 5 Kbp long fragments. We tested CLC Assembly Cell, ABySS and MaSuRCA assemblers that were used in the similar Picea abies, Picea glauca and Pinus taeda conifer genome sequencing projects, respectively. Results: The best Siberian larch genome assembly was ~5.5 Gbp long (that is 46% of the expected complete genome length) with N50 for contigs equaled 1947 bp. Almost all Si- berian pine short reads were successfully mapped to the draft genome assembly v1.0 of closely related sugar pine (Pinus lambertiana Dougl.) generated in the PineRefSeq proj- ect (http://pinegenome.org/pinerefseq) covering more than 80% of the assembly (~21.26 Gbp). Thus, the reference-based together with de novo assembly approaches resulted in a draft genome assembly of Siberian pine with a total length of ~22.9 Gbp (79% of the expected complete genome length) with N50 for contigs equaled 2352 bp. About 80% of Siberian larch and pine nuclear genomes consisted of highly repetitive DNA. Conclusion: Thus, we obtained 46 and 79% of the length of Siberian larch and Siberian stone pine genomes, respectively. Currently, work is underway to improve the quality of assembly and annotation. Acknowledgements: This study was supported by Research Grant No. 14.Y26.31.0004 from the Government of the Russian Federation. 216 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY CLUSTER ANALYSIS OF STRESS-INDUCED DUPLEX DESTABILIZATION (SIDD) PROFILES FOR E. COLI PROMOTERS M.A. Orlov*, A.A. Ryasik, E.A. Temlyakova, A.A. Sorokin Institute of Cell Biophysics RAS, Pushchino, Russia * Corresponding author: orlovmikhailanat@gmail.com Key words: DNA physical properties, machine learning, cluster analysis, bacterial promoters Motivation and Aim: The physical properties of promoter DNA play a key role in all steps of transcription initiation. The propensity to undergo stress-induced duplex desta- bilization (SIDD) [1] could contribute significantly to the formation of open promoter complex. It was shown that SIDD can be used successfully for promoters identification and when used in combination, it is shown to significantly reduce the level of false posi- tive predictions [1]. Methods and Algorithms: Sequences of experimentally found E. coli promoters were obtained from RegulonDB (version 8.5). SIDD profiles were calculated for each se- quence in the set. Following clusterization of the profiles was performed with Ward method; its results consistency was assessed using consensus clustering technique. Results: Analysis of the SIDD data shows that approximately half of promoters have no significant maxima corresponding to the opening probability over 50% for .... According to clusterization rest of promoters are grouped into three stable compact clusters with opening probability maxima nearly -175, -45, and +20 positions from transcription start site (TST) accordingly. We suppose that regions with high probability to open located outside of a region, which interact with polymerase directly, serve as a ‘safety valves’ that keep the superhelicity of promoter DNA stable. Conclusion: Analysis of SIDD profiles in the promoter area of E. coli genome shows that regions of high melting probability do not contribute directly to polymerase-pro- moter recognition, but help to maintain the promoter in stable superhelical conditions. Acknowledgements: This work was supported by RFBR grant r_centr_a 14-44-03679. References: 1. Wang, H.Q. and Benham, C.J. (2006) Promoter prediction and annotation of micro- bial genomes based on DNA sequence and structural responses to superhelical stress, BMC Bioinformatics, 7: 248-262. 217 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY COMPUTER ANALYIS OF DISTAL GENE REGULATION USING CHROMOSOME CONTACTS DATA Y.L. Orlov 1 *, E.V. Kulakova 2 , A.G. Bogomolov 1 , V.N. Babenko 1 , G. Li 3 1 Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia 2 Novosibirsk State University, Novosibirsk, Russia 3 Huazhong Agricultural University, Wuhan, China * Corresponding author: orlov@bionet.nsc.ru Key words: gene expression regulation, chromosome contacts, CTCF sites, ChIA-PET Motivation and Aim: Transcription regulation research is important problem of molecu- lar biology challenging sequencing technologies and bioinformatics data analysis. ChIP- Seq detects interactions between DNA and proteins; ChIA-PET (Chromatin Interaction Analysis with Paired-End-Tag) technology allows detect interactions between pairs of DNA sites affecting gene regulation. Fullwood et al. [1] used ChIA-PET technology to construct chromatin interaction network bound by estrogen receptor alpha from hu- man breast cancer cell line (MCF-7) and found long-range ER binding sites are mostly located at promoter regions. CTCF-mediated interactions found in mouse embryonic pluripotent stem cells and human cell lines [2]. Li et al. [2] detected promoter-centered distant interactions bound by RNA Polymerase II in cancer cells. Methods and Algorithms: We developed computer programs for statistical data analysis and test it on CTCF binding sites, genes and spatial topological domains. These data have been obtained experimentally by using methods ChIP-seq, Hi-C, ChIA-PET. Five distinct chromatin domains revealed by CTCF ChIA-PET raised a new model of CTCF function for chromosome structure organization and linking enhancers to promoters for gene transcription regulation. Results: We used data on the spatial domains in the genome of the mouse embryonic stem cells and in the human genome, data on the location of CTCF binding sites clusters obtained by ChIA-PET. Gene annotation was obtained from UCSC Genome Browser (http://genome.ucsc.edu). The result of the analysis is the distribution of CTCF tran- scription factor binding sites on domains on the human chromosomes and relative gene locations. The distributions of human genes relative CTCF binding sites and a randomly generated list of such sites as the program output were used to estimate statistical signifi- cance of the associations found. Conclusion: Chromatin interaction network is organized into “community”, and genes within community perform related functions and respond to external stimuli in a coordi- nated manner. In all the promoter-nonpromoter interactions, more than 40% of the non- promoter regulatory elements didn’t interact with their nearest promoters. References: 1. M.J. Fullwood et al. (2009) An oestrogen-receptor-alpha-bound human chromatin interactome, Nature, 462(7269):58-64. 2. G. Li et al. (2014) Chromatin Interaction Analysis with Paired-End Tag (ChIA-PET) sequencing tech- nology and application, BMC Genomics, 15(Suppl 12):S11. 218 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY DIOXIN-MEDIATED UPREGULATION OF ONCOSTATIN M IN U937 MACROPHAGES D.Y. Oshchepkov 1 *, E.V. Kashina 1 , V.A. Mordvinov 1 , D.P. Furman 1, 2 1 Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia 2 Novosibirsk State University, Novosibirsk, Russia * Corresponding author: diman@bionet.nsc.ru Key words: Oncostatin M, macrophage, dioxin, AhR, cytokines Motivation and Aim: The environmental pollutant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD, dioxin) is the most toxic among the dioxin xenobiotics and induces a broad spectrum of biological effects, including immunotoxicity and cancer [1]. Macrophages are key regulators of the innate immune response, as well as one of the first types of cells responding to stress, so the study of dioxin action in these cells is important. It is known that TCDD exposure effects some cytokines expression [2] but our analysis [3] showed that the list of such cytokines is not yet completed. We have investigated an effect of TCDD on Oncostatin M (OSM) expression in U937 macrophages. Methods and Algorithms: Real-time PCR experiments were performed to investigate OSM mRNA expression dynamics at 6 and 24 hours after TCDD exposure in U937 macrophage-like cells. Results: The data obtained demonstrate that OSM is upregulated after 6h of TCDD ex- posure, and maintains its overexpression after 24 hours. Transcription factor AP-1 is known to be the activator of OSM expression in macrophages [4]. We have shown that FOSB and FOSL2 genes, coding AP-1 subunits, are upregulated simultaneously with OSM in U937. Conclusion: Oncostatin M is supposed to play fundamental roles in mechanisms of in- flammation in pathology [5]. Predicted activation of Oncostatin M expression in mono- cytes/macrophages, which are a primary source of OSM, can explain a spectrum of biological effects of AhR ligands exposure, including immunotoxicity and cancer. Acknowledgements: This work was supported by state projects nos 0324-2015-0003 and 0324-2015-0004. References: 1. Mandal P. (2005) J Comp Physiol B. 175: 221-30. 2. Kerkvliet N. (2009) Biochem Pharmacol. 77(4): 746-60 3. Furman D., et al. (2009) Comput Biol Chem. 33(6):465-8. 4. Kastl SP, et al.,.Blood. 2009 Sep 24;114(13):2812-8. 5. Richards C.D. ISRN Inflamm. 2013 Dec 8;2013:512103. 219 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY IN SILICO MOUSE CHROMOCENTERS CONTENT D.I. Ostromyshenskii*, A.S. Komissarov, I.S. Kuznetsova, O.I. Podgornaya Institute of Cytology RAS, St. Petersburg, Russia * Corresponding author: necroforus@gmail.com Key words: sequencing, chromocenter Motivation and Aim: Chromocenters are interphase nuclei landmark structures of constitu- tive heterochromatin. Heterochromatin enriched in tandem repeat (TR). There is progress in recent years in revealing chromocenters protein content, though it is not clear what DNA underlay constitutive heterochromatin apart from TR. The aim of the current work was to find out what DNA sequences involved in chromocenters formation. Methods and Algorithms: Chromocenters was biochemically isolated and chromocenters’ DNA library prepared by Nextera kit. The library were sequenced using Illumina MiSeq with paired-end reads of 35 bp. Results: Bioinformatics comparison of chromocenters MiSeq (chcMiSeq) with whole ge- nome sequencing on Illumina HiSeq (gnHiSeq) revealed NNN content: Family\Source chcMiSeq gnHiSeq Tandem repeats (TR) 70 % 10 % LINE 7 % 4 % ERV 1.5 % 1 % Among chcMiSeq TR the most abundant is MaSat (61%) and MiSat (4%). The rest of TR (5%) represents the TRs families previously described [1]. The rest 20% of chcMiSeq dataset is mostly unannotated sequences, but some of them have been identified when part of chcMi-Seq dataset have been assembled into contigs by IDBA_UD program. In the contigs assembled there are many fragments of heterochromatic Y chromosome, rRNA and six other pseudo-genes and ncRNA gene. Full scale gene sfi1 is found in contigs and it is localized to the chromosome 11 pericentromeric region. The ERV based fragments from chcMiSeq assembled contigs went to all the possible locations being mapped to different ERV consensuses from Repbase. This indicate that the whole ERV could be built in TR ar- rays. In contrast, there is very few full-length LINEs in chcMiSeq or in its’ part assembled. Most of the LINE fragments collected in the same 2 kb region at the end of the 2 nd ORF and its’ flanking region. The same region of LINE is the origin for the L1-based TR [1]. Full-length LINEs enrich facultative heterochromatin, but it is nearly absent in constitutive heterochromatin [2, 3] Conclusion: The sequencing of chromocenters’ DNA (chcMiSeq) reveal full length ERVs and precise LINE’ fragment of 2 kb as the substantial mouse constitutive heterochromatin components together with TR of different families. Acknowledgements: This work was supported by the granting program ‘Molecular and cell biology’ of the Russian Academy of Sciences. References: 1. A.S. Komissarov, E.V. Gavrilova, S. Demin et al. (2011). Tandemly repeated DNA families in the mouse genome. BMC genomics, 12(1):531-552. 2. A.T. Chinwalla et al (2002). Initial sequencing and comparative analysis of the mouse genome. Nature, 420(6915): 520-562. 3. I. Solovei, M. Kreysing, C. Lanctot et al. (2009). Nuclear architecture of rod photoreceptor cells adapts to vision in mammalian evolution. Cell 137: 356-368. 220 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY THE GENOME WIDE ANALYSIS OF THE LARGE TANDEM REPEATS IN THE CLOSELY RELATED GENOMES D.I. Ostromyshenskii*, O.I. Podgornaya Institute of Cytology RAS, St. Petersburg, Russia * Corresponding author: necroforus@gmail.com Key words: genomes, tandem repeats Motivation and Aim: Large tandemly repeated sequences (TR, or satellite DNA) are neces- sary part of higher eukaryotes genomes and can comprise up to tens percent of the genomes. Much of TRs’ functional nature in any genome remains enigmatic because there are only few tools available for dissecting and elucidating the TR functions. Material and methods: The modified pipeline of the one used previously in our Lab [1] ap- plied to the several databases. Four mammalian genera was used: (1) mice Mus: M.musculus, M. caroli (unassembled genome); (2) guinea pigs Cavia: C. porcellus, C. apperea; (3) bats Myotis: M. brandii, M. davidii, M. lucifugus; (4) cows Bos: B. taurus, B. mutus, B. indicus. Results: We tried to find all the 62 M.musculus TR families [1] in raw reads of M. caroli ge- nome (Caroli Genome Project, PRJEB2188). There are only few TR of M. musculus in M. caroli genome. M. musculus major satellite (MaSat or GSAT-MM) occupied nearly 0,7% of M. caroli genome, while in M. musculus genome - ~ 11 %. In M. caroli genome we found 5 other M. musculus’s TR’s families. Genus Cavia. C. porcellus genome possesses 25 TR and C. apperea – only 10 TR. 9 out of 10 C. apperea TR’s family exist also in C. porcellus genome except the major TR for this species – Capp-1518. In C. porcellus genome there are two major TR – Cpor-783 is absent in the 2 nd genome and Cpor-123 exists in C. apperea genome as the minor one. Genus Myotis. There is no any TR of Myotis in Repbase, but 133 TR’s families are found in M. brandtii genome, 105 - in M. davidii genome and 26 - in M. lucifugus genome. Only 5 TR families exist in three genome but most of TR families are species-specific. Major TR for M. davidii and M. lucifugus is common in sequence though differ in monomer length, but the same TR is minor one in M. brandtii. The major for M. brandtii is not identified in both other genomes at all. Genus Bos. There are three TR known for Bos in Repbase and all of them are found in all Bos assemblies. Still the major TR in all Bos assemblies differ: in B. taurus genome BT- SAT4/BTSAT5 is a major TR while BTSAT6 major TR family in B. indicus genome. It is visible that most of the top TR families in genus Bos exist only in two genomes or even in one, i.e. is species-specific. Conclusion: The most exhausting analysis of major TR (one for each species) of ~300 animals and plants display no readily apparent conserved characteristics [2]. We compared the TR sets. Our data evidenced that there are species-specific top TR, which are absent in genome of closely related species. In all genera examined major TRs are species-specific and hardly exist in other species of genera even as a minor ones. Acknowledgements: This work was supported by the granting program ‘Molecular and cell biology’ of the Russian Academy of Sciences. References: 1. A.S. Komissarov, E.V. Gavrilova, S. Demin et al. (2011). Tandemly repeated DNA families in the mouse genome. BMC genomics, 12(1):531-552. 2. D.P. Melters, K.R. Bradnam, H.A. Young, et al (2013). Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. Genome Biol, 14(1), R10. 221 THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY ELECTROSTATICS: A NEW OLD GENOME SELECTION FACTOR A.A. Osypov* Institute of Higher Nervous Activity and Neurophysiology RAS, Moscow, Russia, Institute of Cell Biophysics of RAS, Pushchino, Russia * Corresponding author: aosypov@gmail.com Download 3.91 Kb. Do'stlaringiz bilan baham: |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling