International conference on bioinformatics of genome regulation

bet	43/49
Sana	29.01.2018
Hajmi	3.91 Kb.
	#25584

1 ... 39 40 41 42 43 44 45 46 ... 49

Key words: UniPro UGENE, Sanger sequencing, Aligning, Raw data, pipeline, Workflow Designer
Motivation and Aim: Sanger [1] sequencing has been used for several decades. Although
more modern sequencing technologies have already appeared (e.g. NGS [2]), the Sanger
technology is still widely used as a supplementary method for NGS for more precision
or as an independent method for sequencing of small genomes. It makes developement
of a high quality algorithm for Sanger data processing an actual objective.
Methods and Algorithms: UGENE [3] is a universal toolkit for biological data visualiza-
tion and analysis. One of UGENE’s features is Workflow Designer that allows anyone
with no programming skills to design and run complex computational pipelines. Many
algorithmic blocks are implemented in Workflow Designer to perform different kinds of
tasks. These blocks have been used to build a new element that aligns Sanger reads to
a reference sequence. The algorithm is based on the ВLAST [4] tool that first localizes
an appropriate region of the read on the reference. Then its exact location is found by
the Smith-Waterman algorithm [5] that can be launched on GPU to increase the overall
pipeline performance. In the end, all aligned reads are merged into a single multiple
alignment data structure via UGENE’s own merging algorithm.
Results and Conclusion: The algorithm has proved itself as a fast and memory efficient
solution for aligning Sanger reads to a reference. The Workflow Designer tool provides
pipelined data processing for several independent blocks of Sanger data. In the nearest
future the algorithm will be available in the UGENE toolkit along with a new viewer
for raw Sanger data.
Availability: http://ugene.net/download.html
References:
1.  Sanger F., Niclein S., Coulson A.R. (1977) DNA sequencing with chain-terminating inhibitors, Pro-
ceedings of the National Academy of Sciences, USA, 74: 5463-5467.
2.  de Magalhães JP, Finch CE, Janssens G (2010) Next-generation sequencing in aging research: emerg-
ing applications, problems, pitfalls and possible solutions, Ageing Research Reviews 9 (3): 315–323.
3.  Okonechnikov K., Golosova, O., Fursov, M., the UGENE team (2012) Unipro UGENE: a unified
bioinformatics toolkit, Bioinformatics. 28 (8): 1166-1167.
4.  Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) Basic local alignment search
tool, Journal of Molecular Biology 215:403-410.
5.  Smith, Temple F.; and Waterman, Michael S. (1981) Identification of Common Molecular Subse-
quences, Journal of Molecular Biology 147: 195–197.

317
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
GENETIC BASIS OF AGGRESSION: CLUSTERIZATION
OF EXPRESSION PROFILES
E.S. Tiys
1, 2
*, A.O. Bragin
1, 2
, I.V. Medvedeva
1, 2
, I.V. Chadaeva
1
, A.L. Markel
1
,
Y.L. Orlov
1, 2
1
Institute of Cytology of Genetics SB RAS, Novosibirsk, Russia
2
Novosibirsk State University, Novosibirsk, Russia
* Corresponding author: tiys@bionet.nsc.ru
Key words: aggression, RNA-Seq, differential expression, functional annotation, MeV, GOexpress
Motivation and Aim: Aggression is a complex phenomenon of behavior with physiologi-
cal and genetics roots. It is important to find traces of aggression on genetic level. RNA-
seq experiment is invaluable source of information on genetic component of aggressive
behavior.
Methods and Algorithms: We used RNA-Seq data from brain areas of aggressive and
tame rat lines obtained at ICG SB RAS [1]. MirVana™ miRNA Isolation Kit(Ambion)
was used to extract mRNA. NEBNext® mRNA Library Prep Reagent Set for Illumina
(NEB) was used to prepare libraries. Sequencing was conducted according to Illumina
protocol. Reference genome RGSC Rnor_5.0\rn5 and TopHat2 tool were used for the
alignment. The sequence reads were analyzed using Cufflink workflow [2] using FPKM
to quantify expression. Clustering of samples by gene expression was done with MeV
[3] using average linkage clustering. Correlation coefficient was used as a measure of
similarity. Differential expression was calculated with MeV using Wilcoxon test with
FDR multiple tests correction. Ranking of processes was made by GOexpress [4].
Results: Expression profiles were clustered by tissues and the tissue clusters were ac-
curately  clustered  into  aggressive  and  tame  samples.  Differential  expression  was  not
statistically significant. Applying GOexpress ranking allow us to prioritize the following
Gene Ontology biological processes: “negative regulation of transporter activity”, “ex-
trinsic apoptotic signaling pathway in absence of ligand”, “salivary gland cavitation”,
“regulation of response to DNA damage stimulus, establishment of cell polarity”.
Conclusion: On the one hand accurate clusterization of the expression profiles confirms
the presence of molecular genetic traces of aggression behavior in the RNA-seq data, on
the other hand differential expression and prioritizing were not show statistical signifi-
cance. Further analysis is needed to uncover molecular basis of aggression.
Acknowledgements: The work was supported by RSF grant 14-14-00269.
References:
1.  Y.L. Orlov et al. (2012) ICGenomics: program complex for symbol sequence analysis in genomics,
Vavilov journal of genetics and breeding, 16(4/1): 732-741. (In Russian).
2.  http://cole-trapnell-lab.github.io/cufflinks/
3.  A.I. Saeed (2003) TM4: a free, open-source system for microarray data management and analysis,
Biotechniques, 34(2): 374.
4.  K. Rue-Albrecht (2016) GOexpress: an R/Bioconductor package for the identification and visualisation
of robust gene ontology signatures through supervised learning of gene expression data, BMC bioinfor-
matics, 17(1): 1.

318
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
GENEONTOLOGY BIOLOGICAL PROCESSES SENSITIVE
TO SALT DIET CHANGES IN AN EXPREIMENT WITH
105-DAY ISOLATION: STATISTICAL ANALYSIS OF URINE
PROTEOME
E.S. Tiys
1, 2
*, E.D. Petrovskiy
1
, L.Kh. Pastushkova
3
, D.N. Kashirina
3
, I.M. Larina
3
,
V.A. Ivanisenko
1
1
Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
2
Novosibirsk state university, Novosibirsk, Russia
3
Institute for Biomedical Problems RAS, Moscow, Russia
* Corresponding author tiys@bionet.nsc.ru
Key words: salt intake, Gene Ontology, Random forest, GOexpress, Bioconductor
Motivation and Aim: Conducting mass-spectrometry experiments allows to solve impor-
tant problem of protein expression measurement. Promising approach to interpret mass
spectrometric studies is based on the identification of biological processes, protein repre-
sentation of which is the most contrast between case and control samples. The aim of the
study was a computer search for biological processes, changing of functional activity of
which was related with salt intake regime, based on urine proteome analysis in 105-day
volunteer isolation experiment.
Methods and Algorithms: Weekly during the 105-day isolation urine samples from six
volunteers were collected and chromato-mass spectrometry was conducted. Collected
spectra were analysed with Mascot tool. The results consist of lists of identified proteins
for each volunteer and data on salt content in diet (6, 9 or 12 g/day). GOexpress package
[1] of Bioconductor was used to search for relation between salt intake and Gene Ontol-
ogy (GO) processes through corresponding protein sets. Built-in package estimation of
statistical significance was used with 250000 permutations of protein ranks. Processes
with corrected p<0.05 were considered as significant. The Benjamini-Hochberg correc-
tion was used (R language, function p.adjust).
Results: Using GOexpress we identified 131 GO processes, demonstrating statistically
significant relation with salt consumption. Among these, we can emphasize metabolic
processes of connective tissue, such as chondroitin sulfate metabolic process, collagen
catabolic process, hyaluronan metabolic process. This fact consistent with earlier hy-
pothesis of extracellular matrix participation in sodium deposition [2, 3].
Conclusion:  Mass-spectrometry  data  analysis  of  volunteer  urine  proteomes  gave  po-
tential link between changes in salt intake and functioning of biological processes, in-
volving in metabolism of connective tissue, that in agreement with hypothesis about its
protein participation in sodium deposition.
Acknowledgements: Project № 0324-2015-0003 and RSF grant 14-24-00123
References:
1.  K. Rue-Albrecht (2016) GOexpress: an R/Bioconductor package for the identification and visualisation
of robust gene ontology signatures through supervised learning of gene expression data, BMC bioinfor-
matics, 17(1): 1.
2.  I.M. Larina et al., (2012) Reconstruction of associative protein networks connected with processes
of sodium exchange regulation and sodium deposition in healthy volunteers based on urine proteome
analysis, Human Physiology, 38(3): 316–323.
3.  L.Kh. Pastushkova, (2015), Proteome profile of urine of healthy human under normal conditions and
under factors of space flight, thesis [In russian].

319
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
PHSYOLOGICAL AND TRANSCRIPTIONAL CHANGES
IN A BLOSSAM-END ROT RESISTANT TOMATO
INTROGRESSION LINE IL8-3 FRUIT
S. Tomoki, H. Ikeda, Y. Kanayama*
School of Agricultural Science, Tohoku University, Sendai, Japan
* Corresponding author: yoshinori.kanayama.a7@tohoku.ac.jp
Key words: Blossom-end rot, Calcium, Solanum lycopersicum, Solanum pennellii, tomato, transcriptome
Motivation and Aim: Tomato (Solanum lycopersicum) is one of the most important veg-
etables due to its high economic and nutrition values and is also important as an ex-
perimental model species of the Solanaceae family and fleshy-fruited plants. Tomato is
sensitive to Calcium (Ca) deficiency, which causes several physiological disorders in
fruits such as cracking and blossom-end rot (BER). Ca concentration, early fruit growth,
and expression of Ca-movement-related genes were analyzed during early fruit devel-
opment in tomato, which is the most important stage in the incidence of BER, to in-
vestigate physiological mechanisms affecting the occurrence of BER. We used tomato
introgression line IL8-3 having a chromosome segment from the wild relative (Solanum
pennellii) because the line shows lower incidence of BER compared with the parent cul-
tivar ‘M82’ (S. lycopersicum), as described previously. In addition, we tried to analyze
transcriptomic  character  of  early  fruit  development  in  IL8-3  on  microarray  data  and
characterize tomato Ca-movement-related genes by phylogenetic analysis.
Results and Conclusion: Total Ca concentration in fruit and leaves was higher in IL8-
3  than  in  ‘M82’,  whereas  no  significant  differences  were  observed  between  total  Ca
concentration in roots and stems of ‘M82’ and IL8-3. Therefore, the lower incidence
of BER in IL8-3 could be not related to whole-plant uptake but to transport within the
plant. IL8-3 fruit showed a lower growth rate than ‘M82’, which could result in prevent-
ing the occurrence of BER. The expression of genes encoding Ca
2+
/H
+
exchanger, auto-
inhibited Ca
2+
-ATPase, Ca
2+
channel, and Na
+
/Ca
2+
exchanger-like protein, was higher
in IL8-3 fruit than in ‘M82’ fruit, suggesting active Ca movement in IL8-3. In contrast,
the expression of the gene encoding Group 3 Cation/Ca
2+
exchanger was lower in IL8-3
fruit than in ‘M82’. Additionally, the expression level of boron-movement-related genes
is different between IL8-3 and ‘M82’. It is reported that boron deficiency may also affect
the incidence of BER and Ca
2+
transportation. All our results could be related to physi-
ological mechanisms accounting for the lower incidence of BER in IL8-3 and one of
these factors may play a primary role.

320
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
DIFFERENTIAL EXPRESSION OF SHAGGY, A DROSOPHILA
MELANOGASTER GENE ENCODING GSK-3 BETA,
AFFECTS LIFESPAN
M.V. Trostnikov, N.V. Roshina, E.G. Pasyukova
Institute of Molecular Genetics, RAS, Russia
* Corresponding author: egpas@rambler.ru
Key words: Life span, GSK-3 beta, the nervous system, Drosophila
Motivation and Aim: Protein kinases are important proteins involved in multiple phos-
phorylation cascades, and their role in systemic regulation of various metabolic pro-
cesses is well recognized. A serine-threonine kinase GSK-3 beta (glycogen synthase
kinase-3 beta) is an important protein involved in various signaling pathways and meta-
bolic processes, neurogenesis and neuronal function. Earlier, we have shown that sev-
eral genes involved in asymmetric neuroblast division, including shaggy (sgg) encoding
Drosophila GSK-3 beta, affect Drosophila lifespan. Several mutations in sgg increased
male and female mean lifespan by 5% to 50% together with slightly reducing the total
amount of GSK-3 beta protein. In order to further assess the role of GSK-3 beta in the
control of lifespan, aging and neuronal function, we used transgenic lines with additional
copies of the four sgg variants under inducible UAS promoters, to provide sgg overex-
pression; transgenic lines with hairpins, to provide sgg RNA-i knockdown; and special
driver lines, to induce overexpression and knock-down in various tissues and cell types.
Results: We demonstrate that i) the proper expression of the main functional GSK-3
beta transcript, RB, in the nervous system and fat body is essential for survival; ii) sgg
transcripts whose functionality has not previously been demonstrated, RA, RG and RO,
appeared to be functional: alterations in their expression in the nervous system slightly
decrease male lifespan; in muscles, only overexpression of RA affects lifespan; iii) sgg
misexpression in different neurons affects lifespan in a neuron-specific manner: mod-
erate increase and decrease in the amount of the main isoform of GSK-3 beta, PB, in
peptidergic and cholinergic neurons has no effect on lifespan, while in glutaminergic and
motor neurons it increases female and decreases male lifespan; iv) dopaminergic neu-
rons are the most sensitive to sgg function: strong sgg RB overexpression is lethal, while
moderate decrease in PB amount improves female lifespan. Analyses of the molecular
and cellular mechanisms underlying positive effects of sgg misexpression on lifespan
are under way.
Conclusion: Differential expression of GSK-3 beta is one of the mechanisms involved
in a complex regulation of lifespan and aging.

321
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
ADDITIVITY AND NON-ADDITIVITY OF GENETIC
CONTROL OF HUMAN METABOLOME
Y.A. Tsepilov
1, 2
*, S. Shin
3, 4
, N. Soranzo
3
, T.D. Spector
5
, J. Adamski
6, 7, 8
,
G. Kastenmüller
9
, K. Strauch
10, 11
, R. Wang-Sattler
12
, C. Gieger
10
, Y.S. Aulchenko
1, 2
,
J.S. Ried
10
1
Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
2
Novosibirsk State University, Novosibirsk, Russia
3
Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, United Kingdom
4
MRC Integrative Epidemiology Unit (IEU), University of Bristol Integrative Epidemiology, Bristol,
United Kingdom
5
Department of Twin Research and Genetic Epidemiology, King’s College London , London, United Kingdom
6
Institute of Experimental Genetics, Genome Analysis Center, Helmholtz Zentrum München - German
Research Center for Environmental Health, Neuherberg, Germany
7
Institute of Experimental Genetics, Life and Food Science Center Weihenstephan, Technische Universität
München, Freising-Weihenstephan, Germany
8
German Center for Diabetes Research, Neuherberg, Germany
9
Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München – German Research Center
for Environmental Health, Neuherberg, Germany
10
Institute of Genetic Epidemiology, Helmholtz Zentrum München – German Research Center for Environ-
mental Health, Neuherberg, Germany
11
Institute of Medical Informatics, Biometry and Epidemiology, Chair of Genetic Epidemiology, Ludwig-
Maximilians-Universität, Munich, Germany
12
Research Unit of Molecular Epidemiology, Helmholtz Zentrum München – German Research Center for
Environmental Health, Neuherberg, Germany
* Corresponding author: tsepilov@bionet.nsc.ru
Key words: genome-wide association studies, non-additive models, KORA, metabolomics, genotypic model
Motivation and Aim:
Genome-wide association studies (GWAS) are widely applied
to analyze the genetic effects on phenotypes. With the availability of high-throughput
technologies for metabolite measurements, GWAS successfully identified loci that affect
metabolite concentrations and underlying pathways. In most GWAS the effect of each
SNP on the phenotype is assumed to be additive. Other genetic models such as reces-
sive, dominant or over-dominant were considered only by very few studies. In contrast
to that, there are theories that emphasize the relevance of non-additive effects as a conse-
quence of physiological mechanisms. This might be especially important for metabolites
as these traits are closer to the underlying pathways than other traits or diseases.
Methods and Algorithms:
In this study we analyzed systematically non-additive effects
on a large panel of serum metabolites and all possible ratios (22,801 in total) in a popula-
tion based study (KORA F4, N = 1,785). We applied four different 1 df tests correspond-
ing to an additive, dominant, recessive and over-dominant trait model and additionally
a genotypic model with 2 df that allows a more general formulation of genetic effects.
Results:
Twenty three loci were found to be genome-wide significantly associated (Bon-
ferroni corrected p-value ≤2.19x10
-12
) with at least one metabolite or ratio. For five of
them we show evidence of non-additive effects. We replicated seventeen loci (including
three loci with non-additive effects) in an independent study (TwinsUK, N = 846).
Conclusion:
We proposed efficient methodology of identification of non-additive loci.
We showed that genetic effects on metabolite concentrations and ratios were mostly ad-
ditive; at the same time we also found significant and replicable non-additive loci.

322
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
IDENTIFICATION OF BREED-SPECIFIC SNP-MARKERS
FOR SUS SCROFA DOMESTICUS USING SRA-DATA
OF NGS PROJECTS
I.S. Tsybovsky*, V.N. Kipen, S.A. Kotova
Scientific and Practical Centre of the State Committee of Forensic Expertises, Minsk, Belarus
* Corresponding author: tsybovsky@yahoo.com
Key words: SNP, Sequence Read Archive, Next-Generation Sequencing, Sus scrofa domesticus
Motivation and Goal: The following domestic pig breeds are widely spread in the Re-
public of Belarus: Large White, Duroc, Landrace, Pietrain and others. Large White pig,
and based on it generated factory types dominate in the breeding livestock. The per-
centage of these four swine breeds represents at least 98% of the total population of
Sus scrofa domesticus in Belarus. The breeders are oftenly faced with the objective to
estimate breeding animals’ breed-purity to correct inbreeding values in order to manage
breed productivity indexes.
The goal of this study is to find breed-specific SNPs for domestic pigs, which are culti-
vated in Belarus: Large White, Landrace, Pietrain, Duroc and Meishan.(for comparison)
SRA-data were analyzed by whole genome sequencing (NGS), located in the public do-
main of the DNAnexus cloud service (http://sra.dnanexus.com/) and NCBI-SRA server
(http: //www.ncbi.nlm.nih .gov / sra). Search Basics were provided by the previously
published results of Ramos A. et al. [1], produced using SNP-chip «PorcineSNP60» (Il-
lumina).
Methods and Algorithms: Analysis was performed using SRA Nucleotide BLAST al-
gorithm and program BioEdit v.7.2.5. The amount included in analysis SNP – 193; the
number of reads for whole genome Large White pigs – 19, for other breeds – 71 (Land-
race – 22, Pietrain – 6, Duroc – 28, Meishan – 15). The total number of analyzed se-
quences is 32 754 738 518.
Results: Finally, 55 breed-specific markers were identified among 193 SNPs, i.e. for
those for which one allele is unique to only one breed. 6 specific SNPs were identified
for the Large White breed (for which minor allele prevalence frequency was in the range
of 7,9-23,7%); for Landrace – 5 (9,1-20,5%); for Pietrain – 7 (16,7-66,7%); for Duroc –
32 (7,1-100%); and for Meishan – 5 (23,3-100%).
Conclusion: Basing on these results it is expected in the future to develop and test SNP-
markers genetic panel intended to provide the differentiation of domestic pig breeds,
such as Large White, Landrace, Duroc, Pietrain and Meishan.
Availability: http://sra.dnanexus.com/, http://www.ncbi.nlm.nih.gov/sra
References:
1.  A.M. Ramos et al. (2011) Identification of high utility SNPs for population assignment and traceability
purposes in the pig using high-throughput sequencing, Animal Genetics, 42: 613-620

323
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
IDENTIFICATION OF THE TAXA OF THE ORDER
ARTIODACTYLA FOR CRIMINAL INVESTIGATION CASES
OF ILLEGAL HUNTING
I.S. Tsybovsky*, S.A. Kotova, V.I. Rybakova, A.A. Rabcava, E.A. Spivak
Scientific and Practical Centre of the State Committee of Forensic Expertises, Minsk, Belarus
* Corresponding author: tsybovsky@ yahoo.com

Download 3.91 Kb.

Do'stlaringiz bilan baham:

1 ... 39 40 41 42 43 44 45 46 ... 49