International conference on bioinformatics of genome regulation

bet	13/49
Sana	29.01.2018
Hajmi	3,91 Kb.
	#25584

1 ... 9 10 11 12 13 14 15 16 ... 49

Key words

Key words: DNA damage, DNA repair, BER, MMR, epigenetic demethylation
Motivation and Aim: Active DNA demethylation (ADDM) in mammals occurs via hy-
droxylation of 5-methylcytosine (5mC) by TET and/or deamination by AID/APOBEC
family enzymes. The resulting 5mC derivatives are removed through the base excision
repair (BER) pathway (1). At present, it is unclear how the cell manages to eliminate
closely  spaced  5mC  residues  while  avoiding  generation  of  double-strand  breaks  and
whether alternative DNA repair pathways participate in ADDM (2). Recently, was pro-
posed, that cytidine deaminase induced damage can lead to indirect ADDM through the
BER or non-canonical DNA mismatch repair (ncMMR) (3).
Methods and Algorithms: We used a phagemid DNA containing oxidative base lesions
and methylated sites are used to examine the involvement of various DNA repair path-
ways in ADDM in murine and human cell-free extracts.
Results: We demonstrate that, in addition to short-patch BER, 5-hydroxymethyluracil
and uracil mispaired with guanine can be processed by ncMMR and long-patch BER
with concomitant removal of distant 5mC residues. Furthermore, the presence of mul-
tiple mispairs in the same MMR nick/mismatch recognition region together with BER-
mediated nick formation promotes proficient ncMMR resulting in the reactivation of an
epigenetically silenced reporter gene in murine cells.
Conclusion: These findings suggest cooperation between BER and ncMMR in the re-
moval of multiple mismatches that might occur in mammalian cells during ADDM.
This work was supported by Russian Science Foundation 14-24-00093; Fondation ARC
PDF20101202141.
References:
1.  S.Moréra, I.Grin, A.Vigouroux, S.Couvé, V.Henriot, M.Saparbaev, A.A.Ishchenko (2012) Biochemical
and structural characterization of the glycosylase domain of MBD4 bound to thymine and 5-hydroxy-
methyuracil-containing DNA, Nucleic Acids Res., 40(19): 9917-9926.
2.  D.O.Zharkov, I.R.Grin (2012) Mechanisms of somatic mutagenesis and active DNA demethylation in
epigenetic regulation, In: Epigenetics, S.M.Zakian, V.V.Vlasov, E.V.Dementieva, 31-52 (Novosibirsk:
Publishing House of SB RAS).
3.  R.D.Kasymov,  I.R.Grin, A.V.Endutkin,  S.L.Smirnov, A.A.Ishchenko,  M.K.Saparbaev,  D.O.Zharkov
(2013) Excision of 8-oxoguanine from methylated CpG dinucleotides by human 8-oxoguanine DNA
glycosylase, FEBS Lett., 587(18): 3129-3134.

93
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
CONSERVATION LEVEL OF THE KEY MEIOTIC PROTEINS
REFLECTS THEIR FUNCTION AND INDEPENDENT EVO-
LUTION IN DIFFERENT LINEAGES OF EUKARYOTES
T.M. Grishaeva*, Yu.F. Bogdanov
Vavilov Institute of General Genetics RAS, Moscow, Russia
* Corresponding author: grishaeva@vigg.ru
Key words: proteins, amino acid sequence, functional domains, recombination, cohesins, synaptonemal
complex, conservation, evolution
Motivation and Aim: Meiosis is conserved in terms of its cytological appearance in the
evolutionarily row of Eukaryota. Meiosis requires proteins participating in (1) recombi-
nation and (2) chromosome remodeling and synapsis. Among them are meiosis-specific
proteins and proteins shared with mitosis. According to a widespread opinion, group
1 proteins are more conserved than group 2 proteins, mainly due to similarity of their
functional domains. Published estimates are rarely presented in a quantitative form. We
analyzed the amino acid sequences of some key meiotic proteins from both of the groups
in model eukaryotic species and quantitatively estimated the level of their evolutionary
conservation.
Methods and Algorithms: Protein Blast was used to perform pair-wise amino acid se-
quence comparisons for orthologs of seven proteins in eight eukaryotic species repre-
senting fungi, plants, invertebrates and vertebrates. Because the similarity index Score
depends on the protein size, percent similarity was estimated for each protein, taking its
similarity for itself as 100%.
Results: The recombination mediators RAD51 and DMC1 and the mismatch repair pro-
tein MLH1 were shown to be moderately conserved (from 13% to 76% similarity be-
tween proteins from different phyla), while meiosis-specific endonuclease SPO11 had
only a low similarity in the tested proteomes (5-32%). Among structural chromosomal
proteins, cohesins RAD21 and REC8 demonstrated low conservation (0-25% and 0-5%,
respectively). Synaptonemal complex components possessing the HORMA domain
(Hop1 and its orthologs) displayed only a minor similarity between the taxa (2-14%).
In a separate study, the level of conservation was estimated for functional domains of
some proteins because they are thought to be more conserved than the full-size protein
molecules. Unexpected results were obtained. The longer (TOPRIM_topoIIB_SPO) and
short (TPGA_N) domains of SPO11 showed the same or even lower conservation as
compared with the full-size SPO11 (14-38% and 1-36%, respectively). The HORMA
domain was only two to three times more conserved than the full-size protein.
Conclusion: Proteins responsible for the fidelity of meiotic recombination (RAD51,
DMC1, and MLH1) are the most conserved among key meiotic proteins, while cohesins,
synaptonemal complex proteins, and endonuclease SPO11 are less conserved. Meiotic
proteins evolved independently in different phylogenetic lineages of Eukaryota.
Acknowledgements: This work was supported by the Russian Foundation for Basic Re-
search (Grant no. 16-04-01447-a).

94
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
A COMPUTATION SYSTEM FOR RANDOMIZATION-BASED
ENRICHMENT ANALYSIS USING GPU:
PERFORMANCE INVESTIGATION
M. Grishenko
1
, A. Yakimenko
1, 2
, M. Khairetdinov
1, 2
, K. Gunbin
3
1
Novosibirsk State Technical University, Novosibirsk, Russia
2
Institute Computational Mathematics and Mathematical Geophysics SB RAS, Novosibirsk, Russia
3
Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
* Corresponding author: mikhail.grishch@gmail.com
Key words: Randomization, GPU
Motivation and Aim: Processing of genetic data for the analysis genetic determination
of traits is very important problem for modern biology. Resampling methods are wide-
ly used to solve such problems. However, they require much computational resources.
Common trend of increasing the simultaneously tested set of genetic characteristics,
reflecting the simultaneous contribution of many factors in the formation of a single
biological trait is a challenge for computer science. The aim of this paper is improving
of existing algorithm as well as it usability and performance.
Methods and Algorithms: We used graphic processor units (GPUs) for increasing the
performance of permutation test. The whole algorithm workflow can be divided into
three main stages: 1) reading the input file and forming the input data array according to
convenient format for further calculations; 2) the sum calculation of the measured values
of the gene functioning in the context of the experiment under analysis for various gene
characteristics, the cycle composed of mixing of the array elements and after that col-
lecting the quantities necessary for the statistics; 3) the exact calculation of p-values and
the formation of output file. The most consuming and parallelizable is the second stage.
It performed by multiplying two matrices: matrix containing numerical values obtained
by the experiments and a matrix of gene characteristics. The cuBLAS library of matrix-
matrix multiplication, which allowed for a transition of this algorithm to the architecture
of GPUs, was used.
Results:  A  software  for  the  permutation  test  aimed  at  finding  statistically  significant
overrepresented gene characteristics under different external and/or internal conditions
for computing devices: PC with NVIDIA GPU. We improved multiple testing of ge-
netic characteristics overrepresentation by replacing matrix-vector multiplication with
matrix-matrix multiplication. The resulting matrix contains a set of arrays with the sums
reflecting  various  gene  characteristics. We  investigated  performance  of  this  software
depending on the: (1) number of genes; (2) number of genetic characteristics. The execu-
tion time of program using the matrix-matrix multiplication for the simultaneous testing
of characteristics overrepresentation increases slightly with the number of testable char-
acteristics and experiments.
Conclusion: This software allows user to find statistically significant overrepresented
characteristics of genes simultaneously.
Availability: Upon the requests to the authors.
Acknowledgements: This work was supported by the Russian foundation for basic re-
search (Grant No. 16-37-00240)

95
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
ACTIVE MAINTENANCE OF PHYLOTRANSCRIPTOMIC
HOURGLASS PATTERNS IN PLANT AND ANIMAL
EMBRYOGENESIS
H.-G. Drost
1
, A. Gabel
1
, I. Grosse
1, 2
, M. Quint
3, 4
1
Institute of Computer Science, Martin Luther University Halle-Wittenberg, Halle (Saale), Germany
2
German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
3
Department of Molecular Signal Processing, Leibniz Institute of Plant Biochemistry, Halle (Saale),
Germany
4
Institute of Agricultural and Nutritional Sciences, Martin Luther University Halle-Wittenberg, Halle
(Saale), Germany
* Corresponding author: grosse@informatik.uni-halle.de
Key words:
evolution, gene expression, hourglass model
The  developmental  hourglass  model  has  been  used  to  describe  the  morphological
transitions of related species throughout embryogenesis.
Recently,  quantifiable  approaches  combining  transcriptomic  and  evolutionary
information  provided  novel  evidence  for  the  presence  of  a  phylotranscriptomic
hourglass  pattern  across  kingdoms. As  its  biological  function  is  unknown  it  remains
speculative  whether  this  pattern  is  functional  or  merely  represents  a  nonfunctional
evolutionary relic. The latter would seriously hamper future experimental approaches
designed to test hypotheses regarding its function. Here, we address this question by
generating transcriptome divergence index profiles across embryogenesis of Danio rerio,
Drosophila  melanogaster,  and Arabidopsis  thaliana. To  enable  meaningful  evaluation
of the resulting patterns, we develop a statistical test that specifically assesses potential
hourglass patterns. Based on this objective measure we find that two of these profiles
follow a statistically significant hourglass pattern with the most conserved transcriptomes
in the phylotypic periods. As the transcriptome divergence index considers only recent
evolutionary signals, this indicates that the phylotranscriptomic hourglass pattern is not
a rudiment but possibly actively maintained, implicating the existence of some linked
biological function associated with embryogenesis in extant species.

96
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
THE EVOLUTION OF LANGUAGE-READINESS IN THE HOM-
ININ LINEAGE: AN ANALYSIS OF OPEN CHROMATIN RE-
GIONS IMPLICATED IN GENE REGULATION
K. Gunbin
1,*
, A. Benítez-Burraco
2
, F. Gusev
1
, E. Rogaev
1,3
1
Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
2
Department of Philology, University of Huelva, Huelva, Spain
3
University of Massachusetts Medical School, Worcester, USA
e-mail: genkvg@gmail.com
*
Corresponding author
Key words: DNAseI Hypersensitive Sites (DHSs), language-readiness, extinct vs. extant hominins, self-
domestication hypothesis.
Motivation and Aim: Language evolution studies rely on indirect evidence, mostly related
to speech organs and human behaviour. The successful retrieval of ancient genomes is expected
to contribute significantly to our knowledge of how cognition and communication evolved.
Comparative genomic studies have demonstrated >99.9% of identity between modern human
(M) and Neanderthal (N) and Denisovan (D) protein-coding gene sequences, which is in sharp
contrast with the significant (~1 MYA) evolutionary distance between these three hominin
species and with their behavioral and physical distinctive features [1,2]. Therefore, it is
reasonable to anticipate an elevated rate of changes in the regulatory regions of the genes.
Methods and Algorithms: First, we aligned SNPs and indels from D [1], N [2], and UI
[3] that are not found in M. Second, we mapped common changes in the D-N and D-N-
UI clades onto DHSs from M with an annotated effect (activation or repression) on gene
expression [4]. Third, we checked if the positions evolved in D-N and D-N-UI overlapped
with transcription factor binding sites showing signatures of negative or positive selection in
M lineage (using ENCODE RegTfbs data track and INSIGHT approach [5]). This allowed
us to put forth a list of genes affected by changes in DHSs. In our past research we have
put forth a list of candidate genes that could be associated with language-readiness [6-10].
Some of these genes show differences with the N and D homologs in regulatory (including
methylation patterns) or coding regions [6-8]. Finally, we analyzed the nearest interactions
between these two lists using GeneMania data (http://genemania.org/).
Results: Our results are suggestive of potential differences between M and N, D and
UI regarding the regulation pattern of genes involved in osteogenesis, brain function, and
immunity. This is interesting in view of the link between the emergence of language-readiness
and skull-brain cross-talk [6], but also in the brain-immune system cross-talk [11]. We have
also found that the regulation of FOXP1 (in D), and CNTNAP2 (in N), two well-known genes
related to language function could have been affected by severe changes in DHSs of these
hominin species. Finally, we found that several of our candidates for language-readiness
and self-domestication are predicted to interact with genes affected by changes in DHSs.
In the whole, our findings give support to the view that differences in expression and/or
interactions of known candidates for cognitive development and language evolution may
account for the presumed differences in cognitive and linguistic abilities between extinct and
extant hominins.
Availability: Upon the requests to the authors. Acknowledgements: The study is supported
by the grants 14.B25.31.0033 [Resolution No.220], FFI2014-61888-EXP, FFI2013-43823-P).
References:
1. M. Meyer et al. (2012) Science 338:222-226.; 2. G. Renaud et al. (2014) Nature 505:43-49.; 3. Q. Fu et al. (2014)
Nature 514:445-449.; 4. N.C. Sheffield et al. (2013) Genome Research 23:777-788.; 5. L. Arbiza et al. (2013) Na-
ture Genetics 45:723-729.; 6. C. Boeckx, and A. Benítez-Burraco (2014). Front. Psychol. 5:282.; 7. C. Boeckx and
A. Benítez-Burraco, (2014). Front. Psychol. 5:1324.; 8. A. Benítez-Burraco and C. Boeckx (2015). Front. Psychol.
6:794.; 9. A. S. Wilkins et al. (2014). Genetics 197:795-808.; 10. B. Hare, and M. Tomasello (2005). Trends Cogn.
Sci. 9:439-444.; 11. A. Benítez-Burraco and J. Uriagereka, (2016) Front. Mol. Neurosci. 8:84.

97
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
SEARCHING FOR CYTOLYTIC GENETIC MARKERS
OF NEWCASTLE DISEASE VIRUS USING COMPUTER
ASSISTED ANALYSIS
K.V. Gunbin
1, 4
, M.R. Kabilov
2
, K.S. Yurchenko
3
, A.V. Glushchenko
3
, A.M. Shestopalov
3
,
N.V. Gubanova
1, 4
1
Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
2
Institute of Chemical Biology and Fundamental Medicine SB RAS, Novosibirsk, Russia
3
Research Center of Clinical and Experimental Medicine SB RAMS, Novosibirsk, Russia
4
Novosibirsk State University, Novosibirsk, Russia
* Corresponding author: genkvg@gmail.com
Key words: Newcastle Disease Virus, whole genome sequencing, 3D protein structure
Motivation and Aim: Newcastle disease virus (NDV) is one of the potential candidate viruses
for therapy of human tumors. This is enveloped virus containing a non-segmented, single-
stranded, negative-sense RNA genome. The replication of many viruses results in lysis of the
tumor cells and inherent cytotoxicity. However, profound understanding of the mechanisms
of virus replication is still illusive.
Methods and Algorithms: In order to distinguish the cytolytic genetic markers, 15 NDV class
II [1] strains (s.) were tested for their ability to lyse the human tumor cell cultures. Whole
genome sequencing of 8 new NDV s. was carried out. NDV genomes were aligned by SaTe
2.2.7; NDV proteins were aligned using Promals3D. In searching for the genetic basis un-
derlying cytotoxicity, various RNA and protein characteristics were analyzed: (1) common
SNPs and indels; (2) the localization and number of long inverted, tandem, and palindrome
RNA repeats; (3) local RNA similarity and (4) the values of various dinucleotide RNA prop-
erties obtained by various window size; (5) codon adaptation indexes as described in [1]; (6)
the 3D protein structure assisted comparison of conservative and variable positions. For this
analysis M, HN, and F protein structures were taken directly from PDB, the 3D protein struc-
tures of L, NP, and V proteins were modelled using I-TASSER 4.4 and target structures with
DELTA-BLAST E-value<1E-40. The protein structural flexibility was solved by CABS-flex.
Results: In their ability to lyse tumor cells, the strains were divided into two groups: cytolytic
(8 s.) and non-cytolytic (7 s.). Only tiny differences between these groups were identified in
two protein coding regions (<200nt length each) using various dinucleotide RNA properties.
The projection of variable sites on 3D protein structures shown that non-cytolytic strains ac-
cumulate mutations in NP, L, and M proteins in sterically interacted regions with high level
of structural flexibility and high total number of intramolecular contacts significantly more
frequently than cytolytic strains. The same is observed for V proteins in cytolytic strains.
Taking into consideration that these three proteins are critically important to NDV RNA rep-
lication and virion synthesis, it is reasonable to speculate that the cause of NDV cytotoxicity
is the fast kinetic of proteins folding. The structure of NDV V protein resembles Simian virus
5V protein, which is known to modulate the activity of DDB1-CUL4-ROC1 E3 apparatus in
favor of viral infection [3]. Therefore, the development of the same molecular mechanism
promoting the viral infection is probable in cytopathic NDV strains.
Availability: Data available upon the requests to the authors. Acknowledgements: This work was
supported by the grants 14-04-01196 and 2013–2020 (VI.53.1.4, VI.62.1.3, 0309-2014-0007).
References:
1.  D.G. Diel et al. (2012) Infect Genet Evol. 12:1770-1779.
2.  Y.Y. Waldman et al. (2010) Nucleic Acids Res. 38:2964-2974;
3.  S. Angers et al. (2006) Nature. 443:590-593.

98
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
TRANSCRIPTOMIC ANALYSIS OF WHEAT ROOT
IN RESPONSE TO ESSENTIAL NUTRIENT DEFICIENCY:
A GEMOME-WIDE COMPARATIVE STUDY
S. Gupta
1
, B.S. Yadav
2
, S. Freilich
3
, P.K. Varadwaj
1
*
1
Department of Bioinformatics, Indian Institute of Information Technology-Allahabad, India
2
Deparment of Molecular Biology and Ecology of Plants, Tel Aviv University, Israel
3
Systems Biology and Ecology ARO-Volcani Center-Bet-Dagan, Israel
* Corresponding author: Pritish@iiita.ac.in
Key words: genomics, wheat, nutrient deficiency
Plant roots are essential for providing anchorage and uptake of nutrient and water, required
for growth and development. The deficiency of necessary macro-nutrients, i.e. Potassium (K),
Magnesium (Mg), Nitrogen (N), Phosphorus (P), and Sulfate (S) in soil directs to a reduction in
crop yield and growth as a resulted of change in its gene expression profile. Aim of this study is
to identify and annotate such deficiency related responsible genes and proteins as well as high-
lights the cellular and metabolic pathways response which is essential for survival of the plant.
To differentiate the gene expression levels, various nutrient deficiencies (ND) treated, microar-
ray data of wheat root tissue sample were analyzed. To annotate the Gene Ontology (GO) term
for each nutrient deficiency response genes (NDRGs), Singular Enrichment Analysis (SEA)
was performed.Further, the functional enrichment analysis was performed using MapMan an-
notations scheme. Macro-nutrients regulated genes were assigned into a functional category
within the hierarchy of MapMan pathway scheme. Pathway enrichment in each experiment
was determined by calculating the cumulative hypergeometric p value for the probability that
a gene group is over-represented within a functional bin at a rate higher than chance expecta-
tion. Multiple tests were performed for all gene group functional categories at all hierarchical
levels and Heatmap for each gene group was generated using “gpplaot” package in R. To
revalidate the identified results a meta-analysis was performed using root specific earlier iden-
tified sample data of wheat through Genevestigator tool. A set of 435 statistically significant
NDRGs was identified which got differentially expressed in response to at least one among all
ND under study. Total, 58 NGRs were reported to be expressed in response to minimum two
ND. P deficiency was seen to be affecting large numbers of NRGs in- comparison to other ND.
The SEA predicts individual GO classification of NRGs in different biological process, cel-
lular component and molecular function. MapMan analysis showed the carbohydrate and lipid
pathways are up-regulated during potassium deficiency, hormonal (jasomonate and ethylene)
as well amino acids synthesis (Asp) in Mg deficiency. In P deficiency, lipid degradation, MYB
transcription factors, transportation and poly amines pathway were highly expressed but the
cell-wall,  fatty  acid  synthesis,  protein  translation  modification,  DNA  synthesis  and  repairs
and abiotic stress pathways were down-regulated. Amino acid and carbohydrate degraded and
photosynthesis process down during nitrogen deficiency. Polyamine metabolism and protein
synthesis pathways enriched in S deficiency in plant. Moreover, meta-analysis performed the
significance analysis of NRGs, which is based on a large-scale systematic combination of nor-
malized and quality-controlled expression data of root specific 109 samples of wheat. Study
reveals several uncharacterized proteins in response to ND and other stress with provide a
set of known genes, mRNA, proteins. Through functional annotation of these uncharacter-
ized genes, components involved in different metabolic pathways and reaction mechanism of
proteins/enzymes encoded by 155 transcripts were expressed during ND condition. The study
demonstrated the deeper understanding of nutrient utilization in wheat root.

99
THE TENTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE\SYSTEMS BIOLOGY
TRANSCRIPTION BY ALTERNATIVE SIGMA FACTORS:
REVISING THE RIGIDNESS PARADIGM
J. Guzina
1*
, M. Djordjevic
1
1
Faculty of Biology, University of Belgrade, Belgrade, Serbia
* Corresponding author: jelenag@bio.bg.ac.rs

Download 3,91 Kb.

Do'stlaringiz bilan baham:

1 ... 9 10 11 12 13 14 15 16 ... 49