This data interchange is the second data interchange of "genomics research on drought tolerance mechanism of typical desert plants in heihe basin", a key project of the major research program of "integrated research on eco-hydrological processes in heihe basin".The main research goal of this project is a typical desert sand Holly plants as materials, using the current international advanced a new generation of gene sequencing technology to the whole genome sequence and gene transcription of Holly group sequence decoding, so as to explore related to drought resistance gene and gene groups, and transgenic technology in model plants such as arabidopsis and rice) verify its drought resistance.
二, data content
1.Sequencing of the genome and transcriptome of lycophylla SPP.
The genome size of Mongolian Holly was about 926 Mb, GC content 36.88%, repeat sequence proportion 66%, genome heterozygosity rate 0.56%, which indicated that the genome has many repeat sequences, high heterozygosity and belongs to a complex genome.Based on the predicted sequence results, we subsequently carried out in-depth sequencing of the genome of lysiopsis SPP. The obtained data were assembled to obtain a 937 Mb genome sequence (table 1), which was basically the same as the predicted genome size.Through to the sand Holly transcriptome sequencing and sequence assembly (table 2), received more than 77000 genes coding sequence (Unigene), these sequences are comments found that most of the gene sequence and legumes and soybean, garbanzo beans and bean has a higher similarity (figure 1), consistent with the fact of sand ilex leguminous plants.
一), and the sand Holly is a leguminous plants consistent with the fact.
2.Discovery of simple repeat sequence (SSR) molecular markers of sand Holly:
There is a transcriptome data set of sand Holly in the network public database, and the sample collection site is zhongwei city, ningxia.But this is the location of the project team samples in minqin county, gansu province, in order to study whether this sand in different areas of the Holly sequence has sequence polymorphism, we first identify the minqin county plant samples in the genomes of simple sequence repeat (SSR) markers (table 3), and then, compares the transcriptome sequences of plant sample, found in part of SSR molecular marker polymorphism (table 4), these molecular markers could be used for the species of plant genetic map construction, QTL mapping and genetic diversity analysis in the study.
三, data processing instructions
Sample collection place: minqin county, gansu province, latitude and longitude: N38 ° 34 '25.93 "E103 ° 08' 36.77".Genome sequencing: a total of 8 genomic DNA libraries of different sizes were constructed and determined by Illumina HiSeq 2500 instrument.Transcriptome sequencing: a library of 24 transcriptome mrnas was constructed and determined by Illumina HiSeq 4000.
四, the use of data and meaning
We selected a typical desert plant as the research object, from the Angle of genomics, parse the desert plant genome and transcriptome sequences, excavated its precious drought-resistant gene resources, and to study their drought resistance mechanism of favorable sand Holly this ancient and important to the utilization of plant resources, as well as the heihe river basin of drought-resistant plant genetic breeding, ecological restoration and sustainable development.
File naming and required software
File name: the data was stored in doc format, and the file was named "genomics research on drought tolerance mechanism of typical desert plants in heihe basin -- data set ii.docx". Data reading method: it can be opened and read directly by Office, Wps and other Office software.
He, J., Feng, L. (2016). Genomic studies of drought tolerance mechanisms of a typical plant in Heihe basin - dataset II (2014-2015). A Big Earth Data Platform for Three Poles,
DOI: 10.3972/heihe.0034.2016.db.
CSTR: 18406.11.heihe.0034.2016.db.
(Download the reference:
RIS |
Bibtex
)
Using this data, the data citation is required to be referenced and the related literatures are suggested to be cited.
Copyright & License
To respect the intellectual property rights, protect the rights of data authors, expand services of the data center, and evaluate the application potential of data, data users should clearly indicate the source of the data and the author of the data in the research results generated by using the data (including published papers, articles, data products, and unpublished research reports, data products and other results). For re-posting (second or multiple releases) data, the author must also indicate the source of the original data.
This dataset belongs to offline data sharing group, which requires an approval from the data author. You can apply for this data online in login status.
Data has intellectual property rights because it is not only the result of physical labor, but also the achievements of intellectual activities.
Data sharing needs to protect the intellectual property rights of Data, so that we can protect the rights and interests of data producers and make data sharing sustainable development.
The core intellectual property rights of data include the right of signature, the right of publication and the right of recompilation, among which the right of signature is the most basic right.
In academia, the traditional literature citation is the best way to reflect the right of signature. It has been widely recognized by scientists, so the intellectual property rights of data can be embodied through data reference.
Required Article Citation:
This data is not cited in this article
Data Citations:
He, J., Feng, L. (2016). Genomic studies of drought tolerance mechanisms of a typical plant in Heihe basin - dataset II (2014-2015). A Big Earth Data Platform for Three Poles, 2016.
DOI: 10.3972/heihe.0034.2016.db.