¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ (Damaster mirabilissimus mirabilissimus)

|
¢Ã ±¹ ¸í : ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹
¢Ã ÇÐ ¸í : Damaster mirabilissimus mirabilissimus Ishikawa & Deuve
¢Ã ÁöÁ¤¹øÈ£ : ¸êÁ¾À§±â ¾ß»ý»ý¹° ¥±±Þ
¢Ã °è Åë : ÀýÁöµ¿¹°¹® > °ïÃæ° > µüÁ¤¹ú·¹¸ñ > µüÁ¤¹ú·¹°ú > Á¶·Õ¹ÚµüÁ¤¹ú·¹¼Ó
|
¡Ý Çü Å : ¸ö±æÀÌ ¼öÄÆ 23¡25 mm, ¾ÏÄÆ 25¡28 mmÀÌ´Ù. ¸öºû ±òÀº ¸Å¿ì È·ÁÇѵ¥, ¾îµÎ¿î Ǫ¸¥»ö, ¾îµÎ¿î ³ì»ö, û·Ï»ö, ¾îµÎ¿î ÀÚÁÖ»ö µîÀÇ ±Ý¼Ó¼º ±¤ÅÃÀÌ °ËÀº»ö ¹ÙÅÁ À§¿¡¼ ºû³´Ù. ¸Ó¸®°¡ ´«¿¡ ¶ç°Ô Å©°í, ±¸±â(ÀÔÆ²)°¡ ưưÇÏ°Ô ¹ß´ÞÇØ ÀÖ´Ù. ¸ñÀÌ À¯³È÷ ±½´Ù. µüÁö³¯°³´Â ±ä Ÿ¿øÇüÀÌ°í ±×¹°´«Ã³·³ Á¶°¢ ±¸Á¶·Î µÅ ÀÖ´Ù. ³¯Áö ¸øÇÏ´Â °ÍÀº µÞ³¯°³°¡ ÅðÈÇ߱⠶§¹®ÀÌ´Ù.
¡Ý »ý Å : Çѱ¹ °íÀ¯Á¾À¸·Î »êÁöÀÇ ¿ø½Ã¸²¿¡ µå¹°°Ô ¼½ÄÇÑ´Ù. À¯Ãæ°ú ¼ºÃæÀº ¸ðµÎ ¾ßÇ༺À̸ç, ´ÞÆØÀ̳ª Áö··ÀÌ, ³ªºñ¸ñÀÇ À¯Ãæ µîÀ» ÁÖ·Î Àâ¾Æ¸Ô´Â´Ù.
¡Ý ºÐ Æ÷ : ÁߺÎÁö¹æ ¿ø½Ã¸²Áö´ë ±ØÈ÷ Á¦ÇÑµÈ Áö¿ª
¨ç ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ äÁýÁ¤º¸
- ä Áý ÀÚ: ºû°íÀ»»ýÅ¿¬±¸¼Ò Á¤Çåõ
- äÁýÁö¿ª: °¿øµµ Æòⱺ ÁøºÎ¸é µ¿»ê¸® »ê (»ó¿ø»ç¾Õ Àü³ª¹« ½£±æ)
¡ßÆ÷ȹÇÑ ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ ¼ºÃæ 2°³Ã¼
¨è äÁýÇöȲ ¹× »ùÇÃó¸®
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ äÁýÇöȲ
|
»ùÇÃó¸®
|
äÁýÀÏ
|
°³Ã¼ ¼ö
|
DNAseq
|
RNAseq
|
Ç¥º»
|
2016.06.19
|
2 EA
|
1 EA
|
1 EA
|
-
|
¨é ±âÁ¸ NCBI Taxonomy À¯ÀüÁ¤º¸ ÇöȲ
`À¯ÀüÁ¤º¸ Á¾·ù
|
Genus Damaster À¯ÀüÀÚ¿ø
|
Damaster mirabilissimus mirabilissimus À¯ÀüÀÚ¿ø
|
Nucleotide
|
402
|
2
|
Protein
|
395
|
26
|
Genome
|
1
|
1
|
Gene
|
13
|
13
|
¨ê Sequencing °á°ú
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹
|
Type
|
Total Reads
|
Total Bases
|
Total Bases (Gb)
|
GC Rate
|
ºñ°í
|
DNA seq
|
265,230,228
|
40,049,764,428
|
40.0
|
34.8
|
|
RNA seq
|
66,553,052
|
10,049,510,852
|
10.0
|
43.6
|
|
- Illumina DNA Sequncing °á°ú
SOAP de novo
|
DM-DNA hiseq
|
Input information
|
¡¡
|
¡¡
|
¡¡
|
Region1
|
Region2
|
Total
|
raw data
|
Number Of Reads
|
|
|
132,615,114
|
132,615,114
|
265,230,228
|
Number Of Bases
|
|
|
20,024,882,214
|
20,024,882,214
|
40,049,764,428
|
error correction
|
Number Of Reads
|
|
|
131,511,817
|
128,015,022
|
259,526,839
|
Number Of Bases
|
|
|
19,521,040,843
|
18,441,036,998
|
37,962,077,841
|
pair reads
|
Number Of Reads
|
|
|
|
255,643,244
|
255,643,244
|
Number Of Bases
|
|
|
|
37,414,558,156
|
37,414,558,156
|
single reads
|
Number Of Reads
|
|
|
|
3,883,595
|
3,883,595
|
Number Of Bases
|
|
|
|
547,519,685
|
547,519,685
|
Assembly Results
|
Scaffold Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Scaffolds
|
|
182,729
|
27,526
|
16,059
|
10,931
|
Number Of Bases
|
|
178,261,025
|
152,086,295
|
144,148,167
|
137,055,031
|
Avg. Scaffold Size
|
|
975
|
5,525
|
8,976
|
12,538
|
N50 Scaffold Size
|
|
14,949
|
19,439
|
20,900
|
22,227
|
N80 Scaffold Size
|
|
1,156
|
5,554
|
7,183
|
8,820
|
N90 Scaffold Size
|
|
240
|
2,037
|
3,555
|
5,075
|
largest Scaffold Size
|
|
175,593
|
175,593
|
175,593
|
175,593
|
Contig Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Contigs
|
7,027,349
|
363,618
|
74,054
|
39,915
|
16,690
|
Number Of Bases
|
450,397,865
|
173,451,879
|
120,684,540
|
96,530,594
|
64,006,388
|
Avg. Contig Size
|
64
|
477
|
1,629
|
2,418
|
3,835
|
N50 Contig Size
|
56
|
1,251
|
2,148
|
2,725
|
3,874
|
N80 Contig Size
|
35
|
257
|
999
|
1,525
|
2,579
|
N90 Contig Size
|
34
|
147
|
728
|
1,246
|
2,271
|
Largest Contig Size
|
82,858
|
82,858
|
82,858
|
82,858
|
82,858
|
- Illumina RNA Sequncing °á°ú
Total number of raw reads
|
- Number of sequences
|
66,553,052
|
- Number of bases
|
10,049,510,852
|
Contig information
|
- Total number of contig
|
188,927
|
- Number of bases
|
206,636,027
|
- Mean length of contig (bp)
|
1,093.7
|
- N50 length of contig (bp)
|
2,244
|
- GC % of contig
|
39.59
|
- Largest contig (bp)
|
27,961
|
- No. of large contigs (¡Ã500bp)
|
92,509
|
Unigene information
|
- Total number of unigenes
|
48,407
|
- Number of bases
|
87,917,588
|
- Mean length of unigene (bp)
|
1,816.2
|
- N50 length of unigene (bp)
|
3,093
|
- GC % of unigene
|
39.65
|
- Length ranges (bp)
|
224 – 29,278
|
2) ÇöÀç »óȲ¿¡ ´ëÇÑ °íÂû
³ª) ÀÚ»ý µ¿¹°ÀÚ¿ø Genome Survey
(1) °íǰÁú genomic DNA äÃë ¹× Ç¥º»È®º¸
- °Ë¼ö¿ë DNA´Â ´ëºÎºÐ ¸¸Á·ÇÏ¿´°í ¸êÁ¾À§±âÁ¾ÀÎ ´ë»óÁ¾µéÀº äÁý·®ÀÇ Á¦ÇÑÀ¸·Î ÀÎÇØ È®ÁõÇ¥º»À¸·Î Á¦Ãâ°¡´ÉÇÑ Ç¥º»À» È®º¸ÇÏÁö ¸øÇÏ¿© »çÁøÀ¸·Î ´ëóÇϰíÀÚ ÇÔ.
°Ë¼ö¿ë »ùÇà DNA ³óµµ ¹× ÃÑ ¾ç, È®ÁõÇ¥º» ÇöȲǥ
|
Sample ¸í
|
³óµµ(ng/ul)
|
Volume(ul)
|
DNA ÃÑ·®(ug)
|
ÀúÀåÀå¼Ò
|
È®ÁõÇ¥º»
|
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹
|
130.554
|
66
|
8.617
|
1XTE pH8.0
|
»çÁøÀ¸·Î´ëü
|
(2) ±âŸºÐ¼®
A. Genome Size estimation – Jellyfish ÇÁ·Î±×·¥
°¢ ´ë»óÁ¾À» ´ë»óÀ¸·Î Jellyfish ÇÁ·Î±×·¥À¸·Î Genome size¸¦ ¿¹ÃøÇÑ °á°ú´Â ¾Æ·¡¿Í °°´Ù. °¢ ºÐ¼®Àº k-mer °ª 17mer, 19mer, 21mer¸¦ ´ë»óÀ¸·Î ÇÏ¿´°í ±×°ÍÀ» ÅëÇØ ¿¹ÃøµÈ genomeÀÇ Å©±â´Â Estimation genome size ¿¡ Ç¥½ÃµÇ¾ú°í, À̸¦ ÅëÇØ À̹ø »ç¾÷¿¡¼ ºÐ¼®ÇÑ ¼¿ÀÇ ¾çÀÇ coverage´Â Genome coverage depth(Coverage Depth)¿¡ Ç¥±âµÇ¾ú´Ù. ±×¸®°í °¢ Á¾ÀÇ Ç¥ ¾Æ·¡ ±×·¡ÇÁ´Â kmer°ª º°·Î ³ªÅ¸³ depthÀÇ ¿¹Ãø ±×·¡ÇÁ·Î peak°ªÀ» È®ÀÎÇÒ ¼ö ÀÖ´Ù.

B. ¹Ýº¹¼¿
- ½ÃÄö½ÌÀÌ ¿Ï·áµÈ »ùÇà DNA ÀÇ contigs Áß 1kb ÀÌ»óÀÇ contigsÀÇ ¹Ýº¹¼¿À» Ž»ö (Repeat masking results of DNA contigs)
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹
(Damaster mirabilissimus mirabilissimus)
|
¡¡
|
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ (Damaster mirabilissimus mirabilissimus) ´Â 1kb ÀÌ»óÀÇ contig ÀÇ ¹Ýº¹¼¿Àº 3,915°³ read (96,530,594 bp) À̰í, GC level Àº 36.77% ÀÌ´Ù. Repeat Masking À¸·Î ³ª¿Â ¹Ýº¹¼¿Àº 56,841 bp (ÀüüÀÇ 0.06%) ¸¸Å ÀÖ´Ù. Small RNA Àº 153°³ read (7,308 bp) ÀÌ ÀÖ´Ù.
|
sequences: 3,915
|
¡¡
|
total length: 96,530,594 bp (96,530,594 bp excl N/X-runs)
|
¡¡
|
GC level : 36.77%
|
¡¡
|
bases masked: 56,841 bp ( 0.06%)
|
¡¡
|
¡¡
|
number of
elements
|
length
occupied
|
percentage
of sequence
|
|
SINEs:
|
12
|
767 bp
|
0.00%
|
|
¡¡
|
ALUs
|
0
|
0 bp
|
0.00%
|
|
¡¡
|
MIRs
|
1
|
63 bp
|
0.00%
|
|
LINEs:
|
56
|
3,428 bp
|
0.00%
|
|
¡¡
|
LINE1
|
14
|
796 bp
|
0.00%
|
|
¡¡
|
LINE2
|
11
|
556 bp
|
0.00%
|
|
¡¡
|
L3/CR1
|
14
|
1,151 bp
|
0.00%
|
|
LTRelements:
|
15
|
1,189 bp
|
0.00%
|
|
¡¡
|
ERVL
|
6
|
401 bp
|
0.00%
|
|
¡¡
|
ERVL-MaLRs
|
0
|
0 bp
|
0.00%
|
|
¡¡
|
ERV_classI
|
6
|
353 bp
|
0.00%
|
|
¡¡
|
ERV_classII
|
0
|
0 bp
|
0.00%
|
|
DNAelements:
|
205
|
43,636 bp
|
0.05%
|
|
¡¡
|
hAT-Charlie
|
89
|
17,788 bp
|
0.02%
|
|
¡¡
|
TcMar-Tigger
|
46
|
9,027 bp
|
0.01%
|
|
Unclassified:
|
0
|
0 bp
|
0.00%
|
|
Total interspersed repeats:
|
¡¡
|
49,020 bp
|
0.05%
|
|
Small RNA:
|
153
|
7,308 bp
|
0.01%
|
|
Satellites:
|
9
|
513 bp
|
0.00%
|
|
|

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹´Â Reference·Î 16,823 bp Å©±âÀÇ Damaster mirabilissimus ¹ÌÅäÄܵ帮¾Æ genomeÀ» ÀÌ¿ëÇÏ¿© ¸ÊÇÎÇÏ¿´´Ù. °¸Àº 2,550 bp °¡ ³ª¿Ô°í, Coverage ´Â 84.8%·Î È®ÀεǾú´Ù.
D. Transcriptome data ºÐ¼®
- ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ÀÇ KOG °á°ú, ¡°Infomation Storage and Processing¡± ºÎºÐ¿¡¼ RNA processing and modification (3.3%), transcription (3.1%) µîÀÌ ¸¹ÀÌ ¹ßÇöµÈ´Ù. ¡°Cellular Processes and Signaling¡± ¿¡¼´Â signal transduction mechanisms (10.9%), post translational modification (4.7%), cytoseleton (4.0%) µîÀÌ ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¶ÇÇÑ Metabolism ¿¡¼´Â carbohydrate transport and metabolism (2.5%) ÀÌ ¸¹ÀÌ ¸ÅĪµÇ°í, Multiple Function Àº 23.6% ÀÌ´Ù.
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ÀÇ GO °á°ú, Biological Process¿¡¼ metabolic process (5,260°³), cellular process (5,099°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. Molecular Function ¿¡¼´Â binding (5,065°³), catalytic activity (4,898°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¶ÇÇÑ, Cellular Component ºÎºÐ¿¡¼´Â membrane (3,329°³), cell (3,099°³), cell part (3,069°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù.
(4) Microsatellite È帱º
A. SSR statistics
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹
|
Repeats
|
4
|
5
|
6
|
7
|
8
|
9
|
10
|
11
|
12
|
13
|
14
|
Di
|
0
|
0
|
4651
|
3116
|
1917
|
1040
|
781
|
760
|
1206
|
1472
|
428
|
Tri
|
0
|
2136
|
867
|
315
|
90
|
46
|
24
|
3
|
0
|
0
|
0
|
Tetra
|
1680
|
368
|
98
|
35
|
5
|
0
|
0
|
0
|
0
|
0
|
0
|
Penta
|
144
|
23
|
7
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
Hexa
|
3
|
9
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
Total
|
1827
|
2536
|
5623
|
3466
|
2012
|
1086
|
805
|
763
|
1206
|
1472
|
428
|
¡æ SSR °Ë»ö°á°ú ÃÑ 21,224°³ÀÇ SSR ¼¿À» È®ÀÎÇÒ ¼ö ÀÖ¾úÀ¸¸ç, Di motif °¡ 15,371°³·Î °¡Àå ¸¹ÀÌ Á¸ÀçÇÏ´Â °ÍÀ» È®ÀÎ ÇÒ ¼ö ÀÖ¾úÀ¸¸ç, À̸¦ ±â¹ÝÀ¸·Î ÇÏ¿© SSR marker È帱º¿¡ ´ëÇÑ primer ¸¦ 1,828°³ µðÀÚÀÎ ÇÏ¿´´Ù.
B. Primer sequence
¡æ °¢ Á¾ÀÇ À¯ÀüÀÚ¸¦ ºÐ¼®ÇÏ¿© ±× Á¾À» ±¸º°ÇÒ ¼ö ÀÖ´Â primer¸¦ ¸¸µç´Ù. primer ÀÇ Á¶°ÇÀº Motif ¾à 4~6°³À̸ç, Forward ¿Í Reverse primer °¡ 18~22°³ Á¤µµÀ̰í Tm°ªÀÌ 54.5~55.5 Á¤µµ°¡ Àû´çÇÏ´Ù.
- ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹
Sequence
|
Motif
|
Forward Primer
|
Tm
|
Reverse Primer
|
13974520
|
(ACAT)4
|
TTCACCGACTCTATACCTTTG
|
54.59
|
ATGCACAGGTTACTCATAGCA
|
55.96
|
13975538
|
(CATA)6
|
AACTACAAGGACCACACACAC
|
54.97
|
TGAAAAACTCTCCACTTACCA
|
55.03
|
13986408
|
(GCAAC)4
|
GCTCGATAAAAATTACACGAA
|
54.86
|
ACATTCTTTCACACACACACA
|
54.82
|
13986432
|
(ATGT)4
|
GGATTCAGTGATGAGATTGAA
|
55.09
|
GTGAACTAGTGCGCTTATGTC
|
55.24
|
13997532
|
(TTTA)6
|
TGAAAAGGAAAGTTGAGATGA
|
55.08
|
CAATACACAACCCAGATAAGC
|
54.86
|
13997724
|
(GTAC)4
|
ACGACAAGTGAAAAAGCATTA
|
55.23
|
ACACACACACACACACACTTC
|
55.1
|
14008308
|
(ACAT)4
|
CCATGACGTAAAAACAAGGT
|
55.17
|
GAACGAAATAAAAAGGTGGAT
|
55.02
|
14008602
|
(TGTT)4
|
GTATCGCCTAGTTAATGTTGC
|
54.33
|
AATGTGTGATGTGAAAGGAAC
|
54.95
|
14018664
|
(CATA)4
|
TTTCTCAAATCAGTAGCTTGC
|
55.02
|
TCGTTTATTCATTGCTATGGT
|
54.96
|
14019002
|
(GAGC)6
|
ATGTGACGTCATCGTTAAGAG
|
55.34
|
ACACTGGCACTTCTTGTCTC
|
55.34
|
14028586
|
(GTTT)4
|
CCTACAACCATACGATAGTGC
|
54.9
|
TTGACCGGTACTGTAGAAGAC
|
54.46
|
14028684
|
(GCAA)6
|
ACGATGGAGATATTCGTTGTA
|
54.78
|
CTCTCTTCGTGTGTGTGTGTA
|
54.73
|
14038135
|
(TACA)4
|
TCGTAACAAAACAGAAACCAT
|
54.91
|
AACAACCTACTCTTGCATTGA
|
55.05
|
14048286
|
(TATC)4
|
CTTATGTTGTGCCCATAATCT
|
54.4
|
AAACATGGATGACATGAGATG
|
55.78
|
C. ¹ÙÄÚµå ¼¿
¿ä¹ø 16³âµµ¿¡ äÁýÇÑ 11Á¾ÀÇ À¯ÀüÀÚ¸¦ ºÐ¼®ÇÏ¿© ±× Á¾À» ±¸º°ÇÒ ¼ö ÀÖ´Â Barcode ¼¿À» ¸¸µé¾ú°í, ±× Barcode ¼¿À» °¡Áö°í NCBI (National Center for Biotechnology Information) »çÀÌÆ®¿¡ ÀÖ´Â BLAST (Basic Local Alignment Search Tool)À» ÀÌ¿ëÇÏ¿© °á°ú¸¦ È®ÀÎÇÏ¿´´Ù. °¢ Á¾ÀÇ ¼¿µéÀÇ BLAST °á°ú ¸¶Ä¿·Î¼ ÁÖ·Î »ç¿ëµÇ´Â COI ¼¿°ú ¸ÅÄ¡µÇ´Â °ÍÀ¸·Î º¸¾Æ ¹ÙÄÚµå ¼¿·Î »ç¿ë°¡´ÉÇÒ °ÍÀ¸·Î »ç·áµÈ´Ù.

(5) QC °á°ú
DNA Library QC
|
Name
|
Concentration
(ng/ul)
|
Volume
(ul)
|
Quantity
(ug)
|
Main peak Size (bp)
|
Result
|
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹
|
26.15
|
3.27
|
85.6
|
470
|
pass
|
RNA Library QC
|
Name
|
Concentration
(ng/ul)
|
Volume
(ul)
|
Quantity
(ng)
|
Main peak Size (bp)
|
Result
|
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹
|
71.53
|
5.15
|
368.05
|
299
|
Pass
|
(6) Gene prediction
»ùÇÃÁ¾
|
Number of Contig (>1kb)
|
Number of Bases (>1kb)
|
N50 Contig Size
|
Number of gene prediction
|
¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹
|
37,784
|
52,940,062
|
1,337
|
27,379
|
°¢Á¾º°·Î transcriptome ¼¿À» ´ë»óÀ¸·Î 1kb ÀÌ»ó¸¸À» ¼±ÅÃÇÏ¿© Gene predcition ÇÑ °á°ú À§ÀÇ ¸ÅĪµÈ contig, base ¼ö´Â À§ÀÇ Ç¥¿Í °°°í N50µÇ´Â ContigÀÇ ±æÀ̵µ º¸Åë 1~3kb Á¤µµ·Î ³ªÅ¸³µ´Ù. ±×¸®°í À̵éÀ» ÅëÇØ ¿¹ÃøµÈ À¯ÀüÀÚÀÇ ¼ö´Â 2õ°³¿¡¼ 10¸¸°³ ÀÌ»óÀ¸·Î ´Ù¾çÇÏ°Ô ¿¹ÃøµÇ¾ú´Ù.
¡Ø Long contigÀÇ Gene prediction °á°ú ±×¸² ¿¹½Ã

|