µÎµå·°Á¶°³ (Lamprotula coreana)

|
¢Ã ±¹ ¸í : µÎµå·°Á¶°³
¢Ã ÇÐ ¸í : Lamprotula coreana (Martens, 1886)
¢Ã ÁöÁ¤¹øÈ£ : ¸êÁ¾À§±â ¾ß»ý»ý¹° ¥°±Þ
¢Ã °è Åë : ¿¬Ã¼µ¿¹°¹® > ÀÌ¸ÅÆÐ° > ¼®ÆÐ¸ñ > ¼®ÆÐ°ú > µÎµå·°Á¶°³¼Ó
|
¡Ý Çü Å : °¢Àå 45 mm, °¢°í 40 mm. ÆÐ°¢Àº µÕ±Û°í ´ã¼ö ÀÌ¸ÅÆÐ·ù Áß °¡Àå µÎ²®°í ´Ü´ÜÇÑ ÆÐ°¢À» °®´Â´Ù. °¢ÇǴ Ȳ»ö ¹ÙÅÁ¿¡ Èæ°¥»öÀ» ¶ì°í °ú¸³»óÀÇ ±½Àº µ¹±â°¡ ²®ÁúÀÇ µÚÂÊ¿¡ ƯÈ÷ ¸¹ÀÌ ³ªÅ¸³´Ù. °¢Á¤Àº ¾Õ ÂÊÀ¸·Î Ä¡¿ìÃÄ ÀÖ°í ¾ÕÂÊ µî¼± ¾Æ·¡´Â Á÷¼±»óÀ¸·Î ¹è¼±¿¡ ¿¬°áµÈ´Ù. ÆÐ°¢ ³»¸éÀº ¹é»öÀÇ ¿¬ÇÑ ÀÚÁÖ»öÀÌ´Ù. ÁÖÄ¡¿Í ÈÄÃøÄ¡´Â ¿ÞÂÊ ²®Áú¿¡ 2°³, ¿À¸¥ÂÊ ²®Áú¿¡ ÁÖÄ¡°¡ 3°³, ÃøÄ¡°¡ 1°³¾¿ ÀÖ´Ù.
¡Ý »ý Å : ¼ö½ÉÀÌ ±í°í À¯¼ÓÀÌ ³ôÀ¸¸ç ÇÏ»ó¿¡ ¸ð·¡¿Í ÀÚ°¥ÀÌ È¥ÇÕµÈ °÷¿¡ ¼½ÄÇÑ´Ù. ÀÚ¿õÀÌüÀ̸ç 10¿ù¿¡¼ ´ÙÀ½ÇØ 4¿ù¿¡ ±Û·Î۵ð¿òÀ» ¹æÃâÇÏ´Â µ¿°è »ê¶õÇüÀÌ´Ù. À¯»ýÀº glochidiumÀ¸·Î ¹æÃâµÇ¾î ¾î·ù µîÀÇ ¼÷ÁÖ¿¡ ±â»ý »ýȰÀ» °ÅÄ£ ÈÄ À¯ÆÐ·Î º¯ÅÂÇÑ´Ù.
¡Ý ºÐ Æ÷ : ±Ý° Áö¿ª
¨ç äÁýÁ¤º¸
- ä Áý ÀÚ: °¿ø´ëÇб³ ÀÌÁØ»ó
- äÁýÁö¿ª: ÃæÃ»³²µµ ±Ý»ê±º Á¦¿ø¸é õ³»¸® ÀÏ´ë
¡ß Æ÷ȹÀ§Ä¡ : ±Ý»ê±º õ³»¸® ÀÏ´ë
¡ß ¼½ÄÁö ȯ°æ
¡ß Æ÷ȹÇÑ µÎµå·°Á¶°³ 3°³Ã¼
¨è äÁýÇöȲ ¹× »ùÇÃó¸®
µÎµå·°Á¶°³ äÁýÇöȲ
|
»ùÇÃó¸®
|
äÁýÀÏ
|
°³Ã¼ ¼ö
|
DNAseq
|
RNAseq
|
Ç¥º»
|
2016.07.29
|
3 EA
|
2 EA
|
1 EA
|
-
|
¨é ±âÁ¸ NCBI Taxonomy À¯ÀüÁ¤º¸ ÇöȲ
À¯ÀüÁ¤º¸ Á¾·ù
|
Lamprotula ¼Ó À¯ÀüÀÚ¿ø
|
Lamprotula coreana À¯ÀüÀÚ¿ø
|
Nucleotide
|
210
|
30
|
Protein
|
280
|
29
|
Genome
|
6
|
1
|
Gene
|
127
|
13
|
¨ê Sequencing °á°ú
µÎµå·°Á¶°³
|
Type
|
Total Reads
|
Total Bases
|
Total Bases (Gb)
|
GC Rate
|
ºñ°í
|
DNA seq
|
277,629,844
|
41,922,106,444
|
41.9
|
35.6
|
|
RNA seq
|
77,373,658
|
7,814,739,458
|
7.8
|
42.5
|
|
- Illumina DNA Sequncing °á°ú
SOAP de novo
|
LC-DNA hiseq
|
Input information
|
¡¡
|
¡¡
|
¡¡
|
Region1
|
Region2
|
Total
|
raw data
|
Number Of Reads
|
|
|
138,814,922
|
138,814,922
|
277,629,844
|
Number Of Bases
|
|
|
20,961,053,222
|
20,961,053,222
|
41,922,106,444
|
error correction
|
Number Of Reads
|
|
|
138,306,819
|
135,904,178
|
274,210,997
|
Number Of Bases
|
|
|
20,585,461,708
|
19,678,470,444
|
40,263,932,152
|
pair reads
|
Number Of Reads
|
|
|
|
271,385,608
|
271,385,608
|
Number Of Bases
|
|
|
|
39,882,892,577
|
39,882,892,577
|
single reads
|
Number Of Reads
|
|
|
|
2,825,389
|
2,825,389
|
Number Of Bases
|
|
|
|
381,039,575
|
381,039,575
|
Assembly Results
|
Scaffold Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Scaffolds
|
|
3,623,201
|
643,092
|
306,974
|
97,622
|
Number Of Bases
|
|
1,367,530,675
|
825,732,596
|
588,962,850
|
298,123,292
|
Avg. Scaffold Size
|
|
377
|
1,284
|
1,918
|
3,053
|
N50 Scaffold Size
|
|
776
|
1,523
|
2,017
|
3,014
|
N80 Scaffold Size
|
|
193
|
827
|
1,329
|
2,323
|
N90 Scaffold Size
|
|
131
|
652
|
1,156
|
2,150
|
largest Scaffold Size
|
|
22,696
|
22,696
|
22,696
|
22,696
|
Contig Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Contigs
|
31,682,529
|
3,623,201
|
643,092
|
306,974
|
97,622
|
Number Of Bases
|
2,633,147,009
|
1,367,530,675
|
825,732,596
|
588,962,850
|
298,123,292
|
Avg. Contig Size
|
83
|
377
|
1,284
|
1,918
|
3,053
|
N50 Contig Size
|
109
|
776
|
1,523
|
2,017
|
3,014
|
N80 Contig Size
|
39
|
193
|
827
|
1,329
|
2,323
|
N90 Contig Size
|
35
|
131
|
652
|
1,156
|
2,150
|
Largest Contig Size
|
22,696
|
22,696
|
22,696
|
22,696
|
22,696
|
- Illumina RNA Sequncing °á°ú
Total number of raw reads
|
- Number of sequences
|
77,373,658
|
- Number of bases
|
7,814,739,458
|
Contig information
|
- Total number of contig
|
50,607
|
- Number of bases
|
52,650,133
|
- Mean length of contig (bp)
|
1,040.3
|
- N50 length of contig (bp)
|
1,477
|
- GC % of contig
|
37.34
|
- Largest contig (bp)
|
31,416
|
- No. of large contigs (¡Ã500bp)
|
126,231
|
Unigene information
|
- Total number of unigenes
|
71,798
|
- Number of bases
|
98,731,274
|
- Mean length of unigene (bp)
|
1,375.1
|
- N50 length of unigene (bp)
|
2,243
|
- GC % of unigene
|
36.7
|
- Length ranges (bp)
|
224 – 25,876
|
2) ÇöÀç »óȲ¿¡ ´ëÇÑ °íÂû
³ª) ÀÚ»ý µ¿¹°ÀÚ¿ø Genome Survey
(1) °íǰÁú genomic DNA äÃë ¹× Ç¥º»È®º¸
- °Ë¼ö¿ë DNA´Â ´ëºÎºÐ ¸¸Á·ÇÏ¿´°í ¸êÁ¾À§±âÁ¾ÀÎ ´ë»óÁ¾µéÀº äÁý·®ÀÇ Á¦ÇÑÀ¸·Î ÀÎÇØ È®ÁõÇ¥º»À¸·Î Á¦Ãâ°¡´ÉÇÑ Ç¥º»À» È®º¸ÇÏÁö ¸øÇÏ¿© »çÁøÀ¸·Î ´ëóÇϰíÀÚ ÇÔ.
°Ë¼ö¿ë »ùÇà DNA ³óµµ ¹× ÃÑ ¾ç, È®ÁõÇ¥º» ÇöȲǥ
|
Sample ¸í
|
³óµµ(ng/ul)
|
Volume(ul)
|
DNA ÃÑ·®(ug)
|
ÀúÀåÀå¼Ò
|
È®ÁõÇ¥º»
|
µÎµå·°Á¶°³
|
126.527
|
28
|
3.543
|
1XTE pH8.0
|
»çÁøÀ¸·Î´ëü
|
(2) ±âŸºÐ¼®
A. Genome Size estimation – Jellyfish ÇÁ·Î±×·¥
°¢ ´ë»óÁ¾À» ´ë»óÀ¸·Î Jellyfish ÇÁ·Î±×·¥À¸·Î Genome size¸¦ ¿¹ÃøÇÑ °á°ú´Â ¾Æ·¡¿Í °°´Ù. °¢ ºÐ¼®Àº k-mer °ª 17mer, 19mer, 21mer¸¦ ´ë»óÀ¸·Î ÇÏ¿´°í ±×°ÍÀ» ÅëÇØ ¿¹ÃøµÈ genomeÀÇ Å©±â´Â Estimation genome size ¿¡ Ç¥½ÃµÇ¾ú°í, À̸¦ ÅëÇØ À̹ø »ç¾÷¿¡¼ ºÐ¼®ÇÑ ¼¿ÀÇ ¾çÀÇ coverage´Â Genome coverage depth(Coverage Depth)¿¡ Ç¥±âµÇ¾ú´Ù. ±×¸®°í °¢ Á¾ÀÇ Ç¥ ¾Æ·¡ ±×·¡ÇÁ´Â kmer°ª º°·Î ³ªÅ¸³ depthÀÇ ¿¹Ãø ±×·¡ÇÁ·Î peak°ªÀ» È®ÀÎÇÒ ¼ö ÀÖ´Ù.

B. ¹Ýº¹¼¿
- ½ÃÄö½ÌÀÌ ¿Ï·áµÈ »ùÇà DNA ÀÇ contigs Áß 1kb ÀÌ»óÀÇ contigsÀÇ ¹Ýº¹¼¿À» Ž»ö (Repeat masking results of DNA contigs)
µÎµå·°Á¶°³(Lamprotula coreana)
|
¡¡
|
µÎµå·°Á¶°³(Lamprotula coreana) ´Â 1kb ÀÌ»óÀÇ contig ÀÇ ¹Ýº¹¼¿Àº 306,974°³ÀÇ read (588,962,850 bp) À̰í, GC level Àº 34.16% ÀÌ´Ù. Repeat Masking À¸·Î ³ª¿Â ¹Ýº¹¼¿Àº 813,957 bp (ÀüüÀÇ 0.14%) ¸¸Å ÀÖ´Ù. Small RNA Àº 1,368°³ read (83,423 bp) ÀÌ ÀÖ´Ù.
|
sequences: 306,974
|
¡¡
|
total length: 588,962,850 bp (588,962,850 bp excl N/X-runs)
|
¡¡
|
GC level : 34.16%
|
¡¡
|
bases masked: 813,957 bp ( 0.14%)
|
¡¡
|
¡¡
|
number of
elements
|
length
occupied
|
percentage
of sequence
|
|
SINEs:
|
284
|
19,183 bp
|
0.00%
|
|
¡¡
|
ALUs
|
0
|
0bp
|
0.00%
|
|
¡¡
|
MIRs
|
199
|
14,049 bp
|
0.00%
|
|
LINEs:
|
2,421
|
513,969 bp
|
0.09%
|
|
¡¡
|
LINE1
|
169
|
10,686 bp
|
0.00%
|
|
¡¡
|
LINE2
|
229
|
20,145 bp
|
0.00%
|
|
¡¡
|
L3/CR1
|
246
|
16,365 bp
|
0.00%
|
|
LTRelements:
|
165
|
12,903 bp
|
0.00%
|
|
¡¡
|
ERVL
|
11
|
592 bp
|
0.00%
|
|
¡¡
|
ERVL-MaLRs
|
6
|
328 bp
|
0.00%
|
|
¡¡
|
ERV_classI
|
85
|
6,260 bp
|
0.00%
|
|
¡¡
|
ERV_classII
|
26
|
1,446 bp
|
0.00%
|
|
DNAelements:
|
1,830
|
168,386 bp
|
0.03%
|
|
¡¡
|
hAT-Charlie
|
545
|
58,611 bp
|
0.01%
|
|
¡¡
|
TcMar-Tigger
|
299
|
31,545 bp
|
0.01%
|
|
Unclassified:
|
20
|
1,093 bp
|
0.00%
|
|
Total interspersed repeats:
|
¡¡
|
715,534 bp
|
0.12%
|
|
Small RNA:
|
1,368
|
83,423 bp
|
0.01%
|
|
Satellites:
|
235
|
15,373 bp
|
0.00%
|
|
|

µÎµå·°Á¶°³´Â Reference·Î 15,697 bp Å©±âÀÇ Lamprotula coreana ¹ÌÅäÄܵ帮¾Æ genomeÀ» ÀÌ¿ëÇÏ¿© ¸ÊÇÎÇÏ¿´´Ù. °¸Àº 1,708 bp °¡ ³ª¿Ô°í, Coverage ´Â 89.1%·Î È®ÀεǾú´Ù.
D. Transcriptome data ºÐ¼®
- µÎµå·°Á¶°³

µÎµå·°Á¶°³ÀÇ KOG °á°ú, ¡°Infomation Storage and Processing¡± ºÎºÐ¿¡¼ transcription (2.3%), RNA processing and modification (2%) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¡°Cellular Processes and Signaling¡± ¿¡¼ signal transduction mechanisms (15.2%) °¡ °¡Àå ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¶ÇÇÑ Metabolism ¿¡¼´Â 1% ´ë·Î ³·°Ô ¸ÅĪµÇ°í, Multiple Function Àº 30 % ÀÌ´Ù.
µÎµå·°Á¶°³ÀÇ GO °á°ú, Biological Process¿¡¼ cellular process (4,562°³), metabolic process (4,334°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. Molecular Function ¿¡¼´Â binding (7,948°³), catalytic activity (4,313°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¶ÇÇÑ, Cellular Component ºÎºÐ¿¡¼´Â membrane (2,217°³), cell (2,136°³), cell part (2,133°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù.
(4) Microsatellite È帱º
A. SSR statistics
µÎµå·°Á¶°³
|
Repeats
|
4
|
5
|
6
|
7
|
8
|
9
|
10
|
11
|
12
|
13
|
14
|
Di
|
0
|
0
|
12562
|
8662
|
6387
|
5515
|
5651
|
9422
|
23778
|
16020
|
651
|
Tri
|
0
|
10314
|
5638
|
4680
|
9417
|
4239
|
157
|
0
|
0
|
0
|
0
|
Tetra
|
9757
|
4381
|
6856
|
3063
|
30
|
0
|
0
|
0
|
0
|
0
|
0
|
Penta
|
722
|
403
|
270
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
Hexa
|
179
|
19
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
Total
|
10658
|
15117
|
25326
|
16405
|
15834
|
9754
|
5808
|
9422
|
23778
|
16020
|
651
|
¡æ SSR °Ë»ö°á°ú ÃÑ 148,773°³ÀÇ SSR ¼¿À» È®ÀÎÇÒ ¼ö ÀÖ¾úÀ¸¸ç, Di motif °¡ 88,648°³·Î °¡Àå ¸¹ÀÌ Á¸ÀçÇÏ´Â °ÍÀ» È®ÀÎ ÇÒ ¼ö ÀÖ¾úÀ¸¸ç, À̸¦ ±â¹ÝÀ¸·Î ÇÏ¿© SSR marker È帱º¿¡ ´ëÇÑ primer ¸¦ 1,049°³ µðÀÚÀÎ ÇÏ¿´´Ù.
B. Primer sequence
¡æ °¢ Á¾ÀÇ À¯ÀüÀÚ¸¦ ºÐ¼®ÇÏ¿© ±× Á¾À» ±¸º°ÇÒ ¼ö ÀÖ´Â primer¸¦ ¸¸µç´Ù. primer ÀÇ Á¶°ÇÀº Motif ¾à 4~6°³À̸ç, Forward ¿Í Reverse primer °¡ 18~22°³ Á¤µµÀ̰í Tm°ªÀÌ 54.5~55.5 Á¤µµ°¡ Àû´çÇÏ´Ù.
- µÎµå·°Á¶°³
Sequence
|
Motif
|
Forward Primer
|
Tm
|
Reverse Primer
|
62752037
|
(CCACGA)4
|
TCGTTAATCTCTCCAATGAAA
|
55.05
|
CACTACTGGTAGGCAAAGAAA
|
54.84
|
62789653
|
(TGCTAT)4
|
ATCTTCCTTCCTTCTCATTTG
|
55.12
|
GGCCAACATCTCTTACATAAA
|
54.52
|
62831174
|
(TTGG)7
|
CAAAGGTTTTATCCACTTTGA
|
54.59
|
GATCCAATTTCCTCAGCAT
|
55
|
62871592
|
(AGGGGG)4
|
GTAAAAGTTCCCCTGTCCTTA
|
55.09
|
GTGAATAAAATGGTTCCATCA
|
55
|
62905158
|
(GCCACC)4
|
CAGAGAGAAAACGTAAAATGC
|
54.48
|
TATTTACGCCTGACTAAAACG
|
54.99
|
62943753
|
(TAAT)6
|
CTAAACATGAAGGGCATACTG
|
55.04
|
CATTTAAGAAGCTGTCGTCAT
|
54.74
|
62984802
|
(AATCTC)4
|
TCTCAATTCCAATCTGACACT
|
54.74
|
TTGAGGTTGATACTGAGAGTGA
|
55.02
|
63018464
|
(GTAG)4
|
TGATGGATCGTAGATGAGTTT
|
54.72
|
CACATCGTTCATCTGACAATA
|
54.52
|
63057244
|
(ATTT)4
|
GTACTTGCACCCATTTTGTAG
|
54.95
|
ATGCTGCTTGAGCTTTATGTA
|
55.5
|
63094730
|
(TTAT)4
|
TGGGACTATTTTATGAGCAAA
|
55.1
|
TGTAAGCCAACTGGATAAAAC
|
54.61
|
63132429
|
(ACCCTG)4
|
ACCCTAAGAAAACAGACCTTG
|
55.15
|
CAGACTCAGGGTTAGGGATAG
|
55.51
|
63169447
|
(ATTGTC)4
|
CACGGATATAAAGTGGGTTACT
|
55.03
|
TAACATGTTTACGGAAGAAGC
|
54.77
|
63202817
|
(TTTA)4
|
GAAAGATCTGTGCACACTAACA
|
55.56
|
TATTGAATCAGGGTGTACTGG
|
55.11
|
63239029
|
(TACC)7
|
TCTATCGATTTCCTTAACGTG
|
54.7
|
TGAAAAGAGAAAACGAACAAG
|
54.92
|
C. ¹ÙÄÚµå ¼¿
¿ä¹ø 16³âµµ¿¡ äÁýÇÑ 11Á¾ÀÇ À¯ÀüÀÚ¸¦ ºÐ¼®ÇÏ¿© ±× Á¾À» ±¸º°ÇÒ ¼ö ÀÖ´Â Barcode ¼¿À» ¸¸µé¾ú°í, ±× Barcode ¼¿À» °¡Áö°í NCBI (National Center for Biotechnology Information) »çÀÌÆ®¿¡ ÀÖ´Â BLAST (Basic Local Alignment Search Tool)À» ÀÌ¿ëÇÏ¿© °á°ú¸¦ È®ÀÎÇÏ¿´´Ù. °¢ Á¾ÀÇ ¼¿µéÀÇ BLAST °á°ú ¸¶Ä¿·Î¼ ÁÖ·Î »ç¿ëµÇ´Â COI ¼¿°ú ¸ÅÄ¡µÇ´Â °ÍÀ¸·Î º¸¾Æ ¹ÙÄÚµå ¼¿·Î »ç¿ë°¡´ÉÇÒ °ÍÀ¸·Î »ç·áµÈ´Ù.

(5) QC °á°ú
DNA Library QC
|
Name
|
Concentration
(ng/ul)
|
Volume
(ul)
|
Quantity
(ug)
|
Main peak Size (bp)
|
Result
|
µÎµå·°Á¶°³
|
73.87
|
3.27
|
241.81
|
470
|
pass
|
RNA Library QC
|
Name
|
Concentration
(ng/ul)
|
Volume
(ul)
|
Quantity
(ng)
|
Main peak Size (bp)
|
Result
|
µÎµå·°Á¶°³
|
71.31
|
5.34
|
380.92
|
288
|
Pass
|
(6) Gene prediction
»ùÇÃÁ¾
|
Number of Contig (>1kb)
|
Number of Bases (>1kb)
|
N50 Contig Size
|
Number of gene prediction
|
µÎµå·°Á¶°³
|
306,974
|
588,962,850
|
2,017
|
91,223
|
°¢Á¾º°·Î transcriptome ¼¿À» ´ë»óÀ¸·Î 1kb ÀÌ»ó¸¸À» ¼±ÅÃÇÏ¿© Gene predcition ÇÑ °á°ú À§ÀÇ ¸ÅĪµÈ contig, base ¼ö´Â À§ÀÇ Ç¥¿Í °°°í N50µÇ´Â ContigÀÇ ±æÀ̵µ º¸Åë 1~3kb Á¤µµ·Î ³ªÅ¸³µ´Ù. ±×¸®°í À̵éÀ» ÅëÇØ ¿¹ÃøµÈ À¯ÀüÀÚÀÇ ¼ö´Â 2õ°³¿¡¼ 10¸¸°³ ÀÌ»óÀ¸·Î ´Ù¾çÇÏ°Ô ¿¹ÃøµÇ¾ú´Ù.
¡Ø Long contigÀÇ Gene prediction °á°ú ±×¸² ¿¹½Ã

|