¿ï¸ªµµ´ÞÆØÀÌ (Karaftohelix adamsi)

|
¢Ã ±¹ ¸í : ¿ï¸ªµµ´ÞÆØÀÌ
¢Ã ÇÐ ¸í : Karaftohelix adamsi Kuroda & Hukuda, 1944
¢Ã ÁöÁ¤¹øÈ£ : ¸êÁ¾À§±â ¾ß»ý»ý¹° ¥±±Þ
¢Ã °è Åë : ¿¬Ã¼µ¿¹°¹® > º¹Á·° > ÁøÀ¯Æó¸ñ > ´ÞÆØÀ̰ú > ¿ï¸ªµµ´ÞÆØÀ̼Ó
|
¡Ý Çü Å : °¢°í 9 mm, °¢°æ 14 mm. ³ªÃþÀº 4.5ÃþÀ̰í ü»öÀº Àû°¥»öÀÌ´Ù. üÃþ ÁÖ¿¬¿¡ µÐÇÑ °¢ÀÌ ÀÖ°í ±× °¢À» µû¶ó ÇÑ ÁÙÀÇ ÁøÇÑ Àû°¥»ö ¶ì°¡ üÃþÀ» µÎ¸£°í ÀÖ´Ù. ±½Àº ¼ºÀ可ÀÌ ÆÐ°¢ Àüü¿¡ ³ª ÀÖ´Ù. Á¦°øÀº Á¼°í ±íÀ¸¸ç °¢±¸´Â °æ»çÁ® ÀÖ°í Àú¼ø ºÎÀ§°¡ ÆíÆòÇÑ ³ÇüÀÌ¸ç ¾à°£ ÆÛÁø´Ù.
¡Ý »ý Å : ÀÚ¿õµ¿Ã¼ÀÌ¸ç ¼öÁ¤µÈ ³Àº ÇÑ ¹ø¿¡ »ê¶õÇÏ°í ¹ß»ýÀº Á÷Á¢ ÀÌ·ç¾îÁø´Ù. ½£ ¼ÓÀÇ ½À±â ³ôÀº °ü¸ñ¸² ¹ØÀ̳ª ³ª¹« À§¿¡ ¼½ÄÇÑ´Ù.
¡Ý ºÐ Æ÷ : Çѱ¹ Ư»êÁ¾ÀÌ¸ç ¿ï¸ªµµ°¡ ¸ð½Ä »êÁöÀÌ´Ù.
¨ç äÁýÁ¤º¸
- ä Áý ÀÚ: °¿ø´ëÇб³ ÀÌÁØ»ó
- äÁýÁö¿ª: °æ»óºÏµµ ¿ï¸ª±º ¼ºÀκÀ ÀÏ´ë

¨è äÁýÇöȲ ¹× »ùÇÃó¸®
¿ï¸ªµµ´ÞÆØÀÌ Ã¤ÁýÇöȲ
|
»ùÇÃó¸®
|
äÁýÀÏ
|
°³Ã¼ ¼ö
|
DNAseq
|
RNAseq
|
Ç¥º»
|
2016.08.25
|
2 EA
|
1 EA
|
1 EA
|
-
|
¨é ±âÁ¸ NCBI Taxonomy À¯ÀüÁ¤º¸ ÇöȲ
À¯ÀüÁ¤º¸ Á¾·ù
|
Karaftohelix ¼Ó À¯ÀüÀÚ¿ø
|
Karaftohelix adamsi À¯ÀüÀÚ¿ø
|
Nucleotide
|
61
|
-
|
Protein
|
10
|
-
|
¨ê Sequencing °á°ú
¿ï¸ªµµ´ÞÆØÀÌ
|
Type
|
Total Reads
|
Total Bases
|
Total Bases (Gb)
|
GC Rate
|
ºñ°í
|
DNA seq
|
301,367,732
|
37,972,334,232
|
38.0
|
-
|
|
RNA seq
|
72,798,798
|
9,172,648,548
|
9.2
|
-
|
|
- Illumina DNA Sequncing °á°ú
SOAP de novo
|
KA-DNA hiseq
|
Input information
|
¡¡
|
¡¡
|
¡¡
|
Region1
|
Region2
|
Total
|
raw data
|
Number Of Reads
|
|
|
150,683,866
|
150,683,866
|
301,367,732
|
Number Of Bases
|
|
|
18,986,167,116
|
18,986,167,116
|
37,972,334,232
|
error correction
|
Number Of Reads
|
|
|
145,215,767
|
143,526,128
|
288,741,895
|
Number Of Bases
|
|
|
17,165,231,480
|
16,821,247,935
|
33,986,479,415
|
pair reads
|
Number Of Reads
|
|
|
|
280,695,668
|
280,695,668
|
Number Of Bases
|
|
|
|
33,171,264,113
|
33,171,264,113
|
single reads
|
Number Of Reads
|
|
|
|
8,046,227
|
8,046,227
|
Number Of Bases
|
|
|
|
815,215,302
|
815,215,302
|
Assembly Results
|
Scaffold Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Scaffolds
|
2,864,610
|
2,864,610
|
702,012
|
154,765
|
10,959
|
Number Of Bases
|
1,080,542,080
|
1,080,542,080
|
586,172,615
|
212,714,185
|
27,364,546
|
Avg. Scaffold Size
|
377
|
377
|
834
|
1,374
|
2,496
|
N50 Scaffold Size
|
540
|
540
|
844
|
1,331
|
2,364
|
N80 Scaffold Size
|
287
|
287
|
610
|
1,106
|
2,115
|
N90 Scaffold Size
|
153
|
153
|
552
|
1,050
|
2,055
|
largest Scaffold Size
|
26,600
|
26,600
|
26,600
|
26,600
|
26,600
|
Contig Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Contigs
|
53,121,597
|
4,575,747
|
134,014
|
5,603
|
180
|
Number Of Bases
|
2,958,625,374
|
878,101,771
|
87,975,412
|
7,444,288
|
1,003,710
|
Avg. Contig Size
|
55
|
191
|
656
|
1,328
|
5,576
|
N50 Contig Size
|
51
|
202
|
630
|
1,190
|
8,088
|
N80 Contig Size
|
35
|
127
|
541
|
1,056
|
2,895
|
N90 Contig Size
|
34
|
112
|
519
|
1,024
|
2,345
|
Largest Contig Size
|
26,600
|
26,600
|
26,600
|
26,600
|
26,600
|
- Illumina RNA Sequncing °á°ú
Total number of raw reads
|
- Number of sequences
|
72,798,798
|
- Number of bases
|
9,172,648,548
|
Contig information
|
- Total number of contig
|
247,039
|
- Number of bases
|
113,912,872
|
- Mean length of contig (bp)
|
461
|
- N50 length of contig (bp)
|
479
|
- GC % of contig
|
42
|
- Largest contig (bp)
|
12,388
|
- No. of large contigs (¡Ã500bp)
|
60,078
|
Unigene information
|
- Total number of unigenes
|
51,683
|
- Number of bases
|
36,127,974
|
- Mean length of unigene (bp)
|
699
|
- N50 length of unigene (bp)
|
887
|
- GC % of unigene
|
42
|
- Length ranges (bp)
|
224 – 13,102
|
2) ÇöÀç »óȲ¿¡ ´ëÇÑ °íÂû
³ª) ÀÚ»ý µ¿¹°ÀÚ¿ø Genome Survey
(1) °íǰÁú genomic DNA äÃë ¹× Ç¥º»È®º¸
- °Ë¼ö¿ë DNA´Â ´ëºÎºÐ ¸¸Á·ÇÏ¿´°í ¸êÁ¾À§±âÁ¾ÀÎ ´ë»óÁ¾µéÀº äÁý·®ÀÇ Á¦ÇÑÀ¸·Î ÀÎÇØ È®ÁõÇ¥º»À¸·Î Á¦Ãâ°¡´ÉÇÑ Ç¥º»À» È®º¸ÇÏÁö ¸øÇÏ¿© »çÁøÀ¸·Î ´ëóÇϰíÀÚ ÇÔ.
°Ë¼ö¿ë »ùÇà DNA ³óµµ ¹× ÃÑ ¾ç, È®ÁõÇ¥º» ÇöȲǥ
|
Sample ¸í
|
³óµµ(ng/ul)
|
Volume(ul)
|
DNA ÃÑ·®(ug)
|
ÀúÀåÀå¼Ò
|
È®ÁõÇ¥º»
|
¿ï¸ªµµ´ÞÆØÀÌ
|
171.4
|
79.91
|
13.7
|
1XTE pH8.0
|
»çÁøÀ¸·Î´ëü
|
(2) ±âŸºÐ¼®
A. Genome Size estimation – Jellyfish ÇÁ·Î±×·¥
°¢ ´ë»óÁ¾À» ´ë»óÀ¸·Î Jellyfish ÇÁ·Î±×·¥À¸·Î Genome size¸¦ ¿¹ÃøÇÑ °á°ú´Â ¾Æ·¡¿Í °°´Ù. °¢ ºÐ¼®Àº k-mer °ª 17mer, 19mer, 21mer¸¦ ´ë»óÀ¸·Î ÇÏ¿´°í ±×°ÍÀ» ÅëÇØ ¿¹ÃøµÈ genomeÀÇ Å©±â´Â Estimation genome size ¿¡ Ç¥½ÃµÇ¾ú°í, À̸¦ ÅëÇØ À̹ø »ç¾÷¿¡¼ ºÐ¼®ÇÑ ¼¿ÀÇ ¾çÀÇ coverage´Â Genome coverage depth(Coverage Depth)¿¡ Ç¥±âµÇ¾ú´Ù. ±×¸®°í °¢ Á¾ÀÇ Ç¥ ¾Æ·¡ ±×·¡ÇÁ´Â kmer°ª º°·Î ³ªÅ¸³ depthÀÇ ¿¹Ãø ±×·¡ÇÁ·Î peak°ªÀ» È®ÀÎÇÒ ¼ö ÀÖ´Ù.

B. ¹Ýº¹¼¿
- ½ÃÄö½ÌÀÌ ¿Ï·áµÈ »ùÇà DNA ÀÇ contigs Áß 1kb ÀÌ»óÀÇ contigsÀÇ ¹Ýº¹¼¿À» Ž»ö (Repeat masking results of DNA contigs)
¿ï¸ªµµ´ÞÆØÀÌ(Karaftohelix adamsi)
|
¡¡
|
¿ï¸ªµµ´ÞÆØÀÌ (Karaftohelix adamsi) ´Â 1kb ÀÌ»óÀÇ contig ÀÇ ¹Ýº¹¼¿Àº 4,605°³ read (6,217,408 bp) À̰í, GC level Àº 39.06% ÀÌ´Ù. Repeat Masking À¸·Î ³ª¿Â ¹Ýº¹¼¿Àº 5,127 bp (ÀüüÀÇ 0.08%) ¸¸Å ÀÖ´Ù. Small RNA Àº 9°³ read (624 bp) ÀÌ ÀÖ´Ù.
|
sequences: 4,605
|
¡¡
|
total length: 6,217,408 bp (6,217,408 bp excl N/X-runs)
|
¡¡
|
GC level : 39.06%
|
¡¡
|
bases masked: 5,127 bp ( 0.08%)
|
¡¡
|
¡¡
|
number of
elements
|
length
occupied
|
percentage
of sequence
|
|
SINEs:
|
1
|
50 bp
|
0.00%
|
|
¡¡
|
ALUs
|
0
|
0 bp
|
0.00%
|
|
¡¡
|
MIRs
|
0
|
0 bp
|
0.00%
|
|
LINEs:
|
13
|
1,732 bp
|
0.03%
|
|
¡¡
|
LINE1
|
1
|
55 bp
|
0.00%
|
|
¡¡
|
LINE2
|
1
|
51 bp
|
0.00%
|
|
¡¡
|
L3/CR1
|
4
|
195 bp
|
0.00%
|
|
LTRelements:
|
3
|
181 bp
|
0.00%
|
|
¡¡
|
ERVL
|
0
|
0 bp
|
0.00%
|
|
¡¡
|
ERVL-MaLRs
|
0
|
0 bp
|
0.00%
|
|
¡¡
|
ERV_classI
|
1
|
62 bp
|
0.00%
|
|
¡¡
|
ERV_classII
|
1
|
44 bp
|
0.00%
|
|
DNAelements:
|
22
|
2,436 bp
|
0.04%
|
|
¡¡
|
hAT-Charlie
|
7
|
755 bp
|
0.01%
|
|
¡¡
|
TcMar-Tigger
|
9
|
979 bp
|
0.02%
|
|
Unclassified:
|
0
|
0 bp
|
0.00%
|
|
Total interspersed repeats:
|
¡¡
|
4,399 bp
|
0.07%
|
|
Small RNA:
|
9
|
624 bp
|
0.01%
|
|
Satellites:
|
2
|
104 bp
|
0.00%
|
|
|

¿ï¸ªµµ´ÞÆØÀÌ´Â Reference·Î 14,039 bp Å©±âÀÇ Aegista diversifamilia ¹ÌÅäÄܵ帮¾Æ genomeÀ» ÀÌ¿ëÇÏ¿© ¸ÊÇÎÇÏ¿´´Ù. °¸Àº 633 bp °¡ ³ª¿Ô°í, Coverage ´Â 95.5%·Î È®ÀεǾú´Ù.
D. Transcriptome data ºÐ¼®
- ¿ï¸ªµµ´ÞÆØÀÌ

¿ï¸ªµµ´ÞÆØÀÌÀÇ KOG °á°ú, ¡°Infomation Storage and Processing¡± ºÎºÐ¿¡¼ transcription (4.9%), replication/recombination/repair (4.5%) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¡°Cellular Processes and Signaling¡± ¿¡¼´Â signal transduction mechanisms °ú post translational modification ÀÌ 5.1% ·Î °¡Àå ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¶ÇÇÑ Metabolism ºÎºÐ¿¡¼ amino acid transport and metabolism (2.3%)°¡ ¸¹ÀÌ ¸ÅĪµÇ°í, Multiple Function Àº 29.4% ÀÌ´Ù.
¿ï¸ªµµ´ÞÆØÀÌÀÇ GO °á°ú, Biological Process¿¡¼ metabolic process (1,049°³), cellular process (824°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. Molecular Function ¿¡¼´Â binding (1,708°³), catalytic activity (1,162°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¶ÇÇÑ, Cellular Component ºÎºÐ¿¡¼´Â membrane (474°³), cell (462°³), cell part (458°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù.
(4) Microsatellite È帱º
A. SSR statistics
¿ï¸ªµµ´ÞÆØÀÌ
|
Repeats
|
4
|
5
|
6
|
7
|
8
|
9
|
10
|
11
|
12
|
13
|
Di
|
0
|
0
|
104
|
49
|
33
|
20
|
12
|
26
|
15
|
5
|
Tri
|
0
|
124
|
51
|
27
|
21
|
4
|
0
|
0
|
0
|
0
|
Tetra
|
70
|
26
|
17
|
3
|
0
|
0
|
0
|
0
|
0
|
0
|
Penta
|
10
|
1
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
Hexa
|
2
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
0
|
Total
|
82
|
151
|
172
|
79
|
54
|
24
|
12
|
26
|
15
|
5
|
¡æ SSR °Ë»ö°á°ú ÃÑ 620°³ÀÇ SSR ¼¿À» È®ÀÎÇÒ ¼ö ÀÖ¾úÀ¸¸ç, Di motif °¡ 264°³·Î °¡Àå ¸¹ÀÌ Á¸ÀçÇÏ´Â °ÍÀ» È®ÀÎ ÇÒ ¼ö ÀÖ¾úÀ¸¸ç, À̸¦ ±â¹ÝÀ¸·Î ÇÏ¿© SSR marker È帱º¿¡ ´ëÇÑ primer ¸¦ 343°³ µðÀÚÀÎ ÇÏ¿´´Ù.
B. Primer sequence
¡æ °¢ Á¾ÀÇ À¯ÀüÀÚ¸¦ ºÐ¼®ÇÏ¿© ±× Á¾À» ±¸º°ÇÒ ¼ö ÀÖ´Â primer¸¦ ¸¸µç´Ù. primer ÀÇ Á¶°ÇÀº Motif ¾à 4~6°³À̸ç, Forward ¿Í Reverse primer °¡ 18~22°³ Á¤µµÀ̰í Tm°ªÀÌ 54.5~55.5 Á¤µµ°¡ Àû´çÇÏ´Ù.
- ¿ï¸ªµµ´ÞÆØÀÌ
Sequence
|
Motif
|
Forward Primer
|
Tm
|
Reverse Primer
|
107317884
|
(TTCACT)6
|
GCAACCTCATTAACTCATCTG
|
54.97
|
CAAGTTTAGTCAGAAGCTTGG
|
54.55
|
107309687
|
(TCACC)5
|
TCTCACCTCACCTCATCATAC
|
55.03
|
GTGCATTGAGATGCATTGT
|
55.38
|
107310883
|
(TTGTT)5
|
ACACGCAATAACAACAAAGTT
|
55.02
|
TTGTGCTAAGTGAGAGCAAGT
|
55.45
|
107310979
|
(CTTGT)5
|
AGAAGACAAGCGAAATAAGGT
|
54.99
|
CAGAGTAATTTCAGACGTCCA
|
55.48
|
107314189
|
(GCGAT)5
|
GATACGATGCGATGATGTG
|
55.81
|
CCTCTACCCTCTACCGTAAAA
|
55.28
|
107316943
|
(TTGGT)5
|
GGAAAGGTTTAAGGGTTGTAA
|
55.1
|
ACCAGGATGTATCAAAAGCTC
|
55.9
|
107309333
|
(CTTT)4
|
GAGTTTGGAATCACAACAAGT
|
54.24
|
GCTCGTGCACCATATAGATAA
|
55.55
|
107309827
|
(CCTG)4
|
GTCCTGTTATTTGTGTGTGTG
|
53.88
|
CAATAAATCATGCCCTAATGA
|
55.3
|
107309893
|
(TATA)4
|
TACATTTCTTTGCATTCTTGG
|
55.57
|
ACCTCCTCGTTCTTACAAATC
|
55.06
|
107310321
|
(CACA)4
|
TCTAGTGTTGTGAGGAAGCAT
|
55.04
|
CAAGGTAAGTTGAAACCATTG
|
54.92
|
107310329
|
(TGTT)4
|
TGTTTGTCAAAACAACTGTGA
|
55.18
|
ACATGGTGGTTGTCAATGTAG
|
55.81
|
107310451
|
(GTAC)4
|
ACAATGAAGCACTACATCCAG
|
55.34
|
GTTTTGTACGTGAACCTGTGT
|
55.16
|
107310695
|
(CTAC)4
|
CAGCGACAATAACCAAGTTAC
|
55.11
|
CAGGTAGGTAGATCGGTAGGT
|
54.98
|
C. ¹ÙÄÚµå ¼¿
¿ä¹ø 16³âµµ¿¡ äÁýÇÑ 11Á¾ÀÇ À¯ÀüÀÚ¸¦ ºÐ¼®ÇÏ¿© ±× Á¾À» ±¸º°ÇÒ ¼ö ÀÖ´Â Barcode ¼¿À» ¸¸µé¾ú°í, ±× Barcode ¼¿À» °¡Áö°í NCBI (National Center for Biotechnology Information) »çÀÌÆ®¿¡ ÀÖ´Â BLAST (Basic Local Alignment Search Tool)À» ÀÌ¿ëÇÏ¿© °á°ú¸¦ È®ÀÎÇÏ¿´´Ù. °¢ Á¾ÀÇ ¼¿µéÀÇ BLAST °á°ú ¸¶Ä¿·Î¼ ÁÖ·Î »ç¿ëµÇ´Â COI ¼¿°ú ¸ÅÄ¡µÇ´Â °ÍÀ¸·Î º¸¾Æ ¹ÙÄÚµå ¼¿·Î »ç¿ë°¡´ÉÇÒ °ÍÀ¸·Î »ç·áµÈ´Ù.

(5) QC °á°ú
DNA Library QC
|
Name
|
Concentration
(ng/ul)
|
Volume
(ul)
|
Quantity
(ug)
|
Main peak Size (bp)
|
Result
|
¿ï¸ªµµ´ÞÆØÀÌ
|
171.43
|
79.91
|
13.7
|
500
|
pass
|
RNA Library QC
|
Name
|
Concentration
(ng/ul)
|
Volume
(ul)
|
Quantity
(ng)
|
Main peak Size (bp)
|
Result
|
¿ï¸ªµµ´ÞÆØÀÌ
|
180
|
6.2
|
29
|
318
|
Pass
|
(6) Gene prediction
»ùÇÃÁ¾
|
Number of Contig (>1kb)
|
Number of Bases (>1kb)
|
N50 Contig Size
|
Number of gene prediction
|
¿ï¸ªµµ´ÞÆØÀÌ
|
4,605
|
6,217,408
|
1,195
|
2,291
|
°¢Á¾º°·Î transcriptome ¼¿À» ´ë»óÀ¸·Î 1kb ÀÌ»ó¸¸À» ¼±ÅÃÇÏ¿© Gene predcition ÇÑ °á°ú À§ÀÇ ¸ÅĪµÈ contig, base ¼ö´Â À§ÀÇ Ç¥¿Í °°°í N50µÇ´Â ContigÀÇ ±æÀ̵µ º¸Åë 1~3kb Á¤µµ·Î ³ªÅ¸³µ´Ù. ±×¸®°í À̵éÀ» ÅëÇØ ¿¹ÃøµÈ À¯ÀüÀÚÀÇ ¼ö´Â 2õ°³¿¡¼ 10¸¸°³ ÀÌ»óÀ¸·Î ´Ù¾çÇÏ°Ô ¿¹ÃøµÇ¾ú´Ù.
¡Ø Long contigÀÇ Gene prediction °á°ú ±×¸² ¿¹½Ã

|