°¹°Ô (Chasmagnathus convexus)

|
¢Ã ±¹ ¸í: °¹°Ô
¢Ã ¿µ ¸í: mud-flat crab
¢Ã ÇÐ ¸í: Chasmagnathus convexus
¢Ã ÁöÁ¤¹øÈ£: ¸êÁ¾À§±â¾ß»ýµ¿¹°¥±±Þ
¢Ã °è Åë: °©°¢°-½Ê°¢¸ñ-Âü°Ô°ú
|
¡Ý »ý±è»õ : °©°¢ÀÇ À±°ûÀº ¾ç ¿·°¡ÀåÀÚ¸®°¡ º¼·ÏÇÑ »ç°¢ÇüÀε¥ ±× ±æÀÌ´Â ³ªºñÀÇ 10/13Á¤µµÀÌ´Ù. À̸¶´Â ªÀº Çô¸ð¾çÀÌ¾î¼ ±× °¡ÀåÀÚ¸®°¡ µÕ±×½º¸§ÇÏ°í ¾Æ·¡ÂÊÀ¸·Î ¸Å¿ì ±â¿î °ÍÀÌ Æ¯Â¡ÀÌ´Ù. µî¸é¿¡´Â ±íÀº ¼¼·ÎȨÀÌ ÀÖ°í ÀÌ È¨Àº °¡¿îµ¥ À§±¸¿ª¿¡±îÁö »¸´Â´Ù. ´«±¸¸Û µÞ °¡ÀåÀÚ¸®´Â ¶Ñ·ÇÇÑ µÎµÏÀ» ÀÌ·é´Ù. °©°¢ÀÇ ¿·°¡ÀåÀÚ¸®´Â ¸Å¿ì º¼·ÏÇÏ°í ´« µÞ´Ï¸¦ Æ÷ÇÔÇÏ¿© 4°³ÀÇ ³ÐÀº À̰¡ Àִµ¥ ¸Ç µÚÀÇ °ÍÀº ±¸ºÐÀÌ ¶Ñ·ÇÇÏÁö ¾Ê´Ù.
¡Ý ¼½ÄÁö : Çϱ¸ °¡±îÀÌÀÇ ¹Î¹°ÀÌ È帣´Â µµ¶û ¶Ç´Â °¥´ë¹ç¿¡ ±¸¸ÛÀ» ÆÄ°í »ì¸ç »óÁ¶¼± ±Ùó¿¡¼ ±íÀº ±¸¸ÛÀ» ÆÄ°í »ç´Â °Íµµ ÀÖ´Ù. ¼½ÄÁö°¡ µµ·Î °Ç¼³µî¿¡ ÀÇÇÏ¿© ÆÄ±«µÇ¾î ¸êÁ¶À§±â¿¡ óÇÑ Á¾ÀÌ´Ù.
¡Ý ºÐÆ÷ : Çѱ¹?ÀϺ»?Áö¿¡ ºÐÆ÷Çϸç Çѱ¹¿¡¼´Â ÀÎõ °Èµµ¿Í ³²ÇصµÀÇ °¥´ë¹ç¿¡ ¼½ÄÇÑ´Ù. ÃÖ±Ù Á¦ÁÖµµ ¸ð½½Æ÷¿Í ¿©¼ö ¿¬¾È¿¡¼µµ ¼½ÄÀÌ È®Àεǰí ÀÖ´Ù.
1) äÁýÁ¤º¸
- äÁýÇã°¡: 2014.07.10 / 5¸¶¸®
- ä Áý ÀÚ: ½Å¶ó´ëÇб³ °íÇö¼÷
2) äÁýÇöȲ ¹× »ùÇÃó¸®
°¹°Ô äÁýÇöȲ
|
»ùÇÃó¸®
|
äÁýÀÏ
|
°³Ã¼ ¼ö
|
DNAseq
|
RNAseq
|
Ç¥º»
|
2014.08.27
|
2 EA
|
2 EA (¸öÅë)
|
2 EA (³»Àå)
|
-
|
3) QC Data
»ùÇÃÁ¾
|
DNA QC
|
RNA QC
|
Result
|
Concentration (ug/ul)
|
Total Conc. (ug)
|
Concentration (ug/ul)
|
RIN (>7)
|
°¹°Ô
|
45.7
|
3.6
|
606.6
|
6.7
|
pass
|
4) ±âÁ¸ NCBI Taxonomy À¯ÀüÁ¤º¸ ÇöȲ
À¯ÀüÁ¤º¸ Á¾·ù
|
Chasmagnathus ¼Ó À¯ÀüÀÚ¿ø
|
Chasmagnathus convexus À¯ÀüÀÚ¿ø
|
Nucleotide
|
18
|
18
|
Protein
|
3
|
3
|
5) Sequencing ºÐ¼® °á°ú
- Illumina DNA Sequncing °á°ú
Input information
|
¡¡
|
¡¡
|
¡¡
|
Region1
|
Region2
|
Total
|
raw data
|
Number Of Reads
|
|
|
153,623,443
|
153,623,443
|
307,246,886
|
Number Of Bases
|
|
|
19,356,553,818
|
19,356,553,818
|
38,713,107,636
|
error correction
|
Number Of Reads
|
|
|
150,790,723
|
149,270,752
|
300,061,475
|
Number Of Bases
|
|
|
18,299,853,139
|
18,132,103,317
|
36,431,956,456
|
pair reads
|
Number Of Reads
|
|
|
|
|
295,475,870
|
Number Of Bases
|
|
|
|
|
35,919,383,637
|
single reads
|
Number Of Reads
|
|
|
|
|
4,585,605
|
Number Of Bases
|
|
|
|
|
512,572,819
|
Assembly Results
|
Scaffold Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Scaffolds
|
1,861,541
|
1,861,541
|
459,931
|
225,587
|
80,411
|
Number Of Bases
|
917,173,752
|
917,173,752
|
642,936,252
|
477,829,625
|
275,878,606
|
Avg. Scaffold Size
|
492
|
492
|
1,397
|
2,118
|
3,430
|
N50 Scaffold Size
|
1,071
|
1,071
|
1,711
|
2,279
|
3,412
|
N80 Scaffold Size
|
301
|
301
|
870
|
1,396
|
2,435
|
N90 Scaffold Size
|
160
|
160
|
671
|
1,185
|
2,203
|
largest Scaffold Size
|
34,968
|
34,968
|
34,968
|
34,968
|
34,968
|
Contig Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Contigs
|
20,933,404
|
3,549,408
|
410,643
|
118,957
|
21,539
|
Number Of Bases
|
1,988,515,949
|
978,693,506
|
391,546,899
|
192,032,899
|
62,192,297
|
Avg. Contig Size
|
94
|
275
|
953
|
1,614
|
2,887
|
N50 Contig Size
|
96
|
365
|
985
|
1,580
|
2,768
|
N80 Contig Size
|
50
|
153
|
645
|
1,176
|
2,231
|
N90 Contig Size
|
46
|
124
|
567
|
1,082
|
2,110
|
Largest Contig Size
|
15,357
|
15,357
|
15,357
|
15,357
|
15,357
|
- Illumina RNA Sequncing °á°ú
Input information
|
¡¡
|
¡¡
|
¡¡
|
Region1
|
Region2
|
Total
|
raw data
|
Number Of Reads
|
|
|
162,451,399
|
162,451,399
|
324,902,798
|
Number Of Bases
|
|
|
20,468,876,274
|
20,468,876,274
|
40,937,752,548
|
error correction
|
Number Of Reads
|
|
|
160,563,707
|
159,222,631
|
319,786,338
|
Number Of Bases
|
|
|
19,866,633,265
|
19,690,735,691
|
39,557,368,956
|
pair reads
|
Number Of Reads
|
|
|
|
|
317,265,048
|
Number Of Bases
|
|
|
|
|
39,282,129,074
|
single reads
|
Number Of Reads
|
|
|
|
|
2,521,290
|
Number Of Bases
|
|
|
|
|
275,239,882
|
Assembly Results
|
Scaffold Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Scaffolds
|
193,895
|
193,895
|
33,702
|
13,647
|
4,339
|
Number Of Bases
|
73,081,363
|
73,081,363
|
40,849,327
|
26,962,339
|
14,173,580
|
Avg. Scaffold Size
|
376
|
376
|
1,212
|
1,975
|
3,266
|
N50 Scaffold Size
|
619
|
619
|
1,406
|
2,087
|
3,252
|
N80 Scaffold Size
|
209
|
209
|
753
|
1,324
|
2,392
|
N90 Scaffold Size
|
145
|
145
|
612
|
1,148
|
2,185
|
largest Scaffold Size
|
21,944
|
21,944
|
21,944
|
21,944
|
21,944
|
Contig Metrics
|
> 1 bp*
|
> 100 bp
|
> 500 bp**
|
> 1000 bp
|
> 2000 bp
|
Number Of Contigs
|
4,023,890
|
255,107
|
30,210
|
9,990
|
2,178
|
Number Of Bases
|
236,793,481
|
73,420,982
|
30,690,612
|
16,822,864
|
6,239,197
|
Avg. Contig Size
|
58
|
287
|
1,015
|
1,683
|
2,864
|
N50 Contig Size
|
62
|
382
|
1,085
|
1,696
|
2,775
|
N80 Contig Size
|
34
|
161
|
668
|
1,209
|
2,246
|
N90 Contig Size
|
32
|
125
|
577
|
1,097
|
2,109
|
Largest Contig Size
|
11,371
|
11,371
|
11,371
|
11,371
|
11,371
|
³ª) ¸êÁ¾À§±âÁ¾ µ¿¹° Genome Survey
°Ë¼ö¿ë »ùÇà DNA ³óµµ ¹× ÃÑ ¾ç, È®ÁõÇ¥º» ÇöȲǥ
|
¡¡Sample¸í
|
³óµµ(ng/ul)
|
Volume(ul)
|
DNA ÃÑ·®(ug)
|
O.D A260/280
|
È®ÁõÇ¥º»
|
äÁýÇã°¡ÀÏ
|
äÁýÀÏ
|
°¹°Ô (¼öÄÆ)
|
110
|
996
|
109
|
1.56
|
X
|
2014.7.10
|
2014.8.27
|
(2) À¯Àüü ¼¿È®º¸ : ÇöÀç 20Á¾ È®º¸
Sample
|
±¸ºÐ
|
raw data size
|
trim data size
|
ºñ°í
|
°¹°Ô
|
DNA
|
38,713,107,636
|
36,431,956,456
|
¡¡
|
RNA
|
40,937,752,548
|
39,557,368,956
|
¡¡
|
(3) ±âŸºÐ¼®
A. Genome Size estimation
»ù Çà ¸í
|
Kmer size
|
Kmer total
|
Pk depth
|
Genome_size
|
Coverage(X)
|
°¹°Ô
|
23
|
23,965,257,108
|
9
|
2,662,806,345
|
11.5385
|
B. ¹Ýº¹¼¿
- ½ÃÄý½ÌÀÌ ¿Ï·áµÈ °¹°Ô DNA ¹× RNAÀÇ contigs Áß 1kb ÀÌ»óÀÇ contigsÀÇ ¹Ýº¹¼¿À» Ž»ö(Left; Repeat masking results of DNA contigs, Right; Repeat masking results of RNA contigs).
sequences: 118,957
|
|
sequences: 9,990
|
total length: 192,032,899 bp (192,032,899 bp excl N/X-runs)
|
|
total length: 16,822,864bp (16,822,864bp excl N/X-runs)
|
GC level: 42.55%
|
|
GC level: 47.17%
|
bases masked: 564,853 bp (0.29%)
|
|
bases masked: 21,554 bp (0.13%)
|
|
number of
elements
|
length
occupied
|
percentage
of sequence
|
|
|
number of
elements
|
length
occupied
|
percentage
of sequence
|
SINEs:
|
79
|
4,590 bp
|
0.00%
|
|
SINEs:
|
3
|
178 bp
|
0.00%
|
|
ALUs
|
0
|
0 bp
|
0.00%
|
|
|
ALUs
|
0
|
0 bp
|
0.00%
|
|
MIRs
|
65
|
3,731 bp
|
0.00%
|
|
|
MIRs
|
2
|
124 bp
|
0.00%
|
LINEs:
|
2,195
|
358,387 bp
|
0.19%
|
|
LINEs:
|
43
|
4,931 bp
|
0.03%
|
|
LINE1
|
54
|
3,300 bp
|
0.00%
|
|
|
LINE1
|
2
|
135 bp
|
0.00%
|
|
LINE2
|
26
|
1,508 bp
|
0.00%
|
|
|
LINE2
|
2
|
193 bp
|
0.00%
|
|
L3/CR1
|
1,949
|
330,647 bp
|
0.17%
|
|
|
L3/CR1
|
35
|
4,387 bp
|
0.03%
|
LTR elements:
|
303
|
22,297 bp
|
0.01%
|
|
LTR elements:
|
8
|
936 bp
|
0.01%
|
|
ERVL
|
19
|
1,246 bp
|
0.00%
|
|
|
ERVL
|
3
|
150 bp
|
0.00%
|
|
ERVL-MaLRs
|
5
|
305 bp
|
0.00%
|
|
|
ERVL-MaLRs
|
1
|
61 bp
|
0.00%
|
|
ERV_classI
|
229
|
16,722 bp
|
0.01%
|
|
|
ERV_classI
|
1
|
87 bp
|
0.00%
|
|
ERV_classII
|
33
|
2,034 bp
|
0.00%
|
|
|
ERV_classII
|
1
|
71 bp
|
0.00%
|
DNA elements:
|
1,273
|
151,813 bp
|
0.08%
|
|
DNA elements:
|
58
|
14,499 bp
|
0.09%
|
|
hAT-Charlie
|
305
|
41,456 bp
|
0.02%
|
|
|
hAT-Charlie
|
8
|
3,595 bp
|
0.02%
|
|
TcMar-Tigger
|
743
|
88,342 bp
|
0.05%
|
|
|
TcMar-Tigger
|
43
|
9,434 bp
|
0.06%
|
Unclassified:
|
3
|
164 bp
|
0.00%
|
|
Unclassified:
|
0
|
0 bp
|
0.00%
|
Total interspersed repeats:
|
|
537,251 bp
|
0.28%
|
|
Total interspersed repeats:
|
|
20,544 bp
|
0.12%
|
Small RNA:
|
414
|
27,650 bp
|
0.01%
|
|
Small RNA:
|
16
|
1,010 bp
|
0.01%
|
Satellites:
|
1
|
46 bp
|
0.00%
|
|
Satellites:
|
0
|
0 bp
|
0.00%
|
Simple repeats:
|
0
|
0 bp
|
0.00%
|
|
Simple repeats:
|
0
|
0 bp
|
0.00%
|
Low complexity:
|
0
|
0 bp
|
0.00%
|
|
Low complexity:
|
0
|
0 bp
|
0.00%
|
D. Mito-genome Map

(4) Microsatellite È帱º
A. SSR statistics
Motif(-mer)
|
2
|
3
|
4
|
5
|
6
|
7
|
8
|
9
|
10
|
Total
|
°Ù°Ô
|
89,764
|
45,550
|
7,655
|
846
|
60
|
1
|
-
|
2
|
1
|
143,879
|
B. ¹ÙÄڵ弿
>°¹°Ô(Chasmagnathus convexus)
TCGAAAATGGATGTATTTAAATTACGGTCTGTAAGTAGTATAGTAATAGCTCCAGCTAGA
ACTGGCAAGGATAAAAGAAGTAGGATGGCAGTAATAAATACAGCTCAAACAAAAAGGGGT
ATTTGGTCTATCATTATCCCATATGATCGTATATTGATAACGGTTGTTATGAAATTTACA
GCTCCTAGAATTGAGGATACTCCTGCAAGATGAGGAGAAAAAATTCCAAGATCTACGGAG
GCCCCAGCATGGGTGATAGCAGCTGCAAGGGGTGGATAAACAGTTCATCCTGTGCCCACT
CCTCTTTCTACTATTCTTCTTGTTAGGAGAGAAAGAGATGGAGGAAGAAGTCAAAATCTT
ATATTATTTATACGAGGAAAGGCTTTATCTGGTGCTCCTAGTATAAGGGGAACTAGTCAG
TTACCGAATCCTCCAGTTATAATGGGTATAACTATAAAGAAAATTATGACAAAAGCGTGG
GCTGTCACAACCACATTGTAAATTTGGTCATTACCGATTAATCTTCCTGGTTGACTTAGT
TCTGCTCGAATAATTAATCTTACGGATGTACCTACTATTCCGGCTCCAGCGCCGAAAATA
AAATATAAGGTTCCAATGTCTT
°¥°Ô(Helice tientsinensis) : family ±îÁö °°À½
Helice latimera : family ±îÁö °°À½

|