¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ (Damaster mirabilissimus mirabilissimus)

±×¸²ÀÔ´Ï´Ù.

¢Ã ±¹    ¸í : ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

¢Ã ÇР   ¸í : Damaster mirabilissimus mirabilissimus
Ishikawa & Deuve

¢Ã ÁöÁ¤¹øÈ£ : ¸êÁ¾À§±â ¾ß»ý»ý¹° ¥±±Þ

¢Ã °è    Åë : ÀýÁöµ¿¹°¹® > °ïÃæ°­ > µüÁ¤¹ú·¹¸ñ > µüÁ¤¹ú·¹°ú
> Á¶·Õ¹ÚµüÁ¤¹ú·¹¼Ó


¡Ý Çü Å : ¸ö±æÀÌ ¼öÄÆ 23¡­25 mm, ¾ÏÄÆ 25¡­28 mmÀÌ´Ù. ¸öºû ±òÀº ¸Å¿ì È­·ÁÇѵ¥, ¾îµÎ¿î Ǫ¸¥»ö, ¾îµÎ¿î ³ì»ö, û·Ï»ö, ¾îµÎ¿î ÀÚÁÖ»ö µîÀÇ ±Ý¼Ó¼º ±¤ÅÃÀÌ °ËÀº»ö ¹ÙÅÁ À§¿¡¼­ ºû³­´Ù. ¸Ó¸®°¡ ´«¿¡ ¶ç°Ô Å©°í, ±¸±â(ÀÔÆ²)°¡ ưưÇÏ°Ô ¹ß´ÞÇØ ÀÖ´Ù. ¸ñÀÌ À¯³­È÷ ±½´Ù. µüÁö³¯°³´Â ±ä Ÿ¿øÇüÀÌ°í ±×¹°´«Ã³·³ Á¶°¢ ±¸Á¶·Î µÅ ÀÖ´Ù. ³¯Áö ¸øÇÏ´Â °ÍÀº µÞ³¯°³°¡ ÅðÈ­Ç߱⠶§¹®ÀÌ´Ù.


¡Ý »ý Å : Çѱ¹ °íÀ¯Á¾À¸·Î »êÁöÀÇ ¿ø½Ã¸²¿¡ µå¹°°Ô ¼­½ÄÇÑ´Ù. À¯Ãæ°ú ¼ºÃæÀº ¸ðµÎ ¾ßÇ༺À̸ç, ´ÞÆØÀ̳ª Áö··ÀÌ, ³ªºñ¸ñÀÇ À¯Ãæ µîÀ» ÁÖ·Î Àâ¾Æ¸Ô´Â´Ù.


¡Ý ºÐ Æ÷ : ÁߺÎÁö¹æ ¿ø½Ã¸²Áö´ë ±ØÈ÷ Á¦ÇÑµÈ Áö¿ª

¨ç ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ äÁýÁ¤º¸

- ä Áý ÀÚ: ºû°íÀ»»ýÅ¿¬±¸¼Ò Á¤Çåõ

- äÁýÁö¿ª: °­¿øµµ Æòⱺ ÁøºÎ¸é µ¿»ê¸® »ê (»ó¿ø»ç¾Õ Àü³ª¹« ½£±æ)

   ¹­À½ °³Ã¼ÀÔ´Ï´Ù.

¡ßÆ÷ȹÇÑ ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ ¼ºÃæ 2°³Ã¼


¨è äÁýÇöȲ ¹× »ùÇÃó¸®

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ äÁýÇöȲ

»ùÇÃó¸®

äÁýÀÏ

°³Ã¼ ¼ö

DNAseq

RNAseq

Ç¥º»

2016.06.19

2 EA

1 EA

1 EA

-


¨é ±âÁ¸ NCBI Taxonomy À¯ÀüÁ¤º¸ ÇöȲ

`À¯ÀüÁ¤º¸ Á¾·ù

Genus Damaster À¯ÀüÀÚ¿ø

Damaster mirabilissimus mirabilissimus À¯ÀüÀÚ¿ø

Nucleotide

402

2

Protein

395

26

Genome

1

1

Gene

13

13


¨ê Sequencing °á°ú

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

Type

Total Reads

Total Bases

Total Bases (Gb)

GC Rate

ºñ°í

DNA seq

265,230,228 

40,049,764,428 

40.0 

34.8 

 

RNA seq

66,553,052 

10,049,510,852 

10.0 

43.6 

 


 - Illumina DNA Sequncing °á°ú

SOAP de novo

DM-DNA hiseq

Input information

¡¡

¡¡

¡¡

Region1

Region2

Total

raw data

Number Of Reads

 

 

132,615,114

132,615,114

265,230,228

Number Of Bases

 

 

20,024,882,214

20,024,882,214

40,049,764,428

error correction

Number Of Reads

 

 

131,511,817

128,015,022

259,526,839

Number Of Bases

 

 

19,521,040,843

18,441,036,998

37,962,077,841

pair reads

Number Of Reads

 

 

 

255,643,244

255,643,244

Number Of Bases

 

 

 

37,414,558,156

37,414,558,156

single reads

Number Of Reads

 

 

 

3,883,595

3,883,595

Number Of Bases

 

 

 

547,519,685

547,519,685

Assembly Results

Scaffold Metrics

> 1 bp*

> 100 bp

> 500 bp**

> 1000 bp

> 2000 bp

Number Of Scaffolds

 

182,729

27,526

16,059

10,931

Number Of Bases

 

178,261,025

152,086,295

144,148,167

137,055,031

Avg. Scaffold Size

 

975

5,525

8,976

12,538

N50 Scaffold Size

 

14,949

19,439

20,900

22,227

N80 Scaffold Size

 

1,156

5,554

7,183

8,820

N90 Scaffold Size

 

240

2,037

3,555

5,075

largest Scaffold Size

 

175,593

175,593

175,593

175,593

Contig Metrics

> 1 bp*

> 100 bp

> 500 bp**

> 1000 bp

> 2000 bp

Number Of Contigs

7,027,349

363,618

74,054

39,915

16,690

Number Of Bases

450,397,865

173,451,879

120,684,540

96,530,594

64,006,388

Avg. Contig Size

64

477

1,629

2,418

3,835

N50 Contig Size

56

1,251

2,148

2,725

3,874

N80 Contig Size

35

257

999

1,525

2,579

N90 Contig Size

34

147

728

1,246

2,271

Largest Contig Size

82,858

82,858

82,858

82,858

82,858

 

 - Illumina RNA Sequncing °á°ú

Total number of raw reads

- Number of sequences

66,553,052

- Number of bases

10,049,510,852

Contig information

- Total number of  contig

188,927

- Number of bases

206,636,027

- Mean length of contig (bp)

1,093.7

- N50 length of contig (bp)

2,244

- GC % of contig

39.59

- Largest contig (bp)

27,961

- No. of large contigs (¡Ã500bp)

92,509

Unigene information

- Total number of  unigenes

48,407

- Number of bases

87,917,588

- Mean length of unigene (bp)

1,816.2

- N50 length of unigene (bp)

3,093

- GC % of unigene

39.65

- Length ranges (bp)

224 – 29,278

 

2) ÇöÀç »óȲ¿¡ ´ëÇÑ °íÂû

³ª) ÀÚ»ý µ¿¹°ÀÚ¿ø Genome Survey

(1) °íǰÁú genomic DNA äÃë ¹× Ç¥º»È®º¸

- °Ë¼ö¿ë DNA´Â ´ëºÎºÐ ¸¸Á·ÇÏ¿´°í ¸êÁ¾À§±âÁ¾ÀÎ ´ë»óÁ¾µéÀº äÁý·®ÀÇ Á¦ÇÑÀ¸·Î ÀÎÇØ È®ÁõÇ¥º»À¸·Î Á¦Ãâ°¡´ÉÇÑ Ç¥º»À» È®º¸ÇÏÁö ¸øÇÏ¿© »çÁøÀ¸·Î ´ëóÇϰíÀÚ ÇÔ.


°Ë¼ö¿ë  »ùÇà DNA ³óµµ ¹× ÃÑ ¾ç, È®ÁõÇ¥º» ÇöȲǥ

Sample ¸í

³óµµ(ng/ul)

Volume(ul)

DNA ÃÑ·®(ug)

ÀúÀåÀå¼Ò

È®ÁõÇ¥º»

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

130.554

66

8.617

1XTE pH8.0

»çÁøÀ¸·Î´ëü


(2) ±âŸºÐ¼®

A. Genome Size estimation – Jellyfish ÇÁ·Î±×·¥


°¢ ´ë»óÁ¾À» ´ë»óÀ¸·Î Jellyfish ÇÁ·Î±×·¥À¸·Î Genome size¸¦ ¿¹ÃøÇÑ °á°ú´Â ¾Æ·¡¿Í °°´Ù. °¢ ºÐ¼®Àº k-mer °ª 17mer, 19mer, 21mer¸¦ ´ë»óÀ¸·Î ÇÏ¿´°í ±×°ÍÀ» ÅëÇØ ¿¹ÃøµÈ genomeÀÇ Å©±â´Â Estimation genome size ¿¡ Ç¥½ÃµÇ¾ú°í, À̸¦ ÅëÇØ À̹ø »ç¾÷¿¡¼­ ºÐ¼®ÇÑ ¼­¿­ÀÇ ¾çÀÇ coverage´Â Genome coverage depth(Coverage Depth)¿¡ Ç¥±âµÇ¾ú´Ù. ±×¸®°í °¢ Á¾ÀÇ Ç¥ ¾Æ·¡ ±×·¡ÇÁ´Â kmer°ª º°·Î ³ªÅ¸³­ depthÀÇ ¿¹Ãø ±×·¡ÇÁ·Î peak°ªÀ» È®ÀÎÇÒ ¼ö ÀÖ´Ù.


±×¸²ÀÔ´Ï´Ù.
¿øº» ±×¸²ÀÇ À̸§: CLP000012080012.bmp
¿øº» ±×¸²ÀÇ Å©±â: °¡·Î 908pixel, ¼¼·Î 596pixel


B. ¹Ýº¹¼­¿­

-  ½ÃÄö½ÌÀÌ ¿Ï·áµÈ »ùÇà DNA ÀÇ contigs Áß 1kb ÀÌ»óÀÇ contigsÀÇ ¹Ýº¹¼­¿­À» Ž»ö (Repeat masking results of DNA contigs)


¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

(Damaster mirabilissimus mirabilissimus)

¡¡

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ (Damaster mirabilissimus mirabilissimus)  ´Â 1kb ÀÌ»óÀÇ contig ÀÇ ¹Ýº¹¼­¿­Àº 3,915°³ read (96,530,594 bp) À̰í, GC level Àº 36.77% ÀÌ´Ù. Repeat Masking À¸·Î ³ª¿Â ¹Ýº¹¼­¿­Àº 56,841 bp (ÀüüÀÇ 0.06%) ¸¸Å­ ÀÖ´Ù. Small RNA Àº 153°³ read (7,308 bp) ÀÌ ÀÖ´Ù.

sequences: 3,915

¡¡

total length: 96,530,594 bp (96,530,594 bp excl N/X-runs)

¡¡

GC level : 36.77%

¡¡

bases masked: 56,841 bp ( 0.06%)

¡¡

¡¡

number of

elements

length

occupied

percentage

of sequence

 

SINEs: 

12

767 bp

0.00%

 

¡¡

ALUs 

0

 0 bp

0.00%

 

¡¡

MIRs 

1

63 bp

0.00%

 

LINEs: 

56

3,428 bp

0.00%

 

¡¡

LINE1 

14

796 bp

0.00%

 

¡¡

LINE2 

11

556 bp

0.00%

 

¡¡

L3/CR1 

14

1,151 bp

0.00%

 

LTRelements: 

15

1,189 bp

0.00%

 

¡¡

ERVL 

6

401 bp

0.00%

 

¡¡

ERVL-MaLRs

0

0 bp

0.00%

 

¡¡

ERV_classI 

6

353 bp

0.00%

 

¡¡

ERV_classII

0

 0 bp

0.00%

 

DNAelements: 

205

43,636 bp

0.05%

 

¡¡

hAT-Charlie 

89

17,788 bp

0.02%

 

¡¡

TcMar-Tigger

46

9,027 bp

0.01%

 

Unclassified: 

0

0 bp

0.00%

 

Total interspersed repeats:

¡¡

49,020 bp

0.05%

 

Small RNA:

153

7,308 bp

0.01%

 

Satellites:

9

513 bp

0.00%

 

 



C. Mito-genome Map

±×¸²ÀÔ´Ï´Ù.
¿øº» ±×¸²ÀÇ À̸§: 2016_Mito_mapping-¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹.jpg
¿øº» ±×¸²ÀÇ Å©±â: °¡·Î 3824pixel, ¼¼·Î 1212pixel
»çÁø ÂïÀº ³¯Â¥: 2016³â 11¿ù 27ÀÏ ¿ÀÈÄ 2:09
ÇÁ·Î±×·¥ À̸§ : Adobe Photoshop CC (Windows)
»ö ´ëÇ¥ : sRGB

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹´Â Reference·Î 16,823 bp Å©±âÀÇ Damaster mirabilissimus ¹ÌÅäÄܵ帮¾Æ genomeÀ» ÀÌ¿ëÇÏ¿© ¸ÊÇÎÇÏ¿´´Ù. °¸Àº 2,550 bp °¡ ³ª¿Ô°í, Coverage ´Â 84.8%·Î È®ÀεǾú´Ù. 

D. Transcriptome data ºÐ¼®

-  ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

±×¸²ÀÔ´Ï´Ù.
¿øº» ±×¸²ÀÇ À̸§: CLP00001208001f.bmp
¿øº» ±×¸²ÀÇ Å©±â: °¡·Î 912pixel, ¼¼·Î 578pixel


¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ÀÇ KOG °á°ú, ¡°Infomation Storage and Processing¡± ºÎºÐ¿¡¼­ RNA processing and modification (3.3%), transcription (3.1%) µîÀÌ ¸¹ÀÌ ¹ßÇöµÈ´Ù. ¡°Cellular Processes and Signaling¡± ¿¡¼­´Â signal transduction mechanisms (10.9%), post translational modification (4.7%), cytoseleton (4.0%) µîÀÌ ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¶ÇÇÑ Metabolism ¿¡¼­´Â carbohydrate transport and metabolism (2.5%) ÀÌ ¸¹ÀÌ ¸ÅĪµÇ°í, Multiple Function Àº 23.6% ÀÌ´Ù.

±×¸²ÀÔ´Ï´Ù.
¿øº» ±×¸²ÀÇ À̸§: GO_Á¤¸®º»-¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹.jpg
¿øº» ±×¸²ÀÇ Å©±â: °¡·Î 4413pixel, ¼¼·Î 2931pixel
»çÁø ÂïÀº ³¯Â¥: 2016³â 11¿ù 27ÀÏ ¿ÀÈÄ 10:26
ÇÁ·Î±×·¥ À̸§ : Adobe Photoshop CC (Windows)
»ö ´ëÇ¥ : sRGB


¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹ÀÇ GO °á°ú, Biological Process¿¡¼­ metabolic process (5,260°³), cellular process (5,099°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. Molecular Function ¿¡¼­´Â binding (5,065°³), catalytic activity (4,898°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù. ¶ÇÇÑ, Cellular Component ºÎºÐ¿¡¼­´Â membrane (3,329°³), cell (3,099°³), cell part (3,069°³) ¼øÀ¸·Î ¸¹ÀÌ ¸ÅĪµÈ´Ù.

(4) Microsatellite È帱º

A. SSR statistics


¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

Repeats

4

5

6

7

8

9

10

11

12

13

14

Di

0

0

4651

3116

1917

1040

781

760

1206

1472

428

Tri

0

2136

867

315

90

46

24

3

0

0

0

Tetra

1680

368

98

35

5

0

0

0

0

0

0

Penta

144

23

7

0

0

0

0

0

0

0

0

Hexa

3

9

0

0

0

0

0

0

0

0

0

Total

1827

2536

5623

3466

2012

1086

805

763

1206

1472

428


¡æ SSR °Ë»ö°á°ú ÃÑ 21,224°³ÀÇ SSR ¼­¿­À» È®ÀÎÇÒ ¼ö ÀÖ¾úÀ¸¸ç, Di motif °¡ 15,371°³·Î °¡Àå ¸¹ÀÌ Á¸ÀçÇÏ´Â °ÍÀ» È®ÀÎ ÇÒ ¼ö ÀÖ¾úÀ¸¸ç, À̸¦ ±â¹ÝÀ¸·Î ÇÏ¿© SSR marker È帱º¿¡ ´ëÇÑ primer ¸¦ 1,828°³ µðÀÚÀÎ ÇÏ¿´´Ù.

B. Primer sequence

¡æ °¢ Á¾ÀÇ À¯ÀüÀÚ¸¦ ºÐ¼®ÇÏ¿© ±× Á¾À» ±¸º°ÇÒ ¼ö ÀÖ´Â primer¸¦ ¸¸µç´Ù. primer ÀÇ Á¶°ÇÀº Motif ¾à 4~6°³À̸ç, Forward ¿Í Reverse primer °¡ 18~22°³ Á¤µµÀ̰í Tm°ªÀÌ 54.5~55.5 Á¤µµ°¡ Àû´çÇÏ´Ù.


- ¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

Sequence

Motif

Forward Primer

Tm

Reverse Primer

13974520

(ACAT)4

TTCACCGACTCTATACCTTTG

54.59

ATGCACAGGTTACTCATAGCA

55.96

13975538

(CATA)6

AACTACAAGGACCACACACAC

54.97

TGAAAAACTCTCCACTTACCA

55.03

13986408

(GCAAC)4

GCTCGATAAAAATTACACGAA

54.86

ACATTCTTTCACACACACACA

54.82

13986432

(ATGT)4

GGATTCAGTGATGAGATTGAA

55.09

GTGAACTAGTGCGCTTATGTC

55.24

13997532

(TTTA)6

TGAAAAGGAAAGTTGAGATGA

55.08

CAATACACAACCCAGATAAGC

54.86

13997724

(GTAC)4

ACGACAAGTGAAAAAGCATTA

55.23

ACACACACACACACACACTTC

55.1

14008308

(ACAT)4

CCATGACGTAAAAACAAGGT

55.17

GAACGAAATAAAAAGGTGGAT

55.02

14008602

(TGTT)4

GTATCGCCTAGTTAATGTTGC

54.33

AATGTGTGATGTGAAAGGAAC

54.95

14018664

(CATA)4

TTTCTCAAATCAGTAGCTTGC

55.02

TCGTTTATTCATTGCTATGGT

54.96

14019002

(GAGC)6

ATGTGACGTCATCGTTAAGAG

55.34

ACACTGGCACTTCTTGTCTC

55.34

14028586

(GTTT)4

CCTACAACCATACGATAGTGC

54.9

TTGACCGGTACTGTAGAAGAC

54.46

14028684

(GCAA)6

ACGATGGAGATATTCGTTGTA

54.78

CTCTCTTCGTGTGTGTGTGTA

54.73

14038135

(TACA)4

TCGTAACAAAACAGAAACCAT

54.91

AACAACCTACTCTTGCATTGA

55.05

14048286

(TATC)4

CTTATGTTGTGCCCATAATCT

54.4

AAACATGGATGACATGAGATG

55.78

 

C. ¹ÙÄÚµå ¼­¿­


    ¿ä¹ø 16³âµµ¿¡ äÁýÇÑ 11Á¾ÀÇ À¯ÀüÀÚ¸¦ ºÐ¼®ÇÏ¿© ±× Á¾À» ±¸º°ÇÒ ¼ö ÀÖ´Â Barcode ¼­¿­À» ¸¸µé¾ú°í, ±× Barcode ¼­¿­À» °¡Áö°í NCBI (National Center for Biotechnology Information) »çÀÌÆ®¿¡ ÀÖ´Â BLAST (Basic Local Alignment Search Tool)À» ÀÌ¿ëÇÏ¿© °á°ú¸¦ È®ÀÎÇÏ¿´´Ù. °¢ Á¾ÀÇ ¼­¿­µéÀÇ BLAST °á°ú ¸¶Ä¿·Î¼­ ÁÖ·Î »ç¿ëµÇ´Â COI ¼­¿­°ú ¸ÅÄ¡µÇ´Â °ÍÀ¸·Î º¸¾Æ ¹ÙÄÚµå ¼­¿­·Î »ç¿ë°¡´ÉÇÒ °ÍÀ¸·Î »ç·áµÈ´Ù.



   ±×¸²ÀÔ´Ï´Ù.
¿øº» ±×¸²ÀÇ À̸§: CLP000012080001.bmp
¿øº» ±×¸²ÀÇ Å©±â: °¡·Î 920pixel, ¼¼·Î 682pixel


(5) QC °á°ú

DNA Library QC

Name

Concentration 

 (ng/ul)

Volume

(ul)

Quantity 

(ug)

Main peak Size (bp)

Result

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

26.15

3.27

85.6

470

pass


RNA Library QC

Name

Concentration 

 (ng/ul)

Volume

(ul)

Quantity 

(ng)

Main peak Size (bp)

Result

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

71.53

5.15

368.05

299

Pass


(6) Gene prediction

»ùÇÃÁ¾

Number of  Contig (>1kb)

Number of Bases (>1kb)

N50 Contig Size

Number of gene prediction

¸ÚÁ¶·Õ¹ÚµüÁ¤¹ú·¹

37,784

52,940,062

1,337

27,379


°¢Á¾º°·Î transcriptome ¼­¿­À» ´ë»óÀ¸·Î 1kb ÀÌ»ó¸¸À» ¼±ÅÃÇÏ¿© Gene predcition ÇÑ °á°ú À§ÀÇ ¸ÅĪµÈ contig, base ¼ö´Â À§ÀÇ Ç¥¿Í °°°í N50µÇ´Â ContigÀÇ ±æÀ̵µ º¸Åë 1~3kb Á¤µµ·Î ³ªÅ¸³µ´Ù. ±×¸®°í À̵éÀ» ÅëÇØ ¿¹ÃøµÈ À¯ÀüÀÚÀÇ ¼ö´Â 2õ°³¿¡¼­ 10¸¸°³ ÀÌ»óÀ¸·Î ´Ù¾çÇÏ°Ô ¿¹ÃøµÇ¾ú´Ù.


¡Ø Long contigÀÇ Gene prediction °á°ú ±×¸² ¿¹½Ã

±×¸²ÀÔ´Ï´Ù.
¿øº» ±×¸²ÀÇ À̸§: CLP000068380005.bmp
¿øº» ±×¸²ÀÇ Å©±â: °¡·Î 952pixel, ¼¼·Î 583pixel