ClCG05G005620 (gene) Watermelon (Charleston Gray)

NameClCG05G005620
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionEndonuclease/exonuclease/phosphatase family protein
LocationCG_Chr05 : 5484327 .. 5486114 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGAGGATCCTGAGGAGTGAAGTTAGACGTCTCTGCTCAAGGTTATGGTGGCTGATATGGAAGCACCCAAAGCGCAGAGTTATTGTCAAAAGGTTTGGGAAGATGAATGTGAAAAGTCGACAAAAAGGCAAACCAGATAAAAATAAAGCCATAGTTTATAAGAATAGCCAGTTGCGTGATTCTGTCATCAAAAGGCCATTTCGAGTTGCTACATTCAATGTTGCCATGTTCTCTCTTGCCCCTGCTGTACCAGTTGCAGATAAACCAGCAACATTCGGGTTTGGTAGAAAGGAATGCAATTTTAGAAGCCCTGTGAATCATTGCCCAAAGAGTATACTAAAGCAATCCCCATTGCATACTGCATTAAGCAAGACTGAGTCTCTCTCCAGATCAAAGCCAAAAGTTTCCATTAATCTTCCTGACAACGAGATTTCATTAGCAAACAAGAAGTTGTCTGCTTCCATGGAAGACGAAACGTCGGGCTTATCGAAGACAACTGAAAAAACCTACTTTAAAAGTCAAGTCCCAGTAAGGTCTCCCGTGTGCTTTCCTTTCTCCATAGCTAATTGGCACTGTGAAGATCACTTGACAAGTAGTAGAACCATCCTAGAGGTTCTCAAAGAGGCAGATGCTGATATGTTGGCTTTACAAGATGTGAAGGCTGAGGAATCAAAAGGCATGAAGCCTCTATCTGATTTAGCAGCTGCTTTAGGGATGCATTATGTATTTGCTGAGAGCTGGGCTCCTGAGTATGGAAATGCTGTTTTGTCCAAATGGCCAATCAAAAGATGGAAAGTTCAGAAGATCGCTGATGATGATGATTTCAGGTTCTTAATTATCTTCTCCCATCACTCATTAAGAACTTCAATGGGAAATTACCCATGTTATTTCCTCTTTCCTTTTCATTTTCTATAATCGGTTTGTGTTTTTTGCCTTCCGGGAAATGGTATTAATGCTTAACTTATGGATTATTCAATTTGGGTCACAGAAACTTGCTGAAAGTCGCAATTGACGTGCCAGGGGCGGGAGAAGTTAACATCTACTGCACTCAACTTGACCATCTTGATGAGAATTGGAGGATGAGGCAGATTCATGCAATAACAAAATCAGTTGATTGTCCGCACATCTTGGTGGGTGGTCTCAATTCTTTGGAGAGATCCGATTACTCACCTGAAAGATGGACCAACATAGTCGAGGTAATCTTCATGCTTGAACGAGTATTCACCGAACATTTAAACAAAATTTATTGATCAGCATGTGAATGACACCGCTATGTAAACCATAAGAAGTTAATACGCTCGAATTTTTTCGGATGCAGTACTATGAGAAGGTTGGGAAGCCGACTCCAAAGGTGGAAGTGATGAAGTATTTGAGGGGAAAAGGCTACATAGATTCAAAAGACTATGCAGGAGATTGTGTGCCAGTGGTGATCATGGCCAAAGGCCAAAGTATGACACATTGCTTATGCTCATGAACCATTCTTCTTTGAAATCTACTGAAAAACTTTGCTAATTTCATATGTTTTGCAACACAGGTGTGCAGGGAACCTGTAAATACGGCACGAGAGTCGACTATATCTTAGCTTCACAGGATTCAACGTTTAAATTTGTCCCGGGTTCATACTCGGTTGTTTCGTCAAAAGGCACTTCGGACCACCACATTGTGAAGGCAGAGTTTGTAGGAATAGGAAAGAAAGCTAGCAGAGGACAAAAGGATTTAAGGCACAGGATTGCAAGGTTGACTCGAACATGTTCTTCAATAGGAATGTCATTGATGCATACTTAG

mRNA sequence

ATGTTGAGGATCCTGAGGAGTGAAGTTAGACGTCTCTGCTCAAGGTTATGGTGGCTGATATGGAAGCACCCAAAGCGCAGAGTTATTGTCAAAAGGTTTGGGAAGATGAATGTGAAAAGTCGACAAAAAGGCAAACCAGATAAAAATAAAGCCATAGTTTATAAGAATAGCCAGTTGCGTGATTCTGTCATCAAAAGGCCATTTCGAGTTGCTACATTCAATGTTGCCATGTTCTCTCTTGCCCCTGCTGTACCAGTTGCAGATAAACCAGCAACATTCGGGTTTGGTAGAAAGGAATGCAATTTTAGAAGCCCTGTGAATCATTGCCCAAAGAGTATACTAAAGCAATCCCCATTGCATACTGCATTAAGCAAGACTGAGTCTCTCTCCAGATCAAAGCCAAAAGTTTCCATTAATCTTCCTGACAACGAGATTTCATTAGCAAACAAGAAGTTGTCTGCTTCCATGGAAGACGAAACGTCGGGCTTATCGAAGACAACTGAAAAAACCTACTTTAAAAGTCAAGTCCCAGTAAGGTCTCCCGTGTGCTTTCCTTTCTCCATAGCTAATTGGCACTGTGAAGATCACTTGACAAGTAGTAGAACCATCCTAGAGGTTCTCAAAGAGGCAGATGCTGATATGTTGGCTTTACAAGATGTGAAGGCTGAGGAATCAAAAGGCATGAAGCCTCTATCTGATTTAGCAGCTGCTTTAGGGATGCATTATGTATTTGCTGAGAGCTGGGCTCCTGAGTATGGAAATGCTGTTTTGTCCAAATGGCCAATCAAAAGATGGAAAGTTCAGAAGATCGCTGATGATGATGATTTCAGAAACTTGCTGAAAGTCGCAATTGACGTGCCAGGGGCGGGAGAAGTTAACATCTACTGCACTCAACTTGACCATCTTGATGAGAATTGGAGGATGAGGCAGATTCATGCAATAACAAAATCAGTTGATTGTCCGCACATCTTGGTGGGTGGTCTCAATTCTTTGGAGAGATCCGATTACTCACCTGAAAGATGGACCAACATAGTCGAGTACTATGAGAAGGTTGGGAAGCCGACTCCAAAGGTGGAAGTGATGAAGTATTTGAGGGGAAAAGGCTACATAGATTCAAAAGACTATGCAGGAGATTGTGTGCCAGTGGTGATCATGGCCAAAGGCCAAAGTGTGCAGGGAACCTGTAAATACGGCACGAGAGTCGACTATATCTTAGCTTCACAGGATTCAACGTTTAAATTTGTCCCGGGTTCATACTCGGTTGTTTCGTCAAAAGGCACTTCGGACCACCACATTGTGAAGGCAGAGTTTGTAGGAATAGGAAAGAAAGCTAGCAGAGGACAAAAGGATTTAAGGCACAGGATTGCAAGGTTGACTCGAACATGTTCTTCAATAGGAATGTCATTGATGCATACTTAG

Coding sequence (CDS)

ATGTTGAGGATCCTGAGGAGTGAAGTTAGACGTCTCTGCTCAAGGTTATGGTGGCTGATATGGAAGCACCCAAAGCGCAGAGTTATTGTCAAAAGGTTTGGGAAGATGAATGTGAAAAGTCGACAAAAAGGCAAACCAGATAAAAATAAAGCCATAGTTTATAAGAATAGCCAGTTGCGTGATTCTGTCATCAAAAGGCCATTTCGAGTTGCTACATTCAATGTTGCCATGTTCTCTCTTGCCCCTGCTGTACCAGTTGCAGATAAACCAGCAACATTCGGGTTTGGTAGAAAGGAATGCAATTTTAGAAGCCCTGTGAATCATTGCCCAAAGAGTATACTAAAGCAATCCCCATTGCATACTGCATTAAGCAAGACTGAGTCTCTCTCCAGATCAAAGCCAAAAGTTTCCATTAATCTTCCTGACAACGAGATTTCATTAGCAAACAAGAAGTTGTCTGCTTCCATGGAAGACGAAACGTCGGGCTTATCGAAGACAACTGAAAAAACCTACTTTAAAAGTCAAGTCCCAGTAAGGTCTCCCGTGTGCTTTCCTTTCTCCATAGCTAATTGGCACTGTGAAGATCACTTGACAAGTAGTAGAACCATCCTAGAGGTTCTCAAAGAGGCAGATGCTGATATGTTGGCTTTACAAGATGTGAAGGCTGAGGAATCAAAAGGCATGAAGCCTCTATCTGATTTAGCAGCTGCTTTAGGGATGCATTATGTATTTGCTGAGAGCTGGGCTCCTGAGTATGGAAATGCTGTTTTGTCCAAATGGCCAATCAAAAGATGGAAAGTTCAGAAGATCGCTGATGATGATGATTTCAGAAACTTGCTGAAAGTCGCAATTGACGTGCCAGGGGCGGGAGAAGTTAACATCTACTGCACTCAACTTGACCATCTTGATGAGAATTGGAGGATGAGGCAGATTCATGCAATAACAAAATCAGTTGATTGTCCGCACATCTTGGTGGGTGGTCTCAATTCTTTGGAGAGATCCGATTACTCACCTGAAAGATGGACCAACATAGTCGAGTACTATGAGAAGGTTGGGAAGCCGACTCCAAAGGTGGAAGTGATGAAGTATTTGAGGGGAAAAGGCTACATAGATTCAAAAGACTATGCAGGAGATTGTGTGCCAGTGGTGATCATGGCCAAAGGCCAAAGTGTGCAGGGAACCTGTAAATACGGCACGAGAGTCGACTATATCTTAGCTTCACAGGATTCAACGTTTAAATTTGTCCCGGGTTCATACTCGGTTGTTTCGTCAAAAGGCACTTCGGACCACCACATTGTGAAGGCAGAGTTTGTAGGAATAGGAAAGAAAGCTAGCAGAGGACAAAAGGATTTAAGGCACAGGATTGCAAGGTTGACTCGAACATGTTCTTCAATAGGAATGTCATTGATGCATACTTAG

Protein sequence

MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLRDSVIKRPFRVATFNVAMFSLAPAVPVADKPATFGFGRKECNFRSPVNHCPKSILKQSPLHTALSKTESLSRSKPKVSINLPDNEISLANKKLSASMEDETSGLSKTTEKTYFKSQVPVRSPVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEESKGMKPLSDLAAALGMHYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAGEVNIYCTQLDHLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEKVGKPTPKVEVMKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDSTFKFVPGSYSVVSSKGTSDHHIVKAEFVGIGKKASRGQKDLRHRIARLTRTCSSIGMSLMHT
BLAST of ClCG05G005620 vs. TrEMBL
Match: A0A0A0L5Q2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G150770 PE=4 SV=1)

HSP 1 Score: 886.3 bits (2289), Expect = 1.6e-254
Identity = 436/472 (92.37%), Postives = 454/472 (96.19%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           MLRILRSE+RRLCSRLWWLIWKHPKRRVIVKRFGKMNVK RQKG+PDKNKA VY  +QL 
Sbjct: 1   MLRILRSELRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKGRQKGRPDKNKARVYTKNQLC 60

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATFGFGRKECNFRSPVNHCPKSILKQSPLH 120
           DSVI RPFRVATFNVAMFSLAPAVPVA+KPATFGFGRKE +FRSPVNHCPKSILKQSPLH
Sbjct: 61  DSVITRPFRVATFNVAMFSLAPAVPVAEKPATFGFGRKEYSFRSPVNHCPKSILKQSPLH 120

Query: 121 TALSKTESLSRSKPKVSINLPDNEISLANKKLSASMEDETSGLSKTTEKTYFKSQVPVRS 180
           TALSKTESLSRSKPKVSINLPDNEISLAN KLSASME+ T GL+KTT+K YFKSQVPVRS
Sbjct: 121 TALSKTESLSRSKPKVSINLPDNEISLANNKLSASMENGTPGLTKTTDKRYFKSQVPVRS 180

Query: 181 PVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEESKGMKPLSDLAAALGM 240
           PVCFPFSIANWHCED LTSSRTILEVLKEADAD+LALQDVKAEESKGMKPLSDLAAALGM
Sbjct: 181 PVCFPFSIANWHCEDDLTSSRTILEVLKEADADILALQDVKAEESKGMKPLSDLAAALGM 240

Query: 241 HYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAGEVNIYCTQLD 300
            YVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPG GEVNIYCTQLD
Sbjct: 241 DYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGTGEVNIYCTQLD 300

Query: 301 HLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEKVGKPTPKVEV 360
           HLDENWRM+QI+AITKSVDCPHILVGGLNSLE+SDYSPERWTNIVEYYEKVGKPTPKVEV
Sbjct: 301 HLDENWRMKQINAITKSVDCPHILVGGLNSLEKSDYSPERWTNIVEYYEKVGKPTPKVEV 360

Query: 361 MKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDSTFKFVPGSYS 420
           MK+L GKGYIDSKDYAGDC PVVIMAKGQ+VQGTCKYGTRVDYILASQDSTFKFVPGSYS
Sbjct: 361 MKFLSGKGYIDSKDYAGDCEPVVIMAKGQNVQGTCKYGTRVDYILASQDSTFKFVPGSYS 420

Query: 421 VVSSKGTSDHHIVKAEFVGIGKKASRGQKDLRHRIARLTRTCSSIGMSLMHT 473
           VVSSKGTSDHHIVKAEFVGIG+KASRG KDL+ RI+RLT+TCSSIGMSLMHT
Sbjct: 421 VVSSKGTSDHHIVKAEFVGIGQKASRGHKDLKKRISRLTQTCSSIGMSLMHT 472

BLAST of ClCG05G005620 vs. TrEMBL
Match: A0A061EMS5_THECC (DNAse I-like superfamily protein OS=Theobroma cacao GN=TCM_021088 PE=4 SV=1)

HSP 1 Score: 552.7 bits (1423), Expect = 4.2e-154
Identity = 287/489 (58.69%), Postives = 356/489 (72.80%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           ML I R ++ +LCSR+ WLI K P+ +VI++R G++N K ++KG      + ++    L 
Sbjct: 18  MLGIFRRKMCQLCSRIRWLIRKRPRPKVIIRRLGRLNSKGQRKGDLGTKNSSIHLYGDLG 77

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATFGFGRKEC---------NFRSPVNHC-P 120
            S  KRP R+ATFNVAMFSLAP +  A++   F +G ++          N  +   +C P
Sbjct: 78  FSNPKRPIRIATFNVAMFSLAPVISEAEEAGLFSYGEEDYMALKSPFQFNLHTKSPNCYP 137

Query: 121 KSILKQSPLHTA------LSKTESLSRSKPKVSINLPDNEISLANKKLSASMEDETSGLS 180
           KSILKQSPLH +      +SK +  SRSK KVSINLPDNEISLA +KL   +ED   G S
Sbjct: 138 KSILKQSPLHNSHTSPDSISKQKKFSRSKQKVSINLPDNEISLAQRKLLTFVEDVKEGAS 197

Query: 181 KTTEKTYFKSQVPVRSPVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEE 240
                   ++ V +RSPVC P S+ N+  E  L S R+I EVL+E DAD+LALQDVKA+E
Sbjct: 198 DMITSRINRNNVIMRSPVCLPSSMINFWNEGSLRSGRSIAEVLREVDADILALQDVKAQE 257

Query: 241 SKGMKPLSDLAAALGMHYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAI 300
            KGMKPLSDLAAALGM YVFAESWAP+YGNA+LSKWPIKRW VQKIADDDDFRN+LK  I
Sbjct: 258 EKGMKPLSDLAAALGMKYVFAESWAPDYGNAILSKWPIKRWTVQKIADDDDFRNVLKATI 317

Query: 301 DVPGAGEVNIYCTQLDHLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNI 360
           +VP AGEVN YCTQLDHLDENWRM+QI AIT+S +  H+L+GGLNSL  SDYS ERWT+I
Sbjct: 318 EVPWAGEVNFYCTQLDHLDENWRMKQIKAITESNNSSHLLLGGLNSLNGSDYSSERWTDI 377

Query: 361 VEYYEKVGKPTPKVEVMKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYI 420
           V+YYE +GKP P+ EVMK LRG+ Y D+KDYAG+C PVVI+AKGQ+VQGTCKYGTRVDYI
Sbjct: 378 VKYYEDIGKPRPRTEVMKLLRGREYTDAKDYAGECEPVVIIAKGQNVQGTCKYGTRVDYI 437

Query: 421 LASQDSTFKFVPGSYSVVSSKGTSDHHIVKAEFVGIGKKA-----SRGQKDLRHRIARLT 468
           LAS +S + FVPGSYSV+SSKGTSDHHIVK + V  G+K+      R +K  + ++ R+T
Sbjct: 438 LASSNSPYNFVPGSYSVISSKGTSDHHIVKVDLVKGGEKSQQNVIKRDRKPTQKKVIRMT 497

BLAST of ClCG05G005620 vs. TrEMBL
Match: I1MFA3_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_15G100400 PE=4 SV=2)

HSP 1 Score: 547.4 bits (1409), Expect = 1.8e-152
Identity = 279/481 (58.00%), Postives = 348/481 (72.35%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           M  ++R ++R L SR+ WL+WK P+ +V++KRF K+N K   K +  KNK+    N  L 
Sbjct: 1   MFGVIRRKLRHLYSRILWLLWKRPRSKVVIKRFRKLNFKGHHKAELCKNKSTSDPNGLLV 60

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATF----GFGRKECNFRSPVNHCPKSILKQ 120
           +S   R  R+ATFNVAMFSLAPAV   ++        G  +K    +      PKSILKQ
Sbjct: 61  ESQSGRAIRIATFNVAMFSLAPAVSEFNEWVVSNHENGSNKKSLLAKGDF---PKSILKQ 120

Query: 121 SPLHTALSKTESLS------RSKPKVSINLPDNEISLANKKLSASMEDETSGLSKTTEKT 180
           SPLH +L K +SLS      RS  KVSINLPDNEISLAN +L AS+E +           
Sbjct: 121 SPLHASLDKAQSLSDSEILPRSNLKVSINLPDNEISLANSRLLASIESKEGTSDTIMGNV 180

Query: 181 YFKSQVPVRSPVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEESKGMKP 240
             + QVP RSPVCFPF +      +  T SR+ILEVL+E DAD+LALQDVKAEE K MKP
Sbjct: 181 SGRHQVPARSPVCFPFIMNYCEGTERFTCSRSILEVLREIDADVLALQDVKAEEEKNMKP 240

Query: 241 LSDLAAALGMHYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAG 300
           LSDLAAALGM YVFAESWAPEYGNA+LSKWPIK+W+VQKIADDDDFRN+LK  +DVP AG
Sbjct: 241 LSDLAAALGMKYVFAESWAPEYGNAILSKWPIKKWRVQKIADDDDFRNVLKATVDVPWAG 300

Query: 301 EVNIYCTQLDHLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEK 360
           E+N + TQLDHLDENWRM+Q+HAI +S D PHIL GGLNSL  +DYS ERWT+I +YYEK
Sbjct: 301 EINFHSTQLDHLDENWRMKQVHAIIRSNDPPHILAGGLNSLYGADYSSERWTDIFKYYEK 360

Query: 361 VGKPTPKVEVMKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDS 420
           +GKP P+ EVM +++ KGY+D+KDYAG+C P+ I+AKGQ+VQGTCKYGTRVDYILAS +S
Sbjct: 361 LGKPRPRSEVMNFVKSKGYVDAKDYAGECEPIAIIAKGQNVQGTCKYGTRVDYILASPNS 420

Query: 421 TFKFVPGSYSVVSSKGTSDHHIVKAEFVGIG----KKASRGQKDLRHRIARLTRTCSSIG 468
            +K+VPGSYSV+SSKGTSDHHIVK + + +     K A R  + L+ ++ ++T  CS+ G
Sbjct: 421 HYKYVPGSYSVISSKGTSDHHIVKVDIMKVNASAQKNAIRQCRKLKRKVVKITPPCSATG 478

BLAST of ClCG05G005620 vs. TrEMBL
Match: A0A0B2RIL1_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_010086 PE=4 SV=1)

HSP 1 Score: 547.4 bits (1409), Expect = 1.8e-152
Identity = 279/481 (58.00%), Postives = 348/481 (72.35%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           M  ++R ++R L SR+ WL+WK P+ +V++KRF K+N K   K +  KNK+    N  L 
Sbjct: 1   MFGVIRRKLRHLYSRILWLLWKRPRSKVVIKRFRKLNFKGHHKAELCKNKSTSDPNGLLV 60

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATF----GFGRKECNFRSPVNHCPKSILKQ 120
           +S   R  R+ATFNVAMFSLAPAV   ++        G  +K    +      PKSILKQ
Sbjct: 61  ESQSGRAIRIATFNVAMFSLAPAVSEFNEWVVSNHENGSNKKSLLAKGDF---PKSILKQ 120

Query: 121 SPLHTALSKTESLS------RSKPKVSINLPDNEISLANKKLSASMEDETSGLSKTTEKT 180
           SPLH +L K +SLS      RS  KVSINLPDNEISLAN +L AS+E +           
Sbjct: 121 SPLHASLDKAQSLSDSEILPRSNLKVSINLPDNEISLANSRLLASIESKEGTSDTIMGNV 180

Query: 181 YFKSQVPVRSPVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEESKGMKP 240
             + QVP RSPVCFPF +      +  T SR+ILEVL+E DAD+LALQDVKAEE K MKP
Sbjct: 181 SGRHQVPARSPVCFPFIMNYCEGTERFTCSRSILEVLREIDADVLALQDVKAEEEKNMKP 240

Query: 241 LSDLAAALGMHYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAG 300
           LSDLAAALGM YVFAESWAPEYGNA+LSKWPIK+W+VQKIADDDDFRN+LK  +DVP AG
Sbjct: 241 LSDLAAALGMKYVFAESWAPEYGNAILSKWPIKKWRVQKIADDDDFRNVLKATVDVPWAG 300

Query: 301 EVNIYCTQLDHLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEK 360
           E+N + TQLDHLDENWRM+Q+HAI +S D PHIL GGLNSL  +DYS ERWT+I +YYEK
Sbjct: 301 EINFHSTQLDHLDENWRMKQVHAIIRSNDPPHILAGGLNSLYGADYSSERWTDIFKYYEK 360

Query: 361 VGKPTPKVEVMKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDS 420
           +GKP P+ EVM +++ KGY+D+KDYAG+C P+ I+AKGQ+VQGTCKYGTRVDYILAS +S
Sbjct: 361 LGKPRPRSEVMNFVKSKGYVDAKDYAGECEPIAIIAKGQNVQGTCKYGTRVDYILASPNS 420

Query: 421 TFKFVPGSYSVVSSKGTSDHHIVKAEFVGIG----KKASRGQKDLRHRIARLTRTCSSIG 468
            +K+VPGSYSV+SSKGTSDHHIVK + + +     K A R  + L+ ++ ++T  CS+ G
Sbjct: 421 HYKYVPGSYSVISSKGTSDHHIVKVDIMKVNASAQKNAIRQCRKLKRKVVKITPPCSATG 478

BLAST of ClCG05G005620 vs. TrEMBL
Match: A0A0B2S062_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_007240 PE=4 SV=1)

HSP 1 Score: 547.0 bits (1408), Expect = 2.3e-152
Identity = 283/481 (58.84%), Postives = 353/481 (73.39%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           M  ++R ++R L SR+ WL+WK PK +V++KRF K+N K   + +  KNK+    N  L 
Sbjct: 1   MFGVIRRKLRHLYSRILWLLWKRPKSKVVIKRFRKLNFKGHHRAELRKNKSTSDSNGLLV 60

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATFGFGRKECNFRSPV--NHCPKSILKQSP 120
           +S   RP R+ATFNVAMFSLAPAV   D+           N ++ +     PKSILKQSP
Sbjct: 61  ESQSGRPIRIATFNVAMFSLAPAVSEFDEWVVSNHEHVS-NKKNLLAKGDFPKSILKQSP 120

Query: 121 LHTALSKTESLS------RSKPKVSINLPDNEISLANKKLSASMEDETSGLSKTTEKTYF 180
           LH +L K ++LS      RS  KVSINLPDNEISLAN +L ASME +       T     
Sbjct: 121 LHASLDKAQNLSASNILPRSNLKVSINLPDNEISLANSRLLASMERKEGTSDTITGNVSG 180

Query: 181 KSQVPVRSPVCFPFSIANWHCED--HLTSSRTILEVLKEADADMLALQDVKAEESKGMKP 240
           + QVP RSPVCFPF + N +CED    T SR+I+EVL+E DAD+LALQDVKAEE K MKP
Sbjct: 181 RHQVPARSPVCFPF-VMN-YCEDTERFTCSRSIMEVLREIDADVLALQDVKAEEEKNMKP 240

Query: 241 LSDLAAALGMHYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAG 300
           LSDLAAALGM YVFAESWAPEYGNA+LSKWPIK+ +VQKIADDDDFRN+LK  IDVP AG
Sbjct: 241 LSDLAAALGMKYVFAESWAPEYGNAILSKWPIKKSRVQKIADDDDFRNVLKATIDVPWAG 300

Query: 301 EVNIYCTQLDHLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEK 360
           E+N + TQLDHLDE+WRM+Q+HAI +S D PHIL GGLNSL  +DYS ERWT+I +YYEK
Sbjct: 301 EINFHSTQLDHLDESWRMKQVHAIIRSNDPPHILAGGLNSLYGADYSSERWTDIFKYYEK 360

Query: 361 VGKPTPKVEVMKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDS 420
           +GKP P+ EVM +++ KGY+D+KDYAG+C P+VI+AKGQ+VQGTCKYGTRVDYILAS +S
Sbjct: 361 LGKPRPRSEVMNFMKSKGYVDAKDYAGECEPIVIIAKGQNVQGTCKYGTRVDYILASPNS 420

Query: 421 TFKFVPGSYSVVSSKGTSDHHIVKAEFVGIG----KKASRGQKDLRHRIARLTRTCSSIG 468
            +K+VPGSYSV+SSKGTSDHHIVK + + +     K   R  + L+ ++ ++T  CS+ G
Sbjct: 421 PYKYVPGSYSVISSKGTSDHHIVKVDIMKVNAPAQKNVIRQCRKLKRKVVKITPPCSATG 478

BLAST of ClCG05G005620 vs. TAIR10
Match: AT3G21530.1 (AT3G21530.1 DNAse I-like superfamily protein)

HSP 1 Score: 486.9 bits (1252), Expect = 1.4e-137
Identity = 264/471 (56.05%), Postives = 331/471 (70.28%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           ML + R ++  L SRL W+I K  + RVIV+RF K   ++R+K  P+   + ++ +S   
Sbjct: 1   MLCVFRRKLGCLFSRLRWVIKKRVRARVIVRRFRKARWRARRKESPESEVSSIHLSSNSG 60

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATFGFGRKECNFRSPVNHCPKSILKQSPLH 120
                R  RVATFNVAMFSLAP V   ++ A  G      N   P    PK ILKQSPLH
Sbjct: 61  -----RHIRVATFNVAMFSLAPVVQTMEETAFLGH-LDSSNITCP---SPKGILKQSPLH 120

Query: 121 TALSKTESLSRSKPKVSINLPDNEISLANKKLSASM-EDETSGLSKTTEKTYFKSQVPVR 180
           ++  +       KPKV INLPDNEISLA      SM E++  G          +  + +R
Sbjct: 121 SSAVR-------KPKVCINLPDNEISLAQSYSFLSMVENDNDGKEN-------RGSLSMR 180

Query: 181 SPVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEESKGMKPLSDLAAALG 240
           SPVC P    +    +  +S R+I E+L+E DAD+LALQDVKAEE   MKPLSDLA+ALG
Sbjct: 181 SPVCLPSCWWDQESFNGYSSRRSIAELLRELDADILALQDVKAEEETLMKPLSDLASALG 240

Query: 241 MHYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAGEVNIYCTQL 300
           M YVFAESWAPEYGNA+LSKWPIK+W+VQ+IAD DDFRN+LKV +++P AG+VN+YCTQL
Sbjct: 241 MKYVFAESWAPEYGNAILSKWPIKKWRVQRIADVDDFRNVLKVTVEIPWAGDVNVYCTQL 300

Query: 301 DHLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEKVGKPTPKVE 360
           DHLDENWRM+QI AIT+  + PHIL+GGLNSL+ SDYS  RW +IV+YYE  GKPTP+VE
Sbjct: 301 DHLDENWRMKQIDAITRGDESPHILLGGLNSLDGSDYSIARWNHIVKYYEDSGKPTPRVE 360

Query: 361 VMKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDSTFKFVPGSY 420
           VM++L+GKGY+DSK++AG+C PVVI+AKGQ+VQGTCKYGTRVDYILAS +S ++FVPGSY
Sbjct: 361 VMRFLKGKGYLDSKEFAGECEPVVIIAKGQNVQGTCKYGTRVDYILASPESPYEFVPGSY 420

Query: 421 SVVSSKGTSDHHIVKAEFVGIGKKASRGQKDLRHRIARLTRTCSSIGMSLM 471
           SVVSSKGTSDHHIVK + V I K+ SRG  + +H   +  +    I  +LM
Sbjct: 421 SVVSSKGTSDHHIVKVDLV-ITKERSRG--NFKHSRKKAKQKIFQIKANLM 445

BLAST of ClCG05G005620 vs. TAIR10
Match: AT2G48030.1 (AT2G48030.1 DNAse I-like superfamily protein)

HSP 1 Score: 400.6 bits (1028), Expect = 1.3e-111
Identity = 225/409 (55.01%), Postives = 279/409 (68.22%), Query Frame = 1

Query: 70  VATFNVAMFSLAPAVPVADKPATFGFGRKECNFRSPVNHCPKSILKQ-----SPLHTALS 129
           VATFN AMFS+APAVP ++K   F         +S V+  PKSILK      SP H +  
Sbjct: 58  VATFNAAMFSMAPAVP-SNKGLPF-------RSKSTVDR-PKSILKPMNAAASPTHDS-R 117

Query: 130 KTESLSRSKPK-VSINLPDNEISLANKKLSASMEDETSGLSKTTEKTYFKSQVPVRSPVC 189
           K +  ++S+P+ VSINLPDNEIS   ++LS   + + S               P+R    
Sbjct: 118 KQQRFAKSRPRRVSINLPDNEIS---RQLSFREDPQHS---------------PLRPG-- 177

Query: 190 FPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEESKGMKPLSDLAAALGMHYV 249
                     E  L S+RT LEVL E DAD+LALQDVKA+E+  M+PLSDLAAALGM+YV
Sbjct: 178 ----------EIGLRSTRTALEVLSELDADVLALQDVKADEADQMRPLSDLAAALGMNYV 237

Query: 250 FAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAGEVNIYCTQLDHLD 309
           FAESWAPEYGNA+LSKWPIK   V +I D  DFRN+LK +I+VPG+GEV  +CT LDHLD
Sbjct: 238 FAESWAPEYGNAILSKWPIKSSNVLRIFDHTDFRNVLKASIEVPGSGEVEFHCTHLDHLD 297

Query: 310 ENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEKVGKPTPKVEVMKY 369
           E WRM+Q+ AI +S + PHIL G LNSL+ SDYSPERWT+IV+YYE++GKP PK +VM++
Sbjct: 298 EKWRMKQVDAIIQSTNVPHILAGALNSLDESDYSPERWTDIVKYYEEMGKPIPKAQVMRF 357

Query: 370 LRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDSTFKFVPGSYSVVS 429
           L+ K Y D+KD+AG+C  VV++AKGQSVQGTCKYGTRVDYILAS DS ++FVPGSYSV+S
Sbjct: 358 LKSKEYTDAKDFAGECESVVVVAKGQSVQGTCKYGTRVDYILASSDSPYRFVPGSYSVLS 417

Query: 430 SKGTSDHHIVKAEFV---GIGKKASRGQKDLRHRIARLTRTCSSIGMSL 470
           SKGTSDHHIVK + V    I       +    H++ R+T T  +   SL
Sbjct: 418 SKGTSDHHIVKVDVVKATSINVNEQEQRPIRSHKLQRITATTYNNNSSL 426

BLAST of ClCG05G005620 vs. NCBI nr
Match: gi|449432773|ref|XP_004134173.1| (PREDICTED: uncharacterized protein LOC101215085 [Cucumis sativus])

HSP 1 Score: 886.3 bits (2289), Expect = 2.3e-254
Identity = 436/472 (92.37%), Postives = 454/472 (96.19%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           MLRILRSE+RRLCSRLWWLIWKHPKRRVIVKRFGKMNVK RQKG+PDKNKA VY  +QL 
Sbjct: 1   MLRILRSELRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKGRQKGRPDKNKARVYTKNQLC 60

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATFGFGRKECNFRSPVNHCPKSILKQSPLH 120
           DSVI RPFRVATFNVAMFSLAPAVPVA+KPATFGFGRKE +FRSPVNHCPKSILKQSPLH
Sbjct: 61  DSVITRPFRVATFNVAMFSLAPAVPVAEKPATFGFGRKEYSFRSPVNHCPKSILKQSPLH 120

Query: 121 TALSKTESLSRSKPKVSINLPDNEISLANKKLSASMEDETSGLSKTTEKTYFKSQVPVRS 180
           TALSKTESLSRSKPKVSINLPDNEISLAN KLSASME+ T GL+KTT+K YFKSQVPVRS
Sbjct: 121 TALSKTESLSRSKPKVSINLPDNEISLANNKLSASMENGTPGLTKTTDKRYFKSQVPVRS 180

Query: 181 PVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEESKGMKPLSDLAAALGM 240
           PVCFPFSIANWHCED LTSSRTILEVLKEADAD+LALQDVKAEESKGMKPLSDLAAALGM
Sbjct: 181 PVCFPFSIANWHCEDDLTSSRTILEVLKEADADILALQDVKAEESKGMKPLSDLAAALGM 240

Query: 241 HYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAGEVNIYCTQLD 300
            YVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPG GEVNIYCTQLD
Sbjct: 241 DYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGTGEVNIYCTQLD 300

Query: 301 HLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEKVGKPTPKVEV 360
           HLDENWRM+QI+AITKSVDCPHILVGGLNSLE+SDYSPERWTNIVEYYEKVGKPTPKVEV
Sbjct: 301 HLDENWRMKQINAITKSVDCPHILVGGLNSLEKSDYSPERWTNIVEYYEKVGKPTPKVEV 360

Query: 361 MKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDSTFKFVPGSYS 420
           MK+L GKGYIDSKDYAGDC PVVIMAKGQ+VQGTCKYGTRVDYILASQDSTFKFVPGSYS
Sbjct: 361 MKFLSGKGYIDSKDYAGDCEPVVIMAKGQNVQGTCKYGTRVDYILASQDSTFKFVPGSYS 420

Query: 421 VVSSKGTSDHHIVKAEFVGIGKKASRGQKDLRHRIARLTRTCSSIGMSLMHT 473
           VVSSKGTSDHHIVKAEFVGIG+KASRG KDL+ RI+RLT+TCSSIGMSLMHT
Sbjct: 421 VVSSKGTSDHHIVKAEFVGIGQKASRGHKDLKKRISRLTQTCSSIGMSLMHT 472

BLAST of ClCG05G005620 vs. NCBI nr
Match: gi|659076584|ref|XP_008438759.1| (PREDICTED: uncharacterized protein LOC103483772 [Cucumis melo])

HSP 1 Score: 871.3 bits (2250), Expect = 7.6e-250
Identity = 427/473 (90.27%), Postives = 451/473 (95.35%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           MLRILRSE+RRLCSRLWWLIWKHPKRRVI+KRFGKMNVK RQKG+PDK+KA +Y  +QL 
Sbjct: 1   MLRILRSELRRLCSRLWWLIWKHPKRRVIIKRFGKMNVKGRQKGRPDKSKARIYTKNQLC 60

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATFGFGRKECNFRSPVNHCPKSILKQSPLH 120
           DSVI RPFRVATFNVAMFSLAPAVPVA+KPATFGF RKEC+FRSPVNHCPKSILKQSPLH
Sbjct: 61  DSVITRPFRVATFNVAMFSLAPAVPVAEKPATFGFRRKECSFRSPVNHCPKSILKQSPLH 120

Query: 121 TALSKTESLSRSKPKVSINLPDNEISLANKKLSASMEDETSGLSKTTEKTYFKSQVPVRS 180
            ALSKTESLSRSKPKVSINLPDNEISLAN KLSASME+ET  L+KTT+K YFKSQVPVRS
Sbjct: 121 NALSKTESLSRSKPKVSINLPDNEISLANNKLSASMEEETPSLTKTTDKRYFKSQVPVRS 180

Query: 181 PVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEESKGMKPLSDLAAALGM 240
           PVCFPFSIANWHCEDHLTSS+TILEVLKEADAD+LALQDVKAEESKGMKPLSDLAAALGM
Sbjct: 181 PVCFPFSIANWHCEDHLTSSKTILEVLKEADADILALQDVKAEESKGMKPLSDLAAALGM 240

Query: 241 HYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAGEVNIYCTQLD 300
            YVFAESWAPEYGNAVLSKWPI+RWKVQKIADDDDFRNLLKVAIDVPG GEVNIYCTQLD
Sbjct: 241 DYVFAESWAPEYGNAVLSKWPIRRWKVQKIADDDDFRNLLKVAIDVPGTGEVNIYCTQLD 300

Query: 301 HLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEKVGKPTPKVEV 360
           HLDENWRM+QI+AITKSVDCPHILVGGLNSLERSDYSP RWT+IVEYYEKVGKPTPKVEV
Sbjct: 301 HLDENWRMKQINAITKSVDCPHILVGGLNSLERSDYSPGRWTDIVEYYEKVGKPTPKVEV 360

Query: 361 MKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDSTFKFVPGSYS 420
           MK+L GKGYIDSKDYAGDC PVVIMAKGQ+VQGTCKYGTRVDYILAS DSTFKFVPGSYS
Sbjct: 361 MKFLSGKGYIDSKDYAGDCEPVVIMAKGQNVQGTCKYGTRVDYILASPDSTFKFVPGSYS 420

Query: 421 VVSSKGTSDHHIVKAEFVGIGKKASR-GQKDLRHRIARLTRTCSSIGMSLMHT 473
           V+SSKGTSDHHIVKAEFVGIG+K SR GQK L+ RI+RLT+TCSSIGMSLMHT
Sbjct: 421 VISSKGTSDHHIVKAEFVGIGQKVSRGGQKVLKQRISRLTQTCSSIGMSLMHT 473

BLAST of ClCG05G005620 vs. NCBI nr
Match: gi|225432458|ref|XP_002277228.1| (PREDICTED: uncharacterized protein LOC100259606 [Vitis vinifera])

HSP 1 Score: 563.5 bits (1451), Expect = 3.4e-157
Identity = 285/486 (58.64%), Postives = 365/486 (75.10%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           ML IL  +++R+C+RL WL+W+ PK +V+++RFGK+ +K + KG P  +K+ ++ NSQL 
Sbjct: 1   MLSILHRKLQRICARLRWLMWRRPKPKVVIRRFGKL-MKRQPKGVPGSSKSAIHLNSQLG 60

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATFGFGRKE-CNFRSPV---------NHCP 120
            S+  +P R+ATFN AMF LAPAVP  +K   F    ++   F+S +         N+ P
Sbjct: 61  RSISGKPIRIATFNAAMFCLAPAVPKPEKSVVFCQEEEDYLRFKSHMKIDTWAKSENYRP 120

Query: 121 KSILKQSPLHTALSKTESLS-----RSKPKVSINLPDNEISLANKKLSASMEDETSGLSK 180
           KSILKQSPLH+  +  + LS     RS+ KVSINLPDNEISLAN KL +  E E  G S 
Sbjct: 121 KSILKQSPLHSTPNTPDHLSQQKLTRSRLKVSINLPDNEISLANSKLLSFWESEKEGSSS 180

Query: 181 TTEKTYFKSQVPVRSPVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEES 240
                 +K  VP+RSPVC+P S++++   + L SSR+ILEVL+E +AD+LALQDVKAEE 
Sbjct: 181 NGRNYRYK--VPMRSPVCYPSSMSDYPIGEGLRSSRSILEVLREVNADILALQDVKAEEE 240

Query: 241 KGMKPLSDLAAALGMHYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAID 300
           KGMKPLSDLA ALGM YVFAESWAPE+GNA+LSKWPIKRWK QKI D +DFRN+LK  ID
Sbjct: 241 KGMKPLSDLAGALGMKYVFAESWAPEFGNAILSKWPIKRWKAQKIIDGEDFRNVLKATID 300

Query: 301 VPGAGEVNIYCTQLDHLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIV 360
           VP AGEVN +CTQLDHLDENWRM+Q +AI +S DCPHIL GGLNSL  SDYS ERW +I+
Sbjct: 301 VPWAGEVNFHCTQLDHLDENWRMKQTNAIIQSSDCPHILAGGLNSLNGSDYSRERWMDII 360

Query: 361 EYYEKVGKPTPKVEVMKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYIL 420
           +YYE++GKPTPKV+VM++L+GK Y+D+K++AG+C PVVI+AKGQ+VQGTCKYGTRVDYIL
Sbjct: 361 KYYEEIGKPTPKVDVMEFLKGKEYVDAKNFAGECEPVVIIAKGQNVQGTCKYGTRVDYIL 420

Query: 421 ASQDSTFKFVPGSYSVVSSKGTSDHHIVKAEFVGIGKKAS----RGQKDLRHRIARLTRT 468
           ASQDS +KFVP SYSV+SSKGTSDHH+VK + V + + A     R  +  + +I ++   
Sbjct: 421 ASQDSPYKFVPRSYSVISSKGTSDHHVVKVDIVKVDENAEENFIRRHRKPKQKIVKMRNP 480

BLAST of ClCG05G005620 vs. NCBI nr
Match: gi|590660455|ref|XP_007035404.1| (DNAse I-like superfamily protein [Theobroma cacao])

HSP 1 Score: 552.7 bits (1423), Expect = 6.0e-154
Identity = 287/489 (58.69%), Postives = 356/489 (72.80%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           ML I R ++ +LCSR+ WLI K P+ +VI++R G++N K ++KG      + ++    L 
Sbjct: 18  MLGIFRRKMCQLCSRIRWLIRKRPRPKVIIRRLGRLNSKGQRKGDLGTKNSSIHLYGDLG 77

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATFGFGRKEC---------NFRSPVNHC-P 120
            S  KRP R+ATFNVAMFSLAP +  A++   F +G ++          N  +   +C P
Sbjct: 78  FSNPKRPIRIATFNVAMFSLAPVISEAEEAGLFSYGEEDYMALKSPFQFNLHTKSPNCYP 137

Query: 121 KSILKQSPLHTA------LSKTESLSRSKPKVSINLPDNEISLANKKLSASMEDETSGLS 180
           KSILKQSPLH +      +SK +  SRSK KVSINLPDNEISLA +KL   +ED   G S
Sbjct: 138 KSILKQSPLHNSHTSPDSISKQKKFSRSKQKVSINLPDNEISLAQRKLLTFVEDVKEGAS 197

Query: 181 KTTEKTYFKSQVPVRSPVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEE 240
                   ++ V +RSPVC P S+ N+  E  L S R+I EVL+E DAD+LALQDVKA+E
Sbjct: 198 DMITSRINRNNVIMRSPVCLPSSMINFWNEGSLRSGRSIAEVLREVDADILALQDVKAQE 257

Query: 241 SKGMKPLSDLAAALGMHYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAI 300
            KGMKPLSDLAAALGM YVFAESWAP+YGNA+LSKWPIKRW VQKIADDDDFRN+LK  I
Sbjct: 258 EKGMKPLSDLAAALGMKYVFAESWAPDYGNAILSKWPIKRWTVQKIADDDDFRNVLKATI 317

Query: 301 DVPGAGEVNIYCTQLDHLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNI 360
           +VP AGEVN YCTQLDHLDENWRM+QI AIT+S +  H+L+GGLNSL  SDYS ERWT+I
Sbjct: 318 EVPWAGEVNFYCTQLDHLDENWRMKQIKAITESNNSSHLLLGGLNSLNGSDYSSERWTDI 377

Query: 361 VEYYEKVGKPTPKVEVMKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYI 420
           V+YYE +GKP P+ EVMK LRG+ Y D+KDYAG+C PVVI+AKGQ+VQGTCKYGTRVDYI
Sbjct: 378 VKYYEDIGKPRPRTEVMKLLRGREYTDAKDYAGECEPVVIIAKGQNVQGTCKYGTRVDYI 437

Query: 421 LASQDSTFKFVPGSYSVVSSKGTSDHHIVKAEFVGIGKKA-----SRGQKDLRHRIARLT 468
           LAS +S + FVPGSYSV+SSKGTSDHHIVK + V  G+K+      R +K  + ++ R+T
Sbjct: 438 LASSNSPYNFVPGSYSVISSKGTSDHHIVKVDLVKGGEKSQQNVIKRDRKPTQKKVIRMT 497

BLAST of ClCG05G005620 vs. NCBI nr
Match: gi|1021027662|gb|KZM85449.1| (hypothetical protein DCAR_027129 [Daucus carota subsp. sativus])

HSP 1 Score: 547.7 bits (1410), Expect = 1.9e-152
Identity = 280/474 (59.07%), Postives = 350/474 (73.84%), Query Frame = 1

Query: 1   MLRILRSEVRRLCSRLWWLIWKHPKRRVIVKRFGKMNVKSRQKGKPDKNKAIVYKNSQLR 60
           ML   R ++  LCSR+ WLIW+ PK +V+++RFGK++ K + + KP  N A+  KN  + 
Sbjct: 1   MLSQFRKKLSHLCSRIRWLIWRRPKSKVVIRRFGKLSSKGQLRRKPS-NLAVRRKNDLMG 60

Query: 61  DSVIKRPFRVATFNVAMFSLAPAVPVADKPATF-GFGRKECNFRSPVNHCPKSILKQSPL 120
            +++++  RVATFN A+FSLA AVP A+K   F              N+ PKSILKQSPL
Sbjct: 61  GALLQKSVRVATFNAALFSLALAVPRAEKAVVFLNEDDLFTTMEKSANNVPKSILKQSPL 120

Query: 121 HTALSKTES----LSRSKPK--VSINLPDNEISLANKKLSASMEDETSGLSKTTEKTYFK 180
           H++ S T S    LS  KPK  VSINLPDNEISLA KK+   +++     S+   +   +
Sbjct: 121 HSSFSGTASPEYLLSPIKPKMKVSINLPDNEISLAQKKVLGKIDEP----SRPIRQGSVR 180

Query: 181 SQVPVRSPVCFPFSIANWHCEDHLTSSRTILEVLKEADADMLALQDVKAEESKGMKPLSD 240
           +Q P+RSPV  PF + NW  +  L  SRTIL+VLKE DAD+LALQDVKAEE K MKPLSD
Sbjct: 181 NQGPMRSPVNIPFGMTNWMNDGSLIGSRTILDVLKEVDADILALQDVKAEEEKNMKPLSD 240

Query: 241 LAAALGMHYVFAESWAPEYGNAVLSKWPIKRWKVQKIADDDDFRNLLKVAIDVPGAGEVN 300
           LA ALGM+YVFAESWAPEYGNA+LSKWPIKRWKVQ+I DD DFRN+LK  IDVP  G++N
Sbjct: 241 LAYALGMNYVFAESWAPEYGNAILSKWPIKRWKVQRIYDDQDFRNVLKATIDVPWTGDIN 300

Query: 301 IYCTQLDHLDENWRMRQIHAITKSVDCPHILVGGLNSLERSDYSPERWTNIVEYYEKVGK 360
            YCTQLDHLDE+WR++QI+AI +S D PHIL GGLNSL  SDYSPERW +IV YY+ +GK
Sbjct: 301 FYCTQLDHLDESWRLKQINAIIQSSDHPHILAGGLNSLNISDYSPERWADIVRYYQAIGK 360

Query: 361 PTPKVEVMKYLRGKGYIDSKDYAGDCVPVVIMAKGQSVQGTCKYGTRVDYILASQDSTFK 420
           PTPKVEV  +L+G+ YID+KD++GDC PVV++AKGQ+VQGTCKYGTRVDYI+ASQD  +K
Sbjct: 361 PTPKVEVTNFLKGEEYIDAKDFSGDCEPVVMIAKGQNVQGTCKYGTRVDYIMASQDLHYK 420

Query: 421 FVPGSYSVVSSKGTSDHHIVKAEFVGIGKKASRGQKDLRHRIARLTRTCSSIGM 468
           FVP +YSV+SSKGTSDHH+VK + V   K A R  K L  ++AR+  +CSS GM
Sbjct: 421 FVPETYSVISSKGTSDHHLVKVDIV---KAADRRPKKLTQKVARIASSCSSSGM 466

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0L5Q2_CUCSA1.6e-25492.37Uncharacterized protein OS=Cucumis sativus GN=Csa_3G150770 PE=4 SV=1[more]
A0A061EMS5_THECC4.2e-15458.69DNAse I-like superfamily protein OS=Theobroma cacao GN=TCM_021088 PE=4 SV=1[more]
I1MFA3_SOYBN1.8e-15258.00Uncharacterized protein OS=Glycine max GN=GLYMA_15G100400 PE=4 SV=2[more]
A0A0B2RIL1_GLYSO1.8e-15258.00Uncharacterized protein OS=Glycine soja GN=glysoja_010086 PE=4 SV=1[more]
A0A0B2S062_GLYSO2.3e-15258.84Uncharacterized protein OS=Glycine soja GN=glysoja_007240 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21530.11.4e-13756.05 DNAse I-like superfamily protein[more]
AT2G48030.11.3e-11155.01 DNAse I-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449432773|ref|XP_004134173.1|2.3e-25492.37PREDICTED: uncharacterized protein LOC101215085 [Cucumis sativus][more]
gi|659076584|ref|XP_008438759.1|7.6e-25090.27PREDICTED: uncharacterized protein LOC103483772 [Cucumis melo][more]
gi|225432458|ref|XP_002277228.1|3.4e-15758.64PREDICTED: uncharacterized protein LOC100259606 [Vitis vinifera][more]
gi|590660455|ref|XP_007035404.1|6.0e-15458.69DNAse I-like superfamily protein [Theobroma cacao][more]
gi|1021027662|gb|KZM85449.1|1.9e-15259.07hypothetical protein DCAR_027129 [Daucus carota subsp. sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005135Endo/exonuclease/phosphatase
IPR027317PGAP2-interacting protein
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004518 nuclease activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0004527 exonuclease activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG05G005620.1ClCG05G005620.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005135Endonuclease/exonuclease/phosphataseGENE3DG3DSA:3.60.10.10coord: 188..437
score: 1.7
IPR005135Endonuclease/exonuclease/phosphatasePFAMPF03372Exo_endo_phoscoord: 200..430
score: 8.3
IPR005135Endonuclease/exonuclease/phosphataseunknownSSF56219DNase I-likecoord: 194..437
score: 4.19
IPR027317PGAP2-interacting proteinPANTHERPTHR14859CALCOFLUOR WHITE HYPERSENSITIVE PROTEIN PRECURSORcoord: 1..467
score: 1.3E
NoneNo IPR availablePANTHERPTHR14859:SF2DNASE I-LIKE SUPERFAMILY PROTEINcoord: 1..467
score: 1.3E

The following gene(s) are paralogous to this gene:

None