CmoCh19G000050 (gene) Cucurbita moschata (Rifu)

NameCmoCh19G000050
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEndonuclease/exonuclease/phosphatase family protein
LocationCmo_Chr19 : 41350 .. 44068 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGATGAAAACGAGTGGCCCCAAAAGTGGAAGTCAACCATTTGTAGGAGTCACCGAGTTGGACTAAAATTGGGGTTTTGTTATTAATTTCTGCACTCACACCAATGCCATACAGGTAAGCCCAGTTGAATGTCACTCTGCAACTCTCACACTTTCCTACTCCTTTTCTAAAATGCTCAACTTCCTCAACCGGAGCCTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCTCGTGCCCGGCGGGTCAGACCTAGGGTAGTCGTCATCAAGAAATTTGGAAAAACCACCTCCAAACCTCACGCCGATCCCCACAATACCCTCGACTCCTTCGTTAATGCCTCCTTGGCGTCGCCTGTGCATCCCAAACCTCAATTTCACGGTCGTAATACTCAGAGACCTGTGCGGATTGCGACATTTAATGCCGCCTCCTTCTCCATGGCACCTGCTGTTCCGTACGCTGAAAAATCTAATTCCTCTGCCAAATTCCGACGGAGTTTGGATTCCAGTTTACGGACAAAATCCGTAAATGATCGCCCCAAAAGCATTTTGAAACAGTCTCCACTGCATCCGAATTCCGTGAATGGAGTTGTTGCTAATCATAACCTCCACACCCAACCGAAGTTCGTGAAACCCAAGCCGCGGGTTTCGATCAACCTGCCTGATAACGAGATATCCTTACTCAGAAATCGACAGGCGAGCTTTTCTGAGTACGAAATGGAGAAGGAGGATCCGTCCTCTTCAGGTAACGATGGGAATGGGATGCGGATCGCTAAGAGTCGGCCTCCACTGAGATCGATTGTAAGCATGCCTTTGGAGCGCGAAAAAGGGGAGAGTTACAGATGCAGTAGGACGGTTGTGGAGGTGCTTAGGGAGTTGGATGCTGACATATTGGCATTGCAAGATGTGAAGGCGGTGGAAGAGAAAGGCATGAGACCGCTCTCGGATTTGGCAGATGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGCTGGGCGCCGGAGTATGGAAATGCGGTCCTGTCGAGATGGCCCATCAAACGCTGGAAAGTCGAGAAGATTTTCGACCACACCGATTTCAGGTTAGCAGCTGTACACCACACTCAATAGTATTACACTATTATTAAACTGAAGCAAACCTTTTAGATTAATTTGTGTGTTCTTGCTCTGCTCCTCATTTCATAAACATGCTTTTGATTGATGACACTAGTAGAGTAGATGGAGTCAAGGCGATTGACTGCTTTTGTTTTCCTTTTAAACAATGAATTCAGCAAAAACTAAGCTAAAAACTTAACTTATGGTCAACGTTAGATCTGTGATAGAAGGCAGAGGTCGAAGATTATTGAGAAGGAGTCCCACGTTGTCTAATTTAGAGAATGATTATGGGTTTATAGGTAAGGAATACATCTCCATTGGTATGAGGCGGCCTTTTGGGGAATCCCAAAGCAAAGCCATGAGAACTTATGCTCAAAGTAGACAATATCATACGATTGTGGAGAGTCGTTATTACCAACTAGAAACTTATGAAAAGTAGTAAATAGTTGATAAACTCTGTGAGAATGATAGAATTGGTTATTGGGTTGGGCGGTGGCAGGAATGTGTTAAAAGCGACCATTGATGTGGGAGAAGTAGGAGAGGTAAATGTGCAGTGTACCCATTTGGATCATCTGGACGAGAATTGGAGGATGAAACAGATAAAATCCATAATCCGATCGACCAACGACGAACCCCATATCTTATTAGGAGGCCTTAATTCTCTGGATCCCACGGATTACTCGCAGCAACGGTGGACGGACATCGTGAAGGTATCAATGTTTATTTTGAACTTTGTTTAGCTTAGTTGAGTTTTGTTTATGGCATGCATCCCATGTTGGTTGGTTGGTTGGTTGATTGATTGATTCTTCAGTATTACGAAGAGATAGGAAAGCCAACTCCGGAAGCTAAAGTGATTAAGTTCTTAAAGAGCGGTATGCACTATAGGGATGCAAAGGAGTTTGGAGGAGAATGCGAATCAGTGGTGATGATCGCCAAAGGCCAAAGTATGAAACTGAGTGATGAATATGAAAAAAAAAGATAGGACAATGAGAATGAAAGGTTTTTACTAATGGAATGTATGGCATTGGTTGGCAATGGCAGGTGTTCAAGGGACGTGCAAGTACGGGACACGAGTCGACTACATATTGGCCTCTCCCGATGCAGATTACGAGTTTGTTAAAGGATCCTACTCTGTCCTTTCCTCAAAAGGAACCTCCGATCATCACATTGTCAAGGTTGATTTCCTCAAACCTCCTCATTCTCGAGGTTGATTTCTTCAAACCTCCTCCTCGGCCTCGGCCTCGGCCCCAAACCCGTTCGCATTCCCTTTCCCCTTGGAAGAGATGGACACGAAACAACGATCAACACGCCCACACCCTTTGTCTCCTTTAATTTCTCAAACCTTTTCACTTATATTTACACACTCATTTGTATTACATTTAGGAATGGAACTCATTCCTTTTGTTCTATGCCGTATGTAACCAAAGAACCCATGTACACCTTAATTTCAAACAAAAGTATTAGACACAAAAACTTCAAAACTAAAATGAGATTTGAAGGAAGCTGATGTATTTACCAACACACAAGCTACAGTTACTACTGGAACTGCCTGGATTTCTAGTTAAAGTTCTAATGAAAGCTCTAAGTATATCGTTGTCATCAACAATCTACAGAATTACAGAAGAATAGTGTTG

mRNA sequence

AGATGAAAACGAGTGGCCCCAAAAGTGGAAGTCAACCATTTGTAGGAGTCACCGAGTTGGACTAAAATTGGGGTTTTGTTATTAATTTCTGCACTCACACCAATGCCATACAGGTAAGCCCAGTTGAATGTCACTCTGCAACTCTCACACTTTCCTACTCCTTTTCTAAAATGCTCAACTTCCTCAACCGGAGCCTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCTCGTGCCCGGCGGGTCAGACCTAGGGTAGTCGTCATCAAGAAATTTGGAAAAACCACCTCCAAACCTCACGCCGATCCCCACAATACCCTCGACTCCTTCGTTAATGCCTCCTTGGCGTCGCCTGTGCATCCCAAACCTCAATTTCACGGTCGTAATACTCAGAGACCTGTGCGGATTGCGACATTTAATGCCGCCTCCTTCTCCATGGCACCTGCTGTTCCGTACGCTGAAAAATCTAATTCCTCTGCCAAATTCCGACGGAGTTTGGATTCCAGTTTACGGACAAAATCCGTAAATGATCGCCCCAAAAGCATTTTGAAACAGTCTCCACTGCATCCGAATTCCGTGAATGGAGTTGTTGCTAATCATAACCTCCACACCCAACCGAAGTTCGTGAAACCCAAGCCGCGGGTTTCGATCAACCTGCCTGATAACGAGATATCCTTACTCAGAAATCGACAGGCGAGCTTTTCTGAGTACGAAATGGAGAAGGAGGATCCGTCCTCTTCAGGTAACGATGGGAATGGGATGCGGATCGCTAAGAGTCGGCCTCCACTGAGATCGATTGTAAGCATGCCTTTGGAGCGCGAAAAAGGGGAGAGTTACAGATGCAGTAGGACGGTTGTGGAGGTGCTTAGGGAGTTGGATGCTGACATATTGGCATTGCAAGATGTGAAGGCGGTGGAAGAGAAAGGCATGAGACCGCTCTCGGATTTGGCAGATGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGCTGGGCGCCGGAGTATGGAAATGCGGTCCTGTCGAGATGGCCCATCAAACGCTGGAAAGTCGAGAAGATTTTCGACCACACCGATTTCAGGAATGTGTTAAAAGCGACCATTGATGTGGGAGAAGTAGGAGAGGTAAATGTGCAGTGTACCCATTTGGATCATCTGGACGAGAATTGGAGGATGAAACAGATAAAATCCATAATCCGATCGACCAACGACGAACCCCATATCTTATTAGGAGGCCTTAATTCTCTGGATCCCACGGATTACTCGCAGCAACGGTGGACGGACATCGTGAAGTATTACGAAGAGATAGGAAAGCCAACTCCGGAAGCTAAAGTGATTAAGTTCTTAAAGAGCGGTATGCACTATAGGGATGCAAAGGAGTTTGGAGGAGAATGCGAATCAGTGGTGATGATCGCCAAAGGCCAAAGTGTTCAAGGGACGTGCAAGTACGGGACACGAGTCGACTACATATTGGCCTCTCCCGATGCAGATTACGAGTTTGTTAAAGGATCCTACTCTGTCCTTTCCTCAAAAGGAACCTCCGATCATCACATTGTCAAGGTTGATTTCCTCAAACCTCCTCATTCTCGAGGTTGATTTCTTCAAACCTCCTCCTCGGCCTCGGCCTCGGCCCCAAACCCGTTCGCATTCCCTTTCCCCTTGGAAGAGATGGACACGAAACAACGATCAACACGCCCACACCCTTTGTCTCCTTTAATTTCTCAAACCTTTTCACTTATATTTACACACTCATTTGTATTACATTTAGGAATGGAACTCATTCCTTTTGTTCTATGCCGTATGTAACCAAAGAACCCATGTACACCTTAATTTCAAACAAAAGTATTAGACACAAAAACTTCAAAACTAAAATGAGATTTGAAGGAAGCTGATGTATTTACCAACACACAAGCTACAGTTACTACTGGAACTGCCTGGATTTCTAGTTAAAGTTCTAATGAAAGCTCTAAGTATATCGTTGTCATCAACAATCTACAGAATTACAGAAGAATAGTGTTG

Coding sequence (CDS)

ATGCTCAACTTCCTCAACCGGAGCCTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCTCGTGCCCGGCGGGTCAGACCTAGGGTAGTCGTCATCAAGAAATTTGGAAAAACCACCTCCAAACCTCACGCCGATCCCCACAATACCCTCGACTCCTTCGTTAATGCCTCCTTGGCGTCGCCTGTGCATCCCAAACCTCAATTTCACGGTCGTAATACTCAGAGACCTGTGCGGATTGCGACATTTAATGCCGCCTCCTTCTCCATGGCACCTGCTGTTCCGTACGCTGAAAAATCTAATTCCTCTGCCAAATTCCGACGGAGTTTGGATTCCAGTTTACGGACAAAATCCGTAAATGATCGCCCCAAAAGCATTTTGAAACAGTCTCCACTGCATCCGAATTCCGTGAATGGAGTTGTTGCTAATCATAACCTCCACACCCAACCGAAGTTCGTGAAACCCAAGCCGCGGGTTTCGATCAACCTGCCTGATAACGAGATATCCTTACTCAGAAATCGACAGGCGAGCTTTTCTGAGTACGAAATGGAGAAGGAGGATCCGTCCTCTTCAGGTAACGATGGGAATGGGATGCGGATCGCTAAGAGTCGGCCTCCACTGAGATCGATTGTAAGCATGCCTTTGGAGCGCGAAAAAGGGGAGAGTTACAGATGCAGTAGGACGGTTGTGGAGGTGCTTAGGGAGTTGGATGCTGACATATTGGCATTGCAAGATGTGAAGGCGGTGGAAGAGAAAGGCATGAGACCGCTCTCGGATTTGGCAGATGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGCTGGGCGCCGGAGTATGGAAATGCGGTCCTGTCGAGATGGCCCATCAAACGCTGGAAAGTCGAGAAGATTTTCGACCACACCGATTTCAGGAATGTGTTAAAAGCGACCATTGATGTGGGAGAAGTAGGAGAGGTAAATGTGCAGTGTACCCATTTGGATCATCTGGACGAGAATTGGAGGATGAAACAGATAAAATCCATAATCCGATCGACCAACGACGAACCCCATATCTTATTAGGAGGCCTTAATTCTCTGGATCCCACGGATTACTCGCAGCAACGGTGGACGGACATCGTGAAGTATTACGAAGAGATAGGAAAGCCAACTCCGGAAGCTAAAGTGATTAAGTTCTTAAAGAGCGGTATGCACTATAGGGATGCAAAGGAGTTTGGAGGAGAATGCGAATCAGTGGTGATGATCGCCAAAGGCCAAAGTGTTCAAGGGACGTGCAAGTACGGGACACGAGTCGACTACATATTGGCCTCTCCCGATGCAGATTACGAGTTTGTTAAAGGATCCTACTCTGTCCTTTCCTCAAAAGGAACCTCCGATCATCACATTGTCAAGGTTGATTTCCTCAAACCTCCTCATTCTCGAGGTTGA
BLAST of CmoCh19G000050 vs. TrEMBL
Match: A0A0A0KDU0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G409370 PE=4 SV=1)

HSP 1 Score: 739.2 bits (1907), Expect = 3.1e-210
Identity = 390/469 (83.16%), Postives = 408/469 (86.99%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVN-ASLAS 60
           ML FLNR LRRLCSRLRWPR R +RPRV++IKKFGKTTS+  + P  T+DSFVN AS  S
Sbjct: 1   MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPS 60

Query: 61  PVHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVND 120
            VHP  QFH   TQRP+RIATFNAASFSMAPAVP  EKSNSSAKFRRSLDS+ RTKSVND
Sbjct: 61  AVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSNSSAKFRRSLDSNSRTKSVND 120

Query: 121 RPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEY 180
           RPKSILKQSPLH NS+N  VA           + KPRVSINLPDNEISLLRNRQAS  EY
Sbjct: 121 RPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQAS--EY 180

Query: 181 EMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADIL 240
           EME E+ SSSGND  GMRIAKS  PLR  VSMP ER    +YRCSRTVVEVLRELDADIL
Sbjct: 181 EME-ENLSSSGNDRKGMRIAKSGTPLRWTVSMPSERG---TYRCSRTVVEVLRELDADIL 240

Query: 241 ALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTD 300
           ALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD
Sbjct: 241 ALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD 300

Query: 301 FRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPT 360
           FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPT
Sbjct: 301 FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPT 360

Query: 361 DYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQG 420
           DYSQQRW DIVKYYEEIGKPTPEAKV KFLKS M YRDAKEFGGECESVVMIAKGQSVQG
Sbjct: 361 DYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQG 420

Query: 421 TCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPH 469
           TCKYGTRVDYI+ASPDA+YEFV+GSYSV+SSKGTSDHHIVKVDFLK PH
Sbjct: 421 TCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH 450

BLAST of CmoCh19G000050 vs. TrEMBL
Match: F6HQ43_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0104g00790 PE=4 SV=1)

HSP 1 Score: 589.7 bits (1519), Expect = 3.1e-165
Identity = 322/475 (67.79%), Postives = 359/475 (75.58%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60
           ML  +N  LRRLCSRLRWP   R RPRVV IK  GK++SK H D     ++  N S    
Sbjct: 18  MLKIINTRLRRLCSRLRWPLRGRSRPRVV-IKTLGKSSSKSHFDAKR--EAAANGSAV-- 77

Query: 61  VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNS-------SAKFRRSLDSSLR 120
           VHP  Q       + +RIATFNAA FSMAPAVP AEK  +         K +RS+D + R
Sbjct: 78  VHPNGQLGAEKPNKTIRIATFNAALFSMAPAVPRAEKGENFGNGNVEGLKEKRSVDMNFR 137

Query: 121 TKSVNDRPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQ 180
           TKS N+RPKSILKQSPLHPNS+   +   NL  Q KF K KPRVSINLPDNEISL RNR+
Sbjct: 138 TKSANERPKSILKQSPLHPNSM---ILPENLSKQQKFAKSKPRVSINLPDNEISLGRNRK 197

Query: 181 ASFSEYEMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREK---GESYRCSRTVVEV 240
            SF    +E+++ SSS   G   RI + + PLRS VS P   EK   GE+YR  RTVVEV
Sbjct: 198 LSF----VEEKEGSSSSTIG---RILRGKAPLRSTVSFPASLEKVTEGEAYRSRRTVVEV 257

Query: 241 LRELDADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWK 300
           LRELDADILALQDVKA EEK M+PLSDLA ALGM YVFAESWAPEYGNA+LS+WPIKRWK
Sbjct: 258 LRELDADILALQDVKAEEEKAMKPLSDLAAALGMNYVFAESWAPEYGNAILSKWPIKRWK 317

Query: 301 VEKIFDHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILL 360
           V+KIFD TDFRNVLKATIDV + GEVN  CTHLDHLDENWRMKQI SII+S N+ PHIL 
Sbjct: 318 VQKIFDDTDFRNVLKATIDVPQAGEVNFHCTHLDHLDENWRMKQINSIIQS-NEGPHILA 377

Query: 361 GGLNSLDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVM 420
           GGLNSLD TDYS +RWTDIVKYYEE+GKPTP+ +V+KFLKS   Y DAK+F GECESVVM
Sbjct: 378 GGLNSLDETDYSSERWTDIVKYYEEMGKPTPKVEVMKFLKS-KQYTDAKDFAGECESVVM 437

Query: 421 IAKGQSVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLK 466
           IAKGQSVQGTCKYGTRVDYILASP + Y+FV GSYSV SSKGTSDHHIVKVD  K
Sbjct: 438 IAKGQSVQGTCKYGTRVDYILASPSSPYKFVPGSYSVFSSKGTSDHHIVKVDIAK 475

BLAST of CmoCh19G000050 vs. TrEMBL
Match: A0A067JCJ1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21993 PE=4 SV=1)

HSP 1 Score: 579.7 bits (1493), Expect = 3.2e-162
Identity = 314/472 (66.53%), Postives = 362/472 (76.69%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60
           ML  LN+ LRR CSRLRWP  R+ + +++ I+KFGK+  KP  D  N  +S  N S  + 
Sbjct: 1   MLKILNKRLRRFCSRLRWPIRRKSKSKII-IRKFGKSNPKPQNDTKN--ESITNGS--AT 60

Query: 61  VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNS-------SAKFRRSLDSSLR 120
           VHP  Q     T++P++IATFNAA FSMAPAVP    S+S       S K  RS D S R
Sbjct: 61  VHPNGQLDSLKTEQPIKIATFNAALFSMAPAVPKLPNSSSFDFDNEDSQKGARSTDFSFR 120

Query: 121 TKSVNDRPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQ 180
           TKS NDRPKSILK SPLHPNS+N   +N  L  Q K+ K K RVSINLPDNEISLLRNRQ
Sbjct: 121 TKSANDRPKSILKPSPLHPNSMN---SNEILPKQQKYAKSKLRVSINLPDNEISLLRNRQ 180

Query: 181 ASFSEYEMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRE 240
             F+E E EKE  S+S    N  RI + + P+RS  +  +  E  ESYR +RTV+EVL+E
Sbjct: 181 LGFAE-EKEKETTSAS----NLTRILRGKAPMRSQSARSIGNEV-ESYRSTRTVLEVLKE 240

Query: 241 LDADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEK 300
           LDADILALQDVKA EEK M+PLSDLA ALGM YVFAESWAPEYGNA+LS+WPIK+W+V+K
Sbjct: 241 LDADILALQDVKAEEEKAMKPLSDLASALGMNYVFAESWAPEYGNAILSKWPIKKWRVQK 300

Query: 301 IFDHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGL 360
           IFD TDFRNVLKATIDV E GE+N  CTHLDHLDENWRMKQI +II+S ND PHIL GGL
Sbjct: 301 IFDDTDFRNVLKATIDVPEKGELNFYCTHLDHLDENWRMKQINAIIQS-NDGPHILAGGL 360

Query: 361 NSLDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAK 420
           NSLD TDYS +RWTDIVKYYEE+GKPTP+ +V++FLKS  HY DAKEF GECESVVMIAK
Sbjct: 361 NSLDETDYSAERWTDIVKYYEEMGKPTPKVEVMRFLKS-KHYSDAKEFAGECESVVMIAK 420

Query: 421 GQSVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLK 466
           GQ+VQGTCKYGTRVDYILAS ++ Y+FV GSYSV SSKGTSDHHIVKVD +K
Sbjct: 421 GQNVQGTCKYGTRVDYILASSNSPYKFVPGSYSVFSSKGTSDHHIVKVDMIK 456

BLAST of CmoCh19G000050 vs. TrEMBL
Match: A0A061DQ80_THECC (DNAse I-like superfamily protein OS=Theobroma cacao GN=TCM_004482 PE=4 SV=1)

HSP 1 Score: 578.6 bits (1490), Expect = 7.1e-162
Identity = 314/476 (65.97%), Postives = 363/476 (76.26%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60
           ML  LN+ ++R CSR+RWP  RR + ++V IK+FGK+ S+ ++D  +   + VN +  S 
Sbjct: 19  MLKLLNKRIKRFCSRIRWPVRRRSKSKIV-IKRFGKSNSRANSDTKD--HTIVNGT--SK 78

Query: 61  VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSS-------AKFRRSLDSSLR 120
           VH   Q  G ++ RP+RIATFNAA FSMAPA+P AE S+S           RRS+D SLR
Sbjct: 79  VHQDGQLGGLDSVRPIRIATFNAALFSMAPAIPKAENSSSFDFENEGFKDARRSMDLSLR 138

Query: 121 TKSVNDRPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQ 180
            KS NDRPKSILKQSP+HPNS+N      NL  Q KFVK K RVSINLPDNEISLLRNRQ
Sbjct: 139 AKSTNDRPKSILKQSPMHPNSIND---KENLSNQQKFVKSKLRVSINLPDNEISLLRNRQ 198

Query: 181 ASFSEYEMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKG----ESYRCSRTVVE 240
            SF+E    KE  SS G    G RI + + PLRS VS       G    E YR  +TV+E
Sbjct: 199 LSFAE--RGKEGSSSGG----GSRILRGKAPLRSTVSFSTNMGNGVDSFERYRSRKTVLE 258

Query: 241 VLRELDADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRW 300
           VLRELDADILALQDVKA EEK M+PLSDLA ALGM YVFAESWAPEYGNAVLS+WPIKRW
Sbjct: 259 VLRELDADILALQDVKAEEEKAMKPLSDLAAALGMNYVFAESWAPEYGNAVLSKWPIKRW 318

Query: 301 KVEKIFDHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHIL 360
           KV+KIFD TDFRNVLKATIDV + GEV+  CTHLDHLDENWRMKQI +II+S ND PHIL
Sbjct: 319 KVQKIFDDTDFRNVLKATIDVPQAGEVDFHCTHLDHLDENWRMKQINAIIQS-NDGPHIL 378

Query: 361 LGGLNSLDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVV 420
            GGLNSL+ TDYS +RWTDIVKYYEE+GKP P+ +V+KFLK+   Y DAK+F GECE VV
Sbjct: 379 AGGLNSLEETDYSTERWTDIVKYYEEMGKPIPKVEVMKFLKN-KQYTDAKDFAGECEPVV 438

Query: 421 MIAKGQSVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLK 466
           +IAKGQSVQGTCKYGTRVDYILASP++ Y+FV GSYSVLSSKGTSDHH+VKVD +K
Sbjct: 439 VIAKGQSVQGTCKYGTRVDYILASPNSPYKFVPGSYSVLSSKGTSDHHMVKVDIIK 478

BLAST of CmoCh19G000050 vs. TrEMBL
Match: W9RXD9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_008386 PE=4 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 3.0e-160
Identity = 310/477 (64.99%), Postives = 364/477 (76.31%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60
           ML  LNR+LR LCSRLRWP  RR +P+VV I++FGK++SK      N  ++      ++ 
Sbjct: 1   MLKLLNRNLRLLCSRLRWPIRRRSKPKVV-IRRFGKSSSKAR----NGAETEPRVPASAA 60

Query: 61  VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVP----YAEKSNSSAKFRRSLDSSLRTKS 120
           VHP  QF G  ++ P+RIATFNAA FSMAPAVP    + E +N      R+L+  +R+KS
Sbjct: 61  VHPNGQFVGAKSETPLRIATFNAALFSMAPAVPKAARFVEDTNGDVLKVRNLN--VRSKS 120

Query: 121 VNDRPKSILKQSPLHPNSVNGVVANHNLHT---QPKFVKPKPRVSINLPDNEISLLRNRQ 180
           +NDRPKSILKQS LHPNS++    N+NL     Q KF + K RVSINLPDNEISLLRNRQ
Sbjct: 121 LNDRPKSILKQSRLHPNSMSSNNNNNNLDNLSNQQKFARSKLRVSINLPDNEISLLRNRQ 180

Query: 181 ASFSEYEMEKEDPSSSGNDGNG--MRIAKSRPPLRSIVSMPLEREKG---ESYRCSRTVV 240
            SFSE           G +G+    R  + + PLRS VS       G   +SYR +RTV+
Sbjct: 181 LSFSE----------DGKEGSSSISRFLRGKAPLRSSVSFSANIGSGTDEDSYRSTRTVL 240

Query: 241 EVLRELDADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKR 300
           EVLRELD DILALQDVKA EEK M+PLSDLA ALGM YVFAESWAPEYGNA+LS+WPIKR
Sbjct: 241 EVLRELDTDILALQDVKAEEEKAMKPLSDLASALGMNYVFAESWAPEYGNAILSKWPIKR 300

Query: 301 WKVEKIFDHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHI 360
           WKV+KIFD TDFRNVLKATIDV +VGE+N  CTHLDHLDENWRMKQI +II+S N+EPHI
Sbjct: 301 WKVQKIFDDTDFRNVLKATIDVPQVGEINFHCTHLDHLDENWRMKQINAIIQS-NNEPHI 360

Query: 361 LLGGLNSLDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESV 420
           L+GGLNSL+ TDYSQ+RWTDIVKYYEE+GKPTP+ +V+++LKS   Y DAK+F GECESV
Sbjct: 361 LVGGLNSLEETDYSQERWTDIVKYYEEMGKPTPKVEVMRYLKS-KQYTDAKDFAGECESV 420

Query: 421 VMIAKGQSVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLK 466
           VMIAKGQSVQGTCKYGTRVDYILASP++ Y+FV GSYSV SSKGTSDHHIVKVD  K
Sbjct: 421 VMIAKGQSVQGTCKYGTRVDYILASPNSPYKFVPGSYSVFSSKGTSDHHIVKVDIRK 458

BLAST of CmoCh19G000050 vs. TAIR10
Match: AT3G21530.1 (AT3G21530.1 DNAse I-like superfamily protein)

HSP 1 Score: 431.8 bits (1109), Expect = 5.4e-121
Identity = 258/477 (54.09%), Postives = 313/477 (65.62%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTT--SKPHADPHNTLDSFVNASLA 60
           ML    R L  L SRLRW   +RVR RV+V ++F K    ++    P + + S   +S  
Sbjct: 1   MLCVFRRKLGCLFSRLRWVIKKRVRARVIV-RRFRKARWRARRKESPESEVSSIHLSS-- 60

Query: 61  SPVHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVN 120
                       N+ R +R+ATFN A FS+AP V   E++     F   LDSS  T    
Sbjct: 61  ------------NSGRHIRVATFNVAMFSLAPVVQTMEET----AFLGHLDSSNITCP-- 120

Query: 121 DRPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSE 180
             PK ILKQSPLH ++V                  KP+V INLPDNEISL +    S+S 
Sbjct: 121 -SPKGILKQSPLHSSAVR-----------------KPKVCINLPDNEISLAQ----SYSF 180

Query: 181 YEMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPL---EREKGESYRCSRTVVEVLRELD 240
             M + D     NDG   R + S   +RS V +P    ++E    Y   R++ E+LRELD
Sbjct: 181 LSMVEND-----NDGKENRGSLS---MRSPVCLPSCWWDQESFNGYSSRRSIAELLRELD 240

Query: 241 ADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIF 300
           ADILALQDVKA EE  M+PLSDLA ALGMKYVFAESWAPEYGNA+LS+WPIK+W+V++I 
Sbjct: 241 ADILALQDVKAEEETLMKPLSDLASALGMKYVFAESWAPEYGNAILSKWPIKKWRVQRIA 300

Query: 301 DHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNS 360
           D  DFRNVLK T+++   G+VNV CT LDHLDENWRMKQI +I R  ++ PHILLGGLNS
Sbjct: 301 DVDDFRNVLKVTVEIPWAGDVNVYCTQLDHLDENWRMKQIDAITRG-DESPHILLGGLNS 360

Query: 361 LDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQ 420
           LD +DYS  RW  IVKYYE+ GKPTP  +V++FLK G  Y D+KEF GECE VV+IAKGQ
Sbjct: 361 LDGSDYSIARWNHIVKYYEDSGKPTPRVEVMRFLK-GKGYLDSKEFAGECEPVVIIAKGQ 420

Query: 421 SVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDF-LKPPHSRG 472
           +VQGTCKYGTRVDYILASP++ YEFV GSYSV+SSKGTSDHHIVKVD  +    SRG
Sbjct: 421 NVQGTCKYGTRVDYILASPESPYEFVPGSYSVVSSKGTSDHHIVKVDLVITKERSRG 424

BLAST of CmoCh19G000050 vs. TAIR10
Match: AT2G48030.1 (AT2G48030.1 DNAse I-like superfamily protein)

HSP 1 Score: 427.2 bits (1097), Expect = 1.3e-119
Identity = 263/467 (56.32%), Postives = 301/467 (64.45%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60
           MLN +   LRR   RLR PR    + R+ V        S P    H         S A+ 
Sbjct: 1   MLNLI-AFLRR---RLRRPR----KARISVNHHHLSVDSSPETHHHQN-----GFSSAAA 60

Query: 61  VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVNDR 120
           +HP P        + + +ATFNAA FSMAPAVP    SN    FR        +KS  DR
Sbjct: 61  IHPNPD-------KTITVATFNAAMFSMAPAVP----SNKGLPFR--------SKSTVDR 120

Query: 121 PKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPR-VSINLPDNEISLLRNRQASFSEY 180
           PKSILK  P++          H+   Q +F K +PR VSINLPDNEIS    RQ SF   
Sbjct: 121 PKSILK--PMNA----AASPTHDSRKQQRFAKSRPRRVSINLPDNEIS----RQLSF--- 180

Query: 181 EMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGE-SYRCSRTVVEVLRELDADI 240
              +EDP  S              PLR           GE   R +RT +EVL ELDAD+
Sbjct: 181 ---REDPQHS--------------PLRP----------GEIGLRSTRTALEVLSELDADV 240

Query: 241 LALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHT 300
           LALQDVKA E   MRPLSDLA ALGM YVFAESWAPEYGNA+LS+WPIK   V +IFDHT
Sbjct: 241 LALQDVKADEADQMRPLSDLAAALGMNYVFAESWAPEYGNAILSKWPIKSSNVLRIFDHT 300

Query: 301 DFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDP 360
           DFRNVLKA+I+V   GEV   CTHLDHLDE WRMKQ+ +II+STN  PHIL G LNSLD 
Sbjct: 301 DFRNVLKASIEVPGSGEVEFHCTHLDHLDEKWRMKQVDAIIQSTN-VPHILAGALNSLDE 360

Query: 361 TDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQ 420
           +DYS +RWTDIVKYYEE+GKP P+A+V++FLKS   Y DAK+F GECESVV++AKGQSVQ
Sbjct: 361 SDYSPERWTDIVKYYEEMGKPIPKAQVMRFLKS-KEYTDAKDFAGECESVVVVAKGQSVQ 393

Query: 421 GTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLK 466
           GTCKYGTRVDYILAS D+ Y FV GSYSVLSSKGTSDHHIVKVD +K
Sbjct: 421 GTCKYGTRVDYILASSDSPYRFVPGSYSVLSSKGTSDHHIVKVDVVK 393

BLAST of CmoCh19G000050 vs. NCBI nr
Match: gi|659097139|ref|XP_008449463.1| (PREDICTED: uncharacterized protein LOC103491341 [Cucumis melo])

HSP 1 Score: 743.8 bits (1919), Expect = 1.8e-211
Identity = 393/469 (83.80%), Postives = 411/469 (87.63%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTS-KPHADPHNTLDSFVNASLAS 60
           ML FLNR LRRLCSRLRWPR RR+RPRV+VIKKFGKTTS + ++ P  T+DSFVNAS  S
Sbjct: 1   MLKFLNRKLRRLCSRLRWPRRRRIRPRVLVIKKFGKTTSYETNSHPEKTIDSFVNASSPS 60

Query: 61  PVHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVND 120
            VHP  QF+  NTQRP+RIATFNAASFSMAPAVP  EKSNSSAKFRRSLDS+ RTKSVND
Sbjct: 61  AVHPNSQFYLLNTQRPIRIATFNAASFSMAPAVP--EKSNSSAKFRRSLDSNSRTKSVND 120

Query: 121 RPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEY 180
           RPKSILKQSPLH NS+N  VA           K KPRVSINLPDNEISLLRNRQAS  EY
Sbjct: 121 RPKSILKQSPLHTNSINSGVA-----------KTKPRVSINLPDNEISLLRNRQAS--EY 180

Query: 181 EMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADIL 240
           EME E+ SSSGND  GM IAKS  PLR  VSMP ER    SYRCSRTVVEVLR+LDADIL
Sbjct: 181 EME-ENLSSSGNDRRGMGIAKSGTPLRWTVSMPSERG---SYRCSRTVVEVLRDLDADIL 240

Query: 241 ALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTD 300
           ALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD
Sbjct: 241 ALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD 300

Query: 301 FRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPT 360
           FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPT
Sbjct: 301 FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPT 360

Query: 361 DYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQG 420
           DYSQQRWTDIVKYYEEIGKPTPEAKV KFLKS M YRDAKE+GGECESVVMIAKGQSVQG
Sbjct: 361 DYSQQRWTDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEYGGECESVVMIAKGQSVQG 420

Query: 421 TCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPH 469
           TCKYGTRVDYILASPDA+YEFV+GSYSV+SSKGTSDHHIVKVDFLK PH
Sbjct: 421 TCKYGTRVDYILASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH 450

BLAST of CmoCh19G000050 vs. NCBI nr
Match: gi|778716168|ref|XP_004140218.2| (PREDICTED: uncharacterized protein LOC101212223 [Cucumis sativus])

HSP 1 Score: 739.2 bits (1907), Expect = 4.5e-210
Identity = 390/469 (83.16%), Postives = 408/469 (86.99%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVN-ASLAS 60
           ML FLNR LRRLCSRLRWPR R +RPRV++IKKFGKTTS+  + P  T+DSFVN AS  S
Sbjct: 1   MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPS 60

Query: 61  PVHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVND 120
            VHP  QFH   TQRP+RIATFNAASFSMAPAVP  EKSNSSAKFRRSLDS+ RTKSVND
Sbjct: 61  AVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSNSSAKFRRSLDSNSRTKSVND 120

Query: 121 RPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEY 180
           RPKSILKQSPLH NS+N  VA           + KPRVSINLPDNEISLLRNRQAS  EY
Sbjct: 121 RPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQAS--EY 180

Query: 181 EMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADIL 240
           EME E+ SSSGND  GMRIAKS  PLR  VSMP ER    +YRCSRTVVEVLRELDADIL
Sbjct: 181 EME-ENLSSSGNDRKGMRIAKSGTPLRWTVSMPSERG---TYRCSRTVVEVLRELDADIL 240

Query: 241 ALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTD 300
           ALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD
Sbjct: 241 ALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD 300

Query: 301 FRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPT 360
           FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPT
Sbjct: 301 FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPT 360

Query: 361 DYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQG 420
           DYSQQRW DIVKYYEEIGKPTPEAKV KFLKS M YRDAKEFGGECESVVMIAKGQSVQG
Sbjct: 361 DYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQG 420

Query: 421 TCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPH 469
           TCKYGTRVDYI+ASPDA+YEFV+GSYSV+SSKGTSDHHIVKVDFLK PH
Sbjct: 421 TCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH 450

BLAST of CmoCh19G000050 vs. NCBI nr
Match: gi|359479852|ref|XP_002271044.2| (PREDICTED: uncharacterized protein LOC100247717 [Vitis vinifera])

HSP 1 Score: 589.7 bits (1519), Expect = 4.4e-165
Identity = 322/475 (67.79%), Postives = 359/475 (75.58%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60
           ML  +N  LRRLCSRLRWP   R RPRVV IK  GK++SK H D     ++  N S    
Sbjct: 18  MLKIINTRLRRLCSRLRWPLRGRSRPRVV-IKTLGKSSSKSHFDAKR--EAAANGSAV-- 77

Query: 61  VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNS-------SAKFRRSLDSSLR 120
           VHP  Q       + +RIATFNAA FSMAPAVP AEK  +         K +RS+D + R
Sbjct: 78  VHPNGQLGAEKPNKTIRIATFNAALFSMAPAVPRAEKGENFGNGNVEGLKEKRSVDMNFR 137

Query: 121 TKSVNDRPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQ 180
           TKS N+RPKSILKQSPLHPNS+   +   NL  Q KF K KPRVSINLPDNEISL RNR+
Sbjct: 138 TKSANERPKSILKQSPLHPNSM---ILPENLSKQQKFAKSKPRVSINLPDNEISLGRNRK 197

Query: 181 ASFSEYEMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREK---GESYRCSRTVVEV 240
            SF    +E+++ SSS   G   RI + + PLRS VS P   EK   GE+YR  RTVVEV
Sbjct: 198 LSF----VEEKEGSSSSTIG---RILRGKAPLRSTVSFPASLEKVTEGEAYRSRRTVVEV 257

Query: 241 LRELDADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWK 300
           LRELDADILALQDVKA EEK M+PLSDLA ALGM YVFAESWAPEYGNA+LS+WPIKRWK
Sbjct: 258 LRELDADILALQDVKAEEEKAMKPLSDLAAALGMNYVFAESWAPEYGNAILSKWPIKRWK 317

Query: 301 VEKIFDHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILL 360
           V+KIFD TDFRNVLKATIDV + GEVN  CTHLDHLDENWRMKQI SII+S N+ PHIL 
Sbjct: 318 VQKIFDDTDFRNVLKATIDVPQAGEVNFHCTHLDHLDENWRMKQINSIIQS-NEGPHILA 377

Query: 361 GGLNSLDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVM 420
           GGLNSLD TDYS +RWTDIVKYYEE+GKPTP+ +V+KFLKS   Y DAK+F GECESVVM
Sbjct: 378 GGLNSLDETDYSSERWTDIVKYYEEMGKPTPKVEVMKFLKS-KQYTDAKDFAGECESVVM 437

Query: 421 IAKGQSVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLK 466
           IAKGQSVQGTCKYGTRVDYILASP + Y+FV GSYSV SSKGTSDHHIVKVD  K
Sbjct: 438 IAKGQSVQGTCKYGTRVDYILASPSSPYKFVPGSYSVFSSKGTSDHHIVKVDIAK 475

BLAST of CmoCh19G000050 vs. NCBI nr
Match: gi|802794201|ref|XP_012092317.1| (PREDICTED: uncharacterized protein LOC105650052 [Jatropha curcas])

HSP 1 Score: 579.7 bits (1493), Expect = 4.6e-162
Identity = 314/472 (66.53%), Postives = 362/472 (76.69%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60
           ML  LN+ LRR CSRLRWP  R+ + +++ I+KFGK+  KP  D  N  +S  N S  + 
Sbjct: 1   MLKILNKRLRRFCSRLRWPIRRKSKSKII-IRKFGKSNPKPQNDTKN--ESITNGS--AT 60

Query: 61  VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNS-------SAKFRRSLDSSLR 120
           VHP  Q     T++P++IATFNAA FSMAPAVP    S+S       S K  RS D S R
Sbjct: 61  VHPNGQLDSLKTEQPIKIATFNAALFSMAPAVPKLPNSSSFDFDNEDSQKGARSTDFSFR 120

Query: 121 TKSVNDRPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQ 180
           TKS NDRPKSILK SPLHPNS+N   +N  L  Q K+ K K RVSINLPDNEISLLRNRQ
Sbjct: 121 TKSANDRPKSILKPSPLHPNSMN---SNEILPKQQKYAKSKLRVSINLPDNEISLLRNRQ 180

Query: 181 ASFSEYEMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRE 240
             F+E E EKE  S+S    N  RI + + P+RS  +  +  E  ESYR +RTV+EVL+E
Sbjct: 181 LGFAE-EKEKETTSAS----NLTRILRGKAPMRSQSARSIGNEV-ESYRSTRTVLEVLKE 240

Query: 241 LDADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEK 300
           LDADILALQDVKA EEK M+PLSDLA ALGM YVFAESWAPEYGNA+LS+WPIK+W+V+K
Sbjct: 241 LDADILALQDVKAEEEKAMKPLSDLASALGMNYVFAESWAPEYGNAILSKWPIKKWRVQK 300

Query: 301 IFDHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGL 360
           IFD TDFRNVLKATIDV E GE+N  CTHLDHLDENWRMKQI +II+S ND PHIL GGL
Sbjct: 301 IFDDTDFRNVLKATIDVPEKGELNFYCTHLDHLDENWRMKQINAIIQS-NDGPHILAGGL 360

Query: 361 NSLDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAK 420
           NSLD TDYS +RWTDIVKYYEE+GKPTP+ +V++FLKS  HY DAKEF GECESVVMIAK
Sbjct: 361 NSLDETDYSAERWTDIVKYYEEMGKPTPKVEVMRFLKS-KHYSDAKEFAGECESVVMIAK 420

Query: 421 GQSVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLK 466
           GQ+VQGTCKYGTRVDYILAS ++ Y+FV GSYSV SSKGTSDHHIVKVD +K
Sbjct: 421 GQNVQGTCKYGTRVDYILASSNSPYKFVPGSYSVFSSKGTSDHHIVKVDMIK 456

BLAST of CmoCh19G000050 vs. NCBI nr
Match: gi|590717956|ref|XP_007050713.1| (DNAse I-like superfamily protein [Theobroma cacao])

HSP 1 Score: 578.6 bits (1490), Expect = 1.0e-161
Identity = 314/476 (65.97%), Postives = 363/476 (76.26%), Query Frame = 1

Query: 1   MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60
           ML  LN+ ++R CSR+RWP  RR + ++V IK+FGK+ S+ ++D  +   + VN +  S 
Sbjct: 19  MLKLLNKRIKRFCSRIRWPVRRRSKSKIV-IKRFGKSNSRANSDTKD--HTIVNGT--SK 78

Query: 61  VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSS-------AKFRRSLDSSLR 120
           VH   Q  G ++ RP+RIATFNAA FSMAPA+P AE S+S           RRS+D SLR
Sbjct: 79  VHQDGQLGGLDSVRPIRIATFNAALFSMAPAIPKAENSSSFDFENEGFKDARRSMDLSLR 138

Query: 121 TKSVNDRPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQ 180
            KS NDRPKSILKQSP+HPNS+N      NL  Q KFVK K RVSINLPDNEISLLRNRQ
Sbjct: 139 AKSTNDRPKSILKQSPMHPNSIND---KENLSNQQKFVKSKLRVSINLPDNEISLLRNRQ 198

Query: 181 ASFSEYEMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKG----ESYRCSRTVVE 240
            SF+E    KE  SS G    G RI + + PLRS VS       G    E YR  +TV+E
Sbjct: 199 LSFAE--RGKEGSSSGG----GSRILRGKAPLRSTVSFSTNMGNGVDSFERYRSRKTVLE 258

Query: 241 VLRELDADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRW 300
           VLRELDADILALQDVKA EEK M+PLSDLA ALGM YVFAESWAPEYGNAVLS+WPIKRW
Sbjct: 259 VLRELDADILALQDVKAEEEKAMKPLSDLAAALGMNYVFAESWAPEYGNAVLSKWPIKRW 318

Query: 301 KVEKIFDHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHIL 360
           KV+KIFD TDFRNVLKATIDV + GEV+  CTHLDHLDENWRMKQI +II+S ND PHIL
Sbjct: 319 KVQKIFDDTDFRNVLKATIDVPQAGEVDFHCTHLDHLDENWRMKQINAIIQS-NDGPHIL 378

Query: 361 LGGLNSLDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVV 420
            GGLNSL+ TDYS +RWTDIVKYYEE+GKP P+ +V+KFLK+   Y DAK+F GECE VV
Sbjct: 379 AGGLNSLEETDYSTERWTDIVKYYEEMGKPIPKVEVMKFLKN-KQYTDAKDFAGECEPVV 438

Query: 421 MIAKGQSVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLK 466
           +IAKGQSVQGTCKYGTRVDYILASP++ Y+FV GSYSVLSSKGTSDHH+VKVD +K
Sbjct: 439 VIAKGQSVQGTCKYGTRVDYILASPNSPYKFVPGSYSVLSSKGTSDHHMVKVDIIK 478

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KDU0_CUCSA3.1e-21083.16Uncharacterized protein OS=Cucumis sativus GN=Csa_6G409370 PE=4 SV=1[more]
F6HQ43_VITVI3.1e-16567.79Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0104g00790 PE=4 SV=... [more]
A0A067JCJ1_JATCU3.2e-16266.53Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21993 PE=4 SV=1[more]
A0A061DQ80_THECC7.1e-16265.97DNAse I-like superfamily protein OS=Theobroma cacao GN=TCM_004482 PE=4 SV=1[more]
W9RXD9_9ROSA3.0e-16064.99Uncharacterized protein OS=Morus notabilis GN=L484_008386 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21530.15.4e-12154.09 DNAse I-like superfamily protein[more]
AT2G48030.11.3e-11956.32 DNAse I-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659097139|ref|XP_008449463.1|1.8e-21183.80PREDICTED: uncharacterized protein LOC103491341 [Cucumis melo][more]
gi|778716168|ref|XP_004140218.2|4.5e-21083.16PREDICTED: uncharacterized protein LOC101212223 [Cucumis sativus][more]
gi|359479852|ref|XP_002271044.2|4.4e-16567.79PREDICTED: uncharacterized protein LOC100247717 [Vitis vinifera][more]
gi|802794201|ref|XP_012092317.1|4.6e-16266.53PREDICTED: uncharacterized protein LOC105650052 [Jatropha curcas][more]
gi|590717956|ref|XP_007050713.1|1.0e-16165.97DNAse I-like superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005135Endo/exonuclease/phosphatase
IPR027317PGAP2-interacting protein
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh19G000050.1CmoCh19G000050.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005135Endonuclease/exonuclease/phosphataseGENE3DG3DSA:3.60.10.10coord: 224..463
score: 2.4
IPR005135Endonuclease/exonuclease/phosphatasePFAMPF03372Exo_endo_phoscoord: 227..456
score: 5.0
IPR005135Endonuclease/exonuclease/phosphataseunknownSSF56219DNase I-likecoord: 225..464
score: 2.62
IPR027317PGAP2-interacting proteinPANTHERPTHR14859CALCOFLUOR WHITE HYPERSENSITIVE PROTEIN PRECURSORcoord: 1..465
score: 4.1E
NoneNo IPR availablePANTHERPTHR14859:SF2DNASE I-LIKE SUPERFAMILY PROTEINcoord: 1..465
score: 4.1E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh19G000050Cucurbita moschata (Rifu)cmocmoB387
CmoCh19G000050Cucurbita maxima (Rimu)cmacmoB598
CmoCh19G000050Cucumber (Chinese Long) v2cmocuB511
CmoCh19G000050Cucurbita pepo (Zucchini)cmocpeB498