|
Sequences
The following sequences are available for this feature:
Gene sequence (with intron) Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR Hold the cursor over a type above to highlight its positions in the sequence below. AGATGAAAACGAGTGGCCCCAAAAGTGGAAGTCAACCATTTGTAGGAGTCACCGAGTTGGACTAAAATTGGGGTTTTGTTATTAATTTCTGCACTCACACCAATGCCATACAGGTAAGCCCAGTTGAATGTCACTCTGCAACTCTCACACTTTCCTACTCCTTTTCTAAAATGCTCAACTTCCTCAACCGGAGCCTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCTCGTGCCCGGCGGGTCAGACCTAGGGTAGTCGTCATCAAGAAATTTGGAAAAACCACCTCCAAACCTCACGCCGATCCCCACAATACCCTCGACTCCTTCGTTAATGCCTCCTTGGCGTCGCCTGTGCATCCCAAACCTCAATTTCACGGTCGTAATACTCAGAGACCTGTGCGGATTGCGACATTTAATGCCGCCTCCTTCTCCATGGCACCTGCTGTTCCGTACGCTGAAAAATCTAATTCCTCTGCCAAATTCCGACGGAGTTTGGATTCCAGTTTACGGACAAAATCCGTAAATGATCGCCCCAAAAGCATTTTGAAACAGTCTCCACTGCATCCGAATTCCGTGAATGGAGTTGTTGCTAATCATAACCTCCACACCCAACCGAAGTTCGTGAAACCCAAGCCGCGGGTTTCGATCAACCTGCCTGATAACGAGATATCCTTACTCAGAAATCGACAGGCGAGCTTTTCTGAGTACGAAATGGAGAAGGAGGATCCGTCCTCTTCAGGTAACGATGGGAATGGGATGCGGATCGCTAAGAGTCGGCCTCCACTGAGATCGATTGTAAGCATGCCTTTGGAGCGCGAAAAAGGGGAGAGTTACAGATGCAGTAGGACGGTTGTGGAGGTGCTTAGGGAGTTGGATGCTGACATATTGGCATTGCAAGATGTGAAGGCGGTGGAAGAGAAAGGCATGAGACCGCTCTCGGATTTGGCAGATGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGCTGGGCGCCGGAGTATGGAAATGCGGTCCTGTCGAGATGGCCCATCAAACGCTGGAAAGTCGAGAAGATTTTCGACCACACCGATTTCAGGTTAGCAGCTGTACACCACACTCAATAGTATTACACTATTATTAAACTGAAGCAAACCTTTTAGATTAATTTGTGTGTTCTTGCTCTGCTCCTCATTTCATAAACATGCTTTTGATTGATGACACTAGTAGAGTAGATGGAGTCAAGGCGATTGACTGCTTTTGTTTTCCTTTTAAACAATGAATTCAGCAAAAACTAAGCTAAAAACTTAACTTATGGTCAACGTTAGATCTGTGATAGAAGGCAGAGGTCGAAGATTATTGAGAAGGAGTCCCACGTTGTCTAATTTAGAGAATGATTATGGGTTTATAGGTAAGGAATACATCTCCATTGGTATGAGGCGGCCTTTTGGGGAATCCCAAAGCAAAGCCATGAGAACTTATGCTCAAAGTAGACAATATCATACGATTGTGGAGAGTCGTTATTACCAACTAGAAACTTATGAAAAGTAGTAAATAGTTGATAAACTCTGTGAGAATGATAGAATTGGTTATTGGGTTGGGCGGTGGCAGGAATGTGTTAAAAGCGACCATTGATGTGGGAGAAGTAGGAGAGGTAAATGTGCAGTGTACCCATTTGGATCATCTGGACGAGAATTGGAGGATGAAACAGATAAAATCCATAATCCGATCGACCAACGACGAACCCCATATCTTATTAGGAGGCCTTAATTCTCTGGATCCCACGGATTACTCGCAGCAACGGTGGACGGACATCGTGAAGGTATCAATGTTTATTTTGAACTTTGTTTAGCTTAGTTGAGTTTTGTTTATGGCATGCATCCCATGTTGGTTGGTTGGTTGGTTGATTGATTGATTCTTCAGTATTACGAAGAGATAGGAAAGCCAACTCCGGAAGCTAAAGTGATTAAGTTCTTAAAGAGCGGTATGCACTATAGGGATGCAAAGGAGTTTGGAGGAGAATGCGAATCAGTGGTGATGATCGCCAAAGGCCAAAGTATGAAACTGAGTGATGAATATGAAAAAAAAAGATAGGACAATGAGAATGAAAGGTTTTTACTAATGGAATGTATGGCATTGGTTGGCAATGGCAGGTGTTCAAGGGACGTGCAAGTACGGGACACGAGTCGACTACATATTGGCCTCTCCCGATGCAGATTACGAGTTTGTTAAAGGATCCTACTCTGTCCTTTCCTCAAAAGGAACCTCCGATCATCACATTGTCAAGGTTGATTTCCTCAAACCTCCTCATTCTCGAGGTTGATTTCTTCAAACCTCCTCCTCGGCCTCGGCCTCGGCCCCAAACCCGTTCGCATTCCCTTTCCCCTTGGAAGAGATGGACACGAAACAACGATCAACACGCCCACACCCTTTGTCTCCTTTAATTTCTCAAACCTTTTCACTTATATTTACACACTCATTTGTATTACATTTAGGAATGGAACTCATTCCTTTTGTTCTATGCCGTATGTAACCAAAGAACCCATGTACACCTTAATTTCAAACAAAAGTATTAGACACAAAAACTTCAAAACTAAAATGAGATTTGAAGGAAGCTGATGTATTTACCAACACACAAGCTACAGTTACTACTGGAACTGCCTGGATTTCTAGTTAAAGTTCTAATGAAAGCTCTAAGTATATCGTTGTCATCAACAATCTACAGAATTACAGAAGAATAGTGTTG mRNA sequence AGATGAAAACGAGTGGCCCCAAAAGTGGAAGTCAACCATTTGTAGGAGTCACCGAGTTGGACTAAAATTGGGGTTTTGTTATTAATTTCTGCACTCACACCAATGCCATACAGGTAAGCCCAGTTGAATGTCACTCTGCAACTCTCACACTTTCCTACTCCTTTTCTAAAATGCTCAACTTCCTCAACCGGAGCCTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCTCGTGCCCGGCGGGTCAGACCTAGGGTAGTCGTCATCAAGAAATTTGGAAAAACCACCTCCAAACCTCACGCCGATCCCCACAATACCCTCGACTCCTTCGTTAATGCCTCCTTGGCGTCGCCTGTGCATCCCAAACCTCAATTTCACGGTCGTAATACTCAGAGACCTGTGCGGATTGCGACATTTAATGCCGCCTCCTTCTCCATGGCACCTGCTGTTCCGTACGCTGAAAAATCTAATTCCTCTGCCAAATTCCGACGGAGTTTGGATTCCAGTTTACGGACAAAATCCGTAAATGATCGCCCCAAAAGCATTTTGAAACAGTCTCCACTGCATCCGAATTCCGTGAATGGAGTTGTTGCTAATCATAACCTCCACACCCAACCGAAGTTCGTGAAACCCAAGCCGCGGGTTTCGATCAACCTGCCTGATAACGAGATATCCTTACTCAGAAATCGACAGGCGAGCTTTTCTGAGTACGAAATGGAGAAGGAGGATCCGTCCTCTTCAGGTAACGATGGGAATGGGATGCGGATCGCTAAGAGTCGGCCTCCACTGAGATCGATTGTAAGCATGCCTTTGGAGCGCGAAAAAGGGGAGAGTTACAGATGCAGTAGGACGGTTGTGGAGGTGCTTAGGGAGTTGGATGCTGACATATTGGCATTGCAAGATGTGAAGGCGGTGGAAGAGAAAGGCATGAGACCGCTCTCGGATTTGGCAGATGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGCTGGGCGCCGGAGTATGGAAATGCGGTCCTGTCGAGATGGCCCATCAAACGCTGGAAAGTCGAGAAGATTTTCGACCACACCGATTTCAGGAATGTGTTAAAAGCGACCATTGATGTGGGAGAAGTAGGAGAGGTAAATGTGCAGTGTACCCATTTGGATCATCTGGACGAGAATTGGAGGATGAAACAGATAAAATCCATAATCCGATCGACCAACGACGAACCCCATATCTTATTAGGAGGCCTTAATTCTCTGGATCCCACGGATTACTCGCAGCAACGGTGGACGGACATCGTGAAGTATTACGAAGAGATAGGAAAGCCAACTCCGGAAGCTAAAGTGATTAAGTTCTTAAAGAGCGGTATGCACTATAGGGATGCAAAGGAGTTTGGAGGAGAATGCGAATCAGTGGTGATGATCGCCAAAGGCCAAAGTGTTCAAGGGACGTGCAAGTACGGGACACGAGTCGACTACATATTGGCCTCTCCCGATGCAGATTACGAGTTTGTTAAAGGATCCTACTCTGTCCTTTCCTCAAAAGGAACCTCCGATCATCACATTGTCAAGGTTGATTTCCTCAAACCTCCTCATTCTCGAGGTTGATTTCTTCAAACCTCCTCCTCGGCCTCGGCCTCGGCCCCAAACCCGTTCGCATTCCCTTTCCCCTTGGAAGAGATGGACACGAAACAACGATCAACACGCCCACACCCTTTGTCTCCTTTAATTTCTCAAACCTTTTCACTTATATTTACACACTCATTTGTATTACATTTAGGAATGGAACTCATTCCTTTTGTTCTATGCCGTATGTAACCAAAGAACCCATGTACACCTTAATTTCAAACAAAAGTATTAGACACAAAAACTTCAAAACTAAAATGAGATTTGAAGGAAGCTGATGTATTTACCAACACACAAGCTACAGTTACTACTGGAACTGCCTGGATTTCTAGTTAAAGTTCTAATGAAAGCTCTAAGTATATCGTTGTCATCAACAATCTACAGAATTACAGAAGAATAGTGTTG Coding sequence (CDS) ATGCTCAACTTCCTCAACCGGAGCCTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCTCGTGCCCGGCGGGTCAGACCTAGGGTAGTCGTCATCAAGAAATTTGGAAAAACCACCTCCAAACCTCACGCCGATCCCCACAATACCCTCGACTCCTTCGTTAATGCCTCCTTGGCGTCGCCTGTGCATCCCAAACCTCAATTTCACGGTCGTAATACTCAGAGACCTGTGCGGATTGCGACATTTAATGCCGCCTCCTTCTCCATGGCACCTGCTGTTCCGTACGCTGAAAAATCTAATTCCTCTGCCAAATTCCGACGGAGTTTGGATTCCAGTTTACGGACAAAATCCGTAAATGATCGCCCCAAAAGCATTTTGAAACAGTCTCCACTGCATCCGAATTCCGTGAATGGAGTTGTTGCTAATCATAACCTCCACACCCAACCGAAGTTCGTGAAACCCAAGCCGCGGGTTTCGATCAACCTGCCTGATAACGAGATATCCTTACTCAGAAATCGACAGGCGAGCTTTTCTGAGTACGAAATGGAGAAGGAGGATCCGTCCTCTTCAGGTAACGATGGGAATGGGATGCGGATCGCTAAGAGTCGGCCTCCACTGAGATCGATTGTAAGCATGCCTTTGGAGCGCGAAAAAGGGGAGAGTTACAGATGCAGTAGGACGGTTGTGGAGGTGCTTAGGGAGTTGGATGCTGACATATTGGCATTGCAAGATGTGAAGGCGGTGGAAGAGAAAGGCATGAGACCGCTCTCGGATTTGGCAGATGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGCTGGGCGCCGGAGTATGGAAATGCGGTCCTGTCGAGATGGCCCATCAAACGCTGGAAAGTCGAGAAGATTTTCGACCACACCGATTTCAGGAATGTGTTAAAAGCGACCATTGATGTGGGAGAAGTAGGAGAGGTAAATGTGCAGTGTACCCATTTGGATCATCTGGACGAGAATTGGAGGATGAAACAGATAAAATCCATAATCCGATCGACCAACGACGAACCCCATATCTTATTAGGAGGCCTTAATTCTCTGGATCCCACGGATTACTCGCAGCAACGGTGGACGGACATCGTGAAGTATTACGAAGAGATAGGAAAGCCAACTCCGGAAGCTAAAGTGATTAAGTTCTTAAAGAGCGGTATGCACTATAGGGATGCAAAGGAGTTTGGAGGAGAATGCGAATCAGTGGTGATGATCGCCAAAGGCCAAAGTGTTCAAGGGACGTGCAAGTACGGGACACGAGTCGACTACATATTGGCCTCTCCCGATGCAGATTACGAGTTTGTTAAAGGATCCTACTCTGTCCTTTCCTCAAAAGGAACCTCCGATCATCACATTGTCAAGGTTGATTTCCTCAAACCTCCTCATTCTCGAGGTTGA Protein sequence MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASPVHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVNDRPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEYEMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPHSRG
Homology
BLAST of CmoCh19G000050 vs. ExPASy TrEMBL
Match: A0A6J1HGD4 (uncharacterized protein LOC111464053 OS=Cucurbita moschata OX=3662 GN=LOC111464053 PE=4 SV=1) HSP 1 Score: 948.7 bits (2451), Expect = 9.0e-273 Identity = 471/471 (100.00%), Postives = 471/471 (100.00%), Query Frame = 0 Query: 1 MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60 MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP Sbjct: 1 MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60
Query: 61 VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVNDR 120 VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVNDR Sbjct: 61 VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVNDR 120
Query: 121 PKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEYE 180 PKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEYE Sbjct: 121 PKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEYE 180
Query: 181 MEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADILA 240 MEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADILA Sbjct: 181 MEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADILA 240
Query: 241 LQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTDF 300 LQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTDF Sbjct: 241 LQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTDF 300
Query: 301 RNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPTD 360 RNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPTD Sbjct: 301 RNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPTD 360
Query: 361 YSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQGT 420 YSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQGT Sbjct: 361 YSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQGT 420
Query: 421 CKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPHSRG 472 CKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPHSRG Sbjct: 421 CKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPHSRG 471
BLAST of CmoCh19G000050 vs. ExPASy TrEMBL
Match: A0A6J1HR38 (uncharacterized protein LOC111467014 OS=Cucurbita maxima OX=3661 GN=LOC111467014 PE=4 SV=1) HSP 1 Score: 921.4 bits (2380), Expect = 1.5e-264 Identity = 457/471 (97.03%), Postives = 463/471 (98.30%), Query Frame = 0 Query: 1 MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60 ML FLNRSLRRLC+RLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVN SLASP Sbjct: 23 MLKFLNRSLRRLCTRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNGSLASP 82
Query: 61 VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVNDR 120 VHPKPQF G N RPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVNDR Sbjct: 83 VHPKPQFCGLNAHRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVNDR 142
Query: 121 PKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEYE 180 PKSILKQSPLHPN+VNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEYE Sbjct: 143 PKSILKQSPLHPNTVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEYE 202
Query: 181 MEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADILA 240 MEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMP EREKGESYRC+RTVVEVLRELDADILA Sbjct: 203 MEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPSEREKGESYRCNRTVVEVLRELDADILA 262
Query: 241 LQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTDF 300 LQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTDF Sbjct: 263 LQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTDF 322
Query: 301 RNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPTD 360 RNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPTD Sbjct: 323 RNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPTD 382
Query: 361 YSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQGT 420 YSQQRWTDIVKYYEEIGKPTPEAKVIKFLKS MHYRDAKEFGGECESVVMIAKGQSVQGT Sbjct: 383 YSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSSMHYRDAKEFGGECESVVMIAKGQSVQGT 442
Query: 421 CKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPHSRG 472 CKYGTRVDYILASPDADY+FV+GSYSVLSSKGTSDHHIVKVDFLKPPHS+G Sbjct: 443 CKYGTRVDYILASPDADYKFVEGSYSVLSSKGTSDHHIVKVDFLKPPHSQG 493
BLAST of CmoCh19G000050 vs. ExPASy TrEMBL
Match: A0A5A7UUR9 (DNAse I-like superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold496G00650 PE=4 SV=1) HSP 1 Score: 743.8 bits (1919), Expect = 4.4e-211 Identity = 393/469 (83.80%), Postives = 411/469 (87.63%), Query Frame = 0 Query: 1 MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTS-KPHADPHNTLDSFVNASLAS 60 ML FLNR LRRLCSRLRWPR RR+RPRV+VIKKFGKTTS + ++ P T+DSFVNAS S Sbjct: 1 MLKFLNRKLRRLCSRLRWPRRRRIRPRVLVIKKFGKTTSYETNSHPEKTIDSFVNASSPS 60
Query: 61 PVHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVND 120 VHP QF+ NTQRP+RIATFNAASFSMAPAVP EKSNSSAKFRRSLDS+ RTKSVND Sbjct: 61 AVHPNSQFYLLNTQRPIRIATFNAASFSMAPAVP--EKSNSSAKFRRSLDSNSRTKSVND 120
Query: 121 RPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEY 180 RPKSILKQSPLH NS+N VA K KPRVSINLPDNEISLLRNRQA SEY Sbjct: 121 RPKSILKQSPLHTNSINSGVA-----------KTKPRVSINLPDNEISLLRNRQA--SEY 180
Query: 181 EMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADIL 240 EME E+ SSSGND GM IAKS PLR VSMP ER SYRCSRTVVEVLR+LDADIL Sbjct: 181 EME-ENLSSSGNDRRGMGIAKSGTPLRWTVSMPSER---GSYRCSRTVVEVLRDLDADIL 240
Query: 241 ALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTD 300 ALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD Sbjct: 241 ALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD 300
Query: 301 FRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPT 360 FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPT Sbjct: 301 FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPT 360
Query: 361 DYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQG 420 DYSQQRWTDIVKYYEEIGKPTPEAKV KFLKS M YRDAKE+GGECESVVMIAKGQSVQG Sbjct: 361 DYSQQRWTDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEYGGECESVVMIAKGQSVQG 420
Query: 421 TCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPH 469 TCKYGTRVDYILASPDA+YEFV+GSYSV+SSKGTSDHHIVKVDFLK PH Sbjct: 421 TCKYGTRVDYILASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH 450
BLAST of CmoCh19G000050 vs. ExPASy TrEMBL
Match: A0A1S3BLG5 (uncharacterized protein LOC103491341 OS=Cucumis melo OX=3656 GN=LOC103491341 PE=4 SV=1) HSP 1 Score: 743.8 bits (1919), Expect = 4.4e-211 Identity = 393/469 (83.80%), Postives = 411/469 (87.63%), Query Frame = 0 Query: 1 MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTS-KPHADPHNTLDSFVNASLAS 60 ML FLNR LRRLCSRLRWPR RR+RPRV+VIKKFGKTTS + ++ P T+DSFVNAS S Sbjct: 1 MLKFLNRKLRRLCSRLRWPRRRRIRPRVLVIKKFGKTTSYETNSHPEKTIDSFVNASSPS 60
Query: 61 PVHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVND 120 VHP QF+ NTQRP+RIATFNAASFSMAPAVP EKSNSSAKFRRSLDS+ RTKSVND Sbjct: 61 AVHPNSQFYLLNTQRPIRIATFNAASFSMAPAVP--EKSNSSAKFRRSLDSNSRTKSVND 120
Query: 121 RPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEY 180 RPKSILKQSPLH NS+N VA K KPRVSINLPDNEISLLRNRQA SEY Sbjct: 121 RPKSILKQSPLHTNSINSGVA-----------KTKPRVSINLPDNEISLLRNRQA--SEY 180
Query: 181 EMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADIL 240 EME E+ SSSGND GM IAKS PLR VSMP ER SYRCSRTVVEVLR+LDADIL Sbjct: 181 EME-ENLSSSGNDRRGMGIAKSGTPLRWTVSMPSER---GSYRCSRTVVEVLRDLDADIL 240
Query: 241 ALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTD 300 ALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD Sbjct: 241 ALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD 300
Query: 301 FRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPT 360 FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPT Sbjct: 301 FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPT 360
Query: 361 DYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQG 420 DYSQQRWTDIVKYYEEIGKPTPEAKV KFLKS M YRDAKE+GGECESVVMIAKGQSVQG Sbjct: 361 DYSQQRWTDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEYGGECESVVMIAKGQSVQG 420
Query: 421 TCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPH 469 TCKYGTRVDYILASPDA+YEFV+GSYSV+SSKGTSDHHIVKVDFLK PH Sbjct: 421 TCKYGTRVDYILASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH 450
BLAST of CmoCh19G000050 vs. ExPASy TrEMBL
Match: A0A0A0KDU0 (Endo/exonuclease/phosphatase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G409370 PE=4 SV=1) HSP 1 Score: 739.2 bits (1907), Expect = 1.1e-209 Identity = 390/469 (83.16%), Postives = 408/469 (86.99%), Query Frame = 0 Query: 1 MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVN-ASLAS 60 ML FLNR LRRLCSRLRWPR R +RPRV++IKKFGKTTS+ + P T+DSFVN AS S Sbjct: 1 MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPS 60
Query: 61 PVHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVND 120 VHP QFH TQRP+RIATFNAASFSMAPAVP EKSNSSAKFRRSLDS+ RTKSVND Sbjct: 61 AVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSNSSAKFRRSLDSNSRTKSVND 120
Query: 121 RPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSEY 180 RPKSILKQSPLH NS+N VA + KPRVSINLPDNEISLLRNRQA SEY Sbjct: 121 RPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQA--SEY 180
Query: 181 EMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGESYRCSRTVVEVLRELDADIL 240 EME E+ SSSGND GMRIAKS PLR VSMP ER +YRCSRTVVEVLRELDADIL Sbjct: 181 EME-ENLSSSGNDRKGMRIAKSGTPLRWTVSMPSER---GTYRCSRTVVEVLRELDADIL 240
Query: 241 ALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHTD 300 ALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD Sbjct: 241 ALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD 300
Query: 301 FRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDPT 360 FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPT Sbjct: 301 FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPT 360
Query: 361 DYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQG 420 DYSQQRW DIVKYYEEIGKPTPEAKV KFLKS M YRDAKEFGGECESVVMIAKGQSVQG Sbjct: 361 DYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQG 420
Query: 421 TCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLKPPH 469 TCKYGTRVDYI+ASPDA+YEFV+GSYSV+SSKGTSDHHIVKVDFLK PH Sbjct: 421 TCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH 450
BLAST of CmoCh19G000050 vs. TAIR 10
Match: AT3G21530.1 (DNAse I-like superfamily protein ) HSP 1 Score: 432.2 bits (1110), Expect = 5.4e-121 Identity = 258/477 (54.09%), Postives = 313/477 (65.62%), Query Frame = 0 Query: 1 MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKT--TSKPHADPHNTLDSFVNASLA 60 ML R L L SRLRW +RVR RV+V ++F K ++ P + + S +S Sbjct: 1 MLCVFRRKLGCLFSRLRWVIKKRVRARVIV-RRFRKARWRARRKESPESEVSSIHLSS-- 60
Query: 61 SPVHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVN 120 N+ R +R+ATFN A FS+AP V E++ F LDSS T Sbjct: 61 ------------NSGRHIRVATFNVAMFSLAPVVQTMEET----AFLGHLDSSNIT---C 120
Query: 121 DRPKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKPRVSINLPDNEISLLRNRQASFSE 180 PK ILKQSPLH ++V KP+V INLPDNEISL + S+S Sbjct: 121 PSPKGILKQSPLHSSAVR-----------------KPKVCINLPDNEISLAQ----SYSF 180
Query: 181 YEMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMP---LEREKGESYRCSRTVVEVLRELD 240 M + D NDG R + S +RS V +P ++E Y R++ E+LRELD Sbjct: 181 LSMVEND-----NDGKENRGSLS---MRSPVCLPSCWWDQESFNGYSSRRSIAELLRELD 240
Query: 241 ADILALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIF 300 ADILALQDVKA EE M+PLSDLA ALGMKYVFAESWAPEYGNA+LS+WPIK+W+V++I Sbjct: 241 ADILALQDVKAEEETLMKPLSDLASALGMKYVFAESWAPEYGNAILSKWPIKKWRVQRIA 300
Query: 301 DHTDFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNS 360 D DFRNVLK T+++ G+VNV CT LDHLDENWRMKQI +I R ++ PHILLGGLNS Sbjct: 301 DVDDFRNVLKVTVEIPWAGDVNVYCTQLDHLDENWRMKQIDAITRG-DESPHILLGGLNS 360
Query: 361 LDPTDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQ 420 LD +DYS RW IVKYYE+ GKPTP +V++FLK G Y D+KEF GECE VV+IAKGQ Sbjct: 361 LDGSDYSIARWNHIVKYYEDSGKPTPRVEVMRFLK-GKGYLDSKEFAGECEPVVIIAKGQ 420
Query: 421 SVQGTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDF-LKPPHSRG 472 +VQGTCKYGTRVDYILASP++ YEFV GSYSV+SSKGTSDHHIVKVD + SRG Sbjct: 421 NVQGTCKYGTRVDYILASPESPYEFVPGSYSVVSSKGTSDHHIVKVDLVITKERSRG 424
BLAST of CmoCh19G000050 vs. TAIR 10
Match: AT2G48030.1 (DNAse I-like superfamily protein ) HSP 1 Score: 427.2 bits (1097), Expect = 1.7e-119 Identity = 263/467 (56.32%), Postives = 301/467 (64.45%), Query Frame = 0 Query: 1 MLNFLNRSLRRLCSRLRWPRARRVRPRVVVIKKFGKTTSKPHADPHNTLDSFVNASLASP 60 MLN + LRR RLR PR + R+ V S P H S A+ Sbjct: 1 MLNLI-AFLRR---RLRRPR----KARISVNHHHLSVDSSPETHHHQN-----GFSSAAA 60
Query: 61 VHPKPQFHGRNTQRPVRIATFNAASFSMAPAVPYAEKSNSSAKFRRSLDSSLRTKSVNDR 120 +HP P + + +ATFNAA FSMAPAVP SN F R+KS DR Sbjct: 61 IHPNP-------DKTITVATFNAAMFSMAPAVP----SNKGLPF--------RSKSTVDR 120
Query: 121 PKSILKQSPLHPNSVNGVVANHNLHTQPKFVKPKP-RVSINLPDNEISLLRNRQASFSEY 180 PKSILK P++ H+ Q +F K +P RVSINLPDNEIS RQ SF Sbjct: 121 PKSILK--PMNA----AASPTHDSRKQQRFAKSRPRRVSINLPDNEIS----RQLSF--- 180
Query: 181 EMEKEDPSSSGNDGNGMRIAKSRPPLRSIVSMPLEREKGE-SYRCSRTVVEVLRELDADI 240 +EDP S PLR GE R +RT +EVL ELDAD+ Sbjct: 181 ---REDPQHS--------------PLR----------PGEIGLRSTRTALEVLSELDADV 240
Query: 241 LALQDVKAVEEKGMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDHT 300 LALQDVKA E MRPLSDLA ALGM YVFAESWAPEYGNA+LS+WPIK V +IFDHT Sbjct: 241 LALQDVKADEADQMRPLSDLAAALGMNYVFAESWAPEYGNAILSKWPIKSSNVLRIFDHT 300
Query: 301 DFRNVLKATIDVGEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNDEPHILLGGLNSLDP 360 DFRNVLKA+I+V GEV CTHLDHLDE WRMKQ+ +II+STN PHIL G LNSLD Sbjct: 301 DFRNVLKASIEVPGSGEVEFHCTHLDHLDEKWRMKQVDAIIQSTN-VPHILAGALNSLDE 360
Query: 361 TDYSQQRWTDIVKYYEEIGKPTPEAKVIKFLKSGMHYRDAKEFGGECESVVMIAKGQSVQ 420 +DYS +RWTDIVKYYEE+GKP P+A+V++FLKS Y DAK+F GECESVV++AKGQSVQ Sbjct: 361 SDYSPERWTDIVKYYEEMGKPIPKAQVMRFLKS-KEYTDAKDFAGECESVVVVAKGQSVQ 393
Query: 421 GTCKYGTRVDYILASPDADYEFVKGSYSVLSSKGTSDHHIVKVDFLK 466 GTCKYGTRVDYILAS D+ Y FV GSYSVLSSKGTSDHHIVKVD +K Sbjct: 421 GTCKYGTRVDYILASSDSPYRFVPGSYSVLSSKGTSDHHIVKVDVVK 393
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1HGD4 | 9.0e-273 | 100.00 | uncharacterized protein LOC111464053 OS=Cucurbita moschata OX=3662 GN=LOC1114640... | [more] |
A0A6J1HR38 | 1.5e-264 | 97.03 | uncharacterized protein LOC111467014 OS=Cucurbita maxima OX=3661 GN=LOC111467014... | [more] |
A0A5A7UUR9 | 4.4e-211 | 83.80 | DNAse I-like superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... | [more] |
A0A1S3BLG5 | 4.4e-211 | 83.80 | uncharacterized protein LOC103491341 OS=Cucumis melo OX=3656 GN=LOC103491341 PE=... | [more] |
A0A0A0KDU0 | 1.1e-209 | 83.16 | Endo/exonuclease/phosphatase domain-containing protein OS=Cucumis sativus OX=365... | [more] |
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR Term | IPR Description | Source | Source Term | Source Description | Alignment |
IPR005135 | Endonuclease/exonuclease/phosphatase | PFAM | PF03372 | Exo_endo_phos | coord: 217..456 e-value: 4.4E-7 score: 29.8 |
IPR036691 | Endonuclease/exonuclease/phosphatase superfamily | GENE3D | 3.60.10.10 | Endonuclease/exonuclease/phosphatase | coord: 211..465 e-value: 7.7E-31 score: 109.5 |
IPR036691 | Endonuclease/exonuclease/phosphatase superfamily | SUPERFAMILY | 56219 | DNase I-like | coord: 225..464 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 178..201 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 97..134 |
None | No IPR available | PANTHER | PTHR14859:SF10 | ENDONUCLEASE/EXONUCLEASE/PHOSPHATASE FAMILY PROTEIN | coord: 1..465 |
None | No IPR available | PANTHER | PTHR14859 | CALCOFLUOR WHITE HYPERSENSITIVE PROTEIN PRECURSOR | coord: 1..465 |
Relationships
The following mRNA feature(s) are a part of this gene:
GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category |
Term Accession |
Term Name |
biological_process |
GO:0006506 |
GPI anchor biosynthetic process |
cellular_component |
GO:0005783 |
endoplasmic reticulum |
molecular_function |
GO:0003824 |
catalytic activity |
|