Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATAGAATTGTTGTGTTTATGGGAAGCAATAACTTTACCTTGTCTTCAAGTCTTCCTTTCTTCTTCATCAACATTTTCTCTCAAACCTCAAAATCCCAATTCCACAGCTCTCCAGCAATCCTCATCAATGGCGTCTTCAGCTTCTAGCCCCTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCAACTCCTGCGCACAGTTTCTCTGTAAATCCATCTTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCGATCAGACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTGTCTAAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTGGTGTTTCTGATGAAAGGAAAGTGAGTGAAGTTCTGTACATTCAGCCGAAACTTGGATCTGCGAGTGATTTGAATGCGCAATCTCGCCACCAGGAAAAACTCCGTTATTCAATGCCGAAAAAAAGGTACGAAAATTCTTATGAATTTGCTGATACTGATAATGTCGCTCATGCATGTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAATCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTGAGATCGAGTCTTACTGAATCCGACGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAAAAGCTCTGAGAATAATTGTGAAGGAAATAGTGAATTTGGTGATAATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAATTGCGTGAGAAATTTGGAAAGAAGGTGATAAGAGAGAGAGGATTCGGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCCCTGAAAAAATCAGGATCTCTTCATTCTTATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTCCGTTGTCATCGACGACGAGAAAGCCCCGTAAAATGTCGTCACTCAGTAACATTTCCTATAAGTCGTTGCATTCTCGACAATACAGTACGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTTCGCCACGTTCGGACAGGAATTTCGCAAGTATTCCAAAAGCATTATCCCAAGGAAAATCGGTTCGAAGAATTCGAGCAAATGCAGCTGCCATAGAGGATATGAAAGCTCAAGAGATGCACAGAAAGCAAGTTAAACATGATGACATTATAGGGAATAAGTTTGAAGAAGGTGGAATGTCACCACCATATATGAGAGAAGATGGAACGGGACAGGGATGGCCTGATGTTGTTAACCCAAATGCTGGTAATATGAATCGTTTTCCAAAGACGACGTTCTTGGGGATTAAGGAGCAGAAGGAAGAGACAGAGAGTCTGGTGGCAGATGATAGTAAAGATGACTCTGAGGGGGAGGATGAAAGTTTGTTTGCAAGTTCAGATGAAGAAGCTGGTTCAAGTATGGCTGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCGGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGAGGAGGAGGAGGATGGGGGTCATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA
mRNA sequence
ATGATAGAATTGTTGTGTTTATGGGAAGCAATAACTTTACCTTGTCTTCAAGTCTTCCTTTCTTCTTCATCAACATTTTCTCTCAAACCTCAAAATCCCAATTCCACAGCTCTCCAGCAATCCTCATCAATGGCGTCTTCAGCTTCTAGCCCCTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCAACTCCTGCGCACAGTTTCTCTGTAAATCCATCTTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCGATCAGACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTGTCTAAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTGGTGTTTCTGATGAAAGGAAAGTGAGTGAAGTTCTGTACATTCAGCCGAAACTTGGATCTGCGAGTGATTTGAATGCGCAATCTCGCCACCAGGAAAAACTCCGTTATTCAATGCCGAAAAAAAGGTACGAAAATTCTTATGAATTTGCTGATACTGATAATGTCGCTCATGCATGTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAATCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTGAGATCGAGTCTTACTGAATCCGACGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAAAAGCTCTGAGAATAATTGTGAAGGAAATAGTGAATTTGGTGATAATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAATTGCGTGAGAAATTTGGAAAGAAGGTGATAAGAGAGAGAGGATTCGGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCCCTGAAAAAATCAGGATCTCTTCATTCTTATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTCCGTTGTCATCGACGACGAGAAAGCCCCGTAAAATGTCGTCACTCAGTAACATTTCCTATAAGTCGTTGCATTCTCGACAATACAGTACGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTTCGCCACGTTCGGACAGGAATTTCGCAAGTATTCCAAAAGCATTATCCCAAGGAAAATCGGTTCGAAGAATTCGAGCAAATGCAGCTGCCATAGAGGATATGAAAGCTCAAGAGATGCACAGAAAGCAAGTTAAACATGATGACATTATAGGGAATAAGTTTGAAGAAGGTGGAATGTCACCACCATATATGAGAGAAGATGGAACGGGACAGGGATGGCCTGATGTTGTTAACCCAAATGCTGGTAATATGAATCGTTTTCCAAAGACGACGTTCTTGGGGATTAAGGAGCAGAAGGAAGAGACAGAGAGTCTGGTGGCAGATGATAGTAAAGATGACTCTGAGGGGGAGGATGAAAGTTTGTTTGCAAGTTCAGATGAAGAAGCTGGTTCAAGTATGGCTGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCGGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGAGGAGGAGGAGGATGGGGGTCATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA
Coding sequence (CDS)
ATGATAGAATTGTTGTGTTTATGGGAAGCAATAACTTTACCTTGTCTTCAAGTCTTCCTTTCTTCTTCATCAACATTTTCTCTCAAACCTCAAAATCCCAATTCCACAGCTCTCCAGCAATCCTCATCAATGGCGTCTTCAGCTTCTAGCCCCTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCAACTCCTGCGCACAGTTTCTCTGTAAATCCATCTTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCGATCAGACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTGTCTAAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTGGTGTTTCTGATGAAAGGAAAGTGAGTGAAGTTCTGTACATTCAGCCGAAACTTGGATCTGCGAGTGATTTGAATGCGCAATCTCGCCACCAGGAAAAACTCCGTTATTCAATGCCGAAAAAAAGGTACGAAAATTCTTATGAATTTGCTGATACTGATAATGTCGCTCATGCATGTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAATCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTGAGATCGAGTCTTACTGAATCCGACGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAAAAGCTCTGAGAATAATTGTGAAGGAAATAGTGAATTTGGTGATAATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAATTGCGTGAGAAATTTGGAAAGAAGGTGATAAGAGAGAGAGGATTCGGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCCCTGAAAAAATCAGGATCTCTTCATTCTTATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTCCGTTGTCATCGACGACGAGAAAGCCCCGTAAAATGTCGTCACTCAGTAACATTTCCTATAAGTCGTTGCATTCTCGACAATACAGTACGAGTTCTCTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTTCGCCACGTTCGGACAGGAATTTCGCAAGTATTCCAAAAGCATTATCCCAAGGAAAATCGGTTCGAAGAATTCGAGCAAATGCAGCTGCCATAGAGGATATGAAAGCTCAAGAGATGCACAGAAAGCAAGTTAAACATGATGACATTATAGGGAATAAGTTTGAAGAAGGTGGAATGTCACCACCATATATGAGAGAAGATGGAACGGGACAGGGATGGCCTGATGTTGTTAACCCAAATGCTGGTAATATGAATCGTTTTCCAAAGACGACGTTCTTGGGGATTAAGGAGCAGAAGGAAGAGACAGAGAGTCTGGTGGCAGATGATAGTAAAGATGACTCTGAGGGGGAGGATGAAAGTTTGTTTGCAAGTTCAGATGAAGAAGCTGGTTCAAGTATGGCTGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCGGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGAGGAGGAGGAGGATGGGGGTCATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA
Protein sequence
MIELLCLWEAITLPCLQVFLSSSSTFSLKPQNPNSTALQQSSSMASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDLNAQSRHQEKLRYSMPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKPRKMSSLSNISYKSLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFASIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGTGQGWPDVVNPNAGNMNRFPKTTFLGIKEQKEETESLVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFSSTSSSYFS
Homology
BLAST of CmoCh17G004880 vs. ExPASy TrEMBL
Match:
A0A6J1H4M0 (uncharacterized protein LOC111459998 OS=Cucurbita moschata OX=3662 GN=LOC111459998 PE=4 SV=1)
HSP 1 Score: 1132.5 bits (2928), Expect = 0.0e+00
Identity = 592/592 (100.00%), Postives = 592/592 (100.00%), Query Frame = 0
Query: 44 MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVDQTLF 103
MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVDQTLF
Sbjct: 1 MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVDQTLF 60
Query: 104 TKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFDDVDD 163
TKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFDDVDD
Sbjct: 61 TKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFDDVDD 120
Query: 164 FGVSDERKVSEVLYIQPKLGSASDLNAQSRHQEKLRYSMPKKRYENSYEFADTDNVAHAC 223
FGVSDERKVSEVLYIQPKLGSASDLNAQSRHQEKLRYSMPKKRYENSYEFADTDNVAHAC
Sbjct: 121 FGVSDERKVSEVLYIQPKLGSASDLNAQSRHQEKLRYSMPKKRYENSYEFADTDNVAHAC 180
Query: 224 KSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSK 283
KSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSK
Sbjct: 181 KSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSK 240
Query: 284 SSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL 343
SSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL
Sbjct: 241 SSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVL 300
Query: 344 RPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKPRKMSSLSNISYK 403
RPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKPRKMSSLSNISYK
Sbjct: 301 RPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKPRKMSSLSNISYK 360
Query: 404 SLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFASIPKALSQGKSV 463
SLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFASIPKALSQGKSV
Sbjct: 361 SLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFASIPKALSQGKSV 420
Query: 464 RRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGTGQGWPDVVNPNA 523
RRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGTGQGWPDVVNPNA
Sbjct: 421 RRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGTGQGWPDVVNPNA 480
Query: 524 GNMNRFPKTTFLGIKEQKEETESLVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGA 583
GNMNRFPKTTFLGIKEQKEETESLVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGA
Sbjct: 481 GNMNRFPKTTFLGIKEQKEETESLVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGA 540
Query: 584 FEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFSSTSSSYFS 636
FEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFSSTSSSYFS
Sbjct: 541 FEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFSSTSSSYFS 592
BLAST of CmoCh17G004880 vs. ExPASy TrEMBL
Match:
A0A6J1KUS4 (uncharacterized protein LOC111498900 OS=Cucurbita maxima OX=3661 GN=LOC111498900 PE=4 SV=1)
HSP 1 Score: 1052.4 bits (2720), Expect = 7.8e-304
Identity = 560/596 (93.96%), Postives = 569/596 (95.47%), Query Frame = 0
Query: 44 MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 103
MASSASSPFTKLHFPHSPLPQPPA NSCAQFLCKS+FFCFFLLLLPLFPSEAPDFVD
Sbjct: 1 MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVD 60
Query: 104 QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 163
QTLFTKFWELFHLM VGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD
Sbjct: 61 QTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120
Query: 164 DVDDFGVSDERKVSEVLYIQPKLGSASDLNAQSRHQEKLRYSMPKKRYENSYEFADTDNV 223
DVDDF VSDERK+SEVLYIQP LGSASDLNAQSR QEKLRYS+PKKRYENSYEFADTDNV
Sbjct: 121 DVDDFSVSDERKLSEVLYIQPNLGSASDLNAQSRQQEKLRYSIPKKRYENSYEFADTDNV 180
Query: 224 AHACKSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESC 283
AHACKSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSL+SSLTESDDVEFDCGDESC
Sbjct: 181 AHACKSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSLKSSLTESDDVEFDCGDESC 240
Query: 284 LSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG 343
LSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG
Sbjct: 241 LSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFG 300
Query: 344 NAVLRPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKPRKMSSLSN 403
NAVLRPSHFRPPSIDETQFESLKKSGSLHS LSQSSQTSSLSS LSSTTRK KMSSLSN
Sbjct: 301 NAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKHHKMSSLSN 360
Query: 404 ISYKSLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFASIPKALSQ 463
ISYKSLHSRQYS SSLSENSRGSSEDPLIEQENSSECNESVVSSPRSD NF SIPKALSQ
Sbjct: 361 ISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDMNFRSIPKALSQ 420
Query: 464 GKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGTGQGWPDVV 523
GKS+RRI+ANAAAIED+KAQEMHRKQVKHDDIIGNKFEEGG S PY+REDGTG GWPDV
Sbjct: 421 GKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTS-PYIREDGTGHGWPDVA 480
Query: 524 NPNAGNMNRFPKTTFLGIKEQKEETESLVADDSKDDSEGEDESLFASSDEEAGSSMAGDS 583
NPNA NM+RFP TTFLGIKEQKEETESLVADDSKDDSEGEDES FASSDEEA SSMAGDS
Sbjct: 481 NPNASNMSRFPTTTFLGIKEQKEETESLVADDSKDDSEGEDESFFASSDEEAASSMAGDS 540
Query: 584 ESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFSSTSSSYFS 636
ESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR GGGGWGSFSSTSSSYFS
Sbjct: 541 ESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLR----GGGGWGSFSSTSSSYFS 591
BLAST of CmoCh17G004880 vs. ExPASy TrEMBL
Match:
A0A5D3DMA5 (DUF761 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G002840 PE=4 SV=1)
HSP 1 Score: 857.4 bits (2214), Expect = 3.7e-245
Identity = 475/607 (78.25%), Postives = 514/607 (84.68%), Query Frame = 0
Query: 44 MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 103
MASS S+PFTK HFPHSPLP +NSC FLCKS+FFC FLLLLPLFPSEAP+FV+
Sbjct: 1 MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60
Query: 104 QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNV--DEPRYSSFENPQSYLSKMLYVASI 163
QTL TKFWELFHLMFVGIAVSYGLFS RN Q++V DEPR+S+FENPQSYLSKML+VASI
Sbjct: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120
Query: 164 FDDVDDFGVSDERKVSEVLYIQPKLGSASDLNAQSRHQEKLRYSMPKKRYENSYEFADTD 223
F+DVDDF VSDERK+SEVLYIQP LGS NA SR QE YS+PKKRYENS EF DT+
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVRGFNAISRQQENFHYSIPKKRYENSLEFDDTN 180
Query: 224 NVAHACKSRYTRGGSVVVVPETNRSS------SGGIVNYKPLGLPVRSLRSSLTESDDVE 283
+V HACKSRYTRGGSVVVV ETNRS+ SG IVNYKPLGLPVRSLRS+LTE DDVE
Sbjct: 181 SVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPDDVE 240
Query: 284 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 343
FDCGDESCLSSKSS K+SE+NCE SEFGDNCCVNLEEKFDET IA MS FQLRE FGK
Sbjct: 241 FDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENFGKN 300
Query: 344 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKP 403
++RERG NAVLRPSHFRP SIDETQFESLKKS SLHS LSQSSQTSSLS LSSTTRK
Sbjct: 301 MMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTTRKH 360
Query: 404 RKMSSLSNISYKSLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 463
RKMSSL NISYKS HSRQYS SSLSENSRGSSEDPLIE ENSSECNES++SSPR DRNFA
Sbjct: 361 RKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDRNFA 420
Query: 464 SIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGT 523
IPKALS+GKSVR IRAN +AIE+MKAQEM+R QV+HDD +GNKF EGGMS PYMREDGT
Sbjct: 421 HIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKF-EGGMS-PYMREDGT 480
Query: 524 GQGWPDVVNPNAGNMNRFPK-TTFLGIKEQKEETESLVADD--SKDDSEGEDESLFASSD 583
G GWP + +PNAG NR PK TTF GI+EQKE+ ES + DD +D+SE ED S F SSD
Sbjct: 481 GHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESSD 540
Query: 584 EEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFSS 636
EEA SSMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLR GGWGSFSS
Sbjct: 541 EEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLR------GGWGSFSS 599
BLAST of CmoCh17G004880 vs. ExPASy TrEMBL
Match:
E5GCN2 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)
HSP 1 Score: 857.4 bits (2214), Expect = 3.7e-245
Identity = 475/607 (78.25%), Postives = 514/607 (84.68%), Query Frame = 0
Query: 44 MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 103
MASS S+PFTK HFPHSPLP +NSC FLCKS+FFC FLLLLPLFPSEAP+FV+
Sbjct: 1 MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60
Query: 104 QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNV--DEPRYSSFENPQSYLSKMLYVASI 163
QTL TKFWELFHLMFVGIAVSYGLFS RN Q++V DEPR+S+FENPQSYLSKML+VASI
Sbjct: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120
Query: 164 FDDVDDFGVSDERKVSEVLYIQPKLGSASDLNAQSRHQEKLRYSMPKKRYENSYEFADTD 223
F+DVDDF VSDERK+SEVLYIQP LGS NA SR QE YS+PKKRYENS EF DT+
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVRGFNAISRQQENFHYSIPKKRYENSLEFDDTN 180
Query: 224 NVAHACKSRYTRGGSVVVVPETNRSS------SGGIVNYKPLGLPVRSLRSSLTESDDVE 283
+V HACKSRYTRGGSVVVV ETNRS+ SG IVNYKPLGLPVRSLRS+LTE DDVE
Sbjct: 181 SVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPDDVE 240
Query: 284 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 343
FDCGDESCLSSKSS K+SE+NCE SEFGDNCCVNLEEKFDET IA MS FQLRE FGK
Sbjct: 241 FDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENFGKN 300
Query: 344 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKP 403
++RERG NAVLRPSHFRP SIDETQFESLKKS SLHS LSQSSQTSSLS LSSTTRK
Sbjct: 301 MMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTTRKH 360
Query: 404 RKMSSLSNISYKSLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 463
RKMSSL NISYKS HSRQYS SSLSENSRGSSEDPLIE ENSSECNES++SSPR DRNFA
Sbjct: 361 RKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDRNFA 420
Query: 464 SIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGT 523
IPKALS+GKSVR IRAN +AIE+MKAQEM+R QV+HDD +GNKF EGGMS PYMREDGT
Sbjct: 421 HIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKF-EGGMS-PYMREDGT 480
Query: 524 GQGWPDVVNPNAGNMNRFPK-TTFLGIKEQKEETESLVADD--SKDDSEGEDESLFASSD 583
G GWP + +PNAG NR PK TTF GI+EQKE+ ES + DD +D+SE ED S F SSD
Sbjct: 481 GHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESSD 540
Query: 584 EEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFSS 636
EEA SSMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLR GGWGSFSS
Sbjct: 541 EEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLR------GGWGSFSS 599
BLAST of CmoCh17G004880 vs. ExPASy TrEMBL
Match:
A0A0A0K9X1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G103540 PE=4 SV=1)
HSP 1 Score: 850.9 bits (2197), Expect = 3.4e-243
Identity = 471/608 (77.47%), Postives = 511/608 (84.05%), Query Frame = 0
Query: 44 MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 103
MA S S+PFTK HFPHSPLP +NSC QF+CKS+FFC FLLLLPLFPSEAP+FV+
Sbjct: 1 MAPSPSTPFTKPHFPHSPLPPTSTTRHSNSCTQFICKSLFFCIFLLLLPLFPSEAPEFVN 60
Query: 104 QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNV--DEPRYSSFENPQSYLSKMLYVASI 163
QT TKFWELFHLMF+GIAVSYGLFS RN Q++V DEPR+S+FENPQSYLSKM +VASI
Sbjct: 61 QTFLTKFWELFHLMFIGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMFHVASI 120
Query: 164 FDDVDDFGVSDERKVSEVLYIQPKLGSASDLNAQSRHQEKLRYSMPKKRYENSYEFADTD 223
F+DVDDF VSDERK+SEVLYIQP LGS S LNA SR QE YS+PKKRYENS EFA+TD
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVSGLNAISRQQENFHYSIPKKRYENSLEFAETD 180
Query: 224 NVAHACKSRYTRGGSVVVVPETNRSS------SGGIVNYKPLGLPVRSLRSSLTESDDVE 283
NV HACKSRYTRGGSVVVV ETNRS+ SG IVNYKPLGLPVRSL+SSLTE DDVE
Sbjct: 181 NVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLKSSLTEPDDVE 240
Query: 284 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 343
FDCGDESCLSSKSS K+SE+NCE SEFGDNCCVNLEEKFDET IASMS FQLREKF K
Sbjct: 241 FDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIASMSPFQLREKFEKN 300
Query: 344 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKP 403
++RER NAVLRPSHFRP SIDETQFESLKKS SLHS LSQSSQTSSLSSPLSS TRK
Sbjct: 301 MMRERRVKNAVLRPSHFRPSSIDETQFESLKKSTSLHSNLSQSSQTSSLSSPLSSRTRKH 360
Query: 404 RKMSSLSNISYKSLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 463
RKMSSL NISYKS HSRQYS SSLSENSRGSSEDPLI+ ENSSECNESVVSSPR DRNFA
Sbjct: 361 RKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIDPENSSECNESVVSSPRLDRNFA 420
Query: 464 SIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGT 523
+ PKALS+GKSVR +RA+ +AIE+MKAQEM+R QV+HDD + NKF EGGMS PYMRED T
Sbjct: 421 NTPKALSRGKSVRTVRASTSAIEEMKAQEMYRNQVEHDDNVENKF-EGGMS-PYMREDET 480
Query: 524 GQGWPDVVNPNAGNMNRFPK----TTFLGIKEQKEETESLVADDSKDDSEGEDESLFASS 583
G GWP + N NA NR+ K TTF GI+EQKE+TES V DD KD+SE ED+S F SS
Sbjct: 481 GHGWPGINNLNAAYSNRYSKTTATTTFSGIEEQKEDTESQVTDDGKDNSEREDDSFFESS 540
Query: 584 DEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFS 636
DEEA SM GDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLR GGWGSFS
Sbjct: 541 DEEAALSMTGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR------GGWGSFS 600
BLAST of CmoCh17G004880 vs. TAIR 10
Match:
AT3G60380.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT4G16790.1); Has 6102 Blast hits to 3981 proteins in 424 species: Archae - 6; Bacteria - 372; Metazoa - 2603; Fungi - 655; Plants - 291; Viruses - 28; Other Eukaryotes - 2147 (source: NCBI BLink). )
HSP 1 Score: 139.4 bits (350), Expect = 9.8e-33
Identity = 139/392 (35.46%), Postives = 188/392 (47.96%), Query Frame = 0
Query: 47 SASSPFTKLHFPHSPL--PQPPANSC--AQFLCKSIFFCFFLLLLPLFPSEAPDFVDQTL 106
++ +P+TK P + + PQP S F CKS+ F FLL LPLFPS+APDFV +T+
Sbjct: 2 ASPNPYTKRRSPPNVVVPPQPRYKSIGGGGFFCKSVLFALFLLALPLFPSQAPDFVGETV 61
Query: 107 FTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD-DV 166
TKFWEL HL+FVGIAV+YGLFS RN + VD E+ SY+S++ V+S+FD +
Sbjct: 62 LTKFWELIHLLFVGIAVAYGLFSRRNVESAVDLRMTRVDESSLSYVSRIFQVSSVFDEEF 121
Query: 167 DDFGVS--DERKVSEVLYIQPKLGSASDLNAQSRHQEKLRYSMPKKRYENSYEFADTDNV 226
DD D R V +G + +S E S EF +T+ V
Sbjct: 122 DDNSCEFVDVRSDESVSARASVVGKSESFVVES------------GELEESSEFGETNEV 181
Query: 227 AHACKSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESC 286
A S+Y +G S VVV G +V ++PLGLP+R LRSSL D +
Sbjct: 182 -RAWNSQYFQGKSKVVVARPAYGLDGHVV-HQPLGLPIRRLRSSLR----------DNAA 241
Query: 287 LSSKSSPKSSEN--NCEGNSEFGDNCCVNLEEKFDE--TAIASMSSFQLREKFGKKVIRE 346
L KS S + N E S DN FDE A AS +Q R +
Sbjct: 242 LQDKSFADSCDGAVNAEAESLLADNF-------FDEVLAAPASPVPWQARPEM------- 301
Query: 347 RGFGNAVLRPSHFRPPSIDET-QFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKPRKM 406
G G+ PS+F+P S+DET + S + +GS S S +SQ + SP S + +
Sbjct: 302 MGIGDNY--PSNFQPISVDETLKSISSRSTGSSSSQTSYASQNQNRFSPSRSVSAESLNS 353
Query: 407 SSLSNISYKSLHSRQYSTSSLSENSRGSSEDP 427
+ + KS S S+S S S P
Sbjct: 362 NVEELVKEKSRQSSSRSSSPSLPPSPSLSPSP 353
HSP 2 Score: 42.0 bits (97), Expect = 2.1e-03
Identity = 35/79 (44.30%), Postives = 48/79 (60.76%), Query Frame = 0
Query: 541 KEETESLVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQ 600
K E E + ++ + +E + E F +E A S + S EVD+KAGEFIAKFREQ
Sbjct: 661 KSEPEEVAMEEPQ--AEQQPEVTFEEEEEAAWESQSNASHDHN-EVDRKAGEFIAKFREQ 720
Query: 601 IQLQRMASVEKRLRGGGGG 620
I+LQ++ S E+ RGGG G
Sbjct: 721 IRLQKLISGEQP-RGGGTG 735
BLAST of CmoCh17G004880 vs. TAIR 10
Match:
AT4G16790.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 63.2 bits (152), Expect = 8.9e-10
Identity = 43/123 (34.96%), Postives = 64/123 (52.03%), Query Frame = 0
Query: 64 QPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVDQTLFTKFWELFHLMFVGIAVSYGL 123
Q P ++F+ K++ ++P+F S+ P+ +Q T+ EL HL+FVGIAVSYGL
Sbjct: 19 QNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQ---TRLLELLHLVFVGIAVSYGL 78
Query: 124 FSTRN-------NQMNVDEPRYS-SFENPQSYLSKMLYVASIFD-------DVDDFGVSD 172
FS RN N D + S N SY+ K+L V+S+F+ + D D
Sbjct: 79 FSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEVSSVFNVGHESESEPSDDSSGD 138
HSP 2 Score: 34.7 bits (78), Expect = 3.4e-01
Identity = 27/74 (36.49%), Postives = 43/74 (58.11%), Query Frame = 0
Query: 539 EQKEETESLVADDSKDDSEGEDESLFASSDEEAGSSMAGDSE-SGAFEVDKKAGEFIAKF 598
+Q+ S ++S++ + E+ E+ G SE + +VDKKA EFIAKF
Sbjct: 389 DQRSNLGSKAVEESENGEQRRGENEIHDEVEKKIVEEEGVSEINNGSDVDKKADEFIAKF 448
Query: 599 REQIQLQRMASVEK 612
REQI+LQR+ S+++
Sbjct: 449 REQIRLQRIESIKR 462
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1H4M0 | 0.0e+00 | 100.00 | uncharacterized protein LOC111459998 OS=Cucurbita moschata OX=3662 GN=LOC1114599... | [more] |
A0A6J1KUS4 | 7.8e-304 | 93.96 | uncharacterized protein LOC111498900 OS=Cucurbita maxima OX=3661 GN=LOC111498900... | [more] |
A0A5D3DMA5 | 3.7e-245 | 78.25 | DUF761 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... | [more] |
E5GCN2 | 3.7e-245 | 78.25 | Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1 | [more] |
A0A0A0K9X1 | 3.4e-243 | 77.47 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G103540 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G60380.1 | 9.8e-33 | 35.46 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |
AT4G16790.1 | 8.9e-10 | 34.96 | hydroxyproline-rich glycoprotein family protein | [more] |