Cp4.1LG12g04410.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG12g04410.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHydroxyproline-rich glycoprotein family protein, putative
LocationCp4.1LG12 : 3731848 .. 3733623 (+)
Sequence length1776
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCTTCAGCTTCTAGCCCCTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCAACTCCTGCGCACAGTTTCTCTGTAAATCCATCTTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCACATTTCGTCGATCACACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTGTCTAAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTGGTGTTTCTGATGAAAGGAAAGTGAGTGAAGTTCTGTACATTCAGCCGAAACTTGGATCTGCGAGTGATTTCAATGCGCAATCTCGCCACCAGGAAAAACTCCGTTACTCAATACCGAAAAAAAGGTACGAAAACTCTTATGAATTTGCTGATACTGATAATGTCGCTCATGCTTGTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAACCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTGAGATCGAGTCTTACTGAATCCGACGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAGAAGCTCTGAGAATAATTGTGAAGGAAACAGTGAATTTGGTGATGATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAATTGCGTGAGAAATTTGGAAAGAAGGTGATTAGAGAGAGAGGATTTGGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAAATCAGGATCTCTTCATTCTAATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTTCGTTGTCATCGACGACGAGACAGCACCGTAAAATGTCGTCACTCAGTAACATTTCCTATAAGTCGTTGCATTCTCGACAATACAGTATGAGTTCTGTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTTCGCCACGTTCGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCCCAAGGAAAATCGGTTCGAAGAATTCGAGCAAATGCAGCTGCCATGGAGGATATGAAAGCTCAAGAGATGCACAGAAAGCAAGTTAAACAAGATGACATTATAGGGAATAAGTTTGAAGAAGGTGGAATGTCACCACCATATATGAGAGAAGATGGAACGGGACACGGATGGCCTGATGTTGTTAACCCGAATGCTAGTAATATGAATCGTTTTCCGAAGACGACGTTCTTGGGGATTAAGGAGCAGAAGGAAGAGACAGAGAGTGTGGTGGCAGATGATAGTAAAGATGACTCTGAGGGGGAGGATGAAAGTTTGTTTGCAAGTTCAGATGAAGAAGCTGGTTCAAGTATGGCCGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCGGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGAGGAGGAGGGTGGGGGTCATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA

mRNA sequence

ATGGCGTCTTCAGCTTCTAGCCCCTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCAACTCCTGCGCACAGTTTCTCTGTAAATCCATCTTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCACATTTCGTCGATCACACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTGTCTAAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTGGTGTTTCTGATGAAAGGAAAGTGAGTGAAGTTCTGTACATTCAGCCGAAACTTGGATCTGCGAGTGATTTCAATGCGCAATCTCGCCACCAGGAAAAACTCCGTTACTCAATACCGAAAAAAAGGTACGAAAACTCTTATGAATTTGCTGATACTGATAATGTCGCTCATGCTTGTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAACCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTGAGATCGAGTCTTACTGAATCCGACGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAGAAGCTCTGAGAATAATTGTGAAGGAAACAGTGAATTTGGTGATGATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAATTGCGTGAGAAATTTGGAAAGAAGGTGATTAGAGAGAGAGGATTTGGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAAATCAGGATCTCTTCATTCTAATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTTCGTTGTCATCGACGACGAGACAGCACCGTAAAATGTCGTCACTCAGTAACATTTCCTATAAGTCGTTGCATTCTCGACAATACAGTATGAGTTCTGTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTTCGCCACGTTCGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCCCAAGGAAAATCGGTTCGAAGAATTCGAGCAAATGCAGCTGCCATGGAGGATATGAAAGCTCAAGAGATGCACAGAAAGCAAGTTAAACAAGATGACATTATAGGGAATAAGTTTGAAGAAGGTGGAATGTCACCACCATATATGAGAGAAGATGGAACGGGACACGGATGGCCTGATGTTGTTAACCCGAATGCTAGTAATATGAATCGTTTTCCGAAGACGACGTTCTTGGGGATTAAGGAGCAGAAGGAAGAGACAGAGAGTGTGGTGGCAGATGATAGTAAAGATGACTCTGAGGGGGAGGATGAAAGTTTGTTTGCAAGTTCAGATGAAGAAGCTGGTTCAAGTATGGCCGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCGGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGAGGAGGAGGGTGGGGGTCATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA

Coding sequence (CDS)

ATGGCGTCTTCAGCTTCTAGCCCCTTCACGAAGCTCCATTTCCCCCATTCTCCACTTCCACAACCACCAGCCAACTCCTGCGCACAGTTTCTCTGTAAATCCATCTTCTTCTGCTTTTTTCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCACATTTCGTCGATCACACTTTGTTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTCAGCACAAGGAACAACCAGATGAATGTAGACGAACCTCGCTACTCCAGTTTTGAGAATCCGCAGTCTTATTTGTCTAAGATGCTTTACGTCGCTTCAATTTTTGATGATGTTGACGATTTTGGTGTTTCTGATGAAAGGAAAGTGAGTGAAGTTCTGTACATTCAGCCGAAACTTGGATCTGCGAGTGATTTCAATGCGCAATCTCGCCACCAGGAAAAACTCCGTTACTCAATACCGAAAAAAAGGTACGAAAACTCTTATGAATTTGCTGATACTGATAATGTCGCTCATGCTTGTAAATCGAGATATACTCGTGGTGGATCTGTGGTGGTTGTGCCTGAAACAAACCGTAGTTCATCAGGAGGCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTGAGATCGAGTCTTACTGAATCCGACGATGTCGAATTCGATTGTGGTGATGAATCTTGTTTGAGTTCTAAAAGTTCACCCAGAAGCTCTGAGAATAATTGTGAAGGAAACAGTGAATTTGGTGATGATTGTTGTGTGAATTTGGAGGAGAAGTTTGATGAGACTGCAATTGCATCAATGTCCTCATTTCAATTGCGTGAGAAATTTGGAAAGAAGGTGATTAGAGAGAGAGGATTTGGGAATGCTGTTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAAATCAGGATCTCTTCATTCTAATCTATCTCAGTCATCACAAACTAGTTCCCTCTCTTCTTCGTTGTCATCGACGACGAGACAGCACCGTAAAATGTCGTCACTCAGTAACATTTCCTATAAGTCGTTGCATTCTCGACAATACAGTATGAGTTCTGTGTCTGAAAACAGTAGAGGGAGCTCTGAAGACCCTCTGATTGAACAAGAAAACTCATCCGAGTGCAATGAATCCGTGGTGAGTTCGCCACGTTCGGACAGGAATTTCGCAAGTATTCCGAAAGCTTTATCCCAAGGAAAATCGGTTCGAAGAATTCGAGCAAATGCAGCTGCCATGGAGGATATGAAAGCTCAAGAGATGCACAGAAAGCAAGTTAAACAAGATGACATTATAGGGAATAAGTTTGAAGAAGGTGGAATGTCACCACCATATATGAGAGAAGATGGAACGGGACACGGATGGCCTGATGTTGTTAACCCGAATGCTAGTAATATGAATCGTTTTCCGAAGACGACGTTCTTGGGGATTAAGGAGCAGAAGGAAGAGACAGAGAGTGTGGTGGCAGATGATAGTAAAGATGACTCTGAGGGGGAGGATGAAAGTTTGTTTGCAAGTTCAGATGAAGAAGCTGGTTCAAGTATGGCCGGAGATTCGGAGTCGGGGGCTTTCGAGGTCGACAAGAAGGCGGGCGAGTTCATAGCCAAGTTCAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGTTGAAAAAAGATTGAGAGGAGGAGGAGGAGGAGGAGGGTGGGGGTCATTCAGCAGCACAAGCAGCAGCTATTTCAGTTGA

Protein sequence

MASSASSPFTKLHFPHSPLPQPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHGWPDVVNPNASNMNRFPKTTFLGIKEQKEETESVVADDSKDDSEGEDESLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSSTSSSYFS
BLAST of Cp4.1LG12g04410.1 vs. TrEMBL
Match: E5GCN2_CUCME (Putative uncharacterized protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 856.7 bits (2212), Expect = 1.7e-245
Identity = 473/606 (78.05%), Postives = 516/606 (85.15%), Query Frame = 1

Query: 1   MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVD 60
           MASS S+PFTK HFPHSPLP       +NSC  FLCKS+FFC FLLLLPLFPSEAP FV+
Sbjct: 1   MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  HTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPRYSSFENPQSYLSKMLYVASI 120
            TL TKFWELFHLMFVGIAVSYGLFS RN Q++VD  EPR+S+FENPQSYLSKML+VASI
Sbjct: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120

Query: 121 FDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTD 180
           F+DVDDF VSDERK+SEVLYIQP LGS   FNA SR QE   YSIPKKRYENS EF DT+
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVRGFNAISRQQENFHYSIPKKRYENSLEFDDTN 180

Query: 181 NVAHACKSRYTRGGSVVVVPETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVE 240
           +V HACKSRYTRGGSVVVV ETNRS+SG       IVNYKPLGLPVRSLRS+LTE DDVE
Sbjct: 181 SVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPDDVE 240

Query: 241 FDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK 300
           FDCGDESCLSSKSS ++SE+NCE  SEFGD+CCVNLEEKFDET IA MS FQLRE FGK 
Sbjct: 241 FDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENFGKN 300

Query: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQH 360
           ++RERG  NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLS SLSSTTR+H
Sbjct: 301 MMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTTRKH 360

Query: 361 RKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420
           RKMSSL NISYKS HSRQYS+SS+SENSRGSSEDPLIE ENSSECNES++SSPR DRNFA
Sbjct: 361 RKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDRNFA 420

Query: 421 SIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGT 480
            IPKALS+GKSVR IRAN +A+E+MKAQEM+R QV+ DD +GNKF EGGMS PYMREDGT
Sbjct: 421 HIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKF-EGGMS-PYMREDGT 480

Query: 481 GHGWPDVVNPNASNMNRFPK-TTFLGIKEQKEETESVVADD--SKDDSEGEDESLFASSD 540
           GHGWP + +PNA   NR PK TTF GI+EQKE+ ES + DD   +D+SE ED S F SSD
Sbjct: 481 GHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESSD 540

Query: 541 EEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSST 592
           EEA SSMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLR     GGWGSFSST
Sbjct: 541 EEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLR-----GGWGSFSST 599

BLAST of Cp4.1LG12g04410.1 vs. TrEMBL
Match: A0A0A0K9X1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G103540 PE=4 SV=1)

HSP 1 Score: 844.0 bits (2179), Expect = 1.1e-241
Identity = 466/607 (76.77%), Postives = 511/607 (84.18%), Query Frame = 1

Query: 1   MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVD 60
           MA S S+PFTK HFPHSPLP       +NSC QF+CKS+FFC FLLLLPLFPSEAP FV+
Sbjct: 1   MAPSPSTPFTKPHFPHSPLPPTSTTRHSNSCTQFICKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  HTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPRYSSFENPQSYLSKMLYVASI 120
            T  TKFWELFHLMF+GIAVSYGLFS RN Q++VD  EPR+S+FENPQSYLSKM +VASI
Sbjct: 61  QTFLTKFWELFHLMFIGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMFHVASI 120

Query: 121 FDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTD 180
           F+DVDDF VSDERK+SEVLYIQP LGS S  NA SR QE   YSIPKKRYENS EFA+TD
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVSGLNAISRQQENFHYSIPKKRYENSLEFAETD 180

Query: 181 NVAHACKSRYTRGGSVVVVPETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVE 240
           NV HACKSRYTRGGSVVVV ETNRS+SG       IVNYKPLGLPVRSL+SSLTE DDVE
Sbjct: 181 NVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLKSSLTEPDDVE 240

Query: 241 FDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK 300
           FDCGDESCLSSKSS ++SE+NCE  SEFGD+CCVNLEEKFDET IASMS FQLREKF K 
Sbjct: 241 FDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIASMSPFQLREKFEKN 300

Query: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQH 360
           ++RER   NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLSS LSS TR+H
Sbjct: 301 MMRERRVKNAVLRPSHFRPSSIDETQFESLKKSTSLHSNLSQSSQTSSLSSPLSSRTRKH 360

Query: 361 RKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420
           RKMSSL NISYKS HSRQYS+SS+SENSRGSSEDPLI+ ENSSECNESVVSSPR DRNFA
Sbjct: 361 RKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIDPENSSECNESVVSSPRLDRNFA 420

Query: 421 SIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGT 480
           + PKALS+GKSVR +RA+ +A+E+MKAQEM+R QV+ DD + NKF EGGMS PYMRED T
Sbjct: 421 NTPKALSRGKSVRTVRASTSAIEEMKAQEMYRNQVEHDDNVENKF-EGGMS-PYMREDET 480

Query: 481 GHGWPDVVNPNASNMNRFPK----TTFLGIKEQKEETESVVADDSKDDSEGEDESLFASS 540
           GHGWP + N NA+  NR+ K    TTF GI+EQKE+TES V DD KD+SE ED+S F SS
Sbjct: 481 GHGWPGINNLNAAYSNRYSKTTATTTFSGIEEQKEDTESQVTDDGKDNSEREDDSFFESS 540

Query: 541 DEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSS 592
           DEEA  SM GDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLR     GGWGSFSS
Sbjct: 541 DEEAALSMTGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR-----GGWGSFSS 600

BLAST of Cp4.1LG12g04410.1 vs. TrEMBL
Match: A0A061DWI6_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_005691 PE=4 SV=1)

HSP 1 Score: 208.4 bits (529), Expect = 2.4e-50
Identity = 209/589 (35.48%), Postives = 286/589 (48.56%), Query Frame = 1

Query: 25  NSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTR 84
           N+  +F  K +FF FFL+ +PLFPS+AP FV+ T+  KFWEL HLMF+GIAVSYGLF  R
Sbjct: 25  NTYTRFAGKLLFFTFFLIAIPLFPSQAPDFVNRTILNKFWELLHLMFIGIAVSYGLFGRR 84

Query: 85  NNQMNVDEPRYSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASD 144
               NVD     + ++ QS +S M +++ IF+D  D            +Y   + G A  
Sbjct: 85  ----NVDN---GNLDDSQSNVSGMFHLSPIFEDGFDHS----------MYYSGQ-GKAGF 144

Query: 145 FNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETN-----RS 204
           FNA++              +EN YE    +NV  A  S+Y +G  +VV+ + N       
Sbjct: 145 FNAKN------------DSFENPYE----ENVVQAWSSKYIQGEPIVVLAQPNCGIEKYG 204

Query: 205 SSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDD 264
            SG  ++YKPLGLPVRSL+S +      EF  G      S     S  ++   +  F D 
Sbjct: 205 ESGSNIDYKPLGLPVRSLKSRVGSRGSPEFGNGSSESSGSSVKDLSDSSDKWRSERFNDL 264

Query: 265 CCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVLRPSHFRPPSIDETQFESLK 324
              NLE KF E+ +   S    R + G+   R RG      RPSHFRP S+DETQF+SL 
Sbjct: 265 GSENLEGKFSESHVLG-SPIPWRARSGRTKERVRG---GATRPSHFRPLSVDETQFDSL- 324

Query: 325 KSGSLHSNLSQSSQTSSLSSSLSSTTRQH--------RKMSSL----------------- 384
           KS SL S +S SSQ  S S S S+ +  H          MS L                 
Sbjct: 325 KSRSLRSTVSFSSQVGSQSHSPSNLSPSHSNSSESPKSNMSELVKERSPRRSFPPTSSSI 384

Query: 385 -----SNISYKSLHSRQYSMSS-VSENSRGSSEDPLIE-----QENSSECNESVVSSPRS 444
                S  S  + HSRQYS  S ++ ++R   ED L E     + +SS   E +  S   
Sbjct: 385 PKPMSSKASVTASHSRQYSDGSLLAIHARKCFEDELKEFCDSRKNDSSSSKEWISGSFEF 444

Query: 445 DRNFASIPKALSQGKSV---RRIRANAAAMEDMKAQEMHRKQVK-QDDIIGNKFEEGGMS 504
           + N A+  KA S+GKSV   R  RAN  A+   +A E +   +K +  +  ++ EE    
Sbjct: 445 EANPAAPSKASSRGKSVRTFRTFRANGNAVGAREAGEKNENHLKGKLAVASDEVEEAYTD 504

Query: 505 PPYMREDGTGHGWPDVVNPNASNMNRFPKTTFL-GIKEQKEETESVVADDSKDDSEGEDE 564
               + +G  +        N       PK T L    ++K+E     A +  +DSE E+E
Sbjct: 505 KSEPKIEGLNNLSLGFNRQNLGGDCYMPKPTSLENQNKEKQEYSEHPAVEFGEDSESENE 564

Query: 565 SLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVE 568
               SSDEE  S       S   EVD+KAGEFIAKF+EQI+LQR  S++
Sbjct: 565 DFQVSSDEETMSGTFCVEGSDTSEVDRKAGEFIAKFKEQIRLQRTTSID 574

BLAST of Cp4.1LG12g04410.1 vs. TrEMBL
Match: M5W8Q2_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa022289mg PE=4 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 1.0e-48
Identity = 207/588 (35.20%), Postives = 297/588 (50.51%), Query Frame = 1

Query: 7   SPFTKLHFPHSPL-----PQPPANSCA---QFLCKSIFFCFFLLLLPLFPSEAPHFVDHT 66
           SP+ K HFP+S L     P P     +    FL K++FF   ++++PLFPS+AP F++HT
Sbjct: 5   SPYRKPHFPYSELNSAIHPNPIKQGKSYTMHFLFKALFFALVIMIIPLFPSQAPDFINHT 64

Query: 67  LFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFDD- 126
           + TKFWEL HL+F+GIAVSYGLFS RN +   + P  S+  + +SY+ ++  V+S FDD 
Sbjct: 65  ILTKFWELIHLVFIGIAVSYGLFSRRNVERGFENP--SNLGSSESYMPRIFPVSSNFDDG 124

Query: 127 VDDFGVSDERKV-------SEVLYIQPKLGSASD---FNAQSRHQEKLRYSIPKKRYENS 186
            ++   SDE++V       S+     P   S+ +   F+AQ     K    + ++  ENS
Sbjct: 125 YENPCGSDEKRVVGLGSWNSQYFVGNPVTVSSHESTGFDAQC----KPSLPVHERGSENS 184

Query: 187 YEFADTDNVAHACKSRYTRGGSVVVVPETNR-----SSSGGIVNYKPLGLPVRSLRSSLT 246
           Y + + +N+  A  S+Y  G  +V V + N           IV+ +PLGLP+RSL+S + 
Sbjct: 185 YGYKE-NNLTQAWSSQYFHGEPMVFVAQPNYGFDEWGKPRSIVDSEPLGLPIRSLKSRVI 244

Query: 247 ESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASM----SS 306
           + D  EF  G ES  SS  SP SS+ +   N +FGD   +NLEE+F+E   A       S
Sbjct: 245 DQDSSEFVTGSESGSSSNFSPNSSDKS--RNGKFGDLGPLNLEEEFNEATAAPFPVHRGS 304

Query: 307 FQLREKFGKKVIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLS 366
              R + GK+V        +  RPSHFRP S+DETQFES+ K+ S  S LS SS++S  S
Sbjct: 305 SSGRMEMGKRV-------GSSSRPSHFRPLSVDETQFESM-KTRSFRSTLSFSSESSQTS 364

Query: 367 SSLSSTTRQHRKMSSLSNISYKSLHSRQYSMSSVSE----NSRGSSEDPLIEQENSSECN 426
           S           MSS       S H      SS +     +  GS ED L  +E      
Sbjct: 365 S-----------MSSSPKEDIGSFHEEDLRRSSENYFKGLSGSGSEEDQLGNKELGP--- 424

Query: 427 ESVVSSPRSDRNFASIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFE 486
               +S RSD   AS+ KA  +G+SVR IR +               ++  DD +    +
Sbjct: 425 ----ASLRSDVKPASLTKASLRGRSVRTIRPS---------------RLTTDDKVEKMCD 484

Query: 487 EGG---MSPPYMREDGTGHGWPDVVNP--NASNMNRFPKTTF--LGIKEQKEETESVVAD 546
            GG   M    ++  GT   + D V    +  N    PK T      KE +E   +VVA+
Sbjct: 485 NGGAISMRKDIIQNGGTDKKFFDNVTGKLDLGNSLHMPKPTIPKYQKKEMQEFHGNVVAE 542

Query: 547 DSKDDSEGEDESLFASSDEE-----AGSSMAGDSESGA---FEVDKKA 548
           +S+DDSE E E+   SS++E     A ++   +S + A    EVDKKA
Sbjct: 545 ESEDDSESEAENFLVSSEDEDADPPAAAAATCNSVNVAGPDSEVDKKA 542

BLAST of Cp4.1LG12g04410.1 vs. TrEMBL
Match: A0A067HCP1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006972mg PE=4 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 2.6e-44
Identity = 194/602 (32.23%), Postives = 297/602 (49.34%), Query Frame = 1

Query: 26  SCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTRN 85
           S   FL K +FF  FL+ +PLFPS+AP F++ T+ TKFWEL HL+FVG+AVSYGLF  RN
Sbjct: 34  SFTHFLGKFLFFALFLIAIPLFPSQAPDFINQTVLTKFWELVHLLFVGLAVSYGLFCRRN 93

Query: 86  NQMNVDEPRYSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASDF 145
           +  ++ E   ++ ++  SY+S++L+V+S+FD+  D       K +       + G +   
Sbjct: 94  DDGDI-ETHSNNTDDSYSYVSRVLHVSSLFDNGFDNSYGFNEKYAY------QTGCSDSG 153

Query: 146 NAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACK-SRYTRGGSVVVVPETN-----RS 205
           ++    Q K+R   P+  ++N     + +NV+ A   S+Y +  S+VVV + N       
Sbjct: 154 SSVIGDQSKVRSLNPEVWFQNPSGCGE-NNVSQAWNYSQYVQSESMVVVNQENCAVNEYG 213

Query: 206 SSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDD 265
            S  ++++KPLGLPVRSLRS +   D  E + G ES   S SS  S + + E  + FG+ 
Sbjct: 214 ESELMMDHKPLGLPVRSLRSGVRNQDFSEINNGTESSSVSTSSSISPKESIE--NTFGEV 273

Query: 266 CCVNLEEKFDETAIASMSS-FQLREKFGKKVIRERGFGNAVLRPSHFRPPSIDETQFESL 325
             +NLE K +E   A++SS    R   G+  +RE         PSHFRP S+DETQFESL
Sbjct: 274 SPLNLENKCNEGESAALSSPIPWRSISGRMEMREN--VGIASHPSHFRPHSVDETQFESL 333

Query: 326 KKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMSSLSN---------------------- 385
            KS S  S  + SSQ SS+S S +  +  H   S L N                      
Sbjct: 334 -KSQSFWSTENFSSQISSMSDSPNRLSPSHAVSSELENQEMEDLGEEQSYRNSYPPANMP 393

Query: 386 ----ISYKSLHSRQYSMSSVSE-NSRGSSEDPLIEQENSSECNESVVSSPRSDRNFASIP 445
                S  + H R+Y+  S+ E N + S ED      NS   +ES       ++ + S  
Sbjct: 394 TNGKASLNAFHIRRYTSGSLFEKNVQKSFED------NSKNRDESQRKDQLDNQEWRS-- 453

Query: 446 KALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGTGHG 505
            +L    S+ +  +   ++   +++    +  K+ +  GN      +  P    +    G
Sbjct: 454 GSLKWNGSLDKASSRGKSVRTFRSRRYEPEAAKRGEKSGNCI-SNDVGKPKEEIEAVCKG 513

Query: 506 WPDVVNPNASNMNR------------FPKTTFLGIKEQKEETESV-VADDSKDDSEGEDE 565
             ++ N    N++              PK T    +++K +  S  VA +SK+ SE + E
Sbjct: 514 KSEMANGGLDNLSASAKKQDVDYHFPMPKPTHSQFQKRKNKEHSQRVAVESKEISESKAE 573

Query: 566 SLFASSDEEAGSSMAGDSESGAF------------EVDKKAGEFIAKFREQIQLQRMASV 569
           +    SDE + ++   D+E                EVDKKAGEFIA+FREQI+LQ+MAS+
Sbjct: 574 NYEVKSDEGSMTNSGNDAEPDPMTNSGNDAEPDPNEVDKKAGEFIARFREQIRLQKMASI 613

BLAST of Cp4.1LG12g04410.1 vs. TAIR10
Match: AT3G60380.1 (AT3G60380.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 138.3 bits (347), Expect = 1.6e-32
Identity = 142/388 (36.60%), Postives = 188/388 (48.45%), Query Frame = 1

Query: 4   SASSPFTKLHFPHSPL--PQPPANSCAQ--FLCKSIFFCFFLLLLPLFPSEAPHFVDHTL 63
           ++ +P+TK   P + +  PQP   S     F CKS+ F  FLL LPLFPS+AP FV  T+
Sbjct: 2   ASPNPYTKRRSPPNVVVPPQPRYKSIGGGGFFCKSVLFALFLLALPLFPSQAPDFVGETV 61

Query: 64  FTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD-DV 123
            TKFWEL HL+FVGIAV+YGLFS RN +  VD       E+  SY+S++  V+S+FD + 
Sbjct: 62  LTKFWELIHLLFVGIAVAYGLFSRRNVESAVDLRMTRVDESSLSYVSRIFQVSSVFDEEF 121

Query: 124 DDFGVS--DERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTDNV 183
           DD      D R    V      +G +  F  +S               E S EF +T+ V
Sbjct: 122 DDNSCEFVDVRSDESVSARASVVGKSESFVVES------------GELEESSEFGETNEV 181

Query: 184 AHACKSRYTRGGSVVVVPETNRSSSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESC 243
             A  S+Y +G S VVV        G +V ++PLGLP+R LRSSL           D + 
Sbjct: 182 -RAWNSQYFQGKSKVVVARPAYGLDGHVV-HQPLGLPIRRLRSSLR----------DNAA 241

Query: 244 LSSKSSPRSSEN--NCEGNSEFGDDCCVNLEEKFDE--TAIASMSSFQLREKFGKKVIRE 303
           L  KS   S +   N E  S   D+        FDE   A AS   +Q R +        
Sbjct: 242 LQDKSFADSCDGAVNAEAESLLADNF-------FDEVLAAPASPVPWQARPEM------- 301

Query: 304 RGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQHRKMS 363
            G G+    PS+F+P S+DET      KS S  S  S SSQTS  S       +   + S
Sbjct: 302 MGIGDNY--PSNFQPISVDET-----LKSISSRSTGSSSSQTSYAS-------QNQNRFS 335

Query: 364 SLSNISYKSLHSRQYSMSSVSENSRGSS 381
              ++S +SL+S    +  V E SR SS
Sbjct: 362 PSRSVSAESLNSNVEEL--VKEKSRQSS 335

BLAST of Cp4.1LG12g04410.1 vs. TAIR10
Match: AT4G16790.1 (AT4G16790.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 59.7 bits (143), Expect = 7.0e-09
Identity = 42/123 (34.15%), Postives = 62/123 (50.41%), Query Frame = 1

Query: 21  QPPANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGL 80
           Q P    ++F+ K++       ++P+F S+ P   + T   +  EL HL+FVGIAVSYGL
Sbjct: 19  QNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQT---RLLELLHLVFVGIAVSYGL 78

Query: 81  FSTRN-------NQMNVDEPRYS-SFENPQSYLSKMLYVASIFD-------DVDDFGVSD 129
           FS RN          N D  +   S  N  SY+ K+L V+S+F+       +  D    D
Sbjct: 79  FSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEVSSVFNVGHESESEPSDDSSGD 138

BLAST of Cp4.1LG12g04410.1 vs. NCBI nr
Match: gi|307136424|gb|ADN34231.1| (hypothetical protein [Cucumis melo subsp. melo])

HSP 1 Score: 856.7 bits (2212), Expect = 2.4e-245
Identity = 473/606 (78.05%), Postives = 516/606 (85.15%), Query Frame = 1

Query: 1   MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVD 60
           MASS S+PFTK HFPHSPLP       +NSC  FLCKS+FFC FLLLLPLFPSEAP FV+
Sbjct: 1   MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  HTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPRYSSFENPQSYLSKMLYVASI 120
            TL TKFWELFHLMFVGIAVSYGLFS RN Q++VD  EPR+S+FENPQSYLSKML+VASI
Sbjct: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120

Query: 121 FDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTD 180
           F+DVDDF VSDERK+SEVLYIQP LGS   FNA SR QE   YSIPKKRYENS EF DT+
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVRGFNAISRQQENFHYSIPKKRYENSLEFDDTN 180

Query: 181 NVAHACKSRYTRGGSVVVVPETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVE 240
           +V HACKSRYTRGGSVVVV ETNRS+SG       IVNYKPLGLPVRSLRS+LTE DDVE
Sbjct: 181 SVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPDDVE 240

Query: 241 FDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK 300
           FDCGDESCLSSKSS ++SE+NCE  SEFGD+CCVNLEEKFDET IA MS FQLRE FGK 
Sbjct: 241 FDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENFGKN 300

Query: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQH 360
           ++RERG  NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLS SLSSTTR+H
Sbjct: 301 MMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTTRKH 360

Query: 361 RKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420
           RKMSSL NISYKS HSRQYS+SS+SENSRGSSEDPLIE ENSSECNES++SSPR DRNFA
Sbjct: 361 RKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDRNFA 420

Query: 421 SIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGT 480
            IPKALS+GKSVR IRAN +A+E+MKAQEM+R QV+ DD +GNKF EGGMS PYMREDGT
Sbjct: 421 HIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKF-EGGMS-PYMREDGT 480

Query: 481 GHGWPDVVNPNASNMNRFPK-TTFLGIKEQKEETESVVADD--SKDDSEGEDESLFASSD 540
           GHGWP + +PNA   NR PK TTF GI+EQKE+ ES + DD   +D+SE ED S F SSD
Sbjct: 481 GHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESSD 540

Query: 541 EEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSST 592
           EEA SSMAG+SESGA+EVDKKAGEFIAKFREQIQLQRMASV+KRLR     GGWGSFSST
Sbjct: 541 EEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLR-----GGWGSFSST 599

BLAST of Cp4.1LG12g04410.1 vs. NCBI nr
Match: gi|449445742|ref|XP_004140631.1| (PREDICTED: uncharacterized protein LOC101220435 [Cucumis sativus])

HSP 1 Score: 844.0 bits (2179), Expect = 1.6e-241
Identity = 466/607 (76.77%), Postives = 511/607 (84.18%), Query Frame = 1

Query: 1   MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVD 60
           MA S S+PFTK HFPHSPLP       +NSC QF+CKS+FFC FLLLLPLFPSEAP FV+
Sbjct: 1   MAPSPSTPFTKPHFPHSPLPPTSTTRHSNSCTQFICKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  HTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPRYSSFENPQSYLSKMLYVASI 120
            T  TKFWELFHLMF+GIAVSYGLFS RN Q++VD  EPR+S+FENPQSYLSKM +VASI
Sbjct: 61  QTFLTKFWELFHLMFIGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMFHVASI 120

Query: 121 FDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTD 180
           F+DVDDF VSDERK+SEVLYIQP LGS S  NA SR QE   YSIPKKRYENS EFA+TD
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVSGLNAISRQQENFHYSIPKKRYENSLEFAETD 180

Query: 181 NVAHACKSRYTRGGSVVVVPETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVE 240
           NV HACKSRYTRGGSVVVV ETNRS+SG       IVNYKPLGLPVRSL+SSLTE DDVE
Sbjct: 181 NVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLKSSLTEPDDVE 240

Query: 241 FDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK 300
           FDCGDESCLSSKSS ++SE+NCE  SEFGD+CCVNLEEKFDET IASMS FQLREKF K 
Sbjct: 241 FDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIASMSPFQLREKFEKN 300

Query: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQH 360
           ++RER   NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLSS LSS TR+H
Sbjct: 301 MMRERRVKNAVLRPSHFRPSSIDETQFESLKKSTSLHSNLSQSSQTSSLSSPLSSRTRKH 360

Query: 361 RKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420
           RKMSSL NISYKS HSRQYS+SS+SENSRGSSEDPLI+ ENSSECNESVVSSPR DRNFA
Sbjct: 361 RKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIDPENSSECNESVVSSPRLDRNFA 420

Query: 421 SIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGT 480
           + PKALS+GKSVR +RA+ +A+E+MKAQEM+R QV+ DD + NKF EGGMS PYMRED T
Sbjct: 421 NTPKALSRGKSVRTVRASTSAIEEMKAQEMYRNQVEHDDNVENKF-EGGMS-PYMREDET 480

Query: 481 GHGWPDVVNPNASNMNRFPK----TTFLGIKEQKEETESVVADDSKDDSEGEDESLFASS 540
           GHGWP + N NA+  NR+ K    TTF GI+EQKE+TES V DD KD+SE ED+S F SS
Sbjct: 481 GHGWPGINNLNAAYSNRYSKTTATTTFSGIEEQKEDTESQVTDDGKDNSEREDDSFFESS 540

Query: 541 DEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGWGSFSS 592
           DEEA  SM GDSESGA EVDKKAGEFIAKFREQIQLQRMASV+KRLR     GGWGSFSS
Sbjct: 541 DEEAALSMTGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLR-----GGWGSFSS 600

BLAST of Cp4.1LG12g04410.1 vs. NCBI nr
Match: gi|659119650|ref|XP_008459770.1| (PREDICTED: uncharacterized protein LOC103498804 [Cucumis melo])

HSP 1 Score: 753.4 bits (1944), Expect = 2.9e-214
Identity = 412/534 (77.15%), Postives = 452/534 (84.64%), Query Frame = 1

Query: 1   MASSASSPFTKLHFPHSPLPQPP----ANSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVD 60
           MASS S+PFTK HFPHSPLP       +NSC  FLCKS+FFC FLLLLPLFPSEAP FV+
Sbjct: 1   MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  HTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVD--EPRYSSFENPQSYLSKMLYVASI 120
            TL TKFWELFHLMFVGIAVSYGLFS RN Q++VD  EPR+S+FENPQSYLSKML+VASI
Sbjct: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120

Query: 121 FDDVDDFGVSDERKVSEVLYIQPKLGSASDFNAQSRHQEKLRYSIPKKRYENSYEFADTD 180
           F+DVDDF VSDERK+SEVLYIQP LGS   FNA SR QE   YSIPKKRYENS EF DT+
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVRGFNAISRQQENFHYSIPKKRYENSLEFDDTN 180

Query: 181 NVAHACKSRYTRGGSVVVVPETNRSSSG------GIVNYKPLGLPVRSLRSSLTESDDVE 240
           +V HACKSRYTRGGSVVVV ETNRS+SG       IVNYKPLGLPVRSLRS+LTE DDVE
Sbjct: 181 SVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPDDVE 240

Query: 241 FDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQLREKFGKK 300
           FDCGDESCLSSKSS ++SE+NCE  SEFGD+CCVNLEEKFDET IA MS FQLRE FGK 
Sbjct: 241 FDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENFGKN 300

Query: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRQH 360
           ++RERG  NAVLRPSHFRP SIDETQFESLKKS SLHSNLSQSSQTSSLS SLSSTTR+H
Sbjct: 301 MMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTTRKH 360

Query: 361 RKMSSLSNISYKSLHSRQYSMSSVSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420
           RKMSSL NISYKS HSRQYS+SS+SENSRGSSEDPLIE ENSSECNES++SSPR DRNFA
Sbjct: 361 RKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDRNFA 420

Query: 421 SIPKALSQGKSVRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGGMSPPYMREDGT 480
            IPKALS+GKSVR IRAN +A+E+MKAQEM+R QV+ DD +GNKF EGGMS PYMREDGT
Sbjct: 421 HIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKF-EGGMS-PYMREDGT 480

Query: 481 GHGWPDVVNPNASNMNRFPK-TTFLGIKEQKEETESVVADD--SKDDSEGEDES 520
           GHGWP + +PNA   NR PK TTF GI+EQKE+ ES + DD   +D+SE ED S
Sbjct: 481 GHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVS 532

BLAST of Cp4.1LG12g04410.1 vs. NCBI nr
Match: gi|645253169|ref|XP_008232452.1| (PREDICTED: uncharacterized serine-rich protein C215.13 [Prunus mume])

HSP 1 Score: 232.6 bits (592), Expect = 1.7e-57
Identity = 234/653 (35.83%), Postives = 339/653 (51.91%), Query Frame = 1

Query: 7   SPFTKLHFPHSPL-----PQPPANSCA---QFLCKSIFFCFFLLLLPLFPSEAPHFVDHT 66
           SP+ K HFP+S L     P P     +    FL K++FF   +++LPLFPS+AP F++HT
Sbjct: 5   SPYRKPHFPYSELNSAIHPNPIKQGKSYMMHFLFKALFFALVIMVLPLFPSQAPDFINHT 64

Query: 67  LFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFDD- 126
           + TKFWEL HL+F+GIAVSYGLFS RN +   + P  S+  + +SY+ ++  V+S FDD 
Sbjct: 65  ILTKFWELIHLVFIGIAVSYGLFSRRNVERGFENP--SNLGSSESYMPRIFPVSSNFDDG 124

Query: 127 VDDFGVSDERKV-------SEVLYIQPKLGSASD---FNAQSRHQEKLRYSIPKKRY--E 186
            ++   SDE++V       S+     P   S+ +   F+AQ +       S+P   +  E
Sbjct: 125 YENPCGSDEKRVVGLGTWNSQYFVGNPVTVSSHESTGFDAQCKP------SLPVHEHGSE 184

Query: 187 NSYEFADTDNVAHACKSRYTRGGSVVVVPETNRSSSG-----GIVNYKPLGLPVRSLRSS 246
           NSY + + +N+  A  S+Y +G  +V V + N   +       IV+ +PLGLP+RSL+S 
Sbjct: 185 NSYGYKE-NNLTQAWSSQYFQGEPMVFVAQPNYGLNEWGKPRSIVDSEPLGLPIRSLKSR 244

Query: 247 LTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDDCCVNLEEKFDETAIASMSSFQ 306
           + + D  EF    ES  SS  SP SS+ +   N EFGD   +NLEE+F+E   A   +  
Sbjct: 245 VRDQDSSEFVTRSESGSSSNFSPNSSDKS--RNGEFGDLGPLNLEEEFNEATAAPFPAH- 304

Query: 307 LREKFGKKVIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLS---QSSQTSSL 366
            R     ++ R +  G++  RPSHFRP S+DETQFES+K + SL S LS   +SSQTSS+
Sbjct: 305 -RGSSSGRMERGKRVGSSG-RPSHFRPLSVDETQFESMK-TRSLRSTLSFSSESSQTSSM 364

Query: 367 SSS--------------LSSTTRQHRKMSS-----------------LSNISYKSLHSRQ 426
           SSS              L+S     +K  S                    +S  +LHSR 
Sbjct: 365 SSSPKEESFARSISSEALNSKVNNLKKRKSSQGSSPSGLPSSPPKPITEKVSMSTLHSRG 424

Query: 427 YSMSSVSENSRGSSEDPLIEQENSSECNESVV-------SSPRSDRNFASIPKALSQGKS 486
           YS+ S  E     S +   +  + S   E  +       +S RSD   AS+ KA  +G+S
Sbjct: 425 YSIGSFHEEDLRRSSENYFKDLSGSGSEEDQLGNKELGPASLRSDVKPASLTKASLRGRS 484

Query: 487 VRRIRANAAAMEDMKAQEMHRKQVKQDDIIGNKFEEGG---MSPPYMREDGTGHGWPDVV 546
           VR IR +               ++  DD +    + GG   M    ++  GT   + D V
Sbjct: 485 VRTIRPS---------------RLTTDDKVEKMCDNGGAISMRKDIIQNGGTDKKFFDNV 544

Query: 547 NP--NASNMNRFPKTTFLGI--KEQKEETESVVADDSKDDSEGEDESLFASSDEE----- 578
               +  N    PK T      KE +E   +VVA++S+DDSE E ++   SS++E     
Sbjct: 545 TGKLDLGNSLHMPKPTIPKYQKKEMQEFHGNVVAEESEDDSESEAKNFLVSSEDEDADPP 604

BLAST of Cp4.1LG12g04410.1 vs. NCBI nr
Match: gi|590723796|ref|XP_007052284.1| (Uncharacterized protein TCM_005691 [Theobroma cacao])

HSP 1 Score: 208.4 bits (529), Expect = 3.5e-50
Identity = 209/589 (35.48%), Postives = 286/589 (48.56%), Query Frame = 1

Query: 25  NSCAQFLCKSIFFCFFLLLLPLFPSEAPHFVDHTLFTKFWELFHLMFVGIAVSYGLFSTR 84
           N+  +F  K +FF FFL+ +PLFPS+AP FV+ T+  KFWEL HLMF+GIAVSYGLF  R
Sbjct: 25  NTYTRFAGKLLFFTFFLIAIPLFPSQAPDFVNRTILNKFWELLHLMFIGIAVSYGLFGRR 84

Query: 85  NNQMNVDEPRYSSFENPQSYLSKMLYVASIFDDVDDFGVSDERKVSEVLYIQPKLGSASD 144
               NVD     + ++ QS +S M +++ IF+D  D            +Y   + G A  
Sbjct: 85  ----NVDN---GNLDDSQSNVSGMFHLSPIFEDGFDHS----------MYYSGQ-GKAGF 144

Query: 145 FNAQSRHQEKLRYSIPKKRYENSYEFADTDNVAHACKSRYTRGGSVVVVPETN-----RS 204
           FNA++              +EN YE    +NV  A  S+Y +G  +VV+ + N       
Sbjct: 145 FNAKN------------DSFENPYE----ENVVQAWSSKYIQGEPIVVLAQPNCGIEKYG 204

Query: 205 SSGGIVNYKPLGLPVRSLRSSLTESDDVEFDCGDESCLSSKSSPRSSENNCEGNSEFGDD 264
            SG  ++YKPLGLPVRSL+S +      EF  G      S     S  ++   +  F D 
Sbjct: 205 ESGSNIDYKPLGLPVRSLKSRVGSRGSPEFGNGSSESSGSSVKDLSDSSDKWRSERFNDL 264

Query: 265 CCVNLEEKFDETAIASMSSFQLREKFGKKVIRERGFGNAVLRPSHFRPPSIDETQFESLK 324
              NLE KF E+ +   S    R + G+   R RG      RPSHFRP S+DETQF+SL 
Sbjct: 265 GSENLEGKFSESHVLG-SPIPWRARSGRTKERVRG---GATRPSHFRPLSVDETQFDSL- 324

Query: 325 KSGSLHSNLSQSSQTSSLSSSLSSTTRQH--------RKMSSL----------------- 384
           KS SL S +S SSQ  S S S S+ +  H          MS L                 
Sbjct: 325 KSRSLRSTVSFSSQVGSQSHSPSNLSPSHSNSSESPKSNMSELVKERSPRRSFPPTSSSI 384

Query: 385 -----SNISYKSLHSRQYSMSS-VSENSRGSSEDPLIE-----QENSSECNESVVSSPRS 444
                S  S  + HSRQYS  S ++ ++R   ED L E     + +SS   E +  S   
Sbjct: 385 PKPMSSKASVTASHSRQYSDGSLLAIHARKCFEDELKEFCDSRKNDSSSSKEWISGSFEF 444

Query: 445 DRNFASIPKALSQGKSV---RRIRANAAAMEDMKAQEMHRKQVK-QDDIIGNKFEEGGMS 504
           + N A+  KA S+GKSV   R  RAN  A+   +A E +   +K +  +  ++ EE    
Sbjct: 445 EANPAAPSKASSRGKSVRTFRTFRANGNAVGAREAGEKNENHLKGKLAVASDEVEEAYTD 504

Query: 505 PPYMREDGTGHGWPDVVNPNASNMNRFPKTTFL-GIKEQKEETESVVADDSKDDSEGEDE 564
               + +G  +        N       PK T L    ++K+E     A +  +DSE E+E
Sbjct: 505 KSEPKIEGLNNLSLGFNRQNLGGDCYMPKPTSLENQNKEKQEYSEHPAVEFGEDSESENE 564

Query: 565 SLFASSDEEAGSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVE 568
               SSDEE  S       S   EVD+KAGEFIAKF+EQI+LQR  S++
Sbjct: 565 DFQVSSDEETMSGTFCVEGSDTSEVDRKAGEFIAKFKEQIRLQRTTSID 574

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E5GCN2_CUCME1.7e-24578.05Putative uncharacterized protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A0A0K9X1_CUCSA1.1e-24176.77Uncharacterized protein OS=Cucumis sativus GN=Csa_6G103540 PE=4 SV=1[more]
A0A061DWI6_THECC2.4e-5035.48Uncharacterized protein OS=Theobroma cacao GN=TCM_005691 PE=4 SV=1[more]
M5W8Q2_PRUPE1.0e-4835.20Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa022289mg PE=4 S... [more]
A0A067HCP1_CITSI2.6e-4432.23Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006972mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G60380.11.6e-3236.60 FUNCTIONS IN: molecular_function unknown[more]
AT4G16790.17.0e-0934.15 hydroxyproline-rich glycoprotein family protein[more]
Match NameE-valueIdentityDescription
gi|307136424|gb|ADN34231.1|2.4e-24578.05hypothetical protein [Cucumis melo subsp. melo][more]
gi|449445742|ref|XP_004140631.1|1.6e-24176.77PREDICTED: uncharacterized protein LOC101220435 [Cucumis sativus][more]
gi|659119650|ref|XP_008459770.1|2.9e-21477.15PREDICTED: uncharacterized protein LOC103498804 [Cucumis melo][more]
gi|645253169|ref|XP_008232452.1|1.7e-5735.83PREDICTED: uncharacterized serine-rich protein C215.13 [Prunus mume][more]
gi|590723796|ref|XP_007052284.1|3.5e-5035.48Uncharacterized protein TCM_005691 [Theobroma cacao][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR008480DUF761_pln
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG12g04410Cp4.1LG12g04410gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG12g04410.1:cds:001Cp4.1LG12g04410.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG12g04410.1Cp4.1LG12g04410.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 542..568
score: 1.
NoneNo IPR availablePANTHERPTHR34059FAMILY NOT NAMEDcoord: 4..575
score: 2.5E
NoneNo IPR availablePANTHERPTHR34059:SF1SUBFAMILY NOT NAMEDcoord: 4..575
score: 2.5E