Cp4.1LG03g04670 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g04670
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function, DUF642
LocationCp4.1LG03 : 5376829 .. 5380310 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCCCAAATGGAAATAAGCAGACAGATACACAAACAATGTTTCAGGTTACACCTCACTTAAACGCGGATTTCTCTACTAATCTCCTTCTCTTTCCTCGCGTCTTCGTGGTTTTATCATTTTTCCCTCACTATAAATACAATCTCTTAATCCCACTTCCTCTCCCATCCATTGAATACCCTTTTCCCCTTCTCAAACCCCCCAAAAATGGCTTCTCTTTCATGTTCATCACTGCCATTGCTAGCGTTCTTCTTGCTATTGGCTTCTTCAGCCTTTGCAGGGTCGACTTTGGAAGGTATAATGCTTATTTTCGAAACCCCTTTTGCTTAGTTTTGTAATTTGGCTATCTTTTATGTGGGTTTGTTGTTTAGCTGCATGATGAACTGATGTTTGTTTGTGGGCTTCTCTCTTTTGTTGTTGGTACTGTTTTGGATCTTGTTTGTTGGATTTATTGTAGAAATGTGTAGTTTTAGCTTTCTGGTTTGCTATAATGAATGTTTTGTACTGGTTTTACTTATGAAATCCATTTAGATCTGCCCCTTATAGAGTTTGTTTTACTCTCCAAACCAGTTTGAGGCTTTGTATTTGGTGTTAGGATGCTAAGAGTGTCAAATGAGCCTAGGCTCACCGCTAGCAGATATTGTACTCTTTGAACTTTTCCTTCTTGGCTTGGATTGTTACAAATGGTATCAGAGCTAGACACCGGGCGGTGTGCTAGTNAAAAATGGCTTCTCTTTCATGTTCATCACTGCCATTGCTAGCGTTCTTCTTGCTATTGGCTTCTTCAGCCTTTGCAGGGTCGATTTTGGAAGGTATAATGCTTATTTTCGAAACCCCTTTTGCTTAGTTTTGTAATTTGGCTATCTTTTATGTGGGTTTGTTGTTTAGCTGCATGATGAACTGATGTTTGTTTGTGGGCTTCTCTCTTTTGTTGTTGGTACTGTTTTGGATCTTGTTTGTTGGATTTATTGTAGAAATGTGTAGTTTTAGCTTTCTGGTTTGCTATATTGAATGTTTTGTACTGGTTTTACTTATGAAATCCATTTAGATCTGCCCCTTATAGAGTTTGTTTTACTCTCCAAACCCGTTTGAGGCTTTGTATTTGGTGTTAGGATGCTAAGAGTGTCAAATGAGCCTAGGCTCACCGCTAGCAGATATTGTACTCTTTGAACTTTTCCTTCTTGGCTTGGATTGTTACAAATGGTATCAGAGCTAGACACCGGGCGGTGTGCTAGTAGGGATGTTGGGCCCTCAATGGGGTGGATTGTGAGATCTCACATTGGTTGGAGAGGGGAAGAAGCATTCCTTATAAGGGTGTGAAACCCCTCTCCCTAGTAGACGTGTTTTAGAACTGTGAGGTTGATGGTGATATGTAATGGGCCAAAGCAGACAATATGTGCTAGCGATGGGCTTGGACTGTTACAAATGGTATCAGAGCTAGACATTGGTCAGTGTGCTAACGAGGATGCTGGGCCCTCAAGGGGGTGGATTGTGAGATCCCAAGTCGGTTGAAGAGGGGAACGGAACATATTTTGTAAGGGTGGAGCCCGGAAGGGAAACCTCTCCCTAGCAGACGTGTTTTCAAACCGTGAGGTTGACGGTGATACGTAACTGGCCAAAACGGACAACGGTGGGCTTGGGCTGTTACAAATGGTATTAGAGCTAGATATCGGGCGGTGTGTCAGCGAGGACGCTAGCTCTCAAAGGGGTGTATTGTGAGATCCCACATCAGTTGGAGAGAGGAACGAAGCATTCTTTAGAGAAAGGTGTGGAAATCTCTCTCTAGTGGGCGTGTTTTAAAACCTTGAGGGGAAGCCGAAAAGAGAGCCTAAACAAAATAATATCTGCTAGCGGTGAGCTTGGCCTGTTATATGTCAATCGAGTTGGATCTTTAACTTTCTGCCTCCCTCTTAGAGCATTTCTTGATCTTGAACGTTGCTTCTTTATTCACACAGGACTTCTAGCAAATGGGAACTTTGAGGAACCACCATTAAAAACCAACTTAAAGAAAACAGTCATAGTAGGCAAAGACTCTCTGCCAAGCTGGGAGATCAATGGCTTTGTTGAGTACATCTCAGGTGGCCCTCAACCCGGCGGAATGTTCTTCCCCGTTGCTCACGGCGTCCATGCCGTAAGGCTCGGCAACGAAGCATCGATATCTCAGATCATAAAAGTGAAAAAAGGGTCTCTTTATGCTCTAACTTTTGGAGCTTCAAGAACATGTGCACAAGATGAAGTCTTGTCTGTGTTAGTACCTCCCCAGAATGGAAGTTTGCCTCTGCAGACTCTTTACAGCAGCGATGGAGGCGATGTTTACGCTTTCGGATTTGTAGCTTCGTCGGATTCTGTTAAGGTTACGTTCCACAATCCCGGAGTTCAGGAAGATCCCGCTTGTGGACCTCTGTTGGACGCCGTTGCCATCAAAGAGCTTATTCGTCCAGTCCCAACAAGAGGTAAACATGCTCGTCTGTCTTTTCGAGATCTAAAAACTCAGGTCATTTGATAACGAACTTGTGAATTTGGTTCGCTGCTTAATCTCGATGTTTGTTAATGGTAATCTTGCAGATAACTTGGTTAGGAACCCGAGCTTCGAGGTTGGTCCTCATCGGTTAGTAAACTCGACCAACGGAGTTCTTCTTCCTCCAAGACAAGAAGACCTTACATCTCCACTCCCAGGCTGGATCATAGAGTCACTCAAGGCTGTAAAGTTCATTGATTCAAAGCATTTCAACGTTCCCGTTGGACTTGCAGCGGTCGAGCTCGTTGCAGGGAGAGAAAGCGCGGTGGCTCAAATCATCAGAACCATCCCCAACAAGCTATACTCGCTAACGTTCAAAGTCGGTGACGCCAAGAACGGATGCCATGGATCAATGATGGTGGAAGCATTTGCTGCTAAAGACACTCTCAAAGTTCCCTTCCAATCTCAAGGAAAAGGGCTTTACAAAACTGCCATTCTCAAGTTCAAAGCAATCTCGCCTAGAACCAGAATCACATTCTTCAGTTCATACTACCATACCAGAACAGACGACTTCGGATCCCTTTGTGGCCCCGTGCTCGATGATGTTCGTGTTGTTCCTACGTATTAGAGATATTGTTTCCATGGCTGCTGCTACCGCGCCCCGGGTTTGGTATCGAAATACTCGTAGACGTTGCATCGGCTGCTTAATGTAATTAAAAACCTTTGATGAGGCTAGATGTGTTCTTCAACTGTTTGCAATTATGGATTATTTCATAGTATCCATGATCTTAGTTTGAATGTGGCGTGTCACGGTCGCATCATGACACTCTTCTATCTTAGCCTTCACGTGACTCGTTGAACACCATTTGAGAGTCACCGCTTTACTTATCGTAATGTAGTGTCAAAATCTATTTTATTTATAATATGTTTATAACATTTTTCTCCACTTAAATTAGTTATCGTGGTCGGTGATTTTGTAGATTATGATACTATGCACAAATCAAATTATTTTTTTATTTGTTTAAT

mRNA sequence

GTCCCAAATGGAAATAAGCAGACAGATACACAAACAATGTTTCAGGTTACACCTCACTTAAACGCGGATTTCTCTACTAATCTCCTTCTCTTTCCTCGCGTCTTCGTGGTTTTATCATTTTTCCCTCACTATAAATACAATCTCTTAATCCCACTTCCTCTCCCATCCATTGAATACCCTTTTCCCCTTCTCAAACCCCCCAAAAATGGCTTCTCTTTCATGTTCATCACTGCCATTGCTAGCGTTCTTCTTGCTATTGGCTTCTTCAGCCTTTGCAGGGTCGACTTTGGAAGGACTTCTAGCAAATGGGAACTTTGAGGAACCACCATTAAAAACCAACTTAAAGAAAACAGTCATAGTAGGCAAAGACTCTCTGCCAAGCTGGGAGATCAATGGCTTTGTTGAGTACATCTCAGGTGGCCCTCAACCCGGCGGAATGTTCTTCCCCGTTGCTCACGGCGTCCATGCCGTAAGGCTCGGCAACGAAGCATCGATATCTCAGATCATAAAAGTGAAAAAAGGGTCTCTTTATGCTCTAACTTTTGGAGCTTCAAGAACATGTGCACAAGATGAAGTCTTGTCTGTGTTAGTACCTCCCCAGAATGGAAGTTTGCCTCTGCAGACTCTTTACAGCAGCGATGGAGGCGATGTTTACGCTTTCGGATTTGTAGCTTCGTCGGATTCTGTTAAGGTTACGTTCCACAATCCCGGAGTTCAGGAAGATCCCGCTTGTGGACCTCTGTTGGACGCCGTTGCCATCAAAGAGCTTATTCGTCCAGTCCCAACAAGAGATAACTTGGTTAGGAACCCGAGCTTCGAGGTTGGTCCTCATCGGTTAGTAAACTCGACCAACGGAGTTCTTCTTCCTCCAAGACAAGAAGACCTTACATCTCCACTCCCAGGCTGGATCATAGAGTCACTCAAGGCTGTAAAGTTCATTGATTCAAAGCATTTCAACGTTCCCGTTGGACTTGCAGCGGTCGAGCTCGTTGCAGGGAGAGAAAGCGCGGTGGCTCAAATCATCAGAACCATCCCCAACAAGCTATACTCGCTAACGTTCAAAGTCGGTGACGCCAAGAACGGATGCCATGGATCAATGATGGTGGAAGCATTTGCTGCTAAAGACACTCTCAAAGTTCCCTTCCAATCTCAAGGAAAAGGGCTTTACAAAACTGCCATTCTCAAGTTCAAAGCAATCTCGCCTAGAACCAGAATCACATTCTTCAGTTCATACTACCATACCAGAACAGACGACTTCGGATCCCTTTGTGGCCCCGTGCTCGATGATGTTCGTGTTGTTCCTACGTATTAGAGATATTGTTTCCATGGCTGCTGCTACCGCGCCCCGGGTTTGGTATCGAAATACTCGTAGACGTTGCATCGGCTGCTTAATGTAATTAAAAACCTTTGATGAGGCTAGATGTGTTCTTCAACTGTTTGCAATTATGGATTATTTCATAGTATCCATGATCTTAGTTTGAATGTGGCGTGTCACGGTCGCATCATGACACTCTTCTATCTTAGCCTTCACGTGACTCGTTGAACACCATTTGAGAGTCACCGCTTTACTTATCGTAATGTAGTGTCAAAATCTATTTTATTTATAATATGTTTATAACATTTTTCTCCACTTAAATTAGTTATCGTGGTCGGTGATTTTGTAGATTATGATACTATGCACAAATCAAATTATTTTTTTATTTGTTTAAT

Coding sequence (CDS)

ATGGCTTCTCTTTCATGTTCATCACTGCCATTGCTAGCGTTCTTCTTGCTATTGGCTTCTTCAGCCTTTGCAGGGTCGACTTTGGAAGGACTTCTAGCAAATGGGAACTTTGAGGAACCACCATTAAAAACCAACTTAAAGAAAACAGTCATAGTAGGCAAAGACTCTCTGCCAAGCTGGGAGATCAATGGCTTTGTTGAGTACATCTCAGGTGGCCCTCAACCCGGCGGAATGTTCTTCCCCGTTGCTCACGGCGTCCATGCCGTAAGGCTCGGCAACGAAGCATCGATATCTCAGATCATAAAAGTGAAAAAAGGGTCTCTTTATGCTCTAACTTTTGGAGCTTCAAGAACATGTGCACAAGATGAAGTCTTGTCTGTGTTAGTACCTCCCCAGAATGGAAGTTTGCCTCTGCAGACTCTTTACAGCAGCGATGGAGGCGATGTTTACGCTTTCGGATTTGTAGCTTCGTCGGATTCTGTTAAGGTTACGTTCCACAATCCCGGAGTTCAGGAAGATCCCGCTTGTGGACCTCTGTTGGACGCCGTTGCCATCAAAGAGCTTATTCGTCCAGTCCCAACAAGAGATAACTTGGTTAGGAACCCGAGCTTCGAGGTTGGTCCTCATCGGTTAGTAAACTCGACCAACGGAGTTCTTCTTCCTCCAAGACAAGAAGACCTTACATCTCCACTCCCAGGCTGGATCATAGAGTCACTCAAGGCTGTAAAGTTCATTGATTCAAAGCATTTCAACGTTCCCGTTGGACTTGCAGCGGTCGAGCTCGTTGCAGGGAGAGAAAGCGCGGTGGCTCAAATCATCAGAACCATCCCCAACAAGCTATACTCGCTAACGTTCAAAGTCGGTGACGCCAAGAACGGATGCCATGGATCAATGATGGTGGAAGCATTTGCTGCTAAAGACACTCTCAAAGTTCCCTTCCAATCTCAAGGAAAAGGGCTTTACAAAACTGCCATTCTCAAGTTCAAAGCAATCTCGCCTAGAACCAGAATCACATTCTTCAGTTCATACTACCATACCAGAACAGACGACTTCGGATCCCTTTGTGGCCCCGTGCTCGATGATGTTCGTGTTGTTCCTACGTATTAG

Protein sequence

MASLSCSSLPLLAFFLLLASSAFAGSTLEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVLVPPQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKELIRPVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHFNVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDTLKVPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRVVPTY
BLAST of Cp4.1LG03g04670 vs. TrEMBL
Match: A0A0A0L9H0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G689790 PE=4 SV=1)

HSP 1 Score: 681.8 bits (1758), Expect = 4.7e-193
Identity = 338/367 (92.10%), Postives = 350/367 (95.37%), Query Frame = 1

Query: 1   MASLSCSSLPLLAFFLLLASSAFAGSTLEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSW 60
           M S   SSLP L FFLLLASSA AG+ LEGLLANGNFEEPP +TNLKKTVI+GK+SLPSW
Sbjct: 1   MPSPPSSSLPFLTFFLLLASSALAGTILEGLLANGNFEEPPAQTNLKKTVIIGKNSLPSW 60

Query: 61  EINGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCA 120
           EINGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQII VKKGSLYALTFGASRTCA
Sbjct: 61  EINGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIINVKKGSLYALTFGASRTCA 120

Query: 121 QDEVLSVLVPPQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLL 180
           QDEVLSVLVPPQNGSLPLQTLYSSDGGDVYA+GFVA SD VKVTFHNPGVQEDPACGPLL
Sbjct: 121 QDEVLSVLVPPQNGSLPLQTLYSSDGGDVYAYGFVAQSDLVKVTFHNPGVQEDPACGPLL 180

Query: 181 DAVAIKELIRPVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLK 240
           DAVAIKEL RP+PTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQED+TSPLPGWIIESLK
Sbjct: 181 DAVAIKELARPLPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDVTSPLPGWIIESLK 240

Query: 241 AVKFIDSKHFNVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMV 300
           AVKFIDSKHFNVPVGLAA+ELVAGRESAVAQIIRTIPNK+YSLTFKVGDAKNGCHGSMMV
Sbjct: 241 AVKFIDSKHFNVPVGLAAIELVAGRESAVAQIIRTIPNKVYSLTFKVGDAKNGCHGSMMV 300

Query: 301 EAFAAKDTLKVPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLD 360
           EAFAAK+T+KVPFQSQGKGLYK AILKFKA S RTRITFFSSYYHTRTDDFGSLCGPVLD
Sbjct: 301 EAFAAKETVKVPFQSQGKGLYKNAILKFKATSRRTRITFFSSYYHTRTDDFGSLCGPVLD 360

Query: 361 DVRVVPT 368
           DVRV+ T
Sbjct: 361 DVRVIST 367

BLAST of Cp4.1LG03g04670 vs. TrEMBL
Match: A0A059BJN3_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_G03366 PE=4 SV=1)

HSP 1 Score: 590.1 bits (1520), Expect = 1.8e-165
Identity = 292/365 (80.00%), Postives = 321/365 (87.95%), Query Frame = 1

Query: 4   LSCSSLPLLAFFLLLASSAFAGST--LEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWE 63
           ++ +S  +LAF LL+A+SAFA      EGLL NGNFE+PP +T+LKKTVI GK+ LP WE
Sbjct: 1   MAVTSSAVLAFLLLIAASAFAAPPPPSEGLLPNGNFEDPPKRTDLKKTVIQGKNGLPKWE 60

Query: 64  INGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQ 123
           ING VEYISGGPQPGGMFF VAHGVHAVRLGN+ASISQ I VK GSLYALTFGASRTCAQ
Sbjct: 61  INGLVEYISGGPQPGGMFFAVAHGVHAVRLGNDASISQSIPVKPGSLYALTFGASRTCAQ 120

Query: 124 DEVLSVLVPPQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLD 183
           DEVL V V PQ G LPLQTLYSS+GGD YA+GF A+S+  KV FHNPGVQEDPACGPLLD
Sbjct: 121 DEVLRVSVHPQTGDLPLQTLYSSNGGDTYAWGFRAASNVAKVVFHNPGVQEDPACGPLLD 180

Query: 184 AVAIKELIRPVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKA 243
           AVAIKEL  P PTRDNLV+N  FE GPHRL+NS+NGVLLPPRQEDLTSPLPGWIIESLKA
Sbjct: 181 AVAIKELFPPRPTRDNLVKNAGFEEGPHRLINSSNGVLLPPRQEDLTSPLPGWIIESLKA 240

Query: 244 VKFIDSKHFNVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVE 303
           VKFID KHFNVP GLAAVELVAGRESA+AQ+IRT+P+K Y+LTF VGDA+NGCHGSMMVE
Sbjct: 241 VKFIDKKHFNVPFGLAAVELVAGRESAIAQMIRTLPSKCYNLTFAVGDARNGCHGSMMVE 300

Query: 304 AFAAKDTLKVPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDD 363
           AFAAKDTLKVPFQSQGKG +KTA LKFKA+S RTR+TFFSS+YHTR DDFGSLCGPVLD+
Sbjct: 301 AFAAKDTLKVPFQSQGKGRFKTASLKFKALSARTRLTFFSSFYHTRIDDFGSLCGPVLDE 360

Query: 364 VRVVP 367
           VRVVP
Sbjct: 361 VRVVP 365

BLAST of Cp4.1LG03g04670 vs. TrEMBL
Match: B9RVM8_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0965260 PE=4 SV=1)

HSP 1 Score: 584.7 bits (1506), Expect = 7.7e-164
Identity = 283/357 (79.27%), Postives = 309/357 (86.55%), Query Frame = 1

Query: 11  LLAFFLLLASSAFAGSTLEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEYIS 70
           LLAFFLL   S  A + +EG L NGNFE+ P   ++ KTV+ GK++LP WE NG VEYIS
Sbjct: 8   LLAFFLLFNGSGLAATYMEGFLKNGNFEQKPKPRDINKTVLKGKNALPGWETNGLVEYIS 67

Query: 71  GGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVLVP 130
            GPQPGGM+F VAHGVHAVRLGNEASISQ + VK GSLYALTFGASRTCAQDEVL V VP
Sbjct: 68  AGPQPGGMYFAVAHGVHAVRLGNEASISQTLAVKAGSLYALTFGASRTCAQDEVLRVSVP 127

Query: 131 PQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKELIR 190
           P +G LPLQTLYSS+GGD YA+GF+A S+ VKVTFHNPGVQEDPACGPL+DAVAIKEL  
Sbjct: 128 PLSGDLPLQTLYSSNGGDTYAWGFIAKSNVVKVTFHNPGVQEDPACGPLVDAVAIKELFP 187

Query: 191 PVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHF 250
           P PTRDNLV+NP FE GPHRLVN++NGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHF
Sbjct: 188 PRPTRDNLVKNPGFEEGPHRLVNTSNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHF 247

Query: 251 NVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDTLK 310
           NVP GLAAVELVAGRESA+AQI+RTIPNK+Y LTF VGDAKNGCHGSMMVEAFAAKDT K
Sbjct: 248 NVPFGLAAVELVAGRESAIAQILRTIPNKVYDLTFSVGDAKNGCHGSMMVEAFAAKDTFK 307

Query: 311 VPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRVVPT 368
           VPF+SQGKG +KT    FKA+S RTRITF+SSYYHTR DDFGSLCGPVLD VRV PT
Sbjct: 308 VPFESQGKGKFKTVSFNFKAVSARTRITFYSSYYHTRIDDFGSLCGPVLDQVRVFPT 364

BLAST of Cp4.1LG03g04670 vs. TrEMBL
Match: M5X0A4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007461mg PE=4 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 2.3e-160
Identity = 278/358 (77.65%), Postives = 311/358 (86.87%), Query Frame = 1

Query: 11  LLAFFLLLASSAFAGST--LEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEY 70
           LLAF  +  + A+A     L+GLL NGNFEEPP  TNLKKTV++GK +LP WEINGFVEY
Sbjct: 8   LLAFSFMFTNPAYAAVQVPLDGLLNNGNFEEPPKPTNLKKTVLIGKYALPKWEINGFVEY 67

Query: 71  ISGGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVL 130
           ISGGPQPGGM+F VAHGVHAVRLGNEASISQ IKVK GSLYALTFGASRTCAQ+EVL V 
Sbjct: 68  ISGGPQPGGMYFSVAHGVHAVRLGNEASISQTIKVKPGSLYALTFGASRTCAQEEVLRVS 127

Query: 131 VPPQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKEL 190
           VPPQ G LPLQTLYSS+GGD YA+GF A+S+ VKVTFHNPGVQEDPACGPLLDA+AIKEL
Sbjct: 128 VPPQAGDLPLQTLYSSNGGDTYAWGFRATSNVVKVTFHNPGVQEDPACGPLLDAIAIKEL 187

Query: 191 IRPVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSK 250
              +PTRDNLVRNP FE  PHRL NS++GVLLPP+Q D+TSPLPGWIIESLKAVKFIDS+
Sbjct: 188 FPALPTRDNLVRNPGFEEAPHRLFNSSHGVLLPPKQLDVTSPLPGWIIESLKAVKFIDSQ 247

Query: 251 HFNVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDT 310
           HFNVP G  AVELVAGRESA+AQ++RT+PNK+Y L+F VGDA+NGCHGSMMVEAFA KDT
Sbjct: 248 HFNVPFGKGAVELVAGRESAIAQVLRTVPNKIYDLSFVVGDARNGCHGSMMVEAFAGKDT 307

Query: 311 LKVPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRVVP 367
           LKVPF SQGKG +K A LKFKA SPRTRITF+SS+YHTR DD+G+LCGP+LD VRV P
Sbjct: 308 LKVPFTSQGKGGFKAASLKFKAASPRTRITFYSSFYHTRVDDYGALCGPILDQVRVYP 365

BLAST of Cp4.1LG03g04670 vs. TrEMBL
Match: A0A061DVE3_THECC (F17A17.37 protein OS=Theobroma cacao GN=TCM_005853 PE=4 SV=1)

HSP 1 Score: 568.5 bits (1464), Expect = 5.7e-159
Identity = 273/347 (78.67%), Postives = 303/347 (87.32%), Query Frame = 1

Query: 20  SSAFAGSTLEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEYISGGPQPGGMF 79
           S+  A    EG L NGNFEE P  T+LKKTV++GK +LP W I+G VEYI+GGPQPGGMF
Sbjct: 37  SAKIAAKPFEGYLENGNFEEQPKPTDLKKTVLLGKYALPKWTISGLVEYITGGPQPGGMF 96

Query: 80  FPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVLVPPQNGSLPLQ 139
           FPVAHGVHAV+LGNEASISQ I VK G+LYALTFGASRTCAQDEVL V VP Q+G LPLQ
Sbjct: 97  FPVAHGVHAVKLGNEASISQTIPVKPGTLYALTFGASRTCAQDEVLRVSVPAQSGDLPLQ 156

Query: 140 TLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKELIRPVPTRDNLV 199
           TLYSS G DVYA+GF+A S  + VTFHNPGVQEDP CGPLLDAVAIKEL+RP+PTRDNLV
Sbjct: 157 TLYSSYGDDVYAWGFIAKSKYITVTFHNPGVQEDPTCGPLLDAVAIKELVRPMPTRDNLV 216

Query: 200 RNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHFNVPVGLAAV 259
           +NP FE GPHRLVNSTNGVLLPPRQED TSPLPGWIIESLKAVKFIDSKHFNVP G AAV
Sbjct: 217 KNPGFEEGPHRLVNSTNGVLLPPRQEDSTSPLPGWIIESLKAVKFIDSKHFNVPAGKAAV 276

Query: 260 ELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDTLKVPFQSQGKG 319
           ELVAGRESA+AQI+RT+PN+LY LTF +GDA+NGCHG MMVEAFA K+T+KVPF S+GKG
Sbjct: 277 ELVAGRESAIAQILRTVPNQLYDLTFIIGDARNGCHGEMMVEAFADKNTVKVPFTSRGKG 336

Query: 320 LYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRVVP 367
            +KTA LKFKA++ RTRITFFSSYYHTR +DFGSLCGPVLD+VRV P
Sbjct: 337 EFKTASLKFKAVTARTRITFFSSYYHTRINDFGSLCGPVLDEVRVSP 383

BLAST of Cp4.1LG03g04670 vs. TAIR10
Match: AT3G08030.1 (AT3G08030.1 Protein of unknown function, DUF642)

HSP 1 Score: 529.3 bits (1362), Expect = 2.0e-150
Identity = 257/354 (72.60%), Postives = 294/354 (83.05%), Query Frame = 1

Query: 11  LLAFFLLLASSAFAGSTLEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEYIS 70
           +L   LL+  +A      EG L NGNFEE P KT++KKTV++GK++LP WE  GFVEYI+
Sbjct: 8   ILPILLLICGAALGAPASEGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETTGFVEYIA 67

Query: 71  GGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVLVP 130
           GGPQPGGM+FPVAHGVHAVRLGNEA+ISQ ++VK GSLYALTFGASRTCAQDEVL V VP
Sbjct: 68  GGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKPGSLYALTFGASRTCAQDEVLRVSVP 127

Query: 131 PQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKELIR 190
            Q+G LPLQTLY+S GGDVYA+ FVA +  V VTFHNPGVQEDPACGPLLDAVAIKEL+ 
Sbjct: 128 SQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTFHNPGVQEDPACGPLLDAVAIKELVH 187

Query: 191 PVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHF 250
           P+ TR NLV+N  FE GPHRLVNST GVLLPP+QEDLTSPLPGWIIESLKAVKFIDSK+F
Sbjct: 188 PIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDLTSPLPGWIIESLKAVKFIDSKYF 247

Query: 251 NVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDTLK 310
           NVP G AA+ELVAG+ESA+AQ+IRT P + Y+L+F VGDAKN CHGSMMVEAFAA+DTLK
Sbjct: 248 NVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSFVVGDAKNDCHGSMMVEAFAARDTLK 307

Query: 311 VPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRV 365
           VP  S G G  KTA  KFKA+  RTRITFFS +YHT+  D  SLCGPV+D++ V
Sbjct: 308 VPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYHTKKTDTVSLCGPVIDEIVV 361

BLAST of Cp4.1LG03g04670 vs. TAIR10
Match: AT2G41800.1 (AT2G41800.1 Protein of unknown function, DUF642)

HSP 1 Score: 453.4 bits (1165), Expect = 1.4e-127
Identity = 216/337 (64.09%), Postives = 260/337 (77.15%), Query Frame = 1

Query: 28  LEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEYISGGPQPGGMFFPVAHGVH 87
           L+G+L NGNFE  PLK+N+K   I+G +SLP WEI G VE +SGGPQPGG +FPV  GVH
Sbjct: 30  LDGILPNGNFEITPLKSNMKGRQIIGANSLPHWEIAGHVELVSGGPQPGGFYFPVPRGVH 89

Query: 88  AVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVLVPPQNGSLPLQTLYSSDGG 147
           AVRLGN  +ISQ ++VK G +Y+LTFGA+RTCAQDE + V VP Q   LPLQT++SSDGG
Sbjct: 90  AVRLGNLGTISQNVRVKSGLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFSSDGG 149

Query: 148 DVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKELIRPVPTRDNLVRNPSFEVG 207
           D YA+ F A+SD VKVTFHNPGVQED  CGPLLD VAIKE++    TR NLV+N  FE+G
Sbjct: 150 DTYAWAFKATSDVVKVTFHNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGGFEIG 209

Query: 208 PHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHFNVPVGLAAVELVAGRES 267
           PH   N + G+L+P R +D  SPLPGWI+ESLK VK+ID +HF VP G  AVELVAGRES
Sbjct: 210 PHVFANFSTGILIPARIQDFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVAGRES 269

Query: 268 AVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDTLKVPFQSQGKGLYKTAILK 327
           A+AQIIRTI  K Y L+F VGDA+NGCHGSMMVEAFA ++  K+ F S+GKG +KT   +
Sbjct: 270 AIAQIIRTIAGKAYMLSFAVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKTGHFR 329

Query: 328 FKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRV 365
           F A S RTR+TF+S++YHT+  DFG LCGPVLD V V
Sbjct: 330 FVADSDRTRLTFYSAFYHTKLHDFGHLCGPVLDSVVV 366

BLAST of Cp4.1LG03g04670 vs. TAIR10
Match: AT2G41810.1 (AT2G41810.1 Protein of unknown function, DUF642)

HSP 1 Score: 446.8 bits (1148), Expect = 1.3e-125
Identity = 212/337 (62.91%), Postives = 260/337 (77.15%), Query Frame = 1

Query: 28  LEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEYISGGPQPGGMFFPVAHGVH 87
           L+GLL NGNFE+ P K+N++K  I+GK SLP WEI+G VE +SGGPQPGG +F V  GVH
Sbjct: 30  LDGLLPNGNFEQIPNKSNMRKRQIIGKYSLPHWEISGHVELVSGGPQPGGFYFAVPRGVH 89

Query: 88  AVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVLVPPQNGSLPLQTLYSSDGG 147
           A RLGN ASISQ +KVK G +Y+LTFG +RTCAQDE + + VP Q   LP+QTL+S++GG
Sbjct: 90  AARLGNLASISQYVKVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTLFSTNGG 149

Query: 148 DVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKELIRPVPTRDNLVRNPSFEVG 207
           D YA+ F A+SD VKVTF+NPGVQEDP CGP++DAVAIKE++    T+ NLV+N  FE G
Sbjct: 150 DTYAWAFKATSDLVKVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKNGGFETG 209

Query: 208 PHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHFNVPVGLAAVELVAGRES 267
           PH   N + G+L+P + +DL SPLPGWI+ESLK VK+ID++HF VP GLAA+ELVAGRES
Sbjct: 210 PHVFSNFSTGILIPAKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIELVAGRES 269

Query: 268 AVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDTLKVPFQSQGKGLYKTAILK 327
           A+AQIIRT+  K Y L+F VGDA NGCHGSMMVEAFA     KV F+S  KG +K     
Sbjct: 270 AIAQIIRTVSGKNYILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAFKVGRFA 329

Query: 328 FKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRV 365
           F+A S RTRITF+S +YHT+  DFG LCGPVLD+V V
Sbjct: 330 FRADSNRTRITFYSGFYHTKLHDFGHLCGPVLDNVSV 366

BLAST of Cp4.1LG03g04670 vs. TAIR10
Match: AT4G32460.1 (AT4G32460.1 Protein of unknown function, DUF642)

HSP 1 Score: 411.4 bits (1056), Expect = 5.9e-115
Identity = 193/337 (57.27%), Postives = 253/337 (75.07%), Query Frame = 1

Query: 29  EGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEYISGGPQPGGMFFPVAHGVHA 88
           +GLL NG+FE  P  +++K T ++   ++P+WE++GFVEYI  G + G M   V  G  A
Sbjct: 24  DGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFA 83

Query: 89  VRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVLVPPQNGSLPLQTLYSSDGGD 148
           VRLGNEASI Q I VKKGS Y++TF A+RTCAQDE L+V V P +  +P+QT+YSS G D
Sbjct: 84  VRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWD 143

Query: 149 VYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKELIRPVPTRDNLVRNPSFEVGP 208
           +Y++ F A SD   +  HNPGV+EDPACGPL+D VA++ L  P PT  N+++N  FE GP
Sbjct: 144 LYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGP 203

Query: 209 HRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHFNVPVGLAAVELVAGRESA 268
             L N ++GVL+PP   D  SPLPGW++ESLKAVK+IDS HF+VP G  AVELVAG+ESA
Sbjct: 204 WVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESA 263

Query: 269 VAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDTLKVPFQSQGKGLYKTAILKF 328
           VAQ++RTIP K Y L+F VGDA N C GSM+VEAFA KDT+KVP++S+GKG +K + L+F
Sbjct: 264 VAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRF 323

Query: 329 KAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRVV 366
            A+S RTR+ F+S++Y  R DDF SLCGPV+DDV+++
Sbjct: 324 VAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLL 360

BLAST of Cp4.1LG03g04670 vs. TAIR10
Match: AT5G25460.1 (AT5G25460.1 Protein of unknown function, DUF642)

HSP 1 Score: 409.5 bits (1051), Expect = 2.3e-114
Identity = 197/359 (54.87%), Postives = 265/359 (73.82%), Query Frame = 1

Query: 11  LLAFFLLLASSAFAGSTL----EGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFV 70
           +++FFLL  ++A A  +     +G+L NG+FE  P  +++K T I+ K ++P+WE+ GFV
Sbjct: 6   VVSFFLLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWEVTGFV 65

Query: 71  EYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLS 130
           EYI  G + G M   V  G  AVRLGNEASI Q +KV KG  Y+LTF A+RTCAQDE L+
Sbjct: 66  EYIKSGHKQGDMLLVVPAGKFAVRLGNEASIKQRLKVVKGMYYSLTFSAARTCAQDERLN 125

Query: 131 VLVPPQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIK 190
           + V P +G +P+QT+YSS G D+YA+ F A SD  +V  HNPGV+EDPACGPL+D VA++
Sbjct: 126 ISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESDVAEVVIHNPGVEEDPACGPLIDGVAMR 185

Query: 191 ELIRPVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFID 250
            L  P PT  N+++N  FE GP  L  ST GVL+PP  ED  SPLPGW++ESLKAVK++D
Sbjct: 186 SLYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHSPLPGWMVESLKAVKYVD 245

Query: 251 SKHFNVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAK 310
            +HF+VP G  A+ELVAG+ESA+AQ++RT+  K Y L+F VGDA N C GSM+VEAFA K
Sbjct: 246 VEHFSVPQGRRAIELVAGKESAIAQVVRTVIGKTYVLSFAVGDANNACKGSMVVEAFAGK 305

Query: 311 DTLKVPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRVV 366
           DTLKVP++S+G G +K A ++F A+S R+RI F+S++Y  R+DDF SLCGPV+DDV+++
Sbjct: 306 DTLKVPYESKGTGGFKRASIRFVAVSTRSRIMFYSTFYAMRSDDFSSLCGPVIDDVKLI 364

BLAST of Cp4.1LG03g04670 vs. NCBI nr
Match: gi|449464002|ref|XP_004149718.1| (PREDICTED: uncharacterized protein LOC101216438 [Cucumis sativus])

HSP 1 Score: 681.8 bits (1758), Expect = 6.7e-193
Identity = 338/367 (92.10%), Postives = 350/367 (95.37%), Query Frame = 1

Query: 1   MASLSCSSLPLLAFFLLLASSAFAGSTLEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSW 60
           M S   SSLP L FFLLLASSA AG+ LEGLLANGNFEEPP +TNLKKTVI+GK+SLPSW
Sbjct: 1   MPSPPSSSLPFLTFFLLLASSALAGTILEGLLANGNFEEPPAQTNLKKTVIIGKNSLPSW 60

Query: 61  EINGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCA 120
           EINGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQII VKKGSLYALTFGASRTCA
Sbjct: 61  EINGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIINVKKGSLYALTFGASRTCA 120

Query: 121 QDEVLSVLVPPQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLL 180
           QDEVLSVLVPPQNGSLPLQTLYSSDGGDVYA+GFVA SD VKVTFHNPGVQEDPACGPLL
Sbjct: 121 QDEVLSVLVPPQNGSLPLQTLYSSDGGDVYAYGFVAQSDLVKVTFHNPGVQEDPACGPLL 180

Query: 181 DAVAIKELIRPVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLK 240
           DAVAIKEL RP+PTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQED+TSPLPGWIIESLK
Sbjct: 181 DAVAIKELARPLPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDVTSPLPGWIIESLK 240

Query: 241 AVKFIDSKHFNVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMV 300
           AVKFIDSKHFNVPVGLAA+ELVAGRESAVAQIIRTIPNK+YSLTFKVGDAKNGCHGSMMV
Sbjct: 241 AVKFIDSKHFNVPVGLAAIELVAGRESAVAQIIRTIPNKVYSLTFKVGDAKNGCHGSMMV 300

Query: 301 EAFAAKDTLKVPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLD 360
           EAFAAK+T+KVPFQSQGKGLYK AILKFKA S RTRITFFSSYYHTRTDDFGSLCGPVLD
Sbjct: 301 EAFAAKETVKVPFQSQGKGLYKNAILKFKATSRRTRITFFSSYYHTRTDDFGSLCGPVLD 360

Query: 361 DVRVVPT 368
           DVRV+ T
Sbjct: 361 DVRVIST 367

BLAST of Cp4.1LG03g04670 vs. NCBI nr
Match: gi|659123735|ref|XP_008461812.1| (PREDICTED: uncharacterized protein LOC103500323 [Cucumis melo])

HSP 1 Score: 662.1 bits (1707), Expect = 5.5e-187
Identity = 330/364 (90.66%), Postives = 343/364 (94.23%), Query Frame = 1

Query: 1   MASLSCSSLPLLAFFLLLASSAFAGSTLEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSW 60
           MAS   SSLP L FFLLLASSA AG+ LEGLLANGNFE PP KTNLKKTVI+GK+SLPSW
Sbjct: 1   MASPPFSSLPFLTFFLLLASSALAGTILEGLLANGNFEVPPAKTNLKKTVIIGKNSLPSW 60

Query: 61  EINGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCA 120
           EING VEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQII+VKKGSLYALTFGASRTCA
Sbjct: 61  EINGLVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIIQVKKGSLYALTFGASRTCA 120

Query: 121 QDEVLSVLVPPQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLL 180
           QDEVLSVLVPPQNGSLPLQTLYSSDGGDVYA+GFVA SDSVKVTFHNPGVQEDPACGPLL
Sbjct: 121 QDEVLSVLVPPQNGSLPLQTLYSSDGGDVYAYGFVAPSDSVKVTFHNPGVQEDPACGPLL 180

Query: 181 DAVAIKELIRPVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLK 240
           DAVAIKEL+RP+PT  NLVRNPSFEVGPHRL NSTNGVLLPPRQED+TSPLPGWIIESLK
Sbjct: 181 DAVAIKELVRPLPTTVNLVRNPSFEVGPHRLGNSTNGVLLPPRQEDVTSPLPGWIIESLK 240

Query: 241 AVKFIDSKHFNVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMV 300
           AVKFIDSKHFNVP G AA+ELVAGRESAVAQIIRTIPNK+YSL FKVGDAKNGCHGSMMV
Sbjct: 241 AVKFIDSKHFNVPDGEAAIELVAGRESAVAQIIRTIPNKVYSLKFKVGDAKNGCHGSMMV 300

Query: 301 EAFAAKDTLKVPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLD 360
           EAFAAK+T+KVPFQS+GKGLYK AILKF A SPRTRITFFSSYYHTRTDDFGSLCGPVLD
Sbjct: 301 EAFAAKETVKVPFQSEGKGLYKDAILKFTATSPRTRITFFSSYYHTRTDDFGSLCGPVLD 360

Query: 361 DVRV 365
           DV V
Sbjct: 361 DVSV 364

BLAST of Cp4.1LG03g04670 vs. NCBI nr
Match: gi|702426478|ref|XP_010067858.1| (PREDICTED: uncharacterized protein LOC104454644 [Eucalyptus grandis])

HSP 1 Score: 590.1 bits (1520), Expect = 2.6e-165
Identity = 292/365 (80.00%), Postives = 321/365 (87.95%), Query Frame = 1

Query: 4   LSCSSLPLLAFFLLLASSAFAGST--LEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWE 63
           ++ +S  +LAF LL+A+SAFA      EGLL NGNFE+PP +T+LKKTVI GK+ LP WE
Sbjct: 1   MAVTSSAVLAFLLLIAASAFAAPPPPSEGLLPNGNFEDPPKRTDLKKTVIQGKNGLPKWE 60

Query: 64  INGFVEYISGGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQ 123
           ING VEYISGGPQPGGMFF VAHGVHAVRLGN+ASISQ I VK GSLYALTFGASRTCAQ
Sbjct: 61  INGLVEYISGGPQPGGMFFAVAHGVHAVRLGNDASISQSIPVKPGSLYALTFGASRTCAQ 120

Query: 124 DEVLSVLVPPQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLD 183
           DEVL V V PQ G LPLQTLYSS+GGD YA+GF A+S+  KV FHNPGVQEDPACGPLLD
Sbjct: 121 DEVLRVSVHPQTGDLPLQTLYSSNGGDTYAWGFRAASNVAKVVFHNPGVQEDPACGPLLD 180

Query: 184 AVAIKELIRPVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKA 243
           AVAIKEL  P PTRDNLV+N  FE GPHRL+NS+NGVLLPPRQEDLTSPLPGWIIESLKA
Sbjct: 181 AVAIKELFPPRPTRDNLVKNAGFEEGPHRLINSSNGVLLPPRQEDLTSPLPGWIIESLKA 240

Query: 244 VKFIDSKHFNVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVE 303
           VKFID KHFNVP GLAAVELVAGRESA+AQ+IRT+P+K Y+LTF VGDA+NGCHGSMMVE
Sbjct: 241 VKFIDKKHFNVPFGLAAVELVAGRESAIAQMIRTLPSKCYNLTFAVGDARNGCHGSMMVE 300

Query: 304 AFAAKDTLKVPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDD 363
           AFAAKDTLKVPFQSQGKG +KTA LKFKA+S RTR+TFFSS+YHTR DDFGSLCGPVLD+
Sbjct: 301 AFAAKDTLKVPFQSQGKGRFKTASLKFKALSARTRLTFFSSFYHTRIDDFGSLCGPVLDE 360

Query: 364 VRVVP 367
           VRVVP
Sbjct: 361 VRVVP 365

BLAST of Cp4.1LG03g04670 vs. NCBI nr
Match: gi|255553512|ref|XP_002517797.1| (PREDICTED: uncharacterized protein LOC8259300 [Ricinus communis])

HSP 1 Score: 584.7 bits (1506), Expect = 1.1e-163
Identity = 283/357 (79.27%), Postives = 309/357 (86.55%), Query Frame = 1

Query: 11  LLAFFLLLASSAFAGSTLEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEYIS 70
           LLAFFLL   S  A + +EG L NGNFE+ P   ++ KTV+ GK++LP WE NG VEYIS
Sbjct: 8   LLAFFLLFNGSGLAATYMEGFLKNGNFEQKPKPRDINKTVLKGKNALPGWETNGLVEYIS 67

Query: 71  GGPQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVLVP 130
            GPQPGGM+F VAHGVHAVRLGNEASISQ + VK GSLYALTFGASRTCAQDEVL V VP
Sbjct: 68  AGPQPGGMYFAVAHGVHAVRLGNEASISQTLAVKAGSLYALTFGASRTCAQDEVLRVSVP 127

Query: 131 PQNGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKELIR 190
           P +G LPLQTLYSS+GGD YA+GF+A S+ VKVTFHNPGVQEDPACGPL+DAVAIKEL  
Sbjct: 128 PLSGDLPLQTLYSSNGGDTYAWGFIAKSNVVKVTFHNPGVQEDPACGPLVDAVAIKELFP 187

Query: 191 PVPTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHF 250
           P PTRDNLV+NP FE GPHRLVN++NGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHF
Sbjct: 188 PRPTRDNLVKNPGFEEGPHRLVNTSNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHF 247

Query: 251 NVPVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDTLK 310
           NVP GLAAVELVAGRESA+AQI+RTIPNK+Y LTF VGDAKNGCHGSMMVEAFAAKDT K
Sbjct: 248 NVPFGLAAVELVAGRESAIAQILRTIPNKVYDLTFSVGDAKNGCHGSMMVEAFAAKDTFK 307

Query: 311 VPFQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRVVPT 368
           VPF+SQGKG +KT    FKA+S RTRITF+SSYYHTR DDFGSLCGPVLD VRV PT
Sbjct: 308 VPFESQGKGKFKTVSFNFKAVSARTRITFYSSYYHTRIDDFGSLCGPVLDQVRVFPT 364

BLAST of Cp4.1LG03g04670 vs. NCBI nr
Match: gi|729323613|ref|XP_010534242.1| (PREDICTED: uncharacterized protein LOC104809851 [Tarenaya hassleriana])

HSP 1 Score: 574.3 bits (1479), Expect = 1.5e-160
Identity = 275/354 (77.68%), Postives = 308/354 (87.01%), Query Frame = 1

Query: 13  AFFLLLASSAFAGSTLEGLLANGNFEEPPLKTNLKKTVIVGKDSLPSWEINGFVEYISGG 72
           A  L+L+ +A      EG LANGNFEE P K+++KKTV++GK++LP WEI G VEYISGG
Sbjct: 10  ALLLILSGAALGAPAYEGYLANGNFEEQPKKSDMKKTVLIGKNALPKWEITGMVEYISGG 69

Query: 73  PQPGGMFFPVAHGVHAVRLGNEASISQIIKVKKGSLYALTFGASRTCAQDEVLSVLVPPQ 132
           PQPGGMFFPVAHGVHAVRLGNEASISQ I+VK GSLYALTFGASRTCAQDEVL V V PQ
Sbjct: 70  PQPGGMFFPVAHGVHAVRLGNEASISQKIQVKPGSLYALTFGASRTCAQDEVLRVSVLPQ 129

Query: 133 NGSLPLQTLYSSDGGDVYAFGFVASSDSVKVTFHNPGVQEDPACGPLLDAVAIKELIRPV 192
           +G LPLQTLYSS GGDVYA+GF+  +  V VTFHNPGVQEDPACGPLLDAVAIK+L  P+
Sbjct: 130 SGDLPLQTLYSSFGGDVYAWGFIPKTQEVTVTFHNPGVQEDPACGPLLDAVAIKKLAFPL 189

Query: 193 PTRDNLVRNPSFEVGPHRLVNSTNGVLLPPRQEDLTSPLPGWIIESLKAVKFIDSKHFNV 252
           PTRDNLVRNP FE GPHRLVNST GVLLPP+QED TSPLPGWIIESLKAVKFIDSKHFNV
Sbjct: 190 PTRDNLVRNPGFEEGPHRLVNSTGGVLLPPKQEDSTSPLPGWIIESLKAVKFIDSKHFNV 249

Query: 253 PVGLAAVELVAGRESAVAQIIRTIPNKLYSLTFKVGDAKNGCHGSMMVEAFAAKDTLKVP 312
           P G AA+ELVAG+ESA+AQ+IRT+PNK+YSL+F VGDA+NGCHGSM+VEAFA KDTLKVP
Sbjct: 250 PYGEAAIELVAGKESAIAQVIRTVPNKVYSLSFAVGDARNGCHGSMVVEAFAGKDTLKVP 309

Query: 313 FQSQGKGLYKTAILKFKAISPRTRITFFSSYYHTRTDDFGSLCGPVLDDVRVVP 367
             S G G YKTA LKFKA+  RTRITFFS+YYHT+ DDFGSLCGP++DD+RVVP
Sbjct: 310 HNSVGAGHYKTASLKFKALEARTRITFFSTYYHTKKDDFGSLCGPIIDDIRVVP 363

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L9H0_CUCSA4.7e-19392.10Uncharacterized protein OS=Cucumis sativus GN=Csa_3G689790 PE=4 SV=1[more]
A0A059BJN3_EUCGR1.8e-16580.00Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_G03366 PE=4 SV=1[more]
B9RVM8_RICCO7.7e-16479.27Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0965260 PE=4 SV=1[more]
M5X0A4_PRUPE2.3e-16077.65Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007461mg PE=4 SV=1[more]
A0A061DVE3_THECC5.7e-15978.67F17A17.37 protein OS=Theobroma cacao GN=TCM_005853 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G08030.12.0e-15072.60 Protein of unknown function, DUF642[more]
AT2G41800.11.4e-12764.09 Protein of unknown function, DUF642[more]
AT2G41810.11.3e-12562.91 Protein of unknown function, DUF642[more]
AT4G32460.15.9e-11557.27 Protein of unknown function, DUF642[more]
AT5G25460.12.3e-11454.87 Protein of unknown function, DUF642[more]
Match NameE-valueIdentityDescription
gi|449464002|ref|XP_004149718.1|6.7e-19392.10PREDICTED: uncharacterized protein LOC101216438 [Cucumis sativus][more]
gi|659123735|ref|XP_008461812.1|5.5e-18790.66PREDICTED: uncharacterized protein LOC103500323 [Cucumis melo][more]
gi|702426478|ref|XP_010067858.1|2.6e-16580.00PREDICTED: uncharacterized protein LOC104454644 [Eucalyptus grandis][more]
gi|255553512|ref|XP_002517797.1|1.1e-16379.27PREDICTED: uncharacterized protein LOC8259300 [Ricinus communis][more]
gi|729323613|ref|XP_010534242.1|1.5e-16077.68PREDICTED: uncharacterized protein LOC104809851 [Tarenaya hassleriana][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008979Galactose-bd-like_sf
IPR006946DUF642
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0015995 chlorophyll biosynthetic process
biological_process GO:0009740 gibberellic acid mediated signaling pathway
biological_process GO:0010162 seed dormancy process
cellular_component GO:0005618 cell wall
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g04670.1Cp4.1LG03g04670.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006946Domain of unknown function DUF642PFAMPF04862DUF642coord: 197..364
score: 2.9E-14coord: 30..186
score: 1.4
IPR008979Galactose-binding domain-likeunknownSSF49785Galactose-binding domain-likecoord: 31..187
score: 4.2
NoneNo IPR availablePANTHERPTHR31265FAMILY NOT NAMEDcoord: 1..366
score: 4.3E
NoneNo IPR availablePANTHERPTHR31265:SF2F17A17.37 PROTEINcoord: 1..366
score: 4.3E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG03g04670Cp4.1LG02g15600Cucurbita pepo (Zucchini)cpecpeB449