Cp4.1LG17g00210 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG17g00210
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSeed maturation protein PM36
LocationCp4.1LG17 : 388078 .. 390993 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGTACGCATGTAAACCGTGTCAATCTGGATAATGAATGAGACATCAATGATTCGATGGCAACTCCAAACAAAACAAATATTTAAAATAAAAAAATATTGGTTCAATGCCACGAAGAATTCGACGGTGTATGAACGTTGTAACCAAATTACACGCGTACAGTTTCACCCACGCTTTATCTATCCCACTGAGTTGCAAATTTGCAAACTGAGCTTAGAAGCAAGGAACAAAAATGGCGGACCCGAAAACCAGAGCACAACTCGCCGGAGGAATGACCGCCACTGACTCATGGATCAGAAAGCACCGTCTCATCTACACCGACGCCACACGCCATCCTCTCGTCCTCTCCATTCGCGACGGCACCGTCGATCTCAACGCCTTCAGAACTTGGGTGGTAAGTTCGTTTACGATAATCCACAAAATAATTAATTAGGTCTTCGAAAACGTCCGATTTCTCGACGTGATTAGTGTTCGAGTTCAAATCCTTCAGGAACAGGAATGCGAATTTCTGAGATCCTTCACGGCGTTCGTCGCAAGTGTGTTGGTTAAAGCATGGAAAGAATCGGACGATAGAGCGGACGAGGAAGTGATCCTTGGTAGTTTGGCTTCTCTCAACGACGAATTCGCGTGGTTCAAAAAAGAAGCCCTAAAACGAGATATCGACTTGACTAAAATTGTTCCTCAGAACGCAACCGCCGGCTATTCCAGGTACGTATCGAGTTCTGCATCAAATTTTCTTGAGAGAGAATATCCAGTGAATGTTCGGTTTAGGTTTCTGGAGAGTTTGATGCGGCCGGAAATGGAGTACACGGTGGCGATTACGGCTCTTTGGGCTATCGAAGCTGTGTACCACGAGAGCTTTGCGTACTGCATGGAAGATGGATCAAAAACGCCATTGGAATTGAGAGAAGCCTGCGAAAGATGGGGGAACGAAGGATTTGGAAACTACTGCAATACCTTGAAGAAGATTGTGGATAGACGATTGGAGATGGCTGCTGGCGAAATAAGTAAAAAAACAGAAGTGGCTTTGTTGCGGGTTCTTGAATGTGAGGTTGCGTTTTGGAATATGAGCCGGGATGGTCCACGACAAACTATTTGAGCTATGAATTAGCATCAAAAGAATTCCCACGTTTGTTCTTGAACTAGAAACAAGAAACTAAATGAACTCAAAATCTCTAGAATGACCAGAGTTGCATATATATGGAGCTTTGTAATTTCATTTCAATCAGATGGATTACGGTTATCCTAGCGCCTTCTGCTGTGCCTTCGGCTCTTCCTCCGACTACGTTCATCATCGGAATCAGATTCTGATTCAGAACCGGAGTCAGAATCAGAAGATTCAGAAGATGAACTGTACCTCCTCTTTCTTCTCTCCCTTTTCTGCTTCTTTCTCCTTCTCCGACTCCTCTCTGATTCAGATTCAGATGAAGAACTGTAATCTGAACTGCTTCCAGAAACTGATGATGACCCACTTTCAGTCTCTAAAGCAGAGGCCTCACTATCGCTACTGGACCGCCCATGCTTTCTTTTGCTTTTCTTCTTCGAATGCTTCTCAGTCTTCTTCTTCTCGTTCTTCATATCAGAATCCGGGTTGTCCAGATCATAAGAGGCTGCCATCTTCATCCTCAACTTGGGATTCTTAAGTTGTTGAGTTCGAGATGGCCGAGATATGTAAACTCGTTCGTTTTTGCATTCATACGTCCAATGCCCCACCTGATAACATTTCTGGCACTGTGCAGCAGCACCTCCACCAGCGCAAGACATAGAGCCCTTAAGGTCCTTTCTTTCACCCAATCTGACTGCTTTTTCAGTACTCATCAAATACATCTGCCTCTTTGCTTCCCTTTTCTCCTGCCATCTGCTAGGCCCTTCTTCCTTCTGCCCATAAGCATTAACATTTCGAGCAGCAGCCGCAGCTGCTCGTTCAGCCTGAGCACGACTGAGCCCCTTTGCTGCACTAAGAGCAGCGGCCTTAATCCTTTCAGCAGCAGCTTGCGACTTCTCTTCCTTCTTACTAGACATGCTTTTGTTTCAATATCTGAATCTAATAAACTGGCACAGAACAAAAGGAAAATGAAGAATAAGAATGCCAAATCAAGAATCAATGAACACTACCAAAAGAGAAGACGGCACAAAAACGAAGTGGGAAGCATTAGGTTTCCTTCTTATAAGACACGCTCTAGCTTCTTCCACATTTCAACAGCTCTCCAAACTAGGATCAGATAGACCTTACAAAAAAACGAAGGAAGAACAAAATATAGAGCAAGTCTTAGGGTTTCTACCAGTACCGTAAAAGGCGAAAAAATAGAAGAAAATCGGATCAAATGCTAGCCCAAAGACCATAATCCCAGAACACTCCATTGTTTCCGATTGATGAGCCGCAATCGAAATAGAACGCCGGCTTTATATCGATCTGCTTCGTAAACCAAATGGGAAAGTCGAAAGATCGGGACAATACGCGAAGCTTGCAGCTACTAAACCGCAAAAATAAGTTGAAAGAAGAGGAATCGCAGTTGGAACATACCTCGGGGATCCAAGATGAATTGGAATCCGTGAAGAAGATCGCAATGAGAAGGCGATGTCGAATGAAACGCGAAGGCCAAGCGGAAGAAATACTGGGGATATCGGGAGAGGATGCGCAGTAGTCGGAACGGAATGGATCGAGGCTGTGTGAGTTCTTTATAGACCGGGGGGGTTTGATTTCCGTGTTTTGTGAATAGATCAAAATACCGACTCCCTCCATATTTAAGGCCTCAAATTTCGTTGAAGCTAATATCTTATTTCTTTTAGGTCTTTGAACCATTTTGATTTTATTTTATTTATTATTATTATTATTATTATTATTATTTTTTGTATATGAAAATATCAAATAAATTTATAAACATTCAACGAATTTTTAAAATGAAATAATTT

mRNA sequence

ATGACAAAGCAAGGAACAAAAATGGCGGACCCGAAAACCAGAGCACAACTCGCCGGAGGAATGACCGCCACTGACTCATGGATCAGAAAGCACCGTCTCATCTACACCGACGCCACACGCCATCCTCTCGTCCTCTCCATTCGCGACGGCACCGTCGATCTCAACGCCTTCAGAACTTGGGTGGAACAGGAATGCGAATTTCTGAGATCCTTCACGGCGTTCGTCGCAAGTGTGTTGGTTAAAGCATGGAAAGAATCGGACGATAGAGCGGACGAGGAAGTGATCCTTGGTAGTTTGGCTTCTCTCAACGACGAATTCGCGTGGTTCAAAAAAGAAGCCCTAAAACGAGATATCGACTTGACTAAAATTGTTCCTCAGAACGCAACCGCCGGCTATTCCAGGTTTCTGGAGAGTTTGATGCGGCCGGAAATGGAGTACACGGTGGCGATTACGGCTCTTTGGGCTATCGAAGCTGTGTACCACGAGAGCTTTGCGTACTGCATGGAAGATGGATCAAAAACGCCATTGGAATTGAGAGAAGCCTGCGAAAGATGGGGGAACGAAGGATTTGGAAACTACTGCAATACCTTGAAGAAGATTGTGGATAGACGATTGGAGATGGCTGCTGGCGAAATAAGTAAAAAAACAGAAGTGGCTTTGTTGCGGGTTCTTGAATGTGAGGTTGCGTTTTGGAATATGAGCCGGGATGGTCCACGACAAACTATTTGAGCTATGAATTAGCATCAAAAGAATTCCCACGTTTGTTCTTGAACTAGAAACAAGAAACTAAATGAACTCAAAATCTCTAGAATGACCAGAGTTGCATATATATGGAGCTTTGTAATTTCATTTCAATCAGATGGATTACGGTTATCCTAGCGCCTTCTGCTGTGCCTTCGGCTCTTCCTCCGACTACGTTCATCATCGGAATCAGATTCTGATTCAGAACCGGAGTCAGAATCAGAAGATTCAGAAGATGAACTGTACCTCCTCTTTCTTCTCTCCCTTTTCTGCTTCTTTCTCCTTCTCCGACTCCTCTCTGATTCAGATTCAGATGAAGAACTGTAATCTGAACTGCTTCCAGAAACTGATGATGACCCACTTTCAGTCTCTAAAGCAGAGGCCTCACTATCGCTACTGGACCGCCCATGCTTTCTTTTGCTTTTCTTCTTCGAATGCTTCTCAGTCTTCTTCTTCTCGTTCTTCATATCAGAATCCGGGTTGTCCAGATCATAAGAGGCTGCCATCTTCATCCTCAACTTGGGATTCTTAAGTTGTTGAGTTCGAGATGGCCGAGATATGTAAACTCGTTCGTTTTTGCATTCATACGTCCAATGCCCCACCTGATAACATTTCTGGCACTGTGCAGCAGCACCTCCACCAGCGCAAGACATAGAGCCCTTAAGGTCCTTTCTTTCACCCAATCTGACTGCTTTTTCAGTACTCATCAAATACATCTGCCTCTTTGCTTCCCTTTTCTCCTGCCATCTGCTAGGCCCTTCTTCCTTCTGCCCATAAGCATTAACATTTCGAGCAGCAGCCGCAGCTGCTCGTTCAGCCTGAGCACGACTGAGCCCCTTTGCTGCACTAAGAGCAGCGGCCTTAATCCTTTCAGCAGCAGCTTGCGACTTCTCTTCCTTCTTACTAGACATGCTTTTGTTTCAATATCTGAATCTAATAAACTGGCACAGAACAAAAGGAAAATGAAGAATAAGAATGCCAAATCAAGAATCAATGAACACTACCAAAAGAGAAGACGGCACAAAAACGAAGTGGGAAGCATTAGGTTTCCTTCTTATAAGACACGCTCTAGCTTCTTCCACATTTCAACAGCTCTCCAAACTAGGATCAGATAGACCTTACAAAAAAACGAAGGAAGAACAAAATATAGAGCAAGTCTTAGGGTTTCTACCAGTACCGTAAAAGGCGAAAAAATAGAAGAAAATCGGATCAAATGCTAGCCCAAAGACCATAATCCCAGAACACTCCATTGTTTCCGATTGATGAGCCGCAATCGAAATAGAACGCCGGCTTTATATCGATCTGCTTCGTAAACCAAATGGGAAAGTCGAAAGATCGGGACAATACGCGAAGCTTGCAGCTACTAAACCGCAAAAATAAGTTGAAAGAAGAGGAATCGCAGTTGGAACATACCTCGGGGATCCAAGATGAATTGGAATCCGTGAAGAAGATCGCAATGAGAAGGCGATGTCGAATGAAACGCGAAGGCCAAGCGGAAGAAATACTGGGGATATCGGGAGAGGATGCGCAGTAGTCGGAACGGAATGGATCGAGGCTGTGTGAGTTCTTTATAGACCGGGGGGGTTTGATTTCCGTGTTTTGTGAATAGATCAAAATACCGACTCCCTCCATATTTAAGGCCTCAAATTTCGTTGAAGCTAATATCTTATTTCTTTTAGGTCTTTGAACCATTTTGATTTTATTTTATTTATTATTATTATTATTATTATTATTATTTTTTGTATATGAAAATATCAAATAAATTTATAAACATTCAACGAATTTTTAAAATGAAATAATTT

Coding sequence (CDS)

ATGACAAAGCAAGGAACAAAAATGGCGGACCCGAAAACCAGAGCACAACTCGCCGGAGGAATGACCGCCACTGACTCATGGATCAGAAAGCACCGTCTCATCTACACCGACGCCACACGCCATCCTCTCGTCCTCTCCATTCGCGACGGCACCGTCGATCTCAACGCCTTCAGAACTTGGGTGGAACAGGAATGCGAATTTCTGAGATCCTTCACGGCGTTCGTCGCAAGTGTGTTGGTTAAAGCATGGAAAGAATCGGACGATAGAGCGGACGAGGAAGTGATCCTTGGTAGTTTGGCTTCTCTCAACGACGAATTCGCGTGGTTCAAAAAAGAAGCCCTAAAACGAGATATCGACTTGACTAAAATTGTTCCTCAGAACGCAACCGCCGGCTATTCCAGGTTTCTGGAGAGTTTGATGCGGCCGGAAATGGAGTACACGGTGGCGATTACGGCTCTTTGGGCTATCGAAGCTGTGTACCACGAGAGCTTTGCGTACTGCATGGAAGATGGATCAAAAACGCCATTGGAATTGAGAGAAGCCTGCGAAAGATGGGGGAACGAAGGATTTGGAAACTACTGCAATACCTTGAAGAAGATTGTGGATAGACGATTGGAGATGGCTGCTGGCGAAATAAGTAAAAAAACAGAAGTGGCTTTGTTGCGGGTTCTTGAATGTGAGGTTGCGTTTTGGAATATGAGCCGGGATGGTCCACGACAAACTATTTGA

Protein sequence

MTKQGTKMADPKTRAQLAGGMTATDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASVLVKAWKESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLESLMRPEMEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTLKKIVDRRLEMAAGEISKKTEVALLRVLECEVAFWNMSRDGPRQTI
BLAST of Cp4.1LG17g00210 vs. Swiss-Prot
Match: TENAE_SOYBN (Probable bifunctional TENA-E protein OS=Glycine max GN=TENA_E PE=2 SV=1)

HSP 1 Score: 278.1 bits (710), Expect = 9.2e-74
Identity = 130/212 (61.32%), Postives = 160/212 (75.47%), Query Frame = 1

Query: 24  TDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASVLVKAW 83
           T++W++KHRL+Y  ATRHPL++SIRDGT++  +F+TW+ Q+  F+R+F  FVASVL+KAW
Sbjct: 15  TETWLKKHRLLYNGATRHPLIISIRDGTINTASFKTWLAQDYLFVRAFVPFVASVLIKAW 74

Query: 84  KESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLESLMRPE 143
           KESD   D EVILG +ASL DE +WFK EA K  I L+ +VPQ A   Y   LESLM P+
Sbjct: 75  KESDCSGDMEVILGGMASLEDEISWFKTEANKWGISLSDVVPQQANKNYCGLLESLMSPD 134

Query: 144 MEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTLKKIVDR 203
            EYTVAITA WAIE VY ESFA+C+E+GSKTP EL+E C RWGNE FG YC +L+ I +R
Sbjct: 135 AEYTVAITAFWAIETVYQESFAHCIEEGSKTPPELKETCVRWGNEAFGKYCQSLQNIANR 194

Query: 204 RLEMAAGEISKKTEVALLRVLECEVAFWNMSR 236
            L+ A+ E  KK EV LL VLE EV FWNMSR
Sbjct: 195 CLQKASDEELKKAEVMLLSVLEHEVEFWNMSR 226

BLAST of Cp4.1LG17g00210 vs. Swiss-Prot
Match: TENAE_ARATH (Bifunctional TENA-E protein OS=Arabidopsis thaliana GN=TENA_E PE=1 SV=3)

HSP 1 Score: 272.7 bits (696), Expect = 3.9e-72
Identity = 126/213 (59.15%), Postives = 161/213 (75.59%), Query Frame = 1

Query: 25  DSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASVLVKAWK 84
           D+WI KHR IYT ATRH  V+SIRDG+VDL++FRTW+ Q+  F+R F  FVASVL++A K
Sbjct: 8   DTWIDKHRSIYTAATRHAFVVSIRDGSVDLSSFRTWLGQDYLFVRRFVPFVASVLIRACK 67

Query: 85  ESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLESLMRPEM 144
           +S + +D EV+LG +ASLNDE  WFK+E  K D+D + +VPQ A   Y RFLE LM  E+
Sbjct: 68  DSGESSDMEVVLGGIASLNDEIEWFKREGSKWDVDFSTVVPQRANQEYGRFLEDLMSSEV 127

Query: 145 EYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTLKKIVDRR 204
           +Y V +TA WAIEAVY ESFA+C+EDG+KTP+EL  AC RWGN+GF  YC+++K I +R 
Sbjct: 128 KYPVIMTAFWAIEAVYQESFAHCLEDGNKTPVELTGACHRWGNDGFKQYCSSVKNIAERC 187

Query: 205 LEMAAGEISKKTEVALLRVLECEVAFWNMSRDG 238
           LE A+GE+  + E  L+RVLE EVAFW MSR G
Sbjct: 188 LENASGEVLGEAEDVLVRVLELEVAFWEMSRGG 220

BLAST of Cp4.1LG17g00210 vs. Swiss-Prot
Match: TENA2_MAIZE (Bifunctional TENA2 protein OS=Zea mays GN=TENA2 PE=1 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 7.5e-60
Identity = 105/211 (49.76%), Postives = 143/211 (67.77%), Query Frame = 1

Query: 24  TDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASVLVKAW 83
           T +W+ KHR +Y  ATRHP  +SIRDGTVD++AF+ W+ Q+  F+R F AF+ASVL+K  
Sbjct: 11  TAAWMEKHRHMYERATRHPFTVSIRDGTVDMSAFKRWLSQDYLFVREFVAFIASVLLKCC 70

Query: 84  KESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLESLMRPE 143
           K+ +D +D E+ILG +AS++DE +WFK EA    +DL  + P  A   Y RFL S   PE
Sbjct: 71  KQ-EDSSDMEIILGGVASISDEISWFKNEATVWGVDLASVSPLKANLEYHRFLRSFTEPE 130

Query: 144 MEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTLKKIVDR 203
           + Y VA+T  W IE VY +SF +C++DG+KTP EL   C+RWG+ GF  YC +L+ IVDR
Sbjct: 131 ISYAVAVTTFWTIETVYQDSFGFCIQDGNKTPPELLGTCQRWGSAGFRQYCQSLQSIVDR 190

Query: 204 RLEMAAGEISKKTEVALLRVLECEVAFWNMS 235
            L  A  +  +  E A +RVLE E+ FW+MS
Sbjct: 191 CLANAPADAVQSAEEAFVRVLELEIGFWDMS 220

BLAST of Cp4.1LG17g00210 vs. TrEMBL
Match: A0A0A0KEX4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G307400 PE=4 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 2.2e-106
Identity = 184/226 (81.42%), Postives = 207/226 (91.59%), Query Frame = 1

Query: 8   MADPKTRAQLAGGMTATDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEF 67
           MADPK RAQLAG MTAT+SW+RKHRLIYT ATRHP +L+IRDGT+DL+AF+TW+EQ+  F
Sbjct: 1   MADPKARAQLAGAMTATESWLRKHRLIYTGATRHPFILTIRDGTIDLSAFKTWLEQDFGF 60

Query: 68  LRSFTAFVASVLVKAWKESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQN 127
           LRSF AFV SVLVKAWKESDDRADEEVIL  LA+LNDEFAWFKKE+LKRDI+L+++VPQN
Sbjct: 61  LRSFAAFVGSVLVKAWKESDDRADEEVILACLAALNDEFAWFKKESLKRDINLSEVVPQN 120

Query: 128 ATAGYSRFLESLMRPEMEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGN 187
           ATAGYSRFLESLMRPE+EYTVAITALW IEAVYHESFA+C+E+G+KTPLELREACERWGN
Sbjct: 121 ATAGYSRFLESLMRPEVEYTVAITALWLIEAVYHESFAHCLEEGTKTPLELREACERWGN 180

Query: 188 EGFGNYCNTLKKIVDRRLEMAAGEISKKTEVALLRVLECEVAFWNM 234
           EGFG+YCNTLKKI DRRLEM + E+SKK EV  LRVLE EV FWNM
Sbjct: 181 EGFGSYCNTLKKIADRRLEMGSEEVSKKAEVGFLRVLEYEVEFWNM 226

BLAST of Cp4.1LG17g00210 vs. TrEMBL
Match: A0A061EPD9_THECC (Heme oxygenase-like, multi-helical OS=Theobroma cacao GN=TCM_021348 PE=4 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 5.8e-75
Identity = 136/220 (61.82%), Postives = 168/220 (76.36%), Query Frame = 1

Query: 19  GGMTATDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASV 78
           G    T++W+RKHRL+Y  ATRHP + SIRDG +DL++F+TW+ Q+  F+R+F  FVASV
Sbjct: 10  GKTLMTETWLRKHRLLYVGATRHPFIRSIRDGNIDLSSFKTWLGQDYVFVRAFVPFVASV 69

Query: 79  LVKAWKESDDRA-DEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLE 138
           L KA K SD+ + D EV+LG +A+LNDE AWFKKEA K  + L+ IVPQ A   Y RFLE
Sbjct: 70  LSKACKGSDNSSNDVEVMLGGMAALNDEIAWFKKEASKWGVQLSDIVPQKANQNYCRFLE 129

Query: 139 SLMRPEMEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTL 198
           SLM PE+EYTVAITA WAIEAVY ESFA+C+EDG+K P EL+E C+RWGNEGFG YCN L
Sbjct: 130 SLMSPEVEYTVAITAFWAIEAVYQESFAHCLEDGTKPPPELQETCQRWGNEGFGQYCNAL 189

Query: 199 KKIVDRRLEMAAGEISKKTEVALLRVLECEVAFWNMSRDG 238
           +KI DR+LE A+ ++  K EV  LRVLE EV FWN+S  G
Sbjct: 190 RKIADRQLEKASDDVITKAEVTFLRVLEHEVDFWNISHGG 229

BLAST of Cp4.1LG17g00210 vs. TrEMBL
Match: A0A151ST20_CAJCA (Seed maturation protein PM36 OS=Cajanus cajan GN=KK1_004174 PE=4 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 6.4e-74
Identity = 131/215 (60.93%), Postives = 164/215 (76.28%), Query Frame = 1

Query: 21  MTATDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASVLV 80
           M   + W+RKHR +Y  ATRHPL+LSIRDGT++++ F+TW+ Q+  F+R+F  F AS+L+
Sbjct: 12  MGMIEVWLRKHRSLYNGATRHPLILSIRDGTINIDCFKTWLAQDYLFVRAFVPFAASLLI 71

Query: 81  KAWKESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLESLM 140
           KAWKESD+  D EVILG +ASL DE +WFK+EA K  + ++ +VPQ A   Y   LESLM
Sbjct: 72  KAWKESDESGDVEVILGGMASLEDEISWFKREANKWGVSVSDVVPQRANVKYCGLLESLM 131

Query: 141 RPEMEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTLKKI 200
            PE+EYTVAITA WAIEAVY ESFA+C+E+GSKTP EL+E C RWGNE FGNYC +L+KI
Sbjct: 132 SPEVEYTVAITAFWAIEAVYQESFAHCIEEGSKTPPELKETCARWGNEAFGNYCQSLQKI 191

Query: 201 VDRRLEMAAGEISKKTEVALLRVLECEVAFWNMSR 236
            +RRL+ A+ E  KK EV  L VLE EV FWNMSR
Sbjct: 192 ANRRLQKASDEELKKAEVTFLSVLENEVEFWNMSR 226

BLAST of Cp4.1LG17g00210 vs. TrEMBL
Match: A0A068TRX7_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00025696001 PE=4 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 6.4e-74
Identity = 128/215 (59.53%), Postives = 166/215 (77.21%), Query Frame = 1

Query: 21  MTATDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASVLV 80
           ++  ++W+RKHRL+Y  ATRHP + SIRDG+VD+++F+ W+EQ+  F+R+F  FVA+VL+
Sbjct: 14  VSVIETWLRKHRLLYVGATRHPFIHSIRDGSVDVSSFKRWLEQDYIFVRAFVPFVATVLL 73

Query: 81  KAWKESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLESLM 140
           KAWKES D  D +VIL  +ASLNDEFAWF KEA K  + LT + PQ A   Y RFLESLM
Sbjct: 74  KAWKESFDATDLDVILSGMASLNDEFAWFNKEASKWGVSLTNVAPQKANLDYCRFLESLM 133

Query: 141 RPEMEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTLKKI 200
             E+EYTVAITA WAIE VY +SFA+C+EDGS TP ELR+ C+RWGN+GFG YC+ L+ I
Sbjct: 134 SSEVEYTVAITAFWAIETVYQDSFAHCLEDGSNTPEELRDTCQRWGNDGFGRYCHALQSI 193

Query: 201 VDRRLEMAAGEISKKTEVALLRVLECEVAFWNMSR 236
            +  LE A+ ++ K+TE A+L VLE EVAFWNMS+
Sbjct: 194 AEHHLEKASDDVRKRTEAAVLDVLEYEVAFWNMSQ 228

BLAST of Cp4.1LG17g00210 vs. TrEMBL
Match: A0A022RZK6_ERYGU (Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a024518mg PE=4 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 8.3e-74
Identity = 127/212 (59.91%), Postives = 166/212 (78.30%), Query Frame = 1

Query: 24  TDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASVLVKAW 83
           T +W++KHRL+Y  ATRHP +L IRDG+VD+++F+ W+ Q+  F+R+F  FVASVL+KAW
Sbjct: 10  TVTWLKKHRLLYVGATRHPFILGIRDGSVDISSFKRWLGQDYIFVRAFVPFVASVLLKAW 69

Query: 84  KESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLESLMRPE 143
           KESDD+AD +VILG +++LNDE +WFK EA K  + L  +VPQ A   Y RFLE LM PE
Sbjct: 70  KESDDKADVDVILGGISALNDEVSWFKNEASKWAVALDSVVPQQANLNYCRFLERLMSPE 129

Query: 144 MEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTLKKIVDR 203
           ++YT AITA WAIE VY ESFA+C+E+GS+TP EL+E C+RWGN+GFG YC++L+ I +R
Sbjct: 130 VDYTQAITAFWAIETVYRESFAHCLEEGSRTPQELKETCQRWGNDGFGQYCSSLQSIANR 189

Query: 204 RLEMAAGEISKKTEVALLRVLECEVAFWNMSR 236
           RLE A+ E+  K EV LL +LE EV FWNMSR
Sbjct: 190 RLEKASDEVVAKAEVNLLEILEHEVEFWNMSR 221

BLAST of Cp4.1LG17g00210 vs. TAIR10
Match: AT3G16990.1 (AT3G16990.1 Haem oxygenase-like, multi-helical)

HSP 1 Score: 272.7 bits (696), Expect = 2.2e-73
Identity = 126/213 (59.15%), Postives = 161/213 (75.59%), Query Frame = 1

Query: 25  DSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASVLVKAWK 84
           D+WI KHR IYT ATRH  V+SIRDG+VDL++FRTW+ Q+  F+R F  FVASVL++A K
Sbjct: 8   DTWIDKHRSIYTAATRHAFVVSIRDGSVDLSSFRTWLGQDYLFVRRFVPFVASVLIRACK 67

Query: 85  ESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLESLMRPEM 144
           +S + +D EV+LG +ASLNDE  WFK+E  K D+D + +VPQ A   Y RFLE LM  E+
Sbjct: 68  DSGESSDMEVVLGGIASLNDEIEWFKREGSKWDVDFSTVVPQRANQEYGRFLEDLMSSEV 127

Query: 145 EYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTLKKIVDRR 204
           +Y V +TA WAIEAVY ESFA+C+EDG+KTP+EL  AC RWGN+GF  YC+++K I +R 
Sbjct: 128 KYPVIMTAFWAIEAVYQESFAHCLEDGNKTPVELTGACHRWGNDGFKQYCSSVKNIAERC 187

Query: 205 LEMAAGEISKKTEVALLRVLECEVAFWNMSRDG 238
           LE A+GE+  + E  L+RVLE EVAFW MSR G
Sbjct: 188 LENASGEVLGEAEDVLVRVLELEVAFWEMSRGG 220

BLAST of Cp4.1LG17g00210 vs. NCBI nr
Match: gi|659116976|ref|XP_008458358.1| (PREDICTED: seed maturation protein PM36 isoform X1 [Cucumis melo])

HSP 1 Score: 407.5 bits (1046), Expect = 1.6e-110
Identity = 193/234 (82.48%), Postives = 213/234 (91.03%), Query Frame = 1

Query: 8   MADPKTRAQLAGGMTATDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEF 67
           MAD K RAQLAG MTATDSW+RKHRLIYT ATRHP +L+IRDGTVDL+AF+TW+EQECEF
Sbjct: 1   MADTKARAQLAGAMTATDSWLRKHRLIYTGATRHPFILTIRDGTVDLSAFKTWLEQECEF 60

Query: 68  LRSFTAFVASVLVKAWKESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQN 127
           LRSF AFV SVLVKAWKESDDRADEEVILGSLA+LNDEFAWFKKEALKRDI+L++IVPQ 
Sbjct: 61  LRSFAAFVGSVLVKAWKESDDRADEEVILGSLAALNDEFAWFKKEALKRDINLSEIVPQK 120

Query: 128 ATAGYSRFLESLMRPEMEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGN 187
           ATAGYSRFLESLMRPE+EYTVAITALWAIEAVYHESFAYC+E+G+KTPLELREACERWG+
Sbjct: 121 ATAGYSRFLESLMRPEVEYTVAITALWAIEAVYHESFAYCLEEGTKTPLELREACERWGS 180

Query: 188 EGFGNYCNTLKKIVDRRLEMAAGEISKKTEVALLRVLECEVAFWNMSRDGPRQT 242
           EGF  YC+TLKKI DRRLEM +GE++KK EV LLRVLE EV FWNM R    +T
Sbjct: 181 EGFDKYCSTLKKIADRRLEMGSGEVNKKAEVGLLRVLEYEVGFWNMIRPPSHRT 234

BLAST of Cp4.1LG17g00210 vs. NCBI nr
Match: gi|659116978|ref|XP_008458359.1| (PREDICTED: seed maturation protein PM36 isoform X2 [Cucumis melo])

HSP 1 Score: 398.3 bits (1022), Expect = 9.7e-108
Identity = 191/234 (81.62%), Postives = 211/234 (90.17%), Query Frame = 1

Query: 8   MADPKTRAQLAGGMTATDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEF 67
           MAD K RAQLAG MTATDSW+RKHRLIYT ATRHP +L+IRDGTVDL+AF+TW+E  CEF
Sbjct: 1   MADTKARAQLAGAMTATDSWLRKHRLIYTGATRHPFILTIRDGTVDLSAFKTWLE--CEF 60

Query: 68  LRSFTAFVASVLVKAWKESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQN 127
           LRSF AFV SVLVKAWKESDDRADEEVILGSLA+LNDEFAWFKKEALKRDI+L++IVPQ 
Sbjct: 61  LRSFAAFVGSVLVKAWKESDDRADEEVILGSLAALNDEFAWFKKEALKRDINLSEIVPQK 120

Query: 128 ATAGYSRFLESLMRPEMEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGN 187
           ATAGYSRFLESLMRPE+EYTVAITALWAIEAVYHESFAYC+E+G+KTPLELREACERWG+
Sbjct: 121 ATAGYSRFLESLMRPEVEYTVAITALWAIEAVYHESFAYCLEEGTKTPLELREACERWGS 180

Query: 188 EGFGNYCNTLKKIVDRRLEMAAGEISKKTEVALLRVLECEVAFWNMSRDGPRQT 242
           EGF  YC+TLKKI DRRLEM +GE++KK EV LLRVLE EV FWNM R    +T
Sbjct: 181 EGFDKYCSTLKKIADRRLEMGSGEVNKKAEVGLLRVLEYEVGFWNMIRPPSHRT 232

BLAST of Cp4.1LG17g00210 vs. NCBI nr
Match: gi|449460951|ref|XP_004148207.1| (PREDICTED: seed maturation protein PM36 [Cucumis sativus])

HSP 1 Score: 393.3 bits (1009), Expect = 3.1e-106
Identity = 184/226 (81.42%), Postives = 207/226 (91.59%), Query Frame = 1

Query: 8   MADPKTRAQLAGGMTATDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEF 67
           MADPK RAQLAG MTAT+SW+RKHRLIYT ATRHP +L+IRDGT+DL+AF+TW+EQ+  F
Sbjct: 1   MADPKARAQLAGAMTATESWLRKHRLIYTGATRHPFILTIRDGTIDLSAFKTWLEQDFGF 60

Query: 68  LRSFTAFVASVLVKAWKESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQN 127
           LRSF AFV SVLVKAWKESDDRADEEVIL  LA+LNDEFAWFKKE+LKRDI+L+++VPQN
Sbjct: 61  LRSFAAFVGSVLVKAWKESDDRADEEVILACLAALNDEFAWFKKESLKRDINLSEVVPQN 120

Query: 128 ATAGYSRFLESLMRPEMEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGN 187
           ATAGYSRFLESLMRPE+EYTVAITALW IEAVYHESFA+C+E+G+KTPLELREACERWGN
Sbjct: 121 ATAGYSRFLESLMRPEVEYTVAITALWLIEAVYHESFAHCLEEGTKTPLELREACERWGN 180

Query: 188 EGFGNYCNTLKKIVDRRLEMAAGEISKKTEVALLRVLECEVAFWNM 234
           EGFG+YCNTLKKI DRRLEM + E+SKK EV  LRVLE EV FWNM
Sbjct: 181 EGFGSYCNTLKKIADRRLEMGSEEVSKKAEVGFLRVLEYEVEFWNM 226

BLAST of Cp4.1LG17g00210 vs. NCBI nr
Match: gi|1009132042|ref|XP_015883172.1| (PREDICTED: probable bifunctional TENA-E protein [Ziziphus jujuba])

HSP 1 Score: 295.8 bits (756), Expect = 6.8e-77
Identity = 139/219 (63.47%), Postives = 172/219 (78.54%), Query Frame = 1

Query: 25  DSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASVLVKAWK 84
           ++W++KHRLIYT ATRHPL+LSIRDGTV L++F+ W+ Q+C F+R+F  F AS+L+KAWK
Sbjct: 17  ETWLKKHRLIYTGATRHPLILSIRDGTVHLSSFKRWLGQDCIFVRAFVPFAASLLIKAWK 76

Query: 85  ESDDRADEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLESLMRPEM 144
           ESD+R+D +V+LG LA+LNDE AWFK+EA K  + L++IVPQ     Y RFLESLM PE+
Sbjct: 77  ESDNRSDLDVLLGGLAALNDEIAWFKQEAAKWGVLLSEIVPQKTNEEYCRFLESLMSPEV 136

Query: 145 EYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTLKKIVDRR 204
           EYTVAITA WAIEAVY ESFA+C+EDGSKTP EL E C+RWGN GFG YC+ L+ IV+RR
Sbjct: 137 EYTVAITAYWAIEAVYQESFAHCLEDGSKTPPELLETCQRWGNPGFGQYCSALRNIVNRR 196

Query: 205 LEMAA--------GEISKKTEVALLRVLECEVAFWNMSR 236
           LE A+         ++ KK EV  LRVLE EV FWNMSR
Sbjct: 197 LERASDDQLKEGLDDVLKKAEVVFLRVLEYEVDFWNMSR 235

BLAST of Cp4.1LG17g00210 vs. NCBI nr
Match: gi|590661807|ref|XP_007035775.1| (Heme oxygenase-like, multi-helical [Theobroma cacao])

HSP 1 Score: 288.9 bits (738), Expect = 8.3e-75
Identity = 136/220 (61.82%), Postives = 168/220 (76.36%), Query Frame = 1

Query: 19  GGMTATDSWIRKHRLIYTDATRHPLVLSIRDGTVDLNAFRTWVEQECEFLRSFTAFVASV 78
           G    T++W+RKHRL+Y  ATRHP + SIRDG +DL++F+TW+ Q+  F+R+F  FVASV
Sbjct: 10  GKTLMTETWLRKHRLLYVGATRHPFIRSIRDGNIDLSSFKTWLGQDYVFVRAFVPFVASV 69

Query: 79  LVKAWKESDDRA-DEEVILGSLASLNDEFAWFKKEALKRDIDLTKIVPQNATAGYSRFLE 138
           L KA K SD+ + D EV+LG +A+LNDE AWFKKEA K  + L+ IVPQ A   Y RFLE
Sbjct: 70  LSKACKGSDNSSNDVEVMLGGMAALNDEIAWFKKEASKWGVQLSDIVPQKANQNYCRFLE 129

Query: 139 SLMRPEMEYTVAITALWAIEAVYHESFAYCMEDGSKTPLELREACERWGNEGFGNYCNTL 198
           SLM PE+EYTVAITA WAIEAVY ESFA+C+EDG+K P EL+E C+RWGNEGFG YCN L
Sbjct: 130 SLMSPEVEYTVAITAFWAIEAVYQESFAHCLEDGTKPPPELQETCQRWGNEGFGQYCNAL 189

Query: 199 KKIVDRRLEMAAGEISKKTEVALLRVLECEVAFWNMSRDG 238
           +KI DR+LE A+ ++  K EV  LRVLE EV FWN+S  G
Sbjct: 190 RKIADRQLEKASDDVITKAEVTFLRVLEHEVDFWNISHGG 229

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TENAE_SOYBN9.2e-7461.32Probable bifunctional TENA-E protein OS=Glycine max GN=TENA_E PE=2 SV=1[more]
TENAE_ARATH3.9e-7259.15Bifunctional TENA-E protein OS=Arabidopsis thaliana GN=TENA_E PE=1 SV=3[more]
TENA2_MAIZE7.5e-6049.76Bifunctional TENA2 protein OS=Zea mays GN=TENA2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KEX4_CUCSA2.2e-10681.42Uncharacterized protein OS=Cucumis sativus GN=Csa_6G307400 PE=4 SV=1[more]
A0A061EPD9_THECC5.8e-7561.82Heme oxygenase-like, multi-helical OS=Theobroma cacao GN=TCM_021348 PE=4 SV=1[more]
A0A151ST20_CAJCA6.4e-7460.93Seed maturation protein PM36 OS=Cajanus cajan GN=KK1_004174 PE=4 SV=1[more]
A0A068TRX7_COFCA6.4e-7459.53Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00025696001 PE=4 SV=1[more]
A0A022RZK6_ERYGU8.3e-7459.91Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a024518mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G16990.12.2e-7359.15 Haem oxygenase-like, multi-helical[more]
Match NameE-valueIdentityDescription
gi|659116976|ref|XP_008458358.1|1.6e-11082.48PREDICTED: seed maturation protein PM36 isoform X1 [Cucumis melo][more]
gi|659116978|ref|XP_008458359.1|9.7e-10881.62PREDICTED: seed maturation protein PM36 isoform X2 [Cucumis melo][more]
gi|449460951|ref|XP_004148207.1|3.1e-10681.42PREDICTED: seed maturation protein PM36 [Cucumis sativus][more]
gi|1009132042|ref|XP_015883172.1|6.8e-7763.47PREDICTED: probable bifunctional TENA-E protein [Ziziphus jujuba][more]
gi|590661807|ref|XP_007035775.1|8.3e-7561.82Heme oxygenase-like, multi-helical [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR026285TenA_E
IPR016084Haem_Oase-like_multi-hlx
IPR004305Thiaminase-2/PQQC
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g00210.1Cp4.1LG17g00210.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004305Thiaminase-2/PQQCPFAMPF03070TENA_THI-4coord: 29..236
score: 6.8
IPR016084Haem oxygenase-like, multi-helicalGENE3DG3DSA:1.20.910.10coord: 21..234
score: 3.4
IPR016084Haem oxygenase-like, multi-helicalunknownSSF48613Heme oxygenase-likecoord: 38..234
score: 1.38
IPR026285TenA_E proteinPIRPIRSF003170Pet18pcoord: 11..237
score: 1.4
NoneNo IPR availablePANTHERPTHR20858PHOSPHOMETHYLPYRIMIDINE KINASEcoord: 4..221
score: 6.0
NoneNo IPR availablePANTHERPTHR20858:SF18PROTEIN PET18coord: 4..221
score: 6.0

The following gene(s) are paralogous to this gene:

None