Cp4.1LG10g10460.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG10g10460.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDUF241 domain protein
LocationCp4.1LG10 : 4043288 .. 4044148 (-)
Sequence length861
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCCAAGTACCAAGCGAGGTCCATAAGCTTGCCTTCTAGATTGCATCCCTGTGCTGTGAAGGTTGAGGAGGAGTTGAGGAAGGTGAAGACATGGGTGTCTTCTTCTTCTTCTTCTTCTTCTGTTTGGGGTGCTCTTTTGGGATTACAGGATTTGTATGACTCCATTGATGAGCTTCTCAAAATGGGTTCCACTCAGCAGGTTTTGTCTTGTCCCCAAAACAAACAGTTGGTGGACGAGTTGTTGGATGATTCTATGAAGCTTTTGGATGTCTGCAGCTTAGCAAAGGACATAGCATTAGAAACCCAAACGCATGTTGGGGCTCTTCACTCTGCCTTTCGCCGGAGGAAAGGCGATTCCGCCATGAAAACCACCACTGCTGCTTACATTTCCTATAGAAAAAAGATGAAGAAAGAAGGTAAAAACTTGATAACATCAATGAAGAAGATGAATGAGAAATTCAACTCATCCCCAATGCAAAATCCAGATAATCACCTGAGCTCTGTGGTTGAAGCGCTGAGACAAGTTTGCTCAACCAACAGCTCTATTTTCGAATCCGTGTTGTTGTACTTAACGCCATTGACGAAGCCAAAAGCTCGAGGATGGTCTTTGGTTACTAAGTGGGTGCACAAGGGGGCGATTGCCTGCGAGTCAAACAGTGCCATGAACGAATTTGAGAACGTGGATGTGGCTTTGAGCTCTGTTGTTGAAGAAATGGAGGTTGAGAAGTTGCAGGTTGCTCAGAGAAGATTGGAGAGTTTGGAAATGGCGGCACAAGAAATTGAGAGTGGGTTGGATGGTGTGTTCAGGAGATTGATCAAAATAAGAGCCTCTCTGTTGAACATAATATCTCAATAG

mRNA sequence

ATGGCTGCCAAGTACCAAGCGAGGTCCATAAGCTTGCCTTCTAGATTGCATCCCTGTGCTGTGAAGGTTGAGGAGGAGTTGAGGAAGGTGAAGACATGGGTGTCTTCTTCTTCTTCTTCTTCTTCTGTTTGGGGTGCTCTTTTGGGATTACAGGATTTGTATGACTCCATTGATGAGCTTCTCAAAATGGGTTCCACTCAGCAGGTTTTGTCTTGTCCCCAAAACAAACAGTTGGTGGACGAGTTGTTGGATGATTCTATGAAGCTTTTGGATGTCTGCAGCTTAGCAAAGGACATAGCATTAGAAACCCAAACGCATGTTGGGGCTCTTCACTCTGCCTTTCGCCGGAGGAAAGGCGATTCCGCCATGAAAACCACCACTGCTGCTTACATTTCCTATAGAAAAAAGATGAAGAAAGAAGGTAAAAACTTGATAACATCAATGAAGAAGATGAATGAGAAATTCAACTCATCCCCAATGCAAAATCCAGATAATCACCTGAGCTCTGTGGTTGAAGCGCTGAGACAAGTTTGCTCAACCAACAGCTCTATTTTCGAATCCGTGTTGTTGTACTTAACGCCATTGACGAAGCCAAAAGCTCGAGGATGGTCTTTGGTTACTAAGTGGGTGCACAAGGGGGCGATTGCCTGCGAGTCAAACAGTGCCATGAACGAATTTGAGAACGTGGATGTGGCTTTGAGCTCTGTTGTTGAAGAAATGGAGGTTGAGAAGTTGCAGGTTGCTCAGAGAAGATTGGAGAGTTTGGAAATGGCGGCACAAGAAATTGAGAGTGGGTTGGATGGTGTGTTCAGGAGATTGATCAAAATAAGAGCCTCTCTGTTGAACATAATATCTCAATAG

Coding sequence (CDS)

ATGGCTGCCAAGTACCAAGCGAGGTCCATAAGCTTGCCTTCTAGATTGCATCCCTGTGCTGTGAAGGTTGAGGAGGAGTTGAGGAAGGTGAAGACATGGGTGTCTTCTTCTTCTTCTTCTTCTTCTGTTTGGGGTGCTCTTTTGGGATTACAGGATTTGTATGACTCCATTGATGAGCTTCTCAAAATGGGTTCCACTCAGCAGGTTTTGTCTTGTCCCCAAAACAAACAGTTGGTGGACGAGTTGTTGGATGATTCTATGAAGCTTTTGGATGTCTGCAGCTTAGCAAAGGACATAGCATTAGAAACCCAAACGCATGTTGGGGCTCTTCACTCTGCCTTTCGCCGGAGGAAAGGCGATTCCGCCATGAAAACCACCACTGCTGCTTACATTTCCTATAGAAAAAAGATGAAGAAAGAAGGTAAAAACTTGATAACATCAATGAAGAAGATGAATGAGAAATTCAACTCATCCCCAATGCAAAATCCAGATAATCACCTGAGCTCTGTGGTTGAAGCGCTGAGACAAGTTTGCTCAACCAACAGCTCTATTTTCGAATCCGTGTTGTTGTACTTAACGCCATTGACGAAGCCAAAAGCTCGAGGATGGTCTTTGGTTACTAAGTGGGTGCACAAGGGGGCGATTGCCTGCGAGTCAAACAGTGCCATGAACGAATTTGAGAACGTGGATGTGGCTTTGAGCTCTGTTGTTGAAGAAATGGAGGTTGAGAAGTTGCAGGTTGCTCAGAGAAGATTGGAGAGTTTGGAAATGGCGGCACAAGAAATTGAGAGTGGGTTGGATGGTGTGTTCAGGAGATTGATCAAAATAAGAGCCTCTCTGTTGAACATAATATCTCAATAG

Protein sequence

MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSSSSSSSSVWGALLGLQDLYDSIDELLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRKGDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVCSTNSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSAMNEFENVDVALSSVVEEMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ
BLAST of Cp4.1LG10g10460.1 vs. TrEMBL
Match: A0A0A0KGU4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G507480 PE=4 SV=1)

HSP 1 Score: 451.1 bits (1159), Expect = 1.0e-123
Identity = 239/288 (82.99%), Postives = 260/288 (90.28%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSS--SSSSSSVWGALLGLQDLYDSID 60
           MA KY ARSISLPSR HP  +KVEEEL KVKTWVSS  SSSSSSV G LLGLQDLYDSID
Sbjct: 1   MATKYHARSISLPSRSHPSTLKVEEELAKVKTWVSSTTSSSSSSVCGGLLGLQDLYDSID 60

Query: 61  ELLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRK 120
           ELLKMGSTQ+VLSCPQ+KQ V+ELLD SMKLLDVCSLAK++ LETQ HVGALHSA RRRK
Sbjct: 61  ELLKMGSTQKVLSCPQHKQFVEELLDGSMKLLDVCSLAKEVTLETQQHVGALHSAVRRRK 120

Query: 121 GDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVC 180
           GDSA+KT T AY  YRK+MKKE K LITSMKKMNEKFN++PM+NPD+HLSSV+ ALRQ C
Sbjct: 121 GDSAVKTATVAYNCYRKRMKKEAKKLITSMKKMNEKFNTTPMENPDHHLSSVIGALRQAC 180

Query: 181 STNSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSAMNEFENVDVALSSVVE 240
           STN+ IFESVL+YLTPLTK KARGWSLV+KWVHKGAIACESNS +NEFENVDVALSSVV+
Sbjct: 181 STNNLIFESVLVYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVQ 240

Query: 241 EMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
           EMEVEK Q+AQ+RLESLEMAAQEIESGLDGVFRRLIK RAS+LNIISQ
Sbjct: 241 EMEVEKSQIAQKRLESLEMAAQEIESGLDGVFRRLIKTRASMLNIISQ 288

BLAST of Cp4.1LG10g10460.1 vs. TrEMBL
Match: M5VXA1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa026133mg PE=4 SV=1)

HSP 1 Score: 286.6 bits (732), Expect = 3.4e-74
Identity = 156/297 (52.53%), Postives = 219/297 (73.74%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSSSSSSSS--VWGALLGLQDLYDSID 60
           MAAKY  RSISLPSR HP  ++VEEEL +++ W +SSS+S+S  +  AL GL++LY+ +D
Sbjct: 1   MAAKYHVRSISLPSRSHPTTLRVEEELSRLQAWEASSSASTSDSICRALCGLEELYECVD 60

Query: 61  ELLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRK 120
           +LL M STQQ+LS PQ ++ ++ELLD S++LLD+C + KD   + + H  AL SA RRRK
Sbjct: 61  DLLHMASTQQLLSQPQQEKYMNELLDGSVRLLDICGITKDAISQIKEHARALQSALRRRK 120

Query: 121 GDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVC 180
           GDS+++T  A Y  +RKKMKKE K LITS+K+++ K  +S M   D HLS+V+  LR+ C
Sbjct: 121 GDSSIETGIANYTCFRKKMKKEAKKLITSLKQVDSKIGASQMVEQDQHLSAVIRVLREAC 180

Query: 181 STNSSIFESVLLYLT-PLTKPKARGWSLVTKWVHKGAIACESNSA-MNEFENVDVALSSV 240
           S N SIF+S+L++L+ P++KPK+  WSLV+K++HKG IACE     +NE + VD AL+++
Sbjct: 181 SNNMSIFQSLLVFLSVPVSKPKSNKWSLVSKFMHKGVIACEGQKEDINEMDGVDAALNTL 240

Query: 241 -------VEEMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
                  +E  +VEK+Q AQ+RLE+LE+  + ++SGL+ VFRRLIK RASLLNIISQ
Sbjct: 241 RKSSAAPIECTDVEKIQSAQKRLEALEVTIEGLDSGLESVFRRLIKTRASLLNIISQ 297

BLAST of Cp4.1LG10g10460.1 vs. TrEMBL
Match: U5GEW5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s10660g PE=4 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 1.5e-69
Identity = 150/288 (52.08%), Postives = 208/288 (72.22%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTW-VSSSSSSSSVWGALLGLQDLYDSIDE 60
           MA KY  RSISLPSR HP   ++EEEL K+K W VSS+S+S S+   L GL+DLY  +D+
Sbjct: 1   MACKYHVRSISLPSRSHPTTQRIEEELNKLKAWEVSSNSTSGSICNGLSGLEDLYKCMDD 60

Query: 61  LLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRKG 120
           LL + STQQVLS  +N++ +DELLD S++LLDVCS+ +DI L  +  V AL SAFRRRKG
Sbjct: 61  LLNLASTQQVLSRYENEKCLDELLDGSVRLLDVCSIGRDILLRFREQVQALQSAFRRRKG 120

Query: 121 DSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVCS 180
           DS+++++ A +  +RKKMKK+ K LI S+K+M+ K  +S + + D HLS+V+  +R+V  
Sbjct: 121 DSSIESSVATFTCFRKKMKKDAKKLIASLKQMDNKLGASSLLDQDQHLSAVIRVIREVNV 180

Query: 181 TNSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSA-MNEFENVDVALSSVVE 240
            N SIF+S+L++L+  +KP    WSLV+K +HKG IACE     +NE E VD ALS V +
Sbjct: 181 INCSIFQSLLMFLSKSSKPNQSRWSLVSKLMHKGVIACEEKQENVNEIETVDAALSEVSD 240

Query: 241 EMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
               EK+++AQ+RLE+LEM+  ++E+ L+ + R LIK RASLLNIISQ
Sbjct: 241 S---EKVKIAQKRLEALEMSIDDLENCLERLSRPLIKSRASLLNIISQ 285

BLAST of Cp4.1LG10g10460.1 vs. TrEMBL
Match: A0A061DGD3_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_000101 PE=4 SV=1)

HSP 1 Score: 256.9 bits (655), Expect = 2.9e-65
Identity = 140/288 (48.61%), Postives = 202/288 (70.14%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSSS-SSSSSVWGALLGLQDLYDSIDE 60
           MAAKY  RSISLPSR HP  +++E+EL ++KTW +S  S+S S+   L GL+DLY  +D+
Sbjct: 1   MAAKYHVRSISLPSRSHPTTLRIEDELNRLKTWEASPLSTSESICAGLSGLEDLYQCMDD 60

Query: 61  LLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRKG 120
           LL + STQQVLS  Q+++ +DELLD S++LLD+CS+A+D   + +  V AL SA RRRK 
Sbjct: 61  LLNLASTQQVLSQHQHEKCIDELLDGSVRLLDICSIARDYMFQLKERVHALQSALRRRKR 120

Query: 121 DSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVCS 180
           DS+++     Y  +RK+MKK+GK LIT +K+M+ K  +SP+ + D+H S+V+  LR+V +
Sbjct: 121 DSSIENDIINYTCFRKEMKKQGKKLITELKQMDNKLGASPLLDQDHHFSAVIRVLREVNA 180

Query: 181 TNSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSA-MNEFENVDVALSSVVE 240
            N+SIF+S+  +L+ L   K   WSLV+K +HKG I+CE     +NE E+VD AL     
Sbjct: 181 MNTSIFQSLFSFLSALVSSKQTRWSLVSKLMHKGVISCEEKQENVNELESVDAALCR--H 240

Query: 241 EMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
             +VEK+Q+A +RL +LE + + +E+ L+ VFR LIK R SLLNI+SQ
Sbjct: 241 TSDVEKMQIAHKRLVALESSIEGLENRLECVFRHLIKARTSLLNIVSQ 286

BLAST of Cp4.1LG10g10460.1 vs. TrEMBL
Match: A0A0L9TPR4_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan01g171500 PE=4 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 4.6e-63
Identity = 145/287 (50.52%), Postives = 199/287 (69.34%), Query Frame = 1

Query: 4   KYQARSISLPSRLHPCAVKVEEELRKVKTWVSSSSSS-SSVWGALLGLQDLYDSIDELLK 63
           KY  RSISLPSR HP  ++VEEEL K+KTW  +S+ S  S+   L  LQ+LY ++D+LL 
Sbjct: 5   KYHVRSISLPSRSHPSTIRVEEELSKLKTWEGTSTPSLQSIQNGLSLLQELYIALDDLLN 64

Query: 64  MGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRKGDSA 123
           M STQQV+S  +  + V+E+LD SM++LD+C + +D  L+ + +V ALHSA RRRKGDS+
Sbjct: 65  MSSTQQVISLHKGHKCVEEVLDGSMRILDMCGITRDTLLQIKENVQALHSALRRRKGDSS 124

Query: 124 MKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVCSTNS 183
           ++T+ A Y  + KKMKK    LITS+K M+ KF  SP+ + D+HL+SV   LR+V   N 
Sbjct: 125 VETSVAEYKFFAKKMKKNVNKLITSLKHMDAKFGVSPLLDLDHHLASVTRVLREVIVINL 184

Query: 184 SIFESVLLYLT-PLTKPKARGWSLVTKWVHKGAIACESNSAM-NEFENVDVALSSVV-EE 243
           S+F+S+L +LT   +K KA  W LV K +HKG    E NS   NE  +V++ALS+++ E 
Sbjct: 185 SVFQSILSFLTVSSSKSKATKWLLVAKLMHKGVKPSEENSENDNELHSVEMALSTLLNES 244

Query: 244 MEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
              E ++VA  RLE+LE A + +E+GL+ VFRRLIK RASLLNIISQ
Sbjct: 245 THDESIRVAHERLEALENAIESVENGLESVFRRLIKTRASLLNIISQ 291

BLAST of Cp4.1LG10g10460.1 vs. TAIR10
Match: AT4G35690.1 (AT4G35690.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 176.4 bits (446), Expect = 2.5e-44
Identity = 113/294 (38.44%), Postives = 170/294 (57.82%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSSSSSSSSVWGALLGLQDLYDSIDEL 60
           M  K Q RSISLPS  HP    +EE L KVKT  + + SS SV   L GL++LY+  ++ 
Sbjct: 4   MLVKNQLRSISLPSSSHPSTTGIEESLNKVKTINTMTGSSESVLMGLEGLEELYNCTEDF 63

Query: 61  LKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRK-- 120
           LKMGSTQ+V+S     + ++E+LD S++L+D+CS+++D+ +ETQ HV  + S  RR+K  
Sbjct: 64  LKMGSTQRVMSSSDGSEFMEEMLDGSLRLMDICSVSRDLMVETQEHVRGVQSCVRRKKVV 123

Query: 121 -GDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQN---PDNHLSSVVEAL 180
            G+  +    A Y+ +RK M+KE K L+ S+K ++   +SS   N    + HL  VV+A+
Sbjct: 124 GGEDQLDVAVAGYVGFRKNMRKEAKRLLGSLKNIDGGLSSSSSVNNGEQEEHLVVVVDAM 183

Query: 181 RQVCSTNSSIFESVLLYLTPLTKPKAR---GWSLVTKWVHKGAIACESNSAMNEFENVDV 240
           RQV S + ++  S L +L+   +   +      L  K VH            NE EN+D+
Sbjct: 184 RQVVSVSVAVLRSFLEFLSGRRQSNIKSKLASVLKKKKVH------HVEETKNELENLDL 243

Query: 241 ALSSVVEEMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIIS 286
            +     ++        Q++LE +EM+    E  L+G+FRRLI+ RASLLNIIS
Sbjct: 244 EIFCSRNDL--------QKKLEEVEMSIDGFEKKLEGLFRRLIRTRASLLNIIS 283

BLAST of Cp4.1LG10g10460.1 vs. TAIR10
Match: AT4G35710.1 (AT4G35710.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 146.7 bits (369), Expect = 2.1e-35
Identity = 103/291 (35.40%), Postives = 167/291 (57.39%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSSSSSSSSVWGALLGLQDLYDSIDEL 60
           M  K Q RSISLPSR  P    +EE L K+KT  +++ SS S+   L GL++LY  ++E 
Sbjct: 4   MIIKKQLRSISLPSRSQPSTSGLEESLNKIKTINTTTGSSESILMGLAGLEELYIFLEEF 63

Query: 61  LKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRK-- 120
           LKMGS Q+V+S     + ++E+LD S++L+D+CS+++D+ +ET  HV  + S  RR+K  
Sbjct: 64  LKMGSKQRVMS-SGGSEFMEEMLDGSLRLMDICSVSRDLMVETHEHVRGVQSYVRRKKVS 123

Query: 121 ---GDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALR 180
              G   +    + Y+ +RK M+KE K L+ S+KK++    S    + D  L +V++ +R
Sbjct: 124 GGGGGDKIDVAVSDYVGFRKNMRKEAKKLLGSLKKVDGGTRSCDNDHEDEQLVAVIDRVR 183

Query: 181 QVCSTNSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNS-AMNEFENVDVALS 240
           +V S +  + +S L  L+       R  ++ +K      +  ++++ A N  E +D A+ 
Sbjct: 184 RVVSVSVVVLKSFLELLS------RRKSNIKSKLASVLKMKKDNHAPAKNVLETLDSAIF 243

Query: 241 SVVEEMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIIS 286
              + +  + L   Q  LE +EM     E  L+G+FRRLI+ RAS+LNIIS
Sbjct: 244 G--DFLSHDDL---QNELEEVEMCIGGFERNLEGLFRRLIRTRASILNIIS 282

BLAST of Cp4.1LG10g10460.1 vs. TAIR10
Match: AT2G17680.1 (AT2G17680.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 144.8 bits (364), Expect = 8.1e-35
Identity = 109/300 (36.33%), Postives = 161/300 (53.67%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSSSS--SSSSVWGALLGLQDLYDSID 60
           M  K   RSISL SR HP    +EE L K    +++S+  SS SV   L GL+DLYD  +
Sbjct: 4   MMIKNHVRSISLQSRSHPSTAAIEESLDKFLITMNTSTMASSESVHSGLSGLEDLYDCSE 63

Query: 61  ELLKMGSTQQVLSCPQNK---------QLVDELLDDSMKLLDVCSLAKDIALETQTHVGA 120
           +LLKMGSTQ+VLS    K         + ++E+LD S++L+D+C++++D+ +ET  HV  
Sbjct: 64  DLLKMGSTQRVLSFSDEKKKKKRKVKGEFMEEMLDGSLRLMDICNVSRDLMVETHEHVLG 123

Query: 121 LHSAFRRRKGDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKF---NSSPMQNPDNH 180
           L S  RRRK         + Y+ +RK M+KE K L+ S+K +N      +    Q+ D H
Sbjct: 124 LQSCVRRRK-----DVDVSGYVGFRKNMRKEVKKLLGSLKNINVGLVMRDHGYDQDGDIH 183

Query: 181 LSSVVEALRQVCSTNSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSAMNEF 240
             +V+ A+R+V     S+ +S   +L+           L    ++K           NE 
Sbjct: 184 FLAVIHAMRRVVYMTVSVLKSFFEFLSGRQNGNDVRSKLALVLMNK-KFHDHDKMVKNEL 243

Query: 241 ENVDVALSSVVEEMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
           ENVD A+    + +  + L     +LE +E+   + E  L+G+FR LIK RASLLNIISQ
Sbjct: 244 ENVDSAICG--DSISHDDL---HEKLEEVEVWIGKFEKSLEGLFRGLIKTRASLLNIISQ 292

BLAST of Cp4.1LG10g10460.1 vs. TAIR10
Match: AT2G17080.1 (AT2G17080.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 130.2 bits (326), Expect = 2.1e-30
Identity = 97/285 (34.04%), Postives = 160/285 (56.14%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWV-SSSSSSSSVWGALLGLQDLYDSIDE 60
           MA  +  RS S PSR HP A  V+E+L ++++   +SSSSSSS+   L  LQ+L++S+D+
Sbjct: 1   MAVSFHVRSNSFPSRSHPQAAHVDEQLARLRSSEQASSSSSSSICQRLDNLQELHESLDK 60

Query: 61  LLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRKG 120
           L+    TQQ LS   NK+ V++LLD S+++LD+C+++KD   E +  +  + S  RR++G
Sbjct: 61  LISRPVTQQALSQEHNKKAVEQLLDGSLRILDLCNISKDALSEMKEGLMEIQSILRRKRG 120

Query: 121 DSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVCS 180
           D  +      Y++ RK +KK  + +  S+K    +       N D+ L+   EA     +
Sbjct: 121 D--LSEEVKKYLTSRKSLKKSFQKVQKSLKVTQAE------DNNDDTLAVFGEAE----A 180

Query: 181 TNSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSAMNEFENVDVALSSVVEE 240
              S+F+S+L Y++         WS+V+K ++K  + CE+    NEF  VD        E
Sbjct: 181 ITLSLFDSLLSYMS--GSKTCSKWSVVSKLMNKKKVTCEAQE--NEFTKVD-------SE 240

Query: 241 MEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNII 285
            + EK  +    +++LE   Q++E GL+ + + LIK R S LNI+
Sbjct: 241 FQSEK-TLKMDDVQNLESCIQDLEDGLESLSKSLIKYRVSFLNIL 261

BLAST of Cp4.1LG10g10460.1 vs. TAIR10
Match: AT4G35660.1 (AT4G35660.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 122.5 bits (306), Expect = 4.3e-28
Identity = 98/285 (34.39%), Postives = 157/285 (55.09%), Query Frame = 1

Query: 7   ARSISLPSRL-HPCAVKVEEELRKVKTWVSSSSSSSSVWGALLGLQDLYDSIDE-LLKMG 66
           ARSISLP+RL HP A +VEEEL+K++   SSSS+SS +   L  L +LYD ++E ++   
Sbjct: 16  ARSISLPTRLIHPKAQRVEEELKKIQALNSSSSASSRIQLGLAKLVELYDFVNEQVISSP 75

Query: 67  STQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRKGD-SAM 126
             QQ L   +N++LV++ LD+S+ LLDV    +D+      H+  L SA RRR+G+ S++
Sbjct: 76  QGQQALRLCRNRKLVEDALDESIVLLDVSDFTRDLIGTLMEHIQELQSALRRRRGNLSSV 135

Query: 127 KTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNP---DNHLSSVVEALRQVCST 186
           ++   +YIS+ KK K E    + S+ +   K  +  ++     D H S V   LRQ  ++
Sbjct: 136 QSEIRSYISFHKKSKTEAARQVKSLARRQTKKKAWVIKQSGGLDEHSSMVSNILRQSNAS 195

Query: 187 NSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSAMNEFENVDVALSSVVEEM 246
             SI +S+L +L+   +   +           G I C  NS +  F    +    +V+E+
Sbjct: 196 TISILQSLLQFLSTSGENNEK---------KNGEIGCVDNSMIRSFFGRIIG-RKMVKEI 255

Query: 247 EVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIIS 286
           +    Q    RL  + ++ + I+  L  + RRLI+ RASLLNI++
Sbjct: 256 DA---QTILGRLAMVNVSLEAIKDELSYLSRRLIQHRASLLNIVT 287

BLAST of Cp4.1LG10g10460.1 vs. NCBI nr
Match: gi|659080626|ref|XP_008440893.1| (PREDICTED: uncharacterized protein LOC103485177 [Cucumis melo])

HSP 1 Score: 462.2 bits (1188), Expect = 6.5e-127
Identity = 243/286 (84.97%), Postives = 262/286 (91.61%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSSSSSSSSVWGALLGLQDLYDSIDEL 60
           MA KY ARSISLPSR HP   KVEEELRKVKTWVSSSSSSSSV G LLGLQDLYDSIDEL
Sbjct: 1   MATKYHARSISLPSRSHPSTSKVEEELRKVKTWVSSSSSSSSVCGGLLGLQDLYDSIDEL 60

Query: 61  LKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRKGD 120
           LKMGSTQ+VLSCPQ+KQLV+ELLD SMKLLDVCSLAK++ LETQ HVGALHSA RRRKGD
Sbjct: 61  LKMGSTQKVLSCPQHKQLVEELLDGSMKLLDVCSLAKEVTLETQQHVGALHSAVRRRKGD 120

Query: 121 SAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVCST 180
           SA+KT TAAY  YRK+MKKE K LITSMKKMNEKFN++PM+NPD+HLSSV+ ALRQ CST
Sbjct: 121 SAVKTATAAYNCYRKRMKKEAKKLITSMKKMNEKFNTTPMENPDHHLSSVIGALRQACST 180

Query: 181 NSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSAMNEFENVDVALSSVVEEM 240
           NS IFESVL+YLTPLTK KARGWSLV+KWVHKGAIACESNS +NEFENVDVALSS+VEEM
Sbjct: 181 NSLIFESVLVYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSIVEEM 240

Query: 241 EVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
           EVEK Q+AQ+RLESLEMAAQEIESGLDGVFRRLIK RAS+LNIISQ
Sbjct: 241 EVEKSQIAQKRLESLEMAAQEIESGLDGVFRRLIKTRASMLNIISQ 286

BLAST of Cp4.1LG10g10460.1 vs. NCBI nr
Match: gi|778719853|ref|XP_011658068.1| (PREDICTED: uncharacterized protein LOC101217795 [Cucumis sativus])

HSP 1 Score: 451.1 bits (1159), Expect = 1.5e-123
Identity = 239/288 (82.99%), Postives = 260/288 (90.28%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSS--SSSSSSVWGALLGLQDLYDSID 60
           MA KY ARSISLPSR HP  +KVEEEL KVKTWVSS  SSSSSSV G LLGLQDLYDSID
Sbjct: 1   MATKYHARSISLPSRSHPSTLKVEEELAKVKTWVSSTTSSSSSSVCGGLLGLQDLYDSID 60

Query: 61  ELLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRK 120
           ELLKMGSTQ+VLSCPQ+KQ V+ELLD SMKLLDVCSLAK++ LETQ HVGALHSA RRRK
Sbjct: 61  ELLKMGSTQKVLSCPQHKQFVEELLDGSMKLLDVCSLAKEVTLETQQHVGALHSAVRRRK 120

Query: 121 GDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVC 180
           GDSA+KT T AY  YRK+MKKE K LITSMKKMNEKFN++PM+NPD+HLSSV+ ALRQ C
Sbjct: 121 GDSAVKTATVAYNCYRKRMKKEAKKLITSMKKMNEKFNTTPMENPDHHLSSVIGALRQAC 180

Query: 181 STNSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSAMNEFENVDVALSSVVE 240
           STN+ IFESVL+YLTPLTK KARGWSLV+KWVHKGAIACESNS +NEFENVDVALSSVV+
Sbjct: 181 STNNLIFESVLVYLTPLTKSKARGWSLVSKWVHKGAIACESNSGLNEFENVDVALSSVVQ 240

Query: 241 EMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
           EMEVEK Q+AQ+RLESLEMAAQEIESGLDGVFRRLIK RAS+LNIISQ
Sbjct: 241 EMEVEKSQIAQKRLESLEMAAQEIESGLDGVFRRLIKTRASMLNIISQ 288

BLAST of Cp4.1LG10g10460.1 vs. NCBI nr
Match: gi|595821705|ref|XP_007204858.1| (hypothetical protein PRUPE_ppa026133mg [Prunus persica])

HSP 1 Score: 286.6 bits (732), Expect = 4.9e-74
Identity = 156/297 (52.53%), Postives = 219/297 (73.74%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSSSSSSSS--VWGALLGLQDLYDSID 60
           MAAKY  RSISLPSR HP  ++VEEEL +++ W +SSS+S+S  +  AL GL++LY+ +D
Sbjct: 1   MAAKYHVRSISLPSRSHPTTLRVEEELSRLQAWEASSSASTSDSICRALCGLEELYECVD 60

Query: 61  ELLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRK 120
           +LL M STQQ+LS PQ ++ ++ELLD S++LLD+C + KD   + + H  AL SA RRRK
Sbjct: 61  DLLHMASTQQLLSQPQQEKYMNELLDGSVRLLDICGITKDAISQIKEHARALQSALRRRK 120

Query: 121 GDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVC 180
           GDS+++T  A Y  +RKKMKKE K LITS+K+++ K  +S M   D HLS+V+  LR+ C
Sbjct: 121 GDSSIETGIANYTCFRKKMKKEAKKLITSLKQVDSKIGASQMVEQDQHLSAVIRVLREAC 180

Query: 181 STNSSIFESVLLYLT-PLTKPKARGWSLVTKWVHKGAIACESNSA-MNEFENVDVALSSV 240
           S N SIF+S+L++L+ P++KPK+  WSLV+K++HKG IACE     +NE + VD AL+++
Sbjct: 181 SNNMSIFQSLLVFLSVPVSKPKSNKWSLVSKFMHKGVIACEGQKEDINEMDGVDAALNTL 240

Query: 241 -------VEEMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
                  +E  +VEK+Q AQ+RLE+LE+  + ++SGL+ VFRRLIK RASLLNIISQ
Sbjct: 241 RKSSAAPIECTDVEKIQSAQKRLEALEVTIEGLDSGLESVFRRLIKTRASLLNIISQ 297

BLAST of Cp4.1LG10g10460.1 vs. NCBI nr
Match: gi|645273856|ref|XP_008242078.1| (PREDICTED: uncharacterized protein LOC103340436 [Prunus mume])

HSP 1 Score: 282.0 bits (720), Expect = 1.2e-72
Identity = 154/297 (51.85%), Postives = 218/297 (73.40%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTWVSSSSSSSS--VWGALLGLQDLYDSID 60
           MAAKY  RSISLPSR HP  ++VEEEL +++ W +SSS+S+S  +  AL GL++LY+ +D
Sbjct: 1   MAAKYHVRSISLPSRSHPTTLRVEEELSRLQAWEASSSASTSDSICRALCGLEELYECLD 60

Query: 61  ELLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRK 120
           +LL M STQQ+LS  Q ++ +DELLD S++LLD+C + KD   + + H  AL SA RRRK
Sbjct: 61  DLLHMASTQQLLSQHQQEKYMDELLDGSVRLLDICGITKDAISQIKEHARALQSALRRRK 120

Query: 121 GDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVC 180
           GDS+++T+ A Y  +RKKMKK+ K LITS+K+++ K  +S M   D HLS+V+  LR+ C
Sbjct: 121 GDSSIETSIANYTCFRKKMKKDAKKLITSLKQVDSKIGASQMVEQDQHLSAVIRVLREAC 180

Query: 181 STNSSIFESVLLYLT-PLTKPKARGWSLVTKWVHKGAIACESNSA-MNEFENVDVALSSV 240
           S N SIF+S L++L+ P++KPK+  WSLV+K++HKG IACE     +NE + VD AL+++
Sbjct: 181 SKNMSIFQSFLVFLSVPVSKPKSNKWSLVSKFMHKGVIACEGQEEDINEMDGVDAALNTL 240

Query: 241 -------VEEMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
                  +E  +VEK++ AQ+RLE+LE+  + +E+GL+ VFRRLIK RASLLNIISQ
Sbjct: 241 RKSSTAPIECTDVEKIRSAQKRLEALEVTIEGLENGLESVFRRLIKTRASLLNIISQ 297

BLAST of Cp4.1LG10g10460.1 vs. NCBI nr
Match: gi|470105356|ref|XP_004289051.1| (PREDICTED: uncharacterized protein LOC101310646 [Fragaria vesca subsp. vesca])

HSP 1 Score: 277.3 bits (708), Expect = 3.0e-71
Identity = 151/290 (52.07%), Postives = 213/290 (73.45%), Query Frame = 1

Query: 1   MAAKYQARSISLPSRLHPCAVKVEEELRKVKTW--VSSSSSSSSVWGALLGLQDLYDSID 60
           MAAKY  RSISLP+R HP  V+VEEEL ++++W   SS+S+S S+   L GL++LYD +D
Sbjct: 1   MAAKYHVRSISLPTRSHPTTVRVEEELGRLQSWESTSSASTSDSILRGLSGLEELYDCVD 60

Query: 61  ELLKMGSTQQVLSCPQNKQLVDELLDDSMKLLDVCSLAKDIALETQTHVGALHSAFRRRK 120
           +LL+M STQQ+LS  Q ++ +DELLD S+KLLD+C + +D  L+ + HV AL SA RRRK
Sbjct: 61  DLLQMASTQQLLSQHQQEKCMDELLDGSVKLLDICGITRDFMLQVKEHVFALQSALRRRK 120

Query: 121 GDSAMKTTTAAYISYRKKMKKEGKNLITSMKKMNEKFNSSPMQNPDNHLSSVVEALRQVC 180
           GDS+++T+ A+Y S+ KKMKK+ K LI+ +K+ + K  SS     D HL++V+  LRQVC
Sbjct: 121 GDSSIETSIASYTSFSKKMKKDAKKLISQLKQADSKIVSSQSLEQDQHLAAVIRVLRQVC 180

Query: 181 STNSSIFESVLLYLTPLTKPKARGWSLVTKWVHKGAIACESNSAMN--EFENVDVALSSV 240
           + N SIF+S+L++L  +   K+  WSLV+K +HKG +ACE+   +N  E + VD  LSS+
Sbjct: 181 AKNMSIFQSLLVFLA-VPVSKSNRWSLVSKLMHKGVVACETQVNINGHELDGVDSVLSSL 240

Query: 241 VEEMEVEKLQVAQRRLESLEMAAQEIESGLDGVFRRLIKIRASLLNIISQ 287
            +  EVEK+Q AQ++LE+LE+  + +ESGL+ VF+RLIK RASLLNIISQ
Sbjct: 241 CKSAEVEKIQSAQKKLEALEVCIEGLESGLESVFKRLIKTRASLLNIISQ 289

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KGU4_CUCSA1.0e-12382.99Uncharacterized protein OS=Cucumis sativus GN=Csa_6G507480 PE=4 SV=1[more]
M5VXA1_PRUPE3.4e-7452.53Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa026133mg PE=4 SV=1[more]
U5GEW5_POPTR1.5e-6952.08Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s10660g PE=4 SV=1[more]
A0A061DGD3_THECC2.9e-6548.61Uncharacterized protein OS=Theobroma cacao GN=TCM_000101 PE=4 SV=1[more]
A0A0L9TPR4_PHAAN4.6e-6350.52Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan01g171500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G35690.12.5e-4438.44 Arabidopsis protein of unknown function (DUF241)[more]
AT4G35710.12.1e-3535.40 Arabidopsis protein of unknown function (DUF241)[more]
AT2G17680.18.1e-3536.33 Arabidopsis protein of unknown function (DUF241)[more]
AT2G17080.12.1e-3034.04 Arabidopsis protein of unknown function (DUF241)[more]
AT4G35660.14.3e-2834.39 Arabidopsis protein of unknown function (DUF241)[more]
Match NameE-valueIdentityDescription
gi|659080626|ref|XP_008440893.1|6.5e-12784.97PREDICTED: uncharacterized protein LOC103485177 [Cucumis melo][more]
gi|778719853|ref|XP_011658068.1|1.5e-12382.99PREDICTED: uncharacterized protein LOC101217795 [Cucumis sativus][more]
gi|595821705|ref|XP_007204858.1|4.9e-7452.53hypothetical protein PRUPE_ppa026133mg [Prunus persica][more]
gi|645273856|ref|XP_008242078.1|1.2e-7251.85PREDICTED: uncharacterized protein LOC103340436 [Prunus mume][more]
gi|470105356|ref|XP_004289051.1|3.0e-7152.07PREDICTED: uncharacterized protein LOC101310646 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR004320DUF241_pln
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG10g10460Cp4.1LG10g10460gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG10g10460.1:cds:001Cp4.1LG10g10460.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG10g10460.1Cp4.1LG10g10460.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004320Protein of unknown function DUF241, plantPFAMPF03087DUF241coord: 47..283
score: 8.7
NoneNo IPR availableunknownCoilCoilcoord: 245..265
scor
NoneNo IPR availablePANTHERPTHR31509FAMILY NOT NAMEDcoord: 1..286
score: 2.3E
NoneNo IPR availablePANTHERPTHR31509:SF14SUBFAMILY NOT NAMEDcoord: 1..286
score: 2.3E