CmoCh07G005530.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh07G005530.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEPIDERMAL PATTERNING FACTOR-like protein 1
LocationCmo_Chr07 : 2488400 .. 2491716 (-)
Sequence length3223
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGTTTCCATTTTTCTGCTGGGTGTGACTGATTTACTGTTGAATCTCAAAAGAAATCAGAAAGAAGTAATCATGTTTATGTTGATTAGATCTTAAAAAGCTACAGCAAAAAGGTATTTATTTTGGGTCTTTTGCAAAAATAATATGTGCGAGGATGTAATAGTTGGTAGGGAAAAATAGTATTAAAAAAACATGGAGAAGAACAAGGAAATACCCCCACAGCATATGTCATTAGGTAAAATCTCATCTGTACATATGAGTATACCTTATGCCTTCTTGAGAGTAGGAGTGACTGACTTACTCGATTCTATGCCTCAATGATGGACATAATATTTTTATTATGAACATCCTCTTCAACTTGAGAACCTTACAAGTTCTAATCCATCGACCTTTCGTTGAGTTATCCTTAGCATTGTATGAGCAAAACTCAATTTGTTTATAACCATAATTCTAGATCAATGAATTTATCAGATAATGTCTTCTGCTACTTTTGGCTTGGCCTGAGTTGATTTAGTTAGATTATACGAATATGTCTTTGCTTAGTCATGTCTAGTTCATGAAATCTGTACATGGAGACACACTCTGGTCTCTTGTTAGCTGCTTAGGGTAGTTAGATTATACCAATCGTTGTTTGCTTATCATATCTTGACCTCCCCAACTTGGCCTAAGTATCTAAACCTCCATCGCTTACAATTTATTTCTTTGTCGGGCCAACTCAAAAACTCAATGGCTATATACCTTCATCACTCGCTTGACTTATTGATGGTGTTTTGACAAACACCACGTTGATTCTAGTTTTGCTACTTATAGGCGGATGCTAGATCCCCACATCTACTATCATCGGGAAAATCAAGATTATACCTCTAAAAAAATAGAAAAAATGGAAGCAATGCAATTATTAATCTTAGAATTTTCAATACATTATTAGTTACAAACTCGAGAAAGGGACAAACCTCGTTTTAAGCACCTGTTCGTGCCACTATGAACAGCGTTCGAGATTCCGATATCCAAATAAAGTTTGCTTCATGATTTTTAAAGAAATTATACATTCTCCGTGAGAGCAAGGGTTTCAGAGTAGAGAGCCAAAATTCATTTCGGCATGGGTTCTCTTCCCTCGATTTCTTTCCCACTTCGTGAAAGAGTCCTACGAAGTTGAAAAAGCTAAAAGTACTCCTCTTGTGTTTTCTTCTTCTTTGGCCTTCAGAATCAGACGAGGGGTTCTGGAGTTGAGTTTGCTCACTGCGTCTCAGACTCTGTTTCTTGGAGTTTTGTAATGTTTTGGAATGGTCTGATGCATGGTTGATAATTTAGTTGACTAGTTGCCCTGCAAAAATAAATCAGCACATCTGGTGAGACAGGCGAGCAATGTGGCTGGGGACATCCGATCAAATTGATACAATACAGAGAAGATTAGCATGACCCTTACGCAAAGATACACGCACAAATTGAGAAATGTATCAAAATCAAACACTGTTTAGTGTGCTAGTGAGAATGCTGGGTTCTCAAGAGGATGAATTGTTAGATTCCACATCGGTTGTTAGATCCCAACCTCTCCATAGTAGATGCGTTTTAAAATCGTGAGGTTGACGACAATACGTAATTGATAAAAACGGACGATATTTGCTAGTGGTGGATTTCTTTTCACCTTTAACGAGCAAACACAACGTATTTTTTTTTATTATTATTATACATCATTATATTATATTTAGGATGGCTAGGATGTTTTAACATGACACTTTTTGATTCATTTTGGATCAATTAGGTGGCTACAGATCTATAGAATTTGATAATATTTTGCCCAGTTGGATGTTAGTGGGTTGTAGTACTAGGTTTTCTTTTTCCTTTTCTTGGATTTTGTAGTTAGGTTTAGGCTAAATTTGATTCTTTTAGCCGAGACATAAATATTTTCAAATACCTCATGAAGTTATAGCCAAGAGATAACTACATCTCCTTTATGAGCAACTTTGGTGCAATGAGGCATTTAAGGAAGAGTAATTGAGTAAATGCTAGTTTGGAGTTTAGTGATTCAAATCTGTGCTCGATATTCTTGAACATTGGTACATTCGTAGATGGGGAGAAGGGTTTTATGAACAAGAATAACAAATAAATTTGGTCCACAGTCAAATGTGAAATCTCGATGGTCCAATCCATCAGCTATTATTTCTTGCTTAGTTGCAAGATGAGATGAGGTAGTCGCTGCATGGCCATCAGCGCAGCGTTGAAATGTAAAATCAATGGGCAGTGAAGATTGAAGAAGAGTGAAACAAATGCAAAGTGGAGAGTAGGGGAGTAAATGCGGTCCCTTTGCAGCAGCCAGGCTAGTGATGGACAGCCTAATGTTGCATAGTGGAAGAATAGGGCGTCCTTTGGATGTGATGTGGATACTACTGTATTGTTCCTTTCACTGTCTGCTACACTCTGTCCTCTTACTCAGAACTTCTCTGCTTTCTCTCTTCTCTTGTTCTTGTCGGATAACTTATGACTCACCCACCCTTTCCTACCCAAATTTATGCTAACTCTCTCTCCTTTGCACTTGCCTCTATTAAATGTTCATCACTCTCTCTCTCTCTTAGTTACTTGTGGAGTGGTGTGGTGTGGTGTGTTGGGTGGAAGTGGATCAAAGTAACAGAAAAATCTGGCAACTGGGTCTTTATTGCATATGATTTTTTTACTCCTTTGCTGCAATGGATCCATGAGTTTTGGGCATCAATGACTTGATGACTACCATGAATTTGCCTACTGTTTGGGCCACCTCATTACTTATTGTGCTTCTTCATCTTCTTCTTTTTCTCTCCGCCTCTTCCCCTTCCAGGGTGTGCCTTCTTTCTCTCTTCCATGAATTTTCTCATATTGCTCGTTACTTTATTCCTGAGCTTCGTTGTTTTGGTTGCTTGCTGATTTGATCAGGGTATATTTTTCGAAGACAAAACCAGGCTTGGATCCACCCCACCAAGCTGCCACAACAAGTGCAACGAGTGTCATCCATGCATGGCGGTGCAAGTTCCTAGCATGCCTCGGATGGATTCGCATTCACCCTCGGCCTTGCCATTGGGATTTTTTGACTCATCCTCACAAGGGAACAGATACTCATTTTACAAGCCATTGGGTTGGAAATGCCGCTGTGGGAACCACTTCTTCAATCCTTGAGCCATATTTGCATGTCCAAAACACAGGGTTGGTCTCCCATTCTTCTGGTTTATGTAAATGACATTGAATTGTTTCTTTTTTTCATTATAAGGATGGAATTCTTTGTTCAGAGTGAAAGCAGGTCGTTTGGTTTCTGCATTTATTGTACATAAACCTTCTACCCTTAACCCC

mRNA sequence

AAGTTTCCATTTTTCTGCTGGGTGTGACTGATTTACTGTTGAATCTCAAAAGAAATCAGAAAGAAGTAATCATGTTTATGTTGATTAGATCTTAAAAAGCTACAGCAAAAAGGTATTTATTTTGGGTCTTTTGCAAAAATAATATGTGCGAGGATGTAATAGTTGGTAGGGAAAAATAGTATTAAAAAAACATGGAGAAGAACAAGGAAATACCCCCACAGCATATGTCATTAGGTAAAATCTCATCTGTACATATGAGTATACCTTATGCCTTCTTGAGAGTAGGAGTGACTGACTTACTCGATTCTATGCCTCAATGATGGACATAATATTTTTATTATGAACATCCTCTTCAACTTGAGAACCTTACAAGTTCTAATCCATCGACCTTTCGTTGAGTTATCCTTAGCATTGTATGAGCAAAACTCAATTTGTTTATAACCATAATTCTAGATCAATGAATTTATCAGATAATGTCTTCTGCTACTTTTGGCTTGGCCTGAGTTGATTTAGTTAGATTATACGAATATGTCTTTGCTTAGTCATGTCTAGTTCATGAAATCTGTACATGGAGACACACTCTGGTCTCTTGTTAGCTGCTTAGGGTAGTTAGATTATACCAATCGTTGTTTGCTTATCATATCTTGACCTCCCCAACTTGGCCTAAGTATCTAAACCTCCATCGCTTACAATTTATTTCTTTGTCGGGCCAACTCAAAAACTCAATGGCTATATACCTTCATCACTCGCTTGACTTATTGATGGTGTTTTGACAAACACCACGTTGATTCTAGTTTTGCTACTTATAGGCGGATGCTAGATCCCCACATCTACTATCATCGGGAAAATCAAGATTATACCTCTAAAAAAATAGAAAAAATGGAAGCAATGCAATTATTAATCTTAGAATTTTCAATACATTATTAGTTACAAACTCGAGAAAGGGACAAACCTCGTTTTAAGCACCTGTTCGTGCCACTATGAACAGCGTTCGAGATTCCGATATCCAAATAAAGTTTGCTTCATGATTTTTAAAGAAATTATACATTCTCCGTGAGAGCAAGGGTTTCAGAGTAGAGAGCCAAAATTCATTTCGGCATGGGTTCTCTTCCCTCGATTTCTTTCCCACTTCGTGAAAGAGTCCTACGAAGTTGAAAAAGCTAAAAGTACTCCTCTTGTGTTTTCTTCTTCTTTGGCCTTCAGAATCAGACGAGGGGTTCTGGAGTTGAGTTTGCTCACTGCGTCTCAGACTCTGTTTCTTGGAGTTTTGTAATGTTTTGGAATGGTCTGATGCATGGTTGATAATTTAGTTGACTAGTTGCCCTGCAAAAATAAATCAGCACATCTGGTGAGACAGGCGAGCAATGTGGCTGGGGACATCCGATCAAATTGATACAATACAGAGAAGATTAGCATGACCCTTACGCAAAGATACACGCACAAATTGAGAAATGTATCAAAATCAAACACTGTTTAGTGTGCTAGTGAGAATGCTGGGTTCTCAAGAGGATGAATTGTTAGATTCCACATCGGTTGTTAGATCCCAACCTCTCCATAGTAGATGCGTTTTAAAATCGTGAGGTTGACGACAATACGTAATTGATAAAAACGGACGATATTTGCTAGTGGTGGATTTCTTTTCACCTTTAACGAGCAAACACAACGTATTTTTTTTTATTATTATTATACATCATTATATTATATTTAGGATGGCTAGGATGTTTTAACATGACACTTTTTGATTCATTTTGGATCAATTAGGTGGCTACAGATCTATAGAATTTGATAATATTTTGCCCAGTTGGATGTTAGTGGGTTGTAGTACTAGGTTTTCTTTTTCCTTTTCTTGGATTTTGTAGTTAGGTTTAGGCTAAATTTGATTCTTTTAGCCGAGACATAAATATTTTCAAATACCTCATGAAGTTATAGCCAAGAGATAACTACATCTCCTTTATGAGCAACTTTGGTGCAATGAGGCATTTAAGGAAGAGTAATTGAGTAAATGCTAGTTTGGAGTTTAGTGATTCAAATCTGTGCTCGATATTCTTGAACATTGGTACATTCGTAGATGGGGAGAAGGGTTTTATGAACAAGAATAACAAATAAATTTGGTCCACAGTCAAATGTGAAATCTCGATGGTCCAATCCATCAGCTATTATTTCTTGCTTAGTTGCAAGATGAGATGAGGTAGTCGCTGCATGGCCATCAGCGCAGCGTTGAAATGTAAAATCAATGGGCAGTGAAGATTGAAGAAGAGTGAAACAAATGCAAAGTGGAGAGTAGGGGAGTAAATGCGGTCCCTTTGCAGCAGCCAGGCTAGTGATGGACAGCCTAATGTTGCATAGTGGAAGAATAGGGCGTCCTTTGGATGTGATGTGGATACTACTGTATTGTTCCTTTCACTGTCTGCTACACTCTGTCCTCTTACTCAGAACTTCTCTGCTTTCTCTCTTCTCTTGTTCTTGTCGGATAACTTATGACTCACCCACCCTTTCCTACCCAAATTTATGCTAACTCTCTCTCCTTTGCACTTGCCTCTATTAAATGTTCATCACTCTCTCTCTCTCTTAGTTACTTGTGGAGTGGTGTGGTGTGGTGTGTTGGGTGGAAGTGGATCAAAGTAACAGAAAAATCTGGCAACTGGGTCTTTATTGCATATGATTTTTTTACTCCTTTGCTGCAATGGATCCATGAGTTTTGGGCATCAATGACTTGATGACTACCATGAATTTGCCTACTGTTTGGGCCACCTCATTACTTATTGTGCTTCTTCATCTTCTTCTTTTTCTCTCCGCCTCTTCCCCTTCCAGGGGTATATTTTTCGAAGACAAAACCAGGCTTGGATCCACCCCACCAAGCTGCCACAACAAGTGCAACGAGTGTCATCCATGCATGGCGGTGCAAGTTCCTAGCATGCCTCGGATGGATTCGCATTCACCCTCGGCCTTGCCATTGGGATTTTTTGACTCATCCTCACAAGGGAACAGATACTCATTTTACAAGCCATTGGGTTGGAAATGCCGCTGTGGGAACCACTTCTTCAATCCTTGAGCCATATTTGCATGTCCAAAACACAGGGTTGGTCTCCCATTCTTCTGGTTTATGTAAATGACATTGAATTGTTTCTTTTTTTCATTATAAGGATGGAATTCTTTGTTCAGAGTGAAAGCAGGTCGTTTGGTTTCTGCATTTATTGTACATAAACCTTCTACCCTTAACCCC

Coding sequence (CDS)

ATGACTACCATGAATTTGCCTACTGTTTGGGCCACCTCATTACTTATTGTGCTTCTTCATCTTCTTCTTTTTCTCTCCGCCTCTTCCCCTTCCAGGGGTATATTTTTCGAAGACAAAACCAGGCTTGGATCCACCCCACCAAGCTGCCACAACAAGTGCAACGAGTGTCATCCATGCATGGCGGTGCAAGTTCCTAGCATGCCTCGGATGGATTCGCATTCACCCTCGGCCTTGCCATTGGGATTTTTTGACTCATCCTCACAAGGGAACAGATACTCATTTTACAAGCCATTGGGTTGGAAATGCCGCTGTGGGAACCACTTCTTCAATCCTTGA
BLAST of CmoCh07G005530.1 vs. Swiss-Prot
Match: EPFL1_ARATH (EPIDERMAL PATTERNING FACTOR-like protein 1 OS=Arabidopsis thaliana GN=EPFL1 PE=1 SV=1)

HSP 1 Score: 94.0 bits (232), Expect = 1.1e-18
Identity = 53/116 (45.69%), Postives = 67/116 (57.76%), Query Frame = 1

Query: 11  ATSLLIVLLHLLL-------FLSASSPS---RGIFFEDKTRLGSTPPSCHNKCNECHPCM 70
           +T LL+ L+ +LL       FL    P    +    EDK RLGSTPPSCHN+CN CHPCM
Sbjct: 7   STLLLLPLILILLITPQVSSFLQPIQPPISPQVALIEDKARLGSTPPSCHNRCNNCHPCM 66

Query: 71  AVQVPSMP-RMDSHSPSALPLGFFDSSSQ----GNRYSFYKPLGWKCRCGNHFFNP 112
           A+QVP++P R      +    GF    S      ++YS YKP+GWKC C  HF+NP
Sbjct: 67  AIQVPTLPTRSRFTRVNPFSGGFVRPPSSLTTVLDQYSNYKPMGWKCHCNGHFYNP 122

BLAST of CmoCh07G005530.1 vs. Swiss-Prot
Match: EPFL2_ARATH (EPIDERMAL PATTERNING FACTOR-like protein 2 OS=Arabidopsis thaliana GN=EPFL2 PE=2 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 8.1e-09
Identity = 28/76 (36.84%), Postives = 41/76 (53.95%), Query Frame = 1

Query: 42  LGSTPPSCHN-KCNECHPCMAVQVPSMPRMDSHSPSALPLGFFDSS-----SQGNRYSFY 101
           +GS PP C   +C  C  C A+QVP+ P+   HSP          +     ++G+  + Y
Sbjct: 53  IGSRPPRCERVRCRSCGHCEAIQVPTNPQTKLHSPLTTSSSSSSETIHLDYTRGDDSTNY 112

Query: 102 KPLGWKCRCGNHFFNP 112
           KP+ WKC+CGN  +NP
Sbjct: 113 KPMSWKCKCGNSIYNP 128

BLAST of CmoCh07G005530.1 vs. Swiss-Prot
Match: EPFL3_ARATH (EPIDERMAL PATTERNING FACTOR-like protein 3 OS=Arabidopsis thaliana GN=EPFL3 PE=1 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 1.3e-06
Identity = 24/66 (36.36%), Postives = 33/66 (50.00%), Query Frame = 1

Query: 39  KTRLGSTPPSCHNKCNECHPCMAVQVPSMPRMDSHSPSALPLGFFDSSSQGNRYSFYKPL 98
           + R+GS PPSC  KC  C PC A+Q P++  +   SP                Y+ Y+P 
Sbjct: 54  RRRIGSKPPSCEKKCYGCEPCEAIQFPTISSIPHLSP---------------HYANYQPE 104

Query: 99  GWKCRC 105
           GW+C C
Sbjct: 114 GWRCHC 104

BLAST of CmoCh07G005530.1 vs. Swiss-Prot
Match: EPFL4_ARATH (EPIDERMAL PATTERNING FACTOR-like protein 4 OS=Arabidopsis thaliana GN=EPFL4 PE=1 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 1.3e-06
Identity = 24/69 (34.78%), Postives = 35/69 (50.72%), Query Frame = 1

Query: 43  GSTPPSCHNKCNECHPCMAVQVPSMPRMDSHSPSALPLGFFDSSSQGNRYSFYKPLGWKC 102
           GS+PP+C +KC +C PC  V VP  P +      ++PL ++             P  W+C
Sbjct: 60  GSSPPTCRSKCGKCQPCKPVHVPIQPGL------SMPLEYY-------------PEAWRC 109

Query: 103 RCGNHFFNP 112
           +CGN  F P
Sbjct: 120 KCGNKLFMP 109

BLAST of CmoCh07G005530.1 vs. TrEMBL
Match: A0A0A0KGR1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G499820 PE=4 SV=1)

HSP 1 Score: 208.0 bits (528), Expect = 5.9e-51
Identity = 101/113 (89.38%), Postives = 103/113 (91.15%), Query Frame = 1

Query: 1   MTTMNLPTVWATSLLIVLLHLLLFLSASSPSRGIFFEDKTRLGSTPPSCHNKCNECHPCM 60
           MTTMNL TVWATSLLIVLLHLLL LSASSP RG+FFEDKTRLGSTPPSCHNKCNECHPCM
Sbjct: 5   MTTMNLATVWATSLLIVLLHLLLLLSASSPPRGMFFEDKTRLGSTPPSCHNKCNECHPCM 64

Query: 61  AVQVPSMPRMDSH--SPSALPLGFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 112
           AVQVPSMP   S   SPSALP+ FFDSSSQGNRYSFYKPLGWKCRCGNHFFNP
Sbjct: 65  AVQVPSMPGRASRLDSPSALPMRFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 117

BLAST of CmoCh07G005530.1 vs. TrEMBL
Match: C6SZ30_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_08G134100 PE=2 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 3.5e-27
Identity = 68/115 (59.13%), Postives = 79/115 (68.70%), Query Frame = 1

Query: 12  TSLLIVLL-HLLLFLSASSPS---------RGIFFEDKTRLGSTPPSCHNKCNECHPCMA 71
           TSLLI+LL H L  LS +S S         R + FE+K RLGS PPSCHNKCN+CHPCMA
Sbjct: 13  TSLLIILLLHNLFSLSLASASNHPQPAISTRELLFEEKNRLGSIPPSCHNKCNDCHPCMA 72

Query: 72  VQVPSMPRMDSHSP-----SALPLGFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 112
           VQVP++P  DS+ P     +A+      SS QGNRYS YKPLGWKC CG+HFFNP
Sbjct: 73  VQVPTLPSHDSNPPDLTKTAAMATFLNPSSPQGNRYSNYKPLGWKCHCGDHFFNP 127

BLAST of CmoCh07G005530.1 vs. TrEMBL
Match: A0A0B2RA79_GLYSO (EPIDERMAL PATTERNING FACTOR-like protein 1 OS=Glycine soja GN=glysoja_008379 PE=4 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 3.5e-27
Identity = 68/115 (59.13%), Postives = 79/115 (68.70%), Query Frame = 1

Query: 12  TSLLIVLL-HLLLFLSASSPS---------RGIFFEDKTRLGSTPPSCHNKCNECHPCMA 71
           TSLLI+LL H L  LS +S S         R + FE+K RLGS PPSCHNKCN+CHPCMA
Sbjct: 13  TSLLIILLLHNLFSLSLASASNHPQPAISTRELLFEEKNRLGSIPPSCHNKCNDCHPCMA 72

Query: 72  VQVPSMPRMDSHSP-----SALPLGFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 112
           VQVP++P  DS+ P     +A+      SS QGNRYS YKPLGWKC CG+HFFNP
Sbjct: 73  VQVPTLPSHDSNPPDLTKTAAMATFLNPSSPQGNRYSNYKPLGWKCHCGDHFFNP 127

BLAST of CmoCh07G005530.1 vs. TrEMBL
Match: V7CQQ4_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G257600g PE=4 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 6.0e-27
Identity = 68/113 (60.18%), Postives = 79/113 (69.91%), Query Frame = 1

Query: 12  TSLLIVL-LHLLLFLSASS-------PSRGIFFEDKTRLGSTPPSCHNKCNECHPCMAVQ 71
           TSLLIVL LH  L L ++S        +R + FE+K RLGS PPSCHNKCN+CHPCMAVQ
Sbjct: 13  TSLLIVLVLHNFLSLVSASNHPHTSISTRELLFEEKNRLGSIPPSCHNKCNDCHPCMAVQ 72

Query: 72  VPSMPRMDSHSP-----SALPLGFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 112
           VP++P  DS+ P     SA+      SS QGNRYS YKPLGWKC CG+HFFNP
Sbjct: 73  VPTLPSHDSNPPDLTKTSAMASFLNPSSPQGNRYSNYKPLGWKCHCGDHFFNP 125

BLAST of CmoCh07G005530.1 vs. TrEMBL
Match: F6HJL0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0391g00030 PE=4 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 7.8e-27
Identity = 66/126 (52.38%), Postives = 87/126 (69.05%), Query Frame = 1

Query: 1   MTTMNLPTVWATSLLIVLLHLLLFLSA-------SSPSRGIFFEDKTRLGSTPPSCHNKC 60
           M ++    ++  +++  +LHLLL  ++       S  +RG+ FE+KTRLGSTPPSCHNKC
Sbjct: 1   MNSLGSYLLYTITVVTAVLHLLLSPASCFSQQQGSDSTRGLLFEEKTRLGSTPPSCHNKC 60

Query: 61  NECHPCMAVQVPSMPRMDSHS------PSALPLGFFDS--SSQGNRYSFYKPLGWKCRCG 112
           NECHPCMAVQVP++P   SHS       + +P+ FF+S  S+  NRYS YKPLGWKC CG
Sbjct: 61  NECHPCMAVQVPTLP---SHSGPRLGLTATVPMEFFESSPSTAANRYSNYKPLGWKCHCG 120

BLAST of CmoCh07G005530.1 vs. TAIR10
Match: AT5G10310.1 (AT5G10310.1 unknown protein)

HSP 1 Score: 94.0 bits (232), Expect = 6.3e-20
Identity = 53/116 (45.69%), Postives = 67/116 (57.76%), Query Frame = 1

Query: 11  ATSLLIVLLHLLL-------FLSASSPS---RGIFFEDKTRLGSTPPSCHNKCNECHPCM 70
           +T LL+ L+ +LL       FL    P    +    EDK RLGSTPPSCHN+CN CHPCM
Sbjct: 7   STLLLLPLILILLITPQVSSFLQPIQPPISPQVALIEDKARLGSTPPSCHNRCNNCHPCM 66

Query: 71  AVQVPSMP-RMDSHSPSALPLGFFDSSSQ----GNRYSFYKPLGWKCRCGNHFFNP 112
           A+QVP++P R      +    GF    S      ++YS YKP+GWKC C  HF+NP
Sbjct: 67  AIQVPTLPTRSRFTRVNPFSGGFVRPPSSLTTVLDQYSNYKPMGWKCHCNGHFYNP 122

BLAST of CmoCh07G005530.1 vs. TAIR10
Match: AT4G37810.1 (AT4G37810.1 unknown protein)

HSP 1 Score: 61.2 bits (147), Expect = 4.6e-10
Identity = 28/76 (36.84%), Postives = 41/76 (53.95%), Query Frame = 1

Query: 42  LGSTPPSCHN-KCNECHPCMAVQVPSMPRMDSHSPSALPLGFFDSS-----SQGNRYSFY 101
           +GS PP C   +C  C  C A+QVP+ P+   HSP          +     ++G+  + Y
Sbjct: 53  IGSRPPRCERVRCRSCGHCEAIQVPTNPQTKLHSPLTTSSSSSSETIHLDYTRGDDSTNY 112

Query: 102 KPLGWKCRCGNHFFNP 112
           KP+ WKC+CGN  +NP
Sbjct: 113 KPMSWKCKCGNSIYNP 128

BLAST of CmoCh07G005530.1 vs. TAIR10
Match: AT3G13898.1 (AT3G13898.1 unknown protein)

HSP 1 Score: 53.9 bits (128), Expect = 7.3e-08
Identity = 24/66 (36.36%), Postives = 33/66 (50.00%), Query Frame = 1

Query: 39  KTRLGSTPPSCHNKCNECHPCMAVQVPSMPRMDSHSPSALPLGFFDSSSQGNRYSFYKPL 98
           + R+GS PPSC  KC  C PC A+Q P++  +   SP                Y+ Y+P 
Sbjct: 54  RRRIGSKPPSCEKKCYGCEPCEAIQFPTISSIPHLSP---------------HYANYQPE 104

Query: 99  GWKCRC 105
           GW+C C
Sbjct: 114 GWRCHC 104

BLAST of CmoCh07G005530.1 vs. TAIR10
Match: AT4G14723.1 (AT4G14723.1 BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1))

HSP 1 Score: 53.9 bits (128), Expect = 7.3e-08
Identity = 24/69 (34.78%), Postives = 35/69 (50.72%), Query Frame = 1

Query: 43  GSTPPSCHNKCNECHPCMAVQVPSMPRMDSHSPSALPLGFFDSSSQGNRYSFYKPLGWKC 102
           GS+PP+C +KC +C PC  V VP  P +      ++PL ++             P  W+C
Sbjct: 60  GSSPPTCRSKCGKCQPCKPVHVPIQPGL------SMPLEYY-------------PEAWRC 109

Query: 103 RCGNHFFNP 112
           +CGN  F P
Sbjct: 120 KCGNKLFMP 109

BLAST of CmoCh07G005530.1 vs. TAIR10
Match: AT2G30370.1 (AT2G30370.1 allergen-related)

HSP 1 Score: 47.4 bits (111), Expect = 6.8e-06
Identity = 24/70 (34.29%), Postives = 31/70 (44.29%), Query Frame = 1

Query: 42  LGSTPPSCHNKCNECHPCMAVQVPSMPRMDSHSPSALPLGFFDSSSQGNRYSFYKPLGWK 101
           LGS+PP C +KC  C PC  V VP         P   P+            + Y P  W+
Sbjct: 180 LGSSPPRCSSKCGRCTPCKPVHVP--------VPPGTPV-----------TAEYYPEAWR 230

Query: 102 CRCGNHFFNP 112
           C+CGN  + P
Sbjct: 240 CKCGNKLYMP 230

BLAST of CmoCh07G005530.1 vs. NCBI nr
Match: gi|700193534|gb|KGN48738.1| (hypothetical protein Csa_6G499820 [Cucumis sativus])

HSP 1 Score: 208.0 bits (528), Expect = 8.5e-51
Identity = 101/113 (89.38%), Postives = 103/113 (91.15%), Query Frame = 1

Query: 1   MTTMNLPTVWATSLLIVLLHLLLFLSASSPSRGIFFEDKTRLGSTPPSCHNKCNECHPCM 60
           MTTMNL TVWATSLLIVLLHLLL LSASSP RG+FFEDKTRLGSTPPSCHNKCNECHPCM
Sbjct: 5   MTTMNLATVWATSLLIVLLHLLLLLSASSPPRGMFFEDKTRLGSTPPSCHNKCNECHPCM 64

Query: 61  AVQVPSMPRMDSH--SPSALPLGFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 112
           AVQVPSMP   S   SPSALP+ FFDSSSQGNRYSFYKPLGWKCRCGNHFFNP
Sbjct: 65  AVQVPSMPGRASRLDSPSALPMRFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 117

BLAST of CmoCh07G005530.1 vs. NCBI nr
Match: gi|659080050|ref|XP_008440584.1| (PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 1 [Cucumis melo])

HSP 1 Score: 206.1 bits (523), Expect = 3.2e-50
Identity = 101/113 (89.38%), Postives = 102/113 (90.27%), Query Frame = 1

Query: 1   MTTMNLPTVWATSLLIVLLHLLLFLSASSPSRGIFFEDKTRLGSTPPSCHNKCNECHPCM 60
           MTTMNL TVWATSLLIVLLHLLL LSASSP RGIFFEDKTRLGSTPPSCHNKCNECHPCM
Sbjct: 22  MTTMNLATVWATSLLIVLLHLLLLLSASSPPRGIFFEDKTRLGSTPPSCHNKCNECHPCM 81

Query: 61  AVQVPSMPRMDSH--SPSALPLGFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 112
           AVQVPSMP   S   S SALP+ FFDSSSQGNRYSFYKPLGWKCRCGNHFFNP
Sbjct: 82  AVQVPSMPGRASRLDSSSALPMRFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 134

BLAST of CmoCh07G005530.1 vs. NCBI nr
Match: gi|645219144|ref|XP_008234219.1| (PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 1 [Prunus mume])

HSP 1 Score: 130.2 bits (326), Expect = 2.3e-27
Identity = 68/121 (56.20%), Postives = 85/121 (70.25%), Query Frame = 1

Query: 2   TTMNLPT--VWATSLLIVLLHLLLFLSASSPS----RGIFFEDKTRLGSTPPSCHNKCNE 61
           T MN P   ++  +  I+LLH LL   AS  S    RG+ FE+KTRLGSTPPSCH+KCN+
Sbjct: 5   TAMNNPLYLLFKATRAIMLLHFLLLSPASCFSTHSLRGLLFEEKTRLGSTPPSCHSKCNQ 64

Query: 62  CHPCMAVQVPSMP---RMDSHSPSALPLGFFDSSSQG--NRYSFYKPLGWKCRCGNHFFN 112
           CHPCMAVQVP++P   R+++    + P+ FFD S  G  N+YS YKPLGWKC CG+HFFN
Sbjct: 65  CHPCMAVQVPTIPSHDRVETGMTRSFPMMFFDPSHPGTNNKYSNYKPLGWKCHCGDHFFN 124

BLAST of CmoCh07G005530.1 vs. NCBI nr
Match: gi|951005401|ref|XP_014508135.1| (PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 1 [Vigna radiata var. radiata])

HSP 1 Score: 129.8 bits (325), Expect = 3.0e-27
Identity = 68/125 (54.40%), Postives = 83/125 (66.40%), Query Frame = 1

Query: 1   MTTMNLPTVWATSLLIVLLHLLLFLSASSPS---------RGIFFEDKTRLGSTPPSCHN 60
           M ++N    + T+ L+++L L  FLS  S S         R + FE+K RLGS PPSCHN
Sbjct: 1   MASLNSYHYYTTTSLLIVLLLHNFLSLVSASNHPHTAISPRELLFEEKNRLGSIPPSCHN 60

Query: 61  KCNECHPCMAVQVPSMPRMDSHSP-----SALPLGFFDSSSQGNRYSFYKPLGWKCRCGN 112
           KCN+CHPCMAVQVP++P  DS+ P     SA+   F  SS QGNRYS YKPLGWKC CG+
Sbjct: 61  KCNDCHPCMAVQVPTLPSHDSNPPDLTKTSAMATFFNPSSPQGNRYSNYKPLGWKCHCGD 120

BLAST of CmoCh07G005530.1 vs. NCBI nr
Match: gi|470103522|ref|XP_004288185.1| (PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 1 [Fragaria vesca subsp. vesca])

HSP 1 Score: 129.0 bits (323), Expect = 5.0e-27
Identity = 65/109 (59.63%), Postives = 76/109 (69.72%), Query Frame = 1

Query: 11  ATSLLIVLLHLLLFLSASS---PSRGIFFEDKTRLGSTPPSCHNKCNECHPCMAVQVPSM 70
           AT   IVLL LLL LS +S     RG+ F +K RLGS PPSCHNKCN+CHPCMAVQVP+M
Sbjct: 25  ATRATIVLLLLLLVLSPASCFTSLRGLLFAEKARLGSKPPSCHNKCNQCHPCMAVQVPTM 84

Query: 71  PRMDSHSP-----SALPLGFFDSSSQGNRYSFYKPLGWKCRCGNHFFNP 112
           P  D   P      A P+  FD + + NRYS YKPLGWKC+CG+HF+NP
Sbjct: 85  PSHDRVKPVGKTRPAKPMMLFDPAHRDNRYSNYKPLGWKCQCGDHFYNP 133

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EPFL1_ARATH1.1e-1845.69EPIDERMAL PATTERNING FACTOR-like protein 1 OS=Arabidopsis thaliana GN=EPFL1 PE=1... [more]
EPFL2_ARATH8.1e-0936.84EPIDERMAL PATTERNING FACTOR-like protein 2 OS=Arabidopsis thaliana GN=EPFL2 PE=2... [more]
EPFL3_ARATH1.3e-0636.36EPIDERMAL PATTERNING FACTOR-like protein 3 OS=Arabidopsis thaliana GN=EPFL3 PE=1... [more]
EPFL4_ARATH1.3e-0634.78EPIDERMAL PATTERNING FACTOR-like protein 4 OS=Arabidopsis thaliana GN=EPFL4 PE=1... [more]
Match NameE-valueIdentityDescription
A0A0A0KGR1_CUCSA5.9e-5189.38Uncharacterized protein OS=Cucumis sativus GN=Csa_6G499820 PE=4 SV=1[more]
C6SZ30_SOYBN3.5e-2759.13Uncharacterized protein OS=Glycine max GN=GLYMA_08G134100 PE=2 SV=1[more]
A0A0B2RA79_GLYSO3.5e-2759.13EPIDERMAL PATTERNING FACTOR-like protein 1 OS=Glycine soja GN=glysoja_008379 PE=... [more]
V7CQQ4_PHAVU6.0e-2760.18Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G257600g PE=4 SV=1[more]
F6HJL0_VITVI7.8e-2752.38Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0391g00030 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT5G10310.16.3e-2045.69 unknown protein[more]
AT4G37810.14.6e-1036.84 unknown protein[more]
AT3G13898.17.3e-0836.36 unknown protein[more]
AT4G14723.17.3e-0834.78 BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:A... [more]
AT2G30370.16.8e-0634.29 allergen-related[more]
Match NameE-valueIdentityDescription
gi|700193534|gb|KGN48738.1|8.5e-5189.38hypothetical protein Csa_6G499820 [Cucumis sativus][more]
gi|659080050|ref|XP_008440584.1|3.2e-5089.38PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 1 [Cucumis melo][more]
gi|645219144|ref|XP_008234219.1|2.3e-2756.20PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 1 [Prunus mume][more]
gi|951005401|ref|XP_014508135.1|3.0e-2754.40PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 1 [Vigna radiata var. radiat... [more]
gi|470103522|ref|XP_004288185.1|5.0e-2759.63PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 1 [Fragaria vesca subsp. ves... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010374 stomatal complex development
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh07G005530CmoCh07G005530gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh07G005530.1CmoCh07G005530.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh07G005530.1.three_prime_UTR.1CmoCh07G005530.1.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh07G005530.1.CDS.2CmoCh07G005530.1.CDS.2CDS
CmoCh07G005530.1.CDS.1CmoCh07G005530.1.CDS.1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh07G005530.1.five_prime_UTR.1CmoCh07G005530.1.five_prime_UTR.1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh07G005530.1.exon.2CmoCh07G005530.1.exon.2exon
CmoCh07G005530.1.exon.1CmoCh07G005530.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33109FAMILY NOT NAMEDcoord: 86..111
score: 5.4E-39coord: 14..68
score: 5.4
NoneNo IPR availablePANTHERPTHR33109:SF3EPIDERMAL PATTERNING FACTOR-LIKE PROTEIN 1coord: 14..68
score: 5.4E-39coord: 86..111
score: 5.4
NoneNo IPR availablePFAMPF17181EPFcoord: 41..111
score: 9.9