CmaCh01G015740 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh01G015740
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptiontranscription initiation factor TFIID subunit 7-like
LocationCma_Chr01: 10878638 .. 10879836 (-)
RNA-Seq ExpressionCmaCh01G015740
SyntenyCmaCh01G015740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGTCCCCCCCTTATCCCCTTCTCTCTTTCTGTCTCTCCATCTTCTCTCTCTCCCTATCAGAAAAGTCGGTACCAATCTGCCGCCGGTTGATGACAACGAACCCTAACCCTTGGGCGTTGCCGGCGAGGAGAAGCAGAGCGGAGGAAGGTTCAGAAATCTCTACTGCGACGGACTTGATCTCCCGCCTGTGAGTGATGGAGGTTTTGCTCGGTTCTCCGACGTTCAGCATCGAAGTCCCGTCGGAGAACTCCTCCGCCGTCGCGGAGACTCATAATCGGGACGGGTCGGGTTTTCGCGAGTCCGGATCTGGTAGTTCGATTGGGGAAAACTCGTCGGAGTCGTCGTCGATTGGAGTTCCCGACGACGATTCCGACGACGACGACGAGGTGCAGAGCAAGCTAAAGGAAGGAGGATTGCTCGGATTGGATTCTCTTGAAGACGCTCTTGTAATCAAGTGAGTTTCTCTATCCCTATCTCTTAAATTTTCGATCTCCTATAACCTAGAAATCGAAAATTGAAGCCGAAAACGGCGATTGAACCCTAAAATTTCCGATTAATCACACGAATTAAAGATTTAATACTCTGTAATGTGATGGAAATCCTCGGGGGGATTGGAATTGTAGAGGAGGCTTATCGAGGCATTTCTCGGGGAAATCGAAGTCGTTTGCGAATTTATCAGAGGTGATTCAAGTGAAAGATTTAGAGAAGCCGGATAATCCTTTCAACAAGAGGAGAAGAATTTTAATGGCGTCAAAATGGTCGAGAAAAGCCTCATTCTACAGCTGGCGAAACCCTAAATCGATGCCTCTGCTTGCCCTAAACGAAGACGAAGAAGAACAACAACCGGCGGATGGTTCCGATTCAGAGGAAAGAAATCGAGAGAGCGACGAAGATGATGATCAAAACGAACGAAGAAGACAAACCCTAGGGCAAAGGTACCACGATCGGAAGCTCGTTAATGGCTTCAAATCAATGAGCTGTTTTGATCTGCAAGAATATGAACAGCAATAACGCGCAGTGTAATGAAATCGTTTGGCCTTTCTTCACTTATTGCTCTGAACAAAAAGAAAAGGAATTTTTTTTTTCTATTTTTCTTTTTTATTTTTTCTTTCTATGTTTCATGAGTCCATGTAGATAAAATTAAAAAATGTATTTGGAAATTATTTCTATATTTTTATTGTTTAATTATCTAATTTTA

mRNA sequence

TAGTCCCCCCCTTATCCCCTTCTCTCTTTCTGTCTCTCCATCTTCTCTCTCTCCCTATCAGAAAAGTCGGTACCAATCTGCCGCCGGTTGATGACAACGAACCCTAACCCTTGGGCGTTGCCGGCGAGGAGAAGCAGAGCGGAGGAAGGTTCAGAAATCTCTACTGCGACGGACTTGATCTCCCGCCTGTGAGTGATGGAGGTTTTGCTCGGTTCTCCGACGTTCAGCATCGAAGTCCCGTCGGAGAACTCCTCCGCCGTCGCGGAGACTCATAATCGGGACGGGTCGGGTTTTCGCGAGTCCGGATCTGGTAGTTCGATTGGGGAAAACTCGTCGGAGTCGTCGTCGATTGGAGTTCCCGACGACGATTCCGACGACGACGACGAGGTGCAGAGCAAGCTAAAGGAAGGAGGATTGCTCGGATTGGATTCTCTTGAAGACGCTCTTGTAATCAAAGGAGGCTTATCGAGGCATTTCTCGGGGAAATCGAAGTCGTTTGCGAATTTATCAGAGGTGATTCAAGTGAAAGATTTAGAGAAGCCGGATAATCCTTTCAACAAGAGGAGAAGAATTTTAATGGCGTCAAAATGGTCGAGAAAAGCCTCATTCTACAGCTGGCGAAACCCTAAATCGATGCCTCTGCTTGCCCTAAACGAAGACGAAGAAGAACAACAACCGGCGGATGGTTCCGATTCAGAGGAAAGAAATCGAGAGAGCGACGAAGATGATGATCAAAACGAACGAAGAAGACAAACCCTAGGGCAAAGGTACCACGATCGGAAGCTCGTTAATGGCTTCAAATCAATGAGCTGTTTTGATCTGCAAGAATATGAACAGCAATAACGCGCAGTGTAATGAAATCGTTTGGCCTTTCTTCACTTATTGCTCTGAACAAAAAGAAAAGGAATTTTTTTTTTCTATTTTTCTTTTTTATTTTTTCTTTCTATGTTTCATGAGTCCATGTAGATAAAATTAAAAAATGTATTTGGAAATTATTTCTATATTTTTATTGTTTAATTATCTAATTTTA

Coding sequence (CDS)

ATGGAGGTTTTGCTCGGTTCTCCGACGTTCAGCATCGAAGTCCCGTCGGAGAACTCCTCCGCCGTCGCGGAGACTCATAATCGGGACGGGTCGGGTTTTCGCGAGTCCGGATCTGGTAGTTCGATTGGGGAAAACTCGTCGGAGTCGTCGTCGATTGGAGTTCCCGACGACGATTCCGACGACGACGACGAGGTGCAGAGCAAGCTAAAGGAAGGAGGATTGCTCGGATTGGATTCTCTTGAAGACGCTCTTGTAATCAAAGGAGGCTTATCGAGGCATTTCTCGGGGAAATCGAAGTCGTTTGCGAATTTATCAGAGGTGATTCAAGTGAAAGATTTAGAGAAGCCGGATAATCCTTTCAACAAGAGGAGAAGAATTTTAATGGCGTCAAAATGGTCGAGAAAAGCCTCATTCTACAGCTGGCGAAACCCTAAATCGATGCCTCTGCTTGCCCTAAACGAAGACGAAGAAGAACAACAACCGGCGGATGGTTCCGATTCAGAGGAAAGAAATCGAGAGAGCGACGAAGATGATGATCAAAACGAACGAAGAAGACAAACCCTAGGGCAAAGGTACCACGATCGGAAGCTCGTTAATGGCTTCAAATCAATGAGCTGTTTTGATCTGCAAGAATATGAACAGCAATAA

Protein sequence

MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ
Homology
BLAST of CmaCh01G015740 vs. ExPASy TrEMBL
Match: A0A6J1J0I5 (uncharacterized protein LOC111480811 OS=Cucurbita maxima OX=3661 GN=LOC111480811 PE=4 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 1.1e-108
Identity = 215/215 (100.00%), Postives = 215/215 (100.00%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD 60
           MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD
Sbjct: 1   MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD 60

Query: 61  DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120
           DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF
Sbjct: 61  DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120

Query: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ 180
           NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ
Sbjct: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ 180

Query: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 216
           NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ
Sbjct: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 215

BLAST of CmaCh01G015740 vs. ExPASy TrEMBL
Match: A0A6J1FKY0 (uncharacterized protein LOC111446357 OS=Cucurbita moschata OX=3662 GN=LOC111446357 PE=4 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 1.6e-104
Identity = 210/215 (97.67%), Postives = 212/215 (98.60%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD 60
           MEVLLGSPTFSIEVP ENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS 
Sbjct: 1   MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS- 60

Query: 61  DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120
           DDDEVQSK KEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF
Sbjct: 61  DDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120

Query: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ 180
           NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEE+QQPA+GSDSEERNRESDEDDDQ
Sbjct: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQ 180

Query: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 216
           NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ
Sbjct: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 214

BLAST of CmaCh01G015740 vs. ExPASy TrEMBL
Match: A0A6J1FA94 (uncharacterized protein LOC111443706 OS=Cucurbita moschata OX=3662 GN=LOC111443706 PE=4 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 1.2e-70
Identity = 166/231 (71.86%), Postives = 184/231 (79.65%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEVPS-----------ENSSAVAETHNRDGSGFRESGSGSSIGENSS-E 60
           MEV+ G PTF+IEV +           EN +AV ET NR  + FR SGSGSSIGENSS  
Sbjct: 1   MEVMFGPPTFNIEVAAATAFDGVSLTPENPAAVVETQNRGRTSFRGSGSGSSIGENSSGS 60

Query: 61  SSSIGVPDDDSDDD---DEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLS 120
           SSSIGVPD DSDDD    EVQSK KEGGL  LDSLEDAL IK GLS HFSGKSKSFANLS
Sbjct: 61  SSSIGVPDGDSDDDGGSGEVQSKSKEGGLCRLDSLEDALPIKRGLSSHFSGKSKSFANLS 120

Query: 121 EVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEE---QQPA 180
           EVIQVKDLEKP+NPFNKR+RILMASKWSRKASFY+W NPKSMPLLAL+EDEEE   ++ A
Sbjct: 121 EVIQVKDLEKPENPFNKRKRILMASKWSRKASFYNWPNPKSMPLLALDEDEEEKYHKEAA 180

Query: 181 DGSDSEERNR---ESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQ 211
            GSDSE+R+R   E DE+D++NERR +TLG R+HDRKLVNGFKS SCFDLQ
Sbjct: 181 AGSDSEDRDRGSDEEDEEDEENERRGRTLGHRFHDRKLVNGFKSKSCFDLQ 231

BLAST of CmaCh01G015740 vs. ExPASy TrEMBL
Match: A0A6J1ILJ2 (uncharacterized protein LOC111476326 OS=Cucurbita maxima OX=3661 GN=LOC111476326 PE=4 SV=1)

HSP 1 Score: 275.4 bits (703), Expect = 2.0e-70
Identity = 165/234 (70.51%), Postives = 185/234 (79.06%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEVPS-----------ENSSAVAETHNRDGSGFRESGSGSSIGENSS-E 60
           MEV+ G PTF++EV +           EN +AV ET NR  +GFR SGS SSIGENSS  
Sbjct: 1   MEVMFGPPTFNVEVAAATAFDGVSLTPENPAAVVETQNRGRTGFRGSGSDSSIGENSSGS 60

Query: 61  SSSIGVPDDDSDDD---DEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLS 120
           SSSIGVPD DSDDD    EVQSK KEGGL  LDSLEDAL IK GLS HFSGKSKSFANLS
Sbjct: 61  SSSIGVPDGDSDDDGGSGEVQSKSKEGGLCRLDSLEDALPIKRGLSSHFSGKSKSFANLS 120

Query: 121 EVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEE---QQPA 180
           EVIQVKDLEKP+NPFNKR+RILMASKWSRKASFY+W NPKSMPLLAL+EDEEE   ++ A
Sbjct: 121 EVIQVKDLEKPENPFNKRKRILMASKWSRKASFYNWPNPKSMPLLALDEDEEEKYHKEAA 180

Query: 181 DGSDSEERNR------ESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQ 211
            GSDSE+R+R      E DE+D++NERRR+TLG R+HD+KLVNGFKS SCFDLQ
Sbjct: 181 AGSDSEDRDRGSDEEDEEDEEDEENERRRRTLGHRFHDQKLVNGFKSKSCFDLQ 234

BLAST of CmaCh01G015740 vs. ExPASy TrEMBL
Match: A0A1S3CQX1 (uncharacterized protein LOC103503753 OS=Cucumis melo OX=3656 GN=LOC103503753 PE=4 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 2.2e-69
Identity = 170/231 (73.59%), Postives = 181/231 (78.35%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEV-----------PSENSSAVAETHNRDGSGFRESGSGSSIGENSSE- 60
           MEVLLG PTFSIEV           PSEN SA AE  N   SGF  SGSGSSIGENSSE 
Sbjct: 1   MEVLLGPPTFSIEVPPPSAFSGVSLPSENPSA-AEAQNLARSGFLRSGSGSSIGENSSES 60

Query: 61  SSSIGVPDDDSDDD---DEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLS 120
           SSSIGVPD DSDDD   DEVQSK KEGGL GL+SLE AL IK GLS HFSGKSKSFANLS
Sbjct: 61  SSSIGVPDGDSDDDGGGDEVQSKRKEGGLCGLESLEKALPIKRGLSSHFSGKSKSFANLS 120

Query: 121 EVIQVKDLEKPDNPFNKRRRILMASKWSR-KASFYSWRNPKSMPLLALNEDEEEQQPADG 180
           EVIQVKDLEKP+NPFNKRRRILMASKWSR KASFY+W NPKSMPLLALNE++E++Q  DG
Sbjct: 121 EVIQVKDLEKPENPFNKRRRILMASKWSRKKASFYNWPNPKSMPLLALNENDEQKQEEDG 180

Query: 181 SDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 216
            DS E   ESDE+D+    RR+ LGQR+HD KLVNGFK  SCFDLQE EQQ
Sbjct: 181 KDSGE---ESDEEDEGKGGRRRNLGQRFHDGKLVNGFKFKSCFDLQECEQQ 227

BLAST of CmaCh01G015740 vs. NCBI nr
Match: XP_022981755.1 (uncharacterized protein LOC111480811 [Cucurbita maxima])

HSP 1 Score: 402.5 bits (1033), Expect = 2.3e-108
Identity = 215/215 (100.00%), Postives = 215/215 (100.00%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD 60
           MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD
Sbjct: 1   MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD 60

Query: 61  DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120
           DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF
Sbjct: 61  DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120

Query: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ 180
           NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ
Sbjct: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ 180

Query: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 216
           NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ
Sbjct: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 215

BLAST of CmaCh01G015740 vs. NCBI nr
Match: XP_022940912.1 (uncharacterized protein LOC111446357 [Cucurbita moschata] >KAG7037565.1 hypothetical protein SDJN02_01193 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 388.7 bits (997), Expect = 3.4e-104
Identity = 210/215 (97.67%), Postives = 212/215 (98.60%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD 60
           MEVLLGSPTFSIEVP ENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS 
Sbjct: 1   MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDS- 60

Query: 61  DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120
           DDDEVQSK KEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF
Sbjct: 61  DDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120

Query: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ 180
           NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEE+QQPA+GSDSEERNRESDEDDDQ
Sbjct: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQ 180

Query: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 216
           NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ
Sbjct: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 214

BLAST of CmaCh01G015740 vs. NCBI nr
Match: XP_023524376.1 (uncharacterized protein LOC111788285 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 388.7 bits (997), Expect = 3.4e-104
Identity = 210/215 (97.67%), Postives = 212/215 (98.60%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD 60
           MEVLLGSPTFSIEVPSENSSAVAETHNRDG GFRESGSGSSIGENSSESSSIGVPDDDS 
Sbjct: 1   MEVLLGSPTFSIEVPSENSSAVAETHNRDGLGFRESGSGSSIGENSSESSSIGVPDDDS- 60

Query: 61  DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120
           DDDEVQSK KEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF
Sbjct: 61  DDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120

Query: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ 180
           NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEE+QQPA+GSDSEERNRESDEDDDQ
Sbjct: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQ 180

Query: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 216
           NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ
Sbjct: 181 NERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 214

BLAST of CmaCh01G015740 vs. NCBI nr
Match: KAG6608207.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 337.4 bits (864), Expect = 8.9e-89
Identity = 185/191 (96.86%), Postives = 188/191 (98.43%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEVPSENSSAVAETHNRDGSGFRESGSGSSIGENSSESSSIGVPDDDSD 60
           MEVLLGSPTFSIEVP ENSSAVAETHNRDGSGFRESGSGSSIGENSS+SSSIGVPDDDS 
Sbjct: 1   MEVLLGSPTFSIEVPLENSSAVAETHNRDGSGFRESGSGSSIGENSSDSSSIGVPDDDS- 60

Query: 61  DDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120
           DDDEVQSK KEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF
Sbjct: 61  DDDEVQSKPKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKDLEKPDNPF 120

Query: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQ 180
           NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEE+QQPA+GSDSEERNRESDEDDDQ
Sbjct: 121 NKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEQQQPAEGSDSEERNRESDEDDDQ 180

Query: 181 NERRRQTLGQR 192
           NERRRQTLGQR
Sbjct: 181 NERRRQTLGQR 190

BLAST of CmaCh01G015740 vs. NCBI nr
Match: XP_038898503.1 (uncharacterized protein LOC120086121 [Benincasa hispida])

HSP 1 Score: 288.1 bits (736), Expect = 6.2e-74
Identity = 172/231 (74.46%), Postives = 189/231 (81.82%), Query Frame = 0

Query: 1   MEVLLGSPTFSIEVPSENSSA----------VAETHNRDGSGFRESGSGSSIGENSS-ES 60
           MEVL G PTFSIEVP   + A            ET NR  SGFRESGSGSSIGENSS  S
Sbjct: 1   MEVLFG-PTFSIEVPPPTAFAGVSIPLENPSPPETQNRARSGFRESGSGSSIGENSSASS 60

Query: 61  SSIGVPDDDSDDD---DEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSE 120
           SSIG+PD DSDDD   DEVQSK  EGGL GL+SLE+AL IK GLS HFSGKSKSFANLSE
Sbjct: 61  SSIGIPDADSDDDGDTDEVQSKPMEGGLCGLESLEEALPIKRGLSSHFSGKSKSFANLSE 120

Query: 121 VIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEE--QQPADG 180
           VIQVKDLEKP+NPFNKRRRILMASKWSRKASFY+W NPKSMPLLALNEDEEE  ++ ++ 
Sbjct: 121 VIQVKDLEKPENPFNKRRRILMASKWSRKASFYNWPNPKSMPLLALNEDEEEDRKEASEE 180

Query: 181 SDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYEQQ 216
           SDSE+ + ESDE+D++ ERRR+TLGQR+HDRKLVNGFKS SCFDLQEYEQQ
Sbjct: 181 SDSEDGDGESDEEDEEKERRRRTLGQRFHDRKLVNGFKSKSCFDLQEYEQQ 230

BLAST of CmaCh01G015740 vs. TAIR 10
Match: AT5G24890.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24550.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 117.9 bits (294), Expect = 1.0e-26
Identity = 92/231 (39.83%), Postives = 133/231 (57.58%), Query Frame = 0

Query: 4   LLGSPTFSIEV------------PSENSSAVAETHNRDG---SGFRESGSGSSIGENSSE 63
           L+  PTFSIEV             + +SS+  ET N +G   SG     SG +  + SS+
Sbjct: 3   LMAKPTFSIEVSQYGTTDLPATEKASSSSSSFETTNEEGVEESGLSRIWSGQT-ADYSSD 62

Query: 64  SSSIGVPDDDSDDDDEVQSK-----LKEGGLLGL---DSLEDALVIKGGLSRHFSGKSKS 123
           SSSIG P D  +D++E +++      KE GL GL    SLED+L  K GLS H+ GKSKS
Sbjct: 63  SSSIGTPGDSEEDEEESENENDDVSSKELGLRGLASMSSLEDSLPSKRGLSNHYKGKSKS 122

Query: 124 FANLSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYSWRNPKSMPLLALNEDEEEQQ 183
           F NL E+  VK++ K +NP NKRRR+ + +K +RK SFYSW+NPKSMPLL +NEDE++  
Sbjct: 123 FGNLGEIGSVKEVAKQENPLNKRRRLQICNKLARK-SFYSWQNPKSMPLLPVNEDEDDDD 182

Query: 184 PADGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQE 212
             D  +  +   + ++     E  ++ + ++   +     +KS SCF L +
Sbjct: 183 EDDDEEDLKSGFDENKSSSDEEGVKKVVVRKGSFKN--RAYKSRSCFALSD 229

BLAST of CmaCh01G015740 vs. TAIR 10
Match: AT2G24550.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31510.1); Has 219 Blast hits to 219 proteins in 33 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 2; Plants - 184; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 97.8 bits (242), Expect = 1.1e-20
Identity = 77/196 (39.29%), Postives = 115/196 (58.67%), Query Frame = 0

Query: 30  GSGFRESGSGSSIGENSSE-SSSIGVPDDDSDDDDEVQSKLKEGGLLG--LDSLEDALVI 89
           G G R S + +   E SS+ SSSIG   ++ ++++E  +   + G L     SLED+L I
Sbjct: 48  GIGLRMSNNNNKSPEESSDSSSSIGESSENEEEEEEDDAVSCQRGTLDSFSSSLEDSLPI 107

Query: 90  KGGLSRHFSGKSKSFANLSEVI-QVKDLEKPDNPFNKRRRILMASKWSRK------ASFY 149
           K GLS H+ GKSKSF NL E   + KDLEK +NPFNKRRR+++A+K  R+      ++FY
Sbjct: 108 KRGLSNHYVGKSKSFGNLMEAASKAKDLEKVENPFNKRRRLVIANKLRRRGRSMSASNFY 167

Query: 150 SWRNPKSMPLLALNEDEEEQQPADGSDSEERNRESDEDDDQNERRRQTLGQRYHDRKLVN 209
           SW+NP SMPLLAL E  EE       D    N + ++DD   +  R+ +    + ++L+ 
Sbjct: 168 SWQNPNSMPLLALQEPNEE-------DHHIHNDDYEDDDGDGDDHRKIMMMMKNKKELM- 227

Query: 210 GFKSMSCFDLQEYEQQ 216
             ++ SCF L   +++
Sbjct: 228 -AQTRSCFCLSSLQEE 234

BLAST of CmaCh01G015740 vs. TAIR 10
Match: AT4G31510.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24550.1); Has 205 Blast hits to 205 proteins in 31 species: Archae - 0; Bacteria - 0; Metazoa - 5; Fungi - 3; Plants - 187; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 85.5 bits (210), Expect = 5.7e-17
Identity = 81/226 (35.84%), Postives = 117/226 (51.77%), Query Frame = 0

Query: 1   MEVLLGSPTF--SIEVPSENSSAVAETHNRDGSGFRESG------SGSSIGENSSESSSI 60
           MEVL+GS TF     V + + +  A   +R   G R  G      S SS+GE S      
Sbjct: 1   MEVLVGS-TFRDRSSVTTHDQAVPASLSSR--IGLRRCGRSPPPESSSSVGETS------ 60

Query: 61  GVPDDDSDDDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFSGKSKSFANLSEVIQVKD 120
              +++ D+DD V S           SLED+L IK GLS H+ GKSKSF NL E     D
Sbjct: 61  ---ENEEDEDDAVSSSQGRWLNSFSSSLEDSLPIKRGLSNHYIGKSKSFGNLMEASNTND 120

Query: 121 LEKPDNPFNKRRRILMASKWSRKA-----SFYSWRNPKSMPLLALNEDEEEQQPADGSDS 180
           L K ++P NKRRR+L+A+K  R++     S Y+  NP SMPLLAL E + E    +  D 
Sbjct: 121 LVKVESPLNKRRRLLIANKLRRRSSLSSFSIYTKINPNSMPLLALQESDNEDHKLNDDDD 180

Query: 181 EERNRESDEDDDQNERRRQTLGQRYHDRKLVNGFKSMSCFDLQEYE 214
           ++   +S  DD+ ++ + + +    H   +V   ++ SCF L  ++
Sbjct: 181 DD---DSSSDDETSKLKEKRMKMTNHRDFMVP--QTKSCFSLTSFQ 209

BLAST of CmaCh01G015740 vs. TAIR 10
Match: AT5G21940.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G43850.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 62.8 bits (151), Expect = 4.0e-10
Identity = 46/128 (35.94%), Postives = 68/128 (53.12%), Query Frame = 0

Query: 29  DGSGFRESGSGSSIGENSSESSSIGVPDDDSDDDDEVQSKLKEGGLLGLDSLEDALVIKG 88
           D S    S + SSIG NS +         D   ++EV+S  K G L  ++SLE  L ++ 
Sbjct: 31  DSSSSPSSSASSSIGRNSDDGEKSSEDGGDDAGENEVESPYK-GPLEMMESLEQVLPVRK 90

Query: 89  GLSRHFSGKSKSFAN--------LSEVIQVKDLEKPDNPFNKRRRILMASKWSRKASFYS 148
           G+S+++SGKSKSF N        L+    +KDL KP+NP+++RRR L+  +         
Sbjct: 91  GISKYYSGKSKSFTNLTAEAASALTSSSSMKDLAKPENPYSRRRRNLLCHQ--------I 149

BLAST of CmaCh01G015740 vs. TAIR 10
Match: AT3G43850.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: vacuole; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G21940.1); Has 215 Blast hits to 215 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 61.2 bits (147), Expect = 1.1e-09
Identity = 43/98 (43.88%), Postives = 62/98 (63.27%), Query Frame = 0

Query: 36  SGSGSSIGENSSESSSIGVPDDDSDDDDEVQSKLKEGGLLGLDSLEDALVIKGGLSRHFS 95
           S S  SIGENS         DDD   ++E++S    G L  ++SLE+AL IK  +S+ + 
Sbjct: 24  STSSDSIGENS---------DDDEGGENEIESSY-NGPLDMMESLEEALPIKRAISKFYK 83

Query: 96  GKSKSFANLSEV--IQVKDLEKPDNPFNKRRRILMASK 132
           GKSKSF +LSE   + VKDL KP+N +++RRR L++ +
Sbjct: 84  GKSKSFMSLSETSSLPVKDLTKPENLYSRRRRNLLSHR 111

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1J0I51.1e-108100.00uncharacterized protein LOC111480811 OS=Cucurbita maxima OX=3661 GN=LOC111480811... [more]
A0A6J1FKY01.6e-10497.67uncharacterized protein LOC111446357 OS=Cucurbita moschata OX=3662 GN=LOC1114463... [more]
A0A6J1FA941.2e-7071.86uncharacterized protein LOC111443706 OS=Cucurbita moschata OX=3662 GN=LOC1114437... [more]
A0A6J1ILJ22.0e-7070.51uncharacterized protein LOC111476326 OS=Cucurbita maxima OX=3661 GN=LOC111476326... [more]
A0A1S3CQX12.2e-6973.59uncharacterized protein LOC103503753 OS=Cucumis melo OX=3656 GN=LOC103503753 PE=... [more]
Match NameE-valueIdentityDescription
XP_022981755.12.3e-108100.00uncharacterized protein LOC111480811 [Cucurbita maxima][more]
XP_022940912.13.4e-10497.67uncharacterized protein LOC111446357 [Cucurbita moschata] >KAG7037565.1 hypothet... [more]
XP_023524376.13.4e-10497.67uncharacterized protein LOC111788285 [Cucurbita pepo subsp. pepo][more]
KAG6608207.18.9e-8996.86Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_038898503.16.2e-7474.46uncharacterized protein LOC120086121 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT5G24890.11.0e-2639.83unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G24550.11.1e-2039.29unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31510.15.7e-1735.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G21940.14.0e-1035.94unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G43850.11.1e-0943.88unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..72
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 176..196
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 149..196
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 11..53
NoneNo IPR availablePANTHERPTHR33172OS08G0516900 PROTEINcoord: 20..214
NoneNo IPR availablePANTHERPTHR33172:SF46OS08G0516900 PROTEINcoord: 20..214

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh01G015740.1CmaCh01G015740.1mRNA