Cp4.1LG01g24190 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g24190
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionTFIIF_beta domain-containing protein
LocationCp4.1LG01: 19107955 .. 19110911 (-)
RNA-Seq ExpressionCp4.1LG01g24190
SyntenyCp4.1LG01g24190
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTGTTGTAATTGTTGGTGTTTTTGTGCGATCTGACTTGCTAGACGTACATCTGCTTCTTACGAGCTCAATACATTCGAATTGTTTCGGATTTGTGTTTTAGTTTTGTATTTTCAGGATCCGTTAGCTATGGATTCAGTTCAGAGTTGGTTGTTGTGGAATTTTTTCGCATATAGCCTTAATTGAATGACAATTTTGCTCGTAAGTGTTGATCGAGTTGATTGAATAAACAACCGGTTTTGTTAGTTGTGTCTTGAATTGCTTGCATATACTTGTTTTCAACTACTGCGTGTTGTTGCATCTCCTGAATATTGAAGTGGTTTTAGTGATTAGGGCCTCTAGATGAATTGGGAAGATTCTTTTAGGAGAAAAGCTAATTCCTAGTTCATTTTACTTTGGTTGACTTTTCTTCGTGAAAATTGGCTTTAAGCAGTCGTGTGGTTTTGGGGCTCTTATGCTTGAATTGAAAAGAAAAACCTTCATGCTATGGATATTGTTTCTTTGTGTACTCTTCTTCTCTCTCTATGTATATGTACAGATTTTATGGTAAAAAACTTATGGTGACTATTCCATCTTTGAGTATTTTAGGTAATTGAAGGTAGAAATGGATAATTTCCTTGGAGGCTTTACAGAGGATATCCAAAGATGTCCATTCTTGAGGAATATTAATGAGCCGACTCATTTCTCATTTTCCTCGTCCATGGCATTCCCCATGCCTGTAAGTTTTTTCTTTCTATAATTTTATGTTTGAATTTGTACAAATGAATTAATTCAACTCGTTTTGTCCTATAAATTACAACATTGGTCAATGTTCAACTATGGAAAACACTTGAATATCAGACATGCATCTGTATTTCCCTAAAATTATACTAAGGGTTCAACATAATAGGAATGAATTAATAGTCTTCCTGTCAAAGTTATTCAGTGTTCCATTATTTTCATAACTATTTATAATACTCTACATATATAAATATATAAATATTAGTTTTACAAAACTTGACACATGTAAATATTAGTTCTTGTTATATATAAATCATTTCTTATAGTTTAGTGTTGCCAGCCCAAGCACTACTTATTTTAGTGTATATGGTATATGTATACCTGTTTCAGTTTCTTGCTAGTATGACTTGATAGATGAATAAGTGGATACTTTCTTATCTCTGGTGACATGGCTGAGGTGAATTAAATGCACTTTTTGTACCTCTGTTAGTTATCCTCTTATTGTGTTGTTATAACTATTAGCAGTTCGCTCCCGGATTCGAGTGTTATATTTAGTTCTTCAATAAATCTTACAATCTTCAATGTAGGCTGTTGTTGGGTCTCCCTCGATGTTCTTGTAACTCAGTTTTGCATATAATTCTGACATTTTTCTGATTTCTAGTTAATCTAATAACTGCTTTTCCACTTTTTATAGGTGCGTGGAGCTAAAGGGCCAATTTTTGAAGATGGTCCCAATTTTGATATGGCATTTAGGCTTTTCCATGGTCGAGATGGAGTGGTTCCACTCTCTGGAAGATCTATGCATCCAGAGAGTGTAGAACTCAAACCAGCCCCATCGCAGTTCAATCCTTTAGCTGCCAAAGCTGCCACTATTAGTCTCTCTTCCTTCGGACCTGGAGGTCCCTTCAGCTTTGATTCATTTTCAGACAAGTGGAAAAATCAGAAAAAGAAATTTGAATCATCCAAGAAAGAGTCTTCTTCAAAGGTGATCAATTGTTAATCTTTGATGGGTTATATCTATTGAGTACTTTAGCTGCTAGGTGAAAAATGATGGAATTCATATGCTAAAGAATATTGTGATCGGAATGGATGAAAATCTATAGGATGTATGTGATCTTTTCTCTTGACTTATCATTATGTTCATTTTCAGGGTGGAAATTCACATGAAGCTGTGGGTAATGAATGGCTGCAAATGGGGAACTGCCCGATTGCGAAGTCATACCGTGCTGTTAGCAATGTCATACCACTTGTTGCGAAAGCTCTTCAGCCCCCACCAGGCATGAAATTTAGATGTCCACCAGCCGTAGTTGCAGCAAGAGCAGCATTAGCAAAGACTGCATTTGCAAAGAATCTCCGCCCCCAACCGTTGCCAGCAAAAGTACTCGCAATCGGTCTACTCGGCATGGCAGCAAATGTACCTTTAGGAATATGGAGAGAACACACTGAAAAATTCTCACCCTCTTGGTTTGCTGCCGTTCATGCAGCTGTTCCATTTATAGCTATGCTGAGGAAATCCATCTTAATGCCGAAGTCAGCAATGGCATTTACAATTGCAGCATCAGTCTTAGGCCAGGTTATCGGCTCAAGGGCAGAGCGATTCCGACTCAAGGCAGTAGCTTCGGAAAAACTAACCCTCCAAGATTCAATCGGCAAATCGACCCCTTTTCCCGTCGTTACTGTGAAAAACGGTCATTGTGGCGACATCGAGAGCTGGAATCCAGTTACCACTCTTCAGGTAGCAGGTCCTCCAACACCAACCAACATACCCTGCTGATCTCTCCTTCACCATTTATTGTGAGTGTTCTTCTCTGTATCCTATTGTGTTTAATTCTCCATTATACACCAGTTTGTTCATTTGGCAGTATTTGACAGTTCAAAGTATAAATTGATGAAATAAAAAGTGTCTTTGTGTGTGTATTAAATCCTGGTGGGAATGTTGTTCAATTTTGCTTTGTCATATTTTGGCTCAGAAGGAAGGTGATGAGAAACACAACGTATTATAGAGGAAAGATGGTGAAATGTGAGGAGATGGGAGACTTTGTCATGAATTGAATTCAATATTGTCGGCTAAATCAATCACTATTTTAGTTGACCCATGAATTGAATTCCAAAAGTGTTCTTGGAAAACTCATGGCAAATGGAGCACAATTTCTGTTTTGCATGTCTGATCTTGAATTGGCGTTTATGACGTTTTCTCCCTTGTAAAGGCGAGTCAAGTTGACTTTGTGTTCTCGAT

mRNA sequence

TTTTTGTTGTAATTGTTGGTGTTTTTGTGCGATCTGACTTGCTAGACGTACATCTGCTTCTTACGAGCTCAATACATTCGAATTGTTTCGGATTTGTGTTTTAGTTTTGTATTTTCAGGATCCGTTAGCTATGGATTCAGTTCAGAGTTGGTTGTTGTGGAATTTTTTCGCATATAGCCTTAATTGAATGACAATTTTGCTCGTAATTGAAGGTAGAAATGGATAATTTCCTTGGAGGCTTTACAGAGGATATCCAAAGATGTCCATTCTTGAGGAATATTAATGAGCCGACTCATTTCTCATTTTCCTCGTCCATGGCATTCCCCATGCCTGTGCGTGGAGCTAAAGGGCCAATTTTTGAAGATGGTCCCAATTTTGATATGGCATTTAGGCTTTTCCATGGTCGAGATGGAGTGGTTCCACTCTCTGGAAGATCTATGCATCCAGAGAGTGTAGAACTCAAACCAGCCCCATCGCAGTTCAATCCTTTAGCTGCCAAAGCTGCCACTATTAGTCTCTCTTCCTTCGGACCTGGAGGTCCCTTCAGCTTTGATTCATTTTCAGACAAGTGGAAAAATCAGAAAAAGAAATTTGAATCATCCAAGAAAGAGTCTTCTTCAAAGGGTGGAAATTCACATGAAGCTGTGGGTAATGAATGGCTGCAAATGGGGAACTGCCCGATTGCGAAGTCATACCGTGCTGTTAGCAATGTCATACCACTTGTTGCGAAAGCTCTTCAGCCCCCACCAGGCATGAAATTTAGATGTCCACCAGCCGTAGTTGCAGCAAGAGCAGCATTAGCAAAGACTGCATTTGCAAAGAATCTCCGCCCCCAACCGTTGCCAGCAAAAGTACTCGCAATCGGTCTACTCGGCATGGCAGCAAATGTACCTTTAGGAATATGGAGAGAACACACTGAAAAATTCTCACCCTCTTGGTTTGCTGCCGTTCATGCAGCTGTTCCATTTATAGCTATGCTGAGGAAATCCATCTTAATGCCGAAGTCAGCAATGGCATTTACAATTGCAGCATCAGTCTTAGGCCAGGTTATCGGCTCAAGGGCAGAGCGATTCCGACTCAAGGCAGTAGCTTCGGAAAAACTAACCCTCCAAGATTCAATCGGCAAATCGACCCCTTTTCCCGTCGTTACTGTGAAAAACGGTCATTGTGGCGACATCGAGAGCTGGAATCCAGTTACCACTCTTCAGGTAGCAGGTCCTCCAACACCAACCAACATACCCTGCTGATCTCTCCTTCACCATTTATTAAGGAAGGTGATGAGAAACACAACGTATTATAGAGGAAAGATGGTGAAATGTGAGGAGATGGGAGACTTTGTCATGAATTGAATTCAATATTGTCGGCTAAATCAATCACTATTTTAGTTGACCCATGAATTGAATTCCAAAAGTGTTCTTGGAAAACTCATGGCAAATGGAGCACAATTTCTGTTTTGCATGTCTGATCTTGAATTGGCGTTTATGACGTTTTCTCCCTTGTAAAGGCGAGTCAAGTTGACTTTGTGTTCTCGAT

Coding sequence (CDS)

ATGGATAATTTCCTTGGAGGCTTTACAGAGGATATCCAAAGATGTCCATTCTTGAGGAATATTAATGAGCCGACTCATTTCTCATTTTCCTCGTCCATGGCATTCCCCATGCCTGTGCGTGGAGCTAAAGGGCCAATTTTTGAAGATGGTCCCAATTTTGATATGGCATTTAGGCTTTTCCATGGTCGAGATGGAGTGGTTCCACTCTCTGGAAGATCTATGCATCCAGAGAGTGTAGAACTCAAACCAGCCCCATCGCAGTTCAATCCTTTAGCTGCCAAAGCTGCCACTATTAGTCTCTCTTCCTTCGGACCTGGAGGTCCCTTCAGCTTTGATTCATTTTCAGACAAGTGGAAAAATCAGAAAAAGAAATTTGAATCATCCAAGAAAGAGTCTTCTTCAAAGGGTGGAAATTCACATGAAGCTGTGGGTAATGAATGGCTGCAAATGGGGAACTGCCCGATTGCGAAGTCATACCGTGCTGTTAGCAATGTCATACCACTTGTTGCGAAAGCTCTTCAGCCCCCACCAGGCATGAAATTTAGATGTCCACCAGCCGTAGTTGCAGCAAGAGCAGCATTAGCAAAGACTGCATTTGCAAAGAATCTCCGCCCCCAACCGTTGCCAGCAAAAGTACTCGCAATCGGTCTACTCGGCATGGCAGCAAATGTACCTTTAGGAATATGGAGAGAACACACTGAAAAATTCTCACCCTCTTGGTTTGCTGCCGTTCATGCAGCTGTTCCATTTATAGCTATGCTGAGGAAATCCATCTTAATGCCGAAGTCAGCAATGGCATTTACAATTGCAGCATCAGTCTTAGGCCAGGTTATCGGCTCAAGGGCAGAGCGATTCCGACTCAAGGCAGTAGCTTCGGAAAAACTAACCCTCCAAGATTCAATCGGCAAATCGACCCCTTTTCCCGTCGTTACTGTGAAAAACGGTCATTGTGGCGACATCGAGAGCTGGAATCCAGTTACCACTCTTCAGGTAGCAGGTCCTCCAACACCAACCAACATACCCTGCTGA

Protein sequence

MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKNQKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDSIGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC
Homology
BLAST of Cp4.1LG01g24190 vs. NCBI nr
Match: XP_023543023.1 (uncharacterized protein LOC111802764 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 680 bits (1755), Expect = 1.12e-246
Identity = 342/342 (100.00%), Postives = 342/342 (100.00%), Query Frame = 0

Query: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60
           MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF
Sbjct: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60

Query: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120
           HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN
Sbjct: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120

Query: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180
           QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK
Sbjct: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180

Query: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240
           FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW
Sbjct: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240

Query: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300
           FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS
Sbjct: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300

Query: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC
Sbjct: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342

BLAST of Cp4.1LG01g24190 vs. NCBI nr
Match: XP_022990398.1 (uncharacterized protein LOC111487268 [Cucurbita maxima])

HSP 1 Score: 679 bits (1751), Expect = 4.54e-246
Identity = 341/342 (99.71%), Postives = 342/342 (100.00%), Query Frame = 0

Query: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60
           MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF
Sbjct: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60

Query: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120
           HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN
Sbjct: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120

Query: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180
           QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK
Sbjct: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180

Query: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240
           FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW
Sbjct: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240

Query: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300
           FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS
Sbjct: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300

Query: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPP+PTNIPC
Sbjct: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPSPTNIPC 342

BLAST of Cp4.1LG01g24190 vs. NCBI nr
Match: XP_022964843.1 (uncharacterized protein LOC111464821 [Cucurbita moschata])

HSP 1 Score: 676 bits (1745), Expect = 3.73e-245
Identity = 340/342 (99.42%), Postives = 341/342 (99.71%), Query Frame = 0

Query: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60
           MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF
Sbjct: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60

Query: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120
           HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN
Sbjct: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120

Query: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180
           QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK
Sbjct: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180

Query: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240
           FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW
Sbjct: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240

Query: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300
           FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS
Sbjct: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300

Query: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           I KSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPP+PTNIPC
Sbjct: 301 ISKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPSPTNIPC 342

BLAST of Cp4.1LG01g24190 vs. NCBI nr
Match: KAG7033009.1 (hypothetical protein SDJN02_07061, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 674 bits (1739), Expect = 3.70e-244
Identity = 339/342 (99.12%), Postives = 340/342 (99.42%), Query Frame = 0

Query: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60
           MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF
Sbjct: 6   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 65

Query: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120
           HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN
Sbjct: 66  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 125

Query: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180
           QKKKFESSKKESSSKGGNSHEAV NEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK
Sbjct: 126 QKKKFESSKKESSSKGGNSHEAVSNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 185

Query: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240
           FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW
Sbjct: 186 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 245

Query: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300
           FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS
Sbjct: 246 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 305

Query: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           I KSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPP+PTNIPC
Sbjct: 306 ISKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPSPTNIPC 347

BLAST of Cp4.1LG01g24190 vs. NCBI nr
Match: KAG6602326.1 (General transcription factor IIF subunit 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 672 bits (1735), Expect = 3.17e-238
Identity = 338/342 (98.83%), Postives = 340/342 (99.42%), Query Frame = 0

Query: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60
           MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF
Sbjct: 346 MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 405

Query: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120
           HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKW+N
Sbjct: 406 HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWQN 465

Query: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180
           QKKKFESSKKESSSKGGNSHEAV NEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK
Sbjct: 466 QKKKFESSKKESSSKGGNSHEAVSNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 525

Query: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240
           FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW
Sbjct: 526 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 585

Query: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300
           FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS
Sbjct: 586 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 645

Query: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           I KSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPP+PTNIPC
Sbjct: 646 ISKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPSPTNIPC 687

BLAST of Cp4.1LG01g24190 vs. ExPASy TrEMBL
Match: A0A6J1JT60 (uncharacterized protein LOC111487268 OS=Cucurbita maxima OX=3661 GN=LOC111487268 PE=4 SV=1)

HSP 1 Score: 679 bits (1751), Expect = 2.20e-246
Identity = 341/342 (99.71%), Postives = 342/342 (100.00%), Query Frame = 0

Query: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60
           MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF
Sbjct: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60

Query: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120
           HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN
Sbjct: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120

Query: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180
           QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK
Sbjct: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180

Query: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240
           FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW
Sbjct: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240

Query: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300
           FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS
Sbjct: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300

Query: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPP+PTNIPC
Sbjct: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPSPTNIPC 342

BLAST of Cp4.1LG01g24190 vs. ExPASy TrEMBL
Match: A0A6J1HM37 (uncharacterized protein LOC111464821 OS=Cucurbita moschata OX=3662 GN=LOC111464821 PE=4 SV=1)

HSP 1 Score: 676 bits (1745), Expect = 1.81e-245
Identity = 340/342 (99.42%), Postives = 341/342 (99.71%), Query Frame = 0

Query: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60
           MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF
Sbjct: 1   MDNFLGGFTEDIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLF 60

Query: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120
           HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN
Sbjct: 61  HGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKN 120

Query: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180
           QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK
Sbjct: 121 QKKKFESSKKESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMK 180

Query: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240
           FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW
Sbjct: 181 FRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSW 240

Query: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300
           FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS
Sbjct: 241 FAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDS 300

Query: 301 IGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           I KSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPP+PTNIPC
Sbjct: 301 ISKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPSPTNIPC 342

BLAST of Cp4.1LG01g24190 vs. ExPASy TrEMBL
Match: A0A1S3CPS7 (uncharacterized protein LOC103503307 OS=Cucumis melo OX=3656 GN=LOC103503307 PE=4 SV=1)

HSP 1 Score: 625 bits (1612), Expect = 4.54e-225
Identity = 320/350 (91.43%), Postives = 332/350 (94.86%), Query Frame = 0

Query: 1   MDNFLGGFTED-------IQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNF 60
           MD+FLGGFTED       IQRCPFLRNINEPT+FSFSSSMAFP+PVRGAKGPIFEDGPNF
Sbjct: 1   MDSFLGGFTEDSTTFNQDIQRCPFLRNINEPTNFSFSSSMAFPIPVRGAKGPIFEDGPNF 60

Query: 61  DMAFRLFHGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDS 120
           DMAFRLFHGRDGVVPLSGRSMHP SVELKPAPSQFNPLAAKAATISLSSFGPGGPFSF S
Sbjct: 61  DMAFRLFHGRDGVVPLSGRSMHPGSVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFGS 120

Query: 121 FSDKWKNQKKKFESSKKESSSKGGNS-HEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKA 180
           FS+KWKNQKKKFESSKKESSS+GGNS HEAVGNEWLQMGNCPIAKSYRAVS+VIPLVAKA
Sbjct: 121 FSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSSVIPLVAKA 180

Query: 181 LQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREH 240
           LQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREH
Sbjct: 181 LQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREH 240

Query: 241 TEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVAS 300
           TEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAER RLKAVAS
Sbjct: 241 TEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKAVAS 300

Query: 301 EKLTLQDSIGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           +KLTLQDS+ +ST  PVV +KNGHCGDIESWNPVTTLQVAGP +P  +PC
Sbjct: 301 KKLTLQDSLAESTLLPVVNMKNGHCGDIESWNPVTTLQVAGPASPNKVPC 350

BLAST of Cp4.1LG01g24190 vs. ExPASy TrEMBL
Match: A0A5A7TBC4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G001170 PE=4 SV=1)

HSP 1 Score: 625 bits (1612), Expect = 4.54e-225
Identity = 320/350 (91.43%), Postives = 332/350 (94.86%), Query Frame = 0

Query: 1   MDNFLGGFTED-------IQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNF 60
           MD+FLGGFTED       IQRCPFLRNINEPT+FSFSSSMAFP+PVRGAKGPIFEDGPNF
Sbjct: 1   MDSFLGGFTEDSTTFNQDIQRCPFLRNINEPTNFSFSSSMAFPIPVRGAKGPIFEDGPNF 60

Query: 61  DMAFRLFHGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDS 120
           DMAFRLFHGRDGVVPLSGRSMHP SVELKPAPSQFNPLAAKAATISLSSFGPGGPFSF S
Sbjct: 61  DMAFRLFHGRDGVVPLSGRSMHPGSVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFGS 120

Query: 121 FSDKWKNQKKKFESSKKESSSKGGNS-HEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKA 180
           FS+KWKNQKKKFESSKKESSS+GGNS HEAVGNEWLQMGNCPIAKSYRAVS+VIPLVAKA
Sbjct: 121 FSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSSVIPLVAKA 180

Query: 181 LQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREH 240
           LQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREH
Sbjct: 181 LQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREH 240

Query: 241 TEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVAS 300
           TEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAER RLKAVAS
Sbjct: 241 TEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKAVAS 300

Query: 301 EKLTLQDSIGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           +KLTLQDS+ +ST  PVV +KNGHCGDIESWNPVTTLQVAGP +P  +PC
Sbjct: 301 KKLTLQDSLAESTLLPVVNMKNGHCGDIESWNPVTTLQVAGPASPNKVPC 350

BLAST of Cp4.1LG01g24190 vs. ExPASy TrEMBL
Match: A0A0A0KNM8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G492350 PE=4 SV=1)

HSP 1 Score: 625 bits (1612), Expect = 4.54e-225
Identity = 320/350 (91.43%), Postives = 332/350 (94.86%), Query Frame = 0

Query: 1   MDNFLGGFTED-------IQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNF 60
           MDNFLGGFTED       IQRCPFLRNINEPT+FSFSSSMAFP+PVRGAKGPIFEDGPNF
Sbjct: 1   MDNFLGGFTEDSTTFNQDIQRCPFLRNINEPTNFSFSSSMAFPVPVRGAKGPIFEDGPNF 60

Query: 61  DMAFRLFHGRDGVVPLSGRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDS 120
           DMAFRLFHGRDGVVPLSGRSMHP SVELKPAPSQFNPLAAKAATISLSSFGPGGPFSF S
Sbjct: 61  DMAFRLFHGRDGVVPLSGRSMHPGSVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFGS 120

Query: 121 FSDKWKNQKKKFESSKKESSSKGGNS-HEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKA 180
           FS+KWKNQKKKFESSKKESSS+GGNS HEAVGNEWLQMGNCPIAKSYRAVS+VIPLVAKA
Sbjct: 121 FSEKWKNQKKKFESSKKESSSQGGNSQHEAVGNEWLQMGNCPIAKSYRAVSSVIPLVAKA 180

Query: 181 LQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREH 240
           LQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREH
Sbjct: 181 LQPPPGMKFRCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREH 240

Query: 241 TEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVAS 300
           TEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAER RLKAVAS
Sbjct: 241 TEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERLRLKAVAS 300

Query: 301 EKLTLQDSIGKSTPFPVVTVKNGHCGDIESWNPVTTLQVAGPPTPTNIPC 342
           +KLTLQDS+ ++T  PVV +KNGHCGDIESWNPVTTLQVAGP +P  +PC
Sbjct: 301 KKLTLQDSLTEATLLPVVNMKNGHCGDIESWNPVTTLQVAGPASPNKVPC 350

BLAST of Cp4.1LG01g24190 vs. TAIR 10
Match: AT4G25030.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G45410.3); Has 125 Blast hits to 125 proteins in 36 species: Archae - 2; Bacteria - 31; Metazoa - 0; Fungi - 4; Plants - 88; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 429.5 bits (1103), Expect = 2.6e-120
Identity = 229/330 (69.39%), Postives = 261/330 (79.09%), Query Frame = 0

Query: 11  DIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLS 70
           +I RCPFLRNINEPT+ SFSSS+ FP+P R  KGPIFEDGPNFD AFRLFHG+DGVVPLS
Sbjct: 15  NILRCPFLRNINEPTNLSFSSSLPFPIPARAGKGPIFEDGPNFDTAFRLFHGQDGVVPLS 74

Query: 71  GRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKNQKKKFESSKK 130
             +    +   KP P  F+PLAAKAATISLSSFG GGPF FD+FSD +KNQKKK +SSK 
Sbjct: 75  DTA---RTEAQKPVP-VFHPLAAKAATISLSSFGSGGPFGFDAFSDMFKNQKKKSDSSK- 134

Query: 131 ESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAVVAA 190
              +KGGN HEA+G+EWL+ GNCPIAKSYRAVS V PLVAK LQPPPGMKF+CP A+V A
Sbjct: 135 ---NKGGN-HEAMGDEWLKTGNCPIAKSYRAVSGVAPLVAKILQPPPGMKFKCPQAIVTA 194

Query: 191 RAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPF 250
           RAA++KT FAKNLRPQPLPAKVL IG+LGMA NVPLG+WREHTEKFS SWF A+HAAVPF
Sbjct: 195 RAAISKTPFAKNLRPQPLPAKVLVIGMLGMALNVPLGVWREHTEKFSASWFIALHAAVPF 254

Query: 251 IAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQ----DSIGKSTP 310
           I +LRKS+LMPK+AM FTIAASVLGQVIGSRAER RLK+VA +KLTL+     S+     
Sbjct: 255 IGILRKSVLMPKTAMVFTIAASVLGQVIGSRAERRRLKSVAEKKLTLEVPNPSSVEADQM 314

Query: 311 FPVVTVKNGHCGD--IESWNPVTTLQVAGP 335
                  +G CGD  +  WNP+  L VA P
Sbjct: 315 QFAGVSSDGRCGDKVVMKWNPM-MLDVASP 334

BLAST of Cp4.1LG01g24190 vs. TAIR 10
Match: AT4G25030.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G45410.3); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 429.5 bits (1103), Expect = 2.6e-120
Identity = 229/330 (69.39%), Postives = 261/330 (79.09%), Query Frame = 0

Query: 11  DIQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLS 70
           +I RCPFLRNINEPT+ SFSSS+ FP+P R  KGPIFEDGPNFD AFRLFHG+DGVVPLS
Sbjct: 15  NILRCPFLRNINEPTNLSFSSSLPFPIPARAGKGPIFEDGPNFDTAFRLFHGQDGVVPLS 74

Query: 71  GRSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKNQKKKFESSKK 130
             +    +   KP P  F+PLAAKAATISLSSFG GGPF FD+FSD +KNQKKK +SSK 
Sbjct: 75  DTA---RTEAQKPVP-VFHPLAAKAATISLSSFGSGGPFGFDAFSDMFKNQKKKSDSSK- 134

Query: 131 ESSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAVVAA 190
              +KGGN HEA+G+EWL+ GNCPIAKSYRAVS V PLVAK LQPPPGMKF+CP A+V A
Sbjct: 135 ---NKGGN-HEAMGDEWLKTGNCPIAKSYRAVSGVAPLVAKILQPPPGMKFKCPQAIVTA 194

Query: 191 RAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPF 250
           RAA++KT FAKNLRPQPLPAKVL IG+LGMA NVPLG+WREHTEKFS SWF A+HAAVPF
Sbjct: 195 RAAISKTPFAKNLRPQPLPAKVLVIGMLGMALNVPLGVWREHTEKFSASWFIALHAAVPF 254

Query: 251 IAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQ----DSIGKSTP 310
           I +LRKS+LMPK+AM FTIAASVLGQVIGSRAER RLK+VA +KLTL+     S+     
Sbjct: 255 IGILRKSVLMPKTAMVFTIAASVLGQVIGSRAERRRLKSVAEKKLTLEVPNPSSVEADQM 314

Query: 311 FPVVTVKNGHCGD--IESWNPVTTLQVAGP 335
                  +G CGD  +  WNP+  L VA P
Sbjct: 315 QFAGVSSDGRCGDKVVMKWNPM-MLDVASP 334

BLAST of Cp4.1LG01g24190 vs. TAIR 10
Match: AT5G45410.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G25030.2); Has 124 Blast hits to 124 proteins in 34 species: Archae - 2; Bacteria - 31; Metazoa - 0; Fungi - 0; Plants - 91; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 400.2 bits (1027), Expect = 1.7e-111
Identity = 203/308 (65.91%), Postives = 245/308 (79.55%), Query Frame = 0

Query: 12  IQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSG 71
           IQ+CPFLRNIN+PT+ SF SS++FP+PV+G KGPIFEDGP FD AF+LFHG+DG+VPLSG
Sbjct: 13  IQKCPFLRNINKPTNLSF-SSLSFPIPVQGGKGPIFEDGPGFDSAFKLFHGKDGIVPLSG 72

Query: 72  RSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKNQKKKFESSKKE 131
            +   E  E      QFNPLA K ATISLS+FGPGGPF F  FS+KWK Q+KK + SK +
Sbjct: 73  FADDSED-EAGRRALQFNPLAGKVATISLSAFGPGGPFGFGPFSEKWKKQQKKPKPSKNQ 132

Query: 132 SSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAVVAAR 191
            S    + HEAVG+EWL+ GNCPIAKS+RA S V+PL++KAL  PPGMK+RCP  +VAAR
Sbjct: 133 QSG-DSSKHEAVGDEWLKTGNCPIAKSFRAASKVMPLISKALTLPPGMKYRCPAPIVAAR 192

Query: 192 AALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFI 251
           AAL+KTA  K+LRPQPLP K+LAI L+GMAANVPLG+WREHT+KFSP+WF AVHAAVPFI
Sbjct: 193 AALSKTALVKSLRPQPLPEKMLAIALMGMAANVPLGVWREHTKKFSPAWFLAVHAAVPFI 252

Query: 252 AMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDSIGKSTPFPVVT 311
           AMLRKS+LMPK+AMA TI AS+LGQVIGSRAER+RLKAVA + + +   +      P  +
Sbjct: 253 AMLRKSVLMPKTAMALTIGASILGQVIGSRAERYRLKAVAEKMVPVTAMVSGYNQSPGDS 312

Query: 312 -VKNGHCG 319
            +  GHCG
Sbjct: 313 GISGGHCG 317

BLAST of Cp4.1LG01g24190 vs. TAIR 10
Match: AT5G45410.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G25030.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 400.2 bits (1027), Expect = 1.7e-111
Identity = 203/308 (65.91%), Postives = 245/308 (79.55%), Query Frame = 0

Query: 12  IQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSG 71
           IQ+CPFLRNIN+PT+ SF SS++FP+PV+G KGPIFEDGP FD AF+LFHG+DG+VPLSG
Sbjct: 13  IQKCPFLRNINKPTNLSF-SSLSFPIPVQGGKGPIFEDGPGFDSAFKLFHGKDGIVPLSG 72

Query: 72  RSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKNQKKKFESSKKE 131
            +   E  E      QFNPLA K ATISLS+FGPGGPF F  FS+KWK Q+KK + SK +
Sbjct: 73  FADDSED-EAGRRALQFNPLAGKVATISLSAFGPGGPFGFGPFSEKWKKQQKKPKPSKNQ 132

Query: 132 SSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAVVAAR 191
            S    + HEAVG+EWL+ GNCPIAKS+RA S V+PL++KAL  PPGMK+RCP  +VAAR
Sbjct: 133 QSG-DSSKHEAVGDEWLKTGNCPIAKSFRAASKVMPLISKALTLPPGMKYRCPAPIVAAR 192

Query: 192 AALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFI 251
           AAL+KTA  K+LRPQPLP K+LAI L+GMAANVPLG+WREHT+KFSP+WF AVHAAVPFI
Sbjct: 193 AALSKTALVKSLRPQPLPEKMLAIALMGMAANVPLGVWREHTKKFSPAWFLAVHAAVPFI 252

Query: 252 AMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDSIGKSTPFPVVT 311
           AMLRKS+LMPK+AMA TI AS+LGQVIGSRAER+RLKAVA + + +   +      P  +
Sbjct: 253 AMLRKSVLMPKTAMALTIGASILGQVIGSRAERYRLKAVAEKMVPVTAMVSGYNQSPGDS 312

Query: 312 -VKNGHCG 319
            +  GHCG
Sbjct: 313 GISGGHCG 317

BLAST of Cp4.1LG01g24190 vs. TAIR 10
Match: AT5G45410.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G25030.2); Has 124 Blast hits to 124 proteins in 34 species: Archae - 2; Bacteria - 31; Metazoa - 0; Fungi - 0; Plants - 91; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 400.2 bits (1027), Expect = 1.7e-111
Identity = 203/308 (65.91%), Postives = 245/308 (79.55%), Query Frame = 0

Query: 12  IQRCPFLRNINEPTHFSFSSSMAFPMPVRGAKGPIFEDGPNFDMAFRLFHGRDGVVPLSG 71
           IQ+CPFLRNIN+PT+ SF SS++FP+PV+G KGPIFEDGP FD AF+LFHG+DG+VPLSG
Sbjct: 13  IQKCPFLRNINKPTNLSF-SSLSFPIPVQGGKGPIFEDGPGFDSAFKLFHGKDGIVPLSG 72

Query: 72  RSMHPESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSDKWKNQKKKFESSKKE 131
            +   E  E      QFNPLA K ATISLS+FGPGGPF F  FS+KWK Q+KK + SK +
Sbjct: 73  FADDSED-EAGRRALQFNPLAGKVATISLSAFGPGGPFGFGPFSEKWKKQQKKPKPSKNQ 132

Query: 132 SSSKGGNSHEAVGNEWLQMGNCPIAKSYRAVSNVIPLVAKALQPPPGMKFRCPPAVVAAR 191
            S    + HEAVG+EWL+ GNCPIAKS+RA S V+PL++KAL  PPGMK+RCP  +VAAR
Sbjct: 133 QSG-DSSKHEAVGDEWLKTGNCPIAKSFRAASKVMPLISKALTLPPGMKYRCPAPIVAAR 192

Query: 192 AALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWREHTEKFSPSWFAAVHAAVPFI 251
           AAL+KTA  K+LRPQPLP K+LAI L+GMAANVPLG+WREHT+KFSP+WF AVHAAVPFI
Sbjct: 193 AALSKTALVKSLRPQPLPEKMLAIALMGMAANVPLGVWREHTKKFSPAWFLAVHAAVPFI 252

Query: 252 AMLRKSILMPKSAMAFTIAASVLGQVIGSRAERFRLKAVASEKLTLQDSIGKSTPFPVVT 311
           AMLRKS+LMPK+AMA TI AS+LGQVIGSRAER+RLKAVA + + +   +      P  +
Sbjct: 253 AMLRKSVLMPKTAMALTIGASILGQVIGSRAERYRLKAVAEKMVPVTAMVSGYNQSPGDS 312

Query: 312 -VKNGHCG 319
            +  GHCG
Sbjct: 313 GISGGHCG 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023543023.11.12e-246100.00uncharacterized protein LOC111802764 [Cucurbita pepo subsp. pepo][more]
XP_022990398.14.54e-24699.71uncharacterized protein LOC111487268 [Cucurbita maxima][more]
XP_022964843.13.73e-24599.42uncharacterized protein LOC111464821 [Cucurbita moschata][more]
KAG7033009.13.70e-24499.12hypothetical protein SDJN02_07061, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAG6602326.13.17e-23898.83General transcription factor IIF subunit 2, partial [Cucurbita argyrosperma subs... [more]
Match NameE-valueIdentityDescription
A0A6J1JT602.20e-24699.71uncharacterized protein LOC111487268 OS=Cucurbita maxima OX=3661 GN=LOC111487268... [more]
A0A6J1HM371.81e-24599.42uncharacterized protein LOC111464821 OS=Cucurbita moschata OX=3662 GN=LOC1114648... [more]
A0A1S3CPS74.54e-22591.43uncharacterized protein LOC103503307 OS=Cucumis melo OX=3656 GN=LOC103503307 PE=... [more]
A0A5A7TBC44.54e-22591.43Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0KNM84.54e-22591.43Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G492350 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G25030.12.6e-12069.39unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G25030.22.6e-12069.39unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G45410.11.7e-11165.91unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G45410.21.7e-11165.91unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G45410.31.7e-11165.91unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31033PROTEIN, PUTATIVE-RELATEDcoord: 7..326
NoneNo IPR availablePANTHERPTHR31033:SF23SUBFAMILY NOT NAMEDcoord: 7..326

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g24190.1Cp4.1LG01g24190.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042742 defense response to bacterium
cellular_component GO:0009507 chloroplast
cellular_component GO:0012505 endomembrane system