CmUC05G098490 (gene) Watermelon (USVL531) v1

Overview
NameCmUC05G098490
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionSAGA-Tad1 domain-containing protein
LocationCmU531Chr05: 28073423 .. 28075829 (+)
RNA-Seq ExpressionCmUC05G098490
SyntenyCmUC05G098490
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAACCTCCGCAAAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATTTTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACATACTGGATCTGCCTTCCCAAATCAGAATCAGAATATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCCCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTCGGGATAGGCCAAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGGAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTTGGATCCCTTAAGCTTCCTGAGTGGTCCTCTACTTCCGCCTCTTGGTATTCCATTTTGTTCAGCTAGTGTTGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTAGATGTATACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAATGTGCAATCTGAATTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAGTTTGTATGCGTGCCTTTGAGGAATAAGCAGTTGCAAAGGTTTTATTTATTCTACAGGCATCCAAGGCTGGTCTGTTCTGGTCCGTAGGGCTTTAAAATTGACCTGGGATGTCTCACCGTTCTTCCCATCTCACTTGGTCTCTCCCTTTTGGTTCAGGCTCCAAGTTTCCATGCTAAAATATTGTAACATCATTGCCTGCTCCATTGAGGAAGTTGAATTAACTCGACTAAGAAAGGGGAGCTTTCAGCCAGAATGCCCATCAAAGGTTGTTCTGTTGCATTTTTAGCATTCATGTTGGCCCCATTTGCCACTACCATGTAACTTAGTTTTTCTTGTAACTTATTTCTGAGAGAATCAAAAAAGTTTTGCAGGGATACTGTCATTTAGTTTGGTGAAAGATGAATCACCTGGGTGAGAGCTTCAGGTGCTCTGTATGTATCTTTAGAATCTGAGAATTTATTGGAATTTCAATATCAATAACAAATTAACATCCATTTCAATGTGCTTTATAGTTTGGTCCCTTGTTCAATTATATGATTATTCAATATATAATGATGCTGCTCTCGTCTTCTAACTGAAATTAGTTATTTTTAATCCCACTTTTGCGATATGAATAATGTGAGCGGCCCCTCTCGATATATGTGTGGGTATATGTGCGTGGGAAGTTCAAGTCTTAGGAGCAGAAAAGCCTGATTTCAAGGTGTCTGCAACCAGATATTCTGCTTGCTTCTGAACACATCAATGGTCTACAAATGCGCTGACCTATTGGATTCAGATAATTCTTATTAAAGGTTAACGTTGTGATTGAAAGTTTGAAGTAGACAAGAGTTGCAAGGCCATAATAACTCGGAAGGTACCTGCTCCTTTTAGCAATTGCAAATTTTCTTTCCCGCTCAGATAACCTACAGTTCTATGTGCTGGTGCGTTGGATTTTGTAGGATATGACTTGATTGCAAAATATATTTGCATCTGAAGTACTATTAACCATTGTATTAATCGGGGAAAGCTTGACCTTGTCTTATGATGCCTGCGGTTTTATGAATCATAAGCACCTTGCCATCATCATATTATTGATTTAGTTTTCTTCTGAGAGATTTACCCAATTTTTTCTTATGATATTGCTTACTTGTCAGCAAAATATTTTCTTCATTTTTAGCTGAATATGGTCGTATTATGAAGTAA

mRNA sequence

ATGCAACCTCCGCAAAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATTTTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACATACTGGATCTGCCTTCCCAAATCAGAATCAGAATATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCCCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTCGGGATAGGCCAAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGGAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTTGGATCCCTTAAGCTTCCTGAGTGGTCCTCTACTTCCGCCTCTTGGTATTCCATTTTGTTCAGCTAGTGTTGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTAGATGTATACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAATGTGCAATCTGAATTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAGGCTTTAAAATTGACCTGGGATGTCTCACCGTTCTTCCCATCTCACTTGGTCTCTCCCTTTTGGTTCAGGCTCCAAGTTTCCATGCTAAAATATTCATTCATGTTGGCCCCATTTGCCACTACCATATGAATCACCTGGGTGAGAGCTTCAGGTGCTCTCAATTGCAAATTTTCTTTCCCGCTCAGATAACCTACAGTTCTATGTGCTGCAAAATATTTTCTTCATTTTTAGCTGAATATGGTCGTATTATGAAGTAA

Coding sequence (CDS)

ATGCAACCTCCGCAAAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATTTTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACATACTGGATCTGCCTTCCCAAATCAGAATCAGAATATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCCCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTCGGGATAGGCCAAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGGAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTTGGATCCCTTAAGCTTCCTGAGTGGTCCTCTACTTCCGCCTCTTGGTATTCCATTTTGTTCAGCTAGTGTTGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTAGATGTATACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAATGTGCAATCTGAATTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAGGCTTTAAAATTGACCTGGGATGTCTCACCGTTCTTCCCATCTCACTTGGTCTCTCCCTTTTGGTTCAGGCTCCAAGTTTCCATGCTAAAATATTCATTCATGTTGGCCCCATTTGCCACTACCATATGAATCACCTGGGTGAGAGCTTCAGGTGCTCTCAATTGCAAATTTTCTTTCCCGCTCAGATAACCTACAGTTCTATGTGCTGCAAAATATTTTCTTCATTTTTAGCTGAATATGGTCGTATTATGAAGTAA

Protein sequence

MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKGFKIDLGCLTVLPISLGLSLLVQAPSFHAKIFIHVGPICHYHMNHLGESFRCSQLQIFFPAQITYSSMCCKIFSSFLAEYGRIMK
Homology
BLAST of CmUC05G098490 vs. NCBI nr
Match: XP_038899147.1 (uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida] >XP_038899148.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida])

HSP 1 Score: 773.9 bits (1997), Expect = 8.5e-220
Identity = 387/410 (94.39%), Postives = 395/410 (96.34%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP  SSRIDLGDLKAQIVKKLGND+SKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPP IN SGHAQSVLQPSN SPCR+DGPE TGSAFPNQNQ+I
Sbjct: 61  QLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRDDGPEQTGSAFPNQNQSI 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNV 180
           P+WSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDS+SKVITENGNV
Sbjct: 121 PIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLS 240
           TMCDYQRPVQHLQAVAELPENDIDGAV RPSEKPRIHPTEAAILEEGEEVEQ DPLSFL 
Sbjct: 181 TMCDYQRPVQHLQAVAELPENDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPV+SSG  DFLSCYDSIGLSDS TVRKRMEQIATAQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPT 360
           LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKV+N MWPT
Sbjct: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPT 360

Query: 361 NHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           NHLRVQNSN +SE LQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 NHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 410

BLAST of CmUC05G098490 vs. NCBI nr
Match: XP_004136450.1 (uncharacterized protein LOC101212293 [Cucumis sativus] >KGN60169.1 hypothetical protein Csa_001313 [Cucumis sativus])

HSP 1 Score: 764.2 bits (1972), Expect = 6.8e-217
Identity = 381/410 (92.93%), Postives = 395/410 (96.34%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQ SNNSPCREDGPE TGSAFPNQNQ+ 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNV 180
           P+W NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDSSSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLS 240
           T+CDYQRPV++LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPLSFL 
Sbjct: 181 TLCDYQRPVRYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPT 360
           LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPT
Sbjct: 301 LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           NHLRVQNSN +SE LQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 NHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 410

BLAST of CmUC05G098490 vs. NCBI nr
Match: XP_008466308.1 (PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >XP_016903596.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >XP_016903597.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >XP_016903598.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >KAA0038755.1 SAGA-Tad1 domain-containing protein [Cucumis melo var. makuwa] >TYK31368.1 SAGA-Tad1 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 760.8 bits (1963), Expect = 7.5e-216
Identity = 381/410 (92.93%), Postives = 394/410 (96.10%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSPCREDGPE TGSAFPNQNQ+ 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNV 180
           P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDSSSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLS 240
           T+CDYQRPVQ+LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPL FL 
Sbjct: 181 TLCDYQRPVQYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPT 360
           LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPT
Sbjct: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           NHLRVQN+N +SE LQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 NHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409

BLAST of CmUC05G098490 vs. NCBI nr
Match: XP_022976270.1 (uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976271.1 uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976272.1 uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976273.1 uncharacterized protein LOC111476715 [Cucurbita maxima])

HSP 1 Score: 748.4 bits (1931), Expect = 3.8e-212
Identity = 377/411 (91.73%), Postives = 388/411 (94.40%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQ  QSSRIDLGDLKAQIVKKLGNDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENI
Sbjct: 1   MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPP INVSGHAQSVLQ SNN+PCRED PE TGSAFPNQNQ+I
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNV 180
           P+W+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNV
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLS 240
           TMCDYQRPVQ LQAVAELPENDIDG+VQRPS KPRI PTEA+ILEEGEEVEQ DPLSFL 
Sbjct: 181 TMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG   DFLSCYDSIGLSDSETVRKRMEQIATAQ
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ 300

Query: 301 GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWP 360
           GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGKVINGMWP
Sbjct: 301 GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWP 360

Query: 361 TNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           TNHLRVQNSN +SE L+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 TNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409

BLAST of CmUC05G098490 vs. NCBI nr
Match: XP_022142878.1 (uncharacterized protein LOC111012883 [Momordica charantia])

HSP 1 Score: 746.9 bits (1927), Expect = 1.1e-211
Identity = 370/410 (90.24%), Postives = 389/410 (94.88%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQ SNNSPCREDGPE TGSAFPNQNQ +
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQTV 120

Query: 121 PLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGN 180
           P+WSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTEDS SKVITENGN
Sbjct: 121 PIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKVTCLSYPSTGTEDSGSKVITENGN 180

Query: 181 VTMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFL 240
           VT+CDYQRPVQHLQAVAELPENDI+GAVQRPSEKPRIHPTEAAILE+GEEVEQ DPLSFL
Sbjct: 181 VTLCDYQRPVQHLQAVAELPENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFL 240

Query: 241 SGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
            GPLLPPLGIPFCSASVGGAR+ALP+ +SGDF SCYDSIGLSD+ETVRKRMEQIATAQGL
Sbjct: 241 RGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL 300

Query: 301 EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTN 360
           EGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGKVINGMWP+N
Sbjct: 301 EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSN 360

Query: 361 HLRVQN-SNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           HLRVQN SN + E LQEKSL+CSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 HLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQLGEDWPLLLEK 410

BLAST of CmUC05G098490 vs. ExPASy TrEMBL
Match: A0A0A0LGS9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G881820 PE=4 SV=1)

HSP 1 Score: 764.2 bits (1972), Expect = 3.3e-217
Identity = 381/410 (92.93%), Postives = 395/410 (96.34%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQ SNNSPCREDGPE TGSAFPNQNQ+ 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNV 180
           P+W NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDSSSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLS 240
           T+CDYQRPV++LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPLSFL 
Sbjct: 181 TLCDYQRPVRYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPT 360
           LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPT
Sbjct: 301 LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           NHLRVQNSN +SE LQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 NHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 410

BLAST of CmUC05G098490 vs. ExPASy TrEMBL
Match: A0A1S4E5S7 (uncharacterized protein LOC103503757 OS=Cucumis melo OX=3656 GN=LOC103503757 PE=4 SV=1)

HSP 1 Score: 760.8 bits (1963), Expect = 3.6e-216
Identity = 381/410 (92.93%), Postives = 394/410 (96.10%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSPCREDGPE TGSAFPNQNQ+ 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNV 180
           P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDSSSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLS 240
           T+CDYQRPVQ+LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPL FL 
Sbjct: 181 TLCDYQRPVQYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPT 360
           LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPT
Sbjct: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           NHLRVQN+N +SE LQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 NHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409

BLAST of CmUC05G098490 vs. ExPASy TrEMBL
Match: A0A5A7TBJ9 (SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G006740 PE=4 SV=1)

HSP 1 Score: 760.8 bits (1963), Expect = 3.6e-216
Identity = 381/410 (92.93%), Postives = 394/410 (96.10%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSPCREDGPE TGSAFPNQNQ+ 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNV 180
           P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDSSSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLS 240
           T+CDYQRPVQ+LQ+VAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPL FL 
Sbjct: 181 TLCDYQRPVQYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPT 360
           LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPT
Sbjct: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           NHLRVQN+N +SE LQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 NHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409

BLAST of CmUC05G098490 vs. ExPASy TrEMBL
Match: A0A6J1IIZ9 (uncharacterized protein LOC111476715 OS=Cucurbita maxima OX=3661 GN=LOC111476715 PE=4 SV=1)

HSP 1 Score: 748.4 bits (1931), Expect = 1.9e-212
Identity = 377/411 (91.73%), Postives = 388/411 (94.40%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQ  QSSRIDLGDLKAQIVKKLGNDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENI
Sbjct: 1   MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPP INVSGHAQSVLQ SNN+PCRED PE TGSAFPNQNQ+I
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNV 180
           P+W+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNV
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLS 240
           TMCDYQRPVQ LQAVAELPENDIDG+VQRPS KPRI PTEA+ILEEGEEVEQ DPLSFL 
Sbjct: 181 TMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG   DFLSCYDSIGLSDSETVRKRMEQIATAQ
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ 300

Query: 301 GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWP 360
           GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGKVINGMWP
Sbjct: 301 GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWP 360

Query: 361 TNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           TNHLRVQNSN +SE L+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 TNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409

BLAST of CmUC05G098490 vs. ExPASy TrEMBL
Match: A0A6J1CPD1 (uncharacterized protein LOC111012883 OS=Momordica charantia OX=3673 GN=LOC111012883 PE=4 SV=1)

HSP 1 Score: 746.9 bits (1927), Expect = 5.4e-212
Identity = 370/410 (90.24%), Postives = 389/410 (94.88%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
           QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQ SNNSPCREDGPE TGSAFPNQNQ +
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQTV 120

Query: 121 PLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGN 180
           P+WSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTEDS SKVITENGN
Sbjct: 121 PIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKVTCLSYPSTGTEDSGSKVITENGN 180

Query: 181 VTMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFL 240
           VT+CDYQRPVQHLQAVAELPENDI+GAVQRPSEKPRIHPTEAAILE+GEEVEQ DPLSFL
Sbjct: 181 VTLCDYQRPVQHLQAVAELPENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFL 240

Query: 241 SGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
            GPLLPPLGIPFCSASVGGAR+ALP+ +SGDF SCYDSIGLSD+ETVRKRMEQIATAQGL
Sbjct: 241 RGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL 300

Query: 301 EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTN 360
           EGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGKVINGMWP+N
Sbjct: 301 EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSN 360

Query: 361 HLRVQN-SNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           HLRVQN SN + E LQEKSL+CSVSLLDFKVAMELNPKQLGEDWPLLLEK
Sbjct: 361 HLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQLGEDWPLLLEK 410

BLAST of CmUC05G098490 vs. TAIR 10
Match: AT2G24530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 396.7 bits (1018), Expect = 2.6e-110
Identity = 218/413 (52.78%), Postives = 279/413 (67.55%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQ  Q  RI L +LK  IVKK G ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+
Sbjct: 1   MQRSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENL 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
            LHNQLIRSIL+NA VAK+PPP + +GH+      +N    R DG E +G+  PN +Q+ 
Sbjct: 61  SLHNQLIRSILRNATVAKSPPPDHEAGHSTK----ANAFQSRGDGLEQSGTLIPNHSQHE 120

Query: 121 PLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGN 180
           P+WSNGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED+   V  ENG 
Sbjct: 121 PVWSNGVLPISPRKVRSGMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENG- 180

Query: 181 VTMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTE---AAILEEGEEVEQLDPL 240
               DYQR  +++        ++ DG   RP EKPRI   E   A  + + +  E+   +
Sbjct: 181 ----DYQRSGRYV-------ADEKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARV 240

Query: 241 SFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATA 300
           +    PL+ PLGIPFCSASVGG+ + +PVS++ + +SCYDS GL D E +RKRME IA A
Sbjct: 241 NLSMSPLIAPLGIPFCSASVGGSPRTIPVSTNAELISCYDSGGLPDIEMLRKRMENIAVA 300

Query: 301 QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVINGM 360
           QGLEGVSMEC   LNN LDVYLK+LI SC +LV ARST    G   I KQQ+Q K++NG+
Sbjct: 301 QGLEGVSMECAKTLNNMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGV 360

Query: 361 WPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
           WPTN L++Q  N  S+  Q+     SVS+LDF+ AMELNP+QLGEDWP L E+
Sbjct: 361 WPTNSLKIQTPNGSSDIRQDHH---SVSMLDFRTAMELNPRQLGEDWPTLRER 394

BLAST of CmUC05G098490 vs. TAIR 10
Match: AT4G31440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2; Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 311.2 bits (796), Expect = 1.5e-84
Identity = 192/411 (46.72%), Postives = 253/411 (61.56%), Query Frame = 0

Query: 1   MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQ  Q  RIDL +LK  IVKK+G ++S RYF+YL RFL QKL+K EFDK C R+LGREN+
Sbjct: 1   MQRLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENL 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNI 120
            LHN+LIRSIL+NA +AK+PP ++ SGH    L        +EDGPE + S  P+  +N 
Sbjct: 61  SLHNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLG-----KEDGPEESRSLNPDHIRND 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNV 180
              SNGVL    R G    R   RD+P PLG NGK+                       +
Sbjct: 121 LALSNGVL-AKVRPGTCDDR-TIRDKPCPLGSNGKV-----------------------L 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTE--AAILEEGEEVEQLDPLSF 240
               Y RP ++         ++ D A   P+E+  +   +  AA +   +E  Q+  LS 
Sbjct: 181 GPFAYSRPGRY--------PDERDSAFLCPAEQKAVSGKDQVAAPISRDDEA-QVRILS- 240

Query: 241 LSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG 300
            + P++ PLGIPFCSASVGG R+ +PVS+S   +SCYDS GLSD+E +RKRME IA  QG
Sbjct: 241 -TPPVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLRKRMENIAVTQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVINGMWP 360
           L GVS EC  +LNN LD+YLK+L+KSC++L  ARS     G H ++KQQ++ +++NG+  
Sbjct: 301 LGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQSRDELVNGVRT 360

Query: 361 TNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
            N   +Q SN  S+  +E+    SVSLLDF+VAMELNP QLGEDWPLL E+
Sbjct: 361 NNSFHIQTSNQPSDITREQH---SVSLLDFRVAMELNPHQLGEDWPLLRER 367

BLAST of CmUC05G098490 vs. TAIR 10
Match: AT4G33890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 177.2 bits (448), Expect = 3.3e-44
Identity = 140/410 (34.15%), Postives = 202/410 (49.27%), Query Frame = 0

Query: 6   SSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQ 65
           SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+
Sbjct: 7   SSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNR 66

Query: 66  LIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNIPLWSN 125
           LIRSI+KNAC+AK+PP I   G   S ++  N                   +Q  PL  +
Sbjct: 67  LIRSIIKNACIAKSPPFIKKGG---SFVRFGNGDS-------------KKNSQIQPLHGD 126

Query: 126 GVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDY 185
                S RK RS    K RDRPSPLGP GK   L   +T  E+S SK             
Sbjct: 127 SAFSPSTRKCRS---RKLRDRPSPLGPLGKPHSL---TTTNEESMSKA------------ 186

Query: 186 QRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQL---DPLSFLSGP 245
                  Q+  EL          RP       P E   +EEGEEVEQ+    P      P
Sbjct: 187 -------QSATELL-----SLGSRP-------PVEVVSVEEGEEVEQIAGGSPSVQSRCP 246

Query: 246 LLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG 305
           L  PLG+   S   G  RK++      S S +  +C ++  L D+ T+R R+E+    +G
Sbjct: 247 LTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLERRLEMEG 306

Query: 306 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPT 365
           L+ ++M+  ++LN+ LDV++++LI+ CL L   R   +                      
Sbjct: 307 LK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTD---------------------- 334

Query: 366 NHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
              RV+  N Q  + Q+      VS+ DF+  MELN + LGEDWP+ +EK
Sbjct: 367 ---RVREMNYQ--YTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEK 334

BLAST of CmUC05G098490 vs. TAIR 10
Match: AT4G33890.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 177.2 bits (448), Expect = 3.3e-44
Identity = 140/410 (34.15%), Postives = 202/410 (49.27%), Query Frame = 0

Query: 6   SSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQ 65
           SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+
Sbjct: 7   SSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNR 66

Query: 66  LIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNIPLWSN 125
           LIRSI+KNAC+AK+PP I   G   S ++  N                   +Q  PL  +
Sbjct: 67  LIRSIIKNACIAKSPPFIKKGG---SFVRFGNGDS-------------KKNSQIQPLHGD 126

Query: 126 GVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDY 185
                S RK RS    K RDRPSPLGP GK   L   +T  E+S SK             
Sbjct: 127 SAFSPSTRKCRS---RKLRDRPSPLGPLGKPHSL---TTTNEESMSKA------------ 186

Query: 186 QRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQL---DPLSFLSGP 245
                  Q+  EL          RP       P E   +EEGEEVEQ+    P      P
Sbjct: 187 -------QSATELL-----SLGSRP-------PVEVVSVEEGEEVEQIAGGSPSVQSRCP 246

Query: 246 LLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG 305
           L  PLG+   S   G  RK++      S S +  +C ++  L D+ T+R R+E+    +G
Sbjct: 247 LTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLERRLEMEG 306

Query: 306 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPT 365
           L+ ++M+  ++LN+ LDV++++LI+ CL L   R   +                      
Sbjct: 307 LK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTD---------------------- 334

Query: 366 NHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
              RV+  N Q  + Q+      VS+ DF+  MELN + LGEDWP+ +EK
Sbjct: 367 ---RVREMNYQ--YTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEK 334

BLAST of CmUC05G098490 vs. TAIR 10
Match: AT2G14850.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 162.5 bits (410), Expect = 8.4e-40
Identity = 129/404 (31.93%), Postives = 184/404 (45.54%), Query Frame = 0

Query: 7   SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQL 66
           SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L
Sbjct: 8   SRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRL 67

Query: 67  IRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTGSAFPNQNQNIPLWSNG 126
           +RSILKNA VAK+PPP                              +P ++    L+ + 
Sbjct: 68  VRSILKNASVAKSPPP-----------------------------RYPKKS----LYGDP 127

Query: 127 VLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQ 186
           V P SPRK RS    KFRDRPSPLGP GK   L   +T  ++S SK              
Sbjct: 128 VFPPSPRKCRS---RKFRDRPSPLGPLGKPQSL---TTTNDESMSK-------------- 187

Query: 187 RPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQL--DPLSFLSGPLL 246
              Q L                         P E   +E+GEEVEQ+   P      PL 
Sbjct: 188 --AQRL-------------------------PMEVVSVEDGEEVEQMTGSPSVQSRSPLT 247

Query: 247 PPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSM 306
            PLG+ F   S    +      +  +  +C  S  L D  T+R R+E+    +G++ +SM
Sbjct: 248 APLGVSFHLKS----KARFSTYNGINRETCQSSGELPDMITLRARLEKKLEMEGIK-LSM 283

Query: 307 ECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQ 366
           +  N+LN  L+ Y+++LI+ CL L                                    
Sbjct: 308 DSANLLNRGLNAYMRRLIEPCLSLAS---------------------------------- 283

Query: 367 NSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK 409
                    Q+K    +VS+LDF  AME+NP+ LGE+WP+ LEK
Sbjct: 368 ---------QQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEK 283

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899147.18.5e-22094.39uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida] >XP_03889914... [more]
XP_004136450.16.8e-21792.93uncharacterized protein LOC101212293 [Cucumis sativus] >KGN60169.1 hypothetical ... [more]
XP_008466308.17.5e-21692.93PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >XP_016903596.1 P... [more]
XP_022976270.13.8e-21291.73uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976271.1 uncharac... [more]
XP_022142878.11.1e-21190.24uncharacterized protein LOC111012883 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LGS93.3e-21792.93Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G881820 PE=4 SV=1[more]
A0A1S4E5S73.6e-21692.93uncharacterized protein LOC103503757 OS=Cucumis melo OX=3656 GN=LOC103503757 PE=... [more]
A0A5A7TBJ93.6e-21692.93SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A6J1IIZ91.9e-21291.73uncharacterized protein LOC111476715 OS=Cucurbita maxima OX=3661 GN=LOC111476715... [more]
A0A6J1CPD15.4e-21290.24uncharacterized protein LOC111012883 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
Match NameE-valueIdentityDescription
AT2G24530.12.6e-11052.78unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31440.11.5e-8446.72unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.13.3e-4434.15unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.23.3e-4434.15unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G14850.18.4e-4031.93unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 5..329
e-value: 1.1E-59
score: 202.0
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PANTHERPTHR21277TRANSCRIPTIONAL ADAPTER 1coord: 1..408
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 87..118
NoneNo IPR availablePANTHERPTHR21277:SF38TRANSCRIPTIONAL REGULATOR OF RNA POLII, SAGA, SUBUNITcoord: 1..408

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC05G098490.1CmUC05G098490.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0000124 SAGA complex
cellular_component GO:0070461 SAGA-type complex
molecular_function GO:0003713 transcription coactivator activity