CaUC05G096940 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC05G096940
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionSAGA-Tad1 domain-containing protein
LocationCiama_Chr05: 28291587 .. 28292837 (+)
RNA-Seq ExpressionCaUC05G096940
SyntenyCaUC05G096940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAACCTGAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTAGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATTTTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGATCTGCCTTCCCAAATCAGAATCAGAGTATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCGCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTAGAGATAGGCCAAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATGGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCGGTTCAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGAAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCGAATCCGTTAAGCTTCCTGAGTGGTCCTCTACTTCCACCACTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTCCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCAACTGCACAAGGACTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTGGATGTATACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGTTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATATCAATGGGCGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCGGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGTTGTTGGAGAAAGTTTGTATGCGTGCCTTCGAGGAATAA

mRNA sequence

ATGCAACCTGAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTAGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATTTTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGATCTGCCTTCCCAAATCAGAATCAGAGTATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCGCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTAGAGATAGGCCAAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATGGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCGGTTCAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGAAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCGAATCCGTTAAGCTTCCTGAGTGGTCCTCTACTTCCACCACTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTCCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCAACTGCACAAGGACTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTGGATGTATACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGTTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATATCAATGGGCGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCGGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGTTGTTGGAGAAAGTTTGTATGCGTGCCTTCGAGGAATAA

Coding sequence (CDS)

ATGCAACCTGAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTAGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATTTTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGATCTGCCTTCCCAAATCAGAATCAGAGTATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCGCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTAGAGATAGGCCAAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATGGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCGGTTCAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGAAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCGAATCCGTTAAGCTTCCTGAGTGGTCCTCTACTTCCACCACTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTCCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCAACTGCACAAGGACTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTGGATGTATACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGTTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATATCAATGGGCGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCGGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGTTGTTGGAGAAAGTTTGTATGCGTGCCTTCGAGGAATAA

Protein sequence

MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE
Homology
BLAST of CaUC05G096940 vs. NCBI nr
Match: XP_038899147.1 (uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida] >XP_038899148.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida])

HSP 1 Score: 792.0 bits (2044), Expect = 2.6e-225
Identity = 395/418 (94.50%), Postives = 404/418 (96.65%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP+HSSRIDLGDLKAQIVKKL ND+SKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPP IN SGHAQSVLQPSN SPCR+DGPEQTGSAFPNQNQSI
Sbjct: 61  QLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRDDGPEQTGSAFPNQNQSI 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNV 180
           P+WSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTED +SKVITENGNV
Sbjct: 121 PIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLS 240
           TMCDYQRPVQHLQAVAELPENDID AV RPSEKPRIHPTEAAILEEGEEVEQS+PLSFL 
Sbjct: 181 TMCDYQRPVQHLQAVAELPENDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPV+SSG  DFLSCYDSIGLSDS TVRKRMEQIATAQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPT 360
           LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGK +N MWPT
Sbjct: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPT 360

Query: 361 NHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           NHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK+CMRAFEE
Sbjct: 361 NHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRAFEE 418

BLAST of CaUC05G096940 vs. NCBI nr
Match: XP_004136450.1 (uncharacterized protein LOC101212293 [Cucumis sativus] >KGN60169.1 hypothetical protein Csa_001313 [Cucumis sativus])

HSP 1 Score: 778.5 bits (2009), Expect = 2.9e-221
Identity = 388/418 (92.82%), Postives = 403/418 (96.41%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQ SNNSPCREDGPEQTGSAFPNQNQS 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNV 180
           P+W NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+ED SSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLS 240
           T+CDYQRPV++LQ+VAELPENDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PLSFL 
Sbjct: 181 TLCDYQRPVRYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPT 360
           LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK +NGMWPT
Sbjct: 301 LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           NHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK+ MRAFEE
Sbjct: 361 NHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 418

BLAST of CaUC05G096940 vs. NCBI nr
Match: XP_008466308.1 (PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >XP_016903596.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >XP_016903597.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >XP_016903598.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >KAA0038755.1 SAGA-Tad1 domain-containing protein [Cucumis melo var. makuwa] >TYK31368.1 SAGA-Tad1 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 776.2 bits (2003), Expect = 1.5e-220
Identity = 389/418 (93.06%), Postives = 402/418 (96.17%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSPCREDGPEQTGSAFPNQNQS 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNV 180
           P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+ED SSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLS 240
           T+CDYQRPVQ+LQ+VAELPENDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PL FL 
Sbjct: 181 TLCDYQRPVQYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPT 360
           LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK +NGMWPT
Sbjct: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           NHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK+ MRAFEE
Sbjct: 361 NHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of CaUC05G096940 vs. NCBI nr
Match: XP_022142878.1 (uncharacterized protein LOC111012883 [Momordica charantia])

HSP 1 Score: 762.7 bits (1968), Expect = 1.7e-216
Identity = 376/418 (89.95%), Postives = 398/418 (95.22%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQ SNNSPCREDGPEQTGSAFPNQNQ++
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQTV 120

Query: 121 PLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGN 180
           P+WSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTED  SKVITENGN
Sbjct: 121 PIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKVTCLSYPSTGTEDSGSKVITENGN 180

Query: 181 VTMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFL 240
           VT+CDYQRPVQHLQAVAELPENDI+ AVQRPSEKPRIHPTEAAILE+GEEVEQS+PLSFL
Sbjct: 181 VTLCDYQRPVQHLQAVAELPENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFL 240

Query: 241 SGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
            GPLLPPLGIPFCSASVGGAR+ALP+ +SGDF SCYDSIGLSD+ETVRKRMEQIATAQGL
Sbjct: 241 RGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL 300

Query: 301 EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPTN 360
           EGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGK INGMWP+N
Sbjct: 301 EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSN 360

Query: 361 HLRVQN-INGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           HLRVQN  NGR EVLQEKSL+CSVSLLDFKVAMELNPKQLGEDWPLLLEK+CMR FEE
Sbjct: 361 HLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRTFEE 418

BLAST of CaUC05G096940 vs. NCBI nr
Match: XP_022976270.1 (uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976271.1 uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976272.1 uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976273.1 uncharacterized protein LOC111476715 [Cucurbita maxima])

HSP 1 Score: 759.6 bits (1960), Expect = 1.4e-215
Identity = 383/419 (91.41%), Postives = 395/419 (94.27%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQ + SSRIDLGDLKAQIVKKL NDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENI
Sbjct: 1   MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPP INVSGHAQSVLQ SNN+PCRED PEQTGSAFPNQNQSI
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNV 180
           P+W+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNV
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLS 240
           TMCDYQRPVQ LQAVAELPENDID +VQRPS KPRI PTEA+ILEEGEEVEQS+PLSFL 
Sbjct: 181 TMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG   DFLSCYDSIGLSDSETVRKRMEQIATAQ
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ 300

Query: 301 GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWP 360
           GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGK INGMWP
Sbjct: 301 GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWP 360

Query: 361 TNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           TNHLRVQN NGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEK+ MRAFEE
Sbjct: 361 TNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of CaUC05G096940 vs. ExPASy TrEMBL
Match: A0A0A0LGS9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G881820 PE=4 SV=1)

HSP 1 Score: 778.5 bits (2009), Expect = 1.4e-221
Identity = 388/418 (92.82%), Postives = 403/418 (96.41%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQ SNNSPCREDGPEQTGSAFPNQNQS 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNV 180
           P+W NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+ED SSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLS 240
           T+CDYQRPV++LQ+VAELPENDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PLSFL 
Sbjct: 181 TLCDYQRPVRYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPT 360
           LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK +NGMWPT
Sbjct: 301 LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           NHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK+ MRAFEE
Sbjct: 361 NHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 418

BLAST of CaUC05G096940 vs. ExPASy TrEMBL
Match: A0A1S4E5S7 (uncharacterized protein LOC103503757 OS=Cucumis melo OX=3656 GN=LOC103503757 PE=4 SV=1)

HSP 1 Score: 776.2 bits (2003), Expect = 7.0e-221
Identity = 389/418 (93.06%), Postives = 402/418 (96.17%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSPCREDGPEQTGSAFPNQNQS 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNV 180
           P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+ED SSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLS 240
           T+CDYQRPVQ+LQ+VAELPENDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PL FL 
Sbjct: 181 TLCDYQRPVQYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPT 360
           LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK +NGMWPT
Sbjct: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           NHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK+ MRAFEE
Sbjct: 361 NHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of CaUC05G096940 vs. ExPASy TrEMBL
Match: A0A5A7TBJ9 (SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G006740 PE=4 SV=1)

HSP 1 Score: 776.2 bits (2003), Expect = 7.0e-221
Identity = 389/418 (93.06%), Postives = 402/418 (96.17%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSPCREDGPEQTGSAFPNQNQS 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNV 180
           P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+ED SSKVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLS 240
           T+CDYQRPVQ+LQ+VAELPENDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PL FL 
Sbjct: 181 TLCDYQRPVQYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPT 360
           LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK +NGMWPT
Sbjct: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPT 360

Query: 361 NHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           NHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEK+ MRAFEE
Sbjct: 361 NHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of CaUC05G096940 vs. ExPASy TrEMBL
Match: A0A6J1CPD1 (uncharacterized protein LOC111012883 OS=Momordica charantia OX=3673 GN=LOC111012883 PE=4 SV=1)

HSP 1 Score: 762.7 bits (1968), Expect = 8.1e-217
Identity = 376/418 (89.95%), Postives = 398/418 (95.22%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQ SNNSPCREDGPEQTGSAFPNQNQ++
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQTV 120

Query: 121 PLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGN 180
           P+WSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTED  SKVITENGN
Sbjct: 121 PIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKVTCLSYPSTGTEDSGSKVITENGN 180

Query: 181 VTMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFL 240
           VT+CDYQRPVQHLQAVAELPENDI+ AVQRPSEKPRIHPTEAAILE+GEEVEQS+PLSFL
Sbjct: 181 VTLCDYQRPVQHLQAVAELPENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFL 240

Query: 241 SGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
            GPLLPPLGIPFCSASVGGAR+ALP+ +SGDF SCYDSIGLSD+ETVRKRMEQIATAQGL
Sbjct: 241 RGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL 300

Query: 301 EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPTN 360
           EGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGK INGMWP+N
Sbjct: 301 EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSN 360

Query: 361 HLRVQN-INGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           HLRVQN  NGR EVLQEKSL+CSVSLLDFKVAMELNPKQLGEDWPLLLEK+CMR FEE
Sbjct: 361 HLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRTFEE 418

BLAST of CaUC05G096940 vs. ExPASy TrEMBL
Match: A0A6J1IIZ9 (uncharacterized protein LOC111476715 OS=Cucurbita maxima OX=3661 GN=LOC111476715 PE=4 SV=1)

HSP 1 Score: 759.6 bits (1960), Expect = 6.8e-216
Identity = 383/419 (91.41%), Postives = 395/419 (94.27%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQ + SSRIDLGDLKAQIVKKL NDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENI
Sbjct: 1   MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPP INVSGHAQSVLQ SNN+PCRED PEQTGSAFPNQNQSI
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNV 180
           P+W+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNV
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNV 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLS 240
           TMCDYQRPVQ LQAVAELPENDID +VQRPS KPRI PTEA+ILEEGEEVEQS+PLSFL 
Sbjct: 181 TMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ 300
           GPLLPPLGIPFCSASVGGARKALPVSSSG   DFLSCYDSIGLSDSETVRKRMEQIATAQ
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ 300

Query: 301 GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWP 360
           GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGK INGMWP
Sbjct: 301 GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWP 360

Query: 361 TNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
           TNHLRVQN NGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEK+ MRAFEE
Sbjct: 361 TNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of CaUC05G096940 vs. TAIR 10
Match: AT2G24530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 402.1 bits (1032), Expect = 5.3e-112
Identity = 221/421 (52.49%), Postives = 283/421 (67.22%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQ     RI L +LK  IVKK   ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+
Sbjct: 1   MQRSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENL 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
            LHNQLIRSIL+NA VAK+PPP + +GH+      +N    R DG EQ+G+  PN +Q  
Sbjct: 61  SLHNQLIRSILRNATVAKSPPPDHEAGHSTK----ANAFQSRGDGLEQSGTLIPNHSQHE 120

Query: 121 PLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGN 180
           P+WSNGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED    V  ENG 
Sbjct: 121 PVWSNGVLPISPRKVRSGMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENG- 180

Query: 181 VTMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTE---AAILEEGEEVEQSNPL 240
               DYQR  +++        ++ D    RP EKPRI   E   A  + + +  E+   +
Sbjct: 181 ----DYQRSGRYV-------ADEKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARV 240

Query: 241 SFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATA 300
           +    PL+ PLGIPFCSASVGG+ + +PVS++ + +SCYDS GL D E +RKRME IA A
Sbjct: 241 NLSMSPLIAPLGIPFCSASVGGSPRTIPVSTNAELISCYDSGGLPDIEMLRKRMENIAVA 300

Query: 301 QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKFINGM 360
           QGLEGVSMEC   LNN LDVYLK+LI SC +LV ARST    G   I KQQ+Q K +NG+
Sbjct: 301 QGLEGVSMECAKTLNNMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGV 360

Query: 361 WPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFE 417
           WPTN L++Q  NG S++ Q+     SVS+LDF+ AMELNP+QLGEDWP L E++ +R+FE
Sbjct: 361 WPTNSLKIQTPNGSSDIRQDHH---SVSMLDFRTAMELNPRQLGEDWPTLRERISLRSFE 402

BLAST of CaUC05G096940 vs. TAIR 10
Match: AT4G31440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2; Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 308.5 bits (789), Expect = 8.0e-84
Identity = 191/419 (45.58%), Postives = 255/419 (60.86%), Query Frame = 0

Query: 1   MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60
           MQ     RIDL +LK  IVKK+  ++S RYF+YL RFL QKL+K EFDK C R+LGREN+
Sbjct: 1   MQRLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENL 60

Query: 61  QLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSI 120
            LHN+LIRSIL+NA +AK+PP ++ SGH    L        +EDGPE++ S  P+  ++ 
Sbjct: 61  SLHNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLG-----KEDGPEESRSLNPDHIRND 120

Query: 121 PLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNV 180
              SNGVL    R G    R   RD+P PLG NGK+                       +
Sbjct: 121 LALSNGVL-AKVRPGTCDDR-TIRDKPCPLGSNGKV-----------------------L 180

Query: 181 TMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTE--AAILEEGEEVEQSNPLSF 240
               Y RP ++         ++ D A   P+E+  +   +  AA +   +E  Q   LS 
Sbjct: 181 GPFAYSRPGRY--------PDERDSAFLCPAEQKAVSGKDQVAAPISRDDEA-QVRILS- 240

Query: 241 LSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG 300
            + P++ PLGIPFCSASVGG R+ +PVS+S   +SCYDS GLSD+E +RKRME IA  QG
Sbjct: 241 -TPPVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLRKRMENIAVTQG 300

Query: 301 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKFINGMWP 360
           L GVS EC  +LNN LD+YLK+L+KSC++L  ARS     G H ++KQQ++ + +NG+  
Sbjct: 301 LGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQSRDELVNGVRT 360

Query: 361 TNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
            N   +Q  N  S++ +E+    SVSLLDF+VAMELNP QLGEDWPLL E++ +  FEE
Sbjct: 361 NNSFHIQTSNQPSDITREQH---SVSLLDFRVAMELNPHQLGEDWPLLRERISISLFEE 375

BLAST of CaUC05G096940 vs. TAIR 10
Match: AT4G33890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 177.6 bits (449), Expect = 2.1e-44
Identity = 140/418 (33.49%), Postives = 206/418 (49.28%), Query Frame = 0

Query: 6   SSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQ 65
           SSR+D  ++KA I +++ N +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+
Sbjct: 7   SSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNR 66

Query: 66  LIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSIPLWSN 125
           LIRSI+KNAC+AK+PP I   G   S ++  N                   +Q  PL  +
Sbjct: 67  LIRSIIKNACIAKSPPFIKKGG---SFVRFGNGDS-------------KKNSQIQPLHGD 126

Query: 126 GVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDY 185
                S RK RS    K RDRPSPLGP GK   L   +T  E+  SK             
Sbjct: 127 SAFSPSTRKCRS---RKLRDRPSPLGPLGKPHSL---TTTNEESMSKA------------ 186

Query: 186 QRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQ---SNPLSFLSGP 245
                  Q+  EL          RP       P E   +EEGEEVEQ    +P      P
Sbjct: 187 -------QSATELL-----SLGSRP-------PVEVVSVEEGEEVEQIAGGSPSVQSRCP 246

Query: 246 LLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG 305
           L  PLG+   S   G  RK++      S S +  +C ++  L D+ T+R R+E+    +G
Sbjct: 247 LTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLERRLEMEG 306

Query: 306 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPT 365
           L+ ++M+  ++LN+ LDV++++LI+ CL L   R   +                      
Sbjct: 307 LK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTD---------------------- 342

Query: 366 NHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
              RV+ +N   +  Q+      VS+ DF+  MELN + LGEDWP+ +EK+C RA ++
Sbjct: 367 ---RVREMN--YQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRASDK 342

BLAST of CaUC05G096940 vs. TAIR 10
Match: AT4G33890.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 177.6 bits (449), Expect = 2.1e-44
Identity = 140/418 (33.49%), Postives = 206/418 (49.28%), Query Frame = 0

Query: 6   SSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQ 65
           SSR+D  ++KA I +++ N +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+
Sbjct: 7   SSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNR 66

Query: 66  LIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSIPLWSN 125
           LIRSI+KNAC+AK+PP I   G   S ++  N                   +Q  PL  +
Sbjct: 67  LIRSIIKNACIAKSPPFIKKGG---SFVRFGNGDS-------------KKNSQIQPLHGD 126

Query: 126 GVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDY 185
                S RK RS    K RDRPSPLGP GK   L   +T  E+  SK             
Sbjct: 127 SAFSPSTRKCRS---RKLRDRPSPLGPLGKPHSL---TTTNEESMSKA------------ 186

Query: 186 QRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQ---SNPLSFLSGP 245
                  Q+  EL          RP       P E   +EEGEEVEQ    +P      P
Sbjct: 187 -------QSATELL-----SLGSRP-------PVEVVSVEEGEEVEQIAGGSPSVQSRCP 246

Query: 246 LLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG 305
           L  PLG+   S   G  RK++      S S +  +C ++  L D+ T+R R+E+    +G
Sbjct: 247 LTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLERRLEMEG 306

Query: 306 LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPT 365
           L+ ++M+  ++LN+ LDV++++LI+ CL L   R   +                      
Sbjct: 307 LK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTD---------------------- 342

Query: 366 NHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
              RV+ +N   +  Q+      VS+ DF+  MELN + LGEDWP+ +EK+C RA ++
Sbjct: 367 ---RVREMN--YQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRASDK 342

BLAST of CaUC05G096940 vs. TAIR 10
Match: AT2G14850.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 167.2 bits (422), Expect = 2.9e-41
Identity = 132/412 (32.04%), Postives = 186/412 (45.15%), Query Frame = 0

Query: 7   SRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQL 66
           SR++  ++KA I +K+ + ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L
Sbjct: 8   SRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRL 67

Query: 67  IRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTGSAFPNQNQSIPLWSNG 126
           +RSILKNA VAK+PPP                              +P ++    L+ + 
Sbjct: 68  VRSILKNASVAKSPPP-----------------------------RYPKKS----LYGDP 127

Query: 127 VLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQ 186
           V P SPRK RS    KFRDRPSPLGP GK   L    T T D S                
Sbjct: 128 VFPPSPRKCRS---RKFRDRPSPLGPLGKPQSL----TTTNDES---------------- 187

Query: 187 RPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQ--SNPLSFLSGPLL 246
                                     K +  P E   +E+GEEVEQ   +P      PL 
Sbjct: 188 ------------------------MSKAQRLPMEVVSVEDGEEVEQMTGSPSVQSRSPLT 247

Query: 247 PPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSM 306
            PLG+ F   S    +      +  +  +C  S  L D  T+R R+E+    +G++ +SM
Sbjct: 248 APLGVSFHLKS----KARFSTYNGINRETCQSSGELPDMITLRARLEKKLEMEGIK-LSM 291

Query: 307 ECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKFINGMWPTNHLRVQ 366
           +  N+LN  L+ Y+++LI+ CL L                                    
Sbjct: 308 DSANLLNRGLNAYMRRLIEPCLSLAS---------------------------------- 291

Query: 367 NINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVCMRAFEE 417
                    Q+K    +VS+LDF  AME+NP+ LGE+WP+ LEK+C RA EE
Sbjct: 368 ---------QQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEKICCRASEE 291

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899147.12.6e-22594.50uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida] >XP_03889914... [more]
XP_004136450.12.9e-22192.82uncharacterized protein LOC101212293 [Cucumis sativus] >KGN60169.1 hypothetical ... [more]
XP_008466308.11.5e-22093.06PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo] >XP_016903596.1 P... [more]
XP_022142878.11.7e-21689.95uncharacterized protein LOC111012883 [Momordica charantia][more]
XP_022976270.11.4e-21591.41uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976271.1 uncharac... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LGS91.4e-22192.82Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G881820 PE=4 SV=1[more]
A0A1S4E5S77.0e-22193.06uncharacterized protein LOC103503757 OS=Cucumis melo OX=3656 GN=LOC103503757 PE=... [more]
A0A5A7TBJ97.0e-22193.06SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A6J1CPD18.1e-21789.95uncharacterized protein LOC111012883 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A6J1IIZ96.8e-21691.41uncharacterized protein LOC111476715 OS=Cucurbita maxima OX=3661 GN=LOC111476715... [more]
Match NameE-valueIdentityDescription
AT2G24530.15.3e-11252.49unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31440.18.0e-8445.58unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.12.1e-4433.49unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.22.1e-4433.49unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G14850.12.9e-4132.04unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 5..329
e-value: 3.4E-59
score: 200.4
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PANTHERPTHR21277TRANSCRIPTIONAL ADAPTER 1coord: 1..416
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 86..126
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 86..125
NoneNo IPR availablePANTHERPTHR21277:SF38TRANSCRIPTIONAL REGULATOR OF RNA POLII, SAGA, SUBUNITcoord: 1..416

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC05G096940.1CaUC05G096940.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0000124 SAGA complex
cellular_component GO:0070461 SAGA-type complex
molecular_function GO:0003713 transcription coactivator activity