Clc01G21840 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G21840
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionU-box domain-containing protein 4
LocationClcChr01: 33236501 .. 33241269 (-)
RNA-Seq ExpressionClc01G21840
SyntenyClc01G21840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGTCGCTTGAAGATTCTCATTCCGCTTCTAATCGGTTTCCTTTAAGTAGAAATTGTTACAGTCCGTCGTCTACCACTTCCAGTAAAATCAGCAGGAACATTGGCCGTTCTATGCGTACTATTCGCTCTAATTTCTTTCAAGATGATAATAGTTGCACCTTTAATGGCTCCGTGGCAGGAAAGTCCGGCTGTGTCTCTGAGAATCTCACTGATTCTGTCATCGACCTTAGGCTTGGCGAGTTGGCATCTCGTAGCCCTAAATGGCCTAATAAGCAATCCTCTGAACAGGAACAGGATTTTCTTGAACTTTCTCATGCTTTCAGTGATTTCTCTGCTTGTAGCAGTGATATTTCCGGAGAGTTGCAGAGACTTGCAAGTTTACCCTCTATGGCGGTTGTTCCTAAAAGAGAAGGTGAAGATGCTGACCCCGAGCCCGAACCGTGCTTAGGGTTTTTGCAGAGGGAGAATTTCTCCACTGAGATTATTGAGAGTATTTCGCCTGAAGATCTACAGCCCACTGTCAAGATCTGTATCGATGGGCTTCAGTCCTCGTCAATTGCAGTGAAGCGATCTGCGGCGGCTAAGCTGAGGCTTCTGGCGAAGAATCGGTCTGATAATCGGCTACTCATTGGGGAATCGGGCGCCGTTCCTGCTTTAATTCCTTTGCTTCGATCTACCGATCCATGGACACAAGAGCACGCTGTAACTGCCTTGCTGAATCTCTCACTCCATGAGTCTAATAAAGTTATAATCACGAATGCTGGAGCCATAAAGTCGCTGGTTTATGCGCTCAAAACCGGCACCGAAACTTCGAAACAGAATGCGGCTTGTGCTCTGATGAGCCTTGCTTTGTTGGAGGAAAACAAGACTTCGATTGGGGTTTGCGGGGCCATTCCACCACTGGTATCTTTACTACTGAATGGATCAAATAGAGGGAAGAAGGACGCGCTCACGACGCTCTACAAGCTTTGCTCGATCAAACCGAACAAGGAACGGGCCGTCACTGCTGGAGCTGTGAAGCCATTGGTGGCGCTAGTAGCGGAGCAAGGCACGGGTTTAGCGGAGAAGGCGATGGTGGTTTTGAGTAGCCTGGCTGGGATTCAAGAGGGGAAGGATGCGATTGTGGAAGAGGGTGGAATTGCTGCACTCGTGGAAGCAATTGAAGATGGGTCGGTGAAAGGGAAAGAGTTTGCAGTATTGACACTCTTGCAATTGTGTGTTGAGAGTGTGAGAAATAGAGGGTTGCTCGTAAGGGAAGGTGGAATTCCACCTCTGGTTGCACTTTCTCAGACTGGAAGCGTTCGAGCAAAGCATAAGGTATTTACCAAAAACTGACTAGTCACCAATTATTACACGATTCTCGTTAACTGATTCTATGAAATGATCACCTTGTTGATGGCAATTTCCTAATCTTTGCAGGCAGAAACGCTTCTGGGTTATTTAAGAGAACCAAGACAAGAAGCATCCTCATCAAGTCCTTAACAGAAGGCCTGAGATGGGTTTGTTTAGAAGGTAATTGTATTATTAAGTAGTGGTATCGTGGTTGCTGTATATAGGGAGTGGTTTTTTGATGGTTTGTACAGCGTTGGTTGAGAATGTTGCAAGAATGAAGATCGTAGAATGAAGTAGTGCAGTTAGTAAAGCCAAAAACTGACGGGTAGAGAACAAGATGTATGATTTTTAACTAGGTTGTGGGATATCCTTCTCAATGACATTCCGTTTTTTACTAAATAGTGGAATATCACTTGGTTTGCTTTTCTGGGTTTGGGGATTTTATGAAATTCTGATTGGTTGTATGTCTTGTATGGTTGAACCCAAATTATGTGGAACTGATCGTCCTGTTTTACATATTAAAGCTGGAATTGCACAAATTTTGTATATTCAGAGTGTTGGATACTCAGCACAAGTTAATGTCTCTTTTTCTTCTCCTTTAATTGTGATTGAAACTTCACTGAATTCAGTGGTCTGAGAAGCTTGAAGTTGTTCTATTTTCATACAAAAGGAGAAATATCTCTCTTTCTTGCCCTACTTGCAATTGCATGAACATATGAACATCATCATCTCAGGCGAAGGCCACAGGCTGGGTGGTGTGGATGCCCCACCATAGATTTAGAAATATTTTGTAGGGTTTTTAGTCTCTTACTGCGTCTTGTCGTTTAGGGGTGAATTATTAGTTGGCAGAGCAGGTATGTTATTGTCCTGCCGCTGCTTGCTGTGCTGTTGATCATTTGCTATTTCTATTCAAAATCTCCAATTTTTTACACCACCCAACTGTCGGTGTCCTTCAGCTGAAGGGCACCCTACAAAGTCTCCCCATCTCTCTCTGTGGCTTGTTATTATTTTTTATTTCTTTTAACGTTAGTTGAGCTGTATTTGAACGGCCTTTCCTTTCCTTTCCTTTCCTTTCATCGCAAACAAAGGTTCCACAGTTAATCTTTCAATTCTTTCCTTTTCCATTTCTATTCGAGTTCATTTAATGATCGTTTCATTACTCTCCATTTCACTCTATTGAAGACGGAAAAAAACAATGAACGAACTAGTAAGAAGAAAAGACAATAAAGCAAGAGTTGCAAGTGGATAGAAGTTAATAAATAATAACCAATCATGCATTGTTGTCGTCCACAATTTATTCCAACTTACCAAAAGTTTCTCACAATGAAACACCAAGAGAAACATCAAGTGTTGTTAAATAAAAGTTCACGTCAATTCTGCTGAAGATTATGAACTTAAATACATTCTAAAGTAAAGAATCATTGTGCAGATGCTTCCTTCTGGCCTTTACTGCTCTTATGCTTCTTCCCTTGCTTAAGCAGCTCTTCAACCTCCTCGAATTGCAGTCCTTTAGTTTCAGGTACTAAGAAATATATACCAATCAATCCAAGCAATGAGAATCCTGCAAACAGCAGGAATGTTCCAGCTGCCCCAAGATTCTCCACCAATGTTAAAAATGTCTGACTCACTATCAGATTTGATACCCAATTTGAAACTGCAGCAATTCCTCCTCCAGTTCCTCTATATCTAAGTGGGTAAATCTCTGAGTTCAACACCCATGGTACTGTTCCCATTCCAGGTGCATATGAAATTATGTAAAGCCCCATCACCACCACTGCCAAGAAGCCGATCTTGCTTGGGCAGCCTTCTGTGAACCATACTCGTCGGTTCGACCGACATTCGCCCCTCACGTTCTTTGTCAAATCCAAGCATGCTCCAGGGAGATACTGATATTTCATTCAACATATAAACTTTGTGAGTTTCTGTTGATTGTTACGGTTGGAATGACTTCAATAAGGGAAAGAAAGCAAGTTGATGTTTTAATACCTCATTATCTCCATTGGCACAGAAACCGCATTTTTGTCTTAAACATGACATGCAGTTCCATGAAGATGCATCTGGAGCTGAGACATAGGCTGGACAGGTAGAGTTACTACCAAAGTGAGTAGATTCGAGGGCGTTGACGGGTGGAGCATGGCTAGCAGATTGGAAGAACACTCCGGCCAACACTACAAGGCAAGCGATAATTCCGAACATTGAGATGATCATAATTCGTCTTCTCCCGTATCTGTCAACTGTAAGCATGCTGACAACAGTGCCAGCAGCATTGAGAAATGATGTAACGAGAGATAACGCCATGGCTGTTGTATTAGAAGCATATCCAGCAAACTGCATGATGGTTGGACTGTAGTACATAACAGTGTTGATACCACAGAACTGCTGAGCTACTTGGACGATGATCCCGGCCCAGAGCCCTCTTCGAACGACTTGGCTACTCAGAGCACCTTTAACTTTGGCTATTATACTGCCATCTCCAATTGCCCCTTCTTCTGCCTTTTCTGTTTCCACAGATTCATGCAACAATCTCATCTCTTCATCAACTTGATTAGCAGGATATATCTTCTCCAGTATTGCTCTTGCTTCATCTACTTTGTCCTGATCATCAGAAAAATGAGTATATCAGTAACTTTGATCAATCACTTCTAATCCAAGGGAAACTTTACTCACAAACACTACTAATGCACGCTCATAAGGAAACACAAATAGACATGAGAAACTTTAGGAAGTGCTACTGACTTATGAAGTCGGTGGGTCAAACACCATTAAAATACTAAAACAGTTTACTATTATTGTTTGAGAAGCTCAAAAGGAAAAAAAAGTTGTAGCATTTTTCAAACATTACCCGTCTATAAAGCCACCTAGGAGACTCAGGCAGCGATAACATTAAAACAAACTGAACCACAGCAGGAAGTCCTGCTACTCCAAGCATTAGACGCCATGTTAACTTGGTCTGCAACACAAATAGATGCAAATTCAAAACCTCAAAATAAGAACATCAAATTTAACACTGCATAACTTTAACCATCAGAAAGGAAGGTCCATTTTACCTTAGTGAAGGCCAAGTTGATTAGATACGAAAGAAATTGTCCTCCAGTAATTAGCAACCCATTAGTGCTAACAAGAGCACCCCTGATTCTAGCAGGGGAAGCTTCTGATATGTAAAGAGGAGCAGTCATGGATGCCATTCCAACTCCAAAACCAACTATAAGTCTCCCAACGATAATGAATCCGGGAAAGGGAGCAACAGCCATGACAATTGCACCGACGAAGAACACAACATCGGCAACTAGGATCGAATTTTTCCGACCGAATTTATCGTTCATCCAACCACCAATTGCAGCACCCACAATAGCACCTGCTACAGCCATACTTACAATGGTTTCCTGTATATATAATCAATGAACCAAATAGTGAAAATAAGCACTATTTAAAAG

mRNA sequence

ATGGTGTCGCTTGAAGATTCTCATTCCGCTTCTAATCGGTTTCCTTTAAGTAGAAATTGTTACAGTCCGTCGTCTACCACTTCCAGTAAAATCAGCAGGAACATTGGCCGTTCTATGCGTACTATTCGCTCTAATTTCTTTCAAGATGATAATAGTTGCACCTTTAATGGCTCCGTGGCAGGAAAGTCCGGCTGTGTCTCTGAGAATCTCACTGATTCTGTCATCGACCTTAGGCTTGGCGAGTTGGCATCTCGTAGCCCTAAATGGCCTAATAAGCAATCCTCTGAACAGGAACAGGATTTTCTTGAACTTTCTCATGCTTTCAGTGATTTCTCTGCTTGTAGCAGTGATATTTCCGGAGAGTTGCAGAGACTTGCAAGTTTACCCTCTATGGCGGTTGTTCCTAAAAGAGAAGGTGAAGATGCTGACCCCGAGCCCGAACCGTGCTTAGGGTTTTTGCAGAGGGAGAATTTCTCCACTGAGATTATTGAGAGTATTTCGCCTGAAGATCTACAGCCCACTGTCAAGATCTGTATCGATGGGCTTCAGTCCTCGTCAATTGCAGTGAAGCGATCTGCGGCGGCTAAGCTGAGGCTTCTGGCGAAGAATCGGTCTGATAATCGGCTACTCATTGGGGAATCGGGCGCCGTTCCTGCTTTAATTCCTTTGCTTCGATCTACCGATCCATGGACACAAGAGCACGCTGTAACTGCCTTGCTGAATCTCTCACTCCATGAGTCTAATAAAGTTATAATCACGAATGCTGGAGCCATAAAGTCGCTGGTTTATGCGCTCAAAACCGGCACCGAAACTTCGAAACAGAATGCGGCTTGTGCTCTGATGAGCCTTGCTTTGTTGGAGGAAAACAAGACTTCGATTGGGGTTTGCGGGGCCATTCCACCACTGGTATCTTTACTACTGAATGGATCAAATAGAGGGAAGAAGGACGCGCTCACGACGCTCTACAAGCTTTGCTCGATCAAACCGAACAAGGAACGGGCCGTCACTGCTGGAGCTGTGAAGCCATTGGTGGCGCTAGTAGCGGAGCAAGGCACGGGTTTAGCGGAGAAGGCGATGGTGGTTTTGAGTAGCCTGGCTGGGATTCAAGAGGGGAAGGATGCGATTGTGGAAGAGGGTGGAATTGCTGCACTCGTGGAAGCAATTGAAGATGGGTCGGTGAAAGGGAAAGAGTTTGCAGTATTGACACTCTTGCAATTGTGTGTTGAGAGTGTGAGAAATAGAGGGTTGCTCGTAAGGGAAGGTGGAATTCCACCTCTGGTTGCACTTTCTCAGACTGGAAGCGTTCGAGCAAAGCATAAGGCAGAAACGCTTCTGGGTTATTTAAGAGAACCAAGACAAGAAGCATCCTCATCAAGTCCTTAACAGAAGGCCTGAGATGGGTTTGTTTAGAAGATGCTTCCTTCTGGCCTTTACTGCTCTTATGCTTCTTCCCTTGCTTAAGCAGCTCTTCAACCTCCTCGAATTGCAGTCCTTTAGTTTCAGGTACTAAGAAATATATACCAATCAATCCAAGCAATGAGAATCCTGCAAACAGCAGGAATGTTCCAGCTGCCCCAAGATTCTCCACCAATGTTAAAAATGTCTGACTCACTATCAGATTTGATACCCAATTTGAAACTGCAGCAATTCCTCCTCCAGTTCCTCTATATCTAAGTGGGTAAATCTCTGAGTTCAACACCCATGGTACTGTTCCCATTCCAGGTGCATATGAAATTATGTAAAGCCCCATCACCACCACTGCCAAGAAGCCGATCTTGCTTGGGCAGCCTTCTGTGAACCATACTCGTCGGTTCGACCGACATTCGCCCCTCACGTTCTTTGTCAAATCCAAGCATGCTCCAGGGAGATACTGATATTTCATTCAACATATAAACTTTGTGAGTTTCTGTTGATTGTTACGGTTGGAATGACTTCAATAAGGGAAAGAAAGCAAGTTGATGTTTTAATACCTCATTATCTCCATTGGCACAGAAACCGCATTTTTGTCTTAAACATGACATGCAGTTCCATGAAGATGCATCTGGAGCTGAGACATAGGCTGGACAGGTAGAGTTACTACCAAAGTGAGTAGATTCGAGGGCGTTGACGGGTGGAGCATGGCTAGCAGATTGGAAGAACACTCCGGCCAACACTACAAGGCAAGCGATAATTCCGAACATTGAGATGATCATAATTCGTCTTCTCCCGTATCTGTCAACTGTAAGCATGCTGACAACAGTGCCAGCAGCATTGAGAAATGATGTAACGAGAGATAACGCCATGGCTGTTGTATTAGAAGCATATCCAGCAAACTGCATGATGGTTGGACTGTAGTACATAACAGTGTTGATACCACAGAACTGCTGAGCTACTTGGACGATGATCCCGGCCCAGAGCCCTCTTCGAACGACTTGGCTACTCAGAGCACCTTTAACTTTGGCTATTATACTGCCATCTCCAATTGCCCCTTCTTCTGCCTTTTCTGTTTCCACAGATTCATGCAACAATCTCATCTCTTCATCAACTTGATTAGCAGGATATATCTTCTCCAGTATTGCTCTTGCTTCATCTACTTTGTCCTGATCATCAGAAAAATGAGTATATCAGTAACTTTGATCAATCACTTCTAATCCAAGGGAAACTTTACTCACAAACACTACTAATGCACGCTCATAAGGAAACACAAATAGACATGAGAAACTTTAGGAAGTGCTACTGACTTATGAAGTCGGTGGGTCAAACACCATTAAAATACTAAAACAGTTTACTATTATTGTTTGAGAAGCTCAAAAGGAAAAAAAAGTTGTAGCATTTTTCAAACATTACCCGTCTATAAAGCCACCTAGGAGACTCAGGCAGCGATAACATTAAAACAAACTGAACCACAGCAGGAAGTCCTGCTACTCCAAGCATTAGACGCCATGTTAACTTGGTCTGCAACACAAATAGATGCAAATTCAAAACCTCAAAATAAGAACATCAAATTTAACACTGCATAACTTTAACCATCAGAAAGGAAGGTCCATTTTACCTTAGTGAAGGCCAAGTTGATTAGATACGAAAGAAATTGTCCTCCAGTAATTAGCAACCCATTAGTGCTAACAAGAGCACCCCTGATTCTAGCAGGGGAAGCTTCTGATATGTAAAGAGGAGCAGTCATGGATGCCATTCCAACTCCAAAACCAACTATAAGTCTCCCAACGATAATGAATCCGGGAAAGGGAGCAACAGCCATGACAATTGCACCGACGAAGAACACAACATCGGCAACTAGGATCGAATTTTTCCGACCGAATTTATCGTTCATCCAACCACCAATTGCAGCACCCACAATAGCACCTGCTACAGCCATACTTACAATGGTTTCCTGTATATATAATCAATGAACCAAATAGTGAAAATAAGCACTATTTAAAAG

Coding sequence (CDS)

ATGGTGTCGCTTGAAGATTCTCATTCCGCTTCTAATCGGTTTCCTTTAAGTAGAAATTGTTACAGTCCGTCGTCTACCACTTCCAGTAAAATCAGCAGGAACATTGGCCGTTCTATGCGTACTATTCGCTCTAATTTCTTTCAAGATGATAATAGTTGCACCTTTAATGGCTCCGTGGCAGGAAAGTCCGGCTGTGTCTCTGAGAATCTCACTGATTCTGTCATCGACCTTAGGCTTGGCGAGTTGGCATCTCGTAGCCCTAAATGGCCTAATAAGCAATCCTCTGAACAGGAACAGGATTTTCTTGAACTTTCTCATGCTTTCAGTGATTTCTCTGCTTGTAGCAGTGATATTTCCGGAGAGTTGCAGAGACTTGCAAGTTTACCCTCTATGGCGGTTGTTCCTAAAAGAGAAGGTGAAGATGCTGACCCCGAGCCCGAACCGTGCTTAGGGTTTTTGCAGAGGGAGAATTTCTCCACTGAGATTATTGAGAGTATTTCGCCTGAAGATCTACAGCCCACTGTCAAGATCTGTATCGATGGGCTTCAGTCCTCGTCAATTGCAGTGAAGCGATCTGCGGCGGCTAAGCTGAGGCTTCTGGCGAAGAATCGGTCTGATAATCGGCTACTCATTGGGGAATCGGGCGCCGTTCCTGCTTTAATTCCTTTGCTTCGATCTACCGATCCATGGACACAAGAGCACGCTGTAACTGCCTTGCTGAATCTCTCACTCCATGAGTCTAATAAAGTTATAATCACGAATGCTGGAGCCATAAAGTCGCTGGTTTATGCGCTCAAAACCGGCACCGAAACTTCGAAACAGAATGCGGCTTGTGCTCTGATGAGCCTTGCTTTGTTGGAGGAAAACAAGACTTCGATTGGGGTTTGCGGGGCCATTCCACCACTGGTATCTTTACTACTGAATGGATCAAATAGAGGGAAGAAGGACGCGCTCACGACGCTCTACAAGCTTTGCTCGATCAAACCGAACAAGGAACGGGCCGTCACTGCTGGAGCTGTGAAGCCATTGGTGGCGCTAGTAGCGGAGCAAGGCACGGGTTTAGCGGAGAAGGCGATGGTGGTTTTGAGTAGCCTGGCTGGGATTCAAGAGGGGAAGGATGCGATTGTGGAAGAGGGTGGAATTGCTGCACTCGTGGAAGCAATTGAAGATGGGTCGGTGAAAGGGAAAGAGTTTGCAGTATTGACACTCTTGCAATTGTGTGTTGAGAGTGTGAGAAATAGAGGGTTGCTCGTAAGGGAAGGTGGAATTCCACCTCTGGTTGCACTTTCTCAGACTGGAAGCGTTCGAGCAAAGCATAAGGCAGAAACGCTTCTGGGTTATTTAAGAGAACCAAGACAAGAAGCATCCTCATCAAGTCCTTAA

Protein sequence

MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVAGKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISGELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICIDGLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALLNLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIPPLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMVVLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVREGGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP
Homology
BLAST of Clc01G21840 vs. NCBI nr
Match: XP_038881710.1 (U-box domain-containing protein 4 [Benincasa hispida])

HSP 1 Score: 849.0 bits (2192), Expect = 2.0e-242
Identity = 451/460 (98.04%), Postives = 456/460 (99.13%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVSLEDSHS SNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSLEDSHSTSNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
            KS C+SENLTDSVIDLRLGELASRSPKWP KQSSEQEQDFLELSHAFSDFSACSSDISG
Sbjct: 61  AKSSCISENLTDSVIDLRLGELASRSPKWP-KQSSEQEQDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQ+SSIAVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQASSIAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VL+SLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVES+RNRGLLVRE
Sbjct: 361 VLNSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESMRNRGLLVRE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 459

BLAST of Clc01G21840 vs. NCBI nr
Match: XP_004134799.1 (U-box domain-containing protein 4 [Cucumis sativus] >KGN49052.1 hypothetical protein Csa_003608 [Cucumis sativus])

HSP 1 Score: 841.6 bits (2173), Expect = 3.1e-240
Identity = 448/460 (97.39%), Postives = 454/460 (98.70%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVSLEDSHS SNRFPL+RNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSLEDSHSTSNRFPLTRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
            KSGCVSENLTDSVIDLRLGELASRSPKW +KQSSEQEQDFLELSHAFSDFSACSSDISG
Sbjct: 61  AKSGCVSENLTDSVIDLRLGELASRSPKW-SKQSSEQEQDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVVP+REGED DPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVPQREGEDGDPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQSSSIAVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGA+KSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAVKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGS+KGKEFAVLTLLQLCVESVRNRGLLV E
Sbjct: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSLKGKEFAVLTLLQLCVESVRNRGLLVSE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQ ASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQVASSSSP 459

BLAST of Clc01G21840 vs. NCBI nr
Match: XP_008440076.1 (PREDICTED: U-box domain-containing protein 4 [Cucumis melo] >TYK12996.1 U-box domain-containing protein 4 [Cucumis melo var. makuwa])

HSP 1 Score: 839.3 bits (2167), Expect = 1.5e-239
Identity = 448/460 (97.39%), Postives = 453/460 (98.48%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVSLEDSHS SNRFPL+RNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSLEDSHSTSNRFPLTRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
            KS CVSENLTDSVIDLRLGELASRSPKW +KQSSEQEQDFLELSHAFSDFSACSSDISG
Sbjct: 61  AKSVCVSENLTDSVIDLRLGELASRSPKW-SKQSSEQEQDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVVP+REGED DPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVPQREGEDGDPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQSSSIAVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGA+KSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAVKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLV E
Sbjct: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVSE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQ ASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQVASSSSP 459

BLAST of Clc01G21840 vs. NCBI nr
Match: XP_023518630.1 (U-box domain-containing protein 4-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 833.6 bits (2152), Expect = 8.5e-238
Identity = 443/460 (96.30%), Postives = 451/460 (98.04%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVS+EDSHS+SNRFPL RNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSIEDSHSSSNRFPLGRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
           G+SGCVSENLTDSVID+RLGELASRSPKW N QSSEQE+DFLELSHAFSDFSACSSDISG
Sbjct: 61  GESGCVSENLTDSVIDIRLGELASRSPKWAN-QSSEQEEDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVV KREG DA+ EPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVSKREGGDAETEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQSSS+AVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQSSSVAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGAIKSLVY LKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAIKSLVYVLKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKER VTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERTVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE
Sbjct: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPR EASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRHEASSSSP 459

BLAST of Clc01G21840 vs. NCBI nr
Match: KAG6594698.1 (U-box domain-containing protein 4, partial [Cucurbita argyrosperma subsp. sororia] >KAG7026666.1 U-box domain-containing protein 4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 832.8 bits (2150), Expect = 1.4e-237
Identity = 442/460 (96.09%), Postives = 451/460 (98.04%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVS+EDSHS+SNRFPL RNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSIEDSHSSSNRFPLGRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
           GKSGC+SENLTDSVID+RLGELASRSPKW + QSSEQE+DFLELSHAFSDFSACSSDISG
Sbjct: 61  GKSGCISENLTDSVIDIRLGELASRSPKWAS-QSSEQEEDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVV KREG DA+ EPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVSKREGGDAETEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQSSS+AVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQSSSVAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGAIKSLVY LKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAIKSLVYVLKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKER VTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERTVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE
Sbjct: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPR EASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRHEASSSSP 459

BLAST of Clc01G21840 vs. ExPASy Swiss-Prot
Match: O22193 (U-box domain-containing protein 4 OS=Arabidopsis thaliana OX=3702 GN=PUB4 PE=1 SV=3)

HSP 1 Score: 257.7 bits (657), Expect = 2.5e-67
Identity = 165/387 (42.64%), Postives = 239/387 (61.76%), Query Frame = 0

Query: 85  RSPKWPNKQSSEQ--EQDFLELSHAFSDFSACSSDISGELQRLASLPSMAVVPKREGEDA 144
           RSP   +  S+E+    D  E S   +  +  SSD SGE++      + +   +R+  D 
Sbjct: 440 RSPSATSTVSNEEFPRADANENSEESAHATPYSSDASGEIRSGPLAATTSAATRRDLSDF 499

Query: 145 DPEPEPCLGFLQR-----------ENFSTEIIESISPE------DLQPTVKICIDGLQSS 204
            P+      F+ R           E   + I+ + S E      +++  VK  ++ L+SS
Sbjct: 500 SPK------FMDRRTRGQFWRRPSERLGSRIVSAPSNETRRDLSEVETQVKKLVEELKSS 559

Query: 205 SIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALLNLSLH 264
           S+  +R A A+LRLLAK+  DNR++IG SGA+  L+ LL STD  TQE+AVTALLNLS++
Sbjct: 560 SLDTQRQATAELRLLAKHNMDNRIVIGNSGAIVLLVELLYSTDSATQENAVTALLNLSIN 619

Query: 265 ESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIPPLVSL 324
           ++NK  I +AGAI+ L++ L+ G+  +K+N+A  L SL+++EENK  IG  GAI PLV L
Sbjct: 620 DNNKKAIADAGAIEPLIHVLENGSSEAKENSAATLFSLSVIEENKIKIGQSGAIGPLVDL 679

Query: 325 LLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMVVLSSL 384
           L NG+ RGKKDA T L+ L   + NK   V +GAV+ L+ L+ +   G+ +KA+ VL++L
Sbjct: 680 LGNGTPRGKKDAATALFNLSIHQENKAMIVQSGAVRYLIDLM-DPAAGMVDKAVAVLANL 739

Query: 385 AGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVREGGIPP 444
           A I EG++AI +EGGI  LVE +E GS +GKE A   LLQL   S R   ++++EG +PP
Sbjct: 740 ATIPEGRNAIGQEGGIPLLVEVVELGSARGKENAAAALLQLSTNSGRFCNMVLQEGAVPP 799

Query: 445 LVALSQTGSVRAKHKAETLLGYLREPR 453
           LVALSQ+G+ RA+ KA+ LL Y R  R
Sbjct: 800 LVALSQSGTPRAREKAQALLSYFRNQR 819

BLAST of Clc01G21840 vs. ExPASy Swiss-Prot
Match: Q5XEZ8 (U-box domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=PUB2 PE=2 SV=1)

HSP 1 Score: 244.6 bits (623), Expect = 2.2e-63
Identity = 144/291 (49.48%), Postives = 197/291 (67.70%), Query Frame = 0

Query: 164 ESISPEDLQPTVKICIDGLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPL 223
           E+ S   ++  VK  ID L+SSS+  +R A A++R+LA+N +DNR++I    A+P+L+ L
Sbjct: 412 ETGSSSSIETEVKKLIDDLKSSSLDTQREATARIRILARNSTDNRIVIARCEAIPSLVSL 471

Query: 224 LRSTDPWTQEHAVTALLNLSLHESNKVIITNAGAIKSLVYALKTG-TETSKQNAACALMS 283
           L STD   Q  AVT LLNLS++++NK +I  +GAI  L++ LKTG  E +K N+A  L S
Sbjct: 472 LYSTDERIQADAVTCLLNLSINDNNKSLIAESGAIVPLIHVLKTGYLEEAKANSAATLFS 531

Query: 284 LALLEENKTSIGVCGAIPPLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKP 343
           L+++EE KT IG  GAI PLV LL +GS  GKKDA T L+ L     NK + + AGAV+ 
Sbjct: 532 LSVIEEYKTEIGEAGAIEPLVDLLGSGSLSGKKDAATALFNLSIHHENKTKVIEAGAVRY 591

Query: 344 LVALVAEQGTGLAEKAMVVLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLT 403
           LV L+ +   G+ EKA+VVL++LA ++EGK AI EEGGI  LVE +E GS +GKE A   
Sbjct: 592 LVELM-DPAFGMVEKAVVVLANLATVREGKIAIGEEGGIPVLVEVVELGSARGKENATAA 651

Query: 404 LLQLCVESVRNRGLLVREGGIPPLVALSQTGSVRAKHKAETLLGYLREPRQ 454
           LLQLC  S +    ++REG IPPLVAL+++G+ R K KA+ LL Y +  RQ
Sbjct: 652 LLQLCTHSPKFCNNVIREGVIPPLVALTKSGTARGKEKAQNLLKYFKAHRQ 701

BLAST of Clc01G21840 vs. ExPASy Swiss-Prot
Match: Q8GWV5 (U-box domain-containing protein 3 OS=Arabidopsis thaliana OX=3702 GN=PUB3 PE=2 SV=2)

HSP 1 Score: 218.0 bits (554), Expect = 2.2e-55
Identity = 134/280 (47.86%), Postives = 187/280 (66.79%), Query Frame = 0

Query: 174 TVKICIDGLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQE 233
           T+K+ ++ L+S S  VK +AAA++R L  N  +NR+ IG  GA+  L+ LL S +  TQE
Sbjct: 474 TIKL-VEDLKSGSNKVKTAAAAEIRHLTINSIENRVHIGRCGAITPLLSLLYSEEKLTQE 533

Query: 234 HAVTALLNLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSI 293
           HAVTALLNLS+ E NK +I   GAI+ LV+ L TG + +K+N+A +L SL++L+ N+  I
Sbjct: 534 HAVTALLNLSISELNKAMIVEVGAIEPLVHVLNTGNDRAKENSAASLFSLSVLQVNRERI 593

Query: 294 GVC-GAIPPLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGT 353
           G    AI  LV+LL  G+ RGKKDA + L+ L     NK R V A AVK LV L+ +   
Sbjct: 594 GQSNAAIQALVNLLGKGTFRGKKDAASALFNLSITHDNKARIVQAKAVKYLVELL-DPDL 653

Query: 354 GLAEKAMVVLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVR 413
            + +KA+ +L++L+ + EG+ AIV EGGI  LVE ++ GS +GKE A   LLQLC+ S +
Sbjct: 654 EMVDKAVALLANLSAVGEGRQAIVREGGIPLLVETVDLGSQRGKENAASVLLQLCLNSPK 713

Query: 414 NRGLLVREGGIPPLVALSQTGSVRAKHKAETLLGYLREPR 453
              L+++EG IPPLVALSQ+G+ RAK KA+ LL + R  R
Sbjct: 714 FCTLVLQEGAIPPLVALSQSGTQRAKEKAQQLLSHFRNQR 751

BLAST of Clc01G21840 vs. ExPASy Swiss-Prot
Match: Q8VZ40 (U-box domain-containing protein 14 OS=Arabidopsis thaliana OX=3702 GN=PUB14 PE=1 SV=1)

HSP 1 Score: 215.7 bits (548), Expect = 1.1e-54
Identity = 128/302 (42.38%), Postives = 186/302 (61.59%), Query Frame = 0

Query: 159 STEIIESISPEDLQPTVKICIDGLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVP 218
           +T+I  S S +  +  V   ++ L + +   +R+AA +LRLLAK   DNR+ I E+GA+P
Sbjct: 331 TTKIGGSSSSDCDRTFVLSLLEKLANGTTEQQRAAAGELRLLAKRNVDNRVCIAEAGAIP 390

Query: 219 ALIPLLRSTDPWTQEHAVTALLNLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAAC 278
            L+ LL S DP TQEH+VTALLNLS++E NK  I +AGAI  +V  LK G+  +++NAA 
Sbjct: 391 LLVELLSSPDPRTQEHSVTALLNLSINEGNKGAIVDAGAITDIVEVLKNGSMEARENAAA 450

Query: 279 ALMSLALLEENKTSIGVCGAIPPLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAG 338
            L SL++++ENK +IG  GAI  L+SLL  G+ RGKKDA T ++ LC  + NK RAV  G
Sbjct: 451 TLFSLSVIDENKVAIGAAGAIQALISLLEEGTRRGKKDAATAIFNLCIYQGNKSRAVKGG 510

Query: 339 AVKPLVALVAEQGTGLAEKAMVVLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEF 398
            V PL  L+ + G G+ ++A+ +L+ L+  QEGK AI E   I  LVE I  GS + +E 
Sbjct: 511 IVDPLTRLLKDAGGGMVDEALAILAILSTNQEGKTAIAEAESIPVLVEIIRTGSPRNREN 570

Query: 399 AVLTLLQLCVESVRNRGLLVREGGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSS 458
           A   L  LC+ ++    +    G    L  L++ G+ RAK KA +LL  +++    A ++
Sbjct: 571 AAAILWYLCIGNIERLNVAREVGADVALKELTENGTDRAKRKAASLLELIQQTEGVAVTT 630

Query: 459 SP 461
            P
Sbjct: 631 VP 632

BLAST of Clc01G21840 vs. ExPASy Swiss-Prot
Match: Q5VRH9 (U-box domain-containing protein 12 OS=Oryza sativa subsp. japonica OX=39947 GN=PUB12 PE=2 SV=1)

HSP 1 Score: 208.4 bits (529), Expect = 1.8e-52
Identity = 122/274 (44.53%), Postives = 174/274 (63.50%), Query Frame = 0

Query: 182 LQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALLN 241
           L+S +   +R+AA ++RLLAK   +NR+ I E+GA+P L+ LL S+DP TQEHAVTALLN
Sbjct: 332 LRSGNQDEQRAAAGEIRLLAKRNVNNRICIAEAGAIPLLVNLLSSSDPRTQEHAVTALLN 391

Query: 242 LSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIPP 301
           LS+HE+NK  I ++ AI  +V  LKTG+  +++NAA  L SL++++ENK +IG  GAIPP
Sbjct: 392 LSIHENNKASIVDSHAIPKIVEVLKTGSMETRENAAATLFSLSVVDENKVTIGAAGAIPP 451

Query: 302 LVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMVV 361
           L++LL +GS RGKKDA T ++ LC  + NK RAV AG V  L+  + +   G+ ++A+ +
Sbjct: 452 LINLLCDGSPRGKKDAATAIFNLCIYQGNKVRAVKAGIVIHLMNFLVDPTGGMIDEALSL 511

Query: 362 LSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVREG 421
           LS LAG  EGK  I     I  LVE I+ GS + +E A   L  LC             G
Sbjct: 512 LSILAGNPEGKIVIARSEPIPPLVEVIKTGSPRNRENAAAILWLLCSADTEQTLAAKAAG 571

Query: 422 GIPPLVALSQTGSVRAKHKAETLLGYLREPRQEA 456
               L  LS+TG+ RAK KA ++L  + +  +++
Sbjct: 572 VEDALKELSETGTDRAKRKASSILELMHQANEDS 605

BLAST of Clc01G21840 vs. ExPASy TrEMBL
Match: A0A0A0KL34 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511670 PE=4 SV=1)

HSP 1 Score: 841.6 bits (2173), Expect = 1.5e-240
Identity = 448/460 (97.39%), Postives = 454/460 (98.70%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVSLEDSHS SNRFPL+RNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSLEDSHSTSNRFPLTRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
            KSGCVSENLTDSVIDLRLGELASRSPKW +KQSSEQEQDFLELSHAFSDFSACSSDISG
Sbjct: 61  AKSGCVSENLTDSVIDLRLGELASRSPKW-SKQSSEQEQDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVVP+REGED DPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVPQREGEDGDPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQSSSIAVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGA+KSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAVKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGS+KGKEFAVLTLLQLCVESVRNRGLLV E
Sbjct: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSLKGKEFAVLTLLQLCVESVRNRGLLVSE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQ ASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQVASSSSP 459

BLAST of Clc01G21840 vs. ExPASy TrEMBL
Match: A0A5D3CRG4 (U-box domain-containing protein 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G005940 PE=4 SV=1)

HSP 1 Score: 839.3 bits (2167), Expect = 7.5e-240
Identity = 448/460 (97.39%), Postives = 453/460 (98.48%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVSLEDSHS SNRFPL+RNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSLEDSHSTSNRFPLTRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
            KS CVSENLTDSVIDLRLGELASRSPKW +KQSSEQEQDFLELSHAFSDFSACSSDISG
Sbjct: 61  AKSVCVSENLTDSVIDLRLGELASRSPKW-SKQSSEQEQDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVVP+REGED DPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVPQREGEDGDPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQSSSIAVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGA+KSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAVKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLV E
Sbjct: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVSE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQ ASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQVASSSSP 459

BLAST of Clc01G21840 vs. ExPASy TrEMBL
Match: A0A1S3B089 (U-box domain-containing protein 4 OS=Cucumis melo OX=3656 GN=LOC103484662 PE=4 SV=1)

HSP 1 Score: 839.3 bits (2167), Expect = 7.5e-240
Identity = 448/460 (97.39%), Postives = 453/460 (98.48%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVSLEDSHS SNRFPL+RNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSLEDSHSTSNRFPLTRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
            KS CVSENLTDSVIDLRLGELASRSPKW +KQSSEQEQDFLELSHAFSDFSACSSDISG
Sbjct: 61  AKSVCVSENLTDSVIDLRLGELASRSPKW-SKQSSEQEQDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVVP+REGED DPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVPQREGEDGDPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQSSSIAVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGA+KSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAVKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLV E
Sbjct: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVSE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQ ASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQVASSSSP 459

BLAST of Clc01G21840 vs. ExPASy TrEMBL
Match: A0A6J1EFA6 (U-box domain-containing protein 4-like OS=Cucurbita moschata OX=3662 GN=LOC111433635 PE=4 SV=1)

HSP 1 Score: 831.6 bits (2147), Expect = 1.6e-237
Identity = 441/460 (95.87%), Postives = 451/460 (98.04%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVS+EDSHS+SNRFPL RNCYSP+STTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSIEDSHSSSNRFPLGRNCYSPTSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
           GKSGC+SENLTDSVID+RLGELASRSPKW + QSSEQE+DFLELSHAFSDFSACSSDISG
Sbjct: 61  GKSGCISENLTDSVIDIRLGELASRSPKWAS-QSSEQEEDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVV KREG DA+ EPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVSKREGGDAETEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQSSS+AVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQSSSVAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGAIKSLVY LKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAIKSLVYVLKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKER VTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERTVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE
Sbjct: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPR EASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRHEASSSSP 459

BLAST of Clc01G21840 vs. ExPASy TrEMBL
Match: A0A6J1KQR1 (U-box domain-containing protein 4-like OS=Cucurbita maxima OX=3661 GN=LOC111496781 PE=4 SV=1)

HSP 1 Score: 829.7 bits (2142), Expect = 5.9e-237
Identity = 441/460 (95.87%), Postives = 450/460 (97.83%), Query Frame = 0

Query: 1   MVSLEDSHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60
           MVS+EDSHS+SNRFPL RNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA
Sbjct: 1   MVSIEDSHSSSNRFPLGRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQDDNSCTFNGSVA 60

Query: 61  GKSGCVSENLTDSVIDLRLGELASRSPKWPNKQSSEQEQDFLELSHAFSDFSACSSDISG 120
           GKSGCVSENLTDSVID+RLGELASRSPKW + QSSEQE+DFLELSHAFSDFSACSSDISG
Sbjct: 61  GKSGCVSENLTDSVIDIRLGELASRSPKWAS-QSSEQEEDFLELSHAFSDFSACSSDISG 120

Query: 121 ELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180
           ELQRLASLPSMAVV KREG DA+ EPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID
Sbjct: 121 ELQRLASLPSMAVVSKREGGDAETEPEPCLGFLQRENFSTEIIESISPEDLQPTVKICID 180

Query: 181 GLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240
           GLQSSS+AVKRSAAAKLRLLAKNRSDNR+LIGESGAVPALIPLLRSTDPWTQEHAVTALL
Sbjct: 181 GLQSSSVAVKRSAAAKLRLLAKNRSDNRVLIGESGAVPALIPLLRSTDPWTQEHAVTALL 240

Query: 241 NLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300
           NLSLHESNKVIITNAGAIKSLVY LKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP
Sbjct: 241 NLSLHESNKVIITNAGAIKSLVYVLKTGTETSKQNAACALMSLALLEENKTSIGVCGAIP 300

Query: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMV 360
           PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKER VTAGAVKPLVALVAEQGTGLAEKAMV
Sbjct: 301 PLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERTVTAGAVKPLVALVAEQGTGLAEKAMV 360

Query: 361 VLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVRE 420
           VLS LAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCV+SVRNRGLLVRE
Sbjct: 361 VLSILAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVDSVRNRGLLVRE 420

Query: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRQEASSSSP 461
           GGIPPLVALSQTGSVRAKHKAETLLGYLREPR EASSSSP
Sbjct: 421 GGIPPLVALSQTGSVRAKHKAETLLGYLREPRHEASSSSP 459

BLAST of Clc01G21840 vs. TAIR 10
Match: AT4G16490.1 (ARM repeat superfamily protein )

HSP 1 Score: 634.0 bits (1634), Expect = 9.2e-182
Identity = 352/472 (74.58%), Postives = 396/472 (83.90%), Query Frame = 0

Query: 1   MVSLED--SHSASNRFPLSRNCYSPSSTTSSKISRNIGRSMRTIRSNFFQD-DNSCTFNG 60
           MVS+E+  SHS S RFPL+ + Y  SS +++++ R  GRSMRT+RSNF+Q  D SC+F G
Sbjct: 1   MVSVEEPLSHSNSTRFPLTTDFYGSSSPSAARLHRQAGRSMRTVRSNFYQSGDQSCSFVG 60

Query: 61  SVAGKSGCVSENLTDSVIDLRLGELASRSPKWPNKQ-SSEQEQDFLELSHAFSDFSACSS 120
           S+  KS   SE L+DSVID+RLGELA ++    N   SS +E+ FL++S AFSDFSACSS
Sbjct: 61  SIGDKSEYASEFLSDSVIDMRLGELALKNSNSLNSNASSMKEEAFLDISQAFSDFSACSS 120

Query: 121 DISGELQRLASLPSMAVVPKREG------EDADPEPEPCLGFLQRENFSTEIIESISPED 180
           DISGELQRLA LPS        G       D + E EPCLGFLQRENFSTEIIE ISPED
Sbjct: 121 DISGELQRLACLPSPEADRNESGGDNEAEHDPELEREPCLGFLQRENFSTEIIECISPED 180

Query: 181 LQPTVKICIDGLQSSSIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPW 240
           LQPTVK+CIDGL+SSS+A+KRSAAAKLRLLAKNR+DNR+LIGESGA+ ALIPLLR  DPW
Sbjct: 181 LQPTVKLCIDGLRSSSVAIKRSAAAKLRLLAKNRADNRVLIGESGAIQALIPLLRCNDPW 240

Query: 241 TQEHAVTALLNLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENK 300
           TQEHAVTALLNLSLH+ NK +I   GAIKSLV+ LKTGTETSKQNAACAL+SLALLEENK
Sbjct: 241 TQEHAVTALLNLSLHDQNKAVIAAGGAIKSLVWVLKTGTETSKQNAACALLSLALLEENK 300

Query: 301 TSIGVCGAIPPLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQ 360
            SIG CGAIPPLVSLLLNGS RGKKDALTTLYKLC+++ NKERAVTAGAVKPLV LVAE+
Sbjct: 301 GSIGACGAIPPLVSLLLNGSCRGKKDALTTLYKLCTLQQNKERAVTAGAVKPLVDLVAEE 360

Query: 361 GTGLAEKAMVVLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVES 420
           GTG+AEKAMVVLSSLA I +GK+AIVEEGGIAALVEAIEDGSVKGKEFA+LTLLQLC +S
Sbjct: 361 GTGMAEKAMVVLSSLAAIDDGKEAIVEEGGIAALVEAIEDGSVKGKEFAILTLLQLCSDS 420

Query: 421 VRNRGLLVREGGIPPLVALSQTG--SVRAKHKAETLLGYLREPRQEASSSSP 461
           VRNRGLLVREG IPPLV LSQ+G  SVRAK KAE LLGYLREPR+EASSSSP
Sbjct: 421 VRNRGLLVREGAIPPLVGLSQSGSVSVRAKRKAERLLGYLREPRKEASSSSP 472

BLAST of Clc01G21840 vs. TAIR 10
Match: AT3G01400.1 (ARM repeat superfamily protein )

HSP 1 Score: 264.6 bits (675), Expect = 1.5e-70
Identity = 165/352 (46.88%), Postives = 218/352 (61.93%), Query Frame = 0

Query: 102 LELSHAFSDFSACSSDISGELQRLASLPSMAVVPKREGEDADPEPEPCLGFLQRENFSTE 161
           L L+   S FS C+SD SGE                              F    + S  
Sbjct: 21  LSLNDDSSAFSDCNSDRSGE------------------------------FPTASSESRR 80

Query: 162 IIESISPEDLQPTVKICIDGLQSS-SIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPAL 221
           ++ S + E+    +   +  L SS SI  ++ AA ++RLL+KN+ +NR+ I ++GA+  L
Sbjct: 81  LLLSCASENSDDLINHLVSHLDSSYSIDEQKQAAMEIRLLSKNKPENRIKIAKAGAIKPL 140

Query: 222 IPLLRSTDPWTQEHAVTALLNLSLHESNKVIITNAGAIKSLVYALKTGTETSKQNAACAL 281
           I L+ S+D   QE+ VTA+LNLSL + NK  I ++GAIK LV ALK GT T+K+NAACAL
Sbjct: 141 ISLISSSDLQLQEYGVTAILNLSLCDENKESIASSGAIKPLVRALKMGTPTAKENAACAL 200

Query: 282 MSLALLEENKTSIGVCGAIPPLVSLLLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAV 341
           + L+ +EENK +IG  GAIP LV+LL  G  R KKDA T LY LCS K NK RAV +G +
Sbjct: 201 LRLSQIEENKVAIGRSGAIPLLVNLLETGGFRAKKDASTALYSLCSAKENKIRAVQSGIM 260

Query: 342 KPLVALVAEQGTGLAEKAMVVLSSLAGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAV 401
           KPLV L+A+ G+ + +K+  V+S L  + E K AIVEEGG+  LVE +E G+ + KE AV
Sbjct: 261 KPLVELMADFGSNMVDKSAFVMSLLMSVPESKPAIVEEGGVPVLVEIVEVGTQRQKEMAV 320

Query: 402 LTLLQLCVESVRNRGLLVREGGIPPLVALSQTGSVRAKHKAETLLGYLREPR 453
             LLQLC ESV  R ++ REG IPPLVALSQ G+ RAK KAE L+  LR+PR
Sbjct: 321 SILLQLCEESVVYRTMVAREGAIPPLVALSQAGTSRAKQKAEALIELLRQPR 342

BLAST of Clc01G21840 vs. TAIR 10
Match: AT2G23140.1 (RING/U-box superfamily protein with ARM repeat domain )

HSP 1 Score: 257.7 bits (657), Expect = 1.8e-68
Identity = 165/387 (42.64%), Postives = 239/387 (61.76%), Query Frame = 0

Query: 85  RSPKWPNKQSSEQ--EQDFLELSHAFSDFSACSSDISGELQRLASLPSMAVVPKREGEDA 144
           RSP   +  S+E+    D  E S   +  +  SSD SGE++      + +   +R+  D 
Sbjct: 443 RSPSATSTVSNEEFPRADANENSEESAHATPYSSDASGEIRSGPLAATTSAATRRDLSDF 502

Query: 145 DPEPEPCLGFLQR-----------ENFSTEIIESISPE------DLQPTVKICIDGLQSS 204
            P+      F+ R           E   + I+ + S E      +++  VK  ++ L+SS
Sbjct: 503 SPK------FMDRRTRGQFWRRPSERLGSRIVSAPSNETRRDLSEVETQVKKLVEELKSS 562

Query: 205 SIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALLNLSLH 264
           S+  +R A A+LRLLAK+  DNR++IG SGA+  L+ LL STD  TQE+AVTALLNLS++
Sbjct: 563 SLDTQRQATAELRLLAKHNMDNRIVIGNSGAIVLLVELLYSTDSATQENAVTALLNLSIN 622

Query: 265 ESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIPPLVSL 324
           ++NK  I +AGAI+ L++ L+ G+  +K+N+A  L SL+++EENK  IG  GAI PLV L
Sbjct: 623 DNNKKAIADAGAIEPLIHVLENGSSEAKENSAATLFSLSVIEENKIKIGQSGAIGPLVDL 682

Query: 325 LLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMVVLSSL 384
           L NG+ RGKKDA T L+ L   + NK   V +GAV+ L+ L+ +   G+ +KA+ VL++L
Sbjct: 683 LGNGTPRGKKDAATALFNLSIHQENKAMIVQSGAVRYLIDLM-DPAAGMVDKAVAVLANL 742

Query: 385 AGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVREGGIPP 444
           A I EG++AI +EGGI  LVE +E GS +GKE A   LLQL   S R   ++++EG +PP
Sbjct: 743 ATIPEGRNAIGQEGGIPLLVEVVELGSARGKENAAAALLQLSTNSGRFCNMVLQEGAVPP 802

Query: 445 LVALSQTGSVRAKHKAETLLGYLREPR 453
           LVALSQ+G+ RA+ KA+ LL Y R  R
Sbjct: 803 LVALSQSGTPRAREKAQALLSYFRNQR 822

BLAST of Clc01G21840 vs. TAIR 10
Match: AT2G23140.2 (RING/U-box superfamily protein with ARM repeat domain )

HSP 1 Score: 257.7 bits (657), Expect = 1.8e-68
Identity = 165/387 (42.64%), Postives = 239/387 (61.76%), Query Frame = 0

Query: 85  RSPKWPNKQSSEQ--EQDFLELSHAFSDFSACSSDISGELQRLASLPSMAVVPKREGEDA 144
           RSP   +  S+E+    D  E S   +  +  SSD SGE++      + +   +R+  D 
Sbjct: 440 RSPSATSTVSNEEFPRADANENSEESAHATPYSSDASGEIRSGPLAATTSAATRRDLSDF 499

Query: 145 DPEPEPCLGFLQR-----------ENFSTEIIESISPE------DLQPTVKICIDGLQSS 204
            P+      F+ R           E   + I+ + S E      +++  VK  ++ L+SS
Sbjct: 500 SPK------FMDRRTRGQFWRRPSERLGSRIVSAPSNETRRDLSEVETQVKKLVEELKSS 559

Query: 205 SIAVKRSAAAKLRLLAKNRSDNRLLIGESGAVPALIPLLRSTDPWTQEHAVTALLNLSLH 264
           S+  +R A A+LRLLAK+  DNR++IG SGA+  L+ LL STD  TQE+AVTALLNLS++
Sbjct: 560 SLDTQRQATAELRLLAKHNMDNRIVIGNSGAIVLLVELLYSTDSATQENAVTALLNLSIN 619

Query: 265 ESNKVIITNAGAIKSLVYALKTGTETSKQNAACALMSLALLEENKTSIGVCGAIPPLVSL 324
           ++NK  I +AGAI+ L++ L+ G+  +K+N+A  L SL+++EENK  IG  GAI PLV L
Sbjct: 620 DNNKKAIADAGAIEPLIHVLENGSSEAKENSAATLFSLSVIEENKIKIGQSGAIGPLVDL 679

Query: 325 LLNGSNRGKKDALTTLYKLCSIKPNKERAVTAGAVKPLVALVAEQGTGLAEKAMVVLSSL 384
           L NG+ RGKKDA T L+ L   + NK   V +GAV+ L+ L+ +   G+ +KA+ VL++L
Sbjct: 680 LGNGTPRGKKDAATALFNLSIHQENKAMIVQSGAVRYLIDLM-DPAAGMVDKAVAVLANL 739

Query: 385 AGIQEGKDAIVEEGGIAALVEAIEDGSVKGKEFAVLTLLQLCVESVRNRGLLVREGGIPP 444
           A I EG++AI +EGGI  LVE +E GS +GKE A   LLQL   S R   ++++EG +PP
Sbjct: 740 ATIPEGRNAIGQEGGIPLLVEVVELGSARGKENAAAALLQLSTNSGRFCNMVLQEGAVPP 799

Query: 445 LVALSQTGSVRAKHKAETLLGYLREPR 453
           LVALSQ+G+ RA+ KA+ LL Y R  R
Sbjct: 800 LVALSQSGTPRAREKAQALLSYFRNQR 819

BLAST of Clc01G21840 vs. TAIR 10
Match: AT5G58680.1 (ARM repeat superfamily protein )

HSP 1 Score: 249.2 bits (635), Expect = 6.4e-66
Identity = 157/365 (43.01%), Postives = 222/365 (60.82%), Query Frame = 0

Query: 101 FLELSHAFSD---------FSACSSDISGELQRLASLPSMAVVPKREGEDADPEPEPCLG 160
           F  + H+FSD         FS C+SDIS E     S                        
Sbjct: 7   FTYMDHSFSDISLNFDSSAFSDCNSDISCEFPTTTS------------------------ 66

Query: 161 FLQRENFSTEIIESISPEDLQPTVKICIDGLQ-SSSIAVKRSAAAKLRLLAKNRSDNRLL 220
               E+   ++  S + ++    ++  I  L+ SSSI  ++ AA ++RLL+KN+ +NR+ 
Sbjct: 67  ----ESRQRKLFLSCAVDNSDDVIRNLITHLESSSSIEEQKQAAMEIRLLSKNKPENRIK 126

Query: 221 IGESGAVPALIPLLRSTDPWTQEHAVTALLNLSLHESNKVIITNAGAIKSLVYALKTGTE 280
           + ++GA+  L+ L+ S+D   QE+ VTA+LNLSL + NK +I ++GA+K LV AL+ GT 
Sbjct: 127 LAKAGAIKPLVSLISSSDLQLQEYGVTAVLNLSLCDENKEMIVSSGAVKPLVNALRLGTP 186

Query: 281 TSKQNAACALMSLALLEENKTSIGVCGAIPPLVSLLLNGSNRGKKDALTTLYKLCSIKPN 340
           T+K+NAACAL+ L+ +EENK +IG  GAIP LV+LL NG  R KKDA T LY LCS   N
Sbjct: 187 TTKENAACALLRLSQVEENKITIGRSGAIPLLVNLLENGGFRAKKDASTALYSLCSTNEN 246

Query: 341 KERAVTAGAVKPLVALVAEQGTGLAEKAMVVLSSLAGIQEGKDAIVEEGGIAALVEAIED 400
           K RAV +G +KPLV L+ +  + + +K+  V++ L    E K A+VEEGG+  LVE +E 
Sbjct: 247 KTRAVESGIMKPLVELMIDFESDMVDKSAFVMNLLMSAPESKPAVVEEGGVPVLVEIVEA 306

Query: 401 GSVKGKEFAVLTLLQLCVESVRNRGLLVREGGIPPLVALSQTGSVR-AKHKAETLLGYLR 455
           G+ + KE +V  LLQLC ESV  R ++ REG +PPLVALSQ  + R AK KAE L+  LR
Sbjct: 307 GTQRQKEISVSILLQLCEESVVYRTMVAREGAVPPLVALSQGSASRGAKVKAEALIELLR 343

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881710.12.0e-24298.04U-box domain-containing protein 4 [Benincasa hispida][more]
XP_004134799.13.1e-24097.39U-box domain-containing protein 4 [Cucumis sativus] >KGN49052.1 hypothetical pro... [more]
XP_008440076.11.5e-23997.39PREDICTED: U-box domain-containing protein 4 [Cucumis melo] >TYK12996.1 U-box do... [more]
XP_023518630.18.5e-23896.30U-box domain-containing protein 4-like [Cucurbita pepo subsp. pepo][more]
KAG6594698.11.4e-23796.09U-box domain-containing protein 4, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
O221932.5e-6742.64U-box domain-containing protein 4 OS=Arabidopsis thaliana OX=3702 GN=PUB4 PE=1 S... [more]
Q5XEZ82.2e-6349.48U-box domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=PUB2 PE=2 S... [more]
Q8GWV52.2e-5547.86U-box domain-containing protein 3 OS=Arabidopsis thaliana OX=3702 GN=PUB3 PE=2 S... [more]
Q8VZ401.1e-5442.38U-box domain-containing protein 14 OS=Arabidopsis thaliana OX=3702 GN=PUB14 PE=1... [more]
Q5VRH91.8e-5244.53U-box domain-containing protein 12 OS=Oryza sativa subsp. japonica OX=39947 GN=P... [more]
Match NameE-valueIdentityDescription
A0A0A0KL341.5e-24097.39Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511670 PE=4 SV=1[more]
A0A5D3CRG47.5e-24097.39U-box domain-containing protein 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3B0897.5e-24097.39U-box domain-containing protein 4 OS=Cucumis melo OX=3656 GN=LOC103484662 PE=4 S... [more]
A0A6J1EFA61.6e-23795.87U-box domain-containing protein 4-like OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A6J1KQR15.9e-23795.87U-box domain-containing protein 4-like OS=Cucurbita maxima OX=3661 GN=LOC1114967... [more]
Match NameE-valueIdentityDescription
AT4G16490.19.2e-18274.58ARM repeat superfamily protein [more]
AT3G01400.11.5e-7046.88ARM repeat superfamily protein [more]
AT2G23140.11.8e-6842.64RING/U-box superfamily protein with ARM repeat domain [more]
AT2G23140.21.8e-6842.64RING/U-box superfamily protein with ARM repeat domain [more]
AT5G58680.16.4e-6643.01ARM repeat superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000225ArmadilloSMARTSM00185arm_5coord: 327..367
e-value: 210.0
score: 2.9
coord: 245..285
e-value: 46.0
score: 8.1
coord: 204..244
e-value: 2.8E-4
score: 30.2
coord: 410..450
e-value: 19.0
score: 11.1
coord: 286..326
e-value: 0.83
score: 18.7
coord: 368..408
e-value: 8.1
score: 13.9
IPR000225ArmadilloPFAMPF00514Armcoord: 205..243
e-value: 2.0E-8
score: 34.0
coord: 288..326
e-value: 9.7E-6
score: 25.5
IPR000225ArmadilloPROSITEPS50176ARM_REPEATcoord: 215..257
score: 12.5474
IPR000225ArmadilloPROSITEPS50176ARM_REPEATcoord: 338..380
score: 9.9574
IPR011989Armadillo-like helicalGENE3D1.25.10.10coord: 159..458
e-value: 8.9E-53
score: 181.4
NoneNo IPR availablePANTHERPTHR23315U BOX DOMAIN-CONTAININGcoord: 2..456
NoneNo IPR availablePANTHERPTHR23315:SF304U-BOX DOMAIN-CONTAINING PROTEIN 4-LIKEcoord: 2..456
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 172..448

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G21840.1Clc01G21840.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus
molecular_function GO:0016874 ligase activity
molecular_function GO:0005515 protein binding