CmUC10G199930 (gene) Watermelon (USVL531) v1

Overview
NameCmUC10G199930
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionGATA zinc finger protein
LocationCmU531Chr10: 32012695 .. 32018645 (+)
RNA-Seq ExpressionCmUC10G199930
SyntenyCmUC10G199930
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACGTAATTACGAAACGACATGTCACACTTACAACGCCAGACCTAAAACCCTTATCCGACGTCGTTTTACACTTCCAGATCCCTTTCTTCTCTCTCCGAACCATCGTCTTCTTCCTCTTCGAAGGGATCAGATTTCTCACGGCGGCACTTCCCTCTCTTCACTCCCTCCGTTCCCGCCTCCGAAATGTCTGATCAATCTCCTCCTCCTCCTCCTCCCGCCGGCGAGTCCGAGTCCCGTCCCGTCGGTGGCACCGAGCACAGCTGGTGCCGCGCCGTCCCCGGCGGCACCGGCACCACTGTCCTCGGCCTACTCCTCTCAAAACCTCCCGATATTCCCCATCTCCAATCCTCCCTCCACACTCTCCAAAACCTCCACCCAATCCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGATTTCTCCTTCCTCATTCCTCCTTCTCCGCCGCTTAACCTCCAGATCCTCGACCTCGCCGCCACCGCAAGCGCTATCGCCTCTCATCCCGACGCCAAGGATCCTTCCGTCTCCGATTTCCACAAGATCCACGAAACGGAGATCAACCGCGCCACGTGGTTCGATCCAAACCATCCGTCGTACTCCGACACCGACGTGATGTTCGCTACCGTCTACACCGTATGCGACAGCCAATGGGCGGTATTCCTCCGCCTCCACACGGCGACATGCGATCGTGCCGCGGCGGCGGCACTGTTGAGAGAACTGCTAGTGCTTGTGGCGGCCGGAGGGGAAATAGAGGGCGGAGGATTTGAAATTGGGGATAATGGTGAGATCGGATTAGGGATTGAAAATCTAATCCCTAACGGTAAAGCGAATAAATCTCTGTGGGCGCGTGGATTAGATATGCTTGGTTACTCCTTGAATTCGTTCCGATTAGCGAATTTGGAATTCAAAGACGCGAATTCTGAAAGATTTTCTCAGATGATTAGGTTGAAGATGAACTCCGATGAGACTCAGAAACTTCTCGCTGTAAGTTTTTTCTCTTTCTCCGCATAATTTCCTTCATTTTCTCGGGAAAATGTTGGGAGTTTTTTCTTTTTAATTTTACGTAGGTATATCAATATTCAAATTTCAACAATGATTAATGCAAGTGGGACCCTCTTCTATCTCAAAACCCAATCAGATTTCACTTTCAATCTAAATGATAATATTCATATACAATTTTCCCATTTTCTGTTTTTTTTATGTACCAAAATATTTCTAGCAGTTTTTAATCTTTACTTTTGTATTTATATTGTTTATAAACCCTGAATTTAATCCTTTTGTGTATTATCATTTATCAACCTAGGTTTTCATTCTTCATAAAACATTCAAATTCTTTCATTTTAATACTTAGTGAATTCACGATTCTTTTAAGACATAATTAACAGATATTTAAAGGAGAAAAAAAGTATAAGAAAAGTTTATAATTTCTACTAAATTGTAAAAAATTTAGAGGATGAATTAGAAACTTTCATAATTAAAAAAAAAAAAACACTATTGGTCCTTAACTTTCATGGAGTACCAATATAATTTCTAAACTTTAGTTTGTAACTATTTAGTTATTATCAATTTACTAATAATTTAATCTTGAACTTTAGTACGTAACAATTTAGTCATTTTAATTTTAAATTTATTAGATATCAACTTTATGATGTATACTTTGTATTTATAAGAATATTGAATTTCTAATTAATTTATCGATTTATTTATATAAAAAATCTTAAGATTAATTTCTACAAAAGTGATTAAACCATTACAAATTTCAAAGTATGAGAAGTGAATTGTTACACAGTAAATTTTAAGGCCTAAATTATTACAAAATTAATAATACATGAACTAAATCATTACAAAAATAAAAATTAGAGACTTAATTGTTACTTTACCTAAAAGTGATTTTTTAGCATGATAAAAGGATTAAAATCTTTTAGGAATTTTTTTAAAAAATTTATTTATAGGGAAATAACATTTGTATTTGCTAATTTCAACTAAACAATCATATTTGTAAATCTTAGAAGAAAAAAAAAGTTGTCTTTTCAATATATTTATTGAGTAAAAGTCTTTTCAATATTTTAATTTATACTCGACACAATGTTTGATTATTATTATTTCAAAAGGGAATGAGTTTTATTACGGTGTGGTTAGGTGTAAGGAGTCAGGTTTGGTAGTGTGAGATGTCCACGCTGTCACTGTCGTTTGGCTTTGTACGTTTGATGTAAAGTGCGGGTGGTGGGGACGACATGTAGCCTCGAAATAATAAATATTTATTTTCTATTATAAAGCCTTCCATTGTTATAAACATGCTTTTATATGGTATTATAATTATAAAAAAATACTCTTTTTCTTTTTTTAGGTATGTTTTCAAATATATCAAAATAAACTAAATTATCTATAAAATATAAAAAATTTATATTGTCTATTTGCAATAAACTACAATAAATTTCTAGCGATAGAATACAATAATTTATATATTTGTAAATAATTTGATATTTTTATTTATTTATAATAAATTTTCTTCCTTTCATTATTTTGTTTGTTTGTGTTTAATGATGTACTTTTTCTTTTCAACTAAACAAGGGAAAAAAAACTTATATTATTTTGTTTGTTTGTGTTTAATGATGTACTTTTTCTTTTCAACTTAAAAAAAAACTTATGCCCTCAAAATTTTGAGAAATAATAGGACAAATTTCGTTGGCAATTTATATTTTATTTTATATTTTTAGATTTATTGTGTTAAACTACTACTTAGATTATATTTTTAAAAATTGTTTTTGCATTTTGTGCTAAACAATGCAAATGAAATTAACATATTTTTTTTATTGTATCTATATTTTAAATTTTAAATTTAGAATTATATTAAATAAAGATTAAAATATTTATTAATACAATATTTTTGTAAATTCAATTTATTTGTTTTAAAATTAAAATGTCTATAAATATTACAAAATAATAGTTATACAATTTTTTTAACATAACATGTTCACAAAGAAATTACTAATAGTTTGTTGTCAACTAATATTTATTGGATAACTATTATTAGTTTTATGATATATAAATATATTATCAAATACATTTTAAATAATTGTTCAAGTGAGATATTCAATTTTTGAATAAAAAATACTTGAAATAGACTTTTAGTTTTTCTTAAGGTGCTTTTTTAGTTACCAAAATTTTAAGAATAGGTTTTAAATGTGATTGAAGTTTTTTTTAAAATTTTTATTTTATTTTATTTTTTTTTCATTTTAAACAATAGTATGATATTTTGAATCAGAAATTGTTTTAAAAAAAATTATTTATTTCTCTTCCACTTTCTTTTCTGTTTCTTTCTTTAGTTCAATATGCGAACAAATGGCTACCAGTTTCAAACTACACGTTTCCATTTAACTATTTCTAGTAAATTAATTTTATTTTTAACCGAATATAATAGTTATGCTTTTTTTCTAACCAAAAAAATAATAATAATAATTAAAAAAAAAATAATAACACTATCGAAGAAGAAGAGAGAGGATATGATGAGAGGAGAGAAGGAGAATGTTGGAAGGGAGGGAGAGGAGATAAATTTTTAAAACTATTTTTTAAATTTAAAATGTAAAAAATTGTTTAAAGGATCATATGAACCTATTCTTAAAGGTTAGACTAACTAAAAATTATTTTAAAAATTAAGTCTTCAAAGTTTTTGAAAATCAAGCTTATTTCTTCTCAATTTTTTTTTGATGATTTTTATCCCTTTTACGTAAAAGAGTTTAATTTTTTACCAAATTTTAAAAATAATAATATATTTGTTTTTAGTTTTAAAAAATATGATTTGGTTTTTAAAAAATTAGTAGAAAGTAGACAACCAAATGTATAAATTTTATGGTAAAAATTAGTGCTTATAAACTTAATTTTAAAAAACAAAAACTAAGGCCACGTTTTATTGGGCCGATAAACACTTGTTCCATCTCTAAATTCTTTAAATCTATTTTTTAGTTTTTACCAATGTTTTTAAAAACCAAACCTAATTTTGAAAAAAAAATAGTTTTAGAAAAACTTTTTGTATTTTTTAAATTTGACTAAGAATACAACTCTTCTACTCAACTACTCAAGTAAGAGTGATATCATTGTCATAAATTGAGAGAAAGTTGGCTTAAATTAAAAAAAAAAAACGAAATAGTTATCAAACAAAACCTAAATAATAGGTTATTAAACCATGCTTGTATTTTCTCTGATTTGATATGAATGTTTGAAGGGCTGCAAATCGAGAGGCATTAAGCTCTGTGGAGCTTTGGCAGCTGCTGGATTGATTGCTACTCGTTGTTCTAAGGACCTTCCTCCTCACCAGAGGGAGAAATATGCTGTTGTTACTCTCAATGATTGTCGTTCCCTCCTTGATCCTCCCCTCACAAGCCACCATTTAGGTAAATTCAACTCCCCTTCTCTTTTCTGGACTCGCTCCTATATCGCTCCATTAGCTCTACGTTTTTTCAACTTTCTCTGCATATTAAGAACTACGGATACTTCAGTTTTTCAAAAGAGTATAAAGTCTCACTATGTACGTGTGTCGTATATGTGCCTTGTATTTATATCCATGTTTCTTAACTAGTGTGTCATGTGAACATGTCAACCTCCCACGTTGGATGATTTGTCTTGTAGATTGTTAAATCGCCAATGTAATCAACCCAAATGCTTAAGTTAATGGGTGATGATAAATTTAATTAATCACTTGACATTTCCCGTTCACTTGTAGGTTTGAAAATTTGTAAAAGTCCTAACAAGTGGAAAGCATAGTTAGTTGGTGAGAAAATGACATTACATAGGTTTGAACGCACACTCTGATACCATATTAAAACACTAATCAACTCAATAAAGCTTAAGTTGATGGGTTATGGTGAATTTAATTATATCAACACTTTAACATGATTTAACCTTATAACTATTGTAGTTTTTGCTAACTTAAGGATAGTTAGGATGGTTAAGGCATATATGTCCACAACTAAGAGGTCAGAAGTTCAAATATTTCATACTCGGTCATTGAACTAAGGAAACGGTTGTAATTTTTCATGTAAAAATACTCTAAATTCTCGATTGATAACAATTTGGCGTATTTTGTTTTTGAAAATTAAGCTTATAAACAATTTGGTGTGTTTTGTTTTTGAAAATTAAGCTTACAAATGCAACTTTCGCCTATAAGTTTATTTGCTTTGTTATCTTTTCCAAAACTCAAAACAAAAAGTCAAACAGTTATCGAACATGACCTTGATTCTCAGTTGAACAAACAAAAAAGCATATAAATTGAATAGAAGATAATCATTGGGATTAAGTTTTTTAGTTATTAATTATTTTTCTTTTTTACATTTCAGGATTCTATCACTCTGCCATCCTCAACACACATGACATATCAGCTGAAGATACACTATGGGAAGTGGCAAAGCGATGCTATTTTTCCTTCTCAAATGCCAAAGACAGCAACAAGCATTTCTCAGACATGTCTGACTTGAACTTCCTCATGTGCAAAGCGATTGAAAATCCCAGCCTCACTCCCTCTTCGTCCATGAGAACTGCCCTGATCTCGGTCTTTGAAGATCCCATCATCGAAACTTCCGGTCCTGCGCAGCAGCACATCGGCCTACACGACTACATCGGTTGTGCCTCTGCACACGGTGTTGGGCCATCGATCGCCTTCTTTGATATGATTCGTGACGGTCAGTTGGATTGTGCTTGTGTGTACCCGTCGCCTTTGTTCTCTCGAGAACAAATGAACCAAATTTTTGATGAGATGAAGAAAATTCTGGTGAATAATGCCATGGAAGTAGTTGAAGGTTAAGAATTTAGTTGTTGGACTGGGATCGAAGTTCCAGGTAGCTCGATAAGGGGAAGATCCACGATATATTAGCGTGGACAATTATCTTTTATATTCAAAATAAAGTCACGAACACTTATATTCAAAAAAGTTAATATCATATAACTGTGAAGTA

mRNA sequence

ACGTAATTACGAAACGACATGTCACACTTACAACGCCAGACCTAAAACCCTTATCCGACGTCGTTTTACACTTCCAGATCCCTTTCTTCTCTCTCCGAACCATCGTCTTCTTCCTCTTCGAAGGGATCAGATTTCTCACGGCGGCACTTCCCTCTCTTCACTCCCTCCGTTCCCGCCTCCGAAATGTCTGATCAATCTCCTCCTCCTCCTCCTCCCGCCGGCGAGTCCGAGTCCCGTCCCGTCGGTGGCACCGAGCACAGCTGGTGCCGCGCCGTCCCCGGCGGCACCGGCACCACTGTCCTCGGCCTACTCCTCTCAAAACCTCCCGATATTCCCCATCTCCAATCCTCCCTCCACACTCTCCAAAACCTCCACCCAATCCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGATTTCTCCTTCCTCATTCCTCCTTCTCCGCCGCTTAACCTCCAGATCCTCGACCTCGCCGCCACCGCAAGCGCTATCGCCTCTCATCCCGACGCCAAGGATCCTTCCGTCTCCGATTTCCACAAGATCCACGAAACGGAGATCAACCGCGCCACGTGGTTCGATCCAAACCATCCGTCGTACTCCGACACCGACGTGATGTTCGCTACCGTCTACACCGTATGCGACAGCCAATGGGCGGTATTCCTCCGCCTCCACACGGCGACATGCGATCGTGCCGCGGCGGCGGCACTGTTGAGAGAACTGCTAGTGCTTGTGGCGGCCGGAGGGGAAATAGAGGGCGGAGGATTTGAAATTGGGGATAATGGTGAGATCGGATTAGGGATTGAAAATCTAATCCCTAACGGTAAAGCGAATAAATCTCTGTGGGCGCGTGGATTAGATATGCTTGGTTACTCCTTGAATTCGTTCCGATTAGCGAATTTGGAATTCAAAGACGCGAATTCTGAAAGATTTTCTCAGATGATTAGGTTGAAGATGAACTCCGATGAGACTCAGAAACTTCTCGCTGGCTGCAAATCGAGAGGCATTAAGCTCTGTGGAGCTTTGGCAGCTGCTGGATTGATTGCTACTCGTTGTTCTAAGGACCTTCCTCCTCACCAGAGGGAGAAATATGCTGTTGTTACTCTCAATGATTGTCGTTCCCTCCTTGATCCTCCCCTCACAAGCCACCATTTAGGATTCTATCACTCTGCCATCCTCAACACACATGACATATCAGCTGAAGATACACTATGGGAAGTGGCAAAGCGATGCTATTTTTCCTTCTCAAATGCCAAAGACAGCAACAAGCATTTCTCAGACATGTCTGACTTGAACTTCCTCATGTGCAAAGCGATTGAAAATCCCAGCCTCACTCCCTCTTCGTCCATGAGAACTGCCCTGATCTCGGTCTTTGAAGATCCCATCATCGAAACTTCCGGTCCTGCGCAGCAGCACATCGGCCTACACGACTACATCGGTTGTGCCTCTGCACACGGTGTTGGGCCATCGATCGCCTTCTTTGATATGATTCGTGACGGTCAGTTGGATTGTGCTTGTGTGTACCCGTCGCCTTTGTTCTCTCGAGAACAAATGAACCAAATTTTTGATGAGATGAAGAAAATTCTGGTGAATAATGCCATGGAAGTAGTTGAAGGTTAAGAATTTAGTTGTTGGACTGGGATCGAAGTTCCAGGTAGCTCGATAAGGGGAAGATCCACGATATATTAGCGTGGACAATTATCTTTTATATTCAAAATAAAGTCACGAACACTTATATTCAAAAAAGTTAATATCATATAACTGTGAAGTA

Coding sequence (CDS)

ATGTCTGATCAATCTCCTCCTCCTCCTCCTCCCGCCGGCGAGTCCGAGTCCCGTCCCGTCGGTGGCACCGAGCACAGCTGGTGCCGCGCCGTCCCCGGCGGCACCGGCACCACTGTCCTCGGCCTACTCCTCTCAAAACCTCCCGATATTCCCCATCTCCAATCCTCCCTCCACACTCTCCAAAACCTCCACCCAATCCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGATTTCTCCTTCCTCATTCCTCCTTCTCCGCCGCTTAACCTCCAGATCCTCGACCTCGCCGCCACCGCAAGCGCTATCGCCTCTCATCCCGACGCCAAGGATCCTTCCGTCTCCGATTTCCACAAGATCCACGAAACGGAGATCAACCGCGCCACGTGGTTCGATCCAAACCATCCGTCGTACTCCGACACCGACGTGATGTTCGCTACCGTCTACACCGTATGCGACAGCCAATGGGCGGTATTCCTCCGCCTCCACACGGCGACATGCGATCGTGCCGCGGCGGCGGCACTGTTGAGAGAACTGCTAGTGCTTGTGGCGGCCGGAGGGGAAATAGAGGGCGGAGGATTTGAAATTGGGGATAATGGTGAGATCGGATTAGGGATTGAAAATCTAATCCCTAACGGTAAAGCGAATAAATCTCTGTGGGCGCGTGGATTAGATATGCTTGGTTACTCCTTGAATTCGTTCCGATTAGCGAATTTGGAATTCAAAGACGCGAATTCTGAAAGATTTTCTCAGATGATTAGGTTGAAGATGAACTCCGATGAGACTCAGAAACTTCTCGCTGGCTGCAAATCGAGAGGCATTAAGCTCTGTGGAGCTTTGGCAGCTGCTGGATTGATTGCTACTCGTTGTTCTAAGGACCTTCCTCCTCACCAGAGGGAGAAATATGCTGTTGTTACTCTCAATGATTGTCGTTCCCTCCTTGATCCTCCCCTCACAAGCCACCATTTAGGATTCTATCACTCTGCCATCCTCAACACACATGACATATCAGCTGAAGATACACTATGGGAAGTGGCAAAGCGATGCTATTTTTCCTTCTCAAATGCCAAAGACAGCAACAAGCATTTCTCAGACATGTCTGACTTGAACTTCCTCATGTGCAAAGCGATTGAAAATCCCAGCCTCACTCCCTCTTCGTCCATGAGAACTGCCCTGATCTCGGTCTTTGAAGATCCCATCATCGAAACTTCCGGTCCTGCGCAGCAGCACATCGGCCTACACGACTACATCGGTTGTGCCTCTGCACACGGTGTTGGGCCATCGATCGCCTTCTTTGATATGATTCGTGACGGTCAGTTGGATTGTGCTTGTGTGTACCCGTCGCCTTTGTTCTCTCGAGAACAAATGAACCAAATTTTTGATGAGATGAAGAAAATTCTGGTGAATAATGCCATGGAAGTAGTTGAAGGTTAA

Protein sequence

MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHKIHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLRELLVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQREKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG
Homology
BLAST of CmUC10G199930 vs. NCBI nr
Match: XP_038905440.1 (uncharacterized protein LOC120091472 [Benincasa hispida])

HSP 1 Score: 899.4 bits (2323), Expect = 1.3e-257
Identity = 444/478 (92.89%), Postives = 459/478 (96.03%), Query Frame = 0

Query: 1   MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
           MSDQS   PPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1   MSDQS---PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60

Query: 61  QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
           QNLHPILRSKIHHDPSRRDFSFLI PSPPL+LQILDL ATA AIASHPDA DPSVSDFHK
Sbjct: 61  QNLHPILRSKIHHDPSRRDFSFLISPSPPLHLQILDLPATARAIASHPDANDPSVSDFHK 120

Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
           IHE EIN ATWFDPNHPSYSDTDVMFATVYT+ DSQWA+FLRLHTATCDRAAAAALLREL
Sbjct: 121 IHEQEINSATWFDPNHPSYSDTDVMFATVYTMSDSQWAIFLRLHTATCDRAAAAALLREL 180

Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
           LVL A GGEIEGGGFEIGDNGEIGLGIE+LIPNGKANKSLWARGLDMLGYSLNSFRLANL
Sbjct: 181 LVLTATGGEIEGGGFEIGDNGEIGLGIEDLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240

Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQR 300
           EFKDANSERFSQMIRLKMNS ETQKLLAGCK RG+KLCGALAAAGL+ATRCSKDLP HQ+
Sbjct: 241 EFKDANSERFSQMIRLKMNSHETQKLLAGCKLRGVKLCGALAAAGLLATRCSKDLPLHQK 300

Query: 301 EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDS 360
           EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFS+SNAKD+
Sbjct: 301 EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSYSNAKDN 360

Query: 361 NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGC 420
           NKHFSDMSDLNFLMCKAIENP LTPSSSMRTALISVFEDPII+TSGP QQ++GLHDY GC
Sbjct: 361 NKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIIDTSGPEQQNLGLHDYSGC 420

Query: 421 ASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
           ASAHGVGPSIA FDMIRDGQLDCACVYPSPLFSR+QMN+IFDEMKKILVNNAMEVVEG
Sbjct: 421 ASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILVNNAMEVVEG 475

BLAST of CmUC10G199930 vs. NCBI nr
Match: XP_008442855.1 (PREDICTED: uncharacterized protein LOC103486623 [Cucumis melo])

HSP 1 Score: 869.8 bits (2246), Expect = 1.1e-248
Identity = 437/479 (91.23%), Postives = 450/479 (93.95%), Query Frame = 0

Query: 1   MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
           MSDQS  PPPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1   MSDQS--PPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60

Query: 61  QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
           QNLHPILRSKIHHDP RRDFSFLIP SP L+LQILDLAAT  AIASHPDA DPSVSDFHK
Sbjct: 61  QNLHPILRSKIHHDPLRRDFSFLIPASPSLHLQILDLAATTRAIASHPDADDPSVSDFHK 120

Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
           IHE EINR  WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 121 IHEHEINRVIWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 180

Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
           LVL A GGEIEGG FEIGDNGEIGLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 181 LVLAADGGEIEGGRFEIGDNGEIGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 240

Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
           EFKD NSERFSQMIRLKMNSDETQKLLAGCK RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 241 EFKDPNSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDHLPPYQ 300

Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
           +EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAED LWEVA RCYFSFSNAKD
Sbjct: 301 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDKLWEVANRCYFSFSNAKD 360

Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
           +NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGP  Q++GL+DYIG
Sbjct: 361 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPGHQNLGLNDYIG 420

Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
            ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSR+QMNQIFDEMKKILVN+A+EV EG
Sbjct: 421 YASAHGVGPSIALFDTIRDGQLDCACVYPSPLFSRDQMNQIFDEMKKILVNSAVEVNEG 477

BLAST of CmUC10G199930 vs. NCBI nr
Match: XP_004149221.3 (uncharacterized protein LOC101208906 [Cucumis sativus] >KGN59146.1 hypothetical protein Csa_000929 [Cucumis sativus])

HSP 1 Score: 868.6 bits (2243), Expect = 2.5e-248
Identity = 433/479 (90.40%), Postives = 452/479 (94.36%), Query Frame = 0

Query: 1   MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
           MSDQS PPPP   ES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 50  MSDQSLPPPP--AESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 109

Query: 61  QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
           QNLHPILRSKIHHDPSRRDFSFLIPPSPPL+LQILDLAATA AIASHPDA DPSVSDFHK
Sbjct: 110 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAIASHPDADDPSVSDFHK 169

Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
           IHE EINR  WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 170 IHEHEINRVMWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 229

Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
           LVL A GGEIEGGGFE GDNGE+GLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 230 LVLAAGGGEIEGGGFETGDNGEVGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 289

Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
           EFKD N+ERFSQMIRL+MNSDETQKLLAGCK RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 290 EFKDPNTERFSQMIRLRMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDHLPPYQ 349

Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
           +EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDT+WEVA RCYFSFSNAKD
Sbjct: 350 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTVWEVASRCYFSFSNAKD 409

Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
           +NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIE SGP QQ++GLHDYIG
Sbjct: 410 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIEISGPEQQNLGLHDYIG 469

Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
            ASAHGVGPSIA FD IRDGQLD ACVYPSPLFSR+QMN+IFD+MKKILVN+++EV EG
Sbjct: 470 YASAHGVGPSIAIFDTIRDGQLDSACVYPSPLFSRDQMNRIFDDMKKILVNSSVEVNEG 526

BLAST of CmUC10G199930 vs. NCBI nr
Match: KAA0043871.1 (GATA zinc finger domain-containing protein isoform 1 [Cucumis melo var. makuwa] >TYK25265.1 GATA zinc finger domain-containing protein isoform 1 [Cucumis melo var. makuwa])

HSP 1 Score: 826.6 bits (2134), Expect = 1.1e-235
Identity = 420/479 (87.68%), Postives = 433/479 (90.40%), Query Frame = 0

Query: 1   MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
           MSDQS  PPPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1   MSDQS--PPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60

Query: 61  QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
           QNLHPILRSKIHHDP RRDFSFLIP SP L+LQILDLAAT  AIASHPDA DPSVSDFHK
Sbjct: 61  QNLHPILRSKIHHDPLRRDFSFLIPASPSLHLQILDLAATTRAIASHPDADDPSVSDFHK 120

Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
           IHE EINR  WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 121 IHEHEINRVIWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 180

Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
           LVL A GGEIEGG FEIGDNGEIGLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 181 LVLAADGGEIEGGRFEIGDNGEIGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 240

Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
           EFKD NSERFSQMI                  RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 241 EFKDPNSERFSQMI------------------RGIKLCGALAAAGLIATRCSKDHLPPYQ 300

Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
           +EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAED LWEVA RCYFSFSNAKD
Sbjct: 301 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDKLWEVANRCYFSFSNAKD 360

Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
           +NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGP  Q++GL+DYIG
Sbjct: 361 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPGHQNLGLNDYIG 420

Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
            ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSR+QMNQIFDEMKKILVN+A+EV EG
Sbjct: 421 YASAHGVGPSIALFDTIRDGQLDCACVYPSPLFSRDQMNQIFDEMKKILVNSAVEVNEG 459

BLAST of CmUC10G199930 vs. NCBI nr
Match: XP_022982938.1 (uncharacterized protein LOC111481632 [Cucurbita maxima])

HSP 1 Score: 823.5 bits (2126), Expect = 9.1e-235
Identity = 405/465 (87.10%), Postives = 431/465 (92.69%), Query Frame = 0

Query: 14  ESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILRSKIHH 73
           E   RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDI HLQ+SLH LQNLHPILRSKIHH
Sbjct: 4   EIRIRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHH 63

Query: 74  DPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHKIHETEINRATWFD 133
           DPSRRDFSFLIPPSP ++LQILDLAA A AIASHPDA DPS+SDFHKI E EINRA W +
Sbjct: 64  DPSRRDFSFLIPPSPSIHLQILDLAAAARAIASHPDADDPSISDFHKILEHEINRAKWLN 123

Query: 134 PNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLRELLVLVAAGGEIEGG 193
           P+HPSYSDTDVMFATVY V D QWAVFL LHTA CDR AAA+LLRELLVL AA G+IEGG
Sbjct: 124 PSHPSYSDTDVMFATVYAVSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAEGKIEGG 183

Query: 194 GFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEFKDANSERFSQM 253
           GF+IGDNGEIG GIE+LIP+GKA+K LWARGLDMLGYSLNSFR ANLEFKDA+SERFSQM
Sbjct: 184 GFKIGDNGEIGSGIEDLIPSGKAHKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQM 243

Query: 254 IRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQREKYAVVTLNDCRS 313
           IRLK+NSDETQKLLAGCKSRGIKLCGAL AAGLIATRCSKDLPPHQ EKYAVVTL DCRS
Sbjct: 244 IRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPHQTEKYAVVTLIDCRS 303

Query: 314 LLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSNKHFSDMSDLNFL 373
           LLDPPLT+HHLGFYHSAILNTHDISAEDTLWEV++RCYFSFSNAKD+NKHF+DMSDLNFL
Sbjct: 304 LLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVSERCYFSFSNAKDNNKHFTDMSDLNFL 363

Query: 374 MCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGCASAHGVGPSIAFF 433
           M KAIENP LTPSSSMRTALIS FEDPII TS PAQQH+G+ DYIGCASAHGVGPSIA F
Sbjct: 364 MGKAIENPGLTPSSSMRTALISAFEDPIIYTSDPAQQHLGISDYIGCASAHGVGPSIALF 423

Query: 434 DMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
           D+IRDGQLDCACVYPSPLFSR+QMNQ+FDEMKKILV++AMEVVEG
Sbjct: 424 DLIRDGQLDCACVYPSPLFSRDQMNQLFDEMKKILVSSAMEVVEG 468

BLAST of CmUC10G199930 vs. ExPASy TrEMBL
Match: A0A1S3B7G8 (uncharacterized protein LOC103486623 OS=Cucumis melo OX=3656 GN=LOC103486623 PE=4 SV=1)

HSP 1 Score: 869.8 bits (2246), Expect = 5.4e-249
Identity = 437/479 (91.23%), Postives = 450/479 (93.95%), Query Frame = 0

Query: 1   MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
           MSDQS  PPPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1   MSDQS--PPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60

Query: 61  QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
           QNLHPILRSKIHHDP RRDFSFLIP SP L+LQILDLAAT  AIASHPDA DPSVSDFHK
Sbjct: 61  QNLHPILRSKIHHDPLRRDFSFLIPASPSLHLQILDLAATTRAIASHPDADDPSVSDFHK 120

Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
           IHE EINR  WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 121 IHEHEINRVIWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 180

Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
           LVL A GGEIEGG FEIGDNGEIGLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 181 LVLAADGGEIEGGRFEIGDNGEIGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 240

Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
           EFKD NSERFSQMIRLKMNSDETQKLLAGCK RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 241 EFKDPNSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDHLPPYQ 300

Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
           +EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAED LWEVA RCYFSFSNAKD
Sbjct: 301 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDKLWEVANRCYFSFSNAKD 360

Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
           +NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGP  Q++GL+DYIG
Sbjct: 361 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPGHQNLGLNDYIG 420

Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
            ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSR+QMNQIFDEMKKILVN+A+EV EG
Sbjct: 421 YASAHGVGPSIALFDTIRDGQLDCACVYPSPLFSRDQMNQIFDEMKKILVNSAVEVNEG 477

BLAST of CmUC10G199930 vs. ExPASy TrEMBL
Match: A0A0A0LGP2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G777540 PE=4 SV=1)

HSP 1 Score: 868.6 bits (2243), Expect = 1.2e-248
Identity = 433/479 (90.40%), Postives = 452/479 (94.36%), Query Frame = 0

Query: 1   MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
           MSDQS PPPP   ES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 50  MSDQSLPPPP--AESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 109

Query: 61  QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
           QNLHPILRSKIHHDPSRRDFSFLIPPSPPL+LQILDLAATA AIASHPDA DPSVSDFHK
Sbjct: 110 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAIASHPDADDPSVSDFHK 169

Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
           IHE EINR  WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 170 IHEHEINRVMWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 229

Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
           LVL A GGEIEGGGFE GDNGE+GLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 230 LVLAAGGGEIEGGGFETGDNGEVGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 289

Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
           EFKD N+ERFSQMIRL+MNSDETQKLLAGCK RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 290 EFKDPNTERFSQMIRLRMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDHLPPYQ 349

Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
           +EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDT+WEVA RCYFSFSNAKD
Sbjct: 350 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTVWEVASRCYFSFSNAKD 409

Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
           +NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIE SGP QQ++GLHDYIG
Sbjct: 410 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIEISGPEQQNLGLHDYIG 469

Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
            ASAHGVGPSIA FD IRDGQLD ACVYPSPLFSR+QMN+IFD+MKKILVN+++EV EG
Sbjct: 470 YASAHGVGPSIAIFDTIRDGQLDSACVYPSPLFSRDQMNRIFDDMKKILVNSSVEVNEG 526

BLAST of CmUC10G199930 vs. ExPASy TrEMBL
Match: A0A5A7TQM7 (GATA zinc finger domain-containing protein isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003980 PE=4 SV=1)

HSP 1 Score: 826.6 bits (2134), Expect = 5.2e-236
Identity = 420/479 (87.68%), Postives = 433/479 (90.40%), Query Frame = 0

Query: 1   MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
           MSDQS  PPPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1   MSDQS--PPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60

Query: 61  QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
           QNLHPILRSKIHHDP RRDFSFLIP SP L+LQILDLAAT  AIASHPDA DPSVSDFHK
Sbjct: 61  QNLHPILRSKIHHDPLRRDFSFLIPASPSLHLQILDLAATTRAIASHPDADDPSVSDFHK 120

Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
           IHE EINR  WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 121 IHEHEINRVIWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 180

Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
           LVL A GGEIEGG FEIGDNGEIGLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 181 LVLAADGGEIEGGRFEIGDNGEIGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 240

Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
           EFKD NSERFSQMI                  RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 241 EFKDPNSERFSQMI------------------RGIKLCGALAAAGLIATRCSKDHLPPYQ 300

Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
           +EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAED LWEVA RCYFSFSNAKD
Sbjct: 301 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDKLWEVANRCYFSFSNAKD 360

Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
           +NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGP  Q++GL+DYIG
Sbjct: 361 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPGHQNLGLNDYIG 420

Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
            ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSR+QMNQIFDEMKKILVN+A+EV EG
Sbjct: 421 YASAHGVGPSIALFDTIRDGQLDCACVYPSPLFSRDQMNQIFDEMKKILVNSAVEVNEG 459

BLAST of CmUC10G199930 vs. ExPASy TrEMBL
Match: A0A6J1J6C5 (uncharacterized protein LOC111481632 OS=Cucurbita maxima OX=3661 GN=LOC111481632 PE=4 SV=1)

HSP 1 Score: 823.5 bits (2126), Expect = 4.4e-235
Identity = 405/465 (87.10%), Postives = 431/465 (92.69%), Query Frame = 0

Query: 14  ESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILRSKIHH 73
           E   RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDI HLQ+SLH LQNLHPILRSKIHH
Sbjct: 4   EIRIRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHH 63

Query: 74  DPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHKIHETEINRATWFD 133
           DPSRRDFSFLIPPSP ++LQILDLAA A AIASHPDA DPS+SDFHKI E EINRA W +
Sbjct: 64  DPSRRDFSFLIPPSPSIHLQILDLAAAARAIASHPDADDPSISDFHKILEHEINRAKWLN 123

Query: 134 PNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLRELLVLVAAGGEIEGG 193
           P+HPSYSDTDVMFATVY V D QWAVFL LHTA CDR AAA+LLRELLVL AA G+IEGG
Sbjct: 124 PSHPSYSDTDVMFATVYAVSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAEGKIEGG 183

Query: 194 GFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEFKDANSERFSQM 253
           GF+IGDNGEIG GIE+LIP+GKA+K LWARGLDMLGYSLNSFR ANLEFKDA+SERFSQM
Sbjct: 184 GFKIGDNGEIGSGIEDLIPSGKAHKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQM 243

Query: 254 IRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQREKYAVVTLNDCRS 313
           IRLK+NSDETQKLLAGCKSRGIKLCGAL AAGLIATRCSKDLPPHQ EKYAVVTL DCRS
Sbjct: 244 IRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPHQTEKYAVVTLIDCRS 303

Query: 314 LLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSNKHFSDMSDLNFL 373
           LLDPPLT+HHLGFYHSAILNTHDISAEDTLWEV++RCYFSFSNAKD+NKHF+DMSDLNFL
Sbjct: 304 LLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVSERCYFSFSNAKDNNKHFTDMSDLNFL 363

Query: 374 MCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGCASAHGVGPSIAFF 433
           M KAIENP LTPSSSMRTALIS FEDPII TS PAQQH+G+ DYIGCASAHGVGPSIA F
Sbjct: 364 MGKAIENPGLTPSSSMRTALISAFEDPIIYTSDPAQQHLGISDYIGCASAHGVGPSIALF 423

Query: 434 DMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
           D+IRDGQLDCACVYPSPLFSR+QMNQ+FDEMKKILV++AMEVVEG
Sbjct: 424 DLIRDGQLDCACVYPSPLFSRDQMNQLFDEMKKILVSSAMEVVEG 468

BLAST of CmUC10G199930 vs. ExPASy TrEMBL
Match: A0A6J1F424 (uncharacterized protein LOC111442199 OS=Cucurbita moschata OX=3662 GN=LOC111442199 PE=4 SV=1)

HSP 1 Score: 815.8 bits (2106), Expect = 9.2e-233
Identity = 402/465 (86.45%), Postives = 428/465 (92.04%), Query Frame = 0

Query: 14  ESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILRSKIHH 73
           E + RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDI HLQ+SLH LQNLHPILRSKIHH
Sbjct: 4   EIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHH 63

Query: 74  DPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHKIHETEINRATWFD 133
           DPSRRDFS LIPPSP ++LQILDLAA A AIASHPDA +PS+SDFHKI E EINRA W +
Sbjct: 64  DPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDADNPSISDFHKILEHEINRAKWLN 123

Query: 134 PNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLRELLVLVAAGGEIEGG 193
           P+HPSYSDTDVMFATVY + D QWAVFL LHTA CDR AAA+LLRELLVL AAGG+IEGG
Sbjct: 124 PSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGG 183

Query: 194 GFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEFKDANSERFSQM 253
           GFEIGDNGEIG GIE+LIP+GKA K LWARGLDMLGYSLNSFR ANLEFKDA+SERFSQM
Sbjct: 184 GFEIGDNGEIGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQM 243

Query: 254 IRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQREKYAVVTLNDCRS 313
           IRLK+NSDETQKLLAGCKSRGIKLCGAL AAGLIATRCSKDLPP+Q EKYAVVTL DCRS
Sbjct: 244 IRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLIDCRS 303

Query: 314 LLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSNKHFSDMSDLNFL 373
           LLDPPLT+HHLGFYHSAILNTHDISAEDTLWEVA+RCYFSFSN K++NKHF+DMSDLNFL
Sbjct: 304 LLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFL 363

Query: 374 MCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGCASAHGVGPSIAFF 433
           M KAIENP LTPSSSMRTALIS FEDPII TS PAQQH+G+ DYIGCASAHGVGPSIA F
Sbjct: 364 MGKAIENPGLTPSSSMRTALISAFEDPIIYTSDPAQQHLGIFDYIGCASAHGVGPSIALF 423

Query: 434 DMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
           DMIRDGQLDCACVYPSPLFSR+QMN +FDEMKKILV+ AMEVVEG
Sbjct: 424 DMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVEG 468

BLAST of CmUC10G199930 vs. TAIR 10
Match: AT3G52610.1 (unknown protein; Has 68 Blast hits to 67 proteins in 21 species: Archae - 0; Bacteria - 11; Metazoa - 0; Fungi - 0; Plants - 55; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 473.0 bits (1216), Expect = 2.8e-133
Identity = 252/472 (53.39%), Postives = 331/472 (70.13%), Query Frame = 0

Query: 9   PPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILR 68
           P    +S +RPVGGTE+SWCRA+ GGTG  V+ LLLS+ P + +LQ++L  LQ  HP LR
Sbjct: 4   PNRVPKSMTRPVGGTEYSWCRAIDGGTGIAVIALLLSRTPKLQNLQNTLDKLQIYHPTLR 63

Query: 69  SKIHHDPSRRDFSFLIPPSPPLNLQI--LDLAATASAIASHPDAKDPSVSDFHKIHETEI 128
           S I  D S   FSF++  +   +++I   D  +TA  I    D+ DP       I E E+
Sbjct: 64  SNIRFDASANSFSFVVTSAADSHVEIHPFDSVSTAQIIR---DSDDPCADPHRIILEHEM 123

Query: 129 NRATWFDPNHPSYSDTDVMFATVYTVCD--SQWAVFLRLHTATCDRAAAAALLRELLVLV 188
           N+ TW +P+    S++ V   ++Y + D   Q  +  RL+TA  DR AA  LLRE +   
Sbjct: 124 NKNTWINPHRWIKSESRVFIVSLYDLTDDGEQRILTFRLNTAAVDRTAAVTLLREFMKET 183

Query: 189 AAGGEIEGGGFEIGDNGEIGLG--IENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEF 248
           AA G    G         +GLG  IE LIP+GK +K  WARG+D+LGYSLN+FR +NL F
Sbjct: 184 AADG-FGNGPVVAATETAVGLGKAIEELIPSGKGDKPFWARGIDVLGYSLNAFRFSNLNF 243

Query: 249 KDA-NSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQRE 308
            DA NS R SQ++RLK++ D+T KL+AGCK+RG+KL  ALA++ LIA   SK+LPP+Q E
Sbjct: 244 VDAENSNRRSQLVRLKLDRDQTLKLVAGCKARGLKLWAALASSALIAAYSSKNLPPYQGE 303

Query: 309 KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSN 368
           KYAVVTL+DCRS+L+PPLTS+  GFYH+ IL+THD++ E+ LW++AKRCY SF+++K+SN
Sbjct: 304 KYAVVTLSDCRSILEPPLTSNDFGFYHAGILHTHDLTGEEKLWDLAKRCYDSFTSSKNSN 363

Query: 369 KHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPII-ETSGPAQQHIGLHDYIGC 428
           K F+DMSDLNFLMCKAIENP+LTPSSS+RTA IS+FEDP+I E+  P    +G+ DYIGC
Sbjct: 364 KQFTDMSDLNFLMCKAIENPNLTPSSSLRTAFISIFEDPVIDESPEPELASLGVQDYIGC 423

Query: 429 ASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNA 473
           AS HGVGPS+A FD +RDG+LDCA VYPSPL SREQM+ +   MK IL+  +
Sbjct: 424 ASIHGVGPSVAVFDALRDGKLDCAFVYPSPLHSREQMDGLIQHMKTILLEGS 471

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905440.11.3e-25792.89uncharacterized protein LOC120091472 [Benincasa hispida][more]
XP_008442855.11.1e-24891.23PREDICTED: uncharacterized protein LOC103486623 [Cucumis melo][more]
XP_004149221.32.5e-24890.40uncharacterized protein LOC101208906 [Cucumis sativus] >KGN59146.1 hypothetical ... [more]
KAA0043871.11.1e-23587.68GATA zinc finger domain-containing protein isoform 1 [Cucumis melo var. makuwa] ... [more]
XP_022982938.19.1e-23587.10uncharacterized protein LOC111481632 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3B7G85.4e-24991.23uncharacterized protein LOC103486623 OS=Cucumis melo OX=3656 GN=LOC103486623 PE=... [more]
A0A0A0LGP21.2e-24890.40Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G777540 PE=4 SV=1[more]
A0A5A7TQM75.2e-23687.68GATA zinc finger domain-containing protein isoform 1 OS=Cucumis melo var. makuwa... [more]
A0A6J1J6C54.4e-23587.10uncharacterized protein LOC111481632 OS=Cucurbita maxima OX=3661 GN=LOC111481632... [more]
A0A6J1F4249.2e-23386.45uncharacterized protein LOC111442199 OS=Cucurbita moschata OX=3662 GN=LOC1114421... [more]
Match NameE-valueIdentityDescription
AT3G52610.12.8e-13353.39unknown protein; Has 68 Blast hits to 67 proteins in 21 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR023213Chloramphenicol acetyltransferase-like domain superfamilyGENE3D3.30.559.10coord: 37..191
e-value: 2.5E-9
score: 39.2
NoneNo IPR availableGENE3D3.30.559.30Nonribosomal peptide synthetase, condensation domaincoord: 226..349
e-value: 3.3E-5
score: 25.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..15
NoneNo IPR availablePANTHERPTHR34375GATA ZINC FINGER PROTEIN-RELATEDcoord: 1..472
NoneNo IPR availablePANTHERPTHR34375:SF2GATA ZINC FINGER PROTEINcoord: 1..472
NoneNo IPR availableSUPERFAMILY52777CoA-dependent acyltransferasescoord: 41..186
NoneNo IPR availableSUPERFAMILY52777CoA-dependent acyltransferasescoord: 226..464

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC10G199930.1CmUC10G199930.1mRNA