Cla97C10G187750 (gene) Watermelon (97103) v2.5

Overview
NameCla97C10G187750
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionacidic leucine-rich nuclear phosphoprotein 32 family member B-like
LocationCla97Chr10: 3756582 .. 3757339 (-)
RNA-Seq ExpressionCla97C10G187750
SyntenyCla97C10G187750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGATGAACGAGAAAGTGAATGGAAAAATGGGGATGAGAATTATGGTGGGATTGGCACCAATTTTGGTGGTGGTGATTTTGTGGGTAACATCAAAATCATCTCAATTTTTTCAAAGTAGTAATCTTTTTGGAAGTCCTCATTTCATATTCTCAATCTTCAACTCCATAATTCTTCTCATAACTGTGAGAAACCATGATGAGCCACAACCTCATGAACAATTTGAGATAATGGAACTTATTCTTCAATTATCTCCCTATCATAAGCCTCATCAAGACATTGAATTTAGTGCTCGTGAAGATACAAGAGAAGATGACGGTGAAAATGCCGAATATGGAAACCAAAATGATCAGAAAGATCAAGACGATAGTGATGAAGATGAAATCGAGCAATGGAATGAGAAAGAAGATGAAGATCATGAAGATAACGAGCGTGACGACGATCAAGATGAGGTCGAGGAGTGGGACGAGAAAGAGGAGGAAGAAGAAGAATTGGAGAAGAGGATTGAAGAATTCATAGCAAAGGTGAACAAGAGATGGAGAGAAGAGAAGTTGAGAGATCATTTTCTTATTCAAATTTGTTCTAGTAACTATGTTAGTAGTACCTGAAATTAAATTATTAATTTCTTTTTCCATTCATAATTTTGTTACTCATTTATATTATATGATCAAGCTTTTTTTTTGGTACTAATATTATATGATGAAGCTCTATTTTGCATGATGCCAAAAGAATATATATACATATAAAGATTTCTATTCCTT

mRNA sequence

TGATGAACGAGAAAGTGAATGGAAAAATGGGGATGAGAATTATGGTGGGATTGGCACCAATTTTGGTGGTGGTGATTTTGTGGGTAACATCAAAATCATCTCAATTTTTTCAAAGTAGTAATCTTTTTGGAAGTCCTCATTTCATATTCTCAATCTTCAACTCCATAATTCTTCTCATAACTGTGAGAAACCATGATGAGCCACAACCTCATGAACAATTTGAGATAATGGAACTTATTCTTCAATTATCTCCCTATCATAAGCCTCATCAAGACATTGAATTTAGTGCTCGTGAAGATACAAGAGAAGATGACGGTGAAAATGCCGAATATGGAAACCAAAATGATCAGAAAGATCAAGACGATAGTGATGAAGATGAAATCGAGCAATGGAATGAGAAAGAAGATGAAGATCATGAAGATAACGAGCGTGACGACGATCAAGATGAGGTCGAGGAGTGGGACGAGAAAGAGGAGGAAGAAGAAGAATTGGAGAAGAGGATTGAAGAATTCATAGCAAAGGTGAACAAGAGATGGAGAGAAGAGAAGTTGAGAGATCATTTTCTTATTCAAATTTGTTCTAGTAACTATGTTAGTAGTACCTGAAATTAAATTATTAATTTCTTTTTCCATTCATAATTTTGTTACTCATTTATATTATATGATCAAGCTTTTTTTTTGGTACTAATATTATATGATGAAGCTCTATTTTGCATGATGCCAAAAGAATATATATACATATAAAGATTTCTATTCCTT

Coding sequence (CDS)

ATGAACGAGAAAGTGAATGGAAAAATGGGGATGAGAATTATGGTGGGATTGGCACCAATTTTGGTGGTGGTGATTTTGTGGGTAACATCAAAATCATCTCAATTTTTTCAAAGTAGTAATCTTTTTGGAAGTCCTCATTTCATATTCTCAATCTTCAACTCCATAATTCTTCTCATAACTGTGAGAAACCATGATGAGCCACAACCTCATGAACAATTTGAGATAATGGAACTTATTCTTCAATTATCTCCCTATCATAAGCCTCATCAAGACATTGAATTTAGTGCTCGTGAAGATACAAGAGAAGATGACGGTGAAAATGCCGAATATGGAAACCAAAATGATCAGAAAGATCAAGACGATAGTGATGAAGATGAAATCGAGCAATGGAATGAGAAAGAAGATGAAGATCATGAAGATAACGAGCGTGACGACGATCAAGATGAGGTCGAGGAGTGGGACGAGAAAGAGGAGGAAGAAGAAGAATTGGAGAAGAGGATTGAAGAATTCATAGCAAAGGTGAACAAGAGATGGAGAGAAGAGAAGTTGAGAGATCATTTTCTTATTCAAATTTGTTCTAGTAACTATGTTAGTAGTACCTGA

Protein sequence

MNEKVNGKMGMRIMVGLAPILVVVILWVTSKSSQFFQSSNLFGSPHFIFSIFNSIILLITVRNHDEPQPHEQFEIMELILQLSPYHKPHQDIEFSAREDTREDDGENAEYGNQNDQKDQDDSDEDEIEQWNEKEDEDHEDNERDDDQDEVEEWDEKEEEEEELEKRIEEFIAKVNKRWREEKLRDHFLIQICSSNYVSST
Homology
BLAST of Cla97C10G187750 vs. NCBI nr
Match: XP_038876855.1 (glutamic acid-rich protein-like [Benincasa hispida])

HSP 1 Score: 237.3 bits (604), Expect = 1.2e-58
Identity = 154/204 (75.49%), Postives = 170/204 (83.33%), Query Frame = 0

Query: 3   EKVNGKMGMRIMVGLAPILVVVILWVTSKSSQFFQSSNLFGSPHFIFSIFNSIILLITVR 62
           EKVNGKMGMRIMVGL PILVV+ILWVTSKSSQ F SS+LF SPH IFSIFNSIILLIT+R
Sbjct: 9   EKVNGKMGMRIMVGLVPILVVMILWVTSKSSQIFLSSHLFVSPHLIFSIFNSIILLITMR 68

Query: 63  NHDEPQPHEQFEIMELILQLSPYHKPHQDIEFSA---REDTR---EDDGENAEYGNQNDQ 122
           NH EP P EQFEIMEL LQLSPYHKPHQDIEFSA   REDTR   ED+ +N EYGN+N++
Sbjct: 69  NH-EPHPREQFEIMELTLQLSPYHKPHQDIEFSACDQREDTREEDEDEDKNVEYGNKNEK 128

Query: 123 KDQDDSDEDEIEQWNEKEDEDHEDNERDDDQDEVEEWDEKEEEEEELEKRIEEFIAKVNK 182
           +D + +DEDEI+Q NEKE ED EDN+RD+  D        EEE+EELEKRIEEFIAKVNK
Sbjct: 129 ED-EFNDEDEIKQLNEKEHED-EDNDRDEVGD--------EEEDEELEKRIEEFIAKVNK 188

Query: 183 RWREEKLRDHFLIQICSSNYVSST 201
           RWREEKLRDH LIQICS+N VSST
Sbjct: 189 RWREEKLRDHLLIQICSNN-VSST 200

BLAST of Cla97C10G187750 vs. NCBI nr
Match: KAE8650077.1 (hypothetical protein Csa_010314 [Cucumis sativus])

HSP 1 Score: 204.1 bits (518), Expect = 1.1e-48
Identity = 136/199 (68.34%), Postives = 161/199 (80.90%), Query Frame = 0

Query: 9   MGMRIMVGLAPILVVVILWVTSKSSQFFQSSNLFGSPHFIFSIFNSIILLITVRNHDEPQ 68
           MGMRIM+GL PILVV++L VTSKSSQFF    LFGSPHFIFSIFNSIILLIT+RNH+EPQ
Sbjct: 1   MGMRIMLGLIPILVVMVLLVTSKSSQFFL---LFGSPHFIFSIFNSIILLITMRNHEEPQ 60

Query: 69  PHEQFEIMELILQLSPYH-KP-HQDIEF-SAREDTR--EDDGE---NAEYGNQNDQKDQD 128
           PHEQF+IMELILQLSPYH KP HQD EF    +D R  +DDG+   + +Y N+N+++D+ 
Sbjct: 61  PHEQFDIMELILQLSPYHNKPHHQDTEFHDDHKDKRVEDDDGDVNTDDQYDNRNEKEDK- 120

Query: 129 DSDEDEIEQWNEKEDEDHEDNERDDDQDEVEEWDEKEEEEEELEKRIEEFIAKVNKRWRE 188
            SDEDE+++W+EKE E       DD+ DE E WDE EEE+EELE RIEEFIAKVNKRWRE
Sbjct: 121 VSDEDEMKEWDEKEGE-------DDEYDEAEGWDE-EEEDEELEMRIEEFIAKVNKRWRE 180

Query: 189 EKLRDHFLIQICSSNYVSS 200
           EKLRDH LIQICS+N V++
Sbjct: 181 EKLRDHLLIQICSTNNVTN 187

BLAST of Cla97C10G187750 vs. NCBI nr
Match: XP_023005569.1 (acidic leucine-rich nuclear phosphoprotein 32 family member B-like [Cucurbita maxima])

HSP 1 Score: 125.2 bits (313), Expect = 6.5e-25
Identity = 98/196 (50.00%), Postives = 122/196 (62.24%), Query Frame = 0

Query: 9   MGMRIMVGLAPIL-----VVVILWVTSKSSQFFQSSNLFGSPHFIFSIFNSIILLITVRN 68
           MGMRIMVGL  ++       V+L  TSKSSQ F SS L GSPHF+FSIFNSIILLI V  
Sbjct: 1   MGMRIMVGLVAVMAAAAAAAVVLLATSKSSQVFLSSKLLGSPHFMFSIFNSIILLIIVTY 60

Query: 69  HDEPQPHEQFEIME---LILQLSPYHKPHQDIEFSAREDTREDDGENAEYGNQNDQKDQD 128
           H  P     + I +    +     Y+ P Q+IEF A +D  + D E          +D  
Sbjct: 61  HRPPLHQSPYHIRKGSCYLESYQGYYTPCQNIEFYACDDEHDSDDE---------YEDDS 120

Query: 129 DSDEDEIEQWNEKEDEDHEDNERD-DDQDEVEEWDEKEEEEEELEKRIEEFIAKVNKRWR 188
            +++DEIEQ NE E E  ++ + D DD+D+ E  DEK + ++ELEKRIEEFIAKVNKRWR
Sbjct: 121 QNEKDEIEQLNEDETEQCDEEDEDVDDEDDTERCDEK-DNDDELEKRIEEFIAKVNKRWR 180

Query: 189 EEKLRDHFLIQICSSN 196
           EEKLRD+   Q CSSN
Sbjct: 181 EEKLRDNLFNQFCSSN 186

BLAST of Cla97C10G187750 vs. NCBI nr
Match: TYK19467.1 (hypothetical protein E5676_scaffold443G001210 [Cucumis melo var. makuwa])

HSP 1 Score: 121.3 bits (303), Expect = 9.3e-24
Identity = 91/152 (59.87%), Postives = 113/152 (74.34%), Query Frame = 0

Query: 61  VRNHDEPQ-PHEQFEIM--ELILQLSPYH-KPHQDIEFSAREDTRE---DDGENA----E 120
           +RNH+EPQ PHEQFEIM  +LIL+LSPYH KPH        +DTRE   DDG+N     +
Sbjct: 1   MRNHEEPQRPHEQFEIMDDQLILRLSPYHNKPH-------HQDTREEDDDDGDNQNTDDQ 60

Query: 121 YGNQNDQKDQDDSD-EDEIEQWNEKEDEDHEDNERDDDQDEVEEWDEKEEEEEELEKRIE 180
           Y N+N+++D+  SD EDE+++W+EKE E        +D+ E E WDE EEE+EELEKRIE
Sbjct: 61  YDNRNEKEDKLISDIEDEMKEWDEKEGE--------EDEYEAERWDE-EEEDEELEKRIE 120

Query: 181 EFIAKVNKRWREEKLRDHFLIQICSSNYVSST 201
           EFIAKVNKRWREEKLRDH LIQICS+N +S+T
Sbjct: 121 EFIAKVNKRWREEKLRDHLLIQICSTNVISNT 136

BLAST of Cla97C10G187750 vs. NCBI nr
Match: KAG7028360.1 (RNA polymerase II transcription factor B subunit 5, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 111.7 bits (278), Expect = 7.4e-21
Identity = 97/201 (48.26%), Postives = 122/201 (60.70%), Query Frame = 0

Query: 9   MGMRIMVGLAPIL----------VVVILWVTSKSSQFFQSSNLFGSPHFIFSIFNSIILL 68
           MGMRIMVGL  ++           VV+L VTSKSSQFF S+ L GSPHF+FSIFNSIILL
Sbjct: 1   MGMRIMVGLVAVMAAAAAAAATTTVVVLLVTSKSSQFFLSTKLLGSPHFMFSIFNSIILL 60

Query: 69  ITVRNHDEPQPHEQFEIMELILQLSPYHKPH---QDIEFSAREDTREDDGENAEYGNQND 128
           I V  H  P     + I +    +  YH+ +   Q IEF     T   D E  +    N+
Sbjct: 61  IIVTYHPPPPHQSPYHIRKGSCYVESYHEYYTTCQHIEFC----THNGDDEYED----NN 120

Query: 129 QKDQDDSDEDEIEQWNEKEDEDHEDNERDDDQDEVEEWDEKEEEEEELEKRIEEFIAKVN 188
           Q ++DD D DE EQ +E EDED       DD++E    DE E+ ++ELEKRIEEFIAKVN
Sbjct: 121 QNEKDDRDNDETEQCDE-EDED------VDDENETRRCDE-EDNDDELEKRIEEFIAKVN 180

Query: 189 KRWREEKLRDHFLIQICSSNY 197
           KRWREEKL+D+ L Q  + ++
Sbjct: 181 KRWREEKLQDNLLNQFWAFSF 185

BLAST of Cla97C10G187750 vs. ExPASy TrEMBL
Match: A0A0A0L728 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G036520 PE=4 SV=1)

HSP 1 Score: 132.1 bits (331), Expect = 2.6e-27
Identity = 93/147 (63.27%), Postives = 115/147 (78.23%), Query Frame = 0

Query: 61  VRNHDEPQPHEQFEIMELILQLSPYH-KP-HQDIEF-SAREDTR--EDDGE---NAEYGN 120
           +RNH+EPQPHEQF+IMELILQLSPYH KP HQD EF    +D R  +DDG+   + +Y N
Sbjct: 1   MRNHEEPQPHEQFDIMELILQLSPYHNKPHHQDTEFHDDHKDKRVEDDDGDVNTDDQYDN 60

Query: 121 QNDQKDQDDSDEDEIEQWNEKEDEDHEDNERDDDQDEVEEWDEKEEEEEELEKRIEEFIA 180
           +N+++D+  SDEDE+++W+EKE E       DD+ DE E WDE EEE+EELE RIEEFIA
Sbjct: 61  RNEKEDK-VSDEDEMKEWDEKEGE-------DDEYDEAEGWDE-EEEDEELEMRIEEFIA 120

Query: 181 KVNKRWREEKLRDHFLIQICSSNYVSS 200
           KVNKRWREEKLRDH LIQICS+N V++
Sbjct: 121 KVNKRWREEKLRDHLLIQICSTNNVTN 138

BLAST of Cla97C10G187750 vs. ExPASy TrEMBL
Match: A0A6J1KZL9 (acidic leucine-rich nuclear phosphoprotein 32 family member B-like OS=Cucurbita maxima OX=3661 GN=LOC111498513 PE=4 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 3.1e-25
Identity = 98/196 (50.00%), Postives = 122/196 (62.24%), Query Frame = 0

Query: 9   MGMRIMVGLAPIL-----VVVILWVTSKSSQFFQSSNLFGSPHFIFSIFNSIILLITVRN 68
           MGMRIMVGL  ++       V+L  TSKSSQ F SS L GSPHF+FSIFNSIILLI V  
Sbjct: 1   MGMRIMVGLVAVMAAAAAAAVVLLATSKSSQVFLSSKLLGSPHFMFSIFNSIILLIIVTY 60

Query: 69  HDEPQPHEQFEIME---LILQLSPYHKPHQDIEFSAREDTREDDGENAEYGNQNDQKDQD 128
           H  P     + I +    +     Y+ P Q+IEF A +D  + D E          +D  
Sbjct: 61  HRPPLHQSPYHIRKGSCYLESYQGYYTPCQNIEFYACDDEHDSDDE---------YEDDS 120

Query: 129 DSDEDEIEQWNEKEDEDHEDNERD-DDQDEVEEWDEKEEEEEELEKRIEEFIAKVNKRWR 188
            +++DEIEQ NE E E  ++ + D DD+D+ E  DEK + ++ELEKRIEEFIAKVNKRWR
Sbjct: 121 QNEKDEIEQLNEDETEQCDEEDEDVDDEDDTERCDEK-DNDDELEKRIEEFIAKVNKRWR 180

Query: 189 EEKLRDHFLIQICSSN 196
           EEKLRD+   Q CSSN
Sbjct: 181 EEKLRDNLFNQFCSSN 186

BLAST of Cla97C10G187750 vs. ExPASy TrEMBL
Match: A0A5D3D7D0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold443G001210 PE=4 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 4.5e-24
Identity = 91/152 (59.87%), Postives = 113/152 (74.34%), Query Frame = 0

Query: 61  VRNHDEPQ-PHEQFEIM--ELILQLSPYH-KPHQDIEFSAREDTRE---DDGENA----E 120
           +RNH+EPQ PHEQFEIM  +LIL+LSPYH KPH        +DTRE   DDG+N     +
Sbjct: 1   MRNHEEPQRPHEQFEIMDDQLILRLSPYHNKPH-------HQDTREEDDDDGDNQNTDDQ 60

Query: 121 YGNQNDQKDQDDSD-EDEIEQWNEKEDEDHEDNERDDDQDEVEEWDEKEEEEEELEKRIE 180
           Y N+N+++D+  SD EDE+++W+EKE E        +D+ E E WDE EEE+EELEKRIE
Sbjct: 61  YDNRNEKEDKLISDIEDEMKEWDEKEGE--------EDEYEAERWDE-EEEDEELEKRIE 120

Query: 181 EFIAKVNKRWREEKLRDHFLIQICSSNYVSST 201
           EFIAKVNKRWREEKLRDH LIQICS+N +S+T
Sbjct: 121 EFIAKVNKRWREEKLRDHLLIQICSTNVISNT 136

BLAST of Cla97C10G187750 vs. ExPASy TrEMBL
Match: A0A6J1E6C0 (nucleolin-like OS=Cucurbita moschata OX=3662 GN=LOC111431005 PE=4 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 4.4e-11
Identity = 85/176 (48.30%), Postives = 105/176 (59.66%), Query Frame = 0

Query: 14  MVGLAPILVVVI-LWVTSKSSQFFQSSNLFGSPHFIFSIFNSIILLITVRNHDEPQPHEQ 73
           MVGL PILVVV+ LWV+S        S+LFGSP+ IFSIFN +ILL+TVRNH EP+    
Sbjct: 1   MVGLVPILVVVVLLWVSS------NPSHLFGSPYIIFSIFNFMILLVTVRNH-EPR---- 60

Query: 74  FEIMELILQLSPYHKPHQDIEFSAR---EDTREDD-GENAEYGNQNDQKDQDDSDEDEIE 133
                 +LQ S YHK  QD EFSAR   ED REDD  ENA+    + + ++ D D DEIE
Sbjct: 61  -----CLLQPS-YHKLCQDAEFSAREEDEDKREDDEDENAD----SSENEKIDRDRDEIE 120

Query: 134 QWNEKEDED------HEDNERDDDQ----------------DEVEEWDEKEEEEEE 163
           Q NE+++ED       ED +RD D+                DE E WD++EEEEEE
Sbjct: 121 QLNEEDEEDADNDGFDEDEDRDGDEIEQSNEEDEDNDSVDKDETERWDDEEEEEEE 155

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876855.11.2e-5875.49glutamic acid-rich protein-like [Benincasa hispida][more]
KAE8650077.11.1e-4868.34hypothetical protein Csa_010314 [Cucumis sativus][more]
XP_023005569.16.5e-2550.00acidic leucine-rich nuclear phosphoprotein 32 family member B-like [Cucurbita ma... [more]
TYK19467.19.3e-2459.87hypothetical protein E5676_scaffold443G001210 [Cucumis melo var. makuwa][more]
KAG7028360.17.4e-2148.26RNA polymerase II transcription factor B subunit 5, partial [Cucurbita argyrospe... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L7282.6e-2763.27Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G036520 PE=4 SV=1[more]
A0A6J1KZL93.1e-2550.00acidic leucine-rich nuclear phosphoprotein 32 family member B-like OS=Cucurbita ... [more]
A0A5D3D7D04.5e-2459.87Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1E6C04.4e-1148.30nucleolin-like OS=Cucurbita moschata OX=3662 GN=LOC111431005 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 143..173
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 118..164
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 96..117
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 96..164
NoneNo IPR availablePANTHERPTHR36595:SF2SUBFAMILY NOT NAMEDcoord: 44..193
NoneNo IPR availablePANTHERPTHR36595TRANSMEMBRANE PROTEINcoord: 44..193

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C10G187750.2Cla97C10G187750.2mRNA