CaUC01G008970 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC01G008970
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionUnknown protein
LocationCiama_Chr01: 10607510 .. 10610271 (-)
RNA-Seq ExpressionCaUC01G008970
SyntenyCaUC01G008970
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGAATACCCGACCGAGCTTCATCCGAGGACATCGGGTATTGGTCGGGCGTAATTTTGGGATGGCGAAAGTGGCCCAATCCCGTCGGGTATAGAGTGGGGAAGTGTGCACCCGGTACTTGCCCCTACCAAGTATTGACCGGGCGATGTAGTTGTGGGATCTCTCTGATTTTTTGGCCTGGTTTTGGTCTAGCCGAGTTTTACGAGGCAAGGTAGGTAGGGGTCTCTTTATTTTTTGGCCCGGTCTTGCCCTGGGCGGATTTTTGACCAGGCGGGGGAAGAATTTTTATGGCTGGGTATGGATCGGGCAAGATAGTTGGTGGAACCGTGGTAGGTATGCACCAAGCAAGACAAATTGAAGGAGAGTAGGGATGGGAAACAACAAACGTATAGTTTGTACGACCTCCGTCACGCATTCCAGGTCAGTTGTTATAATTAGTTTCAATATTGCTTTCCTCTTTGAATTATTAGTTGTCTAACGAAATATATTTTATTTCATTTTACGTAGGTATAGGCATATGAGACCATTTCGTCCTTGACTGGACGTGTTGCAAATCGATTGAGCAACAATGCCATATCACGTATTTTTCGATGGTCTTGCTCACACTCTCCTTCATATGCTTTGATTAGGGATGAGGTGTTCGGTGCTAAAGGGGTNTTTTTTTTTTTTTTTTTTTTTTTTTACTTATAGATAATATTCTGTTTAGTAGGCATTTGTTACGCCAGAGCTTGTGGCAACGAAAGAGGAGATGATGTTCATGGATCACGTTATGTTGCCACCGGAAGCATCGATCTTACCACCGACACCCATGTTATCACCGGTACCACAGAATGATCCCGAGCCTTCAACAAGTCACGCTAATGACGATGTTGATGCAAACGAGGATGTAGAAAGGAGGAATTGGAGGAGGAGGAAGAAGAAGAAGAGTCTGAAGAAGATGTTAAAAAAAATGAAAAGTTGCAGTAAAAAACTTGAGGCTAAAGTTAATGACCTTAATGATCGTGTTGGAGGCATCAAGAAGGACCTAAAAGCAATGAAGAAATATTTCCACCGAATGTCTAAGGTATGACAAAACAATAATAAAATTTTATTATTGACACTTCTTTTTCATTTATATTTTCGTTTATATTGAATCATTCAAGTTCATTTTGTTGGGATTTATGTCTTAAATCTTGTAGTCTATAAATTTGTAAATAAATTAATAAAGTATTATTTATTTTTATAATAATTGTTATTTTCTTGAGGTTATTTAATGTATCTTGAACAATATGTGGTTGACATACAATTGGATCGCGTTCAAGAAATAACCTAAAAGGTCTATAGTATATGAGTAAAGTTGGGTGCCTCATCCTGGTAACGCCATGGATACGACCTACTTTGTAAGTGTTACATACGGTTTCATCTAAATCGTTCGTGTAATGACACGTAAGTGGGGCTATCCTATATGGTGAGTCTGTATAAGACTAGACCATGAAATTAAATATCTCTTTGTAACTTCGTTAATTGAAGAGATTTATATTTCAAATGAGAACCATGTAACTTGATCTCAATCCTGAGTGAGTTATGAACTCTTGCTCATAAAGATCATTCTTTGATTTGTATGGGTGAGAGTGGTCTCAGCTACCGACTCAATACGCCTACCATTTTGGGAATGATGTCGAGTGAGGAGCTGGGAACATAACTTCACAAGATGAAATTCACTCCTTTCCTAACTTTAGGGAAAATAGATAGGTTGTTCCCTTAAATGTTGATTCTAGGACTTGAACAATGAGGTCTCACCCTCTCACTGATTTGAGAGTGACTTTGTTTATGATTTGACCATAAATAGTATTGTTCATTAGAGAATCAATGGTACTTAAGGTGCAAGAGGTAATTATAGAGGTAAAACGGTAATTTCACCCAGTTGTAAAACGAACAACCTGTGGAGGATTGACTTACAAAATATGGTCATTCAATGGACACATAAATATACCGCAGTTCATAAGAGTGCAACTATGGGTCTTTAGTGGTATGCCTCATGGTTAGTGTATGTTGATTAATATAATTAATGAGTTTAATTTATTAATCTCAAATTATTGGAGCTTATAACGGTAGGTCCATTAGGTTCCTCTAGTAGCTCAAAACAGGTTAAAACAATGAAATAAAAGAATTTGAGTGTTCAAATTAATCAAGGGAATTAATTGTATATGATACAATTAACATTACGTATGAGATACATCATAATATAAAGTTTATAATGTGAGAAAAAACTTTATAATATGAGAATATTAATATTTGAATAGGATTTAAATATTAAATTAGTATGAATTGGATTCATATTAAAACTATAAGTTATAAGAGTATATGTGTATTATGACACATTGAATCAAATATATGGTATTTGATTTCCTTTAATTAATTAAATAAGATTTCATTAATTAAATGGTTATTTAATTAATAGTTAATTATTATATATAAATTAATTAAATTTTATTTATTAACTTATATTAATTAAAGATATTATTATTATTATTAAATAACAAAACAACTCTCATGCAATGCAGGGGAGTAATAAGTGGGGAGGGAAGTCCCTCACTTTAAATTGAGATGAGATCTCAATTGTTTCTTTACGTGAAATATTACAGAAAGAAACTCTCTCTCCTCTCAACTCTCAAAAGAAACTCTCCGCGTTTTTCCCTTTTGGTTCAGAAGGTTCCCATCAACTTCTTTGTCCCAAGAGTATATCAGAGAAGATCCATTGGTGGTGTTCTTGGAGAATGA

mRNA sequence

ATGATGAATACCCGACCGAGCTTCATCCGAGGACATCGGGTATTGGTCGGGCGTAATTTTGGGATGGCGAAAGTGGCCCAATCCCGTCGGGTATGCACCAAGCAAGACAAATTGAAGGAGAGTAGGGATGGGAAACAACAAACGTATAGTTTGTACGACCTCCGTCACGCATTCCAGGCATATGAGACCATTTCGTCCTTGACTGGACGTGTTGCAAATCGATTGAGCAACAATGCCATATCACGTATTTTTCGATGGTCTTGCTCACACTCTCCTTCATATGCTTTGATTAGGGATGAGGCATTTGTTACGCCAGAGCTTGTGGCAACGAAAGAGGAGATGATGTTCATGGATCACGTTATGTTGCCACCGGAAGCATCGATCTTACCACCGACACCCATGTTATCACCGGTACCACAGAATGATCCCGAGCCTTCAACAAGTCACGCTAATGACGATGTTGATGCAAACGAGGATGTAGAAAGGAGGAATTGGAGGAGGAGGAAGAAGAAGAAGAGTCTGAAGAAGATGTTAAAAAAAATGAAAAGTTGCAGTAAAAAACTTGAGGCTAAAGTTAATGACCTTAATGATCGTGTTGGAGGCATCAAGAAGGACCTAAAAGCAATGAAGAAATATTTCCACCGAATGTCTAAGAAGGTTCCCATCAACTTCTTTGTCCCAAGAGTATATCAGAGAAGATCCATTGGTGGTGTTCTTGGAGAATGA

Coding sequence (CDS)

ATGATGAATACCCGACCGAGCTTCATCCGAGGACATCGGGTATTGGTCGGGCGTAATTTTGGGATGGCGAAAGTGGCCCAATCCCGTCGGGTATGCACCAAGCAAGACAAATTGAAGGAGAGTAGGGATGGGAAACAACAAACGTATAGTTTGTACGACCTCCGTCACGCATTCCAGGCATATGAGACCATTTCGTCCTTGACTGGACGTGTTGCAAATCGATTGAGCAACAATGCCATATCACGTATTTTTCGATGGTCTTGCTCACACTCTCCTTCATATGCTTTGATTAGGGATGAGGCATTTGTTACGCCAGAGCTTGTGGCAACGAAAGAGGAGATGATGTTCATGGATCACGTTATGTTGCCACCGGAAGCATCGATCTTACCACCGACACCCATGTTATCACCGGTACCACAGAATGATCCCGAGCCTTCAACAAGTCACGCTAATGACGATGTTGATGCAAACGAGGATGTAGAAAGGAGGAATTGGAGGAGGAGGAAGAAGAAGAAGAGTCTGAAGAAGATGTTAAAAAAAATGAAAAGTTGCAGTAAAAAACTTGAGGCTAAAGTTAATGACCTTAATGATCGTGTTGGAGGCATCAAGAAGGACCTAAAAGCAATGAAGAAATATTTCCACCGAATGTCTAAGAAGGTTCCCATCAACTTCTTTGTCCCAAGAGTATATCAGAGAAGATCCATTGGTGGTGTTCTTGGAGAATGA

Protein sequence

MMNTRPSFIRGHRVLVGRNFGMAKVAQSRRVCTKQDKLKESRDGKQQTYSLYDLRHAFQAYETISSLTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAFVTPELVATKEEMMFMDHVMLPPEASILPPTPMLSPVPQNDPEPSTSHANDDVDANEDVERRNWRRRKKKKSLKKMLKKMKSCSKKLEAKVNDLNDRVGGIKKDLKAMKKYFHRMSKKVPINFFVPRVYQRRSIGGVLGE
Homology
BLAST of CaUC01G008970 vs. NCBI nr
Match: XP_038881592.1 (uncharacterized protein LOC120073060 [Benincasa hispida])

HSP 1 Score: 100.1 bits (248), Expect = 2.7e-17
Identity = 74/163 (45.40%), Postives = 98/163 (60.12%), Query Frame = 0

Query: 60  AYETISSLTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAFVTP---ELVATKEEMMF 119
           A++T+SSL GRVAN LS NA   I++WSCSH PS+A+I+    V     +      E  +
Sbjct: 138 AHDTMSSLVGRVANWLSENASPHIYQWSCSHFPSHAVIKGNYHVEACGNQRGDGVHESHY 197

Query: 120 MDHVMLPPEASILPPTPMLSPVPQNDPEPSTSHANDDVDANEDVERRNWRRRKKKKSLKK 179
           +    +    +  P  P + PVP    E ST+ A D+V+ANED ERRN RRRK KKS+KK
Sbjct: 198 VATGSIDFTTATEPSLPPMPPVPPIGVETSTTQAPDNVNANEDEERRN-RRRKNKKSMKK 257

Query: 180 MLKK-MKSCSKKLEAKVNDLNDRVGGIKKDLKAMKKYFHRMSK 219
           M KK MK   K+ +A    L+  V  ++K+ KAMKKY  RMSK
Sbjct: 258 MFKKFMKEVGKRFDA----LDQHVERLEKNFKAMKKYMRRMSK 295

BLAST of CaUC01G008970 vs. NCBI nr
Match: XP_022154965.1 (uncharacterized protein LOC111022110 [Momordica charantia])

HSP 1 Score: 72.0 bits (175), Expect = 7.8e-09
Identity = 69/210 (32.86%), Postives = 103/210 (49.05%), Query Frame = 0

Query: 47  QTYSLYDLRHAFQ--AYETISSLTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAF-- 106
           +TYSLY   +AFQ  AYETIS+L+ RVA RL+++AI R+ RWSC++S ++ ++  E F  
Sbjct: 64  ETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFEN 123

Query: 107 ----VTPELVATKEEMMFMDHVMLPPEASILPPTPM------LSPVPQNDPEPSTSHAND 166
               V   L AT  E   M  VM PP A + PP P       L+        P TS   D
Sbjct: 124 VKSKVVVRLEATDVERQHMARVMHPPVAPVGPPAPTELATEPLATTSTAQKSPVTSEVGD 183

Query: 167 DV-------DANEDVER-----------------RNWRRRKKKKSLKKMLKKMKSCSKKL 219
            V       DA+  V+R                 +    +KKKKS  K  +++    ++L
Sbjct: 184 LVELDDVAKDASPLVDRVTEDIIGTDGGQDQLLPQKGTEKKKKKSKHKWSREL----RRL 243

BLAST of CaUC01G008970 vs. NCBI nr
Match: XP_022136676.1 (uncharacterized protein LOC111008328 [Momordica charantia])

HSP 1 Score: 65.5 bits (158), Expect = 7.3e-07
Identity = 56/153 (36.60%), Postives = 77/153 (50.33%), Query Frame = 0

Query: 47  QTYSLYDLRHAFQ--AYETISSLTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAF-- 106
           +TYSLY   +AFQ  AYETIS+L+ RVA RL+++AI R+ RWSC++S ++ ++  E F  
Sbjct: 7   ETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFEN 66

Query: 107 ----VTPELVATKEEMMFMDHVMLPPEASILPPTPM------LSPVPQNDPEPSTSHAND 166
               V   L AT  E   M  VM PP   + PP P       LS          T    D
Sbjct: 67  VKSKVVVRLEATDVERQHMARVMHPPVTPVGPPAPTKLATEPLSTTSTTQKSLVTGEVGD 126

Query: 167 DV-------DANED----VERRNWRRRKKKKSL 175
            V       DA+ D      R+  RRR++ +S+
Sbjct: 127 PVELDDVAKDASPDKINCCHRKGRRRRRRSQSI 159

BLAST of CaUC01G008970 vs. NCBI nr
Match: XP_022153201.1 (uncharacterized protein LOC111020757 [Momordica charantia])

HSP 1 Score: 62.0 bits (149), Expect = 8.1e-06
Identity = 69/239 (28.87%), Postives = 109/239 (45.61%), Query Frame = 0

Query: 19  NFGMAKVAQSRRVCTKQDKLKESRDGKQQ----------TYSLYDLRHAFQ--AYETISS 78
           N+  + +   R + + ++ LK+     QQ          TYSLY   +AFQ  AYETIS+
Sbjct: 185 NYDWSSMIFDRTIWSLKNALKDKLSVYQQKATADPSHVETYSLYGFPYAFQVWAYETIST 244

Query: 79  LTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAF------VTPELVATKEEMMFMDHV 138
                   LS++AI R+ RWSC +S  + ++  E F      V   L+AT  +   M  V
Sbjct: 245 --------LSDDAIPRLLRWSCIYSCGFRVLTSEVFDNTRSKVKEHLLATDAKEQHMVRV 304

Query: 139 MLPPEASILP-----------PTPMLSPVPQNDPEPST-------------SHANDD--- 198
           +LPPE  ++P           P P  SP     P+P               +HA D+   
Sbjct: 305 ILPPEVRVIPDPPAVPDRAVVPDPPASPERAAVPDPPADVEMGPLEDPVVDAHAVDEARP 364

Query: 199 -VDANEDVERRNWRRRKKKKSLKKMLKKMKSCSKKLEAKVNDLNDRVGGIKKDLKAMKK 212
             +  E +E+R  ++ K KK + + LK++ +C   +E K+ D    + GI+  LK + K
Sbjct: 365 SANDGEGLEKR-LKKNKFKKRISRRLKRLDNCVGAIEDKLGDFGVALKGIQIYLKKLAK 414

BLAST of CaUC01G008970 vs. NCBI nr
Match: XP_022157020.1 (uncharacterized protein LOC111023847 [Momordica charantia])

HSP 1 Score: 60.8 bits (146), Expect = 1.8e-05
Identity = 30/58 (51.72%), Postives = 44/58 (75.86%), Query Frame = 0

Query: 47  QTYSLYDLRHAFQ--AYETISSLTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAF 103
           +TYSLY   +AFQ  AYETIS+L+ RVA RL+++AI R+ RWSC++S ++ ++  E F
Sbjct: 223 ETYSLYXFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVF 280

BLAST of CaUC01G008970 vs. ExPASy TrEMBL
Match: A0A6J1DL40 (uncharacterized protein LOC111022110 OS=Momordica charantia OX=3673 GN=LOC111022110 PE=4 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 3.8e-09
Identity = 69/210 (32.86%), Postives = 103/210 (49.05%), Query Frame = 0

Query: 47  QTYSLYDLRHAFQ--AYETISSLTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAF-- 106
           +TYSLY   +AFQ  AYETIS+L+ RVA RL+++AI R+ RWSC++S ++ ++  E F  
Sbjct: 64  ETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFEN 123

Query: 107 ----VTPELVATKEEMMFMDHVMLPPEASILPPTPM------LSPVPQNDPEPSTSHAND 166
               V   L AT  E   M  VM PP A + PP P       L+        P TS   D
Sbjct: 124 VKSKVVVRLEATDVERQHMARVMHPPVAPVGPPAPTELATEPLATTSTAQKSPVTSEVGD 183

Query: 167 DV-------DANEDVER-----------------RNWRRRKKKKSLKKMLKKMKSCSKKL 219
            V       DA+  V+R                 +    +KKKKS  K  +++    ++L
Sbjct: 184 LVELDDVAKDASPLVDRVTEDIIGTDGGQDQLLPQKGTEKKKKKSKHKWSREL----RRL 243

BLAST of CaUC01G008970 vs. ExPASy TrEMBL
Match: A0A6J1C463 (uncharacterized protein LOC111008328 OS=Momordica charantia OX=3673 GN=LOC111008328 PE=4 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 3.6e-07
Identity = 56/153 (36.60%), Postives = 77/153 (50.33%), Query Frame = 0

Query: 47  QTYSLYDLRHAFQ--AYETISSLTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAF-- 106
           +TYSLY   +AFQ  AYETIS+L+ RVA RL+++AI R+ RWSC++S ++ ++  E F  
Sbjct: 7   ETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFEN 66

Query: 107 ----VTPELVATKEEMMFMDHVMLPPEASILPPTPM------LSPVPQNDPEPSTSHAND 166
               V   L AT  E   M  VM PP   + PP P       LS          T    D
Sbjct: 67  VKSKVVVRLEATDVERQHMARVMHPPVTPVGPPAPTKLATEPLSTTSTTQKSLVTGEVGD 126

Query: 167 DV-------DANED----VERRNWRRRKKKKSL 175
            V       DA+ D      R+  RRR++ +S+
Sbjct: 127 PVELDDVAKDASPDKINCCHRKGRRRRRRSQSI 159

BLAST of CaUC01G008970 vs. ExPASy TrEMBL
Match: A0A6J1DJX9 (uncharacterized protein LOC111020757 OS=Momordica charantia OX=3673 GN=LOC111020757 PE=4 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 3.9e-06
Identity = 69/239 (28.87%), Postives = 109/239 (45.61%), Query Frame = 0

Query: 19  NFGMAKVAQSRRVCTKQDKLKESRDGKQQ----------TYSLYDLRHAFQ--AYETISS 78
           N+  + +   R + + ++ LK+     QQ          TYSLY   +AFQ  AYETIS+
Sbjct: 185 NYDWSSMIFDRTIWSLKNALKDKLSVYQQKATADPSHVETYSLYGFPYAFQVWAYETIST 244

Query: 79  LTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAF------VTPELVATKEEMMFMDHV 138
                   LS++AI R+ RWSC +S  + ++  E F      V   L+AT  +   M  V
Sbjct: 245 --------LSDDAIPRLLRWSCIYSCGFRVLTSEVFDNTRSKVKEHLLATDAKEQHMVRV 304

Query: 139 MLPPEASILP-----------PTPMLSPVPQNDPEPST-------------SHANDD--- 198
           +LPPE  ++P           P P  SP     P+P               +HA D+   
Sbjct: 305 ILPPEVRVIPDPPAVPDRAVVPDPPASPERAAVPDPPADVEMGPLEDPVVDAHAVDEARP 364

Query: 199 -VDANEDVERRNWRRRKKKKSLKKMLKKMKSCSKKLEAKVNDLNDRVGGIKKDLKAMKK 212
             +  E +E+R  ++ K KK + + LK++ +C   +E K+ D    + GI+  LK + K
Sbjct: 365 SANDGEGLEKR-LKKNKFKKRISRRLKRLDNCVGAIEDKLGDFGVALKGIQIYLKKLAK 414

BLAST of CaUC01G008970 vs. ExPASy TrEMBL
Match: A0A6J1DRZ7 (uncharacterized protein LOC111023847 OS=Momordica charantia OX=3673 GN=LOC111023847 PE=4 SV=1)

HSP 1 Score: 60.8 bits (146), Expect = 8.7e-06
Identity = 30/58 (51.72%), Postives = 44/58 (75.86%), Query Frame = 0

Query: 47  QTYSLYDLRHAFQ--AYETISSLTGRVANRLSNNAISRIFRWSCSHSPSYALIRDEAF 103
           +TYSLY   +AFQ  AYETIS+L+ RVA RL+++AI R+ RWSC++S ++ ++  E F
Sbjct: 223 ETYSLYXFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVF 280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881592.12.7e-1745.40uncharacterized protein LOC120073060 [Benincasa hispida][more]
XP_022154965.17.8e-0932.86uncharacterized protein LOC111022110 [Momordica charantia][more]
XP_022136676.17.3e-0736.60uncharacterized protein LOC111008328 [Momordica charantia][more]
XP_022153201.18.1e-0628.87uncharacterized protein LOC111020757 [Momordica charantia][more]
XP_022157020.11.8e-0551.72uncharacterized protein LOC111023847 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DL403.8e-0932.86uncharacterized protein LOC111022110 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A6J1C4633.6e-0736.60uncharacterized protein LOC111008328 OS=Momordica charantia OX=3673 GN=LOC111008... [more]
A0A6J1DJX93.9e-0628.87uncharacterized protein LOC111020757 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A6J1DRZ78.7e-0651.72uncharacterized protein LOC111023847 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 171..205
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 127..174
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 148..164

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC01G008970.1CaUC01G008970.1mRNA