Cla97C05G093810 (gene) Watermelon (97103) v2

NameCla97C05G093810
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC105851746
LocationCla97Chr05 : 13938530 .. 13939192 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTAAAGAAGAGCTTTTTCAAGTTTTGTTTAGCAATGGATTAATCAAACAAGAGAATTTGCGAGGTGATGTCCCTGATACACAACTTAATGACAATCTAATGTGTTTATATCATGCTGGGGCAAAAGGTCATTCTATTGATCAATGCCCTCATTTCTCCCAGAAGATACAAGAGTTATTAGATTCTCGTTTCTTAGTAGTCTCACAGAAGACAATTAAAGAATCAGAGCGCTAACATGAAATACATGTTGTAGAAGAGCATGTTGCAGGCGAATTCTCGAAGGTTTCTCTAGAACGAAAACCATTAACGATATTTTACAAGGAGAAACCTAATACCACCACTTCCAATCCAAAACCAATTACCATACAGGTTCCATCTCCATTTGAGTATAAAAGCTCAAAAGCAGTACCATGGAATTATGAGTACAAGGTAATTGTTGAGTCTGGACCTATTCCTATAGACAATATTAATGAAATTAGAGGTATAACACGAAGTGGAAGGTGTTATATGCCAGAGGAATTATTGAAATACAAAGGAAAATCTAAGATAAATGATGCTATTGATTGTAAGGTAGAGGAACCCATGGTGGTGAGAAGTCGAGAAGCAAAAGAGCCAGCATCTGAAGATGACATACAAGAGTTTTTAAGACTTGTAAAGTAG

mRNA sequence

ATGTCTAAAGAAGAGCTTTTTCAAGTTTTGTTTAGCAATGGATTAATCAAACAAGAGAATTTGCGAGGTGATGTCCCTGATACACAACTTAATGACAATCTAATGTGTTTATATCATGCTGGGGCAAAAGAAGAGCATGTTGCAGGCGAATTCTCGAAGGTTTCTCTAGAACGAAAACCATTAACGATATTTTACAAGGAGAAACCTAATACCACCACTTCCAATCCAAAACCAATTACCATACAGGTTCCATCTCCATTTGAGTATAAAAGCTCAAAAGCAGTACCATGGAATTATGAGTACAAGGTAATTGTTGAGTCTGGACCTATTCCTATAGACAATATTAATGAAATTAGAGGTATAACACGAAGTGGAAGGTGTTATATGCCAGAGGAATTATTGAAATACAAAGGAAAATCTAAGATAAATGATGCTATTGATTGTAAGGTAGAGGAACCCATGGTGGTGAGAAGTCGAGAAGCAAAAGAGCCAGCATCTGAAGATGACATACAAGAGTTTTTAAGACTTGTAAAGTAG

Coding sequence (CDS)

ATGTCTAAAGAAGAGCTTTTTCAAGTTTTGTTTAGCAATGGATTAATCAAACAAGAGAATTTGCGAGGTGATGTCCCTGATACACAACTTAATGACAATCTAATGTGTTTATATCATGCTGGGGCAAAAGAAGAGCATGTTGCAGGCGAATTCTCGAAGGTTTCTCTAGAACGAAAACCATTAACGATATTTTACAAGGAGAAACCTAATACCACCACTTCCAATCCAAAACCAATTACCATACAGGTTCCATCTCCATTTGAGTATAAAAGCTCAAAAGCAGTACCATGGAATTATGAGTACAAGGTAATTGTTGAGTCTGGACCTATTCCTATAGACAATATTAATGAAATTAGAGGTATAACACGAAGTGGAAGGTGTTATATGCCAGAGGAATTATTGAAATACAAAGGAAAATCTAAGATAAATGATGCTATTGATTGTAAGGTAGAGGAACCCATGGTGGTGAGAAGTCGAGAAGCAAAAGAGCCAGCATCTGAAGATGACATACAAGAGTTTTTAAGACTTGTAAAGTAG

Protein sequence

MSKEELFQVLFSNGLIKQENLRGDVPDTQLNDNLMCLYHAGAKEEHVAGEFSKVSLERKPLTIFYKEKPNTTTSNPKPITIQVPSPFEYKSSKAVPWNYEYKVIVESGPIPIDNINEIRGITRSGRCYMPEELLKYKGKSKINDAIDCKVEEPMVVRSREAKEPASEDDIQEFLRLVK
BLAST of Cla97C05G093810 vs. NCBI nr
Match: XP_022155098.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia])

HSP 1 Score: 132.1 bits (331), Expect = 1.9e-27
Identity = 82/241 (34.02%), Postives = 122/241 (50.62%), Query Frame = 0

Query: 4   EELFQVLFSNGLIKQENLRGDVPDTQLNDNLMCLYHAGAK-------------------- 63
           + LF++L+ +G +  E+L  D+   + ++NL C YHAGA+                    
Sbjct: 567 QXLFEILWXHGYMSMEHLCPDIRCERYDENLTCPYHAGARGHPLEQCSCFKEKVQELLDL 626

Query: 64  ----------EEHV-------AGEFSKVSLERKPLTIFYKEKPNTTTSNPKPITIQVPSP 123
                     E+ +         E S  + + KPLT+ Y+EKP   +++ +PITIQVP+P
Sbjct: 627 KILTVTQSHQEQRIDVVEYVSTAESSSAAYKPKPLTVLYREKPEVPSNSWRPITIQVPAP 686

Query: 124 FEYKSSKAVPWNYEYKVIV----ESGPIPIDNINEIRGITRSGRCYMPEELLKY------ 179
           FEY SSKAVPW YE KV V    +S  +P+DNI    G+TR+GRCY PE LLK+      
Sbjct: 687 FEYSSSKAVPWKYECKVTVGQKAQSSSLPVDNITRGGGMTRTGRCYTPESLLKHTNKPNS 746

BLAST of Cla97C05G093810 vs. NCBI nr
Match: XP_022158986.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia])

HSP 1 Score: 125.9 bits (315), Expect = 1.3e-25
Identity = 82/240 (34.17%), Postives = 115/240 (47.92%), Query Frame = 0

Query: 5   ELFQVLFSNGLIKQENLRGDVPDTQLNDNLMCLYHAGAK--------------------- 64
           ELF++L  +G +  E L  ++     +++L C +HAGAK                     
Sbjct: 466 ELFEILLGSGYVSVEYLCPNLKYKGYDESLTCPFHAGAKGHSLEQCNSFRMKVQELLDSK 525

Query: 65  ----------------EEHVAGEFSKVSLERKPLTIFYKEKPNTTTSNPKPITIQVPSPF 124
                           E+    E S  +L+ K LTIFY EKPN    + KPITI VP+PF
Sbjct: 526 ILTVANSHQKKGINIVEDVSVAEGSSDALKPKCLTIFYSEKPNAPNCSRKPITITVPAPF 585

Query: 125 EYKSSKAVPWNYEYKVI----VESGPIPIDNINEIRGITRSGRCYMPEELLKYKGKS--- 179
           EYKSSKAVPW Y+ KV     V S P+PIDNI  + G+TR+GRCY P+ LLK   ++   
Sbjct: 586 EYKSSKAVPWKYQCKVTVGQDVSSPPLPIDNITGVGGLTRTGRCYTPDSLLKCVNETTSE 645

BLAST of Cla97C05G093810 vs. NCBI nr
Match: XP_022143495.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia])

HSP 1 Score: 120.6 bits (301), Expect = 5.6e-24
Identity = 81/246 (32.93%), Postives = 114/246 (46.34%), Query Frame = 0

Query: 4   EELFQVLFSNGLIKQENLRGDVPDTQLNDNLMCLYHAGAK-------------------- 63
           EELF++L  +G +  E L  ++     +++L C +HAGAK                    
Sbjct: 400 EELFEILLGSGYVSVEYLCPNLKYKGYDESLTCPFHAGAKGHALEQCNSFRMIVQELLDS 459

Query: 64  ----------------------EEHVAGEFSKVSLERKPLTIFYKEKPNTTTSNPKPITI 123
                                  E    E S  +L+ K LTIFY EKP+    + KPITI
Sbjct: 460 KILTVANSHQKKGINVVEDVSVAEGSIAEGSSDALKPKRLTIFYSEKPDAPNCSRKPITI 519

Query: 124 QVPSPFEYKSSKAVPWNYEYKVI----VESGPIPIDNINEIRGITRSGRCYMPEELLKYK 179
            VP+PFEYKSSKAVPW YE KV     V S P+P+DNI  + G+T +GRCY P+ LLK  
Sbjct: 520 TVPAPFEYKSSKAVPWKYECKVTVGQDVSSPPLPVDNITGVGGLTXTGRCYTPDSLLKRV 579

BLAST of Cla97C05G093810 vs. NCBI nr
Match: XP_022147189.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia])

HSP 1 Score: 119.4 bits (298), Expect = 1.2e-23
Identity = 82/241 (34.02%), Postives = 113/241 (46.89%), Query Frame = 0

Query: 5   ELFQVLFSNGLIKQENLRGDVPDTQ-LNDNLMCLYHAGAK-------------------- 64
           ELF++L  +G I  E L    P  +  +++L C +H GAK                    
Sbjct: 506 ELFEILLGSGYISVEYL---CPKYKGYDESLTCXFHXGAKGHSLEQCNXFRMKVQELLDS 565

Query: 65  -----------------EEHVAGEFSKVSLERKPLTIFYKEKPNTTTSNPKPITIQVPSP 124
                            E+ +  E S  SL+ KPLTIFY+EKP+  + + KP  I VP P
Sbjct: 566 KILTXANSHXKKXTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCSRKPXXITVPXP 625

Query: 125 FEYKSSKAVPWNYEYKVI----VESGPIPIDNINEIRGITRSGRCYMPEELLKYKGKSKI 179
           FEYKSSKAVPW YE KV     V S  +P+DNI  + G+TR+GRCY P+ LLK   ++  
Sbjct: 626 FEYKSSKAVPWKYECKVTVGQDVSSPSLPVDNITGVGGLTRTGRCYTPDSLLKRVNETTS 685

BLAST of Cla97C05G093810 vs. NCBI nr
Match: XP_022150030.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111018303 [Momordica charantia])

HSP 1 Score: 107.5 bits (267), Expect = 4.9e-20
Identity = 53/96 (55.21%), Postives = 68/96 (70.83%), Query Frame = 0

Query: 44  EEHVAGEFSKVSLERKPLTIFYKEKPNTTTSNPKPITIQVPSPFEYKSSKAVPWNYEYKV 103
           E+ +  E S  S++ K LTIFY+EKP+  + + KPITI VP+PFEYKSSKAVPW YE KV
Sbjct: 27  EDILVAEGSSDSIKPKRLTIFYREKPDAPSCSRKPITITVPAPFEYKSSKAVPWKYECKV 86

Query: 104 I----VESGPIPIDNINEIRGITRSGRCYMPEELLK 136
                V S  +P+DNI  + G+TR+GRCY P+ LLK
Sbjct: 87  TVGQDVSSPSLPVDNITGVGGLTRTGRCYTPDSLLK 122

BLAST of Cla97C05G093810 vs. TrEMBL
Match: tr|A0A1S4E260|A0A1S4E260_CUCME (uncharacterized protein LOC107991629 OS=Cucumis melo OX=3656 GN=LOC107991629 PE=4 SV=1)

HSP 1 Score: 102.8 bits (255), Expect = 8.0e-19
Identity = 70/197 (35.53%), Postives = 107/197 (54.31%), Query Frame = 0

Query: 1   MSKEELFQVLFSNGLIKQENLRGDVPDTQLNDNLMCLYHAGAKEEHVAGEFSKVS----- 60
           M  EELF+ LF  G + Q+ L  ++     +++  C++H G    HV  +  K       
Sbjct: 270 MPMEELFKGLFEAGYVSQKYLDPNIKYEGYDESRHCIFHQGV-VGHVVQQCKKFRSKVQQ 329

Query: 61  -LERKPLTIFYKE-KPNTTTSNPKPITIQVPSPFEYKSSKAVPWNYEYKVIVESGPIPID 120
            ++ K LT FY+E +  +T  NPK +TI V SPF+ K  K VPW Y+ +VI      P+D
Sbjct: 330 LMDSKILTFFYQESRSESTFYNPKKLTIHVSSPFKCKDLKVVPWWYDCQVITG----PVD 389

Query: 121 NINEIRGITRSGRCYMPEEL--------LKYKGKSKINDAID-CK---VEEPMVVRSREA 179
           NI  I GITRSGRCY P+ L        L+   K++  +  + CK   VE P++ +  E 
Sbjct: 390 NIIGISGITRSGRCYKPDNLTVPSDGLILQQGSKNEKRNVKELCKDQDVEMPIIAKYIEY 449

BLAST of Cla97C05G093810 vs. TrEMBL
Match: tr|A0A061E378|A0A061E378_THECC (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_008095 PE=4 SV=1)

HSP 1 Score: 94.0 bits (232), Expect = 3.7e-16
Identity = 67/177 (37.85%), Postives = 94/177 (53.11%), Query Frame = 0

Query: 29   QLNDNLMCLYHAGAKEEHVA-------GEFSKVSL---ERKPLTIFYKEKPN-----TTT 88
            +L D+ +  ++ GA+E  V         E +  S    + KPLTIFY+E  +     + T
Sbjct: 1297 ELMDSSVIEFYEGAEENLVGTINRDTPAEVASSSFGANKPKPLTIFYEENKSPMNDTSPT 1356

Query: 89   SNPKPITIQVPSPFEYKSSKAVPWNYEYKVI--VESGP-IPIDNINEIRGITRSGRCYMP 148
             +   ITI+VPSPF YKS KAVPWNYE  ++  V S P    ++I  + GITRSGRCY P
Sbjct: 1357 MSRNGITIEVPSPFPYKSDKAVPWNYECNILGTVSSTPQASFEDITGVGGITRSGRCYSP 1416

Query: 149  EELLKY-KGK--------SKINDAIDCKVEEPMVVRSREAKEPASEDDIQEFLRLVK 179
            E   K  KGK         K +     +V+E +V  + E K P +E +  EFL+ +K
Sbjct: 1417 EAAEKVGKGKPAQGEGGLKKADTFSKNQVDESVVAPNNEVKNPVTEKEEGEFLKFIK 1473

BLAST of Cla97C05G093810 vs. TrEMBL
Match: tr|A0A061EXR3|A0A061EXR3_THECC (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_024883 PE=4 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 2.4e-15
Identity = 60/177 (33.90%), Postives = 89/177 (50.28%), Query Frame = 0

Query: 29   QLNDNLMCLYHAGAKEEHVAGEFSKVSLE----------RKPLTIFYKEKPN-----TTT 88
            +L D+ +  ++ GA+E  V   +     E           KPLTIFY+E  +     + T
Sbjct: 1223 ELMDSSIIEFYEGAEENLVGTIYGDTPAEVASSSFGANKPKPLTIFYEENKSPMNDTSPT 1282

Query: 89   SNPKPITIQVPSPFEYKSSKAVPWNYEYKVIVESGPIP---IDNINEIRGITRSGRCYMP 148
                 ITI+VPSPF YK+ KAVPWNYE  ++  +   P    ++I  + GITRSGRCY P
Sbjct: 1283 MIRNGITIEVPSPFPYKNDKAVPWNYECNILGTASSAPQASFEDITGVGGITRSGRCYSP 1342

Query: 149  EELLKYK---------GKSKINDAIDCKVEEPMVVRSREAKEPASEDDIQEFLRLVK 179
            E   + +         G  K +     +V+E +V  + E K P +E +  EFL+ +K
Sbjct: 1343 EVAERVEKGKPAQGEGGLKKADTFSKDQVDEFVVAPNNEVKSPVTEKEAGEFLKFIK 1399

BLAST of Cla97C05G093810 vs. TrEMBL
Match: tr|A0A061E2I6|A0A061E2I6_THECC (RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein OS=Theobroma cacao OX=3641 GN=TCM_007834 PE=4 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 2.4e-15
Identity = 67/207 (32.37%), Postives = 99/207 (47.83%), Query Frame = 0

Query: 1   MSKEELFQVLFSNGLIKQENLRGDVPDTQLNDNLMCLYHAGA---------KEEHVAGEF 60
           M  +++F+ L     I  E +  D  +   +    C +H GA            H   E 
Sbjct: 476 MPMDKVFEALSKINAITPEPI--DTKELGHDLTYSCKFHMGAIGHSIQNCDGFRHTPAEV 535

Query: 61  SKVSL---ERKPLTIFYKEKPN-----TTTSNPKPITIQVPSPFEYKSSKAVPWNYEYKV 120
           +  S    + KPLTIFY+E  +     + T     ITI+VPSPF YKS KAVPWNY+  +
Sbjct: 536 ASSSFGANKPKPLTIFYEENKSPMNDTSPTMIRNGITIEVPSPFPYKSDKAVPWNYQCNI 595

Query: 121 IVESGPIP---IDNINEIRGITRSGRCYMPEELLKY-KGK--------SKINDAIDCKVE 179
              +   P    +++  + GITRSGRCY PE + +  KGK         K +     +V+
Sbjct: 596 SGTASSAPQASFEDLTGVGGITRSGRCYSPEVVERVGKGKPAQEEGGLKKADTFSKDQVD 655

BLAST of Cla97C05G093810 vs. TrEMBL
Match: tr|A0A061DPM2|A0A061DPM2_THECC (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_003960 PE=4 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 1.6e-14
Identity = 64/177 (36.16%), Postives = 91/177 (51.41%), Query Frame = 0

Query: 29   QLNDNLMCLYHAGAKEEHVA-------GEFSKVSL---ERKPLTIFYKE--KPNTTTSNP 88
            +L D+ +  ++ GA+E  V         E +  S    + KPLTIFY+E   P   TS  
Sbjct: 1129 ELMDSSVIEFYEGAEENLVGTINGDTPAEMASSSFGANKPKPLTIFYEENRSPMNDTSPT 1188

Query: 89   ---KPITIQVPSPFEYKSSKAVPWNYEYKVIVESGPIPIDNINEIRG---ITRSGRCYMP 148
                 ITI+VPSPF YKS KAVPWNYE  ++  +   P  +  ++ G   ITRSGRCY P
Sbjct: 1189 MIRSGITIEVPSPFPYKSDKAVPWNYECNILGTASSAPQASFEDLTGVGDITRSGRCYSP 1248

Query: 149  EELLKY-KGK--------SKINDAIDCKVEEPMVVRSREAKEPASEDDIQEFLRLVK 179
            E   +  KGK         K +     +V+E +V  + E K P ++ +  EFL+ +K
Sbjct: 1249 EVAERVGKGKPAQGEGGLKKADTFSKDQVDESVVAPNNEVKNPVTKKEAGEFLKFIK 1305

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022155098.11.9e-2734.02LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica ch... [more]
XP_022158986.11.3e-2534.17LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia][more]
XP_022143495.15.6e-2432.93LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia][more]
XP_022147189.11.2e-2334.02LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia][more]
XP_022150030.14.9e-2055.21LOW QUALITY PROTEIN: uncharacterized protein LOC111018303 [Momordica charantia][more]
Match NameE-valueIdentityDescription
tr|A0A1S4E260|A0A1S4E260_CUCME8.0e-1935.53uncharacterized protein LOC107991629 OS=Cucumis melo OX=3656 GN=LOC107991629 PE=... [more]
tr|A0A061E378|A0A061E378_THECC3.7e-1637.85Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_008095 PE=4 SV=1[more]
tr|A0A061EXR3|A0A061EXR3_THECC2.4e-1533.90Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_024883 PE=4 SV=1[more]
tr|A0A061E2I6|A0A061E2I6_THECC2.4e-1532.37RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein... [more]
tr|A0A061DPM2|A0A061DPM2_THECC1.6e-1436.16Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_003960 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090502 RNA phosphodiester bond hydrolysis, endonucleolytic
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G093810.1Cla97C05G093810.1mRNA


The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None