Cla97C06G118620 (gene) Watermelon (97103) v2

NameCla97C06G118620
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionReverse transcriptase
LocationCla97Chr06 : 12775852 .. 12776310 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAACCCCACTGCGCATGGATTATTGACTACTTGGATAGTTTTGGTAAGATTCCAAATGATCCTTCCAGTTCTCAGCTTGCTTTCCACGTAGAGTATGATCTATTGGATAACAATCTTAAAATTATTCATTGTGATGCTGCTTTTAAGCCTAGTGAGCACATTGTTGGAATTGGTGGAGTTGTTGGATCGTCTTTTGGGCAGCTTCTAGCCTCTTTGTGTGAGTTTAGAACCTTTTCTGGCAACATGCTATGTGTTGAAGCTTTTACTGTTCTTGAGGGGTTTTGTTTGGCTGAAAGAATGGGTCTTTCTCATTTGTTAATTCTGTCTGATTCGTTAATGTTGATTCAGATTCTTATGGGCCTTTATGCCTCTCAGGTTGGCATGGCTAATTTTTTTGCTGATATTAAACAGCTTATAGCCTGTTTTGTTTCTGTTTCCTTTCAACATGTGCCTTGA

mRNA sequence

ATGAAACCCCACTGCGCATGGATTATTGACTACTTGGATAGTTTTGGTAAGATTCCAAATGATCCTTCCAGTTCTCAGCTTGCTTTCCACGTAGAGTATGATCTATTGGATAACAATCTTAAAATTATTCATTGTGATGCTGCTTTTAAGCCTAGTGAGCACATTGTTGGAATTGGTGGAGTTGTTGGATCGTCTTTTGGGCAGCTTCTAGCCTCTTTGTGTGAGTTTAGAACCTTTTCTGGCAACATGCTATGTGTTGAAGCTTTTACTGTTCTTGAGGGGTTTTGTTTGGCTGAAAGAATGGGTCTTTCTCATTTGTTAATTCTGTCTGATTCGTTAATGTTGATTCAGATTCTTATGGGCCTTTATGCCTCTCAGGTTGGCATGGCTAATTTTTTTGCTGATATTAAACAGCTTATAGCCTGTTTTGTTTCTGTTTCCTTTCAACATGTGCCTTGA

Coding sequence (CDS)

ATGAAACCCCACTGCGCATGGATTATTGACTACTTGGATAGTTTTGGTAAGATTCCAAATGATCCTTCCAGTTCTCAGCTTGCTTTCCACGTAGAGTATGATCTATTGGATAACAATCTTAAAATTATTCATTGTGATGCTGCTTTTAAGCCTAGTGAGCACATTGTTGGAATTGGTGGAGTTGTTGGATCGTCTTTTGGGCAGCTTCTAGCCTCTTTGTGTGAGTTTAGAACCTTTTCTGGCAACATGCTATGTGTTGAAGCTTTTACTGTTCTTGAGGGGTTTTGTTTGGCTGAAAGAATGGGTCTTTCTCATTTGTTAATTCTGTCTGATTCGTTAATGTTGATTCAGATTCTTATGGGCCTTTATGCCTCTCAGGTTGGCATGGCTAATTTTTTTGCTGATATTAAACAGCTTATAGCCTGTTTTGTTTCTGTTTCCTTTCAACATGTGCCTTGA

Protein sequence

MKPHCAWIIDYLDSFGKIPNDPSSSQLAFHVEYDLLDNNLKIIHCDAAFKPSEHIVGIGGVVGSSFGQLLASLCEFRTFSGNMLCVEAFTVLEGFCLAERMGLSHLLILSDSLMLIQILMGLYASQVGMANFFADIKQLIACFVSVSFQHVP
BLAST of Cla97C06G118620 vs. NCBI nr
Match: POE96822.1 (hypothetical protein CFP56_48987 [Quercus suber])

HSP 1 Score: 54.7 bits (130), Expect = 3.2e-04
Identity = 33/106 (31.13%), Postives = 49/106 (46.23%), Query Frame = 0

Query: 46  DAAFKPSEHIVGIGGVVGSSFGQLLASLCEFRTFSGNMLCVEAFTVLEGFCLAERMGLSH 105
           D A+   E   GIG VV +  GQ++ASL E      ++  +EA          E +GL  
Sbjct: 670 DGAYFEEEEAAGIGVVVRNEMGQVMASLAEKIIMPSSVEILEAIAARRAMIFMEELGLRR 729

Query: 106 LLILSDSLMLIQILMGLYASQVGMANFFADIKQLIACFVSVSFQHV 152
            +   DS  +++ L G    +  + +   D K L+ CF S SF HV
Sbjct: 730 AIFEGDSETVVKALSGDCPDRSSIGHIVKDCKSLMGCFQSCSFSHV 775

BLAST of Cla97C06G118620 vs. NCBI nr
Match: POF06673.1 (hypothetical protein CFP56_29261 [Quercus suber])

HSP 1 Score: 54.7 bits (130), Expect = 3.2e-04
Identity = 33/106 (31.13%), Postives = 49/106 (46.23%), Query Frame = 0

Query: 46  DAAFKPSEHIVGIGGVVGSSFGQLLASLCEFRTFSGNMLCVEAFTVLEGFCLAERMGLSH 105
           D A+   E   GIG VV +  GQ++ASL E      ++  +EA          E +GL  
Sbjct: 260 DGAYFEEEEAAGIGVVVRNEMGQVMASLAEKIIMPSSVEILEAIAARRAMIFMEELGLRR 319

Query: 106 LLILSDSLMLIQILMGLYASQVGMANFFADIKQLIACFVSVSFQHV 152
            +   DS  +++ L G    +  + +   D K L+ CF S SF HV
Sbjct: 320 AIFEGDSETVVKALSGDCPDRSSIGHIVKDCKSLMGCFQSCSFSHV 365

BLAST of Cla97C06G118620 vs. NCBI nr
Match: XP_018821989.1 (PREDICTED: uncharacterized protein LOC108992010 [Juglans regia])

HSP 1 Score: 53.5 bits (127), Expect = 7.2e-04
Identity = 32/109 (29.36%), Postives = 54/109 (49.54%), Query Frame = 0

Query: 43  IHCDAAFKPSEHIVGIGGVVGSSFGQLLASLCEFRTFSGNMLCVEAFTVLEGFCLAERMG 102
           ++ D AF     + GIG V+  S G++L +       S N+  +E   +L G  L   +G
Sbjct: 624 LNVDGAFSQDGSVTGIGAVLRDSKGEVLMAAAIRERASLNVYELEGLAILRGLQLCLHLG 683

Query: 103 LSHLLILSDSLMLIQILMGLYASQVGMANFFADIKQLIACFVSVSFQHV 152
           + HL I SDSL+++        S   M N  +++++L+ CF +    HV
Sbjct: 684 IYHLSIESDSLLVVNEFDRNGQSMATMGNVISEVRKLMFCFQTCELTHV 732

BLAST of Cla97C06G118620 vs. TrEMBL
Match: tr|A0A0A9VCQ3|A0A0A9VCQ3_ARUDO (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 1.6e-04
Identity = 32/115 (27.83%), Postives = 59/115 (51.30%), Query Frame = 0

Query: 37  DNNLKIIHCDAAFKPSEHIVGIGGVVGSSFGQLLASLCEFRTFSGNMLCVEAFTVLEGFC 96
           +N +  ++ DAAF P       G V+  S+G    +   + +   ++L  EA  V +G  
Sbjct: 210 ENGIMKVNTDAAFYPDNMNGSSGVVIRDSYGNFKQAAAHWYSNLADVLTAEALAVRDGLI 269

Query: 97  LAERMGLSHLLILSDSLMLIQILMGLYASQVGMANFFADIKQLIACFVSVSFQHV 152
           LA   G  H+ + SD+  +++++     +   +A+ + D+K+L   F+SVSF HV
Sbjct: 270 LAAEAGNRHVTVESDNSSVVKLMKTGEGALSSIASIWHDVKELSRKFISVSFNHV 324

BLAST of Cla97C06G118620 vs. TrEMBL
Match: tr|A0A2P4LN13|A0A2P4LN13_QUESU (Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_29261 PE=4 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 2.1e-04
Identity = 33/106 (31.13%), Postives = 49/106 (46.23%), Query Frame = 0

Query: 46  DAAFKPSEHIVGIGGVVGSSFGQLLASLCEFRTFSGNMLCVEAFTVLEGFCLAERMGLSH 105
           D A+   E   GIG VV +  GQ++ASL E      ++  +EA          E +GL  
Sbjct: 260 DGAYFEEEEAAGIGVVVRNEMGQVMASLAEKIIMPSSVEILEAIAARRAMIFMEELGLRR 319

Query: 106 LLILSDSLMLIQILMGLYASQVGMANFFADIKQLIACFVSVSFQHV 152
            +   DS  +++ L G    +  + +   D K L+ CF S SF HV
Sbjct: 320 AIFEGDSETVVKALSGDCPDRSSIGHIVKDCKSLMGCFQSCSFSHV 365

BLAST of Cla97C06G118620 vs. TrEMBL
Match: tr|A0A2P4KV06|A0A2P4KV06_QUESU (Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_48987 PE=4 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 2.1e-04
Identity = 33/106 (31.13%), Postives = 49/106 (46.23%), Query Frame = 0

Query: 46  DAAFKPSEHIVGIGGVVGSSFGQLLASLCEFRTFSGNMLCVEAFTVLEGFCLAERMGLSH 105
           D A+   E   GIG VV +  GQ++ASL E      ++  +EA          E +GL  
Sbjct: 670 DGAYFEEEEAAGIGVVVRNEMGQVMASLAEKIIMPSSVEILEAIAARRAMIFMEELGLRR 729

Query: 106 LLILSDSLMLIQILMGLYASQVGMANFFADIKQLIACFVSVSFQHV 152
            +   DS  +++ L G    +  + +   D K L+ CF S SF HV
Sbjct: 730 AIFEGDSETVVKALSGDCPDRSSIGHIVKDCKSLMGCFQSCSFSHV 775

BLAST of Cla97C06G118620 vs. TrEMBL
Match: tr|A0A2I4ERH4|A0A2I4ERH4_9ROSI (uncharacterized protein LOC108992010 OS=Juglans regia OX=51240 GN=LOC108992010 PE=4 SV=1)

HSP 1 Score: 53.5 bits (127), Expect = 4.7e-04
Identity = 32/109 (29.36%), Postives = 54/109 (49.54%), Query Frame = 0

Query: 43  IHCDAAFKPSEHIVGIGGVVGSSFGQLLASLCEFRTFSGNMLCVEAFTVLEGFCLAERMG 102
           ++ D AF     + GIG V+  S G++L +       S N+  +E   +L G  L   +G
Sbjct: 624 LNVDGAFSQDGSVTGIGAVLRDSKGEVLMAAAIRERASLNVYELEGLAILRGLQLCLHLG 683

Query: 103 LSHLLILSDSLMLIQILMGLYASQVGMANFFADIKQLIACFVSVSFQHV 152
           + HL I SDSL+++        S   M N  +++++L+ CF +    HV
Sbjct: 684 IYHLSIESDSLLVVNEFDRNGQSMATMGNVISEVRKLMFCFQTCELTHV 732

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POE96822.13.2e-0431.13hypothetical protein CFP56_48987 [Quercus suber][more]
POF06673.13.2e-0431.13hypothetical protein CFP56_29261 [Quercus suber][more]
XP_018821989.17.2e-0429.36PREDICTED: uncharacterized protein LOC108992010 [Juglans regia][more]
Match NameE-valueIdentityDescription
tr|A0A0A9VCQ3|A0A0A9VCQ3_ARUDO1.6e-0427.83Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
tr|A0A2P4LN13|A0A2P4LN13_QUESU2.1e-0431.13Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_29261 PE=4 SV=1[more]
tr|A0A2P4KV06|A0A2P4KV06_QUESU2.1e-0431.13Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_48987 PE=4 SV=1[more]
tr|A0A2I4ERH4|A0A2I4ERH4_9ROSI4.7e-0429.36uncharacterized protein LOC108992010 OS=Juglans regia OX=51240 GN=LOC108992010 P... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR012337RNaseH-like_sf
IPR036397RNaseH_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003964 RNA-directed DNA polymerase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G118620.1Cla97C06G118620.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF13456RVT_3coord: 45..152
e-value: 9.5E-12
score: 44.7
IPR036397Ribonuclease H superfamilyGENE3DG3DSA:3.30.420.10coord: 41..152
e-value: 6.6E-9
score: 37.9
IPR012337Ribonuclease H-like superfamilySUPERFAMILYSSF53098Ribonuclease H-likecoord: 42..152

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C06G118620Cla004100Watermelon (97103) v1wmwmbB406
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C06G118620Cucumber (Chinese Long) v3cucwmbB567
Cla97C06G118620Watermelon (97103) v2wmbwmbB164
Cla97C06G118620Watermelon (97103) v2wmbwmbB074
Cla97C06G118620Silver-seed gourdcarwmbB0034
Cla97C06G118620Silver-seed gourdcarwmbB0134
Cla97C06G118620Silver-seed gourdcarwmbB0596
Cla97C06G118620Silver-seed gourdcarwmbB0780
Cla97C06G118620Silver-seed gourdcarwmbB0939
Cla97C06G118620Silver-seed gourdcarwmbB1087
Cla97C06G118620Cucumber (Gy14) v2cgybwmbB079
Cla97C06G118620Cucumber (Gy14) v2cgybwmbB143
Cla97C06G118620Cucumber (Gy14) v2cgybwmbB517
Cla97C06G118620Cucumber (Gy14) v1cgywmbB169
Cla97C06G118620Cucumber (Gy14) v1cgywmbB576
Cla97C06G118620Cucumber (Gy14) v1cgywmbB615
Cla97C06G118620Cucurbita maxima (Rimu)cmawmbB079
Cla97C06G118620Cucurbita maxima (Rimu)cmawmbB132
Cla97C06G118620Cucurbita maxima (Rimu)cmawmbB187
Cla97C06G118620Cucurbita maxima (Rimu)cmawmbB379
Cla97C06G118620Cucurbita maxima (Rimu)cmawmbB824
Cla97C06G118620Cucurbita moschata (Rifu)cmowmbB068
Cla97C06G118620Cucurbita moschata (Rifu)cmowmbB117
Cla97C06G118620Cucurbita moschata (Rifu)cmowmbB171
Cla97C06G118620Cucurbita moschata (Rifu)cmowmbB796
Cla97C06G118620Cucurbita moschata (Rifu)cmowmbB895
Cla97C06G118620Wild cucumber (PI 183967)cpiwmbB083
Cla97C06G118620Wild cucumber (PI 183967)cpiwmbB151
Cla97C06G118620Wild cucumber (PI 183967)cpiwmbB571
Cla97C06G118620Cucumber (Chinese Long) v3cucwmbB079
Cla97C06G118620Cucumber (Chinese Long) v3cucwmbB150
Cla97C06G118620Cucumber (Chinese Long) v2cuwmbB082
Cla97C06G118620Cucumber (Chinese Long) v2cuwmbB149
Cla97C06G118620Cucumber (Chinese Long) v2cuwmbB544
Cla97C06G118620Bottle gourd (USVL1VR-Ls)lsiwmbB096
Cla97C06G118620Bottle gourd (USVL1VR-Ls)lsiwmbB378
Cla97C06G118620Melon (DHL92) v3.6.1medwmbB183
Cla97C06G118620Melon (DHL92) v3.6.1medwmbB248
Cla97C06G118620Melon (DHL92) v3.6.1medwmbB249
Cla97C06G118620Melon (DHL92) v3.6.1medwmbB381
Cla97C06G118620Melon (DHL92) v3.5.1mewmbB197
Cla97C06G118620Melon (DHL92) v3.5.1mewmbB262
Cla97C06G118620Melon (DHL92) v3.5.1mewmbB263
Cla97C06G118620Melon (DHL92) v3.5.1mewmbB391
Cla97C06G118620Watermelon (Charleston Gray)wcgwmbB028
Cla97C06G118620Watermelon (Charleston Gray)wcgwmbB077
Cla97C06G118620Watermelon (Charleston Gray)wcgwmbB273
Cla97C06G118620Watermelon (97103) v1wmwmbB034
Cla97C06G118620Watermelon (97103) v1wmwmbB374
Cla97C06G118620Wax gourdwgowmbB140
Cla97C06G118620Wax gourdwgowmbB589
Cla97C06G118620Wax gourdwgowmbB651
Cla97C06G118620Wax gourdwgowmbB659