Cla97C04G070530 (gene) Watermelon (97103) v2.5

Overview
NameCla97C04G070530
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionReverse transcriptase domain-containing protein
LocationCla97Chr04: 13109777 .. 13112247 (-)
RNA-Seq ExpressionCla97C04G070530
SyntenyCla97C04G070530
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGGAAGAGAGAAGAGGGGAGAATCAAACGACCGAAAATAAAACTTGCCCGATTATTAGGGTAAGTTTTGAGGGGGTAAAACGGTAGGACAAACATGGGTAATAAATACCCACCCAACCCAACCCAACCCAACCCATACCCATTTATCAAACACCCCCTAAAATTGTTAGTGAATAAAAGCAAATTTGTTATCATCAAAACATCTTTATTATATAAAATTAATTAATTTAAAATTGAGGACAAAAAAGCCAACATAGTGATTTTTAAATTTTGGGAAATATCAAATTTTGCATAAGAATTAAGATTTGGAAAATACATAGAATCTCAATCTTAATCTTGCGTGAGATTTCACCAACAAAACCCAAATATATAGAACTTGGTGTTAATCCTACCCGAGACTAAATAGAAAAAAAGACCAAAATATTCGATAAGATTAAAAAAAGTTAATTTTTTAATCTAAATTTTGGAGACCGATCTTAGTCAAATCTCAGCCAAAACTGACCAAGAAACTATTTTAATAAATATTCTAAAAAGATGCTAATTTTAAAAGATGAAATTCTTCCGAGCTAATTCTAACAATTTTCCAAATTGATTCTTTAATTGGTAATTCTATCGTATAGTTTTATACTCAAGGCGATCAATCAATGCAACAAAGAGGTTTGATTTTATTATTACTTTTTGCTTTGACATGAATTTAATGCATCTCCAACCTTTTTCGTTGGTTTGTGGGTACTTCGTGTTCTGTTGGCTGGAACCAATTATTAAAACAATTTTGAATTACTACAAACATACTGCGGGATTTCAAATAAAACATTTTAAATTTACTCCTTTCAATGATCATTTGAATTTTGATTTTTAAAAATTAAGATTATTGACACTATATTGCCTATAAATTTTTCTTGCTTACAAACTACTTTTCCACCAATATTTTCAAAAATCACGCTAAATTTTGAGAACTAAAATACAATAATTTTCAAAAACTTAATTTTATTTTTAAAATTTAACTAAGAAATCAAGTGTTTCTCCCTCAAATGTGAAAACCATAGAATTTATGAGAAACTAGTTCAAATTTTAAAAACTAAGAATTAAAAACAAAATCATTACCAAACAAAACTTACTCTTCTTAATTTTGAAATGTGACTTAAATTTTTAAGAACGCCCTTAAAAAGTAGATATCACAAACAAAAAAATTAATAGGTCTACAGGACATAAGCTTAAATTTCAAAAGTCAAACCAAATGATTATCAACAAAACATCGTTTTTTAATTTAGCTTTAATTTTTTAATACATTCTCATAAATTACATAACAAAACAAAGAAATGGTAAAAATAATATTTATAAATTTAATTTTTAAAAGGAAAATACTAATCAAAGAAATTTGTCCACTCAGCTTCTGTGTAACAAAATAAGTTTCTTGGTCCGTAAACTTGGGAAGATGTTGCCAAAAAAAAAAAAATTAAAAATTAAAAATTAAAAATCTATGCATTGGAAAAAGCTAAAAAGAAAGTCACACTTTGAACCTGATGGGCTAAATGCTTTGTTCTCTCAACGTTATTGGGAGGTGGTGGGAACTGATGTTTTAAGCTTGTGTCTCAACATGCTGAGTGGGAACTGATGTTTCTAGCGCCTATCAGCGATGTTTAATACAAGACTGTGGCCAAAGTTTTTACTGATCGTTTATAAATGACAATGCACTCGTTAGCTTTGAATACAACCCTCAAACTAGACATGAGTTATGATCAAGTTGAATGGAGTTTCCTTTGACAGGCAATGACCAAAAAGGGTTTCTGTAGCCGCTAGATGCAAAAGATCGCAATGTGTTGAGATTGTTACATTCTCGATCTTTCTCAACGTGGGTAGTCACAGACACTTGTGATCATCGAGAGGATTGCATCAAGACCAGGAAATCCTCTTTCACCCTATCTGCTTCTCATATTCGAAAGGTCTTTCAACCCTTTTTCAAGTGGAGAGAAGAGGTTCTTTTCCAGGTTTGCGTATCAACCTATACAACCCTTCTATTTTTCACTCTTTTTTCTTTTTTTTTTTCTCTTTTTTTTTTTTTGGTTGTAGATGACAACTTGCGTTTCTTTTCGGCTCGAGAAAATGATTGACATGTGATTAAATAGGTGTTGAAGACATATGAGCATGCCTCTATGCAAGTGATCAACTATAGCAAATCCATGTTTATCAGCAAGAATACCCATTGGGAGGTGTTTAGTAAAATCCAAGAAGCCTTGGGGATTTGTTATTCTACCTCATTAGGTCAATACCTGGGACTACCTTCGCAAATCTCTAAGAGTAAATGTCAAGTCTTTAGTAATCTTAAGGATCAAATTTGGAAGCTTCTCCAAGGGTCGAAGGAAAAGCTCTTTCCGAGGGACGATAAAGAAGTGCTAATTAATGTCGTGACGCAAGCCATACCAATTTATACTATGAGTAGACTTCGTCTGCTTAGGGAGCCTTTGTGA

mRNA sequence

ATGAGGGAAGAGAGAAGAGGGGAGAATCAAACGACCGAAAATAAAACTTGCCCGATTATTAGGCCGCTAGATGCAAAAGATCGCAATGTGTTGAGATTGTTACATTCTCGATCTTTCTCAACGTGGGTAGTCACAGACACTTGTGATCATCGAGAGGATTGCATCAAGACCAGGAAATCCTCTTTCACCCTATCTGCTTCTCATATTCGAAAGGTCTTTCAACCCTTTTTCAAGTGGAGAGAAGAGGTTCTTTTCCAGGTGTTGAAGACATATGAGCATGCCTCTATGCAAGTGATCAACTATAGCAAATCCATGTTTATCAGCAAGAATACCCATTGGGAGGTGTTTAGTAAAATCCAAGAAGCCTTGGGGATTTGTTATTCTACCTCATTAGGTCAATACCTGGGACTACCTTCGCAAATCTCTAAGAGTAAATGTCAAGTCTTTAGTAATCTTAAGGATCAAATTTGGAAGCTTCTCCAAGGGTCGAAGGAAAAGCTCTTTCCGAGGGACGATAAAGAAGTGCTAATTAATGTCGTGACGCAAGCCATACCAATTTATACTATGAGTAGACTTCGTCTGCTTAGGGAGCCTTTGTGA

Coding sequence (CDS)

ATGAGGGAAGAGAGAAGAGGGGAGAATCAAACGACCGAAAATAAAACTTGCCCGATTATTAGGCCGCTAGATGCAAAAGATCGCAATGTGTTGAGATTGTTACATTCTCGATCTTTCTCAACGTGGGTAGTCACAGACACTTGTGATCATCGAGAGGATTGCATCAAGACCAGGAAATCCTCTTTCACCCTATCTGCTTCTCATATTCGAAAGGTCTTTCAACCCTTTTTCAAGTGGAGAGAAGAGGTTCTTTTCCAGGTGTTGAAGACATATGAGCATGCCTCTATGCAAGTGATCAACTATAGCAAATCCATGTTTATCAGCAAGAATACCCATTGGGAGGTGTTTAGTAAAATCCAAGAAGCCTTGGGGATTTGTTATTCTACCTCATTAGGTCAATACCTGGGACTACCTTCGCAAATCTCTAAGAGTAAATGTCAAGTCTTTAGTAATCTTAAGGATCAAATTTGGAAGCTTCTCCAAGGGTCGAAGGAAAAGCTCTTTCCGAGGGACGATAAAGAAGTGCTAATTAATGTCGTGACGCAAGCCATACCAATTTATACTATGAGTAGACTTCGTCTGCTTAGGGAGCCTTTGTGA

Protein sequence

MREERRGENQTTENKTCPIIRPLDAKDRNVLRLLHSRSFSTWVVTDTCDHREDCIKTRKSSFTLSASHIRKVFQPFFKWREEVLFQVLKTYEHASMQVINYSKSMFISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQISKSKCQVFSNLKDQIWKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMSRLRLLREPL
Homology
BLAST of Cla97C04G070530 vs. NCBI nr
Match: XP_022145148.1 (uncharacterized protein LOC111014662 [Momordica charantia])

HSP 1 Score: 101.3 bits (251), Expect = 1.0e-17
Identity = 51/97 (52.58%), Postives = 69/97 (71.13%), Query Frame = 0

Query: 98  VINYSKSMF-ISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQISKSKCQVFSNLKDQI 157
           ++NY KSMF +S+NT   V + I+  L +  +  +GQYLGLPSQ S++KCQVF+N+ +++
Sbjct: 1   MVNYEKSMFMVSRNTSRCVAANIESELHVTRTNCMGQYLGLPSQTSRNKCQVFNNILNRV 60

Query: 158 WKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMSRLR 194
           W+ LQG KEKLF    KEVLI  V QAIP Y+MS  R
Sbjct: 61  WQFLQGWKEKLFSSGGKEVLIKAVAQAIPNYSMSCFR 97

BLAST of Cla97C04G070530 vs. NCBI nr
Match: XP_030923017.1 (uncharacterized protein LOC115949892 [Quercus lobata])

HSP 1 Score: 100.5 bits (249), Expect = 1.7e-17
Identity = 54/109 (49.54%), Postives = 74/109 (67.89%), Query Frame = 0

Query: 87  VLKTYEHASMQVINYSK-SMFISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQISKSK 146
           +L  YEHAS Q IN  K ++F S NTH EV + I+  LG+  ++   QYLGLPS + +SK
Sbjct: 710 ILYRYEHASGQCINRGKTNLFFSSNTHPEVQAAIKAFLGLTVTSRFEQYLGLPSLVGRSK 769

Query: 147 CQVFSNLKDQIWKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMSRLRL 195
            + FS +K++IWK L+G KE+L  +  +E+L+  V QAIPIYTMS  RL
Sbjct: 770 KKSFSLIKERIWKKLKGWKERLLSQAGREILVKAVIQAIPIYTMSCFRL 818

BLAST of Cla97C04G070530 vs. NCBI nr
Match: XP_030929772.1 (uncharacterized protein LOC115955672 [Quercus lobata])

HSP 1 Score: 98.2 bits (243), Expect = 8.4e-17
Identity = 53/109 (48.62%), Postives = 74/109 (67.89%), Query Frame = 0

Query: 87  VLKTYEHASMQVINYSK-SMFISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQISKSK 146
           +L  YEHAS Q IN  K ++F S NTH EV + I+  LG+  ++   QYLGLPS + +SK
Sbjct: 135 ILYRYEHASGQCINRGKTNLFFSSNTHPEVQAAIKAFLGLPVTSRFEQYLGLPSLVGRSK 194

Query: 147 CQVFSNLKDQIWKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMSRLRL 195
            + FS +K++IWK L+G KE+L  +  +E+L+  V QAIPI+TMS  RL
Sbjct: 195 KKSFSLIKERIWKKLKGWKERLLSQAGREILVKAVIQAIPIFTMSCFRL 243

BLAST of Cla97C04G070530 vs. NCBI nr
Match: XP_030968750.1 (uncharacterized protein LOC115989220 [Quercus lobata])

HSP 1 Score: 93.6 bits (231), Expect = 2.1e-15
Identity = 54/108 (50.00%), Postives = 68/108 (62.96%), Query Frame = 0

Query: 84  LFQVLKTYEHASMQVINYSKS-MFISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQIS 143
           L ++LK YE AS Q +N  KS +  S NT  E  S++ E LG    T  G+YLGLPS I 
Sbjct: 849 LVEILKLYEAASGQKVNADKSAVSFSHNTTPEARSEVLEILGPMQDTRQGKYLGLPSVIG 908

Query: 144 KSKCQVFSNLKDQIWKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMS 191
           KSK QVF+ +K+++ K L G KEK+     KE+LI  VTQ IP YTMS
Sbjct: 909 KSKNQVFAEIKERVGKKLSGWKEKMLSMGGKEILIKAVTQTIPTYTMS 956

BLAST of Cla97C04G070530 vs. NCBI nr
Match: XP_030936294.1 (uncharacterized protein LOC115961454 [Quercus lobata])

HSP 1 Score: 92.4 bits (228), Expect = 4.6e-15
Identity = 50/112 (44.64%), Postives = 72/112 (64.29%), Query Frame = 0

Query: 84  LFQVLKTYEHASMQVINYSK-SMFISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQIS 143
           L ++L TYE AS Q IN +K ++F SK+T ++V   I+EALG+       +YL LPS + 
Sbjct: 204 LLEILVTYERASGQQINRAKTTLFFSKSTSYDVQEVIKEALGVQVVQQYEKYLALPSLVG 263

Query: 144 KSKCQVFSNLKDQIWKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMSRLRL 195
           + K + F+NLK +IWK LQG + KL     +E+L+  V QA+P YTMS  +L
Sbjct: 264 RKKKESFANLKQRIWKKLQGWEAKLLSHVGREILLKAVAQALPTYTMSCFKL 315

BLAST of Cla97C04G070530 vs. ExPASy TrEMBL
Match: A0A6J1CV63 (uncharacterized protein LOC111014662 OS=Momordica charantia OX=3673 GN=LOC111014662 PE=4 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 4.8e-18
Identity = 51/97 (52.58%), Postives = 69/97 (71.13%), Query Frame = 0

Query: 98  VINYSKSMF-ISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQISKSKCQVFSNLKDQI 157
           ++NY KSMF +S+NT   V + I+  L +  +  +GQYLGLPSQ S++KCQVF+N+ +++
Sbjct: 1   MVNYEKSMFMVSRNTSRCVAANIESELHVTRTNCMGQYLGLPSQTSRNKCQVFNNILNRV 60

Query: 158 WKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMSRLR 194
           W+ LQG KEKLF    KEVLI  V QAIP Y+MS  R
Sbjct: 61  WQFLQGWKEKLFSSGGKEVLIKAVAQAIPNYSMSCFR 97

BLAST of Cla97C04G070530 vs. ExPASy TrEMBL
Match: A0A2N9J936 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS60851 PE=4 SV=1)

HSP 1 Score: 94.0 bits (232), Expect = 7.7e-16
Identity = 52/112 (46.43%), Postives = 70/112 (62.50%), Query Frame = 0

Query: 84   LFQVLKTYEHASMQVINYSK-SMFISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQIS 143
            L+ +LK YE AS Q IN  K ++F SKNT   + + I    G   S+   +YLGLP  + 
Sbjct: 1611 LYAILKLYERASGQKINEEKTAIFFSKNTPNSIRANILSMFGTSSSSKFEKYLGLPPILG 1670

Query: 144  KSKCQVFSNLKDQIWKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMSRLRL 195
            +SK + F+ +KD+IWK LQG KEKL  +  +E+LI  V QAIPIY MS  +L
Sbjct: 1671 RSKKRAFNEIKDRIWKKLQGWKEKLLSQAGREILIKAVVQAIPIYAMSCFKL 1722

BLAST of Cla97C04G070530 vs. ExPASy TrEMBL
Match: A0A2N9I946 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48253 PE=4 SV=1)

HSP 1 Score: 94.0 bits (232), Expect = 7.7e-16
Identity = 52/112 (46.43%), Postives = 70/112 (62.50%), Query Frame = 0

Query: 84   LFQVLKTYEHASMQVINYSK-SMFISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQIS 143
            L+ +LK YE AS Q IN  K ++F SKNT   + + I    G   S+   +YLGLP  + 
Sbjct: 1396 LYAILKLYERASGQKINEEKTAIFFSKNTPNSIRANILSMFGTSSSSKFEKYLGLPPILG 1455

Query: 144  KSKCQVFSNLKDQIWKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMSRLRL 195
            +SK + F+ +KD+IWK LQG KEKL  +  +E+LI  V QAIPIY MS  +L
Sbjct: 1456 RSKKRAFNEIKDRIWKKLQGWKEKLLSQAGREILIKAVVQAIPIYAMSCFKL 1507

BLAST of Cla97C04G070530 vs. ExPASy TrEMBL
Match: A0A2N9G5C3 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS22630 PE=4 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 1.7e-15
Identity = 54/113 (47.79%), Postives = 70/113 (61.95%), Query Frame = 0

Query: 83   VLFQVLKTYEHASMQVINYSK-SMFISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQI 142
            VL  +LK YE AS Q IN  K + F SKNT   V S+I    G   ++   +YLGLPS +
Sbjct: 1088 VLQTILKLYERASGQKINEEKTAFFFSKNTPVAVRSEILSMFGSSPASQFEKYLGLPSIL 1147

Query: 143  SKSKCQVFSNLKDQIWKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMSRLRL 195
             +SK + F+ +KD+IWK LQG KE L  +  +E+LI  V QAIPIY MS  +L
Sbjct: 1148 GRSKKRAFNEIKDRIWKRLQGWKENLLSQAGREILIKAVVQAIPIYAMSCFKL 1200

BLAST of Cla97C04G070530 vs. ExPASy TrEMBL
Match: A0A2N9GHD2 (CCHC-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS26712 PE=4 SV=1)

HSP 1 Score: 92.4 bits (228), Expect = 2.2e-15
Identity = 51/110 (46.36%), Postives = 67/110 (60.91%), Query Frame = 0

Query: 82  EVLFQVLKTYEHASMQVINYSK-SMFISKNTHWEVFSKIQEALGICYSTSLGQYLGLPSQ 141
           E L  +L+TYEHAS Q IN  K ++F S NT  +    I    G   +T   +YLGLP  
Sbjct: 809 EALLALLQTYEHASGQKINCGKTALFFSHNTQPDCRQIILNLFGTSATTQFEKYLGLPPV 868

Query: 142 ISKSKCQVFSNLKDQIWKLLQGSKEKLFPRDDKEVLINVVTQAIPIYTMS 191
           I KSK   F+++KD++W+ LQG KEK+  +  +EVLI  V QAIP Y MS
Sbjct: 869 IGKSKKNAFNDIKDRVWRRLQGWKEKMLSQAGREVLIKAVIQAIPTYAMS 918

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022145148.11.0e-1752.58uncharacterized protein LOC111014662 [Momordica charantia][more]
XP_030923017.11.7e-1749.54uncharacterized protein LOC115949892 [Quercus lobata][more]
XP_030929772.18.4e-1748.62uncharacterized protein LOC115955672 [Quercus lobata][more]
XP_030968750.12.1e-1550.00uncharacterized protein LOC115989220 [Quercus lobata][more]
XP_030936294.14.6e-1544.64uncharacterized protein LOC115961454 [Quercus lobata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CV634.8e-1852.58uncharacterized protein LOC111014662 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A2N9J9367.7e-1646.43Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS60851 PE=4 SV=1[more]
A0A2N9I9467.7e-1646.43Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48253 PE=4 SV=1[more]
A0A2N9G5C31.7e-1547.79Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS22630 PE=4 SV=1[more]
A0A2N9GHD22.2e-1546.36CCHC-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS2671... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33116:SF45SUBFAMILY NOT NAMEDcoord: 83..191
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 83..191

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C04G070530.1Cla97C04G070530.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003824 catalytic activity