CaUC11G210560 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC11G210560
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionReverse transcriptase
LocationCiama_Chr11: 22275654 .. 22276022 (-)
RNA-Seq ExpressionCaUC11G210560
SyntenyCaUC11G210560
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTTCGAGAAAGCCGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAACTTTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATCACCCCACGGACCTTCCTTCCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGAAAGCTTAGGACACGATGGTCGGGACCCTTTATCATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGCAACGATGGAACAACCTTCAAAGTGAATGGTCAACGATTGAAGCACTACATCGGTGATGAAGAACGCGGATTTGAGAACCTGGCTTTCATTGCATGA

mRNA sequence

ATGGATTTCGAGAAAGCCGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAACTTTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATCACCCCACGGACCTTCCTTCCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGAAAGCTTAGGACACGATGGTCGGGACCCTTTATCATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGCAACGATGGAACAACCTTCAAAGTGAATGGTCAACGATTGAAGCACTACATCGGTGATGAAGAACGCGGATTTGAGAACCTGGCTTTCATTGCATGA

Coding sequence (CDS)

ATGGATTTCGAGAAAGCCGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAACTTTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATCACCCCACGGACCTTCCTTCCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGAAAGCTTAGGACACGATGGTCGGGACCCTTTATCATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGCAACGATGGAACAACCTTCAAAGTGAATGGTCAACGATTGAAGCACTACATCGGTGATGAAGAACGCGGATTTGAGAACCTGGCTTTCATTGCATGA

Protein sequence

MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQRLKHYIGDEERGFENLAFIA
Homology
BLAST of CaUC11G210560 vs. NCBI nr
Match: WP_217833156.1 (hypothetical protein, partial [Synechococcus sp. PCC 7002])

HSP 1 Score: 231.1 bits (588), Expect = 5.1e-57
Identity = 113/122 (92.62%), Postives = 116/122 (95.08%), Query Frame = 0

Query: 1   MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSR 60
           MDFEKAGEKRLLELNEMEEFRAQAYENAKLYK+RTARWHDKKITP TFLP QR+LLFNSR
Sbjct: 473 MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKKRTARWHDKKITPGTFLPRQRLLLFNSR 532

Query: 61  LRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQRLKHYIGDEERGFENLAF 120
           LRLFPGKLRTRWSGPFIIVKVSPHGAVELQGN+GTTFKVNG RLKHYIGDEER  ENLAF
Sbjct: 533 LRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNNGTTFKVNGLRLKHYIGDEERVLENLAF 592

Query: 121 IA 123
            A
Sbjct: 593 TA 594

BLAST of CaUC11G210560 vs. NCBI nr
Match: XP_038885822.1 (uncharacterized protein LOC120076116 [Benincasa hispida])

HSP 1 Score: 184.9 bits (468), Expect = 4.2e-43
Identity = 90/117 (76.92%), Postives = 96/117 (82.05%), Query Frame = 0

Query: 4   EKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRLRL 63
           +K GEKRLLEL E+EEF  QAYENAKLYKER ARWHDKKI   TF  GQ VLLFNSRLRL
Sbjct: 46  QKVGEKRLLELAELEEFHDQAYENAKLYKERIARWHDKKIIHHTFDLGQSVLLFNSRLRL 105

Query: 64  FPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQRLKHYIGDEERGFENLAF 121
           FP KLRTRW GPF++VK SPHGAVE+QG DG  FKVNGQRL+HY GDEER  ENL F
Sbjct: 106 FPSKLRTRWLGPFVVVKDSPHGAVEVQGEDGLRFKVNGQRLEHYCGDEERKLENLVF 162

BLAST of CaUC11G210560 vs. NCBI nr
Match: XP_038885946.1 (uncharacterized protein LOC120076251 [Benincasa hispida])

HSP 1 Score: 176.4 bits (446), Expect = 1.5e-40
Identity = 82/111 (73.87%), Postives = 94/111 (84.68%), Query Frame = 0

Query: 2   DFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRL 61
           D + AG+ RLL+LNEMEEF+ QAYEN+K+YKERT +WHD  I PR FLPGQRVLLFNSRL
Sbjct: 40  DLQIAGKHRLLQLNEMEEFQNQAYENSKIYKERTTKWHDNWIVPRAFLPGQRVLLFNSRL 99

Query: 62  RLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQRLKHYIGDEE 113
           RLFPGKL++RW GPF+I  V+P+ AVEL G DGTTFKVN QRLKHY GDEE
Sbjct: 100 RLFPGKLKSRWFGPFVIKDVTPYDAVELHGKDGTTFKVNAQRLKHYCGDEE 150

BLAST of CaUC11G210560 vs. NCBI nr
Match: XP_030498073.1 (uncharacterized protein LOC115713732 [Cannabis sativa])

HSP 1 Score: 164.9 bits (416), Expect = 4.5e-37
Identity = 75/111 (67.57%), Postives = 95/111 (85.59%), Query Frame = 0

Query: 1   MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSR 60
           M+ + AGEKRLL+LNE+EEFR +AYENAK+YKERT +WHD+ +  + F PGQ+VLLFNSR
Sbjct: 427 MELKAAGEKRLLQLNELEEFRNEAYENAKIYKERTKKWHDQGLVRKEFQPGQQVLLFNSR 486

Query: 61  LRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQRLKHYIGDE 112
           L+LFPGKL++RWSGPF +VKV P+GAVEL+G+   TFKVNGQRLK Y+G +
Sbjct: 487 LKLFPGKLKSRWSGPFTVVKVFPYGAVELKGDGPATFKVNGQRLKFYLGGQ 537

BLAST of CaUC11G210560 vs. NCBI nr
Match: XP_030502743.1 (uncharacterized protein LOC115717916 [Cannabis sativa])

HSP 1 Score: 164.5 bits (415), Expect = 5.9e-37
Identity = 75/109 (68.81%), Postives = 93/109 (85.32%), Query Frame = 0

Query: 1   MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSR 60
           MD + AG+KRLL+L+E+EEFR +AYENAK+YKERT RWHD+ +  + F PGQ+VLLFNSR
Sbjct: 1   MDLKAAGQKRLLQLDELEEFRNEAYENAKIYKERTKRWHDRNLVRKEFQPGQQVLLFNSR 60

Query: 61  LRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQRLKHYIG 110
           L+LFPGKL++RWSGPF +VKV P+GAVEL+G    TFKVNGQRLK Y+G
Sbjct: 61  LKLFPGKLKSRWSGPFTVVKVFPYGAVELKGEGPNTFKVNGQRLKLYLG 109

BLAST of CaUC11G210560 vs. ExPASy TrEMBL
Match: A0A6P4D1N6 (uncharacterized protein LOC107484509 OS=Arachis duranensis OX=130453 GN=LOC107484509 PE=4 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 3.1e-36
Identity = 75/112 (66.96%), Postives = 91/112 (81.25%), Query Frame = 0

Query: 1   MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSR 60
           +D + AGEKRLL+LNE+EEFR +AYENA++YKER  RWHDK+I+ RTF PGQRVLLFNSR
Sbjct: 126 LDAQAAGEKRLLQLNELEEFRLEAYENARIYKERAKRWHDKRISQRTFEPGQRVLLFNSR 185

Query: 61  LRLFPGKLRTRWSGPFIIVKVSPHGAVELQGN-DGTTFKVNGQRLKHYIGDE 112
           L++FPGKLR+RW+GP+ I+KVSPHG VEL       TF  NG R+KHY G E
Sbjct: 186 LKIFPGKLRSRWTGPYTIIKVSPHGYVELLDEASKQTFTANGHRVKHYFGGE 237

BLAST of CaUC11G210560 vs. ExPASy TrEMBL
Match: A0A2G9H400 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_15163 PE=4 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 4.1e-36
Identity = 77/109 (70.64%), Postives = 89/109 (81.65%), Query Frame = 0

Query: 2    DFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSRL 61
            D + AGEKRLL+LNE++EFR QAYENAK+YKE+T RWHDKKI  R F PGQ VLLFNSRL
Sbjct: 1282 DMQVAGEKRLLQLNELDEFRLQAYENAKIYKEKTKRWHDKKIVERRFEPGQYVLLFNSRL 1341

Query: 62   RLFPGKLRTRWSGPFIIVKVSPHGAVELQG-NDGTTFKVNGQRLKHYIG 110
            +LFPGKL++RWSGPF I +V PHGAVEL+  N    FKVN QR+KHY G
Sbjct: 1342 KLFPGKLKSRWSGPFRITEVFPHGAVELENKNSRNRFKVNAQRIKHYRG 1390

BLAST of CaUC11G210560 vs. ExPASy TrEMBL
Match: A0A1S4DFS8 (uncharacterized protein LOC107829365 OS=Nicotiana tabacum OX=4097 GN=LOC107829365 PE=4 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 7.0e-36
Identity = 77/112 (68.75%), Postives = 90/112 (80.36%), Query Frame = 0

Query: 1   MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSR 60
           MD + AGEKRLL+LNE++EFR  AYENAKLYK +T RWHDK I  R F PGQ VLLFNSR
Sbjct: 396 MDGDLAGEKRLLQLNELDEFRLHAYENAKLYKVKTKRWHDKHIQHREFEPGQEVLLFNSR 455

Query: 61  LRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGT-TFKVNGQRLKHYIGDE 112
           L+LFPGKL++RWSGPF++V V PHGAVEL+    T TF VNGQR+KHY G +
Sbjct: 456 LKLFPGKLKSRWSGPFVVVSVKPHGAVELRDMSSTGTFLVNGQRIKHYWGGD 507

BLAST of CaUC11G210560 vs. ExPASy TrEMBL
Match: A0A1U7YP28 (uncharacterized protein LOC104249065 OS=Nicotiana sylvestris OX=4096 GN=LOC104249065 PE=4 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 7.0e-36
Identity = 74/110 (67.27%), Postives = 92/110 (83.64%), Query Frame = 0

Query: 1   MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSR 60
           MD + AGEK+L++LNE++EFR  +YENAKLYKE+T RWHDK I PR F PGQ+VLLFNSR
Sbjct: 1   MDLKAAGEKKLMQLNELDEFRLHSYENAKLYKEKTKRWHDKHIKPRHFEPGQQVLLFNSR 60

Query: 61  LRLFPGKLRTRWSGPFIIVKVSPHGAVELQG-NDGTTFKVNGQRLKHYIG 110
           LRLFPGKL++RWSGPF +V+V+P+GA+EL+  N G  F VNG R+KHY G
Sbjct: 61  LRLFPGKLKSRWSGPFEVVRVTPYGAIELRALNGGRKFLVNGHRVKHYWG 110

BLAST of CaUC11G210560 vs. ExPASy TrEMBL
Match: A0A6P6SHX4 (uncharacterized protein LOC113691659 OS=Coffea arabica OX=13443 GN=LOC113691659 PE=4 SV=1)

HSP 1 Score: 159.1 bits (401), Expect = 1.2e-35
Identity = 75/112 (66.96%), Postives = 89/112 (79.46%), Query Frame = 0

Query: 1   MDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITPRTFLPGQRVLLFNSR 60
           MDF  AGEKRLLELNE+EE R  AYENAK+YKE+   WHDK I P+ F  GQ VLLFNSR
Sbjct: 31  MDFNLAGEKRLLELNELEEHRLHAYENAKIYKEKIKYWHDKHIIPKQFQVGQNVLLFNSR 90

Query: 61  LRLFPGKLRTRWSGPFIIVKVSPHGAVELQGNDGTTFKVNGQRLKHYIGDEE 113
           LRLFPGKL++RWSGPF + +V P+GAVE++G +G  FKVNGQRLK Y+  E+
Sbjct: 91  LRLFPGKLKSRWSGPFEVTQVFPYGAVEIKGENGAPFKVNGQRLKLYLAGEK 142

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WP_217833156.15.1e-5792.62hypothetical protein, partial [Synechococcus sp. PCC 7002][more]
XP_038885822.14.2e-4376.92uncharacterized protein LOC120076116 [Benincasa hispida][more]
XP_038885946.11.5e-4073.87uncharacterized protein LOC120076251 [Benincasa hispida][more]
XP_030498073.14.5e-3767.57uncharacterized protein LOC115713732 [Cannabis sativa][more]
XP_030502743.15.9e-3768.81uncharacterized protein LOC115717916 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6P4D1N63.1e-3666.96uncharacterized protein LOC107484509 OS=Arachis duranensis OX=130453 GN=LOC10748... [more]
A0A2G9H4004.1e-3670.64Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_15163 PE=... [more]
A0A1S4DFS87.0e-3668.75uncharacterized protein LOC107829365 OS=Nicotiana tabacum OX=4097 GN=LOC10782936... [more]
A0A1U7YP287.0e-3667.27uncharacterized protein LOC104249065 OS=Nicotiana sylvestris OX=4096 GN=LOC10424... [more]
A0A6P6SHX41.2e-3566.96uncharacterized protein LOC113691659 OS=Coffea arabica OX=13443 GN=LOC113691659 ... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 4..95
NoneNo IPR availablePANTHERPTHR24559:SF334SUBFAMILY NOT NAMEDcoord: 4..95

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC11G210560.1CaUC11G210560.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0015074 DNA integration
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0003676 nucleic acid binding