ClCG08G001967 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG08G001967
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCG_Chr08: 4080358 .. 4080771 (+)
RNA-Seq ExpressionClCG08G001967
SyntenyClCG08G001967
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATTAATAAGATTCAAAATTCAAAATTTTGATGGAAAGGGTGATTTTGGTCTTTGGAAGGCCAAAGTCAAAGCAGTTCTTGGTCAACAAAAGGCTCACAAGGCTCTTTTAGATTCATCTACTCTACCAAACATAGCCAAAGCTCAAGAAAAATTGGAAAGTCTTCTTGCAACCAAAGATCTTCCAAATAAAATGTTTTTGGGAGAAACTTTTTTCACATTTAAAATGGATGCCTCCAAGACTTACATAGAAAACTTGGATGAATACAAAAAGATAGTTTCAGAATTTAAAAGTCTTGGAGACAAATTGGGTGACAAAAATGAAGCCTATGTTCTATTAAACTCACTATTGGATACCTACAAGGATGTGAAGAATGCTTTGAAAATATGGCAAAGATTCGATTACAACTGA

mRNA sequence

ATGTCATTAATAAGATTCAAAATTCAAAATTTTGATGGAAAGGGTGATTTTGGTCTTTGGAAGGCCAAAGTCAAAGCAGTTCTTGGTCAACAAAAGGCTCACAAGGCTCTTTTAGATTCATCTACTCTACCAAACATAGCCAAAGCTCAAGAAAAATTGGAAAGTCTTCTTGCAACCAAAGATCTTCCAAATAAAATGTTTTTGGGAGAAACTTTTTTCACATTTAAAATGGATGCCTCCAAGACTTACATAGAAAACTTGGATGAATACAAAAAGATAGTTTCAGAATTTAAAAGTCTTGGAGACAAATTGGGTGACAAAAATGAAGCCTATGTTCTATTAAACTCACTATTGGATACCTACAAGGATGTGAAGAATGCTTTGAAAATATGGCAAAGATTCGATTACAACTGA

Coding sequence (CDS)

ATGTCATTAATAAGATTCAAAATTCAAAATTTTGATGGAAAGGGTGATTTTGGTCTTTGGAAGGCCAAAGTCAAAGCAGTTCTTGGTCAACAAAAGGCTCACAAGGCTCTTTTAGATTCATCTACTCTACCAAACATAGCCAAAGCTCAAGAAAAATTGGAAAGTCTTCTTGCAACCAAAGATCTTCCAAATAAAATGTTTTTGGGAGAAACTTTTTTCACATTTAAAATGGATGCCTCCAAGACTTACATAGAAAACTTGGATGAATACAAAAAGATAGTTTCAGAATTTAAAAGTCTTGGAGACAAATTGGGTGACAAAAATGAAGCCTATGTTCTATTAAACTCACTATTGGATACCTACAAGGATGTGAAGAATGCTTTGAAAATATGGCAAAGATTCGATTACAACTGA

Protein sequence

MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQEKLESLLATKDLPNKMFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALKIWQRFDYN
Homology
BLAST of ClCG08G001967 vs. NCBI nr
Match: XP_038896323.1 (uncharacterized protein LOC120084587 [Benincasa hispida])

HSP 1 Score: 174.5 bits (441), Expect = 6.4e-40
Identity = 90/148 (60.81%), Postives = 109/148 (73.65%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE--------- 60
           MS+ RF+++ FDGKGDFGLWKAK+KA+L QQKAH+ALL+ STL     AQE         
Sbjct: 1   MSIARFEVEKFDGKGDFGLWKAKIKAILDQQKAHRALLNPSTLSATMTAQEKEDWELATY 60

Query: 61  ----------KLESLLATKDLPNKMFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLG 120
                     KLE L ATKDLP+KM+L E FFTFKMD+SKT   N DE+KKIV+EFK+LG
Sbjct: 61  DQETAYRIWTKLEELYATKDLPSKMYLSEKFFTFKMDSSKTLTNNFDEFKKIVAEFKTLG 120

Query: 121 DKLGDKNEAYVLLNSLLDTYKDVKNALK 130
           +KL DKNEAYVL NSL ++YK++KNALK
Sbjct: 121 EKLSDKNEAYVLPNSLPESYKEIKNALK 148

BLAST of ClCG08G001967 vs. NCBI nr
Match: XP_038882247.1 (uncharacterized protein LOC120073473 [Benincasa hispida])

HSP 1 Score: 128.6 bits (322), Expect = 4.0e-26
Identity = 66/146 (45.21%), Postives = 94/146 (64.38%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE--------- 60
           M+  R++I+ FDGK DF LWK K+K VLGQQKA  A+ + +  P    A E         
Sbjct: 1   MATTRYEIEKFDGKTDFELWKVKIKEVLGQQKALLAITNPAKYPETLTAAEKKTIEIVDQ 60

Query: 61  --------KLESLLATKDLPNKMFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDK 120
                   KL  +   KDL NK FL E FFT KMD +K+  +NL+E+K++  +F+S+GD 
Sbjct: 61  PTAYDLWKKLNEIYLNKDLYNKAFLRERFFTHKMDVAKSLTDNLNEFKRLSLKFRSIGDN 120

Query: 121 LGDKNEAYVLLNSLLDTYKDVKNALK 130
           +G++NEA++LLNSLL+++KDVK A+K
Sbjct: 121 IGEENEAFILLNSLLESFKDVKTAMK 146

BLAST of ClCG08G001967 vs. NCBI nr
Match: XP_038885928.1 (uncharacterized protein LOC120076236 [Benincasa hispida])

HSP 1 Score: 123.2 bits (308), Expect = 1.7e-24
Identity = 68/164 (41.46%), Postives = 97/164 (59.15%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIA-KAQE-------- 60
           M+  R++I+ F+ K DF LWKAK+K VL +QKA  A+ D +  P I  KA++        
Sbjct: 1   MTTTRYEIEKFEAKTDFELWKAKIKVVLRKQKALLAITDPAKYPKILFKAEKETIESNAY 60

Query: 61  --------------------------KLESLLATKDLPNKMFLGETFFTFKMDASKTYIE 120
                                     KL  +   KDLPNK FL E FFT+KMD +K+  +
Sbjct: 61  GTIVLNVIDSVLRQIVDRPTAYALWNKLNDIYLNKDLPNKAFLRERFFTYKMDPAKSLTD 120

Query: 121 NLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK 130
           NL+E+K + S+F+S+GD +G++NEA++LLNSL +T+KDVK ALK
Sbjct: 121 NLNEFKSLSSDFRSIGDNIGEENEAFILLNSLPETFKDVKTALK 164

BLAST of ClCG08G001967 vs. NCBI nr
Match: XP_038904517.1 (uncharacterized protein LOC120090894 [Benincasa hispida])

HSP 1 Score: 122.5 bits (306), Expect = 2.9e-24
Identity = 63/146 (43.15%), Postives = 93/146 (63.70%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPN------------IAK 60
           M+  R++I+ F+G  DFGLWK K+KAVLGQQKA  A+ D +  P             + K
Sbjct: 1   MATTRYEIEKFNGNTDFGLWKVKIKAVLGQQKALLAITDPAKYPETLTDVEKETIEIVDK 60

Query: 61  AQ-----EKLESLLATKDLPNKMFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDK 120
           A       KL  +   KDLPNK F  + F T+KMD  K+  +NL+++K++ SEFKS+GD 
Sbjct: 61  ATSDALWNKLNEIYLHKDLPNKAFWRKRFLTYKMDVVKSLTDNLNKFKRLSSEFKSIGDN 120

Query: 121 LGDKNEAYVLLNSLLDTYKDVKNALK 130
           +G++N+A++LLNSL +++KDV   +K
Sbjct: 121 IGEENDAFILLNSLPESFKDVNTTMK 146

BLAST of ClCG08G001967 vs. NCBI nr
Match: XP_038887098.1 (uncharacterized protein LOC120077280 [Benincasa hispida])

HSP 1 Score: 121.3 bits (303), Expect = 6.4e-24
Identity = 68/164 (41.46%), Postives = 93/164 (56.71%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE--------- 60
           M+  +++I+ FD K DF L KAK+KAVLGQQKA  A+ D S  P      E         
Sbjct: 1   MATTKYEIEKFDEKIDFELRKAKIKAVLGQQKALLAITDPSKYPETLSEAEKETIESNAY 60

Query: 61  --------------------------KLESLLATKDLPNKMFLGETFFTFKMDASKTYIE 120
                                     KL  +   KD PNK FL E FFT+KMD +K+  +
Sbjct: 61  GTIILNVTDSVLRQIMDQPTVYALWNKLNEIYLNKDFPNKNFLRERFFTYKMDPTKSLTD 120

Query: 121 NLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK 130
           NL+E+K++ SEF+S+GD +G++NEA++L NSL +T+KDVK ALK
Sbjct: 121 NLNEFKRLSSEFRSIGDNIGEENEAFILFNSLPETFKDVKTALK 164

BLAST of ClCG08G001967 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 4.6e-09
Identity = 37/163 (22.70%), Postives = 77/163 (47.24%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE--------- 60
           MS +++++  F+G   F  W+ +++ +L QQ  HK L   S  P+  KA++         
Sbjct: 1   MSGVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAA 60

Query: 61  --------------------------KLESLLATKDLPNKMFLGETFFTFKMDASKTYIE 120
                                     +LESL  +K L NK++L +  +   M     ++ 
Sbjct: 61  SAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLS 120

Query: 121 NLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNAL 129
           +L+ +  ++++  +LG K+ ++++A +LLNSL  +Y ++   +
Sbjct: 121 HLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTI 163

BLAST of ClCG08G001967 vs. ExPASy TrEMBL
Match: A0A5A7U6R2 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold333G00930 PE=4 SV=1)

HSP 1 Score: 118.2 bits (295), Expect = 2.6e-23
Identity = 67/164 (40.85%), Postives = 91/164 (55.49%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ--------- 60
           M+  +F+I+ FDG GDF LW  ++ A+LG QKA KAL D   LP  + K++         
Sbjct: 1   MTTTKFEIEKFDGNGDFTLWTKRITAILGSQKALKALEDPKELPATLTKSERETLEEVAY 60

Query: 61  -------------------------EKLESLLATKDLPNKMFLGETFFTFKMDASKTYIE 120
                                    EKL+SL   KDLPNKMF+ E  F+FK + +K   E
Sbjct: 61  STLIMNITDNVLRQVIEETTAFATWEKLKSLYEKKDLPNKMFIKEKLFSFKKNQNKNLDE 120

Query: 121 NLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK 130
           NLDE+KK+ +     G+KLG +NEA +L+NS+ DTYK+VK  LK
Sbjct: 121 NLDEFKKLTNALNQTGEKLGAENEAAILINSIHDTYKEVKTGLK 164

BLAST of ClCG08G001967 vs. ExPASy TrEMBL
Match: A0A5D3DNU1 (Putative gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004440 PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 1.0e-22
Identity = 65/164 (39.63%), Postives = 97/164 (59.15%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ--------- 60
           M+  RF++  F+G GDF LW+ K++A+L Q K  K +LD   LP NI +++         
Sbjct: 1   MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAK-ILDEERLPDNITESEKRDMDEMAY 60

Query: 61  -------------------------EKLESLLATKDLPNKMFLGETFFTFKMDASKTYIE 120
                                    +KLESL  TK LPNK+++ E FF +KMD SK+  E
Sbjct: 61  STILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIKEKFFGYKMDQSKSLEE 120

Query: 121 NLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK 130
           NLDE++KIV +  ++G+K+ D+N+A +LLNSL +TY++VK A+K
Sbjct: 121 NLDEFQKIVVDLNNIGEKMSDENQAVILLNSLPETYREVKAAIK 163

BLAST of ClCG08G001967 vs. ExPASy TrEMBL
Match: A0A5C7HX22 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_012728 PE=4 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.7e-22
Identity = 64/130 (49.23%), Postives = 87/130 (66.92%), Query Frame = 0

Query: 9   QNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPN-------IAKAQEKLESLLATKD 68
           + FDG GDFG+W+ KVKA+L QQK  +A+ D   LP+            +KLESL  TK 
Sbjct: 24  KEFDGSGDFGIWRRKVKALLFQQKILEAIEDLEKLPDKLNDEKTACDVWKKLESLYLTKS 83

Query: 69  LPNKMFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLG--DKLGDKNEAYVLLNSLLD 128
           L NK++L E  F+FKMDASK   +NLDE+KK++ E  + G  +KL D+NEA +LLNSL +
Sbjct: 84  LTNKIYLKERLFSFKMDASKCLGQNLDEFKKMIIELANAGVYEKLSDENEAIILLNSLPE 143

Query: 129 TYKDVKNALK 130
           ++ DVK A+K
Sbjct: 144 SFGDVKAAIK 153

BLAST of ClCG08G001967 vs. ExPASy TrEMBL
Match: A0A5A7U2U7 (Retrotransposon protein, putative, Ty1-copia sub-class OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold385G00590 PE=4 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.7e-22
Identity = 65/164 (39.63%), Postives = 96/164 (58.54%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ--------- 60
           M+  RF++  F+G GDF LW+ K++A+L Q K  K +LD   LP NI +++         
Sbjct: 1   MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAK-ILDEERLPDNITESEKRDMDEMAY 60

Query: 61  -------------------------EKLESLLATKDLPNKMFLGETFFTFKMDASKTYIE 120
                                    +KLESL  TK LPNK+++ E FF +KMD SK   E
Sbjct: 61  WTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIKEKFFGYKMDQSKILEE 120

Query: 121 NLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK 130
           NLDE++KIV +  ++G+K+ D+N+A +LLNSL +TY++VK A+K
Sbjct: 121 NLDEFQKIVVDLNNIGEKMSDENQAVILLNSLPETYREVKAAIK 163

BLAST of ClCG08G001967 vs. ExPASy TrEMBL
Match: A0A5A7UB25 (Putative gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold560G00190 PE=4 SV=1)

HSP 1 Score: 112.5 bits (280), Expect = 1.4e-21
Identity = 64/164 (39.02%), Postives = 96/164 (58.54%), Query Frame = 0

Query: 1   MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ--------- 60
           M+  RF++  F+G GDF LW+ K++A+L Q K  K +LD   LP NI +++         
Sbjct: 1   MASTRFEVSKFNGHGDFSLWRKKIRAILVQHKVAK-ILDEERLPDNITESEKRDMDEMAY 60

Query: 61  -------------------------EKLESLLATKDLPNKMFLGETFFTFKMDASKTYIE 120
                                    +KLESL  TK L NK+++ E FF +KMD SK+  E
Sbjct: 61  STILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLLNKIYIKEKFFGYKMDQSKSLEE 120

Query: 121 NLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK 130
           NLDE++KIV +  ++G+K+ D+N+A +LLNSL +TY++VK A+K
Sbjct: 121 NLDEFQKIVVDLNNIGEKMSDENQAVILLNSLPETYREVKAAIK 163

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896323.16.4e-4060.81uncharacterized protein LOC120084587 [Benincasa hispida][more]
XP_038882247.14.0e-2645.21uncharacterized protein LOC120073473 [Benincasa hispida][more]
XP_038885928.11.7e-2441.46uncharacterized protein LOC120076236 [Benincasa hispida][more]
XP_038904517.12.9e-2443.15uncharacterized protein LOC120090894 [Benincasa hispida][more]
XP_038887098.16.4e-2441.46uncharacterized protein LOC120077280 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
P109784.6e-0922.70Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A5A7U6R22.6e-2340.85Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3DNU11.0e-2239.63Putative gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A5C7HX221.7e-2249.23Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_012728 PE=4 SV=1[more]
A0A5A7U2U71.7e-2239.63Retrotransposon protein, putative, Ty1-copia sub-class OS=Cucumis melo var. maku... [more]
A0A5A7UB251.4e-2139.02Putative gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 48..130
e-value: 7.6E-12
score: 45.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG08G001967.1ClCG08G001967.1mRNA