Bhi01G000212 (gene) Wax gourd (B227) v1

Overview
NameBhi01G000212
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr1: 5622608 .. 5623337 (+)
RNA-Seq ExpressionBhi01G000212
SyntenyBhi01G000212
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCGAGTACTTGATTTAAAAGGAAAATTGGAGACAATGAAAAAAGGCAATCTGAAGTTAGAGGAGTATTTTCACAAGATTAAAAACTTAGTTGATTCATTAAAGGTTGTCAGTCAATCTATCTCTTATGAAGATCATGTTATGCATATTCTAGTCAGTTTGGGACCTGGGTATGACTCTACTGTTTTTATCATCACAGACAAGGATGAATACCCTTCTTTACAAAGGGATAATTCTCAGTTCCTTGCTCAAGAAAATCGTGTTGAGAGACATAGCCTAGTGAATGTTGATGGTTCTACCCCTTTAGTCAATCTCACCACACAAACTTCATCCAAACAACCTTATTCTTCTCCTTTTCAACCCAATAATAATAATACCGAGTGTAACAGAAGAAGTAAGAATTCGATTGGCAAGCAGAACATAAGATTTTGGAACAATAATGGTAAACCTCAATGCCAAGTTTGTGGCAAGTTTGGACACACTACTCTTAAGCGTTATCTTCTTCTTGAGAAATGGTTTCATAGACCAAACTCAAGTAATAATACATAGCATACTTCTTCCAATCATCAAGGACAACATATAGGGAATTCCTGTGTGAATATTCTGTCTATACAAAATGATCTCAATAGGGAAAACCAGTGGTTTCGGAAACTCTAGAGCCACTAGTCACGTCACAAATGATGCATCTAATCTCACTTTTGGTACTGAATATCTTGGTGACAGTTAG

mRNA sequence

ATGGCTCGAGTACTTGATTTAAAAGGAAAATTGGAGACAATGAAAAAAGGCAATCTGAAGTTAGAGGAGTATTTTCACAAGATTAAAAACTTAGTTGATTCATTAAAGGTTGTCAGTCAATCTATCTCTTATGAAGATCATGTTATGCATATTCTAGTCAGTTTGGGACCTGGGTATGACTCTACTGTTTTTATCATCACAGACAAGGATGAATACCCTTCTTTACAAAGGGATAATTCTCAGTTCCTTGCTCAAGAAAATCGTGTTGAGAGACATAGCCTAGTGAATGTTGATGGTTCTACCCCTTTAGTCAATCTCACCACACAAACTTCATCCAAACAACCTTATTCTTCTCCTTTTCAACCCAATAATAATAATACCGAGTGTAACAGAAGAAGTAAGAATTCGATTGGCAAGCAGAACATAAGATTTTGGAACAATAATGGGAAAACCAGTGGTTTCGGAAACTCTAGAGCCACTAGTCACGTCACAAATGATGCATCTAATCTCACTTTTGGTACTGAATATCTTGGTGACAGTTAG

Coding sequence (CDS)

ATGGCTCGAGTACTTGATTTAAAAGGAAAATTGGAGACAATGAAAAAAGGCAATCTGAAGTTAGAGGAGTATTTTCACAAGATTAAAAACTTAGTTGATTCATTAAAGGTTGTCAGTCAATCTATCTCTTATGAAGATCATGTTATGCATATTCTAGTCAGTTTGGGACCTGGGTATGACTCTACTGTTTTTATCATCACAGACAAGGATGAATACCCTTCTTTACAAAGGGATAATTCTCAGTTCCTTGCTCAAGAAAATCGTGTTGAGAGACATAGCCTAGTGAATGTTGATGGTTCTACCCCTTTAGTCAATCTCACCACACAAACTTCATCCAAACAACCTTATTCTTCTCCTTTTCAACCCAATAATAATAATACCGAGTGTAACAGAAGAAGTAAGAATTCGATTGGCAAGCAGAACATAAGATTTTGGAACAATAATGGGAAAACCAGTGGTTTCGGAAACTCTAGAGCCACTAGTCACGTCACAAATGATGCATCTAATCTCACTTTTGGTACTGAATATCTTGGTGACAGTTAG

Protein sequence

MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYDSTVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQPYSSPFQPNNNNTECNRRSKNSIGKQNIRFWNNNGKTSGFGNSRATSHVTNDASNLTFGTEYLGDS
Homology
BLAST of Bhi01G000212 vs. TAIR 10
Match: AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 52.8 bits (125), Expect = 3.4e-07
Identity = 34/125 (27.20%), Postives = 63/125 (50.40%), Query Frame = 0

Query: 2   ARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYDS 61
           AR L L  +L T   G++++ +Y+ K+K L DSL+ V   ++  + VM++L  L P +D+
Sbjct: 112 ARALRLDSELRTKDIGDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDN 171

Query: 62  TVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQPYSSPFQ 121
            + +I  +  +PS     +    +E+R++R    N        + T    S+ P  + FQ
Sbjct: 172 IINVIKHRQPFPSFDDAATMLQEEEDRLKRAIKPNPTHVDHSSSSTVLACSEAPPVTNFQ 231

Query: 122 PNNNN 127
            +  N
Sbjct: 232 RSGGN 236

BLAST of Bhi01G000212 vs. NCBI nr
Match: XP_022154487.1 (uncharacterized protein LOC111021757 [Momordica charantia])

HSP 1 Score: 111.7 bits (278), Expect = 6.7e-21
Identity = 72/153 (47.06%), Postives = 93/153 (60.78%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +ARV+ LK KLE  KKGNL L++YF KIKNLVDSL +  + +S EDH+MHIL  LGP +D
Sbjct: 135 LARVMQLKLKLENFKKGNLSLKDYFLKIKNLVDSLAIAGKKLSTEDHIMHILAGLGPEFD 194

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQ---PYS 120
           + + +IT ++   +LQ   S  L QE R ER +L+N DGS P VNLT   SSK+     S
Sbjct: 195 AIISVITARNMPQTLQEVCSLLLQQEGRNER-NLINSDGSLPSVNLTLNDSSKKNNLHQS 254

Query: 121 SPFQPNNNNTECNRRSKNSIGKQNIRFWNNNGK 151
             F P+ +N     R  N+    N R W  N K
Sbjct: 255 KCFNPHQSNYSQRGRGTNN-RSSNRRNWTGNNK 285

BLAST of Bhi01G000212 vs. NCBI nr
Match: XP_022136882.1 (dr1-associated corepressor homolog isoform X1 [Momordica charantia])

HSP 1 Score: 108.2 bits (269), Expect = 7.4e-20
Identity = 67/155 (43.23%), Postives = 92/155 (59.35%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +ARV+ LK KLE +KKGNL L++YF K+K LVDSL    + ++ EDH+MHIL  L   ++
Sbjct: 45  LARVMQLKSKLENIKKGNLPLKDYFQKVKALVDSLAAAGKKVTVEDHIMHILTGLRSEFE 104

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQPYSS-- 120
           STV +I+ + +  +LQ   S  L+ E R ER+S +N DG+ P VNLT QT +     S  
Sbjct: 105 STVSVISARTQTQTLQEVYSLLLSHEGRNERNS-INTDGTLPSVNLTQQTKNSNSAQSID 164

Query: 121 ---PFQPNNNNTECNRRSKNSIGKQNIRFWNNNGK 151
              P+  NN       RSKNS      R WN+N +
Sbjct: 165 GQRPYMQNN-------RSKNSGNPNFRRNWNSNNR 191

BLAST of Bhi01G000212 vs. NCBI nr
Match: XP_022136883.1 (dr1-associated corepressor homolog isoform X2 [Momordica charantia])

HSP 1 Score: 108.2 bits (269), Expect = 7.4e-20
Identity = 67/155 (43.23%), Postives = 92/155 (59.35%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +ARV+ LK KLE +KKGNL L++YF K+K LVDSL    + ++ EDH+MHIL  L   ++
Sbjct: 45  LARVMQLKSKLENIKKGNLPLKDYFQKVKALVDSLAAAGKKVTVEDHIMHILTGLRSEFE 104

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQPYSS-- 120
           STV +I+ + +  +LQ   S  L+ E R ER+S +N DG+ P VNLT QT +     S  
Sbjct: 105 STVSVISARTQTQTLQEVYSLLLSHEGRNERNS-INTDGTLPSVNLTQQTKNSNSAQSID 164

Query: 121 ---PFQPNNNNTECNRRSKNSIGKQNIRFWNNNGK 151
              P+  NN       RSKNS      R WN+N +
Sbjct: 165 GQRPYMQNN-------RSKNSGNPNFRRNWNSNNR 191

BLAST of Bhi01G000212 vs. NCBI nr
Match: XP_022156747.1 (uncharacterized protein LOC111023586 [Momordica charantia])

HSP 1 Score: 85.1 bits (209), Expect = 6.7e-13
Identity = 64/179 (35.75%), Postives = 94/179 (52.51%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +ARV+ LK KLE MKKG++ L+ YF KIKNLVDSL    + +  +DH+MHIL  LGP +D
Sbjct: 112 LARVMQLKSKLENMKKGSMNLKNYFLKIKNLVDSLATAGKRLPTDDHIMHILARLGPEFD 171

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQPYSSPF 120
           S V +I+ +    S+Q  +S   +              G  P V  +T  SS    S+P 
Sbjct: 172 SIVSVISTRKSPQSIQEPSSNGFSH-------------GFPPQVQSSTGFSSS---STPA 231

Query: 121 QPNNNNTECNRRSKNSIGKQNIRFWNNNGKTSGFGNSRATSHVTNDASNLTFGTEYLGD 180
           Q N      +     ++   N    + N   + + +S AT+HVTND  N + G++Y G+
Sbjct: 232 QSNFGVFGGSTPQMQAMMVAN----DFNRDVTWYPDSGATNHVTNDFGNFSLGSKYHGN 270

BLAST of Bhi01G000212 vs. NCBI nr
Match: KAA0046195.1 (putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa] >TYK14162.1 putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa])

HSP 1 Score: 82.8 bits (203), Expect = 3.3e-12
Identity = 50/132 (37.88%), Postives = 78/132 (59.09%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +A+ +  K KL  MKKG + L+EYF KI+  VD+L  +++ IS +DH+++IL  LG  Y 
Sbjct: 112 LAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQ 171

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQT---SSKQPYS 120
           S + II+ + + PS+Q + S  L QE+++E  S +  + S P VN+TT T   SS +  S
Sbjct: 172 SIISIISARTDSPSVQDNMSLLLTQESQIE--SKITSEVSLPTVNMTTHTRDISSLEKES 231

Query: 121 SPFQPNNNNTEC 130
                  +N  C
Sbjct: 232 EVTHRGGSNNLC 241

BLAST of Bhi01G000212 vs. ExPASy TrEMBL
Match: A0A6J1DLT9 (uncharacterized protein LOC111021757 OS=Momordica charantia OX=3673 GN=LOC111021757 PE=4 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 3.2e-21
Identity = 72/153 (47.06%), Postives = 93/153 (60.78%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +ARV+ LK KLE  KKGNL L++YF KIKNLVDSL +  + +S EDH+MHIL  LGP +D
Sbjct: 135 LARVMQLKLKLENFKKGNLSLKDYFLKIKNLVDSLAIAGKKLSTEDHIMHILAGLGPEFD 194

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQ---PYS 120
           + + +IT ++   +LQ   S  L QE R ER +L+N DGS P VNLT   SSK+     S
Sbjct: 195 AIISVITARNMPQTLQEVCSLLLQQEGRNER-NLINSDGSLPSVNLTLNDSSKKNNLHQS 254

Query: 121 SPFQPNNNNTECNRRSKNSIGKQNIRFWNNNGK 151
             F P+ +N     R  N+    N R W  N K
Sbjct: 255 KCFNPHQSNYSQRGRGTNN-RSSNRRNWTGNNK 285

BLAST of Bhi01G000212 vs. ExPASy TrEMBL
Match: A0A6J1C8R2 (dr1-associated corepressor homolog isoform X2 OS=Momordica charantia OX=3673 GN=LOC111008464 PE=4 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 3.6e-20
Identity = 67/155 (43.23%), Postives = 92/155 (59.35%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +ARV+ LK KLE +KKGNL L++YF K+K LVDSL    + ++ EDH+MHIL  L   ++
Sbjct: 45  LARVMQLKSKLENIKKGNLPLKDYFQKVKALVDSLAAAGKKVTVEDHIMHILTGLRSEFE 104

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQPYSS-- 120
           STV +I+ + +  +LQ   S  L+ E R ER+S +N DG+ P VNLT QT +     S  
Sbjct: 105 STVSVISARTQTQTLQEVYSLLLSHEGRNERNS-INTDGTLPSVNLTQQTKNSNSAQSID 164

Query: 121 ---PFQPNNNNTECNRRSKNSIGKQNIRFWNNNGK 151
              P+  NN       RSKNS      R WN+N +
Sbjct: 165 GQRPYMQNN-------RSKNSGNPNFRRNWNSNNR 191

BLAST of Bhi01G000212 vs. ExPASy TrEMBL
Match: A0A6J1C6N9 (dr1-associated corepressor homolog isoform X1 OS=Momordica charantia OX=3673 GN=LOC111008464 PE=4 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 3.6e-20
Identity = 67/155 (43.23%), Postives = 92/155 (59.35%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +ARV+ LK KLE +KKGNL L++YF K+K LVDSL    + ++ EDH+MHIL  L   ++
Sbjct: 45  LARVMQLKSKLENIKKGNLPLKDYFQKVKALVDSLAAAGKKVTVEDHIMHILTGLRSEFE 104

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQPYSS-- 120
           STV +I+ + +  +LQ   S  L+ E R ER+S +N DG+ P VNLT QT +     S  
Sbjct: 105 STVSVISARTQTQTLQEVYSLLLSHEGRNERNS-INTDGTLPSVNLTQQTKNSNSAQSID 164

Query: 121 ---PFQPNNNNTECNRRSKNSIGKQNIRFWNNNGK 151
              P+  NN       RSKNS      R WN+N +
Sbjct: 165 GQRPYMQNN-------RSKNSGNPNFRRNWNSNNR 191

BLAST of Bhi01G000212 vs. ExPASy TrEMBL
Match: A0A6J1DSS1 (uncharacterized protein LOC111023586 OS=Momordica charantia OX=3673 GN=LOC111023586 PE=4 SV=1)

HSP 1 Score: 85.1 bits (209), Expect = 3.2e-13
Identity = 64/179 (35.75%), Postives = 94/179 (52.51%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +ARV+ LK KLE MKKG++ L+ YF KIKNLVDSL    + +  +DH+MHIL  LGP +D
Sbjct: 112 LARVMQLKSKLENMKKGSMNLKNYFLKIKNLVDSLATAGKRLPTDDHIMHILARLGPEFD 171

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQTSSKQPYSSPF 120
           S V +I+ +    S+Q  +S   +              G  P V  +T  SS    S+P 
Sbjct: 172 SIVSVISTRKSPQSIQEPSSNGFSH-------------GFPPQVQSSTGFSSS---STPA 231

Query: 121 QPNNNNTECNRRSKNSIGKQNIRFWNNNGKTSGFGNSRATSHVTNDASNLTFGTEYLGD 180
           Q N      +     ++   N    + N   + + +S AT+HVTND  N + G++Y G+
Sbjct: 232 QSNFGVFGGSTPQMQAMMVAN----DFNRDVTWYPDSGATNHVTNDFGNFSLGSKYHGN 270

BLAST of Bhi01G000212 vs. ExPASy TrEMBL
Match: A0A5D3CRZ7 (Putative Ty1-copia-like retrotransposon OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold688G00160 PE=4 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 1.6e-12
Identity = 50/132 (37.88%), Postives = 78/132 (59.09%), Query Frame = 0

Query: 1   MARVLDLKGKLETMKKGNLKLEEYFHKIKNLVDSLKVVSQSISYEDHVMHILVSLGPGYD 60
           +A+ +  K KL  MKKG + L+EYF KI+  VD+L  +++ IS +DH+++IL  LG  Y 
Sbjct: 112 LAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQ 171

Query: 61  STVFIITDKDEYPSLQRDNSQFLAQENRVERHSLVNVDGSTPLVNLTTQT---SSKQPYS 120
           S + II+ + + PS+Q + S  L QE+++E  S +  + S P VN+TT T   SS +  S
Sbjct: 172 SIISIISARTDSPSVQDNMSLLLTQESQIE--SKITSEVSLPTVNMTTHTRDISSLEKES 231

Query: 121 SPFQPNNNNTEC 130
                  +N  C
Sbjct: 232 EVTHRGGSNNLC 241

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G34070.13.4e-0727.20CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022154487.16.7e-2147.06uncharacterized protein LOC111021757 [Momordica charantia][more]
XP_022136882.17.4e-2043.23dr1-associated corepressor homolog isoform X1 [Momordica charantia][more]
XP_022136883.17.4e-2043.23dr1-associated corepressor homolog isoform X2 [Momordica charantia][more]
XP_022156747.16.7e-1335.75uncharacterized protein LOC111023586 [Momordica charantia][more]
KAA0046195.13.3e-1237.88putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa] >TYK14162.1 p... [more]
Match NameE-valueIdentityDescription
A0A6J1DLT93.2e-2147.06uncharacterized protein LOC111021757 OS=Momordica charantia OX=3673 GN=LOC111021... [more]
A0A6J1C8R23.6e-2043.23dr1-associated corepressor homolog isoform X2 OS=Momordica charantia OX=3673 GN=... [more]
A0A6J1C6N93.6e-2043.23dr1-associated corepressor homolog isoform X1 OS=Momordica charantia OX=3673 GN=... [more]
A0A6J1DSS13.2e-1335.75uncharacterized protein LOC111023586 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A5D3CRZ71.6e-1237.88Putative Ty1-copia-like retrotransposon OS=Cucumis melo var. makuwa OX=1194695 G... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 7..89
e-value: 4.7E-8
score: 32.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 109..136
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 3..171
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 3..171

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi01M000212Bhi01M000212mRNA