CmaCh16G010070 (gene) Cucurbita maxima (Rimu)

NameCmaCh16G010070
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCma_Chr16 : 7756537 .. 7758349 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAAACAAATAGGAGGCATTACATCGAAGGGTGAAGAAGAAGTACTCTACACAAGTGAAAGCCGGAGCAATAATAGGCCGTCTACAAAACGCGGATACAATGGTGACAAAACAAGAAGTCACCAAGGAATTGTACAACTAGGGAGAGCTTATAAGAACGGTAACAATAACTCTCAAGGGAAAAGATTTGAGGGCATTTACTACAATTGCGGGAAGAAGGGCCACATGTCCAAAGATTGTTGGTCTAAGAAAAAATCTGTCGAAAGCAATGTGACATCCTCCAACATGGAGATGGAGGAGGAATGGGATGCAGAGGTACTCTATGCTATAGAAGAAGATGAGCTAGCACTCATGGTGATGATGGGAGACCATATCGATTATGAGAATGATTGGATCATTGATTTAGGATACTCAAACCACATGATTGATGATCAAAGTGGTGCAGTGGAAGAGGGGTGGCCCTTAGAGAAGAGTATTACCAAGCCTTGAGCAAATTGAAGAAATTCTTTTGACGGAAGAGCAAACTGAAGAGATTCTTCCACAGAAGACGGGGAGGAAACTGTACACATTTGCTTCAGTGCTAATGTGGCTGAAGATTCAAGTGACACTAGTCTTGGTGAGTAATTGGTGAGTAAGAAGTGACTCAACCAAGCGAACCTAGTAAGAAATAATGAGCACCTCAACCACTAAGACAATTAGAAATGATCCAAAAGCCAAGTCTAGAGTATGTCAACACAACTATTGTAGAAGATGAAGTTAATATAAGCTAGAGATATATGAGGATGTATCACAAAACTCGGTTTGGCAGAAAGCGATTGAGGAAGAAATTATAGCCGTGGAGCAAAATCAAACTTGAGAACTAGTGCCAAGATCAAGAGATGTTAAATCTATCTTTTGCAAGTGGATTTACAAAATAAACTGTACCCCGAATGGATCAATTGTGAGATACAAAACTCAGACTGTAGATCCAGGGTTCTCTCAACAATATGAACTAGACTATGATGAAACGTTTAGTCTAGTGGCAAAGATCATTATCGTACAAGTTTCTCCAGTACTTACGGTAAATAAAGATTGAAAATTATGGTAGATGGATATGAATAATGCTTTATTGCATGGAGAGTTAAACAGAGAGTTCTACATGGACTAACTGGAGAAATTCGAAAATGAAGTTGGAGTCATTGATCAATATATGTAAAATCCGAAGAAGCCTCATTTGGATGCGGCTCGACGAACCTTGAGATATGTCAAAGGTACAATCAATTACGATCGTTTATACAAAAGAAGCGAAGACTAGAAGCTAGCTGGATATAGTGATGTCGACTATGCAGGAGACCACGATACCCGAAGATCAATCACTGGGTATGTGTTCAAGCTTGGTTTGAGAACAATTTTTTGGTGTAACAAAAGACAACTAACAATATCATTGTCAACTAGAGAAGCAGAGTATAGAGCAGCAGCTGGAGCAGCTCAGGAAAATACATGGTTAAAACTTTTGATGGAAGATTGGCACCAGAAAATTGAGTATCTAATACTTCATTACAACAATCAATCTGTGATTCGCCTAGCAGAAAATTCGGTGTTTCATGCTAGAACAAAACATGTAGAGGTACACTATCATTTCATTAGAGAGAAAGTCTTGAAGGAACAAATGGAGATGCAGCAGATCAAGACAGATGATCAAGTGACAGACTTGTTTACAAAAGGGCTGAATACTGCTAAACATGAGAGTTTTCGCTGTCAGCTCAACATGGTGCAACGAATGAGGACTAGTGCTGAGGGGAGTCTTAAAATATCATCACTAACCCAATAG

mRNA sequence

ATGGCTAAACAAATAGGAGGCATTACATCGAAGGGTGAAGAAGAAGTACTCTACACAAGTGAAAGCCGGAGCAATAATAGGCCGTCTACAAAACGCGGATACAATGGTGACAAAACAAGAAGTCACCAAGGAATTGTACAACTAGGGAGAGCTTATAAGAACGGTAACAATAACTCTCAAGGGAAAAGATTTGAGGGCATTTACTACAATTGCGGGAAGAAGGGCCACATGTCCAAAGATTGTTGGTCTAAGAAAAAATCTGTCGAAAGCAATGTGACATCCTCCAACATGGAGATGGAGGAGGAATGGGATGCAGAGGTACTCTATGCTATAGAAGAAGATGAGCTAGCACTCATGGTGATGATGGGAGACCATATCGATTATGAGAATGATTGGATCATTGATTTAGGATACTCAAACCACATGATTGATGATCAAAGTGGAGACCACGATACCCGAAGATCAATCACTGGGTATGTGTTCAAGCTTGGTTTGAGAACAATTTTTTGGTGTAACAAAAGACAACTAACAATATCATTGTCAACTAGAGAAGCAGAGTATAGAGCAGCAGCTGGAGCAGCTCAGGAAAATACATGGTTAAAACTTTTGATGGAAGATTGGCACCAGAAAATTGAGTATCTAATACTTCATTACAACAATCAATCTGTGATTCGCCTAGCAGAAAATTCGGTGTTTCATGCTAGAACAAAACATGTAGAGGTACACTATCATTTCATTAGAGAGAAAGTCTTGAAGGAACAAATGGAGATGCAGCAGATCAAGACAGATGATCAAGTGACAGACTTGTTTACAAAAGGGCTGAATACTGCTAAACATGAGAGTTTTCGCTGTCAGCTCAACATGGTGCAACGAATGAGGACTAGTGCTGAGGGGAGTCTTAAAATATCATCACTAACCCAATAG

Coding sequence (CDS)

ATGGCTAAACAAATAGGAGGCATTACATCGAAGGGTGAAGAAGAAGTACTCTACACAAGTGAAAGCCGGAGCAATAATAGGCCGTCTACAAAACGCGGATACAATGGTGACAAAACAAGAAGTCACCAAGGAATTGTACAACTAGGGAGAGCTTATAAGAACGGTAACAATAACTCTCAAGGGAAAAGATTTGAGGGCATTTACTACAATTGCGGGAAGAAGGGCCACATGTCCAAAGATTGTTGGTCTAAGAAAAAATCTGTCGAAAGCAATGTGACATCCTCCAACATGGAGATGGAGGAGGAATGGGATGCAGAGGTACTCTATGCTATAGAAGAAGATGAGCTAGCACTCATGGTGATGATGGGAGACCATATCGATTATGAGAATGATTGGATCATTGATTTAGGATACTCAAACCACATGATTGATGATCAAAGTGGAGACCACGATACCCGAAGATCAATCACTGGGTATGTGTTCAAGCTTGGTTTGAGAACAATTTTTTGGTGTAACAAAAGACAACTAACAATATCATTGTCAACTAGAGAAGCAGAGTATAGAGCAGCAGCTGGAGCAGCTCAGGAAAATACATGGTTAAAACTTTTGATGGAAGATTGGCACCAGAAAATTGAGTATCTAATACTTCATTACAACAATCAATCTGTGATTCGCCTAGCAGAAAATTCGGTGTTTCATGCTAGAACAAAACATGTAGAGGTACACTATCATTTCATTAGAGAGAAAGTCTTGAAGGAACAAATGGAGATGCAGCAGATCAAGACAGATGATCAAGTGACAGACTTGTTTACAAAAGGGCTGAATACTGCTAAACATGAGAGTTTTCGCTGTCAGCTCAACATGGTGCAACGAATGAGGACTAGTGCTGAGGGGAGTCTTAAAATATCATCACTAACCCAATAG

Protein sequence

MAKQIGGITSKGEEEVLYTSESRSNNRPSTKRGYNGDKTRSHQGIVQLGRAYKNGNNNSQGKRFEGIYYNCGKKGHMSKDCWSKKKSVESNVTSSNMEMEEEWDAEVLYAIEEDELALMVMMGDHIDYENDWIIDLGYSNHMIDDQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLMEDWHQKIEYLILHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTDDQVTDLFTKGLNTAKHESFRCQLNMVQRMRTSAEGSLKISSLTQ
BLAST of CmaCh16G010070 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 7.9e-22
Identity = 59/138 (42.75%), Postives = 89/138 (64.49%), Query Frame = 1

Query: 145  DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLM 204
            D +GD D R+S TGY+F      I W +K Q  ++LST EAEY AA    +E  WLK  +
Sbjct: 1182 DMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFL 1241

Query: 205  ED--WHQKIEYLILHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKT 264
            ++   HQK EY +++ ++QS I L++NS++HARTKH++V YH+IRE V  E +++ +I T
Sbjct: 1242 QELGLHQK-EY-VVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKIST 1301

Query: 265  DDQVTDLFTKGLNTAKHE 281
            ++   D+ TK +   K E
Sbjct: 1302 NENPADMLTKVVPRNKFE 1317

BLAST of CmaCh16G010070 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 102.4 bits (254), Expect = 8.7e-21
Identity = 62/169 (36.69%), Postives = 97/169 (57.40%), Query Frame = 1

Query: 135  DLGYSNHMI----DDQSGDHDTRRSITGYVFKL-GLRTIFWCNKRQLTISLSTREAEYRA 194
            +L + N +I     D +G    R+S TGY+FK+     I W  KRQ +++ S+ EAEY A
Sbjct: 1241 NLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMA 1300

Query: 195  AAGAAQENTWLKLLMEDWHQKIEYLILHY-NNQSVIRLAENSVFHARTKHVEVHYHFIRE 254
               A +E  WLK L+   + K+E  I  Y +NQ  I +A N   H R KH+++ YHF RE
Sbjct: 1301 LFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFARE 1360

Query: 255  KVLKEQMEMQQIKTDDQVTDLFTKGLNTAKHESFRCQLNMVQRMRTSAE 298
            +V    + ++ I T++Q+ D+FTK L  A+    R +L ++Q  +++AE
Sbjct: 1361 QVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLLQDDQSNAE 1409

BLAST of CmaCh16G010070 vs. TrEMBL
Match: A5AKW8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027864 PE=4 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 2.2e-47
Identity = 101/155 (65.16%), Postives = 120/155 (77.42%), Query Frame = 1

Query: 145  DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLM 204
            D +GDHDTR S TGYVF LG   I WC+KRQ T+SLST EAEYRAAA A QE+ WL  LM
Sbjct: 1144 DYAGDHDTRXSTTGYVFMLGSGAISWCSKRQPTVSLSTTEAEYRAAAMATQESMWLIRLM 1203

Query: 205  EDWHQKIEYLI-LHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTD 264
             D HQ ++Y + L+ +NQS +RLAEN VFHARTKHVEVHYHFIREKVLKE++E+ QIK++
Sbjct: 1204 NDLHQLVDYAVPLYCDNQSAVRLAENPVFHARTKHVEVHYHFIREKVLKEEVELNQIKSE 1263

Query: 265  DQVTDLFTKGLNTAKHESFRCQLNMVQRMRTSAEG 299
            DQV DLFTKGL+ +K ESF  QL MV+ +    EG
Sbjct: 1264 DQVADLFTKGLSGSKFESFCHQLGMVKILEADVEG 1298

BLAST of CmaCh16G010070 vs. TrEMBL
Match: A5BGK7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_007301 PE=4 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 2.9e-47
Identity = 101/155 (65.16%), Postives = 121/155 (78.06%), Query Frame = 1

Query: 145 DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLM 204
           D +GDHDTRRS TGYVF LG   I WC+KRQ T+SL T EAEYRAAA AAQE+TWL  LM
Sbjct: 817 DYAGDHDTRRSTTGYVFMLGSGAISWCSKRQPTVSLLTTEAEYRAAAMAAQESTWLIRLM 876

Query: 205 EDWHQKIEYLI-LHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTD 264
            D HQ ++Y + L+ +NQS +RLAEN VFHARTKHVEVHYHFIREKVL+E++E++QIK+ 
Sbjct: 877 NDLHQLVDYAVPLYCDNQSAVRLAENPVFHARTKHVEVHYHFIREKVLEEEVELKQIKSK 936

Query: 265 DQVTDLFTKGLNTAKHESFRCQLNMVQRMRTSAEG 299
           DQV DLFTKGL+ +K E F  QL MV+ +    EG
Sbjct: 937 DQVADLFTKGLSGSKFECFCHQLGMVKILEADVEG 971

BLAST of CmaCh16G010070 vs. TrEMBL
Match: A5BJX0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_003097 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 3.8e-47
Identity = 122/251 (48.61%), Postives = 156/251 (62.15%), Query Frame = 1

Query: 60   QGKRFEGIYYNCGKKG------HMSKDCWSKKKSVESNVTSSNMEMEEEWDAEVLYAIEE 119
            Q K F G+  +C  +G        +KD   K   +E  V S  M+  ++   EV+  I  
Sbjct: 810  QLKHFLGLEVDCTHEGIFLCQQKCAKDLLKKFGMLEFGVMSRYMQNPKKPHLEVVRRILR 869

Query: 120  D-----ELALMVMMGDHIDYENDWIIDLGYSNHMIDDQSGDHDTRRSITGYVFKLGLRTI 179
                  +  L+   G+           +GY +    D +GDHDTRRS TGYVF LG R I
Sbjct: 870  HVKSTIDYGLLYKKGEDCKL-------VGYCDA---DYTGDHDTRRSTTGYVFMLGSRAI 929

Query: 180  FWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLMEDWHQKIEYLI-LHYNNQSVIRLA 239
             WC+KRQ T+SLST EAEYRAAA A QE+TWL  LM D HQ ++Y + L+ +NQ  + LA
Sbjct: 930  SWCSKRQPTVSLSTTEAEYRAAAMATQESTWLIXLMNDLHQLVDYAVPLYCDNQLAVHLA 989

Query: 240  ENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTDDQVTDLFTKGLNTAKHESFRCQLN 299
            EN VFHARTKHVEVHYHFIREKVL+E++E++QIK+ DQV DLFTKGL+ +K ESF  QL 
Sbjct: 990  ENPVFHARTKHVEVHYHFIREKVLEEEVELKQIKSGDQVADLFTKGLSGSKFESFCHQLG 1049

BLAST of CmaCh16G010070 vs. TrEMBL
Match: A0A151SZC9_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_015612 PE=4 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 6.9e-41
Identity = 88/148 (59.46%), Postives = 109/148 (73.65%), Query Frame = 1

Query: 145 DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLM 204
           D +GDHDTRRS TGY+F +G   I WC+KRQ T+SLS+ E EYRA A AAQE +WL  L+
Sbjct: 214 DYTGDHDTRRSTTGYMFTMGSGAISWCSKRQSTVSLSSTEVEYRALAMAAQECSWLMQLL 273

Query: 205 EDWHQKIEYLI-LHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTD 264
           +D  + ++Y + LH NNQS IRL EN VFHARTKHVEVHYHF+REKVL+  +EM+ I T+
Sbjct: 274 QDLRKLVDYPVTLHCNNQSAIRLVENPVFHARTKHVEVHYHFVREKVLQGDIEMKYINTE 333

Query: 265 DQVTDLFTKGLNTAKHESFRCQLNMVQR 292
            QV D+FTKGL+  K E+F  Q  M  R
Sbjct: 334 GQVADIFTKGLSATKFENFIKQFGMTTR 361

BLAST of CmaCh16G010070 vs. TrEMBL
Match: I1IU64_BRADI (Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1)

HSP 1 Score: 174.5 bits (441), Expect = 2.0e-40
Identity = 90/148 (60.81%), Postives = 110/148 (74.32%), Query Frame = 1

Query: 145  DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLM 204
            D +GD DTRRS TGY+F LG   I WC+KRQ T++LS+ EAEYR+AA AAQE+TWLK LM
Sbjct: 1260 DYAGDCDTRRSTTGYLFNLGSGAITWCSKRQPTVALSSTEAEYRSAAAAAQESTWLKQLM 1319

Query: 205  EDWHQKIEYLI-LHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTD 264
            ED HQ  +  + +  +N S IRLAEN VFHARTKH+EVHYH+IREKVLK ++EM   KT+
Sbjct: 1320 EDLHQTPKDQVWIFCDNLSTIRLAENPVFHARTKHIEVHYHYIREKVLKGEIEMVPTKTE 1379

Query: 265  DQVTDLFTKGLNTAKHESFRCQLNMVQR 292
            DQ  D+ TK LN +K E FR  L MV +
Sbjct: 1380 DQTADILTKSLNKSKFEKFREALGMVTK 1407

BLAST of CmaCh16G010070 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 75.1 bits (183), Expect = 8.4e-14
Identity = 46/126 (36.51%), Postives = 68/126 (53.97%), Query Frame = 1

Query: 151 DTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLMEDWHQK 210
           DTRRS  GY   LG   I W +K+Q  +S S+ EAEYRA + A  E  WL     +    
Sbjct: 455 DTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLP 514

Query: 211 I-EYLILHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTDDQVTDL 270
           + +  +L  +N + I +A N+VFH RTKH+E   H +RE+ + +       +  D+  D 
Sbjct: 515 LSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRERSVYQATLSYSFQAYDE-QDG 574

Query: 271 FTKGLN 276
           FT+ L+
Sbjct: 575 FTEYLS 579

BLAST of CmaCh16G010070 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 52.4 bits (124), Expect = 5.8e-07
Identity = 28/55 (50.91%), Postives = 32/55 (58.18%), Query Frame = 1

Query: 145 DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTW 200
           D +G   TRRS TG+   LG   I W  KRQ T+S S+ E EYRA A  A E TW
Sbjct: 171 DWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmaCh16G010070 vs. NCBI nr
Match: gi|147794801|emb|CAN71427.1| (hypothetical protein VITISV_027864 [Vitis vinifera])

HSP 1 Score: 197.6 bits (501), Expect = 3.2e-47
Identity = 101/155 (65.16%), Postives = 120/155 (77.42%), Query Frame = 1

Query: 145  DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLM 204
            D +GDHDTR S TGYVF LG   I WC+KRQ T+SLST EAEYRAAA A QE+ WL  LM
Sbjct: 1144 DYAGDHDTRXSTTGYVFMLGSGAISWCSKRQPTVSLSTTEAEYRAAAMATQESMWLIRLM 1203

Query: 205  EDWHQKIEYLI-LHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTD 264
             D HQ ++Y + L+ +NQS +RLAEN VFHARTKHVEVHYHFIREKVLKE++E+ QIK++
Sbjct: 1204 NDLHQLVDYAVPLYCDNQSAVRLAENPVFHARTKHVEVHYHFIREKVLKEEVELNQIKSE 1263

Query: 265  DQVTDLFTKGLNTAKHESFRCQLNMVQRMRTSAEG 299
            DQV DLFTKGL+ +K ESF  QL MV+ +    EG
Sbjct: 1264 DQVADLFTKGLSGSKFESFCHQLGMVKILEADVEG 1298

BLAST of CmaCh16G010070 vs. NCBI nr
Match: gi|147798853|emb|CAN61340.1| (hypothetical protein VITISV_007301 [Vitis vinifera])

HSP 1 Score: 197.2 bits (500), Expect = 4.2e-47
Identity = 101/155 (65.16%), Postives = 121/155 (78.06%), Query Frame = 1

Query: 145 DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLM 204
           D +GDHDTRRS TGYVF LG   I WC+KRQ T+SL T EAEYRAAA AAQE+TWL  LM
Sbjct: 817 DYAGDHDTRRSTTGYVFMLGSGAISWCSKRQPTVSLLTTEAEYRAAAMAAQESTWLIRLM 876

Query: 205 EDWHQKIEYLI-LHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTD 264
            D HQ ++Y + L+ +NQS +RLAEN VFHARTKHVEVHYHFIREKVL+E++E++QIK+ 
Sbjct: 877 NDLHQLVDYAVPLYCDNQSAVRLAENPVFHARTKHVEVHYHFIREKVLEEEVELKQIKSK 936

Query: 265 DQVTDLFTKGLNTAKHESFRCQLNMVQRMRTSAEG 299
           DQV DLFTKGL+ +K E F  QL MV+ +    EG
Sbjct: 937 DQVADLFTKGLSGSKFECFCHQLGMVKILEADVEG 971

BLAST of CmaCh16G010070 vs. NCBI nr
Match: gi|147783961|emb|CAN63563.1| (hypothetical protein VITISV_003097 [Vitis vinifera])

HSP 1 Score: 196.8 bits (499), Expect = 5.4e-47
Identity = 122/251 (48.61%), Postives = 156/251 (62.15%), Query Frame = 1

Query: 60   QGKRFEGIYYNCGKKG------HMSKDCWSKKKSVESNVTSSNMEMEEEWDAEVLYAIEE 119
            Q K F G+  +C  +G        +KD   K   +E  V S  M+  ++   EV+  I  
Sbjct: 810  QLKHFLGLEVDCTHEGIFLCQQKCAKDLLKKFGMLEFGVMSRYMQNPKKPHLEVVRRILR 869

Query: 120  D-----ELALMVMMGDHIDYENDWIIDLGYSNHMIDDQSGDHDTRRSITGYVFKLGLRTI 179
                  +  L+   G+           +GY +    D +GDHDTRRS TGYVF LG R I
Sbjct: 870  HVKSTIDYGLLYKKGEDCKL-------VGYCDA---DYTGDHDTRRSTTGYVFMLGSRAI 929

Query: 180  FWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLMEDWHQKIEYLI-LHYNNQSVIRLA 239
             WC+KRQ T+SLST EAEYRAAA A QE+TWL  LM D HQ ++Y + L+ +NQ  + LA
Sbjct: 930  SWCSKRQPTVSLSTTEAEYRAAAMATQESTWLIXLMNDLHQLVDYAVPLYCDNQLAVHLA 989

Query: 240  ENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTDDQVTDLFTKGLNTAKHESFRCQLN 299
            EN VFHARTKHVEVHYHFIREKVL+E++E++QIK+ DQV DLFTKGL+ +K ESF  QL 
Sbjct: 990  ENPVFHARTKHVEVHYHFIREKVLEEEVELKQIKSGDQVADLFTKGLSGSKFESFCHQLG 1049

BLAST of CmaCh16G010070 vs. NCBI nr
Match: gi|658020190|ref|XP_008345472.1| (PREDICTED: uncharacterized protein LOC103408393 [Malus domestica])

HSP 1 Score: 185.7 bits (470), Expect = 1.3e-43
Identity = 98/144 (68.06%), Postives = 112/144 (77.78%), Query Frame = 1

Query: 145 DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLM 204
           D +GDHDT RS TGYVFKLG  TI  C+KRQ T+SLST EAEYRAAA  AQEN WL  LM
Sbjct: 252 DYAGDHDTXRSTTGYVFKLGSGTISXCSKRQPTVSLSTTEAEYRAAAMXAQENAWLVQLM 311

Query: 205 EDWHQKIEYLI-LHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTD 264
            D HQ ++Y + L+ +NQS IRLAEN VFHARTKHVEVHYHFIREKVL+E +EM+Q+KT+
Sbjct: 312 SDLHQPVDYSVPLYCDNQSAIRLAENXVFHARTKHVEVHYHFIREKVLQEXIEMRQVKTN 371

Query: 265 DQVTDLFTKGLNTAK---HESFRC 285
           DQV DLFTK L+T K   H S  C
Sbjct: 372 DQVADLFTKSLSTDKSPVHFSVGC 395

BLAST of CmaCh16G010070 vs. NCBI nr
Match: gi|1012348973|gb|KYP60164.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 176.0 bits (445), Expect = 9.9e-41
Identity = 88/148 (59.46%), Postives = 109/148 (73.65%), Query Frame = 1

Query: 145 DQSGDHDTRRSITGYVFKLGLRTIFWCNKRQLTISLSTREAEYRAAAGAAQENTWLKLLM 204
           D +GDHDTRRS TGY+F +G   I WC+KRQ T+SLS+ E EYRA A AAQE +WL  L+
Sbjct: 214 DYTGDHDTRRSTTGYMFTMGSGAISWCSKRQSTVSLSSTEVEYRALAMAAQECSWLMQLL 273

Query: 205 EDWHQKIEYLI-LHYNNQSVIRLAENSVFHARTKHVEVHYHFIREKVLKEQMEMQQIKTD 264
           +D  + ++Y + LH NNQS IRL EN VFHARTKHVEVHYHF+REKVL+  +EM+ I T+
Sbjct: 274 QDLRKLVDYPVTLHCNNQSAIRLVENPVFHARTKHVEVHYHFVREKVLQGDIEMKYINTE 333

Query: 265 DQVTDLFTKGLNTAKHESFRCQLNMVQR 292
            QV D+FTKGL+  K E+F  Q  M  R
Sbjct: 334 GQVADIFTKGLSATKFENFIKQFGMTTR 361

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC7.9e-2242.75Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME8.7e-2136.69Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A5AKW8_VITVI2.2e-4765.16Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027864 PE=4 SV=1[more]
A5BGK7_VITVI2.9e-4765.16Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_007301 PE=4 SV=1[more]
A5BJX0_VITVI3.8e-4748.61Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_003097 PE=4 SV=1[more]
A0A151SZC9_CAJCA6.9e-4159.46Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
I1IU64_BRADI2.0e-4060.81Uncharacterized protein OS=Brachypodium distachyon PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.18.4e-1436.51 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.15.8e-0750.91ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|147794801|emb|CAN71427.1|3.2e-4765.16hypothetical protein VITISV_027864 [Vitis vinifera][more]
gi|147798853|emb|CAN61340.1|4.2e-4765.16hypothetical protein VITISV_007301 [Vitis vinifera][more]
gi|147783961|emb|CAN63563.1|5.4e-4748.61hypothetical protein VITISV_003097 [Vitis vinifera][more]
gi|658020190|ref|XP_008345472.1|1.3e-4368.06PREDICTED: uncharacterized protein LOC103408393 [Malus domestica][more]
gi|1012348973|gb|KYP60164.1|9.9e-4159.46Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001878Znf_CCHC
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G010070.1CmaCh16G010070.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeGENE3DG3DSA:4.10.60.10coord: 69..84
score: 1.
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 69..81
score: 1.
IPR001878Zinc finger, CCHC-typePROFILEPS50158ZF_CCHCcoord: 69..81
score: 9
IPR001878Zinc finger, CCHC-typeunknownSSF57756Retrovirus zinc finger-like domainscoord: 49..89
score: 8.7
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 145..214
score: 8.0
NoneNo IPR availablePANTHERPTHR11439:SF164SUBFAMILY NOT NAMEDcoord: 145..214
score: 8.0

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None