CmaCh12G009200 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh12G009200
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionIntegrase catalytic domain-containing protein
LocationCma_Chr12: 7072178 .. 7073309 (-)
RNA-Seq ExpressionCmaCh12G009200
SyntenyCmaCh12G009200
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTGGGAGATTCTGGGGAGAGGTAGTAATGACGGCCGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTTGACGGGAAGACGTCATATGAGGCCTGGTACAACAAAAAACCAGCGGTACATCATTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGTAACACATTCCCACCTCGCCAAGCTCTCTCCTAGGGGGCTAAAGGACGTCTTCATCGGCTATGAACTCGGGAGCAAGGCGTACAAACTATATGATCCTGTAGGGGGGAGAGCTCACGTGTCTCCCGACGTCGTCTTCGACGAAAGCACTGTCTAAGCAGTGGAATGACGTGATCGAGGCAGACCATAATCCAAATAAATTCATGGTGAAGTACCTCATCACCGAGTCGGAAGAAGGAGGAGCCCAGCATCAGCAGCCGTCACCGCCGCCAGCAGGTGCAACCCCTGAACCAGTAGAATTTACAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCCAGGTACCGAAGGATGGACGACCTAGTAGGAGGAGGTGAACCACCTGGACTGGCAGCGCACAAACTCAAAGAAATGGCCGAACTACATGCCATCAGTGCAGATGAACCGAACACCTTCGTCGAAGTAGAAAAGAACCCGTGCTGGCTGAAGGCAATGCAGGAGGAGATGACATCCATCACCGAGAACCAGACATGGAGTCTGGAGGATATACCGCCGGGACACCAAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAAAGAAAAAGGAGAAGTTGTGAAGCACAAGGCCCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTTGAAGAGGTATTTGCGCCAGTGGCAAGGTTAGAATCCGTTCGTTTCTTGCTGGCAATTACGGCACATTACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAGGAGACCGTCTATATCTGACAATCACCTGGCTTCCTGGACAACGACAACCCCAATAAAGTACTGCGCCTGCACAAGGCACTCTACGGGCTTCGACAAGCTCCACGAGCCTGGAACCCAAAGCTCGACAGTACCTTACTGTCACTGA

mRNA sequence

ATGCCTGGGAGATTCTGGGGAGAGGTAGTAATGACGGCCGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTTGACGGGAAGACGTCATATGAGGCCTGGTACAACAAAAAACCAGCGGTACATCATTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGGGGGAGAGCTCACGTGTCTCCCGACGTCGTCTTCGACGAAAGCACTGTCTAAGCAGTGGAATGACGTGATCGAGGCAGACCATAATCCAAATAAATTCATGGTGAAGTACCTCATCACCGAGTCGGAAGAAGGAGGAGCCCAGCATCAGCAGCCGTCACCGCCGCCAGCAGGTGCAACCCCTGAACCAGTAGAATTTACAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCCAGGTACCGAAGGATGGACGACCTAGTAGGAGGAGGTGAACCACCTGGACTGGCAGCGCACAAACTCAAAGAAATGGCCGAACTACATGCCATCAGTGCAGATGAACCGAACACCTTCGTCGAAGTAGAAAAGAACCCGTGCTGGCTGAAGGCAATGCAGGAGGAGATGACATCCATCACCGAGAACCAGACATGGAGTCTGGAGGATATACCGCCGGGACACCAAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAAAGAAAAAGGAGAAGTTGTGAAGCACAAGGCCCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTTGAAGAGGTATTTGCGCCAGTGGCAAGGTTAGAATCCGTTCGTTTCTTGCTGGCAATTACGGCACATTACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAGGAGACCGTCTATATCTGACAATCACCTGGCTTCCTGGACAACGACAACCCCAATAAAGTACTGCGCCTGCACAAGGCACTCTACGGGCTTCGACAAGCTCCACGAGCCTGGAACCCAAAGCTCGACAGTACCTTACTGTCACTGA

Coding sequence (CDS)

ATGCCTGGGAGATTCTGGGGAGAGGTAGTAATGACGGCCGTCTATCTCCTCAATCGGTCACCAACCCGAAGCCTTGACGGGAAGACGTCATATGAGGCCTGGTACAACAAAAAACCAGCGGTACATCATTTTCGCGTGTTCGGCTGCGTCGCATACATGAAGGGGGGAGAGCTCACGTGTCTCCCGACGTCGTCTTCGACGAAAGCACTGTCTAAGCAGTGGAATGACGTGATCGAGGCAGACCATAATCCAAATAAATTCATGGTGAAGTACCTCATCACCGAGTCGGAAGAAGGAGGAGCCCAGCATCAGCAGCCGTCACCGCCGCCAGCAGGTGCAACCCCTGAACCAGTAGAATTTACAACACCACGGACTGCGGATTCGACGCTGGATGCCGATCACGATACTGATCTGGAGGCCAGGTACCGAAGGATGGACGACCTAGTAGGAGGAGGTGAACCACCTGGACTGGCAGCGCACAAACTCAAAGAAATGGCCGAACTACATGCCATCAGTGCAGATGAACCGAACACCTTCGTCGAAGTAGAAAAGAACCCGTGCTGGCTGAAGGCAATGCAGGAGGAGATGACATCCATCACCGAGAACCAGACATGGAGTCTGGAGGATATACCGCCGGGACACCAAGCCATAGGGCTCAAATGGGTCTTCAAACTGAAGCGCAAAGAAAAAGGAGAAGTTGTGAAGCACAAGGCCCGTCTGGTGGCGAAGGGCTACGTCCAGAAGCAAGGAGTGGACTTTGAAGAGGTATTTGCGCCAGTGGCAAGGTTAGAATCCGTTCGTTTCTTGCTGGCAATTACGGCACATTACTCTTGGGAGGTTCACCATATGGACGTAAAGTCTGCTTTCCTTAACGGAGAGTTGAGGAGACCGTCTATATCTGACAATCACCTGGCTTCCTGGACAACGACAACCCCAATAAAGTACTGCGCCTGCACAAGGCACTCTACGGGCTTCGACAAGCTCCACGAGCCTGGAACCCAAAGCTCGACAGTACCTTACTGTCACTGA

Protein sequence

MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMKGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGGAQHQQPSPPPAGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGELRRPSISDNHLASWTTTTPIKYCACTRHSTGFDKLHEPGTQSSTVPYCH
Homology
BLAST of CmaCh12G009200 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 6.7e-33
Identity = 101/334 (30.24%), Postives = 163/334 (48.80%), Query Frame = 0

Query: 1   MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAY-------- 60
           +P  FWGE V TA YL+NRSP+  L  +     W NK+ +  H +VFGC A+        
Sbjct: 605 LPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQR 664

Query: 61  --MKGGELTCLPTSSSTKALS-KQWNDV-------IEADHNPNKFMVKYLITESEEGGAQ 120
             +    + C+      +    + W+ V        +     ++      ++E  + G  
Sbjct: 665 TKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGII 724

Query: 121 HQQPSPPPAGATPEPVEFTTPRTA-----------------DSTLDADHDTDLEARYRRM 180
               + P     P   E TT   +                 +   + +H T  E +++ +
Sbjct: 725 PNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPL 784

Query: 181 DDLVGGGEPPGLAAHKLKEMAELHAISAD-EPNTFVEV----EKNPCWLKAMQEEMTSIT 240
                  E P + + +     E   IS D EP +  EV    EKN   +KAMQEEM S+ 
Sbjct: 785 ----RRSERPRVESRRYPS-TEYVLISDDREPESLKEVLSHPEKNQL-MKAMQEEMESLQ 844

Query: 241 ENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPV 295
           +N T+ L ++P G + +  KWVFKLK+    ++V++KARLV KG+ QK+G+DF+E+F+PV
Sbjct: 845 KNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPV 904

BLAST of CmaCh12G009200 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 133.7 bits (335), Expect = 4.1e-30
Identity = 109/405 (26.91%), Postives = 166/405 (40.99%), Query Frame = 0

Query: 5    FWGEVVMTAVYLLNRSPTRSL--DGKTSYEAWYNKKPAVHHFRVFGCVAYMKGGELTCLP 64
            FWGE V+TA YL+NR P+R+L    KT YE W+NKKP + H RVFG   Y+         
Sbjct: 609  FWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKNKQGKF 668

Query: 65   TSSSTKAL--------SKQWND-----------VIEADHNPNKFMVKY---LITESEEGG 124
               S K++         K W+            V++  +  N   VK+    + +S+E  
Sbjct: 669  DDKSFKSIFVGYEPNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESE 728

Query: 125  AQH--------QQPSPPPAGATPEPVEFTTPRTADSTLDADHDTDL-------------- 184
             ++         Q   P      + ++F          +  +D+                
Sbjct: 729  NKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECD 788

Query: 185  -----------------EARYRRMDDLV----GGGEP----PGLAAHKLKEMA------- 244
                             E++ R+ DD +    G G P        A  LKE+        
Sbjct: 789  NIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKN 848

Query: 245  ---------------------------------ELHAISADEPNTFVEV---EKNPCWLK 296
                                               H I  D PN+F E+   +    W +
Sbjct: 849  DGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEE 908

BLAST of CmaCh12G009200 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 2.9e-20
Identity = 72/249 (28.92%), Postives = 108/249 (43.37%), Query Frame = 0

Query: 55   GGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGGAQHQQPSPPP---- 114
            G + T  PT + T+  S Q      + +NP       L  +S    AQ    SP P    
Sbjct: 838  GPQPTTQPTQTQTQTHSSQ----NTSQNNPTNESPSQL-AQSLSTPAQSSSSSPSPTTSA 897

Query: 115  ----AGATPEPVEFTTPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMA 174
                   TP  +    P      ++ ++   L          +G     G+     K   
Sbjct: 898  SSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTH------SMGTRAKAGIIKPNPKYSL 957

Query: 175  ELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAI-GLKWVFKL 234
             +   +  EP T ++  K+  W  AM  E+ +   N TW L   PP H  I G +W+F  
Sbjct: 958  AVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTK 1017

Query: 235  KRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDV 294
            K    G + ++KARLVAKGY Q+ G+D+ E F+PV +  S+R +L +    SW +  +DV
Sbjct: 1018 KYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDV 1075

BLAST of CmaCh12G009200 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 1.5e-19
Identity = 47/121 (38.84%), Postives = 70/121 (57.85%), Query Frame = 0

Query: 175  EPNTFVEVEKNPCWLKAMQEEMTSITENQTWSL-EDIPPGHQAIGLKWVFKLKRKEKGEV 234
            EP T ++  K+  W +AM  E+ +   N TW L    PP    +G +W+F  K    G +
Sbjct: 938  EPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSL 997

Query: 235  VKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGE 294
             ++KARLVAKGY Q+ G+D+ E F+PV +  S+R +L +    SW +  +DV +AFL G 
Sbjct: 998  NRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGT 1057

BLAST of CmaCh12G009200 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 9.8e-16
Identity = 40/98 (40.82%), Postives = 62/98 (63.27%), Query Frame = 0

Query: 175 EPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVV 234
           EP + +   K+P W +AMQEE+ +++ N+TW L   P     +G KWVFK K    G + 
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 235 KHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI 273
           + KARLVAKG+ Q++G+ F E ++PV R  ++R +L +
Sbjct: 87  RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNV 124

BLAST of CmaCh12G009200 vs. ExPASy TrEMBL
Match: Q7XPB1 (OSJNBb0026E15.10 protein OS=Oryza sativa subsp. japonica OX=39947 GN=OSJNBb0026E15.10 PE=4 SV=2)

HSP 1 Score: 304.7 bits (779), Expect = 4.9e-79
Identity = 166/344 (48.26%), Postives = 211/344 (61.34%), Query Frame = 0

Query: 1    MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMK------ 60
            +PGRFWGE + TAV+LLNRSPT+SLD +T YEAWY + PAVH  R FGCV ++K      
Sbjct: 711  VPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQWPAVHFLRTFGCVGHVKITKPGL 770

Query: 61   ------GGELTCLPTSSSTKA----------------------LSKQWNDVI-EADHNPN 120
                     +  L     +KA                      ++  W  V  +      
Sbjct: 771  KKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDIAWDWGPVTPDGAPQLE 830

Query: 121  KFMVKYLITESEEGGAQHQQPSP---------------PPAGATPEPVEFTTPRTADSTL 180
             F V+ ++T +  G A    P+P               PP+  +PE VEF TP T DS L
Sbjct: 831  PFTVEQVVT-TTIGTAPASSPTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPPTQDSIL 890

Query: 181  DADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLK 240
            DAD D D+  RYR +D+L+G   PPG A   L+++ ELH +SADEP +  E E +P W  
Sbjct: 891  DADADDDVVPRYRLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEADPSWRG 950

Query: 241  AMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQG 295
            AMQ+E+ +I +N TWSL D+P GH+AIGLKWV+KLKR E+G +V++KARLVAKGYVQ+QG
Sbjct: 951  AMQDELNAIVDNDTWSLTDLPHGHRAIGLKWVYKLKRDEQGAIVRYKARLVAKGYVQRQG 1010

BLAST of CmaCh12G009200 vs. ExPASy TrEMBL
Match: Q2QSF4 (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica OX=39947 GN=LOC_Os12g24300 PE=4 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 2.8e-74
Identity = 163/340 (47.94%), Postives = 210/340 (61.76%), Query Frame = 0

Query: 1   MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------- 60
           MPG FWGE V TAV+LLNRSPT+ L  KT Y AWY ++PAVH  R FGC+ ++       
Sbjct: 81  MPGLFWGEAVSTAVFLLNRSPTKFLANKTPYRAWYGERPAVHFLRTFGCIGHVNNDKPGL 140

Query: 61  -----KGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEE------GGAQ 120
                +   +  L     +KA  + ++ V   D  P  F V++ ++ ++E          
Sbjct: 141 KKLDDRSAPIVLLGYEQGSKAY-RLYDPV--GDREP--FTVEHAVSTTKEEVVTPASSPP 200

Query: 121 HQQPSPPPAGATPE--------PVEFTTPRTADSTLDADHDTD-LEARYRRMDDLVGGGE 180
             Q  P P   TPE         VE  +P + DS LD D D +  E RYR +D+++G   
Sbjct: 201 RSQSPPTPVSPTPEQGMQATEAEVEHASPPSHDSILDTDADPEKREPRYRTLDNILGPAS 260

Query: 181 PPGLAAHKLKEMAELHAISADEPNTFVEVEK--NPCWLKAMQEEMTSITENQTWSLEDIP 240
           P G+A  +L E  ELHA+SA+EP++  E E   +P W  AM++EM SI EN TWSL D+P
Sbjct: 261 PSGMAT-RLLEQLELHAVSAEEPSSLAEAEAEVDPNWKGAMEDEMKSIEENNTWSLTDLP 320

Query: 241 PGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLA 300
            GH+AIGLKWV+KLKR E+G VV HKARLVAKGYVQ+QGVDF+EVF PVARLESV  LLA
Sbjct: 321 AGHRAIGLKWVYKLKRDEQGAVVCHKARLVAKGYVQRQGVDFDEVFTPVARLESVHLLLA 380

Query: 301 ITAHYSWEVHHMDVKSAFLNGELRRPSISDNHLASWTTTT 312
           + AH +WEVHHMDVKSAFLN EL+     +  L +W+  T
Sbjct: 381 VAAHQAWEVHHMDVKSAFLNSELQ-----EMQLRTWSLHT 409

BLAST of CmaCh12G009200 vs. ExPASy TrEMBL
Match: A0A7I8IFL9 (Hypothetical protein OS=Spirodela intermedia OX=51605 GN=SI7747_02003168 PE=4 SV=1)

HSP 1 Score: 287.3 bits (734), Expect = 8.2e-74
Identity = 160/344 (46.51%), Postives = 214/344 (62.21%), Query Frame = 0

Query: 1   MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------- 60
           +PG FWGE V TAVY+LNR PT+S+DG T +E W+ KKPAVHH +VFGC+AY+       
Sbjct: 134 LPGWFWGEAVATAVYILNRCPTKSVDGMTPFEVWHGKKPAVHHLKVFGCIAYVLNTTPHL 193

Query: 61  -----KGGELTCLPTSSSTKALSKQ---------WNDVI------EADHNPNKFMVKYLI 120
                +G ++  +     +KA               DV+      +A  + + F ++Y +
Sbjct: 194 KKLEDRGRKMIFIGYEYGSKAYRAYDPTARRVHVTRDVVFDENARDAGSHDDMFTLEYAV 253

Query: 121 -----------TESEEGGAQHQQPSPPPA-----GATP-----EPVEFTTPRTAD-STLD 180
                       E  +  A   +P  PP+     GA P     E VEF +P +     LD
Sbjct: 254 GNQVPPELDGAIEVLDDDAPGPEPMSPPSVHYSGGAAPIEDGGEEVEFASPPSVHIDHLD 313

Query: 181 ADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKA 240
           A+HD D   R+R++D++VG   P GLA+  L    ELHA+S+DEP +FVE E +P W KA
Sbjct: 314 ANHD-DAPLRFRKIDNIVGLASPRGLASRAL-IAKELHAVSSDEPVSFVEAEGHPSWRKA 373

Query: 241 MQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGV 296
           M+EEM SI EN+TWSL D+P G +AIGLKWV+K+KR E G V K+KARLV KGY Q+QG+
Sbjct: 374 MEEEMASIEENRTWSLVDLPHGRRAIGLKWVYKVKRDENGAVAKYKARLVVKGYAQRQGI 433

BLAST of CmaCh12G009200 vs. ExPASy TrEMBL
Match: A0A3L6TJD2 (Integrase catalytic domain-containing protein OS=Panicum miliaceum OX=4540 GN=C2845_PM01G10330 PE=4 SV=1)

HSP 1 Score: 276.9 bits (707), Expect = 1.1e-70
Identity = 171/369 (46.34%), Postives = 209/369 (56.64%), Query Frame = 0

Query: 1   MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVA-------YM 60
           +PG FWGE V TAV++LNRSPTRSLDGKT YEA +  +PAV   R FGC+A       Y+
Sbjct: 166 LPGTFWGEAVTTAVFILNRSPTRSLDGKTPYEA-HGDRPAVSFLRTFGCIAHVRNTKPYL 225

Query: 61  KGGELTCLPT-----SSSTKALS--------------------KQWNDVIEA---DHNPN 120
           K  E    P       + +KA                       QW+   EA   D   N
Sbjct: 226 KKLEDRSTPMIFVGYEAGSKAYRVYNPVDGRVLVTRDVVFDEVAQWDWGAEAGGEDSGGN 285

Query: 121 -KFMVKY-LITESEEGG---------------------------------AQHQQPSPPP 180
            +F V +    ES  GG                                   H  PSP P
Sbjct: 286 GEFTVVFPANVESVVGGEPAGRSPAAMNSPPSATTRSTDAVPTTPSSPMVTAHTPPSPAP 345

Query: 181 ----AGATPEPVEFTT-PRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEM 240
                  +PEP+EF T P      LDADHD D+  R+RR+D+L+G G  PGLA  ++   
Sbjct: 346 TLNVTHVSPEPIEFATPPSNFHDDLDADHDDDVPVRFRRLDELLGPGTLPGLAP-RVSGD 405

Query: 241 AELHAISADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKL 295
            EL   +A+EP +F E EK+ CW +AM+EEM SI EN+TWSL ++P GH+ IGLKWVFK+
Sbjct: 406 GELLFTTAEEPTSFKEAEKHQCWQRAMEEEMKSIEENKTWSLIELPAGHKPIGLKWVFKV 465

BLAST of CmaCh12G009200 vs. ExPASy TrEMBL
Match: A0A7I8IJM7 (Hypothetical protein OS=Spirodela intermedia OX=51605 GN=SI7747_04004410 PE=4 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 1.4e-70
Identity = 159/354 (44.92%), Postives = 214/354 (60.45%), Query Frame = 0

Query: 1   MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------- 60
           +PG FWGE V TAVY+LNR PT+S+DG T +E W+ KKPAVHH +VFGC+AY+       
Sbjct: 155 LPGWFWGEAVATAVYILNRCPTKSVDGMTPFEVWHGKKPAVHHLKVFGCIAYVLNTTPHL 214

Query: 61  -----KGGELTCLPTSSSTKAL--------------------SKQWN-----DVIEADHN 120
                +G ++  +     +KA                     + QW+     +  +A  +
Sbjct: 215 KKLEDRGRKMIFIGYEYGSKAYRAYDPTARRVHVTRDVVFDENAQWDWGSGAEQGDAGSH 274

Query: 121 PNKFMVKYLI-----------TESEEGGAQHQQPSPPPA-----GATP-----EPVEFTT 180
            + F ++Y +            E  +  A   +P  PP+     GA P     E VEF +
Sbjct: 275 DDMFTLEYAVGNQVPPELDGAIEVLDDDAPGPEPMSPPSVHYSGGAAPIEDGGEEVEFAS 334

Query: 181 PRTAD-STLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVE 240
           P +     LDA+HD D   R+R++D++VG   P GLA+  L    ELHA+S+DEP +FVE
Sbjct: 335 PPSVHIDHLDANHD-DAPLRFRKIDNIVGLASPRGLASRAL-IAKELHAVSSDEPVSFVE 394

Query: 241 VEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLV 296
            E +P W KAM+EEM SI EN+T SL D+P G +AIGLKWV+K+KR E   VVK+KARLV
Sbjct: 395 AEGHPSWRKAMEEEMASIEENRTGSLVDLPHGRRAIGLKWVYKVKRDENRAVVKYKARLV 454

BLAST of CmaCh12G009200 vs. NCBI nr
Match: XP_023522344.1 (uncharacterized protein LOC111786267, partial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 365.5 bits (937), Expect = 4.9e-97
Identity = 210/359 (58.50%), Postives = 226/359 (62.95%), Query Frame = 0

Query: 1   MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMKGGELTC 60
           MPGRFWGE VMTAVYLLNRSPTRSLDGKT YEAW   +  V    VF             
Sbjct: 1   MPGRFWGEAVMTAVYLLNRSPTRSLDGKTPYEAW--GRAHVSRDVVF------------- 60

Query: 61  LPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGGAQHQQPSPPPAGATPEPVEF 120
                  ++   QWNDVIEADHNPN+F V+YL+TE EE GAQHQ+PSPPPAGA PEPVEF
Sbjct: 61  ------DESTFWQWNDVIEADHNPNQFTVEYLVTEPEE-GAQHQEPSPPPAGAPPEPVEF 120

Query: 121 TTPRTADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFV 180
            TPRTA+STLDADHDT LEARYRR+DDLVGGGEPPGLAA +LKE+AELHAISADEPNTF 
Sbjct: 121 ATPRTANSTLDADHDTYLEARYRRIDDLVGGGEPPGLAARELKEVAELHAISADEPNTFA 180

Query: 181 EVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARL 240
           E EKNPCW                                    LKR +K EVVK+KARL
Sbjct: 181 EAEKNPCW------------------------------------LKRNKKREVVKYKARL 240

Query: 241 VAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNGEL------ 300
           V KGYVQK GVDFEEVFAPV RLESVRFLL+I AHYSWEVHHMDVKSAFLN EL      
Sbjct: 241 VVKGYVQKLGVDFEEVFAPVVRLESVRFLLSIAAHYSWEVHHMDVKSAFLNEELKETVYV 300

Query: 301 RRP----------------------------------------SISDNHLASWTTTTPI 314
           R+P                                        S+SDNHLASWTTTT I
Sbjct: 301 RQPPGFLDNDNQNKVLRLHKALYGLRQAPLAWNAKLDSTLLSXSMSDNHLASWTTTTRI 301

BLAST of CmaCh12G009200 vs. NCBI nr
Match: CAE03692.2 (OSJNBb0026E15.10 [Oryza sativa Japonica Group])

HSP 1 Score: 304.7 bits (779), Expect = 1.0e-78
Identity = 166/344 (48.26%), Postives = 211/344 (61.34%), Query Frame = 0

Query: 1    MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMK------ 60
            +PGRFWGE + TAV+LLNRSPT+SLD +T YEAWY + PAVH  R FGCV ++K      
Sbjct: 711  VPGRFWGEAMSTAVFLLNRSPTKSLDNQTPYEAWYGQWPAVHFLRTFGCVGHVKITKPGL 770

Query: 61   ------GGELTCLPTSSSTKA----------------------LSKQWNDVI-EADHNPN 120
                     +  L     +KA                      ++  W  V  +      
Sbjct: 771  KKLDDRSAPMVLLGYEQGSKAYRLYDPVSERVHVSRDVVFDEDIAWDWGPVTPDGAPQLE 830

Query: 121  KFMVKYLITESEEGGAQHQQPSP---------------PPAGATPEPVEFTTPRTADSTL 180
             F V+ ++T +  G A    P+P               PP+  +PE VEF TP T DS L
Sbjct: 831  PFTVEQVVT-TTIGTAPASSPTPPSPPSPAPSAPTTPAPPSPPSPEAVEFVTPPTQDSIL 890

Query: 181  DADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLK 240
            DAD D D+  RYR +D+L+G   PPG A   L+++ ELH +SADEP +  E E +P W  
Sbjct: 891  DADADDDVVPRYRLVDNLLGNASPPGHAPRVLEQL-ELHVVSADEPASLAEAEADPSWRG 950

Query: 241  AMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQG 295
            AMQ+E+ +I +N TWSL D+P GH+AIGLKWV+KLKR E+G +V++KARLVAKGYVQ+QG
Sbjct: 951  AMQDELNAIVDNDTWSLTDLPHGHRAIGLKWVYKLKRDEQGAIVRYKARLVAKGYVQRQG 1010

BLAST of CmaCh12G009200 vs. NCBI nr
Match: XP_023521510.1 (uncharacterized protein LOC111785335 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 298.9 bits (764), Expect = 5.6e-77
Identity = 185/395 (46.84%), Postives = 211/395 (53.42%), Query Frame = 0

Query: 17  LNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYMK------------GGELTCLPTS 76
           +NRSPTR L GKTSYEAWYNKK AVHHFRVF C+AYMK            G ++  +   
Sbjct: 41  VNRSPTRRLYGKTSYEAWYNKKLAVHHFRVFDCIAYMKVVHPHLAKLDPRGLKVVFIGYE 100

Query: 77  SSTKALSKQWNDVIEADHNPNKFMVKYLITESEEGGAQHQQPSPPPAGATPEPVEFTTPR 136
             +K      NDVIE D NPN+F V+YLITE  EGGAQH++ SP  A  TP+PVEF TPR
Sbjct: 101 PRSK------NDVIEVDQNPNQFTVEYLITEPREGGAQHRELSPTSAAVTPKPVEFATPR 160

Query: 137 TADSTLDADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEK 196
           TADSTLD DHD DL ARYRRMDDLVGGGEPPGLA  +L+E+ ELHAIS DEPNTF + E+
Sbjct: 161 TADSTLDVDHDDDLVARYRRMDDLVGGGEPPGLATRELEEVTELHAISTDEPNTFAKAER 220

Query: 197 NPCWLKAMQEEMTS-------------------------------------ITENQTWS- 256
           NPC LK    ++ S                                     ITE +    
Sbjct: 221 NPCCLKVHHMDVKSAFLNGELKETIDVQQPPGFLDNDNPGKNPNQFTVEYLITEPREGGA 280

Query: 257 -------------------------------------------LEDI-----PPGHQAIG 296
                                                      ++D+     PPG     
Sbjct: 281 QHRELSPTSAAVTPKPVEFATPRTADSTLDVDHDDDLVARYRRMDDLVGGGEPPGLATRE 340

BLAST of CmaCh12G009200 vs. NCBI nr
Match: ABA97666.1 (retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group])

HSP 1 Score: 288.9 bits (738), Expect = 5.8e-74
Identity = 163/340 (47.94%), Postives = 210/340 (61.76%), Query Frame = 0

Query: 1   MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------- 60
           MPG FWGE V TAV+LLNRSPT+ L  KT Y AWY ++PAVH  R FGC+ ++       
Sbjct: 81  MPGLFWGEAVSTAVFLLNRSPTKFLANKTPYRAWYGERPAVHFLRTFGCIGHVNNDKPGL 140

Query: 61  -----KGGELTCLPTSSSTKALSKQWNDVIEADHNPNKFMVKYLITESEE------GGAQ 120
                +   +  L     +KA  + ++ V   D  P  F V++ ++ ++E          
Sbjct: 141 KKLDDRSAPIVLLGYEQGSKAY-RLYDPV--GDREP--FTVEHAVSTTKEEVVTPASSPP 200

Query: 121 HQQPSPPPAGATPE--------PVEFTTPRTADSTLDADHDTD-LEARYRRMDDLVGGGE 180
             Q  P P   TPE         VE  +P + DS LD D D +  E RYR +D+++G   
Sbjct: 201 RSQSPPTPVSPTPEQGMQATEAEVEHASPPSHDSILDTDADPEKREPRYRTLDNILGPAS 260

Query: 181 PPGLAAHKLKEMAELHAISADEPNTFVEVEK--NPCWLKAMQEEMTSITENQTWSLEDIP 240
           P G+A  +L E  ELHA+SA+EP++  E E   +P W  AM++EM SI EN TWSL D+P
Sbjct: 261 PSGMAT-RLLEQLELHAVSAEEPSSLAEAEAEVDPNWKGAMEDEMKSIEENNTWSLTDLP 320

Query: 241 PGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLA 300
            GH+AIGLKWV+KLKR E+G VV HKARLVAKGYVQ+QGVDF+EVF PVARLESV  LLA
Sbjct: 321 AGHRAIGLKWVYKLKRDEQGAVVCHKARLVAKGYVQRQGVDFDEVFTPVARLESVHLLLA 380

Query: 301 ITAHYSWEVHHMDVKSAFLNGELRRPSISDNHLASWTTTT 312
           + AH +WEVHHMDVKSAFLN EL+     +  L +W+  T
Sbjct: 381 VAAHQAWEVHHMDVKSAFLNSELQ-----EMQLRTWSLHT 409

BLAST of CmaCh12G009200 vs. NCBI nr
Match: CAA2616957.1 (unnamed protein product [Spirodela intermedia])

HSP 1 Score: 287.3 bits (734), Expect = 1.7e-73
Identity = 160/344 (46.51%), Postives = 214/344 (62.21%), Query Frame = 0

Query: 1   MPGRFWGEVVMTAVYLLNRSPTRSLDGKTSYEAWYNKKPAVHHFRVFGCVAYM------- 60
           +PG FWGE V TAVY+LNR PT+S+DG T +E W+ KKPAVHH +VFGC+AY+       
Sbjct: 134 LPGWFWGEAVATAVYILNRCPTKSVDGMTPFEVWHGKKPAVHHLKVFGCIAYVLNTTPHL 193

Query: 61  -----KGGELTCLPTSSSTKALSKQ---------WNDVI------EADHNPNKFMVKYLI 120
                +G ++  +     +KA               DV+      +A  + + F ++Y +
Sbjct: 194 KKLEDRGRKMIFIGYEYGSKAYRAYDPTARRVHVTRDVVFDENARDAGSHDDMFTLEYAV 253

Query: 121 -----------TESEEGGAQHQQPSPPPA-----GATP-----EPVEFTTPRTAD-STLD 180
                       E  +  A   +P  PP+     GA P     E VEF +P +     LD
Sbjct: 254 GNQVPPELDGAIEVLDDDAPGPEPMSPPSVHYSGGAAPIEDGGEEVEFASPPSVHIDHLD 313

Query: 181 ADHDTDLEARYRRMDDLVGGGEPPGLAAHKLKEMAELHAISADEPNTFVEVEKNPCWLKA 240
           A+HD D   R+R++D++VG   P GLA+  L    ELHA+S+DEP +FVE E +P W KA
Sbjct: 314 ANHD-DAPLRFRKIDNIVGLASPRGLASRAL-IAKELHAVSSDEPVSFVEAEGHPSWRKA 373

Query: 241 MQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVVKHKARLVAKGYVQKQGV 296
           M+EEM SI EN+TWSL D+P G +AIGLKWV+K+KR E G V K+KARLV KGY Q+QG+
Sbjct: 374 MEEEMASIEENRTWSLVDLPHGRRAIGLKWVYKVKRDENGAVAKYKARLVVKGYAQRQGI 433

BLAST of CmaCh12G009200 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 122.9 bits (307), Expect = 5.1e-28
Identity = 54/122 (44.26%), Postives = 85/122 (69.67%), Query Frame = 0

Query: 173 ADEPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGE 232
           A EP+T+ E ++   W  AM +E+ ++    TW +  +PP  + IG KWV+K+K    G 
Sbjct: 83  AKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGT 142

Query: 233 VVKHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAITAHYSWEVHHMDVKSAFLNG 292
           + ++KARLVAKGY Q++G+DF E F+PV +L SV+ +LAI+A Y++ +H +D+ +AFLNG
Sbjct: 143 IERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNG 202

Query: 293 EL 295
           +L
Sbjct: 203 DL 204

BLAST of CmaCh12G009200 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 85.9 bits (211), Expect = 6.9e-17
Identity = 40/98 (40.82%), Postives = 62/98 (63.27%), Query Frame = 0

Query: 175 EPNTFVEVEKNPCWLKAMQEEMTSITENQTWSLEDIPPGHQAIGLKWVFKLKRKEKGEVV 234
           EP + +   K+P W +AMQEE+ +++ N+TW L   P     +G KWVFK K    G + 
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 235 KHKARLVAKGYVQKQGVDFEEVFAPVARLESVRFLLAI 273
           + KARLVAKG+ Q++G+ F E ++PV R  ++R +L +
Sbjct: 87  RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNV 124

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109786.7e-3330.24Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041464.1e-3026.91Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW22.9e-2028.92Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.5e-1938.84Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P925209.8e-1640.82Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
Q7XPB14.9e-7948.26OSJNBb0026E15.10 protein OS=Oryza sativa subsp. japonica OX=39947 GN=OSJNBb0026E... [more]
Q2QSF42.8e-7447.94Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
A0A7I8IFL98.2e-7446.51Hypothetical protein OS=Spirodela intermedia OX=51605 GN=SI7747_02003168 PE=4 SV... [more]
A0A3L6TJD21.1e-7046.34Integrase catalytic domain-containing protein OS=Panicum miliaceum OX=4540 GN=C2... [more]
A0A7I8IJM71.4e-7044.92Hypothetical protein OS=Spirodela intermedia OX=51605 GN=SI7747_04004410 PE=4 SV... [more]
Match NameE-valueIdentityDescription
XP_023522344.14.9e-9758.50uncharacterized protein LOC111786267, partial [Cucurbita pepo subsp. pepo][more]
CAE03692.21.0e-7848.26OSJNBb0026E15.10 [Oryza sativa Japonica Group][more]
XP_023521510.15.6e-7746.84uncharacterized protein LOC111785335 [Cucurbita pepo subsp. pepo][more]
ABA97666.15.8e-7447.94retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group][more]
CAA2616957.11.7e-7346.51unnamed protein product [Spirodela intermedia][more]
Match NameE-valueIdentityDescription
AT4G23160.15.1e-2844.26cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.16.9e-1740.82Reverse transcriptase (RNA-dependent DNA polymerase) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 202..295
e-value: 1.2E-35
score: 123.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 97..134
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 5..53
coord: 170..294
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 2..39

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh12G009200.1CmaCh12G009200.1mRNA