CmoCh02G015640 (gene) Cucurbita moschata (Rifu)

NameCmoCh02G015640
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCmo_Chr02 : 9041144 .. 9042116 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGTAGCTGGTTTTTCTTCTTCTGCACCATTGTTACCAATCTTTAATGGTGAGAAATATGAGTGGTGGAGCATCAAGATGAAGACCTTGCTCAGATCGCAGGAGCTATGGGACTTGGTGGAGCACGGGTTTGTTGATCTTTTAGAACCCACAATAGAAGAAAAGGAGAGACTAAGAGAAACCAAGAAAAACGATGCCAAGGCTTTATTCATTATTCAGCAAGCAGTTCATGAGACTATCTTTTCACGAATTGCAGCAGCAACCACATCAAAGCAAGCATGGTCAATTCTGCAGAAAGAGTTTCAGGGAGATTCAAAAGTCATAATAGTGAAATTGCAGTCTCTAAGACGTGATTTTGAAACTCTGCTCATGACGAATGGCGAATCAATTGCTGACTTTTTGTCCAGAACAATGGCAATAGTCAGTCAGATGCGCACCTATGGAGAGAAAATTTCAGACGAAACAATTGTTGCAAAGGTGTTGAGAAGCTTAACTCCAAAGTTTGACCATGTGGTGGCTGCCATAGAAGAAGCCAAGGATCTATCCATACTCTCCGTTGATGAACTGATGGGCTCGCTTCAGGCTCATGAGGCAAGAATCAACAGAGCATCAGAAAGGAACGAAGAAAAGGCACTACAAGTGAAGGAGACAACCAATAACGAAAGAGAAAATATTCATTTAGCAGGTAGAAGTCGTGGAAGAGGAGGATTTCGCAACTTCCATGGTAGTCGTGATAACAGTTGGAGAAGTGATGGACAGAGACAATTCAATGAACAAAGGAATGTCATACAATGTTACCATTGTAGAAGGTATGGGCACACAAAATCTAATTGTTGGTATAAAAATCAACGAATGAATTTTGCAGCAGAGAATGAAGAAGAAGAAGAAAAGTTGTTTGTGGCGTGCATGGATACTAATCCGGAAAAAGGTAGCTTATGGTTTGTTGATAGCGGATGCTCGAACCATATGA

mRNA sequence

ATGGCAGTAGCTGGTTTTTCTTCTTCTGCACCATTGTTACCAATCTTTAATGGTGAGAAATATGAGTGGTGGAGCATCAAGATGAAGACCTTGCTCAGATCGCAGGAGCTATGGGACTTGGTGGAGCACGGGTTTGTTGATCTTTTAGAACCCACAATAGAAGAAAAGGAGAGACTAAGAGAAACCAAGAAAAACGATGCCAAGGCTTTATTCATTATTCAGCAAGCAGTTCATGAGACTATCTTTTCACGAATTGCAGCAGCAACCACATCAAAGCAAGCATGGTCAATTCTGCAGAAAGAGTTTCAGGGAGATTCAAAAGTCATAATAGTGAAATTGCAGTCTCTAAGACGTGATTTTGAAACTCTGCTCATGACGAATGGCGAATCAATTGCTGACTTTTTGTCCAGAACAATGGCAATAGTCAGTCAGATGCGCACCTATGGAGAGAAAATTTCAGACGAAACAATTGTTGCAAAGGTGTTGAGAAGCTTAACTCCAAAGTTTGACCATGTGGTGGCTGCCATAGAAGAAGCCAAGGATCTATCCATACTCTCCGTTGATGAACTGATGGGCTCGCTTCAGGCTCATGAGGCAAGAATCAACAGAGCATCAGAAAGGAACGAAGAAAAGGCACTACAAGTGAAGGAGACAACCAATAACGAAAGAGAAAATATTCATTTAGCAGGTAGAAGTCGTGGAAGAGGAGGATTTCGCAACTTCCATGGTAGTCGTGATAACAGTTGGAGAAGTGATGGACAGAGACAATTCAATGAACAAAGGAATAGAATGAAGAAGAAGAAGAAAAGTTGTTTGTGGCGTGCATGGATACTAATCCGGAAAAAGGTAGCTTATGGTTTGTTGATAGCGGATGCTCGAACCATATGA

Coding sequence (CDS)

ATGGCAGTAGCTGGTTTTTCTTCTTCTGCACCATTGTTACCAATCTTTAATGGTGAGAAATATGAGTGGTGGAGCATCAAGATGAAGACCTTGCTCAGATCGCAGGAGCTATGGGACTTGGTGGAGCACGGGTTTGTTGATCTTTTAGAACCCACAATAGAAGAAAAGGAGAGACTAAGAGAAACCAAGAAAAACGATGCCAAGGCTTTATTCATTATTCAGCAAGCAGTTCATGAGACTATCTTTTCACGAATTGCAGCAGCAACCACATCAAAGCAAGCATGGTCAATTCTGCAGAAAGAGTTTCAGGGAGATTCAAAAGTCATAATAGTGAAATTGCAGTCTCTAAGACGTGATTTTGAAACTCTGCTCATGACGAATGGCGAATCAATTGCTGACTTTTTGTCCAGAACAATGGCAATAGTCAGTCAGATGCGCACCTATGGAGAGAAAATTTCAGACGAAACAATTGTTGCAAAGGTGTTGAGAAGCTTAACTCCAAAGTTTGACCATGTGGTGGCTGCCATAGAAGAAGCCAAGGATCTATCCATACTCTCCGTTGATGAACTGATGGGCTCGCTTCAGGCTCATGAGGCAAGAATCAACAGAGCATCAGAAAGGAACGAAGAAAAGGCACTACAAGTGAAGGAGACAACCAATAACGAAAGAGAAAATATTCATTTAGCAGGTAGAAGTCGTGGAAGAGGAGGATTTCGCAACTTCCATGGTAGTCGTGATAACAGTTGGAGAAGTGATGGACAGAGACAATTCAATGAACAAAGGAATAGAATGAAGAAGAAGAAGAAAAGTTGTTTGTGGCGTGCATGGATACTAATCCGGAAAAAGGTAGCTTATGGTTTGTTGATAGCGGATGCTCGAACCATATGA
BLAST of CmoCh02G015640 vs. TrEMBL
Match: A0A0V0IV83_SOLCH (Putative ovule protein (Fragment) OS=Solanum chacoense PE=4 SV=1)

HSP 1 Score: 317.8 bits (813), Expect = 1.4e-83
Identity = 176/266 (66.17%), Postives = 206/266 (77.44%), Query Frame = 1

Query: 1   MAVAGFSSSA--PLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKER 60
           MA  G S S   PL+P+F GE YE+WSI+MKT+L+SQ+LWDLVE G+ D      +E+ R
Sbjct: 1   MATNGSSLSVAQPLIPVFKGESYEFWSIRMKTILKSQDLWDLVERGYTD-----PDEENR 60

Query: 61  LRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRR 120
           LR+ KK DAKAL  IQQAVH++IFSRIA ATTSKQAWSILQK FQGDSKVI+V+LQSLRR
Sbjct: 61  LRDNKKKDAKALVFIQQAVHDSIFSRIAXATTSKQAWSILQKXFQGDSKVIVVRLQSLRR 120

Query: 121 DFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEE 180
           DFETL+M +GESIA FLSR M IVSQ+R+YGEK++D+ IV KVLRSL PKFDHVVAAIEE
Sbjct: 121 DFETLMMKSGESIASFLSRAMTIVSQIRSYGEKVTDQIIVEKVLRSLNPKFDHVVAAIEE 180

Query: 181 AKDLSILSVDELMGSLQAHEARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGF 240
           +KDLS+ S DELMGSLQAHEAR NR+ E+NEEKA QVK+ T    +N   A R RGRGGF
Sbjct: 181 SKDLSVFSFDELMGSLQAHEARRNRSVEKNEEKAFQVKDATTKYGDNNGPASRGRGRGGF 240

Query: 241 RNFHGS--RDNSWRSDGQRQFNEQRN 263
           R   G        R++G RQ NEQ N
Sbjct: 241 RGGRGRGFGRGRGRNNGHRQSNEQGN 261

BLAST of CmoCh02G015640 vs. TrEMBL
Match: A5BT67_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_020406 PE=4 SV=1)

HSP 1 Score: 283.9 bits (725), Expect = 2.3e-73
Identity = 153/232 (65.95%), Postives = 181/232 (78.02%), Query Frame = 1

Query: 7   SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKND 66
           S S P +PIF GE YE+WSIKMKTL +SQ+LWDLVE+G+     P  +E+ RL+E  K D
Sbjct: 10  SVSQPAIPIFKGECYEFWSIKMKTLFKSQDLWDLVENGY-----PYPDEEARLKENTKKD 69

Query: 67  AKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMT 126
           +KALF IQQAVHE+IFS+IA ATT+K+AW+ L+  FQG SKVI VKLQSLRRDFETL M 
Sbjct: 70  SKALFFIQQAVHESIFSKIAVATTTKEAWTTLKTAFQGSSKVITVKLQSLRRDFETLHMK 129

Query: 127 NGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILS 186
           NGES+ DFLSR  AIV+QMR+YGE I D+TIVAKVLRSLTPKFDHVVAAIEE+K LS  S
Sbjct: 130 NGESVQDFLSRVAAIVNQMRSYGEDILDQTIVAKVLRSLTPKFDHVVAAIEESKGLSTYS 189

Query: 187 VDELMGSLQAHEARINRASERNEEKALQVK-ETTNNERENIHLAGRSRGRGG 238
            DELMGSLQ+HE R++R  E+NEEKA   K ET++ +       GR RGRGG
Sbjct: 190 FDELMGSLQSHEVRLSRIEEKNEEKAFYTKGETSDQKNGGREATGRGRGRGG 236

BLAST of CmoCh02G015640 vs. TrEMBL
Match: A5AWP3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_020777 PE=4 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 3.6e-71
Identity = 161/290 (55.52%), Postives = 202/290 (69.66%), Query Frame = 1

Query: 7   SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKND 66
           S S P +PIF GE YE+WSIKMKTL +SQ+LWDLVE+G+     P  +E+ RL+E  K D
Sbjct: 10  SVSQPAIPIFKGECYEFWSIKMKTLFKSQDLWDLVENGY-----PYPDEEARLKENTKKD 69

Query: 67  AKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMT 126
           +KALF IQQAVHE+IFS+IAA TT+K+AW+ L+  FQG SKVI VKLQSLRRDFETL M 
Sbjct: 70  SKALFFIQQAVHESIFSKIAAXTTAKEAWTTLKTAFQGSSKVITVKLQSLRRDFETLHMK 129

Query: 127 NGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILS 186
           NGES+ DFLSR  AIV+QMR+YGE I D+T+VAKVLRSLTPKFDHVVAAIEE+KDLS  S
Sbjct: 130 NGESVQDFLSRVAAIVNQMRSYGEDILDQTVVAKVLRSLTPKFDHVVAAIEESKDLSTYS 189

Query: 187 VDELMGSLQAHEARINRASERNEEKALQVK-ETTNNERENIHLAGRSRGRGGFRNFHGSR 246
            DELMGSLQ+HE R++R  E+NEEK    K ET++ +       GR  GRGG    HG R
Sbjct: 190 FDELMGSLQSHEVRLSRTEEKNEEKXFYTKGETSDQKNGGREATGRGCGRGG---AHG-R 249

Query: 247 DNSWRSDGQRQFNEQRNRMKKKKKSCLWRAWILIRKKVAYGLLIADARTI 296
               R  G  Q    +   ++K+ + + +    ++  +AY   +  +  I
Sbjct: 250 GGRGRGRGDAQXECWKKERQEKQANYVEQEEDQVKLFMAYNEEVVSSNNI 290

BLAST of CmoCh02G015640 vs. TrEMBL
Match: M0SED6_MUSAM (Uncharacterized protein OS=Musa acuminata subsp. malaccensis PE=4 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 1.3e-68
Identity = 140/216 (64.81%), Postives = 169/216 (78.24%), Query Frame = 1

Query: 1   MAVAGFSSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLR 60
           MA  G S S PL+PIF+G+ YE+WSIKMKTL +SQ+LWDL+E+ + D      +++ +LR
Sbjct: 1   MAFNGNSMSQPLIPIFSGKSYEFWSIKMKTLFKSQDLWDLIENEYADP-----DDEIKLR 60

Query: 61  ETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDF 120
           E +K D+KALF IQQAVHETIF RIAAATTSKQAW ILQ EFQG S+VI VKLQ+L  +F
Sbjct: 61  ENRKKDSKALFFIQQAVHETIFLRIAAATTSKQAWLILQNEFQGSSRVITVKLQTLHHEF 120

Query: 121 ETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAK 180
           E L M + ES+ DFLSR   IVSQM++Y E +SD  IV KVLR+LTPKFDHVV AIEE+K
Sbjct: 121 EILFMKSNESVQDFLSRVTEIVSQMKSYSEHLSDHIIVVKVLRNLTPKFDHVVTAIEESK 180

Query: 181 DLSILSVDELMGSLQAHEARINRASERNEEKALQVK 217
           DLS  S DELMGSLQA+E R+NR+ E+NEEK  QVK
Sbjct: 181 DLSTYSFDELMGSLQAYEVRLNRSLEKNEEKVFQVK 211

BLAST of CmoCh02G015640 vs. TrEMBL
Match: A5AHH2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032906 PE=4 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 2.9e-68
Identity = 151/257 (58.75%), Postives = 187/257 (72.76%), Query Frame = 1

Query: 15  IFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQ 74
           +  GE YE+WSIKMKTL +SQ+LWDLVE+G+     P  +E+ RL+E  K D+KALF IQ
Sbjct: 132 VVEGECYEFWSIKMKTLFKSQDLWDLVENGY-----PYPDEEARLKENTKKDSKALFFIQ 191

Query: 75  QAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADF 134
           QA+HE+IFS+IA ATT+K+AW+ L+  FQG SKVI VKLQSLRRDFETL M NGES  DF
Sbjct: 192 QAIHESIFSKIAVATTAKEAWTTLETAFQGSSKVITVKLQSLRRDFETLHMKNGESXQDF 251

Query: 135 LSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSL 194
           LSR  AIV+QMR+YGE I D+T+VAKVLRSLTPKFDHVVA IEE+KDLS  S DELMGSL
Sbjct: 252 LSRVAAIVNQMRSYGEDILDQTVVAKVLRSLTPKFDHVVAXIEESKDLSTYSFDELMGSL 311

Query: 195 QAHEARINRASERNEEKALQVK-ETTNNERENIHLAGRSRGRGGFRNFHGSRDNSWRSDG 254
           Q+HE R++   ++NEEK    K ET++ +       GR RGRGG    HG R    R  G
Sbjct: 312 QSHEVRLSXTEDKNEEKXFYTKGETSDXKNGGREXTGRGRGRGG---AHG-RGGRGRGRG 371

Query: 255 QRQFNEQRNRMKKKKKS 271
             Q +++++  K + KS
Sbjct: 372 DAQGDQRQSTEKSRNKS 379

BLAST of CmoCh02G015640 vs. TAIR10
Match: AT3G21000.1 (AT3G21000.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 67.0 bits (162), Expect = 2.2e-11
Identity = 52/184 (28.26%), Postives = 94/184 (51.09%), Query Frame = 1

Query: 21  YEWWSIKMKTLLRSQELWDLVEHGFVD------LLEPTI--EEKERLRETKKNDAKALFI 80
           YE W+   K+ L  Q LWD+V +G          L  TI  EE  + R+    DAKAL I
Sbjct: 17  YEIWAPITKSTLIEQGLWDVVVNGVPQDPSKNPELAATIQPEELSKWRDFVVKDAKALQI 76

Query: 81  IQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVII-----VKLQSLRRDFETLLMTN 140
           +Q ++ +++F +  +A+++K  W +L+K   G+ +  I     V ++ L +  E L M +
Sbjct: 77  LQSSLTDSVFRKTLSASSAKDVWDLLRK---GNEQATIRRLEQVTIRRLEKQLEDLKMVD 136

Query: 141 GESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSV 192
            ES + +L + + I+ ++     + SD  I   V  +L+  FD + + +EE  D+  ++ 
Sbjct: 137 KESGSSYLDKALEILERLGRAKLEKSDYEICKNVFTTLSGSFDGLDSMLEELIDVHKMTS 196

BLAST of CmoCh02G015640 vs. TAIR10
Match: AT1G48720.1 (AT1G48720.1 unknown protein)

HSP 1 Score: 63.2 bits (152), Expect = 3.2e-10
Identity = 32/93 (34.41%), Postives = 57/93 (61.29%), Query Frame = 1

Query: 7  SSSAPL-LPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIE------EKERL 66
          S++ P  +P+     Y+ WS++MK +L + ++W++VE GF+   EP  E      +K+ L
Sbjct: 3  SNNVPFQVPVLTKSNYDNWSLRMKAILGAHDVWEIVEKGFI---EPENEGSLSQTQKDGL 62

Query: 67 RETKKNDAKALFIIQQAVHETIFSRIAAATTSK 93
          R+++K D KAL +I Q + E  F ++  AT++K
Sbjct: 63 RDSRKRDKKALCLIYQGLDEDTFEKVVEATSAK 92

BLAST of CmoCh02G015640 vs. NCBI nr
Match: gi|147811646|emb|CAN72676.1| (hypothetical protein VITISV_020406 [Vitis vinifera])

HSP 1 Score: 283.9 bits (725), Expect = 3.3e-73
Identity = 153/232 (65.95%), Postives = 181/232 (78.02%), Query Frame = 1

Query: 7   SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKND 66
           S S P +PIF GE YE+WSIKMKTL +SQ+LWDLVE+G+     P  +E+ RL+E  K D
Sbjct: 10  SVSQPAIPIFKGECYEFWSIKMKTLFKSQDLWDLVENGY-----PYPDEEARLKENTKKD 69

Query: 67  AKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMT 126
           +KALF IQQAVHE+IFS+IA ATT+K+AW+ L+  FQG SKVI VKLQSLRRDFETL M 
Sbjct: 70  SKALFFIQQAVHESIFSKIAVATTTKEAWTTLKTAFQGSSKVITVKLQSLRRDFETLHMK 129

Query: 127 NGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILS 186
           NGES+ DFLSR  AIV+QMR+YGE I D+TIVAKVLRSLTPKFDHVVAAIEE+K LS  S
Sbjct: 130 NGESVQDFLSRVAAIVNQMRSYGEDILDQTIVAKVLRSLTPKFDHVVAAIEESKGLSTYS 189

Query: 187 VDELMGSLQAHEARINRASERNEEKALQVK-ETTNNERENIHLAGRSRGRGG 238
            DELMGSLQ+HE R++R  E+NEEKA   K ET++ +       GR RGRGG
Sbjct: 190 FDELMGSLQSHEVRLSRIEEKNEEKAFYTKGETSDQKNGGREATGRGRGRGG 236

BLAST of CmoCh02G015640 vs. NCBI nr
Match: gi|659126980|ref|XP_008463459.1| (PREDICTED: uncharacterized protein LOC103501626 [Cucumis melo])

HSP 1 Score: 280.4 bits (716), Expect = 3.6e-72
Identity = 153/250 (61.20%), Postives = 194/250 (77.60%), Query Frame = 1

Query: 8   SSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDA 67
           ++ P++PIF GE YE+WSI MKTLLRSQ+LWDLVE G+ D      +++++L E KK D+
Sbjct: 10  TAQPIIPIFKGEGYEFWSIHMKTLLRSQDLWDLVEQGYADP-----DDEDKLWENKKKDS 69

Query: 68  KALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTN 127
           KAL IIQQ VH+++FSRI AAT+SKQAW ILQK FQGDS+V++VKLQSLRRDFETL M N
Sbjct: 70  KALVIIQQVVHDSVFSRIVAATSSKQAWLILQKAFQGDSRVLMVKLQSLRRDFETLTMKN 129

Query: 128 GESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSV 187
           GESIADFLSR   I+SQMRTY E+I+++TIV KVLRSLT KFDHVVAAIEE+K+LS  + 
Sbjct: 130 GESIADFLSRATTIISQMRTYDERITNQTIVEKVLRSLTLKFDHVVAAIEESKNLSTFTF 189

Query: 188 DELMGSLQAHEARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFR-NFHGSRD 247
            EL+GSLQAHE+RINR+ ERN+EK  QV++      E+  +  R RGR G+R   HG+  
Sbjct: 190 IELIGSLQAHESRINRSMERNKEKMFQVRDVVPKYNESDCVMTRGRGRRGYRGRGHGAGK 249

Query: 248 NSWRSDGQRQ 257
              +++ QRQ
Sbjct: 250 GCNQNEEQRQ 254

BLAST of CmoCh02G015640 vs. NCBI nr
Match: gi|147789988|emb|CAN71759.1| (hypothetical protein VITISV_020777 [Vitis vinifera])

HSP 1 Score: 276.6 bits (706), Expect = 5.2e-71
Identity = 161/290 (55.52%), Postives = 202/290 (69.66%), Query Frame = 1

Query: 7   SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKND 66
           S S P +PIF GE YE+WSIKMKTL +SQ+LWDLVE+G+     P  +E+ RL+E  K D
Sbjct: 10  SVSQPAIPIFKGECYEFWSIKMKTLFKSQDLWDLVENGY-----PYPDEEARLKENTKKD 69

Query: 67  AKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMT 126
           +KALF IQQAVHE+IFS+IAA TT+K+AW+ L+  FQG SKVI VKLQSLRRDFETL M 
Sbjct: 70  SKALFFIQQAVHESIFSKIAAXTTAKEAWTTLKTAFQGSSKVITVKLQSLRRDFETLHMK 129

Query: 127 NGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILS 186
           NGES+ DFLSR  AIV+QMR+YGE I D+T+VAKVLRSLTPKFDHVVAAIEE+KDLS  S
Sbjct: 130 NGESVQDFLSRVAAIVNQMRSYGEDILDQTVVAKVLRSLTPKFDHVVAAIEESKDLSTYS 189

Query: 187 VDELMGSLQAHEARINRASERNEEKALQVK-ETTNNERENIHLAGRSRGRGGFRNFHGSR 246
            DELMGSLQ+HE R++R  E+NEEK    K ET++ +       GR  GRGG    HG R
Sbjct: 190 FDELMGSLQSHEVRLSRTEEKNEEKXFYTKGETSDQKNGGREATGRGCGRGG---AHG-R 249

Query: 247 DNSWRSDGQRQFNEQRNRMKKKKKSCLWRAWILIRKKVAYGLLIADARTI 296
               R  G  Q    +   ++K+ + + +    ++  +AY   +  +  I
Sbjct: 250 GGRGRGRGDAQXECWKKERQEKQANYVEQEEDQVKLFMAYNEEVVSSNNI 290

BLAST of CmoCh02G015640 vs. NCBI nr
Match: gi|449456267|ref|XP_004145871.1| (PREDICTED: uncharacterized protein LOC101208246 [Cucumis sativus])

HSP 1 Score: 269.6 bits (688), Expect = 6.3e-69
Identity = 148/254 (58.27%), Postives = 191/254 (75.20%), Query Frame = 1

Query: 8   SSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDA 67
           ++ PL+ IF GE YE+WS++MKTLLRSQ+LWDLVEH + D      +++ +LRE ++ D+
Sbjct: 10  TTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADP-----DDEGKLREKRRKDS 69

Query: 68  KALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTN 127
           KAL IIQQAVH++ FSRI   TTSK+AW ILQK F+GD +V++VKLQSLR++FETL+M N
Sbjct: 70  KALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRKEFETLMMKN 129

Query: 128 GESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSV 187
            ESIA+FLSR   I+SQM+TYGE I+D+TIV KVLRSLT KFD VVAAIEE+KDLS  + 
Sbjct: 130 RESIANFLSRATTIISQMQTYGETITDQTIVEKVLRSLTLKFDQVVAAIEESKDLSTFTF 189

Query: 188 DELMGSLQAHEARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFR-NFHGSRD 247
            ELMGSLQAHE+RINR+ ERNEEKA QVK+       +  +  R RGRGG+R    G+  
Sbjct: 190 IELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGRGTEK 249

Query: 248 NSWRSDGQRQFNEQ 261
              +++ + QF  Q
Sbjct: 250 GCKQNEEKGQFRVQ 258

BLAST of CmoCh02G015640 vs. NCBI nr
Match: gi|147771031|emb|CAN60238.1| (hypothetical protein VITISV_032906 [Vitis vinifera])

HSP 1 Score: 266.9 bits (681), Expect = 4.1e-68
Identity = 151/257 (58.75%), Postives = 187/257 (72.76%), Query Frame = 1

Query: 15  IFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQ 74
           +  GE YE+WSIKMKTL +SQ+LWDLVE+G+     P  +E+ RL+E  K D+KALF IQ
Sbjct: 132 VVEGECYEFWSIKMKTLFKSQDLWDLVENGY-----PYPDEEARLKENTKKDSKALFFIQ 191

Query: 75  QAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADF 134
           QA+HE+IFS+IA ATT+K+AW+ L+  FQG SKVI VKLQSLRRDFETL M NGES  DF
Sbjct: 192 QAIHESIFSKIAVATTAKEAWTTLETAFQGSSKVITVKLQSLRRDFETLHMKNGESXQDF 251

Query: 135 LSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSL 194
           LSR  AIV+QMR+YGE I D+T+VAKVLRSLTPKFDHVVA IEE+KDLS  S DELMGSL
Sbjct: 252 LSRVAAIVNQMRSYGEDILDQTVVAKVLRSLTPKFDHVVAXIEESKDLSTYSFDELMGSL 311

Query: 195 QAHEARINRASERNEEKALQVK-ETTNNERENIHLAGRSRGRGGFRNFHGSRDNSWRSDG 254
           Q+HE R++   ++NEEK    K ET++ +       GR RGRGG    HG R    R  G
Sbjct: 312 QSHEVRLSXTEDKNEEKXFYTKGETSDXKNGGREXTGRGRGRGG---AHG-RGGRGRGRG 371

Query: 255 QRQFNEQRNRMKKKKKS 271
             Q +++++  K + KS
Sbjct: 372 DAQGDQRQSTEKSRNKS 379

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0V0IV83_SOLCH1.4e-8366.17Putative ovule protein (Fragment) OS=Solanum chacoense PE=4 SV=1[more]
A5BT67_VITVI2.3e-7365.95Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_020406 PE=4 SV=1[more]
A5AWP3_VITVI3.6e-7155.52Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_020777 PE=4 SV=1[more]
M0SED6_MUSAM1.3e-6864.81Uncharacterized protein OS=Musa acuminata subsp. malaccensis PE=4 SV=1[more]
A5AHH2_VITVI2.9e-6858.75Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032906 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21000.12.2e-1128.26 Gag-Pol-related retrotransposon family protein[more]
AT1G48720.13.2e-1034.41 unknown protein[more]
Match NameE-valueIdentityDescription
gi|147811646|emb|CAN72676.1|3.3e-7365.95hypothetical protein VITISV_020406 [Vitis vinifera][more]
gi|659126980|ref|XP_008463459.1|3.6e-7261.20PREDICTED: uncharacterized protein LOC103501626 [Cucumis melo][more]
gi|147789988|emb|CAN71759.1|5.2e-7155.52hypothetical protein VITISV_020777 [Vitis vinifera][more]
gi|449456267|ref|XP_004145871.1|6.3e-6958.27PREDICTED: uncharacterized protein LOC101208246 [Cucumis sativus][more]
gi|147771031|emb|CAN60238.1|4.1e-6858.75hypothetical protein VITISV_032906 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR025314DUF4219
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh02G015640.1CmoCh02G015640.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025314Domain of unknown function DUF4219PFAMPF13961DUF4219coord: 16..42
score: 4.
NoneNo IPR availableunknownCoilCoilcoord: 187..221
scor
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 14..245
score: 1.9
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 63..200
score: 4.7