CmoCh20G010830.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh20G010830.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
LocationCmo_Chr20 : 10253868 .. 10254725 (+)
Sequence length858
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCGACCTTTGGCCCAATCCCTCTCCATCAGGCTGTCACAATCCGTCTCACTAAAAACAACTTCATCATATGGCGCGCCCAACTCCTCCCCTACCTACGGAGTACGAAGCTTATGGGCTACCTCGATGGCACCATTGTTGCACCTGCCAAGCTGGTCCCTTCCTCAACCACTGATGGTGTTGATATGGTCTCTAATCCGGCCTATGATCGGTGGTATGTTCAGGATCAACAGGTCCTTAGTGGCCTTCTCTCCTCCATGTCTGAGGAGATTCTTCATGATATGGTTGCCGCTACTACTTCCAAGGAGGCGTGGGATACACTGCAGCAGATGTTTTCGTCGTTAACTCGTGCTCGCACTGTTCAGATCCGTGTTGAGCTCACCACGTCCAAGAAACGCGATATGTCTGCTGCAGATTATTTTCGCAAGATCAAAGGGCTAGCCACCGAGCTGGCTGCCGCCGGCTCTGCCTTGCAGGATGATGATGTGATCGCGTATCTCTTCGCTGGTCTTGGCCCTGACTATGATCCCTTCGTCACCTCGATGACTACCAAGAGTGAATCCCTCACGCTTGATGATGTGTTTGCACATCTAATGGCGTTTGAAGGTCATCAACTACAACACCAGGCTGTACTTCAATTAAATTCTGGATCTTCAGTTGATTATGCTGGTTGTGGTGGTCTGCAGAAGAACTGTGAGTGTAGGGATCGTGGTCGTGGCCGTTCTCAAGGTGGTGTGCCCTCTCGTCTTACTGGTAATCGTCGTGACCCTTCTGCTCGTCCTACTTGCCAAATCTGCGGCAAATTGGGGCATACTGCCGTTCGCTGCTGGCATAGGATGGATGAGTCCTATTAA

mRNA sequence

ATGGCGTCGACCTTTGGCCCAATCCCTCTCCATCAGGCTGTCACAATCCGTCTCACTAAAAACAACTTCATCATATGGCGCGCCCAACTCCTCCCCTACCTACGGAGTACGAAGCTTATGGGCTACCTCGATGGCACCATTGTTGCACCTGCCAAGCTGGTCCCTTCCTCAACCACTGATGGTGTTGATATGGTCTCTAATCCGGCCTATGATCGGTGGTATGTTCAGGATCAACAGGTCCTTAGTGGCCTTCTCTCCTCCATGTCTGAGGAGATTCTTCATGATATGGTTGCCGCTACTACTTCCAAGGAGGCGTGGGATACACTGCAGCAGATGTTTTCGTCGTTAACTCGTGCTCGCACTGTTCAGATCCGTGTTGAGCTCACCACGTCCAAGAAACGCGATATGTCTGCTGCAGATTATTTTCGCAAGATCAAAGGGCTAGCCACCGAGCTGGCTGCCGCCGGCTCTGCCTTGCAGGATGATGATGTGATCGCGTATCTCTTCGCTGGTCTTGGCCCTGACTATGATCCCTTCGTCACCTCGATGACTACCAAGAGTGAATCCCTCACGCTTGATGATGTGTTTGCACATCTAATGGCGTTTGAAGGTCATCAACTACAACACCAGGCTGTACTTCAATTAAATTCTGGATCTTCAGTTGATTATGCTGGTTGTGGTGGTCTGCAGAAGAACTGTGAGTGTAGGGATCGTGGTCGTGGCCGTTCTCAAGGTGGTGTGCCCTCTCGTCTTACTGGTAATCGTCGTGACCCTTCTGCTCGTCCTACTTGCCAAATCTGCGGCAAATTGGGGCATACTGCCGTTCGCTGCTGGCATAGGATGGATGAGTCCTATTAA

Coding sequence (CDS)

ATGGCGTCGACCTTTGGCCCAATCCCTCTCCATCAGGCTGTCACAATCCGTCTCACTAAAAACAACTTCATCATATGGCGCGCCCAACTCCTCCCCTACCTACGGAGTACGAAGCTTATGGGCTACCTCGATGGCACCATTGTTGCACCTGCCAAGCTGGTCCCTTCCTCAACCACTGATGGTGTTGATATGGTCTCTAATCCGGCCTATGATCGGTGGTATGTTCAGGATCAACAGGTCCTTAGTGGCCTTCTCTCCTCCATGTCTGAGGAGATTCTTCATGATATGGTTGCCGCTACTACTTCCAAGGAGGCGTGGGATACACTGCAGCAGATGTTTTCGTCGTTAACTCGTGCTCGCACTGTTCAGATCCGTGTTGAGCTCACCACGTCCAAGAAACGCGATATGTCTGCTGCAGATTATTTTCGCAAGATCAAAGGGCTAGCCACCGAGCTGGCTGCCGCCGGCTCTGCCTTGCAGGATGATGATGTGATCGCGTATCTCTTCGCTGGTCTTGGCCCTGACTATGATCCCTTCGTCACCTCGATGACTACCAAGAGTGAATCCCTCACGCTTGATGATGTGTTTGCACATCTAATGGCGTTTGAAGGTCATCAACTACAACACCAGGCTGTACTTCAATTAAATTCTGGATCTTCAGTTGATTATGCTGGTTGTGGTGGTCTGCAGAAGAACTGTGAGTGTAGGGATCGTGGTCGTGGCCGTTCTCAAGGTGGTGTGCCCTCTCGTCTTACTGGTAATCGTCGTGACCCTTCTGCTCGTCCTACTTGCCAAATCTGCGGCAAATTGGGGCATACTGCCGTTCGCTGCTGGCATAGGATGGATGAGTCCTATTAA
BLAST of CmoCh20G010830.1 vs. TrEMBL
Match: A0A096R8L4_MAIZE (Uncharacterized protein OS=Zea mays PE=4 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 4.3e-61
Identity = 124/188 (65.96%), Postives = 153/188 (81.38%), Query Frame = 1

Query: 7   PIPLHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLVPSSTTDGVDMVS 66
           PIP+HQ VTIRLTK+NF++WRAQLLPYLRS KL+G+LDGT  A A  + +  T     V 
Sbjct: 3   PIPIHQVVTIRLTKSNFLLWRAQLLPYLRSWKLIGHLDGTQAARALTIAAGATQ----VP 62

Query: 67  NPAYDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFSSLTRARTVQIRV 126
           NPAY+RWY QDQQ+LSGLLSSMSEE+L D+V AT+S+E WD LQ+ F+S T+ARTVQIRV
Sbjct: 63  NPAYERWYNQDQQLLSGLLSSMSEEVLWDVVHATSSREVWDFLQKNFASSTKARTVQIRV 122

Query: 127 ELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGPDYDPFVTSMTTK 186
           EL T+KKRDM  AD+FRKI  L TELAAA +  +D++V+AYLFA L  DYDPF+TSMTTK
Sbjct: 123 ELATAKKRDMYVADFFRKITRLDTELAAADAPFRDEEVLAYLFASLPADYDPFITSMTTK 182

Query: 187 SESLTLDD 195
           +E+L+LD+
Sbjct: 183 NEALSLDN 186

BLAST of CmoCh20G010830.1 vs. TrEMBL
Match: C0PCZ1_MAIZE (Uncharacterized protein OS=Zea mays PE=2 SV=1)

HSP 1 Score: 240.0 bits (611), Expect = 3.6e-60
Identity = 133/302 (44.04%), Postives = 193/302 (63.91%), Query Frame = 1

Query: 1   MASTFGPIP---LHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLV--- 60
           +A+ F  +P     Q V+++L   N+++W AQ+LPYLRS  L G++DG++ AP + V   
Sbjct: 10  VAAVFSSMPTPAFSQMVSMKLNHENYLLWVAQVLPYLRSQGLSGHIDGSLPAPRQTVAVD 69

Query: 61  PSSTTDGVDMVSNPAYDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFS 120
           P+  + G  +  NP +  WY QDQ VLS + SS+SEE+L  +V ATT++ AW TL++M++
Sbjct: 70  PAEGSGGRTIAINPEFTSWYHQDQLVLSVINSSLSEEVLATVVDATTARGAWSTLERMYA 129

Query: 121 SLTRARTVQIRVELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGP 180
           S +RAR +QIR++L T +K D++AA+YFRK+K LA  LAA G  L+D++ I+YL  GL  
Sbjct: 130 SSSRARIMQIRMQLATIQKGDLTAAEYFRKVKRLADTLAAVGKRLEDEEFISYLLRGLPA 189

Query: 181 DYDPFVTSMTTKSESLTLDDVFAHLMAFEGHQLQHQAVLQL---NSGSSVDYAGCGGLQK 240
           DYD  VTS+TT+ ++ T+ DV+AHL++FE  Q  H AV Q+   N+ + V   G G   +
Sbjct: 190 DYDSLVTSITTRPDTYTISDVYAHLLSFETRQEYHTAVGQISSVNNANRVPSRGGGAFGQ 249

Query: 241 NCECRDRGRGRSQGGVPSRLTGNRRDPSARP--------TCQICGKLGHTAVRCWHRMDE 286
           N     RG GR +GG      GN R P+  P        TCQICGK  H A++CWHR D+
Sbjct: 250 N----GRG-GRGRGGRSQPGRGNNRPPARPPSNNSSSSGTCQICGKGNHNALQCWHRFDQ 306

BLAST of CmoCh20G010830.1 vs. TrEMBL
Match: B8A366_MAIZE (Uncharacterized protein OS=Zea mays PE=2 SV=1)

HSP 1 Score: 238.4 bits (607), Expect = 1.1e-59
Identity = 132/302 (43.71%), Postives = 192/302 (63.58%), Query Frame = 1

Query: 1   MASTFGPIP---LHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLV--- 60
           +A+ F  +P     Q V+++L   N+++W AQ+LPYLRS  L G++DG++ AP + V   
Sbjct: 10  VAAVFSSMPTPAFSQMVSMKLNHENYLLWVAQVLPYLRSQGLSGHIDGSLPAPRQTVAVD 69

Query: 61  PSSTTDGVDMVSNPAYDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFS 120
           P+  + G  +  NP +  WY QDQ VLS + SS+SEE+L  +V ATT++ AW TL++M++
Sbjct: 70  PAEGSGGRTIAINPEFTSWYHQDQLVLSVINSSLSEEVLATVVDATTARGAWSTLERMYA 129

Query: 121 SLTRARTVQIRVELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGP 180
           S +R R +QIR++L T +K D++AA+YFRK+K LA  LAA G  L+D++ I+YL  GL  
Sbjct: 130 SSSRVRIMQIRMQLATIQKGDLTAAEYFRKVKRLADTLAAVGKRLEDEEFISYLLRGLPA 189

Query: 181 DYDPFVTSMTTKSESLTLDDVFAHLMAFEGHQLQHQAVLQL---NSGSSVDYAGCGGLQK 240
           DYD  VTS+TT+ ++ T+ DV+AHL++FE  Q  H AV Q+   N+ + V   G G   +
Sbjct: 190 DYDSLVTSITTRPDTYTISDVYAHLLSFETRQEYHTAVGQISSVNNANRVPSRGGGAFGQ 249

Query: 241 NCECRDRGRGRSQGGVPSRLTGNRRDPSARP--------TCQICGKLGHTAVRCWHRMDE 286
           N     RG GR +GG      GN R P+  P        TCQICGK  H A++CWHR D+
Sbjct: 250 N----GRG-GRGRGGRSQPGRGNNRPPARPPSNNSSSSGTCQICGKGNHNALQCWHRFDQ 306

BLAST of CmoCh20G010830.1 vs. TrEMBL
Match: V7B6D9_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_008G110300g PE=4 SV=1)

HSP 1 Score: 215.7 bits (548), Expect = 7.3e-53
Identity = 115/285 (40.35%), Postives = 180/285 (63.16%), Query Frame = 1

Query: 10  LHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLVPSSTTDGVDMVSNPA 69
           + Q + ++LT+ N+++W AQ+LPYLRS  L+G++DG+++ P + +  S            
Sbjct: 6   ISQVINVKLTQENYLLWSAQILPYLRSQGLVGFVDGSMLPPNQRLRLS------------ 65

Query: 70  YDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFSSLTRARTVQIRVELT 129
                 QDQ VLS + SS++EE+L  +V  TT++EAWDTL++ F+S +RAR +QIR+EL+
Sbjct: 66  ------QDQLVLSLINSSVTEEVLSTVVGITTAREAWDTLERQFASTSRARAMQIRMELS 125

Query: 130 TSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGPDYDPFVTSMTTKSES 189
           T +K+DM+ ADYFRK+K L   LAA G  ++D+++IAY+  GLGPDYDPFVTS+TT+++ 
Sbjct: 126 TIQKKDMTIADYFRKVKRLRDTLAAIGKRVEDEELIAYMLQGLGPDYDPFVTSITTRTDV 185

Query: 190 LTLDDVFAHLMAFEGHQLQHQAVLQLNSGSSVDYAGCGGLQKNCECRDRGRGRSQGGVPS 249
            T+ DV+AH++++E   L++  + Q +S ++V+     G   N     RGRGR   G   
Sbjct: 186 YTVSDVYAHMLSYEMRHLRNGTLEQFSSANNVNRMSNRG-GTNGRRGGRGRGRQSNGGRG 245

Query: 250 RLTGNRRDPSARPT---------CQICGKLGHTAVRCWHRMDESY 286
           +    R +   +P+          QICGK  H A++CWH  D+ Y
Sbjct: 246 QTWHTRNNFGRQPSKAQSNSDKVYQICGKPNHDALQCWHIFDQEY 271

BLAST of CmoCh20G010830.1 vs. TrEMBL
Match: V7CSV3_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G323600g PE=4 SV=1)

HSP 1 Score: 215.3 bits (547), Expect = 9.6e-53
Identity = 113/292 (38.70%), Postives = 180/292 (61.64%), Query Frame = 1

Query: 3   STFGPIPLHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLVPSSTTDGV 62
           +T     + Q + ++LT+ N+++W AQ+LPYLRS  L+G++DG++  P + +   T + +
Sbjct: 8   NTMSSPSISQVINVKLTQENYLLWSAQILPYLRSQGLVGFVDGSMPPPNQTI---TVESI 67

Query: 63  DMVSNPAYDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFSSLTRARTV 122
                     WY QDQ VLS + SS++EE+L  +V  TT++EAWD L++ F+S +RAR +
Sbjct: 68  ----------WYPQDQLVLSLINSSVTEEVLSTVVGITTAREAWDMLERQFASTSRARAM 127

Query: 123 QIRVELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGPDYDPFVTS 182
           Q  +EL+T +K+DM+ ADYFRK+K L   LAA G  ++D+++IAY+  GLG DYDP VTS
Sbjct: 128 QNHMELSTIQKKDMTIADYFRKVKRLGDTLAAIGKRVEDEELIAYMLQGLGLDYDPLVTS 187

Query: 183 MTTKSESLTLDDVFAHLMAFEGHQLQHQAVLQLNSGSSVDYAGCGGLQKNCECRDRGRGR 242
           +TT+++  T+ DV+AH++++E   L++  + Q    ++V+     G       R  GRGR
Sbjct: 188 ITTRTDVYTVSDVYAHMLSYEMRHLRNGTLEQFPFANNVNSMSNRG---GTNGRRGGRGR 247

Query: 243 SQGGVPSRLTGNRRDPSARPT---------CQICGKLGHTAVRCWHRMDESY 286
              G   +    R +   +P+         CQICGK  H A++CWHR D++Y
Sbjct: 248 QSNGGRGQTWHTRNNSGRQPSKAQSNSDKVCQICGKPNHDALQCWHRFDQAY 283

BLAST of CmoCh20G010830.1 vs. TAIR10
Match: AT1G34070.1 (AT1G34070.1 Retrotransposon gag protein (InterPro:IPR005162))

HSP 1 Score: 72.0 bits (175), Expect = 6.6e-13
Identity = 51/197 (25.89%), Postives = 91/197 (46.19%), Query Frame = 1

Query: 8   IPLHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLVPSSTTDGVDMVSN 67
           I  H  V + + ++N+  WR   L +  S  +MG++DGT      L+P++  D       
Sbjct: 16  IKSHIPVMLDIEESNYDAWRELFLTHCLSFDVMGHIDGT------LLPTNANDV------ 75

Query: 68  PAYDRWYVQDQQVLSGLLSSMS-EEILHDMVAATTSKEAWDTLQQMFSSLTRARTVQIRV 127
                W  +D  V   L  +++ ++     V ++TS++ W  ++  F +   AR +++  
Sbjct: 76  ----NWQKRDGIVKLSLYGTLTPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARALRLDS 135

Query: 128 ELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGPDYDPFVTSMTTK 187
           EL T    DM  ADY+RK+K LA  L      + D +++ Y+  GL P +D  +  +  +
Sbjct: 136 ELRTKDIGDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIINVIKHR 195

Query: 188 SESLTLDDVFAHLMAFE 204
               + DD    L   E
Sbjct: 196 QPFPSFDDAATMLQEEE 196

BLAST of CmoCh20G010830.1 vs. NCBI nr
Match: gi|670388504|ref|XP_008674717.1| (PREDICTED: uncharacterized protein LOC103650906 isoform X1 [Zea mays])

HSP 1 Score: 334.0 bits (855), Expect = 2.6e-88
Identity = 164/284 (57.75%), Postives = 217/284 (76.41%), Query Frame = 1

Query: 2   ASTFGPIPLHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLVPSSTTDG 61
           +S+   IP+HQ VTIRLTK NF++WRAQLLP+LR  +LMG+LDG+ +APA+ + SST   
Sbjct: 4   SSSAAHIPIHQVVTIRLTKTNFLLWRAQLLPHLRGAQLMGFLDGSNLAPAQQIASSTDAN 63

Query: 62  VDMVSNPAYDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFSSLTRART 121
             ++ NPAY+ W+ QDQQ+LSGLLSSM+E++L D++  ++SKEAWD L  M+SS +RAR 
Sbjct: 64  ARVIPNPAYEWWFNQDQQILSGLLSSMTEDVLGDIITVSSSKEAWDILNSMYSSASRARI 123

Query: 122 VQIRVELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGPDYDPFVT 181
           VQIR++L T KKRD+SAADYF KIK  A++LAAA + L++D+ +AYL AGLGP+YD FVT
Sbjct: 124 VQIRMDLATIKKRDLSAADYFCKIKSYASDLAAAAAPLREDEKVAYLLAGLGPEYDSFVT 183

Query: 182 SMTTKSESLTLDDVFAHLMAFEGHQLQHQAVLQLNSGSSVDYAGCGGLQKNCECRDRGRG 241
           +MTTKSE+LT DD++AH +++E  QLQHQA  +LN G+  +YAG GG Q     R RGRG
Sbjct: 184 AMTTKSEALTFDDIYAHFLSYEARQLQHQAETRLNVGTMANYAGRGGSQH----RGRGRG 243

Query: 242 RSQGGVPSRLTGNRRDPSARPTCQICGKLGHTAVRCWHRMDESY 286
            S+G       G+   P+ R  CQICGKLGHTA++CWHRMD+SY
Sbjct: 244 NSRGQSSFHGPGH-TGPTTRDPCQICGKLGHTALKCWHRMDKSY 282

BLAST of CmoCh20G010830.1 vs. NCBI nr
Match: gi|670388508|ref|XP_008674719.1| (PREDICTED: uncharacterized protein LOC103650906 isoform X2 [Zea mays])

HSP 1 Score: 334.0 bits (855), Expect = 2.6e-88
Identity = 164/284 (57.75%), Postives = 217/284 (76.41%), Query Frame = 1

Query: 2   ASTFGPIPLHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLVPSSTTDG 61
           +S+   IP+HQ VTIRLTK NF++WRAQLLP+LR  +LMG+LDG+ +APA+ + SST   
Sbjct: 4   SSSAAHIPIHQVVTIRLTKTNFLLWRAQLLPHLRGAQLMGFLDGSNLAPAQQIASSTDAN 63

Query: 62  VDMVSNPAYDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFSSLTRART 121
             ++ NPAY+ W+ QDQQ+LSGLLSSM+E++L D++  ++SKEAWD L  M+SS +RAR 
Sbjct: 64  ARVIPNPAYEWWFNQDQQILSGLLSSMTEDVLGDIITVSSSKEAWDILNSMYSSASRARI 123

Query: 122 VQIRVELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGPDYDPFVT 181
           VQIR++L T KKRD+SAADYF KIK  A++LAAA + L++D+ +AYL AGLGP+YD FVT
Sbjct: 124 VQIRMDLATIKKRDLSAADYFCKIKSYASDLAAAAAPLREDEKVAYLLAGLGPEYDSFVT 183

Query: 182 SMTTKSESLTLDDVFAHLMAFEGHQLQHQAVLQLNSGSSVDYAGCGGLQKNCECRDRGRG 241
           +MTTKSE+LT DD++AH +++E  QLQHQA  +LN G+  +YAG GG Q     R RGRG
Sbjct: 184 AMTTKSEALTFDDIYAHFLSYEARQLQHQAETRLNVGTMANYAGRGGSQH----RGRGRG 243

Query: 242 RSQGGVPSRLTGNRRDPSARPTCQICGKLGHTAVRCWHRMDESY 286
            S+G       G+   P+ R  CQICGKLGHTA++CWHRMD+SY
Sbjct: 244 NSRGQSSFHGPGH-TGPTTRDPCQICGKLGHTALKCWHRMDKSY 282

BLAST of CmoCh20G010830.1 vs. NCBI nr
Match: gi|670370937|ref|XP_008666719.1| (PREDICTED: uncharacterized protein LOC103645443 [Zea mays])

HSP 1 Score: 306.2 bits (783), Expect = 5.9e-80
Identity = 157/240 (65.42%), Postives = 191/240 (79.58%), Query Frame = 1

Query: 7   PIPLHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLVPSSTTDGVDMVS 66
           PIPLH  VTIRLTK+N+++WRAQL+P+LRS KLMGYLDGT+ APA+L+ SST  G   V+
Sbjct: 14  PIPLHHVVTIRLTKSNYLLWRAQLVPFLRSAKLMGYLDGTLSAPAQLIASSTATGAAQVA 73

Query: 67  NPAYDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFSSLTRARTVQIRV 126
           NPAY RWY QDQQ+LSGLLSSM+E++L D+VAAT+SKE WD+LQ+ FSS TRA TVQIRV
Sbjct: 74  NPAYIRWYDQDQQLLSGLLSSMTEDVLRDVVAATSSKEVWDSLQKKFSSSTRACTVQIRV 133

Query: 127 ELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGPDYDPFVTSMTTK 186
           EL T+KKRD +AADY+ KI GLA ELAAAG+ LQDD+V++YL  GL  +YD FVTS+T+K
Sbjct: 134 ELATAKKRDSTAADYYHKITGLANELAAAGAPLQDDEVLSYLLVGLPAEYDLFVTSITSK 193

Query: 187 SESLTLDDVFAHLMAFEGHQLQHQAVLQLNSGSSVDYAGCGGLQKNCECRDRGRGRSQGG 246
           ++ LTLDDVFAHLMAFE  QLQHQ  +QLN+  S +Y G GG         RGRGR+  G
Sbjct: 194 TDPLTLDDVFAHLMAFEARQLQHQTEIQLNTDMSANYVGRGGR------GHRGRGRNTRG 247

BLAST of CmoCh20G010830.1 vs. NCBI nr
Match: gi|670447915|ref|XP_008664171.1| (PREDICTED: uncharacterized protein LOC103642757 [Zea mays])

HSP 1 Score: 270.8 bits (691), Expect = 2.8e-69
Identity = 135/211 (63.98%), Postives = 169/211 (80.09%), Query Frame = 1

Query: 7   PIPLHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLVPSSTTDGVDMVS 66
           P+P+ QAVTIRLTK N+++WRAQ LPYLRS+KLMG+LDG+  APA  V +ST DG   + 
Sbjct: 4   PVPISQAVTIRLTKANYLLWRAQALPYLRSSKLMGFLDGSKPAPATTVVASTVDGATPIP 63

Query: 67  NPAYDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFSSLTRARTVQIRV 126
           NP YDRW+ QDQQ+LSGLLS+M+E++L D+V A TSKE WD+LQ+ F+S T+ARTVQIRV
Sbjct: 64  NPEYDRWFDQDQQLLSGLLSTMTEDVLRDVVLAKTSKEVWDSLQKKFASSTKARTVQIRV 123

Query: 127 ELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGPDYDPFVTSMTTK 186
           EL T KKRD+SAAD+F KI GLA ELAAA + L D++V+AYL AGL  +YDPFVTSMTTK
Sbjct: 124 ELATLKKRDLSAADFFHKIMGLANELAAADAPLCDEEVLAYLLAGLPVEYDPFVTSMTTK 183

Query: 187 SESLTLDDVFAHLMAFEGHQLQHQAVLQLNS 218
           SE+ +LDDVFAHL+AFE  +L  +     +S
Sbjct: 184 SEAFSLDDVFAHLVAFEARRLHTRRTCNCSS 214

BLAST of CmoCh20G010830.1 vs. NCBI nr
Match: gi|413943632|gb|AFW76281.1| (hypothetical protein ZEAMMB73_463302 [Zea mays])

HSP 1 Score: 243.0 bits (619), Expect = 6.1e-61
Identity = 124/188 (65.96%), Postives = 153/188 (81.38%), Query Frame = 1

Query: 7   PIPLHQAVTIRLTKNNFIIWRAQLLPYLRSTKLMGYLDGTIVAPAKLVPSSTTDGVDMVS 66
           PIP+HQ VTIRLTK+NF++WRAQLLPYLRS KL+G+LDGT  A A  + +  T     V 
Sbjct: 3   PIPIHQVVTIRLTKSNFLLWRAQLLPYLRSWKLIGHLDGTQAARALTIAAGATQ----VP 62

Query: 67  NPAYDRWYVQDQQVLSGLLSSMSEEILHDMVAATTSKEAWDTLQQMFSSLTRARTVQIRV 126
           NPAY+RWY QDQQ+LSGLLSSMSEE+L D+V AT+S+E WD LQ+ F+S T+ARTVQIRV
Sbjct: 63  NPAYERWYNQDQQLLSGLLSSMSEEVLWDVVHATSSREVWDFLQKNFASSTKARTVQIRV 122

Query: 127 ELTTSKKRDMSAADYFRKIKGLATELAAAGSALQDDDVIAYLFAGLGPDYDPFVTSMTTK 186
           EL T+KKRDM  AD+FRKI  L TELAAA +  +D++V+AYLFA L  DYDPF+TSMTTK
Sbjct: 123 ELATAKKRDMYVADFFRKITRLDTELAAADAPFRDEEVLAYLFASLPADYDPFITSMTTK 182

Query: 187 SESLTLDD 195
           +E+L+LD+
Sbjct: 183 NEALSLDN 186

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A096R8L4_MAIZE4.3e-6165.96Uncharacterized protein OS=Zea mays PE=4 SV=1[more]
C0PCZ1_MAIZE3.6e-6044.04Uncharacterized protein OS=Zea mays PE=2 SV=1[more]
B8A366_MAIZE1.1e-5943.71Uncharacterized protein OS=Zea mays PE=2 SV=1[more]
V7B6D9_PHAVU7.3e-5340.35Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_008G110300g PE=4 SV=1[more]
V7CSV3_PHAVU9.6e-5338.70Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G323600g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G34070.16.6e-1325.89 Retrotransposon gag protein (InterPro:IPR005162)[more]
Match NameE-valueIdentityDescription
gi|670388504|ref|XP_008674717.1|2.6e-8857.75PREDICTED: uncharacterized protein LOC103650906 isoform X1 [Zea mays][more]
gi|670388508|ref|XP_008674719.1|2.6e-8857.75PREDICTED: uncharacterized protein LOC103650906 isoform X2 [Zea mays][more]
gi|670370937|ref|XP_008666719.1|5.9e-8065.42PREDICTED: uncharacterized protein LOC103645443 [Zea mays][more]
gi|670447915|ref|XP_008664171.1|2.8e-6963.98PREDICTED: uncharacterized protein LOC103642757 [Zea mays][more]
gi|413943632|gb|AFW76281.1|6.1e-6165.96hypothetical protein ZEAMMB73_463302 [Zea mays][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh20G010830CmoCh20G010830gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh20G010830.1CmoCh20G010830.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh20G010830.1.exon.1CmoCh20G010830.1.exon.1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh20G010830.1.CDS.1CmoCh20G010830.1.CDS.1CDS


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 12..285
score: 1.3
NoneNo IPR availablePANTHERPTHR11439:SF185SUBFAMILY NOT NAMEDcoord: 12..285
score: 1.3
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 73..203
score: 8.2