CmoCh09G003520.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh09G003520.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(Retrovirus-related Pol polyprotein from transposon TNT 1-94) (3.1.13.-)
LocationCmo_Chr09 : 1524593 .. 1525871 (-)
Sequence length948
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGAATCAAGTTTTTCAGCCGTTGCACCACCAGTCTTCGATGGAGACAATTATCAAATGTGGGCAGTTCGTATGGAGACTTATTTGGAGGCCTTGGATCTTTGGGAAGCAATAGAAGAGGATTACGAGGTCCCTCCGCTTCCAGCAAATCCTACTGTAGCACAAATCAAATTACAGAAGGAAAAGAAAACAAGGAAATCAAAGGCAAAAGCTTGCCTATTTGCCACTGTATCTAAAATGATTTCATGCGAATAATGTCCCTCAAAACAGCAAAAGAAATCTGGGATTATCTCAAGGCTGAATATGAAGGAGATGAGAGGATTCGTGGAATGAAAGTCCTGAATTTGATCAAGGATTTCGAGTTGCAGAAGATGAAGGAGTCAGAGTCGGTAAAAGAGTACTCAGACAAACTTCTCAGCATCGCCAACAAGGTGAGATTGCTTGGTTCTGTATTAAATGATTCCAGGATCGTTGAAAAGCTGCTAGTCACTCCTCCAGAGAAGTTTGAAGCCACCATTACTACTCTGGAGAACACCAAAGACTTGTCAAAGACTTCTCTTAGAGAGCTCTTGAATGCTTTAAAAGCGCAAGAGCAAAGGAGGTCTATGAGACAAGAAGGGGTGATTGAAGGCGCCTTACTTGTTGAGCATCAAGACAGCCGCAGGTATAAAAACAACTAAACTTTCAAAAATCAATTGACGTATGGAGATTCATCTGCCAATTATCAAAAGACAAAAGGAAGAGGTTTCAAAAAATCCTATTCACCTTGCCGCCATTGTGAGAAGAAAGGTCATCCACCATACAAGTGTTGGAGAAGACCTGACGCCTTCTGCTCCAAATGCAATCAACTTGGACATGAAGCTGTGATCTGCAAAGCCAAAGATCCGGTGAAAGAAGTAGATGCACAGGTCGTTGATCAAGAAGAAGAAGAAGAAGAAGAAGATCAATTGTTTATGGTCACTTCTTCCTCAAGCAAAGAATCAAGCGAGAGCTGGTTGATTGATAGTGGGTGCACAAATCACATGACATATGACAAGGAGTCTTTTGAGGAATTAAGAGACACCGAAGATAAAAGAGTGAGGATTGGCAATGGTGAACACTTGGAAGTCAAGGGAAAAGGCACAGTAGCTATAACAAGCTATGAAGGTACAAAATTTATTCCAGATGTTTTATTTGTGCCTAAAATTGATCAAAATCTCTTGAGCGTTGGTCAGTTACTTAATAAAGGCTATAAAGTATTGTTTGAGAATAAGCAGTGCTGGATCAAAGATGCTAG

mRNA sequence

ATGGGAGAATCAAGTTTTTCAGCCGTTGCACCACCAGTCTTCGATGGAGACAATTATCAAATGTGGGCAGTTCGTATGGAGACTTATTTGGAGGCCTTGGATCTTTGGGAAGCAATAGAAGAGGATTACGAGGTCCCTCCGCTTCCAGCAAATCCTACTGTAGCACAAATCAAATTACAGAAGGAAAAGAAAACAAGGAAATCAAAGGCAAAAGCTTGCCTATTTGCCACTGCTGAATATGAAGGAGATGAGAGGATTCGTGGAATGAAAGTCCTGAATTTGATCAAGGATTTCGAGTTGCAGAAGATGAAGGAGTCAGAGTCGGTAAAAGAGTACTCAGACAAACTTCTCAGCATCGCCAACAAGGTGAGATTGCTTGGTTCTGTATTAAATGATTCCAGGATCGTTGAAAAGCTGCTAGTCACTCCTCCAGAGAAGTTTGAAGCCACCATTACTACTCTGGAGAACACCAAAGACTTGTCAAAGACTTCTCTTAGAGAGCTCTTGAATGCTTTAAAAGCGCAAGAGCAAAGGAGGTCTATGAGACAAGAAGGGGTGATTGAAGGCGCCTTACTTGTTGAGCATCAAGACAGCCGCAGACCTGACGCCTTCTGCTCCAAATGCAATCAACTTGGACATGAAGCTGTGATCTGCAAAGCCAAAGATCCGGTGAAAGAAGTAGATGCACAGGTCGTTGATCAAGAAGAAGAAGAAGAAGAAGAAGATCAATTGTTTATGGTCACTTCTTCCTCAAGCAAAGAATCAAGCGAGAGCTGGTTGATTGATAGTGGGTGCACAAATCACATGACATATGACAAGGAGTCTTTTGAGGAATTAAGAGACACCGAAGATAAAAGAGTGAGGATTGGCAATGGTGAACACTTGGAAGTCAAGGGAAAAGGCACAGTAGCTATAACAAGCTATGAAGTGCTGGATCAAAGATGCTAG

Coding sequence (CDS)

ATGGGAGAATCAAGTTTTTCAGCCGTTGCACCACCAGTCTTCGATGGAGACAATTATCAAATGTGGGCAGTTCGTATGGAGACTTATTTGGAGGCCTTGGATCTTTGGGAAGCAATAGAAGAGGATTACGAGGTCCCTCCGCTTCCAGCAAATCCTACTGTAGCACAAATCAAATTACAGAAGGAAAAGAAAACAAGGAAATCAAAGGCAAAAGCTTGCCTATTTGCCACTGCTGAATATGAAGGAGATGAGAGGATTCGTGGAATGAAAGTCCTGAATTTGATCAAGGATTTCGAGTTGCAGAAGATGAAGGAGTCAGAGTCGGTAAAAGAGTACTCAGACAAACTTCTCAGCATCGCCAACAAGGTGAGATTGCTTGGTTCTGTATTAAATGATTCCAGGATCGTTGAAAAGCTGCTAGTCACTCCTCCAGAGAAGTTTGAAGCCACCATTACTACTCTGGAGAACACCAAAGACTTGTCAAAGACTTCTCTTAGAGAGCTCTTGAATGCTTTAAAAGCGCAAGAGCAAAGGAGGTCTATGAGACAAGAAGGGGTGATTGAAGGCGCCTTACTTGTTGAGCATCAAGACAGCCGCAGACCTGACGCCTTCTGCTCCAAATGCAATCAACTTGGACATGAAGCTGTGATCTGCAAAGCCAAAGATCCGGTGAAAGAAGTAGATGCACAGGTCGTTGATCAAGAAGAAGAAGAAGAAGAAGAAGATCAATTGTTTATGGTCACTTCTTCCTCAAGCAAAGAATCAAGCGAGAGCTGGTTGATTGATAGTGGGTGCACAAATCACATGACATATGACAAGGAGTCTTTTGAGGAATTAAGAGACACCGAAGATAAAAGAGTGAGGATTGGCAATGGTGAACACTTGGAAGTCAAGGGAAAAGGCACAGTAGCTATAACAAGCTATGAAGTGCTGGATCAAAGATGCTAG
BLAST of CmoCh09G003520.1 vs. TrEMBL
Match: A0A151TNI1_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_022187 PE=4 SV=1)

HSP 1 Score: 367.1 bits (941), Expect = 2.2e-98
Identity = 204/363 (56.20%), Postives = 249/363 (68.60%), Query Frame = 1

Query: 3   ESSFSAVAPPVFDGDNYQMWAVRMETYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQKE 62
           ES+FS VAPPVFDGDNY  WAV+ME YLEALDLWEA+E +YEV  L  NPTVAQIK+ KE
Sbjct: 4   ESNFSQVAPPVFDGDNYDRWAVKMEAYLEALDLWEAVEAEYEVLLLLNNPTVAQIKMHKE 63

Query: 63  KKTRKSKAKACLFATA-------------------EYEGDERIRGMKVLNLIKDFELQKM 122
           +KTRK+KAK CLFA                     EY GDERIR M+VLNL+++FELQ+M
Sbjct: 64  RKTRKAKAKTCLFAGVSQTIFTRIMTLKSAKEIWDEYAGDERIRSMQVLNLMREFELQRM 123

Query: 123 KESESVKEYSDKLLSIANKVRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKT 182
           KESE +KEYSDKLL IANK+RLLGS   DSRIVEK+LVT PEK+EA+I +LENT+DLSK 
Sbjct: 124 KESEKIKEYSDKLLGIANKIRLLGSNFPDSRIVEKILVTVPEKYEASIASLENTRDLSKI 183

Query: 183 SLRELLNALKAQEQRRSMRQEGVIEGALLVEHQDSR------------------------ 242
           +  E+L+A +AQEQR  MR++  +EGALLV+ Q ++                        
Sbjct: 184 TFAEVLHAFQAQEQRSLMREDHAVEGALLVKSQQAKNYKKNYPASSYDKGKGGKKSYPPC 243

Query: 243 ---------------RPDAFCSKCNQLGHEAVICKAKDPVKEVDAQVVDQEEEEEEEDQL 302
                          RPDA C+KCNQ+GHEA+ICK+K+  +E +A+  DQ    EEEDQL
Sbjct: 244 QHCGKMGHAPFKCWQRPDAKCNKCNQMGHEAIICKSKNQQQEEEAKPADQ----EEEDQL 303

Query: 303 FMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVA 308
           F+ T   S ESSESWLIDS CTNHMT++K  F +LR T   +VRIGNG+H+ VKGKGT+A
Sbjct: 304 FVATCFLSSESSESWLIDSSCTNHMTFNKALFRDLRPTNVTKVRIGNGDHISVKGKGTIA 362

BLAST of CmoCh09G003520.1 vs. TrEMBL
Match: A0A151SFK6_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_024452 PE=4 SV=1)

HSP 1 Score: 346.3 bits (887), Expect = 4.0e-92
Identity = 194/358 (54.19%), Postives = 243/358 (67.88%), Query Frame = 1

Query: 3   ESSFSAVAPPVFDGDNYQMWAVRMETYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQKE 62
           E+SFS    PVFDG+NY +WAVRME+YLEA DLWEA+EEDY+VPPLP NPT+AQIK  KE
Sbjct: 4   ETSFSQATLPVFDGENYDLWAVRMESYLEAFDLWEAVEEDYDVPPLPDNPTMAQIKNHKE 63

Query: 63  KKTRKSKAKACLFA-----------------------TAEYEGDERIRGMKVLNLIKDFE 122
           KKTRK+KAK+CLFA                        AEY GDERIR M+VLNL+++FE
Sbjct: 64  KKTRKAKAKSCLFAGVSTTIFTRIMTLKSAKAIWDYLKAEYVGDERIRSMQVLNLMREFE 123

Query: 123 LQKMKESESVKEYSDKLLSIANKVRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTKD 182
           LQ+MKESE++KEYSDKLLSIANK+RLLGS   DSRIVEK+LVT PE++EA+IT+LEN+K+
Sbjct: 124 LQRMKESETIKEYSDKLLSIANKIRLLGSDFADSRIVEKILVTVPERYEASITSLENSKN 183

Query: 183 LSKTSLRELLNALKAQEQRRSMRQEGVIEGALLVEH------------------------ 242
           LSK +L E+L+AL+AQEQRR MR++  +EGAL  +H                        
Sbjct: 184 LSKITLAEVLHALQAQEQRRLMREDHAVEGALSAKHHVAGYNKKNFFKKNPSTSNENTTN 243

Query: 243 -----QDSRRPDAFCSKCNQLGHEAVICKAKDPVKEVDAQVVDQEEEEEEEDQLFMVTSS 302
                +  ++    C  C ++GH    C      K  DA+  D +  +E EDQLF+ T  
Sbjct: 244 NQNKGKGKKKSYPPCQHCGKMGHPPFRC-----WKRPDAKCTDAQNAQENEDQLFVATCF 303

Query: 303 SSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSY 309
           S+  SSESWLIDSGCTNHMTYD+E F+ L +TE K VRIGNGE + VKGKG++ ITS+
Sbjct: 304 STNSSSESWLIDSGCTNHMTYDRELFKCLDNTEVKWVRIGNGEQIPVKGKGSIVITSH 356

BLAST of CmoCh09G003520.1 vs. TrEMBL
Match: A0A151UAW8_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_020663 PE=4 SV=1)

HSP 1 Score: 334.0 bits (855), Expect = 2.0e-88
Identity = 192/350 (54.86%), Postives = 241/350 (68.86%), Query Frame = 1

Query: 3   ESSFSAVAPPVFDGDNYQMWAVRMETYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQKE 62
           ES+FS +APP+F+ +NY +WAV+ME YLEALDLWEAIEEDYEV PLP NPTVAQI+  KE
Sbjct: 4   ESNFSQIAPPIFNEENYDLWAVKMEAYLEALDLWEAIEEDYEVLPLPDNPTVAQIRSHKE 63

Query: 63  KKTRKSKAKACLFATA-----------------------EYEGDERIRGMKVLNLIKDFE 122
           +K RK+KAK+CLFA                         EY GDERIR M+VLNL+++FE
Sbjct: 64  RKIRKAKAKSCLFAGVSQTIFTRVMALKTAKSIWDYLKEEYAGDERIRSMQVLNLMREFE 123

Query: 123 LQKMKESESVKEYSDKLLSIANKVRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTKD 182
           LQ+MKESE++KEYSDKLL IANKVRLLGS   DSRIVEK+LVT PE++EA+  +LENTK+
Sbjct: 124 LQRMKESETIKEYSDKLLGIANKVRLLGSDFADSRIVEKILVTVPERYEASKASLENTKN 183

Query: 183 LSKTSLRELLN-----ALKA------QEQRRSMRQEGVIEGALLVEHQDS---------- 242
           LSK +L E+ N     AL A      Q +++  ++           +Q            
Sbjct: 184 LSKITLAEVQNHVVEGALPAKHHEVEQSKKKHFKKNQASSNQSPTSNQMKGKGGHPPFKC 243

Query: 243 -RRPDAFCSKCNQLGHEAVICKAKDPVKEVDAQVVDQEEEEEEEDQLFMVTSSSSKESSE 302
            RRP+A CSKCNQ+GHEA+ICK K+   +  AQ+ DQ    EEED+LF+ T   S ES+E
Sbjct: 244 WRRPEAKCSKCNQMGHEAIICKNKN-CHDEGAQIADQ----EEEDRLFVATCFLSSESNE 303

Query: 303 SWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITS 308
           SWLIDSGCTNHMT+D+  F++LR T   +VRIGNG+++ VKGKGTVAITS
Sbjct: 304 SWLIDSGCTNHMTFDEALFKDLRPTNITKVRIGNGDYITVKGKGTVAITS 348

BLAST of CmoCh09G003520.1 vs. TrEMBL
Match: M1B1W5_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400013513 PE=4 SV=1)

HSP 1 Score: 324.3 bits (830), Expect = 1.6e-85
Identity = 188/353 (53.26%), Postives = 230/353 (65.16%), Query Frame = 1

Query: 26  METYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQKEKKTRKSKAKACLFAT-------- 85
           METYL+A+D+WEA+EEDYEVP  P NPT++QIK  K++KTRKSKAKACLF+         
Sbjct: 1   METYLDAMDMWEAVEEDYEVPSFPNNPTISQIKNHKDRKTRKSKAKACLFSAISSSIFTR 60

Query: 86  ---------------AEYEGDERIRGMKVLNLIKDFELQKMKESESVKEYSDKLLSIANK 145
                           EYEGDE IRGM+VLNLI+DFE+QKMKE+E++K+YS++LL+IAN+
Sbjct: 61  IMSLKSAKAIWDYLKVEYEGDEMIRGMQVLNLIRDFEIQKMKETETIKDYSERLLNIANR 120

Query: 146 VRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKTSLRELLNALKAQEQRRSMR 205
           VRLLGSVLNDSRIVEK+LVT PE+FEATI TLENTKDLSK +  ELLNAL+A EQRR MR
Sbjct: 121 VRLLGSVLNDSRIVEKILVTVPERFEATINTLENTKDLSKITFAELLNALQAHEQRRVMR 180

Query: 206 Q---------------------------EGVIEGAL------------LVEHQDS----- 265
           +                           EG+                 L +H        
Sbjct: 181 EEGNTEGALFAKPQDGGKNKKKKNKKNGEGISASTTKGKSGHSKKSYPLCKHNKKGHPSY 240

Query: 266 ---RRPDAFCSKCNQLGHEAVICKAKDPVKEVDAQVVDQEEEEEEEDQLFMVTSSSSKES 309
              RRPDA CSKCNQLGHE VICK     +E +AQ+VD +EE    DQLF+ T  SS+ S
Sbjct: 241 KCWRRPDARCSKCNQLGHEVVICKNNGQQQEAEAQIVDGKEE----DQLFVATCFSSRSS 300

BLAST of CmoCh09G003520.1 vs. TrEMBL
Match: A0A151RSU4_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_032829 PE=4 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 1.0e-84
Identity = 183/344 (53.20%), Postives = 227/344 (65.99%), Query Frame = 1

Query: 26  METYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQKEKKTRKSKAKACLFATA------- 85
           ME YLEALDLW+A+E +YEV PLP NP VAQIK+ KE+KTRK+KAK CLFA         
Sbjct: 1   MEAYLEALDLWKAVEAEYEVLPLPNNPIVAQIKMHKERKTRKAKAKTCLFAGVSQAIFTR 60

Query: 86  ----------------EYEGDERIRGMKVLNLIKDFELQKMKESESVKEYSDKLLSIANK 145
                           EY GDERIR MKVLNL+++FELQ+MKE E +KEYSDKLL IANK
Sbjct: 61  IKTLKSAKEIWDYLKEEYAGDERIRSMKVLNLMREFELQRMKEYEKIKEYSDKLLGIANK 120

Query: 146 VRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKTSLRELLNALKAQEQRRSMR 205
           +RLLGS   DS IVEK+LVT  EK+EA+  +LENT+DLSK +  E+L+A +AQEQR  MR
Sbjct: 121 IRLLGSNFPDSIIVEKILVTVSEKYEASTASLENTRDLSKITFAEVLHAFQAQEQRSLMR 180

Query: 206 QEGVIEGALLVEHQDSR---------------------------------------RPDA 265
           ++  +EGALLV+ Q ++                                       RPDA
Sbjct: 181 EDHAVEGALLVKSQQAKNYKKNYPASSYDKGKGGKKSYPPCQHCGKMGHAPFRCWQRPDA 240

Query: 266 FCSKCNQLGHEAVICKAKDPVKEVDAQVVDQEEEEEEEDQLFMVTSSSSKESSESWLIDS 308
            C+KCNQ+GHEA+ICK+K+  +E +A+  DQ    +EEDQLF+ T   S ESSESWLIDS
Sbjct: 241 KCNKCNQMGHEAIICKSKNQQQEEEAKAADQ----KEEDQLFVATCFLSSESSESWLIDS 300

BLAST of CmoCh09G003520.1 vs. NCBI nr
Match: gi|823251370|ref|XP_012458313.1| (PREDICTED: uncharacterized protein LOC105779111 [Gossypium raimondii])

HSP 1 Score: 444.9 bits (1143), Expect = 1.2e-121
Identity = 252/336 (75.00%), Postives = 272/336 (80.95%), Query Frame = 1

Query: 2   GESSFSAVAPPVFDGDNYQMWAVRMETYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQK 61
           G SSFS VAPPVFDGDNYQMWAVRMETYLEALDLWE +EEDYEVPPLPANPTV QIK QK
Sbjct: 3   GGSSFSVVAPPVFDGDNYQMWAVRMETYLEALDLWELVEEDYEVPPLPANPTVPQIKAQK 62

Query: 62  EKKTRKSKAKACLFAT---------AEYEGDERIRGMKVLNLIKDFELQKMKESESVKEY 121
           EKKTRKSKAKACLFA          AEYEGDERIRGMKVLNLI+DFELQKMKESESVKEY
Sbjct: 63  EKKTRKSKAKACLFAAVKGNLDYLKAEYEGDERIRGMKVLNLIRDFELQKMKESESVKEY 122

Query: 122 SDKLLSIANKVRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKTSLRELLNAL 181
           SD+LLSIANKVRLLGS LNDSRIVEKLLVT PEKFEATITTLENT DLSK SL ELLNAL
Sbjct: 123 SDRLLSIANKVRLLGSELNDSRIVEKLLVTVPEKFEATITTLENTNDLSKISLAELLNAL 182

Query: 182 KAQEQRRSMRQEGVIEGALLVEHQDSRRPDAFCSKCNQLGHEAVICKAKDPV-------- 241
           +AQEQRRSMRQEGVIEGAL V+HQD+ R +   S+  +  H+A+I +AK+ V        
Sbjct: 183 QAQEQRRSMRQEGVIEGALPVKHQDNNRYNLRTSQRVEKIHQAIIRRAKEEVSRNPTHLV 242

Query: 242 -----------KEVDAQVVDQEEEEEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDK 301
                      +EVDAQV DQEEE    DQLF++T  S +ESSESWLI+SGCTNHMTYDK
Sbjct: 243 TIVRRKVKGQVQEVDAQVADQEEE----DQLFVLTCFSGRESSESWLINSGCTNHMTYDK 302

Query: 302 ESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITSYE 310
               ELR+TE KRVRIGNGE+LEVKGKGTVAITSYE
Sbjct: 303 ----ELRNTEVKRVRIGNGEYLEVKGKGTVAITSYE 330

BLAST of CmoCh09G003520.1 vs. NCBI nr
Match: gi|971545142|ref|XP_015162830.1| (PREDICTED: uncharacterized protein LOC107060037 [Solanum tuberosum])

HSP 1 Score: 410.6 bits (1054), Expect = 2.5e-111
Identity = 219/334 (65.57%), Postives = 262/334 (78.44%), Query Frame = 1

Query: 2   GESSFSAVAPPVFDGDNYQMWAVRMETYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQK 61
           GE+SFS++APP F+G++YQ+WAVRM+TYLEALDLWEA+E+DYE+ PLP NPT+AQIK  K
Sbjct: 3   GEASFSSMAPPAFNGESYQIWAVRMQTYLEALDLWEAVEDDYEIVPLPNNPTIAQIKNHK 62

Query: 62  EKKTRKSKAKACLFATA-----------------------EYEGDERIRGMKVLNLIKDF 121
           E+KT KSKAKACL+                          EYEGDERIRGM+VLNL+++F
Sbjct: 63  ERKTMKSKAKACLYIAVSSTIFTRIMSLKSAKEVWDYLKTEYEGDERIRGMQVLNLVREF 122

Query: 122 ELQKMKESESVKEYSDKLLSIANKVRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTK 181
           ELQ+MKESE++KEYSD+LL+IAN+VRLLGS  NDSRIVEK+LVT PEKFEAT+TTLENTK
Sbjct: 123 ELQRMKESETIKEYSDRLLNIANRVRLLGSTFNDSRIVEKILVTVPEKFEATVTTLENTK 182

Query: 182 DLSKTSLRELLNALKAQEQRRSMRQEGVIEGALLVEHQ-----DSRRPDAFCSKCNQLGH 241
           DLSK +L ELL+A +AQEQRR MRQEG++EGALLV+HQ       +RPDA CSKCNQ GH
Sbjct: 183 DLSKITLAELLSAFQAQEQRRVMRQEGIVEGALLVKHQYDGRNKKKRPDAKCSKCNQQGH 242

Query: 242 EAVICKAKDPVKEVDAQVVDQEEEEEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDK 301
           EAVICKAK    EV+AQV D+     EED LF+ T  +S +SSESWLIDSGC+NHMT DK
Sbjct: 243 EAVICKAKIQQPEVEAQVADR-----EEDHLFVATCFTSLDSSESWLIDSGCSNHMTSDK 302

Query: 302 ESFEELRDTEDKRVRIGNGEHLEVKGKGTVAITS 308
             F+EL +TE K+VRIGN E L VKGKG +AITS
Sbjct: 303 MLFKELWNTETKKVRIGNSECLAVKGKGAIAITS 331

BLAST of CmoCh09G003520.1 vs. NCBI nr
Match: gi|823127429|ref|XP_012435705.1| (PREDICTED: uncharacterized protein LOC105762447 [Gossypium raimondii])

HSP 1 Score: 374.8 bits (961), Expect = 1.5e-100
Identity = 204/260 (78.46%), Postives = 214/260 (82.31%), Query Frame = 1

Query: 2   GESSFSAVAPPVFDGDNYQMWAVRMETYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQK 61
           GESSFS  APPVFDGDNYQMWAVRMETYLEALDLWEA+EEDYEVPPL ANP+VAQIK QK
Sbjct: 3   GESSFSVAAPPVFDGDNYQMWAVRMETYLEALDLWEAVEEDYEVPPLLANPSVAQIKAQK 62

Query: 62  EKKTRKSKAKACLFAT-----------------------AEYEGDERIRGMKVLNLIKDF 121
           E K RKSK KACLFA                        AEY GDERIRGMKVLNLI+DF
Sbjct: 63  ENKIRKSKEKACLFAAVSQMIFTRIMSLKSAKEIWDYLKAEYAGDERIRGMKVLNLIRDF 122

Query: 122 ELQKMKESESVKEYSDKLLSIANKVRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTK 181
           ELQKMKESESVKEYSD+LLSIANKVRLLGS LNDSRIVE LLVT PEKFEATITTLENTK
Sbjct: 123 ELQKMKESESVKEYSDRLLSIANKVRLLGSELNDSRIVENLLVTIPEKFEATITTLENTK 182

Query: 182 DLSKTSLRELLNALKAQEQRRSMRQEGVIEGALLVEHQDSRRPDAFCSKCNQLGHEAVIC 239
           DLSK  L +LLNAL+ QEQRRSMRQEGVIEGAL V+HQD+ RPDA CSKCNQLGHEAVIC
Sbjct: 183 DLSKICLVKLLNALQEQEQRRSMRQEGVIEGALPVKHQDNNRPDAKCSKCNQLGHEAVIC 242

BLAST of CmoCh09G003520.1 vs. NCBI nr
Match: gi|1012357371|gb|KYP68556.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 367.1 bits (941), Expect = 3.1e-98
Identity = 204/363 (56.20%), Postives = 249/363 (68.60%), Query Frame = 1

Query: 3   ESSFSAVAPPVFDGDNYQMWAVRMETYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQKE 62
           ES+FS VAPPVFDGDNY  WAV+ME YLEALDLWEA+E +YEV  L  NPTVAQIK+ KE
Sbjct: 4   ESNFSQVAPPVFDGDNYDRWAVKMEAYLEALDLWEAVEAEYEVLLLLNNPTVAQIKMHKE 63

Query: 63  KKTRKSKAKACLFATA-------------------EYEGDERIRGMKVLNLIKDFELQKM 122
           +KTRK+KAK CLFA                     EY GDERIR M+VLNL+++FELQ+M
Sbjct: 64  RKTRKAKAKTCLFAGVSQTIFTRIMTLKSAKEIWDEYAGDERIRSMQVLNLMREFELQRM 123

Query: 123 KESESVKEYSDKLLSIANKVRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTKDLSKT 182
           KESE +KEYSDKLL IANK+RLLGS   DSRIVEK+LVT PEK+EA+I +LENT+DLSK 
Sbjct: 124 KESEKIKEYSDKLLGIANKIRLLGSNFPDSRIVEKILVTVPEKYEASIASLENTRDLSKI 183

Query: 183 SLRELLNALKAQEQRRSMRQEGVIEGALLVEHQDSR------------------------ 242
           +  E+L+A +AQEQR  MR++  +EGALLV+ Q ++                        
Sbjct: 184 TFAEVLHAFQAQEQRSLMREDHAVEGALLVKSQQAKNYKKNYPASSYDKGKGGKKSYPPC 243

Query: 243 ---------------RPDAFCSKCNQLGHEAVICKAKDPVKEVDAQVVDQEEEEEEEDQL 302
                          RPDA C+KCNQ+GHEA+ICK+K+  +E +A+  DQ    EEEDQL
Sbjct: 244 QHCGKMGHAPFKCWQRPDAKCNKCNQMGHEAIICKSKNQQQEEEAKPADQ----EEEDQL 303

Query: 303 FMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVKGKGTVA 308
           F+ T   S ESSESWLIDS CTNHMT++K  F +LR T   +VRIGNG+H+ VKGKGT+A
Sbjct: 304 FVATCFLSSESSESWLIDSSCTNHMTFNKALFRDLRPTNVTKVRIGNGDHISVKGKGTIA 362

BLAST of CmoCh09G003520.1 vs. NCBI nr
Match: gi|697187607|ref|XP_009602835.1| (PREDICTED: uncharacterized protein LOC104097920 [Nicotiana tomentosiformis])

HSP 1 Score: 364.0 bits (933), Expect = 2.6e-97
Identity = 203/353 (57.51%), Postives = 246/353 (69.69%), Query Frame = 1

Query: 2   GESSFSAVAPPVFDGDNYQMWAVRMETYLEALDLWEAIEEDYEVPPLPANPTVAQIKLQK 61
           GE+SFS++APP FDG++YQ+WAVRM+TYLEALDLWEA+E+DYE+ PLP NPT+AQIK  K
Sbjct: 3   GETSFSSIAPPTFDGESYQIWAVRMQTYLEALDLWEAVEDDYEIAPLPNNPTMAQIKNYK 62

Query: 62  EKKTRKSKAKACLFATA-----------------------EYEGDERIRGMKVLNLIKDF 121
           E+KTRKSKAKACLFA                         EYEGDE++RGM+VLNL+++F
Sbjct: 63  ERKTRKSKAKACLFAAVSSTIFTRIMSLKSAKEVWDYLKTEYEGDEKVRGMQVLNLVREF 122

Query: 122 ELQKMKESESVKEYSDKLLSIANKVRLLGSVLNDSRIVEKLLVTPPEKFEATITTLENTK 181
           ELQ+MK+SE++KEYSD+LL+IAN+VRLLGS LNDSRIVEK+LVT PE+FEATITTLENTK
Sbjct: 123 ELQRMKDSETIKEYSDRLLNIANRVRLLGSTLNDSRIVEKILVTVPERFEATITTLENTK 182

Query: 182 DLSKTSLRELLNALKAQEQRRSMRQEGVIEGALLVEHQDSRR------------------ 241
           D+SK +L ELL+A +AQEQRR MRQEG +EGAL V+HQDSRR                  
Sbjct: 183 DMSKITLAELLSAFQAQEQRRVMRQEGAVEGALPVKHQDSRRNQKKQNKKTQGANNENVG 242

Query: 242 ---------------PDAFC------------------SKCNQLGHEAVICKAKDPVKEV 281
                          P   C                  SKCNQ GHEAVICK+K    EV
Sbjct: 243 NNNKIKTGSKKTNFPPCQHCGKKGHPPFKCWRRPDAKCSKCNQQGHEAVICKSKIQQHEV 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A151TNI1_CAJCA2.2e-9856.20Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A0A151SFK6_CAJCA4.0e-9254.19Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A0A151UAW8_CAJCA2.0e-8854.86Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
M1B1W5_SOLTU1.6e-8553.26Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400013513 PE=4 SV=1[more]
A0A151RSU4_CAJCA1.0e-8453.20Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|823251370|ref|XP_012458313.1|1.2e-12175.00PREDICTED: uncharacterized protein LOC105779111 [Gossypium raimondii][more]
gi|971545142|ref|XP_015162830.1|2.5e-11165.57PREDICTED: uncharacterized protein LOC107060037 [Solanum tuberosum][more]
gi|823127429|ref|XP_012435705.1|1.5e-10078.46PREDICTED: uncharacterized protein LOC105762447 [Gossypium raimondii][more]
gi|1012357371|gb|KYP68556.1|3.1e-9856.20Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|697187607|ref|XP_009602835.1|2.6e-9757.51PREDICTED: uncharacterized protein LOC104097920 [Nicotiana tomentosiformis][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR025314DUF4219
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh09G003520CmoCh09G003520gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh09G003520.1CmoCh09G003520.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh09G003520.1.CDS.4CmoCh09G003520.1.CDS.4CDS
CmoCh09G003520.1.CDS.3CmoCh09G003520.1.CDS.3CDS
CmoCh09G003520.1.CDS.2CmoCh09G003520.1.CDS.2CDS
CmoCh09G003520.1.CDS.1CmoCh09G003520.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh09G003520.1.exon.4CmoCh09G003520.1.exon.4exon
CmoCh09G003520.1.exon.3CmoCh09G003520.1.exon.3exon
CmoCh09G003520.1.exon.2CmoCh09G003520.1.exon.2exon
CmoCh09G003520.1.exon.1CmoCh09G003520.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025314Domain of unknown function DUF4219PFAMPF13961DUF4219coord: 14..40
score: 6.0
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 7..306
score: 4.6
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 79..181
score: 1.8