Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGGTAAAGTACGATATTGAGAAATTCAATGGGACTAATTCCTCGTTGTGGAAGATGAAGATGAAGGCTATCTTGAGAAAAGATAATTGCTTTGCAGCCATCAGTGTGAGGCTGGTAGATGTCACAGATGATAAGTGGAATGAGATAAACGGGAATGTTGTTGCAAATATTCATCTGGCTCTAGCAGATAAAGTGTTGTCAAGCATAGAGGAGAAGAAAACTGCGAAGGAGATTTGGGATCATCTCGAAATGTTGTACGAGGCCAGGTCACTTCAGAACAAGATTTTCCTTAAGAGAAGATTGTATACTCTTCGGATGTCAAAATCTACTCAAATAACAGAGCACATCAACATGTTGAATATGTTATTTTCTCAACTCATATCATTGGGTTGTAAAATATGTCAAATGAACGTGTTGAACTTCTACTTCAAAGTCTTGATCAACTTGTCATCGACCTGACAAATAGTATTCTCACCAACTATCTAAACTTTGAAGATATTGCAGCTGCTATCTTAGAAAAGGAAAATCGACGCAAGAACAAAGTAGATAAGTTGGCGAGTTCACAACAAGCAGAGGCTATGGTGGTGACAAGAGGTAGATCAATGGAATATGGCTCAGGTGTGAGCCAAAATCAAGGAAGGGCAAAATCAAGAAGTAAGAAGAACATGAAGTGCTACCACTATGGCAAGAAGGGTCATCTGAAGCAGTATTGTTGGAGTTAAATAAGAAAGATTTCAATCCTCAGGGAAACGTAGCAAGCACCTCAAATGAAGGTGATGTCTTGTGTTGTGAAGCAGCGACAACTGTTGAAGGTATAAAGAGTTTAACTGACGTGTGGTTCATCGACTCAAGAGCCACTTATCACATGATCTCTTGACGAGAATGGTTCGATCATTATAAACCTATCTCTGGAGGATCTGTGTTCAGCTATGATGATCATGCCTTGAAGATTGTCGGTATTGGAACTATCAAGTTGAAGTTCCATGACAATACAGTTCGCACAATTCAACAAGTGCGACATGTAGAAGGCCTGACAAGGAACTTGCTCTTGGTAGGTCAACTAGATGACCTTGATTGCAAGGTGATCGTGGAGAAAGCTATCATGAAGGTGATGTGAGGTGCGCTTACACTTATGAAGGGAAAAGGGTGGCTACAAACTTGTACATGTTGGAGGGAAAGACTTTGCAAGAAGGAGAAGCATCAGTTGCCACAAAAAGTCCAAGTGAAAAGCTATCGATGATCTGGCATCAGAAACATGGACACATGTCCGAACAAGGAATGAAAGTTCTTGCAGAGGAGAATCTATTCCCAAGGCTCACCAAGGTATCTCTACCCTTTTGTGAGCATTGTGTTACAAGTAAGCAGCACAGGTTAAAGTTCAACACATCAAATTCTAGAAGTAAAGAGATTCTAGAATTGGTTCACTCTGATGTATGGCAAGTACTGATTACATCTCTAGGAGGAGTAGGGTACTTCGTGTCCTTTATAGATGACTACTCCGTAAGATATTTGGTGTATCCTATCAAGAGGAAGGCAGATATGTGTTCAGTCTTCAAAGTGTTCAAGCCACAAGTAGAACTTCAATCTGGCAAGAAGATGAAGTGTTTGCGGACAGATAATGGAGGGGAATATACAAGCAATGAGTTTGCGGATTTCTGCAATCAGGAAGGCATCCAGAGGCAGTTCATTACGGCTTACACTCCTCAGCAAAATGGAGTAGCAGAGGATGAATAGAACATTGTTGGAGAGAACAAGAGCAATGTTAGCAGCTGCAGGCTTAGACAAAGCTTTCTGAGTAGAAGCAGTTAATACCGCCTGTTATGTAGTGAATCGTGCTTCATCAACCGCAATTGAGTTGAAGACACCGATGCAGATATGGACAGGCAAGCCAGCTAATTATTCAAAGATGTCTTTTAGTTATGTATCTATTCCCTCCAGCTTTAGTAATTTAATCATGAACTACATTTATCTCTTATAGCCCTTTTATGGAGGATTAAATTTCTATCGAAAAGAGGACAACATAAGACTAAAGGCCACCGTATTTTCATCAACTAAATTATTTTTGTTAAAGATAACAAATTGATTTGGCCAATTAAGGAGTTAGCATATTTATTCAATATTGAAAGGGTTGAATATTGTATTTGCAATTTGTTAAATAATATTTCCCAAGTATGCTCAAGAAGTCTCTAGAAACTACTGGGTTAAGTAAGCTTGCTCAAGAAACTTCATTGGAAAATTGAAGAAATTATAAGCTACTAGAGTGCAATATGTGGCAAAATTATGGCCACTAGATCTTCATGTACCCAAGGTTAATATGTGGCATAATTATAACCATAGAATTTTGTTAGATGAGCAAGTACGGATGGGCTATAAATAGGAGGGCACCCATGCTTGGCAAACCATCCCAAAAAAATCCCCAATTCCAAGTGAACTAAAGTGAATGTGAGGTTTGGTGAGATAAAGATAGAATGTTTCTTCTTATGAGAGATTTTGTTCTCCATAAGAAAGAAAGTTTGTAACTCCTAATTGAGTGGATTTTTTCTTCCTTGCCAGTGGTTTTTGCCCTAATTTGTTTAGGGGTTTTCCACGTAAAATCATTGTGTCATTTTCTTCTTTCTTTTTACTTCTTCTCAAATCCATTTTAGCTGCGTCTTACTCTATCCGATTCCAATCTTTTTTATAACATATGTGATTAAACCAAAGGAAAGGAAGACTTTAGATGTTAAAGTTCCCCTTAAGTACCCTAGACATAGGACCTTATCCCATGGTTGCGATTGATTTATCAAACTAAGACTTTTTATTAATAGAAATCTTAATAAGGGGCTTAGAACTTGGAGATCTCTTGAACATAACTAGGTTAAGACGCATAATCGTTCGATAACATGGAGGGACGTCTAGCGCCTATTGGTATTCCTGTGCTATTTTCGATTAATTTGAAGTAAAAGGATTAGATTTTCGTTCTTAAGTGTTTCTAAATCAAGAAATTATATATTATTTTACAAGGGAATAATCAAAATAAACTTTGGAGCTTCCGGGAACTAAACAAAATTTCCCGTAATCACAAAACATGAAGATTCTAGGAATAATAAAAGAAATGGAGGTGCAGGGAATCGAACCCTGTACCTCTCGCATGCAAAGCGAGCGCTCTACCATATGAGCTACACCCCCTAAATTTATTAATAAAAATTTCAAGTTGTAATGGATTATATAATTGTGTACAAAGCATTCCATTTTTTCTTTTTCTTCTTTTAATTTCATTCTTAGAGTTCGTAAATAAATATACATTTTATTTAGGAAATCAATACAATCAAACAAGTAACCAAGATCTAACTGGACTAAAATAACCAACAAAGTTCGGAAACCAAATACCAATCAGCCACTAAGCCATTTTCCATAGCCATTCTTACAATATCATGGGCCACCAAATTGGTCTCTCTCTAGATGTGCCTCAACCATGCAATCTGCCATCCCACAACAAAGCTTCTAATCTGGTCAACAACATATTCTACATCTTCCCTTCAACGATTCGGATTGCATGAAGCGAATGAGCTTCAATATGAATAGGTAATGTCCGCCATCGCAGCTTTGTCTGTATTCCTCTTCCAAGAATTATCTTGTGTTAGATAATGTTATTTGAATGATAATTAGATTCGAATCAAATATTTTTTGAATTAGATTTCTCATTCCTAGGATAGGAATTATAGTAATTATAGTGGCATAAATAGGTTATTAGGTTTGTATGAAATAATTCAATTCAATGAATTCTAATTGTTATTTTCTCGATTTTCTCTCCTCCCCTAAAATTAACATCGTATAAGAACATTTATTCTTGTAATTTGTTTGTTTTGTTAAAATCATTGGTTTATTTGATGCATTAATCGTTGGTTTAGCAGTGGAAATTTACGTATTCAAGACCTTAGGTTTTTCAGTGGAAATTTACGTGTTTGAGGTCTCTGAATTTTGATTTTGAACTGATATTCTTATTATTTAACAATGTTGTAGATGTTGTTTGAGACCGTCATATTACAATGTTGAATCTCATTGGATTTGCAACTACTGATTGGATATTCGTAGATCAATCTTTTGTTCAATTTGCAATATTAGTATTGTGGCGGGGTGGATTTGCTTGGTTTTTGCATGGAATTTGCATGGTAATAATTCGTGATTTGCAATATGGTTGACAATTTTATAGGAAAAGATTGAATTGTAGTAGTTTCTTGGATTGTGGTTTTATAGTTTTCGTGGCTTGTGATTCTATAGTTTTCGTGGCTTTTGAGAAAATCAATTGTTTCCAAATTGAACATAGTTGATTTGTGATTTGGATTGTGAGTTGTTTTGGTGAGTTTGCAGTGGTGTGGACGACAATATGGAGGAATAGGAAGAGGGAGGTACAATTGTTTGCAATATTCTCTGGATCTTGATGGATTATATTTGATGGCGATGAAGAGAAATTAAGTTTGTGTTTTTCTAAAAATTACGGTTTGCTTGAAAATCACGATTTCAGTTTATTGGAACGTGAGAACTCCTCTGCAGCCTTTGAAACATGTTGTTCTTAAAGGGGAGCTAAAGGAGGCGAGGGCCGAGAACGAGGTCCTCAAGTCCAAGTTGGAGGCTGAGTGCAAGAGTGCCGAAAAGAAGGCGGAGCACCAGTAG
mRNA sequence
ATGACGGTAAAGTACGATATTGAGAAATTCAATGGGACTAATTCCTCGTTGTGGAAGATGAAGATGAAGGCTATCTTGAGAAAAGATAATTGCTTTGCAGCCATCAGTGTGAGGCTGGTAGATGTCACAGATGATAAGTGGAATGAGATAAACGGGAATGTTGTTGCAAATATTCATCTGGCTCTAGCAGATAAAGTGTTGTCAAGCATAGAGGAGAAGAAAACTGCGAAGGAGATTTGGGATCATCTCGAAATGTTGTACGAGGCCAGGTCACTTCAGAACAAGATTTTCCTTAAGAGAAGATTGTATACTCTTCGGATGTCAAAATCTACTCAAATAACAGAGCACATCAACATTCTTGATCAACTTGTCATCGACCTGACAAATAGTATTCTCACCAACTATCTAAACTTTGAAGATATTGCAGCTGCTATCTTAGAAAAGGAAAATCGACGCAAGAACAAAGTAGATAAGTTGGCGAGTTCACAACAAGCAGAGGCTATGGTGGTGACAAGAGGTAGATCAATGGAATATGGCTCAGAAGGGTCATCTGAAGCAGTATTGTTGGAGTTAAATAAGAAAGATTTCAATCCTCAGGGAAACGTAGCAAGCACCTCAAATGAAGGTGATGTCTTGTGTTGTGAAGCAGCGACAACTGTTGAAGGATCTGTGTTCAGCTATGATGATCATGCCTTGAAGATTGTCGGTATTGGAACTATCAAGTTGAAGTTCCATGACAATACAGTTCGCACAATTCAACAAGTGCGACATGTAGAAGGCCTGACAAGGAACTTGCTCTTGGTAGGGAAAAGGGTGGCTACAAACTTGTACATGTTGGAGGGAAAGACTTTGCAAGAAGGAGAAGCATCAGTTGCCACAAAAAGTCCAAGTGAAAAGCTATCGATGATCTGGCATCAGAAACATGGACACATGTCCGAACAAGGAATGAAAGTTCTTGCAGAGGAGAATCTATTCCCAAGGCTCACCAAGCCTTTGAAACATGTTGTTCTTAAAGGGGAGCTAAAGGAGGCGAGGGCCGAGAACGAGGTCCTCAAGTCCAAGTTGGAGGCTGAGTGCAAGAGTGCCGAAAAGAAGGCGGAGCACCAGTAG
Coding sequence (CDS)
ATGACGGTAAAGTACGATATTGAGAAATTCAATGGGACTAATTCCTCGTTGTGGAAGATGAAGATGAAGGCTATCTTGAGAAAAGATAATTGCTTTGCAGCCATCAGTGTGAGGCTGGTAGATGTCACAGATGATAAGTGGAATGAGATAAACGGGAATGTTGTTGCAAATATTCATCTGGCTCTAGCAGATAAAGTGTTGTCAAGCATAGAGGAGAAGAAAACTGCGAAGGAGATTTGGGATCATCTCGAAATGTTGTACGAGGCCAGGTCACTTCAGAACAAGATTTTCCTTAAGAGAAGATTGTATACTCTTCGGATGTCAAAATCTACTCAAATAACAGAGCACATCAACATTCTTGATCAACTTGTCATCGACCTGACAAATAGTATTCTCACCAACTATCTAAACTTTGAAGATATTGCAGCTGCTATCTTAGAAAAGGAAAATCGACGCAAGAACAAAGTAGATAAGTTGGCGAGTTCACAACAAGCAGAGGCTATGGTGGTGACAAGAGGTAGATCAATGGAATATGGCTCAGAAGGGTCATCTGAAGCAGTATTGTTGGAGTTAAATAAGAAAGATTTCAATCCTCAGGGAAACGTAGCAAGCACCTCAAATGAAGGTGATGTCTTGTGTTGTGAAGCAGCGACAACTGTTGAAGGATCTGTGTTCAGCTATGATGATCATGCCTTGAAGATTGTCGGTATTGGAACTATCAAGTTGAAGTTCCATGACAATACAGTTCGCACAATTCAACAAGTGCGACATGTAGAAGGCCTGACAAGGAACTTGCTCTTGGTAGGGAAAAGGGTGGCTACAAACTTGTACATGTTGGAGGGAAAGACTTTGCAAGAAGGAGAAGCATCAGTTGCCACAAAAAGTCCAAGTGAAAAGCTATCGATGATCTGGCATCAGAAACATGGACACATGTCCGAACAAGGAATGAAAGTTCTTGCAGAGGAGAATCTATTCCCAAGGCTCACCAAGCCTTTGAAACATGTTGTTCTTAAAGGGGAGCTAAAGGAGGCGAGGGCCGAGAACGAGGTCCTCAAGTCCAAGTTGGAGGCTGAGTGCAAGAGTGCCGAAAAGAAGGCGGAGCACCAGTAG
Protein sequence
MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDDKWNEINGNVVANIHLALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINILDQLVIDLTNSILTNYLNFEDIAAAILEKENRRKNKVDKLASSQQAEAMVVTRGRSMEYGSEGSSEAVLLELNKKDFNPQGNVASTSNEGDVLCCEAATTVEGSVFSYDDHALKIVGIGTIKLKFHDNTVRTIQQVRHVEGLTRNLLLVGKRVATNLYMLEGKTLQEGEASVATKSPSEKLSMIWHQKHGHMSEQGMKVLAEENLFPRLTKPLKHVVLKGELKEARAENEVLKSKLEAECKSAEKKAEHQ
Homology
BLAST of Moc09g13320.1 vs. NCBI nr
Match:
KAA0026163.1 (Gag-Pol [Cucumis melo var. makuwa] >TYK11620.1 Gag-Pol [Cucumis melo var. makuwa])
HSP 1 Score: 374.8 bits (961), Expect = 8.7e-100
Identity = 222/427 (51.99%), Postives = 263/427 (61.59%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
M K++IEKFNGTN SLW +KMK +LR DNC AI ++TDD KWNE++GN + NIH
Sbjct: 1 MAAKFEIEKFNGTNFSLWTLKMKVVLRNDNCLEAIDKGPAEITDDNKWNEMDGNAMENIH 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
LALAD VLSSI+EKK AKEIWDHL LYE +SL NKIFLKR+LYTLRMS+ST +TEH+N
Sbjct: 61 LALADNVLSSIKEKKIAKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQLVI+L N+IL +YL+F+D+ +A+LE+
Sbjct: 121 LNTLFSQLALLGYKIEPNERAELLLQSLLDSYDQLVINLKNNILIDYLSFDDVVSAVLEE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSMEYGS---EGSSEAVLLELNKKDFNPQGNVAST 240
ENRRKNK DKL + QQAEA+ VTRGR +E + EG S ++KD NPQGNVAST
Sbjct: 181 ENRRKNKEDKLITLQQAEALTVTRGRPLELAAKIKEGQS-------HEKDSNPQGNVAST 240
Query: 241 SNEGDVLCCEAATTVE----------------------------------GSVFSYDDHA 300
NEG L CEA TT E GS+FS +DHA
Sbjct: 241 INEGGALSCEAVTTTEGKKRMADGWFFDSGATYHMTSRREWFHNYKPISGGSLFSCNDHA 300
Query: 301 LKIVGIGTIKLKFHDNTVRTIQQVRHVEGLTRNLLLV----------------------- 331
LKIV IGTIKLK HDN V TIQQVRHVE L +NLLL+
Sbjct: 301 LKIVDIGTIKLKLHDNIVCTIQQVRHVENLKKNLLLLGQLDDLDCKVVVEKRLTKLIKGA 360
BLAST of Moc09g13320.1 vs. NCBI nr
Match:
BAD34493.1 (Gag-Pol [Ipomoea batatas])
HSP 1 Score: 359.4 bits (921), Expect = 3.8e-95
Identity = 217/444 (48.87%), Postives = 271/444 (61.04%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
M K++IEKFNG N SLWK+K+KAILRKDNC AAIS R VD TDD KW+E+N + +A+++
Sbjct: 1 MAAKFEIEKFNGKNFSLWKLKVKAILRKDNCLAAISERPVDFTDDKKWSEMNEDAMADLY 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
L++AD VLSSIEEKKTA EIWDHL LYEA+SL NKIFLKR+LYTLRMS+ST +TEH+N
Sbjct: 61 LSIADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTEHLNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQL+I+LTN+ILT+YL F+D+AAA+LE+
Sbjct: 121 LNTLFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDYLVFDDVAAAVLEE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSMEYGSEG------SSEAVLLELN-------KKD 240
E+RRKNK D+ + QQAEA+ V RGRS E G SS+ L N KKD
Sbjct: 181 ESRRKNKEDRQVNLQQAEALTVMRGRSTERGQSSGRGRSKSSKKNLTCYNCGKKGHLKKD 240
Query: 241 -------FNPQGNVASTSNEGDVLCCEAATTVE--------------------------- 300
NPQGNVASTS++G LCCEA+ E
Sbjct: 241 CWNLAQNSNPQGNVASTSDDGSALCCEASIAREGRKRFADIWLIDSGATYHMTSRKEWFH 300
Query: 301 -------GSVFSYDDHALKIVGIGTIKLKFHDNTVRTIQQVRHVEGLTRNLL-------- 331
GSV+S DDHAL+I+GIGTIKLK +D TV+T+Q VRHV+GL +NLL
Sbjct: 301 HYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGLKKNLLSYGILDNS 360
BLAST of Moc09g13320.1 vs. NCBI nr
Match:
KAE8686521.1 (Protein STRUBBELIG-RECEPTOR FAMILY 1 [Hibiscus syriacus])
HSP 1 Score: 352.1 bits (902), Expect = 6.0e-93
Identity = 214/447 (47.87%), Postives = 265/447 (59.28%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
MT K+DIEKFNG N SLWK+KMKAILRKD C AAI+ R VD TDD KWNE++GN +AN H
Sbjct: 1 MTTKFDIEKFNGRNFSLWKLKMKAILRKDGCLAAINERPVDFTDDNKWNEMDGNAMANFH 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
LAL D+VLSSIEEKKTAKEIWDHL LYEA SL NKIFLKR+LYTLRM +ST +TEH+N
Sbjct: 61 LALTDEVLSSIEEKKTAKEIWDHLTKLYEATSLHNKIFLKRKLYTLRMPESTSVTEHLNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQL+I+LTNS +T+ L F+D+AAA+L++
Sbjct: 121 LNTLFSQLTSLSCKIGEQERAELLLQSLPDSYDQLIINLTNSNVTS-LVFDDVAAAVLQE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSME------------------------YGSEGSS 240
ENRRKNK D+ + QQAEA+ TRGRS E G +G
Sbjct: 181 ENRRKNKEDRQVNLQQAEALTTTRGRSTERGQSSSHKYGRSKSRSKKNLKCYHCGKKGHL 240
Query: 241 EAVLLELNKKDFNPQGNVASTSNEGDVLCCEAATTVE----------------------- 300
+ LNK NPQGN A+TS++GD LCCEA+TTV+
Sbjct: 241 KKDCWSLNKNS-NPQGNTANTSDDGDALCCEASTTVKGRKRFADIWLIDSGATYYMTSRR 300
Query: 301 -----------GSVFSYDDHALKIVGIGTIKLKFHDNTVRTIQQVRHVEGLTRNLL---- 330
GSV+S +DHAL+I+G+GTIKLK +D T++ ++ VRHV+GL +NLL
Sbjct: 301 EWFHHYEPVSGGSVYSCNDHALEIIGVGTIKLKMYDGTIKVVRDVRHVKGLKKNLLSYGL 360
BLAST of Moc09g13320.1 vs. NCBI nr
Match:
KAE8702857.1 (Detected protein of unknown function [Hibiscus syriacus])
HSP 1 Score: 347.4 bits (890), Expect = 1.5e-91
Identity = 201/385 (52.21%), Postives = 252/385 (65.45%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
M K+DIEKFN N SLWK+KMKAILRKD C AAIS R VD DD KWNE++GN ++N H
Sbjct: 1 MATKFDIEKFNRRNFSLWKLKMKAILRKDGCLAAISERPVDFIDDNKWNEMDGNAMSNFH 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
LALAD+VLSSIEEKKTAKEIWDHL LYE SL NKIFLKR+LYTLRMS+ST +TEH+N
Sbjct: 61 LALADEVLSSIEEKKTAKEIWDHLTKLYEVTSLHNKIFLKRKLYTLRMSESTSVTEHLNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQL+I+LTNS +T+ + F+D+AAA+L++
Sbjct: 121 LNTLFSQLTSLRCTIGEQERVELLLQSLPDSYDQLIINLTNSNVTSIV-FDDVAAAVLQE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSMEYGSEGSSEAVLLELNKKDFNPQGNVASTSNE 240
ENRRKNK D+ + QQAEA+ RGRS E G S + GN A++S++
Sbjct: 181 ENRRKNKEDRQVNLQQAEALTTMRGRSAERGQSSS-------------HKHGNTANSSDD 240
Query: 241 GDVLCCEAATTVE-GSVFSYDDHALKIVGIGTIKLKFHDNTVRTIQQVRHVEGLTRNLL- 300
GD L CEA+TTVE GSV+S +DHAL+I+G+GTIKLK +D T++ ++ VRHV+GL +NLL
Sbjct: 241 GDTLLCEASTTVEGGSVYSCNDHALEIIGVGTIKLKMYDGTIKVVRDVRHVKGLKKNLLS 300
Query: 301 ---------------------------LVGKRVATNLYMLEGKTLQEGEASVATKSPSEK 325
L G+++A NLYML+G+TL E EASVA S S
Sbjct: 301 YGLLDNNASKIETRKGIMKVFHGALVVLKGEKIAANLYMLKGETLLEAEASVA--SCSSD 360
BLAST of Moc09g13320.1 vs. NCBI nr
Match:
KAA0044949.1 (hypothetical protein E6C27_scaffold74G002510 [Cucumis melo var. makuwa])
HSP 1 Score: 345.9 bits (886), Expect = 4.3e-91
Identity = 206/386 (53.37%), Postives = 245/386 (63.47%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
M K+ IEKFNGTN SLWK+KMKAI RKDNC AI R ++TDD KWNE++GN +ANIH
Sbjct: 1 MMTKFKIEKFNGTNFSLWKLKMKAISRKDNCLEAIDKRPAEITDDNKWNEMDGNAMANIH 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
LAL D VLSSIEEKK AKEIWDHL LYEA+SL NKIFLKR+LYTLRMS ST +TEH+N
Sbjct: 61 LALVDNVLSSIEEKKIAKEIWDHLIKLYEAKSLHNKIFLKRKLYTLRMSGSTLMTEHMNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQLVI+LTN+ILT+YL+F+D+AA +LE+
Sbjct: 121 LNTLFSQLTLLGYKIEPNKHAELLLQSLPDSYDQLVINLTNNILTDYLSFDDVAATVLEE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSMEYGSEGSS------------------------ 240
ENR KNK DKL SSQQAEA+ VTR R +E S GS
Sbjct: 181 ENRCKNKEDKLVSSQQAEALTVTRVRPLECNSSGSKNQGRSKSQSKKKMKCYNCGKKGHV 240
Query: 241 EAVLLELNKKDFNPQGNVASTSNEGDVLCCEAATTVEGSVFSYDDHALKIVGIGTIKLKF 300
+ +LNKKDFNPQGNVAST NEG L CEA TT+EG + DD K+V
Sbjct: 241 KKNCWDLNKKDFNPQGNVASTINEGGALSCEAVTTMEGQL---DDLDCKVV--------V 300
Query: 301 HDNTVRTIQQVRHVEGLTRNLLLVGKRVATNLYMLEGKTLQEGEASVATKSPSEKLSMIW 331
++ IQ +L+ ++V NLYMLEG+TLQEGEASVA+ S E LSM+W
Sbjct: 301 EKRLMKVIQGAL--------VLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMW 360
BLAST of Moc09g13320.1 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 95.5 bits (236), Expect = 1.3e-18
Identity = 111/451 (24.61%), Postives = 180/451 (39.91%), Query Frame = 0
Query: 3 VKYDIEKFNGTNS-SLWKMKMKAILRKDNCFAAISV---RLVDVTDDKWNEINGNVVANI 62
VKY++ KFNG N S W+ +M+ +L + + V + + + W +++ + I
Sbjct: 4 VKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAI 63
Query: 63 HLALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHIN 122
L L+D V+++I ++ TA+ IW LE LY +++L NK++LK++LY L MS+ T H+N
Sbjct: 64 RLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLN 123
Query: 123 ILDQLV----------------IDLTNSILTNY-------------LNFEDIAAAILEKE 182
+ + L+ I L NS+ ++Y + +D+ +A+L E
Sbjct: 124 VFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLLNE 183
Query: 183 NRRKNKVDKLASSQQAEAMVVT-RGRSME-----YGSEG--------SSEAVLLELN--- 242
RK Q +A++ RGRS + YG G S V N
Sbjct: 184 KMRKK------PENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQ 243
Query: 243 ----KKDF-NPQGNVASTSNEG-------------------------------------D 302
K+D NP+ TS + D
Sbjct: 244 PGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVD 303
Query: 303 VLCCEAATTVE-----------GSVFSYDDHALKIVGIGTIKLKFHDNTVRTIQQVRHVE 325
AT V G+V + KI GIG I +K + ++ VRHV
Sbjct: 304 TAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVP 363
BLAST of Moc09g13320.1 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 68.6 bits (166), Expect = 1.7e-10
Identity = 33/124 (26.61%), Postives = 71/124 (57.26%), Query Frame = 0
Query: 4 KYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDDKWNEINGNVVANIHLALA 63
K +I+ F+G ++WK +++A+L + + + + + DD W + + I L+
Sbjct: 5 KRNIKPFDGEKYAIWKFRIRALLAEQDVLKVVDGLMPNEVDDSWKKAERCAKSTIIEYLS 64
Query: 64 DKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINILDQL 123
D L+ TA++I ++L+ +YE +SL +++ L++RL +L++S + H +I D+L
Sbjct: 65 DSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLKLSSEMSLLSHFHIFDEL 124
Query: 124 VIDL 128
+ +L
Sbjct: 125 ISEL 128
BLAST of Moc09g13320.1 vs. ExPASy TrEMBL
Match:
A0A5A7SNG9 (Gag-Pol OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold263G00680 PE=4 SV=1)
HSP 1 Score: 374.8 bits (961), Expect = 4.2e-100
Identity = 222/427 (51.99%), Postives = 263/427 (61.59%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
M K++IEKFNGTN SLW +KMK +LR DNC AI ++TDD KWNE++GN + NIH
Sbjct: 1 MAAKFEIEKFNGTNFSLWTLKMKVVLRNDNCLEAIDKGPAEITDDNKWNEMDGNAMENIH 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
LALAD VLSSI+EKK AKEIWDHL LYE +SL NKIFLKR+LYTLRMS+ST +TEH+N
Sbjct: 61 LALADNVLSSIKEKKIAKEIWDHLTKLYETKSLHNKIFLKRKLYTLRMSESTSMTEHMNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQLVI+L N+IL +YL+F+D+ +A+LE+
Sbjct: 121 LNTLFSQLALLGYKIEPNERAELLLQSLLDSYDQLVINLKNNILIDYLSFDDVVSAVLEE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSMEYGS---EGSSEAVLLELNKKDFNPQGNVAST 240
ENRRKNK DKL + QQAEA+ VTRGR +E + EG S ++KD NPQGNVAST
Sbjct: 181 ENRRKNKEDKLITLQQAEALTVTRGRPLELAAKIKEGQS-------HEKDSNPQGNVAST 240
Query: 241 SNEGDVLCCEAATTVE----------------------------------GSVFSYDDHA 300
NEG L CEA TT E GS+FS +DHA
Sbjct: 241 INEGGALSCEAVTTTEGKKRMADGWFFDSGATYHMTSRREWFHNYKPISGGSLFSCNDHA 300
Query: 301 LKIVGIGTIKLKFHDNTVRTIQQVRHVEGLTRNLLLV----------------------- 331
LKIV IGTIKLK HDN V TIQQVRHVE L +NLLL+
Sbjct: 301 LKIVDIGTIKLKLHDNIVCTIQQVRHVENLKKNLLLLGQLDDLDCKVVVEKRLTKLIKGA 360
BLAST of Moc09g13320.1 vs. ExPASy TrEMBL
Match:
Q6BCY1 (Gag-Pol OS=Ipomoea batatas OX=4120 GN=Rtsp-1AA PE=4 SV=1)
HSP 1 Score: 359.4 bits (921), Expect = 1.8e-95
Identity = 217/444 (48.87%), Postives = 271/444 (61.04%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
M K++IEKFNG N SLWK+K+KAILRKDNC AAIS R VD TDD KW+E+N + +A+++
Sbjct: 1 MAAKFEIEKFNGKNFSLWKLKVKAILRKDNCLAAISERPVDFTDDKKWSEMNEDAMADLY 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
L++AD VLSSIEEKKTA EIWDHL LYEA+SL NKIFLKR+LYTLRMS+ST +TEH+N
Sbjct: 61 LSIADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTEHLNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQL+I+LTN+ILT+YL F+D+AAA+LE+
Sbjct: 121 LNTLFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDYLVFDDVAAAVLEE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSMEYGSEG------SSEAVLLELN-------KKD 240
E+RRKNK D+ + QQAEA+ V RGRS E G SS+ L N KKD
Sbjct: 181 ESRRKNKEDRQVNLQQAEALTVMRGRSTERGQSSGRGRSKSSKKNLTCYNCGKKGHLKKD 240
Query: 241 -------FNPQGNVASTSNEGDVLCCEAATTVE--------------------------- 300
NPQGNVASTS++G LCCEA+ E
Sbjct: 241 CWNLAQNSNPQGNVASTSDDGSALCCEASIAREGRKRFADIWLIDSGATYHMTSRKEWFH 300
Query: 301 -------GSVFSYDDHALKIVGIGTIKLKFHDNTVRTIQQVRHVEGLTRNLL-------- 331
GSV+S DDHAL+I+GIGTIKLK +D TV+T+Q VRHV+GL +NLL
Sbjct: 301 HYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGLKKNLLSYGILDNS 360
BLAST of Moc09g13320.1 vs. ExPASy TrEMBL
Match:
A0A6A2Z3I3 (Protein STRUBBELIG-RECEPTOR FAMILY 1 OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00111059pilonHSYRG00161 PE=4 SV=1)
HSP 1 Score: 352.1 bits (902), Expect = 2.9e-93
Identity = 214/447 (47.87%), Postives = 265/447 (59.28%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
MT K+DIEKFNG N SLWK+KMKAILRKD C AAI+ R VD TDD KWNE++GN +AN H
Sbjct: 1 MTTKFDIEKFNGRNFSLWKLKMKAILRKDGCLAAINERPVDFTDDNKWNEMDGNAMANFH 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
LAL D+VLSSIEEKKTAKEIWDHL LYEA SL NKIFLKR+LYTLRM +ST +TEH+N
Sbjct: 61 LALTDEVLSSIEEKKTAKEIWDHLTKLYEATSLHNKIFLKRKLYTLRMPESTSVTEHLNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQL+I+LTNS +T+ L F+D+AAA+L++
Sbjct: 121 LNTLFSQLTSLSCKIGEQERAELLLQSLPDSYDQLIINLTNSNVTS-LVFDDVAAAVLQE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSME------------------------YGSEGSS 240
ENRRKNK D+ + QQAEA+ TRGRS E G +G
Sbjct: 181 ENRRKNKEDRQVNLQQAEALTTTRGRSTERGQSSSHKYGRSKSRSKKNLKCYHCGKKGHL 240
Query: 241 EAVLLELNKKDFNPQGNVASTSNEGDVLCCEAATTVE----------------------- 300
+ LNK NPQGN A+TS++GD LCCEA+TTV+
Sbjct: 241 KKDCWSLNKNS-NPQGNTANTSDDGDALCCEASTTVKGRKRFADIWLIDSGATYYMTSRR 300
Query: 301 -----------GSVFSYDDHALKIVGIGTIKLKFHDNTVRTIQQVRHVEGLTRNLL---- 330
GSV+S +DHAL+I+G+GTIKLK +D T++ ++ VRHV+GL +NLL
Sbjct: 301 EWFHHYEPVSGGSVYSCNDHALEIIGVGTIKLKMYDGTIKVVRDVRHVKGLKKNLLSYGL 360
BLAST of Moc09g13320.1 vs. ExPASy TrEMBL
Match:
A0A6A3AGK4 (Integrase catalytic domain-containing protein OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig00110480pilonHSYRG00021 PE=3 SV=1)
HSP 1 Score: 347.4 bits (890), Expect = 7.2e-92
Identity = 201/385 (52.21%), Postives = 252/385 (65.45%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
M K+DIEKFN N SLWK+KMKAILRKD C AAIS R VD DD KWNE++GN ++N H
Sbjct: 1 MATKFDIEKFNRRNFSLWKLKMKAILRKDGCLAAISERPVDFIDDNKWNEMDGNAMSNFH 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
LALAD+VLSSIEEKKTAKEIWDHL LYE SL NKIFLKR+LYTLRMS+ST +TEH+N
Sbjct: 61 LALADEVLSSIEEKKTAKEIWDHLTKLYEVTSLHNKIFLKRKLYTLRMSESTSVTEHLNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQL+I+LTNS +T+ + F+D+AAA+L++
Sbjct: 121 LNTLFSQLTSLRCTIGEQERVELLLQSLPDSYDQLIINLTNSNVTSIV-FDDVAAAVLQE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSMEYGSEGSSEAVLLELNKKDFNPQGNVASTSNE 240
ENRRKNK D+ + QQAEA+ RGRS E G S + GN A++S++
Sbjct: 181 ENRRKNKEDRQVNLQQAEALTTMRGRSAERGQSSS-------------HKHGNTANSSDD 240
Query: 241 GDVLCCEAATTVE-GSVFSYDDHALKIVGIGTIKLKFHDNTVRTIQQVRHVEGLTRNLL- 300
GD L CEA+TTVE GSV+S +DHAL+I+G+GTIKLK +D T++ ++ VRHV+GL +NLL
Sbjct: 241 GDTLLCEASTTVEGGSVYSCNDHALEIIGVGTIKLKMYDGTIKVVRDVRHVKGLKKNLLS 300
Query: 301 ---------------------------LVGKRVATNLYMLEGKTLQEGEASVATKSPSEK 325
L G+++A NLYML+G+TL E EASVA S S
Sbjct: 301 YGLLDNNASKIETRKGIMKVFHGALVVLKGEKIAANLYMLKGETLLEAEASVA--SCSSD 360
BLAST of Moc09g13320.1 vs. ExPASy TrEMBL
Match:
A0A5A7TUN0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G002510 PE=4 SV=1)
HSP 1 Score: 345.9 bits (886), Expect = 2.1e-91
Identity = 206/386 (53.37%), Postives = 245/386 (63.47%), Query Frame = 0
Query: 1 MTVKYDIEKFNGTNSSLWKMKMKAILRKDNCFAAISVRLVDVTDD-KWNEINGNVVANIH 60
M K+ IEKFNGTN SLWK+KMKAI RKDNC AI R ++TDD KWNE++GN +ANIH
Sbjct: 1 MMTKFKIEKFNGTNFSLWKLKMKAISRKDNCLEAIDKRPAEITDDNKWNEMDGNAMANIH 60
Query: 61 LALADKVLSSIEEKKTAKEIWDHLEMLYEARSLQNKIFLKRRLYTLRMSKSTQITEHINI 120
LAL D VLSSIEEKK AKEIWDHL LYEA+SL NKIFLKR+LYTLRMS ST +TEH+N
Sbjct: 61 LALVDNVLSSIEEKKIAKEIWDHLIKLYEAKSLHNKIFLKRKLYTLRMSGSTLMTEHMNT 120
Query: 121 L-------------------------------DQLVIDLTNSILTNYLNFEDIAAAILEK 180
L DQLVI+LTN+ILT+YL+F+D+AA +LE+
Sbjct: 121 LNTLFSQLTLLGYKIEPNKHAELLLQSLPDSYDQLVINLTNNILTDYLSFDDVAATVLEE 180
Query: 181 ENRRKNKVDKLASSQQAEAMVVTRGRSMEYGSEGSS------------------------ 240
ENR KNK DKL SSQQAEA+ VTR R +E S GS
Sbjct: 181 ENRCKNKEDKLVSSQQAEALTVTRVRPLECNSSGSKNQGRSKSQSKKKMKCYNCGKKGHV 240
Query: 241 EAVLLELNKKDFNPQGNVASTSNEGDVLCCEAATTVEGSVFSYDDHALKIVGIGTIKLKF 300
+ +LNKKDFNPQGNVAST NEG L CEA TT+EG + DD K+V
Sbjct: 241 KKNCWDLNKKDFNPQGNVASTINEGGALSCEAVTTMEGQL---DDLDCKVV--------V 300
Query: 301 HDNTVRTIQQVRHVEGLTRNLLLVGKRVATNLYMLEGKTLQEGEASVATKSPSEKLSMIW 331
++ IQ +L+ ++V NLYMLEG+TLQEGEASVA+ S E LSM+W
Sbjct: 301 EKRLMKVIQGAL--------VLMKRRKVDANLYMLEGETLQEGEASVASSSSGENLSMMW 360
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAA0026163.1 | 8.7e-100 | 51.99 | Gag-Pol [Cucumis melo var. makuwa] >TYK11620.1 Gag-Pol [Cucumis melo var. makuwa... | [more] |
BAD34493.1 | 3.8e-95 | 48.87 | Gag-Pol [Ipomoea batatas] | [more] |
KAE8686521.1 | 6.0e-93 | 47.87 | Protein STRUBBELIG-RECEPTOR FAMILY 1 [Hibiscus syriacus] | [more] |
KAE8702857.1 | 1.5e-91 | 52.21 | Detected protein of unknown function [Hibiscus syriacus] | [more] |
KAA0044949.1 | 4.3e-91 | 53.37 | hypothetical protein E6C27_scaffold74G002510 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
P10978 | 1.3e-18 | 24.61 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
P04146 | 1.7e-10 | 26.61 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7SNG9 | 4.2e-100 | 51.99 | Gag-Pol OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold263G00680 PE=4 S... | [more] |
Q6BCY1 | 1.8e-95 | 48.87 | Gag-Pol OS=Ipomoea batatas OX=4120 GN=Rtsp-1AA PE=4 SV=1 | [more] |
A0A6A2Z3I3 | 2.9e-93 | 47.87 | Protein STRUBBELIG-RECEPTOR FAMILY 1 OS=Hibiscus syriacus OX=106335 GN=F3Y22_tig... | [more] |
A0A6A3AGK4 | 7.2e-92 | 52.21 | Integrase catalytic domain-containing protein OS=Hibiscus syriacus OX=106335 GN=... | [more] |
A0A5A7TUN0 | 2.1e-91 | 53.37 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
Match Name | E-value | Identity | Description | |