Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATCCAAGGAAGCTCCATCCACCATGGAAGAAAAGTTGGTAAAAATGGACAAGAAACTTAACATGCTGACAAAAATGGTCGAAAAGAGGGACTATAAGATTGCTTCCCTTATGAACCAGATCGAAATTCAAGATGTTGCAAATGTCGCTGAGTCAAGCCAAAATCATGTTGTGAAAGTCAGCGACAAAAGAAAGAACATCATGCAGGAAAAACAACATGAACATTCTGTTTCAATTGCTTCACTGTCTGTCCAGCAGCTCCAAGATATGATCACAAATTCGATCAGAGCTCAATATGGCGGACCTTCTCAAAACTCCTTCACGTACTCCAAACCGTATACCAAGAGGATCGACAACTTGATAATGCCTATGGGATATCAGCCTCCAAAGTTCCAACAATTTGACGAAAAGGGCAATCCTAAGCAACATATTGCTCATTTCGTTGAGACATGTGAAAATGCTGGCACTCGAGGAGATCTACTCGTCAAAGAATTCGTTCGAAAACTAAAAGGTAATGCTTTTGACTGGTATACAGATTTAGAGCCTGAATCCATCAACGGTTGGGAGCAATTGGAAAGAGAGTTCTTAAATCGCTTCTACAGCACAAGGTGAACCGTAAGCATGATGGAACTCACTAACACTAAGCAACAAAAAGATGAATCTGTTGTTGATTACATCAACCGGTGGAGAGCTTTGAGTCTCGACTGCAAAGATCGACTAATGGAATTATCTGCTATCGAGTTGTGCATTCAAGGTATGCATTGGGGACTTCTCTATATCCTTCAAGGAATAAAACCCCGCACGTTTGAAGAATTAGCAACCCGCGGCCACGACATGGAGCTAAGTATTGCTAATCGAGGAGATAAAGATCTTTTAGTCCCTAACTTAAAGAGAATAAGCAAGAATACTGAGATGACTCTTGAAGAATCAATGGTTATCAACACAACTTCTCTCAAGTCTTCTTCAAAAAGAAAGGATAAGAAGGTCGAAAAACGACAAGAGAATGAAAGGCGTCGTCTAACTCTAAAGGATAGACAAGAAAAAGTTTATCCTTTTCCTGATTCTGATATCTCTGACATGTTAGAACAACTACTAAAAATGCAATTGATCGAACTCCCTGAATGCAAACGACCAGAAGACATGGGAAAAGTAAACGATCCAAACTACTGCAAATACCATAGAGTTGTTGGCCATCCAGTGGGGAAATGCTTCGTCTTAAAGGAACTAATTCTAAAGTTAGCTCAAGAAGGAAAAATCGAGCTGAACATTGATGAATTAGCTCAAGCAAATCATATTGGAGTAGCGACAAACACATGCAATCAAATATGCTCAAAGACATTTCATGGTCAAAATGTGGAAGAACTTGCAACTACACATTACATAAATGTGAAAGAAGTCGACGATTCAAAAGAAATCAAACAAAAGACTTCTGTCTTTGATCGCATCAAGCCTTCAACTACTCGAGTTTCAGTCTTCCAAAGAATGAGTATGGCCGCGATGAAAGAAGAAAATCAATGTTTGACTCGACCTTCAGTCTTCCAAAGGTTAAGTGTCTCCACATCAAGAAAAAATCAAACTTCAACATATACTTTGGATTGTCTTAAAGTTGGTGCAAATGATCGAAACAAAGGAAGAATGAAGATCTTGGAGATGAAAATATTTGATGGCAAAAGGATTCACAACCTTGCTTCACGTATGAAGAGAAAATTATCTCTTGTCATAAGTACGGAAAGTTGCTTGAAGGTGAAGCAAAATCTTGTCATTTCAGCCAATCCTACAAATGAAGGATATGGTCAAAATCATGACATTTGAAATGTAACGCTCCTTGCCGCAAGAGCCTAAACTTCACGGTACTCCTAGCCCACATGAGTTTAAAAGGTGAATGGCCCAAAAAAAAAAATTTGCAAATGAACTATGTTATGACTTGATCCCTTTTGTTATAGAGTACGTAGGCAGCTTAAATTAAAACTTTAAGTTCAGTCCAGGTTAAAAAAAATATTCAAAGTCCAAAAAAAAACCAAGTTAAAAAAAAAAATGGCAACGCCACAATGACGTTGCAGGTGTTTGACTTTATCGCATGCTAGTTTTACAGTAGAATGAAGGATGATCACCGGGGGCATAAACCATCAAAGTAGTTTTGTTTGCTCCTCATGATTGGTATCATCCCGCAAACTGGATCTCTGCTAGATCGTTGACGAAAAAGTTACACTCGTTCGATCCTTGACGAAGAAATTGCACCCGCTTGATCATTGGCGGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCTGTTTGATCGTTGGCGCTGCACCCGTTGGATTATTGGCGAAGACGTTGCACTTGCTCGATCCTTGGCGAAGAAATTGCACTCGCTCGATCCTTGGCAAAGAAATTGCACTCGCTCAATCGTCGGCGAAGAGGCTGCACTCGCTCGATCCTTTGGCGAAGAAGTTACACCCGCTCGATCATTGGTAGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCCGTTTGATCGTTGGCGCTGCACCCGTTGGATCATTGGCGAAGACGTTGCACTCGCTCGATCGTTGGCGAAGAAGTTGCACCCACTCGATTGTTGGCGAAGACGTTGCACCCCCTCGATCGTTAGCAAACAAGTTGAACCCCCTCGATCGTTGGCGAAGAAGTTGCACACGCTCGATCTTTCGCGAAGACGTTGCACCCCATCGATCATTGGCAAACAAGTTGAACCCCCTCGATCGTTGGTGAAGAAGTTGCACCTGCTCGATCATTGGCAGACAAGCTGCACCCATTGGATCGTTGGCGAAGAGGTTGCACCCGCTTGATCGTTGGCGAAGAAGTTGCACCCGCTCGATCGTTGGTGAAGATGAAATGACTGCAAGGAAGGTGACCACCGCTGACCTAGGAACACCTCAACCAGCGGAAGAGATGTTAACCTACTCAGGAATGAAATCACTGCAAGGAAGGTGATCACTGCTGACCTAGGAACACCTCAACCAGTGGAAGAGACGTTAACCTACTCAGCAGTGAAATCACTGCAAGAAAGGTGACCATGCTGACCTAGGAACACCTCAACCAGTGGAAGAAGCGTTAACCTACTCAGCAATGAAATCACTGTAGGGAAGGTGACCACTGTGACCTAGGAACACTTCAACCAGTGGAAAAAAAAGCGTTAACCTACTCAGCAGTGAAATCACTGCAAGGAAGGTGATCATGCTGACCTAGGGACACTTCAACCAGCGGAAGCAGCGTTAACCTACTCAGCAATGAAATCACTGTAGGGAAGGTGACCACTGTGACCTAGGAACACCTCAACTAGCAGAAGAGGCATTAACCTACTCAACAATGAAATCACTACAAGGAAGGTGACCACTGCTTAATGTTTGTCAAAGATAATGGACCGCTGCAAGGAAAAAGAATGTTGTTAGAAGCCGTACATTTGAAAAAAAAAAAAAAAGTGGAAGCCAAAGTTATATATATAAAAAAAAAGGTGCTGGCATTGAAAAGAAAAAGAAAAAAAAAAGAAGTATTGTTGGCGAAGTCTTAAAAGAAATTGCCAACTAAAAAAAAGAAAAAAGTAAAGAAAAATAAGATGAAAAAAAAAATAGCAACACCAAGAAGGCAGCCGTATTGGCCGCCGCCTGCATATGAAGAAGAGAGAGAAAATCAATCTCTCTTCTTCCACATTCAGCTTTACTAGAGGACCTCAAACCTCCATCATCGAACCAAGCGAGCCACCAGACCACGATATGTGGTGGCATAGCAAAGGTGACTGCTCAGACTTTGCCGACGACAATCTTTAGGAGCGTTGAAACCCACACCGGTAGACTGACCCAAGCCATCTCTTTCTTGAGCAATATTGCAAAACGGTGGAAATCACACCGACGAACCAGGCGGCGATGTTGCAACCACGAATGCCACGCCATCTACTTTCGCCCCGACGCCTCACTTGTTCACAGTGTCTTCCATCATATATAATTTGGAGAATATAATTCTTGGATTTCATTGATTTCAAATCAATTATTTTTTTGGGAATATGAGATTCTTTCTTTTTTCCTACAATTCAAATCGGGAAGGATTTGCTTGATTGCAAATCAAAATTTTTGGTACTGCCTTATTTGACAGCAACCCAATTCAATGTCTTCAAGAAAAAGCTCCAATTCCTGTTTTACTCTAAACTCGGATGAATCAGGGGCCATTTACGTAAATAAGCTTGGAATCAAATCTCTCCAGCGGTCCAAATCCAAATAAAATCAATTTGGAAGCAAATGTCACCAAATTCCTCTTGCTTTTCGTTGAAATGTTAGAGTTTAAATCTTACCCCAAATCAAAATTAGGGTAAATTTTAAGCATTATTCTGAAGCAAAATTTGAATATATTATTTGTCAAATTCAAATCAAAGTCTTGGCCACATTATGGGTTTTAATTTTGGTAACCAAATAAAATGGTGAATCAAAGTTAAATGTCTCCATAGTTGGAAAATATTATTTCAAATTTTACGAGGAAAATGTTGGTTTGACATATTATTTATGGAGTACACAAATCAAGTCCGTAAATCAAGTTTAAATATCTCCAAAATCAAAGTCAAAGTCAAATAAGGATGACAATTTTAAATGTCATACTTTATCCCATTCTACTCCAAAATTCAAAGTATGAGTAGAAATTCAGTCAGAGAATAAAAAAATCAAAATTTGGTTCTAAAATTGATATGTACATTGGATCAAAATTATAAACCATGCTCTATATTTTTTTTTGTTAAATGTCACATATTACTTTTTCAAGATTCACACACTCATATACGTGTATAAATCTTGAAAAGGGGCCATCTGTTGAAAAAGAAATTTCGGCCAATTACTAAATGCCACGTCACAATTTAAGGACAAATTTACATGTGCCCTGAATTAATTTAAAATTGGTGAATCAAACTATGACACGTGTCTCTTTTATAGCCGTCCGATGAAATCAACTAATTGGATCAAATCCATAAAGAGACTGGGCCCAAGCCTATCGCCCATTAAGCTCATGGCCCATATAAGACCCACCAAAACTCTATAAATAGAGGGGATCACCCTCATTCGGGAAGGGGGAGCAAGAAGAAAGAAGAAGAGCCAAAAGAAAGGGAGCAAGAAGAAAGAAAAAGAGCCAGAAGAAAGGGGGAAGAAGCCAAGAGGAAGAAGAATTGGAAGAGGTCTAGAAGAATTGGAGATTAAGAGCACTCATAGTCGTCTACAATCCTTCGGAGTCAAGTGAAGCATCCAAAGCCAACAAGATTGGAAGAATTAGAGACTTGAAGAGTACGAGACTTCAAGAACATTGAATATTCTCTCCCAAGCAACAATACTTGAAGATTCCATCAAACTGCAAAGTCTTCATCAAAACTGACTGAAGAACTTCAACAAGCAGTGCAAAGCCGAAGAACTTCACAAGCTACAAAGCCGAAGAACTTCAACAAGCTGCAAAGTCGAAGCACTCCACAAGCTGCAAAGTCGAAACACTTCACAAGCTGCAAAGTCAAAGAACTTCACCAGCGCGAAGCCGAAGACTGTATAAAGTCTTCATCAAGCTACAAGCCAAAGAACTTTATCAACAAGTCAAAGTCTTCAAACAAACTCAAAGTTGAAGACTTCGAAGATCCTTCGTCAAGCGAGAAGCTGAAGGCATCAACCATTCAATAA
mRNA sequence
ATGGCATCCAAGGAAGCTCCATCCACCATGGAAGAAAAGTTGGTAAAAATGGACAAGAAACTTAACATGCTGACAAAAATGGTCGAAAAGAGGGACTATAAGATTGCTTCCCTTATGAACCAGATCGAAATTCAAGATGTTGCAAATGTCGCTGAGTCAAGCCAAAATCATGTTGTGAAAGTCAGCGACAAAAGAAAGAACATCATGCAGGAAAAACAACATGAACATTCTGTTTCAATTGCTTCACTGTCTGTCCAGCAGCTCCAAGATATGATCACAAATTCGATCAGAGCTCAATATGGCGGACCTTCTCAAAACTCCTTCACGTACTCCAAACCGTATACCAAGAGGATCGACAACTTGATAATGCCTATGGGATATCAGCCTCCAAAGTTCCAACAATTTGACGAAAAGGGCAATCCTAAGCAACATATTGCTCATTTCGTTGAGACATGTGAAAATGCTGGCACTCGAGGAGATCTACTCGTCAAAGAATTCGTTCGAAAACTAAAAGGTAATGCTTTTGACTGGTATACAGATTTAGAGCCTGAATCCATCAACGGTTGGGAGCAATTGGAAAGAGAGTTCTTAAATCGCTTCTACAGCACAAGTCTCGACTGCAAAGATCGACTAATGGAATTATCTGCTATCGAGTTGTGCATTCAAGGTATGCATTGGGGACTTCTCTATATCCTTCAAGGAATAAAACCCCGCACGTTTGAAGAATTAGCAACCCGCGGCCACGACATGGAGCTAAGTATTGCTAATCGAGGAGATAAAGATCTTTTAGTCCCTAACTTAAAGAGAATAAGCAAGAATACTGAGATGACTCTTGAAGAATCAATGGTTATCAACACAACTTCTCTCAAGTCTTCTTCAAAAAGAAAGGATAAGAAGGTCGAAAAACGACAAGAGAATGAAAGGCGTCGTCTAACTCTAAAGGATAGACAAGAAAAAGTTTATCCTTTTCCTGATTCTGATATCTCTGACATGTTAGAACAACTACTAAAAATGCAATTGATCGAACTCCCTGAATGCAAACGACCAGAAGACATGGGAAAAGTAAACGATCCAAACTACTGCAAATACCATAGAGTTGTTGGCCATCCAGTGGGGAAATGCTTCGTCTTAAAGGAACTAATTCTAAAGTTAGCTCAAGAAGGAAAAATCGAGCTGAACATTGATGAATTAGCTCAAGCAAATCATATTGGAGTAGCGACAAACACATGCAATCAAATATGCTCAAAGACATTTCATGGTCAAAATGTGGAAGAACTTGCAACTACACATTACATAAATGTGAAAGAAGTCGACGATTCAAAAGAAATCAAACAAAAGACTTCTGTCTTTGATCGCATCAAGCCTTCAACTACTCGAGTTTCAGTCTTCCAAAGAATGAGTATGGCCGCGATGAAAGAAGAAAATCAATGTTTGACTCGACCTTCAGTCTTCCAAAGGTTAAGTGTCTCCACATCAAGAAAAAATCAAACTTCAACATATACTTTGGATTGTCTTAAAGTTGGTGCAAATGATCGAAACAAAGGAAGAATGAAGATCTTGGAGATGAAAATATTTGATGGCAAAAGGATTCACAACCTTGCTTCACGTATGAAGAGAAAATTATCTCTTGTCATAAAGCCTAAACTTCACGGTACTCCTAGCCCACATGAGTTTAAAAGAAATTGCACCCGCTTGATCATTGGCGGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCTGTTTGATCGTTGGCGCTGCACCCGTTGGATTATTGGCGAAGACGTTGCACTTGCTCGATCCTTGGCGAAGAAATTGCACTCGCTCGATCCTTGGCAAAGAAATTGCACTCGCTCAATCGTCGGCGAAGAGGCTGCACTCGCTCGATCCTTTGGCGAAGAAGTTACACCCGCTCGATCATTGGTAGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCCGTTTGATCGTTGGCGCTGCACCCGTTGGATCATTGGCGAAGACGTTGCACTCGCTCGATCGTTGGCGAAGAAGTTGCACCCACTCGATTGTTGGCGAAGACGTTGCACCCCCTCGATCACAAGCTGCACCCATTGGATCGTTGGCGAAGAGGTTGCACCCGCTTGATCGTTGGCGAAGAAGTTGCACCCGCTCGATCGTTGGTGAAGATGAAATGACTGCAAGGAAGGTGACCACCGCTGACCTAGGAACACCTCAACCAGCGGAAGAGATGTTAACCTACTCAGGAATGAAATCACTGCAAGGAAGGGAAGGTGACCACTGTGACCTAGGAACACTTCAACCAGTGGAAAAAAAAGCGTTAACCTACTCAGCAGTGAAATCACTGCAAGGAAGGGAAGGTGACCACTGTGACCTAGGAACACCTCAACTAGCAGAAGAGGCATTAACCTACTCAACAATGAAATCACTACAAGGAAGATTCCATCAAACTGCAAAGTCTTCATCAAAACTGACTGAAGAACTTCAACAAGCAGTGCAAAGCCGAAGAACTTCACAAGCTACAAAGCCGAAGAACTTCAACAAGCTGCAAAGTCGAAGCACTCCACAAGCTGCAAAGTCGAAACACTTCACAAGCTGCAAAGTCAAAGAACTTCACCAGCGCGAAGCCGAAGACTGTATAAAGTCTTCATCAAGCTACAAGCCAAAGAACTTTATCAACAAGTCAAAGTCTTCAAACAAACTCAAAGTTGAAGACTTCGAAGATCCTTCGTCAAGCGAGAAGCTGAAGGCATCAACCATTCAATAA
Coding sequence (CDS)
ATGGCATCCAAGGAAGCTCCATCCACCATGGAAGAAAAGTTGGTAAAAATGGACAAGAAACTTAACATGCTGACAAAAATGGTCGAAAAGAGGGACTATAAGATTGCTTCCCTTATGAACCAGATCGAAATTCAAGATGTTGCAAATGTCGCTGAGTCAAGCCAAAATCATGTTGTGAAAGTCAGCGACAAAAGAAAGAACATCATGCAGGAAAAACAACATGAACATTCTGTTTCAATTGCTTCACTGTCTGTCCAGCAGCTCCAAGATATGATCACAAATTCGATCAGAGCTCAATATGGCGGACCTTCTCAAAACTCCTTCACGTACTCCAAACCGTATACCAAGAGGATCGACAACTTGATAATGCCTATGGGATATCAGCCTCCAAAGTTCCAACAATTTGACGAAAAGGGCAATCCTAAGCAACATATTGCTCATTTCGTTGAGACATGTGAAAATGCTGGCACTCGAGGAGATCTACTCGTCAAAGAATTCGTTCGAAAACTAAAAGGTAATGCTTTTGACTGGTATACAGATTTAGAGCCTGAATCCATCAACGGTTGGGAGCAATTGGAAAGAGAGTTCTTAAATCGCTTCTACAGCACAAGTCTCGACTGCAAAGATCGACTAATGGAATTATCTGCTATCGAGTTGTGCATTCAAGGTATGCATTGGGGACTTCTCTATATCCTTCAAGGAATAAAACCCCGCACGTTTGAAGAATTAGCAACCCGCGGCCACGACATGGAGCTAAGTATTGCTAATCGAGGAGATAAAGATCTTTTAGTCCCTAACTTAAAGAGAATAAGCAAGAATACTGAGATGACTCTTGAAGAATCAATGGTTATCAACACAACTTCTCTCAAGTCTTCTTCAAAAAGAAAGGATAAGAAGGTCGAAAAACGACAAGAGAATGAAAGGCGTCGTCTAACTCTAAAGGATAGACAAGAAAAAGTTTATCCTTTTCCTGATTCTGATATCTCTGACATGTTAGAACAACTACTAAAAATGCAATTGATCGAACTCCCTGAATGCAAACGACCAGAAGACATGGGAAAAGTAAACGATCCAAACTACTGCAAATACCATAGAGTTGTTGGCCATCCAGTGGGGAAATGCTTCGTCTTAAAGGAACTAATTCTAAAGTTAGCTCAAGAAGGAAAAATCGAGCTGAACATTGATGAATTAGCTCAAGCAAATCATATTGGAGTAGCGACAAACACATGCAATCAAATATGCTCAAAGACATTTCATGGTCAAAATGTGGAAGAACTTGCAACTACACATTACATAAATGTGAAAGAAGTCGACGATTCAAAAGAAATCAAACAAAAGACTTCTGTCTTTGATCGCATCAAGCCTTCAACTACTCGAGTTTCAGTCTTCCAAAGAATGAGTATGGCCGCGATGAAAGAAGAAAATCAATGTTTGACTCGACCTTCAGTCTTCCAAAGGTTAAGTGTCTCCACATCAAGAAAAAATCAAACTTCAACATATACTTTGGATTGTCTTAAAGTTGGTGCAAATGATCGAAACAAAGGAAGAATGAAGATCTTGGAGATGAAAATATTTGATGGCAAAAGGATTCACAACCTTGCTTCACGTATGAAGAGAAAATTATCTCTTGTCATAAAGCCTAAACTTCACGGTACTCCTAGCCCACATGAGTTTAAAAGAAATTGCACCCGCTTGATCATTGGCGGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCTGTTTGATCGTTGGCGCTGCACCCGTTGGATTATTGGCGAAGACGTTGCACTTGCTCGATCCTTGGCGAAGAAATTGCACTCGCTCGATCCTTGGCAAAGAAATTGCACTCGCTCAATCGTCGGCGAAGAGGCTGCACTCGCTCGATCCTTTGGCGAAGAAGTTACACCCGCTCGATCATTGGTAGAGAAGCTGCACCCATTGGATCATTGGCAAAGAAGTTGCACCCGTTTGATCGTTGGCGCTGCACCCGTTGGATCATTGGCGAAGACGTTGCACTCGCTCGATCGTTGGCGAAGAAGTTGCACCCACTCGATTGTTGGCGAAGACGTTGCACCCCCTCGATCACAAGCTGCACCCATTGGATCGTTGGCGAAGAGGTTGCACCCGCTTGATCGTTGGCGAAGAAGTTGCACCCGCTCGATCGTTGGTGAAGATGAAATGACTGCAAGGAAGGTGACCACCGCTGACCTAGGAACACCTCAACCAGCGGAAGAGATGTTAACCTACTCAGGAATGAAATCACTGCAAGGAAGGGAAGGTGACCACTGTGACCTAGGAACACTTCAACCAGTGGAAAAAAAAGCGTTAACCTACTCAGCAGTGAAATCACTGCAAGGAAGGGAAGGTGACCACTGTGACCTAGGAACACCTCAACTAGCAGAAGAGGCATTAACCTACTCAACAATGAAATCACTACAAGGAAGATTCCATCAAACTGCAAAGTCTTCATCAAAACTGACTGAAGAACTTCAACAAGCAGTGCAAAGCCGAAGAACTTCACAAGCTACAAAGCCGAAGAACTTCAACAAGCTGCAAAGTCGAAGCACTCCACAAGCTGCAAAGTCGAAACACTTCACAAGCTGCAAAGTCAAAGAACTTCACCAGCGCGAAGCCGAAGACTGTATAAAGTCTTCATCAAGCTACAAGCCAAAGAACTTTATCAACAAGTCAAAGTCTTCAAACAAACTCAAAGTTGAAGACTTCGAAGATCCTTCGTCAAGCGAGAAGCTGAAGGCATCAACCATTCAATAA
Protein sequence
MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRIDNLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWEQLEREFLNRFYSTSLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVPNLKRISKNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQENERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIELNIDELAQANHIGVATNTCNQICSKTFHGQNVEELATTHYINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQCLTRPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIFDGKRIHNLASRMKRKLSLVIKPKLHGTPSPHEFKRNCTRLIIGGEAAPIGSLAKKLHLFDRWRCTRWIIGEDVALARSLAKKLHSLDPWQRNCTRSIVGEEAALARSFGEEVTPARSLVEKLHPLDHWQRSCTRLIVGAAPVGSLAKTLHSLDRWRRSCTHSIVGEDVAPPRSQAAPIGSLAKRLHPLDRWRRSCTRSIVGEDEMTARKVTTADLGTPQPAEEMLTYSGMKSLQGREGDHCDLGTLQPVEKKALTYSAVKSLQGREGDHCDLGTPQLAEEALTYSTMKSLQGRFHQTAKSSSKLTEELQQAVQSRRTSQATKPKNFNKLQSRSTPQAAKSKHFTSCKVKELHQREAEDCIKSSSSYKPKNFINKSKSSNKLKVEDFEDPSSSEKLKASTIQ
Homology
BLAST of Moc07g03790 vs. NCBI nr
Match:
KAA0056121.1 (ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa])
HSP 1 Score: 617.5 bits (1591), Expect = 1.9e-172
Identity = 354/625 (56.64%), Postives = 433/625 (69.28%), Query Frame = 0
Query: 9 TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNI 68
T E ++ +++KK+NML K+VE+RDY+IA L N IE +D AESS H VK +DK K +
Sbjct: 91 TSENRMAELEKKVNMLMKVVEERDYEIAFLKNHIESRD---AAESSHKHTVKNTDKGKAV 150
Query: 69 MQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRIDNLIMPMGYQ 128
MQE Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q Y KPYTKRIDNL MP GYQ
Sbjct: 151 MQESQPQNSTSIASLSVQQLQEMIASSIKMQYGGPAQTFSLYFKPYTKRIDNLRMPNGYQ 210
Query: 129 PPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESING 188
PPKFQQFD KGNPKQH+AHF++TCE AGTRGDLLVK+FVR LKGNA DWY DLEPESI+
Sbjct: 211 PPKFQQFDGKGNPKQHVAHFIKTCETAGTRGDLLVKQFVRTLKGNACDWYIDLEPESIDN 270
Query: 189 WEQLEREFLNRFYST------------------------------SLDCKDRLMELSAIE 248
WEQLER+FLNRFYST SLDCKDRL ELSA+E
Sbjct: 271 WEQLERDFLNRFYSTRHIVSMMELTNTRQQKGELVIDYINRWRALSLDCKDRLTELSAVE 330
Query: 249 LCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS 308
+C QGMHWGLLYILQGIKPRTFEELATR HDMELSIANRG KD L+P L
Sbjct: 331 MCTQGMHWGLLYILQGIKPRTFEELATRAHDMELSIANRGAKDFLIPKSRSDKNELDDTK 390
Query: 309 KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISD 368
K ++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPFPDSD++D
Sbjct: 391 KIANSVIKESMVVHATPLKSFSKRKETKIERKHDGDEKRQSTLKERQEKVYPFPDSDVAD 450
Query: 369 MLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKI 428
MLEQLL+ QLI+LPECKRPE GKV+DPNYCKYHRV+ HPV KCFVLKELILKLA+E KI
Sbjct: 451 MLEQLLENQLIQLPECKRPEQAGKVDDPNYCKYHRVISHPVEKCFVLKELILKLAREKKI 510
Query: 429 ELNIDELAQANH-IGVATN----------------TCNQICSKTFHGQNVEEL------- 488
EL+IDE+AQ NH I + +N T + ++F + EE+
Sbjct: 511 ELDIDEVAQTNHAIEMTSNPIKGKDEDFLQLRRSITLAEFLPRSFLEDDPEEILEVTACH 570
Query: 489 ------ATTHYINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQC--- 548
+Y + KEV++S EI Q+TSVFDRIKPSTTR SVFQR+S+A +EENQC
Sbjct: 571 AASIVEVDNNYGSSKEVNNSNEINQRTSVFDRIKPSTTRSSVFQRLSVATKEEENQCPTF 630
BLAST of Moc07g03790 vs. NCBI nr
Match:
TYK03695.1 (retrotransposon gag protein [Cucumis melo var. makuwa])
HSP 1 Score: 587.0 bits (1512), Expect = 2.8e-163
Identity = 353/709 (49.79%), Postives = 428/709 (60.37%), Query Frame = 0
Query: 9 TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNI 68
T E ++ +++KK+NML K+VE+RDY+IA L N IE +D AESS H VK +DK K +
Sbjct: 91 TSENRMAELEKKVNMLMKVVEERDYEIAFLKNHIESRD---AAESSHKHTVKNTDKGKAV 150
Query: 69 MQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRIDNLIMPMGYQ 128
MQE Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q YSKPYTKRIDNL MP GYQ
Sbjct: 151 MQESQPQNSTSIASLSVQQLQEMIASSIKTQYGGPAQTFSLYSKPYTKRIDNLRMPNGYQ 210
Query: 129 PPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESING 188
PPKFQQFD KGNPKQH+AHF+ETCE AGTRGDLLVK+FVR LKGNAFD Y DLEPESI+
Sbjct: 211 PPKFQQFDGKGNPKQHVAHFIETCETAGTRGDLLVKQFVRTLKGNAFDLYMDLEPESIDN 270
Query: 189 WEQLEREFLNRFYST------------------------------SLDCKDRLMELSAIE 248
WEQLER+FLNRFYST SLDCKDRL ELSA+E
Sbjct: 271 WEQLERDFLNRFYSTRRIVSMMELTNTRQQKGELVIDYINRWRALSLDCKDRLTELSAVE 330
Query: 249 LCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS 308
+C QGMHWGLLYILQGIKPRTFEELATR HDMELSI NRG KD L+P L
Sbjct: 331 MCTQGMHWGLLYILQGIKPRTFEELATRAHDMELSIPNRGAKDFLIPKSRSDKNELNDTK 390
Query: 309 KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISD 368
K ++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPF DSD++D
Sbjct: 391 KIANSVIKESMVVHATPLKSFSKRKETKIERKHDGDEKRQSTLKERQEKVYPFSDSDVAD 450
Query: 369 MLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKI 428
MLEQLL+ QLI+LP+CKRP+ KV+DPNYCKYHRV+ HPV KCFVLKELILKLA+E KI
Sbjct: 451 MLEQLLENQLIQLPKCKRPKQAEKVDDPNYCKYHRVISHPVEKCFVLKELILKLAREKKI 510
Query: 429 ELNIDELAQANHIGVATN------------------------------------------ 488
ELNIDE+AQ NH+ +
Sbjct: 511 ELNIDEVAQTNHVAIEMTSNVPPLTQLDDQRKSLIQFGTSILFQQRIVTINSQNKEAHGK 570
Query: 489 ------------------------------------------------------------ 548
Sbjct: 571 DDDEGWITVTRQKGRQPNSIQKESQFHQKYAKGSISHKKKGRRNKKMWNPKPIKGKDEDF 630
BLAST of Moc07g03790 vs. NCBI nr
Match:
KAA0032121.1 (ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa])
HSP 1 Score: 555.4 bits (1430), Expect = 9.0e-154
Identity = 332/635 (52.28%), Postives = 423/635 (66.61%), Query Frame = 0
Query: 9 TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNI 68
T E + +M++K+N L K+VE+RD++IA+L +Q++ +ESSQ VVK +DK KN+
Sbjct: 90 TAEATMAEMERKINFLMKVVEERDHEIAALKDQMK---ACETSESSQTPVVKATDKGKNV 149
Query: 69 MQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRIDNLIMPMGY 128
++E Q + SVS+ASLSVQQLQDMI NSIRAQYGGP Q SF YSKPYTKRIDNL MP+GY
Sbjct: 150 VEENQPQQQSVSVASLSVQQLQDMIANSIRAQYGGPPQTSFMYSKPYTKRIDNLRMPLGY 209
Query: 129 QPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDW----YTDLEP 188
QP KFQQFD KGNPKQHI HFVETCENAG+RGD LV++FVR LKGNAF+ + +E
Sbjct: 210 QPLKFQQFDGKGNPKQHIVHFVETCENAGSRGDQLVRQFVRSLKGNAFECTRRVVSMMEL 269
Query: 189 ESINGWEQLEREFLNRFYSTSLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEEL 248
+ + +++NR+ + SLDCKD+L ELSA+E+C QGMHW LLYILQGIKPRTFEEL
Sbjct: 270 TNTQRKGEPVIDYINRWRALSLDCKDKLTELSAVEMCTQGMHWELLYILQGIKPRTFEEL 329
Query: 249 ATRGHDMELSIANRGDKDLLVP----------NLKRISKNTEMTLEESMVINTTSLKSSS 308
ATR HDM+LSIANRG KD LV + K+I+ N L ESM++ T LKS S
Sbjct: 330 ATRAHDMKLSIANRGVKDFLVQRTRSDKNEINDTKKIANN---VLNESMLVQETPLKSFS 389
Query: 309 KRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDM 368
KRK+ K ++ + +E+RR TL++RQ+KVYPFPDSD++DMLEQL++ QLI+LPECKRPE +
Sbjct: 390 KRKETKHKRNHDGDEKRRPTLRERQKKVYPFPDSDVADMLEQLIEKQLIQLPECKRPEQV 449
Query: 369 GKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIELNIDELAQANHIGV------- 428
GKV+DPNYCKYHRV+ H V KCFVLKELI KLA+E KIEL+IDE+AQ NH+ V
Sbjct: 450 GKVDDPNYCKYHRVISHLVEKCFVLKELIRKLARENKIELDIDEVAQTNHVAVNMTSSVP 509
Query: 429 ---------------------------ATNTCN----------------QICSKTF---H 488
T T N + ++F H
Sbjct: 510 LSILLYDQRKSLIQFGTFEPILVRFQQKTMTSNSQNKEEPSEDEGEEWIEFLPRSFLEDH 569
Query: 489 GQNVEELATTH----------YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMA 548
+ + E+ H Y + +E+D+S EIKQ+TSVFD IKP TTR SVFQR+SMA
Sbjct: 570 PEEILEVTACHTTSIVEVDNNYDSYEEMDNSNEIKQRTSVFDCIKPLTTRSSVFQRLSMA 629
Query: 549 AMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF 556
KEENQC T + S F+RLS+S S+K++ STYT D LK+ ND+ + MK L+ K F
Sbjct: 630 TKKEENQCPTFTYAQTSAFKRLSISISKKHRPSTYTFDRLKM-TNDQQQREMKTLKAKPF 689
BLAST of Moc07g03790 vs. NCBI nr
Match:
KAA0033746.1 (retrotransposon gag protein [Cucumis melo var. makuwa])
HSP 1 Score: 553.9 bits (1426), Expect = 2.6e-153
Identity = 344/701 (49.07%), Postives = 423/701 (60.34%), Query Frame = 0
Query: 1 MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVK 60
M A +TMEE M++K+N L K E+RD++IA+L +Q++ ESSQ VVK
Sbjct: 86 MVDVTAEATMEE----MERKINFLMKNFEERDHEIAALKDQMK---ACKTGESSQTPVVK 145
Query: 61 VSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRID 120
+DK KN++QE Q + SVS+ASLSVQQLQDMI NSIRAQYGGP Q SF YSK YTKRID
Sbjct: 146 ATDKGKNVVQENQPQQQSVSVASLSVQQLQDMIANSIRAQYGGPPQTSFMYSKSYTKRID 205
Query: 121 NLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYT 180
NL MP+GYQPPKFQQFD KGNPKQHIAHFVETCENAG+RGD LV++FVR LKGNAF+WYT
Sbjct: 206 NLRMPLGYQPPKFQQFDGKGNPKQHIAHFVETCENAGSRGDQLVRQFVRSLKGNAFEWYT 265
Query: 181 DLEPESINGWEQLEREFLNRFYSTSLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRT 240
DLEPE + +++NR+ + SLDCKD+L ELSA+E+C QGMHW LLYILQGIKPRT
Sbjct: 266 DLEPEG-----EPVIDYINRWRALSLDCKDKLTELSAVEMCTQGMHWELLYILQGIKPRT 325
Query: 241 FEELATRGHDMELSIANRGDKDLLVPNLKRISKN--------TEMTLEESMVINTTSLKS 300
FEEL+TR HDMELSIAN G KD LV KR KN L ESM++ T LKS
Sbjct: 326 FEELSTRAHDMELSIANIGAKDFLVQRTKRSDKNEINDTKKIANNILNESMLVQETPLKS 385
Query: 301 SSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPE 360
SKRK+ K E+ + +E+RR TL++RQ+KVYPFPDSD++DMLEQL++ QLI+LPECKRPE
Sbjct: 386 FSKRKETKHERNYDGDEKRRPTLRERQKKVYPFPDSDVADMLEQLIEKQLIQLPECKRPE 445
Query: 361 DMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIELNIDELAQANHIGV----- 420
GKV+DPNYCKYHRV+ HPV KCFVLKELILKLA+E KIEL+IDE+AQ NH+ V
Sbjct: 446 QAGKVDDPNYCKYHRVISHPVEKCFVLKELILKLARENKIELDIDEVAQTNHVAVNMTSS 505
Query: 421 -----------------------------ATNTCN------------------------- 480
T T N
Sbjct: 506 VLPSILLYDQRESLIQFGTFEPILVRFQQKTMTSNSQNKEETSEDEGEEWIGVTHKKERQ 565
Query: 481 ------------------------------------------------------QICSKT 540
+ ++
Sbjct: 566 IGSVQTNSNFHQKHSKGNISHKKKGRRNKKMWKPKPIKGKDKDFFQPRRSINLAEFLPRS 625
Query: 541 F---HGQNVEELATTH----------YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQ 549
F H + + E+ H Y + +EVD+S EIKQ+T VF RIKP T R SVFQ
Sbjct: 626 FLEDHPEKILEVTACHTTSIVEVDNNYDSYEEVDNSNEIKQRTFVFHRIKPLTIRSSVFQ 685
BLAST of Moc07g03790 vs. NCBI nr
Match:
XP_031742032.1 (uncharacterized protein LOC116404025 [Cucumis sativus])
HSP 1 Score: 550.8 bits (1418), Expect = 2.2e-152
Identity = 290/444 (65.32%), Postives = 346/444 (77.93%), Query Frame = 0
Query: 1 MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVK 60
M+ A +E + +M++K+N+L K+V++RD++IA+L Q++ ++ AESSQ VVK
Sbjct: 82 MSVMMADVAVETAMAEMERKINLLMKVVDERDHEIAALKEQMQTRE---TAESSQTPVVK 141
Query: 61 VSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRID 120
V DK KN++QE Q + S S+ASLSVQQLQDMIT+SIRAQYGGPSQ SF YSKPYTKRID
Sbjct: 142 VDDKGKNVVQENQPQQQSTSVASLSVQQLQDMITSSIRAQYGGPSQTSFMYSKPYTKRID 201
Query: 121 NLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYT 180
NL MP+GYQPPKFQQFD KGNPKQH+AHFVETCENAG+RGD LV++FVR LKGNAF+WYT
Sbjct: 202 NLRMPLGYQPPKFQQFDGKGNPKQHVAHFVETCENAGSRGDQLVRQFVRSLKGNAFEWYT 261
Query: 181 DLEPESINGWEQLEREFLNRFYST------------------------------SLDCKD 240
DLEPESI WEQLE+EFLNRFYST SLDCKD
Sbjct: 262 DLEPESIESWEQLEKEFLNRFYSTRRTVSMMELTNTKQRKGEPVIDYINRWRALSLDCKD 321
Query: 241 RLMELSAIELCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVPNLKR 300
RL ELSA+E+C QGMHWGLLYILQGIKPRTFEELATR HDMELSIA+RG KD LVP +K+
Sbjct: 322 RLTELSAVEMCTQGMHWGLLYILQGIKPRTFEELATRAHDMELSIASRGTKDFLVPEVKK 381
Query: 301 ISKN-------TEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVY 360
K + TL+ESMV+NTT LK SK K+ +VEK+ + +ERRRLTLK+RQEKVY
Sbjct: 382 DKKEMKGAEKIVKSTLKESMVVNTTPLK-FSKGKEARVEKKDDGSERRRLTLKERQEKVY 441
Query: 361 PFPDSDISDMLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELI 406
PFPDSDI+DMLEQLL+ QLI+LPECKRPE GKV+DPNYCKYHRV+ HPV KCFVLKELI
Sbjct: 442 PFPDSDIADMLEQLLEKQLIQLPECKRPEQAGKVDDPNYCKYHRVISHPVEKCFVLKELI 501
BLAST of Moc07g03790 vs. ExPASy TrEMBL
Match:
A0A5A7URH1 (Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold17350G00010 PE=4 SV=1)
HSP 1 Score: 617.5 bits (1591), Expect = 9.3e-173
Identity = 354/625 (56.64%), Postives = 433/625 (69.28%), Query Frame = 0
Query: 9 TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNI 68
T E ++ +++KK+NML K+VE+RDY+IA L N IE +D AESS H VK +DK K +
Sbjct: 91 TSENRMAELEKKVNMLMKVVEERDYEIAFLKNHIESRD---AAESSHKHTVKNTDKGKAV 150
Query: 69 MQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRIDNLIMPMGYQ 128
MQE Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q Y KPYTKRIDNL MP GYQ
Sbjct: 151 MQESQPQNSTSIASLSVQQLQEMIASSIKMQYGGPAQTFSLYFKPYTKRIDNLRMPNGYQ 210
Query: 129 PPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESING 188
PPKFQQFD KGNPKQH+AHF++TCE AGTRGDLLVK+FVR LKGNA DWY DLEPESI+
Sbjct: 211 PPKFQQFDGKGNPKQHVAHFIKTCETAGTRGDLLVKQFVRTLKGNACDWYIDLEPESIDN 270
Query: 189 WEQLEREFLNRFYST------------------------------SLDCKDRLMELSAIE 248
WEQLER+FLNRFYST SLDCKDRL ELSA+E
Sbjct: 271 WEQLERDFLNRFYSTRHIVSMMELTNTRQQKGELVIDYINRWRALSLDCKDRLTELSAVE 330
Query: 249 LCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS 308
+C QGMHWGLLYILQGIKPRTFEELATR HDMELSIANRG KD L+P L
Sbjct: 331 MCTQGMHWGLLYILQGIKPRTFEELATRAHDMELSIANRGAKDFLIPKSRSDKNELDDTK 390
Query: 309 KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISD 368
K ++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPFPDSD++D
Sbjct: 391 KIANSVIKESMVVHATPLKSFSKRKETKIERKHDGDEKRQSTLKERQEKVYPFPDSDVAD 450
Query: 369 MLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKI 428
MLEQLL+ QLI+LPECKRPE GKV+DPNYCKYHRV+ HPV KCFVLKELILKLA+E KI
Sbjct: 451 MLEQLLENQLIQLPECKRPEQAGKVDDPNYCKYHRVISHPVEKCFVLKELILKLAREKKI 510
Query: 429 ELNIDELAQANH-IGVATN----------------TCNQICSKTFHGQNVEEL------- 488
EL+IDE+AQ NH I + +N T + ++F + EE+
Sbjct: 511 ELDIDEVAQTNHAIEMTSNPIKGKDEDFLQLRRSITLAEFLPRSFLEDDPEEILEVTACH 570
Query: 489 ------ATTHYINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMAAMKEENQC--- 548
+Y + KEV++S EI Q+TSVFDRIKPSTTR SVFQR+S+A +EENQC
Sbjct: 571 AASIVEVDNNYGSSKEVNNSNEINQRTSVFDRIKPSTTRSSVFQRLSVATKEEENQCPTF 630
BLAST of Moc07g03790 vs. ExPASy TrEMBL
Match:
A0A5D3BX77 (Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold863G00570 PE=4 SV=1)
HSP 1 Score: 587.0 bits (1512), Expect = 1.3e-163
Identity = 353/709 (49.79%), Postives = 428/709 (60.37%), Query Frame = 0
Query: 9 TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNI 68
T E ++ +++KK+NML K+VE+RDY+IA L N IE +D AESS H VK +DK K +
Sbjct: 91 TSENRMAELEKKVNMLMKVVEERDYEIAFLKNHIESRD---AAESSHKHTVKNTDKGKAV 150
Query: 69 MQEKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRIDNLIMPMGYQ 128
MQE Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q YSKPYTKRIDNL MP GYQ
Sbjct: 151 MQESQPQNSTSIASLSVQQLQEMIASSIKTQYGGPAQTFSLYSKPYTKRIDNLRMPNGYQ 210
Query: 129 PPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESING 188
PPKFQQFD KGNPKQH+AHF+ETCE AGTRGDLLVK+FVR LKGNAFD Y DLEPESI+
Sbjct: 211 PPKFQQFDGKGNPKQHVAHFIETCETAGTRGDLLVKQFVRTLKGNAFDLYMDLEPESIDN 270
Query: 189 WEQLEREFLNRFYST------------------------------SLDCKDRLMELSAIE 248
WEQLER+FLNRFYST SLDCKDRL ELSA+E
Sbjct: 271 WEQLERDFLNRFYSTRRIVSMMELTNTRQQKGELVIDYINRWRALSLDCKDRLTELSAVE 330
Query: 249 LCIQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRIS 308
+C QGMHWGLLYILQGIKPRTFEELATR HDMELSI NRG KD L+P L
Sbjct: 331 MCTQGMHWGLLYILQGIKPRTFEELATRAHDMELSIPNRGAKDFLIPKSRSDKNELNDTK 390
Query: 309 KNTEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISD 368
K ++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPF DSD++D
Sbjct: 391 KIANSVIKESMVVHATPLKSFSKRKETKIERKHDGDEKRQSTLKERQEKVYPFSDSDVAD 450
Query: 369 MLEQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKI 428
MLEQLL+ QLI+LP+CKRP+ KV+DPNYCKYHRV+ HPV KCFVLKELILKLA+E KI
Sbjct: 451 MLEQLLENQLIQLPKCKRPKQAEKVDDPNYCKYHRVISHPVEKCFVLKELILKLAREKKI 510
Query: 429 ELNIDELAQANHIGVATN------------------------------------------ 488
ELNIDE+AQ NH+ +
Sbjct: 511 ELNIDEVAQTNHVAIEMTSNVPPLTQLDDQRKSLIQFGTSILFQQRIVTINSQNKEAHGK 570
Query: 489 ------------------------------------------------------------ 548
Sbjct: 571 DDDEGWITVTRQKGRQPNSIQKESQFHQKYAKGSISHKKKGRRNKKMWNPKPIKGKDEDF 630
BLAST of Moc07g03790 vs. ExPASy TrEMBL
Match:
A0A5A7SRE2 (Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold452G00210 PE=4 SV=1)
HSP 1 Score: 555.4 bits (1430), Expect = 4.3e-154
Identity = 332/635 (52.28%), Postives = 423/635 (66.61%), Query Frame = 0
Query: 9 TMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNI 68
T E + +M++K+N L K+VE+RD++IA+L +Q++ +ESSQ VVK +DK KN+
Sbjct: 90 TAEATMAEMERKINFLMKVVEERDHEIAALKDQMK---ACETSESSQTPVVKATDKGKNV 149
Query: 69 MQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRIDNLIMPMGY 128
++E Q + SVS+ASLSVQQLQDMI NSIRAQYGGP Q SF YSKPYTKRIDNL MP+GY
Sbjct: 150 VEENQPQQQSVSVASLSVQQLQDMIANSIRAQYGGPPQTSFMYSKPYTKRIDNLRMPLGY 209
Query: 129 QPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDW----YTDLEP 188
QP KFQQFD KGNPKQHI HFVETCENAG+RGD LV++FVR LKGNAF+ + +E
Sbjct: 210 QPLKFQQFDGKGNPKQHIVHFVETCENAGSRGDQLVRQFVRSLKGNAFECTRRVVSMMEL 269
Query: 189 ESINGWEQLEREFLNRFYSTSLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRTFEEL 248
+ + +++NR+ + SLDCKD+L ELSA+E+C QGMHW LLYILQGIKPRTFEEL
Sbjct: 270 TNTQRKGEPVIDYINRWRALSLDCKDKLTELSAVEMCTQGMHWELLYILQGIKPRTFEEL 329
Query: 249 ATRGHDMELSIANRGDKDLLVP----------NLKRISKNTEMTLEESMVINTTSLKSSS 308
ATR HDM+LSIANRG KD LV + K+I+ N L ESM++ T LKS S
Sbjct: 330 ATRAHDMKLSIANRGVKDFLVQRTRSDKNEINDTKKIANN---VLNESMLVQETPLKSFS 389
Query: 309 KRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPEDM 368
KRK+ K ++ + +E+RR TL++RQ+KVYPFPDSD++DMLEQL++ QLI+LPECKRPE +
Sbjct: 390 KRKETKHKRNHDGDEKRRPTLRERQKKVYPFPDSDVADMLEQLIEKQLIQLPECKRPEQV 449
Query: 369 GKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIELNIDELAQANHIGV------- 428
GKV+DPNYCKYHRV+ H V KCFVLKELI KLA+E KIEL+IDE+AQ NH+ V
Sbjct: 450 GKVDDPNYCKYHRVISHLVEKCFVLKELIRKLARENKIELDIDEVAQTNHVAVNMTSSVP 509
Query: 429 ---------------------------ATNTCN----------------QICSKTF---H 488
T T N + ++F H
Sbjct: 510 LSILLYDQRKSLIQFGTFEPILVRFQQKTMTSNSQNKEEPSEDEGEEWIEFLPRSFLEDH 569
Query: 489 GQNVEELATTH----------YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQRMSMA 548
+ + E+ H Y + +E+D+S EIKQ+TSVFD IKP TTR SVFQR+SMA
Sbjct: 570 PEEILEVTACHTTSIVEVDNNYDSYEEMDNSNEIKQRTSVFDCIKPLTTRSSVFQRLSMA 629
Query: 549 AMKEENQCLT----RPSVFQRLSVSTSRKNQTSTYTLDCLKVGANDRNKGRMKILEMKIF 556
KEENQC T + S F+RLS+S S+K++ STYT D LK+ ND+ + MK L+ K F
Sbjct: 630 TKKEENQCPTFTYAQTSAFKRLSISISKKHRPSTYTFDRLKM-TNDQQQREMKTLKAKPF 689
BLAST of Moc07g03790 vs. ExPASy TrEMBL
Match:
A0A5A7SUW1 (Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold239G002250 PE=4 SV=1)
HSP 1 Score: 553.9 bits (1426), Expect = 1.3e-153
Identity = 344/701 (49.07%), Postives = 423/701 (60.34%), Query Frame = 0
Query: 1 MASKEAPSTMEEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVK 60
M A +TMEE M++K+N L K E+RD++IA+L +Q++ ESSQ VVK
Sbjct: 86 MVDVTAEATMEE----MERKINFLMKNFEERDHEIAALKDQMK---ACKTGESSQTPVVK 145
Query: 61 VSDKRKNIMQEKQ-HEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRID 120
+DK KN++QE Q + SVS+ASLSVQQLQDMI NSIRAQYGGP Q SF YSK YTKRID
Sbjct: 146 ATDKGKNVVQENQPQQQSVSVASLSVQQLQDMIANSIRAQYGGPPQTSFMYSKSYTKRID 205
Query: 121 NLIMPMGYQPPKFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYT 180
NL MP+GYQPPKFQQFD KGNPKQHIAHFVETCENAG+RGD LV++FVR LKGNAF+WYT
Sbjct: 206 NLRMPLGYQPPKFQQFDGKGNPKQHIAHFVETCENAGSRGDQLVRQFVRSLKGNAFEWYT 265
Query: 181 DLEPESINGWEQLEREFLNRFYSTSLDCKDRLMELSAIELCIQGMHWGLLYILQGIKPRT 240
DLEPE + +++NR+ + SLDCKD+L ELSA+E+C QGMHW LLYILQGIKPRT
Sbjct: 266 DLEPEG-----EPVIDYINRWRALSLDCKDKLTELSAVEMCTQGMHWELLYILQGIKPRT 325
Query: 241 FEELATRGHDMELSIANRGDKDLLVPNLKRISKN--------TEMTLEESMVINTTSLKS 300
FEEL+TR HDMELSIAN G KD LV KR KN L ESM++ T LKS
Sbjct: 326 FEELSTRAHDMELSIANIGAKDFLVQRTKRSDKNEINDTKKIANNILNESMLVQETPLKS 385
Query: 301 SSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDMLEQLLKMQLIELPECKRPE 360
SKRK+ K E+ + +E+RR TL++RQ+KVYPFPDSD++DMLEQL++ QLI+LPECKRPE
Sbjct: 386 FSKRKETKHERNYDGDEKRRPTLRERQKKVYPFPDSDVADMLEQLIEKQLIQLPECKRPE 445
Query: 361 DMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIELNIDELAQANHIGV----- 420
GKV+DPNYCKYHRV+ HPV KCFVLKELILKLA+E KIEL+IDE+AQ NH+ V
Sbjct: 446 QAGKVDDPNYCKYHRVISHPVEKCFVLKELILKLARENKIELDIDEVAQTNHVAVNMTSS 505
Query: 421 -----------------------------ATNTCN------------------------- 480
T T N
Sbjct: 506 VLPSILLYDQRESLIQFGTFEPILVRFQQKTMTSNSQNKEETSEDEGEEWIGVTHKKERQ 565
Query: 481 ------------------------------------------------------QICSKT 540
+ ++
Sbjct: 566 IGSVQTNSNFHQKHSKGNISHKKKGRRNKKMWKPKPIKGKDKDFFQPRRSINLAEFLPRS 625
Query: 541 F---HGQNVEELATTH----------YINVKEVDDSKEIKQKTSVFDRIKPSTTRVSVFQ 549
F H + + E+ H Y + +EVD+S EIKQ+T VF RIKP T R SVFQ
Sbjct: 626 FLEDHPEKILEVTACHTTSIVEVDNNYDSYEEVDNSNEIKQRTFVFHRIKPLTIRSSVFQ 685
BLAST of Moc07g03790 vs. ExPASy TrEMBL
Match:
A0A5D3D4X3 (Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G001130 PE=4 SV=1)
HSP 1 Score: 547.7 bits (1410), Expect = 9.1e-152
Identity = 283/433 (65.36%), Postives = 334/433 (77.14%), Query Frame = 0
Query: 11 EEKLVKMDKKLNMLTKMVEKRDYKIASLMNQIEIQDVANVAESSQNHVVKVSDKRKNIMQ 70
E+++ +++KK+NML K+VE+RDY+IA L N IE +D AESS H VK +DK K +MQ
Sbjct: 61 EDRMAELEKKVNMLMKVVEERDYEIAFLKNHIESRD---AAESSHKHTVKNTDKGKAVMQ 120
Query: 71 EKQHEHSVSIASLSVQQLQDMITNSIRAQYGGPSQNSFTYSKPYTKRIDNLIMPMGYQPP 130
E Q ++S SIASLSVQQLQ+MI +SI+ QYGGP+Q YSKPYTKRIDNL MP GYQPP
Sbjct: 121 ESQPQNSTSIASLSVQQLQEMIASSIKTQYGGPAQTFSLYSKPYTKRIDNLRMPNGYQPP 180
Query: 131 KFQQFDEKGNPKQHIAHFVETCENAGTRGDLLVKEFVRKLKGNAFDWYTDLEPESINGWE 190
KFQQFD KGNPKQH+AHF+ETCE AGTRGDLLVK+FVR LKGNAFDWY DLEPESI+ WE
Sbjct: 181 KFQQFDGKGNPKQHVAHFIETCETAGTRGDLLVKQFVRTLKGNAFDWYIDLEPESIDNWE 240
Query: 191 QLEREFLNRFYST------------------------------SLDCKDRLMELSAIELC 250
QLER+FLNRFYST SLDCKDRL ELSA+E+C
Sbjct: 241 QLERDFLNRFYSTRRIVSMMELTNTRQQKGELVIDYINRWRALSLDCKDRLTELSAVEMC 300
Query: 251 IQGMHWGLLYILQGIKPRTFEELATRGHDMELSIANRGDKDLLVP-------NLKRISKN 310
QGMHWGLLYILQGIKPRTFEELATR HDMELSIANRG KD L+P L K
Sbjct: 301 TQGMHWGLLYILQGIKPRTFEELATRAHDMELSIANRGAKDFLIPKSRSDKNELDDTKKI 360
Query: 311 TEMTLEESMVINTTSLKSSSKRKDKKVEKRQE-NERRRLTLKDRQEKVYPFPDSDISDML 370
++ESMV++ T LKS SKRK+ K+E++ + +E+R+ TLK+RQEKVYPFPDSD++DML
Sbjct: 361 ANSVIKESMVVHATPLKSFSKRKETKIERKHDGDEKRQSTLKERQEKVYPFPDSDVADML 420
Query: 371 EQLLKMQLIELPECKRPEDMGKVNDPNYCKYHRVVGHPVGKCFVLKELILKLAQEGKIEL 406
EQLL+ QLI+LPECKRPE GKV+DPNYCKYHRV+ HPV KCFVLKELILKLA+E KIEL
Sbjct: 421 EQLLENQLIQLPECKRPEQAGKVDDPNYCKYHRVISHPVEKCFVLKELILKLAREKKIEL 480
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAA0056121.1 | 1.9e-172 | 56.64 | ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa] | [more] |
TYK03695.1 | 2.8e-163 | 49.79 | retrotransposon gag protein [Cucumis melo var. makuwa] | [more] |
KAA0032121.1 | 9.0e-154 | 52.28 | ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa] | [more] |
KAA0033746.1 | 2.6e-153 | 49.07 | retrotransposon gag protein [Cucumis melo var. makuwa] | [more] |
XP_031742032.1 | 2.2e-152 | 65.32 | uncharacterized protein LOC116404025 [Cucumis sativus] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7URH1 | 9.3e-173 | 56.64 | Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... | [more] |
A0A5D3BX77 | 1.3e-163 | 49.79 | Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... | [more] |
A0A5A7SRE2 | 4.3e-154 | 52.28 | Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... | [more] |
A0A5A7SUW1 | 1.3e-153 | 49.07 | Retrotransposon gag protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaf... | [more] |
A0A5D3D4X3 | 9.1e-152 | 65.36 | Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... | [more] |
Match Name | E-value | Identity | Description | |