Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCTATCTTTAGCACCATGGTGTTATAGACGCAAAAGATGTCTGCACACGTCTAGATGGCGTCGTGATGCTGCCTCCTCATCCAAAATGACCTTGCAGCCTGCTTCTTCAATTCTACTTCATTCTTGCCTCGTCCCGACCCGTTCTGATGTTGTTTTAGCTCCTAAATCCCATCACACCCAATAAATCTGACCACAAGTGTGTGAGTTCAGTTGAAGATAGCAAAAAGTAGAGTTTGTAAAATGTACCTCGTGCTCCAACTTCTCCTCTTATTTATAGAGAGAGCCTGCATATGTTGGTTCGAATGTATACTCAGTTGAATTTGATCGACTCTCGACCTGGTCGAGTTGGACCTAGTATAATCAGAAAATTTGACTCTGACTACTTGACTTACGTTGCTTTCATAGCCTCATTTCAAGTATACGATAATTTACTCCTAATCAGAATTATAATAAAATTTTTAAAAACTCAAAAGACCACGGAAAACTAAAATTGAGAGCAGCAAGGACAAAAATTGTAATTCATCATATACTTTAAATGAAAAGTATTGTCGGTCTAAGTAATTCTTCCCATTGTGAATCCATTTGGCAATAATACTGGATAAGATTTGTGAAGGATTGATTCAACATTTATTATTATTATTATTATTTTTATTTTTGTTGTTATTGTTATAAATTAAATCTTAAATATTCTGAATAGTCTTTCTGGCCGTCGTTTCATCAATAACTGCGAACACACAAGCAAAAAACTTTTATCCCTATAAATAGTTTGCTCATCGGTGGTATATTAAATAAATAAATAAAACAAGTTGCAAAAGCATGTACAATGGATAAACATGAGCATACGTAACATGTTCATCTCTGCGTCTTAATTTTTATATTAAATTCAGATTACGAAATTCTTCGTAATCCATTTTAATGTTTTGTTTGTTGAGAAAAAGTACTTCTTCGTTGAATTTTTTTAATACAGAAAATATAGATTTTTTTTTTGTTGGGTGAGAATTCAAATCCACTATTTTTCTCCGAGGAATATAGCAAAGAATCTTTTCGTGAGTAGATTTATATTTTATAAATGAAACAACTTTATAAGAATGAGTACTTCATTTAACGAAAGACACAATTCTCTGTAAGAAGATGTGACTATTTGAAAGTCACAAACATTATATATTAACGGATATTTTCTTATTATTTTTTTCCCTTTCAATTTTGGATAGAAAGAACTTTGATGGGTTTCTCATTCTCTTATTTTCTCTGTACAACTGTTTATATACTAGTCTTCGTTGAGTCCCGTGGCTAAACCATTGCATTCACATATTAATAATGATAATTGTTCGTGACATCTTGAGCATTATAGAGGGAACAAAATTTATCTCAAGAGAACAGTGTTACCATTTGTTTGGGGTTGTTTGGGGGTAAACGTACCCGGGACACGTGGAGGACCCTTATTCGTGTTTGAAGAAATTAGATAAAAAGGATTGCTTAGGTCCACCTTGGCCCCGCCGAGGTGACCGGGTTTGCCCCTTAAAAGATGAAGTATGGGCACCAAAGTTCGACCTGACTGGGGTCCGACCTGCTTGGAACCCGACAGGTCCGATATGAGAAAAGACATGTAACCGCCGGTCATGTGTGCTGTCGGATCCATCACCTATAAATAGAGGGCTTCATTTCACGCTCAGGTATCGAATCTCACCTCGAACTAAATAAGGAGTTCGATCTATACTGACTTGAGCGTCGGAGTGCTTACCCTCTTGTGCAGGTCCACTCTAGTGTTCAGGTCGGAACCGGAGATCGGGTTCGAGCTCGATTCGTGGAGAACCGTTGTGCAAATTCCTGCATAAACATTTGGCGCCGTCTGTGGGGAAGACATCTTAAGTCATCCCGATCTAAAAAAAAAATATACGCAAAAATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGGGTGTGAATGCTGATAACGGCCCTCGGCGAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCGCCATGCGAACCAAGAGCTACCACCTGCTCACCCAAAACCCTTAAAAGCCAACAGAGGCCGAGGAGGGACGTCGAGAAAGACCTCCCAAAGGGCCAACCAGACAGCAGACCCTGAAGCTCTGTCTACTCTCCAGCGCGAGTTGGATTATATGCGCCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTATAGCCCTGGGGGCACCCGATGAAAAGGGAGCTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCCAACGATGAAGGAGTGGATTACAACTTGCGGGATAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTTCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCTCTAGCACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGGTGAAACACAGGTTCGATGAGCAGGTCGAGGCACTCAAAGCCAGGTGCGAGAAGAAGGAGAGCTCGTTTGACGATGGCGACTTGGGAGAATCGCCATTCACCTCGGATATTATGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCAGCATGAAGCCCTATGATGGGTCTAAGGACCCCAAAGACTATGTTGAGGTATTTAAAGGCATCATGGATTTTCAAGCGGCAACGGATGCAATAAAATACCGCGCCTTCCAGATCGCGCTTACCGGCAGCGCGCGCCTGTGGTACCGGAGACTGTCGGCTAGGTCGATATCAACCTATTCTCAGCTGAGAAAGGAGTTCATAAGCCAATTCTCTTCTCGGCACTACGATAGGAAAACAACGACTCACCTCGCCACCATCAGACAGAAGGAAGGAGAGACGCTGAGAGAATATGTCACACGGCTCCATGAGGAGCAGCTGAAGGTTGCACACTGCTCCGATGATTCGGCCATGTACTACTTCCTCACCGGCTTGGCCGATGAGACCTTGACAGTAAAACTTGGAGAGGAGGCTCCAGCCACCTTCGCCGAAGTATTGCAGAAGGCGAAAAAAGTCATTGATGGACAGGAGCTACTCCGAACCAAAACTGGCCGACCTGAGAAGCGGATCGACCAGAAGAAATTGAACTAGGAGACGAGGAGGCCTGATGTCAAGTCCAAGGATAAGGGACCATCCTTCTCCAGTAGCCGAATTGAGTATCGCAGGTCGGACGGTGGCCCCAACCGAAGCCGACCTTACGAACGTTATACCCCGACCATCATCCCAATCTCTGAAATACTTACAAACATTGAGGAGACTGAGATGGAAAAGCTCCTCAAGCGACCCGAGAAGCTCCGGGGAGACCCAGAAAAACGTAACAAAGACAAGGACTGCCGTTTTCATCGCGATCACGGCCACAATACGTCAAGTTGCTGGGAATTTAAGCGCCAGATTGAAGACCTCATTCAAGACGGCTACTTCAAAAAATTTGTGGGCAAACCAAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGTCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGGGGCCCGAGTGGGGGCCAGTCCGGAAACAAAAGGAAAGAGTTAGCTCGCGAGGCCAGACGCGAGGTATGTATCATCAGGGAGCAGAAGCCCACTTGCTCCATCAGTTTCGGCGATGCCGATCTAGAGGGGGTCCATTTGCCCCATAATGACGCGCTTGTGATCGCTCCTCTCATCGACCACGTCCTGGTCCGAAGAGTACTGATCGATGGAGGCGCATCTGCCAACATCTTGTGTCTCCCAACATATCTTGCCTTGGGATGGACCAGGTCACAGTTGAAGAAGAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCCCCAGAAGGGTGCATTGATCTGCCGGTAACTATCGGGCAAGATGCTACCCAGGTAAAGCAAATGGCCGAGTTCATAGTGATCGACGGCAGATCGGCCTATAACGCCATTTTCGGGAGACCCATAATCCACTCATTTCGGGCCGTCCCCTCCACACTGCATCAAGTCTTGAAGTACTCAACCCCGAATGGGGTGGGCACGGTCCCAGGTGAGCAAAAAATCTCACGGGAGTGTTATGCATCCGCGCTTAAAGGGTCGGCAGTATGCGCCTTGGAAGAACAAACCAATCGTGGCAAACTGCAGGAGTCAGAGGCCGACCTGCCAAATAAAGGCAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCGAGCTTGTTCCTTTACTTAGCCCCGAAAAACAAGTAAGCATAGGAACCAAGCTGGGGGCCACTGACAGGGAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCGCATGGTCTCACGAGGACATGCCTGGTATGGACCCGAAGATCATGGTGCATCGCCTCAACATAGACCCATCATTCCGACCTGTAAAGCAAAAAAGAAGACCTGTAAATAAAGAGAGGAGTGATGTAATTGTTGAGGAAGAAAACAAACTCTTAAAAGCTGAATATATACGAGAAATTCTGTATCCTGAGTGGCTCTCTAACGTTGTATTAGTTAAAAAATCCAACGGGAAGTGGAGGATGTGCGTGGACTTCACAAATTTAAATAAGGCATGCCCAAAAGATTGCTTCCCCCTTCCAAGGATCGACCAGCTCGTGGACGCGACAGCCGGGCACGAGTTACTCACCTTCATGGATGCCTACTCTGGATACAACCAAATTAAAATGCATTTGCCAGACCAAGATGACACCGCGTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTGATGCCTTTTGGCCTAAAGAATGCGGGTGCGACCTACCAAAGGATGGTGAACAAAATGTTTACCAAGCATATCGGTCGGAATATGGAAGTGTATGTAGACGACATGCTTGTCAAGAGCAAGTAGTCAAAGTCGCATCTTTCCGATCTGACCGAAGCCTTTGGAGTGCTACGCAAGTACCAAATGAAGCTCAATCCGACCAAGTGCGCCTTCGGAGTCTCCTCGGGAAAATTCCTTGGTTTTATGGTAAATAACCCTGGGATTGAGGCTAACCCAGAGAAGATAAAGGCCGTGATCGAGATGGAGGCACCCAAGACTGTAAAGCAGCTTCAGTGCCTCAATGGCAGGATTGCAGCCTTGAATAGATTCGTGTCAAGATCGACGAACAAATGTCTTCCGTTCTTCAAAGTTCTCAGGAAGAAGGGACCGTTTGAATGGACGGAAGAGTGTGAGCAAGCGCTAGGGCAGTTAAAAGACTATCTCTGCTCGGCACCTCTACTCGCAAAGCCATTACTAGGGGAAAAACTCTATTTGTACCTGGCGGTATCTGCTAGCGCCGTCAGCTCGGCACTGATCAAGCAGGAGGGTGCGAGCCAAAGCCCCGTTTATTACACCAGTAAGGCCATGACCGAGGTTGAGACCAGATATCCTCAAATGGAGAAGCTGGCCCTCGCTTTAGTCACCTCGGCCCGAAAACTCAGGCCGTACTTCCAAGCGCGCACAGTCGTAGTACTTACAAACTTGCCCCTGAAGAATATCTTCCTTAAGCCTGAAACATCAGGGCGTTTGATGAAGTGGGCATTGGAATTGAGCGAGTACGACATCCAGTTCGAGCCCAGAACTGCGATGAAAGAACAAGCAGTGGCCGATTTTATCGCAGAGCTCACTCCACCACCTCAGTTGGTCAAGTCCGACCTCCCGTGGACGATCTTCGTTGATGGATCTTCTAACGAGAGGGGGTGCGGGGCAGGAATCGTCTTGCTCGCACCAGGAGGTGAGCGATTCAAATATGCTCTGCGGTTCAACTTTCGGACGTCAAATAATGAGGCCGAGTACGAAGCACTCCTAGCAGGCCTGCACGTTGCCAAAGGACTGGGGGCTAATCACATAAAGGTCTTTAGTGACTCCCAGCTAATTGTAAATCAGATCAAGGAGGAGTACCAAGCGAAGGACCCCCGGATGGAAAAATATCTGAGCAAGGTCAGATCGCACCTCGCCCAGTTCGGGACTTACGAGGTAAGTCAAGTTCCAAGATCTGAGAACTCTAATGCAGATGCCTTGGCCAAATTGGCATCAGCATATGAGACCGACCTGGCTAGATCAGTCCTGGTCGAGATCTTGGACACTCATTCAATCTTGGAGCCAGATGTAATGGAGGTTAATACTCCATCACCTACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGTGCGAAGAGCAGCTCGGTTCACACTTCGAGAAGGAATGTTGTACCGACGTGGCTTCTCCCTACCTCTCCTCAAGTGTGTGACTCCCGAAGAAGGCCTTTACATTCTTAGGGAAGTCCATGAAGGGGTGTGTGGAAACCACTATGGCGCCAGGTCGTTGTCGGCCAAGGTGGTTCGACAAGGGTACTATTGGCCCAGTGTCGAGCAGGATGCAAAGCAGTTCGTGAAAGCTTGTGACAACTGCCAGCGTTTCGCAAACATTATCCATCAACCTCCCGAACTGCTCACCCCCATCTCGGCCCCATGGCCATTCGCGCAGTGGGGGGTAGACATCATAGGACATTTTCCTCTGGGCAAAGGACATACAAAGTTCGCCGTTGTCGCTGTGGACTACTTCACTAAGTGGGCTGAAGCCGAGGCACTATCCCACATCACAGAGTCCAGAGTCACGTGTGTGCCGCTTTGGCATACCAAATGATATCGTGACAGACAACGGAAAACAATTTGACAATGCAAAGTTCAAGGACTTTTGCAGAAAACTTGGTATAAGCCACATCAGTTCGTCCCTTGCGCATCCAAAAGCGAATGGACAAGTTAAAGCAGTAAACAAGATCATAAAGCGAGGACTCAAGCTAAGGTTGGACTCCAGAAAGGGAAGATGGGCCGGGGAGCTACCTGAGGTTCTGTGGTCATATCGAACCACCCCAAGGGAGTCAACTGGTGAAACTCCGTTCTCGCTAGCCTTTGGTTCCGAAGCTGTTGTACCAGTCGAGATCGGCATACCAACAAACAGGGTAGAACAGTACGAGCCAACAAAGAACGAGGAAGAGCTACTTCTTAACCTGGACTTATTGGAAGGGAAAAAGGAAATGGCTCAGCTGCGCTTAGCAGAGTATCAGAACAGAATGGCCAGACATTACAATGCCCGAGTTCGACCTCGAAGCTTCCAAGTTGGACATTTAGTCTTAAGAAAAATTCAGAGTCATGTCGGCACCCTGGACCCAAGTTGGGAGGGACCATTCGAAGTCAAAGGCATAGTCCGACCTGGAACTTATATGCTGGCTGACCTAGAAGGAAGAGTGCTTGCGCATCCATGGAACGCGGAGCACTTGAAGCGCTATTACCCTTGAAATGTCAAAATAGTCCCAAAATGGGCTTGTAAAAATTTTCAAGGGGCGGAACTTTCAATAAATGAGGTTATTTAATTTCATAACTCCGAGTTCGATTAGAAATTAAATGGGGGCCACAGACTCCTATAAGACTTAAAATCGGCAAAGTAAGGGCAAAAATTGATCAAATTCGATCCTCAAAACCCACGGGTTCGAGGTGCGATGTCAAAATCAATTACGAACCAAAATTCAATCCTTTGAACGCTTAAGTTAAAGGTGCGATGTTGAAGGTTTAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
mRNA sequence
ATGCTTCTATCTTTAGCACCATGGTGTTATAGACGCAAAAGATGTCTGCACACGTCTAGATGGCGTCGTGATGCTGCCTCCTCATCCAAAATGACCTTGCAGCCTGCTTCTTCAATTCTACTTCATTCTTGCCTCGTCCCGACCCGTTCTGATGTTGTCCACTCTAGTGTTCAGGTCGGAACCGGAGATCGGGTTCGAGCTCGATTCGTGGAGAACCGTTGTGCAAATTCCTGCATAAACATTTGGCGCCGTCTGTGGGGAAGACATCTTAAGTCATCCCGATCTAAAAAAAAAATATACGCAAAAATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGGGTGTGAATGCTGATAACGGCCCTCGGCGAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCGCCATGCGAACCAAGAGCTACCACCTGCTCACCCAAAACCCTTAAAAGCCAACAGAGGCCGAGGAGGGACGTCGAGAAAGACCTCCCAAAGGGCCAACCAGACAGCAGACCCTGAAGCTCTGTCTACTCTCCAGCGCGAGTTGGATTATATGCGCCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTATAGCCCTGGGGGCACCCGATGAAAAGGGAGCTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCCAACGATGAAGGAGTGGATTACAACTTGCGGGATAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTTCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCTCTAGCACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGGTGAAACACAGGTTCGATGAGCAGGTCGAGGCACTCAAAGCCAGGTGCGAGAAGAAGGAGAGCTCGTTTGACGATGGCGACTTGGGAGAATCGCCATTCACCTCGGATATTATGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCAGCATGAAGCCCTATGATGGGTCTAAGGACCCCAAAGACTATGTTGAGGTATTTAAAGGCATCATGGATTTTCAAGCGGCAACGGATGCAATAAAATACCGCGCCTTCCAGATCGCGCTTACCGGCAGCGCGCGCCTGTGGTACCGGAGACTGTCGGCTAGGTCGATATCAACCTATTCTCAGCTGAGAAAGGAGTTCATAAGCCAATTCTCTTCTCGGCACTACGATAGGAAAACAACGACTCACCTCGCCACCATCAGACAGAAGGAAGGAGAGACGCTGAGAGAATATGTCACACGGCTCCATGAGGAGCAGCTGAAGGTTGCACACTGCTCCGATGATTCGGCCATGTACTACTTCCTCACCGGCTTGGCCGATGAGACCTTGACAGTAAAACTTGGAGAGGAGGCTCCAGCCACCTTCGCCGAAGTATTGCAGAAGGCGAAAAAAGAGACGAGGAGGCCTGATGTCAAGTCCAAGGATAAGGGACCATCCTTCTCCAGTAGCCGAATTGAGTATCGCAGGTCGGACGGTGGCCCCAACCGAAGCCGACCTTACGAACGTTATACCCCGACCATCATCCCAATCTCTGAAATACTTACAAACATTGAGGAGACTGAGATGGAAAAGCTCCTCAAGCGACCCGAGAAGCTCCGGGGAGACCCAGAAAAACGTAACAAAGACAAGGACTGCCGTTTTCATCGCGATCACGGCCACAATACGTCAAGTTGCTGGGAATTTAAGCGCCAGATTGAAGACCTCATTCAAGACGGCTACTTCAAAAAATTTGTGGGCAAACCAAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGTCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGGGGCCCGAGTGGGGGCCAGTCCGGAAACAAAAGGAAAGAGTTAGCTCGCGAGGCCAGACGCGAGGTATGTATCATCAGGGAGCAGAAGCCCACTTGCTCCATCAGTTTCGGCGATGCCGATCTAGAGGGGGTCCATTTGCCCCATAATGACGCGCTTGTGATCGCTCCTCTCATCGACCACGTCCTGGTCCGAAGAGTACTGATCGATGGAGGCGCATCTGCCAACATCTTGTGTCTCCCAACATATCTTGCCTTGGGATGGACCAGGTCACAGTTGAAGAAGAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCCCCAGAAGGGTGCATTGATCTGCCGGTAACTATCGGGCAAGATGCTACCCAGACCGACCTGGCTAGATCAGTCCTGGTCGAGATCTTGGACACTCATTCAATCTTGGAGCCAGATGTAATGGAGGTTAATACTCCATCACCTACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGTGCGAAGAGCAGCTCGGTTCACACTTCGAGAAGGAATGTTGTTTAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
Coding sequence (CDS)
ATGCTTCTATCTTTAGCACCATGGTGTTATAGACGCAAAAGATGTCTGCACACGTCTAGATGGCGTCGTGATGCTGCCTCCTCATCCAAAATGACCTTGCAGCCTGCTTCTTCAATTCTACTTCATTCTTGCCTCGTCCCGACCCGTTCTGATGTTGTCCACTCTAGTGTTCAGGTCGGAACCGGAGATCGGGTTCGAGCTCGATTCGTGGAGAACCGTTGTGCAAATTCCTGCATAAACATTTGGCGCCGTCTGTGGGGAAGACATCTTAAGTCATCCCGATCTAAAAAAAAAATATACGCAAAAATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGGGTGTGAATGCTGATAACGGCCCTCGGCGAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCGCCATGCGAACCAAGAGCTACCACCTGCTCACCCAAAACCCTTAAAAGCCAACAGAGGCCGAGGAGGGACGTCGAGAAAGACCTCCCAAAGGGCCAACCAGACAGCAGACCCTGAAGCTCTGTCTACTCTCCAGCGCGAGTTGGATTATATGCGCCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTATAGCCCTGGGGGCACCCGATGAAAAGGGAGCTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCCAACGATGAAGGAGTGGATTACAACTTGCGGGATAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTTCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCTCTAGCACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGGTGAAACACAGGTTCGATGAGCAGGTCGAGGCACTCAAAGCCAGGTGCGAGAAGAAGGAGAGCTCGTTTGACGATGGCGACTTGGGAGAATCGCCATTCACCTCGGATATTATGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCAGCATGAAGCCCTATGATGGGTCTAAGGACCCCAAAGACTATGTTGAGGTATTTAAAGGCATCATGGATTTTCAAGCGGCAACGGATGCAATAAAATACCGCGCCTTCCAGATCGCGCTTACCGGCAGCGCGCGCCTGTGGTACCGGAGACTGTCGGCTAGGTCGATATCAACCTATTCTCAGCTGAGAAAGGAGTTCATAAGCCAATTCTCTTCTCGGCACTACGATAGGAAAACAACGACTCACCTCGCCACCATCAGACAGAAGGAAGGAGAGACGCTGAGAGAATATGTCACACGGCTCCATGAGGAGCAGCTGAAGGTTGCACACTGCTCCGATGATTCGGCCATGTACTACTTCCTCACCGGCTTGGCCGATGAGACCTTGACAGTAAAACTTGGAGAGGAGGCTCCAGCCACCTTCGCCGAAGTATTGCAGAAGGCGAAAAAAGAGACGAGGAGGCCTGATGTCAAGTCCAAGGATAAGGGACCATCCTTCTCCAGTAGCCGAATTGAGTATCGCAGGTCGGACGGTGGCCCCAACCGAAGCCGACCTTACGAACGTTATACCCCGACCATCATCCCAATCTCTGAAATACTTACAAACATTGAGGAGACTGAGATGGAAAAGCTCCTCAAGCGACCCGAGAAGCTCCGGGGAGACCCAGAAAAACGTAACAAAGACAAGGACTGCCGTTTTCATCGCGATCACGGCCACAATACGTCAAGTTGCTGGGAATTTAAGCGCCAGATTGAAGACCTCATTCAAGACGGCTACTTCAAAAAATTTGTGGGCAAACCAAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGTCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGGGGCCCGAGTGGGGGCCAGTCCGGAAACAAAAGGAAAGAGTTAGCTCGCGAGGCCAGACGCGAGGTATGTATCATCAGGGAGCAGAAGCCCACTTGCTCCATCAGTTTCGGCGATGCCGATCTAGAGGGGGTCCATTTGCCCCATAATGACGCGCTTGTGATCGCTCCTCTCATCGACCACGTCCTGGTCCGAAGAGTACTGATCGATGGAGGCGCATCTGCCAACATCTTGTGTCTCCCAACATATCTTGCCTTGGGATGGACCAGGTCACAGTTGAAGAAGAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCCCCAGAAGGGTGCATTGATCTGCCGGTAACTATCGGGCAAGATGCTACCCAGACCGACCTGGCTAGATCAGTCCTGGTCGAGATCTTGGACACTCATTCAATCTTGGAGCCAGATGTAATGGAGGTTAATACTCCATCACCTACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGTGCGAAGAGCAGCTCGGTTCACACTTCGAGAAGGAATGTTGTTTAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
Protein sequence
MLLSLAPWCYRRKRCLHTSRWRRDAASSSKMTLQPASSILLHSCLVPTRSDVVHSSVQVGTGDRVRARFVENRCANSCINIWRRLWGRHLKSSRSKKKIYAKMVHPANSANTTEQRGVNADNGPRRDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPLKANRGRGGTSRKTSQRANQTADPEALSTLQRELDYMRHRLRTVEEMYAEATRANRTASPSIALGAPDEKGAPSIQPGDREPIPNDEGVDYNLRDNDLRKHLTEKKKRASREPEDSSSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKKETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQTDLARSVLVEILDTHSILEPDVMEVNTPSPTWMDPIVEFIKGNPPQDPKEQKKMVRRAARFTLREGMLFKKFNPLNLRVRGAM
Homology
BLAST of Moc08g14960 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 785.0 bits (2026), Expect = 6.5e-223
Identity = 410/530 (77.36%), Postives = 447/530 (84.34%), Query Frame = 0
Query: 292 LKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDI 351
+KA+S P P VITREEFD ++ + D QVEALKA+CE+KE +DGDLGESPFTSD+
Sbjct: 2 VKAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV 61
Query: 352 MEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWY 411
+EAPIPPKFK P++KPYDGSKDPKDYVEVF+ +MDFQAA+DAIK RAF+IALTGSARLWY
Sbjct: 62 LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWY 121
Query: 412 RRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKV 471
RRL A SISTYSQLR+EF++ FSSRHYD+KT THLATIRQKEGETLREYVTR EEQLKV
Sbjct: 122 RRLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV 181
Query: 472 AHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK----------ETRRP--- 531
AHCSDDSAM YFLTGLADE LTVKLGEEAPATFAEVLQKAKK +T RP
Sbjct: 182 AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERK 241
Query: 532 -------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTN 591
D KSKDKG SFSS R EYRR++ GP RSRPYER+TPT IPISEILTN
Sbjct: 242 IGRGRSGKDIENADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTN 301
Query: 592 IEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKK 651
IEE+ MEKLLKRPEKLRG PE+R+KDK CRFHR+HGHNTS WE KRQIE+LIQDGYFKK
Sbjct: 302 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKK 361
Query: 652 FVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVC 711
FVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG KRKELAR ARREVC
Sbjct: 362 FVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVC 421
Query: 712 IIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYL 771
IIREQ+PTC I+F ADLE VHLPHNDALVIAPLIDHV+V RVL+DGG SANIL LPTYL
Sbjct: 422 IIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYL 481
Query: 772 ALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSV 795
ALGWTRSQLKKSPTPLVGFSGESV PEG IDLPVT+GQD TQ T +A V
Sbjct: 482 ALGWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc08g14960 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 773.5 bits (1996), Expect = 2.0e-219
Identity = 416/532 (78.20%), Postives = 443/532 (83.27%), Query Frame = 0
Query: 293 KAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIM 352
KA+S Y P+ P VITREEFD +K +FD QVEALKARCEKKESSFDDGDLGE F+SDI+
Sbjct: 64 KAESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDIL 123
Query: 353 EAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYR 412
EA IPPKFKTP+MKPYDGSKDPKDYVEVF+ +MDFQAATDAIK AFQIALTGSARLWYR
Sbjct: 124 EALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYR 183
Query: 413 RLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVA 472
RL AR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTR EEQLKVA
Sbjct: 184 RLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVA 243
Query: 473 HCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK----------ETRRP---- 532
HCSDDSAM YFLTGLADETLTVKL EEAPATFAEVLQK KK +T RP
Sbjct: 244 HCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNI 303
Query: 533 ------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNI 592
D KS+DKGPS SSSR++YRRS+ N+SRPYE YTPT IPI EILTNI
Sbjct: 304 DQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNI 363
Query: 593 EETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKF 652
EET MEKLLKRPEKLRGDPEKRN DK CRFHRDHGHNTS+ WE KRQIEDLIQDGYFKKF
Sbjct: 364 EETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKF 423
Query: 653 VGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCI 712
VGKPRSNSVEKKEERKR RTPPRRDDRPAVI NK+KELAREARREVCI
Sbjct: 424 VGKPRSNSVEKKEERKRLRTPPRRDDRPAVI-------------NKKKELAREARREVCI 483
Query: 713 IREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLA 772
IREQ+PT SI+F ADLEGVHLPHNDALVIAPLID VLVRR+L+DGGASANIL L TYLA
Sbjct: 484 IREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLA 543
Query: 773 LGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE 798
LGWTRSQLKKSPTPLVGFSGES+S EGCIDLPV+I QD TQ T +A V+++
Sbjct: 544 LGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVID 581
BLAST of Moc08g14960 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 48.5 bits (114), Expect = 3.3e-01
Identity = 46/150 (30.67%), Postives = 69/150 (46.00%), Query Frame = 0
Query: 103 MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHP 162
MV PANS NT ++R + A++G +R++GA +VE Q + RSAR LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 163 KPLKANRGRGGTSRKTSQRANQTADPEALSTLQRELDYMRHRLRTVEEMYAEATRANRTA 222
KP KA + R + S +++ ++ R E + + +
Sbjct: 61 KPSKAESSYNPITPGVITREE---FDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSF 120
Query: 223 SPSI--ALGAPDEKGAPSIQPGDREPIPND 251
S I AL P K P+++P D P D
Sbjct: 121 SSDILEALIPPKFK-TPTMKPYDGSKDPKD 146
HSP 2 Score: 752.3 bits (1941), Expect = 4.7e-213
Identity = 428/651 (65.75%), Postives = 463/651 (71.12%), Query Frame = 0
Query: 229 GAPDEKGAPSIQPGDREPIPNDEGVDYNLRDNDLRKHLTEKKKRASREPEDSSSYSREFS 288
GAP EKGAPSIQPG+REPIPNDEGVDY+LRDNDLRKHLT+KKK+AS EPEDS SYSREFS
Sbjct: 6 GAPGEKGAPSIQPGNREPIPNDEGVDYSLRDNDLRKHLTDKKKKASWEPEDSLSYSREFS 65
Query: 289 NSNLKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFT 348
NSNLKAQSKYKPL PEAVI REEFDL+KHRFDEQVEALKARCEKKES FDD DLGESPFT
Sbjct: 66 NSNLKAQSKYKPLIPEAVINREEFDLMKHRFDEQVEALKARCEKKESPFDDDDLGESPFT 125
Query: 349 SDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSAR 408
SDIMEAPIPPKFKTP+MKPYDGSKDPKDYVEVF+G+MDFQAATDAIK AFQIALTGSAR
Sbjct: 126 SDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCLAFQIALTGSAR 185
Query: 409 LWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQ 468
LW RRL ARSISTYSQLRKEFI QFS RHYDRKT THLATIRQKE
Sbjct: 186 LWCRRLPARSISTYSQLRKEFIGQFSFRHYDRKTATHLATIRQKE--------------- 245
Query: 469 LKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK--------------- 528
DETLTVKLGEEAPATFAEVLQ AKK
Sbjct: 246 --------------------DETLTVKLGEEAPATFAEVLQNAKKVIDGQELLRTKTDRP 305
Query: 529 -----------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEI 588
+ R+ D KSKDKG S S SR EYRRS+ GP+RSRPYER
Sbjct: 306 EKQIDQKRLSQKKRKDDSKSKDKGSSSSGSRTEYRRSESGPSRSRPYER----------- 365
Query: 589 LTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGY 648
CWE KRQIEDLIQD Y
Sbjct: 366 --------------------------------------------CWELKRQIEDLIQDSY 425
Query: 649 FKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARR 708
FKKFVGKPRSNSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ NKRKELA EARR
Sbjct: 426 FKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQFENKRKELACEARR 485
Query: 709 EVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLP 768
+V IIREQKPTCSI+F D DLEGVHLPHNDALVIAPLIDHVLVRRVL+DGGASANIL LP
Sbjct: 486 KVSIIREQKPTCSITFKDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLP 545
Query: 769 TYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE-ILD 828
TYLAL TRSQLKKSPTPLVGFS ESVSPEGCIDLPVTIGQD+TQ T +A V+++ L
Sbjct: 546 TYLALRGTRSQLKKSPTPLVGFSAESVSPEGCIDLPVTIGQDSTQVTQMAEFVVIDGRLA 566
Query: 829 THSILEPDVMEVNTPSPTWMDPIVEFIKGNPPQDPKEQKKMVRRAARFTLR 852
++I E ++ P+ + ++++ N + ++K R L+
Sbjct: 606 YNAIFERPIIHSFQAVPSILHQVLKYSTPNGVGTVRGEQKTSRECYASALK 566
BLAST of Moc08g14960 vs. NCBI nr
Match:
XP_022149377.1 (uncharacterized protein LOC111017807 [Momordica charantia])
HSP 1 Score: 654.1 bits (1686), Expect = 1.7e-183
Identity = 347/427 (81.26%), Postives = 366/427 (85.71%), Query Frame = 0
Query: 372 KDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFIS 431
+DPKDYVEVF+G+MDFQAATDAIK RAFQIALTG ARLWYRRL ARSISTYSQLRKEFIS
Sbjct: 34 QDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGXARLWYRRLPARSISTYSQLRKEFIS 93
Query: 432 QFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADET 491
QF SRHYDRKT THLATIRQKE ETLREYVTR EEQLKV HCSDDSAM YFLTGLADET
Sbjct: 94 QFXSRHYDRKTATHLATIRQKEXETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADET 153
Query: 492 LTVKLGEEAPATFAEVLQKAKK--------------------------ETRRPDVKSKDK 551
LTVKLGEEAPATFAEVLQKAKK E R+ D KS+DK
Sbjct: 154 LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLIQEKRKTDSKSRDK 213
Query: 552 GPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDP 611
G S S+SR E+RR + GP+RSRPYERYTPT I ISEILTNIEE+ MEKLLK PEKLRGDP
Sbjct: 214 GSSSSASRAEHRRLESGPSRSRPYERYTPTTILISEILTNIEESGMEKLLKSPEKLRGDP 273
Query: 612 EKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR 671
EKR+KDK+CRFHRDH HNT+SCWE KRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR
Sbjct: 274 EKRSKDKNCRFHRDHDHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR 333
Query: 672 TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEG 731
TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSI+F DADLEG
Sbjct: 334 TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEG 393
Query: 732 VHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFS 773
VHLPHNDALVIAPLIDHVLV +L+DGGASANIL LPTYLALGWTR QLKKSPT + S
Sbjct: 394 VHLPHNDALVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTRLQLKKSPTRWLD-S 453
BLAST of Moc08g14960 vs. NCBI nr
Match:
XP_022149377.1 (uncharacterized protein LOC111017807 [Momordica charantia])
HSP 1 Score: 69.3 bits (168), Expect = 1.8e-07
Identity = 33/34 (97.06%), Postives = 34/34 (100.00%), Query Frame = 0
Query: 103 MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQ 137
MVHPANSANTTEQRGVNADNGP+RDLGARIVEDQ
Sbjct: 1 MVHPANSANTTEQRGVNADNGPQRDLGARIVEDQ 34
HSP 2 Score: 631.7 bits (1628), Expect = 9.3e-177
Identity = 328/423 (77.54%), Postives = 356/423 (84.16%), Query Frame = 0
Query: 333 KESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATD 392
K+ S +DGDLGES FTSD++EAPIPPKFK P++KPYDGSKDPKDYVEVF+G+MDF AA+D
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 393 AIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQK 452
AIK RAFQIALTGSARLWYRRL ARSISTYSQLR+EF++QFSSR Y +KT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 453 EGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAK 512
EG TLREYVTR EEQLKVAHCSDDSAM YFLTGLADE LTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 513 --------------------------KETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRS 572
K+ R D KSKDKG SFSS R EYRR++ GP +S
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAESGPTKS 283
Query: 573 RPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSS 632
RPYER+TPT IPISEILTNIEE+ MEKLLKRPEKLRG PE+R+KDK CRFHR+HGHNTS
Sbjct: 284 RPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD 343
Query: 633 CWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG 692
CWE KRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSG
Sbjct: 344 CWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSG 403
Query: 693 GQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVR 730
GQSG+KRKELAR ARREVCIIREQ PTC I+F AD E VHLPHNDA VIAPLIDHV+VR
Sbjct: 404 GQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVR 463
BLAST of Moc08g14960 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 785.0 bits (2026), Expect = 3.2e-223
Identity = 410/530 (77.36%), Postives = 447/530 (84.34%), Query Frame = 0
Query: 292 LKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDI 351
+KA+S P P VITREEFD ++ + D QVEALKA+CE+KE +DGDLGESPFTSD+
Sbjct: 2 VKAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV 61
Query: 352 MEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWY 411
+EAPIPPKFK P++KPYDGSKDPKDYVEVF+ +MDFQAA+DAIK RAF+IALTGSARLWY
Sbjct: 62 LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWY 121
Query: 412 RRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKV 471
RRL A SISTYSQLR+EF++ FSSRHYD+KT THLATIRQKEGETLREYVTR EEQLKV
Sbjct: 122 RRLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV 181
Query: 472 AHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK----------ETRRP--- 531
AHCSDDSAM YFLTGLADE LTVKLGEEAPATFAEVLQKAKK +T RP
Sbjct: 182 AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERK 241
Query: 532 -------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTN 591
D KSKDKG SFSS R EYRR++ GP RSRPYER+TPT IPISEILTN
Sbjct: 242 IGRGRSGKDIENADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTN 301
Query: 592 IEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKK 651
IEE+ MEKLLKRPEKLRG PE+R+KDK CRFHR+HGHNTS WE KRQIE+LIQDGYFKK
Sbjct: 302 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKK 361
Query: 652 FVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVC 711
FVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG KRKELAR ARREVC
Sbjct: 362 FVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVC 421
Query: 712 IIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYL 771
IIREQ+PTC I+F ADLE VHLPHNDALVIAPLIDHV+V RVL+DGG SANIL LPTYL
Sbjct: 422 IIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYL 481
Query: 772 ALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSV 795
ALGWTRSQLKKSPTPLVGFSGESV PEG IDLPVT+GQD TQ T +A V
Sbjct: 482 ALGWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc08g14960 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 773.5 bits (1996), Expect = 9.5e-220
Identity = 416/532 (78.20%), Postives = 443/532 (83.27%), Query Frame = 0
Query: 293 KAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIM 352
KA+S Y P+ P VITREEFD +K +FD QVEALKARCEKKESSFDDGDLGE F+SDI+
Sbjct: 64 KAESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDIL 123
Query: 353 EAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYR 412
EA IPPKFKTP+MKPYDGSKDPKDYVEVF+ +MDFQAATDAIK AFQIALTGSARLWYR
Sbjct: 124 EALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYR 183
Query: 413 RLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVA 472
RL AR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTR EEQLKVA
Sbjct: 184 RLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVA 243
Query: 473 HCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK----------ETRRP---- 532
HCSDDSAM YFLTGLADETLTVKL EEAPATFAEVLQK KK +T RP
Sbjct: 244 HCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNI 303
Query: 533 ------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNI 592
D KS+DKGPS SSSR++YRRS+ N+SRPYE YTPT IPI EILTNI
Sbjct: 304 DQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNI 363
Query: 593 EETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKF 652
EET MEKLLKRPEKLRGDPEKRN DK CRFHRDHGHNTS+ WE KRQIEDLIQDGYFKKF
Sbjct: 364 EETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKF 423
Query: 653 VGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCI 712
VGKPRSNSVEKKEERKR RTPPRRDDRPAVI NK+KELAREARREVCI
Sbjct: 424 VGKPRSNSVEKKEERKRLRTPPRRDDRPAVI-------------NKKKELAREARREVCI 483
Query: 713 IREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLA 772
IREQ+PT SI+F ADLEGVHLPHNDALVIAPLID VLVRR+L+DGGASANIL L TYLA
Sbjct: 484 IREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLA 543
Query: 773 LGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE 798
LGWTRSQLKKSPTPLVGFSGES+S EGCIDLPV+I QD TQ T +A V+++
Sbjct: 544 LGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVID 581
BLAST of Moc08g14960 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 48.5 bits (114), Expect = 1.6e-01
Identity = 46/150 (30.67%), Postives = 69/150 (46.00%), Query Frame = 0
Query: 103 MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHP 162
MV PANS NT ++R + A++G +R++GA +VE Q + RSAR LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 163 KPLKANRGRGGTSRKTSQRANQTADPEALSTLQRELDYMRHRLRTVEEMYAEATRANRTA 222
KP KA + R + S +++ ++ R E + + +
Sbjct: 61 KPSKAESSYNPITPGVITREE---FDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSF 120
Query: 223 SPSI--ALGAPDEKGAPSIQPGDREPIPND 251
S I AL P K P+++P D P D
Sbjct: 121 SSDILEALIPPKFK-TPTMKPYDGSKDPKD 146
HSP 2 Score: 752.3 bits (1941), Expect = 2.3e-213
Identity = 428/651 (65.75%), Postives = 463/651 (71.12%), Query Frame = 0
Query: 229 GAPDEKGAPSIQPGDREPIPNDEGVDYNLRDNDLRKHLTEKKKRASREPEDSSSYSREFS 288
GAP EKGAPSIQPG+REPIPNDEGVDY+LRDNDLRKHLT+KKK+AS EPEDS SYSREFS
Sbjct: 6 GAPGEKGAPSIQPGNREPIPNDEGVDYSLRDNDLRKHLTDKKKKASWEPEDSLSYSREFS 65
Query: 289 NSNLKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFT 348
NSNLKAQSKYKPL PEAVI REEFDL+KHRFDEQVEALKARCEKKES FDD DLGESPFT
Sbjct: 66 NSNLKAQSKYKPLIPEAVINREEFDLMKHRFDEQVEALKARCEKKESPFDDDDLGESPFT 125
Query: 349 SDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSAR 408
SDIMEAPIPPKFKTP+MKPYDGSKDPKDYVEVF+G+MDFQAATDAIK AFQIALTGSAR
Sbjct: 126 SDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCLAFQIALTGSAR 185
Query: 409 LWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQ 468
LW RRL ARSISTYSQLRKEFI QFS RHYDRKT THLATIRQKE
Sbjct: 186 LWCRRLPARSISTYSQLRKEFIGQFSFRHYDRKTATHLATIRQKE--------------- 245
Query: 469 LKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK--------------- 528
DETLTVKLGEEAPATFAEVLQ AKK
Sbjct: 246 --------------------DETLTVKLGEEAPATFAEVLQNAKKVIDGQELLRTKTDRP 305
Query: 529 -----------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEI 588
+ R+ D KSKDKG S S SR EYRRS+ GP+RSRPYER
Sbjct: 306 EKQIDQKRLSQKKRKDDSKSKDKGSSSSGSRTEYRRSESGPSRSRPYER----------- 365
Query: 589 LTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGY 648
CWE KRQIEDLIQD Y
Sbjct: 366 --------------------------------------------CWELKRQIEDLIQDSY 425
Query: 649 FKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARR 708
FKKFVGKPRSNSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ NKRKELA EARR
Sbjct: 426 FKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQFENKRKELACEARR 485
Query: 709 EVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLP 768
+V IIREQKPTCSI+F D DLEGVHLPHNDALVIAPLIDHVLVRRVL+DGGASANIL LP
Sbjct: 486 KVSIIREQKPTCSITFKDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLP 545
Query: 769 TYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE-ILD 828
TYLAL TRSQLKKSPTPLVGFS ESVSPEGCIDLPVTIGQD+TQ T +A V+++ L
Sbjct: 546 TYLALRGTRSQLKKSPTPLVGFSAESVSPEGCIDLPVTIGQDSTQVTQMAEFVVIDGRLA 566
Query: 829 THSILEPDVMEVNTPSPTWMDPIVEFIKGNPPQDPKEQKKMVRRAARFTLR 852
++I E ++ P+ + ++++ N + ++K R L+
Sbjct: 606 YNAIFERPIIHSFQAVPSILHQVLKYSTPNGVGTVRGEQKTSRECYASALK 566
BLAST of Moc08g14960 vs. ExPASy TrEMBL
Match:
A0A6J1D7S8 (uncharacterized protein LOC111017807 OS=Momordica charantia OX=3673 GN=LOC111017807 PE=4 SV=1)
HSP 1 Score: 654.1 bits (1686), Expect = 8.4e-184
Identity = 347/427 (81.26%), Postives = 366/427 (85.71%), Query Frame = 0
Query: 372 KDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFIS 431
+DPKDYVEVF+G+MDFQAATDAIK RAFQIALTG ARLWYRRL ARSISTYSQLRKEFIS
Sbjct: 34 QDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGXARLWYRRLPARSISTYSQLRKEFIS 93
Query: 432 QFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADET 491
QF SRHYDRKT THLATIRQKE ETLREYVTR EEQLKV HCSDDSAM YFLTGLADET
Sbjct: 94 QFXSRHYDRKTATHLATIRQKEXETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADET 153
Query: 492 LTVKLGEEAPATFAEVLQKAKK--------------------------ETRRPDVKSKDK 551
LTVKLGEEAPATFAEVLQKAKK E R+ D KS+DK
Sbjct: 154 LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLIQEKRKTDSKSRDK 213
Query: 552 GPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDP 611
G S S+SR E+RR + GP+RSRPYERYTPT I ISEILTNIEE+ MEKLLK PEKLRGDP
Sbjct: 214 GSSSSASRAEHRRLESGPSRSRPYERYTPTTILISEILTNIEESGMEKLLKSPEKLRGDP 273
Query: 612 EKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR 671
EKR+KDK+CRFHRDH HNT+SCWE KRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR
Sbjct: 274 EKRSKDKNCRFHRDHDHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR 333
Query: 672 TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEG 731
TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSI+F DADLEG
Sbjct: 334 TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFDDADLEG 393
Query: 732 VHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFS 773
VHLPHNDALVIAPLIDHVLV +L+DGGASANIL LPTYLALGWTR QLKKSPT + S
Sbjct: 394 VHLPHNDALVIAPLIDHVLV--LLVDGGASANILSLPTYLALGWTRLQLKKSPTRWLD-S 453
BLAST of Moc08g14960 vs. ExPASy TrEMBL
Match:
A0A6J1D7S8 (uncharacterized protein LOC111017807 OS=Momordica charantia OX=3673 GN=LOC111017807 PE=4 SV=1)
HSP 1 Score: 69.3 bits (168), Expect = 8.9e-08
Identity = 33/34 (97.06%), Postives = 34/34 (100.00%), Query Frame = 0
Query: 103 MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQ 137
MVHPANSANTTEQRGVNADNGP+RDLGARIVEDQ
Sbjct: 1 MVHPANSANTTEQRGVNADNGPQRDLGARIVEDQ 34
HSP 2 Score: 631.7 bits (1628), Expect = 4.5e-177
Identity = 328/423 (77.54%), Postives = 356/423 (84.16%), Query Frame = 0
Query: 333 KESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATD 392
K+ S +DGDLGES FTSD++EAPIPPKFK P++KPYDGSKDPKDYVEVF+G+MDF AA+D
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 393 AIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQK 452
AIK RAFQIALTGSARLWYRRL ARSISTYSQLR+EF++QFSSR Y +KT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 453 EGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAK 512
EG TLREYVTR EEQLKVAHCSDDSAM YFLTGLADE LTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 513 --------------------------KETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRS 572
K+ R D KSKDKG SFSS R EYRR++ GP +S
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKG-SFSSGRAEYRRAESGPTKS 283
Query: 573 RPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSS 632
RPYER+TPT IPISEILTNIEE+ MEKLLKRPEKLRG PE+R+KDK CRFHR+HGHNTS
Sbjct: 284 RPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD 343
Query: 633 CWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG 692
CWE KRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSG
Sbjct: 344 CWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSG 403
Query: 693 GQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVR 730
GQSG+KRKELAR ARREVCIIREQ PTC I+F AD E VHLPHNDA VIAPLIDHV+VR
Sbjct: 404 GQSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVR 463
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 3.2e-223 | 77.36 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 9.5e-220 | 78.20 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DHB3 | 1.6e-01 | 30.67 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D7S8 | 8.4e-184 | 81.26 | uncharacterized protein LOC111017807 OS=Momordica charantia OX=3673 GN=LOC111017... | [more] |
A0A6J1D7S8 | 8.9e-08 | 97.06 | uncharacterized protein LOC111017807 OS=Momordica charantia OX=3673 GN=LOC111017... | [more] |
Match Name | E-value | Identity | Description | |