Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGACGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCTTCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTATAAGGAGATGATACTAACTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATGCGGGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGGCACACTCGCCAGAGGAGACCTTCGTGAACACGTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAATAATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGACGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCACGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGACAGCGCGCGATTGTGGTATCGGAGACTGCCAGCTAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGTCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAGGAGCTCCTTCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCTGAGCTGAGTATCGGAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGATGAGTCTGGAGTGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGAGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTACCGCTTCCATCGGGAGCACGACCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACGAGTTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACTTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTTGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTTCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGTCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCTGACACCGCTGGTTGGGTTCTCCGGAGAATCAGTCGTCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCGGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTTCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCACAAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACTTGCCGAGGAGGGAGTTTGCCGCACCCACTGAAGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGATGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGAAAATTATGACGCATCGCCTCAGCATAGATCCGTCATTCCGACCTGTAAAGCAAAAAAGAAGACCTATAAACAAGGAGCGGGGTGATGTAATTGTTGAGGAAGTTAGCAAACTTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGAAAGTGGAGAATGTGCGTAGATTTTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACACCACAGCCGGGCACGAACTACTCACCTTCATGAACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATCAAGGTCATACCGCATTCATAACAGACCAAGGTCTGTACTGTTACAAGGTCATGCCCTTCGGTTTAAAGAATGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGATCTGACCGAAGCCTTCGAGGTTCTGAGGACATATCAAATGAAGCTCAACCCAGCTAAATGTGCCTTTGGAGTCTCTTCGGGAAAATTCCTTGGCTTCATGGTGAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGCTCGAGATGGAGGCACCTAAGACGCTGAAACAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGTTTCAAGATCGACAGATAAGTGCCTTCCTTTCTTCAAAGTCCTACGGAGGAAAGGGCCGTTTGAATGGACAGCGGAGTGCGAGCAAGCATTTCAGCAGTTGAAAAGCTACCTCTGTTCGGCACCTTTGCTCGCCAAGCCCATGCCGGGGGACAAGCTCCAATTGTACTTAGCAGTGTCTGATAGCGCCGTCAGCTCAGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGATCAGATACCCTCAAATGGAAAAGTTGGCTCTTGCTTTAGTCACCTCGGCCCGACGGATCAGACCATACTTCCAAGCCCATACTGTGGTGGTGCTCACTAATTTGCCCCTAAAAAACATCTTCCATAAGCCAGAAGCTTCTGGACGCCTGATGAAGTGGGCAATGGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCGTGGACAATCTACGTCGACGGATCCTCCAATGAGAAGGGGTGCGGGGCCGGGGTCCTCTTGCTCGGACCAGGAGGCGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGCCTGCGAATCGCTAGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGCGACTCCCAGCTAGTTGTGAGCCAGATCAAGGAAGAGTACCAAGCCAAAGACTCCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCGTACCTCGCCCAGTTTCGAACTTACGAAATACGCCGGGTTCCCCGAGCAGAAAATTCTAATGCTGACGCCTTAGCCAAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTGGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGACGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCGTTGTACCGACGTGGCTTTTCCCTGCCTTTATTGAGATGCCTAACCCCTGAAGAGGGTCTGTACGTCCTCAGAGAAATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGACCCTCAACCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTAAGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGGGTAGATATCATTGGTCCTTTCCCATTGGGCAAGGGCCAGACCAAGTTCGCTGTGGTTGCTGTGGATTACTTCGCAAAGTGGGCCGAGGCCGAGGCGCTCTCCCACATAACGGAATCCAGAGTCACGTCCTTCGTATGGACAAATATCATATGTCACTTTGGTATACCGCAGGCCATTGTGACGGACAATGGGAAGCAGTTTGACAACGCTAAGTTCAAAGACTTTTGCAGCAAGCTTGGCATAAGTCACCTTAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCAGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCAGAGGTTCTATGGTCGTACTGGACCACCCAAAGGGAATCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATGCCATCTGACAGAGTAGGGCATTACGAGCCTACGGCAAATGCGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGACAGAATATCAGGGCAGAATGGCCAGACATTACAACGCCCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAATGGCATAATCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCAGAGCACCTGAAGCGTTATTATCCTTGAAATGCCAAAAGGGTATTCAATGGATCTGTAAAAACTGTTTCACAAGAATTATGATCGGAATAAATGTGATGATTTAATTTCATGATTCCGAGCTCGACCAGAAATTAAATGGGGGCCACGGACTCCCACGCGATCACATTCCAGCAGTCGGTTAAAATTCAACCCTCAAAACCTAAGGGTACGAGGTGCGATGCCAAAACTACTGATGAACTTAAAATTCAAACCTTCAAGGTAAAGGGACGATGTGAAAAGTTCAAAATGATCAAGCCTCTGAACCTGAAGGTACGATGTACGATATGAAGAACAGCTACGGACTTAGGATTTTAGCCTTTTAACGTTTTTAAGTTGAGGGTGCGATGCTAAAAATCCAGAGTTGCTGCAATGCATTGAAAGCTAGACAAAGATTCAAATTCAAACTTTCAAATTATTTTAATTGAAGGCGCGACCTCAAAAGTAAGAGGTGCGAGGTGACATTGGCTGCAGTTCAAAGAAAAAGCGAAGAAAGGAAAGATGTTGCCAACAACAAAGTAAAAACAAATGGGAGCTTCTTTATTGAATGGGGAGCAGAGTCAAGGCTTATAGACCTTGCAGCTCTGCCTGACACTTACAAACAAAAAAGCAAAAGAGAAGAGAAGAAAGACAGCGAAAAGCCTCTTGAAGGGTGGTTGCCTAAGCGCCTGCTTGAGGAGCGCCCTCTTGAGTGGTGCCGACCTGATCCTCTTCGAGGTCGGAGTAGTCAGAGTCCAGATCTCTGA
mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGACGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCTTCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGGCACACTCGCCAGAGGAGACCTTCGTGAACACGTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAATAATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGACGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCACGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGACAGCGCGCGATTGTGGTATCGGAGACTGCCAGCTAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGTCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAGGAGCTCCTTCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGACAGATCCCAAGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGATGAGTCTGGAGTGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGAGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTACCGCTTCCATCGGGAGCACGACCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACGAGTTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACTTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTTGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTTCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGTCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCTGACACCGCTGGTTGGGTTCTCCGGAGAATCAGTCGTCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCGGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTTCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCACAAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACTTGCCGAGGAGGGAGTTTGCCGCACCCACTGAAGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTGGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGACGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCGTTGTACCGACGTGGCTTTTCCCTGCCTTTATTGAGATGCCTAACCCCTGAAGAGGTTCAAAGAAAAAGCGAAGAAAGGAAAGATGTTGCCAACAACAAAGTAAAAACAAATGGGAGCTTCTTTATTGAATGGGGAGCAGAGTCAAGGCTTATAGACCTTGCAGCTCTGCCTGACACTTACAAACAAAAAAGCAAAAGAGAAGAGAAGAAAGACAGCGAAAAGCCTCTTGAAGGGTGGTTGCCTAAGCGCCTGCTTGAGGAGCGCCCTCTTGAGTGGTGCCGACCTGATCCTCTTCGAGGTCGGAGTAGTCAGAGTCCAGATCTCTGA
Coding sequence (CDS)
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGACGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCTTCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGGCACACTCGCCAGAGGAGACCTTCGTGAACACGTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAATAATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGACGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCACGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGACAGCGCGCGATTGTGGTATCGGAGACTGCCAGCTAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGTCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAGGAGCTCCTTCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGACAGATCCCAAGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGATGAGTCTGGAGTGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGAGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTACCGCTTCCATCGGGAGCACGACCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACGAGTTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACTTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTTGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTTCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGTCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCTGACACCGCTGGTTGGGTTCTCCGGAGAATCAGTCGTCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCGGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTTCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCACAAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACTTGCCGAGGAGGGAGTTTGCCGCACCCACTGAAGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTGGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGACGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCGTTGTACCGACGTGGCTTTTCCCTGCCTTTATTGAGATGCCTAACCCCTGAAGAGGTTCAAAGAAAAAGCGAAGAAAGGAAAGATGTTGCCAACAACAAAGTAAAAACAAATGGGAGCTTCTTTATTGAATGGGGAGCAGAGTCAAGGCTTATAGACCTTGCAGCTCTGCCTGACACTTACAAACAAAAAAGCAAAAGAGAAGAGAAGAAAGACAGCGAAAAGCCTCTTGAAGGGTGGTTGCCTAAGCGCCTGCTTGAGGAGCGCCCTCTTGAGTGGTGCCGACCTGATCCTCTTCGAGGTCGGAGTAGTCAGAGTCCAGATCTCTGA
Protein sequence
MVQPANSTNTADRRTLAASDAHQREDGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPASAPPSENFDALQREMEAMRTQMRSMEEIKGVPTSAQSRRNIPKTTRARGTLARGDLREHVNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGIITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLTDFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKTDPKSRPYERFTPTTIPISEILTNIDESGVEKLLKRPEKLRGAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGGQSGHKRKELACAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLVLGWTRSQLKKSLTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAFALKGSSVCALETLTSRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMEIDAPESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRGGALYRRGFSLPLLRCLTPEEVQRKSEERKDVANNKVKTNGSFFIEWGAESRLIDLAALPDTYKQKSKREEKKDSEKPLEGWLPKRLLEERPLEWCRPDPLRGRSSQSPDL
Homology
BLAST of Moc08g38160 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 926.4 bits (2393), Expect = 2.0e-265
Identity = 513/765 (67.06%), Postives = 551/765 (72.03%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREDGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTNTADRR LAA+ HQRE GA VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPASAPPSENFDALQREMEAMRTQMRSMEEIKGVPTSAQSRR 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 NIPKTTRARGTLARGDLREHVNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGIIT 180
AESS NP TP G+IT
Sbjct: 121 ---------------------------------------------AESSYNPITP-GVIT 180
Query: 181 REEFDQLRGQLDAQVEALKAKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPY 240
REEFDQL+ + DAQVEALKA+CE+KE +D DLGE F+SD+LEA IPPKFK PT+KPY
Sbjct: 181 REEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTMKPY 240
Query: 241 DGSKDPKDYVEVFEGLTDFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRRE 300
DGSKDPKDYVEVFE L DFQAA+DAIKC AFQIALT SARLWYRRLPAR ISTYSQLR+E
Sbjct: 241 DGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQLRKE 300
Query: 301 FLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA 360
F+ QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLA
Sbjct: 301 FISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLTGLA 360
Query: 361 DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKTDPK- 420
DE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD K D K
Sbjct: 361 DETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKADSKS 420
Query: 421 -----------------------SRPYERFTPTTIPISEILTNIDESGVEKLLKRPEKLR 480
SRPYE +TPTTIPI EILTNI+E+G+EKLLKRPEKLR
Sbjct: 421 RDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRPEKLR 480
Query: 481 GAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK 540
G PE+R+ DKY RFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERK
Sbjct: 481 GDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERK 540
Query: 541 RSRTPPRRTDRLAVINTIFGGPSGGQSGHKRKELACAARREVCIIREQRPTCSITFDGAD 600
R RTPPRR DR AVIN K+KELA ARREVCIIREQRPT SI F+ AD
Sbjct: 541 RLRTPPRRDDRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAFNHAD 600
Query: 601 LEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLVLGWTRSQLKKSLTPLV 660
LE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYL LGWTRSQLKKS TPLV
Sbjct: 601 LEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSPTPLV 650
Query: 661 GFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTL 720
GFSGES+ EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTL
Sbjct: 661 GFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTL 650
Query: 721 HQVLKYSTPNGVGTVRGEQTASRECYAFALKGSSVCALETLTSRD 742
HQVLKYST NGVGTVRGE SRECYA K SSVCALE T RD
Sbjct: 721 HQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALEEQTIRD 650
BLAST of Moc08g38160 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 921.4 bits (2380), Expect = 6.4e-264
Identity = 500/631 (79.24%), Postives = 513/631 (81.30%), Query Frame = 0
Query: 161 SSNQQAESSRNPATPAGIITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDDDLGESPFT 220
SSNQQAESS NPATP G+ITREEFDQLRG+L+AQVEALKAKCEQKEGPLND DLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 221 SDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLTDFQAASDAIKCRAFQIALTDSAR 280
SDVLE APTVK YDGSKDPKDYVEVFEGL DFQAASDAIKCRAFQIALT SAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 281 LWYRRLPARSISTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 340
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 341 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 400
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 401 ERKIGRGRSGKDIEKTDPK-----------------------SRPYERFTPTTIPISEIL 460
ER I RGRSGKD EK D K SRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 461 TNIDESGVEKLLKRPEKLRGAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYF 520
TNI+ESG+EKLLKRPEKLRGAPERR+KDKY RFHREHDHNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 521 KKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGGQSGHKRKELACAARRE 580
KKFVGKPRTSSAEKKEERK SRTP RR DR AVINTIFGGPSGGQSGHKRKELA AARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 581 VCIIREQRPTCSITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPT 640
VCIIREQRPTC ITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 641 YLVLGWTRSQLKKSLTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAY 700
YL LGWTRSQLKKS TPLVGFS ESV+PEGCIDLPVTLG DQT+VTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 701 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAFALKGSSVCALETL 760
NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA ALKGSSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 761 TSRDGTLEFEADLPRREFAAPTEELELVPLL 769
SRDGTLEF+A+LPRREFAAPTEELELVPLL
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc08g38160 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 919.5 bits (2375), Expect = 2.4e-263
Identity = 476/528 (90.15%), Postives = 484/528 (91.67%), Query Frame = 0
Query: 165 QAESSRNPATPAGIITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDDDLGESPFTSDVL 224
+AESSRNPATPAG+ITREEFDQLRGQLDAQVEALKAKCEQKEGPLND DLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 225 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLTDFQAASDAIKCRAFQIALTDSARLWYR 284
EAPIPPKFKAPTVKPYDGSKDPKDYVEVFE L DFQAASDAIKCRAF+IALT SARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 285 RLPARSISTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 344
RLPA SISTYSQLRREFL FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 345 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 404
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 405 GRGRSGKDIEKTDPK-----------------------SRPYERFTPTTIPISEILTNID 464
GRGRSGKDIE DPK SRPYERFTPTTIPISEILTNI+
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 465 ESGVEKLLKRPEKLRGAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFV 524
ESG+EKLLKRPEKLRGAPERRSKDKY RFHREH HNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 525 GKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGGQSGHKRKELACAARREVCII 584
GKPRTSSAEKKEERKRSRTPPRRTDR AVINTIFGGPSGGQSG KRKELA AARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 585 REQRPTCSITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLVL 644
REQRPTC ITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYL L
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 645 GWTRSQLKKSLTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFV 670
GWTRSQLKKS TPLVGFSGESV+PEG IDLPVTLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc08g38160 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 745.3 bits (1923), Expect = 6.3e-211
Identity = 390/446 (87.44%), Postives = 398/446 (89.24%), Query Frame = 0
Query: 352 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 411
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 412 DIEKTDPK-----------------------SRPYERFTPTTIPISEILTNIDESGVEKL 471
D+E TDPK SRPYERFTPTTIPISEILTNI+ESG+EKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 472 LKRPEKLRGAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 531
LKRPEKLRGAPERRSKDKY RFHREH HNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 532 AEKKEERKRSRTPPRRTDRLAVINTIFGGPSGGQSGHKRKELACAARREVCIIREQRPTC 591
AEKKEERKRSRTPPRRTDR AVINTIFGGPSGGQSGHKRK+LA AARREVCIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 592 SITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLVLGWTRSQL 651
ITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYL LGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 652 KKSLTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHS 711
KKS TPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 712 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAFALKGSSVCALETLTSRDGTLEFEA 771
FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYA LKG+SVCALETLTSRDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 772 DLPRREFAAPTEELELVPLLSPEKQL 775
DLP REFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc08g38160 vs. NCBI nr
Match:
XP_022158414.1 (uncharacterized protein LOC111024904 [Momordica charantia])
HSP 1 Score: 738.4 bits (1905), Expect = 7.7e-209
Identity = 394/543 (72.56%), Postives = 428/543 (78.82%), Query Frame = 0
Query: 258 DFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLVQFSSRHYDKKTATH 317
DFQAA+DAIKCRAFQIALT SARLWYRRLPARSISTYSQLR+EF+ QFSS HYD+KTATH
Sbjct: 2 DFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTATH 61
Query: 318 LATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA 377
LATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 62 LATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTFV 121
Query: 378 EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKTDPK------------------ 437
EVLQKAKKVIDGQELLRTKTGRPE++I + + ++ K D K
Sbjct: 122 EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRL 181
Query: 438 ------SRPYERFTPTTIPISEILTNIDESGVEKLLKRPEKLRGAPERRSKDKYYRFHRE 497
SRPYER+T +TIPISEILTNI+ESG+EKLLKRPEKLRG E+R+K+KY RFHR+
Sbjct: 182 ESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHRD 241
Query: 498 HDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINT 557
H HNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DR AVINT
Sbjct: 242 HGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINT 301
Query: 558 IFGGPSGGQSGHKRKELACAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIAPL 617
IFGGP+GGQSG+KRKELA ARREVCIIRE +PTCSITF ADLE VHLPHNDALVIA L
Sbjct: 302 IFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIASL 361
Query: 618 IDHVVVRRVLVDGGASANILSLPTYLVLGWTRSQLKKSLTPLVGFSGESVVPEGCIDLPV 677
IDH +VRRVL+DG GCIDLPV
Sbjct: 362 IDHDLVRRVLIDG----------------------------------------GCIDLPV 421
Query: 678 TLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRG 737
T+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRG
Sbjct: 422 TIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVRG 481
Query: 738 EQTASRECYAFALKGSSVCALETLTSRDGTLEFEADLP---RREFAAPTEELELVPLLSP 774
EQ SRECYA ALKGS+VCALE T+R E EADLP +R+F PTEELELVPLLSP
Sbjct: 482 EQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLSP 504
BLAST of Moc08g38160 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 926.4 bits (2393), Expect = 9.7e-266
Identity = 513/765 (67.06%), Postives = 551/765 (72.03%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREDGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTNTADRR LAA+ HQRE GA VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPASAPPSENFDALQREMEAMRTQMRSMEEIKGVPTSAQSRR 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 NIPKTTRARGTLARGDLREHVNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGIIT 180
AESS NP TP G+IT
Sbjct: 121 ---------------------------------------------AESSYNPITP-GVIT 180
Query: 181 REEFDQLRGQLDAQVEALKAKCEQKEGPLNDDDLGESPFTSDVLEAPIPPKFKAPTVKPY 240
REEFDQL+ + DAQVEALKA+CE+KE +D DLGE F+SD+LEA IPPKFK PT+KPY
Sbjct: 181 REEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTMKPY 240
Query: 241 DGSKDPKDYVEVFEGLTDFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRRE 300
DGSKDPKDYVEVFE L DFQAA+DAIKC AFQIALT SARLWYRRLPAR ISTYSQLR+E
Sbjct: 241 DGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQLRKE 300
Query: 301 FLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA 360
F+ QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLA
Sbjct: 301 FISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLTGLA 360
Query: 361 DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKTDPK- 420
DE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD K D K
Sbjct: 361 DETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKADSKS 420
Query: 421 -----------------------SRPYERFTPTTIPISEILTNIDESGVEKLLKRPEKLR 480
SRPYE +TPTTIPI EILTNI+E+G+EKLLKRPEKLR
Sbjct: 421 RDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRPEKLR 480
Query: 481 GAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERK 540
G PE+R+ DKY RFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERK
Sbjct: 481 GDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERK 540
Query: 541 RSRTPPRRTDRLAVINTIFGGPSGGQSGHKRKELACAARREVCIIREQRPTCSITFDGAD 600
R RTPPRR DR AVIN K+KELA ARREVCIIREQRPT SI F+ AD
Sbjct: 541 RLRTPPRRDDRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAFNHAD 600
Query: 601 LEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLVLGWTRSQLKKSLTPLV 660
LE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYL LGWTRSQLKKS TPLV
Sbjct: 601 LEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSPTPLV 650
Query: 661 GFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTL 720
GFSGES+ EGCIDLPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTL
Sbjct: 661 GFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTL 650
Query: 721 HQVLKYSTPNGVGTVRGEQTASRECYAFALKGSSVCALETLTSRD 742
HQVLKYST NGVGTVRGE SRECYA K SSVCALE T RD
Sbjct: 721 HQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALEEQTIRD 650
BLAST of Moc08g38160 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 921.4 bits (2380), Expect = 3.1e-264
Identity = 500/631 (79.24%), Postives = 513/631 (81.30%), Query Frame = 0
Query: 161 SSNQQAESSRNPATPAGIITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDDDLGESPFT 220
SSNQQAESS NPATP G+ITREEFDQLRG+L+AQVEALKAKCEQKEGPLND DLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 221 SDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLTDFQAASDAIKCRAFQIALTDSAR 280
SDVLE APTVK YDGSKDPKDYVEVFEGL DFQAASDAIKCRAFQIALT SAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 281 LWYRRLPARSISTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 340
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 341 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 400
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 401 ERKIGRGRSGKDIEKTDPK-----------------------SRPYERFTPTTIPISEIL 460
ER I RGRSGKD EK D K SRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 461 TNIDESGVEKLLKRPEKLRGAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYF 520
TNI+ESG+EKLLKRPEKLRGAPERR+KDKY RFHREHDHNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 521 KKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGGQSGHKRKELACAARRE 580
KKFVGKPRTSSAEKKEERK SRTP RR DR AVINTIFGGPSGGQSGHKRKELA AARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 581 VCIIREQRPTCSITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPT 640
VCIIREQRPTC ITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 641 YLVLGWTRSQLKKSLTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAY 700
YL LGWTRSQLKKS TPLVGFS ESV+PEGCIDLPVTLG DQT+VTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 701 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAFALKGSSVCALETL 760
NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA ALKGSSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 761 TSRDGTLEFEADLPRREFAAPTEELELVPLL 769
SRDGTLEF+A+LPRREFAAPTEELELVPLL
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc08g38160 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 919.5 bits (2375), Expect = 1.2e-263
Identity = 476/528 (90.15%), Postives = 484/528 (91.67%), Query Frame = 0
Query: 165 QAESSRNPATPAGIITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDDDLGESPFTSDVL 224
+AESSRNPATPAG+ITREEFDQLRGQLDAQVEALKAKCEQKEGPLND DLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 225 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLTDFQAASDAIKCRAFQIALTDSARLWYR 284
EAPIPPKFKAPTVKPYDGSKDPKDYVEVFE L DFQAASDAIKCRAF+IALT SARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 285 RLPARSISTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 344
RLPA SISTYSQLRREFL FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 345 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 404
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 405 GRGRSGKDIEKTDPK-----------------------SRPYERFTPTTIPISEILTNID 464
GRGRSGKDIE DPK SRPYERFTPTTIPISEILTNI+
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 465 ESGVEKLLKRPEKLRGAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFV 524
ESG+EKLLKRPEKLRGAPERRSKDKY RFHREH HNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 525 GKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGGQSGHKRKELACAARREVCII 584
GKPRTSSAEKKEERKRSRTPPRRTDR AVINTIFGGPSGGQSG KRKELA AARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 585 REQRPTCSITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLVL 644
REQRPTC ITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYL L
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 645 GWTRSQLKKSLTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFV 670
GWTRSQLKKS TPLVGFSGESV+PEG IDLPVTLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc08g38160 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 745.3 bits (1923), Expect = 3.1e-211
Identity = 390/446 (87.44%), Postives = 398/446 (89.24%), Query Frame = 0
Query: 352 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 411
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 412 DIEKTDPK-----------------------SRPYERFTPTTIPISEILTNIDESGVEKL 471
D+E TDPK SRPYERFTPTTIPISEILTNI+ESG+EKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 472 LKRPEKLRGAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 531
LKRPEKLRGAPERRSKDKY RFHREH HNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 532 AEKKEERKRSRTPPRRTDRLAVINTIFGGPSGGQSGHKRKELACAARREVCIIREQRPTC 591
AEKKEERKRSRTPPRRTDR AVINTIFGGPSGGQSGHKRK+LA AARREVCIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 592 SITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLVLGWTRSQL 651
ITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYL LGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 652 KKSLTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHS 711
KKS TPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 712 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAFALKGSSVCALETLTSRDGTLEFEA 771
FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYA LKG+SVCALETLTSRDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 772 DLPRREFAAPTEELELVPLLSPEKQL 775
DLP REFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc08g38160 vs. ExPASy TrEMBL
Match:
A0A6J1DZB9 (uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024904 PE=4 SV=1)
HSP 1 Score: 738.4 bits (1905), Expect = 3.7e-209
Identity = 394/543 (72.56%), Postives = 428/543 (78.82%), Query Frame = 0
Query: 258 DFQAASDAIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLVQFSSRHYDKKTATH 317
DFQAA+DAIKCRAFQIALT SARLWYRRLPARSISTYSQLR+EF+ QFSS HYD+KTATH
Sbjct: 2 DFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTATH 61
Query: 318 LATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA 377
LATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 62 LATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTFV 121
Query: 378 EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKTDPK------------------ 437
EVLQKAKKVIDGQELLRTKTGRPE++I + + ++ K D K
Sbjct: 122 EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRRL 181
Query: 438 ------SRPYERFTPTTIPISEILTNIDESGVEKLLKRPEKLRGAPERRSKDKYYRFHRE 497
SRPYER+T +TIPISEILTNI+ESG+EKLLKRPEKLRG E+R+K+KY RFHR+
Sbjct: 182 ESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHRD 241
Query: 498 HDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINT 557
H HNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DR AVINT
Sbjct: 242 HGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINT 301
Query: 558 IFGGPSGGQSGHKRKELACAARREVCIIREQRPTCSITFDGADLEEVHLPHNDALVIAPL 617
IFGGP+GGQSG+KRKELA ARREVCIIRE +PTCSITF ADLE VHLPHNDALVIA L
Sbjct: 302 IFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIASL 361
Query: 618 IDHVVVRRVLVDGGASANILSLPTYLVLGWTRSQLKKSLTPLVGFSGESVVPEGCIDLPV 677
IDH +VRRVL+DG GCIDLPV
Sbjct: 362 IDHDLVRRVLIDG----------------------------------------GCIDLPV 421
Query: 678 TLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRG 737
T+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRG
Sbjct: 422 TIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVRG 481
Query: 738 EQTASRECYAFALKGSSVCALETLTSRDGTLEFEADLP---RREFAAPTEELELVPLLSP 774
EQ SRECYA ALKGS+VCALE T+R E EADLP +R+F PTEELELVPLLSP
Sbjct: 482 EQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLSP 504
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DHB3 | 9.7e-266 | 67.06 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9E1 | 3.1e-264 | 79.24 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1C7X5 | 1.2e-263 | 90.15 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DD03 | 3.1e-211 | 87.44 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A6J1DZB9 | 3.7e-209 | 72.56 | uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |