Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTCAACCAGCGAACTCGACCAATACGACAGACCGAAGGACTCTGGCTGCCAGCGATACCCACCAGAGGGAGGGCGGAGCAGCAACGGTAGAGGGGCAAGGTCACGACAGCCTAGTAACGGAACCCCTCCGCAAGTCAGCACGGATCACCGCACCTGCCCTACCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCACCCACGGATCCAACAAGCGAGAACCTGGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTATACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGTTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCTCTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAGCGATGGCGACTTGGGATAATCGCCTTTCACCTCGGACGTGTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTACTATTTCCTCACCAGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCACCGAGGTACTCTAGAAGGCGAAGAAAGTCATCGACGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACTTGAACGAAAGATCGGCCGGGGCAGCAGCGGAAAAGATGAAAGGACAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCGACCTGAGTATCGAAGGGTGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAAAAGACGCAACAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAATACGTCGGACTGCGGGGAATTGAAGCGTCAAATTGAGGATCTAATTTAAGACAGCTACTTCAAGAAGTTCGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCACCCAGGCGCACCGACCGACCTACGGTCATCAATACCATTTTTGGAGGGCCAAGCGGAGGTCAATCCGGACATAAGAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGATTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATCGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAGTCGGTCATCCCAGAGGCTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCCCGTCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAATTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTATGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAAGAGCTAATCCACTTCCTCAGATCCAACTCGGACATCTTTGCGTGGTCCCATGAGGACATGCCTGGCATAGACCCGCGGATTATGACGCATCGCCTCAGCATAGATCCATCATTCCGACCTGTAAAACAAAAAAGAAGACCTATGAACAAGGAGAGGAGTGATGTAATTGTTGAGGAAGTTAGCAAACTTTTGAAAGCTGAATACATAAGAGAAATTTTGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGATTTTACGAATTTAAATAAGACATGCCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACGCCACAGCCGGGCACGAATTGCTCACCTTCATGGACGCCTATTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTCATTCCCTTCGGTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGTCAAGCAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGACTTGACCGAAGCCTTCGAGGTCTTGAGGGCATATCAAATGAAGCTCAACCCTGCTAAATGTGCCTTTGGGGTCACCTCGGGAAAATTCCTTGGCTTCATGGTAAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGATCGAGATGGAGGCACCGAAAACGCTGAAGCAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGTTTCAAGGTCGACAGACAAGTGCCTCCCTTTCTTCAAGGTCTTACGAAAGAAGGGGCCGTTTGAATGGACAGCGGAGTGCGAGCAAGCGTTTCAGCAATTGAAGAGCTACCTCTGTTCGGCACCCTTGCTTGCCAAGCCCATACCGGGGGACAAGCTCCAATTGTACCTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGATCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACTAGATACCCTCAAATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGACTTAGACCATACTTCCAAGCCCATACGGTGGTGGTGCTCACTGACTTGCCCCTAAAAAGCATCTTCCATAAGCCAGAAGCTTCTGGTTGCCTAATGAAGTGGGCAATGGAGTTAAGTGAGTACGACATCCAGTTTGAACCCAGAACTGCGTTGAAAGGACAAGCCGCGGCAGATTTCATAGCCGAGCTCACACCACCCTCCGAGCTGCGGGAGTCTGACCTACCTTGGACAGTCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGGGCCGGGGTCCTCTTGCTCGGACCAGGAGGTGAGCGATTTGAGTATGCCTTGCGGTTCAGCTTCCGGACTTCTAACAACGAGGCAGAGTATGAAGCATTTATTGCCGGCCTGCGACTCGCTCGAGCATTGGGGGCCTCTTATGTTAAGGTCTTCAGTGACTCCCAGCTGGTTGTGAGCCAGATCAAGGACGAATACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGACAAGGTCAGATCATACCTCGCCCAGTTTCGAACTTACGAGGTAAGCCGGATTCCGCGAGTAGAAAATTCTAATGCTGACGCTTGGCCAAGCTAGCATCGGCGTACGAGACCGACCTTGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCTTCGATCTCGGGGCCAGATCTGATGGAGATCGGTGCTCCAGAGTCCTCATGGATGGACCCGATCGCGGACATCATTAAGGGCAACTCACCACAAGACCCCAAGGAGCGTCGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGGGTGTGCGGCAATCACTCGGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATATTATTGGCCGACCCTCAGCCAGGACGCCAGGAAGTTCGTTGGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGAGGGTAGATATCATTGGTCCTTTCCCCTTGGGCAAGGGCCAGACCAAGTTCGCTGTGGTTGCTGTGGATTACTTCACCAAGTGGGCCGAGGCCGAGGCGCTCTCCCACATAACGGAATCCAGGGTCACGTCCTTCGTATGGACGAATATCATATGTCGCTTTGGTATACCGCAGGCCATAGTGACAGATAATAGGAAGCAGTTTGACAACGTCAAGTTCAAAGACTTTTGCAGAAAACTTGGCATAAGTCACCTCATCTCGTCCCCCGCACATCCGCAAGCAAATGGACAGGTGGAGGCGGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCAGAGGTTCTATGGTCGTACCGGACCACCCAACGAGAGTCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGACATGCCATCTGATAGAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGGGCATGGCCCAGCTACGCCTGGCAAAATATCAGGGTAGAATGGCCAGACATTACAATGCCCGCGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGACCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGATTCAACCAGCGAACTCGACCAATACGACAGACCGAAGGACTCTGGCTGCCAGCGATACCCACCAGAGGGAGGGCGGAGCAGCAACGGTAGAGGGGCAAGGTCACGACAGCCTAGTAACGGAACCCCTCCGCAAGTCAGCACGGATCACCGCACCTGCCCTACCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCACCCACGGATCCAACAAGCGAGAACCTGGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTATACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGTTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCTCTAAAGGCCAAATCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTACTATTTCCTCACCAGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCACCGAGATCGGCCGGGGCAGCAGCGGAAAAGATGAAAGGACAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCGACCTGAGTATCGAAGGGTGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAAAAGACGCAACAAGGACAATTCGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCACCCAGGCGCACCGACCGACCTACGGTCATCAATACCATTTTTGGAGGGCCAAGCGGAGGTCAATCCGGACATAAGAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGATTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATCGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAGTCGGTCATCCCAGAGGCTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCCCGTCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAATTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTTGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCTTCGATCTCGGGGCCAGATCTGATGGAGATCGGTGCTCCAGAGTCCTCATGGATGGACCCGATCGCGGACATCATTAAGGGCAACTCACCACAAGACCCCAAGGAGCGTCGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAAAGAAGGGCATGGCCCAGCTACGCCTGGCAAAATATCAGGGTAGAATGGCCAGACATTACAATGCCCGCGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGACCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGATTCAACCAGCGAACTCGACCAATACGACAGACCGAAGGACTCTGGCTGCCAGCGATACCCACCAGAGGGAGGGCGGAGCAGCAACGGTAGAGGGGCAAGGTCACGACAGCCTAGTAACGGAACCCCTCCGCAAGTCAGCACGGATCACCGCACCTGCCCTACCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCACCCACGGATCCAACAAGCGAGAACCTGGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTATACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGTTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCTCTAAAGGCCAAATCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTACTATTTCCTCACCAGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCACCGAGATCGGCCGGGGCAGCAGCGGAAAAGATGAAAGGACAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCGACCTGAGTATCGAAGGGTGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAAAAGACGCAACAAGGACAATTCGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCACCCAGGCGCACCGACCGACCTACGGTCATCAATACCATTTTTGGAGGGCCAAGCGGAGGTCAATCCGGACATAAGAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGATTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATCGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAGTCGGTCATCCCAGAGGCTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCCCGTCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAATTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTTGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCTTCGATCTCGGGGCCAGATCTGATGGAGATCGGTGCTCCAGAGTCCTCATGGATGGACCCGATCGCGGACATCATTAAGGGCAACTCACCACAAGACCCCAAGGAGCGTCGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAAAGAAGGGCATGGCCCAGCTACGCCTGGCAAAATATCAGGGTAGAATGGCCAGACATTACAATGCCCGCGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGACCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequence
MIQPANSTNTTDRRTLAASDTHQREGGAATVEGQGHDSLVTEPLRKSARITAPALPPAHPRTSKATRGRGGTSKKGARGPPTDPTSENLDALKREMEAMRTQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKSPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMYYFLTSLADEALTVKLGEEAPATFTEIGRGSSGKDERTDPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPEKTQQGQFVGKPRTSSAEKKEERKRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEACIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYAAALKGPSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSISGPDLMEIGAPESSWMDPIADIIKGNSPQDPKERRKLARRAARFVVRDGALYRRGFSLPLLRCLTPEEGLKKGMAQLRLAKYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYMLADLKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc04g24550 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 753.8 bits (1945), Expect = 1.7e-213
Identity = 430/622 (69.13%), Postives = 452/622 (72.67%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAK---------------SPIP 246
SSNQQAESSHNPA G+ITREEFDQLRG+L+AQVEALKAK SP
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 PK-FKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA 306
+APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW
Sbjct: 62 SDVLEAPTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW------ 121
Query: 307 RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSD 366
FQE+QLKVA SD
Sbjct: 122 -----------------------------------------------FQEDQLKVAQSSD 181
Query: 367 DSAMYYFLTSLADEALTVKLGEEAPATFTE-------------------------IGRGS 426
DSAM YFLT LADEALTVKLG+EAPATF E I RG
Sbjct: 182 DSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGR 241
Query: 427 SGKDERTDPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGME 486
SGKDE+ D KSKDKGSFSSGR E+RR +GPTRSRPYERFTPTTIPISEILTNIE+SGME
Sbjct: 242 SGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGME 301
Query: 487 KLLKRPEKLRGAPEKTQQG---------------------------------QFVGKPRT 546
KLLKRPEKLRGAPE+ + +FVGKPRT
Sbjct: 302 KLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRT 361
Query: 547 SSAEKKEERKRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP 606
SSAEKKEERK SRTP RR DRP VINTIFGGPSGGQSGHKRKELARAARREVCIIREQ P
Sbjct: 362 SSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRP 421
Query: 607 TCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRS 666
TCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRS
Sbjct: 422 TCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRS 481
Query: 667 QLKRSPTPLVGFSGESVIPEACIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPII 726
QLK+S TPLVGFS ESVIPE CIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPII
Sbjct: 482 QLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPII 541
Query: 727 HSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYAAALKGPSVCALETL--RDGTLEF 730
HSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYA+ALKG SVCALETL RDGTLEF
Sbjct: 542 HSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLVSRDGTLEF 570
BLAST of Moc04g24550 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 740.0 bits (1909), Expect = 2.5e-209
Identity = 408/528 (77.27%), Postives = 421/528 (79.73%), Query Frame = 0
Query: 191 QAESSHN---PAGIITREEFDQLRGELDAQVEALKAK----------------------- 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAK
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 -SPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
+PIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMYYFLTSLADEALTVKLGEEAPATFTE-------------------------I 430
HCSDDSAM YFLT LADEALTVKLGEEAPATF E I
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGSSGKD-ERTDPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTTIPISEILTNIE 490
GRG SGKD E DPKSKDKGSFSSGR EYRR E+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGAPEK------------------------------TQQG---QFV 550
+SGMEKLLKRPEKLRGAPE+ Q G +FV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKEERKRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKELARAARREVCII 610
GKPRTSSAEKKEERKRSRTPPRRTDRP VINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 633
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
BLAST of Moc04g24550 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 739.2 bits (1907), Expect = 4.3e-209
Identity = 447/790 (56.58%), Postives = 490/790 (62.03%), Query Frame = 0
Query: 1 MIQPANSTNTTDRRTLAASDTHQREGGAATVEGQGHDSLVTEPLRKSARITAPALPPAHP 60
M+QPANSTNT DRR LAA+ HQRE GA VEGQGH+ L TEPL +SARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPPTDPTSENLDALKREMEAMRTQMRSMEEMYNEMMLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAK-------------- 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ----------SPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
+ IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMYYFLTSLADEALTVKLGEEAPATFTE----------------- 420
F EEQLKVAHCSDDSAM YFLT LADE LTVKL EEAPATF E
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 --------IGRGSSGKDE-RTDPKSKDKG-SFSSGRPEYRRVESGPTRSRPYERFTPTTI 480
I +G +GKD+ + D KS+DKG S SS R +YRR S +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGAPEK------------------------------ 540
PI EILTNIE++GMEKLLKRPEKLRG PEK
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 TQQG---QFVGKPRTSSAEKKEERKRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKELA 600
Q G +FVGKPR++S EKKEERKR RTPPRR DRP VIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEACIDLPVTLGQDRTRVTQMAEFVVI 703
ILSL TYLALGWTRSQLK+SPTPLVGFSGES+ E CIDLPV++ QD T+VTQMAEFVVI
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
BLAST of Moc04g24550 vs. NCBI nr
Match:
XP_022157676.1 (uncharacterized protein LOC111024332 [Momordica charantia])
HSP 1 Score: 664.8 bits (1714), Expect = 1.0e-186
Identity = 347/402 (86.32%), Postives = 353/402 (87.81%), Query Frame = 0
Query: 358 LADEALTVKLGEEAPATFTE-------------------------IGRGSSGKDERTDPK 417
+ADEALTVKLGEEAPATF E IGRG SGKDER DPK
Sbjct: 1 MADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPK 60
Query: 418 SKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR 477
SKDKGSFSSGR EYRR E+GPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR
Sbjct: 61 SKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR 120
Query: 478 GAPEKTQQGQFVGKPRTSSAEKKEERKRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKE 537
GAPE+ K +TSSAEKKEERKRSRTPPRRTDRP VINTIFGGPSGGQSGHKRKE
Sbjct: 121 GAPERR------SKDKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE 180
Query: 538 LARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS 597
LAR ARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS
Sbjct: 181 LAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS 240
Query: 598 ANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEACIDLPVTLGQDRTRVTQMAEFV 657
ANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPE CIDLPVTLGQD+TRVTQM EFV
Sbjct: 241 ANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMTEFV 300
Query: 658 VIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYAAALKGP 717
V+DGRS YNAIFGRPIIHSFR IPSTLHQVLKY TPNGVGTVRGEQT SRECYAAALKG
Sbjct: 301 VVDGRSTYNAIFGRPIIHSFRXIPSTLHQVLKYSTPNGVGTVRGEQTVSRECYAAALKGS 360
Query: 718 SVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ 735
SVCALETLRDGTLE EADLPRKEFAAPTEELELVPLLSPEKQ
Sbjct: 361 SVCALETLRDGTLELEADLPRKEFAAPTEELELVPLLSPEKQ 396
BLAST of Moc04g24550 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 650.2 bits (1676), Expect = 2.6e-182
Identity = 351/441 (79.59%), Postives = 365/441 (82.77%), Query Frame = 0
Query: 351 MYYFLTSLADEALTVKLGEEAPATF------------------TEIGRGSSGKD-ERTDP 410
M YFLT LADEALTVKL EEAPATF T+IG+G SGKD E TDP
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTKIGQGRSGKDMENTDP 60
Query: 411 KSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKL 470
KSKDKGSFS+GR EYRR E+GPTRSRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKL
Sbjct: 61 KSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKL 120
Query: 471 RGAPEK------------------------------TQQG---QFVGKPRTSSAEKKEER 530
RGAPE+ Q G +FVGKPRTSSAEKKEER
Sbjct: 121 RGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSSAEKKEER 180
Query: 531 KRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGA 590
KRSRTPPRRTDRP VINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD A
Sbjct: 181 KRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTCPITFDXA 240
Query: 591 DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPL 650
DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPL
Sbjct: 241 DLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL 300
Query: 651 VGFSGESVIPEACIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPST 710
VGFSGESV+PE CIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPST
Sbjct: 301 VGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPST 360
Query: 711 LHQVLKYPTPNGVGTVRGEQTASRECYAAALKGPSVCALETL--RDGTLEFEADLPRKEF 738
LHQVLKY TPNGVGTVRGEQTASRECYA+ LKG SVCALETL RDGTLEFEADLP +EF
Sbjct: 361 LHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEADLPXREF 420
BLAST of Moc04g24550 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 753.8 bits (1945), Expect = 8.2e-214
Identity = 430/622 (69.13%), Postives = 452/622 (72.67%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAK---------------SPIP 246
SSNQQAESSHNPA G+ITREEFDQLRG+L+AQVEALKAK SP
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 PK-FKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA 306
+APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW
Sbjct: 62 SDVLEAPTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW------ 121
Query: 307 RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSD 366
FQE+QLKVA SD
Sbjct: 122 -----------------------------------------------FQEDQLKVAQSSD 181
Query: 367 DSAMYYFLTSLADEALTVKLGEEAPATFTE-------------------------IGRGS 426
DSAM YFLT LADEALTVKLG+EAPATF E I RG
Sbjct: 182 DSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGR 241
Query: 427 SGKDERTDPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGME 486
SGKDE+ D KSKDKGSFSSGR E+RR +GPTRSRPYERFTPTTIPISEILTNIE+SGME
Sbjct: 242 SGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGME 301
Query: 487 KLLKRPEKLRGAPEKTQQG---------------------------------QFVGKPRT 546
KLLKRPEKLRGAPE+ + +FVGKPRT
Sbjct: 302 KLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRT 361
Query: 547 SSAEKKEERKRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGP 606
SSAEKKEERK SRTP RR DRP VINTIFGGPSGGQSGHKRKELARAARREVCIIREQ P
Sbjct: 362 SSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRP 421
Query: 607 TCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRS 666
TCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRS
Sbjct: 422 TCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRS 481
Query: 667 QLKRSPTPLVGFSGESVIPEACIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPII 726
QLK+S TPLVGFS ESVIPE CIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPII
Sbjct: 482 QLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPII 541
Query: 727 HSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYAAALKGPSVCALETL--RDGTLEF 730
HSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYA+ALKG SVCALETL RDGTLEF
Sbjct: 542 HSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLVSRDGTLEF 570
BLAST of Moc04g24550 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 740.0 bits (1909), Expect = 1.2e-209
Identity = 408/528 (77.27%), Postives = 421/528 (79.73%), Query Frame = 0
Query: 191 QAESSHN---PAGIITREEFDQLRGELDAQVEALKAK----------------------- 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAK
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 -SPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
+PIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMYYFLTSLADEALTVKLGEEAPATFTE-------------------------I 430
HCSDDSAM YFLT LADEALTVKLGEEAPATF E I
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGSSGKD-ERTDPKSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTTIPISEILTNIE 490
GRG SGKD E DPKSKDKGSFSSGR EYRR E+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGAPEK------------------------------TQQG---QFV 550
+SGMEKLLKRPEKLRGAPE+ Q G +FV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKEERKRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKELARAARREVCII 610
GKPRTSSAEKKEERKRSRTPPRRTDRP VINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 633
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
BLAST of Moc04g24550 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 739.2 bits (1907), Expect = 2.1e-209
Identity = 447/790 (56.58%), Postives = 490/790 (62.03%), Query Frame = 0
Query: 1 MIQPANSTNTTDRRTLAASDTHQREGGAATVEGQGHDSLVTEPLRKSARITAPALPPAHP 60
M+QPANSTNT DRR LAA+ HQRE GA VEGQGH+ L TEPL +SARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPPTDPTSENLDALKREMEAMRTQMRSMEEMYNEMMLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAK-------------- 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ----------SPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
+ IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMYYFLTSLADEALTVKLGEEAPATFTE----------------- 420
F EEQLKVAHCSDDSAM YFLT LADE LTVKL EEAPATF E
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 --------IGRGSSGKDE-RTDPKSKDKG-SFSSGRPEYRRVESGPTRSRPYERFTPTTI 480
I +G +GKD+ + D KS+DKG S SS R +YRR S +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGAPEK------------------------------ 540
PI EILTNIE++GMEKLLKRPEKLRG PEK
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 TQQG---QFVGKPRTSSAEKKEERKRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKELA 600
Q G +FVGKPR++S EKKEERKR RTPPRR DRP VIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEACIDLPVTLGQDRTRVTQMAEFVVI 703
ILSL TYLALGWTRSQLK+SPTPLVGFSGES+ E CIDLPV++ QD T+VTQMAEFVVI
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
BLAST of Moc04g24550 vs. ExPASy TrEMBL
Match:
A0A6J1DYW5 (uncharacterized protein LOC111024332 OS=Momordica charantia OX=3673 GN=LOC111024332 PE=4 SV=1)
HSP 1 Score: 664.8 bits (1714), Expect = 5.0e-187
Identity = 347/402 (86.32%), Postives = 353/402 (87.81%), Query Frame = 0
Query: 358 LADEALTVKLGEEAPATFTE-------------------------IGRGSSGKDERTDPK 417
+ADEALTVKLGEEAPATF E IGRG SGKDER DPK
Sbjct: 1 MADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPK 60
Query: 418 SKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR 477
SKDKGSFSSGR EYRR E+GPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR
Sbjct: 61 SKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLR 120
Query: 478 GAPEKTQQGQFVGKPRTSSAEKKEERKRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKE 537
GAPE+ K +TSSAEKKEERKRSRTPPRRTDRP VINTIFGGPSGGQSGHKRKE
Sbjct: 121 GAPERR------SKDKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE 180
Query: 538 LARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS 597
LAR ARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS
Sbjct: 181 LAREARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS 240
Query: 598 ANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEACIDLPVTLGQDRTRVTQMAEFV 657
ANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPE CIDLPVTLGQD+TRVTQM EFV
Sbjct: 241 ANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMTEFV 300
Query: 658 VIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYAAALKGP 717
V+DGRS YNAIFGRPIIHSFR IPSTLHQVLKY TPNGVGTVRGEQT SRECYAAALKG
Sbjct: 301 VVDGRSTYNAIFGRPIIHSFRXIPSTLHQVLKYSTPNGVGTVRGEQTVSRECYAAALKGS 360
Query: 718 SVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ 735
SVCALETLRDGTLE EADLPRKEFAAPTEELELVPLLSPEKQ
Sbjct: 361 SVCALETLRDGTLELEADLPRKEFAAPTEELELVPLLSPEKQ 396
BLAST of Moc04g24550 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 650.2 bits (1676), Expect = 1.3e-182
Identity = 351/441 (79.59%), Postives = 365/441 (82.77%), Query Frame = 0
Query: 351 MYYFLTSLADEALTVKLGEEAPATF------------------TEIGRGSSGKD-ERTDP 410
M YFLT LADEALTVKL EEAPATF T+IG+G SGKD E TDP
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTKIGQGRSGKDMENTDP 60
Query: 411 KSKDKGSFSSGRPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKL 470
KSKDKGSFS+GR EYRR E+GPTRSRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKL
Sbjct: 61 KSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKL 120
Query: 471 RGAPEK------------------------------TQQG---QFVGKPRTSSAEKKEER 530
RGAPE+ Q G +FVGKPRTSSAEKKEER
Sbjct: 121 RGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSSAEKKEER 180
Query: 531 KRSRTPPRRTDRPTVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGA 590
KRSRTPPRRTDRP VINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTCPITFD A
Sbjct: 181 KRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTCPITFDXA 240
Query: 591 DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPL 650
DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPL
Sbjct: 241 DLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL 300
Query: 651 VGFSGESVIPEACIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPST 710
VGFSGESV+PE CIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPST
Sbjct: 301 VGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPST 360
Query: 711 LHQVLKYPTPNGVGTVRGEQTASRECYAAALKGPSVCALETL--RDGTLEFEADLPRKEF 738
LHQVLKY TPNGVGTVRGEQTASRECYA+ LKG SVCALETL RDGTLEFEADLP +EF
Sbjct: 361 LHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEADLPXREF 420
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D9E1 | 8.2e-214 | 69.13 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1C7X5 | 1.2e-209 | 77.27 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 2.1e-209 | 56.58 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DYW5 | 5.0e-187 | 86.32 | uncharacterized protein LOC111024332 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A6J1DD03 | 1.3e-182 | 79.59 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
Match Name | E-value | Identity | Description | |