Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCCGCAAACTCGACCAACACGGCAGATCGAAAGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCACAGGTCGGCACGAATCACCGCGCTTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCAATGGAGGAAATACGCGATCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAGGGGGTTCCCACCTCGGCCCAATCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACACAAAGATAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGTTGAATCCTCTCACAACCCAGTAACTCCCGCAGGGGTGATCACAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGTTGAGGTTGAGGGCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGTCATTCACCTCGGACGTTTTGAAAGCGTCGATCCCTCCGAAGTTCAAAGCTCTTACTGTGAAACCTTATGATGGGTCAAAGGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACTTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCAACTCATTTCGCCACCATCAGACAAAAGGAAGGTGAGACGCTGCAGGAGTATGTCACCAGATTCCAGGATGAACAATTGAAGGTCGCACACTGCTCTGATGACTCGGCCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACTAAAACCGGCCGACCAGAACGAAGGATCGGCCGGGACAGAAGCGGAAAAGGTGAAAAGGCGGATCCCAAGTCCAAGGACAAGGGATCTTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGGTGAACGGACCCACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCTGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGTCGTTTCCATCGGGAGCACGGCCACAACACGTCGAACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTATGGGAAAGCCCAGGACCAGCCCGGCAGAGAAAAAAGAGGAGCGTAAGCGTTCGAGGACGCCGCCCCGGTGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACACAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCTTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGGTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACACCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCGTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGAGGGCACGGTCCGAGGAGAACAGACTGCTTCGAGGGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTGCGCCATCGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGAGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAAATGAGCTAATTCACTTCCTCAGATCCAACTCGGACGTCTTTGCGTGGTCCCACGAGGACATGTCTGGCATTGACCCGCAAATTATGACGCATCGCCTCAGCATAGATCCGTCATTCCGACCTGTAAAACAAAAAAGAAGACCTATAAACAAGGAGAGGAGTGATGTAATTGTTGAGGAAGTTAGCAAACTGTTGAAAGCTGAATACATAAGAGAAATTTTGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGAAGTGGAGAATGTGCGTAGACTTTACGAACTTAAATAAGGCATGCCCGAAAGATTACTTTCCACTGCAGAGGATTGATCAGCTCGTGGACGCCACAGCCGGGCACGAACTGCTCACTTTCATGGTTGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCTGGATAAAGATCATACCGCATTCATAACAGACCAAGGTCTGTACTGTTACAAGGTCATGCCCTTCGGTTTAAAGAACGCAGGAGCGACCTACCAGAGAATAGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATGCTCGTCAAGAGCAAGCAGTCTAAGTCGCATCTTTCCGATCTGACCGAAGCCTTCGAGGTTCTAAGGACATATCAAATGAAGCTCAACCCAGCTAAATGTGCCTTTGGAGTCTCTTCGGAAAAATTCCTTGGCTTCATGGTGAACCACCGGGGGATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGCTCGAGATGGAGGCACCCAAAATGCTGAGACAGCTTCAGTGCCTCAATGGTAGGATTGCAGCCCTGAACCGGTTTGTTTCAAGATCGACAGATAAGTGCCTCCCTTTCTTCAAAGTCCTACGAAAGAAAAGGCCGTTTGAATGGACAGCGGAGTGCGAGCAAGCATTTCAGCAATTGAAGAGCTACCTCTGCTCGGCACCTTTGCTCGACAAGCCCCTGCCAGGGGACAAGCTCCAGTTGTACTTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACCAGATACCCTCAAATGGAAAAGTTGGCTCTCGCTTTAGTCTCTTCGGCCCGACGGCTTAGACCATACTTCCATGCCCATACTGTGGTGGTGCTCACTAACTTGCCCCTAAAAAACATTTTCCATAAGCCAGAAGCTTCTGGACGCCTGATGAAGTGGGCAATGGAGCTAAGTGAGTACGACATCCAGTTCAAACCCAGAACTGCCTTGAAAGGACAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCGTGGACAATCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGGGCCGGGGTCCTCTTGCTCGGACCAGGAGGCGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGTCTGCGAATCGCTAGAGCATTGGGGACCTCTTGTGTTAAGGTCTTCAGCGACTCCCAGCTGGTTGTGAGCCAGATCAAGGAAGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGGCAATCCCTCGATCTCAGAGCCAGATCTGATGGAGGTCGACGCTCCAGAGCCCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAACGCAGAAAGTTGGCAAGGCAAGCAGCTCGGTTTGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTATACGTCCTCAGAAAAATCCACGAAGGAGTGTGCGGCAATCACTTAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAATGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTTGTTAGAACTTGCGACAATTGCCAACGCTACGGAAACGTAATCCACCAACCTCCCGAGCTGCTTACCCCCATCACGGCCTCATGGCCATTCGCCCAGTGGGGGTAGATATTATTGGTCCTTTCCCTCTAGGCAAAGGCCAGACAAAGTTCGCTGTAGTGGCTGTGGATTACTTCACAAAGTGGGTCGAGGGCGAGGCGCTCTCCCACATAACGGAATCCAGAGTCACATCCTCCGTATGGACAAATATCATATGTCGCTTCGGTATACCGCAGGCCATTGTGACAGACAATGGGAAGCAGTTTGACAATGCCAAGTTCAAAGACTTTTGCAGCAAGCTTGGCATAAGTCACCTTAGCTCGTCCCCTGCACATCCGCAAGCAAACAGGCAGGTGGAGGCAGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCAGAGGTTCTATGGTCGTACCGAACCACCCAAAGAGAATCGACGGGTGAAACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTACGAGCCTACAACAAATGAGGAAGAGCTGCTCTTCAACCTCGACTTGTTGGAAGAAACAAGAGCAATGACCCAGCTACGCCTGGCAGAATATCAGGGCAGAATGGCCAGACACTACAACGCCCGCGTTCGACCTCGGACCTTCCAAGTCGGACATCTGGTCTTAAGGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGAGCCGTTTGAGGTCAAGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCTTCGCGCACCCATGGAACGCGGAGCACCTGAAGCGTTACTATCCTTGA
mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAACACGGCAGATCGAAAGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCACAGGTCGGCACGAATCACCGCGCTTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCAATGGAGGAAATACGCGATCCCGATCTGAGAACCGAGGGTGATCACAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGTTGAGGTTGAGGGCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGTCATTCACCTCGGACGTTTTGAAAGCGTCGATCCCTCCGAAGTTCAAAGCTCTTACTGTGAAACCTTATGATGGGTCAAAGGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACTTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCAACTCATTTCGCCACCATCAGACAAAAGGAAGGTGAGACGCTGCAGGAGTATGTCACCAGATTCCAGGATGAACAATTGAAGGTCGCACACTGCTCTGATGACTCGGCCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACTAAAACCGGCCGACCAGAACGAAGGATCGGCCGGGACAGAAGCGGAAAAGGTGAAAAGGCGGATCCCAAGTCCAAGGACAAGGGATCTTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGGTGAACGGACCCACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCTGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGTCGTTTCCATCGGGAGCACGGCCACAACACGTCGAACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTATGGGAAAGCCCAGGACCAGCCCGGCAGAGAAAAAAGAGGAGCGTAAGCGTTCGAGGACGCCGCCCCGGTGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACACAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCTTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGGTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACACCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCGTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGAGGGCACGGTCCGAGGAGAACAGACTGCTTCGAGGGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTGCGCCATCGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGAGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAAATGAGCTAATTCACTTCCTCAGATCCAACTCGGACGTCTTTGCGTGGTCCCACGAGGACATGTCGGTCCCCGTCGAGATCTTAGGCAATCCCTCGATCTCAGAGCCAGATCTGATGGAGGTCGACGCTCCAGAGCCCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAACGCAGAAAGTTGGCAAGGCAAGCAGCTCGGTTTGTGGTCCGAGGTGGAGCATTGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGAGCCGTTTGAGGTCAAGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCTTCGCGCACCCATGGAACGCGGAGCACCTGAAGCGTTACTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCCGCAAACTCGACCAACACGGCAGATCGAAAGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCACAGGTCGGCACGAATCACCGCGCTTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCAATGGAGGAAATACGCGATCCCGATCTGAGAACCGAGGGTGATCACAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGTTGAGGTTGAGGGCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGTCATTCACCTCGGACGTTTTGAAAGCGTCGATCCCTCCGAAGTTCAAAGCTCTTACTGTGAAACCTTATGATGGGTCAAAGGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACTTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCAACTCATTTCGCCACCATCAGACAAAAGGAAGGTGAGACGCTGCAGGAGTATGTCACCAGATTCCAGGATGAACAATTGAAGGTCGCACACTGCTCTGATGACTCGGCCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACTAAAACCGGCCGACCAGAACGAAGGATCGGCCGGGACAGAAGCGGAAAAGGTGAAAAGGCGGATCCCAAGTCCAAGGACAAGGGATCTTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGGTGAACGGACCCACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCTGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGTCGTTTCCATCGGGAGCACGGCCACAACACGTCGAACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTATGGGAAAGCCCAGGACCAGCCCGGCAGAGAAAAAAGAGGAGCGTAAGCGTTCGAGGACGCCGCCCCGGTGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACACAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCTTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGGTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACACCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCGTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGAGGGCACGGTCCGAGGAGAACAGACTGCTTCGAGGGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTGCGCCATCGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGAGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAAATGAGCTAATTCACTTCCTCAGATCCAACTCGGACGTCTTTGCGTGGTCCCACGAGGACATGTCGGTCCCCGTCGAGATCTTAGGCAATCCCTCGATCTCAGAGCCAGATCTGATGGAGGTCGACGCTCCAGAGCCCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAACGCAGAAAGTTGGCAAGGCAAGCAGCTCGGTTTGTGGTCCGAGGTGGAGCATTGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGAGCCGTTTGAGGTCAAGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCTTCGCGCACCCATGGAACGCGGAGCACCTGAAGCGTTACTATCCTTGA
Protein sequence
MVQPANSTNTADRKTLAASDAHQREVGAAVVEGQGHDGLATEPLHRSARITALVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAMRTKMRSMEEIRDPDLRTEGDHRAEFDQLRGKLDVEVEGLKAKCEQKEGPLNDGDLGESSFTSDVLKASIPPKFKALTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHFATIRQKEGETLQEYVTRFQDEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGRDRSGKGEKADPKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFMGKPRTSPAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVCAIETLASRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQVSIGTKLGATDRNELIHFLRSNSDVFAWSHEDMSVPVEILGNPSISEPDLMEVDAPEPSWMDPIVDFIRGNSPQDPKERRKLARQAARFVVRGGALKVQTHVGALDPTWEEPFEVKGIVRPGTYILADLKGDVFAHPWNAEHLKRYYP
Homology
BLAST of Moc10g05220 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 953.0 bits (2462), Expect = 1.8e-273
Identity = 512/706 (72.52%), Postives = 562/706 (79.60%), Query Frame = 0
Query: 1 MVQPANSTNTADRKTLAASDAHQREVGAAVVEGQGHDGLATEPLHRSARITALVLPPAHP 60
MVQPANSTNTADR+ LAA+ HQREVGA VVEGQGH+ L TEPL RSARIT VLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 PRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAMRTKMRSMEEIRDPDLRTEGD 120
P+ A +S N P + T
Sbjct: 61 K-------------------PSKAESSYNPIT--------------------PGVIT--- 120
Query: 121 HRAEFDQLRGKLDVEVEGLKAKCEQKEGPLNDGDLGESSFTSDVLKASIPPKFKALTVKP 180
R EFDQL+ K D +VE LKA+CE+KE +DGDLGE SF+SD+L+A IPPKFK T+KP
Sbjct: 121 -REEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTMKP 180
Query: 181 YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRR 240
YDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTY+QLR+
Sbjct: 181 YDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQLRK 240
Query: 241 EFLAQFSSRHYDKKTATHFATIRQKEGETLQEYVTRFQDEQLKVAHCSDDSAMCYFLTGL 300
EF++QFSSRHYD+KT TH ATIRQKEGETL+EYVTRF +EQLKVAHCSDDSAMCYFLTGL
Sbjct: 241 EFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLTGL 300
Query: 301 ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGRDRSGKGE-KADPK 360
ADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I + R+GK + KAD K
Sbjct: 301 ADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKADSK 360
Query: 361 SKDKG-SFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKL 420
S+DKG S SS R ++RR+ + +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKL
Sbjct: 361 SRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRPEKL 420
Query: 421 RGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFMGKPRTSPAEKKEER 480
RG PE+R+ DKYCRFHR+HGHNTSN WELKRQIEDLIQDGYFKKF+GKPR++ EKKEER
Sbjct: 421 RGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEER 480
Query: 481 KRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSA 540
KR RTPPR DRPAVIN K+KELAR ARREVCIIREQRPT I F+ A
Sbjct: 481 KRLRTPPRRDDRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAFNHA 540
Query: 541 DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL 600
DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPL
Sbjct: 541 DLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSPTPL 600
Query: 601 VGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPST 660
VGFSGES+ EGCIDLPV++ QD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PST
Sbjct: 601 VGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPST 650
Query: 661 LHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVCAIETLASRD 705
LHQVLKYST NG GTVRGE SRECYAS K SSVCA+E RD
Sbjct: 661 LHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALEEQTIRD 650
BLAST of Moc10g05220 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 947.6 bits (2448), Expect = 7.7e-272
Identity = 504/610 (82.62%), Postives = 517/610 (84.75%), Query Frame = 0
Query: 122 RAEFDQLRGKLDVEVEGLKAKCEQKEGPLNDGDLGESSFTSDVLKASIPPKFKALTVKPY 181
R EFDQLRGKL+ +VE LKAKCEQKEGPLNDGDLGES FTSDVL+A TVK Y
Sbjct: 22 REEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAP--------TVKSY 81
Query: 182 DGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRRE 241
DGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW
Sbjct: 82 DGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW------------------ 141
Query: 242 FLAQFSSRHYDKKTATHFATIRQKEGETLQEYVTRFQDEQLKVAHCSDDSAMCYFLTGLA 301
FQ++QLKVA SDDSAMCYFLTGLA
Sbjct: 142 -----------------------------------FQEDQLKVAQSSDDSAMCYFLTGLA 201
Query: 302 DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGRDRSGKGEKADPKSK 361
DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I R RSGK EKAD KSK
Sbjct: 202 DEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGRSGKDEKADLKSK 261
Query: 362 DKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGA 421
DKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGA
Sbjct: 262 DKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGA 321
Query: 422 PERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFMGKPRTSPAEKKEERKRS 481
PERR+KDKYCRFHREH HNTS+ WELKRQIEDLIQD YFKKF+GKPRTS AEKKEERK S
Sbjct: 322 PERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERKLS 381
Query: 482 RTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLE 541
RTP R DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLE
Sbjct: 382 RTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLE 441
Query: 542 EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF 601
EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGF
Sbjct: 442 EVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRSQLKKSTTPLVGF 501
Query: 602 SGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQ 661
S ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYN IFGRPIIHSFRAIPSTLHQ
Sbjct: 502 SRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQ 561
Query: 662 VLKYSTPNGEGTVRGEQTASRECYASALKGSSVCAIETLASRDGTLEFEADLPRREFAAP 721
VLKYSTPNG G VRGEQ ASRECYASALKGSSVCA+ETL SRDGTLEF+A+LPRREFAAP
Sbjct: 562 VLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLVSRDGTLEFKANLPRREFAAP 570
Query: 722 TEELELVPLL 732
TEELELVPLL
Sbjct: 622 TEELELVPLL 570
BLAST of Moc10g05220 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 932.2 bits (2408), Expect = 3.4e-267
Identity = 481/530 (90.75%), Postives = 494/530 (93.21%), Query Frame = 0
Query: 104 MRSMEEIRDPDLRTEGDHRAEFDQLRGKLDVEVEGLKAKCEQKEGPLNDGDLGESSFTSD 163
M E R+P R EFDQLRG+LD +VE LKAKCEQKEGPLNDGDLGES FTSD
Sbjct: 1 MVKAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSD 60
Query: 164 VLKASIPPKFKALTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW 223
VL+A IPPKFKA TVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLW
Sbjct: 61 VLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLW 120
Query: 224 YRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHFATIRQKEGETLQEYVTRFQDEQLK 283
YRRLPA SISTY+QLRREFLA FSSRHYDKKTATH ATIRQKEGETL+EYVTRFQ+EQLK
Sbjct: 121 YRRLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLK 180
Query: 284 VAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER 343
VAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER
Sbjct: 181 VAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER 240
Query: 344 RIGRDRSGKG-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTN 403
+IGR RSGK E ADPKSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTN
Sbjct: 241 KIGRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTN 300
Query: 404 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKK 463
IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS+ WELKRQIE+LIQDGYFKK
Sbjct: 301 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKK 360
Query: 464 FMGKPRTSPAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC 523
F+GKPRTS AEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQSG KRKELARAARREVC
Sbjct: 361 FVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVC 420
Query: 524 IIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYL 583
IIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYL
Sbjct: 421 IIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYL 480
Query: 584 ALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFV 633
ALGWTRSQLKKSPTPLVGFSGESVIPEG IDLPVTLGQDQTQVTQMAEFV
Sbjct: 481 ALGWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc10g05220 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 795.8 bits (2054), Expect = 3.8e-226
Identity = 409/449 (91.09%), Postives = 421/449 (93.76%), Query Frame = 0
Query: 293 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGRDRSGK 352
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRTK IG+ RSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTK-------IGQGRSGK 60
Query: 353 G-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 412
E DPKSKDKGSFS+GRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 413 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFMGKPRTSP 472
LKRPEKLRGAPERRSKDKYCRFHREHGHNTS+ WELK QIEDLIQDGYFKKF+GKPRTS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 473 AEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTC 532
AEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 533 PITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 592
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 593 KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHS 652
KKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYN IFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 653 FRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVCAIETLASRDGTLEFEA 712
FRAIPSTLHQVLKYSTPNG GTVRGEQTASRECYAS LKG+SVCA+ETL SRDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 713 DLPRREFAAPTEELELVPLLSPEKQVSIG 741
DLP REFAAP EELELVPLLS EKQV +G
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQVQLG 442
BLAST of Moc10g05220 vs. NCBI nr
Match:
XP_022158414.1 (uncharacterized protein LOC111024904 [Momordica charantia])
HSP 1 Score: 775.0 bits (2000), Expect = 6.9e-220
Identity = 408/546 (74.73%), Postives = 450/546 (82.42%), Query Frame = 0
Query: 198 MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTAT 257
MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTY+QLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 258 HFATIRQKEGETLQEYVTRFQDEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF 317
H ATIRQKE ETL+EYVTRFQ+EQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 318 AEVLQKAKKVIDGQELLRTKTGRPERRIGRDR-SGKGEKADPKSKDKGSFSS-GRAEFRR 377
EVLQKAKKVIDGQELLRTKTGRPE++I + + S + KAD KS+DKGS SS R E+RR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 378 AVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHR 437
+GP+RSRPYER+T +TIPISEILTNIEESGMEKLLKRPEKLRG E+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 438 EHGHNTSNCWELKRQIEDLIQDGYFKKFMGKPRTSPAEKKEERKRSRTPPRCTDRPAVIN 497
+HGHNT++CWELKRQIEDLIQDGYFKKF+GKPR++ EKKEERKRSRTPPR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 498 TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAP 557
TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF ADLE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 558 LIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLP 617
LIDH +VRRVL+DG GCIDLP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 618 VTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVR 677
VT+GQD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYSTPN G VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 678 GEQTASRECYASALKGSSVCAIETLASRDGTLEFEADLP---RREFAAPTEELELVPLLS 737
GEQ SRECYASALKGS+VCA+E +R E EADLP +R+F PTEELELVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 506
Query: 738 PEKQVS 739
PE+Q +
Sbjct: 541 PERQAN 506
BLAST of Moc10g05220 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 953.0 bits (2462), Expect = 8.9e-274
Identity = 512/706 (72.52%), Postives = 562/706 (79.60%), Query Frame = 0
Query: 1 MVQPANSTNTADRKTLAASDAHQREVGAAVVEGQGHDGLATEPLHRSARITALVLPPAHP 60
MVQPANSTNTADR+ LAA+ HQREVGA VVEGQGH+ L TEPL RSARIT VLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 PRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAMRTKMRSMEEIRDPDLRTEGD 120
P+ A +S N P + T
Sbjct: 61 K-------------------PSKAESSYNPIT--------------------PGVIT--- 120
Query: 121 HRAEFDQLRGKLDVEVEGLKAKCEQKEGPLNDGDLGESSFTSDVLKASIPPKFKALTVKP 180
R EFDQL+ K D +VE LKA+CE+KE +DGDLGE SF+SD+L+A IPPKFK T+KP
Sbjct: 121 -REEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTMKP 180
Query: 181 YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRR 240
YDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTY+QLR+
Sbjct: 181 YDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQLRK 240
Query: 241 EFLAQFSSRHYDKKTATHFATIRQKEGETLQEYVTRFQDEQLKVAHCSDDSAMCYFLTGL 300
EF++QFSSRHYD+KT TH ATIRQKEGETL+EYVTRF +EQLKVAHCSDDSAMCYFLTGL
Sbjct: 241 EFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLTGL 300
Query: 301 ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGRDRSGKGE-KADPK 360
ADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I + R+GK + KAD K
Sbjct: 301 ADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKADSK 360
Query: 361 SKDKG-SFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKL 420
S+DKG S SS R ++RR+ + +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKL
Sbjct: 361 SRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRPEKL 420
Query: 421 RGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFMGKPRTSPAEKKEER 480
RG PE+R+ DKYCRFHR+HGHNTSN WELKRQIEDLIQDGYFKKF+GKPR++ EKKEER
Sbjct: 421 RGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEER 480
Query: 481 KRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSA 540
KR RTPPR DRPAVIN K+KELAR ARREVCIIREQRPT I F+ A
Sbjct: 481 KRLRTPPRRDDRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAFNHA 540
Query: 541 DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL 600
DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPL
Sbjct: 541 DLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSPTPL 600
Query: 601 VGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPST 660
VGFSGES+ EGCIDLPV++ QD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PST
Sbjct: 601 VGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPST 650
Query: 661 LHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVCAIETLASRD 705
LHQVLKYST NG GTVRGE SRECYAS K SSVCA+E RD
Sbjct: 661 LHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALEEQTIRD 650
BLAST of Moc10g05220 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 947.6 bits (2448), Expect = 3.8e-272
Identity = 504/610 (82.62%), Postives = 517/610 (84.75%), Query Frame = 0
Query: 122 RAEFDQLRGKLDVEVEGLKAKCEQKEGPLNDGDLGESSFTSDVLKASIPPKFKALTVKPY 181
R EFDQLRGKL+ +VE LKAKCEQKEGPLNDGDLGES FTSDVL+A TVK Y
Sbjct: 22 REEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAP--------TVKSY 81
Query: 182 DGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRRE 241
DGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW
Sbjct: 82 DGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW------------------ 141
Query: 242 FLAQFSSRHYDKKTATHFATIRQKEGETLQEYVTRFQDEQLKVAHCSDDSAMCYFLTGLA 301
FQ++QLKVA SDDSAMCYFLTGLA
Sbjct: 142 -----------------------------------FQEDQLKVAQSSDDSAMCYFLTGLA 201
Query: 302 DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGRDRSGKGEKADPKSK 361
DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I R RSGK EKAD KSK
Sbjct: 202 DEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGRSGKDEKADLKSK 261
Query: 362 DKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGA 421
DKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGA
Sbjct: 262 DKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGA 321
Query: 422 PERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFMGKPRTSPAEKKEERKRS 481
PERR+KDKYCRFHREH HNTS+ WELKRQIEDLIQD YFKKF+GKPRTS AEKKEERK S
Sbjct: 322 PERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERKLS 381
Query: 482 RTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLE 541
RTP R DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLE
Sbjct: 382 RTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLE 441
Query: 542 EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF 601
EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGF
Sbjct: 442 EVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRSQLKKSTTPLVGF 501
Query: 602 SGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQ 661
S ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYN IFGRPIIHSFRAIPSTLHQ
Sbjct: 502 SRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQ 561
Query: 662 VLKYSTPNGEGTVRGEQTASRECYASALKGSSVCAIETLASRDGTLEFEADLPRREFAAP 721
VLKYSTPNG G VRGEQ ASRECYASALKGSSVCA+ETL SRDGTLEF+A+LPRREFAAP
Sbjct: 562 VLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLVSRDGTLEFKANLPRREFAAP 570
Query: 722 TEELELVPLL 732
TEELELVPLL
Sbjct: 622 TEELELVPLL 570
BLAST of Moc10g05220 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 932.2 bits (2408), Expect = 1.6e-267
Identity = 481/530 (90.75%), Postives = 494/530 (93.21%), Query Frame = 0
Query: 104 MRSMEEIRDPDLRTEGDHRAEFDQLRGKLDVEVEGLKAKCEQKEGPLNDGDLGESSFTSD 163
M E R+P R EFDQLRG+LD +VE LKAKCEQKEGPLNDGDLGES FTSD
Sbjct: 1 MVKAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSD 60
Query: 164 VLKASIPPKFKALTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW 223
VL+A IPPKFKA TVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLW
Sbjct: 61 VLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLW 120
Query: 224 YRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHFATIRQKEGETLQEYVTRFQDEQLK 283
YRRLPA SISTY+QLRREFLA FSSRHYDKKTATH ATIRQKEGETL+EYVTRFQ+EQLK
Sbjct: 121 YRRLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLK 180
Query: 284 VAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER 343
VAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER
Sbjct: 181 VAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER 240
Query: 344 RIGRDRSGKG-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTN 403
+IGR RSGK E ADPKSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTN
Sbjct: 241 KIGRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTN 300
Query: 404 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKK 463
IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTS+ WELKRQIE+LIQDGYFKK
Sbjct: 301 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKK 360
Query: 464 FMGKPRTSPAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC 523
F+GKPRTS AEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQSG KRKELARAARREVC
Sbjct: 361 FVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVC 420
Query: 524 IIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYL 583
IIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYL
Sbjct: 421 IIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYL 480
Query: 584 ALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFV 633
ALGWTRSQLKKSPTPLVGFSGESVIPEG IDLPVTLGQDQTQVTQMAEFV
Sbjct: 481 ALGWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc10g05220 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 795.8 bits (2054), Expect = 1.8e-226
Identity = 409/449 (91.09%), Postives = 421/449 (93.76%), Query Frame = 0
Query: 293 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGRDRSGK 352
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRTK IG+ RSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTK-------IGQGRSGK 60
Query: 353 G-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 412
E DPKSKDKGSFS+GRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 413 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSNCWELKRQIEDLIQDGYFKKFMGKPRTSP 472
LKRPEKLRGAPERRSKDKYCRFHREHGHNTS+ WELK QIEDLIQDGYFKKF+GKPRTS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 473 AEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTC 532
AEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 533 PITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 592
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 593 KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHS 652
KKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYN IFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 653 FRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVCAIETLASRDGTLEFEA 712
FRAIPSTLHQVLKYSTPNG GTVRGEQTASRECYAS LKG+SVCA+ETL SRDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 713 DLPRREFAAPTEELELVPLLSPEKQVSIG 741
DLP REFAAP EELELVPLLS EKQV +G
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQVQLG 442
BLAST of Moc10g05220 vs. ExPASy TrEMBL
Match:
A0A6J1DZB9 (uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024904 PE=4 SV=1)
HSP 1 Score: 775.0 bits (2000), Expect = 3.3e-220
Identity = 408/546 (74.73%), Postives = 450/546 (82.42%), Query Frame = 0
Query: 198 MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTAT 257
MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTY+QLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 258 HFATIRQKEGETLQEYVTRFQDEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF 317
H ATIRQKE ETL+EYVTRFQ+EQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 318 AEVLQKAKKVIDGQELLRTKTGRPERRIGRDR-SGKGEKADPKSKDKGSFSS-GRAEFRR 377
EVLQKAKKVIDGQELLRTKTGRPE++I + + S + KAD KS+DKGS SS R E+RR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 378 AVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHR 437
+GP+RSRPYER+T +TIPISEILTNIEESGMEKLLKRPEKLRG E+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 438 EHGHNTSNCWELKRQIEDLIQDGYFKKFMGKPRTSPAEKKEERKRSRTPPRCTDRPAVIN 497
+HGHNT++CWELKRQIEDLIQDGYFKKF+GKPR++ EKKEERKRSRTPPR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 498 TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAP 557
TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF ADLE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 558 LIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLP 617
LIDH +VRRVL+DG GCIDLP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 618 VTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVR 677
VT+GQD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYSTPN G VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 678 GEQTASRECYASALKGSSVCAIETLASRDGTLEFEADLP---RREFAAPTEELELVPLLS 737
GEQ SRECYASALKGS+VCA+E +R E EADLP +R+F PTEELELVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 506
Query: 738 PEKQVS 739
PE+Q +
Sbjct: 541 PERQAN 506
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DHB3 | 8.9e-274 | 72.52 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9E1 | 3.8e-272 | 82.62 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1C7X5 | 1.6e-267 | 90.75 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DD03 | 1.8e-226 | 91.09 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A6J1DZB9 | 3.3e-220 | 74.73 | uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |