Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCAAGGACATCCAAGGCCACTCGTGGCCGAGGTGAGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTATAACGAAATGATACTAACTGCAGGCGCAGGTCCCGATCTGAAAACCGAGTGACGCGCGTTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAATATCCCGAAGACAGCGAGAGTGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCCCTCCGAAAAGGACAGTCACCATCCCGCTCACATCGGAGCTCCAACCAACAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCATGGATGATCACAAGGGAGGAGTTTGACCAGCTGAGGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATGTGAGTAGAAGGAAGGTCCACTGAACGATGGCGATCTGGGAGAATCGCCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTGTGATGGGTCGAAGGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCAAAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAAAAGGAAGGTGAGACGCTGTGGGAATATGTCACCTGGTTCCAGAAGGAACAATTGAAGGTCGCACACTGCTCCGATGACTCAGCCATGTGTTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGTCCCGACCACCTTCGCCGAAGTGCTACAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAATATATAGAAAAGGCGGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACGAACATCAAGGAGTTTGGAATGGAAAAACTCCTCAAACGCCCTGAGAAGCTTCGGGGAGCCCTAGAGAGGGGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGATTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGATACTTCAAGAAATTTGTGGGGAAGCCCGGGACCAGCTCGGCAGAAAAAAAGGAAGAGAGGAAGCGTTCGCGGACGCCGCCCCGGTGCACTGACCGACCTGCGGTCATCAACACCATTTTCGGAGGGCCAAGCGGGGGCCAGTTCGGACATAAAAGAAAGAAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCGTCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGCGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGAAGGGTGCTAGTAGACGGGGGCGCATCTGCTAATATCCTGTCCCTACCAACCTACCACGCCCTGGGATGGACAAGGTTGCAGTTGAAGAAAAGCCCAACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAAGGTTGCATCGACTTGTCGGTCACATTTGGGCAAGACAAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGAAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCGCTCCCTCAACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGAGGAGAACATACTGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGGTCATCGGTATGCGCTCTCGAGACTCACGCCAGTGGGGAAGGGACGCTCGAGTCCGCGGCCGACCTGCCGAGAAGGGAGTTTTCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAAGCATAGGAACTAAGCTGGGGGCCACCGACGGAGAGGAGCTAATCCATTTCCTCAGATCCAACTCGGACGTCTTTGCATGGTCTCACGAGGACATGCCTGGTATCGACCCGAAGATTATGACGCATCGCCTCAGCATAGAGCCGTCATTCCGATCTGTAAAACAAAAAAGAAGACCTATAAACAAGGAGAGGAGTGATGTAATTATTGAGGAAGTTAACAAACTTTTGAAAGCTGAATACATAAAAGAAATTTCGTATCCCGAGTGGCTCTCCAATTTTGTATTAGTTAAAAAATCTAACGGAAGGTGGAGAATGTGCGTAGACTTTACGAACCTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATCGATCAGCTCGTGGACGCCACGGCCGGGCACAAACTGTTCACCTTCATGGATGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATCAAGATCATACCGCATTCATAACAGACCAAGGTCTGTATTGTTACAAGGTCATGCCCTTCGGTTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGAAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTTTAAGTCGCATCTTTCCGATCTGACCGAAGCCTTCGAGGTTCTAAGGACATATCAAATGAAGCTCAACCCAGCTAAATGTGCCTTTGGAGTCTCTTCGGGAAAATTCCTTGGCTTCATGGTGAACCACCGGGGGATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGCTTGAGATGGAGGCACCCAAGACGCTGAAACAGCTTCAGTGCCTCAATGGCAGGATTGCAGCCCTGAACCGGTTTGTTTCAAGATCGACAGATAAGTGCCTTCCTTTCTTCAAAGTCCTACGAAAGAAAGGGCCGTTTGAATGGACAGCGGAGTGCGAACAAGCATTTCGGCAGTTGAAGAGCTACCTCTGCTCGGCACCTTTGCTCGCCAAGCCCCTACCAGGGGACAAGCTCCAGTTGTACTTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCTCTAATCAGGCAAGAGGAAGCGCGACAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACCAGATACCCTCAAATGGAAAAGTTGGCTCTCGCTTTAGTCACTTCCAAGCCCATACTGTGGTGGTGCTCACTAACTTGCCCCTAAAAAACATCTTCCATAAGCTAGAAGCTTCTGGACGCCTGATGAAGTGGACAATGGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAAAACTGCGTTGAAAGGACAAGCAGTGGCAGATTTCATAGCCGAGCTCACACCATCTTTCGAGCTGAGCGAGTCCGACCTACCGTGGACAATCTATGTCGACGAATCCTCCAATGAGAAGGGGTGCGGGGCCGGGGTCCTCTTGCTCGGACCAGGAGGCGAGCGATTTAAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGTCTGCGAATCGCTAGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGCGACTCTCAACTGGTTGTGAGCCATATCAAGGAAGAGTACCAAGCTAGAGACTCCCGAATGGAGAAATATTTGGGCAAGATCAGGTCGTACCTCGCCCAGTTTCGAACTTACGAAGTAAACCAGGTTCTCCGAGCAGAAAATTCTAACGCTGACGCCTTGGCCAAGTTAGCATCAGCATACGAGACCGACCTGGCCAGGTCGGTCCCCGTTGAGATCTTGGATAATCCCTCGATCTTGGAGCCAGATCTGATGGAGATTGGCGCTCCAGAGCCCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACAACAAGACCCCAAGGAGCGTAGAAAGTTGGCAAGGCAGGCAGCTCGGTTCGTGGTCCGAGGAGGAGCGTTGTACCGGCGCGGCTTTTCCCTGCCTCTACTGAGATGTTTAACACCTGAAAAGGGCCTATACGTCCTCAGAGAAATCCACGAAGGAGTATGCGGCAATCACTCAGGCGCCCGATCGCTGTCAACCAAGGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAAACGTAATCCACCAACCTCCCGAGCTGCTTACCCCCATCTTGGCCCCATGGCCATTCGCGCAGTGAGGGGTAGATATTATTGGTCCTTTCCCTTTGGGCAAAGGCCAGACAAAGTTCGCTGTAGTTGCTGTGGATTACTTCACAAAGTGGGCCGAGGCCAAGGCGCTCTCCCACATAACGGAATCCAGAGTTACGTCCTTCGTATGTACAAGTATCATATGTCGCTTTGGTATACCGAAGGCCATTGTAACAGACAATGGGAAGCAGTTTGACAACGCAAGGTTCAAAGACTTTTGCAGCAAGCTTGACATAAGTCACCTTAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGCGGCGGTCAACAAGATCATCAAGCGAGGCATCAAGCTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAAACGCTACCAGAGGTTCTATGGTCATACCGGACCACCCAAAGAGAATCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGGATGCCATCTGATAGAGTAGAGCATTACGAGCCCACAGCAAATGAGGAAGAGCTGCTCCTCGACCTCGACTTATTGGAGGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAAGGCAGAATGGCCAGACACTACAACGCCCGCGTTCGACCTCGAACCTTCCAAGTCGGACATCTGGTCTTAAGGAAGGTCCAAACCCACGTGGGTGCCCTGGACCCGATCTGGGAGGGGCCGTTTGAAGTCAAGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCATGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGAAATACCAAAATGGTTTTCAATGGATCTGTAAAACCTGTTTCAATAGGATTATGATCGGAATAAATGTGATGATTTAATTTCATGATTCTGAGTTCGACCAGAAATTAAATGGGGGCCACGGACTCCCACACGATCACATTCCAGCAGTCGGTTAAAATTCAATCCTCCAAAACCTAAGGGTACGAGGTGCAATGCCAAAACCACTGATGAACTTAAAATTCAAAACCTTCAAGGTAAAGGGGCGATGTGAAAAGTTCAAAATGATCAAGCCTCTGGACCTGAAGGTACGAAGTGCGATATGAAGAACAGCTACAGACTTAGGAGTTTAGCCTTTTAACGTTTTTAAGTTAAGGGTGCGATGTCAAAAATCCAGAGTTGGTGCAATGCATTGAATACTAGACAGAGATTCGAATTCAAAATTTCAAGTATTTTAATTGAAGGCGCGACCTCAAAAGTACGAGGTGCGAGGTGACATTGGCTTTAGCTTAAAGAAAAAGCAAAGAAAGGAAGGCAACAAAGTAAAAACAAATGAGAGCTTCTTTATTGAATGGAGAGCAGAGCCAAGGCTTATAGACCTTACAGCTCTGCCCTGA
mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCAAGGACATCCAAGGCCACTCGTGGCCGAGGTGAGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTATAACGAAATGATACTAACTGCAGGCGCAGTCGAGGAGGAATATCCCGAAGACAGCGAGAGTGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCCCTCCGAAAAGGACAGTCACCATCCCGCTCACATCGGAGCTCCAACCAACAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCATGGATGATCACAAGGGAGGAGTTTGACCAGCTGAGGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTGTGATGGGTCGAAGGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCAAAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAAAAGGAAGGTGAGACGCTGTGGGAATATGTCACCTGGTTCCAGAAGGAACAATTGAAGGTCGCACACTGCTCCGATGACTCAGCCATGTGTTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGTCCCGACCACCTTCGCCGAAGTGCTACAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAATATATAGAAAAGGCGGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACGAACATCAAGGAGTTTGGAATGGAAAAACTCCTCAAACGCCCTGAGAAGCTTCGGGGAGCCCTAGAGAGGGGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGATTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGATACTTCAAGAAATTTGTGGGGAAGCCCGGGACCAGCTCGGCAGAAAAAAAGGAAGAGAGGAAGCGTTCGCGGACGCCGCCCCGGTGCACTGACCGACCTGCGGTCATCAACACCATTTTCGGAGGGCCAAGCGGGGGCCAGTTCGGACATAAAAGAAAGAAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCGTCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGCGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGAAGGGTGCTAGTAGACGGGGGCGCATCTGCTAATATCCTGTCCCTACCAACCTACCACGCCCTGGGATGGACAAGGTTGCAGTTGAAGAAAAGCCCAACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAAGGTTGCATCGACTTGTCGGTCACATTTGGGCAAGACAAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGAAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCGCTCCCTCAACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGAGGAGAACATACTGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGGTCATCGCATCAGCATACGAGACCGACCTGGCCAGGTCGGTCCCCGTTGAGATCTTGGATAATCCCTCGATCTTGGAGCCAGATCTGATGGAGATTGGCGCTCCAGAGCCCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACAACAAGACCCCAAGGAGCGTAGAAAGTTGGCAAGGCAGGCAGCTCGGTTCGTGGTAAAGGGGCGATGTGAAAAGTTCAAAATGATCAAGCCTCTGGACCTGAAGGCGCGACCTCAAAAGTACGAGGTGCGAGGTGACATTGGCTTTAGCTTAAAGAAAAAGCAAAGAAAGGAAGGCAACAAAGTAAAAACAAATGAGAGCTTCTTTATTGAATGGAGAGCAGAGCCAAGGCTTATAGACCTTACAGCTCTGCCCTGA
Coding sequence (CDS)
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCAAGGACATCCAAGGCCACTCGTGGCCGAGGTGAGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTATAACGAAATGATACTAACTGCAGGCGCAGTCGAGGAGGAATATCCCGAAGACAGCGAGAGTGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCCCTCCGAAAAGGACAGTCACCATCCCGCTCACATCGGAGCTCCAACCAACAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCATGGATGATCACAAGGGAGGAGTTTGACCAGCTGAGGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTGTGATGGGTCGAAGGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTATCAAAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAAAAGGAAGGTGAGACGCTGTGGGAATATGTCACCTGGTTCCAGAAGGAACAATTGAAGGTCGCACACTGCTCCGATGACTCAGCCATGTGTTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGTCCCGACCACCTTCGCCGAAGTGCTACAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAATATATAGAAAAGGCGGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACGAACATCAAGGAGTTTGGAATGGAAAAACTCCTCAAACGCCCTGAGAAGCTTCGGGGAGCCCTAGAGAGGGGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGATTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGATACTTCAAGAAATTTGTGGGGAAGCCCGGGACCAGCTCGGCAGAAAAAAAGGAAGAGAGGAAGCGTTCGCGGACGCCGCCCCGGTGCACTGACCGACCTGCGGTCATCAACACCATTTTCGGAGGGCCAAGCGGGGGCCAGTTCGGACATAAAAGAAAGAAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCGTCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGCGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGAAGGGTGCTAGTAGACGGGGGCGCATCTGCTAATATCCTGTCCCTACCAACCTACCACGCCCTGGGATGGACAAGGTTGCAGTTGAAGAAAAGCCCAACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAAGGTTGCATCGACTTGTCGGTCACATTTGGGCAAGACAAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGAAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCGCTCCCTCAACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCGTGAGGAGAACATACTGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGGTCATCGCATCAGCATACGAGACCGACCTGGCCAGGTCGGTCCCCGTTGAGATCTTGGATAATCCCTCGATCTTGGAGCCAGATCTGATGGAGATTGGCGCTCCAGAGCCCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACAACAAGACCCCAAGGAGCGTAGAAAGTTGGCAAGGCAGGCAGCTCGGTTCGTGGTAAAGGGGCGATGTGAAAAGTTCAAAATGATCAAGCCTCTGGACCTGAAGGCGCGACCTCAAAAGTACGAGGTGCGAGGTGACATTGGCTTTAGCTTAAAGAAAAAGCAAAGAAAGGAAGGCAACAAAGTAAAAACAAATGAGAGCTTCTTTATTGAATGGAGAGCAGAGCCAAGGCTTATAGACCTTACAGCTCTGCCCTGA
Protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGETSKKGARGPTPTPTSENFDALQREMEAMRTQMRSMEEMYNEMILTAGAVEEEYPEDSESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAWMITREEFDQLRGKLDAQVEALKAKSPIPPKFKAPTVKPCDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLWEYVTWFQKEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKYIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIKEFGMEKLLKRPEKLRGALERGSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQFGHKRKKLARAARREVCVIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYHALGWTRLQLKKSPTPLVGFSGESVIPEGCIDLSVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAAPSTLHQILKYSTPNGVRRTYCFEGVLCLRTQRVIASAYETDLARSVPVEILDNPSILEPDLMEIGAPEPSWMDPIVDFIRGNSQQDPKERRKLARQAARFVVKGRCEKFKMIKPLDLKARPQKYEVRGDIGFSLKKKQRKEGNKVKTNESFFIEWRAEPRLIDLTALP
Homology
BLAST of Moc07g24540 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 890.2 bits (2299), Expect = 1.4e-254
Identity = 462/528 (87.50%), Postives = 474/528 (89.77%), Query Frame = 0
Query: 168 QAESSHNPATPAWMITREEFDQLRGKLDAQVEALKAK----------------------- 227
+AESS NPATPA +ITREEFDQLRG+LDAQVEALKAK
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 228 -SPIPPKFKAPTVKPCDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYQ 287
+PIPPKFKAPTVKP DGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWY+
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 288 RLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLWEYVTWFQKEQLKVA 347
RLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETL EYVT FQ+EQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 348 HCSDDSAMCYFLTGLADEALTVKLGEEVPTTFAEVLQKAKKVIDGQELLRTKTGRPERKI 407
HCSDDSAMCYFLTGLADEALTVKLGEE P TFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 408 GRGRSGKYIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIK 467
GRGRSGK IE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNI+
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 468 EFGMEKLLKRPEKLRGALERGSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 527
E GMEKLLKRPEKLRGA ER SKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 528 GKPGTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQFGHKRKKLARAARREVCVI 587
GKP TSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQ G KRK+LARAARREVC+I
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 588 REQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYHAL 647
REQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTY AL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 648 GWTRLQLKKSPTPLVGFSGESVIPEGCIDLSVTFGQDKTQVTQMAEFV 672
GWTR QLKKSPTPLVGFSGESVIPEG IDL VT GQD+TQVTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc07g24540 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 863.6 bits (2230), Expect = 1.4e-246
Identity = 480/735 (65.31%), Postives = 521/735 (70.88%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTNTADRR LAA+ HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGETSKKGARGPTPTPTSENFDALQREMEAMRTQMRSMEEMYNEMILTAGAV 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 EEEYPEDSESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAW 180
AESS+NP TP
Sbjct: 121 ------------------------------------------------AESSYNPITPG- 180
Query: 181 MITREEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTV 240
+ITREEFDQL+ K DAQVEALKA+ + IPPKFK PT+
Sbjct: 181 VITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTM 240
Query: 241 KPCDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQL 300
KP DGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWY+RLPAR ISTYSQL
Sbjct: 241 KPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQL 300
Query: 301 RREFLAQFSSRHYDKKTTTHLATIRQKEGETLWEYVTWFQKEQLKVAHCSDDSAMCYFLT 360
R+EF++QFSSRHYD+KT THLATIRQKEGETL EYVT F +EQLKVAHCSDDSAMCYFLT
Sbjct: 301 RKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLT 360
Query: 361 GLADEALTVKLGEEVPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKYIEKAD 420
GLADE LTVKL EE P TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GK KAD
Sbjct: 361 GLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKAD 420
Query: 421 PKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIKEFGMEKLLKRPE 480
KS+DKG S SS R +YRR+ + +SRPYE +TPTTIPIFEILTNI+E GMEKLLKRPE
Sbjct: 421 SKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRPE 480
Query: 481 KLRGALERGSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKE 540
KLRG E+ + DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKE
Sbjct: 481 KLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKE 540
Query: 541 ERKRSRTPPRCTDRPAVINTIFGGPSGGQFGHKRKKLARAARREVCVIREQRPTCPITFD 600
ERKR RTPPR DRPAVIN K+K+LAR ARREVC+IREQRPT I F+
Sbjct: 541 ERKRLRTPPRRDDRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAFN 600
Query: 601 SADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYHALGWTRLQLKKSPT 660
ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TY ALGWTR QLKKSPT
Sbjct: 601 HADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSPT 617
Query: 661 PLVGFSGESVIPEGCIDLSVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAAP 711
PLVGFSGES+ EGCIDL V+ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA P
Sbjct: 661 PLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVP 617
BLAST of Moc07g24540 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 803.9 bits (2075), Expect = 1.3e-228
Identity = 437/563 (77.62%), Postives = 451/563 (80.11%), Query Frame = 0
Query: 164 SSNQQAESSHNPATPAWMITREEFDQLRGKLDAQVEALKAK---------------SPIP 223
SSNQQAESSHNPATP +ITREEFDQLRGKL+AQVEALKAK SP
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 224 PK-FKAPTVKPCDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPA 283
+APTVK DGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARL
Sbjct: 62 SDVLEAPTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARL------- 121
Query: 284 RSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLWEYVTWFQKEQLKVAHCSD 343
WFQ++QLKVA SD
Sbjct: 122 ----------------------------------------------WFQEDQLKVAQSSD 181
Query: 344 DSAMCYFLTGLADEALTVKLGEEVPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGR 403
DSAMCYFLTGLADEALTVKLG+E P TFAEVLQKAKKVIDGQELLRTKTGRPER I RGR
Sbjct: 182 DSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGR 241
Query: 404 SGKYIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIKEFGM 463
SGK EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EILTNI+E GM
Sbjct: 242 SGK-DEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGM 301
Query: 464 EKLLKRPEKLRGALERGSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPG 523
EKLLKRPEKLRGA ER +KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP
Sbjct: 302 EKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPR 361
Query: 524 TSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQFGHKRKKLARAARREVCVIREQR 583
TSSAEKKEERK SRTP R DRPAVINTIFGGPSGGQ GHKRK+LARAARREVC+IREQR
Sbjct: 362 TSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQR 421
Query: 584 PTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYHALGWTR 643
PTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY ALGWTR
Sbjct: 422 PTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTR 481
Query: 644 LQLKKSPTPLVGFSGESVIPEGCIDLSVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPI 703
QLKKS TPLVGFS ESVIPEGCIDL VT G D+TQVTQMAEFVVIDGRSAYNAIFGRPI
Sbjct: 482 SQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPI 510
Query: 704 IHSFRAAPSTLHQILKYSTPNGV 711
IHSFRA PSTLHQ+LKYSTPNGV
Sbjct: 542 IHSFRAIPSTLHQVLKYSTPNGV 510
BLAST of Moc07g24540 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 727.2 bits (1876), Expect = 1.6e-205
Identity = 368/402 (91.54%), Postives = 381/402 (94.78%), Query Frame = 0
Query: 204 KSPIPPKFKAPTVKPCDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYQ 263
++PIPPKFKAPTVKP DGSKDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWY+
Sbjct: 64 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASDAIKCRAFQIALTGSARLWYR 123
Query: 264 RLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLWEYVTWFQKEQLKVA 323
RLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQKEG TL EYVT FQ+EQLKVA
Sbjct: 124 RLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQKEGGTLREYVTRFQEEQLKVA 183
Query: 324 HCSDDSAMCYFLTGLADEALTVKLGEEVPTTFAEVLQKAKKVIDGQELLRTKTGRPERKI 383
HCSDDSAMCYFLTGLADEALTVKLGE+ PTTFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Sbjct: 184 HCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAKKVIDGQELLRTKTGRPDRKI 243
Query: 384 GRGRSGKYIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIK 443
GRGRSGK +E+ADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPI EILTNI+
Sbjct: 244 GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSRPYERFTPTTIPISEILTNIE 303
Query: 444 EFGMEKLLKRPEKLRGALERGSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 503
E GMEKLLKRPEKLRGA ER SKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV
Sbjct: 304 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 363
Query: 504 GKPGTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQFGHKRKKLARAARREVCVI 563
GKP TSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQ GHKRK+LARAARREVC+I
Sbjct: 364 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCII 423
Query: 564 REQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVL 606
REQ PTCPITFD AD EEVHLPHNDA VIAPLIDHVVVRRVL
Sbjct: 424 REQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRRVL 465
BLAST of Moc07g24540 vs. NCBI nr
Match:
XP_022158414.1 (uncharacterized protein LOC111024904 [Momordica charantia])
HSP 1 Score: 688.3 bits (1775), Expect = 8.3e-194
Identity = 357/476 (75.00%), Postives = 390/476 (81.93%), Query Frame = 0
Query: 236 MDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFSSRHYDKKTTT 295
MDFQAA+DAIKCRAFQIALTGSARLWY+RLPARSISTYSQLR+EF++QFSS HYD+KT T
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 296 HLATIRQKEGETLWEYVTWFQKEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPTTF 355
HLATIRQKE ETL EYVT FQ+EQLKVAHCSDDSAMCYFLT LADE LTVKLGEE PTTF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 356 AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKYIEKADPKSKDKGSFSS-GRAEYRR 415
EVLQKAKKVIDGQELLRTKTGRPE++I + + + KAD KS+DKGS SS R EYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 416 AENGPTRSRPYERFTPTTIPIFEILTNIKEFGMEKLLKRPEKLRGALERGSKDKYCRFHR 475
E+GP+RSRPYER+T +TIPI EILTNI+E GMEKLLKRPEKLRG LE+ +K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 476 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRCTDRPAVIN 535
+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKRSRTPPR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 536 TIFGGPSGGQFGHKRKKLARAARREVCVIREQRPTCPITFDSADLEEVHLPHNDALVIAP 595
TIFGGP+GGQ G+KRK+LAR ARREVC+IRE +PTC ITF ADLE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 596 LIDHVVVRRVLVDGGASANILSLPTYHALGWTRLQLKKSPTPLVGFSGESVIPEGCIDLS 655
LIDH +VRRVL+DG GCIDL
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 656 VTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAAPSTLHQILKYSTPNGV 711
VT GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA PSTLHQ+LKYSTPN V
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEV 436
BLAST of Moc07g24540 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 890.2 bits (2299), Expect = 6.9e-255
Identity = 462/528 (87.50%), Postives = 474/528 (89.77%), Query Frame = 0
Query: 168 QAESSHNPATPAWMITREEFDQLRGKLDAQVEALKAK----------------------- 227
+AESS NPATPA +ITREEFDQLRG+LDAQVEALKAK
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 228 -SPIPPKFKAPTVKPCDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYQ 287
+PIPPKFKAPTVKP DGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWY+
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 288 RLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLWEYVTWFQKEQLKVA 347
RLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETL EYVT FQ+EQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 348 HCSDDSAMCYFLTGLADEALTVKLGEEVPTTFAEVLQKAKKVIDGQELLRTKTGRPERKI 407
HCSDDSAMCYFLTGLADEALTVKLGEE P TFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 408 GRGRSGKYIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIK 467
GRGRSGK IE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNI+
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 468 EFGMEKLLKRPEKLRGALERGSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 527
E GMEKLLKRPEKLRGA ER SKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 528 GKPGTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQFGHKRKKLARAARREVCVI 587
GKP TSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQ G KRK+LARAARREVC+I
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 588 REQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYHAL 647
REQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTY AL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 648 GWTRLQLKKSPTPLVGFSGESVIPEGCIDLSVTFGQDKTQVTQMAEFV 672
GWTR QLKKSPTPLVGFSGESVIPEG IDL VT GQD+TQVTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc07g24540 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 863.6 bits (2230), Expect = 6.9e-247
Identity = 480/735 (65.31%), Postives = 521/735 (70.88%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTNTADRR LAA+ HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGETSKKGARGPTPTPTSENFDALQREMEAMRTQMRSMEEMYNEMILTAGAV 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 EEEYPEDSESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAW 180
AESS+NP TP
Sbjct: 121 ------------------------------------------------AESSYNPITPG- 180
Query: 181 MITREEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTV 240
+ITREEFDQL+ K DAQVEALKA+ + IPPKFK PT+
Sbjct: 181 VITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTM 240
Query: 241 KPCDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQL 300
KP DGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWY+RLPAR ISTYSQL
Sbjct: 241 KPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQL 300
Query: 301 RREFLAQFSSRHYDKKTTTHLATIRQKEGETLWEYVTWFQKEQLKVAHCSDDSAMCYFLT 360
R+EF++QFSSRHYD+KT THLATIRQKEGETL EYVT F +EQLKVAHCSDDSAMCYFLT
Sbjct: 301 RKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLT 360
Query: 361 GLADEALTVKLGEEVPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKYIEKAD 420
GLADE LTVKL EE P TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GK KAD
Sbjct: 361 GLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKAD 420
Query: 421 PKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIKEFGMEKLLKRPE 480
KS+DKG S SS R +YRR+ + +SRPYE +TPTTIPIFEILTNI+E GMEKLLKRPE
Sbjct: 421 SKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRPE 480
Query: 481 KLRGALERGSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKE 540
KLRG E+ + DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKE
Sbjct: 481 KLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKE 540
Query: 541 ERKRSRTPPRCTDRPAVINTIFGGPSGGQFGHKRKKLARAARREVCVIREQRPTCPITFD 600
ERKR RTPPR DRPAVIN K+K+LAR ARREVC+IREQRPT I F+
Sbjct: 541 ERKRLRTPPRRDDRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAFN 600
Query: 601 SADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYHALGWTRLQLKKSPT 660
ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TY ALGWTR QLKKSPT
Sbjct: 601 HADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSPT 617
Query: 661 PLVGFSGESVIPEGCIDLSVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAAP 711
PLVGFSGES+ EGCIDL V+ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA P
Sbjct: 661 PLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVP 617
BLAST of Moc07g24540 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 803.9 bits (2075), Expect = 6.5e-229
Identity = 437/563 (77.62%), Postives = 451/563 (80.11%), Query Frame = 0
Query: 164 SSNQQAESSHNPATPAWMITREEFDQLRGKLDAQVEALKAK---------------SPIP 223
SSNQQAESSHNPATP +ITREEFDQLRGKL+AQVEALKAK SP
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 224 PK-FKAPTVKPCDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPA 283
+APTVK DGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARL
Sbjct: 62 SDVLEAPTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARL------- 121
Query: 284 RSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLWEYVTWFQKEQLKVAHCSD 343
WFQ++QLKVA SD
Sbjct: 122 ----------------------------------------------WFQEDQLKVAQSSD 181
Query: 344 DSAMCYFLTGLADEALTVKLGEEVPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGR 403
DSAMCYFLTGLADEALTVKLG+E P TFAEVLQKAKKVIDGQELLRTKTGRPER I RGR
Sbjct: 182 DSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGR 241
Query: 404 SGKYIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIKEFGM 463
SGK EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EILTNI+E GM
Sbjct: 242 SGK-DEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGM 301
Query: 464 EKLLKRPEKLRGALERGSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPG 523
EKLLKRPEKLRGA ER +KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP
Sbjct: 302 EKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPR 361
Query: 524 TSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQFGHKRKKLARAARREVCVIREQR 583
TSSAEKKEERK SRTP R DRPAVINTIFGGPSGGQ GHKRK+LARAARREVC+IREQR
Sbjct: 362 TSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQR 421
Query: 584 PTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYHALGWTR 643
PTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY ALGWTR
Sbjct: 422 PTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTR 481
Query: 644 LQLKKSPTPLVGFSGESVIPEGCIDLSVTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPI 703
QLKKS TPLVGFS ESVIPEGCIDL VT G D+TQVTQMAEFVVIDGRSAYNAIFGRPI
Sbjct: 482 SQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPI 510
Query: 704 IHSFRAAPSTLHQILKYSTPNGV 711
IHSFRA PSTLHQ+LKYSTPNGV
Sbjct: 542 IHSFRAIPSTLHQVLKYSTPNGV 510
BLAST of Moc07g24540 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 727.2 bits (1876), Expect = 7.8e-206
Identity = 368/402 (91.54%), Postives = 381/402 (94.78%), Query Frame = 0
Query: 204 KSPIPPKFKAPTVKPCDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYQ 263
++PIPPKFKAPTVKP DGSKDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWY+
Sbjct: 64 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASDAIKCRAFQIALTGSARLWYR 123
Query: 264 RLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLWEYVTWFQKEQLKVA 323
RLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQKEG TL EYVT FQ+EQLKVA
Sbjct: 124 RLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQKEGGTLREYVTRFQEEQLKVA 183
Query: 324 HCSDDSAMCYFLTGLADEALTVKLGEEVPTTFAEVLQKAKKVIDGQELLRTKTGRPERKI 383
HCSDDSAMCYFLTGLADEALTVKLGE+ PTTFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Sbjct: 184 HCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAKKVIDGQELLRTKTGRPDRKI 243
Query: 384 GRGRSGKYIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIK 443
GRGRSGK +E+ADPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPI EILTNI+
Sbjct: 244 GRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSRPYERFTPTTIPISEILTNIE 303
Query: 444 EFGMEKLLKRPEKLRGALERGSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 503
E GMEKLLKRPEKLRGA ER SKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV
Sbjct: 304 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 363
Query: 504 GKPGTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQFGHKRKKLARAARREVCVI 563
GKP TSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQ GHKRK+LARAARREVC+I
Sbjct: 364 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCII 423
Query: 564 REQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVL 606
REQ PTCPITFD AD EEVHLPHNDA VIAPLIDHVVVRRVL
Sbjct: 424 REQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRRVL 465
BLAST of Moc07g24540 vs. ExPASy TrEMBL
Match:
A0A6J1DZB9 (uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024904 PE=4 SV=1)
HSP 1 Score: 688.3 bits (1775), Expect = 4.0e-194
Identity = 357/476 (75.00%), Postives = 390/476 (81.93%), Query Frame = 0
Query: 236 MDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFSSRHYDKKTTT 295
MDFQAA+DAIKCRAFQIALTGSARLWY+RLPARSISTYSQLR+EF++QFSS HYD+KT T
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 296 HLATIRQKEGETLWEYVTWFQKEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEVPTTF 355
HLATIRQKE ETL EYVT FQ+EQLKVAHCSDDSAMCYFLT LADE LTVKLGEE PTTF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 356 AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKYIEKADPKSKDKGSFSS-GRAEYRR 415
EVLQKAKKVIDGQELLRTKTGRPE++I + + + KAD KS+DKGS SS R EYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 416 AENGPTRSRPYERFTPTTIPIFEILTNIKEFGMEKLLKRPEKLRGALERGSKDKYCRFHR 475
E+GP+RSRPYER+T +TIPI EILTNI+E GMEKLLKRPEKLRG LE+ +K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 476 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRCTDRPAVIN 535
+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKRSRTPPR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 536 TIFGGPSGGQFGHKRKKLARAARREVCVIREQRPTCPITFDSADLEEVHLPHNDALVIAP 595
TIFGGP+GGQ G+KRK+LAR ARREVC+IRE +PTC ITF ADLE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 596 LIDHVVVRRVLVDGGASANILSLPTYHALGWTRLQLKKSPTPLVGFSGESVIPEGCIDLS 655
LIDH +VRRVL+DG GCIDL
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 656 VTFGQDKTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAAPSTLHQILKYSTPNGV 711
VT GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA PSTLHQ+LKYSTPN V
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEV 436
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022137317.1 | 1.4e-254 | 87.50 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022152854.1 | 1.4e-246 | 65.31 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022150760.1 | 1.3e-228 | 77.62 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022150613.1 | 1.6e-205 | 91.54 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
XP_022158414.1 | 8.3e-194 | 75.00 | uncharacterized protein LOC111024904 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 6.9e-255 | 87.50 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 6.9e-247 | 65.31 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9E1 | 6.5e-229 | 77.62 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1D9W7 | 7.8e-206 | 91.54 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DZB9 | 4.0e-194 | 75.00 | uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |