Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCCCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGAAGGCAATGCGCACGCAAATGCGGTCCATAGAGGAAATGTATAACGAAATGATATTAGCTGCAGGCGCAGGGTCTCGATCTGAGAACCGAGTGGCGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAGGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAAGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGTAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGATGGAGGTCTTAAATGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGGATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGTCGTGCCTTTCAGATCGCGCTTACTAGCCGCGCGCGATTGTGGTATCGGAGGCTGCCAGCCAGGTCGATCTCGACCTACTCTCAACTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGTGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCAGCCATGTGCTATTTTCTCATCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCAACCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGTTGAGTATCAAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGAGTCTGAAATGGAAAAACTACTCAGGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGATTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGTTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCATGCGCGAGGCGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTTAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGTTGGTTGGGTTCTCTGGAGAATCGGTCATTCCAGAGAGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCAGGGAGAACAAACCGCTTCGAGGGAGTGTTATGCCTCTACACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGATGCTCGGGTTCGAGGCCGACCTGCCGAGGAGAGAGTTTGCCGCACCCACTGAGGAGCTCGAGATTGTTCCTCTGCTTAGTCCCGGGAAGCAAGTAAGCATAGGAACCGAGCTGGGGGCCACCGACAGAAAGGAGCTAATCCACTTCCTCAGATCGAACTCGGATGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGAAAATTATGACGCATCGCCTCAGCATAGATCCGTCATTCCGACCTGTAAAGCAAAAAAGAAGACCTATAAACAAGGAGTGGAGTGATGTAATTGTTGAGGAAGTTAGCAAACTTTTGAAAGCTGAATACATAAGAGTAATTTTGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGAAAGTGGAGAATGTGCGTAGATTTTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCTTGGACGCCACAGCCGGGCACGAACTGCTCACTTTCATGGACGCCTACTCTGGGTATAACCAAATCAAGATGCATGTCCCAGATCAAGATCATACCGCATTCATAACAGACCAAGGTCTGTACTGTTACAAGGTCATGTCCTTCGGTTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGACCGGAATATGGAGGTGTATGTGGACGACAGGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGATCTGACCGAAGCCTTCGAGGTTCTGAAGACATATCAAATGAAGCTCAACCCAGCTAAATGTGCCTTTGGAGTCTCTTCGGGAAAATTCCTTGGCTTCATGGTGAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGTCGTGCTCGAGATGGAGGCACCTAAGACGCTGAAGCAGCTCCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGTTTCAAGATCGACAGATAAGTGCCTTCCTTTCTTCAAAGTCCTACGAAAGAAAGGGCCGTTTGAATGGACAACGGAGTGCGAGCAAGCATTTCAGCAGTTGAAAAGTTACCTCTGCTCGGCACCTTTGCTCGCCAAACCCCTGCCAGGGGACAAGCTCCAGTTGTACTTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACCAGATACCCTCAAATGGAAAAGTTGGCTCTCGCTTTAGTCACATCGGCCCGACGGCTTAGACCATACTTCCAAGCCCATACTGTGGTGGTGCTCACTAACTTGCCCCTAAAAAACATCTTCCATAAGCCAGAAGCTTCTGGACGCCTGATGAAGTGGGCAATGGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCGTGGACAATCTATGTCGACGGATCCTCCAATGAGAAGGGGTGTGGGGCCGGGGTCCTCTTGCTCGGACCAGGAGGCGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGTCTGCGAATCGCTAGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGCGATTCCCAGCTGGTTGTGAGCCAGATCAAGGAAGAGTACCAAGCCAAAGACTCCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCGTACCTCGCCCAGTTTCGAACTTACGAAGTAAGCCGGGTTCCCCGAGCAGAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAATACCCTAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTAAAGAGGGCCTGTACGTCCTCAGAGAAATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGCGATCCAACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGGTAGATATCATTGGTCATTTCCCTTTGGGCAAGGGCCAGACTAAGTTCGCTGTGGTTGCTGTGGATTACTTCACAAAGTGGGTCGAGGCAGAGGCGCTCTCCCACATAACGAAAGCCAGAGTCACGTCCTTCGTATGGACAAATATCATATGTCGCTTTGGTATACCGCAGGCCATTGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAATTTAAAGACTTTTGCAGCAAGCTTGGCATAAGTCACCTTAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTAGAGGCAGTGAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCAGAGATTCTATGGTCGTACCGGACCACCCAAAGAGAATCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTACGAGCCTACAGCAAATGAGGAAGAGCTGCTCCTCAACTTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACACTACAACGCCCGCGTTCGACCTCGGACCTTCCAAGTCGGACATCTGGTCTTAAGGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCCGTTTATGGTCAAGGGAATAGTCTGA
mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCCCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGAAGGCAATGCGCACGCAAATGCGGTCCATAGAGGAAATGTATAACGAAATGATATTAGCTGCAGGCGCAGGGTCTCGATCTGAGAACCGAGTGGCGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAGGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAAGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGTAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGATGGAGGTCTTAAATGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGGATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGTCGTGCCTTTCAGATCGCGCTTACTAGCCGCGCGCGATTGTGGTATCGGAGGCTGCCAGCCAGGTCGATCTCGACCTACTCTCAACTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGTGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCAGCCATGTGCTATTTTCTCATCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCAACCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGTTGAGTATCAAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGAGTCTGAAATGGAAAAACTACTCAGGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGATTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGTTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCATGCGCGAGGCGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTTAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGTTGGTTGGGTTCTCTGGAGAATCGGTCATTCCAGAGAGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCAGGGAGAACAAACCGCTTCGAGGGAGTGTTATGCCTCTACACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGATGCTCGGGTTCGAGGCCGACCTGCCGAGGAGAGAGTTTGCCGCACCCACTGAGGAGCTCGAGATTGTTCCTCTGCTTAGTCCCGGGAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAATACCCTAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTAAAGAGGGCCTAGTAGAGCATTACGAGCCTACAGCAAATGAGGAAGAGCTGCTCCTCAACTTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACACTACAACGCCCGCGTTCGACCTCGGACCTTCCAAGTCGGACATCTGGTCTTAAGGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCCGTTTATGGTCAAGGGAATAGTCTGA
Coding sequence (CDS)
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCCCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGAAGGCAATGCGCACGCAAATGCGGTCCATAGAGGAAATGTATAACGAAATGATATTAGCTGCAGGCGCAGGGTCTCGATCTGAGAACCGAGTGGCGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAGGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAAGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGTAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGATGGAGGTCTTAAATGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGGATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGTCGTGCCTTTCAGATCGCGCTTACTAGCCGCGCGCGATTGTGGTATCGGAGGCTGCCAGCCAGGTCGATCTCGACCTACTCTCAACTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGTGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCAGCCATGTGCTATTTTCTCATCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCAACCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGTTGAGTATCAAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGAGTCTGAAATGGAAAAACTACTCAGGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGATTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGTTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCATGCGCGAGGCGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTTAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGACGAAAAGCCCGACACCGTTGGTTGGGTTCTCTGGAGAATCGGTCATTCCAGAGAGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCAGGGAGAACAAACCGCTTCGAGGGAGTGTTATGCCTCTACACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGATGCTCGGGTTCGAGGCCGACCTGCCGAGGAGAGAGTTTGCCGCACCCACTGAGGAGCTCGAGATTGTTCCTCTGCTTAGTCCCGGGAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAATACCCTAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTAAAGAGGGCCTAGTAGAGCATTACGAGCCTACAGCAAATGAGGAAGAGCTGCTCCTCAACTTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACACTACAACGCCCGCGTTCGACCTCGGACCTTCCAAGTCGGACATCTGGTCTTAAGGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCCGTTTATGGTCAAGGGAATAGTCTGA
Protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMKAMRTQMRSIEEMYNEMILAAGAGSRSENRVARVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKRQSPSRSHRSSNQQAESSRNPATPVGVITREEFDQLRGQLDAQMEVLNAKCEQKEGPLNDGDLGGSPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSRARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRVEYQRAENGPTRSRPYERFTPTTIPIFEILTNIEESEMEKLLRRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAMREACIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVQGEQTASRECYASTLKGSSVCALETLAGRDGMLGFEADLPRREFAAPTEELEIVPLLSPGKQLASAYETDLARSVPVEILDNPSISEPDLMEIGAPESSWMDPIADFIRGNSPQYPKERRKLARRAARFVVRGGALYRRGFSLPLLRCLTPKEGLVEHYEPTANEEELLLNFDLLEERRAMAQLRLAEYQGRMARHYNARVRPRTFQVGHLVLRKVQTHVGALDPTWEGPFMVKGIV
Homology
BLAST of Moc09g01470 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 973.0 bits (2514), Expect = 1.9e-279
Identity = 497/528 (94.13%), Postives = 503/528 (95.27%), Query Frame = 0
Query: 191 QAESSRNPATPVGVITREEFDQLRGQLDAQMEVLNAKCEQKEGPLNDGDLGGSPFTSDVL 250
+AESSRNPATP GVITREEFDQLRGQLDAQ+E L AKCEQKEGPLNDGDLG SPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSRARLWYR 310
EAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALT ARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQ EQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERKI 430
HCSDDSAMCYFL GLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDIEKADPKSKDKGSFSSGRVEYQRAENGPTRSRPYERFTPTTIPIFEILTNIE 490
GRGRSGKDIE ADPKSKDKGSFSSGR EY+RAENGPTRSRPYERFTPTTIPI EILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESEMEKLLRRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
ES MEKLL+RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAMREACII 610
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAA RE CII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLTKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFV 719
GWTRSQL KSPTPLVGFSGESVIPE IDLPVTLGQDQTQVTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc09g01470 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 954.1 bits (2465), Expect = 9.3e-274
Identity = 511/631 (80.98%), Postives = 525/631 (83.20%), Query Frame = 0
Query: 187 SSNQQAESSRNPATPVGVITREEFDQLRGQLDAQMEVLNAKCEQKEGPLNDGDLGGSPFT 246
SSNQQAESS NPATP GVITREEFDQLRG+L+AQ+E L AKCEQKEGPLNDGDLG SPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSRAR 306
SDVLE APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT AR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQ 366
LW FQ +Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRP 426
LKVA SDDSAMCYFL GLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKT RP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDIEKADPKSKDKGSFSSGRVEYQRAENGPTRSRPYERFTPTTIPIFEIL 486
ER I RGRSGKD EKAD KSKDKGSFSSGR E++RA NGPTRSRPYERFTPTTIPI EIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 TNIEESEMEKLLRRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYF 546
TNIEES MEKLL+RPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAMRE 606
KKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAA RE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 ACIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPT 666
CIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YLALGWTRSQLTKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAY 726
YLALGWTRSQL KS TPLVGFS ESVIPE CIDLPVTLG DQTQVTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVQGEQTASRECYASTLKGSSVCALETL 786
NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG V+GEQ ASRECYAS LKGSSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 787 AGRDGMLGFEADLPRREFAAPTEELEIVPLL 818
RDG L F+A+LPRREFAAPTEELE+VPLL
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc09g01470 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 947.2 bits (2447), Expect = 1.1e-271
Identity = 523/791 (66.12%), Postives = 567/791 (71.68%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTNTADRR LAA+ HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMKAMRTQMRSIEEMYNEMILAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVARVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKRQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSRNPATPVGVITREEFDQLRGQLDAQMEVLNAKCEQKEGPLNDGDL 240
AESS NP TP GVITREEFDQL+ + DAQ+E L A+CE+KE +DGDL
Sbjct: 181 -----------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 241 GGSPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA 300
G F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 301 LTSRARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT 360
LT ARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 361 RFQVEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLR 420
RF EQLKVAHCSDDSAMCYFL GLADE LTVKL EEAPATFAEVLQK KKVIDGQELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 421 TKTNRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRVEYQRAENGPTRSRPYERFTPTT 480
TKT RPE+ I +GR+GKD KAD KS+DKG S SS RV+Y+R+ + +SRPYE +TPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 481 IPIFEILTNIEESEMEKLLRRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIED 540
IPIFEILTNIEE+ MEKLL+RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 541 LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL 600
LIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN K+KEL
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKEL 600
Query: 601 ARAAMREACIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASA 660
AR A RE CIIREQRPT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASA
Sbjct: 601 AREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASA 650
Query: 661 NILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVV 720
NILSL TYLALGWTRSQL KSPTPLVGFSGES+ E CIDLPV++ QD TQVTQMAEFVV
Sbjct: 661 NILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVV 650
Query: 721 IDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVQGEQTASRECYASTLKGSS 780
IDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTV+GE SRECYAS K SS
Sbjct: 721 IDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSS 650
Query: 781 VCALETLAGRD 791
VCALE RD
Sbjct: 781 VCALEEQTIRD 650
BLAST of Moc09g01470 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 787.3 bits (2032), Expect = 1.5e-223
Identity = 402/446 (90.13%), Postives = 415/446 (93.05%), Query Frame = 0
Query: 378 MCYFLIGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERKIGRGRSGK 437
MCYFL GLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 438 DIEKADPKSKDKGSFSSGRVEYQRAENGPTRSRPYERFTPTTIPIFEILTNIEESEMEKL 497
D+E DPKSKDKGSFS+GR EY+RAENGPTRSRPYERFTPTTIPI EILTNIEES MEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 498 LRRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 557
L+RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 558 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAMREACIIREQRPTC 617
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAA RE CIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 618 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 677
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 678 TKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHS 737
KSPTPLVGFSGESV+PE CIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 738 FRAIPSTLHQVLKYSTPNGVGTVQGEQTASRECYASTLKGSSVCALETLAGRDGMLGFEA 797
FRAIPSTLHQVLKYSTPNGVGTV+GEQTASRECYAS LKG+SVCALETL RDG L FEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 798 DLPRREFAAPTEELEIVPLLSPGKQL 824
DLP REFAAP EELE+VPLLS KQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc09g01470 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 768.5 bits (1983), Expect = 7.3e-218
Identity = 390/422 (92.42%), Postives = 399/422 (94.55%), Query Frame = 0
Query: 231 KEGPLNDGDLGGSPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD 290
K+ LNDGDLG S FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 291 AIKCRAFQIALTSRARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 350
AIKCRAFQIALT ARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 351 EGETLREYVTRFQVEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEVLQKAK 410
EG TLREYVTRFQ EQLKVAHCSDDSAMCYFL GLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 411 KVIDGQELLRTKTNRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRVEYQRAENGPTRSR 470
KVIDGQELLRTKT RP+RKIGRGRSGKD+E+ADPKSKDKGSFSSGR EY+RAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 471 PYERFTPTTIPIFEILTNIEESEMEKLLRRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 530
PYERFTPTTIPI EILTNIEES MEKLL+RPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 531 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 590
WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 591 QSGHKRKELARAAMREACIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 650
QSGHKRKELARAA RE CIIREQ PTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 651 VL 653
VL
Sbjct: 464 VL 465
BLAST of Moc09g01470 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 973.0 bits (2514), Expect = 9.4e-280
Identity = 497/528 (94.13%), Postives = 503/528 (95.27%), Query Frame = 0
Query: 191 QAESSRNPATPVGVITREEFDQLRGQLDAQMEVLNAKCEQKEGPLNDGDLGGSPFTSDVL 250
+AESSRNPATP GVITREEFDQLRGQLDAQ+E L AKCEQKEGPLNDGDLG SPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSRARLWYR 310
EAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALT ARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQ EQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERKI 430
HCSDDSAMCYFL GLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT RPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDIEKADPKSKDKGSFSSGRVEYQRAENGPTRSRPYERFTPTTIPIFEILTNIE 490
GRGRSGKDIE ADPKSKDKGSFSSGR EY+RAENGPTRSRPYERFTPTTIPI EILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESEMEKLLRRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
ES MEKLL+RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAMREACII 610
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAA RE CII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLTKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFV 719
GWTRSQL KSPTPLVGFSGESVIPE IDLPVTLGQDQTQVTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc09g01470 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 954.1 bits (2465), Expect = 4.5e-274
Identity = 511/631 (80.98%), Postives = 525/631 (83.20%), Query Frame = 0
Query: 187 SSNQQAESSRNPATPVGVITREEFDQLRGQLDAQMEVLNAKCEQKEGPLNDGDLGGSPFT 246
SSNQQAESS NPATP GVITREEFDQLRG+L+AQ+E L AKCEQKEGPLNDGDLG SPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSRAR 306
SDVLE APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT AR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQ 366
LW FQ +Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRP 426
LKVA SDDSAMCYFL GLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKT RP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDIEKADPKSKDKGSFSSGRVEYQRAENGPTRSRPYERFTPTTIPIFEIL 486
ER I RGRSGKD EKAD KSKDKGSFSSGR E++RA NGPTRSRPYERFTPTTIPI EIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 TNIEESEMEKLLRRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYF 546
TNIEES MEKLL+RPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAMRE 606
KKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAA RE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 ACIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPT 666
CIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YLALGWTRSQLTKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAY 726
YLALGWTRSQL KS TPLVGFS ESVIPE CIDLPVTLG DQTQVTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVQGEQTASRECYASTLKGSSVCALETL 786
NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG V+GEQ ASRECYAS LKGSSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 787 AGRDGMLGFEADLPRREFAAPTEELEIVPLL 818
RDG L F+A+LPRREFAAPTEELE+VPLL
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc09g01470 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 947.2 bits (2447), Expect = 5.5e-272
Identity = 523/791 (66.12%), Postives = 567/791 (71.68%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTNTADRR LAA+ HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMKAMRTQMRSIEEMYNEMILAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVARVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKRQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSRNPATPVGVITREEFDQLRGQLDAQMEVLNAKCEQKEGPLNDGDL 240
AESS NP TP GVITREEFDQL+ + DAQ+E L A+CE+KE +DGDL
Sbjct: 181 -----------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 241 GGSPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA 300
G F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 301 LTSRARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT 360
LT ARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 361 RFQVEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLR 420
RF EQLKVAHCSDDSAMCYFL GLADE LTVKL EEAPATFAEVLQK KKVIDGQELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 421 TKTNRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRVEYQRAENGPTRSRPYERFTPTT 480
TKT RPE+ I +GR+GKD KAD KS+DKG S SS RV+Y+R+ + +SRPYE +TPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 481 IPIFEILTNIEESEMEKLLRRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIED 540
IPIFEILTNIEE+ MEKLL+RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 541 LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL 600
LIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN K+KEL
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKEL 600
Query: 601 ARAAMREACIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASA 660
AR A RE CIIREQRPT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASA
Sbjct: 601 AREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASA 650
Query: 661 NILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVV 720
NILSL TYLALGWTRSQL KSPTPLVGFSGES+ E CIDLPV++ QD TQVTQMAEFVV
Sbjct: 661 NILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVV 650
Query: 721 IDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVQGEQTASRECYASTLKGSS 780
IDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTV+GE SRECYAS K SS
Sbjct: 721 IDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSS 650
Query: 781 VCALETLAGRD 791
VCALE RD
Sbjct: 781 VCALEEQTIRD 650
BLAST of Moc09g01470 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 787.3 bits (2032), Expect = 7.3e-224
Identity = 402/446 (90.13%), Postives = 415/446 (93.05%), Query Frame = 0
Query: 378 MCYFLIGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTNRPERKIGRGRSGK 437
MCYFL GLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 438 DIEKADPKSKDKGSFSSGRVEYQRAENGPTRSRPYERFTPTTIPIFEILTNIEESEMEKL 497
D+E DPKSKDKGSFS+GR EY+RAENGPTRSRPYERFTPTTIPI EILTNIEES MEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 498 LRRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 557
L+RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 558 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAMREACIIREQRPTC 617
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAA RE CIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 618 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 677
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 678 TKSPTPLVGFSGESVIPESCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHS 737
KSPTPLVGFSGESV+PE CIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 738 FRAIPSTLHQVLKYSTPNGVGTVQGEQTASRECYASTLKGSSVCALETLAGRDGMLGFEA 797
FRAIPSTLHQVLKYSTPNGVGTV+GEQTASRECYAS LKG+SVCALETL RDG L FEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 798 DLPRREFAAPTEELEIVPLLSPGKQL 824
DLP REFAAP EELE+VPLLS KQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc09g01470 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 768.5 bits (1983), Expect = 3.5e-218
Identity = 390/422 (92.42%), Postives = 399/422 (94.55%), Query Frame = 0
Query: 231 KEGPLNDGDLGGSPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD 290
K+ LNDGDLG S FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 291 AIKCRAFQIALTSRARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 350
AIKCRAFQIALT ARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 351 EGETLREYVTRFQVEQLKVAHCSDDSAMCYFLIGLADEALTVKLGEEAPATFAEVLQKAK 410
EG TLREYVTRFQ EQLKVAHCSDDSAMCYFL GLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 411 KVIDGQELLRTKTNRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRVEYQRAENGPTRSR 470
KVIDGQELLRTKT RP+RKIGRGRSGKD+E+ADPKSKDKGSFSSGR EY+RAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 471 PYERFTPTTIPIFEILTNIEESEMEKLLRRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 530
PYERFTPTTIPI EILTNIEES MEKLL+RPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 531 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 590
WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 591 QSGHKRKELARAAMREACIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 650
QSGHKRKELARAA RE CIIREQ PTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 651 VL 653
VL
Sbjct: 464 VL 465
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022137317.1 | 1.9e-279 | 94.13 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022150760.1 | 9.3e-274 | 80.98 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022152854.1 | 1.1e-271 | 66.12 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022152110.1 | 1.5e-223 | 90.13 | uncharacterized protein LOC111019899 [Momordica charantia] | [more] |
XP_022150613.1 | 7.3e-218 | 92.42 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 9.4e-280 | 94.13 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 4.5e-274 | 80.98 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DHB3 | 5.5e-272 | 66.12 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DD03 | 7.3e-224 | 90.13 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A6J1D9W7 | 3.5e-218 | 92.42 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
Match Name | E-value | Identity | Description | |