MS026401 (gene) Bitter gourd (TR) v1

Overview
NameMS026401
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionFAD-binding Berberine family protein
Locationscaffold402: 999969 .. 1006895 (-)
RNA-Seq ExpressionMS026401
SyntenyMS026401
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATTACTCCGCTCCATTAATCCCTCTTGCTCTTGCTTTCATTCTTGTAGCTTCATCTTCATCATGGGGTGCAGCTTCTGCTGATAAATATGAAGCCTTTCTTCAATGTCTTTCCCATCACTCCTCAGATGGTTACTCCATTTCCAAGGTCATTTACACTCCCTCCAACTCCTCCTATTATTCTATTTTGAACTTCTCCATTCGAAACTTCAAATTCTCAACTGTTGAAATTCCAAAACCACTCCTGATTGTAACACCATCACATGTTTCTCACATTCAAGCCTCCCTCATTTGTTGCAAAACTCACGGCTTTCAAATTCGAACTCGAAGCGGCGGCCATGACTACGAGGGTCTTTCTTACGTCGCCTACCTCCCATTCATCATCGTTGACCTCATAAATCTCAGGTCGGTCTCCGTTGATACCAAAAGTAACACTGCATGGGTTCAGTCTGGAGCAACTATCGGCGAACTCTATTATAAAATTGCTGAGAAAAGTCGAACCTGGACATTTCCAGCGGGTATTTGCCCCACAGTTGGGATTGGTGGGTATTTCAGTGGCGGCGGATATGGCTTGTTGCTGAGGAAATATGGTCTTGCAGCTGATAATGTGATTGACGCTTATTTGGTTGATGCTAATGGGAAGTTTCACGATAGAGAGTCGATGGGGGAAGATTTATTTTGGGCCATTAGAGGGGGCGGTGGAGGGAGCTTTGGAATAGTGGTGTCGTGGAAGGTGAAGCTGGTTCCGGTGCCCGTGACTGTGACATTTTGCTCAACTAACAGAACTTTGGAGGAAGGTGCAGTGAAGCTAATCCAGCGGTGGCAATATGTGGCTAGCAAATTGGATGAAAATCTATTCCTTGGCATCTTTTGGACTGGTATTCTTTCAACGCTTTTCATGAATTATATTGTTTGTTATTTATTTATTTGAACACCAAATATAATAGTCTAACAATAGTTTAATTGGATAAGGTATTTATCTTTAACAAGCAAAGGAAATATATACACTTAATATAATTTTCAAGAAAAAAAGATATTATTTTCTTACTATTAAGTGGGTGACCTTAAGGAAGACTTAGTAACAAACGCTTGGAACCTTGAGAGATGCCACCTACCCTGAAGTTTGAAGTTCGAACCCACAACGATTATAGAGATTGACTAGCATAAGCTCGGTGTAAAATTAGACTTTACTTTGAAACTTTTCAAACCAATCATCAAACTTGAAAAAGGGTAAATTCAATACCATCTAATTGTTTTTTTTTTTTTTTTCCTTTGGGTGGGTTTGGGGTGTTTTTATGATTGGATTTTGAGGGTTTCTTGATTACACTTTTAATGACTGCTAAGTTAGATTTTTTGACCATTTTCTATCTTAGGTTTGAGGGGATTAATTAACCACTTATTTGCATTAAAATGTTTGGTTTGATGAAATTAAAACTTCAAAATTAAAAACATGTTTAATTTATAATTTTCACCTCTAAAATTTGAACCCTCTATATTTAATCTTCATGTGTTTAGGGGCTGAAATACAAACTTGAACTTTAAAAACAATGCACCCAATAGTGTAAGAAATGAGTTGTTATAGATAGTAACGTTCTTGTTTCTTATTCTCAGATCTCGTTTCTAATTCTTTTAAAGACGTAAGTATTTGATAGTCGTTTCTATTTCTCGTTTCTCATTTTTAAGAAACAAGAAATACAAACGTTACAAAACAAGTTCTACATTTTCCTAAAAACAATCACATCCCAAATCATTATGAACATTAAATCTAAATGATCTAAATAACTCAATTCTTGATGATAGGTGGTAATGGTTCACGTCAAGGAGGCAAAACGAACCCAACAGCTCTATTCTTCTCTTTATTTCTTGGAAAGGCTGATGAGGCTGTGGCAATTCTGAACACAACTTTTCCTGAGTTGGGTTTGGTAAAGAAAGATTGCACAGAAGTAAGCTGGATTGAGTCAGCTGCCATCGCTGCCAATGGGTTCCAAAATGGAGAAGAAGCCATAGAATTGGAAACTTTGCTCAATAGACCTCTCACCAATATAAGCCTTAAAGTCAAATCTGACTTCGTCAAGGAACCCCTTTCAGAACTTGCAATCCAGGGTATATGGGAAAGATTAAACCGTCAAGACATAGAACTGCCACAAATTCTATTCGTTCCCTACGGAGGGAGAATGAGTCAAATTTCTGAGTCTGAAACTCCTTTCCCGCACAGAGCTGGAAATTTGTACAAGATTGGCTACTTTCTCAGATGGGAAGAACAAAGCGTCGATGCAGAGAAAATGCATCTGAATTGGATACAAGAGCTTTACAGTTATATGACTCCCTTTGTTTCAAAATCGCCCAGAACTGCGTATGTCAACTATAGAGACCTTGACATTGGATCGAACAACAAATATGGAAAGACAAGCTACAAGCAGGCAAGCGTGTGGGGTTTAAAATATTTTAGTAACAATTTTAATAGGTTGGTACGTGTTAAGACCAAGGTGGATCCTTACGATTTCTTTAGGCATGAGCAAAGCATACCCACCCTCGATAGAATTTAGTAAAATAAACAAACAATTTGGTATTGTCTCTACGTGTATTTAAACTAGACACCGCAATCTATCATTGTTTGTAGCAAAATAAGGGGTGTGACTTCTATTTGTGTGTAGTTTAGTAAAGTCATGATGGAGTTTATTTTTTCTTTCTTTTTTGTGTGTGGTTTAGTAAAGTCATGTGATGGGTTTAGTAATTCCTCCAAAAGTTCTGAAGTAGAGAGGATCGAACTCACACAATTTTGGTTGGATGTAATGCCTTAGTCAGTTGAACTATACTCAGATTGGTTGATATTTCTCCATATTTTTTCTACATTTACATTTACTTGATATTTTAACATCTTGACATATTTCCAACAAATTATATAATTTAATTAAAATAATTAAGTTCAGCCACCTTATAATTTAATTTTTTTTTTTACAAAAGCCAACTTATTTAATTAATATATATATATATACATTTTTTTGAAATAGAAATAAAATAAAACGGTTTGGAAAGAAGAAAAATTAAAAAATGATAAATAGTTGATATTAATACATTTATTTATTAGTATTTGACGGGAGAGAAAAGAGAGAGGAGAGAGGATATATTGTTTTTACGCGAAGTGTGAGGTATGATAAAGTAAACAGAGAGAAGTCGAGATGATCATCGCTCAATAGTAATTGACATATATCTCCAACCAATAAATTTGTGAGTTCGATCCGTCTCACCTCCATATTGTTGTACTTAATAAAAAAAAAGTAAATGATGAGAATAAAGTAGAGATTGAGTATTTATTTTATCTTTCTATTTCTATTAACAACTCAGCTTCAACCACTCTATTTGAAGTTTTTTTTGTAAATCAGTGTTCAAAGCTTTGAAATTTCTACAAAAATACTACATAAGTCTTTGTTTTTTTTTTTTTAATGCCCAAAGTCAATAAATTTTCAATAAAATAAATTAATTCGTGAAGATTTGATGTATCCCTTTGTTTAGTTGGACAAGCAAATGCATACGATTTTAAAAATGAGATAAAAATCATATAAGAACTCATAAACGTTTCTTCTCAATGCATTTTAAAATACTTAGGTATTTATACTTAGGTATTTATCCATCTGATAGCACATATCTTTTTTACTGACCCTCATTAAATTTTTTAATAAATAGGAAAGAGAAATGCAGAAAAAAAATATATAGAAAAACAAAAAGTCTCAATGCTCATTAAATTTTTTAATAAATAGGAAAGAAAAATGAGAAAATATATATATAGAAAAACAAAAAAGTCTCAATTTTTTTTTTNNNNNNNNNNNNNNNNNNNNTTGGTGGATGCTCACAAAAAAAATTTACTATATATTTAGTTTACTCGGTCAAATTGAATTTTAAAGTTCAAAAGGAGTGGGATCAATTTTTGTTAAATAAATATATAATTTCGTCAAAGGAGGATAAACATATCAGCCAATAATGCAACATAAATTAAATTCTTTTCTTTCTCTTTCAACATTAATCCTTCCTACCAAAACACCATTAAATAGTGGAGCTTGCAATCACAAAATCTCAATAACCCAACCATGAACAAGTATTACTCTTCTCCATTAATCCCTCTTTTTCTTGCCTTCATTTTTCTATCTTCATCTCCATTCTGGACCTTAGCGGTTGCTGACAAACATGAATCCTTTCTTCAGTGTCTCTTCAATCACTCTCCCGATGGTAATTACTCTATTTCTAAAGTCATACACACTCCAATCAACTCCTCTTATTCTTCTGTTTTAGACTTCTCCATTCGAAACCTTAGATTCTCGACGGCTGAAACCCCAAAGCCACTCCTCATCATAACACCATCACATGTTTCCCACATCCAAGCAGCCGTTGTTTGCTCCAAACGCTATGGCTTTCAAATCCGAACTCGAAGCGGTGGCCATGATTTTGAGGGACTTTCCTACGTAGCCCATCTCCCATTCATCATAGTTGACCTTATAAATCTCAGGTCCATCTCTGTTGATGTCAAAAACAACACTGCATGGGTTCATTCTGGAGCAACTCTAGGTGAACTTTACTATAGTATTGCTCAAAAAAGTCGAACCTTGGCGTTTCCAGCAGGTGTTTGCCCCACGGTTGGAGTTGGTGGACACTTCAGTGGTGGTGGATATGGATTGTTGCTGAGGAAATATGGTCTTGCTGCTGATAATGTGATTGATGCTTACTTGGTTGATGCGAATGGGAAGTTTCACGATAGAGAGTCGATGGGGGAAGATTTGTTTTGGGCCATTAGAGGGGGTGGTGGAGGGAGCTTTGGAATCGTGGTGGCGTGGAAGGTGAAGCTGGTACCAGTGCCCGCGACTGTGACACTTTGCCAAACTAACAGAACTTTGGAGGAAGGTGCAGTCAAGCTAACCCATCGGTGGCAATATGTGGCCAACAAATTGGATGAAAATCTATTCCTAGGCATCCTTTGGCTTGGTATTCTTTCAACTTTTTCCCTTTTTGAGTTATTTGTTACTTATTTATTCCTTTTCATTGTGAGGTCACATTGAGTAGAAAAGTGAATGAACAAACACAAAATCTAAAAAGAGGGATTTTGGGACTATGCAAGTAAATAGTTAAAAAATGCTTAAAAAAATACAAAAGAAAAAAAGAAAAAAAAAGTATACATTTGTTTTCAGTGGCTTAATAATATAAACTTCAAAATTAAGCCTGCTTGGCTTGAAACAATTTTATGTTTTGATTTAGAAAAATTTTCTAAGTACGAAATCTTTGTACAAATCCTTTCATAAATAAATATTAAAATGGAAAGTCACCACGCAATGTCAAATAATTTTAAAATAATAATAATAAATAAAGAAAATTACATTTGCTTTTTACCCATTCATTGATTGAGTAGAATTAAGGACACGCAATTTTAATTTTATATTAAAATAAACATTAAATTGACTCATAGACTTTGATTTTTGACAATTATCTATCTTATGTTTGAGTAAGGTATTAGTTCACAACTTAAATTAAGACTCATAGACTATGATTTTTGGACAATTGTCTATCTTATGTGTTTGAGAAGGGTATTAATTATTGATTCACAACTTATTTAGATTCAATCTTTCATGTAATTTAAAAAGTTAAATTACAATTTTTTATTTTGTTGAGTACAATTAAATTAAAAACTTAGTCTATGAACTTCTAAAATTGTATATAGGTTGATAAGTTTTTTAACATTGAAAAGTGTCGAATATGTCTGATCCCTAAAATTTCAATTTTGTGTTAAACAAAGTCCCTAAACTTTGAAAGTGTATAATAGGTCTTTTAAATTTCAATTTGTGTGTCCATAGATCCTTGACTTATTAAACCGTTTTTAAAATTCATGAACCTCTTAGAAACATAAAGTTAAATTTTATCTTGAATTGGACCTTGACTTCTCGATTTAGAGATTTGTAAGTCTATGAATTTTAAGAAATGCCTAATAAATTTAGGAACTAAACTTACGATAACTCAACCAAATTAAAGTTTAAAGCTAAAATAAACTTAAAGACTAAAAAGGGAAAACATAAACCTTAAAAACAAGTCTCAATCCATCACATTATACAGAAGTATGAAGTTGGGTTTTTCCCTTTTAAAAAATCTCATATATCAAATCACTACAAACACACAAATTTGGATGATTTGAACTCAACTCTTGATGGTAGGTGGGAATATTACAAGTCAAGGAGGAGGCAAAACAAACCCAGTAGCTACATTCTTCTCTTTGTTTCTTGGCCAGGCAGATGAGCTTCTGACAATCTTGAACACAAAATTTCCTGAGTTGGATTTGGCAAAGAAAGACTGTATAGAAACGAGCTGGATTGAATCGACTGTCCTTATGGGCATCGGATTTCAAACTAAGGTGACCTTGGAAGCTCTGCTAAGTAGAACACCTCTCACCAATATGAGCACAAAAATCAAATCTGACTATGTCAAGGAACCCATTTCTGAAGCTACAATCCAGGGCATAGGGGAGAGATTAAACGCTCAAGATATAGAAAGCGGAAACCTTATATTTGTTCCCTATGGTGGGAGAATGAGCCAGATTTCCGAGTCGGAAACTCCTTTCTCACATAGAGCTGGATATTTGTACAAGATTGGCTACATCGCCTCATGGTTAGACCAAAGCATTGATACTGAGAAAAGGCACCTGAGTTGGATACGAGAGCTTTACAGTTACATGGCTCCTTTCGTTTCAAAATCGCCGAGGGCTGCATATGCCAATTACAGAGATCTTGATATTGGATCAAATAAGAGGTATGGAAAGACAAGCTACAAGCAAGCCAGCACGTGGGGGTTCAAGTATTTTGGGAATAATTTTAACAGGTTGGTGCATGTTAAGACCAAGGTTGATCCTTACGATTTTTTTAGGCATGAGCAAAGCATACCCACCCTCTGA

mRNA sequence

ATGAATTACTCCGCTCCATTAATCCCTCTTGCTCTTGCTTTCATTCTTGTAGCTTCATCTTCATCATGGGGTGCAGCTTCTGCTGATAAATATGAAGCCTTTCTTCAATGTCTTTCCCATCACTCCTCAGATGGTTACTCCATTTCCAAGGTCATTTACACTCCCTCCAACTCCTCCTATTATTCTATTTTGAACTTCTCCATTCGAAACTTCAAATTCTCAACTGTTGAAATTCCAAAACCACTCCTGATTGTAACACCATCACATGTTTCTCACATTCAAGCCTCCCTCATTTGTTGCAAAACTCACGGCTTTCAAATTCGAACTCGAAGCGGCGGCCATGACTACGAGGGTCTTTCTTACGTCGCCTACCTCCCATTCATCATCGTTGACCTCATAAATCTCAGGTCGGTCTCCGTTGATACCAAAAGTAACACTGCATGGGTTCAGTCTGGAGCAACTATCGGCGAACTCTATTATAAAATTGCTGAGAAAAGTCGAACCTGGACATTTCCAGCGGGTATTTGCCCCACAGTTGGGATTGGTGGGTATTTCAGTGGCGGCGGATATGGCTTGTTGCTGAGGAAATATGGTCTTGCAGCTGATAATGTGATTGACGCTTATTTGGTTGATGCTAATGGGAAGTTTCACGATAGAGAGTCGATGGGGGAAGATTTATTTTGGGCCATTAGAGGGGGCGGTGGAGGGAGCTTTGGAATAGTGGTGTCGTGGAAGGTGAAGCTGGTTCCGGTGCCCGTGACTGTGACATTTTGCTCAACTAACAGAACTTTGGAGGAAGGTGCAGTGAAGCTAATCCAGCGGTGGCAATATGTGGCTAGCAAATTGGATGAAAATCTATTCCTTGGCATCTTTTGGACTGGTGGTAATGGTTCACGTCAAGGAGGCAAAACGAACCCAACAGCTCTATTCTTCTCTTTATTTCTTGGAAAGGCTGATGAGGCTGTGGCAATTCTGAACACAACTTTTCCTGAGTTGGGTTTGGTAAAGAAAGATTGCACAGAAGTAAGCTGGATTGAGTCAGCTGCCATCGCTGCCAATGGGTTCCAAAATGGAGAAGAAGCCATAGAATTGGAAACTTTGCTCAATAGACCTCTCACCAATATAAGCCTTAAAGTCAAATCTGACTTCGTCAAGGAACCCCTTTCAGAACTTGCAATCCAGGGTATATGGGAAAGATTAAACCGTCAAGACATAGAACTGCCACAAATTCTATTCGTTCCCTACGGAGGGAGAATGAGTCAAATTTCTGAGTCTGAAACTCCTTTCCCGCACAGAGCTGGAAATTTGTACAAGATTGGCTACTTTCTCAGATGGGAAGAACAAAGCGTCGATGCAGAGAAAATGCATCTGAATTGGATACAAGAGCTTTACAGTTATATGACTCCCTTTGTTTCAAAATCGCCCAGAACTGCGTATGTCAACTATAGAGACCTTGACATTGGATCGAACAACAAATATGGAAAGACAAGCTACAAGCAGGCAAGCGTGTGGGGTTTAAAATATTTTAGTAACAATTTTAATAGGTTGTGTCTCTTCAATCACTCTCCCGATGGTAATTACTCTATTTCTAAAGTCATACACACTCCAATCAACTCCTCTTATTCTTCTGTTTTAGACTTCTCCATTCGAAACCTTAGATTCTCGACGGCTGAAACCCCAAAGCCACTCCTCATCATAACACCATCACATGTTTCCCACATCCAAGCAGCCGTTGTTTGCTCCAAACGCTATGGCTTTCAAATCCGAACTCGAAGCGGTGGCCATGATTTTGAGGGACTTTCCTACGTAGCCCATCTCCCATTCATCATAGTTGACCTTATAAATCTCAGGTCCATCTCTGTTGATGTCAAAAACAACACTGCATGGGTTCATTCTGGAGCAACTCTAGGTGAACTTTACTATAGTATTGCTCAAAAAAGTCGAACCTTGGCGTTTCCAGCAGGTGTTTGCCCCACGGTTGGAGTTGGTGGACACTTCAGTGGTGGTGGATATGGATTGTTGCTGAGGAAATATGGTCTTGCTGCTGATAATGTGATTGATGCTTACTTGGTTGATGCGAATGGGAAGTTTCACGATAGAGAGTCGATGGGGGAAGATTTGTTTTGGGCCATTAGAGGGGGTGGTGGAGGGAGCTTTGGAATCGTGGTGGCGTGGAAGGTGAAGCTGGTACCAGTGCCCGCGACTGTGACACTTTGCCAAACTAACAGAACTTTGGAGGAAGATGAGCTTCTGACAATCTTGAACACAAAATTTCCTGAGTTGGATTTGGCAAAGAAAGACTGTATAGAAACGAGCTGGATTGAATCGACTGTCCTTATGGGCATCGGATTTCAAACTAAGGTGACCTTGGAAGCTCTGCTAAGTAGAACACCTCTCACCAATATGAGCACAAAAATCAAATCTGACTATGTCAAGGAACCCATTTCTGAAGCTACAATCCAGGGCATAGGGGAGAGATTAAACGCTCAAGATATAGAAAGCGGAAACCTTATATTTGTTCCCTATGGTGGGAGAATGAGCCAGATTTCCGAGTCGGAAACTCCTTTCTCACATAGAGCTGGATATTTGTACAAGATTGGCTACATCGCCTCATGGTTAGACCAAAGCATTGATACTGAGAAAAGGCACCTGAGTTGGATACGAGAGCTTTACAGTTACATGGCTCCTTTCGTTTCAAAATCGCCGAGGGCTGCATATGCCAATTACAGAGATCTTGATATTGGATCAAATAAGAGGTATGGAAAGACAAGCTACAAGCAAGCCAGCACGTGGGGGTTCAAGTATTTTGGGAATAATTTTAACAGGTTGGTGCATGTTAAGACCAAGGTTGATCCTTACGATTTTTTTAGGCATGAGCAAAGCATACCCACCCTCTGA

Coding sequence (CDS)

ATGAATTACTCCGCTCCATTAATCCCTCTTGCTCTTGCTTTCATTCTTGTAGCTTCATCTTCATCATGGGGTGCAGCTTCTGCTGATAAATATGAAGCCTTTCTTCAATGTCTTTCCCATCACTCCTCAGATGGTTACTCCATTTCCAAGGTCATTTACACTCCCTCCAACTCCTCCTATTATTCTATTTTGAACTTCTCCATTCGAAACTTCAAATTCTCAACTGTTGAAATTCCAAAACCACTCCTGATTGTAACACCATCACATGTTTCTCACATTCAAGCCTCCCTCATTTGTTGCAAAACTCACGGCTTTCAAATTCGAACTCGAAGCGGCGGCCATGACTACGAGGGTCTTTCTTACGTCGCCTACCTCCCATTCATCATCGTTGACCTCATAAATCTCAGGTCGGTCTCCGTTGATACCAAAAGTAACACTGCATGGGTTCAGTCTGGAGCAACTATCGGCGAACTCTATTATAAAATTGCTGAGAAAAGTCGAACCTGGACATTTCCAGCGGGTATTTGCCCCACAGTTGGGATTGGTGGGTATTTCAGTGGCGGCGGATATGGCTTGTTGCTGAGGAAATATGGTCTTGCAGCTGATAATGTGATTGACGCTTATTTGGTTGATGCTAATGGGAAGTTTCACGATAGAGAGTCGATGGGGGAAGATTTATTTTGGGCCATTAGAGGGGGCGGTGGAGGGAGCTTTGGAATAGTGGTGTCGTGGAAGGTGAAGCTGGTTCCGGTGCCCGTGACTGTGACATTTTGCTCAACTAACAGAACTTTGGAGGAAGGTGCAGTGAAGCTAATCCAGCGGTGGCAATATGTGGCTAGCAAATTGGATGAAAATCTATTCCTTGGCATCTTTTGGACTGGTGGTAATGGTTCACGTCAAGGAGGCAAAACGAACCCAACAGCTCTATTCTTCTCTTTATTTCTTGGAAAGGCTGATGAGGCTGTGGCAATTCTGAACACAACTTTTCCTGAGTTGGGTTTGGTAAAGAAAGATTGCACAGAAGTAAGCTGGATTGAGTCAGCTGCCATCGCTGCCAATGGGTTCCAAAATGGAGAAGAAGCCATAGAATTGGAAACTTTGCTCAATAGACCTCTCACCAATATAAGCCTTAAAGTCAAATCTGACTTCGTCAAGGAACCCCTTTCAGAACTTGCAATCCAGGGTATATGGGAAAGATTAAACCGTCAAGACATAGAACTGCCACAAATTCTATTCGTTCCCTACGGAGGGAGAATGAGTCAAATTTCTGAGTCTGAAACTCCTTTCCCGCACAGAGCTGGAAATTTGTACAAGATTGGCTACTTTCTCAGATGGGAAGAACAAAGCGTCGATGCAGAGAAAATGCATCTGAATTGGATACAAGAGCTTTACAGTTATATGACTCCCTTTGTTTCAAAATCGCCCAGAACTGCGTATGTCAACTATAGAGACCTTGACATTGGATCGAACAACAAATATGGAAAGACAAGCTACAAGCAGGCAAGCGTGTGGGGTTTAAAATATTTTAGTAACAATTTTAATAGGTTGTGTCTCTTCAATCACTCTCCCGATGGTAATTACTCTATTTCTAAAGTCATACACACTCCAATCAACTCCTCTTATTCTTCTGTTTTAGACTTCTCCATTCGAAACCTTAGATTCTCGACGGCTGAAACCCCAAAGCCACTCCTCATCATAACACCATCACATGTTTCCCACATCCAAGCAGCCGTTGTTTGCTCCAAACGCTATGGCTTTCAAATCCGAACTCGAAGCGGTGGCCATGATTTTGAGGGACTTTCCTACGTAGCCCATCTCCCATTCATCATAGTTGACCTTATAAATCTCAGGTCCATCTCTGTTGATGTCAAAAACAACACTGCATGGGTTCATTCTGGAGCAACTCTAGGTGAACTTTACTATAGTATTGCTCAAAAAAGTCGAACCTTGGCGTTTCCAGCAGGTGTTTGCCCCACGGTTGGAGTTGGTGGACACTTCAGTGGTGGTGGATATGGATTGTTGCTGAGGAAATATGGTCTTGCTGCTGATAATGTGATTGATGCTTACTTGGTTGATGCGAATGGGAAGTTTCACGATAGAGAGTCGATGGGGGAAGATTTGTTTTGGGCCATTAGAGGGGGTGGTGGAGGGAGCTTTGGAATCGTGGTGGCGTGGAAGGTGAAGCTGGTACCAGTGCCCGCGACTGTGACACTTTGCCAAACTAACAGAACTTTGGAGGAAGATGAGCTTCTGACAATCTTGAACACAAAATTTCCTGAGTTGGATTTGGCAAAGAAAGACTGTATAGAAACGAGCTGGATTGAATCGACTGTCCTTATGGGCATCGGATTTCAAACTAAGGTGACCTTGGAAGCTCTGCTAAGTAGAACACCTCTCACCAATATGAGCACAAAAATCAAATCTGACTATGTCAAGGAACCCATTTCTGAAGCTACAATCCAGGGCATAGGGGAGAGATTAAACGCTCAAGATATAGAAAGCGGAAACCTTATATTTGTTCCCTATGGTGGGAGAATGAGCCAGATTTCCGAGTCGGAAACTCCTTTCTCACATAGAGCTGGATATTTGTACAAGATTGGCTACATCGCCTCATGGTTAGACCAAAGCATTGATACTGAGAAAAGGCACCTGAGTTGGATACGAGAGCTTTACAGTTACATGGCTCCTTTCGTTTCAAAATCGCCGAGGGCTGCATATGCCAATTACAGAGATCTTGATATTGGATCAAATAAGAGGTATGGAAAGACAAGCTACAAGCAAGCCAGCACGTGGGGGTTCAAGTATTTTGGGAATAATTTTAACAGGTTGGTGCATGTTAAGACCAAGGTTGATCCTTACGATTTTTTTAGGCATGAGCAAAGCATACCCACCCTCTGA

Protein sequence

MNYSAPLIPLALAFILVASSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAYLPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGKTNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIELETLLNRPLTNISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQISESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNYRDLDIGSNNKYGKTSYKQASVWGLKYFSNNFNRLCLFNHSPDGNYSISKVIHTPINSSYSSVLDFSIRNLRFSTAETPKPLLIITPSHVSHIQAAVVCSKRYGFQIRTRSGGHDFEGLSYVAHLPFIIVDLINLRSISVDVKNNTAWVHSGATLGELYYSIAQKSRTLAFPAGVCPTVGVGGHFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVAWKVKLVPVPATVTLCQTNRTLEEDELLTILNTKFPELDLAKKDCIETSWIESTVLMGIGFQTKVTLEALLSRTPLTNMSTKIKSDYVKEPISEATIQGIGERLNAQDIESGNLIFVPYGGRMSQISESETPFSHRAGYLYKIGYIASWLDQSIDTEKRHLSWIRELYSYMAPFVSKSPRAAYANYRDLDIGSNKRYGKTSYKQASTWGFKYFGNNFNRLVHVKTKVDPYDFFRHEQSIPTL
Homology
BLAST of MS026401 vs. NCBI nr
Match: KAA8519304.1 (hypothetical protein F0562_013560 [Nyssa sinensis])

HSP 1 Score: 1137.9 bits (2942), Expect = 0.0e+00
Identity = 572/1006 (56.86%), Postives = 715/1006 (71.07%), Query Frame = 0

Query: 10  LALAFILVASSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNFSIR 69
           L+ AF+ +  S SW  ASAD +E FL CLS HS +  +ISKVIYTP+N SY S+L FSIR
Sbjct: 9   LSFAFVFL-FSISW-VASADTHEDFLHCLSLHSENSAAISKVIYTPNNPSYLSVLQFSIR 68

Query: 70  NFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAYLPFII 129
           N +F+    PKPL+IVTP H S IQA++ C K HG QIR RSGGHDYEGLSYV+ +PF+I
Sbjct: 69  NLRFARPTTPKPLVIVTPLHESQIQAAIYCSKEHGMQIRVRSGGHDYEGLSYVSEVPFVI 128

Query: 130 VDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFSGGG 189
           VDLINLRS++VD +++TAWVQ+GAT+GELYY+IAEKS+T  F AG+CPTVG+GG+FSGGG
Sbjct: 129 VDLINLRSITVDIENSTAWVQAGATLGELYYRIAEKSKTIGFTAGVCPTVGVGGHFSGGG 188

Query: 190 YGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKVKLV 249
           YG++ RK+G+A DN+IDA+L+D NG+  DRESMGEDLFWAIRGGGG SFG++++WK+KL+
Sbjct: 189 YGMMSRKHGIAVDNIIDAHLIDVNGRILDRESMGEDLFWAIRGGGGASFGVILAWKIKLI 248

Query: 250 PVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGKTNPTAL 309
            VP  VT  + NRTLE+ A +L+ RWQY+A K DENL + +F    N S Q GK    A 
Sbjct: 249 VVPEKVTVFTVNRTLEQNATELVHRWQYIADKFDENLLMRVFIRRANSS-QDGKRTLQAS 308

Query: 310 FFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIELETLLN 369
           F SLFLG+ D  + ++  +FPEL L K DC E+SWIES    A GF +GE    L+ LLN
Sbjct: 309 FTSLFLGEVDTLLPLMQKSFPELRLAKDDCIEMSWIESILYFA-GFPSGE---SLDVLLN 368

Query: 370 RPLT-NISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQISESETP 429
           R     I  K KSD+VK+P+SE  ++G WE L  + +E  + L+ PYGGR+S+ISESETP
Sbjct: 369 RTSQGGIYFKGKSDYVKQPISEKGLKGSWEMLYDEKLEGVEFLYSPYGGRLSEISESETP 428

Query: 430 FPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNYRDLDIG 489
           FPHR+GN+Y I Y + W E  +   + ++NWI+ +YSYMTPFVSKSPR AY NYRDLD+G
Sbjct: 429 FPHRSGNIYNIHYIVFWGEADIATSEWNINWIRRVYSYMTPFVSKSPRAAYFNYRDLDLG 488

Query: 490 SNNKYGKTSYKQASVWGLKYFSNNFNRLCLFNHSPDGNYSISKVIHTPINSSYSSVLDFS 549
            NNK G TSY QAS+WG+KYF NNFNRL                              FS
Sbjct: 489 VNNK-GNTSYTQASIWGVKYFKNNFNRL------------------------------FS 548

Query: 550 IRNLRFSTAETPKPLLIITPSHVSHIQAAVVCSKRYGFQIRTRSGGHDFEGLSYVAHLPF 609
           IRNLRF+   TPKPL+I+TP H S IQAA+ CSK +G QIR RSGGHD+EGLSYV+ +PF
Sbjct: 549 IRNLRFARPTTPKPLVIVTPLHESQIQAAIYCSKEHGMQIRVRSGGHDYEGLSYVSEVPF 608

Query: 610 IIVDLINLRSISVDVKNNTAWVHSGATLGELYYSIAQKSRTLAFPAGVCPTVGVGGHFSG 669
           +IVDLINLRSI+VD++N+TAWV +GAT+GELYY IA+KS+TL F AGVCPTVGVGGHFSG
Sbjct: 609 VIVDLINLRSITVDIENSTAWVQAGATIGELYYRIAEKSKTLGFTAGVCPTVGVGGHFSG 668

Query: 670 GGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVAWKVK 729
           GGYG++ RK+G+AADN+IDA+L+D NG+  DRESMGEDLFWAIRGGGG SFG+++AWK+K
Sbjct: 669 GGYGMMSRKHGIAADNIIDAHLIDVNGRILDRESMGEDLFWAIRGGGGASFGVILAWKIK 728

Query: 730 LVPVPATVTLCQTNRTLEE----------------------------------------- 789
           L+ VP  VT+   NRTLE+                                         
Sbjct: 729 LIVVPEKVTVFTVNRTLEQNATALVHRWQYIADKFDENLLMRVFIRRANSSQDGKRTIQA 788

Query: 790 ----------DELLTILNTKFPELDLAKKDCIETSWIESTVLMGIGFQTKVTLEALLSRT 849
                     D LL ++   FPEL LAK DCIE SWIES +L   GF +  +L+ LL+RT
Sbjct: 789 SFTSLFLGEVDTLLPLMQKSFPELRLAKDDCIEMSWIES-ILYFAGFPSGESLDVLLNRT 848

Query: 850 PLTNMSTKIKSDYVKEPISEATIQGIGERLNAQDIESGNLIFVPYGGRMSQISESETPFS 909
              +   K KSDYVK+PISE  ++G  E L  + +E+   ++ PYGGR+S+ISESETPF 
Sbjct: 849 SQGSAYFKGKSDYVKQPISEKGLKGSWEMLYDEKLEAVEFLYSPYGGRLSEISESETPFP 908

Query: 910 HRAGYLYKIGYIASWLDQSIDTEKRHLSWIRELYSYMAPFVSKSPRAAYANYRDLDIGSN 964
           HRAG +Y I Y+ +W +  I T + +++WIR +YSYM PFVSKSPRAAY NYRDLD+G N
Sbjct: 909 HRAGNIYNIHYVVAWGEADIATSEWNINWIRRVYSYMTPFVSKSPRAAYFNYRDLDLGVN 968

BLAST of MS026401 vs. NCBI nr
Match: RHN62809.1 (putative tetrahydroberberine oxidase [Medicago truncatula])

HSP 1 Score: 1105.5 bits (2858), Expect = 0.0e+00
Identity = 571/1076 (53.07%), Postives = 716/1076 (66.54%), Query Frame = 0

Query: 10   LALAFILVA---SSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNF 69
            L L  +L+A   S +S+   ++   + FLQCL  +S +  SISKV+YT +NSSY SIL F
Sbjct: 6    LYLTIVLIAIAFSFTSFAIDTSPHEDNFLQCLYSYSHNITSISKVVYTKTNSSYSSILKF 65

Query: 70   SIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAYLP 129
            SI+N +F+T E PKPL+I+TP+ +SHIQ ++IC + HG QIR RSGGHD+EGLS+V+ +P
Sbjct: 66   SIQNLRFATNETPKPLVIITPTQISHIQTAIICSQHHGMQIRIRSGGHDFEGLSFVSNVP 125

Query: 130  FIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFS 189
            F+I+DL N R + VD ++ TAWVQSGAT+GELYYKIA+KS+T  FP G+CPTVG+GG+FS
Sbjct: 126  FVIIDLTNFRGIDVDVENRTAWVQSGATLGELYYKIAQKSKTLGFPGGVCPTVGVGGHFS 185

Query: 190  GGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKV 249
            GGGYG LLRKYGLAADNVIDA+++D  G+F DRE+MGEDLFWAIRGGGG SFG++VSWK+
Sbjct: 186  GGGYGTLLRKYGLAADNVIDAHIIDVKGRFLDREAMGEDLFWAIRGGGGASFGVIVSWKI 245

Query: 250  KLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFW--TGGNGSRQG-GK 309
            KLV VP TVT  +  RTLE+ A KL+ +WQ+VA KL+ENL + I       N S+QG  K
Sbjct: 246  KLVQVPSTVTVFTVPRTLEQNATKLVHKWQFVAHKLEENLAINIILQRLDLNSSKQGEPK 305

Query: 310  TNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIE 369
            +   ALF SLFLG  D  + ++   FPELGLV++DC E+SWIES       F  GE    
Sbjct: 306  STVLALFQSLFLGSVDNLLPLMEEKFPELGLVREDCVEMSWIESVLYLFR-FPEGE---P 365

Query: 370  LETLLNRPL-TNISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQI 429
            LETLLNR L    + K KSDFVK P+ E  ++G+W   +    E   ++  PYGG M +I
Sbjct: 366  LETLLNRTLAAKDNSKAKSDFVKIPIPETGLEGLWPLFDEDGAEDVLMVLFPYGGIMDKI 425

Query: 430  SESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNY 489
            SESE PFPHR G LYKI Y + W ++  + EK+H+NWI++LYSYM PFVSKSPR AY+NY
Sbjct: 426  SESEIPFPHRYGTLYKIQYAVHWHQEGDEVEKLHINWIRKLYSYMEPFVSKSPRAAYINY 485

Query: 490  RDLDIGSNNKYGKTSYKQASVWGLKYFSNNFNRL-------------------------- 549
            RDLDIG NN  G TSYKQAS+WG+KYF NNF RL                          
Sbjct: 486  RDLDIGVNNINGYTSYKQASIWGVKYFKNNFKRLAKVKTKVDPLNFFRNEQSIPSHVFLS 545

Query: 550  ---------------------------------CLFNHSPDGNYSISKVIHTPINSSYSS 609
                                             CL ++S +   SISKV++T  N SYS 
Sbjct: 546  LVCSYLSVALIAILFSYASSAIDTNTYEDNFLQCLSSYSHNST-SISKVVYTKTNPSYSP 605

Query: 610  VLDFSIRNLRFSTAETPKPLLIITPSHVSHIQAAVVCSKRYGFQIRTRSGGHDFEGLSYV 669
            VL F+ +NLRF++ +TPKPL+IITP   SHIQ A++CS+ +G QIRTRSGGHDFEGLSYV
Sbjct: 606  VLKFTTQNLRFASYKTPKPLVIITPLEPSHIQTAIICSQNHGLQIRTRSGGHDFEGLSYV 665

Query: 670  AHLPFIIVDLINLRSISVDVKNNTAWVHSGATLGELYYSIAQKSRTLAFPAGVCPTVGVG 729
            + +PF+++DLIN + I VDV++ TAWV SGATLGELYY+I+QKSR LAFPAG CPT+GVG
Sbjct: 666  SEIPFVVIDLINFKEIDVDVESRTAWVQSGATLGELYYTISQKSRNLAFPAGACPTIGVG 725

Query: 730  GHFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVV 789
            GHFSGGGYG LLRKYGLAADNVIDA+++D  G+  DRE+MGED FWAIRGGGG SFG+++
Sbjct: 726  GHFSGGGYGTLLRKYGLAADNVIDAHIIDVKGRLLDREAMGEDYFWAIRGGGGASFGVII 785

Query: 790  AWKVKLVPVPATVTLCQTNRTLEE------------------------------------ 849
            +WK+KLV VPATVT+    R LE+                                    
Sbjct: 786  SWKIKLVEVPATVTVFTVPRALEQNATKLVHKWQYLASKIDENIAINIVFQRINSSKKGE 845

Query: 850  ---------------DELLTILNTKFPELDLAKKDCIETSWIESTVLMGIGFQTKVTLEA 909
                           D+L+ +++ KFPEL + +++CIE SWIES VL    F      E 
Sbjct: 846  TTILAIFQALFLGGVDKLIPLMDQKFPELGVVRENCIEMSWIES-VLYLFQFPKGALPEV 905

Query: 910  LLSRTPLTNMST---KIKSDYVKEPISEATIQGIGERLNAQDIESGNLIFVPYGGRMSQI 966
            LL+RT   N      K KSD+VK PI E  ++GI    + +  +   +I+ PYGG M  I
Sbjct: 906  LLNRTLAANSPRFIYKAKSDFVKTPIPENGLEGIWSLFHEEGAKGAMMIWFPYGGIMDTI 965

BLAST of MS026401 vs. NCBI nr
Match: KAA8514840.1 (hypothetical protein F0562_018019 [Nyssa sinensis])

HSP 1 Score: 1101.3 bits (2847), Expect = 0.0e+00
Identity = 563/1014 (55.52%), Postives = 687/1014 (67.75%), Query Frame = 0

Query: 4   SAPLIPLALAFILVASSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSI 63
           S  L+ L   F+L     SW  ASA  +E FLQCLS HS    SIS VIYTP+N+SY  I
Sbjct: 5   STSLLSLLFVFLL---PISW-EASAHTHEDFLQCLSLHSQHSASISNVIYTPNNASYLPI 64

Query: 64  LNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVA 123
           L FSI+N +FS+   PKPL+IVTP H S IQA++ C + HG QIR RSGGHDYEGLSYV+
Sbjct: 65  LEFSIQNLRFSSATTPKPLVIVTPLHESQIQATIYCSRNHGMQIRVRSGGHDYEGLSYVS 124

Query: 124 YLPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGG 183
            +PFIIVDLINLRS++VD +++TAWVQ+GAT+GELYY+IAEKS+T  FPAG+CPT+G+GG
Sbjct: 125 DIPFIIVDLINLRSITVDAENSTAWVQAGATVGELYYRIAEKSKTLAFPAGVCPTIGVGG 184

Query: 184 YFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVS 243
           +FSGGGYG LLRKYGLAADN+IDA L+D NG+  DRESMGEDLFWAIRGGGG SFG++++
Sbjct: 185 HFSGGGYGTLLRKYGLAADNIIDARLMDVNGRILDRESMGEDLFWAIRGGGGASFGVILA 244

Query: 244 WKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGK 303
           WK+KL PVP TVT  +  RTLE+ A K++ +WQYVA K  E+LF+ I     N S Q  K
Sbjct: 245 WKIKLAPVPSTVTVFTVRRTLEQNATKIVHQWQYVAHKFPEDLFIRILIRSLNSS-QDEK 304

Query: 304 TNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIE 363
               A F SLFLG  D+ ++++  +FPELGLVK+DC E+SWIES    A GF +G     
Sbjct: 305 RTIQASFNSLFLGGVDKLISLMQESFPELGLVKEDCIEMSWIESILYFA-GFPSG---AS 364

Query: 364 LETLLNR-PLTNISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQI 423
            + LL+R PL     K KSD+VKEP+SE  ++GIW++   +D+E  +++  PYGGRM +I
Sbjct: 365 FDVLLDRTPLARSYFKAKSDYVKEPISESGLEGIWKQFYEEDVEAAEMILSPYGGRMDEI 424

Query: 424 SESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNY 483
           SES  PFPHRAGN+YKI + + W E+   A K HL WI+ LYSYM P+VSKSPR AY+NY
Sbjct: 425 SESSIPFPHRAGNIYKIQHLVYWAEEGTAASKRHLTWIRRLYSYMAPYVSKSPRLAYINY 484

Query: 484 RDLDIGSNNKYGKTSYKQASVWGLKYFSNNFNRLCLFNHSPDGNYSISKVIHTPINSSYS 543
           RDLDIG NN  G TSY QAS+WG+KYF NNFNR    NH                     
Sbjct: 485 RDLDIGVNNN-GNTSYIQASIWGIKYFKNNFNR----NH--------------------- 544

Query: 544 SVLDFSIRNLRFSTAETPKPLLIITPSHVSHIQAAVVCSKRYGFQIRTRSGGHDFEGLSY 603
                                                     G QIR RSGGHD+EGLSY
Sbjct: 545 ------------------------------------------GMQIRVRSGGHDYEGLSY 604

Query: 604 VAHLPFIIVDLINLRSISVDVKNNTAWVHSGATLGELYYSIAQKSRTLAFPAGVCPTVGV 663
           V+ +PFIIVDLINLRSI+VD +N+TAWV +GAT+GELYY IA+KS+TLAFPAGVCPT+GV
Sbjct: 605 VSDIPFIIVDLINLRSITVDAENSTAWVQAGATVGELYYRIAEKSKTLAFPAGVCPTIGV 664

Query: 664 GGHFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIV 723
           GGHFSGGGYG LLRKYGLAADN+IDA L+D NG+  DRESMGEDLFWAIRGGGG SFG++
Sbjct: 665 GGHFSGGGYGTLLRKYGLAADNIIDARLMDVNGRILDRESMGEDLFWAIRGGGGASFGVI 724

Query: 724 VAWKVKLVPVPATVTLCQTNRTLEE----------------------------------- 783
           +AWK+KL PVP+TVT+    RTLE+                                   
Sbjct: 725 LAWKIKLAPVPSTVTVFTVRRTLEQNATKIVHQWQYVAHKFPEDLFIRILIRSLNSSQDE 784

Query: 784 ----------------DELLTILNTKFPELDLAKKDCIETSWIESTVLMGIGFQTKVTLE 843
                           D+L++++   FPEL L K+DCIE SWIES +L   GF +  + +
Sbjct: 785 KRTIQASFNSLFLGGVDKLISLMQESFPELGLVKEDCIEMSWIES-ILYFAGFPSGASFD 844

Query: 844 ALLSRTPLTNMSTKIKSDYVKEPISEATIQGIGERLNAQDIESGNLIFVPYGGRMSQISE 903
            LL RTPL     K KSDYVKEPISE+ ++GI ++   +D+E+  +I  PYGGRM +ISE
Sbjct: 845 VLLDRTPLARSYFKAKSDYVKEPISESGLEGIWKQFYEEDVEAAEMILSPYGGRMDEISE 904

Query: 904 SETPFSHRAGYLYKIGYIASWLDQSIDTEKRHLSWIRELYSYMAPFVSKSPRAAYANYRD 963
           S  PF HRAG +YKI ++  W ++     KRHL+WIR LYSYMAP+VSKSPR AY NYRD
Sbjct: 905 SSIPFPHRAGNIYKIQHLVYWAEEGTAASKRHLTWIRRLYSYMAPYVSKSPRLAYINYRD 939

Query: 964 LDIGSNKRYGKTSYKQASTWGFKYFGNNFNRLVHVKTKVDPYDFFRHEQSIPTL 966
           LDIG N   G TSY QAS WG KYF NNFNRLVHVKT VDP +FFR+EQSIP L
Sbjct: 965 LDIGVNNN-GNTSYIQASIWGIKYFKNNFNRLVHVKTMVDPNNFFRNEQSIPPL 939

BLAST of MS026401 vs. NCBI nr
Match: GAY33888.1 (hypothetical protein CUMW_008580 [Citrus unshiu])

HSP 1 Score: 1083.6 bits (2801), Expect = 0.0e+00
Identity = 566/1084 (52.21%), Postives = 721/1084 (66.51%), Query Frame = 0

Query: 8    IPLALAFILVASSSSWGAASAD-----------KYEA--FLQCLSHHSSDGYSISKVIYT 67
            + L  A +L+ SS  W  +SA+            Y+A  F+QCL  +S D  SISK+IYT
Sbjct: 4    LALPFALVLLLSSQCWVTSSAENHAGSDDNPEPSYDAHKFVQCLLENSEDSTSISKLIYT 63

Query: 68   PSNSSYYSILNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGH 127
             +NSS+ SIL+FSI+N +FST   PKP +IVTP   SH+QA++ C + +G Q+R RSGGH
Sbjct: 64   RTNSSFSSILDFSIQNLRFSTPTTPKPQVIVTPVKESHVQAAVKCSQKYGLQVRVRSGGH 123

Query: 128  DYEGLSYVA--YLPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFP 187
            DYEGLSYV+  ++PF+I+D INL SVSVD ++ TAWVQ+GAT G++Y+ IAEKS+T  FP
Sbjct: 124  DYEGLSYVSNYHVPFVIIDFINLSSVSVDPEAKTAWVQAGATNGKVYHTIAEKSKTLAFP 183

Query: 188  AGICPTVGIGGYFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRG 247
            AG+CPTVG+GG FSGGGYG L+RKYGLAADNV+DA+L+D NG+  DR+SMGEDLFWAIRG
Sbjct: 184  AGVCPTVGVGGLFSGGGYGFLMRKYGLAADNVVDAHLIDVNGRLLDRKSMGEDLFWAIRG 243

Query: 248  GGGGSFGIVVSWKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFW 307
            GGG SFG++++WK+KLV VP TVT  +  RTLE+ A K++ RWQ+VA +LDE+L++ +F 
Sbjct: 244  GGGASFGVIIAWKIKLVTVPETVTAFTVARTLEQNATKIVDRWQHVADQLDEDLYIRVFL 303

Query: 308  TGGNGSRQGGKTNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAA 367
               N S QG KT   A F SLFLG AD  + ++  +FPELGLVK+DC E+SWIES    A
Sbjct: 304  RSANSSTQGKKT-IRASFESLFLGGADVLLPLMQHSFPELGLVKEDCIEMSWIESIMYFA 363

Query: 368  NGFQNGEEAIELETLLNRPLTNIS-LKVKSDFVKEPLSELAIQGIWERLNRQDIELPQIL 427
                 G     L+ LLNR   N+   K KSDFV +P+ E+A QGI+ER   ++ E  +++
Sbjct: 364  -----GFRGQSLDVLLNRTQPNVRFFKAKSDFVYDPMPEIAFQGIYERFYEKEAEAAEMI 423

Query: 428  FVPYGGRMSQISESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFV 487
              PYGG M+QIS+S TPFPHRAG  YKI + + WEE+  +A + H++WI+ LY Y+ P+V
Sbjct: 424  LSPYGGVMNQISDSATPFPHRAGTRYKIQHIVYWEEEGSEASQRHISWIRRLYDYVAPYV 483

Query: 488  SKSPRTAYVNYRDLDIGSNNKYGKTSYKQASVWGLKYFSNNFNRL--------------- 547
            SK+PR AY+NYRDLDIG+NNK G TSYKQAS+WGLKYF NNF RL               
Sbjct: 484  SKNPRAAYLNYRDLDIGTNNK-GYTSYKQASIWGLKYFKNNFKRLVDVKTMVDPGNFFRN 543

Query: 548  --------------------------------------------------------CLFN 607
                                                                    CL  
Sbjct: 544  EQSIPPLSSRKKKAEMKSPCSSIIQFVVLLLSYHCWVTFGNIHATSSVPNENLFLHCLSM 603

Query: 608  HSPDGNYSISKVIHTPINSSYSSVLDFSIRNLRFSTAETPKPLLIITPSHVSHIQAAVVC 667
            HS D   SISKVI+T  NSS+SS+LDFSI+NLRFST  TPKP +I+TP   SH+QAAV C
Sbjct: 604  HS-DNFSSISKVIYTRNNSSFSSILDFSIQNLRFSTPTTPKPQVIVTPLKESHVQAAVKC 663

Query: 668  SKRYGFQIRTRSGGHDFEGLSYVA--HLPFIIVDLINLRSISVDVKNNTAWVHSGATLGE 727
            S++YG Q+R RSGGHD+EG SYV+  H+PF+++DLINL SISVD +  TAWV +GAT+G+
Sbjct: 664  SQKYGMQVRVRSGGHDYEGSSYVSNHHVPFVVIDLINLSSISVDAEAKTAWVQAGATIGK 723

Query: 728  LYYSIAQKSRTLAFPAGVCPTVGVGGHFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFH 787
            LY++IA+KS+TLAFP GVCPTVGVGGHFSGGGYG L+RKYGLAADNV+DA+L+  NG+  
Sbjct: 724  LYHAIAEKSKTLAFPGGVCPTVGVGGHFSGGGYGFLMRKYGLAADNVVDAHLIVVNGRLL 783

Query: 788  DRESMGEDLFWAIRGGGGGSFGIVVAWKVKLVPVPATVTLCQTNRTLEE----------- 847
            DR+SMGEDLFWAIRGGGG SFG++VAWK+KLV VP TVT    NRTLE+           
Sbjct: 784  DRKSMGEDLFWAIRGGGGASFGVIVAWKIKLVTVPETVTAFIVNRTLEQNATKIVDRWQY 843

Query: 848  ----------------------------------------DELLTILNTKFPELDLAKKD 907
                                                    D LL ++   FPEL L K+D
Sbjct: 844  VADKLHEDLYIRVFLSSAASSRRGKKTIRASFESLFLGGADTLLLLMQQSFPELGLVKQD 903

Query: 908  CIETSWIESTVLMGIGFQTKVTLEALLSRTPLTNMSTKIKSDYVKEPISEATIQGIGERL 952
            CIE SWIES V+    F+ + +L+ LL+RT       K KSD+VKEP+ E    GI E+ 
Sbjct: 904  CIEMSWIES-VMYFAEFRGQ-SLDVLLNRTQPNVRFFKAKSDFVKEPMPEIAFLGIYEKF 963

BLAST of MS026401 vs. NCBI nr
Match: KAA3475986.1 (tetrahydrocannabinolic acid synthase-like [Gossypium australe])

HSP 1 Score: 1067.4 bits (2759), Expect = 7.3e-308
Identity = 565/1057 (53.45%), Postives = 710/1057 (67.17%), Query Frame = 0

Query: 1    MNYSAPLIPLALAFILVASSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSY 60
            + +S  L PL LA +L   S   GA+    ++ FLQCLS  S+D  SIS VIYT +NSSY
Sbjct: 3    LQFSMLLPPLVLAALL---SFPMGAS---PHKDFLQCLSLLSNDSTSISNVIYTRNNSSY 62

Query: 61   YSILNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLS 120
              +L  +IRN +F++ + PKPL+IVTPS  SH QA++ C + HG QIRTRSGGHDYEGLS
Sbjct: 63   SFVLESTIRNLRFNSTDTPKPLVIVTPSRTSHFQATIYCARKHGLQIRTRSGGHDYEGLS 122

Query: 121  YVAYLPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVG 180
            YVA +PF++VDL+N RSV VD ++  AWVQ+GA +GE+YY+IAEKSRT  F  GI  T+G
Sbjct: 123  YVAKVPFVVVDLVNFRSVDVDAENRVAWVQAGAILGEVYYRIAEKSRTLAFAGGIYHTIG 182

Query: 181  IGGYFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGI 240
            +GGY SGGG+GLL RKYG   DNVIDA  +D NG+  DR+SMGEDLFWAIRGGGGGSFGI
Sbjct: 183  VGGYISGGGFGLLFRKYGTGGDNVIDAQFIDVNGRILDRKSMGEDLFWAIRGGGGGSFGI 242

Query: 241  VVSWKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQ 300
            V++WK+ LVPVP TVT  S +RTLE+ A +LI +WQ +A +L + +   +     N ++ 
Sbjct: 243  VLAWKLILVPVPATVTAFSVSRTLEQNATQLILQWQDIAHQLPDEMNPDVTMFSINSTQD 302

Query: 301  GGKTNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEE 360
            G KT   ALF SLFLG  DE + I+   FPELGL ++DCTE+SWIES  +  N  QN   
Sbjct: 303  GRKT-ILALFSSLFLGTIDELLPIMQQRFPELGLSRQDCTEMSWIES-ILYYNQLQNQ-- 362

Query: 361  AIELETLLNRPLTNI----SLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYG 420
               LE LLNR   ++      K+KSD+VKEP+SE A+ G++ RL+ ++     I+F+ YG
Sbjct: 363  --PLEILLNRTFRSLVGGQYYKIKSDYVKEPISETALNGLFSRLSDEEASSAIIIFMAYG 422

Query: 421  GRMSQISESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPR 480
            G M +I E  TPFPHRAGNLYKI Y + W+EQ     + +++W + +YSYMTPFVSK PR
Sbjct: 423  GIMDRIPEDSTPFPHRAGNLYKIYYNVNWQEQDNVNSQKYIDWARRVYSYMTPFVSKFPR 482

Query: 481  TAYVNYRDLDIGSNN-KYGKTSYKQASVWGLKYFSNNFNRL------------------- 540
             AY NYRDLDIGSNN KY  TSY QA +WG KYF NNFNRL                   
Sbjct: 483  EAYANYRDLDIGSNNVKY--TSYAQAKIWGRKYFKNNFNRLVQSLQFSILLLSLVLAALL 542

Query: 541  --------------CLFNHSPDGNYSISKVIHTPINSSYSSVLDFSIRNLRFSTAETPKP 600
                          CL   S D + +IS VI+T  N S+S+VL+ +IRNLRF++ +TPKP
Sbjct: 543  SFSMGALPHEDFLQCLSLSSNDSS-TISSVIYTRNNPSFSTVLESTIRNLRFNSTDTPKP 602

Query: 601  LLIITPSHVSHIQAAVVCSKRYGFQIRTRSGGHDFEGLSYVAHLPFIIVDLINLRSISVD 660
            L+I+TPS  SH QA + CS+++G QIRTRSGGHD+EGLSYVA +PF++VDL+N RS+ VD
Sbjct: 603  LVIVTPSRTSHFQATIYCSRKHGLQIRTRSGGHDYEGLSYVAKVPFVVVDLVNFRSVDVD 662

Query: 661  VKNNTAWVHSGATLGELYYSIAQKSRTLAFPAGVCPTVGVGGHFSGGGYGLLLRKYGLAA 720
            V+N  AWV +GA LGE+YY IA+KSRTLAF  GV  ++GVGG+ SGGG+GLL RKYG A 
Sbjct: 663  VENRVAWVQAGAILGEVYYRIAEKSRTLAFAGGVFHSIGVGGYISGGGFGLLFRKYGTAG 722

Query: 721  DNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVAWKVKLVPVPATVTLCQTN 780
            DNVIDA  +D NG+  DR+SMGEDLFWAIRGGGGGSFGIV+AWK+KLVPVPATVT    +
Sbjct: 723  DNVIDAQFIDVNGRILDRKSMGEDLFWAIRGGGGGSFGIVLAWKLKLVPVPATVTAFSVS 782

Query: 781  RTLEE---------------------------------------------------DELL 840
            RTLE+                                                   DELL
Sbjct: 783  RTLEQNATQLILRWQEIAHQLPDEMNPDLSMFSVNSTQDGRKTILASFSSLFLGTIDELL 842

Query: 841  TILNTKFPELDLAKKDCIETSWIESTVLMGIGFQTKVTLEALLSRT---PLTNMSTKIKS 900
             I+  +FP+L L+++DC E SWIES VL     Q +  LE LL+RT   P+     K+KS
Sbjct: 843  PIMQQRFPKLGLSRQDCSEMSWIES-VLYFSQLQNQ-PLEILLNRTFRNPIGGQYFKVKS 902

Query: 901  DYVKEPISEATIQGIGERLNAQDIESGNLIFVPYGGRMSQISESETPFSHRAGYLYKIGY 960
            DYVKEPISE  + G+  RL+ ++  S  ++F+ YGG M +I E  TPF HRAG LYKI Y
Sbjct: 903  DYVKEPISETALNGLFSRLSDEEASSAIIVFMAYGGIMDRIPEEATPFPHRAGNLYKIYY 962

Query: 961  IASWLDQSIDTEKRHLSWIRELYSYMAPFVSKSPRAAYANYRDLDIGSNKRYGKTSYKQA 966
              +W +Q     ++++ W R +Y+YM PFVSKSPR AYANYRDLDIGSN   G TSY QA
Sbjct: 963  NVNWQEQDNVNSQKYIDWSRRVYNYMTPFVSKSPREAYANYRDLDIGSN-NVGITSYTQA 1022

BLAST of MS026401 vs. ExPASy Swiss-Prot
Match: Q9FI21 (Berberine bridge enzyme-like 28 OS=Arabidopsis thaliana OX=3702 GN=At5g44440 PE=1 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 2.9e-142
Identity = 251/497 (50.50%), Postives = 338/497 (68.01%), Query Frame = 0

Query: 25  AASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNFSIRNFKFSTVEIPKPLLI 84
           +A    +E FL+CLS+  +D     KVI+T  +SS++SIL+ SI+N +FS  E PKP+ I
Sbjct: 22  SAHGSNHEDFLKCLSYRMNDNTVEPKVIHTSKDSSFFSILDSSIQNPRFSVSETPKPVSI 81

Query: 85  VTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAY-LPFIIVDLINLRSVSVDTK 144
           +TP   S +Q  + C + HG  +RTRS GH YEGLSY+AY  PF ++DL NLRS+S+D  
Sbjct: 82  ITPVKASDVQTVIRCAQLHGIHVRTRSAGHCYEGLSYIAYNKPFAVIDLRNLRSISLDVD 141

Query: 145 SNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFSGGGYGLLLRKYGLAADN 204
           + T WVQ+GAT GELYY+I + +++  FPAGI PTVG+GG FSGGGYG LLRKYGLAADN
Sbjct: 142 NRTGWVQTGATAGELYYEIGKTTKSLAFPAGIHPTVGVGGQFSGGGYGTLLRKYGLAADN 201

Query: 205 VIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKVKLVPVPVTVTFCSTNRT 264
           +IDA +VDA+G+  DR++MGED FWAIRGGGG SFG+++SWKVKLV VP T+T     +T
Sbjct: 202 IIDALVVDASGRILDRQAMGEDYFWAIRGGGGSSFGVILSWKVKLVDVPSTITVFKVQKT 261

Query: 265 LEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGKTNPTALFFSLFLGKADEAVA 324
            ++ AV++I++WQY A K+ ++LF+       N      K    ALF  L++G  +  +A
Sbjct: 262 SKKEAVRIIKKWQYAADKVPDDLFIRTTLERSN------KNAVHALFTGLYIGPVNNLLA 321

Query: 325 ILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIELETLLNRPLTNISLKVKSDF 384
           ++   FPELGL K+ C E+SWIES    A+ F  GE    L  L NR  T++S K K DF
Sbjct: 322 LMEEKFPELGLEKEGCEEMSWIESVLWFAD-FPKGE---SLGVLTNRERTSLSFKGKDDF 381

Query: 385 VKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQISESETPFPHRAGNLYKIGYFL 444
           V+EP+ E AIQ IW RL   +  L +I+  P+GG+MS+++E ETPFPHR GNLY+I Y  
Sbjct: 382 VQEPIPEAAIQEIWRRLEAPEARLGKIILTPFGGKMSEMAEYETPFPHRGGNLYEIQYVA 441

Query: 445 RWEEQSVDAEK----MHLNWIQELYSYMTPFVSKSPRTAYVNYRDLDIGSNNKYGKTSYK 504
            W E+  D  K     +L W+  +Y +MTP+VSKSPR AYVN++D+D+G      KT Y+
Sbjct: 442 YWREEE-DKNKTETDKYLKWVDSVYEFMTPYVSKSPRGAYVNFKDMDLGMYLGKKKTKYE 501

Query: 505 QASVWGLKYFSNNFNRL 517
           +   WG+KYF NNF RL
Sbjct: 502 EGKSWGVKYFKNNFERL 507

BLAST of MS026401 vs. ExPASy Swiss-Prot
Match: Q9SVG5 (Berberine bridge enzyme-like 18 OS=Arabidopsis thaliana OX=3702 GN=At4g20820 PE=3 SV=1)

HSP 1 Score: 503.4 bits (1295), Expect = 5.5e-141
Identity = 245/500 (49.00%), Postives = 342/500 (68.40%), Query Frame = 0

Query: 25  AASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNFSIRNFKFSTVEIPKPLLI 84
           +A+     +FLQCLS   +D   +SKVI+TP+++S+ S+L  SI+N +FS  ++PKP+LI
Sbjct: 28  SANRSNQSSFLQCLSLQLNDSNIVSKVIHTPNDTSFSSVLASSIQNQRFSAPDVPKPVLI 87

Query: 85  VTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAYLPFIIVDLINLRSVSVDTKS 144
           +TP   S +Q+++ C +  G  IRTRSGGHDYEGLSYV + PF+I+DL NLRS++VD  +
Sbjct: 88  LTPVQPSDVQSAVKCARRFGIHIRTRSGGHDYEGLSYVTHKPFVILDLRNLRSITVDVDN 147

Query: 145 NTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFSGGGYGLLLRKYGLAADNV 204
            + WVQ+GATIGELYY+I +K+RT  FPAG+CPTVG+GG+FSGGGYG LLRK+GLAAD+V
Sbjct: 148 RSVWVQTGATIGELYYEIGKKNRTLAFPAGVCPTVGVGGHFSGGGYGTLLRKHGLAADHV 207

Query: 205 IDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKVKLVPVPVTVTFCSTNRTL 264
           IDA +VDA G+  +R  MGED FWAIRGGGG SF +V+SWK+ L+ VP TVT  +  +  
Sbjct: 208 IDARVVDARGRILERREMGEDFFWAIRGGGGSSFCVVLSWKIGLINVPSTVTVFNVTKFS 267

Query: 265 EEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGKTNPTALFFSLFLGKADEAVAI 324
           E+ A+K+I RWQ+VA K+ ++LF+ +         Q  K    A F  L+LG     + +
Sbjct: 268 EQSALKIIHRWQFVADKVSDDLFIRVM-------LQRYKNMVRASFPGLYLGSVKNLLKM 327

Query: 325 LNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIELETLLNRPLTNISLKVKSDFV 384
           +N  FPELGL + DCTE+SWIES    A   + GEE I +  L  R   +++ K KSDFV
Sbjct: 328 VNKEFPELGLEEDDCTEMSWIESVIWFA---ELGEEPINV--LTKRTRASLAFKAKSDFV 387

Query: 385 KEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQISESETPFPHRAGNLYKIGYFLR 444
           +EP+ + AI  +W RL   + E  Q++F P+GG+MS+I++ ETPFPHR GN+Y+I Y   
Sbjct: 388 QEPMPKTAISKLWRRLQEPEAEHAQLIFTPFGGKMSEIADYETPFPHRKGNIYEIQYLNY 447

Query: 445 WEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNYRDLDIGSNNKYGKTSYKQASVW 504
           W     D ++ ++ W++ +Y  M+ FV+KSPR AY+N RDLD+G      ++ Y++   W
Sbjct: 448 WRG---DVKEKYMRWVERVYDDMSEFVAKSPRGAYINLRDLDLGMYVGVKRSKYEEGKSW 507

Query: 505 GLKYFSNNFNRLCLFNHSPD 525
           G+KYF NNF RL     S D
Sbjct: 508 GVKYFKNNFERLVRVKTSVD 512

BLAST of MS026401 vs. ExPASy Swiss-Prot
Match: Q9FI25 (Berberine bridge enzyme-like 27 OS=Arabidopsis thaliana OX=3702 GN=At5g44410 PE=2 SV=1)

HSP 1 Score: 494.6 bits (1272), Expect = 2.6e-138
Identity = 252/519 (48.55%), Postives = 351/519 (67.63%), Query Frame = 0

Query: 7   LIPLALAFILV-ASSSSWGAASADK--YEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSI 66
           L+ L + F+L+  S S + + SA +  +E FL+CLSH  ++    S++I+T  + SY+SI
Sbjct: 7   LLSLFIYFLLLNLSLSHFPSISAQRTNHENFLKCLSHRINE--DDSRIIHTSKDPSYFSI 66

Query: 67  LNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVA 126
           LN SI+N +F  +E PKP+ I+TP   + +Q+++ C + HG  IRTRSGGHDYEGLSY+A
Sbjct: 67  LNSSIQNPRFFVLETPKPVSIITPVQATDVQSTIKCARLHGIHIRTRSGGHDYEGLSYMA 126

Query: 127 -YLPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIG 186
              PF+++DL NLRS+++D  + T WVQSGATIGELYY+I + S++  FPAG+ PTVGIG
Sbjct: 127 KSRPFVVIDLRNLRSITLDVDNRTGWVQSGATIGELYYEIGKLSKSLAFPAGLYPTVGIG 186

Query: 187 GYFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVV 246
           G F GGGYG L+RKYGL+ADNVIDA++VDANG F DR+ MGED FWAIRGGGG SF +V+
Sbjct: 187 GQFGGGGYGTLMRKYGLSADNVIDAHIVDANGSFLDRQGMGEDFFWAIRGGGGSSFSVVL 246

Query: 247 SWKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGG 306
           SWK++L+ VP  VT     +T E+ AV +I +WQY+A K+  +LF+         +    
Sbjct: 247 SWKIRLLDVPSVVTVFKVVKTSEKEAVSIINKWQYIADKVPNDLFI--------RAMLQK 306

Query: 307 KTNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAI 366
           +T   A F  L+LG   + +A++   FPELGL   +C E+SWIES       F  GE   
Sbjct: 307 ETEVYASFPGLYLGPVSDLLALMKDKFPELGLEIGNCREMSWIESVL----WFIKGE--- 366

Query: 367 ELETLLNRPLTNISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQI 426
            +E L  R  T+ S K K DF++EP+ + AIQ +W R    +  L +I+  P+GG+MS+I
Sbjct: 367 SMEILAKRKRTSRSFKGKDDFIEEPIPKTAIQYLWRRFEAPEARLAKIILTPFGGKMSEI 426

Query: 427 SESETPFPHRAGNLYKIGYFLRWEEQ----SVDAEKMHLNWIQELYSYMTPFVSKSPRTA 486
           +++E PFPHR GNLY+I Y   W E+      + EK +L W++ +Y +MTP+VSKSPR A
Sbjct: 427 ADNEIPFPHREGNLYEIQYLAYWSEEEDKNKTNTEK-YLRWVESVYEFMTPYVSKSPRRA 486

Query: 487 YVNYRDLDIGSNNKYG-KTSYKQASVWGLKYFSNNFNRL 517
           YVN+RD+D+G       KT Y++A VWG+KYF NNF+RL
Sbjct: 487 YVNFRDIDLGMYLGLNMKTKYEEAKVWGVKYFKNNFDRL 507

BLAST of MS026401 vs. ExPASy Swiss-Prot
Match: Q33DQ2 (Cannabichromenic acid synthase OS=Cannabis sativa OX=3483 GN=CBCAS PE=1 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 4.4e-138
Identity = 245/503 (48.71%), Postives = 334/503 (66.40%), Query Frame = 0

Query: 28  ADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNFSIRNFKFSTVEIPKPLLIVTP 87
           A+  E FL+C S +  +  +  K IYT  +  Y S+LN +I+N +F++   PKPL+IVTP
Sbjct: 28  ANPQENFLKCFSEYIPNNPANPKFIYTQHDQLYMSVLNSTIQNLRFTSDTTPKPLVIVTP 87

Query: 88  SHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAYLPFIIVDLINLRSVSVDTKSNTA 147
           S+VSHIQAS++C K  G QIRTRSGGHD EGLSY++ +PF IVDL N+ +V VD  S TA
Sbjct: 88  SNVSHIQASILCSKKVGLQIRTRSGGHDAEGLSYISQVPFAIVDLRNMHTVKVDIHSQTA 147

Query: 148 WVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFSGGGYGLLLRKYGLAADNVIDA 207
           WV++GAT+GE+YY I E +  ++FP G CPTVG+GG+FSGGGYG L+R YGLAADN+IDA
Sbjct: 148 WVEAGATLGEVYYWINEMNENFSFPGGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDA 207

Query: 208 YLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKVKLVPVPVTVTFCSTNRTLE-E 267
           +LV+ +GK  DR+SMGEDLFWAIRGGGG +FGI+ +WK+KLV VP   T  S  + +E  
Sbjct: 208 HLVNVDGKVLDRKSMGEDLFWAIRGGGGENFGIIAAWKIKLVVVPSKATIFSVKKNMEIH 267

Query: 268 GAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGKTNPT--ALFFSLFLGKADEAVAI 327
           G VKL  +WQ +A K D++L L   +   N +   GK   T    F S+FLG  D  V +
Sbjct: 268 GLVKLFNKWQNIAYKYDKDLMLTTHFRTRNITDNHGKNKTTVHGYFSSIFLGGVDSLVDL 327

Query: 328 LNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIELETLLNRPL-TNISLKVKSDF 387
           +N +FPELG+ K DC E+SWI++    +          + E LL+R      +  +K D+
Sbjct: 328 MNKSFPELGIKKTDCKELSWIDTTIFYSGVVNYNTANFKKEILLDRSAGKKTAFSIKLDY 387

Query: 388 VKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQISESETPFPHRAGNLYKIGYFL 447
           VK+ + E A+  I E+L  +++ +   +  PYGG M +ISES  PFPHRAG +Y++ Y  
Sbjct: 388 VKKLIPETAMVKILEKLYEEEVGVGMYVLYPYGGIMDEISESAIPFPHRAGIMYELWYTA 447

Query: 448 RWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNYRDLDIGSNNKYGKTSYKQASV 507
            WE+Q  D EK H+NW++ +Y++ TP+VS++PR AY+NYRDLD+G  N     +Y QA +
Sbjct: 448 TWEKQE-DNEK-HINWVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNPESPNNYTQARI 507

Query: 508 WGLKYFSNNFNRLCLFNHSPDGN 527
           WG KYF  NFNRL       D N
Sbjct: 508 WGEKYFGKNFNRLVKVKTKADPN 528

BLAST of MS026401 vs. ExPASy Swiss-Prot
Match: Q8GTB6 (Tetrahydrocannabinolic acid synthase OS=Cannabis sativa OX=3483 GN=THCAS PE=1 SV=1)

HSP 1 Score: 493.4 bits (1269), Expect = 5.7e-138
Identity = 243/503 (48.31%), Postives = 332/503 (66.00%), Query Frame = 0

Query: 28  ADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNFSIRNFKFSTVEIPKPLLIVTP 87
           A+  E FL+C S H  +  +  K++YT  +  Y SILN +I+N +F +   PKPL+IVTP
Sbjct: 28  ANPRENFLKCFSKHIPNNVANPKLVYTQHDQLYMSILNSTIQNLRFISDTTPKPLVIVTP 87

Query: 88  SHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAYLPFIIVDLINLRSVSVDTKSNTA 147
           S+ SHIQA+++C K  G QIRTRSGGHD EG+SY++ +PF++VDL N+ S+ +D  S TA
Sbjct: 88  SNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVPFVVVDLRNMHSIKIDVHSQTA 147

Query: 148 WVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFSGGGYGLLLRKYGLAADNVIDA 207
           WV++GAT+GE+YY I EK+   +FP G CPTVG+GG+FSGGGYG L+R YGLAADN+IDA
Sbjct: 148 WVEAGATLGEVYYWINEKNENLSFPGGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDA 207

Query: 208 YLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKVKLVPVPVTVTFCSTNRTLE-E 267
           +LV+ +GK  DR+SMGEDLFWAIRGGGG +FGI+ +WK+KLV VP   T  S  + +E  
Sbjct: 208 HLVNVDGKVLDRKSMGEDLFWAIRGGGGENFGIIAAWKIKLVAVPSKSTIFSVKKNMEIH 267

Query: 268 GAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGKTNPT--ALFFSLFLGKADEAVAI 327
           G VKL  +WQ +A K D++L L   +   N +   GK   T    F S+F G  D  V +
Sbjct: 268 GLVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFHGGVDSLVDL 327

Query: 328 LNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIELETLLNRPL-TNISLKVKSDF 387
           +N +FPELG+ K DC E SWI++    +          + E LL+R      +  +K D+
Sbjct: 328 MNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFKKEILLDRSAGKKTAFSIKLDY 387

Query: 388 VKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQISESETPFPHRAGNLYKIGYFL 447
           VK+P+ E A+  I E+L  +D+     +  PYGG M +ISES  PFPHRAG +Y++ Y  
Sbjct: 388 VKKPIPETAMVKILEKLYEEDVGAGMYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTA 447

Query: 448 RWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNYRDLDIGSNNKYGKTSYKQASV 507
            WE+Q  D EK H+NW++ +Y++ TP+VS++PR AY+NYRDLD+G  N     +Y QA +
Sbjct: 448 SWEKQE-DNEK-HINWVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARI 507

Query: 508 WGLKYFSNNFNRLCLFNHSPDGN 527
           WG KYF  NFNRL       D N
Sbjct: 508 WGEKYFGKNFNRLVKVKTKVDPN 528

BLAST of MS026401 vs. ExPASy TrEMBL
Match: A0A5J4ZNA2 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_013560 PE=3 SV=1)

HSP 1 Score: 1137.9 bits (2942), Expect = 0.0e+00
Identity = 572/1006 (56.86%), Postives = 715/1006 (71.07%), Query Frame = 0

Query: 10  LALAFILVASSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNFSIR 69
           L+ AF+ +  S SW  ASAD +E FL CLS HS +  +ISKVIYTP+N SY S+L FSIR
Sbjct: 9   LSFAFVFL-FSISW-VASADTHEDFLHCLSLHSENSAAISKVIYTPNNPSYLSVLQFSIR 68

Query: 70  NFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAYLPFII 129
           N +F+    PKPL+IVTP H S IQA++ C K HG QIR RSGGHDYEGLSYV+ +PF+I
Sbjct: 69  NLRFARPTTPKPLVIVTPLHESQIQAAIYCSKEHGMQIRVRSGGHDYEGLSYVSEVPFVI 128

Query: 130 VDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFSGGG 189
           VDLINLRS++VD +++TAWVQ+GAT+GELYY+IAEKS+T  F AG+CPTVG+GG+FSGGG
Sbjct: 129 VDLINLRSITVDIENSTAWVQAGATLGELYYRIAEKSKTIGFTAGVCPTVGVGGHFSGGG 188

Query: 190 YGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKVKLV 249
           YG++ RK+G+A DN+IDA+L+D NG+  DRESMGEDLFWAIRGGGG SFG++++WK+KL+
Sbjct: 189 YGMMSRKHGIAVDNIIDAHLIDVNGRILDRESMGEDLFWAIRGGGGASFGVILAWKIKLI 248

Query: 250 PVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGKTNPTAL 309
            VP  VT  + NRTLE+ A +L+ RWQY+A K DENL + +F    N S Q GK    A 
Sbjct: 249 VVPEKVTVFTVNRTLEQNATELVHRWQYIADKFDENLLMRVFIRRANSS-QDGKRTLQAS 308

Query: 310 FFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIELETLLN 369
           F SLFLG+ D  + ++  +FPEL L K DC E+SWIES    A GF +GE    L+ LLN
Sbjct: 309 FTSLFLGEVDTLLPLMQKSFPELRLAKDDCIEMSWIESILYFA-GFPSGE---SLDVLLN 368

Query: 370 RPLT-NISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQISESETP 429
           R     I  K KSD+VK+P+SE  ++G WE L  + +E  + L+ PYGGR+S+ISESETP
Sbjct: 369 RTSQGGIYFKGKSDYVKQPISEKGLKGSWEMLYDEKLEGVEFLYSPYGGRLSEISESETP 428

Query: 430 FPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNYRDLDIG 489
           FPHR+GN+Y I Y + W E  +   + ++NWI+ +YSYMTPFVSKSPR AY NYRDLD+G
Sbjct: 429 FPHRSGNIYNIHYIVFWGEADIATSEWNINWIRRVYSYMTPFVSKSPRAAYFNYRDLDLG 488

Query: 490 SNNKYGKTSYKQASVWGLKYFSNNFNRLCLFNHSPDGNYSISKVIHTPINSSYSSVLDFS 549
            NNK G TSY QAS+WG+KYF NNFNRL                              FS
Sbjct: 489 VNNK-GNTSYTQASIWGVKYFKNNFNRL------------------------------FS 548

Query: 550 IRNLRFSTAETPKPLLIITPSHVSHIQAAVVCSKRYGFQIRTRSGGHDFEGLSYVAHLPF 609
           IRNLRF+   TPKPL+I+TP H S IQAA+ CSK +G QIR RSGGHD+EGLSYV+ +PF
Sbjct: 549 IRNLRFARPTTPKPLVIVTPLHESQIQAAIYCSKEHGMQIRVRSGGHDYEGLSYVSEVPF 608

Query: 610 IIVDLINLRSISVDVKNNTAWVHSGATLGELYYSIAQKSRTLAFPAGVCPTVGVGGHFSG 669
           +IVDLINLRSI+VD++N+TAWV +GAT+GELYY IA+KS+TL F AGVCPTVGVGGHFSG
Sbjct: 609 VIVDLINLRSITVDIENSTAWVQAGATIGELYYRIAEKSKTLGFTAGVCPTVGVGGHFSG 668

Query: 670 GGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVAWKVK 729
           GGYG++ RK+G+AADN+IDA+L+D NG+  DRESMGEDLFWAIRGGGG SFG+++AWK+K
Sbjct: 669 GGYGMMSRKHGIAADNIIDAHLIDVNGRILDRESMGEDLFWAIRGGGGASFGVILAWKIK 728

Query: 730 LVPVPATVTLCQTNRTLEE----------------------------------------- 789
           L+ VP  VT+   NRTLE+                                         
Sbjct: 729 LIVVPEKVTVFTVNRTLEQNATALVHRWQYIADKFDENLLMRVFIRRANSSQDGKRTIQA 788

Query: 790 ----------DELLTILNTKFPELDLAKKDCIETSWIESTVLMGIGFQTKVTLEALLSRT 849
                     D LL ++   FPEL LAK DCIE SWIES +L   GF +  +L+ LL+RT
Sbjct: 789 SFTSLFLGEVDTLLPLMQKSFPELRLAKDDCIEMSWIES-ILYFAGFPSGESLDVLLNRT 848

Query: 850 PLTNMSTKIKSDYVKEPISEATIQGIGERLNAQDIESGNLIFVPYGGRMSQISESETPFS 909
              +   K KSDYVK+PISE  ++G  E L  + +E+   ++ PYGGR+S+ISESETPF 
Sbjct: 849 SQGSAYFKGKSDYVKQPISEKGLKGSWEMLYDEKLEAVEFLYSPYGGRLSEISESETPFP 908

Query: 910 HRAGYLYKIGYIASWLDQSIDTEKRHLSWIRELYSYMAPFVSKSPRAAYANYRDLDIGSN 964
           HRAG +Y I Y+ +W +  I T + +++WIR +YSYM PFVSKSPRAAY NYRDLD+G N
Sbjct: 909 HRAGNIYNIHYVVAWGEADIATSEWNINWIRRVYSYMTPFVSKSPRAAYFNYRDLDLGVN 968

BLAST of MS026401 vs. ExPASy TrEMBL
Match: A0A396IB28 (Putative tetrahydroberberine oxidase OS=Medicago truncatula OX=3880 GN=MtrunA17_Chr4g0051421 PE=3 SV=1)

HSP 1 Score: 1105.5 bits (2858), Expect = 0.0e+00
Identity = 571/1076 (53.07%), Postives = 716/1076 (66.54%), Query Frame = 0

Query: 10   LALAFILVA---SSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNF 69
            L L  +L+A   S +S+   ++   + FLQCL  +S +  SISKV+YT +NSSY SIL F
Sbjct: 6    LYLTIVLIAIAFSFTSFAIDTSPHEDNFLQCLYSYSHNITSISKVVYTKTNSSYSSILKF 65

Query: 70   SIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAYLP 129
            SI+N +F+T E PKPL+I+TP+ +SHIQ ++IC + HG QIR RSGGHD+EGLS+V+ +P
Sbjct: 66   SIQNLRFATNETPKPLVIITPTQISHIQTAIICSQHHGMQIRIRSGGHDFEGLSFVSNVP 125

Query: 130  FIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFS 189
            F+I+DL N R + VD ++ TAWVQSGAT+GELYYKIA+KS+T  FP G+CPTVG+GG+FS
Sbjct: 126  FVIIDLTNFRGIDVDVENRTAWVQSGATLGELYYKIAQKSKTLGFPGGVCPTVGVGGHFS 185

Query: 190  GGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKV 249
            GGGYG LLRKYGLAADNVIDA+++D  G+F DRE+MGEDLFWAIRGGGG SFG++VSWK+
Sbjct: 186  GGGYGTLLRKYGLAADNVIDAHIIDVKGRFLDREAMGEDLFWAIRGGGGASFGVIVSWKI 245

Query: 250  KLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFW--TGGNGSRQG-GK 309
            KLV VP TVT  +  RTLE+ A KL+ +WQ+VA KL+ENL + I       N S+QG  K
Sbjct: 246  KLVQVPSTVTVFTVPRTLEQNATKLVHKWQFVAHKLEENLAINIILQRLDLNSSKQGEPK 305

Query: 310  TNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIE 369
            +   ALF SLFLG  D  + ++   FPELGLV++DC E+SWIES       F  GE    
Sbjct: 306  STVLALFQSLFLGSVDNLLPLMEEKFPELGLVREDCVEMSWIESVLYLFR-FPEGE---P 365

Query: 370  LETLLNRPL-TNISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQI 429
            LETLLNR L    + K KSDFVK P+ E  ++G+W   +    E   ++  PYGG M +I
Sbjct: 366  LETLLNRTLAAKDNSKAKSDFVKIPIPETGLEGLWPLFDEDGAEDVLMVLFPYGGIMDKI 425

Query: 430  SESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNY 489
            SESE PFPHR G LYKI Y + W ++  + EK+H+NWI++LYSYM PFVSKSPR AY+NY
Sbjct: 426  SESEIPFPHRYGTLYKIQYAVHWHQEGDEVEKLHINWIRKLYSYMEPFVSKSPRAAYINY 485

Query: 490  RDLDIGSNNKYGKTSYKQASVWGLKYFSNNFNRL-------------------------- 549
            RDLDIG NN  G TSYKQAS+WG+KYF NNF RL                          
Sbjct: 486  RDLDIGVNNINGYTSYKQASIWGVKYFKNNFKRLAKVKTKVDPLNFFRNEQSIPSHVFLS 545

Query: 550  ---------------------------------CLFNHSPDGNYSISKVIHTPINSSYSS 609
                                             CL ++S +   SISKV++T  N SYS 
Sbjct: 546  LVCSYLSVALIAILFSYASSAIDTNTYEDNFLQCLSSYSHNST-SISKVVYTKTNPSYSP 605

Query: 610  VLDFSIRNLRFSTAETPKPLLIITPSHVSHIQAAVVCSKRYGFQIRTRSGGHDFEGLSYV 669
            VL F+ +NLRF++ +TPKPL+IITP   SHIQ A++CS+ +G QIRTRSGGHDFEGLSYV
Sbjct: 606  VLKFTTQNLRFASYKTPKPLVIITPLEPSHIQTAIICSQNHGLQIRTRSGGHDFEGLSYV 665

Query: 670  AHLPFIIVDLINLRSISVDVKNNTAWVHSGATLGELYYSIAQKSRTLAFPAGVCPTVGVG 729
            + +PF+++DLIN + I VDV++ TAWV SGATLGELYY+I+QKSR LAFPAG CPT+GVG
Sbjct: 666  SEIPFVVIDLINFKEIDVDVESRTAWVQSGATLGELYYTISQKSRNLAFPAGACPTIGVG 725

Query: 730  GHFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVV 789
            GHFSGGGYG LLRKYGLAADNVIDA+++D  G+  DRE+MGED FWAIRGGGG SFG+++
Sbjct: 726  GHFSGGGYGTLLRKYGLAADNVIDAHIIDVKGRLLDREAMGEDYFWAIRGGGGASFGVII 785

Query: 790  AWKVKLVPVPATVTLCQTNRTLEE------------------------------------ 849
            +WK+KLV VPATVT+    R LE+                                    
Sbjct: 786  SWKIKLVEVPATVTVFTVPRALEQNATKLVHKWQYLASKIDENIAINIVFQRINSSKKGE 845

Query: 850  ---------------DELLTILNTKFPELDLAKKDCIETSWIESTVLMGIGFQTKVTLEA 909
                           D+L+ +++ KFPEL + +++CIE SWIES VL    F      E 
Sbjct: 846  TTILAIFQALFLGGVDKLIPLMDQKFPELGVVRENCIEMSWIES-VLYLFQFPKGALPEV 905

Query: 910  LLSRTPLTNMST---KIKSDYVKEPISEATIQGIGERLNAQDIESGNLIFVPYGGRMSQI 966
            LL+RT   N      K KSD+VK PI E  ++GI    + +  +   +I+ PYGG M  I
Sbjct: 906  LLNRTLAANSPRFIYKAKSDFVKTPIPENGLEGIWSLFHEEGAKGAMMIWFPYGGIMDTI 965

BLAST of MS026401 vs. ExPASy TrEMBL
Match: A0A5J4Z8V5 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_018019 PE=3 SV=1)

HSP 1 Score: 1101.3 bits (2847), Expect = 0.0e+00
Identity = 563/1014 (55.52%), Postives = 687/1014 (67.75%), Query Frame = 0

Query: 4   SAPLIPLALAFILVASSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSI 63
           S  L+ L   F+L     SW  ASA  +E FLQCLS HS    SIS VIYTP+N+SY  I
Sbjct: 5   STSLLSLLFVFLL---PISW-EASAHTHEDFLQCLSLHSQHSASISNVIYTPNNASYLPI 64

Query: 64  LNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVA 123
           L FSI+N +FS+   PKPL+IVTP H S IQA++ C + HG QIR RSGGHDYEGLSYV+
Sbjct: 65  LEFSIQNLRFSSATTPKPLVIVTPLHESQIQATIYCSRNHGMQIRVRSGGHDYEGLSYVS 124

Query: 124 YLPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGG 183
            +PFIIVDLINLRS++VD +++TAWVQ+GAT+GELYY+IAEKS+T  FPAG+CPT+G+GG
Sbjct: 125 DIPFIIVDLINLRSITVDAENSTAWVQAGATVGELYYRIAEKSKTLAFPAGVCPTIGVGG 184

Query: 184 YFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVS 243
           +FSGGGYG LLRKYGLAADN+IDA L+D NG+  DRESMGEDLFWAIRGGGG SFG++++
Sbjct: 185 HFSGGGYGTLLRKYGLAADNIIDARLMDVNGRILDRESMGEDLFWAIRGGGGASFGVILA 244

Query: 244 WKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGK 303
           WK+KL PVP TVT  +  RTLE+ A K++ +WQYVA K  E+LF+ I     N S Q  K
Sbjct: 245 WKIKLAPVPSTVTVFTVRRTLEQNATKIVHQWQYVAHKFPEDLFIRILIRSLNSS-QDEK 304

Query: 304 TNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIE 363
               A F SLFLG  D+ ++++  +FPELGLVK+DC E+SWIES    A GF +G     
Sbjct: 305 RTIQASFNSLFLGGVDKLISLMQESFPELGLVKEDCIEMSWIESILYFA-GFPSG---AS 364

Query: 364 LETLLNR-PLTNISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQI 423
            + LL+R PL     K KSD+VKEP+SE  ++GIW++   +D+E  +++  PYGGRM +I
Sbjct: 365 FDVLLDRTPLARSYFKAKSDYVKEPISESGLEGIWKQFYEEDVEAAEMILSPYGGRMDEI 424

Query: 424 SESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNY 483
           SES  PFPHRAGN+YKI + + W E+   A K HL WI+ LYSYM P+VSKSPR AY+NY
Sbjct: 425 SESSIPFPHRAGNIYKIQHLVYWAEEGTAASKRHLTWIRRLYSYMAPYVSKSPRLAYINY 484

Query: 484 RDLDIGSNNKYGKTSYKQASVWGLKYFSNNFNRLCLFNHSPDGNYSISKVIHTPINSSYS 543
           RDLDIG NN  G TSY QAS+WG+KYF NNFNR    NH                     
Sbjct: 485 RDLDIGVNNN-GNTSYIQASIWGIKYFKNNFNR----NH--------------------- 544

Query: 544 SVLDFSIRNLRFSTAETPKPLLIITPSHVSHIQAAVVCSKRYGFQIRTRSGGHDFEGLSY 603
                                                     G QIR RSGGHD+EGLSY
Sbjct: 545 ------------------------------------------GMQIRVRSGGHDYEGLSY 604

Query: 604 VAHLPFIIVDLINLRSISVDVKNNTAWVHSGATLGELYYSIAQKSRTLAFPAGVCPTVGV 663
           V+ +PFIIVDLINLRSI+VD +N+TAWV +GAT+GELYY IA+KS+TLAFPAGVCPT+GV
Sbjct: 605 VSDIPFIIVDLINLRSITVDAENSTAWVQAGATVGELYYRIAEKSKTLAFPAGVCPTIGV 664

Query: 664 GGHFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIV 723
           GGHFSGGGYG LLRKYGLAADN+IDA L+D NG+  DRESMGEDLFWAIRGGGG SFG++
Sbjct: 665 GGHFSGGGYGTLLRKYGLAADNIIDARLMDVNGRILDRESMGEDLFWAIRGGGGASFGVI 724

Query: 724 VAWKVKLVPVPATVTLCQTNRTLEE----------------------------------- 783
           +AWK+KL PVP+TVT+    RTLE+                                   
Sbjct: 725 LAWKIKLAPVPSTVTVFTVRRTLEQNATKIVHQWQYVAHKFPEDLFIRILIRSLNSSQDE 784

Query: 784 ----------------DELLTILNTKFPELDLAKKDCIETSWIESTVLMGIGFQTKVTLE 843
                           D+L++++   FPEL L K+DCIE SWIES +L   GF +  + +
Sbjct: 785 KRTIQASFNSLFLGGVDKLISLMQESFPELGLVKEDCIEMSWIES-ILYFAGFPSGASFD 844

Query: 844 ALLSRTPLTNMSTKIKSDYVKEPISEATIQGIGERLNAQDIESGNLIFVPYGGRMSQISE 903
            LL RTPL     K KSDYVKEPISE+ ++GI ++   +D+E+  +I  PYGGRM +ISE
Sbjct: 845 VLLDRTPLARSYFKAKSDYVKEPISESGLEGIWKQFYEEDVEAAEMILSPYGGRMDEISE 904

Query: 904 SETPFSHRAGYLYKIGYIASWLDQSIDTEKRHLSWIRELYSYMAPFVSKSPRAAYANYRD 963
           S  PF HRAG +YKI ++  W ++     KRHL+WIR LYSYMAP+VSKSPR AY NYRD
Sbjct: 905 SSIPFPHRAGNIYKIQHLVYWAEEGTAASKRHLTWIRRLYSYMAPYVSKSPRLAYINYRD 939

Query: 964 LDIGSNKRYGKTSYKQASTWGFKYFGNNFNRLVHVKTKVDPYDFFRHEQSIPTL 966
           LDIG N   G TSY QAS WG KYF NNFNRLVHVKT VDP +FFR+EQSIP L
Sbjct: 965 LDIGVNNN-GNTSYIQASIWGIKYFKNNFNRLVHVKTMVDPNNFFRNEQSIPPL 939

BLAST of MS026401 vs. ExPASy TrEMBL
Match: A0A6N2KFL4 (Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS38692 PE=3 SV=1)

HSP 1 Score: 1072.8 bits (2773), Expect = 7.2e-310
Identity = 542/973 (55.70%), Postives = 686/973 (70.50%), Query Frame = 0

Query: 48   ISKVIYTPSNSSYYSILNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQI 107
            ISKVIYT ++SSY S+L+F+IRN +F++  + KPL IVTP   SHIQA++ C + +  QI
Sbjct: 312  ISKVIYTQNDSSYPSVLHFAIRNLRFNSTTL-KPLAIVTPMKASHIQAAIRCSQKNNLQI 371

Query: 108  RTRSGGHDYEGLSYVAYLPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSR 167
            R RSGGHD+EGLSY++ LPF+I+DLIN RSV++D  + TAWVQ+GAT+GELYY IA K R
Sbjct: 372  RIRSGGHDFEGLSYMSVLPFVILDLINFRSVTIDVANKTAWVQAGATVGELYYDIARKGR 431

Query: 168  TWTFPAGICPTVGIGGYFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLF 227
            T   PAG+ PT+G+GG+FSGGGYG+L+RK+GLAADN+IDA L+DA G+  DR SMGEDLF
Sbjct: 432  TLASPAGLGPTMGVGGHFSGGGYGILMRKHGLAADNIIDARLIDAKGRILDRASMGEDLF 491

Query: 228  WAIRGGGGGSFGIVVSWKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLF 287
            WA+RGGGG SFG+V++W +KLV VP TVT  +  RTLE+ A +LI RWQY+A+KL E+L 
Sbjct: 492  WALRGGGGNSFGVVIAWNIKLVEVPPTVTVFNVPRTLEQNATQLIHRWQYIANKLHEDLM 551

Query: 288  LGIFWTGGNGSRQGGKTNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIES 347
            +G +    N S+  G +   A F   FL  AD  + ++N  F ELGLVK DC E SWIES
Sbjct: 552  IGTYIRRVNSSQ--GNSTIQATFSGFFLAGADRLLQLMNENFLELGLVKDDCIETSWIES 611

Query: 348  AAIAANGFQNGEEAIELETLLNR-PLTNISLKVKSDFVKEPLSELAIQGIWERLNRQDIE 407
              +  N F        LE LL+R P    + K KSD+VKEPL E+A++GI+ER   +DIE
Sbjct: 612  MIL--NRFPGN---TSLERLLDRTPPFVTNYKAKSDYVKEPLPEIALEGIFERFLEEDIE 671

Query: 408  LPQILFVPYGGRMSQISESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSY 467
             P++L VPYGG+M +ISES +PFPHRAGN+YKI + + W E+  +A + H+ WI+ LYSY
Sbjct: 672  TPRLLLVPYGGKMDKISESSSPFPHRAGNIYKIEHQVSWSEEGKEASERHIAWIRRLYSY 731

Query: 468  MTPFVSKSPRTAYVNYRDLDIGSNNKYGKTSYKQASVWGLKYFSNNFNRLCLFNHSPD-- 527
            MTP+VSK+PR AY+NYRDLDIG NN  G TSYKQAS+WG KYF NNF++L     + D  
Sbjct: 732  MTPYVSKNPREAYINYRDLDIGMNNLAGNTSYKQASIWGRKYFKNNFDKLVRVKTAVDPA 791

Query: 528  ----GNYSISKVIHTPINSSYSSVLDFSIRNLRFSTAETPKPLLIITPSHVSHIQAAVVC 587
                   +ISKVI+TP NSSYSS+L FSIRN RF+++E  KP +I+TP+  SHIQAA+ C
Sbjct: 792  NFFSNEQTISKVIYTPENSSYSSILHFSIRNTRFNSSEL-KPFVIVTPTDASHIQAAIHC 851

Query: 588  SKRYGFQIRTRSGGHDFEGLSYVAHLPFIIVDLINLRSISVDVKNNTAWVHSGATLGELY 647
            S+ +  +IR RSGGHDFEGLSY++ +PF+IVDLINLRSI++D  + TAWV SGATLGELY
Sbjct: 852  SQEHKLEIRIRSGGHDFEGLSYMSTVPFVIVDLINLRSITIDATDKTAWVQSGATLGELY 911

Query: 648  YSIAQKSRTLAFPAGVCPTVGVGGHFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDR 707
            Y I +KSRTLAFPAG CP +GVGGHFSGGGY  + RKYGLA+DNVIDA L+DA G+  DR
Sbjct: 912  YRIFEKSRTLAFPAGSCPMIGVGGHFSGGGYSTISRKYGLASDNVIDAQLIDAKGRILDR 971

Query: 708  ESMGEDLFWAIRGGGGGSFGIVVAWKVKLVPVPATVTLCQTNRTLEE------------- 767
            ESMGEDLFWAIRGGGG SFG+V+AWK+KLV VP  VT+    RTLE+             
Sbjct: 972  ESMGEDLFWAIRGGGGQSFGVVIAWKIKLVEVPPKVTVFTKARTLEQNATKIIHRWQYVA 1031

Query: 768  --------------------------------------DELLTILNTKFPELDLAKKDCI 827
                                                  D+LL ++   FPEL L K DC 
Sbjct: 1032 NQLPEDLIIDVMVNRVNSSEEGKSTIQAAFFSLFLGEVDQLLLLMQESFPELGLVKDDCK 1091

Query: 828  ETSWIESTVLMGIGFQTKVTLEALLSRTPLTNMSTKIKSDYVKEPISEATIQGIGERLNA 887
            E SWIES VL  + F +  +L+ALL+RTP  +   K KSDYV+EPI E   + I +R   
Sbjct: 1092 EMSWIES-VLYIVAFPSNASLDALLNRTPQPSSQFKHKSDYVQEPIPEIVFEEIWKRFFQ 1151

Query: 888  QDIESGNLIFVPYGGRMSQISESETPFSHRAGYLYKIGYIASWLDQSIDTEKRHLSWIRE 947
            +DIE      V YGG+M +ISES TPF HRAG  Y +  + SW +++ +  +RHL+WIR 
Sbjct: 1152 KDIEVPAFYMVSYGGKMDEISESSTPFPHRAGNRYILAPVVSWSEETEEASQRHLAWIRR 1211

Query: 948  LYSYMAPFVSKSPRAAYANYRDLDIGSNKRYGKTSYKQASTWGFKYFGNNFNRLVHVKTK 963
            +Y+YM P+VSK+PR AY NYRDLD+G N   G TSYKQAS WG KYF NNF+RLV VKT+
Sbjct: 1212 VYTYMTPYVSKNPRQAYVNYRDLDLGVN-NLGYTSYKQASIWGLKYFKNNFDRLVRVKTE 1271

BLAST of MS026401 vs. ExPASy TrEMBL
Match: A0A5B6W3N7 (Tetrahydrocannabinolic acid synthase-like OS=Gossypium australe OX=47621 GN=EPI10_026099 PE=3 SV=1)

HSP 1 Score: 1067.4 bits (2759), Expect = 3.6e-308
Identity = 565/1057 (53.45%), Postives = 710/1057 (67.17%), Query Frame = 0

Query: 1    MNYSAPLIPLALAFILVASSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSY 60
            + +S  L PL LA +L   S   GA+    ++ FLQCLS  S+D  SIS VIYT +NSSY
Sbjct: 3    LQFSMLLPPLVLAALL---SFPMGAS---PHKDFLQCLSLLSNDSTSISNVIYTRNNSSY 62

Query: 61   YSILNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLS 120
              +L  +IRN +F++ + PKPL+IVTPS  SH QA++ C + HG QIRTRSGGHDYEGLS
Sbjct: 63   SFVLESTIRNLRFNSTDTPKPLVIVTPSRTSHFQATIYCARKHGLQIRTRSGGHDYEGLS 122

Query: 121  YVAYLPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVG 180
            YVA +PF++VDL+N RSV VD ++  AWVQ+GA +GE+YY+IAEKSRT  F  GI  T+G
Sbjct: 123  YVAKVPFVVVDLVNFRSVDVDAENRVAWVQAGAILGEVYYRIAEKSRTLAFAGGIYHTIG 182

Query: 181  IGGYFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGI 240
            +GGY SGGG+GLL RKYG   DNVIDA  +D NG+  DR+SMGEDLFWAIRGGGGGSFGI
Sbjct: 183  VGGYISGGGFGLLFRKYGTGGDNVIDAQFIDVNGRILDRKSMGEDLFWAIRGGGGGSFGI 242

Query: 241  VVSWKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQ 300
            V++WK+ LVPVP TVT  S +RTLE+ A +LI +WQ +A +L + +   +     N ++ 
Sbjct: 243  VLAWKLILVPVPATVTAFSVSRTLEQNATQLILQWQDIAHQLPDEMNPDVTMFSINSTQD 302

Query: 301  GGKTNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEE 360
            G KT   ALF SLFLG  DE + I+   FPELGL ++DCTE+SWIES  +  N  QN   
Sbjct: 303  GRKT-ILALFSSLFLGTIDELLPIMQQRFPELGLSRQDCTEMSWIES-ILYYNQLQNQ-- 362

Query: 361  AIELETLLNRPLTNI----SLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYG 420
               LE LLNR   ++      K+KSD+VKEP+SE A+ G++ RL+ ++     I+F+ YG
Sbjct: 363  --PLEILLNRTFRSLVGGQYYKIKSDYVKEPISETALNGLFSRLSDEEASSAIIIFMAYG 422

Query: 421  GRMSQISESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPR 480
            G M +I E  TPFPHRAGNLYKI Y + W+EQ     + +++W + +YSYMTPFVSK PR
Sbjct: 423  GIMDRIPEDSTPFPHRAGNLYKIYYNVNWQEQDNVNSQKYIDWARRVYSYMTPFVSKFPR 482

Query: 481  TAYVNYRDLDIGSNN-KYGKTSYKQASVWGLKYFSNNFNRL------------------- 540
             AY NYRDLDIGSNN KY  TSY QA +WG KYF NNFNRL                   
Sbjct: 483  EAYANYRDLDIGSNNVKY--TSYAQAKIWGRKYFKNNFNRLVQSLQFSILLLSLVLAALL 542

Query: 541  --------------CLFNHSPDGNYSISKVIHTPINSSYSSVLDFSIRNLRFSTAETPKP 600
                          CL   S D + +IS VI+T  N S+S+VL+ +IRNLRF++ +TPKP
Sbjct: 543  SFSMGALPHEDFLQCLSLSSNDSS-TISSVIYTRNNPSFSTVLESTIRNLRFNSTDTPKP 602

Query: 601  LLIITPSHVSHIQAAVVCSKRYGFQIRTRSGGHDFEGLSYVAHLPFIIVDLINLRSISVD 660
            L+I+TPS  SH QA + CS+++G QIRTRSGGHD+EGLSYVA +PF++VDL+N RS+ VD
Sbjct: 603  LVIVTPSRTSHFQATIYCSRKHGLQIRTRSGGHDYEGLSYVAKVPFVVVDLVNFRSVDVD 662

Query: 661  VKNNTAWVHSGATLGELYYSIAQKSRTLAFPAGVCPTVGVGGHFSGGGYGLLLRKYGLAA 720
            V+N  AWV +GA LGE+YY IA+KSRTLAF  GV  ++GVGG+ SGGG+GLL RKYG A 
Sbjct: 663  VENRVAWVQAGAILGEVYYRIAEKSRTLAFAGGVFHSIGVGGYISGGGFGLLFRKYGTAG 722

Query: 721  DNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVAWKVKLVPVPATVTLCQTN 780
            DNVIDA  +D NG+  DR+SMGEDLFWAIRGGGGGSFGIV+AWK+KLVPVPATVT    +
Sbjct: 723  DNVIDAQFIDVNGRILDRKSMGEDLFWAIRGGGGGSFGIVLAWKLKLVPVPATVTAFSVS 782

Query: 781  RTLEE---------------------------------------------------DELL 840
            RTLE+                                                   DELL
Sbjct: 783  RTLEQNATQLILRWQEIAHQLPDEMNPDLSMFSVNSTQDGRKTILASFSSLFLGTIDELL 842

Query: 841  TILNTKFPELDLAKKDCIETSWIESTVLMGIGFQTKVTLEALLSRT---PLTNMSTKIKS 900
             I+  +FP+L L+++DC E SWIES VL     Q +  LE LL+RT   P+     K+KS
Sbjct: 843  PIMQQRFPKLGLSRQDCSEMSWIES-VLYFSQLQNQ-PLEILLNRTFRNPIGGQYFKVKS 902

Query: 901  DYVKEPISEATIQGIGERLNAQDIESGNLIFVPYGGRMSQISESETPFSHRAGYLYKIGY 960
            DYVKEPISE  + G+  RL+ ++  S  ++F+ YGG M +I E  TPF HRAG LYKI Y
Sbjct: 903  DYVKEPISETALNGLFSRLSDEEASSAIIVFMAYGGIMDRIPEEATPFPHRAGNLYKIYY 962

Query: 961  IASWLDQSIDTEKRHLSWIRELYSYMAPFVSKSPRAAYANYRDLDIGSNKRYGKTSYKQA 966
              +W +Q     ++++ W R +Y+YM PFVSKSPR AYANYRDLDIGSN   G TSY QA
Sbjct: 963  NVNWQEQDNVNSQKYIDWSRRVYNYMTPFVSKSPREAYANYRDLDIGSN-NVGITSYTQA 1022

BLAST of MS026401 vs. TAIR 10
Match: AT5G44440.1 (FAD-binding Berberine family protein )

HSP 1 Score: 507.7 bits (1306), Expect = 2.1e-143
Identity = 251/497 (50.50%), Postives = 338/497 (68.01%), Query Frame = 0

Query: 25  AASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNFSIRNFKFSTVEIPKPLLI 84
           +A    +E FL+CLS+  +D     KVI+T  +SS++SIL+ SI+N +FS  E PKP+ I
Sbjct: 22  SAHGSNHEDFLKCLSYRMNDNTVEPKVIHTSKDSSFFSILDSSIQNPRFSVSETPKPVSI 81

Query: 85  VTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAY-LPFIIVDLINLRSVSVDTK 144
           +TP   S +Q  + C + HG  +RTRS GH YEGLSY+AY  PF ++DL NLRS+S+D  
Sbjct: 82  ITPVKASDVQTVIRCAQLHGIHVRTRSAGHCYEGLSYIAYNKPFAVIDLRNLRSISLDVD 141

Query: 145 SNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFSGGGYGLLLRKYGLAADN 204
           + T WVQ+GAT GELYY+I + +++  FPAGI PTVG+GG FSGGGYG LLRKYGLAADN
Sbjct: 142 NRTGWVQTGATAGELYYEIGKTTKSLAFPAGIHPTVGVGGQFSGGGYGTLLRKYGLAADN 201

Query: 205 VIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKVKLVPVPVTVTFCSTNRT 264
           +IDA +VDA+G+  DR++MGED FWAIRGGGG SFG+++SWKVKLV VP T+T     +T
Sbjct: 202 IIDALVVDASGRILDRQAMGEDYFWAIRGGGGSSFGVILSWKVKLVDVPSTITVFKVQKT 261

Query: 265 LEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGKTNPTALFFSLFLGKADEAVA 324
            ++ AV++I++WQY A K+ ++LF+       N      K    ALF  L++G  +  +A
Sbjct: 262 SKKEAVRIIKKWQYAADKVPDDLFIRTTLERSN------KNAVHALFTGLYIGPVNNLLA 321

Query: 325 ILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIELETLLNRPLTNISLKVKSDF 384
           ++   FPELGL K+ C E+SWIES    A+ F  GE    L  L NR  T++S K K DF
Sbjct: 322 LMEEKFPELGLEKEGCEEMSWIESVLWFAD-FPKGE---SLGVLTNRERTSLSFKGKDDF 381

Query: 385 VKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQISESETPFPHRAGNLYKIGYFL 444
           V+EP+ E AIQ IW RL   +  L +I+  P+GG+MS+++E ETPFPHR GNLY+I Y  
Sbjct: 382 VQEPIPEAAIQEIWRRLEAPEARLGKIILTPFGGKMSEMAEYETPFPHRGGNLYEIQYVA 441

Query: 445 RWEEQSVDAEK----MHLNWIQELYSYMTPFVSKSPRTAYVNYRDLDIGSNNKYGKTSYK 504
            W E+  D  K     +L W+  +Y +MTP+VSKSPR AYVN++D+D+G      KT Y+
Sbjct: 442 YWREEE-DKNKTETDKYLKWVDSVYEFMTPYVSKSPRGAYVNFKDMDLGMYLGKKKTKYE 501

Query: 505 QASVWGLKYFSNNFNRL 517
           +   WG+KYF NNF RL
Sbjct: 502 EGKSWGVKYFKNNFERL 507

BLAST of MS026401 vs. TAIR 10
Match: AT4G20820.1 (FAD-binding Berberine family protein )

HSP 1 Score: 503.4 bits (1295), Expect = 3.9e-142
Identity = 245/500 (49.00%), Postives = 342/500 (68.40%), Query Frame = 0

Query: 25  AASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSILNFSIRNFKFSTVEIPKPLLI 84
           +A+     +FLQCLS   +D   +SKVI+TP+++S+ S+L  SI+N +FS  ++PKP+LI
Sbjct: 28  SANRSNQSSFLQCLSLQLNDSNIVSKVIHTPNDTSFSSVLASSIQNQRFSAPDVPKPVLI 87

Query: 85  VTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVAYLPFIIVDLINLRSVSVDTKS 144
           +TP   S +Q+++ C +  G  IRTRSGGHDYEGLSYV + PF+I+DL NLRS++VD  +
Sbjct: 88  LTPVQPSDVQSAVKCARRFGIHIRTRSGGHDYEGLSYVTHKPFVILDLRNLRSITVDVDN 147

Query: 145 NTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIGGYFSGGGYGLLLRKYGLAADNV 204
            + WVQ+GATIGELYY+I +K+RT  FPAG+CPTVG+GG+FSGGGYG LLRK+GLAAD+V
Sbjct: 148 RSVWVQTGATIGELYYEIGKKNRTLAFPAGVCPTVGVGGHFSGGGYGTLLRKHGLAADHV 207

Query: 205 IDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVVSWKVKLVPVPVTVTFCSTNRTL 264
           IDA +VDA G+  +R  MGED FWAIRGGGG SF +V+SWK+ L+ VP TVT  +  +  
Sbjct: 208 IDARVVDARGRILERREMGEDFFWAIRGGGGSSFCVVLSWKIGLINVPSTVTVFNVTKFS 267

Query: 265 EEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGGKTNPTALFFSLFLGKADEAVAI 324
           E+ A+K+I RWQ+VA K+ ++LF+ +         Q  K    A F  L+LG     + +
Sbjct: 268 EQSALKIIHRWQFVADKVSDDLFIRVM-------LQRYKNMVRASFPGLYLGSVKNLLKM 327

Query: 325 LNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAIELETLLNRPLTNISLKVKSDFV 384
           +N  FPELGL + DCTE+SWIES    A   + GEE I +  L  R   +++ K KSDFV
Sbjct: 328 VNKEFPELGLEEDDCTEMSWIESVIWFA---ELGEEPINV--LTKRTRASLAFKAKSDFV 387

Query: 385 KEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQISESETPFPHRAGNLYKIGYFLR 444
           +EP+ + AI  +W RL   + E  Q++F P+GG+MS+I++ ETPFPHR GN+Y+I Y   
Sbjct: 388 QEPMPKTAISKLWRRLQEPEAEHAQLIFTPFGGKMSEIADYETPFPHRKGNIYEIQYLNY 447

Query: 445 WEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTAYVNYRDLDIGSNNKYGKTSYKQASVW 504
           W     D ++ ++ W++ +Y  M+ FV+KSPR AY+N RDLD+G      ++ Y++   W
Sbjct: 448 WRG---DVKEKYMRWVERVYDDMSEFVAKSPRGAYINLRDLDLGMYVGVKRSKYEEGKSW 507

Query: 505 GLKYFSNNFNRLCLFNHSPD 525
           G+KYF NNF RL     S D
Sbjct: 508 GVKYFKNNFERLVRVKTSVD 512

BLAST of MS026401 vs. TAIR 10
Match: AT5G44410.1 (FAD-binding Berberine family protein )

HSP 1 Score: 494.6 bits (1272), Expect = 1.8e-139
Identity = 252/519 (48.55%), Postives = 351/519 (67.63%), Query Frame = 0

Query: 7   LIPLALAFILV-ASSSSWGAASADK--YEAFLQCLSHHSSDGYSISKVIYTPSNSSYYSI 66
           L+ L + F+L+  S S + + SA +  +E FL+CLSH  ++    S++I+T  + SY+SI
Sbjct: 7   LLSLFIYFLLLNLSLSHFPSISAQRTNHENFLKCLSHRINE--DDSRIIHTSKDPSYFSI 66

Query: 67  LNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYVA 126
           LN SI+N +F  +E PKP+ I+TP   + +Q+++ C + HG  IRTRSGGHDYEGLSY+A
Sbjct: 67  LNSSIQNPRFFVLETPKPVSIITPVQATDVQSTIKCARLHGIHIRTRSGGHDYEGLSYMA 126

Query: 127 -YLPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVGIG 186
              PF+++DL NLRS+++D  + T WVQSGATIGELYY+I + S++  FPAG+ PTVGIG
Sbjct: 127 KSRPFVVIDLRNLRSITLDVDNRTGWVQSGATIGELYYEIGKLSKSLAFPAGLYPTVGIG 186

Query: 187 GYFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGIVV 246
           G F GGGYG L+RKYGL+ADNVIDA++VDANG F DR+ MGED FWAIRGGGG SF +V+
Sbjct: 187 GQFGGGGYGTLMRKYGLSADNVIDAHIVDANGSFLDRQGMGEDFFWAIRGGGGSSFSVVL 246

Query: 247 SWKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGSRQGG 306
           SWK++L+ VP  VT     +T E+ AV +I +WQY+A K+  +LF+         +    
Sbjct: 247 SWKIRLLDVPSVVTVFKVVKTSEKEAVSIINKWQYIADKVPNDLFI--------RAMLQK 306

Query: 307 KTNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGEEAI 366
           +T   A F  L+LG   + +A++   FPELGL   +C E+SWIES       F  GE   
Sbjct: 307 ETEVYASFPGLYLGPVSDLLALMKDKFPELGLEIGNCREMSWIESVL----WFIKGE--- 366

Query: 367 ELETLLNRPLTNISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRMSQI 426
            +E L  R  T+ S K K DF++EP+ + AIQ +W R    +  L +I+  P+GG+MS+I
Sbjct: 367 SMEILAKRKRTSRSFKGKDDFIEEPIPKTAIQYLWRRFEAPEARLAKIILTPFGGKMSEI 426

Query: 427 SESETPFPHRAGNLYKIGYFLRWEEQ----SVDAEKMHLNWIQELYSYMTPFVSKSPRTA 486
           +++E PFPHR GNLY+I Y   W E+      + EK +L W++ +Y +MTP+VSKSPR A
Sbjct: 427 ADNEIPFPHREGNLYEIQYLAYWSEEEDKNKTNTEK-YLRWVESVYEFMTPYVSKSPRRA 486

Query: 487 YVNYRDLDIGSNNKYG-KTSYKQASVWGLKYFSNNFNRL 517
           YVN+RD+D+G       KT Y++A VWG+KYF NNF+RL
Sbjct: 487 YVNFRDIDLGMYLGLNMKTKYEEAKVWGVKYFKNNFDRL 507

BLAST of MS026401 vs. TAIR 10
Match: AT1G30700.1 (FAD-binding Berberine family protein )

HSP 1 Score: 492.7 bits (1267), Expect = 6.9e-139
Identity = 255/531 (48.02%), Postives = 347/531 (65.35%), Query Frame = 0

Query: 1   MNYSAPLIPLALAFILVASSSSWGAASADKYEAFLQCLSHHSSDGYSISKVIYTPSNSSY 60
           M Y+  L+   + FI  +SSSS  +      E F QCL+ +S   + IS  I+   N SY
Sbjct: 1   MKYALILVLFFVVFIWQSSSSSANS------ETFTQCLTSNSDPKHPISPAIFFSGNGSY 60

Query: 61  YSILNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLS 120
            S+L  +IRN +F+T   PKP LI+  +H SH+QA++ C K H  Q++ RSGGHDY+GLS
Sbjct: 61  SSVLQANIRNLRFNTTSTPKPFLIIAATHESHVQAAITCGKRHNLQMKIRSGGHDYDGLS 120

Query: 121 YVAY--LPFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPT 180
           YV Y   PF ++D+ NLRSV VD  S TAWVQ+GA +GE+YY I EKS+T  +PAGICPT
Sbjct: 121 YVTYSGKPFFVLDMFNLRSVDVDVASKTAWVQTGAILGEVYYYIWEKSKTLAYPAGICPT 180

Query: 181 VGIGGYFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSF 240
           VG+GG+ SGGGYG ++RKYGL  DN IDA +VD NGK  DR+ MGEDL+WAI GGGGGS+
Sbjct: 181 VGVGGHISGGGYGNMMRKYGLTVDNTIDARMVDVNGKILDRKLMGEDLYWAINGGGGGSY 240

Query: 241 GIVVSWKVKLVPVPVTVTFCSTNRTLEEGAVKLIQRWQYVASKLDENLFLGIFWTGGNGS 300
           G+V+++K+ LV VP  VT    +RTLE+ A  +I RWQ VA KL + LF+       NG+
Sbjct: 241 GVVLAYKINLVEVPENVTVFRISRTLEQNATDIIHRWQQVAPKLPDELFIRTVIDVVNGT 300

Query: 301 RQGGKTNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNG 360
               KT  T  F ++FLG     ++ILN  FPELGLV+ DCTE SWI+S     N     
Sbjct: 301 VSSQKTVRTT-FIAMFLGDTTTLLSILNRRFPELGLVRSDCTETSWIQSVLFWTNIQVGS 360

Query: 361 EEAIELETLLNRPLTNISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGR 420
            E + L+   N+P+    LK KSD+V+EP+S   ++ IW+++   ++E+P + F PYGG 
Sbjct: 361 SETLLLQR--NQPVN--YLKRKSDYVREPISRTGLESIWKKM--IELEIPTMAFNPYGGE 420

Query: 421 MSQISESETPFPHRAGNLYKIGYFLRWEEQSVDAEKMHLNWIQELYSYMTPFVSKSPRTA 480
           M +IS + TPFP+RAGNL+KI Y   W ++++    M L   ++LY +MTPFVSK+PR +
Sbjct: 421 MGRISSTVTPFPYRAGNLWKIQYGANWRDETLTDRYMELT--RKLYQFMTPFVSKNPRQS 480

Query: 481 YVNYRDLDIGSNNKYGK-TSYKQASVWGLKYFSNNFNRLCLFNHSPD-GNY 528
           + NYRD+D+G N+  GK +SY +   +G KYF+ NF RL       D GN+
Sbjct: 481 FFNYRDVDLGINSHNGKISSYVEGKRYGKKYFAGNFERLVKIKTRVDSGNF 516

BLAST of MS026401 vs. TAIR 10
Match: AT5G44400.1 (FAD-binding Berberine family protein )

HSP 1 Score: 485.7 bits (1249), Expect = 8.5e-137
Identity = 249/528 (47.16%), Postives = 346/528 (65.53%), Query Frame = 0

Query: 9   PLALAFILVASSSSW----GAASADKYEAFLQCLSHHSSDGYSISKVIYTPSN--SSYYS 68
           PL L  ILV   S +     ++ A   + F+ C+  ++   + + K  + P+   S +  
Sbjct: 6   PLPLFSILVLYFSLYTITPTSSLASLQDQFINCVQRNTHVYFPLEKTFFAPTKNVSMFSQ 65

Query: 69  ILNFSIRNFKFSTVEIPKPLLIVTPSHVSHIQASLICCKTHGFQIRTRSGGHDYEGLSYV 128
           +L  + +N +F    +PKP  I +P H SH+QAS+IC K     +R RSGGHDYEGLSYV
Sbjct: 66  VLESTAQNLRFLKKSMPKPGFIFSPIHESHVQASIICSKKLRMHLRVRSGGHDYEGLSYV 125

Query: 129 AYL--PFIIVDLINLRSVSVDTKSNTAWVQSGATIGELYYKIAEKSRTWTFPAGICPTVG 188
           + +  PFI++DL  +R V+++ + N+AWVQSGAT+GELYY+IAEKS+   FPAG+C ++G
Sbjct: 126 SQIDKPFILMDLSKMRQVNINIQDNSAWVQSGATVGELYYRIAEKSKVHGFPAGLCSSLG 185

Query: 189 IGGYFSGGGYGLLLRKYGLAADNVIDAYLVDANGKFHDRESMGEDLFWAIRGGGGGSFGI 248
           IGG+ +GG YG ++RKYGL ADNV+DA +VDANGK  DR +MGED FWAIRGG GGSFGI
Sbjct: 186 IGGHITGGAYGSMMRKYGLGADNVLDAKIVDANGKLLDRAAMGEDTFWAIRGGAGGSFGI 245

Query: 249 VVSWKVKLVPVPVTVTFCSTNRTLEEG-AVKLIQRWQYVASKLDENLFLGIFWTGGNGSR 308
           +++WK+KLVPVP TVT  +  +TL++    K+I +WQ VA KL E LF+ + +   N + 
Sbjct: 246 ILAWKIKLVPVPKTVTVFTVTKTLQQDVGNKIISKWQRVADKLVEELFIRVLF---NVAG 305

Query: 309 QGGKTNPTALFFSLFLGKADEAVAILNTTFPELGLVKKDCTEVSWIESAAIAANGFQNGE 368
            GG    T  + +LFLG     + ++  +FPELGL  KDC E+SW+ES A  +    +  
Sbjct: 306 TGGNKTVTTSYNALFLGGKGTLMNVMKKSFPELGLTFKDCIEMSWLESIAYISGFPTHTP 365

Query: 369 EAIELETLLNRPLTNISLKVKSDFVKEPLSELAIQGIWERLNRQDIELPQILFVPYGGRM 428
             + L+     P   +S K KSDFVK P+ E  +QGI+++L ++DI  P +++ PYGG M
Sbjct: 366 TNVLLQG--KSPFPKVSFKAKSDFVKTPIPESGLQGIFKKLLKEDI--PLMIWNPYGGMM 425

Query: 429 SQISESETPFPHRAGNLYKIGYFLRWEEQSVDAEK---MHLNWIQELYSYMTPFVSKSPR 488
           ++I ES+ PFPHR G L+K+ Y   W    +D++K    H+NWI++LYSYMTP+VS +PR
Sbjct: 426 AKIPESQIPFPHRKGVLFKVQYVTSW----LDSDKRPSRHINWIRDLYSYMTPYVSSNPR 485

Query: 489 TAYVNYRDLDIGSNNKYGKTSYKQASVWGLKYFSNNFNRLCLFNHSPD 525
            AYVNYRDLD+G N K  KT  KQA VWG  YF NNFNRL +     D
Sbjct: 486 EAYVNYRDLDLGRNTKDVKTCIKQAQVWGANYFKNNFNRLMMIKAKVD 522

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA8519304.10.0e+0056.86hypothetical protein F0562_013560 [Nyssa sinensis][more]
RHN62809.10.0e+0053.07putative tetrahydroberberine oxidase [Medicago truncatula][more]
KAA8514840.10.0e+0055.52hypothetical protein F0562_018019 [Nyssa sinensis][more]
GAY33888.10.0e+0052.21hypothetical protein CUMW_008580 [Citrus unshiu][more]
KAA3475986.17.3e-30853.45tetrahydrocannabinolic acid synthase-like [Gossypium australe][more]
Match NameE-valueIdentityDescription
Q9FI212.9e-14250.50Berberine bridge enzyme-like 28 OS=Arabidopsis thaliana OX=3702 GN=At5g44440 PE=... [more]
Q9SVG55.5e-14149.00Berberine bridge enzyme-like 18 OS=Arabidopsis thaliana OX=3702 GN=At4g20820 PE=... [more]
Q9FI252.6e-13848.55Berberine bridge enzyme-like 27 OS=Arabidopsis thaliana OX=3702 GN=At5g44410 PE=... [more]
Q33DQ24.4e-13848.71Cannabichromenic acid synthase OS=Cannabis sativa OX=3483 GN=CBCAS PE=1 SV=1[more]
Q8GTB65.7e-13848.31Tetrahydrocannabinolic acid synthase OS=Cannabis sativa OX=3483 GN=THCAS PE=1 SV... [more]
Match NameE-valueIdentityDescription
A0A5J4ZNA20.0e+0056.86Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_013560 PE=3 SV=1[more]
A0A396IB280.0e+0053.07Putative tetrahydroberberine oxidase OS=Medicago truncatula OX=3880 GN=MtrunA17_... [more]
A0A5J4Z8V50.0e+0055.52Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_018019 PE=3 SV=1[more]
A0A6N2KFL47.2e-31055.70Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS38692 PE=3 SV=1[more]
A0A5B6W3N73.6e-30853.45Tetrahydrocannabinolic acid synthase-like OS=Gossypium australe OX=47621 GN=EPI1... [more]
Match NameE-valueIdentityDescription
AT5G44440.12.1e-14350.50FAD-binding Berberine family protein [more]
AT4G20820.13.9e-14249.00FAD-binding Berberine family protein [more]
AT5G44410.11.8e-13948.55FAD-binding Berberine family protein [more]
AT1G30700.16.9e-13948.02FAD-binding Berberine family protein [more]
AT5G44400.18.5e-13747.16FAD-binding Berberine family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR016169FAD-binding, type PCMH, subdomain 2GENE3D3.30.465.10coord: 613..732
e-value: 2.1E-35
score: 123.1
IPR016169FAD-binding, type PCMH, subdomain 2GENE3D3.30.465.10coord: 141..516
e-value: 4.1E-141
score: 472.7
IPR012951Berberine/berberine-likePFAMPF08031BBEcoord: 478..516
e-value: 8.0E-7
score: 29.1
coord: 905..963
e-value: 1.6E-19
score: 69.7
NoneNo IPR availableGENE3D3.40.462.20coord: 733..901
e-value: 1.3E-44
score: 154.1
NoneNo IPR availableGENE3D3.40.462.20coord: 254..474
e-value: 4.1E-141
score: 472.7
NoneNo IPR availablePANTHERPTHR32448OS08G0158400 PROTEINcoord: 517..748
NoneNo IPR availablePANTHERPTHR32448:SF142CANNABIDIOLIC ACID SYNTHASE-LIKEcoord: 748..965
coord: 8..517
coord: 517..748
NoneNo IPR availablePANTHERPTHR32448OS08G0158400 PROTEINcoord: 748..965
coord: 8..517
IPR016167FAD-binding, type PCMH, subdomain 1GENE3D3.30.43.10coord: 520..612
e-value: 1.0E-34
score: 120.3
IPR016167FAD-binding, type PCMH, subdomain 1GENE3D3.30.43.10coord: 28..135
e-value: 5.8E-42
score: 143.9
IPR006094FAD linked oxidase, N-terminalPFAMPF01565FAD_binding_4coord: 562..698
e-value: 1.4E-26
score: 92.9
coord: 81..217
e-value: 4.8E-26
score: 91.2
IPR016166FAD-binding domain, PCMH-typePROSITEPS51387FAD_PCMHcoord: 77..251
score: 19.077829
IPR016166FAD-binding domain, PCMH-typePROSITEPS51387FAD_PCMHcoord: 558..732
score: 18.722157
IPR036318FAD-binding, type PCMH-like superfamilySUPERFAMILY56176FAD-binding/transporter-associated domain-likecoord: 518..733
IPR036318FAD-binding, type PCMH-like superfamilySUPERFAMILY56176FAD-binding/transporter-associated domain-likecoord: 30..252

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS026401.1MS026401.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0071949 FAD binding
molecular_function GO:0050660 flavin adenine dinucleotide binding
molecular_function GO:0016491 oxidoreductase activity