Cla97C02G038820 (gene) Watermelon (97103) v2

NameCla97C02G038820
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionMolybdenum cofactor sulfurase
LocationCla97Chr02 : 26366484 .. 26369321 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATCATTCACTATGGAAGCCTCTATCTCACTGTGCTGCTTTGATTATGGATAAAAAGAGTAGGAAAAAAGATGGGTCCGATTCAGCGATGGATATCAAGAAGCACAAACTGATTCTTCGAAAACTTGAAGAACATAAGCTTAGAGAAGCTCTTGAAGAAGCTTCTGAAGATGGGTCTCTCTTCAAGTCGCAAGATGTGGGCTCTGAGCCATTGCCTAACGATGATAACAATGGCTTGGGCCGATCTCGGTCGCTCGCTAGACTTCAAGCTCAGCGGGAGTTCCTCAAGGCTACAGCTATGGCGGCTGATCGTACATATGAATCTGATGATGCCATTCCTGATCTTCATGAAGCTTTCTCTAAGTTTCTCACAATGTACCCAAAGTATCAGTCCTCAGAGAAGATTGATCAGCTTCGTTCGAATGAATATTCCCATTTGATTAAGGTATGTCTTGATTACTGTGGATTTGGGCTGTTTTCTTATGTTCAGAGTCTTCATTATTGGGAGTCTTCTACATTTAGTTTGTCTGAGATTGCTGCAAATTTGAGTAATCAAGCTCTTTATGGTGGTGCTGAGAGAGGGACTGTAGAACATGATATAAAGAGTAGGATTATGGATCATTTGAACATACCTGAGCATGAGTATGGCCTTGTTTTTACTGTTAGTAGAGGGTCTGCTTTTAAACTGCTGGCTGAATCATACCCTTTCAATTCCAACAAGAAATTGTTGACTATGTTTGATTATGAGAGCCAATCTGTGAATTGGATGGCACAATGTGCTAGGGAGAAGGGTGCTAAGGCATATAGTGCTTGGTTTAAATGGCCAACTTTGAAACTTTGCTCAACCGATTTGCGGAAACAGATAACGAACAAGAAGAGGAAGAAGAAGGATTCTGTTGGTCTGTTTGTGTTTCCTGTTCAGTCTAGAGTGACTGGTGCTAAGTATTCATACCAGTGGATGGCTTTGGCACAACAGAACAATTGGCATGTGTTGCTTGACGCTGGGTCGTTAGGTCCAAAGGACATGGATTCACTTGGTTTATCGCTTTTTCGACCAGACTTCATCATAACGTCGTTTTATAGGGTTTTCGGGTATGATCCTACTGGTTTTGGATGTCTTCTGATAAAGAGATCAGTGATGGGAAGTTTACAAACTCGATCTGGATGTACTGGCTCTGGAATGGTGAAGATAACCCCTGAGTATCCCATGTACCTGAGTGACTCGATGGATGATCTTGATGGGGTGGGTCGATTTGAAGACGAGCAAGTTGCTGGTGTTGTGGATAAAACGTCTGAAACTCGTCAGGGATCACAACTGCCTGCTTTCTCTGGTGCCTTCACGTCTGCTCAGGTGAGGGATGTTTATGAAACAGAGATGGATCATGATAACAGCTCTGACAGAGATGGAACAAGCACCATACTTGAGGAAAGTGAGACTATTTCTCTTGGGGAGGTGATGAAGAGCCCGATATTCAGTGAAGATGAATCGTCGGATTGTTCAATTTGGATTGACTTAGGACAGAGTCCACTAGGATCCGATAATGCAGGTCAATCCCACAATCAGAAAATTGCTTCTCCCCTGCCTCAGCATTGGTTAAAAGGAAGGAGGAACAAGCTACTATCACCTAAGCCAACTTCTAAGATCCACAGTGAACCAACATACGACAATGACAAAGATTTTAACTTGGGGCCTTATAATGAGCAACCTGTAAGATCTTTTGATGCTGCCGTCCAGTCAGTCTGCCAGGAACTGGACTGCATCAAGGAGGTCCCCAGAGAATTATTTTCAGAAACAAGTGCCACGTCGACTAATAGCAAAAATGGCTCCAATAACAGAGTCGATACAGAGATCCATGAAGTAACAGAAGCTAGTAAACCACTTTCCAACGGTTCCTCCATGAATTCTACATTAGACAATGGATTTCATCTTGACATTTCTGCCTCCAATTTTCATTACTGTGGACTGGAAAATGGTACGACATCAGAAATATGTGCGGAGATGAAGGAGAGTGCCATTAGAAGGGAAACAGAAGGTGAATTCAGGTTGTTAGGGAGAAGGGAAGGGAATAAACATGTTGGTGGAAGGTTTTTTGGTTTGGAAGAGAGTAATATGCCAAGCCGAGGAAGACGGGTTTCTTTTAGGATGGAAGAGAATGGTAAGGAGCACTTAAACCATAACATCGAGCCCAGAGAAATATTGGTGACGAGCTTGGATGATGAAGATTATACCAGCAATGGAGAGTATGATGATGAGGAAGAGTGGAACAGGAGGGAGCCTGAGATTATATGTCGACATCTCGATCACATAAATTTGTTAGGTCTGAACAAGACAACACTTCGACTTCGATTTTTGATCAATTGGCTTGTCACATCGTTGCTTCAACTAAAGTTTCCCAGTTCAGAAGGAAGCAACAAAGTGAACCTAGTTCAGATTTATGGACCAAAAATAAAATATGAAAGGGGAGCAGCAGTAGCTTTCAATGTGAGAAACAGAAACAGGGGACTAATCAATCCAGAATTTGTTCAGAAGCTAGCTGAAAGAGATGGCATATCTCTTGGCATTGGATTCCTCAGTCACATTCGGGTTTTGGACAGCTCGAGACGGCAACATGGTGTTTTAAATCTTGAAGAATCGTCCCTGTGCAGGCAAACAAAAAATGGGAGGCGTGGGAAGCACGGATTTGCACGGCTTGAGGTCGTAACAGCTTCGTTGGGATTCTTGACGAACTTCGAAGATGTTTACAGACTGTGGGGGTTTGTGGCCAAGTTCTTAAATCCTTCCTTTATCAGAGAGGGAACACTTGCTCCTGTTGAAGAAGATTCTGAAACAACCTGA

mRNA sequence

ATGCATCATTCACTATGGAAGCCTCTATCTCACTGTGCTGCTTTGATTATGGATAAAAAGAGTAGGAAAAAAGATGGGTCCGATTCAGCGATGGATATCAAGAAGCACAAACTGATTCTTCGAAAACTTGAAGAACATAAGCTTAGAGAAGCTCTTGAAGAAGCTTCTGAAGATGGGTCTCTCTTCAAGTCGCAAGATGTGGGCTCTGAGCCATTGCCTAACGATGATAACAATGGCTTGGGCCGATCTCGGTCGCTCGCTAGACTTCAAGCTCAGCGGGAGTTCCTCAAGGCTACAGCTATGGCGGCTGATCGTACATATGAATCTGATGATGCCATTCCTGATCTTCATGAAGCTTTCTCTAAGTTTCTCACAATGTACCCAAAGTATCAGTCCTCAGAGAAGATTGATCAGCTTCGTTCGAATGAATATTCCCATTTGATTAAGGTATGTCTTGATTACTGTGGATTTGGGCTGTTTTCTTATGTTCAGAGTCTTCATTATTGGGAGTCTTCTACATTTAGTTTGTCTGAGATTGCTGCAAATTTGAGTAATCAAGCTCTTTATGGTGGTGCTGAGAGAGGGACTGTAGAACATGATATAAAGAGTAGGATTATGGATCATTTGAACATACCTGAGCATGAGTATGGCCTTGTTTTTACTGTTAGTAGAGGGTCTGCTTTTAAACTGCTGGCTGAATCATACCCTTTCAATTCCAACAAGAAATTGTTGACTATGTTTGATTATGAGAGCCAATCTGTGAATTGGATGGCACAATGTGCTAGGGAGAAGGGTGCTAAGGCATATAGTGCTTGGTTTAAATGGCCAACTTTGAAACTTTGCTCAACCGATTTGCGGAAACAGATAACGAACAAGAAGAGGAAGAAGAAGGATTCTGTTGGTCTGTTTGTGTTTCCTGTTCAGTCTAGAGTGACTGGTGCTAAGTATTCATACCAGTGGATGGCTTTGGCACAACAGAACAATTGGCATGTGTTGCTTGACGCTGGGTCGTTAGGTCCAAAGGACATGGATTCACTTGGTTTATCGCTTTTTCGACCAGACTTCATCATAACGTCGTTTTATAGGGTTTTCGGGTATGATCCTACTGGTTTTGGATGTCTTCTGATAAAGAGATCAGTGATGGGAAGTTTACAAACTCGATCTGGATGTACTGGCTCTGGAATGGTGAAGATAACCCCTGAGTATCCCATGTACCTGAGTGACTCGATGGATGATCTTGATGGGGTGGGTCGATTTGAAGACGAGCAAGTTGCTGGTGTTGTGGATAAAACGTCTGAAACTCGTCAGGGATCACAACTGCCTGCTTTCTCTGGTGCCTTCACGTCTGCTCAGGTGAGGGATGTTTATGAAACAGAGATGGATCATGATAACAGCTCTGACAGAGATGGAACAAGCACCATACTTGAGGAAAGTGAGACTATTTCTCTTGGGGAGGTGATGAAGAGCCCGATATTCAGTGAAGATGAATCGTCGGATTGTTCAATTTGGATTGACTTAGGACAGAGTCCACTAGGATCCGATAATGCAGGTCAATCCCACAATCAGAAAATTGCTTCTCCCCTGCCTCAGCATTGGTTAAAAGGAAGGAGGAACAAGCTACTATCACCTAAGCCAACTTCTAAGATCCACAGTGAACCAACATACGACAATGACAAAGATTTTAACTTGGGGCCTTATAATGAGCAACCTGTAAGATCTTTTGATGCTGCCGTCCAGTCAGTCTGCCAGGAACTGGACTGCATCAAGGAGGTCCCCAGAGAATTATTTTCAGAAACAAGTGCCACGTCGACTAATAGCAAAAATGGCTCCAATAACAGAGTCGATACAGAGATCCATGAAGTAACAGAAGCTAGTAAACCACTTTCCAACGGTTCCTCCATGAATTCTACATTAGACAATGGATTTCATCTTGACATTTCTGCCTCCAATTTTCATTACTGTGGACTGGAAAATGGTACGACATCAGAAATATGTGCGGAGATGAAGGAGAGTGCCATTAGAAGGGAAACAGAAGGTGAATTCAGGTTGTTAGGGAGAAGGGAAGGGAATAAACATGTTGGTGGAAGGTTTTTTGGTTTGGAAGAGAGTAATATGCCAAGCCGAGGAAGACGGGTTTCTTTTAGGATGGAAGAGAATGGTAAGGAGCACTTAAACCATAACATCGAGCCCAGAGAAATATTGGTGACGAGCTTGGATGATGAAGATTATACCAGCAATGGAGAGTATGATGATGAGGAAGAGTGGAACAGGAGGGAGCCTGAGATTATATGTCGACATCTCGATCACATAAATTTGTTAGGTCTGAACAAGACAACACTTCGACTTCGATTTTTGATCAATTGGCTTGTCACATCGTTGCTTCAACTAAAGTTTCCCAGTTCAGAAGGAAGCAACAAAGTGAACCTAGTTCAGATTTATGGACCAAAAATAAAATATGAAAGGGGAGCAGCAGTAGCTTTCAATGTGAGAAACAGAAACAGGGGACTAATCAATCCAGAATTTGTTCAGAAGCTAGCTGAAAGAGATGGCATATCTCTTGGCATTGGATTCCTCAGTCACATTCGGGTTTTGGACAGCTCGAGACGGCAACATGGTGTTTTAAATCTTGAAGAATCGTCCCTGTGCAGGCAAACAAAAAATGGGAGGCGTGGGAAGCACGGATTTGCACGGCTTGAGGTCGTAACAGCTTCGTTGGGATTCTTGACGAACTTCGAAGATGTTTACAGACTGTGGGGGTTTGTGGCCAAGTTCTTAAATCCTTCCTTTATCAGAGAGGGAACACTTGCTCCTGTTGAAGAAGATTCTGAAACAACCTGA

Coding sequence (CDS)

ATGCATCATTCACTATGGAAGCCTCTATCTCACTGTGCTGCTTTGATTATGGATAAAAAGAGTAGGAAAAAAGATGGGTCCGATTCAGCGATGGATATCAAGAAGCACAAACTGATTCTTCGAAAACTTGAAGAACATAAGCTTAGAGAAGCTCTTGAAGAAGCTTCTGAAGATGGGTCTCTCTTCAAGTCGCAAGATGTGGGCTCTGAGCCATTGCCTAACGATGATAACAATGGCTTGGGCCGATCTCGGTCGCTCGCTAGACTTCAAGCTCAGCGGGAGTTCCTCAAGGCTACAGCTATGGCGGCTGATCGTACATATGAATCTGATGATGCCATTCCTGATCTTCATGAAGCTTTCTCTAAGTTTCTCACAATGTACCCAAAGTATCAGTCCTCAGAGAAGATTGATCAGCTTCGTTCGAATGAATATTCCCATTTGATTAAGGTATGTCTTGATTACTGTGGATTTGGGCTGTTTTCTTATGTTCAGAGTCTTCATTATTGGGAGTCTTCTACATTTAGTTTGTCTGAGATTGCTGCAAATTTGAGTAATCAAGCTCTTTATGGTGGTGCTGAGAGAGGGACTGTAGAACATGATATAAAGAGTAGGATTATGGATCATTTGAACATACCTGAGCATGAGTATGGCCTTGTTTTTACTGTTAGTAGAGGGTCTGCTTTTAAACTGCTGGCTGAATCATACCCTTTCAATTCCAACAAGAAATTGTTGACTATGTTTGATTATGAGAGCCAATCTGTGAATTGGATGGCACAATGTGCTAGGGAGAAGGGTGCTAAGGCATATAGTGCTTGGTTTAAATGGCCAACTTTGAAACTTTGCTCAACCGATTTGCGGAAACAGATAACGAACAAGAAGAGGAAGAAGAAGGATTCTGTTGGTCTGTTTGTGTTTCCTGTTCAGTCTAGAGTGACTGGTGCTAAGTATTCATACCAGTGGATGGCTTTGGCACAACAGAACAATTGGCATGTGTTGCTTGACGCTGGGTCGTTAGGTCCAAAGGACATGGATTCACTTGGTTTATCGCTTTTTCGACCAGACTTCATCATAACGTCGTTTTATAGGGTTTTCGGGTATGATCCTACTGGTTTTGGATGTCTTCTGATAAAGAGATCAGTGATGGGAAGTTTACAAACTCGATCTGGATGTACTGGCTCTGGAATGGTGAAGATAACCCCTGAGTATCCCATGTACCTGAGTGACTCGATGGATGATCTTGATGGGGTGGGTCGATTTGAAGACGAGCAAGTTGCTGGTGTTGTGGATAAAACGTCTGAAACTCGTCAGGGATCACAACTGCCTGCTTTCTCTGGTGCCTTCACGTCTGCTCAGGTGAGGGATGTTTATGAAACAGAGATGGATCATGATAACAGCTCTGACAGAGATGGAACAAGCACCATACTTGAGGAAAGTGAGACTATTTCTCTTGGGGAGGTGATGAAGAGCCCGATATTCAGTGAAGATGAATCGTCGGATTGTTCAATTTGGATTGACTTAGGACAGAGTCCACTAGGATCCGATAATGCAGGTCAATCCCACAATCAGAAAATTGCTTCTCCCCTGCCTCAGCATTGGTTAAAAGGAAGGAGGAACAAGCTACTATCACCTAAGCCAACTTCTAAGATCCACAGTGAACCAACATACGACAATGACAAAGATTTTAACTTGGGGCCTTATAATGAGCAACCTGTAAGATCTTTTGATGCTGCCGTCCAGTCAGTCTGCCAGGAACTGGACTGCATCAAGGAGGTCCCCAGAGAATTATTTTCAGAAACAAGTGCCACGTCGACTAATAGCAAAAATGGCTCCAATAACAGAGTCGATACAGAGATCCATGAAGTAACAGAAGCTAGTAAACCACTTTCCAACGGTTCCTCCATGAATTCTACATTAGACAATGGATTTCATCTTGACATTTCTGCCTCCAATTTTCATTACTGTGGACTGGAAAATGGTACGACATCAGAAATATGTGCGGAGATGAAGGAGAGTGCCATTAGAAGGGAAACAGAAGGTGAATTCAGGTTGTTAGGGAGAAGGGAAGGGAATAAACATGTTGGTGGAAGGTTTTTTGGTTTGGAAGAGAGTAATATGCCAAGCCGAGGAAGACGGGTTTCTTTTAGGATGGAAGAGAATGGTAAGGAGCACTTAAACCATAACATCGAGCCCAGAGAAATATTGGTGACGAGCTTGGATGATGAAGATTATACCAGCAATGGAGAGTATGATGATGAGGAAGAGTGGAACAGGAGGGAGCCTGAGATTATATGTCGACATCTCGATCACATAAATTTGTTAGGTCTGAACAAGACAACACTTCGACTTCGATTTTTGATCAATTGGCTTGTCACATCGTTGCTTCAACTAAAGTTTCCCAGTTCAGAAGGAAGCAACAAAGTGAACCTAGTTCAGATTTATGGACCAAAAATAAAATATGAAAGGGGAGCAGCAGTAGCTTTCAATGTGAGAAACAGAAACAGGGGACTAATCAATCCAGAATTTGTTCAGAAGCTAGCTGAAAGAGATGGCATATCTCTTGGCATTGGATTCCTCAGTCACATTCGGGTTTTGGACAGCTCGAGACGGCAACATGGTGTTTTAAATCTTGAAGAATCGTCCCTGTGCAGGCAAACAAAAAATGGGAGGCGTGGGAAGCACGGATTTGCACGGCTTGAGGTCGTAACAGCTTCGTTGGGATTCTTGACGAACTTCGAAGATGTTTACAGACTGTGGGGGTTTGTGGCCAAGTTCTTAAATCCTTCCTTTATCAGAGAGGGAACACTTGCTCCTGTTGAAGAAGATTCTGAAACAACCTGA

Protein sequence

MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGSLFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAFSKFLTMYPKYQSSEKIDQLRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIAANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSNKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDSVGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGRFEDEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESETISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRNKLLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVPRELFSETSATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGLENGTTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRMEENGKEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLNKTTLRLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPEFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLEVVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSETT
BLAST of Cla97C02G038820 vs. NCBI nr
Match: XP_008457860.1 (PREDICTED: uncharacterized protein LOC103497444 [Cucumis melo])

HSP 1 Score: 1775.0 bits (4596), Expect = 0.0e+00
Identity = 883/945 (93.44%), Postives = 916/945 (96.93%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS
Sbjct: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60

Query: 61  LFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAF 120
           L KSQDV SEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESD  IPDLHEAF
Sbjct: 61  LSKSQDVDSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDGDIPDLHEAF 120

Query: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA 180
           SKFLTMYPKYQSSEKIDQLRSNEYSHL+KVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA
Sbjct: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLVKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA 180

Query: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSN 240
           ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFN+N
Sbjct: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNTN 240

Query: 241 KKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDSV 300
           KKLLTMFDYESQSVNWMAQCAR+KGAKAYSAWFKWPTLKLCSTDLRKQITNK+RKKKDSV
Sbjct: 241 KKLLTMFDYESQSVNWMAQCARDKGAKAYSAWFKWPTLKLCSTDLRKQITNKRRKKKDSV 300

Query: 301 GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF 360
           GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF
Sbjct: 301 GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF 360

Query: 361 YRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGRFE 420
           YRVFGYDPTGFGCLLIK+SVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVG+FE
Sbjct: 361 YRVFGYDPTGFGCLLIKKSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGQFE 420

Query: 421 DEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESET 480
           D+QVAGVVDKTSETRQGSQLPAFSGAFTSAQVRD+YETEMDHDNSSDRDGTSTILEESET
Sbjct: 421 DDQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDIYETEMDHDNSSDRDGTSTILEESET 480

Query: 481 ISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRNKL 540
           ISLGEVMKSP+FSEDESSDCSIWIDLGQSPLGSDN GQ + QKIASPLPQHWLKGR+NKL
Sbjct: 481 ISLGEVMKSPVFSEDESSDCSIWIDLGQSPLGSDNGGQLYKQKIASPLPQHWLKGRKNKL 540

Query: 541 LSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVPRELFSETS 600
           LSPKPTSKIHSEPTYDN+K+FN  P +EQPV SFDAAVQSVCQELDCI+EVP +LF+ETS
Sbjct: 541 LSPKPTSKIHSEPTYDNEKEFNFRPCDEQPVLSFDAAVQSVCQELDCIEEVPGDLFAETS 600

Query: 601 ATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGLENGT 660
               N+K  SNNRVDTEIHEVTEASKPLSNGSS + T++NGFHLDIS S+F Y GLENGT
Sbjct: 601 TMPANTKINSNNRVDTEIHEVTEASKPLSNGSSKSYTMNNGFHLDISTSDFRYRGLENGT 660

Query: 661 TSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRMEENG 720
           TSEIC E+KESAIRRETEGEFRLLGRREG+KHVGGRFFGLEESNM SRGRRVSFRMEENG
Sbjct: 661 TSEICPEVKESAIRRETEGEFRLLGRREGSKHVGGRFFGLEESNMQSRGRRVSFRMEENG 720

Query: 721 KEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLNKTTL 780
           KEHL+HNI+P E+ VTSLDD+DYTSNGEYDDEEEWNRREPEIICRHLDHIN+LGLNKTTL
Sbjct: 721 KEHLSHNIDPGEVSVTSLDDDDYTSNGEYDDEEEWNRREPEIICRHLDHINMLGLNKTTL 780

Query: 781 RLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE 840
           RLRFLINWLVTSLLQLKFP SEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE
Sbjct: 781 RLRFLINWLVTSLLQLKFPGSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE 840

Query: 841 FVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLEV 900
           FVQKLAERDGISLGIGFLSHIRVLDSS+RQ+GVLNLEESSLCR+TKNGRRGKHGFARLEV
Sbjct: 841 FVQKLAERDGISLGIGFLSHIRVLDSSKRQYGVLNLEESSLCRETKNGRRGKHGFARLEV 900

Query: 901 VTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSETT 946
           VTASLGFLTNFEDVY+LWGFVAKFLNPSFIREGTLAPVEE SETT
Sbjct: 901 VTASLGFLTNFEDVYKLWGFVAKFLNPSFIREGTLAPVEEGSETT 945

BLAST of Cla97C02G038820 vs. NCBI nr
Match: XP_004148049.1 (PREDICTED: uncharacterized protein LOC101209057 [Cucumis sativus] >KGN62047.1 hypothetical protein Csa_2G292770 [Cucumis sativus])

HSP 1 Score: 1767.7 bits (4577), Expect = 0.0e+00
Identity = 881/945 (93.23%), Postives = 914/945 (96.72%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS
Sbjct: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60

Query: 61  LFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAF 120
           LFKSQDV SEPLPNDD+NGLGRSRSLARLQAQREFLKATAMAADRTYESDD IPDLHEAF
Sbjct: 61  LFKSQDVDSEPLPNDDSNGLGRSRSLARLQAQREFLKATAMAADRTYESDDDIPDLHEAF 120

Query: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA 180
           SKFLTMYPKYQSSEKIDQLRSNEYSHL+KVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA
Sbjct: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLVKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA 180

Query: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSN 240
           ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFN+N
Sbjct: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNTN 240

Query: 241 KKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDSV 300
           KKLLTMFDYESQSVNW+AQCAR+KGAKAYSAWFKWPTLKLCSTDLRKQITNK+RKKKDSV
Sbjct: 241 KKLLTMFDYESQSVNWLAQCARDKGAKAYSAWFKWPTLKLCSTDLRKQITNKRRKKKDSV 300

Query: 301 GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF 360
           GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF
Sbjct: 301 GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF 360

Query: 361 YRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGRFE 420
           YRVFGYDPTGFGCLLIK+SVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGV RFE
Sbjct: 361 YRVFGYDPTGFGCLLIKKSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVSRFE 420

Query: 421 DEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESET 480
           D+QVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESET
Sbjct: 421 DDQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESET 480

Query: 481 ISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRNKL 540
           ISLGEVMKSP+FSEDESSDCSIWIDLGQSPLGSDN GQ + QKIASPLPQHWLKGR+NKL
Sbjct: 481 ISLGEVMKSPVFSEDESSDCSIWIDLGQSPLGSDNGGQMYKQKIASPLPQHWLKGRKNKL 540

Query: 541 LSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVPRELFSETS 600
           LSPKPTSKIHSEPTYDN+KDFN  P +EQPV SFDAAVQSVCQELDC++EVP+ELF+E S
Sbjct: 541 LSPKPTSKIHSEPTYDNEKDFNFRPCDEQPVLSFDAAVQSVCQELDCVEEVPKELFAEAS 600

Query: 601 ATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGLENGT 660
               NSK  SNNRV TEI EVTEASKPLSNGSS + T++NGFHLDIS S+F Y GLENGT
Sbjct: 601 TMPANSKIISNNRVVTEIDEVTEASKPLSNGSSKSYTVNNGFHLDISTSDFRYRGLENGT 660

Query: 661 TSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRMEENG 720
           TSEIC E+KESAIRRETEGEFRLLGRR+G+KHVGGRFFGLE+SNM SRGRRVSFRMEENG
Sbjct: 661 TSEICPEVKESAIRRETEGEFRLLGRRDGSKHVGGRFFGLEDSNMQSRGRRVSFRMEENG 720

Query: 721 KEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLNKTTL 780
           KE L+HNI+P E+ VTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHIN+LGLNKTTL
Sbjct: 721 KEQLSHNIDPGEVSVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINMLGLNKTTL 780

Query: 781 RLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE 840
           RLRFLINWLVTSLLQLKFP SEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE
Sbjct: 781 RLRFLINWLVTSLLQLKFPGSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE 840

Query: 841 FVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLEV 900
           FVQKLAERDGISLGIGFLSHIRVLDSS+RQ+GVLNLEESSLCR+TKNGRRGKHGFARLEV
Sbjct: 841 FVQKLAERDGISLGIGFLSHIRVLDSSKRQYGVLNLEESSLCRETKNGRRGKHGFARLEV 900

Query: 901 VTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSETT 946
           VTASLGFLTNFEDVY+LWGFVAKFLNPSFIREGTLAPVEE SETT
Sbjct: 901 VTASLGFLTNFEDVYKLWGFVAKFLNPSFIREGTLAPVEEGSETT 945

BLAST of Cla97C02G038820 vs. NCBI nr
Match: XP_022158238.1 (uncharacterized protein LOC111024771 [Momordica charantia])

HSP 1 Score: 1624.8 bits (4206), Expect = 0.0e+00
Identity = 824/946 (87.10%), Postives = 875/946 (92.49%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MH+SLWKPLSHCAALIMDKKSRKKDGSDSA++IKK KLILRKLEEHKLREALEEASEDG 
Sbjct: 1   MHYSLWKPLSHCAALIMDKKSRKKDGSDSAIEIKKKKLILRKLEEHKLREALEEASEDGC 60

Query: 61  LFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAF 120
           LFKSQDVGS+P+P+     LGRSRSLARLQAQREFL+ATAMAADRTYESDDAIP+LHEAF
Sbjct: 61  LFKSQDVGSDPVPS-----LGRSRSLARLQAQREFLQATAMAADRTYESDDAIPELHEAF 120

Query: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA 180
           SKFLTMYPKY+SSE IDQLRSNEYSHL+KVCLDYCGFGLFSYVQ+LHYWESSTFSLSEIA
Sbjct: 121 SKFLTMYPKYESSEMIDQLRSNEYSHLMKVCLDYCGFGLFSYVQTLHYWESSTFSLSEIA 180

Query: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSN 240
           ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLL+ESYPF++N
Sbjct: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLSESYPFHTN 240

Query: 241 KKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKD-S 300
           KKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNK++KKKD +
Sbjct: 241 KKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKRKKKKDLA 300

Query: 301 VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITS 360
            GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITS
Sbjct: 301 AGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITS 360

Query: 361 FYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGRF 420
           FYRVFGYDPTGFGCLLIKRSVMGSLQT+SGCTGSGMVKITPEYPMYLSDS+DDLDG+GR 
Sbjct: 361 FYRVFGYDPTGFGCLLIKRSVMGSLQTQSGCTGSGMVKITPEYPMYLSDSIDDLDGLGRI 420

Query: 421 EDEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESE 480
           ED++VAGVVD+T ETRQGSQLPAFSGAFTSAQVRDV+ETEMDH N+SDRDGTSTI EESE
Sbjct: 421 EDDEVAGVVDQTFETRQGSQLPAFSGAFTSAQVRDVFETEMDHGNNSDRDGTSTIFEESE 480

Query: 481 TISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRNK 540
           TISLGEVMKSP+FSEDESSDCSIWIDLGQSPLGSDNA Q + QKIASPLPQ+WL G++NK
Sbjct: 481 TISLGEVMKSPVFSEDESSDCSIWIDLGQSPLGSDNANQLNKQKIASPLPQYWLNGKKNK 540

Query: 541 LLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVPRELFSET 600
           LLS KP SKIHS  TYD+ KDFN GPY+E  V SFDAAVQSV QELD ++EVPREL +ET
Sbjct: 541 LLSHKPNSKIHSHLTYDDHKDFNSGPYDEHRVLSFDAAVQSVYQELDSVEEVPRELSAET 600

Query: 601 SATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGLENG 660
           S             V TEIHEVTE  KPLSNGSS+NSTL+NGFHL  S SN         
Sbjct: 601 SXXXXXXXXXXXXXVITEIHEVTETRKPLSNGSSINSTLNNGFHL--SGSN--------- 660

Query: 661 TTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRMEEN 720
           +TSEIC+E+KESAIRRETEGEFRLLGRREG KHVGGR FGLEE++M SRGRRVSFRMEEN
Sbjct: 661 STSEICSEVKESAIRRETEGEFRLLGRREGTKHVGGRIFGLEETSMQSRGRRVSFRMEEN 720

Query: 721 GKEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLNKTT 780
           GKE LNHN+E  E+ VTSLD+EDYTSNGEY DEEEWNRREPEIICRHLDHIN+LGLNKTT
Sbjct: 721 GKEQLNHNVETGEVSVTSLDNEDYTSNGEYGDEEEWNRREPEIICRHLDHINMLGLNKTT 780

Query: 781 LRLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINP 840
           LRLRFLINWLVTSLLQLKFP SEGSNKVNLVQIYGPKIKYERGAAVAFNVR+RNRGLINP
Sbjct: 781 LRLRFLINWLVTSLLQLKFPGSEGSNKVNLVQIYGPKIKYERGAAVAFNVRDRNRGLINP 840

Query: 841 EFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLE 900
           EFVQKLAERDGISLGIGFLSHIRVLDS RRQHGVLNLE+SSLCRQT+NGRRGK+GFARLE
Sbjct: 841 EFVQKLAERDGISLGIGFLSHIRVLDSPRRQHGVLNLEDSSLCRQTENGRRGKNGFARLE 900

Query: 901 VVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSETT 946
           VVTASLGFLTNFEDVY+LW FVAKFLNPSFIREG LAPVEE SETT
Sbjct: 901 VVTASLGFLTNFEDVYKLWAFVAKFLNPSFIREGALAPVEEGSETT 930

BLAST of Cla97C02G038820 vs. NCBI nr
Match: XP_023513272.1 (uncharacterized protein LOC111777789 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1615.9 bits (4183), Expect = 0.0e+00
Identity = 818/946 (86.47%), Postives = 860/946 (90.91%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MHHSLWKPLSHC ALIMDKKSR KDG DSAMDIKKH++ILRKLEEHKLREALEEASEDGS
Sbjct: 1   MHHSLWKPLSHCVALIMDKKSRTKDGYDSAMDIKKHQMILRKLEEHKLREALEEASEDGS 60

Query: 61  LFKSQDVGSEPLPND-DNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEA 120
           LFKSQ+V SEPL ND D+NGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDL EA
Sbjct: 61  LFKSQNVDSEPLRNDGDDNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLREA 120

Query: 121 FSKFLTMYPKYQSSEKIDQLRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEI 180
           FSKFLTMYPKYQSSEKID+LRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEI
Sbjct: 121 FSKFLTMYPKYQSSEKIDELRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEI 180

Query: 181 AANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNS 240
           AANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPF++
Sbjct: 181 AANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFHT 240

Query: 241 NKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDS 300
           NKKLLTMFDYESQSVNWMAQ A+EKGAKAY+AWFKWP+LKLCSTDLRK+ITNK+RKKK+S
Sbjct: 241 NKKLLTMFDYESQSVNWMAQFAKEKGAKAYNAWFKWPSLKLCSTDLRKRITNKRRKKKES 300

Query: 301 VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITS 360
           VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITS
Sbjct: 301 VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITS 360

Query: 361 FYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGRF 420
           FYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYP+YLSDS+DDLD VGRF
Sbjct: 361 FYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPVYLSDSIDDLDQVGRF 420

Query: 421 EDEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESE 480
           ED++VAGVVDKTSETRQGSQLPAFSGAFTSAQVRDV ETEMDHDN SDRDGTSTILEESE
Sbjct: 421 EDDRVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVLETEMDHDNISDRDGTSTILEESE 480

Query: 481 TISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRNK 540
           TISLGEVMKSP+FSEDESSDCSIWIDLGQSPLGSDNAGQ H QK+ASPLPQHWLKG++NK
Sbjct: 481 TISLGEVMKSPVFSEDESSDCSIWIDLGQSPLGSDNAGQLHTQKLASPLPQHWLKGKKNK 540

Query: 541 LLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVPRELFSET 600
           LLSPKPTSKIHSEP+YD D DFN GPY++ PV SFDAAVQS CQE+DCIKEVPREL +ET
Sbjct: 541 LLSPKPTSKIHSEPSYDKDNDFNSGPYDDHPVLSFDAAVQSACQEIDCIKEVPRELLAET 600

Query: 601 SATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGLENG 660
           SA S NSK  SNN+V TEIHE TEASKPLSNG+                           
Sbjct: 601 SAMSANSKKDSNNQVVTEIHEATEASKPLSNGA--------------------------- 660

Query: 661 TTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRMEEN 720
             SEIC+E KESAIRRETEGEFRLLGRREGNKHV                RRVSFRME+N
Sbjct: 661 --SEICSETKESAIRRETEGEFRLLGRREGNKHV----------------RRVSFRMEDN 720

Query: 721 GKEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLNKTT 780
           G EHLNH+IEP E+ +TSLDDEDYTSNGEYDDEE WNRREPEIICRHLDHIN+LGLNKTT
Sbjct: 721 GNEHLNHSIEPGEVTMTSLDDEDYTSNGEYDDEETWNRREPEIICRHLDHINMLGLNKTT 780

Query: 781 LRLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINP 840
           LRLRFLINWLVTSLLQLKF  SEG+NK NLVQIYGPKIKYERGAAVAFNVRNRNRGLINP
Sbjct: 781 LRLRFLINWLVTSLLQLKFQGSEGNNKANLVQIYGPKIKYERGAAVAFNVRNRNRGLINP 840

Query: 841 EFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLE 900
           EFVQK+AERDGISLGIGFLSHIRVLDS + Q GVLNLEESSLC+Q +NGRRG+HGFARLE
Sbjct: 841 EFVQKVAERDGISLGIGFLSHIRVLDSPKWQRGVLNLEESSLCKQAENGRRGEHGFARLE 900

Query: 901 VVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSETT 946
           VVTASLGFLTNFEDVY+LW FVAKFLNPSFIREGTLA VEE S+TT
Sbjct: 901 VVTASLGFLTNFEDVYKLWAFVAKFLNPSFIREGTLALVEEGSQTT 901

BLAST of Cla97C02G038820 vs. NCBI nr
Match: XP_022971719.1 (uncharacterized protein LOC111470388 [Cucurbita maxima])

HSP 1 Score: 1606.3 bits (4158), Expect = 0.0e+00
Identity = 813/946 (85.94%), Postives = 855/946 (90.38%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MHHSLWKPLSHC ALIMDK+SR KDG DSAMD+ KH++ILRKLEEHKLREALEEASEDGS
Sbjct: 1   MHHSLWKPLSHCVALIMDKRSRTKDGYDSAMDVNKHQMILRKLEEHKLREALEEASEDGS 60

Query: 61  LFKSQDVGSEPLPND-DNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEA 120
           LFKSQ+V SEPL ND D NGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDL EA
Sbjct: 61  LFKSQNVDSEPLRNDGDENGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLREA 120

Query: 121 FSKFLTMYPKYQSSEKIDQLRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEI 180
           FSKFLTMYPKYQSSEKID+LRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEI
Sbjct: 121 FSKFLTMYPKYQSSEKIDELRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEI 180

Query: 181 AANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNS 240
           AANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPF++
Sbjct: 181 AANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFHT 240

Query: 241 NKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDS 300
           NKKLLTMFDYESQSVNWMAQ A+EKGAKAY+AWFKWP+LKLCSTDLRK+ITNK+RKKK+S
Sbjct: 241 NKKLLTMFDYESQSVNWMAQFAKEKGAKAYNAWFKWPSLKLCSTDLRKRITNKRRKKKES 300

Query: 301 VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITS 360
           VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITS
Sbjct: 301 VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITS 360

Query: 361 FYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGRF 420
           FYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYP+YLSDS+DDLD VGRF
Sbjct: 361 FYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPVYLSDSIDDLDQVGRF 420

Query: 421 EDEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESE 480
           ED++VAGVVDKTSETRQGSQLPAFSGAFTSAQVRDV ETEMDHDN SDRDGTSTILEESE
Sbjct: 421 EDDRVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVLETEMDHDNISDRDGTSTILEESE 480

Query: 481 TISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRNK 540
           TISLGEVMKSP+FSEDESSDCSIWIDLGQSPLGSDNAGQ H QK+ASPLPQHWLKG++NK
Sbjct: 481 TISLGEVMKSPVFSEDESSDCSIWIDLGQSPLGSDNAGQLHTQKLASPLPQHWLKGKKNK 540

Query: 541 LLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVPRELFSET 600
           LLSPKPTSKIHSEP+YD D DFN GPY++ PV SFDAAVQS CQELD + EVPREL +ET
Sbjct: 541 LLSPKPTSKIHSEPSYDKDNDFNSGPYDDHPVLSFDAAVQSACQELDFVDEVPRELLAET 600

Query: 601 SATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGLENG 660
           SA S NSK  SNNRV TEIHE TEASKPLSNG+                           
Sbjct: 601 SAMSANSKKDSNNRVVTEIHEATEASKPLSNGA--------------------------- 660

Query: 661 TTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRMEEN 720
             SEIC E KESAIRRETEGEFRLLGRREGNKHV                RRVSFRME+N
Sbjct: 661 --SEICPETKESAIRRETEGEFRLLGRREGNKHV----------------RRVSFRMEDN 720

Query: 721 GKEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLNKTT 780
           G EHLNH+IEP E+ +TSLDDEDYTSNGEY+DEE WNRREPEIICRHLDHIN+LGLNKTT
Sbjct: 721 GNEHLNHSIEPGEVTMTSLDDEDYTSNGEYEDEETWNRREPEIICRHLDHINMLGLNKTT 780

Query: 781 LRLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINP 840
           LRLRFLINWLVTSLLQLKF  SEG+NK NLVQIYGPKIKYERGAAVAFNVRNRNRGLINP
Sbjct: 781 LRLRFLINWLVTSLLQLKFQDSEGNNKANLVQIYGPKIKYERGAAVAFNVRNRNRGLINP 840

Query: 841 EFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLE 900
           EFVQK+AERDGISLGIGFLSHIRVLDS +RQ GVLNLEE SLC+Q +NGRRG+HGFARLE
Sbjct: 841 EFVQKVAERDGISLGIGFLSHIRVLDSPKRQRGVLNLEEPSLCKQAENGRRGEHGFARLE 900

Query: 901 VVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSETT 946
           VVTASLGFLTNFEDVY+LW FVAKFLNPSFIREGTLA VEE S+TT
Sbjct: 901 VVTASLGFLTNFEDVYKLWAFVAKFLNPSFIREGTLALVEEGSQTT 901

BLAST of Cla97C02G038820 vs. TrEMBL
Match: tr|A0A1S3C752|A0A1S3C752_CUCME (uncharacterized protein LOC103497444 OS=Cucumis melo OX=3656 GN=LOC103497444 PE=4 SV=1)

HSP 1 Score: 1775.0 bits (4596), Expect = 0.0e+00
Identity = 883/945 (93.44%), Postives = 916/945 (96.93%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS
Sbjct: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60

Query: 61  LFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAF 120
           L KSQDV SEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESD  IPDLHEAF
Sbjct: 61  LSKSQDVDSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDGDIPDLHEAF 120

Query: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA 180
           SKFLTMYPKYQSSEKIDQLRSNEYSHL+KVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA
Sbjct: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLVKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA 180

Query: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSN 240
           ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFN+N
Sbjct: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNTN 240

Query: 241 KKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDSV 300
           KKLLTMFDYESQSVNWMAQCAR+KGAKAYSAWFKWPTLKLCSTDLRKQITNK+RKKKDSV
Sbjct: 241 KKLLTMFDYESQSVNWMAQCARDKGAKAYSAWFKWPTLKLCSTDLRKQITNKRRKKKDSV 300

Query: 301 GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF 360
           GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF
Sbjct: 301 GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF 360

Query: 361 YRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGRFE 420
           YRVFGYDPTGFGCLLIK+SVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVG+FE
Sbjct: 361 YRVFGYDPTGFGCLLIKKSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGQFE 420

Query: 421 DEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESET 480
           D+QVAGVVDKTSETRQGSQLPAFSGAFTSAQVRD+YETEMDHDNSSDRDGTSTILEESET
Sbjct: 421 DDQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDIYETEMDHDNSSDRDGTSTILEESET 480

Query: 481 ISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRNKL 540
           ISLGEVMKSP+FSEDESSDCSIWIDLGQSPLGSDN GQ + QKIASPLPQHWLKGR+NKL
Sbjct: 481 ISLGEVMKSPVFSEDESSDCSIWIDLGQSPLGSDNGGQLYKQKIASPLPQHWLKGRKNKL 540

Query: 541 LSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVPRELFSETS 600
           LSPKPTSKIHSEPTYDN+K+FN  P +EQPV SFDAAVQSVCQELDCI+EVP +LF+ETS
Sbjct: 541 LSPKPTSKIHSEPTYDNEKEFNFRPCDEQPVLSFDAAVQSVCQELDCIEEVPGDLFAETS 600

Query: 601 ATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGLENGT 660
               N+K  SNNRVDTEIHEVTEASKPLSNGSS + T++NGFHLDIS S+F Y GLENGT
Sbjct: 601 TMPANTKINSNNRVDTEIHEVTEASKPLSNGSSKSYTMNNGFHLDISTSDFRYRGLENGT 660

Query: 661 TSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRMEENG 720
           TSEIC E+KESAIRRETEGEFRLLGRREG+KHVGGRFFGLEESNM SRGRRVSFRMEENG
Sbjct: 661 TSEICPEVKESAIRRETEGEFRLLGRREGSKHVGGRFFGLEESNMQSRGRRVSFRMEENG 720

Query: 721 KEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLNKTTL 780
           KEHL+HNI+P E+ VTSLDD+DYTSNGEYDDEEEWNRREPEIICRHLDHIN+LGLNKTTL
Sbjct: 721 KEHLSHNIDPGEVSVTSLDDDDYTSNGEYDDEEEWNRREPEIICRHLDHINMLGLNKTTL 780

Query: 781 RLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE 840
           RLRFLINWLVTSLLQLKFP SEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE
Sbjct: 781 RLRFLINWLVTSLLQLKFPGSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE 840

Query: 841 FVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLEV 900
           FVQKLAERDGISLGIGFLSHIRVLDSS+RQ+GVLNLEESSLCR+TKNGRRGKHGFARLEV
Sbjct: 841 FVQKLAERDGISLGIGFLSHIRVLDSSKRQYGVLNLEESSLCRETKNGRRGKHGFARLEV 900

Query: 901 VTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSETT 946
           VTASLGFLTNFEDVY+LWGFVAKFLNPSFIREGTLAPVEE SETT
Sbjct: 901 VTASLGFLTNFEDVYKLWGFVAKFLNPSFIREGTLAPVEEGSETT 945

BLAST of Cla97C02G038820 vs. TrEMBL
Match: tr|A0A0A0LMR8|A0A0A0LMR8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G292770 PE=4 SV=1)

HSP 1 Score: 1767.7 bits (4577), Expect = 0.0e+00
Identity = 881/945 (93.23%), Postives = 914/945 (96.72%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS
Sbjct: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60

Query: 61  LFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAF 120
           LFKSQDV SEPLPNDD+NGLGRSRSLARLQAQREFLKATAMAADRTYESDD IPDLHEAF
Sbjct: 61  LFKSQDVDSEPLPNDDSNGLGRSRSLARLQAQREFLKATAMAADRTYESDDDIPDLHEAF 120

Query: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLIKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA 180
           SKFLTMYPKYQSSEKIDQLRSNEYSHL+KVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA
Sbjct: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLVKVCLDYCGFGLFSYVQSLHYWESSTFSLSEIA 180

Query: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSN 240
           ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFN+N
Sbjct: 181 ANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNTN 240

Query: 241 KKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDSV 300
           KKLLTMFDYESQSVNW+AQCAR+KGAKAYSAWFKWPTLKLCSTDLRKQITNK+RKKKDSV
Sbjct: 241 KKLLTMFDYESQSVNWLAQCARDKGAKAYSAWFKWPTLKLCSTDLRKQITNKRRKKKDSV 300

Query: 301 GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF 360
           GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF
Sbjct: 301 GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSF 360

Query: 361 YRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGRFE 420
           YRVFGYDPTGFGCLLIK+SVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGV RFE
Sbjct: 361 YRVFGYDPTGFGCLLIKKSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVSRFE 420

Query: 421 DEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESET 480
           D+QVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESET
Sbjct: 421 DDQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEESET 480

Query: 481 ISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRNKL 540
           ISLGEVMKSP+FSEDESSDCSIWIDLGQSPLGSDN GQ + QKIASPLPQHWLKGR+NKL
Sbjct: 481 ISLGEVMKSPVFSEDESSDCSIWIDLGQSPLGSDNGGQMYKQKIASPLPQHWLKGRKNKL 540

Query: 541 LSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVPRELFSETS 600
           LSPKPTSKIHSEPTYDN+KDFN  P +EQPV SFDAAVQSVCQELDC++EVP+ELF+E S
Sbjct: 541 LSPKPTSKIHSEPTYDNEKDFNFRPCDEQPVLSFDAAVQSVCQELDCVEEVPKELFAEAS 600

Query: 601 ATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGLENGT 660
               NSK  SNNRV TEI EVTEASKPLSNGSS + T++NGFHLDIS S+F Y GLENGT
Sbjct: 601 TMPANSKIISNNRVVTEIDEVTEASKPLSNGSSKSYTVNNGFHLDISTSDFRYRGLENGT 660

Query: 661 TSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRMEENG 720
           TSEIC E+KESAIRRETEGEFRLLGRR+G+KHVGGRFFGLE+SNM SRGRRVSFRMEENG
Sbjct: 661 TSEICPEVKESAIRRETEGEFRLLGRRDGSKHVGGRFFGLEDSNMQSRGRRVSFRMEENG 720

Query: 721 KEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLNKTTL 780
           KE L+HNI+P E+ VTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHIN+LGLNKTTL
Sbjct: 721 KEQLSHNIDPGEVSVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINMLGLNKTTL 780

Query: 781 RLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE 840
           RLRFLINWLVTSLLQLKFP SEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE
Sbjct: 781 RLRFLINWLVTSLLQLKFPGSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPE 840

Query: 841 FVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLEV 900
           FVQKLAERDGISLGIGFLSHIRVLDSS+RQ+GVLNLEESSLCR+TKNGRRGKHGFARLEV
Sbjct: 841 FVQKLAERDGISLGIGFLSHIRVLDSSKRQYGVLNLEESSLCRETKNGRRGKHGFARLEV 900

Query: 901 VTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSETT 946
           VTASLGFLTNFEDVY+LWGFVAKFLNPSFIREGTLAPVEE SETT
Sbjct: 901 VTASLGFLTNFEDVYKLWGFVAKFLNPSFIREGTLAPVEEGSETT 945

BLAST of Cla97C02G038820 vs. TrEMBL
Match: tr|A0A2C9USJ7|A0A2C9USJ7_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_13G130600 PE=4 SV=1)

HSP 1 Score: 1403.7 bits (3632), Expect = 0.0e+00
Identity = 715/949 (75.34%), Postives = 827/949 (87.14%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MH SLWKP+SHCAALI+DKK RKKDGS+S ++IKK+  ILRKL+E+KLREALEEASEDGS
Sbjct: 1   MHLSLWKPISHCAALILDKKGRKKDGSESNLEIKKNPSILRKLQENKLREALEEASEDGS 60

Query: 61  LFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAF 120
           LFKSQD+ SE L N D + LGRSRSLARL AQREFL+ATA+AA+R +E++D+IPDL EAF
Sbjct: 61  LFKSQDMESESLGNQDES-LGRSRSLARLHAQREFLRATALAAERIFETEDSIPDLREAF 120

Query: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLI-KVCLDYCGFGLFSYVQSLHYWESSTFSLSEI 180
           SKFLTMYPKYQSSEKIDQLRS+EY+HL  KVCLDYCGFGLFSY+Q+LHYWESSTFSLSEI
Sbjct: 121 SKFLTMYPKYQSSEKIDQLRSDEYAHLTPKVCLDYCGFGLFSYLQTLHYWESSTFSLSEI 180

Query: 181 AANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNS 240
            ANLSN ALYGGAE+GTVEHDIK+RIMD+LNIPEHEYGLVFTVSRGSAFKLLAESYPF++
Sbjct: 181 TANLSNHALYGGAEKGTVEHDIKTRIMDYLNIPEHEYGLVFTVSRGSAFKLLAESYPFHT 240

Query: 241 NKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDS 300
           NKKLLTMFDYESQSV+WMAQ AREKGAK YSAWFKWPTLKLCSTDLRKQI++KKR+KKDS
Sbjct: 241 NKKLLTMFDYESQSVSWMAQSAREKGAKVYSAWFKWPTLKLCSTDLRKQISSKKRRKKDS 300

Query: 301 -VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIIT 360
            VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIIT
Sbjct: 301 AVGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIIT 360

Query: 361 SFYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGR 420
           SFYRVFG+DPTGFGCLLIK+SVMGSLQ +SG TGSGMVKITPEYP+YLSDS+D LD +  
Sbjct: 361 SFYRVFGHDPTGFGCLLIKKSVMGSLQNQSGSTGSGMVKITPEYPLYLSDSVDGLDRLVG 420

Query: 421 FEDEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEES 480
            ED++VAG  + T+ETR GSQLPAFSGAFTSAQVRDV+ETEM+ DNSSDRDGTSTI EE+
Sbjct: 421 IEDDEVAGNAETTTETRPGSQLPAFSGAFTSAQVRDVFETEMEQDNSSDRDGTSTIFEET 480

Query: 481 ETISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRN 540
           E+IS+GEVMKSP+FSEDESSD S WIDLGQSPLGSD AGQ + QK++SPLP  W  G++N
Sbjct: 481 ESISVGEVMKSPVFSEDESSDNSFWIDLGQSPLGSDTAGQLNKQKMSSPLPPFWFSGKKN 540

Query: 541 -KLLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVP-RELF 600
            K LSPKPTSKI+  P YD DK  N+GP+++  + SFDAAV SV QELD +KEVP  E F
Sbjct: 541 HKRLSPKPTSKIYGSPLYD-DKGINMGPHDDHHMLSFDAAVMSVSQELDRVKEVPEEEQF 600

Query: 601 SETSATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGL 660
           ++ + T  N + GS++    EI E   +S  +S GS  NS + N  HL+ S     + GL
Sbjct: 601 ADANCTPQNGRKGSDHPHVHEIEEEPGSSNTVSVGSLSNSDV-NRSHLNNSKLAAAHHGL 660

Query: 661 ENGTTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRM 720
            NG  S I +E+KESAIRRETEGEFRLLGRREGN++ GGRFFGLEE+  PSRGRRVSF M
Sbjct: 661 ANGLISAIGSEVKESAIRRETEGEFRLLGRREGNRYAGGRFFGLEENEHPSRGRRVSFSM 720

Query: 721 EENGKEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLN 780
           E+N KEHL+H +EP E+ VTSLDD++YTS+GEY D +EW+RREPEIICRHL+H+N+LGLN
Sbjct: 721 EDNRKEHLSHTLEPGEVSVTSLDDDEYTSDGEYGDGQEWDRREPEIICRHLNHVNMLGLN 780

Query: 781 KTTLRLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGL 840
           KTTLRLRFLINWLVTSLLQL+FPSS+G  +V+LV IYGPKIKYERGAAVAFN+R+RN+GL
Sbjct: 781 KTTLRLRFLINWLVTSLLQLRFPSSDGEGRVHLVHIYGPKIKYERGAAVAFNIRDRNQGL 840

Query: 841 INPEFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGR-RGKHGF 900
           INPE VQKLAER+GISLGIG+LSHIR+LDS ++Q G LNLE+++LC   +NG+  GK GF
Sbjct: 841 INPEVVQKLAEREGISLGIGYLSHIRILDSPKQQRGALNLEDTTLCMPMENGQNNGKSGF 900

Query: 901 ARLEVVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSET 945
            R+EVVTASLGFLTNFEDVY+LWGF++KFLNP+FI+EG+L  VEE +E+
Sbjct: 901 LRIEVVTASLGFLTNFEDVYKLWGFISKFLNPAFIKEGSLPTVEEGTES 946

BLAST of Cla97C02G038820 vs. TrEMBL
Match: tr|A0A061DG23|A0A061DG23_THECC (Pyridoxal phosphate-dependent transferases superfamily protein OS=Theobroma cacao OX=3641 GN=TCM_000523 PE=4 SV=1)

HSP 1 Score: 1402.1 bits (3628), Expect = 0.0e+00
Identity = 730/950 (76.84%), Postives = 823/950 (86.63%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MH SLWKP+SHCAALI+DKKSR++DGS+SA +IKK+  ILRKL E+KLREALEEASEDGS
Sbjct: 1   MHLSLWKPISHCAALILDKKSRRRDGSESAAEIKKNPSILRKLHENKLREALEEASEDGS 60

Query: 61  LFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAF 120
           LFKSQD+  + L N D + LGRSRSLARL AQREFL+ATA+AA+R +ES+D+IPD+ EAF
Sbjct: 61  LFKSQDMEPDSLGNQDES-LGRSRSLARLHAQREFLRATALAAERIFESEDSIPDVREAF 120

Query: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLI-KVCLDYCGFGLFSYVQSLHYWESSTFSLSEI 180
           +KFLTMYPKY SSEKIDQLRS+EY+HL  KVCLDYCGFGLFSYVQ+LHYWESSTFSLSEI
Sbjct: 121 NKFLTMYPKYHSSEKIDQLRSDEYAHLSPKVCLDYCGFGLFSYVQTLHYWESSTFSLSEI 180

Query: 181 AANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNS 240
            ANLSN ALYGGAE+GTVE+DIKSRIMD+LNIPEHEYGLVFTVSRGSAFKLLA+SYPF++
Sbjct: 181 TANLSNHALYGGAEKGTVEYDIKSRIMDYLNIPEHEYGLVFTVSRGSAFKLLADSYPFHT 240

Query: 241 NKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDS 300
           NKKLLTMFDYESQSVNWMAQ AREKGAK YSAWFKWPTLKLCSTDLRKQI+NKKR+KKDS
Sbjct: 241 NKKLLTMFDYESQSVNWMAQSAREKGAKVYSAWFKWPTLKLCSTDLRKQISNKKRRKKDS 300

Query: 301 -VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIIT 360
             GLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIIT
Sbjct: 301 ATGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIIT 360

Query: 361 SFYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGR 420
           SFYRVFGYDPTGFGCLLIK+SVMGSLQ +SGCTGSGMVKITPEYP+YLSDS+D L   G 
Sbjct: 361 SFYRVFGYDPTGFGCLLIKKSVMGSLQNQSGCTGSGMVKITPEYPLYLSDSVDGLXXXGG 420

Query: 421 FEDEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEES 480
            ED++V    DK SE+R GSQLPAFSGAFTSAQVRDV+ETEMD DNSSDRDG STI EE+
Sbjct: 421 IEDDEVGANGDKPSESRPGSQLPAFSGAFTSAQVRDVFETEMDPDNSSDRDGASTIFEET 480

Query: 481 ETISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRRN 540
           E+IS+GEVMKSP+FSEDESSD S+WIDLGQSPLGSD+AGQ + QKIASPLP  W  G++N
Sbjct: 481 ESISVGEVMKSPVFSEDESSDNSLWIDLGQSPLGSDSAGQLNKQKIASPLPPFWFSGKKN 540

Query: 541 -KLLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVP-RELF 600
            K LSPKPTSKI+  P YD DKD NLG +++  V SFDAAV SV QELD ++E+P  E  
Sbjct: 541 HKRLSPKPTSKIYGSPIYD-DKDVNLG-HDDHHVLSFDAAVLSVSQELDRVREIPEEEQL 600

Query: 601 SETSATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGL 660
           + T+ TS N K  S+     EI E    SKPLS GS  +S + NG  L+ ++S F   GL
Sbjct: 601 AGTNITSRNHKKTSHYSHVLEIQEEQGTSKPLSVGSVSSSAI-NGARLN-NSSVFRNNGL 660

Query: 661 ENGTTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRM 720
            NG+TSEI +E+KESAIRRETEGEFRLLGRREGN++ GGRFFGLE+ + PSRGRRVSF M
Sbjct: 661 ANGSTSEISSEIKESAIRRETEGEFRLLGRREGNRYNGGRFFGLEDEH-PSRGRRVSFSM 720

Query: 721 EENGKEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLN 780
           EE  KE L+H +EP E+ VTSLDDEDYTS+GEY D ++W+RREPEI CRHLDH+N+LGLN
Sbjct: 721 EEGRKERLSHTLEPGEVSVTSLDDEDYTSDGEYGDGQDWDRREPEITCRHLDHVNMLGLN 780

Query: 781 KTTLRLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGL 840
           KTTLRLRFLINWLVTSLLQLK PSS+G  +VNLV IYGPKIKYERGAAVAFNVR++NRGL
Sbjct: 781 KTTLRLRFLINWLVTSLLQLKLPSSDGDGRVNLVHIYGPKIKYERGAAVAFNVRDKNRGL 840

Query: 841 INPEFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRR-GKHGF 900
           INPE VQKLAER+GISLGIGFLSHIR+LDS R+Q G LNLE+++LCR  +NGR  GK GF
Sbjct: 841 INPEIVQKLAEREGISLGIGFLSHIRILDSPRQQRGALNLEDTTLCRPMENGRHDGKSGF 900

Query: 901 ARLEVVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPV-EEDSET 945
            R+EVVTASLGFLTNFEDVY+LW FVAKFLN +FIREGTL  V EE+SET
Sbjct: 901 IRVEVVTASLGFLTNFEDVYKLWAFVAKFLNTAFIREGTLPTVAEEESET 944

BLAST of Cla97C02G038820 vs. TrEMBL
Match: tr|B9S8P3|B9S8P3_RICCO (Molybdopterin cofactor sulfurase, putative OS=Ricinus communis OX=3988 GN=RCOM_0603310 PE=4 SV=1)

HSP 1 Score: 1394.0 bits (3607), Expect = 0.0e+00
Identity = 723/953 (75.87%), Postives = 820/953 (86.04%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MH SLWKP+SHCAALI+DKKSRKKDGS+  ++IKK+  ILRKL+EHKLREALEEASEDGS
Sbjct: 1   MHLSLWKPISHCAALILDKKSRKKDGSEPNLEIKKNPSILRKLQEHKLREALEEASEDGS 60

Query: 61  LFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAF 120
           LFKSQD+ SE L N D + LGRSRSLARL AQREFL+ATA+AA+R +ES+D+IPDLHEAF
Sbjct: 61  LFKSQDMESESLGNQDES-LGRSRSLARLHAQREFLRATALAAERIFESEDSIPDLHEAF 120

Query: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLI-KVCLDYCGFGLFSYVQSLHYWESSTFSLSEI 180
           SKFLTMYPKYQSSE+IDQLRS+EY+HL  KVCLDYCGFGLFSY+Q+LHYWESSTFSLSEI
Sbjct: 121 SKFLTMYPKYQSSERIDQLRSDEYAHLCPKVCLDYCGFGLFSYLQTLHYWESSTFSLSEI 180

Query: 181 AANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNS 240
            ANLSN ALYGGAE+GTVE+DIK+RIMD+LNIPEHEYGLVFTVSRGSAFKLLAESYPF++
Sbjct: 181 TANLSNHALYGGAEKGTVEYDIKTRIMDYLNIPEHEYGLVFTVSRGSAFKLLAESYPFHT 240

Query: 241 NKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDS 300
           NKKLLTMFDYESQSVNWMAQ A+EKGAK YSAWFKWPTLKLCSTDLRKQI++KKR+KKDS
Sbjct: 241 NKKLLTMFDYESQSVNWMAQSAKEKGAKVYSAWFKWPTLKLCSTDLRKQISSKKRRKKDS 300

Query: 301 -VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIIT 360
            VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIIT
Sbjct: 301 AVGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIIT 360

Query: 361 SFYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDG-VG 420
           SFYRVFGYDPTGFGCLLIK+SVMG+LQ +SG TGSGMVKITPEYPMYLSDS+DDLD  VG
Sbjct: 361 SFYRVFGYDPTGFGCLLIKKSVMGNLQNQSGSTGSGMVKITPEYPMYLSDSVDDLDRLVG 420

Query: 421 RFEDEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEE 480
             +D++VA   + TSE R G QLPAFSGAFTSAQVRDV+ETEM+ DNSSDRDGTSTI EE
Sbjct: 421 NDDDDEVAANGETTSEVRPGLQLPAFSGAFTSAQVRDVFETEMEQDNSSDRDGTSTIFEE 480

Query: 481 SETISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRR 540
           +E+IS+GEVMKSP+FSEDESSD S WIDLGQSPLGSD  GQ H QK+ASPLP  W  G++
Sbjct: 481 TESISVGEVMKSPVFSEDESSDNSFWIDLGQSPLGSDAGGQ-HKQKLASPLPPFWFSGKK 540

Query: 541 N-KLLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVP-REL 600
           N K LSPKP+SKI+  P Y  DK  N+GP+++  V SFDAAV SV QELD +KEVP  E 
Sbjct: 541 NHKRLSPKPSSKIYGSPIY--DKGVNMGPHDDNHVLSFDAAVMSVSQELDRVKEVPEEEQ 600

Query: 601 FSETSATSTNSKNGSNNRVDTEIHEVTE---ASKPLSNGSSMNSTLDNGFHLDISASNFH 660
           F+ETS T  N++ G        IHE+ E    S PLS  S  NS ++        A+  H
Sbjct: 601 FTETSYTPRNNRMG-------HIHEIEEEPGTSDPLSASSLSNSAVNRS-----QAAGHH 660

Query: 661 YCGLENGTTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRV 720
              L NG+TS I +EMKESAIRRETEGEFRLLGRREGN++ GGRFFGLEE+  PSRGRRV
Sbjct: 661 --SLANGSTSAIGSEMKESAIRRETEGEFRLLGRREGNRYGGGRFFGLEENEHPSRGRRV 720

Query: 721 SFRMEENGKEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINL 780
           SF ME+N KE L+H +EP EI VTSLDDE+YTS+GEY D +EW+RREPEIIC+HLDH+N+
Sbjct: 721 SFSMEDNRKERLSHALEPGEISVTSLDDEEYTSDGEYGDGQEWDRREPEIICKHLDHVNM 780

Query: 781 LGLNKTTLRLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNR 840
           LGLNKTTLRLRFL+NWLVTSLLQL+ P+S+G  +V LV IYGPKIKYERGAAVAFNVR+R
Sbjct: 781 LGLNKTTLRLRFLVNWLVTSLLQLRLPNSDGEGRVPLVHIYGPKIKYERGAAVAFNVRDR 840

Query: 841 NRGLINPEFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGR-RG 900
           NRGLINPE VQKLAER+GISLGIGFLSHIR+LDS ++Q G LNLE+++LCR  +NG+  G
Sbjct: 841 NRGLINPEVVQKLAEREGISLGIGFLSHIRILDSPKQQRGALNLEDTTLCRPMENGQHNG 900

Query: 901 KHGFARLEVVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSET 945
           K GF R+EVVTASLGFLTNFEDVY+LW FV+KFLNP+FI++G L  VEE SET
Sbjct: 901 KSGFIRVEVVTASLGFLTNFEDVYKLWAFVSKFLNPAFIKDGGLPTVEEGSET 935

BLAST of Cla97C02G038820 vs. Swiss-Prot
Match: sp|Q16P90|MOCO3_AEDAE (Molybdenum cofactor sulfurase 3 OS=Aedes aegypti OX=7159 GN=mal3 PE=3 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 3.1e-16
Identity = 79/269 (29.37%), Postives = 114/269 (42.38%), Query Frame = 0

Query: 143 EYSHLIKVC-LDYCGFGLFSYVQSLHYWESSTFSLSEIAANLSNQALYGGAERGTVEHD- 202
           E+S L + C LD+ G  L        Y +S   S+ E  A    Q LY          D 
Sbjct: 22  EFSRLKEKCYLDHAGTTL--------YADSQIRSVCEGLA----QNLYCNPHTSRTTEDL 81

Query: 203 ---IKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSNKKLLTMFDYESQSVNWM 262
              ++ R++ H N    EY L+FT    ++ KLLAESY F      + + D  +  +   
Sbjct: 82  LDQVRYRVLRHFNTRSSEYSLIFTSGTTASLKLLAESYEFAPEGAFVYLKDSHTSVLGMR 141

Query: 263 AQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDSVGLFVFPVQSRVTGAKYS 322
                E+          +P  +     L K++ + +R   +   L VFP Q    G KY 
Sbjct: 142 EIVGTER---------IYPVER---EQLLKELDSSERSDSEHSSLIVFPAQCNFNGVKYP 201

Query: 323 YQWMALAQQN--------NWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGYDPT 382
            + +   Q+N         + V LDA S        L LS ++PDF+  SFY++FGY PT
Sbjct: 202 LELVRKIQRNGISGYGKERFRVCLDAASF--VSTSFLDLSKYQPDFVCLSFYKIFGY-PT 261

Query: 383 GFGCLLIKRSVMGSLQTRSGCTGSGMVKI 399
           G G LL+  +     Q R    G G VKI
Sbjct: 262 GLGALLVHHTAAD--QLRKKYYGGGTVKI 261

BLAST of Cla97C02G038820 vs. Swiss-Prot
Match: sp|Q16GH0|MOCO1_AEDAE (Molybdenum cofactor sulfurase 1 OS=Aedes aegypti OX=7159 GN=mal1 PE=3 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 4.1e-16
Identity = 86/297 (28.96%), Postives = 126/297 (42.42%), Query Frame = 0

Query: 142 NEYSHLIKVC-LDYCGFGLFSYVQSLHYWESSTFSLSEIAANLSNQALYGGAERGTVEHD 201
           NE+S L + C LD+ G  L        Y +S   S+ E  A    Q LY          D
Sbjct: 21  NEFSRLKEKCYLDHAGTTL--------YADSQIRSVCEGLA----QNLYCNPHTSRTTED 80

Query: 202 ----IKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSNKKLLTMFDYESQSVNW 261
               ++ R++ H N    EY L+FT    ++ KLLAES+ F      + + D  +  +  
Sbjct: 81  LLDQVRYRVLRHFNTRSSEYSLIFTSGTTASLKLLAESFEFAPEGAFVYLKDSHTSVLGM 140

Query: 262 MAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDSVGLFVFPVQSRVTGAKY 321
                 E+          +P  +     L K++ + +R   +   L VFP Q    G KY
Sbjct: 141 REIVGTER---------IYPVER---EQLLKELDSSERSDNEHSSLIVFPAQCNFNGVKY 200

Query: 322 SYQWMALAQQN--------NWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGYDP 381
             + +   Q++         + V LDA S        L LS ++PDF+  SFY++FGY P
Sbjct: 201 PLELVRKIQRDGISGYGKERFRVCLDAASF--VSTSFLDLSKYQPDFVCLSFYKIFGY-P 260

Query: 382 TGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGRFEDEQVA 426
           TG G LL+  +     Q R    G G VKI     ++     D L  V RFED  +A
Sbjct: 261 TGLGALLVHHTAAD--QLRKKYYGGGTVKIAMAGRIF-HVKRDPL--VERFEDGTLA 285

BLAST of Cla97C02G038820 vs. Swiss-Prot
Match: sp|Q9C5X8|MOCOS_ARATH (Molybdenum cofactor sulfurase OS=Arabidopsis thaliana OX=3702 GN=ABA3 PE=1 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 9.1e-16
Identity = 76/295 (25.76%), Postives = 138/295 (46.78%), Query Frame = 0

Query: 118 EAFSKFLTMYPKYQSSEK-IDQLRSNEYSHLIK--VCLDYCGFGLFSYVQSLHYWESSTF 177
           EAF K    Y  Y    K I ++R  E+  L K  V LD+ G  L+S +Q  + ++  T 
Sbjct: 2   EAFLKEFGDYYGYPDGPKNIQEIRDTEFKRLDKGVVYLDHAGSTLYSELQMEYIFKDFT- 61

Query: 178 SLSEIAANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAES 237
             S +  N  +Q+    A    +  D + +++++ N    +Y  +FT    +A KL+ E+
Sbjct: 62  --SNVFGNPHSQSDISSATSDLIA-DARHQVLEYFNASPEDYSCLFTSGATAALKLVGET 121

Query: 238 YPFNSNKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFK------------WPTLKLCS 297
           +P+  +   L   +    SV  + + A  +GA A +   +             P++K+  
Sbjct: 122 FPWTQDSNFLYTME-NHNSVLGIREYALAQGASACAVDIEEAANQPGQLTNSGPSIKVKH 181

Query: 298 TDLRKQITNKKRKKK---DSVGLFVFPVQSRVTGAKYSYQWMALAQQN------------ 357
             ++ + T+K +K++   ++  LF FP +   +G +++   + L ++N            
Sbjct: 182 RAVQMRNTSKLQKEESRGNAYNLFAFPSECNFSGLRFNLDLVKLMKENTETVLQGSPFSK 241

Query: 358 --NWHVLLDAG---SLGPKDMDSLGLSLFRPDFIITSFYRVFGYDPTGFGCLLIK 378
              W VL+DA    +  P D     LS +  DF++ SFY++FGY PTG G LL++
Sbjct: 242 SKRWMVLIDAAKGCATLPPD-----LSEYPADFVVLSFYKLFGY-PTGLGALLVR 285

BLAST of Cla97C02G038820 vs. Swiss-Prot
Match: sp|Q8LGM7|MOCOS_SOLLC (Molybdenum cofactor sulfurase OS=Solanum lycopersicum OX=4081 GN=FLACCA PE=2 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 2.6e-15
Identity = 74/294 (25.17%), Postives = 135/294 (45.92%), Query Frame = 0

Query: 118 EAFSKFLTMYPKYQSSEK-IDQLRSNEYSHL-IKVCLDYCGFGLFSYVQSLHYWESSTFS 177
           E F K    Y  Y +S K ID++R+ E+  L   V LD+ G  L+S  Q    ++    +
Sbjct: 8   EQFLKEFGSYYGYANSPKNIDEIRATEFKRLNDTVYLDHAGATLYSESQMEAVFKDLNST 67

Query: 178 L-----SEIAANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKL 237
           L     S+   +L+ + + G A         + +++   N    EY  +FT    +A KL
Sbjct: 68  LYGNPHSQSTCSLATEDIVGKA---------RQQVLSFFNASPREYSCIFTSGATAALKL 127

Query: 238 LAESYPFNSNKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFK----------WPTLKL 297
           + E++P++SN   +   +    SV  + + A  KGA A++   +             LKL
Sbjct: 128 VGETFPWSSNSSFMYSME-NHNSVLGIREYALSKGAAAFAVDIEDTHVGESESPQSNLKL 187

Query: 298 CSTDLRKQITN---KKRKKKDSVGLFVFPVQSRVTGAKYSYQWMALAQQNN--------- 357
               ++++      K+    ++  LF FP +   +G K+    + + ++ +         
Sbjct: 188 TQHHIQRRNEGGVLKEGMTGNTYNLFAFPSECNFSGRKFDPNLIKIIKEGSERILESSQY 247

Query: 358 ----WHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGYDPTGFGCLLIKR 379
               W VL+DA      +  +  LS+F+ DF++ SFY++FGY PTG G L++++
Sbjct: 248 SRGCWLVLIDAAKGCATNPPN--LSMFKADFVVFSFYKLFGY-PTGLGALIVRK 288

BLAST of Cla97C02G038820 vs. Swiss-Prot
Match: sp|Q7QFL7|MOCOS_ANOGA (Molybdenum cofactor sulfurase OS=Anopheles gambiae OX=7165 GN=mal PE=3 SV=5)

HSP 1 Score: 82.0 bits (201), Expect = 3.8e-14
Identity = 86/303 (28.38%), Postives = 122/303 (40.26%), Query Frame = 0

Query: 135 KIDQLRSNEYSHLIKVC-LDYCGFGLFSYVQSLHYWESSTFSLSEIAANLSNQALYGGAE 194
           KI+Q    ++S L   C LD+ G  L        Y ES   ++ E+ A      LY    
Sbjct: 14  KIEQ----DFSRLADKCYLDHAGTAL--------YGESQLRAVQELLAG----GLYCNPH 73

Query: 195 RGTVEHD----IKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSNKKLLTMFDY 254
                 D    ++ R++        +Y LVFT    ++ KL+AES+ F         F Y
Sbjct: 74  TSRTMEDLIDLVRYRVLRWFQTRPADYSLVFTSGTTASLKLVAESFEFGPGDAEPGSFVY 133

Query: 255 ---ESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDSVGLFVFP 314
                 SV  M +  R    +        P  +        +  + +R+      L VFP
Sbjct: 134 LRDSHTSVLGMRELVRTGRVQ--------PIERAELLQALNEPEDPRRQHPHRPSLLVFP 193

Query: 315 VQSRVTGAKYSYQWMALAQQNN--------WHVLLDAGSLGPKDMDSLGLSLFRPDFIIT 374
            Q    GAKY  +   L ++N         +HV LDA S        L LS +RP F+  
Sbjct: 194 AQCNFNGAKYPLELCELIERNGLRGYGGDAFHVCLDAAS--HVSTSPLDLSRYRPSFVCL 253

Query: 375 SFYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVGR 422
           SFY++FGY PTG G LL++R     L+ +    G G VKI    P    +  D L    R
Sbjct: 254 SFYKIFGY-PTGLGALLVRRDAEPLLRGKR-YYGGGTVKIALSGPDRFHERRDALP--DR 286

BLAST of Cla97C02G038820 vs. TAIR10
Match: AT2G23520.1 (Pyridoxal phosphate (PLP)-dependent transferases superfamily protein)

HSP 1 Score: 1124.8 bits (2908), Expect = 0.0e+00
Identity = 610/961 (63.48%), Postives = 738/961 (76.80%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDK-KSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDG 60
           MH  LWK + HCA LI+DK KSR++DGSDS +D+++   +LRKL E KLR+ALEEASE+G
Sbjct: 1   MHFPLWKQIHHCATLILDKSKSRRRDGSDSPIDVRRKASMLRKLYEDKLRDALEEASENG 60

Query: 61  SLFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEA 120
           SLFKSQDV +E    + +  LGRSRSLARL AQREFL+ATA+AA+R +ES+D IP+L EA
Sbjct: 61  SLFKSQDVENE----NQDESLGRSRSLARLHAQREFLRATALAAERAFESEDDIPELLEA 120

Query: 121 FSKFLTMYPKYQSSEKIDQLRSNEYSHLI--KVCLDYCGFGLFSYVQSLHYWESSTFSLS 180
           F+KFLTMYPK+++SEK+DQLRS+EY HL+  KVCLDYCGFGLFSYVQ+LHYW+S TFSLS
Sbjct: 121 FNKFLTMYPKFETSEKVDQLRSDEYGHLLDSKVCLDYCGFGLFSYVQTLHYWDSCTFSLS 180

Query: 181 EIAANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPF 240
           EI ANLSN ALYGGAE GTVEHD+K+RIMD+LNIPE EYGLVFT SRGSAF+LLAESYPF
Sbjct: 181 EITANLSNHALYGGAEIGTVEHDLKTRIMDYLNIPESEYGLVFTGSRGSAFRLLAESYPF 240

Query: 241 NSNKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKK 300
           ++NK+LLTMFD+ESQSVNWMAQ AREKGAKAY+AWFKWPTLKLCSTDL+K++++KKRKKK
Sbjct: 241 HTNKRLLTMFDHESQSVNWMAQTAREKGAKAYNAWFKWPTLKLCSTDLKKRLSHKKRKKK 300

Query: 301 DS-VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFI 360
           DS VGLFVFP QSRVTG+KYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRP+FI
Sbjct: 301 DSAVGLFVFPAQSRVTGSKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPEFI 360

Query: 361 ITSFYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGV 420
           ITSFY+VFG+DPTGFGCLLIK+SVMG+LQ++SG TGSG+VKITP+YP+YLSDS+D     
Sbjct: 361 ITSFYKVFGHDPTGFGCLLIKKSVMGNLQSQSGKTGSGIVKITPQYPLYLSDSIDXXXXX 420

Query: 421 GRFEDEQVAGVVDK---TSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGT-S 480
              ED  +    DK   T   R+G+Q+P FSGA+TSAQVRDV+ET++  DN+SDRDGT S
Sbjct: 421 XXLEDHDIGTNGDKPATTDAARRGAQMPVFSGAYTSAQVRDVFETDLLEDNASDRDGTSS 480

Query: 481 TILEESETISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHW 540
           TI EE+E++S+GE+MKSP FSEDESSD S WIDLGQSPLGSD+AG  ++ KIASPLP  W
Sbjct: 481 TIFEENESVSVGELMKSPAFSEDESSDNSFWIDLGQSPLGSDSAGHLNHHKIASPLPPFW 540

Query: 541 LKGRRNKLLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVP 600
              +R    SPKP +K +S P YD            + V SFDAAV SV QE++      
Sbjct: 541 FTSKRQ---SPKPVAKSYSSPMYDG-----------KDVLSFDAAVMSVTQEIN------ 600

Query: 601 RELFSETSATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDISASNFH 660
                  S  S N +N SNN    EI E    +     GS   S                
Sbjct: 601 -------STPSRNLRN-SNNLQIQEIQEENCGNIVYRAGSGFGS---------------- 660

Query: 661 YCGLENGTTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRV 720
                NG++S+I ++MK++AIRRETEGEFRLLGRR      GGR  GLE+   PSRG RV
Sbjct: 661 -----NGSSSKISSDMKDNAIRRETEGEFRLLGRR----GTGGRLLGLED-EQPSRGTRV 720

Query: 721 SFRMEENGKEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINL 780
           SF M     + ++H+++  E  + S+ DE   S+GE  +E++W+RREPEI+C H+DH+N+
Sbjct: 721 SFNM-----DRVSHSLDQGEASLASVYDE---SDGENPNEDDWDRREPEIVCSHIDHVNM 780

Query: 781 LGLNKTTLRLRFLINWLVTSLLQLKF--PSSEGSNK-VNLVQIYGPKIKYERGAAVAFNV 840
           LGLNKTT RLRFLINWLV SLLQLK   P S+GS++ +NLVQIYGPKIKYERGAAVAFNV
Sbjct: 781 LGLNKTTSRLRFLINWLVISLLQLKVPEPGSDGSSRYMNLVQIYGPKIKYERGAAVAFNV 840

Query: 841 RNRNRGLINPEFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNL-EESSLCRQTKNG 900
           +++++G ++PE V KLAER+G+SLGIG LSHIR++D  R   G   + E+SSL  Q + G
Sbjct: 841 KDKSKGFVSPEIVLKLAEREGVSLGIGILSHIRIMDLPRNHRGGARIKEDSSLHLQREAG 895

Query: 901 RR-GKHGFARLEVVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPV----EEDSE 945
           +R GK+GF R EVVTASL FL+NFEDVY+LW FVAKFLNP F REG+L  V     EDSE
Sbjct: 901 KRGGKNGFVRFEVVTASLSFLSNFEDVYKLWAFVAKFLNPGFSREGSLPTVIEEEAEDSE 895

BLAST of Cla97C02G038820 vs. TAIR10
Match: AT4G37100.1 (Pyridoxal phosphate (PLP)-dependent transferases superfamily protein)

HSP 1 Score: 1097.0 bits (2836), Expect = 0.0e+00
Identity = 610/966 (63.15%), Postives = 725/966 (75.05%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKS---RKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASE 60
           MH SLWK + HCA+LI+DK     R++DGSDS++++KK   ++RKL E KLREALEEASE
Sbjct: 1   MHFSLWKQIHHCASLILDKSKSSRRRRDGSDSSLNVKKKAALIRKLYEDKLREALEEASE 60

Query: 61  DGSLFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLH 120
           +GSLFKSQD+  +    + +  LGRSRSLARL AQREFL+ATA+AA+R  ES+D+IP+L 
Sbjct: 61  NGSLFKSQDIDQD----NGDGSLGRSRSLARLHAQREFLRATALAAERIIESEDSIPELR 120

Query: 121 EAFSKFLTMYPKYQSSEKIDQLRSNEYSHL----IKVCLDYCGFGLFSYVQSLHYWESST 180
           EA +KFL+MYPKYQ+SEKIDQLRS+EYSHL     KVCLDYCGFGLFSYVQ+LHYW++ T
Sbjct: 121 EALTKFLSMYPKYQASEKIDQLRSDEYSHLSSSASKVCLDYCGFGLFSYVQTLHYWDTCT 180

Query: 181 FSLSEIAANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAE 240
           FSLSEI ANLSN ALYGGAE GTVEHDIK+RIMD+LNIPE+EYGLVFTVSRGSAF+LLAE
Sbjct: 181 FSLSEITANLSNHALYGGAESGTVEHDIKTRIMDYLNIPENEYGLVFTVSRGSAFRLLAE 240

Query: 241 SYPFNSNKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKK 300
           SYPF SNK+LLTMFD+ESQSVNWMAQ AREKGAKAY+AWFKWPTLKLCSTDL+K+++ KK
Sbjct: 241 SYPFQSNKRLLTMFDHESQSVNWMAQTAREKGAKAYNAWFKWPTLKLCSTDLKKRLSYKK 300

Query: 301 RKKKDS-VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFR 360
           RKKKDS VGLFVFP QSRVTG KYSYQWMALAQQN+WHVLLDAGSLGPKDMDSLGLSLFR
Sbjct: 301 RKKKDSAVGLFVFPAQSRVTGTKYSYQWMALAQQNHWHVLLDAGSLGPKDMDSLGLSLFR 360

Query: 361 PDFIITSFYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDD 420
           P+FIITSFYRVFG+DPTGFGCLLIK+SVMGSLQ++SG TGSG+VKITPEYP+YLSDS+D 
Sbjct: 361 PEFIITSFYRVFGHDPTGFGCLLIKKSVMGSLQSQSGKTGSGIVKITPEYPLYLSDSVDG 420

Query: 421 LDGVGRFEDEQVAGVVDKTSET-RQGSQLPAFSGAFTSAQVRDVYETEMDHDN-SSDRDG 480
           LDG+  FED       DKT E  R G+Q+PAFSGA+TSAQVRDV+ETE+  DN SSDRDG
Sbjct: 421 LDGLVGFEDHN----DDKTKEAHRPGTQMPAFSGAYTSAQVRDVFETELLEDNISSDRDG 480

Query: 481 T--STILEESETISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPL 540
           T  +TI EE+E++S+GE+MKSP+FSEDESSD S WIDLGQSPLGSD     HN KIASPL
Sbjct: 481 TTSTTIFEETESVSVGELMKSPVFSEDESSDNSFWIDLGQSPLGSD----QHN-KIASPL 540

Query: 541 PQHWLKGRRN--KLLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELD 600
           P  WL  +R            K +S P YD +            V SFDAAV SV     
Sbjct: 541 PPIWLTNKRKXXXXXXXXXIPKSYSSPLYDGN-----------DVLSFDAAVMSV----- 600

Query: 601 CIKEVPRELFSETSATSTNSKNGSNNRVDTEIHEVTEASKPLSNGSSMNSTLDNGFHLDI 660
                     +E    ST S+N  ++     + E+ E +   S  + + S          
Sbjct: 601 ----------TEHGTNSTPSRNRRSSSNHLHVQEIQEENCGHSFANGLKS---------- 660

Query: 661 SASNFHYCGLENGTTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMP 720
                          S I +E+KESAIRRETEGEFRLLG R+G +    R  G+E+ + P
Sbjct: 661 ---------------SNISSEIKESAIRRETEGEFRLLGGRDGGR---SRLLGVEDEH-P 720

Query: 721 SRGRRVSFRMEENGKEHLNHNI-EPREILVTSLDDEDY--TSNGEYDDEE----EWNRR- 780
           S+GRRVSF M     E ++H+I EP E  + S+ DEDY  TS+ E  D+E    EW+RR 
Sbjct: 721 SKGRRVSFNM-----ERVSHSIVEPGEASLASVYDEDYINTSDVENGDDEGADDEWDRRD 780

Query: 781 -EPEIICRHLDHINLLGLNKTTLRLRFLINWLVTSLLQLKFPSSEGSNKVNLVQIYGPKI 840
            E EI+CRH+DH+N+LGLNKTT RLRFLINWLV SLLQL+ P S G   +NLVQIYGPKI
Sbjct: 781 TETEIVCRHIDHVNMLGLNKTTTRLRFLINWLVISLLQLQVPES-GGRHMNLVQIYGPKI 840

Query: 841 KYERGAAVAFNVRNRNRGLINPEFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLE 900
           KYERGAAVAFNVR++++G ++PE VQ+L +R+G+SLGIG LSHIR++D   R H     E
Sbjct: 841 KYERGAAVAFNVRDKSKGFVSPEIVQRLGDREGVSLGIGILSHIRIVDEKPRNHRARTKE 889

Query: 901 ESSLCRQTKNGRRGKHGFARLEVVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAP 944
           +S+L  Q +    GK+GF R EVVTASL FLTNFEDVY+LW FVAKFLNP F REG+L  
Sbjct: 901 DSALHLQNE---AGKNGFIRFEVVTASLSFLTNFEDVYKLWVFVAKFLNPGFSREGSLPT 889

BLAST of Cla97C02G038820 vs. TAIR10
Match: AT5G66950.1 (Pyridoxal phosphate (PLP)-dependent transferases superfamily protein)

HSP 1 Score: 1008.8 bits (2607), Expect = 2.2e-294
Identity = 561/951 (58.99%), Postives = 684/951 (71.92%), Query Frame = 0

Query: 1   MHHSLWKPLSHCAALIMDKKSRKKDGSDSAMDIKKHKLILRKLEEHKLREALEEASEDGS 60
           MH SLWKP+ HCAA ++                    +  RKL E KLREALE+ASEDG 
Sbjct: 1   MHISLWKPIYHCAAALV---XXXXXXXXXXXXXXXRDVTQRKLHESKLREALEQASEDGL 60

Query: 61  LFKSQDVGSEPLPNDDNNGLGRSRSLARLQAQREFLKATAMAADRTYESDDAIPDLHEAF 120
           L KSQD+  E    D    LGRSRSLARL AQREFL+AT++AA R +ES++ +P+L EA 
Sbjct: 61  LVKSQDMEEEDESQDQI--LGRSRSLARLNAQREFLRATSLAAQRAFESEETLPELEEAL 120

Query: 121 SKFLTMYPKYQSSEKIDQLRSNEYSHLI--KVCLDYCGFGLFSYVQSLHYWESSTFSLSE 180
           + FLTMYPKYQSSEK+D+LR++EY HL   KVCLDYCGFGLFSY+Q++HYW++ TFSLSE
Sbjct: 121 TIFLTMYPKYQSSEKVDELRNDEYFHLSLPKVCLDYCGFGLFSYLQTVHYWDTCTFSLSE 180

Query: 181 IAANLSNQALYGGAERGTVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFN 240
           I+ANLSN A+YGGAE+G++EHDIK RIMD+LNIPE+EYGLVFTVSRGSAFKLLAESYPF+
Sbjct: 181 ISANLSNHAIYGGAEKGSIEHDIKIRIMDYLNIPENEYGLVFTVSRGSAFKLLAESYPFH 240

Query: 241 SNKKLLTMFDYESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKD 300
           +NKKLLTMFD+ESQSV+WM QCA+EKGAK  SAWFKWPTL+LCS DL+K+I +KK++KKD
Sbjct: 241 TNKKLLTMFDHESQSVSWMGQCAKEKGAKVGSAWFKWPTLRLCSMDLKKEILSKKKRKKD 300

Query: 301 S-VGLFVFPVQSRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFII 360
           S  GLFVFPVQSRVTG+KYSYQWMALAQQNNWHVLLDAG+LGPKDMDSLGLSLFRPDFII
Sbjct: 301 SATGLFVFPVQSRVTGSKYSYQWMALAQQNNWHVLLDAGALGPKDMDSLGLSLFRPDFII 360

Query: 361 TSFYRVFGYDPTGFGCLLIKRSVMGSLQTRSGCTGSGMVKITPEYPMYLSDSMDDLDGVG 420
           TSFYRVFGYDPTGFGCLLIK+SV+  LQ++SG T SG+VKITPEYP+YLSDSMD L+G+ 
Sbjct: 361 TSFYRVFGYDPTGFGCLLIKKSVISCLQSQSGKTSSGIVKITPEYPLYLSDSMDGLEGLT 420

Query: 421 RFEDEQVAGVVDKTSETRQGSQLPAFSGAFTSAQVRDVYETEMDHDNSSDRDGTSTILEE 480
             +D    G+         G+QLPAFSGA+TSAQV+DV+ET+MDH+  SDRD TS + EE
Sbjct: 421 GIQDN---GIAINGDNKALGTQLPAFSGAYTSAQVQDVFETDMDHEIGSDRDNTSAVFEE 480

Query: 481 SETISLGEVMKSPIFSEDESSDCSIWIDLGQSPLGSDNAGQSHNQKIASPLPQHWLKGRR 540
           +E+IS+GE++KSP+FSEDESSD S+WIDLGQSP  SDNAG  + QK  SPL    ++   
Sbjct: 481 AESISVGELIKSPVFSEDESSDSSLWIDLGQSPADSDNAGHLNKQK--SPL---LVRKNH 540

Query: 541 NKLLSPKPTSKIHSEPTYDNDKDFNLGPYNEQPVRSFDAAVQSVCQELDCIKEVPRELFS 600
            +  SPKP SK             N G    + V SFDAAV SV        EV  E+  
Sbjct: 541 KRRSSPKPASKA------------NNGSNGGRHVLSFDAAVLSVSH------EVGEEVIE 600

Query: 601 ETSATSTNSKNGSNNRVDTEIH-EVTEASKPLSNGSSMNSTLDNGFHLDISASNFHYCGL 660
           E        +N   N++DT     VTE             T                   
Sbjct: 601 E--------ENSEMNQIDTSRRLRVTEIEXXXXXXXXXKLTAH----------------- 660

Query: 661 ENGTTSEICAEMKESAIRRETEGEFRLLGRREGNKHVGGRFFGLEESNMPSRGRRVSFRM 720
            NG++S I    K+SAIRRETEGEFRLLGRRE +++ GGR   + E   PS+ RRVSFR 
Sbjct: 661 ANGSSSGI----KDSAIRRETEGEFRLLGRREKSQYNGGRLL-VNEDEHPSK-RRVSFRS 720

Query: 721 EENGKEHLNHNIEPREILVTSLDDEDYTSNGEYDDEEEWNRREPEIICRHLDHINLLGLN 780
            ++G           E  V SL DED   +G    E + ++REPEI+CRH+DH+N+LGLN
Sbjct: 721 VDHG-----------EASVISLGDEDEEEDGSNGVEWDDDQREPEIVCRHIDHVNMLGLN 780

Query: 781 KTTLRLRFLINWLVTSLLQLKFP--SSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNR 840
           KTT RLR+LINWLVTSLLQL+ P   S+G +K NLVQIYGPKIKYERG++VAFN+R+   
Sbjct: 781 KTTSRLRYLINWLVTSLLQLRLPRSDSDGEHK-NLVQIYGPKIKYERGSSVAFNIRDLKS 840

Query: 841 GLINPEFVQKLAERDGISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHG 900
           G+++PE VQKLAER+GISLGIG+LSHI+++D+          + SS     + GR   +G
Sbjct: 841 GMVHPEIVQKLAEREGISLGIGYLSHIKIIDNRSE-------DSSSWKPVDREGR--NNG 868

Query: 901 FARLEVVTASLGFLTNFEDVYRLWGFVAKFLNPSFIREGTLAPVEEDSETT 946
           F R+EVVTASLGFLTNFEDVYRLW FVAKFL+P F ++GTL  V E+ +++
Sbjct: 901 FIRVEVVTASLGFLTNFEDVYRLWNFVAKFLSPGFAKQGTLPTVIEEDDSS 868

BLAST of Cla97C02G038820 vs. TAIR10
Match: AT5G51920.1 (Pyridoxal phosphate (PLP)-dependent transferases superfamily protein)

HSP 1 Score: 295.0 bits (754), Expect = 1.6e-79
Identity = 149/326 (45.71%), Postives = 211/326 (64.72%), Query Frame = 0

Query: 92  QREFLKAT--AMAADRTYESDDAIPDLHEAFSKFLTMYPKYQSSEKIDQLRSNEYSHL-- 151
           +R F + T   +  D  +   +++P   E+FS F+  YP Y  + KID+LRS+ Y HL  
Sbjct: 46  RRNFAQTTVSTIFPDTEFTDPNSLPSHQESFSDFIQAYPNYSDTYKIDRLRSDHYFHLGL 105

Query: 152 -IKVCLDYCGFGLFSYVQSLHY-----------WESSTFSLSEIAANLSNQALYGGAERG 211
               CLDY G GL+SY Q L+Y            ES  FS+S    NL  + L  G +  
Sbjct: 106 SHYTCLDYIGIGLYSYSQLLNYDPSTYQISSSLSESPFFSVSPKIGNLKEKLLNDGGQET 165

Query: 212 TVEHDIKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSNKKLLTMFDYESQSVN 271
             E+ +K RIM  L I E +Y +VFT +R SAF+L+AESYPFNS +KLLT++DYES++V+
Sbjct: 166 EFEYSMKRRIMGFLKISEEDYSMVFTANRTSAFRLVAESYPFNSKRKLLTVYDYESEAVS 225

Query: 272 WMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQIT-NKKRKKKDSVGLFVFPVQSRVTGA 331
            + + + ++GAK  +A F WP LKLCS+ LRK +T  K   K    G++VFP+ SRVTG+
Sbjct: 226 EINRVSEKRGAKVAAAEFSWPRLKLCSSKLRKLVTAGKNGSKTKKKGIYVFPLHSRVTGS 285

Query: 332 KYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGYDPTGFGCL 391
           +Y Y WM++AQ+N WHV++DA  LGPKDMDS GLS++ PDF++ SFY+VFG +P+GFGCL
Sbjct: 286 RYPYLWMSVAQENGWHVMIDACGLGPKDMDSFGLSIYNPDFMVCSFYKVFGENPSGFGCL 345

Query: 392 LIKRSVMGSLQTRSGCTGSGMVKITP 401
            +K+S +  L++    TG GM+ + P
Sbjct: 346 FVKKSTISILES---STGPGMINLVP 368


HSP 2 Score: 120.2 bits (300), Expect = 7.0e-27
Identity = 79/185 (42.70%), Postives = 101/185 (54.59%), Query Frame = 0

Query: 746 NGEYDDEEEWNRREPEIICRHLDHINLLGLNKTTLRLRFLINWLVTSLLQLKFPSSEGSN 805
           N    D EE       +  + LDH++ LGL  T  R R LINWLV++L +LK      S 
Sbjct: 381 NRTQTDSEETYSFSSSVEYKGLDHVDSLGLVATGNRSRCLINWLVSALYKLKH-----ST 440

Query: 806 KVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPEFVQKLAERDGISLGIGFLSHIRVLD 865
              LV+IYGPK+ + RG AVAFN+ N     I P  VQKLAE   ISLG  FL +I   +
Sbjct: 441 TSRLVKIYGPKVNFNRGPAVAFNLFNHKGEKIEPFIVQKLAECSNISLGKSFLKNILFQE 500

Query: 866 SSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLEVVTASLGFLTNFEDVYRLWGFVAKFL 925
                 GV +       R  +  R       R+ V+TA+LGFL NFEDVY+LW FVA+FL
Sbjct: 501 D---YEGVKD-------RVFEKKRNRDVDEPRISVLTAALGFLANFEDVYKLWIFVARFL 550

Query: 926 NPSFI 931
           +  F+
Sbjct: 561 DSEFV 550

BLAST of Cla97C02G038820 vs. TAIR10
Match: AT4G22980.1 (FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 221.1 bits (562), Expect = 2.9e-57
Identity = 127/312 (40.71%), Postives = 184/312 (58.97%), Query Frame = 0

Query: 83  SRSLARLQAQREFLKATA----MAADRTYESDDAIPDLHEAFSKFLTMYPKYQSSEKIDQ 142
           S S++    + EF   T     +  +  + S +++P L  +F   +T +P Y  + + D 
Sbjct: 24  SHSMSEKPEELEFSVTTTGTSFLTRNTKFTSQESLPRLRTSFYDLITAFPDYLQTNQADH 83

Query: 143 LRSNEYSHLIKVCLDYCGFG----LFSYVQSLHYWESSTFSLSEIAANLSNQALYGGAER 202
           LRS EY +L         FG    LFSY Q     ES +  L+     LS + +  G E 
Sbjct: 84  LRSTEYQNLSS---SSHVFGQQQPLFSYSQFREISESES-DLNHSLLTLSCKQVSSGKEL 143

Query: 203 GTVEHD------IKSRIMDHLNIPEHEYGLVFTVSRGSAFKLLAESYPFNSNKKLLTMFD 262
            + E +      I+ RI   +N+ E EY ++ T  R SAFK++AE Y F +N  LLT+++
Sbjct: 144 LSFEEESRFQSRIRKRITSFMNLEESEYHMILTQDRSSAFKIVAELYSFKTNPNLLTVYN 203

Query: 263 YESQSVNWMAQCAREKGAKAYSAWFKWPTLKLCSTDLRKQITNKKRKKKDSVGLFVFPVQ 322
           YE ++V  M + + +KG K  SA F WP+ ++ S  L+++IT  KR+ K   GLFVFP+Q
Sbjct: 204 YEDEAVEEMIRISEKKGIKPQSAEFSWPSTEILSEKLKRRITRSKRRGKR--GLFVFPLQ 263

Query: 323 SRVTGAKYSYQWMALAQQNNWHVLLDAGSLGPKDMDSLGLSLFRPDFIITSFYRVFGY-D 380
           S VTGA YSY WM+LA+++ WHVLLD  +LG KDM++LGLSLF+PDF+I SF  V G  D
Sbjct: 264 SLVTGASYSYSWMSLARESEWHVLLDTSALGSKDMETLGLSLFQPDFLICSFTEVLGQDD 323


HSP 2 Score: 95.9 bits (237), Expect = 1.4e-19
Identity = 66/204 (32.35%), Postives = 110/204 (53.92%), Query Frame = 0

Query: 735 VTSLDDEDY----TSNGEYDDEEEWNRREPEII-CRHLDHINLLGLNKTTLRLRFLINWL 794
           +T +D ED+    TS+ E  + E   +++  +I  + LDH + LGL   + R + L  WL
Sbjct: 366 ITPVDHEDHKAASTSSSEIVEIESSVKQDKAMIEFQGLDHADSLGLILISRRSKSLTLWL 425

Query: 795 VTSLLQLKFPSSEGSNKVNLVQIYGPKIKYERGAAVAFNVRNRNRGLINPEFVQKLAERD 854
           + +L  L+ P      ++ LV++YGPK K  RG +++FN+ +     ++P  V++LAER+
Sbjct: 426 LRALRTLQHPGYH-QTEMPLVKLYGPKTKPSRGPSISFNIFDWQGEKVDPLMVERLAERE 485

Query: 855 GISLGIGFLSHIRVLDSSRRQHGVLNLEESSLCRQTKNGRRGKHGFARLEVVTASL-GFL 914
            I L   +L   R+  + RR    ++L                    RL VVT  L GF+
Sbjct: 486 KIGLRCAYLHKFRI-GNKRRSDEAVSL--------------------RLSVVTVRLGGFM 545

Query: 915 TNFEDVYRLWGFVAKFLNPSFIRE 933
           TNFEDV+++W FV++FL+  F+ +
Sbjct: 546 TNFEDVFKVWEFVSRFLDADFVEK 547

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008457860.10.0e+0093.44PREDICTED: uncharacterized protein LOC103497444 [Cucumis melo][more]
XP_004148049.10.0e+0093.23PREDICTED: uncharacterized protein LOC101209057 [Cucumis sativus] >KGN62047.1 hy... [more]
XP_022158238.10.0e+0087.10uncharacterized protein LOC111024771 [Momordica charantia][more]
XP_023513272.10.0e+0086.47uncharacterized protein LOC111777789 [Cucurbita pepo subsp. pepo][more]
XP_022971719.10.0e+0085.94uncharacterized protein LOC111470388 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3C752|A0A1S3C752_CUCME0.0e+0093.44uncharacterized protein LOC103497444 OS=Cucumis melo OX=3656 GN=LOC103497444 PE=... [more]
tr|A0A0A0LMR8|A0A0A0LMR8_CUCSA0.0e+0093.23Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G292770 PE=4 SV=1[more]
tr|A0A2C9USJ7|A0A2C9USJ7_MANES0.0e+0075.34Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_13G130600 PE=4 SV=... [more]
tr|A0A061DG23|A0A061DG23_THECC0.0e+0076.84Pyridoxal phosphate-dependent transferases superfamily protein OS=Theobroma caca... [more]
tr|B9S8P3|B9S8P3_RICCO0.0e+0075.87Molybdopterin cofactor sulfurase, putative OS=Ricinus communis OX=3988 GN=RCOM_0... [more]
Match NameE-valueIdentityDescription
sp|Q16P90|MOCO3_AEDAE3.1e-1629.37Molybdenum cofactor sulfurase 3 OS=Aedes aegypti OX=7159 GN=mal3 PE=3 SV=1[more]
sp|Q16GH0|MOCO1_AEDAE4.1e-1628.96Molybdenum cofactor sulfurase 1 OS=Aedes aegypti OX=7159 GN=mal1 PE=3 SV=1[more]
sp|Q9C5X8|MOCOS_ARATH9.1e-1625.76Molybdenum cofactor sulfurase OS=Arabidopsis thaliana OX=3702 GN=ABA3 PE=1 SV=1[more]
sp|Q8LGM7|MOCOS_SOLLC2.6e-1525.17Molybdenum cofactor sulfurase OS=Solanum lycopersicum OX=4081 GN=FLACCA PE=2 SV=... [more]
sp|Q7QFL7|MOCOS_ANOGA3.8e-1428.38Molybdenum cofactor sulfurase OS=Anopheles gambiae OX=7165 GN=mal PE=3 SV=5[more]
Match NameE-valueIdentityDescription
AT2G23520.10.0e+0063.48Pyridoxal phosphate (PLP)-dependent transferases superfamily protein[more]
AT4G37100.10.0e+0063.15Pyridoxal phosphate (PLP)-dependent transferases superfamily protein[more]
AT5G66950.12.2e-29458.99Pyridoxal phosphate (PLP)-dependent transferases superfamily protein[more]
AT5G51920.11.6e-7945.71Pyridoxal phosphate (PLP)-dependent transferases superfamily protein[more]
AT4G22980.12.9e-5740.71FUNCTIONS IN: molecular_function unknown[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
Vocabulary: INTERPRO
TermDefinition
IPR015424PyrdxlP-dep_Trfase
IPR015422PyrdxlP-dep_Trfase_dom1
IPR015421PyrdxlP-dep_Trfase_major
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003824 catalytic activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0030170 pyridoxal phosphate binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G038820.1Cla97C02G038820.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 86..106
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 541..561
NoneNo IPR availablePANTHERPTHR14237:SF31CATALYTIC/ PYRIDOXAL PHOSPHATE BINDING PROTEINcoord: 38..943
NoneNo IPR availablePANTHERPTHR14237MOLYBDOPTERIN COFACTOR SULFURASE MOSCcoord: 38..943
IPR015421Pyridoxal phosphate-dependent transferase, major domainGENE3DG3DSA:3.40.640.10coord: 173..424
e-value: 1.6E-17
score: 65.3
IPR015422Pyridoxal phosphate-dependent transferase domain 1GENE3DG3DSA:3.90.1150.10coord: 761..930
e-value: 7.3E-6
score: 27.9
IPR015424Pyridoxal phosphate-dependent transferaseSUPERFAMILYSSF53383PLP-dependent transferasescoord: 187..389