Cla97C03G059850 (gene) Watermelon (97103) v2

NameCla97C03G059850
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr03 : 9768102 .. 9770321 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCCACAAATTTTGTTTCTCATTTTACAGATTTCTTCCTCGCTTCTCTCCTTCTAGGTTTCTCTTTTCTACTCAATCCAACTACAAAACCCCAATAAACCCCACTTTGTTTTCGTCAAATGCGGAATCGCTACTAGCATCAATCTTTCAAGCTTGTAACGACCATTCTCTTCTTCGTCAAGGTAAACAATCTCATGCTCAGGCCATTATCAGTGGAGTTGTCCACAATGGGGATTTAGGTTCTAGGATTTTGGGCATGTATGTGCGTACTGGCAGTCTTGAGGATGCAAAGAACTTGTTTTATACTCTTCTATTGGGATGTACTTCGGCTTGGAATTGGATGATTAGGGGGTTTACAATGATGGGTCTGTGTAATTATGCTTTGTTGTTTTATTTTAAGATGTTGGGTGCTGGAGTTTCTCCTGATAAGTATACATTTCCTTATGTGATTAAATCCTGTGGTGCTTTGAACAGTGTGAAGATGGGTAAGATTGTTCATGAGACTGTTAATTTAATGGGTCTTAAGGAGGATGCCTTTGTGGGTAGTTCTTTAATTAAGTTGTATGCAGACAATGGTCAGTTGAGTGATGCACAGTATCTGTTTGATAATATTCCTCAGAAGGATTGTGTTCTGTGGAATGTTATGCTGAATGGTTATGTGAAAAATGGTGACTCTGGTAATGCCGTTAAGATCTTTTTGGAAATGAGACACAGTGAGACTAAGCCCAACTCAGTAACCTTTGCTTGTATTTTATCTGTTTGTGCCTCAGAGGCAATGCTTGACTTAGGTACTCAACTTCACGGGATTGCTGTTAGTTGTGGGCTGGAGTtggattctccagtggctaatacattgttggctatgtactcgaaatgccaatgcttacaagctgcacgtaaactgtttgatacgatgccgcaatgtgacttggtgagttggaatggaataatttctggatatgtacagaatggtttgaggagtgaggctgagcatttgTttcgggggatgatatctgcaggaataaagcccgactcaatcacttttgcaagttttctaccatgtgttaaTgagttgctgagtctcaaacattgtaaggaaattcatggttacattgtaagacatgctgtagttttggacgtgttcttaaaaagtgctctaattgatatatacttcaagtgcagggatgtggaaatggcacgaaaagtgttgtgtcaaagtagttcgttTGatactgtagtgtgtacgaccatgatttcagggtacgtgcttaatgggatgaacacagaagcattggaggtatttagatggttgctgcaagaGagaatgaagCCTACTTCGGTGACTTTTGCTAGTGTCTTTCCAGCTTTTGCTGGCTTGGCCGCTTTAAACTTGGGGAAGGAATTGCATGGTAGTATCATAAAGAATAAGCTTGATGAAAAATGTCATGTAGGCAGTGCTGTTCTGGACATGTATGCAAAATGTGGAAGATTGGATCTTGCTCGTCGAGTTTTTAACAGAATGACTGGAAAGGATGCTATTTGCTGGAACTCCATGATTACGAGTTGTTCCCAGAATGGCAGGCCAGGGGAGGCCATCGATCTTTTCCGTCAGATGGGAATGGAGGGAACTCAGTATGACTGTGTGAGCATATCTGGTGCCATATCTGCTTGTGCAAACTTACCTGCTCTTCATTATGGAAAAGAGATCCATGGCTTCATGATCAAAGGCCCTTTAAGATCTGACCTTTATGCTGAGAGTTCACTGATAGACATGTATGCTAAGTGTGGAAACTTGAACTTCTCTCGGCGGGTGTTCGACATGATGCAAGAAAAAAATGAAGTCTCATGGAATAGCATTATTTCTGCCTATGGAAACCACGGTGATTTGAAGGAGTGTCTTGCTCTATTCCATGAAATGTTGAGAAACGACATTCAGCCTGATCATGTCACCTTTCTTGGTATCATATCTGCTTGTGGCCATGCTGGCCGAGTCGATGAAGGAATTAGATATTACCATCTCATGACAGAGGAATACGGGATCCCAGCTCGAATGGAGCACTATGCCTGTGTGGCTGATTTGTTTGGCCGTGCAGGTCGTCTGGATGAAGCATTTGAAACCATAAATAGCATGCCATTCCCTCCAGATGCTGGTGTATGGGGAACACTACTCGGGGCCTGCCATGTTCATGGAAATGTTGAGCTCGCCGAAGTGGCATAA

mRNA sequence

ATGTTCCACAAATTTTGTTTCTCATTTTACAGATTTCTTCCTCGCTTCTCTCCTTCTAGGTTTCTCTTTTCTACTCAATCCAACTACAAAACCCCAATAAACCCCACTTTGTTTTCGTCAAATGCGGAATCGCTACTAGCATCAATCTTTCAAGCTTGTAACGACCATTCTCTTCTTCGTCAAGGTAAACAATCTCATGCTCAGGCCATTATCAGTGGAGTTGTCCACAATGGGGATTTAGGTTCTAGGATTTTGGGCATGTATGTGCGTACTGGCAGTCTTGAGGATGCAAAGAACTTGTTTTATACTCTTCTATTGGGATGTACTTCGGCTTGGAATTGGATGATTAGGGGGTTTACAATGATGGGTCTGTGTAATTATGCTTTGTTGTTTTATTTTAAGATGTTGGGTGCTGGAGTTTCTCCTGATAAGTATACATTTCCTTATGTGATTAAATCCTGTGGTGCTTTGAACAGTGTGAAGATGGGTAAGATTGTTCATGAGACTGTTAATTTAATGGGTCTTAAGGAGGATGCCTTTGTGGGTAGTTCTTTAATTAAGTTGTATGCAGACAATGGTCAGTTGAGTGATGCACAGTATCTGTTTGATAATATTCCTCAGAAGGATTGTGTTCTGTGGAATGTTATGCTGAATGGTTATGTGAAAAATGGTGACTCTGGTAATGCCGTTAAGATCTTTTTGGAAATGAGACACAGTGAGACTAAGCCCAACTCAGTAACCTTTGCTTGTATTTTATCTGTTTGTGCCTCAGAGGCAATGCTTGACTTAGGTACTCAACTTCACGGGATTGCTGTTAGTTGTGGGCTGGAGTtggattctccagtggctaatacattgttggctatgtactcgaaatgccaatgcttacaagctgcacgtaaactgtttgatacgatgccgcaatgtgacttggtgagttggaatggaataatttctggatatgtacagaatggtttgaggagtgaggctgagcatttgTttcgggggatgatatctgcaggaataaagcccgactcaatcacttttgcaagttttctaccatgtgttaaTgagttgctgagtctcaaacattgtaaggaaattcatggttacattgtaagacatgctgtagttttggacgtgttcttaaaaagtgctctaattgatatatacttcaagtgcagggatgtggaaatggcacgaaaagtgttgtgtcaaagtagttcgttTGatactgtagtgtgtacgaccatgatttcagggtacgtgcttaatgggatgaacacagaagcattggaggtatttagatggttgctgcaagaGagaatgaagCCTACTTCGGTGACTTTTGCTAGTGTCTTTCCAGCTTTTGCTGGCTTGGCCGCTTTAAACTTGGGGAAGGAATTGCATGGTAGTATCATAAAGAATAAGCTTGATGAAAAATGTCATGTAGGCAGTGCTGTTCTGGACATGTATGCAAAATGTGGAAGATTGGATCTTGCTCGTCGAGTTTTTAACAGAATGACTGGAAAGGATGCTATTTGCTGGAACTCCATGATTACGAGTTGTTCCCAGAATGGCAGGCCAGGGGAGGCCATCGATCTTTTCCGTCAGATGGGAATGGAGGGAACTCAGTATGACTGTGTGAGCATATCTGGTGCCATATCTGCTTGTGCAAACTTACCTGCTCTTCATTATGGAAAAGAGATCCATGGCTTCATGATCAAAGGCCCTTTAAGATCTGACCTTTATGCTGAGAGTTCACTGATAGACATGTATGCTAAGTGTGGAAACTTGAACTTCTCTCGGCGGGTGTTCGACATGATGCAAGAAAAAAATGAAGTCTCATGGAATAGCATTATTTCTGCCTATGGAAACCACGGTGATTTGAAGGAGTGTCTTGCTCTATTCCATGAAATGTTGAGAAACGACATTCAGCCTGATCATGTCACCTTTCTTGGTATCATATCTGCTTGTGGCCATGCTGGCCGAGTCGATGAAGGAATTAGATATTACCATCTCATGACAGAGGAATACGGGATCCCAGCTCGAATGGAGCACTATGCCTGTGTGGCTGATTTGTTTGGCCGTGCAGGTCGTCTGGATGAAGCATTTGAAACCATAAATAGCATGCCATTCCCTCCAGATGCTGGTGTATGGGGAACACTACTCGGGGCCTGCCATGTTCATGGAAATGTTGAGCTCGCCGAAGTGGCATAA

Coding sequence (CDS)

ATGTTCCACAAATTTTGTTTCTCATTTTACAGATTTCTTCCTCGCTTCTCTCCTTCTAGGTTTCTCTTTTCTACTCAATCCAACTACAAAACCCCAATAAACCCCACTTTGTTTTCGTCAAATGCGGAATCGCTACTAGCATCAATCTTTCAAGCTTGTAACGACCATTCTCTTCTTCGTCAAGGTAAACAATCTCATGCTCAGGCCATTATCAGTGGAGTTGTCCACAATGGGGATTTAGGTTCTAGGATTTTGGGCATGTATGTGCGTACTGGCAGTCTTGAGGATGCAAAGAACTTGTTTTATACTCTTCTATTGGGATGTACTTCGGCTTGGAATTGGATGATTAGGGGGTTTACAATGATGGGTCTGTGTAATTATGCTTTGTTGTTTTATTTTAAGATGTTGGGTGCTGGAGTTTCTCCTGATAAGTATACATTTCCTTATGTGATTAAATCCTGTGGTGCTTTGAACAGTGTGAAGATGGGTAAGATTGTTCATGAGACTGTTAATTTAATGGGTCTTAAGGAGGATGCCTTTGTGGGTAGTTCTTTAATTAAGTTGTATGCAGACAATGGTCAGTTGAGTGATGCACAGTATCTGTTTGATAATATTCCTCAGAAGGATTGTGTTCTGTGGAATGTTATGCTGAATGGTTATGTGAAAAATGGTGACTCTGGTAATGCCGTTAAGATCTTTTTGGAAATGAGACACAGTGAGACTAAGCCCAACTCAGTAACCTTTGCTTGTATTTTATCTGTTTGTGCCTCAGAGGCAATGCTTGACTTAGGTACTCAACTTCACGGGATTGCTGTTAGTTGTGGGCTGGAGTtggattctccagtggctaatacattgttggctatgtactcgaaatgccaatgcttacaagctgcacgtaaactgtttgatacgatgccgcaatgtgacttggtgagttggaatggaataatttctggatatgtacagaatggtttgaggagtgaggctgagcatttgTttcgggggatgatatctgcaggaataaagcccgactcaatcacttttgcaagttttctaccatgtgttaaTgagttgctgagtctcaaacattgtaaggaaattcatggttacattgtaagacatgctgtagttttggacgtgttcttaaaaagtgctctaattgatatatacttcaagtgcagggatgtggaaatggcacgaaaagtgttgtgtcaaagtagttcgttTGatactgtagtgtgtacgaccatgatttcagggtacgtgcttaatgggatgaacacagaagcattggaggtatttagatggttgctgcaagaGagaatgaagCCTACTTCGGTGACTTTTGCTAGTGTCTTTCCAGCTTTTGCTGGCTTGGCCGCTTTAAACTTGGGGAAGGAATTGCATGGTAGTATCATAAAGAATAAGCTTGATGAAAAATGTCATGTAGGCAGTGCTGTTCTGGACATGTATGCAAAATGTGGAAGATTGGATCTTGCTCGTCGAGTTTTTAACAGAATGACTGGAAAGGATGCTATTTGCTGGAACTCCATGATTACGAGTTGTTCCCAGAATGGCAGGCCAGGGGAGGCCATCGATCTTTTCCGTCAGATGGGAATGGAGGGAACTCAGTATGACTGTGTGAGCATATCTGGTGCCATATCTGCTTGTGCAAACTTACCTGCTCTTCATTATGGAAAAGAGATCCATGGCTTCATGATCAAAGGCCCTTTAAGATCTGACCTTTATGCTGAGAGTTCACTGATAGACATGTATGCTAAGTGTGGAAACTTGAACTTCTCTCGGCGGGTGTTCGACATGATGCAAGAAAAAAATGAAGTCTCATGGAATAGCATTATTTCTGCCTATGGAAACCACGGTGATTTGAAGGAGTGTCTTGCTCTATTCCATGAAATGTTGAGAAACGACATTCAGCCTGATCATGTCACCTTTCTTGGTATCATATCTGCTTGTGGCCATGCTGGCCGAGTCGATGAAGGAATTAGATATTACCATCTCATGACAGAGGAATACGGGATCCCAGCTCGAATGGAGCACTATGCCTGTGTGGCTGATTTGTTTGGCCGTGCAGGTCGTCTGGATGAAGCATTTGAAACCATAAATAGCATGCCATTCCCTCCAGATGCTGGTGTATGGGGAACACTACTCGGGGCCTGCCATGTTCATGGAAATGTTGAGCTCGCCGAAGTGGCATAA

Protein sequence

MFHKFCFSFYRFLPRFSPSRFLFSTQSNYKTPINPTLFSSNAESLLASIFQACNDHSLLRQGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTSAWNWMIRGFTMMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVA
BLAST of Cla97C03G059850 vs. NCBI nr
Match: XP_008459124.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Cucumis melo])

HSP 1 Score: 1357.0 bits (3511), Expect = 0.0e+00
Identity = 662/739 (89.58%), Postives = 691/739 (93.50%), Query Frame = 0

Query: 1   MFHKFCFSFYRFLPRFSPSRFLFSTQSNYKTPINPTLFSSNAESLLASIFQACNDHSLLR 60
           MF+KFC S   FL  FSP RFLFSTQSN+KTP   TLFSSNAES+LASIF ACNDH+LL 
Sbjct: 1   MFYKFCSSSCTFLSHFSPPRFLFSTQSNFKTP--TTLFSSNAESVLASIFHACNDHTLLP 60

Query: 61  QGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTSAWNWMIRGFT 120
           Q KQSHAQ I+ G+  NG LG R+LGMYVR GSL+DAKNLFYTL LGCTSAWNWMIRGFT
Sbjct: 61  QAKQSHAQTIVRGLAQNGYLGPRVLGMYVRAGSLKDAKNLFYTLQLGCTSAWNWMIRGFT 120

Query: 121 MMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAF 180
           MMG  NYALLFY KMLGAGVSPDKYTFPYV+K+C  L SVKMGKIVHETVNLMGL ED F
Sbjct: 121 MMGQFNYALLFYLKMLGAGVSPDKYTFPYVVKACCGLKSVKMGKIVHETVNLMGLMEDVF 180

Query: 181 VGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSE 240
           VGSSLIKLYA+NG LSDAQYLFDNIPQKD VLWNVMLNGYVKNGDSGNA++IFLEMRHSE
Sbjct: 181 VGSSLIKLYAENGHLSDAQYLFDNIPQKDSVLWNVMLNGYVKNGDSGNAIEIFLEMRHSE 240

Query: 241 TKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300
            KPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR
Sbjct: 241 IKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300

Query: 301 KLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELL 360
           KLFD MPQ DLVSWNGIISGYVQNGL SEAE+LFRGMISAGIKPDSITFASFLPCV+ELL
Sbjct: 301 KLFDRMPQSDLVSWNGIISGYVQNGLMSEAENLFRGMISAGIKPDSITFASFLPCVSELL 360

Query: 361 SLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMIS 420
           SLKHCKEIHGYIVRHAVVLDVFLKSALIDIY KCR+++MA+K+LCQSSSFDTVVCTTMIS
Sbjct: 361 SLKHCKEIHGYIVRHAVVLDVFLKSALIDIYCKCRNMKMAQKILCQSSSFDTVVCTTMIS 420

Query: 421 GYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDE 480
           GYVLNGMN EALE FRWLLQER+KPTSVTF+S+FPAFAGLAALNLGKELHGSIIK KLDE
Sbjct: 421 GYVLNGMNKEALEAFRWLLQERLKPTSVTFSSIFPAFAGLAALNLGKELHGSIIKTKLDE 480

Query: 481 KCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMG 540
           KCHVGSAVLDMYAKCGRLDLA RVFNRMT KDAICWNSMITSCSQNGRPGEAI+LFRQMG
Sbjct: 481 KCHVGSAVLDMYAKCGRLDLACRVFNRMTEKDAICWNSMITSCSQNGRPGEAINLFRQMG 540

Query: 541 MEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLN 600
           MEG +YDCVSISGA+SACANLPALHYGK+IHG MIKGPLRSDLYAESSLIDMYAKCGNLN
Sbjct: 541 MEGNRYDCVSISGALSACANLPALHYGKQIHGLMIKGPLRSDLYAESSLIDMYAKCGNLN 600

Query: 601 FSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACG 660
           FSRRVFD MQEKNEVSWNSII AYGNHGDLKECLALFHEMLRN IQPDHVTFL IISACG
Sbjct: 601 FSRRVFDRMQEKNEVSWNSIICAYGNHGDLKECLALFHEMLRNGIQPDHVTFLSIISACG 660

Query: 661 HAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVW 720
           HAG+VDEGIRYYHLMTEEYGIPARM HYAC+ADLFGRAGRLDEAFETINSMPFPPDAGVW
Sbjct: 661 HAGQVDEGIRYYHLMTEEYGIPARMHHYACMADLFGRAGRLDEAFETINSMPFPPDAGVW 720

Query: 721 GTLLGACHVHGNVELAEVA 740
           GTLLGACHVHGNVELAEVA
Sbjct: 721 GTLLGACHVHGNVELAEVA 737

BLAST of Cla97C03G059850 vs. NCBI nr
Match: XP_004135750.2 (PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Cucumis sativus] >XP_011659873.1 PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Cucumis sativus] >KGN65963.1 hypothetical protein Csa_1G553510 [Cucumis sativus])

HSP 1 Score: 1349.0 bits (3490), Expect = 0.0e+00
Identity = 656/739 (88.77%), Postives = 683/739 (92.42%), Query Frame = 0

Query: 1   MFHKFCFSFYRFLPRFSPSRFLFSTQSNYKTPINPTLFSSNAESLLASIFQACNDHSLLR 60
           MF+KFC S   FL R SP RFLFSTQSN+KTPINPTL SSNAES+LASI QACNDH+ L 
Sbjct: 1   MFYKFCSSSSTFLSRLSPPRFLFSTQSNFKTPINPTLLSSNAESVLASILQACNDHTHLP 60

Query: 61  QGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTSAWNWMIRGFT 120
           QGKQSHAQAI+SG+  NGDLG R+LGMYVRTGSL+DAKNLFYTL LGCTSAWNWMIRGFT
Sbjct: 61  QGKQSHAQAIVSGLAQNGDLGPRVLGMYVRTGSLKDAKNLFYTLQLGCTSAWNWMIRGFT 120

Query: 121 MMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAF 180
           MMG  NYALLFY KMLGAGVSPDKYTFPYV+K+C  L SVKMGKIVHETVNLMGLKED F
Sbjct: 121 MMGQFNYALLFYLKMLGAGVSPDKYTFPYVVKACCGLKSVKMGKIVHETVNLMGLKEDVF 180

Query: 181 VGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSE 240
           VGSSLIKLYA+NG LSDAQYLFDNIPQKD VLWNVMLNGYVKNGDSGNA+KIFLEMRHSE
Sbjct: 181 VGSSLIKLYAENGHLSDAQYLFDNIPQKDSVLWNVMLNGYVKNGDSGNAIKIFLEMRHSE 240

Query: 241 TKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300
            KPNSVTFAC+LSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR
Sbjct: 241 IKPNSVTFACVLSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300

Query: 301 KLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELL 360
           KLFDT PQ DLVSWNGIISGYVQNGL  EAEHLFRGMISAGIKPDSITFASFLPCVNELL
Sbjct: 301 KLFDTSPQSDLVSWNGIISGYVQNGLMGEAEHLFRGMISAGIKPDSITFASFLPCVNELL 360

Query: 361 SLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMIS 420
           SLKHCKEIHGYI+RHAVVLDVFLKSALIDIYFKCRDVEMA+K+LCQSSSFDTVVCTTMIS
Sbjct: 361 SLKHCKEIHGYIIRHAVVLDVFLKSALIDIYFKCRDVEMAQKILCQSSSFDTVVCTTMIS 420

Query: 421 GYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDE 480
           GYVLNG N EALE FRWL+QERMKPTSVTF+S+FPAFAGLAALNLGKELHGSIIK KLDE
Sbjct: 421 GYVLNGKNKEALEAFRWLVQERMKPTSVTFSSIFPAFAGLAALNLGKELHGSIIKTKLDE 480

Query: 481 KCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMG 540
           KCHVGSA+LDMYAKCGRLDLA RVFNR+T KDAICWNSMITSCSQNGRPGEAI+LFRQMG
Sbjct: 481 KCHVGSAILDMYAKCGRLDLACRVFNRITEKDAICWNSMITSCSQNGRPGEAINLFRQMG 540

Query: 541 MEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLN 600
           MEGT+YDCVSISGA+SACANLPALHYGKEIHG MIKGPLRSDLYAESSLIDMYAKCGNLN
Sbjct: 541 MEGTRYDCVSISGALSACANLPALHYGKEIHGLMIKGPLRSDLYAESSLIDMYAKCGNLN 600

Query: 601 FSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACG 660
           FSRRVFD MQE+NEVSWNSIISAYGNHGDLKECLALFHEMLRN IQPDHVTFLG      
Sbjct: 601 FSRRVFDRMQERNEVSWNSIISAYGNHGDLKECLALFHEMLRNGIQPDHVTFLGXXXXXX 660

Query: 661 HAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVW 720
                       HLMTEEYGIPARMEHYACVAD+FGRAGRLDEAFETINSMPFPPDAGVW
Sbjct: 661 XXXXXXXXXXXXHLMTEEYGIPARMEHYACVADMFGRAGRLDEAFETINSMPFPPDAGVW 720

Query: 721 GTLLGACHVHGNVELAEVA 740
           GTLLGACH+HGNVELAEVA
Sbjct: 721 GTLLGACHIHGNVELAEVA 739

BLAST of Cla97C03G059850 vs. NCBI nr
Match: XP_022142608.1 (pentatricopeptide repeat-containing protein At4g21300 isoform X1 [Momordica charantia])

HSP 1 Score: 1306.2 bits (3379), Expect = 0.0e+00
Identity = 634/739 (85.79%), Postives = 675/739 (91.34%), Query Frame = 0

Query: 1   MFHKFCFSFYRFLPRFSPSRFLFSTQSNYKTPINPTLFSSNAESLLASIFQACNDHSLLR 60
           MF+KF FS YRFLP FS   FLFST+SN K PINPTLFS+N E+ LASIFQACN HSLLR
Sbjct: 1   MFYKFRFSLYRFLPHFSRPEFLFSTESNSKNPINPTLFSTNVEAALASIFQACNHHSLLR 60

Query: 61  QGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTSAWNWMIRGFT 120
           QG+QSHAQAI SG+  NGD+G RILGMYV TGSL+DAKN+FY+L LGCTSAWNWMIRGFT
Sbjct: 61  QGQQSHAQAIASGISQNGDMGPRILGMYVLTGSLKDAKNVFYSLQLGCTSAWNWMIRGFT 120

Query: 121 MMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAF 180
           +MG  NYALLFYFKMLGAG+ PDKYTFPYV+K+CGALN+VKMGKIVHETVNLMGL++DAF
Sbjct: 121 VMGWFNYALLFYFKMLGAGIYPDKYTFPYVVKACGALNNVKMGKIVHETVNLMGLEKDAF 180

Query: 181 VGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSE 240
           VGSSLIKLYA+NG+LSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDS NA+KIFLEMRH E
Sbjct: 181 VGSSLIKLYAENGRLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSRNAIKIFLEMRHGE 240

Query: 241 TKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300
            KPNSVTFAC+LSVCA EAMLDLGTQLHG+AV+CGL+LDSPVANTLLAMYSKC+CLQAAR
Sbjct: 241 IKPNSVTFACVLSVCAMEAMLDLGTQLHGLAVTCGLDLDSPVANTLLAMYSKCRCLQAAR 300

Query: 301 KLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELL 360
           KLFD MPQ DLVSWNGIISGYVQNGL SEAE LFRGM+SAG+KPDSITFASFLPCV EL 
Sbjct: 301 KLFDMMPQSDLVSWNGIISGYVQNGLMSEAEQLFRGMVSAGMKPDSITFASFLPCVTELF 360

Query: 361 SLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMIS 420
           SL+HCK IHGYIVRHAVVLDVFLKSALID+YFKCRDVEMA+K+L QSS  DTVVCT MIS
Sbjct: 361 SLEHCKAIHGYIVRHAVVLDVFLKSALIDVYFKCRDVEMAQKILRQSSLVDTVVCTAMIS 420

Query: 421 GYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDE 480
           GYVLNGMN EALE FRWLLQ+R+KPTSVTFASVFPAFAGLAALNLGKELH SI+KN+LD 
Sbjct: 421 GYVLNGMNIEALEAFRWLLQKRLKPTSVTFASVFPAFAGLAALNLGKELHCSIVKNRLDV 480

Query: 481 KCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMG 540
           KCHVGSAVLDMYAKCGRLDLA +VFNRMT KDAI WNSMITSCSQNGRPGEAIDLFRQMG
Sbjct: 481 KCHVGSAVLDMYAKCGRLDLACQVFNRMTEKDAIFWNSMITSCSQNGRPGEAIDLFRQMG 540

Query: 541 MEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLN 600
           MEGTQYDCVSISGA+SACANLPALHYGKEIHGFMIKGPLRSD+YAESSLIDMYAKCGNLN
Sbjct: 541 MEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKGPLRSDIYAESSLIDMYAKCGNLN 600

Query: 601 FSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACG 660
           FSRRVFDMMQ KNEVSWNSIISAYGNHGDLKECLALFHEML+N IQPDHVTFLG      
Sbjct: 601 FSRRVFDMMQGKNEVSWNSIISAYGNHGDLKECLALFHEMLKNGIQPDHVTFLGXXXXXX 660

Query: 661 HAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVW 720
                        LMTE+YGIPARMEHYAC+ADLFGRAGRLDEAFETI SMPFPPDAGVW
Sbjct: 661 XXXXXXXXXXXXXLMTEDYGIPARMEHYACMADLFGRAGRLDEAFETIKSMPFPPDAGVW 720

Query: 721 GTLLGACHVHGNVELAEVA 740
           GTLLGACHVHGNVELAEVA
Sbjct: 721 GTLLGACHVHGNVELAEVA 739

BLAST of Cla97C03G059850 vs. NCBI nr
Match: XP_022142610.1 (pentatricopeptide repeat-containing protein At4g21300 isoform X3 [Momordica charantia])

HSP 1 Score: 1306.2 bits (3379), Expect = 0.0e+00
Identity = 634/739 (85.79%), Postives = 675/739 (91.34%), Query Frame = 0

Query: 1   MFHKFCFSFYRFLPRFSPSRFLFSTQSNYKTPINPTLFSSNAESLLASIFQACNDHSLLR 60
           MF+KF FS YRFLP FS   FLFST+SN K PINPTLFS+N E+ LASIFQACN HSLLR
Sbjct: 1   MFYKFRFSLYRFLPHFSRPEFLFSTESNSKNPINPTLFSTNVEAALASIFQACNHHSLLR 60

Query: 61  QGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTSAWNWMIRGFT 120
           QG+QSHAQAI SG+  NGD+G RILGMYV TGSL+DAKN+FY+L LGCTSAWNWMIRGFT
Sbjct: 61  QGQQSHAQAIASGISQNGDMGPRILGMYVLTGSLKDAKNVFYSLQLGCTSAWNWMIRGFT 120

Query: 121 MMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAF 180
           +MG  NYALLFYFKMLGAG+ PDKYTFPYV+K+CGALN+VKMGKIVHETVNLMGL++DAF
Sbjct: 121 VMGWFNYALLFYFKMLGAGIYPDKYTFPYVVKACGALNNVKMGKIVHETVNLMGLEKDAF 180

Query: 181 VGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSE 240
           VGSSLIKLYA+NG+LSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDS NA+KIFLEMRH E
Sbjct: 181 VGSSLIKLYAENGRLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSRNAIKIFLEMRHGE 240

Query: 241 TKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300
            KPNSVTFAC+LSVCA EAMLDLGTQLHG+AV+CGL+LDSPVANTLLAMYSKC+CLQAAR
Sbjct: 241 IKPNSVTFACVLSVCAMEAMLDLGTQLHGLAVTCGLDLDSPVANTLLAMYSKCRCLQAAR 300

Query: 301 KLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELL 360
           KLFD MPQ DLVSWNGIISGYVQNGL SEAE LFRGM+SAG+KPDSITFASFLPCV EL 
Sbjct: 301 KLFDMMPQSDLVSWNGIISGYVQNGLMSEAEQLFRGMVSAGMKPDSITFASFLPCVTELF 360

Query: 361 SLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMIS 420
           SL+HCK IHGYIVRHAVVLDVFLKSALID+YFKCRDVEMA+K+L QSS  DTVVCT MIS
Sbjct: 361 SLEHCKAIHGYIVRHAVVLDVFLKSALIDVYFKCRDVEMAQKILRQSSLVDTVVCTAMIS 420

Query: 421 GYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDE 480
           GYVLNGMN EALE FRWLLQ+R+KPTSVTFASVFPAFAGLAALNLGKELH SI+KN+LD 
Sbjct: 421 GYVLNGMNIEALEAFRWLLQKRLKPTSVTFASVFPAFAGLAALNLGKELHCSIVKNRLDV 480

Query: 481 KCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMG 540
           KCHVGSAVLDMYAKCGRLDLA +VFNRMT KDAI WNSMITSCSQNGRPGEAIDLFRQMG
Sbjct: 481 KCHVGSAVLDMYAKCGRLDLACQVFNRMTEKDAIFWNSMITSCSQNGRPGEAIDLFRQMG 540

Query: 541 MEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLN 600
           MEGTQYDCVSISGA+SACANLPALHYGKEIHGFMIKGPLRSD+YAESSLIDMYAKCGNLN
Sbjct: 541 MEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKGPLRSDIYAESSLIDMYAKCGNLN 600

Query: 601 FSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACG 660
           FSRRVFDMMQ KNEVSWNSIISAYGNHGDLKECLALFHEML+N IQPDHVTFLG      
Sbjct: 601 FSRRVFDMMQGKNEVSWNSIISAYGNHGDLKECLALFHEMLKNGIQPDHVTFLGXXXXXX 660

Query: 661 HAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVW 720
                        LMTE+YGIPARMEHYAC+ADLFGRAGRLDEAFETI SMPFPPDAGVW
Sbjct: 661 XXXXXXXXXXXXXLMTEDYGIPARMEHYACMADLFGRAGRLDEAFETIKSMPFPPDAGVW 720

Query: 721 GTLLGACHVHGNVELAEVA 740
           GTLLGACHVHGNVELAEVA
Sbjct: 721 GTLLGACHVHGNVELAEVA 739

BLAST of Cla97C03G059850 vs. NCBI nr
Match: XP_022142609.1 (pentatricopeptide repeat-containing protein At4g21300 isoform X2 [Momordica charantia])

HSP 1 Score: 1306.2 bits (3379), Expect = 0.0e+00
Identity = 634/739 (85.79%), Postives = 675/739 (91.34%), Query Frame = 0

Query: 1   MFHKFCFSFYRFLPRFSPSRFLFSTQSNYKTPINPTLFSSNAESLLASIFQACNDHSLLR 60
           MF+KF FS YRFLP FS   FLFST+SN K PINPTLFS+N E+ LASIFQACN HSLLR
Sbjct: 1   MFYKFRFSLYRFLPHFSRPEFLFSTESNSKNPINPTLFSTNVEAALASIFQACNHHSLLR 60

Query: 61  QGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTSAWNWMIRGFT 120
           QG+QSHAQAI SG+  NGD+G RILGMYV TGSL+DAKN+FY+L LGCTSAWNWMIRGFT
Sbjct: 61  QGQQSHAQAIASGISQNGDMGPRILGMYVLTGSLKDAKNVFYSLQLGCTSAWNWMIRGFT 120

Query: 121 MMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAF 180
           +MG  NYALLFYFKMLGAG+ PDKYTFPYV+K+CGALN+VKMGKIVHETVNLMGL++DAF
Sbjct: 121 VMGWFNYALLFYFKMLGAGIYPDKYTFPYVVKACGALNNVKMGKIVHETVNLMGLEKDAF 180

Query: 181 VGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSE 240
           VGSSLIKLYA+NG+LSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDS NA+KIFLEMRH E
Sbjct: 181 VGSSLIKLYAENGRLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSRNAIKIFLEMRHGE 240

Query: 241 TKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300
            KPNSVTFAC+LSVCA EAMLDLGTQLHG+AV+CGL+LDSPVANTLLAMYSKC+CLQAAR
Sbjct: 241 IKPNSVTFACVLSVCAMEAMLDLGTQLHGLAVTCGLDLDSPVANTLLAMYSKCRCLQAAR 300

Query: 301 KLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELL 360
           KLFD MPQ DLVSWNGIISGYVQNGL SEAE LFRGM+SAG+KPDSITFASFLPCV EL 
Sbjct: 301 KLFDMMPQSDLVSWNGIISGYVQNGLMSEAEQLFRGMVSAGMKPDSITFASFLPCVTELF 360

Query: 361 SLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMIS 420
           SL+HCK IHGYIVRHAVVLDVFLKSALID+YFKCRDVEMA+K+L QSS  DTVVCT MIS
Sbjct: 361 SLEHCKAIHGYIVRHAVVLDVFLKSALIDVYFKCRDVEMAQKILRQSSLVDTVVCTAMIS 420

Query: 421 GYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDE 480
           GYVLNGMN EALE FRWLLQ+R+KPTSVTFASVFPAFAGLAALNLGKELH SI+KN+LD 
Sbjct: 421 GYVLNGMNIEALEAFRWLLQKRLKPTSVTFASVFPAFAGLAALNLGKELHCSIVKNRLDV 480

Query: 481 KCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMG 540
           KCHVGSAVLDMYAKCGRLDLA +VFNRMT KDAI WNSMITSCSQNGRPGEAIDLFRQMG
Sbjct: 481 KCHVGSAVLDMYAKCGRLDLACQVFNRMTEKDAIFWNSMITSCSQNGRPGEAIDLFRQMG 540

Query: 541 MEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLN 600
           MEGTQYDCVSISGA+SACANLPALHYGKEIHGFMIKGPLRSD+YAESSLIDMYAKCGNLN
Sbjct: 541 MEGTQYDCVSISGALSACANLPALHYGKEIHGFMIKGPLRSDIYAESSLIDMYAKCGNLN 600

Query: 601 FSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACG 660
           FSRRVFDMMQ KNEVSWNSIISAYGNHGDLKECLALFHEML+N IQPDHVTFLG      
Sbjct: 601 FSRRVFDMMQGKNEVSWNSIISAYGNHGDLKECLALFHEMLKNGIQPDHVTFLGXXXXXX 660

Query: 661 HAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVW 720
                        LMTE+YGIPARMEHYAC+ADLFGRAGRLDEAFETI SMPFPPDAGVW
Sbjct: 661 XXXXXXXXXXXXXLMTEDYGIPARMEHYACMADLFGRAGRLDEAFETIKSMPFPPDAGVW 720

Query: 721 GTLLGACHVHGNVELAEVA 740
           GTLLGACHVHGNVELAEVA
Sbjct: 721 GTLLGACHVHGNVELAEVA 739

BLAST of Cla97C03G059850 vs. TrEMBL
Match: tr|A0A1S3C9G4|A0A1S3C9G4_CUCME (pentatricopeptide repeat-containing protein At4g21300 OS=Cucumis melo OX=3656 GN=LOC103498325 PE=4 SV=1)

HSP 1 Score: 1357.0 bits (3511), Expect = 0.0e+00
Identity = 662/739 (89.58%), Postives = 691/739 (93.50%), Query Frame = 0

Query: 1   MFHKFCFSFYRFLPRFSPSRFLFSTQSNYKTPINPTLFSSNAESLLASIFQACNDHSLLR 60
           MF+KFC S   FL  FSP RFLFSTQSN+KTP   TLFSSNAES+LASIF ACNDH+LL 
Sbjct: 1   MFYKFCSSSCTFLSHFSPPRFLFSTQSNFKTP--TTLFSSNAESVLASIFHACNDHTLLP 60

Query: 61  QGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTSAWNWMIRGFT 120
           Q KQSHAQ I+ G+  NG LG R+LGMYVR GSL+DAKNLFYTL LGCTSAWNWMIRGFT
Sbjct: 61  QAKQSHAQTIVRGLAQNGYLGPRVLGMYVRAGSLKDAKNLFYTLQLGCTSAWNWMIRGFT 120

Query: 121 MMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAF 180
           MMG  NYALLFY KMLGAGVSPDKYTFPYV+K+C  L SVKMGKIVHETVNLMGL ED F
Sbjct: 121 MMGQFNYALLFYLKMLGAGVSPDKYTFPYVVKACCGLKSVKMGKIVHETVNLMGLMEDVF 180

Query: 181 VGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSE 240
           VGSSLIKLYA+NG LSDAQYLFDNIPQKD VLWNVMLNGYVKNGDSGNA++IFLEMRHSE
Sbjct: 181 VGSSLIKLYAENGHLSDAQYLFDNIPQKDSVLWNVMLNGYVKNGDSGNAIEIFLEMRHSE 240

Query: 241 TKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300
            KPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR
Sbjct: 241 IKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300

Query: 301 KLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELL 360
           KLFD MPQ DLVSWNGIISGYVQNGL SEAE+LFRGMISAGIKPDSITFASFLPCV+ELL
Sbjct: 301 KLFDRMPQSDLVSWNGIISGYVQNGLMSEAENLFRGMISAGIKPDSITFASFLPCVSELL 360

Query: 361 SLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMIS 420
           SLKHCKEIHGYIVRHAVVLDVFLKSALIDIY KCR+++MA+K+LCQSSSFDTVVCTTMIS
Sbjct: 361 SLKHCKEIHGYIVRHAVVLDVFLKSALIDIYCKCRNMKMAQKILCQSSSFDTVVCTTMIS 420

Query: 421 GYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDE 480
           GYVLNGMN EALE FRWLLQER+KPTSVTF+S+FPAFAGLAALNLGKELHGSIIK KLDE
Sbjct: 421 GYVLNGMNKEALEAFRWLLQERLKPTSVTFSSIFPAFAGLAALNLGKELHGSIIKTKLDE 480

Query: 481 KCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMG 540
           KCHVGSAVLDMYAKCGRLDLA RVFNRMT KDAICWNSMITSCSQNGRPGEAI+LFRQMG
Sbjct: 481 KCHVGSAVLDMYAKCGRLDLACRVFNRMTEKDAICWNSMITSCSQNGRPGEAINLFRQMG 540

Query: 541 MEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLN 600
           MEG +YDCVSISGA+SACANLPALHYGK+IHG MIKGPLRSDLYAESSLIDMYAKCGNLN
Sbjct: 541 MEGNRYDCVSISGALSACANLPALHYGKQIHGLMIKGPLRSDLYAESSLIDMYAKCGNLN 600

Query: 601 FSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACG 660
           FSRRVFD MQEKNEVSWNSII AYGNHGDLKECLALFHEMLRN IQPDHVTFL IISACG
Sbjct: 601 FSRRVFDRMQEKNEVSWNSIICAYGNHGDLKECLALFHEMLRNGIQPDHVTFLSIISACG 660

Query: 661 HAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVW 720
           HAG+VDEGIRYYHLMTEEYGIPARM HYAC+ADLFGRAGRLDEAFETINSMPFPPDAGVW
Sbjct: 661 HAGQVDEGIRYYHLMTEEYGIPARMHHYACMADLFGRAGRLDEAFETINSMPFPPDAGVW 720

Query: 721 GTLLGACHVHGNVELAEVA 740
           GTLLGACHVHGNVELAEVA
Sbjct: 721 GTLLGACHVHGNVELAEVA 737

BLAST of Cla97C03G059850 vs. TrEMBL
Match: tr|A0A0A0LW16|A0A0A0LW16_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553510 PE=4 SV=1)

HSP 1 Score: 1349.0 bits (3490), Expect = 0.0e+00
Identity = 656/739 (88.77%), Postives = 683/739 (92.42%), Query Frame = 0

Query: 1   MFHKFCFSFYRFLPRFSPSRFLFSTQSNYKTPINPTLFSSNAESLLASIFQACNDHSLLR 60
           MF+KFC S   FL R SP RFLFSTQSN+KTPINPTL SSNAES+LASI QACNDH+ L 
Sbjct: 1   MFYKFCSSSSTFLSRLSPPRFLFSTQSNFKTPINPTLLSSNAESVLASILQACNDHTHLP 60

Query: 61  QGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTSAWNWMIRGFT 120
           QGKQSHAQAI+SG+  NGDLG R+LGMYVRTGSL+DAKNLFYTL LGCTSAWNWMIRGFT
Sbjct: 61  QGKQSHAQAIVSGLAQNGDLGPRVLGMYVRTGSLKDAKNLFYTLQLGCTSAWNWMIRGFT 120

Query: 121 MMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAF 180
           MMG  NYALLFY KMLGAGVSPDKYTFPYV+K+C  L SVKMGKIVHETVNLMGLKED F
Sbjct: 121 MMGQFNYALLFYLKMLGAGVSPDKYTFPYVVKACCGLKSVKMGKIVHETVNLMGLKEDVF 180

Query: 181 VGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSE 240
           VGSSLIKLYA+NG LSDAQYLFDNIPQKD VLWNVMLNGYVKNGDSGNA+KIFLEMRHSE
Sbjct: 181 VGSSLIKLYAENGHLSDAQYLFDNIPQKDSVLWNVMLNGYVKNGDSGNAIKIFLEMRHSE 240

Query: 241 TKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300
            KPNSVTFAC+LSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR
Sbjct: 241 IKPNSVTFACVLSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAAR 300

Query: 301 KLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELL 360
           KLFDT PQ DLVSWNGIISGYVQNGL  EAEHLFRGMISAGIKPDSITFASFLPCVNELL
Sbjct: 301 KLFDTSPQSDLVSWNGIISGYVQNGLMGEAEHLFRGMISAGIKPDSITFASFLPCVNELL 360

Query: 361 SLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMIS 420
           SLKHCKEIHGYI+RHAVVLDVFLKSALIDIYFKCRDVEMA+K+LCQSSSFDTVVCTTMIS
Sbjct: 361 SLKHCKEIHGYIIRHAVVLDVFLKSALIDIYFKCRDVEMAQKILCQSSSFDTVVCTTMIS 420

Query: 421 GYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDE 480
           GYVLNG N EALE FRWL+QERMKPTSVTF+S+FPAFAGLAALNLGKELHGSIIK KLDE
Sbjct: 421 GYVLNGKNKEALEAFRWLVQERMKPTSVTFSSIFPAFAGLAALNLGKELHGSIIKTKLDE 480

Query: 481 KCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMG 540
           KCHVGSA+LDMYAKCGRLDLA RVFNR+T KDAICWNSMITSCSQNGRPGEAI+LFRQMG
Sbjct: 481 KCHVGSAILDMYAKCGRLDLACRVFNRITEKDAICWNSMITSCSQNGRPGEAINLFRQMG 540

Query: 541 MEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLN 600
           MEGT+YDCVSISGA+SACANLPALHYGKEIHG MIKGPLRSDLYAESSLIDMYAKCGNLN
Sbjct: 541 MEGTRYDCVSISGALSACANLPALHYGKEIHGLMIKGPLRSDLYAESSLIDMYAKCGNLN 600

Query: 601 FSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACG 660
           FSRRVFD MQE+NEVSWNSIISAYGNHGDLKECLALFHEMLRN IQPDHVTFLG      
Sbjct: 601 FSRRVFDRMQERNEVSWNSIISAYGNHGDLKECLALFHEMLRNGIQPDHVTFLGXXXXXX 660

Query: 661 HAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVW 720
                       HLMTEEYGIPARMEHYACVAD+FGRAGRLDEAFETINSMPFPPDAGVW
Sbjct: 661 XXXXXXXXXXXXHLMTEEYGIPARMEHYACVADMFGRAGRLDEAFETINSMPFPPDAGVW 720

Query: 721 GTLLGACHVHGNVELAEVA 740
           GTLLGACH+HGNVELAEVA
Sbjct: 721 GTLLGACHIHGNVELAEVA 739

BLAST of Cla97C03G059850 vs. TrEMBL
Match: tr|A0A2N9IUG5|A0A2N9IUG5_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56050 PE=4 SV=1)

HSP 1 Score: 1044.3 bits (2699), Expect = 1.3e-301
Identity = 507/731 (69.36%), Postives = 595/731 (81.40%), Query Frame = 0

Query: 13  LPRFSPSRFLFSTQSNYKTPINPTLFSSNAE----SLLASIFQACNDHSLLRQGKQSHAQ 72
           L R SP+ F+ +  ++ +   +  +  SN E    S LASI QACN  S+L+QG+Q HAQ
Sbjct: 5   LTRISPANFIHTNCNHIE---HSRVLYSNTEHAWASQLASILQACNGPSVLQQGRQVHAQ 64

Query: 73  AIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTSAWNWMIRGFTMMGLCNYA 132
            I+ G  +N  LG+++LGMYV  GS+  AKN+FY L LG +  W  MIRGFT MG  ++A
Sbjct: 65  VIVCGYSNNALLGTKLLGMYVLCGSIVAAKNMFYKLELGSSLPWTLMIRGFTKMGWFDFA 124

Query: 133 LLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAFVGSSLIKL 192
           LLFYFKMLG GV PDKYTFP VIK+CG LN+V++GK+VH T+ LMG + D FVGSSLIKL
Sbjct: 125 LLFYFKMLGCGVFPDKYTFPCVIKACGGLNNVRLGKLVHRTIRLMGFEFDVFVGSSLIKL 184

Query: 193 YADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSETKPNSVTF 252
           YA+NG + +A+YLFD +P +DCV+WNVMLN YVKN DS NA+++FLEMR+SE +PNSVTF
Sbjct: 185 YAENGCIGEARYLFDKMPHRDCVMWNVMLNVYVKNADSINALQMFLEMRNSEIRPNSVTF 244

Query: 253 ACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAARKLFDTMPQ 312
           AC+LSVCASEAM+ LGTQLHG+ V CGLELDSPVANTLLAMYSKCQ L  A  LFD MPQ
Sbjct: 245 ACVLSVCASEAMVVLGTQLHGLVVRCGLELDSPVANTLLAMYSKCQKLFDACALFDMMPQ 304

Query: 313 CDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELLSLKHCKEI 372
            DLV+WNG+ISG+VQNG   EA HLFR MIS G+KPDSIT ASFLP V E   L   KEI
Sbjct: 305 TDLVTWNGMISGFVQNGFMGEASHLFREMISVGVKPDSITLASFLPSVTESACLNQGKEI 364

Query: 373 HGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMISGYVLNGMN 432
           HGY+VRH V LDVFLKSALIDIY KCR+VEMARK+  QS++ D VVCTTMISG+ LNGMN
Sbjct: 365 HGYMVRHGVPLDVFLKSALIDIYLKCRNVEMARKIFSQSNTTDIVVCTTMISGFTLNGMN 424

Query: 433 TEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDEKCHVGSAV 492
           ++AL +FRWLL+E+M+P SVT AS  PA AGLAAL LGKELH +I+KN LD +CHVGSA+
Sbjct: 425 SDALAIFRWLLKEKMRPNSVTLASTLPACAGLAALKLGKELHCNILKNGLDGRCHVGSAI 484

Query: 493 LDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMGMEGTQYDC 552
            DMYAKCGRLDLA + F R++ +D +CWNSMITSC+QNG+P +AIDLFRQ+G  GT+YDC
Sbjct: 485 TDMYAKCGRLDLAHQTFERISERDTVCWNSMITSCAQNGKPEDAIDLFRQLGKGGTKYDC 544

Query: 553 VSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLNFSRRVFDM 612
           VSIS A+SACANL +LH+GKEIH FMIKG   SDL+AES+LIDMYAKCGNL+ +R VFDM
Sbjct: 545 VSISAALSACANLSSLHHGKEIHSFMIKGAFSSDLFAESALIDMYAKCGNLDSARSVFDM 604

Query: 613 MQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACGHAGRVDEG 672
           MQ KNEVSWNSIISAYGNHG LK+ L LFHEML N IQPDHVTFL IISACGHAG+VD G
Sbjct: 605 MQGKNEVSWNSIISAYGNHGHLKDSLTLFHEMLENGIQPDHVTFLAIISACGHAGQVDNG 664

Query: 673 IRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACH 732
            RY+  MTE+YGIPA+MEHYAC+ DLFGRAG+L+EAFETI SMPF PDAGVWGTLLGAC 
Sbjct: 665 ARYFRCMTEQYGIPAQMEHYACMVDLFGRAGQLNEAFETIKSMPFTPDAGVWGTLLGACR 724

Query: 733 VHGNVELAEVA 740
           VHGNVELAEVA
Sbjct: 725 VHGNVELAEVA 732

BLAST of Cla97C03G059850 vs. TrEMBL
Match: tr|A0A2P5ENX2|A0A2P5ENX2_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_169790 PE=4 SV=1)

HSP 1 Score: 1033.5 bits (2671), Expect = 2.3e-298
Identity = 496/691 (71.78%), Postives = 577/691 (83.50%), Query Frame = 0

Query: 48  SIFQACNDHSLLRQGKQSHAQAIISGVVHNGD-LGSRILGMYVRTGSLEDAKNLFYTLLL 107
           SI Q C DH+L+RQGKQ HAQ ++ G    G  LG++IL MYV  GS   AKN F+ L L
Sbjct: 40  SILQVCCDHALVRQGKQVHAQVVVGGRQCKGSLLGTKILAMYVLCGSFLHAKNTFHHLEL 99

Query: 108 GCTSAWNWMIRGFTMMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIV 167
           G  S WNWMIRGFTMMGL  YAL+FYFKMLG G SPDKYTFP VIK+CG LN+V + K V
Sbjct: 100 GLASPWNWMIRGFTMMGLFKYALMFYFKMLGYGTSPDKYTFPPVIKACGGLNNVSLAKQV 159

Query: 168 HETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDS 227
           HE V LMGL+ D FVGSSLIKLYA+NG + DA+ LFD +PQ+D VLWNVMLNGYVKNGD+
Sbjct: 160 HERVRLMGLEVDVFVGSSLIKLYAENGCIDDARNLFDKMPQRDSVLWNVMLNGYVKNGDT 219

Query: 228 GNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTL 287
            ++V++FLEM  S+ KPNSVT+AC+LSVC+SE ++  GTQ+HG  VSCGLELDSPVANTL
Sbjct: 220 KHSVEMFLEMSESDVKPNSVTYACMLSVCSSEELICFGTQIHGHVVSCGLELDSPVANTL 279

Query: 288 LAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDS 347
           LAMYSKCQ L  A KLF+ MPQ DLV+WNG+ISG+VQNG   EA + FR MISA I+PDS
Sbjct: 280 LAMYSKCQRLSDACKLFNLMPQTDLVTWNGMISGHVQNGFMIEASNCFRDMISARIEPDS 339

Query: 348 ITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQ 407
           ITFASFLP V E  SLK  KEIHGYI+RH V LDVFLKSALID+YFKCR+VEMARK+L Q
Sbjct: 340 ITFASFLPSVTEFASLKLGKEIHGYIIRHGVPLDVFLKSALIDLYFKCRNVEMARKILHQ 399

Query: 408 SSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLG 467
           S++ D +VCT MISG+VLNGMNT+A+E+FRWLL+ +M+P SVT ASV PAFAG+AAL LG
Sbjct: 400 STTVDLIVCTAMISGFVLNGMNTDAMEIFRWLLKAKMRPNSVTLASVLPAFAGMAALKLG 459

Query: 468 KELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQN 527
           KELHG+IIK  LD +C+VGSA+ DMYAKCGRLDLAR+VF +M  +DA+CWNSMITSCSQN
Sbjct: 460 KELHGNIIKTGLDRRCYVGSAITDMYAKCGRLDLARQVFRKMIERDAVCWNSMITSCSQN 519

Query: 528 GRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAE 587
           G P EAIDLFRQMG+EGT+YDCVSIS A+S+CANLPALH+GKEIHGFMI+    SD+++ 
Sbjct: 520 GEPEEAIDLFRQMGLEGTKYDCVSISAALSSCANLPALHHGKEIHGFMIRRAFSSDIFSG 579

Query: 588 SSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQ 647
           S+LIDMYAKCG+L+F+RRVF+ M+ KNEVSWNSII+AYGNHG L+E L LFHEML   I 
Sbjct: 580 SALIDMYAKCGSLDFARRVFNFMEGKNEVSWNSIIAAYGNHGQLEESLTLFHEMLEQGIL 639

Query: 648 PDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFE 707
           PDHVTFLG+ISACGHAGRVD+G  ++ LMTEEY IPAR EHYAC+ DLFGRAG L+EAFE
Sbjct: 640 PDHVTFLGMISACGHAGRVDDGSHFFRLMTEEYKIPARTEHYACMVDLFGRAGCLNEAFE 699

Query: 708 TINSMPFPPDAGVWGTLLGACHVHGNVELAE 738
           TINSMPF PDAGVWGTLLGAC VHGNVELAE
Sbjct: 700 TINSMPFSPDAGVWGTLLGACRVHGNVELAE 730

BLAST of Cla97C03G059850 vs. TrEMBL
Match: tr|A0A2I4F578|A0A2I4F578_9ROSI (pentatricopeptide repeat-containing protein At4g21300 OS=Juglans regia OX=51240 GN=LOC108995659 PE=4 SV=1)

HSP 1 Score: 1021.1 bits (2639), Expect = 1.2e-294
Identity = 498/735 (67.76%), Postives = 595/735 (80.95%), Query Frame = 0

Query: 8   SFYRFLPRFSPSRFLFSTQSNYKTPIN--PTLFSSNAESLLASIFQACNDHSLLRQGKQS 67
           S +    R SP + + +  +N+K      P +  + A   LASI QAC+  S+LR GKQ 
Sbjct: 12  SIHTRFTRTSPIKLVHANCNNFKHSFRSYPNIEQAFA-CQLASILQACSGPSVLRHGKQV 71

Query: 68  HAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLLLGCTS-AWNWMIRGFTMMGL 127
           HAQ I+SG+ ++G LG R+LGMY+  GS  DAKN+FY L L  +S  WNWMIRGFT++G 
Sbjct: 72  HAQIIVSGIGNHGLLGGRVLGMYILCGSFMDAKNMFYRLELRSSSLPWNWMIRGFTVLGR 131

Query: 128 CNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKIVHETVNLMGLKEDAFVGSS 187
            ++ALLFYFKMLG G SPDKYTFPYVIK+C  LN+V +GK+VH T+ LMG + D FVGSS
Sbjct: 132 FDFALLFYFKMLGYGTSPDKYTFPYVIKACIGLNNVNLGKLVHGTIQLMGFELDVFVGSS 191

Query: 188 LIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKNGDSGNAVKIFLEMRHSETKPN 247
           LIK+YA+N  ++DA+ LFD IP KD VL NVMLNGYVK GD+ +A+K+FLEMR+SE +PN
Sbjct: 192 LIKMYAENDCINDARCLFDRIPHKDGVLCNVMLNGYVKTGDTSSALKMFLEMRNSEIRPN 251

Query: 248 SVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVANTLLAMYSKCQCLQAARKLFD 307
           SVTFACI+SVCASEAM+  GTQLHG+ V CGLEL+S VANTLLAMYSKCQ L  A KLFD
Sbjct: 252 SVTFACIISVCASEAMIGFGTQLHGLVVRCGLELESSVANTLLAMYSKCQYLFDACKLFD 311

Query: 308 TMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIKPDSITFASFLPCVNELLSLKH 367
            +PQ DLV+WNG+ISG+VQNG   EA +LFR MIS  +KPDSITFASFLP V E+  LK 
Sbjct: 312 MIPQTDLVTWNGMISGFVQNGFMREASNLFREMISVSVKPDSITFASFLPSVTEIAGLKQ 371

Query: 368 CKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKVLCQSSSFDTVVCTTMISGYVL 427
            KEIHGY+VRH V LD+F+KSALIDIYFKCRDV MARKV  QS++ D +VCT MISG+VL
Sbjct: 372 GKEIHGYMVRHGVPLDLFVKSALIDIYFKCRDVGMARKVFGQSNTVDVIVCTAMISGFVL 431

Query: 428 NGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAALNLGKELHGSIIKNKLDEKCHV 487
           NG+N++ALE+FRWLL+E+M+P SVT ASV PA A LAAL LGKELH +I+KN LD +CHV
Sbjct: 432 NGINSDALEIFRWLLKEKMRPNSVTLASVLPACAALAALKLGKELHCNILKNGLDGRCHV 491

Query: 488 GSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSCSQNGRPGEAIDLFRQMGMEGT 547
           GSA+ DMYAKCGRLDLA  +F+RM+ +D +CWN MITSCSQN  P EAI LFRQMG+EGT
Sbjct: 492 GSAITDMYAKCGRLDLAHHIFDRMSQRDTVCWNVMITSCSQNSEPEEAIQLFRQMGIEGT 551

Query: 548 QYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDLYAESSLIDMYAKCGNLNFSRR 607
           ++DCVSIS A+SACANLP+L YGKEIHGFM+KG   SDL++ES+LIDMYAKCGNL+ +  
Sbjct: 552 KFDCVSISAALSACANLPSLQYGKEIHGFMVKGTFSSDLFSESALIDMYAKCGNLDSACL 611

Query: 608 VFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRNDIQPDHVTFLGIISACGHAGR 667
           VF+MM EKNEVSWNSIISAYGNHG LK+CL LF+ ML N I PDHVTFL IISACGHAG+
Sbjct: 612 VFNMMGEKNEVSWNSIISAYGNHGCLKDCLRLFNRMLENGILPDHVTFLAIISACGHAGQ 671

Query: 668 VDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDEAFETINSMPFPPDAGVWGTLL 727
           +D G RY++ MT+EYG+PARMEHYAC+ DLFGRAGRL+EAFE I S+PF PDAGVWGTLL
Sbjct: 672 IDNGARYFYSMTKEYGLPARMEHYACMVDLFGRAGRLNEAFEIIKSIPFIPDAGVWGTLL 731

Query: 728 GACHVHGNVELAEVA 740
           GAC VHGNVELAE+A
Sbjct: 732 GACRVHGNVELAEIA 745

BLAST of Cla97C03G059850 vs. Swiss-Prot
Match: sp|Q9STE1|PP333_ARATH (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 687.6 bits (1773), Expect = 1.6e-196
Identity = 346/697 (49.64%), Postives = 453/697 (64.99%), Query Frame = 0

Query: 46  LASIFQACNDHSLLRQGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLL 105
           L+ + QAC++ +LLRQGKQ HA  I++ +  +     RILGMY   GS  D   +FY L 
Sbjct: 38  LSLLLQACSNPNLLRQGKQVHAFLIVNSISGDSYTDERILGMYAMCGSFSDCGKMFYRLD 97

Query: 106 LGCTS--AWNWMIRGFTMMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMG 165
           L  +S   WN +I  F   GL N AL FYFKML  GVSPD  TFP ++K+C AL + K  
Sbjct: 98  LRRSSIRPWNSIISSFVRNGLLNQALAFYFKMLCFGVSPDVSTFPCLVKACVALKNFKGI 157

Query: 166 KIVHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKN 225
             + +TV+ +G+  + FV SSLIK Y + G++     LFD + QKDCV+WNVMLNGY K 
Sbjct: 158 DFLSDTVSSLGMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKC 217

Query: 226 GDSGNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVA 285
           G   + +K F  MR  +  PN+VTF C+LSVCAS+ ++DLG QLHG+ V  G++ +  + 
Sbjct: 218 GALDSVIKGFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIK 277

Query: 286 NTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIK 345
           N+LL+MYSKC     A KLF  M + D V+WN +ISGYVQ+GL  E+   F  MIS+G+ 
Sbjct: 278 NSLLSMYSKCGRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVL 337

Query: 346 PDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKV 405
           PD+ITF+S LP V++  +L++CK+IH YI+RH++ LD+FL SALID YFKCR V MA+ +
Sbjct: 338 PDAITFSSLLPSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNI 397

Query: 406 LCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAAL 465
             Q +S D VV T MISGY+ NG+  ++LE+FRWL++ ++ P  +T  S+ P    L AL
Sbjct: 398 FSQCNSVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLAL 457

Query: 466 NLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSC 525
            LG+ELHG IIK   D +C++G AV+DMYAKCGR++LA  +F R++ +D + WNSMIT C
Sbjct: 458 KLGRELHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRC 517

Query: 526 SQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDL 585
           +Q+  P  AID+FRQMG+ G  YDCVSIS A+SACANLP+  +GK IHGFMIK  L SD+
Sbjct: 518 AQSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDV 577

Query: 586 YAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEML-R 645
           Y+ES+LIDMYAKCGNL  +  VF  M+EKN VSWNSII+A GNHG LK+ L LFHEM+ +
Sbjct: 578 YSESTLIDMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEK 637

Query: 646 NDIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLD 705
           + I+PD                                                      
Sbjct: 638 SGIRPDQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 697

Query: 706 EAFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVA 740
               T+ SMPFPPDAGVWGTLLGAC +H NVELAEVA
Sbjct: 698 XXXXTVKSMPFPPDAGVWGTLLGACRLHKNVELAEVA 734

BLAST of Cla97C03G059850 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 436.4 bits (1121), Expect = 6.2e-121
Identity = 220/590 (37.29%), Postives = 338/590 (57.29%), Query Frame = 0

Query: 150 VIKSCGALNSVKMGKIVHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKD 209
           +++ C +L  ++    +   V   GL ++ F  + L+ L+   G + +A  +F+ I  K 
Sbjct: 43  LLERCSSLKELRQ---ILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKL 102

Query: 210 CVLWNVMLNGYVKNGDSGNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLHG 269
            VL++ ML G+ K  D   A++ F+ MR+ + +P    F  +L VC  EA L +G ++HG
Sbjct: 103 NVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHG 162

Query: 270 IAVSCGLELDSPVANTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSE 329
           + V  G  LD      L  MY+KC+ +  ARK+FD MP+ DLVSWN I++GY QNG+   
Sbjct: 163 LLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARM 222

Query: 330 AEHLFRGMISAGIKPDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALID 389
           A  + + M    +KP  IT  S LP V+ L  +   KEIHGY +R      V + +AL+D
Sbjct: 223 ALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVD 282

Query: 390 IYFKCRDVEMARKVLCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVT 449
           +Y KC  +E AR++       + V   +MI  YV N    EA+ +F+ +L E +KPT V+
Sbjct: 283 MYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVS 342

Query: 450 FASVFPAFAGLAALNLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMT 509
                 A A L  L  G+ +H   ++  LD    V ++++ MY KC  +D A  +F ++ 
Sbjct: 343 VMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQ 402

Query: 510 GKDAICWNSMITSCSQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKE 569
            +  + WN+MI   +QNGRP +A++ F QM     + D  +    I+A A L   H+ K 
Sbjct: 403 SRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKW 462

Query: 570 IHGFMIKGPLRSDLYAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGD 629
           IHG +++  L  +++  ++L+DMYAKCG +  +R +FDMM E++  +WN++I  YG HG 
Sbjct: 463 IHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGF 522

Query: 630 LKECLALFHEMLRNDIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYA 689
            K  L LF EM +  I+P+ VTFL +ISAC H+G V+ G++ +++M E Y I   M+HY 
Sbjct: 523 GKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYG 582

Query: 690 CVADLFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVA 740
            + DL GRAGRL+EA++ I  MP  P   V+G +LGAC +H NV  AE A
Sbjct: 583 AMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKA 629

BLAST of Cla97C03G059850 vs. Swiss-Prot
Match: sp|Q9M1V3|PP296_ARATH (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 434.5 bits (1116), Expect = 2.4e-120
Identity = 234/696 (33.62%), Postives = 379/696 (54.45%), Query Frame = 0

Query: 47  ASIFQACNDHSLLRQGKQSHAQAIISGVVHNGD-LGSRILGMYVRTGSLEDAKNLFYTLL 106
           A + + C     + QG+Q H++   +      D L  +++ MY + GSL+DA+ +F  + 
Sbjct: 84  AYVLELCGKRRAVSQGRQLHSRIFKTFPSFELDFLAGKLVFMYGKCGSLDDAEKVFDEMP 143

Query: 107 LGCTSAWNWMIRGFTMMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKI 166
                AWN MI  +   G    AL  Y+ M   GV     +FP ++K+C  L  ++ G  
Sbjct: 144 DRTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIRSGSE 203

Query: 167 VHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQK-DCVLWNVMLNGYVKNG 226
           +H  +  +G     F+ ++L+ +YA N  LS A+ LFD   +K D VLWN +L+ Y  +G
Sbjct: 204 LHSLLVKLGYHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSYSTSG 263

Query: 227 DSGNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLH-GIAVSCGLELDSPVA 286
            S   +++F EM  +   PNS T    L+ C   +   LG ++H  +  S     +  V 
Sbjct: 264 KSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSELYVC 323

Query: 287 NTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIK 346
           N L+AMY++C  +  A ++   M   D+V+WN +I GYVQN +  EA   F  MI+AG K
Sbjct: 324 NALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAGHK 383

Query: 347 PDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKV 406
            D ++  S +     L +L    E+H Y+++H    ++ + + LID+Y KC       + 
Sbjct: 384 SDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMGRA 443

Query: 407 LCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAAL 466
             +    D +  TT+I+GY  N  + EALE+FR + ++RM+   +   S+  A + L ++
Sbjct: 444 FLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSVLKSM 503

Query: 467 NLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSC 526
            + KE+H  I++  L +   + + ++D+Y KC  +  A RVF  + GKD + W SMI+S 
Sbjct: 504 LIVKEIHCHILRKGLLDTV-IQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMISSS 563

Query: 527 SQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDL 586
           + NG   EA++LFR+M   G   D V++   +SA A+L AL+ G+EIH ++++     + 
Sbjct: 564 ALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKGFCLEG 623

Query: 587 YAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRN 646
               +++DMYA CG+L  ++ VFD ++ K  + + S+I+AYG HG  K  + LF +M   
Sbjct: 624 SIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKMRHE 683

Query: 647 DIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDE 706
           ++ PDH++FL ++ AC HAG +DEG  +  +M  EY +    EHY C+ D+ GRA  + E
Sbjct: 684 NVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANCVVE 743

Query: 707 AFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVA 740
           AFE +  M   P A VW  LL AC  H   E+ E+A
Sbjct: 744 AFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIA 778

BLAST of Cla97C03G059850 vs. Swiss-Prot
Match: sp|Q0WN60|PPR48_ARATH (Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H8 PE=2 SV=2)

HSP 1 Score: 431.4 bits (1108), Expect = 2.0e-119
Identity = 238/701 (33.95%), Postives = 373/701 (53.21%), Query Frame = 0

Query: 46  LASIFQACNDHSLLRQGKQSHAQAIISGVVHNGD-LGSRILGMYVRTGSLEDAKNLFYTL 105
           L  + QA      +  G++ H     S  + N D L +RI+ MY   GS +D++ +F  L
Sbjct: 87  LGLLLQASGKRKDIEMGRKIHQLVSGSTRLRNDDVLCTRIITMYAMCGSPDDSRFVFDAL 146

Query: 106 LLGCTSAWNWMIRGFTMMGLCNYALLFYFKMLG-AGVSPDKYTFPYVIKSCGALNSVKMG 165
                  WN +I  ++   L +  L  + +M+    + PD +T+P VIK+C  ++ V +G
Sbjct: 147 RSKNLFQWNAVISSYSRNELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGIG 206

Query: 166 KIVHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKN 225
             VH  V   GL ED FVG++L+  Y  +G ++DA  LFD +P+++ V WN M+  +  N
Sbjct: 207 LAVHGLVVKTGLVEDVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSDN 266

Query: 226 GDSGNAVKIFLEMRHSE----TKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELD 285
           G S  +  +  EM          P+  T   +L VCA E  + LG  +HG AV   L+ +
Sbjct: 267 GFSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDKE 326

Query: 286 SPVANTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMIS 345
             + N L+ MYSKC C+  A+ +F      ++VSWN ++ G+   G       + R M++
Sbjct: 327 LVLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQMLA 386

Query: 346 AG--IKPDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDV 405
            G  +K D +T  + +P       L   KE+H Y ++   V +  + +A +  Y KC  +
Sbjct: 387 GGEDVKADEVTILNAVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGSL 446

Query: 406 EMARKVLCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAF 465
             A++V     S        +I G+  +     +L+    +    + P S T  S+  A 
Sbjct: 447 SYAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLSAC 506

Query: 466 AGLAALNLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWN 525
           + L +L LGKE+HG II+N L+    V  +VL +Y  CG L   + +F+ M  K  + WN
Sbjct: 507 SKLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSWN 566

Query: 526 SMITSCSQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKG 585
           ++IT   QNG P  A+ +FRQM + G Q   +S+     AC+ LP+L  G+E H + +K 
Sbjct: 567 TVITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALKH 626

Query: 586 PLRSDLYAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALF 645
            L  D +   SLIDMYAK G++  S +VF+ ++EK+  SWN++I  YG HG  KE + LF
Sbjct: 627 LLEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKLF 686

Query: 646 HEMLRNDIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGR 705
            EM R    PD +TFLG+++AC H+G + EG+RY   M   +G+   ++HYACV D+ GR
Sbjct: 687 EEMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLGR 746

Query: 706 AGRLDEAFETI-NSMPFPPDAGVWGTLLGACHVHGNVELAE 738
           AG+LD+A   +   M    D G+W +LL +C +H N+E+ E
Sbjct: 747 AGQLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEMGE 787

BLAST of Cla97C03G059850 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 406.4 bits (1043), Expect = 6.9e-112
Identity = 221/699 (31.62%), Postives = 374/699 (53.51%), Query Frame = 0

Query: 43  ESLLASIFQACNDHSL-LRQGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLF 102
           E   + + +AC   S+     +Q HA+ +  G+  +  + + ++ +Y R G ++ A+ +F
Sbjct: 186 EGTFSGVLEACRGGSVAFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVF 245

Query: 103 YTLLLGCTSAWNWMIRGFTMMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVK 162
             L L   S+W  MI G +       A+  +  M   G+ P  Y F  V+ +C  + S++
Sbjct: 246 DGLRLKDHSSWVAMISGLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLE 305

Query: 163 MGKIVHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYV 222
           +G+ +H  V  +G   D +V ++L+ LY   G L  A+++F N+ Q+D V +N ++NG  
Sbjct: 306 IGEQLHGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLS 365

Query: 223 KNGDSGNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSP 282
           + G    A+++F  M     +P+S T A ++  C+++  L  G QLH      G   ++ 
Sbjct: 366 QCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNK 425

Query: 283 VANTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGY-VQNGLRSEAEHLFRGMISA 342
           +   LL +Y+KC  ++ A   F      ++V WN ++  Y + + LR+ +  +FR M   
Sbjct: 426 IEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRN-SFRIFRQMQIE 485

Query: 343 GIKPDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMA 402
            I P+  T+ S L     L  L+  ++IH  I++    L+ ++ S LID+Y K   ++ A
Sbjct: 486 EIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTA 545

Query: 403 RKVLCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGL 462
             +L + +  D V  TTMI+GY     + +AL  FR +L   ++   V   +   A AGL
Sbjct: 546 WDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGL 605

Query: 463 AALNLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMI 522
            AL  G+++H     +         +A++ +Y++CG+++ +   F +    D I WN+++
Sbjct: 606 QALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALV 665

Query: 523 TSCSQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLR 582
           +   Q+G   EA+ +F +M  EG   +  +   A+ A +    +  GK++H  + K    
Sbjct: 666 SGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYD 725

Query: 583 SDLYAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEM 642
           S+    ++LI MYAKCG+++ + + F  +  KNEVSWN+II+AY  HG   E L  F +M
Sbjct: 726 SETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQM 785

Query: 643 LRNDIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGR 702
           + ++++P+HVT +G++SAC H G VD+GI Y+  M  EYG+  + EHY CV D+  RAG 
Sbjct: 786 IHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGL 845

Query: 703 LDEAFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVA 740
           L  A E I  MP  PDA VW TLL AC VH N+E+ E A
Sbjct: 846 LSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFA 883

BLAST of Cla97C03G059850 vs. TAIR10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 687.6 bits (1773), Expect = 8.6e-198
Identity = 346/697 (49.64%), Postives = 453/697 (64.99%), Query Frame = 0

Query: 46  LASIFQACNDHSLLRQGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLFYTLL 105
           L+ + QAC++ +LLRQGKQ HA  I++ +  +     RILGMY   GS  D   +FY L 
Sbjct: 38  LSLLLQACSNPNLLRQGKQVHAFLIVNSISGDSYTDERILGMYAMCGSFSDCGKMFYRLD 97

Query: 106 LGCTS--AWNWMIRGFTMMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMG 165
           L  +S   WN +I  F   GL N AL FYFKML  GVSPD  TFP ++K+C AL + K  
Sbjct: 98  LRRSSIRPWNSIISSFVRNGLLNQALAFYFKMLCFGVSPDVSTFPCLVKACVALKNFKGI 157

Query: 166 KIVHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKN 225
             + +TV+ +G+  + FV SSLIK Y + G++     LFD + QKDCV+WNVMLNGY K 
Sbjct: 158 DFLSDTVSSLGMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKC 217

Query: 226 GDSGNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSPVA 285
           G   + +K F  MR  +  PN+VTF C+LSVCAS+ ++DLG QLHG+ V  G++ +  + 
Sbjct: 218 GALDSVIKGFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIK 277

Query: 286 NTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIK 345
           N+LL+MYSKC     A KLF  M + D V+WN +ISGYVQ+GL  E+   F  MIS+G+ 
Sbjct: 278 NSLLSMYSKCGRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVL 337

Query: 346 PDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKV 405
           PD+ITF+S LP V++  +L++CK+IH YI+RH++ LD+FL SALID YFKCR V MA+ +
Sbjct: 338 PDAITFSSLLPSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNI 397

Query: 406 LCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAAL 465
             Q +S D VV T MISGY+ NG+  ++LE+FRWL++ ++ P  +T  S+ P    L AL
Sbjct: 398 FSQCNSVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLAL 457

Query: 466 NLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSC 525
            LG+ELHG IIK   D +C++G AV+DMYAKCGR++LA  +F R++ +D + WNSMIT C
Sbjct: 458 KLGRELHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRC 517

Query: 526 SQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDL 585
           +Q+  P  AID+FRQMG+ G  YDCVSIS A+SACANLP+  +GK IHGFMIK  L SD+
Sbjct: 518 AQSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDV 577

Query: 586 YAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEML-R 645
           Y+ES+LIDMYAKCGNL  +  VF  M+EKN VSWNSII+A GNHG LK+ L LFHEM+ +
Sbjct: 578 YSESTLIDMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEK 637

Query: 646 NDIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLD 705
           + I+PD                                                      
Sbjct: 638 SGIRPDQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 697

Query: 706 EAFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVA 740
               T+ SMPFPPDAGVWGTLLGAC +H NVELAEVA
Sbjct: 698 XXXXTVKSMPFPPDAGVWGTLLGACRLHKNVELAEVA 734

BLAST of Cla97C03G059850 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 436.4 bits (1121), Expect = 3.5e-122
Identity = 220/590 (37.29%), Postives = 338/590 (57.29%), Query Frame = 0

Query: 150 VIKSCGALNSVKMGKIVHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKD 209
           +++ C +L  ++    +   V   GL ++ F  + L+ L+   G + +A  +F+ I  K 
Sbjct: 43  LLERCSSLKELRQ---ILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKL 102

Query: 210 CVLWNVMLNGYVKNGDSGNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLHG 269
            VL++ ML G+ K  D   A++ F+ MR+ + +P    F  +L VC  EA L +G ++HG
Sbjct: 103 NVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHG 162

Query: 270 IAVSCGLELDSPVANTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSE 329
           + V  G  LD      L  MY+KC+ +  ARK+FD MP+ DLVSWN I++GY QNG+   
Sbjct: 163 LLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARM 222

Query: 330 AEHLFRGMISAGIKPDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALID 389
           A  + + M    +KP  IT  S LP V+ L  +   KEIHGY +R      V + +AL+D
Sbjct: 223 ALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVD 282

Query: 390 IYFKCRDVEMARKVLCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVT 449
           +Y KC  +E AR++       + V   +MI  YV N    EA+ +F+ +L E +KPT V+
Sbjct: 283 MYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVS 342

Query: 450 FASVFPAFAGLAALNLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMT 509
                 A A L  L  G+ +H   ++  LD    V ++++ MY KC  +D A  +F ++ 
Sbjct: 343 VMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQ 402

Query: 510 GKDAICWNSMITSCSQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKE 569
            +  + WN+MI   +QNGRP +A++ F QM     + D  +    I+A A L   H+ K 
Sbjct: 403 SRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKW 462

Query: 570 IHGFMIKGPLRSDLYAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGD 629
           IHG +++  L  +++  ++L+DMYAKCG +  +R +FDMM E++  +WN++I  YG HG 
Sbjct: 463 IHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGF 522

Query: 630 LKECLALFHEMLRNDIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYA 689
            K  L LF EM +  I+P+ VTFL +ISAC H+G V+ G++ +++M E Y I   M+HY 
Sbjct: 523 GKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYG 582

Query: 690 CVADLFGRAGRLDEAFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVA 740
            + DL GRAGRL+EA++ I  MP  P   V+G +LGAC +H NV  AE A
Sbjct: 583 AMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKA 629

BLAST of Cla97C03G059850 vs. TAIR10
Match: AT3G63370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 434.5 bits (1116), Expect = 1.3e-121
Identity = 234/696 (33.62%), Postives = 379/696 (54.45%), Query Frame = 0

Query: 47  ASIFQACNDHSLLRQGKQSHAQAIISGVVHNGD-LGSRILGMYVRTGSLEDAKNLFYTLL 106
           A + + C     + QG+Q H++   +      D L  +++ MY + GSL+DA+ +F  + 
Sbjct: 84  AYVLELCGKRRAVSQGRQLHSRIFKTFPSFELDFLAGKLVFMYGKCGSLDDAEKVFDEMP 143

Query: 107 LGCTSAWNWMIRGFTMMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVKMGKI 166
                AWN MI  +   G    AL  Y+ M   GV     +FP ++K+C  L  ++ G  
Sbjct: 144 DRTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIRSGSE 203

Query: 167 VHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQK-DCVLWNVMLNGYVKNG 226
           +H  +  +G     F+ ++L+ +YA N  LS A+ LFD   +K D VLWN +L+ Y  +G
Sbjct: 204 LHSLLVKLGYHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSYSTSG 263

Query: 227 DSGNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLH-GIAVSCGLELDSPVA 286
            S   +++F EM  +   PNS T    L+ C   +   LG ++H  +  S     +  V 
Sbjct: 264 KSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSELYVC 323

Query: 287 NTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMISAGIK 346
           N L+AMY++C  +  A ++   M   D+V+WN +I GYVQN +  EA   F  MI+AG K
Sbjct: 324 NALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAGHK 383

Query: 347 PDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMARKV 406
            D ++  S +     L +L    E+H Y+++H    ++ + + LID+Y KC       + 
Sbjct: 384 SDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMGRA 443

Query: 407 LCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGLAAL 466
             +    D +  TT+I+GY  N  + EALE+FR + ++RM+   +   S+  A + L ++
Sbjct: 444 FLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSVLKSM 503

Query: 467 NLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMITSC 526
            + KE+H  I++  L +   + + ++D+Y KC  +  A RVF  + GKD + W SMI+S 
Sbjct: 504 LIVKEIHCHILRKGLLDTV-IQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMISSS 563

Query: 527 SQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLRSDL 586
           + NG   EA++LFR+M   G   D V++   +SA A+L AL+ G+EIH ++++     + 
Sbjct: 564 ALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKGFCLEG 623

Query: 587 YAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEMLRN 646
               +++DMYA CG+L  ++ VFD ++ K  + + S+I+AYG HG  K  + LF +M   
Sbjct: 624 SIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKMRHE 683

Query: 647 DIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGRLDE 706
           ++ PDH++FL ++ AC HAG +DEG  +  +M  EY +    EHY C+ D+ GRA  + E
Sbjct: 684 NVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANCVVE 743

Query: 707 AFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVA 740
           AFE +  M   P A VW  LL AC  H   E+ E+A
Sbjct: 744 AFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIA 778

BLAST of Cla97C03G059850 vs. TAIR10
Match: AT1G18485.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 431.4 bits (1108), Expect = 1.1e-120
Identity = 238/701 (33.95%), Postives = 373/701 (53.21%), Query Frame = 0

Query: 46  LASIFQACNDHSLLRQGKQSHAQAIISGVVHNGD-LGSRILGMYVRTGSLEDAKNLFYTL 105
           L  + QA      +  G++ H     S  + N D L +RI+ MY   GS +D++ +F  L
Sbjct: 87  LGLLLQASGKRKDIEMGRKIHQLVSGSTRLRNDDVLCTRIITMYAMCGSPDDSRFVFDAL 146

Query: 106 LLGCTSAWNWMIRGFTMMGLCNYALLFYFKMLG-AGVSPDKYTFPYVIKSCGALNSVKMG 165
                  WN +I  ++   L +  L  + +M+    + PD +T+P VIK+C  ++ V +G
Sbjct: 147 RSKNLFQWNAVISSYSRNELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGIG 206

Query: 166 KIVHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYVKN 225
             VH  V   GL ED FVG++L+  Y  +G ++DA  LFD +P+++ V WN M+  +  N
Sbjct: 207 LAVHGLVVKTGLVEDVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSDN 266

Query: 226 GDSGNAVKIFLEMRHSE----TKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELD 285
           G S  +  +  EM          P+  T   +L VCA E  + LG  +HG AV   L+ +
Sbjct: 267 GFSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDKE 326

Query: 286 SPVANTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGYVQNGLRSEAEHLFRGMIS 345
             + N L+ MYSKC C+  A+ +F      ++VSWN ++ G+   G       + R M++
Sbjct: 327 LVLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQMLA 386

Query: 346 AG--IKPDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDV 405
            G  +K D +T  + +P       L   KE+H Y ++   V +  + +A +  Y KC  +
Sbjct: 387 GGEDVKADEVTILNAVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGSL 446

Query: 406 EMARKVLCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAF 465
             A++V     S        +I G+  +     +L+    +    + P S T  S+  A 
Sbjct: 447 SYAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLSAC 506

Query: 466 AGLAALNLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWN 525
           + L +L LGKE+HG II+N L+    V  +VL +Y  CG L   + +F+ M  K  + WN
Sbjct: 507 SKLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSWN 566

Query: 526 SMITSCSQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKG 585
           ++IT   QNG P  A+ +FRQM + G Q   +S+     AC+ LP+L  G+E H + +K 
Sbjct: 567 TVITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALKH 626

Query: 586 PLRSDLYAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALF 645
            L  D +   SLIDMYAK G++  S +VF+ ++EK+  SWN++I  YG HG  KE + LF
Sbjct: 627 LLEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKLF 686

Query: 646 HEMLRNDIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGR 705
            EM R    PD +TFLG+++AC H+G + EG+RY   M   +G+   ++HYACV D+ GR
Sbjct: 687 EEMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLGR 746

Query: 706 AGRLDEAFETI-NSMPFPPDAGVWGTLLGACHVHGNVELAE 738
           AG+LD+A   +   M    D G+W +LL +C +H N+E+ E
Sbjct: 747 AGQLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEMGE 787

BLAST of Cla97C03G059850 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 406.4 bits (1043), Expect = 3.8e-113
Identity = 221/699 (31.62%), Postives = 374/699 (53.51%), Query Frame = 0

Query: 43  ESLLASIFQACNDHSL-LRQGKQSHAQAIISGVVHNGDLGSRILGMYVRTGSLEDAKNLF 102
           E   + + +AC   S+     +Q HA+ +  G+  +  + + ++ +Y R G ++ A+ +F
Sbjct: 186 EGTFSGVLEACRGGSVAFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVF 245

Query: 103 YTLLLGCTSAWNWMIRGFTMMGLCNYALLFYFKMLGAGVSPDKYTFPYVIKSCGALNSVK 162
             L L   S+W  MI G +       A+  +  M   G+ P  Y F  V+ +C  + S++
Sbjct: 246 DGLRLKDHSSWVAMISGLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLE 305

Query: 163 MGKIVHETVNLMGLKEDAFVGSSLIKLYADNGQLSDAQYLFDNIPQKDCVLWNVMLNGYV 222
           +G+ +H  V  +G   D +V ++L+ LY   G L  A+++F N+ Q+D V +N ++NG  
Sbjct: 306 IGEQLHGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLS 365

Query: 223 KNGDSGNAVKIFLEMRHSETKPNSVTFACILSVCASEAMLDLGTQLHGIAVSCGLELDSP 282
           + G    A+++F  M     +P+S T A ++  C+++  L  G QLH      G   ++ 
Sbjct: 366 QCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNK 425

Query: 283 VANTLLAMYSKCQCLQAARKLFDTMPQCDLVSWNGIISGY-VQNGLRSEAEHLFRGMISA 342
           +   LL +Y+KC  ++ A   F      ++V WN ++  Y + + LR+ +  +FR M   
Sbjct: 426 IEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRN-SFRIFRQMQIE 485

Query: 343 GIKPDSITFASFLPCVNELLSLKHCKEIHGYIVRHAVVLDVFLKSALIDIYFKCRDVEMA 402
            I P+  T+ S L     L  L+  ++IH  I++    L+ ++ S LID+Y K   ++ A
Sbjct: 486 EIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTA 545

Query: 403 RKVLCQSSSFDTVVCTTMISGYVLNGMNTEALEVFRWLLQERMKPTSVTFASVFPAFAGL 462
             +L + +  D V  TTMI+GY     + +AL  FR +L   ++   V   +   A AGL
Sbjct: 546 WDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGL 605

Query: 463 AALNLGKELHGSIIKNKLDEKCHVGSAVLDMYAKCGRLDLARRVFNRMTGKDAICWNSMI 522
            AL  G+++H     +         +A++ +Y++CG+++ +   F +    D I WN+++
Sbjct: 606 QALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALV 665

Query: 523 TSCSQNGRPGEAIDLFRQMGMEGTQYDCVSISGAISACANLPALHYGKEIHGFMIKGPLR 582
           +   Q+G   EA+ +F +M  EG   +  +   A+ A +    +  GK++H  + K    
Sbjct: 666 SGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYD 725

Query: 583 SDLYAESSLIDMYAKCGNLNFSRRVFDMMQEKNEVSWNSIISAYGNHGDLKECLALFHEM 642
           S+    ++LI MYAKCG+++ + + F  +  KNEVSWN+II+AY  HG   E L  F +M
Sbjct: 726 SETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQM 785

Query: 643 LRNDIQPDHVTFLGIISACGHAGRVDEGIRYYHLMTEEYGIPARMEHYACVADLFGRAGR 702
           + ++++P+HVT +G++SAC H G VD+GI Y+  M  EYG+  + EHY CV D+  RAG 
Sbjct: 786 IHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGL 845

Query: 703 LDEAFETINSMPFPPDAGVWGTLLGACHVHGNVELAEVA 740
           L  A E I  MP  PDA VW TLL AC VH N+E+ E A
Sbjct: 846 LSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFA 883

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008459124.10.0e+0089.58PREDICTED: pentatricopeptide repeat-containing protein At4g21300 [Cucumis melo][more]
XP_004135750.20.0e+0088.77PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Cucumis s... [more]
XP_022142608.10.0e+0085.79pentatricopeptide repeat-containing protein At4g21300 isoform X1 [Momordica char... [more]
XP_022142610.10.0e+0085.79pentatricopeptide repeat-containing protein At4g21300 isoform X3 [Momordica char... [more]
XP_022142609.10.0e+0085.79pentatricopeptide repeat-containing protein At4g21300 isoform X2 [Momordica char... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3C9G4|A0A1S3C9G4_CUCME0.0e+0089.58pentatricopeptide repeat-containing protein At4g21300 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A0A0LW16|A0A0A0LW16_CUCSA0.0e+0088.77Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G553510 PE=4 SV=1[more]
tr|A0A2N9IUG5|A0A2N9IUG5_FAGSY1.3e-30169.36Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56050 PE=4 SV=1[more]
tr|A0A2P5ENX2|A0A2P5ENX2_9ROSA2.3e-29871.78DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_169790 ... [more]
tr|A0A2I4F578|A0A2I4F578_9ROSI1.2e-29467.76pentatricopeptide repeat-containing protein At4g21300 OS=Juglans regia OX=51240 ... [more]
Match NameE-valueIdentityDescription
sp|Q9STE1|PP333_ARATH1.6e-19649.64Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
sp|Q3E6Q1|PPR32_ARATH6.2e-12137.29Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|Q9M1V3|PP296_ARATH2.4e-12033.62Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
sp|Q0WN60|PPR48_ARATH2.0e-11933.95Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX... [more]
sp|Q9SVP7|PP307_ARATH6.9e-11231.62Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT4G21300.18.6e-19849.64Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.13.5e-12237.29Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G63370.11.3e-12133.62Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G18485.11.1e-12033.95Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G13650.13.8e-11331.62Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G059850.1Cla97C03G059850.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 688..711
e-value: 1.0
score: 9.7
coord: 515..543
e-value: 1.8E-7
score: 30.8
coord: 487..511
e-value: 0.008
score: 16.3
coord: 284..309
e-value: 0.56
score: 10.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 209..256
e-value: 8.3E-11
score: 41.8
coord: 411..458
e-value: 1.8E-7
score: 31.1
coord: 612..659
e-value: 2.3E-12
score: 46.8
coord: 310..352
e-value: 2.2E-8
score: 34.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 312..346
e-value: 4.5E-8
score: 30.8
coord: 650..681
e-value: 8.0E-5
score: 20.6
coord: 615..648
e-value: 9.1E-10
score: 36.1
coord: 514..544
e-value: 8.9E-7
score: 26.7
coord: 211..244
e-value: 1.6E-6
score: 25.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 716..739
score: 5.086
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 143..177
score: 5.119
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 613..647
score: 12.858
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 411..445
score: 10.271
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 310..344
score: 12.178
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 481..511
score: 7.333
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 380..410
score: 5.831
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 648..682
score: 8.659
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 108..142
score: 8.232
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 209..243
score: 11.312
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 178..208
score: 7.114
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 279..309
score: 7.015
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 512..546
score: 10.896
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 244..278
score: 6.577
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 582..612
score: 8.265
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 684..714
score: 5.974
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 161..242
e-value: 5.3E-8
score: 34.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 358..461
e-value: 2.7E-14
score: 54.9
coord: 462..587
e-value: 1.8E-19
score: 71.7
coord: 25..160
e-value: 4.2E-10
score: 41.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 257..357
e-value: 2.3E-19
score: 71.9
coord: 588..739
e-value: 4.6E-38
score: 133.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 411..511
coord: 329..404
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 499..739
NoneNo IPR availablePANTHERPTHR24015:SF388SUBFAMILY NOT NAMEDcoord: 38..331
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 38..331
NoneNo IPR availablePANTHERPTHR24015:SF388SUBFAMILY NOT NAMEDcoord: 411..511
coord: 499..739
NoneNo IPR availablePANTHERPTHR24015:SF388SUBFAMILY NOT NAMEDcoord: 329..404

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C03G059850Watermelon (97103) v2wmbwmbB014
Cla97C03G059850Melon (DHL92) v3.5.1mewmbB157