CSPI01G02030 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G02030
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr1: 1327553 .. 1331315 (-)
RNA-Seq ExpressionCSPI01G02030
SyntenyCSPI01G02030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGCTTACCTCACGGTCTAACGTGCCCCCTATTTTCCGTCCAACTTAGTAGGAAAAAAGCTTGACGCGCAGGGTAGAGGTGAAAAATGGCAGAGAAATAGATGAACCTTCAGCCGCCTCCTCTTTCACTTCCTTCTTCAGCTCTAGTCTTTAGTTACTCCTTACAAGTTCTAGGTTTCTCTTTCTCTCTTACCATTCTCTTCTTTTGTTCTGCATAATGAGCTTATTTTGTTTGTACTTTTTTCGTTCATTCATGGCCGGTCATAATTTTGTTGTTGTTATTGTTGTGTTTTTGGAATTGATTATATGGGTTTGTTTCTGCTTTGGATTTGGTTGTATGGGTGTGTTTCTGCTTTGTGATTCGTGGTTGGATGAGTTTGATTGGTGTTTCGATTCTCAGAATGAGTTTTGACCCCTTCTCTGTTCATTTTTGTTTCCAATGAAGTTGAATTTTGACATTTGTATATTGGTTTTTTCAATGTTTTTGATAGGGTAATTGGGGTTTCATGTGTATGTTTGTGTAATTTTTAGTTTTGTGATCGTTTTTGTGTAAGACCTTTGGGTGGCTGAGAATGATAATTGTTTTTTTTTTTTTTTTTTTGTGTTGCTATGTCACTATTCTCACTTGGTGTTAACATGTCAGGAAAAACCTGGTGTAGGGGTAGGAAATGGGAATAGTATGTGGTTTGCATTATTGGAAGTTGGATTGTTTTATGGTAGCAACTGGCTAATTTTGCAACCCTTTTATGGTAGTGATTTGCTTGTTATTACAGATTATGTTGCGAGCAAAGCAGATTGGCAGTCTTTCAAACAGTGCCAGATCATTTTTTCTTAGTGGATCACGATGTAATGCAGACGGGGCTTCGTGCACATGCCCTGAAGATGAAACTTGTGTTTCCGAGAGACAAAATGCTAGAAATGAAACCCTGCCCTCACAAAAGCCATCTACCTTGGTAGCCAATAGTTCACCTAGAGTAGGACCTTTAATTGCCGAAGAAGCAGCAAAAGTAATTGTATCTCACAAAACTGACAATGTTGATCTCTCAGTTTCTATTCGACAAGTTGCGAACACTGGCCCCAATCACCAGAGGGGAGCAGAATGTGTAAGATACGCCAGTGGCCTTAACACTGTTTTGGATGGTGAGTGCACTTCACCAAGGATAGCAGATCAGGTTGTTAAGGCAGGTATTATGGCTGTAAACTTATTCTCTGACTTTGTGAATTTTAAAATCCCCTCATCTGACTATGGTGGAACATTTAGCTCATCCAAGAATTGTATGGTTGATCCTGCCCGGTCCATTACTTCTGTCAAACCTTCAAAAATTAAGCATCTGAGAAGAGAGAATATTTCTAGAGTTCATTCTAGACCATCTGTTGAAATCCCTGTAGATTCTAAGCCCCAAAGTAGTAGTAACCATGGTTCAAATTGCAAGCCCGCACAATCCAGTTATGTCAAAGGCTCAAGGCAAGAAGTTTCCGAGGCCAGAACACAAAAGTTGGTGGTATTTCAAAATATTTCCTCAGACAAGTGTGATAAAAGGAATTTACCACAGAGAACAAGGGTTCATTCAAACAGCTTTACTTCACATTTTCATTCCATTGCACAGACCACAGGGTCAGACTTCACAAATTCTTCTAAGAATTTTAAAAAGTTTCCAGATAATTTAAAAAGCCCCACAGGAATGGCACCAATCACTTCATCGTTTCTGAATGCCCCAAATGTTGTGGAAAGTGTTTCTTGCATATTGCAACAACTTAAATGGGGCCCTGCTGCAGAAGAGGCTATTGGGAAATTGAACTGTTCGATAGATGCTTACCAGGCAAACCAAATTCTGAAGCGGGTAGATGACCATGCTGTTGCTCTTGGGTTCTTTTATTGGTTAAAGCGTCTACCTAGGTTTAGACATGATGGGCACACTTATACTACTATGATTGGCCTCCTTGGTCGTGCCAAACAGTTTGCTGCTATAAATAAATTGCTTGATCAGATGATCAAGGATGGGTGCCAGCCCAATGTTGTAACATATAATCGTATAATTCATAGTTATGGTCGTGCAAACTATTTGCAAGACGCTGTTAATGTATTCAAACAAATGCAGGAAGCAGGATGTGAGCCTGATAGAGTCACCTACTGCACACTCATTGACATTCATGCAAAATCTGGCTTTCTCGATGTTGCCATGGGAATGTATGAGAAGATGCAAGATGCTGGCCTCACTCCCGACACATTTACTTACAGTGTTATGATCAACTGCTTGGGGAAAGCTGGCCATTTAAATGCTGCTCATAGGCTATTCTGCAGGATGGTTGATGAAGGCTGTGTTCCAAATTTGGTAACCTACAATATCATGATTGCTCTTCAAGCAAAAGCAAGGAATTACGAGATTGCATTGAAGCTGTACCGTGATATGCAACAATCAGGTTTTGAGCCAGATAAAGTGACTTACTGCATAGTTATGGAAGTATTAGGTCATTGTGGTTTCCTTGAGGAGGCTGAAGGTATATTTATTGAAATGCAAAAGAAGAACTGGGTGCCTGATGAACCTGTTTATGGTCTATTAGTGGACTTGTGGGGAAAATCTGGTAATGTTCAAAAGGCATGGGAATGGTATCATGCTATGCTTAAGGCGGGTTTAAAGCCGAATGTTCCTACTTGCAATTCCTTGCTTAGTGCCTTTCTTAGGGTACACCAACTATCCGATGCCTATCAGCTGTTGCAATCTATGCTGACTTTTGGTTTAAAACCTTCTCTACAAACTTATACTTTGCTGCTCAGTTGTTGCACTGATGCGCAAACGAATGACATGGGGTTTTGTTGTGAACTCATGCAAGTCACTGGTCACCCAGCACACACATTCCTGGTGTCATTGCCATCGGCTGGACCTAATGGTCAAAATGTGCGGGATCACATGAGCAAATTTTTGGACCTCATGCACAGTGAAGACAGAGAGAGCAAGAGGGGGCTCGTAGATGCAGTTGTAGATTTTCTTCATAAATCAGGACTTAAGGAGGAGGCAGGCTGTGTCTGGGAGGCTGCCATGCAAAAGAATGTCTATCCAGATGCTGTAAAGGAGAAAAGCTCCTGTTATTGGCTCATTAACTTGCACGTCATGTCTGATGGCACCGCCGTAACAGCTTTGTCTAGGACTCTTGCTTGGTTTCGCCAGCAACTACTTCTTTCAGGTGTCGGTCCCAGCCGAATTGATATTGTGACCGGATGGGGTCGGCGAAGTAAGGTCACTGGATCTTCCCTAGTGAGACAGGCAGTTCAGGACCTGCTTAGCATTTTTAGCTTCCCTTTCTTCACTGAAAATGGTAATTCTGGATGTTTTGTGGGGTGTGGGGAGCCTCTAAGTAGATGGTTGCACCAATCTTATGTGGAGAGGATGCATTTGTTGTAGTAGTGCACCTCTTCTTAACTCTCTCTTCATAACCAAAATTCAATTTTGTATGGATTGTATTTAGATTACTTGTTGGATCCAGGTATTTGTAGAAATCTCTCCATATTCAAGTTGAAAGCTCAGCATTTAATTTAAAAGCTGCTTCACAACTTTCTGGAATGGAGTTTGGCTGTATGTTAGTAATTATAAGAACCTTTTACTTTTTAAAGGGATCTATTATTGGTTTGTAGCAAACAGGTTTTTGTCTTTCTATAGTAGGGAACTTTTGCAAAAGATGAATGTAAATTTATTATCTATATGGCCTATGCAAGAATTACATACCTTGATCTAGTTATAAATACAAACTGCTCTCATGGAATTTGC

mRNA sequence

GGGCTTACCTCACGGTCTAACGTGCCCCCTATTTTCCGTCCAACTTAGTAGGAAAAAAGCTTGACGCGCAGGGTAGAGGTGAAAAATGGCAGAGAAATAGATGAACCTTCAGCCGCCTCCTCTTTCACTTCCTTCTTCAGCTCTAGTCTTTAGTTACTCCTTACAAGTTCTAGATTATGTTGCGAGCAAAGCAGATTGGCAGTCTTTCAAACAGTGCCAGATCATTTTTTCTTAGTGGATCACGATGTAATGCAGACGGGGCTTCGTGCACATGCCCTGAAGATGAAACTTGTGTTTCCGAGAGACAAAATGCTAGAAATGAAACCCTGCCCTCACAAAAGCCATCTACCTTGGTAGCCAATAGTTCACCTAGAGTAGGACCTTTAATTGCCGAAGAAGCAGCAAAAGTAATTGTATCTCACAAAACTGACAATGTTGATCTCTCAGTTTCTATTCGACAAGTTGCGAACACTGGCCCCAATCACCAGAGGGGAGCAGAATGTGTAAGATACGCCAGTGGCCTTAACACTGTTTTGGATGGTGAGTGCACTTCACCAAGGATAGCAGATCAGGTTGTTAAGGCAGGTATTATGGCTGTAAACTTATTCTCTGACTTTGTGAATTTTAAAATCCCCTCATCTGACTATGGTGGAACATTTAGCTCATCCAAGAATTGTATGGTTGATCCTGCCCGGTCCATTACTTCTGTCAAACCTTCAAAAATTAAGCATCTGAGAAGAGAGAATATTTCTAGAGTTCATTCTAGACCATCTGTTGAAATCCCTGTAGATTCTAAGCCCCAAAGTAGTAGTAACCATGGTTCAAATTGCAAGCCCGCACAATCCAGTTATGTCAAAGGCTCAAGGCAAGAAGTTTCCGAGGCCAGAACACAAAAGTTGGTGGTATTTCAAAATATTTCCTCAGACAAGTGTGATAAAAGGAATTTACCACAGAGAACAAGGGTTCATTCAAACAGCTTTACTTCACATTTTCATTCCATTGCACAGACCACAGGGTCAGACTTCACAAATTCTTCTAAGAATTTTAAAAAGTTTCCAGATAATTTAAAAAGCCCCACAGGAATGGCACCAATCACTTCATCGTTTCTGAATGCCCCAAATGTTGTGGAAAGTGTTTCTTGCATATTGCAACAACTTAAATGGGGCCCTGCTGCAGAAGAGGCTATTGGGAAATTGAACTGTTCGATAGATGCTTACCAGGCAAACCAAATTCTGAAGCGGGTAGATGACCATGCTGTTGCTCTTGGGTTCTTTTATTGGTTAAAGCGTCTACCTAGGTTTAGACATGATGGGCACACTTATACTACTATGATTGGCCTCCTTGGTCGTGCCAAACAGTTTGCTGCTATAAATAAATTGCTTGATCAGATGATCAAGGATGGGTGCCAGCCCAATGTTGTAACATATAATCGTATAATTCATAGTTATGGTCGTGCAAACTATTTGCAAGACGCTGTTAATGTATTCAAACAAATGCAGGAAGCAGGATGTGAGCCTGATAGAGTCACCTACTGCACACTCATTGACATTCATGCAAAATCTGGCTTTCTCGATGTTGCCATGGGAATGTATGAGAAGATGCAAGATGCTGGCCTCACTCCCGACACATTTACTTACAGTGTTATGATCAACTGCTTGGGGAAAGCTGGCCATTTAAATGCTGCTCATAGGCTATTCTGCAGGATGGTTGATGAAGGCTGTGTTCCAAATTTGGTAACCTACAATATCATGATTGCTCTTCAAGCAAAAGCAAGGAATTACGAGATTGCATTGAAGCTGTACCGTGATATGCAACAATCAGGTTTTGAGCCAGATAAAGTGACTTACTGCATAGTTATGGAAGTATTAGGTCATTGTGGTTTCCTTGAGGAGGCTGAAGGTATATTTATTGAAATGCAAAAGAAGAACTGGGTGCCTGATGAACCTGTTTATGGTCTATTAGTGGACTTGTGGGGAAAATCTGGTAATGTTCAAAAGGCATGGGAATGGTATCATGCTATGCTTAAGGCGGGTTTAAAGCCGAATGTTCCTACTTGCAATTCCTTGCTTAGTGCCTTTCTTAGGGTACACCAACTATCCGATGCCTATCAGCTGTTGCAATCTATGCTGACTTTTGGTTTAAAACCTTCTCTACAAACTTATACTTTGCTGCTCAGTTGTTGCACTGATGCGCAAACGAATGACATGGGGTTTTGTTGTGAACTCATGCAAGTCACTGGTCACCCAGCACACACATTCCTGGTGTCATTGCCATCGGCTGGACCTAATGGTCAAAATGTGCGGGATCACATGAGCAAATTTTTGGACCTCATGCACAGTGAAGACAGAGAGAGCAAGAGGGGGCTCGTAGATGCAGTTGTAGATTTTCTTCATAAATCAGGACTTAAGGAGGAGGCAGGCTGTGTCTGGGAGGCTGCCATGCAAAAGAATGTCTATCCAGATGCTGTAAAGGAGAAAAGCTCCTGTTATTGGCTCATTAACTTGCACGTCATGTCTGATGGCACCGCCGTAACAGCTTTGTCTAGGACTCTTGCTTGGTTTCGCCAGCAACTACTTCTTTCAGGTGTCGGTCCCAGCCGAATTGATATTGTGACCGGATGGGGTCGGCGAAGTAAGGTCACTGGATCTTCCCTAGTGAGACAGGCAGTTCAGGACCTGCTTAGCATTTTTAGCTTCCCTTTCTTCACTGAAAATGGTAATTCTGGATGTTTTGTGGGGTGTGGGGAGCCTCTAAGTAGATGGTTGCACCAATCTTATGTGGAGAGGATGCATTTGTTGTAGTAGTGCACCTCTTCTTAACTCTCTCTTCATAACCAAAATTCAATTTTGTATGGATTGTATTTAGATTACTTGTTGGATCCAGGTATTTGTAGAAATCTCTCCATATTCAAGTTGAAAGCTCAGCATTTAATTTAAAAGCTGCTTCACAACTTTCTGGAATGGAGTTTGGCTGTATGTTAGTAATTATAAGAACCTTTTACTTTTTAAAGGGATCTATTATTGGTTTGTAGCAAACAGGTTTTTGTCTTTCTATAGTAGGGAACTTTTGCAAAAGATGAATGTAAATTTATTATCTATATGGCCTATGCAAGAATTACATACCTTGATCTAGTTATAAATACAAACTGCTCTCATGGAATTTGC

Coding sequence (CDS)

ATGTTGCGAGCAAAGCAGATTGGCAGTCTTTCAAACAGTGCCAGATCATTTTTTCTTAGTGGATCACGATGTAATGCAGACGGGGCTTCGTGCACATGCCCTGAAGATGAAACTTGTGTTTCCGAGAGACAAAATGCTAGAAATGAAACCCTGCCCTCACAAAAGCCATCTACCTTGGTAGCCAATAGTTCACCTAGAGTAGGACCTTTAATTGCCGAAGAAGCAGCAAAAGTAATTGTATCTCACAAAACTGACAATGTTGATCTCTCAGTTTCTATTCGACAAGTTGCGAACACTGGCCCCAATCACCAGAGGGGAGCAGAATGTGTAAGATACGCCAGTGGCCTTAACACTGTTTTGGATGGTGAGTGCACTTCACCAAGGATAGCAGATCAGGTTGTTAAGGCAGGTATTATGGCTGTAAACTTATTCTCTGACTTTGTGAATTTTAAAATCCCCTCATCTGACTATGGTGGAACATTTAGCTCATCCAAGAATTGTATGGTTGATCCTGCCCGGTCCATTACTTCTGTCAAACCTTCAAAAATTAAGCATCTGAGAAGAGAGAATATTTCTAGAGTTCATTCTAGACCATCTGTTGAAATCCCTGTAGATTCTAAGCCCCAAAGTAGTAGTAACCATGGTTCAAATTGCAAGCCCGCACAATCCAGTTATGTCAAAGGCTCAAGGCAAGAAGTTTCCGAGGCCAGAACACAAAAGTTGGTGGTATTTCAAAATATTTCCTCAGACAAGTGTGATAAAAGGAATTTACCACAGAGAACAAGGGTTCATTCAAACAGCTTTACTTCACATTTTCATTCCATTGCACAGACCACAGGGTCAGACTTCACAAATTCTTCTAAGAATTTTAAAAAGTTTCCAGATAATTTAAAAAGCCCCACAGGAATGGCACCAATCACTTCATCGTTTCTGAATGCCCCAAATGTTGTGGAAAGTGTTTCTTGCATATTGCAACAACTTAAATGGGGCCCTGCTGCAGAAGAGGCTATTGGGAAATTGAACTGTTCGATAGATGCTTACCAGGCAAACCAAATTCTGAAGCGGGTAGATGACCATGCTGTTGCTCTTGGGTTCTTTTATTGGTTAAAGCGTCTACCTAGGTTTAGACATGATGGGCACACTTATACTACTATGATTGGCCTCCTTGGTCGTGCCAAACAGTTTGCTGCTATAAATAAATTGCTTGATCAGATGATCAAGGATGGGTGCCAGCCCAATGTTGTAACATATAATCGTATAATTCATAGTTATGGTCGTGCAAACTATTTGCAAGACGCTGTTAATGTATTCAAACAAATGCAGGAAGCAGGATGTGAGCCTGATAGAGTCACCTACTGCACACTCATTGACATTCATGCAAAATCTGGCTTTCTCGATGTTGCCATGGGAATGTATGAGAAGATGCAAGATGCTGGCCTCACTCCCGACACATTTACTTACAGTGTTATGATCAACTGCTTGGGGAAAGCTGGCCATTTAAATGCTGCTCATAGGCTATTCTGCAGGATGGTTGATGAAGGCTGTGTTCCAAATTTGGTAACCTACAATATCATGATTGCTCTTCAAGCAAAAGCAAGGAATTACGAGATTGCATTGAAGCTGTACCGTGATATGCAACAATCAGGTTTTGAGCCAGATAAAGTGACTTACTGCATAGTTATGGAAGTATTAGGTCATTGTGGTTTCCTTGAGGAGGCTGAAGGTATATTTATTGAAATGCAAAAGAAGAACTGGGTGCCTGATGAACCTGTTTATGGTCTATTAGTGGACTTGTGGGGAAAATCTGGTAATGTTCAAAAGGCATGGGAATGGTATCATGCTATGCTTAAGGCGGGTTTAAAGCCGAATGTTCCTACTTGCAATTCCTTGCTTAGTGCCTTTCTTAGGGTACACCAACTATCCGATGCCTATCAGCTGTTGCAATCTATGCTGACTTTTGGTTTAAAACCTTCTCTACAAACTTATACTTTGCTGCTCAGTTGTTGCACTGATGCGCAAACGAATGACATGGGGTTTTGTTGTGAACTCATGCAAGTCACTGGTCACCCAGCACACACATTCCTGGTGTCATTGCCATCGGCTGGACCTAATGGTCAAAATGTGCGGGATCACATGAGCAAATTTTTGGACCTCATGCACAGTGAAGACAGAGAGAGCAAGAGGGGGCTCGTAGATGCAGTTGTAGATTTTCTTCATAAATCAGGACTTAAGGAGGAGGCAGGCTGTGTCTGGGAGGCTGCCATGCAAAAGAATGTCTATCCAGATGCTGTAAAGGAGAAAAGCTCCTGTTATTGGCTCATTAACTTGCACGTCATGTCTGATGGCACCGCCGTAACAGCTTTGTCTAGGACTCTTGCTTGGTTTCGCCAGCAACTACTTCTTTCAGGTGTCGGTCCCAGCCGAATTGATATTGTGACCGGATGGGGTCGGCGAAGTAAGGTCACTGGATCTTCCCTAGTGAGACAGGCAGTTCAGGACCTGCTTAGCATTTTTAGCTTCCCTTTCTTCACTGAAAATGGTAATTCTGGATGTTTTGTGGGGTGTGGGGAGCCTCTAAGTAGATGGTTGCACCAATCTTATGTGGAGAGGATGCATTTGTTGTAG

Protein sequence

MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLVANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKPSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQTYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL*
Homology
BLAST of CSPI01G02030 vs. ExPASy Swiss-Prot
Match: Q8GYP6 (Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX=3702 GN=At1g18900 PE=2 SV=1)

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 542/879 (61.66%), Postives = 672/879 (76.45%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           M+RAK I +LS++ARSFFL+GSR +  DG SC   +DE CVS+RQ  R E   ++K  + 
Sbjct: 1   MIRAKHISNLSSTARSFFLNGSRTSVTDGNSCVYSDDENCVSKRQQLRKEAGQTEKRPSS 60

Query: 61  VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASG-LNT 120
           +      VG ++  E  K +V  K D+      + Q  ++ P     +  V YAS  +  
Sbjct: 61  ILPKPSVVGCILPGEVTKPVVPKKVDDFGRPSLLPQHVSSSPALPLKSHSVNYASTVVRE 120

Query: 121 VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGG-TFSSSKNCMVDPARSITS 180
            ++G+ +S  I DQ+ KAGI+AVN  SD  N KIPS D G   F   K+CMVDP R I+S
Sbjct: 121 EVEGKASSEPIGDQIFKAGIVAVNFLSDLSNCKIPSYDGGSDAFGLPKSCMVDPTRPISS 180

Query: 181 VKPSKIKHLRRENISRVHSRPSV-EIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEA 240
           VK S +K +RRE+ ++++ R +  E  V +    SSN     +  ++ +VKG RQ VS +
Sbjct: 181 VKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNPSSNFRGAKEAERTGFVKGFRQ-VSNS 240

Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDN 300
              K +   N +  K  + ++ QR  + SN F            S F+NSS       + 
Sbjct: 241 VVGKSLPTTNNTYGK--RTSVLQRPHIDSNRFVP----------SGFSNSS------VEM 300

Query: 301 LKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRV 360
           +K P+G A  +  + N+ ++VE+VS +L++ +WGPAAEEA+  L   IDAYQANQ+LK++
Sbjct: 301 MKGPSGTALTSRQYCNSGHIVENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQM 360

Query: 361 DDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVT 420
           +D+  ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF AINKLLD+M++DGCQPN VT
Sbjct: 361 NDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT 420

Query: 421 YNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ 480
           YNR+IHSYGRANYL +A+NVF QMQEAGC+PDRVTYCTLIDIHAK+GFLD+AM MY++MQ
Sbjct: 421 YNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQ 480

Query: 481 DAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYE 540
             GL+PDTFTYSV+INCLGKAGHL AAH+LFC MVD+GC PNLVTYNIM+ L AKARNY+
Sbjct: 481 AGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKARNYQ 540

Query: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLV 600
            ALKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE +F EMQ+KNW+PDEPVYGLLV
Sbjct: 541 NALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQQKNWIPDEPVYGLLV 600

Query: 601 DLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLK 660
           DLWGK+GNV+KAW+WY AML AGL+PNVPTCNSLLS FLRV+++++AY+LLQ+ML  GL+
Sbjct: 601 DLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSTFLRVNKIAEAYELLQNMLALGLR 660

Query: 661 PSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKF 720
           PSLQTYTLLLSCCTD ++  DMGFC +LM  TGHPAH FL+ +P+AGP+G+NVR+H + F
Sbjct: 661 PSLQTYTLLLSCCTDGRSKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGENVRNHANNF 720

Query: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLIN 780
           LDLMHSEDRESKRGLVDAVVDFLHKSG KEEAG VWE A QKNV+PDA++EKS  YWLIN
Sbjct: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLIN 780

Query: 781 LHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLL 840
           LHVMS+GTAVTALSRTLAWFR+Q+L SG  PSRIDIVTGWGRRS+VTG+S+VRQAV++LL
Sbjct: 781 LHVMSEGTAVTALSRTLAWFRKQMLASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELL 840

Query: 841 SIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           +IF  PFFTE+GNSGCFVG GEPL+RWL QS+VERMHLL
Sbjct: 841 NIFGSPFFTESGNSGCFVGSGEPLNRWLLQSHVERMHLL 860

BLAST of CSPI01G02030 vs. ExPASy Swiss-Prot
Match: Q9SSF9 (Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana OX=3702 GN=At1g74750 PE=2 SV=1)

HSP 1 Score: 1069.7 bits (2765), Expect = 1.8e-311
Identity = 547/880 (62.16%), Postives = 662/880 (75.23%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           M+RAK I +LS+SARSFFLSGSR + ADG SCTC EDE+ VS+RQ  R E + + K ++ 
Sbjct: 1   MIRAKHISNLSSSARSFFLSGSRPSAADGNSCTCAEDESGVSKRQQIRTEVVQTGKRASN 60

Query: 61  VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 120
           +A  +   G ++  EA K +V  KT       S+     + P     A+ V +AS +   
Sbjct: 61  LA--AGLAGSILPVEAGKPLVVPKTVEHFTRPSLLPQHVSSPALPGKADSVNHASAIIK- 120

Query: 121 LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 180
              E     I DQ+ KAGI  VNL SD  N+KIP SD        K+CMVDP R I+ VK
Sbjct: 121 ---EDVGVPIGDQIFKAGIGNVNLLSDIANYKIPLSDGTEVVGLPKSCMVDPTRPISGVK 180

Query: 181 PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 240
            S +K +RRE++++V+ R +  +P++S P                   G++Q  ++   +
Sbjct: 181 SSNVKVIRREHLAKVYPRSADRVPINSSP-------------------GTKQASNDVAGK 240

Query: 241 KLVVFQNISSDKCDKRN-LPQRTRVHSNSFTSH--FHSIAQTTGSDFTNSSKNF-KKFPD 300
                  +S++   KR  +PQR    S  + S    +S+  +      +S + F K   +
Sbjct: 241 SFEAHDLLSNNVSGKRKIMPQRPYTDSTRYASGGCDYSVHSSDDRTIISSVEGFGKPSRE 300

Query: 301 NLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKR 360
            +K     AP      N   VVE+VS IL++ KWG AAEEA+      +DAYQANQ+LK+
Sbjct: 301 MMKVTPRTAPTPRQHCNPGYVVENVSSILRRFKWGHAAEEALHNFGFRMDAYQANQVLKQ 360

Query: 361 VDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVV 420
           +D++A ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF  INKLLD+M++DGC+PN V
Sbjct: 361 MDNYANALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRDGCKPNTV 420

Query: 421 TYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKM 480
           TYNR+IHSYGRANYL++A+NVF QMQEAGCEPDRVTYCTLIDIHAK+GFLD+AM MY++M
Sbjct: 421 TYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIAMDMYQRM 480

Query: 481 QDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNY 540
           Q+AGL+PDTFTYSV+INCLGKAGHL AAHRLFC MV +GC PNLVT+NIMIAL AKARNY
Sbjct: 481 QEAGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMIALHAKARNY 540

Query: 541 EIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLL 600
           E ALKLYRDMQ +GF+PDKVTY IVMEVLGHCGFLEEAEG+F EMQ+KNWVPDEPVYGLL
Sbjct: 541 ETALKLYRDMQNAGFQPDKVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWVPDEPVYGLL 600

Query: 601 VDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGL 660
           VDLWGK+GNV KAW+WY AML+AGL+PNVPTCNSLLS FLRVH++S+AY LLQSML  GL
Sbjct: 601 VDLWGKAGNVDKAWQWYQAMLQAGLRPNVPTCNSLLSTFLRVHRMSEAYNLLQSMLALGL 660

Query: 661 KPSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSK 720
            PSLQTYTLLLSCCTDA++N DMGFC +LM V+GHPAH FL+ +P AGP+GQ VRDH+S 
Sbjct: 661 HPSLQTYTLLLSCCTDARSNFDMGFCGQLMAVSGHPAHMFLLKMPPAGPDGQKVRDHVSN 720

Query: 721 FLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLI 780
           FLD MHSEDRESKRGL+DAVVDFLHKSGLKEEAG VWE A  KNVYPDA++EKS  YWLI
Sbjct: 721 FLDFMHSEDRESKRGLMDAVVDFLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLI 780

Query: 781 NLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDL 840
           NLHVMS+GTAV ALSRTLAWFR+Q+L+SG  PSRIDIVTGWGRRS+VTG+S+VRQAV++L
Sbjct: 781 NLHVMSEGTAVIALSRTLAWFRKQMLVSGDCPSRIDIVTGWGRRSRVTGTSMVRQAVEEL 840

Query: 841 LSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           L+IF+FPFFTENGNSGCFVG GEPL  WL +SYVERMHLL
Sbjct: 841 LNIFNFPFFTENGNSGCFVGSGEPLKNWLLESYVERMHLL 855

BLAST of CSPI01G02030 vs. ExPASy Swiss-Prot
Match: Q9SIC9 (Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g31400 PE=2 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 2.9e-48
Identity = 129/530 (24.34%), Postives = 240/530 (45.28%), Query Frame = 0

Query: 374 RFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDA 433
           R   D  +Y T++  + +  Q     ++L QM      PNVV+Y+ +I  + +A    +A
Sbjct: 369 RIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEA 428

Query: 434 VNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINC 493
           +N+F +M+  G   DRV+Y TL+ I+ K G  + A+ +  +M   G+  D  TY+ ++  
Sbjct: 429 LNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGG 488

Query: 494 LGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPD 553
            GK G  +   ++F  M  E  +PNL+TY+ +I   +K   Y+ A++++R+ + +G   D
Sbjct: 489 YGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRAD 548

Query: 554 KVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYH 613
            V Y  +++ L   G +  A  +  EM K+   P+   Y  ++D +G+S  + ++ ++ +
Sbjct: 549 VVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMDRSADYSN 608

Query: 614 AMLKAGLKPNVPTCNSLLSAFLR----------------------------VHQLSDAYQ 673
                    ++P  +S LSA                               + +LS   +
Sbjct: 609 G-------GSLPFSSSALSALTETEGNRVIQLFGQLTTESNNRTTKDCEEGMQELSCILE 668

Query: 674 LLQSMLTFGLKPSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPN 733
           + + M    +KP++ T++ +L+ C+   +  D     E +++  +  +  +  L      
Sbjct: 669 VFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMG--Q 728

Query: 734 GQNVRDHMSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAV 793
            +NV        D ++  D  +     +A+ D L   G K  A  V      + V+ +  
Sbjct: 729 RENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVW 788

Query: 794 KEKSSCYWLINLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGS 853
            +  SC   ++LH+MS G A   +   L   R  +      P  + I+TGWG+ SKV G 
Sbjct: 789 SD--SC---LDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGD 848

Query: 854 SLVRQAVQDLLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
             +R+AV+ LL     PF     N G F   G  ++ WL +S   ++ +L
Sbjct: 849 GALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLIL 884

BLAST of CSPI01G02030 vs. ExPASy Swiss-Prot
Match: Q9SAK0 (Pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=EMB2217 PE=2 SV=1)

HSP 1 Score: 186.8 bits (473), Expect = 1.0e-45
Identity = 121/441 (27.44%), Postives = 218/441 (49.43%), Query Frame = 0

Query: 378 DGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDAVNVF 437
           DG TY  +I  L ++ +  A  KL  QM +   +P+   ++ ++ S G+A  L  ++ V+
Sbjct: 312 DGSTYELIIPSLAKSGRLDAAFKLFQQMKERKLRPSFSVFSSLVDSMGKAGRLDTSMKVY 371

Query: 438 KQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINCLGKA 497
            +MQ  G  P    + +LID +AK+G LD A+ ++++M+ +G  P+   Y+++I    K+
Sbjct: 372 MEMQGFGHRPSATMFVSLIDSYAKAGKLDTALRLWDEMKKSGFRPNFGLYTMIIESHAKS 431

Query: 498 GHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPDKVTY 557
           G L  A  +F  M   G +P   TY+ ++ + A +   + A+K+Y  M  +G  P   +Y
Sbjct: 432 GKLEVAMTVFKDMEKAGFLPTPSTYSCLLEMHAGSGQVDSAMKIYNSMTNAGLRPGLSSY 491

Query: 558 CIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYHAMLK 617
             ++ +L +   ++ A  I +EM+   +  D     +L+ ++ K  +V  A +W   M  
Sbjct: 492 ISLLTLLANKRLVDVAGKILLEMKAMGYSVDVCASDVLM-IYIKDASVDLALKWLRFMGS 551

Query: 618 AGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQTYTLLLSCCTDAQTNDM 677
           +G+K N      L  + ++      A  LL++++    K  L  YT +L+     Q  D 
Sbjct: 552 SGIKTNNFIIRQLFESCMKNGLYDSARPLLETLVHSAGKVDLVLYTSILAHLVRCQDEDK 611

Query: 678 -GFCCELMQVTGHPAHTFLVSLPSAGP--NGQNVRDHMSKFLDLMHSEDRE-SKRGLVDA 737
                 ++  T H AH F+  L   GP    Q V   + +F   +  E  E + R  V+ 
Sbjct: 612 ERQLMSILSATKHKAHAFMCGL-FTGPEQRKQPVLTFVREFYQGIDYELEEGAARYFVNV 671

Query: 738 VVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMSDGTAVTALSRTLA 797
           ++++L   G    A CVW+ A +  ++P A+       W +++  +S G A+ A+  TL 
Sbjct: 672 LLNYLVLMGQINRARCVWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLH 731

Query: 798 WFRQQLLLSGVGPSRIDIVTG 815
            FR+++L  GV P RI +VTG
Sbjct: 732 RFRKRMLYYGVVPRRIKLVTG 750

BLAST of CSPI01G02030 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 1.3e-43
Identity = 113/395 (28.61%), Postives = 192/395 (48.61%), Query Frame = 0

Query: 274 SIAQTTGSDFTNSSKNFKKFPDNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAA 333
           S+     SDF+ S       PD   S      +T    + P+   S S            
Sbjct: 60  SVVSMKSSDFSGSMIRKSSKPDLSSS----EEVTRGLKSFPDTDSSFSYF---------- 119

Query: 334 EEAIGKLNCSIDAYQANQILK--RVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGR 393
           +   G LN        N +L+  RVD     + + + L +    + D +TY T+   L  
Sbjct: 120 KSVAGNLNLVHTTETCNYMLEALRVDGKLEEMAYVFDLMQKRIIKRDTNTYLTIFKSLSV 179

Query: 394 AKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVT 453
                     L +M + G   N  +YN +IH   ++ +  +A+ V+++M   G  P   T
Sbjct: 180 KGGLKQAPYALRKMREFGFVLNAYSYNGLIHLLLKSRFCTEAMEVYRRMILEGFRPSLQT 239

Query: 454 YCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMV 513
           Y +L+    K   +D  MG+ ++M+  GL P+ +T+++ I  LG+AG +N A+ +  RM 
Sbjct: 240 YSSLMVGLGKRRDIDSVMGLLKEMETLGLKPNVYTFTICIRVLGRAGKINEAYEILKRMD 299

Query: 514 DEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLE 573
           DEGC P++VTY ++I     AR  + A +++  M+    +PD+VTY  +++       L+
Sbjct: 300 DEGCGPDVVTYTVLIDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITLLDRFSDNRDLD 359

Query: 574 EAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLL 633
             +  + EM+K   VPD   + +LVD   K+GN  +A++    M   G+ PN+ T N+L+
Sbjct: 360 SVKQFWSEMEKDGHVPDVVTFTILVDALCKAGNFGEAFDTLDVMRDQGILPNLHTYNTLI 419

Query: 634 SAFLRVHQLSDAYQLLQSMLTFGLKPSLQTYTLLL 667
              LRVH+L DA +L  +M + G+KP+  TY + +
Sbjct: 420 CGLLRVHRLDDALELFGNMESLGVKPTAYTYIVFI 440

BLAST of CSPI01G02030 vs. ExPASy TrEMBL
Match: A0A0A0LRL7 (Smr domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G008500 PE=3 SV=1)

HSP 1 Score: 1775.4 bits (4597), Expect = 0.0e+00
Identity = 874/874 (100.00%), Postives = 874/874 (100.00%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
           MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV
Sbjct: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60

Query: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
           ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL
Sbjct: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120

Query: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
           DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP
Sbjct: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180

Query: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
           SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK
Sbjct: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240

Query: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
           LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP
Sbjct: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300

Query: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
           TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA
Sbjct: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360

Query: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
           VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI
Sbjct: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420

Query: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
           IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL
Sbjct: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480

Query: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
           TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK
Sbjct: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540

Query: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
           LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG
Sbjct: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600

Query: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
           KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ
Sbjct: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660

Query: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
           TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH
Sbjct: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720

Query: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
           SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS
Sbjct: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780

Query: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
           DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF
Sbjct: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840

Query: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 874

BLAST of CSPI01G02030 vs. ExPASy TrEMBL
Match: A0A5D3BK75 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold350G002310 PE=3 SV=1)

HSP 1 Score: 1741.1 bits (4508), Expect = 0.0e+00
Identity = 856/874 (97.94%), Postives = 865/874 (98.97%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
           MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVS+RQNARNETLPSQKPSTLV
Sbjct: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSQRQNARNETLPSQKPSTLV 60

Query: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
           ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQV NTGPNHQRGAECVRY+SGLNTVL
Sbjct: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVTNTGPNHQRGAECVRYSSGLNTVL 120

Query: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
           DGEC+SPRIADQVVKAGIMAVNLFSDFVNFKIP SDYGGTFSSSKNCMVDPARSITSVKP
Sbjct: 121 DGECSSPRIADQVVKAGIMAVNLFSDFVNFKIPLSDYGGTFSSSKNCMVDPARSITSVKP 180

Query: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
           SKIKHLRRENISRVHSRPSVE  VDSKPQSSSNHGSNCKPAQSSYVKGSRQEVS+ARTQK
Sbjct: 181 SKIKHLRRENISRVHSRPSVETHVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSKARTQK 240

Query: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
            VVFQ+ISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSD T+SSKN KKFPDNLKSP
Sbjct: 241 SVVFQDISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDLTSSSKNLKKFPDNLKSP 300

Query: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
           TGMAPI SSFLN+PNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA
Sbjct: 301 TGMAPINSSFLNSPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360

Query: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
           VALGFFYWLKRL RFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI
Sbjct: 361 VALGFFYWLKRLARFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420

Query: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
           IHSYGRANYLQ+AVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL
Sbjct: 421 IHSYGRANYLQEAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480

Query: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
           TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVD+GCVPNLVTYNIMIALQAKARNYEIALK
Sbjct: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDQGCVPNLVTYNIMIALQAKARNYEIALK 540

Query: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
           LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG
Sbjct: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600

Query: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
           KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ
Sbjct: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660

Query: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
           TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH
Sbjct: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720

Query: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
           SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS
Sbjct: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780

Query: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
           DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF
Sbjct: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840

Query: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 874

BLAST of CSPI01G02030 vs. ExPASy TrEMBL
Match: A0A1S3BVJ8 (pentatricopeptide repeat-containing protein At1g18900 OS=Cucumis melo OX=3656 GN=LOC103493965 PE=3 SV=1)

HSP 1 Score: 1741.1 bits (4508), Expect = 0.0e+00
Identity = 856/874 (97.94%), Postives = 865/874 (98.97%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
           MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVS+RQNARNETLPSQKPSTLV
Sbjct: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSQRQNARNETLPSQKPSTLV 60

Query: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
           ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQV NTGPNHQRGAECVRY+SGLNTVL
Sbjct: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVTNTGPNHQRGAECVRYSSGLNTVL 120

Query: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
           DGEC+SPRIADQVVKAGIMAVNLFSDFVNFKIP SDYGGTFSSSKNCMVDPARSITSVKP
Sbjct: 121 DGECSSPRIADQVVKAGIMAVNLFSDFVNFKIPLSDYGGTFSSSKNCMVDPARSITSVKP 180

Query: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
           SKIKHLRRENISRVHSRPSVE  VDSKPQSSSNHGSNCKPAQSSYVKGSRQEVS+ARTQK
Sbjct: 181 SKIKHLRRENISRVHSRPSVETHVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSKARTQK 240

Query: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
            VVFQ+ISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSD T+SSKN KKFPDNLKSP
Sbjct: 241 SVVFQDISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDLTSSSKNLKKFPDNLKSP 300

Query: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
           TGMAPI SSFLN+PNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA
Sbjct: 301 TGMAPINSSFLNSPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360

Query: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
           VALGFFYWLKRL RFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI
Sbjct: 361 VALGFFYWLKRLARFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420

Query: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
           IHSYGRANYLQ+AVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL
Sbjct: 421 IHSYGRANYLQEAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480

Query: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
           TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVD+GCVPNLVTYNIMIALQAKARNYEIALK
Sbjct: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDQGCVPNLVTYNIMIALQAKARNYEIALK 540

Query: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
           LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG
Sbjct: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600

Query: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
           KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ
Sbjct: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660

Query: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
           TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH
Sbjct: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720

Query: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
           SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS
Sbjct: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780

Query: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
           DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF
Sbjct: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840

Query: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 874

BLAST of CSPI01G02030 vs. ExPASy TrEMBL
Match: A0A6J1C013 (pentatricopeptide repeat-containing protein At1g18900 OS=Momordica charantia OX=3673 GN=LOC111007135 PE=3 SV=1)

HSP 1 Score: 1575.1 bits (4077), Expect = 0.0e+00
Identity = 775/879 (88.17%), Postives = 821/879 (93.40%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           MLRAKQIGSLS+SARSFFLSGSRCN ADG+SCTC EDETCVS+RQNAR E LPS KPSTL
Sbjct: 1   MLRAKQIGSLSSSARSFFLSGSRCNGADGSSCTCSEDETCVSQRQNARIEILPSSKPSTL 60

Query: 61  VA--NSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLN 120
           VA  NSS R+G LIAE+AAKVIVSHKTD VDLS+++R V NTGP+ QRG ECVRYASGLN
Sbjct: 61  VARPNSSARLGTLIAEDAAKVIVSHKTDKVDLSIAVRPVTNTGPSPQRGPECVRYASGLN 120

Query: 121 TVLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITS 180
           TVLD ECTSP+IADQ VKAGI+AVNLFSDFVNFK+P SDYGGTFSSSKNCMVDPARSITS
Sbjct: 121 TVLDDECTSPKIADQFVKAGIVAVNLFSDFVNFKVPLSDYGGTFSSSKNCMVDPARSITS 180

Query: 181 VKPSKIKHLRRENISRVHSRPSVEIPVDSKPQ-SSSNHGSNCKPAQSSYVKGSRQEVSEA 240
           VKPSK+KHLRRENIS VHS+PSV+IPVDSKPQ SSS+HG  CK  +S+YVKG +Q V EA
Sbjct: 181 VKPSKVKHLRRENISSVHSKPSVDIPVDSKPQSSSSHHGPKCKSEKSNYVKGLKQ-VPEA 240

Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDN 300
           RT+K VVF N+SSDKCDKR LPQR+R+H NSFTSHFHS AQT GS+FTNSSKN  K PDN
Sbjct: 241 RTRKPVVFHNMSSDKCDKRILPQRSRIHLNSFTSHFHSNAQTMGSEFTNSSKNLNKLPDN 300

Query: 301 LKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRV 360
           +KS  GMAP T    +  + VESV CILQQLKWGP AEEA+GKLNCSID YQANQ+LKR+
Sbjct: 301 IKSSMGMAPTTMQLSSTSHAVESVFCILQQLKWGPTAEEALGKLNCSIDVYQANQVLKRL 360

Query: 361 DDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVT 420
           DD++VALGFF WLKRLPRFRHDGHTYTTMIGLLGRAKQF AINKLLDQM+KDGCQPNVVT
Sbjct: 361 DDYSVALGFFNWLKRLPRFRHDGHTYTTMIGLLGRAKQFGAINKLLDQMVKDGCQPNVVT 420

Query: 421 YNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ 480
           YNRIIHSYGRANYLQ+AV+VFKQMQEAGCEPDRVTYCTLIDIHAKSGFLD+AMGMYE+MQ
Sbjct: 421 YNRIIHSYGRANYLQEAVDVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDIAMGMYERMQ 480

Query: 481 DAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYE 540
           +AGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVD+GCVPNLVTYNIMIALQAKARNYE
Sbjct: 481 EAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDQGCVPNLVTYNIMIALQAKARNYE 540

Query: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLV 600
           IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAE IFIEMQKKNWVPDEPVYGLLV
Sbjct: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAESIFIEMQKKNWVPDEPVYGLLV 600

Query: 601 DLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLK 660
           DLWGKSGNVQKAWEWYH ML AGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSML FGLK
Sbjct: 601 DLWGKSGNVQKAWEWYHVMLNAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLHFGLK 660

Query: 661 PSLQTYTLLLSCCTDAQ-TNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKF 720
           PSLQTYTLLLSCCTDAQ TNDMGFCCELMQ+TGHPAHTFLVSLPSAGPNGQNVRDHM+ F
Sbjct: 661 PSLQTYTLLLSCCTDAQSTNDMGFCCELMQITGHPAHTFLVSLPSAGPNGQNVRDHMNTF 720

Query: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLIN 780
           LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKS+CYWLIN
Sbjct: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSNCYWLIN 780

Query: 781 LHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLL 840
           LHVMS+GTAVTALSRTLAWFRQQ+L SGV PSRIDIVTGWGRRS+VTGSSLVRQAVQDLL
Sbjct: 781 LHVMSEGTAVTALSRTLAWFRQQMLHSGVSPSRIDIVTGWGRRSRVTGSSLVRQAVQDLL 840

Query: 841 SIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           +IFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 NIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 878

BLAST of CSPI01G02030 vs. ExPASy TrEMBL
Match: A0A5N6RSC0 (Smr domain-containing protein OS=Carpinus fangiana OX=176857 GN=FH972_019013 PE=3 SV=1)

HSP 1 Score: 1253.4 bits (3242), Expect = 0.0e+00
Identity = 625/885 (70.62%), Postives = 728/885 (82.26%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           MLRAK IG+LSNSARSFFL+GSRC+ ADG SCTCPEDETCVS RQ+ RNE L +QKPSTL
Sbjct: 1   MLRAKHIGNLSNSARSFFLNGSRCSAADGNSCTCPEDETCVSRRQSRRNEVLLAQKPSTL 60

Query: 61  VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLN-T 120
           V+ SS RVG L++EE+ KV+ S K  NVD    ++QV  + P+  R ++CV YA+G++ T
Sbjct: 61  VSTSSGRVGTLVSEESVKVLGSQKAKNVDHPSPLKQVV-SAPSSLRRSDCVSYATGIDAT 120

Query: 121 VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSV 180
             D   +SP IADQ VKAGI  VN  SD VN+K+P S   G  +S  NCMVDP R ++S+
Sbjct: 121 QKDVVQSSPLIADQFVKAGIATVNFLSDLVNYKLPLSGGDGLLNSPGNCMVDPTRPLSSI 180

Query: 181 KPSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEART 240
           K S ++H++REN S VH R S +I   S   +++ H +  K  +S++VK S + V  A T
Sbjct: 181 KSSNVRHIKRENFSSVHPRSSPQIAAGSN-HTTNAHETKGKGDKSNFVK-SLKHVPYAGT 240

Query: 241 QKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNS--------SKNF 300
              V    ISSD  +K+  PQR R +SN FTS+++   QT+ ++F  S        S+ F
Sbjct: 241 GNSVATHGISSDAPEKKTAPQRPRANSNRFTSNYNLNMQTSDAEFVGSNSRGFNRHSRGF 300

Query: 301 KKFPDNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQAN 360
            K P +     G+API    +N  + V SV  ILQQLKWGPAAE+A+G L C +DA+QAN
Sbjct: 301 NKPPSDTSVAAGIAPIKRQIVNPGHAVGSVYQILQQLKWGPAAEKALGDLRCPMDAFQAN 360

Query: 361 QILKRVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGC 420
           QILK++ DH+VALGFF WLKR P F+HDGHTYTTM+G+LGRA+QF AINKLLDQM+KDGC
Sbjct: 361 QILKQLQDHSVALGFFCWLKRQPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVKDGC 420

Query: 421 QPNVVTYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMG 480
           QPNVVTYNR+IHSYGRANYL++A+ VF QMQEAGCEPDRVTYCTLIDIHAKSGFLDVAM 
Sbjct: 421 QPNVVTYNRLIHSYGRANYLKEALKVFNQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMC 480

Query: 481 MYEKMQDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQA 540
           MYE+MQ+AGL+PDTFTYSV+INCLGKAG+L AA  LFC M  +GCVPNLVTYNIMIALQA
Sbjct: 481 MYERMQEAGLSPDTFTYSVIINCLGKAGNLTAAQTLFCEMRGQGCVPNLVTYNIMIALQA 540

Query: 541 KARNYEIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEP 600
           KARNYE ALKLYRDMQ +GFEPDKV+Y IVMEVLGHCG+LEEAE +F+EM++KNWVPDEP
Sbjct: 541 KARNYETALKLYRDMQNAGFEPDKVSYSIVMEVLGHCGYLEEAEAVFVEMKRKNWVPDEP 600

Query: 601 VYGLLVDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSM 660
           VYGLLVDLWGK+GNV+KAWEWY AML AGL+PNVPTCNSLLSAFLRVH+LSDAY LLQSM
Sbjct: 601 VYGLLVDLWGKAGNVEKAWEWYQAMLYAGLRPNVPTCNSLLSAFLRVHRLSDAYNLLQSM 660

Query: 661 LTFGLKPSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVR 720
           +  GL PSLQTYTLLLSC T+AQ+  DM FCCELM +TGHPAHTFL+S+P+AGP+GQNVR
Sbjct: 661 VGLGLNPSLQTYTLLLSCTTEAQSPYDMSFCCELMTITGHPAHTFLLSMPAAGPDGQNVR 720

Query: 721 DHMSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSS 780
           DH+SKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAG VWE A QKNVYPDAVKEKSS
Sbjct: 721 DHVSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVKEKSS 780

Query: 781 CYWLINLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQ 840
           CYWLINLHVMSDGTAVTALSRTLAWFRQQ+L+SG+GPSRIDIVTGWGRRS+VTGSS+VRQ
Sbjct: 781 CYWLINLHVMSDGTAVTALSRTLAWFRQQMLMSGIGPSRIDIVTGWGRRSRVTGSSMVRQ 840

Query: 841 AVQDLLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           AVQ+LL+IF FPFFTENGNSGCFVGCGEPL+RWLHQSYVERMHLL
Sbjct: 841 AVQELLNIFRFPFFTENGNSGCFVGCGEPLNRWLHQSYVERMHLL 882

BLAST of CSPI01G02030 vs. NCBI nr
Match: XP_004138146.1 (pentatricopeptide repeat-containing protein At1g18900 [Cucumis sativus] >KGN63644.1 hypothetical protein Csa_014149 [Cucumis sativus])

HSP 1 Score: 1775.4 bits (4597), Expect = 0.0e+00
Identity = 874/874 (100.00%), Postives = 874/874 (100.00%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
           MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV
Sbjct: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60

Query: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
           ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL
Sbjct: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120

Query: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
           DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP
Sbjct: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180

Query: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
           SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK
Sbjct: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240

Query: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
           LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP
Sbjct: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300

Query: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
           TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA
Sbjct: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360

Query: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
           VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI
Sbjct: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420

Query: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
           IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL
Sbjct: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480

Query: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
           TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK
Sbjct: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540

Query: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
           LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG
Sbjct: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600

Query: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
           KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ
Sbjct: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660

Query: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
           TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH
Sbjct: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720

Query: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
           SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS
Sbjct: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780

Query: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
           DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF
Sbjct: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840

Query: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 874

BLAST of CSPI01G02030 vs. NCBI nr
Match: XP_008453170.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g18900 [Cucumis melo] >KAA0057920.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYJ98608.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1741.1 bits (4508), Expect = 0.0e+00
Identity = 856/874 (97.94%), Postives = 865/874 (98.97%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
           MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVS+RQNARNETLPSQKPSTLV
Sbjct: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSQRQNARNETLPSQKPSTLV 60

Query: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
           ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQV NTGPNHQRGAECVRY+SGLNTVL
Sbjct: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVTNTGPNHQRGAECVRYSSGLNTVL 120

Query: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
           DGEC+SPRIADQVVKAGIMAVNLFSDFVNFKIP SDYGGTFSSSKNCMVDPARSITSVKP
Sbjct: 121 DGECSSPRIADQVVKAGIMAVNLFSDFVNFKIPLSDYGGTFSSSKNCMVDPARSITSVKP 180

Query: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
           SKIKHLRRENISRVHSRPSVE  VDSKPQSSSNHGSNCKPAQSSYVKGSRQEVS+ARTQK
Sbjct: 181 SKIKHLRRENISRVHSRPSVETHVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSKARTQK 240

Query: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
            VVFQ+ISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSD T+SSKN KKFPDNLKSP
Sbjct: 241 SVVFQDISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDLTSSSKNLKKFPDNLKSP 300

Query: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
           TGMAPI SSFLN+PNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA
Sbjct: 301 TGMAPINSSFLNSPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360

Query: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
           VALGFFYWLKRL RFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI
Sbjct: 361 VALGFFYWLKRLARFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420

Query: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
           IHSYGRANYLQ+AVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL
Sbjct: 421 IHSYGRANYLQEAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480

Query: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
           TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVD+GCVPNLVTYNIMIALQAKARNYEIALK
Sbjct: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDQGCVPNLVTYNIMIALQAKARNYEIALK 540

Query: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
           LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG
Sbjct: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600

Query: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
           KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ
Sbjct: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660

Query: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
           TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH
Sbjct: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720

Query: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
           SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS
Sbjct: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780

Query: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
           DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF
Sbjct: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840

Query: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 874

BLAST of CSPI01G02030 vs. NCBI nr
Match: XP_038878936.1 (pentatricopeptide repeat-containing protein At1g18900-like [Benincasa hispida])

HSP 1 Score: 1667.1 bits (4316), Expect = 0.0e+00
Identity = 819/875 (93.60%), Postives = 841/875 (96.11%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
           MLRAK IGSLSN+ARSFFLSGSRCNADG SCTCPEDETCVS+RQNARNE LPSQKPSTLV
Sbjct: 1   MLRAKHIGSLSNNARSFFLSGSRCNADGTSCTCPEDETCVSQRQNARNEILPSQKPSTLV 60

Query: 61  ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
           ANSSPRVGPL+AEEAAKVI SHKTDNVDL VSIRQV  TGP+HQRGAECVRYASGLNTVL
Sbjct: 61  ANSSPRVGPLVAEEAAKVIASHKTDNVDLPVSIRQVTKTGPSHQRGAECVRYASGLNTVL 120

Query: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
           DGECTSP IADQVVKAGI+AVNLF+DFVNFK+P SDYGGTFSSSKNCMVDPARSITSVKP
Sbjct: 121 DGECTSPYIADQVVKAGIVAVNLFTDFVNFKVPLSDYGGTFSSSKNCMVDPARSITSVKP 180

Query: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSN-HGSNCKPAQSSYVKGSRQEVSEARTQ 240
           SKIK LRRENIS VHSRPSVEIPVDSKPQ+SSN HG NCK  QS+YVKGS+Q V E R Q
Sbjct: 181 SKIKQLRRENISSVHSRPSVEIPVDSKPQNSSNHHGPNCKAVQSNYVKGSKQ-VPEVRPQ 240

Query: 241 KLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKS 300
           K VVF NISSDKCDKR  PQRTRVHSNSFTSHFHS AQTTGS+FTNSS N KK PDNLKS
Sbjct: 241 KSVVFHNISSDKCDKRTPPQRTRVHSNSFTSHFHSHAQTTGSEFTNSSMNLKKLPDNLKS 300

Query: 301 PTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDH 360
            TG+AP T SFLN P+VVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDH
Sbjct: 301 STGIAPTTPSFLNGPHVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDH 360

Query: 361 AVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNR 420
           +VALGFFYWLKRL RFRHDGHTYTTMIGLLGRAKQFAAIN+LLDQMIKDGCQPNVVTYNR
Sbjct: 361 SVALGFFYWLKRLARFRHDGHTYTTMIGLLGRAKQFAAINRLLDQMIKDGCQPNVVTYNR 420

Query: 421 IIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAG 480
           IIHSYGRANYLQ+AVNVFKQM EAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ+AG
Sbjct: 421 IIHSYGRANYLQEAVNVFKQMHEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQEAG 480

Query: 481 LTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIAL 540
           LTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVD+GCVPNLVTYNIMIALQAKARNYEIAL
Sbjct: 481 LTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDQGCVPNLVTYNIMIALQAKARNYEIAL 540

Query: 541 KLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLW 600
           KLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQ KNWVPDEPVYGLLVDLW
Sbjct: 541 KLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQNKNWVPDEPVYGLLVDLW 600

Query: 601 GKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSL 660
           GKSGNVQKAWEWYHAML+AGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSL
Sbjct: 601 GKSGNVQKAWEWYHAMLEAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSL 660

Query: 661 QTYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLM 720
           QTYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLM
Sbjct: 661 QTYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLM 720

Query: 721 HSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVM 780
           HSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVM
Sbjct: 721 HSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVM 780

Query: 781 SDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFS 840
           SDGTAVTALSRTLAWFRQQ+LLSGVGP+RIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFS
Sbjct: 781 SDGTAVTALSRTLAWFRQQMLLSGVGPNRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFS 840

Query: 841 FPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           FPFFTENGNSGCFVGCGEPLSRWLH+SYVERMHLL
Sbjct: 841 FPFFTENGNSGCFVGCGEPLSRWLHESYVERMHLL 874

BLAST of CSPI01G02030 vs. NCBI nr
Match: XP_022135050.1 (pentatricopeptide repeat-containing protein At1g18900 [Momordica charantia])

HSP 1 Score: 1575.1 bits (4077), Expect = 0.0e+00
Identity = 775/879 (88.17%), Postives = 821/879 (93.40%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           MLRAKQIGSLS+SARSFFLSGSRCN ADG+SCTC EDETCVS+RQNAR E LPS KPSTL
Sbjct: 1   MLRAKQIGSLSSSARSFFLSGSRCNGADGSSCTCSEDETCVSQRQNARIEILPSSKPSTL 60

Query: 61  VA--NSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLN 120
           VA  NSS R+G LIAE+AAKVIVSHKTD VDLS+++R V NTGP+ QRG ECVRYASGLN
Sbjct: 61  VARPNSSARLGTLIAEDAAKVIVSHKTDKVDLSIAVRPVTNTGPSPQRGPECVRYASGLN 120

Query: 121 TVLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITS 180
           TVLD ECTSP+IADQ VKAGI+AVNLFSDFVNFK+P SDYGGTFSSSKNCMVDPARSITS
Sbjct: 121 TVLDDECTSPKIADQFVKAGIVAVNLFSDFVNFKVPLSDYGGTFSSSKNCMVDPARSITS 180

Query: 181 VKPSKIKHLRRENISRVHSRPSVEIPVDSKPQ-SSSNHGSNCKPAQSSYVKGSRQEVSEA 240
           VKPSK+KHLRRENIS VHS+PSV+IPVDSKPQ SSS+HG  CK  +S+YVKG +Q V EA
Sbjct: 181 VKPSKVKHLRRENISSVHSKPSVDIPVDSKPQSSSSHHGPKCKSEKSNYVKGLKQ-VPEA 240

Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDN 300
           RT+K VVF N+SSDKCDKR LPQR+R+H NSFTSHFHS AQT GS+FTNSSKN  K PDN
Sbjct: 241 RTRKPVVFHNMSSDKCDKRILPQRSRIHLNSFTSHFHSNAQTMGSEFTNSSKNLNKLPDN 300

Query: 301 LKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRV 360
           +KS  GMAP T    +  + VESV CILQQLKWGP AEEA+GKLNCSID YQANQ+LKR+
Sbjct: 301 IKSSMGMAPTTMQLSSTSHAVESVFCILQQLKWGPTAEEALGKLNCSIDVYQANQVLKRL 360

Query: 361 DDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVT 420
           DD++VALGFF WLKRLPRFRHDGHTYTTMIGLLGRAKQF AINKLLDQM+KDGCQPNVVT
Sbjct: 361 DDYSVALGFFNWLKRLPRFRHDGHTYTTMIGLLGRAKQFGAINKLLDQMVKDGCQPNVVT 420

Query: 421 YNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ 480
           YNRIIHSYGRANYLQ+AV+VFKQMQEAGCEPDRVTYCTLIDIHAKSGFLD+AMGMYE+MQ
Sbjct: 421 YNRIIHSYGRANYLQEAVDVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDIAMGMYERMQ 480

Query: 481 DAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYE 540
           +AGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVD+GCVPNLVTYNIMIALQAKARNYE
Sbjct: 481 EAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDQGCVPNLVTYNIMIALQAKARNYE 540

Query: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLV 600
           IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAE IFIEMQKKNWVPDEPVYGLLV
Sbjct: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAESIFIEMQKKNWVPDEPVYGLLV 600

Query: 601 DLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLK 660
           DLWGKSGNVQKAWEWYH ML AGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSML FGLK
Sbjct: 601 DLWGKSGNVQKAWEWYHVMLNAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLHFGLK 660

Query: 661 PSLQTYTLLLSCCTDAQ-TNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKF 720
           PSLQTYTLLLSCCTDAQ TNDMGFCCELMQ+TGHPAHTFLVSLPSAGPNGQNVRDHM+ F
Sbjct: 661 PSLQTYTLLLSCCTDAQSTNDMGFCCELMQITGHPAHTFLVSLPSAGPNGQNVRDHMNTF 720

Query: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLIN 780
           LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKS+CYWLIN
Sbjct: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSNCYWLIN 780

Query: 781 LHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLL 840
           LHVMS+GTAVTALSRTLAWFRQQ+L SGV PSRIDIVTGWGRRS+VTGSSLVRQAVQDLL
Sbjct: 781 LHVMSEGTAVTALSRTLAWFRQQMLHSGVSPSRIDIVTGWGRRSRVTGSSLVRQAVQDLL 840

Query: 841 SIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           +IFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 NIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 878

BLAST of CSPI01G02030 vs. NCBI nr
Match: KAE8124103.1 (hypothetical protein FH972_019013 [Carpinus fangiana])

HSP 1 Score: 1253.4 bits (3242), Expect = 0.0e+00
Identity = 625/885 (70.62%), Postives = 728/885 (82.26%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           MLRAK IG+LSNSARSFFL+GSRC+ ADG SCTCPEDETCVS RQ+ RNE L +QKPSTL
Sbjct: 1   MLRAKHIGNLSNSARSFFLNGSRCSAADGNSCTCPEDETCVSRRQSRRNEVLLAQKPSTL 60

Query: 61  VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLN-T 120
           V+ SS RVG L++EE+ KV+ S K  NVD    ++QV  + P+  R ++CV YA+G++ T
Sbjct: 61  VSTSSGRVGTLVSEESVKVLGSQKAKNVDHPSPLKQVV-SAPSSLRRSDCVSYATGIDAT 120

Query: 121 VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSV 180
             D   +SP IADQ VKAGI  VN  SD VN+K+P S   G  +S  NCMVDP R ++S+
Sbjct: 121 QKDVVQSSPLIADQFVKAGIATVNFLSDLVNYKLPLSGGDGLLNSPGNCMVDPTRPLSSI 180

Query: 181 KPSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEART 240
           K S ++H++REN S VH R S +I   S   +++ H +  K  +S++VK S + V  A T
Sbjct: 181 KSSNVRHIKRENFSSVHPRSSPQIAAGSN-HTTNAHETKGKGDKSNFVK-SLKHVPYAGT 240

Query: 241 QKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNS--------SKNF 300
              V    ISSD  +K+  PQR R +SN FTS+++   QT+ ++F  S        S+ F
Sbjct: 241 GNSVATHGISSDAPEKKTAPQRPRANSNRFTSNYNLNMQTSDAEFVGSNSRGFNRHSRGF 300

Query: 301 KKFPDNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQAN 360
            K P +     G+API    +N  + V SV  ILQQLKWGPAAE+A+G L C +DA+QAN
Sbjct: 301 NKPPSDTSVAAGIAPIKRQIVNPGHAVGSVYQILQQLKWGPAAEKALGDLRCPMDAFQAN 360

Query: 361 QILKRVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGC 420
           QILK++ DH+VALGFF WLKR P F+HDGHTYTTM+G+LGRA+QF AINKLLDQM+KDGC
Sbjct: 361 QILKQLQDHSVALGFFCWLKRQPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVKDGC 420

Query: 421 QPNVVTYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMG 480
           QPNVVTYNR+IHSYGRANYL++A+ VF QMQEAGCEPDRVTYCTLIDIHAKSGFLDVAM 
Sbjct: 421 QPNVVTYNRLIHSYGRANYLKEALKVFNQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMC 480

Query: 481 MYEKMQDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQA 540
           MYE+MQ+AGL+PDTFTYSV+INCLGKAG+L AA  LFC M  +GCVPNLVTYNIMIALQA
Sbjct: 481 MYERMQEAGLSPDTFTYSVIINCLGKAGNLTAAQTLFCEMRGQGCVPNLVTYNIMIALQA 540

Query: 541 KARNYEIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEP 600
           KARNYE ALKLYRDMQ +GFEPDKV+Y IVMEVLGHCG+LEEAE +F+EM++KNWVPDEP
Sbjct: 541 KARNYETALKLYRDMQNAGFEPDKVSYSIVMEVLGHCGYLEEAEAVFVEMKRKNWVPDEP 600

Query: 601 VYGLLVDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSM 660
           VYGLLVDLWGK+GNV+KAWEWY AML AGL+PNVPTCNSLLSAFLRVH+LSDAY LLQSM
Sbjct: 601 VYGLLVDLWGKAGNVEKAWEWYQAMLYAGLRPNVPTCNSLLSAFLRVHRLSDAYNLLQSM 660

Query: 661 LTFGLKPSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVR 720
           +  GL PSLQTYTLLLSC T+AQ+  DM FCCELM +TGHPAHTFL+S+P+AGP+GQNVR
Sbjct: 661 VGLGLNPSLQTYTLLLSCTTEAQSPYDMSFCCELMTITGHPAHTFLLSMPAAGPDGQNVR 720

Query: 721 DHMSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSS 780
           DH+SKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAG VWE A QKNVYPDAVKEKSS
Sbjct: 721 DHVSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVKEKSS 780

Query: 781 CYWLINLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQ 840
           CYWLINLHVMSDGTAVTALSRTLAWFRQQ+L+SG+GPSRIDIVTGWGRRS+VTGSS+VRQ
Sbjct: 781 CYWLINLHVMSDGTAVTALSRTLAWFRQQMLMSGIGPSRIDIVTGWGRRSRVTGSSMVRQ 840

Query: 841 AVQDLLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           AVQ+LL+IF FPFFTENGNSGCFVGCGEPL+RWLHQSYVERMHLL
Sbjct: 841 AVQELLNIFRFPFFTENGNSGCFVGCGEPLNRWLHQSYVERMHLL 882

BLAST of CSPI01G02030 vs. TAIR 10
Match: AT1G18900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 542/879 (61.66%), Postives = 672/879 (76.45%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           M+RAK I +LS++ARSFFL+GSR +  DG SC   +DE CVS+RQ  R E   ++K  + 
Sbjct: 1   MIRAKHISNLSSTARSFFLNGSRTSVTDGNSCVYSDDENCVSKRQQLRKEAGQTEKRPSS 60

Query: 61  VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASG-LNT 120
           +      VG ++  E  K +V  K D+      + Q  ++ P     +  V YAS  +  
Sbjct: 61  ILPKPSVVGCILPGEVTKPVVPKKVDDFGRPSLLPQHVSSSPALPLKSHSVNYASTVVRE 120

Query: 121 VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGG-TFSSSKNCMVDPARSITS 180
            ++G+ +S  I DQ+ KAGI+AVN  SD  N KIPS D G   F   K+CMVDP R I+S
Sbjct: 121 EVEGKASSEPIGDQIFKAGIVAVNFLSDLSNCKIPSYDGGSDAFGLPKSCMVDPTRPISS 180

Query: 181 VKPSKIKHLRRENISRVHSRPSV-EIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEA 240
           VK S +K +RRE+ ++++ R +  E  V +    SSN     +  ++ +VKG RQ VS +
Sbjct: 181 VKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNPSSNFRGAKEAERTGFVKGFRQ-VSNS 240

Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDN 300
              K +   N +  K  + ++ QR  + SN F            S F+NSS       + 
Sbjct: 241 VVGKSLPTTNNTYGK--RTSVLQRPHIDSNRFVP----------SGFSNSS------VEM 300

Query: 301 LKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRV 360
           +K P+G A  +  + N+ ++VE+VS +L++ +WGPAAEEA+  L   IDAYQANQ+LK++
Sbjct: 301 MKGPSGTALTSRQYCNSGHIVENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQM 360

Query: 361 DDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVT 420
           +D+  ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF AINKLLD+M++DGCQPN VT
Sbjct: 361 NDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT 420

Query: 421 YNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ 480
           YNR+IHSYGRANYL +A+NVF QMQEAGC+PDRVTYCTLIDIHAK+GFLD+AM MY++MQ
Sbjct: 421 YNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQ 480

Query: 481 DAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYE 540
             GL+PDTFTYSV+INCLGKAGHL AAH+LFC MVD+GC PNLVTYNIM+ L AKARNY+
Sbjct: 481 AGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKARNYQ 540

Query: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLV 600
            ALKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE +F EMQ+KNW+PDEPVYGLLV
Sbjct: 541 NALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQQKNWIPDEPVYGLLV 600

Query: 601 DLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLK 660
           DLWGK+GNV+KAW+WY AML AGL+PNVPTCNSLLS FLRV+++++AY+LLQ+ML  GL+
Sbjct: 601 DLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSTFLRVNKIAEAYELLQNMLALGLR 660

Query: 661 PSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKF 720
           PSLQTYTLLLSCCTD ++  DMGFC +LM  TGHPAH FL+ +P+AGP+G+NVR+H + F
Sbjct: 661 PSLQTYTLLLSCCTDGRSKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGENVRNHANNF 720

Query: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLIN 780
           LDLMHSEDRESKRGLVDAVVDFLHKSG KEEAG VWE A QKNV+PDA++EKS  YWLIN
Sbjct: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLIN 780

Query: 781 LHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLL 840
           LHVMS+GTAVTALSRTLAWFR+Q+L SG  PSRIDIVTGWGRRS+VTG+S+VRQAV++LL
Sbjct: 781 LHVMSEGTAVTALSRTLAWFRKQMLASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELL 840

Query: 841 SIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           +IF  PFFTE+GNSGCFVG GEPL+RWL QS+VERMHLL
Sbjct: 841 NIFGSPFFTESGNSGCFVGSGEPLNRWLLQSHVERMHLL 860

BLAST of CSPI01G02030 vs. TAIR 10
Match: AT1G18900.2 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 542/879 (61.66%), Postives = 672/879 (76.45%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           M+RAK I +LS++ARSFFL+GSR +  DG SC   +DE CVS+RQ  R E   ++K  + 
Sbjct: 1   MIRAKHISNLSSTARSFFLNGSRTSVTDGNSCVYSDDENCVSKRQQLRKEAGQTEKRPSS 60

Query: 61  VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASG-LNT 120
           +      VG ++  E  K +V  K D+      + Q  ++ P     +  V YAS  +  
Sbjct: 61  ILPKPSVVGCILPGEVTKPVVPKKVDDFGRPSLLPQHVSSSPALPLKSHSVNYASTVVRE 120

Query: 121 VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGG-TFSSSKNCMVDPARSITS 180
            ++G+ +S  I DQ+ KAGI+AVN  SD  N KIPS D G   F   K+CMVDP R I+S
Sbjct: 121 EVEGKASSEPIGDQIFKAGIVAVNFLSDLSNCKIPSYDGGSDAFGLPKSCMVDPTRPISS 180

Query: 181 VKPSKIKHLRRENISRVHSRPSV-EIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEA 240
           VK S +K +RRE+ ++++ R +  E  V +    SSN     +  ++ +VKG RQ VS +
Sbjct: 181 VKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNPSSNFRGAKEAERTGFVKGFRQ-VSNS 240

Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDN 300
              K +   N +  K  + ++ QR  + SN F            S F+NSS       + 
Sbjct: 241 VVGKSLPTTNNTYGK--RTSVLQRPHIDSNRFVP----------SGFSNSS------VEM 300

Query: 301 LKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRV 360
           +K P+G A  +  + N+ ++VE+VS +L++ +WGPAAEEA+  L   IDAYQANQ+LK++
Sbjct: 301 MKGPSGTALTSRQYCNSGHIVENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQM 360

Query: 361 DDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVT 420
           +D+  ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF AINKLLD+M++DGCQPN VT
Sbjct: 361 NDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT 420

Query: 421 YNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ 480
           YNR+IHSYGRANYL +A+NVF QMQEAGC+PDRVTYCTLIDIHAK+GFLD+AM MY++MQ
Sbjct: 421 YNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQ 480

Query: 481 DAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYE 540
             GL+PDTFTYSV+INCLGKAGHL AAH+LFC MVD+GC PNLVTYNIM+ L AKARNY+
Sbjct: 481 AGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKARNYQ 540

Query: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLV 600
            ALKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE +F EMQ+KNW+PDEPVYGLLV
Sbjct: 541 NALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQQKNWIPDEPVYGLLV 600

Query: 601 DLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLK 660
           DLWGK+GNV+KAW+WY AML AGL+PNVPTCNSLLS FLRV+++++AY+LLQ+ML  GL+
Sbjct: 601 DLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSTFLRVNKIAEAYELLQNMLALGLR 660

Query: 661 PSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKF 720
           PSLQTYTLLLSCCTD ++  DMGFC +LM  TGHPAH FL+ +P+AGP+G+NVR+H + F
Sbjct: 661 PSLQTYTLLLSCCTDGRSKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGENVRNHANNF 720

Query: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLIN 780
           LDLMHSEDRESKRGLVDAVVDFLHKSG KEEAG VWE A QKNV+PDA++EKS  YWLIN
Sbjct: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLIN 780

Query: 781 LHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLL 840
           LHVMS+GTAVTALSRTLAWFR+Q+L SG  PSRIDIVTGWGRRS+VTG+S+VRQAV++LL
Sbjct: 781 LHVMSEGTAVTALSRTLAWFRKQMLASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELL 840

Query: 841 SIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           +IF  PFFTE+GNSGCFVG GEPL+RWL QS+VERMHLL
Sbjct: 841 NIFGSPFFTESGNSGCFVGSGEPLNRWLLQSHVERMHLL 860

BLAST of CSPI01G02030 vs. TAIR 10
Match: AT1G74750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 1069.7 bits (2765), Expect = 1.3e-312
Identity = 547/880 (62.16%), Postives = 662/880 (75.23%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           M+RAK I +LS+SARSFFLSGSR + ADG SCTC EDE+ VS+RQ  R E + + K ++ 
Sbjct: 1   MIRAKHISNLSSSARSFFLSGSRPSAADGNSCTCAEDESGVSKRQQIRTEVVQTGKRASN 60

Query: 61  VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 120
           +A  +   G ++  EA K +V  KT       S+     + P     A+ V +AS +   
Sbjct: 61  LA--AGLAGSILPVEAGKPLVVPKTVEHFTRPSLLPQHVSSPALPGKADSVNHASAIIK- 120

Query: 121 LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 180
              E     I DQ+ KAGI  VNL SD  N+KIP SD        K+CMVDP R I+ VK
Sbjct: 121 ---EDVGVPIGDQIFKAGIGNVNLLSDIANYKIPLSDGTEVVGLPKSCMVDPTRPISGVK 180

Query: 181 PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 240
            S +K +RRE++++V+ R +  +P++S P                   G++Q  ++   +
Sbjct: 181 SSNVKVIRREHLAKVYPRSADRVPINSSP-------------------GTKQASNDVAGK 240

Query: 241 KLVVFQNISSDKCDKRN-LPQRTRVHSNSFTSH--FHSIAQTTGSDFTNSSKNF-KKFPD 300
                  +S++   KR  +PQR    S  + S    +S+  +      +S + F K   +
Sbjct: 241 SFEAHDLLSNNVSGKRKIMPQRPYTDSTRYASGGCDYSVHSSDDRTIISSVEGFGKPSRE 300

Query: 301 NLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKR 360
            +K     AP      N   VVE+VS IL++ KWG AAEEA+      +DAYQANQ+LK+
Sbjct: 301 MMKVTPRTAPTPRQHCNPGYVVENVSSILRRFKWGHAAEEALHNFGFRMDAYQANQVLKQ 360

Query: 361 VDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVV 420
           +D++A ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF  INKLLD+M++DGC+PN V
Sbjct: 361 MDNYANALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRDGCKPNTV 420

Query: 421 TYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKM 480
           TYNR+IHSYGRANYL++A+NVF QMQEAGCEPDRVTYCTLIDIHAK+GFLD+AM MY++M
Sbjct: 421 TYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIAMDMYQRM 480

Query: 481 QDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNY 540
           Q+AGL+PDTFTYSV+INCLGKAGHL AAHRLFC MV +GC PNLVT+NIMIAL AKARNY
Sbjct: 481 QEAGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMIALHAKARNY 540

Query: 541 EIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLL 600
           E ALKLYRDMQ +GF+PDKVTY IVMEVLGHCGFLEEAEG+F EMQ+KNWVPDEPVYGLL
Sbjct: 541 ETALKLYRDMQNAGFQPDKVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWVPDEPVYGLL 600

Query: 601 VDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGL 660
           VDLWGK+GNV KAW+WY AML+AGL+PNVPTCNSLLS FLRVH++S+AY LLQSML  GL
Sbjct: 601 VDLWGKAGNVDKAWQWYQAMLQAGLRPNVPTCNSLLSTFLRVHRMSEAYNLLQSMLALGL 660

Query: 661 KPSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSK 720
            PSLQTYTLLLSCCTDA++N DMGFC +LM V+GHPAH FL+ +P AGP+GQ VRDH+S 
Sbjct: 661 HPSLQTYTLLLSCCTDARSNFDMGFCGQLMAVSGHPAHMFLLKMPPAGPDGQKVRDHVSN 720

Query: 721 FLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLI 780
           FLD MHSEDRESKRGL+DAVVDFLHKSGLKEEAG VWE A  KNVYPDA++EKS  YWLI
Sbjct: 721 FLDFMHSEDRESKRGLMDAVVDFLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLI 780

Query: 781 NLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDL 840
           NLHVMS+GTAV ALSRTLAWFR+Q+L+SG  PSRIDIVTGWGRRS+VTG+S+VRQAV++L
Sbjct: 781 NLHVMSEGTAVIALSRTLAWFRKQMLVSGDCPSRIDIVTGWGRRSRVTGTSMVRQAVEEL 840

Query: 841 LSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
           L+IF+FPFFTENGNSGCFVG GEPL  WL +SYVERMHLL
Sbjct: 841 LNIFNFPFFTENGNSGCFVGSGEPLKNWLLESYVERMHLL 855

BLAST of CSPI01G02030 vs. TAIR 10
Match: AT1G18900.3 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 1062.0 bits (2745), Expect = 2.6e-310
Identity = 535/873 (61.28%), Postives = 666/873 (76.29%), Query Frame = 0

Query: 1   MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
           M+RAK I +LS++ARSFFL+GSR +  DG SC   +DE CVS+RQ  R E   ++K  + 
Sbjct: 1   MIRAKHISNLSSTARSFFLNGSRTSVTDGNSCVYSDDENCVSKRQQLRKEAGQTEKRPSS 60

Query: 61  VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASG-LNT 120
           +      VG ++  E  K +V  K D+      + Q  ++ P     +  V YAS  +  
Sbjct: 61  ILPKPSVVGCILPGEVTKPVVPKKVDDFGRPSLLPQHVSSSPALPLKSHSVNYASTVVRE 120

Query: 121 VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGG-TFSSSKNCMVDPARSITS 180
            ++G+ +S  I DQ+ KAGI+AVN  SD  N KIPS D G   F   K+CMVDP R I+S
Sbjct: 121 EVEGKASSEPIGDQIFKAGIVAVNFLSDLSNCKIPSYDGGSDAFGLPKSCMVDPTRPISS 180

Query: 181 VKPSKIKHLRRENISRVHSRPSV-EIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEA 240
           VK S +K +RRE+ ++++ R +  E  V +    SSN     +  ++ +VKG RQ VS +
Sbjct: 181 VKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNPSSNFRGAKEAERTGFVKGFRQ-VSNS 240

Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDN 300
              K +   N +  K  + ++ QR  + SN F            S F+NSS       + 
Sbjct: 241 VVGKSLPTTNNTYGK--RTSVLQRPHIDSNRFVP----------SGFSNSS------VEM 300

Query: 301 LKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRV 360
           +K P+G A  +  + N+ ++VE+VS +L++ +WGPAAEEA+  L   IDAYQANQ+LK++
Sbjct: 301 MKGPSGTALTSRQYCNSGHIVENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQM 360

Query: 361 DDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVT 420
           +D+  ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF AINKLLD+M++DGCQPN VT
Sbjct: 361 NDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT 420

Query: 421 YNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ 480
           YNR+IHSYGRANYL +A+NVF QMQEAGC+PDRVTYCTLIDIHAK+GFLD+AM MY++MQ
Sbjct: 421 YNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQ 480

Query: 481 DAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYE 540
             GL+PDTFTYSV+INCLGKAGHL AAH+LFC MVD+GC PNLVTYNIM+ L AKARNY+
Sbjct: 481 AGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKARNYQ 540

Query: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLV 600
            ALKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE +F EMQ+KNW+PDEPVYGLLV
Sbjct: 541 NALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQQKNWIPDEPVYGLLV 600

Query: 601 DLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLK 660
           DLWGK+GNV+KAW+WY AML AGL+PNVPTCNSLLS FLRV+++++AY+LLQ+ML  GL+
Sbjct: 601 DLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSTFLRVNKIAEAYELLQNMLALGLR 660

Query: 661 PSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKF 720
           PSLQTYTLLLSCCTD ++  DMGFC +LM  TGHPAH FL+ +P+AGP+G+NVR+H + F
Sbjct: 661 PSLQTYTLLLSCCTDGRSKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGENVRNHANNF 720

Query: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLIN 780
           LDLMHSEDRESKRGLVDAVVDFLHKSG KEEAG VWE A QKNV+PDA++EKS  YWLIN
Sbjct: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLIN 780

Query: 781 LHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLL 840
           LHVMS+GTAVTALSRTLAWFR+Q+L SG  PSRIDIVTGWGRRS+VTG+S+VRQAV++LL
Sbjct: 781 LHVMSEGTAVTALSRTLAWFRKQMLASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELL 840

Query: 841 SIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYV 869
           +IF  PFFTE+GNSGCFVG GEPL+RWL QS++
Sbjct: 841 NIFGSPFFTESGNSGCFVGSGEPLNRWLLQSHL 854

BLAST of CSPI01G02030 vs. TAIR 10
Match: AT2G31400.1 (genomes uncoupled 1 )

HSP 1 Score: 195.3 bits (495), Expect = 2.1e-49
Identity = 129/530 (24.34%), Postives = 240/530 (45.28%), Query Frame = 0

Query: 374 RFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDA 433
           R   D  +Y T++  + +  Q     ++L QM      PNVV+Y+ +I  + +A    +A
Sbjct: 369 RIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEA 428

Query: 434 VNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINC 493
           +N+F +M+  G   DRV+Y TL+ I+ K G  + A+ +  +M   G+  D  TY+ ++  
Sbjct: 429 LNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGG 488

Query: 494 LGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPD 553
            GK G  +   ++F  M  E  +PNL+TY+ +I   +K   Y+ A++++R+ + +G   D
Sbjct: 489 YGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRAD 548

Query: 554 KVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYH 613
            V Y  +++ L   G +  A  +  EM K+   P+   Y  ++D +G+S  + ++ ++ +
Sbjct: 549 VVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMDRSADYSN 608

Query: 614 AMLKAGLKPNVPTCNSLLSAFLR----------------------------VHQLSDAYQ 673
                    ++P  +S LSA                               + +LS   +
Sbjct: 609 G-------GSLPFSSSALSALTETEGNRVIQLFGQLTTESNNRTTKDCEEGMQELSCILE 668

Query: 674 LLQSMLTFGLKPSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPN 733
           + + M    +KP++ T++ +L+ C+   +  D     E +++  +  +  +  L      
Sbjct: 669 VFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMG--Q 728

Query: 734 GQNVRDHMSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAV 793
            +NV        D ++  D  +     +A+ D L   G K  A  V      + V+ +  
Sbjct: 729 RENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVW 788

Query: 794 KEKSSCYWLINLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGS 853
            +  SC   ++LH+MS G A   +   L   R  +      P  + I+TGWG+ SKV G 
Sbjct: 789 SD--SC---LDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGD 848

Query: 854 SLVRQAVQDLLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
             +R+AV+ LL     PF     N G F   G  ++ WL +S   ++ +L
Sbjct: 849 GALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLIL 884

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8GYP60.0e+0061.66Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana OX... [more]
Q9SSF91.8e-31162.16Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana OX... [more]
Q9SIC92.9e-4824.34Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidop... [more]
Q9SAK01.0e-4527.44Pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Arabidop... [more]
Q9SZ521.3e-4328.61Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LRL70.0e+00100.00Smr domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G008500 PE=3 SV... [more]
A0A5D3BK750.0e+0097.94Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BVJ80.0e+0097.94pentatricopeptide repeat-containing protein At1g18900 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1C0130.0e+0088.17pentatricopeptide repeat-containing protein At1g18900 OS=Momordica charantia OX=... [more]
A0A5N6RSC00.0e+0070.62Smr domain-containing protein OS=Carpinus fangiana OX=176857 GN=FH972_019013 PE=... [more]
Match NameE-valueIdentityDescription
XP_004138146.10.0e+00100.00pentatricopeptide repeat-containing protein At1g18900 [Cucumis sativus] >KGN6364... [more]
XP_008453170.10.0e+0097.94PREDICTED: pentatricopeptide repeat-containing protein At1g18900 [Cucumis melo] ... [more]
XP_038878936.10.0e+0093.60pentatricopeptide repeat-containing protein At1g18900-like [Benincasa hispida][more]
XP_022135050.10.0e+0088.17pentatricopeptide repeat-containing protein At1g18900 [Momordica charantia][more]
KAE8124103.10.0e+0070.62hypothetical protein FH972_019013 [Carpinus fangiana][more]
Match NameE-valueIdentityDescription
AT1G18900.10.0e+0061.66Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.20.0e+0061.66Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74750.11.3e-31262.16Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G18900.32.6e-31061.28Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G31400.12.1e-4924.34genomes uncoupled 1 [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002625Smr domainSMARTSM00463SMR_2coord: 771..857
e-value: 5.5E-15
score: 65.8
IPR002625Smr domainPROSITEPS50828SMRcoord: 774..855
score: 15.899639
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 626..658
e-value: 6.4E-5
score: 20.9
coord: 485..518
e-value: 1.1E-10
score: 39.0
coord: 450..484
e-value: 5.0E-7
score: 27.5
coord: 380..414
e-value: 7.2E-7
score: 27.0
coord: 555..589
e-value: 2.5E-6
score: 25.3
coord: 591..624
e-value: 2.1E-5
score: 22.4
coord: 520..553
e-value: 5.5E-8
score: 30.5
coord: 415..448
e-value: 1.0E-9
score: 36.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 591..620
e-value: 0.047
score: 14.0
coord: 381..410
e-value: 0.005
score: 17.0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 479..511
e-value: 2.9E-8
score: 33.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 517..563
e-value: 6.3E-14
score: 51.9
coord: 622..669
e-value: 9.8E-10
score: 38.5
coord: 412..458
e-value: 4.5E-15
score: 55.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 413..447
score: 13.076888
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 448..482
score: 11.531345
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 588..622
score: 10.764054
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 623..657
score: 10.851745
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 378..412
score: 10.632519
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 518..552
score: 12.068449
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 483..517
score: 12.989198
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 553..587
score: 10.829822
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 331..476
e-value: 2.7E-35
score: 123.6
coord: 477..549
e-value: 2.4E-19
score: 71.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 558..678
e-value: 1.9E-27
score: 97.8
NoneNo IPR availableGENE3D3.30.1370.110coord: 751..853
e-value: 4.4E-5
score: 25.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 192..231
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 203..231
NoneNo IPR availablePANTHERPTHR47447OS03G0856100 PROTEINcoord: 1..874
NoneNo IPR availablePANTHERPTHR47447:SF4BNAA07G31720D PROTEINcoord: 1..874
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 412..621
IPR036063Smr domain superfamilySUPERFAMILY160443SMR domain-likecoord: 772..852

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G02030.1CSPI01G02030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding