CsaV3_1G007460 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G007460
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionpentatricopeptide repeat-containing protein At2g13600-like
Locationchr1 : 4741195 .. 4743844 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTATGACATTGAGTACTTCACTCAATTACTCACAAATTCAAGTTAGTTTCCTTTCTATTGCATTGAGTTTGAGGAAGGTTGTTTATCAACTCAAAGATGATCCCTTTGGAATTAGGCTTTTGAGTATGTTATCCTTTTAGCTAGGTCATTTTCTCACTAGATTACACTCCTTAGAAATCTCAAACATCAATTCACTATTATGTTACTGAACTACGACCACACATAGTATGATTCTCTTATACCATTTGTTGGAACACAACCAACCTCCAAAAATCTTATAATATTATACGCTTTAGGCATAACAATTGTATCTTAATCATTCTAGTTAGTACTTACTAATGCGTAACTTTGTCGTATTGCTAACAATTTGGCCCTAAAATATATACTAATTCTAATTAATTAAACGGAGCTTGGGTGCCGAGGCAACTAGGATTGTTTCATAGATTCTCCTGTTCCGCCTAAATTGAGTTTGTTCCAGGATTTCAAATGTTTATATAAATTAGATGCCTCTTTGTTTTCATCTCGCTCGTCCATTATTTCTTATTTCAAAATCCACCGATTTACAAAAATCAATAGCTTTGAGAATTTCTCGCAAATCTTTCGTTTCGAAATCGGAGAACTCATCGGTGAAACTAGAAGATTTCTATGTCAGTTTCTTGCAACGGTGTGTTCTAACCTCCGATTCCCGCCATGGATCCGCAATTCATGCGAAGTTCCTCAAAGGGTTTCTTCCATTTTCTCTTTTCTTTCACAACCATGTACTTAACTTTTATGTCAAATGTGGACGTCTATCATATGGCCTGCAACTGTTCGACGAAATGCCTGAGAGAAACGTTGTGTCCTGGTCTGCGATCATTGCTGGGTTCGTCCAACATGGCCGACCCAACGAAGCCCTCTCTCTATTTGGGCGTATGCATTGCGATGGCACGATAATGCCAAACGAGTTCACTCTTGTAAGTGCTCTCCATGCTTGTTCTTTAACTCAGAGGCTGATATGTTCATACCAAATTTATGCATTTATTGTTCGCTTAGGGTATGGGTCGAATGTTTTTCTCATGAATGCGTTCTTAACTGCTCTAATTAGGCATGAGAAATTGCTAGAGGCTTTAGAAGTTTTCGAGAGTTGTTTATCCAAAGATACTGTGTCTTGGAATGCAATGATGGCTGGTTATTTGCAATTAGCATATTTTGAACTGCCGAAGTTTTGGCGACGGATGAATCTCGAGAGCGTTAAGCCTGATAATTTTACATTTGCTAGCATCTTAACTGGATTGGCTGCTCTATCTGAGTTTAGGCTGGGGTTGCAAGTTCATGGTCAGCTTGTGAAAAGTGGCTATGGCAATGACATTTGTGTAGGGAATTCCTTGTGTGATATGTACGTCAAGAATCAGAAGTTGTTAGATGGTTTTAAAGCTTTTGATGAAATGTCTTCAAGTGATGTATGCTCTTGGACCCAAATGGCTGCAGGGTGTCTCCAGTGTGGGGAACCAATGAAAGCTCTTGAGGTCATTTATGAGATGAAAAATGTCGGCGTGAGGTTAAATAAGTTCACCCTTGCAACTGCCTTGAATAGTTGTGCCAATTTGGCCTCCATTGAAGAAGGAAAGAAATTCCATGGATTGAGAATTAAACTTGGAACCGATGTTGATGTTTGTGTTGATAACGCTCTACTTGATATGTATGCAAAATGTGGATGTATGACCAGTGCAAATGTCGTATTTCGTTCGATGGATGAACGATCTGTCGTCTCGTGGACTACTATGATTATGGGATTTGCACATAATGGTCAAACAAAAGAAGCCCTTCAAATCTTTGATGAAATGAGAAAAGGGGAAGCTGAACCTAACCACATCACTTTTATTTGTGTTCTCAATGCTTGTAGCCAAGGAGGTTTCATTGATGAAGCATGGAAATACTTCTCTTCCATGAGTGCCGACCATGGGATTGCACCTTCAGAAGATCACTATGTGTGTATGGTGAATCTATTAGGCCGAGCTGGGTGTATAAAAGAAGCCGAGGATTTGATCCTACAAATGCCATTTCAACCTGGTTCATTGGTCTGGCAAACGTTGCTGGGTGCTTGCTTGGTTCATGGTGACATAGAGACAGGAAAACGAGCAGCCGAGCACGCGTTGAATTTGGATCGAAACGATCCATCGACTTACATCTTGTTATCAAACATGTTTGCTGGTGGTGATAACTGGGACAGTGTTGGAATTTTGAGAGAACTAATGGAAACTAGAGATGTAAAGAAAGTACCTGGATCAAGTTGGATGTCAAACATGAGAAGAACTATTGATTGATTTGTTCATCTGCTTTTAAAGATTTTTTTTATTAAATAAATAGTTTGATGTATGAGATCTCTTTCTCCTATTTTTATCTGTGTATCTTTTAGGAAAGTTATGTTAATGGGAGACCTTTTTTCCTATGAGCATCCCTTAGGATGAAATAGATTTTAATTAGACATGAAATGTGGTGGTCTGAATTTTTTTATCATCTAACGTTGATTTGGTAGTCATTCATTTTTTAAGAGGAGGTTAGAGAAATAATTATGTTCATTTTTTATGGGCAGTGAACAAATAAAACGAAAGTATAAGAATGAAAATGATAAAATTTATTTTTTCGTCGGACCAAATAATAAAATTATATA

mRNA sequence

ATGCCTCTTTGTTTTCATCTCGCTCGTCCATTATTTCTTATTTCAAAATCCACCGATTTACAAAAATCAATAGCTTTGAGAATTTCTCGCAAATCTTTCGTTTCGAAATCGGAGAACTCATCGGTGAAACTAGAAGATTTCTATGTCAGTTTCTTGCAACGGTGTGTTCTAACCTCCGATTCCCGCCATGGATCCGCAATTCATGCGAAGTTCCTCAAAGGGTTTCTTCCATTTTCTCTTTTCTTTCACAACCATGTACTTAACTTTTATGTCAAATGTGGACGTCTATCATATGGCCTGCAACTGTTCGACGAAATGCCTGAGAGAAACGTTGTGTCCTGGTCTGCGATCATTGCTGGGTTCGTCCAACATGGCCGACCCAACGAAGCCCTCTCTCTATTTGGGCGTATGCATTGCGATGGCACGATAATGCCAAACGAGTTCACTCTTGTAAGTGCTCTCCATGCTTGTTCTTTAACTCAGAGGCTGATATGTTCATACCAAATTTATGCATTTATTGTTCGCTTAGGGTATGGGTCGAATGTTTTTCTCATGAATGCGTTCTTAACTGCTCTAATTAGGCATGAGAAATTGCTAGAGGCTTTAGAAGTTTTCGAGAGTTGTTTATCCAAAGATACTGTGTCTTGGAATGCAATGATGGCTGGTTATTTGCAATTAGCATATTTTGAACTGCCGAAGTTTTGGCGACGGATGAATCTCGAGAGCGTTAAGCCTGATAATTTTACATTTGCTAGCATCTTAACTGGATTGGCTGCTCTATCTGAGTTTAGGCTGGGGTTGCAAGTTCATGGTCAGCTTGTGAAAAGTGGCTATGGCAATGACATTTGTGTAGGGAATTCCTTGTGTGATATGTACGTCAAGAATCAGAAGTTGTTAGATGGTTTTAAAGCTTTTGATGAAATGTCTTCAAGTGATGTATGCTCTTGGACCCAAATGGCTGCAGGGTGTCTCCAGTGTGGGGAACCAATGAAAGCTCTTGAGGTCATTTATGAGATGAAAAATGTCGGCGTGAGGTTAAATAAGTTCACCCTTGCAACTGCCTTGAATAGTTGTGCCAATTTGGCCTCCATTGAAGAAGGAAAGAAATTCCATGGATTGAGAATTAAACTTGGAACCGATGTTGATGTTTGTGTTGATAACGCTCTACTTGATATGTATGCAAAATGTGGATGTATGACCAGTGCAAATGTCGTATTTCGTTCGATGGATGAACGATCTGTCGTCTCGTGGACTACTATGATTATGGGATTTGCACATAATGGTCAAACAAAAGAAGCCCTTCAAATCTTTGATGAAATGAGAAAAGGGGAAGCTGAACCTAACCACATCACTTTTATTTGTGTTCTCAATGCTTGTAGCCAAGGAGGTTTCATTGATGAAGCATGGAAATACTTCTCTTCCATGAGTGCCGACCATGGGATTGCACCTTCAGAAGATCACTATGTGTGTATGGTGAATCTATTAGGCCGAGCTGGGTGTATAAAAGAAGCCGAGGATTTGATCCTACAAATGCCATTTCAACCTGGTTCATTGGTCTGGCAAACGTTGCTGGGTGCTTGCTTGGTTCATGGTGACATAGAGACAGGAAAACGAGCAGCCGAGCACGCGTTGAATTTGGATCGAAACGATCCATCGACTTACATCTTGTTATCAAACATGTTTGCTGGTGGTGATAACTGGGACAGTGTTGGAATTTTGAGAGAACTAATGGAAACTAGAGATGTAAAGAAAGTACCTGGATCAAGTTGGATGTCAAACATGAGAAGAACTATTGATTGA

Coding sequence (CDS)

ATGCCTCTTTGTTTTCATCTCGCTCGTCCATTATTTCTTATTTCAAAATCCACCGATTTACAAAAATCAATAGCTTTGAGAATTTCTCGCAAATCTTTCGTTTCGAAATCGGAGAACTCATCGGTGAAACTAGAAGATTTCTATGTCAGTTTCTTGCAACGGTGTGTTCTAACCTCCGATTCCCGCCATGGATCCGCAATTCATGCGAAGTTCCTCAAAGGGTTTCTTCCATTTTCTCTTTTCTTTCACAACCATGTACTTAACTTTTATGTCAAATGTGGACGTCTATCATATGGCCTGCAACTGTTCGACGAAATGCCTGAGAGAAACGTTGTGTCCTGGTCTGCGATCATTGCTGGGTTCGTCCAACATGGCCGACCCAACGAAGCCCTCTCTCTATTTGGGCGTATGCATTGCGATGGCACGATAATGCCAAACGAGTTCACTCTTGTAAGTGCTCTCCATGCTTGTTCTTTAACTCAGAGGCTGATATGTTCATACCAAATTTATGCATTTATTGTTCGCTTAGGGTATGGGTCGAATGTTTTTCTCATGAATGCGTTCTTAACTGCTCTAATTAGGCATGAGAAATTGCTAGAGGCTTTAGAAGTTTTCGAGAGTTGTTTATCCAAAGATACTGTGTCTTGGAATGCAATGATGGCTGGTTATTTGCAATTAGCATATTTTGAACTGCCGAAGTTTTGGCGACGGATGAATCTCGAGAGCGTTAAGCCTGATAATTTTACATTTGCTAGCATCTTAACTGGATTGGCTGCTCTATCTGAGTTTAGGCTGGGGTTGCAAGTTCATGGTCAGCTTGTGAAAAGTGGCTATGGCAATGACATTTGTGTAGGGAATTCCTTGTGTGATATGTACGTCAAGAATCAGAAGTTGTTAGATGGTTTTAAAGCTTTTGATGAAATGTCTTCAAGTGATGTATGCTCTTGGACCCAAATGGCTGCAGGGTGTCTCCAGTGTGGGGAACCAATGAAAGCTCTTGAGGTCATTTATGAGATGAAAAATGTCGGCGTGAGGTTAAATAAGTTCACCCTTGCAACTGCCTTGAATAGTTGTGCCAATTTGGCCTCCATTGAAGAAGGAAAGAAATTCCATGGATTGAGAATTAAACTTGGAACCGATGTTGATGTTTGTGTTGATAACGCTCTACTTGATATGTATGCAAAATGTGGATGTATGACCAGTGCAAATGTCGTATTTCGTTCGATGGATGAACGATCTGTCGTCTCGTGGACTACTATGATTATGGGATTTGCACATAATGGTCAAACAAAAGAAGCCCTTCAAATCTTTGATGAAATGAGAAAAGGGGAAGCTGAACCTAACCACATCACTTTTATTTGTGTTCTCAATGCTTGTAGCCAAGGAGGTTTCATTGATGAAGCATGGAAATACTTCTCTTCCATGAGTGCCGACCATGGGATTGCACCTTCAGAAGATCACTATGTGTGTATGGTGAATCTATTAGGCCGAGCTGGGTGTATAAAAGAAGCCGAGGATTTGATCCTACAAATGCCATTTCAACCTGGTTCATTGGTCTGGCAAACGTTGCTGGGTGCTTGCTTGGTTCATGGTGACATAGAGACAGGAAAACGAGCAGCCGAGCACGCGTTGAATTTGGATCGAAACGATCCATCGACTTACATCTTGTTATCAAACATGTTTGCTGGTGGTGATAACTGGGACAGTGTTGGAATTTTGAGAGAACTAATGGAAACTAGAGATGTAAAGAAAGTACCTGGATCAAGTTGGATGTCAAACATGAGAAGAACTATTGATTGA

Protein sequence

MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWMSNMRRTID
BLAST of CsaV3_1G007460 vs. NCBI nr
Match: XP_011650978.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At3g15130 [Cucumis sativus] >KGN64206.1 hypothetical protein Csa_1G043090 [Cucumis sativus])

HSP 1 Score: 1226.5 bits (3172), Expect = 0.0e+00
Identity = 599/599 (100.00%), Postives = 599/599 (100.00%), Query Frame = 0

Query: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60
           MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD
Sbjct: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG
Sbjct: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180
           FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS
Sbjct: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180

Query: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240
           NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL
Sbjct: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240

Query: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300
           ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD
Sbjct: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300

Query: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360
           GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN
Sbjct: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360

Query: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420
           LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420

Query: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480
           IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540
           IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA
Sbjct: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540

Query: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWMSNMRRTID 600
           EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWMSNMRRTID
Sbjct: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWMSNMRRTID 599

BLAST of CsaV3_1G007460 vs. NCBI nr
Match: XP_008467246.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo] >XP_016903099.1 PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo] >XP_016903114.1 PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo])

HSP 1 Score: 1206.8 bits (3121), Expect = 0.0e+00
Identity = 587/599 (98.00%), Postives = 593/599 (99.00%), Query Frame = 0

Query: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60
           MPLCFHLARPL LISKSTDLQKSIALRIS KSF+SKSE+SSVKLEDFYVSFLQRCV TSD
Sbjct: 1   MPLCFHLARPLILISKSTDLQKSIALRISHKSFISKSEDSSVKLEDFYVSFLQRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLPFSLFFHNHVLN Y+KCGRLSYGLQLFDEMPERNVVSWSAIIAG
Sbjct: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYLKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180
           FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS
Sbjct: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180

Query: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240
           NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL
Sbjct: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240

Query: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300
           ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMY+KNQKLLD
Sbjct: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360
           GFKAFDEMSSSDVCSWTQMA+GCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN
Sbjct: 301 GFKAFDEMSSSDVCSWTQMASGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360

Query: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420
           LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420

Query: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480
           IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540
           IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA
Sbjct: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540

Query: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWMSNMRRTID 600
           EHALNLDRNDPSTYILLSNMFAGG+NWD VG LRELMETRDVKKVPGSSWMSNMRRTID
Sbjct: 541 EHALNLDRNDPSTYILLSNMFAGGNNWDGVGSLRELMETRDVKKVPGSSWMSNMRRTID 599

BLAST of CsaV3_1G007460 vs. NCBI nr
Match: XP_022954723.1 (pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita moschata])

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 527/591 (89.17%), Postives = 554/591 (93.74%), Query Frame = 0

Query: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60
           MPL  H+ RPL  +SKST  + SIALRIS KSF+SKSE S VKLEDFYV+ L RCV TSD
Sbjct: 1   MPL--HIVRPLIFVSKSTKTRHSIALRISHKSFISKSEISYVKLEDFYVNLLHRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLP+SLFFHNHVLNFYVKCG LS GLQLFDEMPERNVVSWSA+IAG
Sbjct: 61  SRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLSCGLQLFDEMPERNVVSWSAVIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180
           FVQHGRPNEALSLF RMHCDGTI+PNEFTLVSALHACSLTQRLICSYQIYA ++RLGYGS
Sbjct: 121 FVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQIYALVLRLGYGS 180

Query: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240
           N+FLMNAFLTALIRHEKLLEALEVF S  SKD VSWNAMMAGYLQL+Y ELPKFWRRMNL
Sbjct: 181 NIFLMNAFLTALIRHEKLLEALEVFGSSSSKDIVSWNAMMAGYLQLSYLELPKFWRRMNL 240

Query: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300
           E++KPDNFTFASILTGLAALSEF+LGLQVHGQLVKSGYGNDICVGNSLCDMY+KNQKLLD
Sbjct: 241 ENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360
           GFKAFDEM SSDVCSWTQMAAGCL CGEPMKALEVIY+MKNVGVRLNKFTLATALN+ AN
Sbjct: 301 GFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIYDMKNVGVRLNKFTLATALNASAN 360

Query: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420
           LASIEEGKKFHGLRIKLG D+DVCVDNALLDMYAKCGCM+SANVVFRSMDE+SVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGADIDVCVDNALLDMYAKCGCMSSANVVFRSMDEQSVVSWTTM 420

Query: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480
           IMGFAHNGQ KEALQIFDEMRK  AEPNHITFICVL ACSQGGFIDEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQAKEALQIFDEMRKEGAEPNHITFICVLYACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540
           I+PSEDHYVCMVNLLGRAGCIKEAEDLI +MPF+PGSLVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLIGRMPFKPGSLVWQTLLGACLVHGDVETGKRAA 540

Query: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWM 592
           EHALNLD+NDPSTY+LLSNMFAG  NWD VG LRELMETRDVKKVPG SWM
Sbjct: 541 EHALNLDQNDPSTYVLLSNMFAGRSNWDGVGSLRELMETRDVKKVPGFSWM 589

BLAST of CsaV3_1G007460 vs. NCBI nr
Match: XP_022994939.1 (pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita maxima])

HSP 1 Score: 1086.2 bits (2808), Expect = 0.0e+00
Identity = 527/591 (89.17%), Postives = 554/591 (93.74%), Query Frame = 0

Query: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60
           MPL  H+ARPL  +SKST  + SIALRIS KSF+SKSE SSVKLEDFYV+ L RCV TSD
Sbjct: 1   MPL--HIARPLIFVSKSTKTRHSIALRISHKSFISKSEISSVKLEDFYVNLLHRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLP+SLFFHNHVLNFYVKCG LS GLQLFDEMPERNVVSWSA+IAG
Sbjct: 61  SRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLSCGLQLFDEMPERNVVSWSAVIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180
           FVQHGRPNEALSLF RMHCDGTI+PNEFTLVSALHACSLTQRLICSYQIYA ++RLGYGS
Sbjct: 121 FVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQIYALVLRLGYGS 180

Query: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240
           N+FLMNAFLTALIRHEKLLEALEVFE+  SKD VSWNAMMAGYLQL+YFELPKFWRRMNL
Sbjct: 181 NIFLMNAFLTALIRHEKLLEALEVFENSSSKDIVSWNAMMAGYLQLSYFELPKFWRRMNL 240

Query: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300
           E +KPDNFTFASILTGLAALSEF+LGLQVHG LVKSGYGNDICVGNSLCDMY+KNQKLLD
Sbjct: 241 EDIKPDNFTFASILTGLAALSEFKLGLQVHGLLVKSGYGNDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360
           GFKAFDEM SSDVCSWTQMAAGCL CGEPMKALEVIY+MKNVGVRLNKFTLATALN+ AN
Sbjct: 301 GFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIYDMKNVGVRLNKFTLATALNASAN 360

Query: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420
           LASIEEGKKFHGLRIKLG D+DVCVDNALLDMYAKCGCM+SANVVFRSMDE+SVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGADIDVCVDNALLDMYAKCGCMSSANVVFRSMDEQSVVSWTTM 420

Query: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480
           IMGFAHNGQ KEALQIFDEMRK  AEPNHITFICVL ACSQGGFIDEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQAKEALQIFDEMRKEGAEPNHITFICVLYACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540
           I+PSEDHYVCMVNLLGRAGCIKEAEDLI +M F+PGSLVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLIGRMLFKPGSLVWQTLLGACLVHGDVETGKRAA 540

Query: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWM 592
           EHALNLD+ND STY+LLSNMFAG  NWD VG LRELMETRDVKKVPG SWM
Sbjct: 541 EHALNLDQNDSSTYVLLSNMFAGRSNWDGVGSLRELMETRDVKKVPGFSWM 589

BLAST of CsaV3_1G007460 vs. NCBI nr
Match: XP_023542233.1 (pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 525/591 (88.83%), Postives = 552/591 (93.40%), Query Frame = 0

Query: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60
           MPL  H+ARPL  +SKST  + SIALRIS KSF+SKSE SSVKLEDFYV  L RCV TSD
Sbjct: 1   MPL--HIARPLICVSKSTRTRLSIALRISHKSFISKSEISSVKLEDFYVDLLHRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLP SLFFHNHVLNFYVKCG LS GLQLFDEMPERNVVSWSA+IAG
Sbjct: 61  SRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEMPERNVVSWSALIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180
           FVQHGRPNEALSLF RMHCDGTI+PNEFTLVSALHACSLTQRLICSYQIYA ++RLGYGS
Sbjct: 121 FVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQIYALVLRLGYGS 180

Query: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240
           N+FLMNAFLTALIRHEKLL+ALEVFES  SKD VSWNAMMAGYLQL+Y ELPKFWRRMNL
Sbjct: 181 NIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQLSYLELPKFWRRMNL 240

Query: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300
           E++KPDNFTFASILTGLAALSEF+LGLQVHGQLVKSGYGNDICVGNSLCDMY+KNQKL D
Sbjct: 241 ENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLFD 300

Query: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360
           GFKAFDEM SSDVCSWTQMAAGCL CGEPMKALEVIY+MKNVGVRLNKFTLATALN+ AN
Sbjct: 301 GFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIYDMKNVGVRLNKFTLATALNASAN 360

Query: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420
           LASIEEGKKFHGLRIKLG D+DVCVDNALLDMYAKCGCM+SANVVFRSMDE+SVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGADIDVCVDNALLDMYAKCGCMSSANVVFRSMDEQSVVSWTTM 420

Query: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480
           IMGFAHNGQ KEALQIFDEMRK   EPNHITF+CVL ACSQGGFIDEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQAKEALQIFDEMRKEGTEPNHITFVCVLYACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540
           I+PSEDHYVCMVNLLGRAGCIKEAEDLI +MPF+PG LVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLIGRMPFKPGPLVWQTLLGACLVHGDVETGKRAA 540

Query: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWM 592
           EHALNLD+NDPSTY+LLSNMFAG  NWD VG LRELMETRDVKKVPG SWM
Sbjct: 541 EHALNLDQNDPSTYVLLSNMFAGRSNWDGVGSLRELMETRDVKKVPGFSWM 589

BLAST of CsaV3_1G007460 vs. TAIR10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 377.1 bits (967), Expect = 2.0e-104
Identity = 199/546 (36.45%), Postives = 313/546 (57.33%), Query Frame = 0

Query: 48  YVSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMP 107
           ++  L   V       G  +H   LK  L   L   N ++N Y K  +  +   +FD M 
Sbjct: 318 FILMLATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMS 377

Query: 108 ERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHAC-SLTQRLICS 167
           ER+++SW+++IAG  Q+G   EA+ LF ++   G + P+++T+ S L A  SL + L  S
Sbjct: 378 ERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCG-LKPDQYTMTSVLKAASSLPEGLSLS 437

Query: 168 YQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQL 227
            Q++   +++   S+ F+  A + A  R+  + EA  +FE   + D V+WNAMMAGY Q 
Sbjct: 438 KQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFER-HNFDLVAWNAMMAGYTQS 497

Query: 228 -AYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVG 287
               +  K +  M+ +  + D+FT A++      L     G QVH   +KSGY  D+ V 
Sbjct: 498 HDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVS 557

Query: 288 NSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVR 347
           + + DMYVK   +     AFD +   D  +WT M +GC++ GE  +A  V  +M+ +GV 
Sbjct: 558 SGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVL 617

Query: 348 LNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVV 407
            ++FT+AT   + + L ++E+G++ H   +KL    D  V  +L+DMYAKCG +  A  +
Sbjct: 618 PDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCL 677

Query: 408 FRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFI 467
           F+ ++  ++ +W  M++G A +G+ KE LQ+F +M+    +P+ +TFI VL+ACS  G +
Sbjct: 678 FKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLV 737

Query: 468 DEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLG 527
            EA+K+  SM  D+GI P  +HY C+ + LGRAG +K+AE+LI  M  +  + +++TLL 
Sbjct: 738 SEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLA 797

Query: 528 ACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKV 587
           AC V GD ETGKR A   L L+  D S Y+LLSNM+A    WD + + R +M+   VKK 
Sbjct: 798 ACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKD 857

Query: 588 PGSSWM 592
           PG SW+
Sbjct: 858 PGFSWI 861

BLAST of CsaV3_1G007460 vs. TAIR10
Match: AT3G15130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 374.8 bits (961), Expect = 1.0e-103
Identity = 194/549 (35.34%), Postives = 308/549 (56.10%), Query Frame = 0

Query: 49  VSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPE 108
           VS L+ C     S  G  +H   LK     +L   N++++ Y KC       ++FD MPE
Sbjct: 10  VSILRVCTRKGLSDQGGQVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPE 69

Query: 109 RNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQ 168
           RNVVSWSA+++G V +G    +LSLF  M   G I PNEFT  + L AC L   L    Q
Sbjct: 70  RNVVSWSALMSGHVLNGDLKGSLSLFSEMGRQG-IYPNEFTFSTNLKACGLLNALEKGLQ 129

Query: 169 IYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAY 228
           I+ F +++G+   V + N+ +    +  ++ EA +VF   + +  +SWNAM+AG++   Y
Sbjct: 130 IHGFCLKIGFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGY 189

Query: 229 ----FELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGY--GNDI 288
                +     +  N++  +PD FT  S+L   ++      G Q+HG LV+SG+   +  
Sbjct: 190 GSKALDTFGMMQEANIKE-RPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSA 249

Query: 289 CVGNSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNV 348
            +  SL D+YVK   L    KAFD++    + SW+ +  G  Q GE ++A+ +   ++ +
Sbjct: 250 TITGSLVDLYVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLFKRLQEL 309

Query: 349 GVRLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSA 408
             +++ F L++ +   A+ A + +GK+   L +KL + ++  V N+++DMY KCG +  A
Sbjct: 310 NSQIDSFALSSIIGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKCGLVDEA 369

Query: 409 NVVFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQG 468
              F  M  + V+SWT +I G+  +G  K++++IF EM +   EP+ + ++ VL+ACS  
Sbjct: 370 EKCFAEMQLKDVISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVLSACSHS 429

Query: 469 GFIDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQT 528
           G I E  + FS +   HGI P  +HY C+V+LLGRAG +KEA+ LI  MP +P   +WQT
Sbjct: 430 GMIKEGEELFSKLLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPNVGIWQT 489

Query: 529 LLGACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDV 588
           LL  C VHGDIE GK   +  L +D  +P+ Y+++SN++     W+  G  REL   + +
Sbjct: 490 LLSLCRVHGDIELGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNARELGNIKGL 549

Query: 589 KKVPGSSWM 592
           KK  G SW+
Sbjct: 550 KKEAGMSWV 556

BLAST of CsaV3_1G007460 vs. TAIR10
Match: AT3G53360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 350.1 bits (897), Expect = 2.7e-96
Identity = 187/544 (34.38%), Postives = 292/544 (53.68%), Query Frame = 0

Query: 50  SFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPER 109
           S ++ C  +SD   G  +HA+ +K      L   N ++  YV+  ++S   ++F  +P +
Sbjct: 173 SIIKACASSSDVGLGKQLHAQVIKLESSSHLIAQNALIAMYVRFNQMSDASRVFYGIPMK 232

Query: 110 NVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQI 169
           +++SWS+IIAGF Q G   EALS    M   G   PNE+   S+L ACS   R     QI
Sbjct: 233 DLISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFGSSLKACSSLLRPDYGSQI 292

Query: 170 YAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYF 229
           +   ++     N     +      R   L  A  VF+     DT SWN ++AG     Y 
Sbjct: 293 HGLCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQIERPDTASWNVIIAGLANNGYA 352

Query: 230 -ELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSL 289
            E    + +M      PD  +  S+L           G+Q+H  ++K G+  D+ V NSL
Sbjct: 353 DEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIHSYIIKWGFLADLTVCNSL 412

Query: 290 CDMYVKNQKLLDGFKAFDEM-SSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLN 349
             MY     L   F  F++  +++D  SW  +   CLQ  +P++ L +   M       +
Sbjct: 413 LTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQPVEMLRLFKLMLVSECEPD 472

Query: 350 KFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFR 409
             T+   L  C  ++S++ G + H   +K G   +  + N L+DMYAKCG +  A  +F 
Sbjct: 473 HITMGNLLRGCVEISSLKLGSQVHCYSLKTGLAPEQFIKNGLIDMYAKCGSLGQARRIFD 532

Query: 410 SMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDE 469
           SMD R VVSW+T+I+G+A +G  +EAL +F EM+    EPNH+TF+ VL ACS  G ++E
Sbjct: 533 SMDNRDVVSWSTLIVGYAQSGFGEEALILFKEMKSAGIEPNHVTFVGVLTACSHVGLVEE 592

Query: 470 AWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGAC 529
             K +++M  +HGI+P+++H  C+V+LL RAG + EAE  I +M  +P  +VW+TLL AC
Sbjct: 593 GLKLYATMQTEHGISPTKEHCSCVVDLLARAGRLNEAERFIDEMKLEPDVVVWKTLLSAC 652

Query: 530 LVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPG 589
              G++   ++AAE+ L +D  + + ++LL +M A   NW++  +LR  M+  DVKK+PG
Sbjct: 653 KTQGNVHLAQKAAENILKIDPFNSTAHVLLCSMHASSGNWENAALLRSSMKKHDVKKIPG 712

Query: 590 SSWM 592
            SW+
Sbjct: 713 QSWI 716

BLAST of CsaV3_1G007460 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 340.9 bits (873), Expect = 1.6e-93
Identity = 174/543 (32.04%), Postives = 290/543 (53.41%), Query Frame = 0

Query: 50  SFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPER 109
           S +  C        G  +HA   K     +      +LN Y KC  +   L  F E    
Sbjct: 394 SLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVE 453

Query: 110 NVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQI 169
           NVV W+ ++  +        +  +F +M  +  I+PN++T  S L  C     L    QI
Sbjct: 454 NVVLWNVMLVAYGLLDDLRNSFRIFRQMQIE-EIVPNQYTYPSILKTCIRLGDLELGEQI 513

Query: 170 YAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYF 229
           ++ I++  +  N ++ +  +    +  KL  A ++      KD VSW  M+AGY Q  + 
Sbjct: 514 HSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFD 573

Query: 230 ELP-KFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSL 289
           +     +R+M    ++ D     + ++  A L   + G Q+H Q   SG+ +D+   N+L
Sbjct: 574 DKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNAL 633

Query: 290 CDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNK 349
             +Y +  K+ + + AF++  + D  +W  + +G  Q G   +AL V   M   G+  N 
Sbjct: 634 VTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNN 693

Query: 350 FTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRS 409
           FT  +A+ + +  A++++GK+ H +  K G D +  V NAL+ MYAKCG ++ A   F  
Sbjct: 694 FTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEKQFLE 753

Query: 410 MDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEA 469
           +  ++ VSW  +I  ++ +G   EAL  FD+M      PNH+T + VL+ACS  G +D+ 
Sbjct: 754 VSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKG 813

Query: 470 WKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACL 529
             YF SM++++G++P  +HYVC+V++L RAG +  A++ I +MP +P +LVW+TLL AC+
Sbjct: 814 IAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACV 873

Query: 530 VHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGS 589
           VH ++E G+ AA H L L+  D +TY+LLSN++A    WD+  + R+ M+ + VKK PG 
Sbjct: 874 VHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQ 933

Query: 590 SWM 592
           SW+
Sbjct: 934 SWI 935

BLAST of CsaV3_1G007460 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 338.6 bits (867), Expect = 8.0e-93
Identity = 184/546 (33.70%), Postives = 297/546 (54.40%), Query Frame = 0

Query: 48  YVSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMP 107
           +   L+ C   ++ R G  IH   +K      LF    + N Y KC +++   ++FD MP
Sbjct: 138 FTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMP 197

Query: 108 ERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSY 167
           ER++VSW+ I+AG+ Q+G    AL +   M C+  + P+  T+VS L A S  + +    
Sbjct: 198 ERDLVSWNTIVAGYSQNGMARMALEMVKSM-CEENLKPSFITIVSVLPAVSALRLISVGK 257

Query: 168 QIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLA 227
           +I+ + +R G+ S V +  A +    +   L  A ++F+  L ++ VSWN+M+  Y+Q  
Sbjct: 258 EIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQ-- 317

Query: 228 YFELPK----FWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDIC 287
             E PK     +++M  E VKP + +    L   A L +   G  +H   V+ G   ++ 
Sbjct: 318 -NENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVS 377

Query: 288 VGNSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVG 347
           V NSL  MY K +++      F ++ S  + SW  M  G  Q G P+ AL    +M++  
Sbjct: 378 VVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRT 437

Query: 348 VRLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSAN 407
           V+ + FT  + + + A L+     K  HG+ ++   D +V V  AL+DMYAKCG +  A 
Sbjct: 438 VKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIAR 497

Query: 408 VVFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGG 467
           ++F  M ER V +W  MI G+  +G  K AL++F+EM+KG  +PN +TF+ V++ACS  G
Sbjct: 498 LIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSG 557

Query: 468 FIDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTL 527
            ++   K F  M  ++ I  S DHY  MV+LLGRAG + EA D I+QMP +P   V+  +
Sbjct: 558 LVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAM 617

Query: 528 LGACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVK 587
           LGAC +H ++   ++AAE    L+ +D   ++LL+N++     W+ VG +R  M  + ++
Sbjct: 618 LGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLR 677

Query: 588 KVPGSS 590
           K PG S
Sbjct: 678 KTPGCS 679

BLAST of CsaV3_1G007460 vs. Swiss-Prot
Match: sp|Q9SMZ2|PP347_ARATH (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 3.7e-103
Identity = 199/546 (36.45%), Postives = 313/546 (57.33%), Query Frame = 0

Query: 48  YVSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMP 107
           ++  L   V       G  +H   LK  L   L   N ++N Y K  +  +   +FD M 
Sbjct: 318 FILMLATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMS 377

Query: 108 ERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHAC-SLTQRLICS 167
           ER+++SW+++IAG  Q+G   EA+ LF ++   G + P+++T+ S L A  SL + L  S
Sbjct: 378 ERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCG-LKPDQYTMTSVLKAASSLPEGLSLS 437

Query: 168 YQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQL 227
            Q++   +++   S+ F+  A + A  R+  + EA  +FE   + D V+WNAMMAGY Q 
Sbjct: 438 KQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFER-HNFDLVAWNAMMAGYTQS 497

Query: 228 -AYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVG 287
               +  K +  M+ +  + D+FT A++      L     G QVH   +KSGY  D+ V 
Sbjct: 498 HDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVS 557

Query: 288 NSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVR 347
           + + DMYVK   +     AFD +   D  +WT M +GC++ GE  +A  V  +M+ +GV 
Sbjct: 558 SGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVL 617

Query: 348 LNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVV 407
            ++FT+AT   + + L ++E+G++ H   +KL    D  V  +L+DMYAKCG +  A  +
Sbjct: 618 PDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCL 677

Query: 408 FRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFI 467
           F+ ++  ++ +W  M++G A +G+ KE LQ+F +M+    +P+ +TFI VL+ACS  G +
Sbjct: 678 FKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLV 737

Query: 468 DEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLG 527
            EA+K+  SM  D+GI P  +HY C+ + LGRAG +K+AE+LI  M  +  + +++TLL 
Sbjct: 738 SEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLA 797

Query: 528 ACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKV 587
           AC V GD ETGKR A   L L+  D S Y+LLSNM+A    WD + + R +M+   VKK 
Sbjct: 798 ACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKD 857

Query: 588 PGSSWM 592
           PG SW+
Sbjct: 858 PGFSWI 861

BLAST of CsaV3_1G007460 vs. Swiss-Prot
Match: sp|P0C898|PP232_ARATH (Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H86 PE=3 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 1.8e-102
Identity = 194/549 (35.34%), Postives = 308/549 (56.10%), Query Frame = 0

Query: 49  VSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPE 108
           VS L+ C     S  G  +H   LK     +L   N++++ Y KC       ++FD MPE
Sbjct: 10  VSILRVCTRKGLSDQGGQVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPE 69

Query: 109 RNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQ 168
           RNVVSWSA+++G V +G    +LSLF  M   G I PNEFT  + L AC L   L    Q
Sbjct: 70  RNVVSWSALMSGHVLNGDLKGSLSLFSEMGRQG-IYPNEFTFSTNLKACGLLNALEKGLQ 129

Query: 169 IYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAY 228
           I+ F +++G+   V + N+ +    +  ++ EA +VF   + +  +SWNAM+AG++   Y
Sbjct: 130 IHGFCLKIGFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGY 189

Query: 229 ----FELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGY--GNDI 288
                +     +  N++  +PD FT  S+L   ++      G Q+HG LV+SG+   +  
Sbjct: 190 GSKALDTFGMMQEANIKE-RPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSA 249

Query: 289 CVGNSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNV 348
            +  SL D+YVK   L    KAFD++    + SW+ +  G  Q GE ++A+ +   ++ +
Sbjct: 250 TITGSLVDLYVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLFKRLQEL 309

Query: 349 GVRLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSA 408
             +++ F L++ +   A+ A + +GK+   L +KL + ++  V N+++DMY KCG +  A
Sbjct: 310 NSQIDSFALSSIIGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKCGLVDEA 369

Query: 409 NVVFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQG 468
              F  M  + V+SWT +I G+  +G  K++++IF EM +   EP+ + ++ VL+ACS  
Sbjct: 370 EKCFAEMQLKDVISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVLSACSHS 429

Query: 469 GFIDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQT 528
           G I E  + FS +   HGI P  +HY C+V+LLGRAG +KEA+ LI  MP +P   +WQT
Sbjct: 430 GMIKEGEELFSKLLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPNVGIWQT 489

Query: 529 LLGACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDV 588
           LL  C VHGDIE GK   +  L +D  +P+ Y+++SN++     W+  G  REL   + +
Sbjct: 490 LLSLCRVHGDIELGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNARELGNIKGL 549

Query: 589 KKVPGSSWM 592
           KK  G SW+
Sbjct: 550 KKEAGMSWV 556

BLAST of CsaV3_1G007460 vs. Swiss-Prot
Match: sp|Q9LFI1|PP280_ARATH (Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E86 PE=2 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 4.8e-95
Identity = 187/544 (34.38%), Postives = 292/544 (53.68%), Query Frame = 0

Query: 50  SFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPER 109
           S ++ C  +SD   G  +HA+ +K      L   N ++  YV+  ++S   ++F  +P +
Sbjct: 173 SIIKACASSSDVGLGKQLHAQVIKLESSSHLIAQNALIAMYVRFNQMSDASRVFYGIPMK 232

Query: 110 NVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQI 169
           +++SWS+IIAGF Q G   EALS    M   G   PNE+   S+L ACS   R     QI
Sbjct: 233 DLISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFGSSLKACSSLLRPDYGSQI 292

Query: 170 YAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYF 229
           +   ++     N     +      R   L  A  VF+     DT SWN ++AG     Y 
Sbjct: 293 HGLCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQIERPDTASWNVIIAGLANNGYA 352

Query: 230 -ELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSL 289
            E    + +M      PD  +  S+L           G+Q+H  ++K G+  D+ V NSL
Sbjct: 353 DEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIHSYIIKWGFLADLTVCNSL 412

Query: 290 CDMYVKNQKLLDGFKAFDEM-SSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLN 349
             MY     L   F  F++  +++D  SW  +   CLQ  +P++ L +   M       +
Sbjct: 413 LTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQPVEMLRLFKLMLVSECEPD 472

Query: 350 KFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFR 409
             T+   L  C  ++S++ G + H   +K G   +  + N L+DMYAKCG +  A  +F 
Sbjct: 473 HITMGNLLRGCVEISSLKLGSQVHCYSLKTGLAPEQFIKNGLIDMYAKCGSLGQARRIFD 532

Query: 410 SMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDE 469
           SMD R VVSW+T+I+G+A +G  +EAL +F EM+    EPNH+TF+ VL ACS  G ++E
Sbjct: 533 SMDNRDVVSWSTLIVGYAQSGFGEEALILFKEMKSAGIEPNHVTFVGVLTACSHVGLVEE 592

Query: 470 AWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGAC 529
             K +++M  +HGI+P+++H  C+V+LL RAG + EAE  I +M  +P  +VW+TLL AC
Sbjct: 593 GLKLYATMQTEHGISPTKEHCSCVVDLLARAGRLNEAERFIDEMKLEPDVVVWKTLLSAC 652

Query: 530 LVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPG 589
              G++   ++AAE+ L +D  + + ++LL +M A   NW++  +LR  M+  DVKK+PG
Sbjct: 653 KTQGNVHLAQKAAENILKIDPFNSTAHVLLCSMHASSGNWENAALLRSSMKKHDVKKIPG 712

Query: 590 SSWM 592
            SW+
Sbjct: 713 QSWI 716

BLAST of CsaV3_1G007460 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 340.9 bits (873), Expect = 2.9e-92
Identity = 174/543 (32.04%), Postives = 290/543 (53.41%), Query Frame = 0

Query: 50  SFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPER 109
           S +  C        G  +HA   K     +      +LN Y KC  +   L  F E    
Sbjct: 394 SLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVE 453

Query: 110 NVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQI 169
           NVV W+ ++  +        +  +F +M  +  I+PN++T  S L  C     L    QI
Sbjct: 454 NVVLWNVMLVAYGLLDDLRNSFRIFRQMQIE-EIVPNQYTYPSILKTCIRLGDLELGEQI 513

Query: 170 YAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYF 229
           ++ I++  +  N ++ +  +    +  KL  A ++      KD VSW  M+AGY Q  + 
Sbjct: 514 HSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFD 573

Query: 230 ELP-KFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSL 289
           +     +R+M    ++ D     + ++  A L   + G Q+H Q   SG+ +D+   N+L
Sbjct: 574 DKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNAL 633

Query: 290 CDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNK 349
             +Y +  K+ + + AF++  + D  +W  + +G  Q G   +AL V   M   G+  N 
Sbjct: 634 VTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNN 693

Query: 350 FTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRS 409
           FT  +A+ + +  A++++GK+ H +  K G D +  V NAL+ MYAKCG ++ A   F  
Sbjct: 694 FTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEKQFLE 753

Query: 410 MDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEA 469
           +  ++ VSW  +I  ++ +G   EAL  FD+M      PNH+T + VL+ACS  G +D+ 
Sbjct: 754 VSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKG 813

Query: 470 WKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACL 529
             YF SM++++G++P  +HYVC+V++L RAG +  A++ I +MP +P +LVW+TLL AC+
Sbjct: 814 IAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACV 873

Query: 530 VHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGS 589
           VH ++E G+ AA H L L+  D +TY+LLSN++A    WD+  + R+ M+ + VKK PG 
Sbjct: 874 VHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQ 933

Query: 590 SWM 592
           SW+
Sbjct: 934 SWI 935

BLAST of CsaV3_1G007460 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 338.6 bits (867), Expect = 1.4e-91
Identity = 184/546 (33.70%), Postives = 297/546 (54.40%), Query Frame = 0

Query: 48  YVSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMP 107
           +   L+ C   ++ R G  IH   +K      LF    + N Y KC +++   ++FD MP
Sbjct: 138 FTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMP 197

Query: 108 ERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSY 167
           ER++VSW+ I+AG+ Q+G    AL +   M C+  + P+  T+VS L A S  + +    
Sbjct: 198 ERDLVSWNTIVAGYSQNGMARMALEMVKSM-CEENLKPSFITIVSVLPAVSALRLISVGK 257

Query: 168 QIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLA 227
           +I+ + +R G+ S V +  A +    +   L  A ++F+  L ++ VSWN+M+  Y+Q  
Sbjct: 258 EIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQ-- 317

Query: 228 YFELPK----FWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDIC 287
             E PK     +++M  E VKP + +    L   A L +   G  +H   V+ G   ++ 
Sbjct: 318 -NENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVS 377

Query: 288 VGNSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVG 347
           V NSL  MY K +++      F ++ S  + SW  M  G  Q G P+ AL    +M++  
Sbjct: 378 VVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRT 437

Query: 348 VRLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSAN 407
           V+ + FT  + + + A L+     K  HG+ ++   D +V V  AL+DMYAKCG +  A 
Sbjct: 438 VKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIAR 497

Query: 408 VVFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGG 467
           ++F  M ER V +W  MI G+  +G  K AL++F+EM+KG  +PN +TF+ V++ACS  G
Sbjct: 498 LIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSG 557

Query: 468 FIDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTL 527
            ++   K F  M  ++ I  S DHY  MV+LLGRAG + EA D I+QMP +P   V+  +
Sbjct: 558 LVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAM 617

Query: 528 LGACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVK 587
           LGAC +H ++   ++AAE    L+ +D   ++LL+N++     W+ VG +R  M  + ++
Sbjct: 618 LGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLR 677

Query: 588 KVPGSS 590
           K PG S
Sbjct: 678 KTPGCS 679

BLAST of CsaV3_1G007460 vs. TrEMBL
Match: tr|A0A0A0LVZ1|A0A0A0LVZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043090 PE=4 SV=1)

HSP 1 Score: 1226.5 bits (3172), Expect = 0.0e+00
Identity = 599/599 (100.00%), Postives = 599/599 (100.00%), Query Frame = 0

Query: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60
           MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD
Sbjct: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG
Sbjct: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180
           FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS
Sbjct: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180

Query: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240
           NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL
Sbjct: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240

Query: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300
           ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD
Sbjct: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300

Query: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360
           GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN
Sbjct: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360

Query: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420
           LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420

Query: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480
           IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540
           IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA
Sbjct: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540

Query: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWMSNMRRTID 600
           EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWMSNMRRTID
Sbjct: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWMSNMRRTID 599

BLAST of CsaV3_1G007460 vs. TrEMBL
Match: tr|A0A1S4E4G2|A0A1S4E4G2_CUCME (pentatricopeptide repeat-containing protein At2g13600-like OS=Cucumis melo OX=3656 GN=LOC103504641 PE=4 SV=1)

HSP 1 Score: 1206.8 bits (3121), Expect = 0.0e+00
Identity = 587/599 (98.00%), Postives = 593/599 (99.00%), Query Frame = 0

Query: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60
           MPLCFHLARPL LISKSTDLQKSIALRIS KSF+SKSE+SSVKLEDFYVSFLQRCV TSD
Sbjct: 1   MPLCFHLARPLILISKSTDLQKSIALRISHKSFISKSEDSSVKLEDFYVSFLQRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLPFSLFFHNHVLN Y+KCGRLSYGLQLFDEMPERNVVSWSAIIAG
Sbjct: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYLKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180
           FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS
Sbjct: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180

Query: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240
           NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL
Sbjct: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240

Query: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300
           ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMY+KNQKLLD
Sbjct: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360
           GFKAFDEMSSSDVCSWTQMA+GCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN
Sbjct: 301 GFKAFDEMSSSDVCSWTQMASGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360

Query: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420
           LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420

Query: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480
           IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540
           IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA
Sbjct: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540

Query: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWMSNMRRTID 600
           EHALNLDRNDPSTYILLSNMFAGG+NWD VG LRELMETRDVKKVPGSSWMSNMRRTID
Sbjct: 541 EHALNLDRNDPSTYILLSNMFAGGNNWDGVGSLRELMETRDVKKVPGSSWMSNMRRTID 599

BLAST of CsaV3_1G007460 vs. TrEMBL
Match: tr|A0A2I4F2Q0|A0A2I4F2Q0_9ROSI (pentatricopeptide repeat-containing protein At2g13600-like OS=Juglans regia OX=51240 GN=LOC108994941 PE=4 SV=1)

HSP 1 Score: 837.4 bits (2162), Expect = 2.0e-239
Identity = 397/566 (70.14%), Postives = 471/566 (83.22%), Query Frame = 0

Query: 26  LRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNH 85
           +R  R+ F S S  +S + E+ Y + L+R    SDS HG  IHAKF+KG LPFSLF  NH
Sbjct: 17  VRCCRRFFASDSFENSAE-EEIYSTLLRRYAEKSDSYHGRGIHAKFVKGSLPFSLFLRNH 76

Query: 86  VLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMP 145
           +LN YVKCG L  GL LF+EMPE+NVVSWSA+IAG VQHG PNEALSLF RMH DGT  P
Sbjct: 77  LLNMYVKCGDLVSGLLLFEEMPEKNVVSWSAVIAGLVQHGCPNEALSLFCRMHRDGTTKP 136

Query: 146 NEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVF 205
           NEFTLVS LHACSL + L  +YQ+Y+F+VRLG+ SN+FL+NAFLTALIRH +L  ALEVF
Sbjct: 137 NEFTLVSTLHACSLVENLTQAYQLYSFVVRLGFESNIFLLNAFLTALIRHGELAGALEVF 196

Query: 206 ESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRL 265
           E+C +KD VSWNAMMAG+LQ ++ ++PKFW RM  E VKPD+FTF+S+LTGLAALS+F+L
Sbjct: 197 ENCRTKDIVSWNAMMAGFLQFSFADVPKFWYRMTCEGVKPDSFTFSSVLTGLAALSDFKL 256

Query: 266 GLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQ 325
           GLQVH QLV+ GYG +ICVGNSL DMY+KNQKL++GF+AFD M   DVCSWTQMAAGCL 
Sbjct: 257 GLQVHAQLVRYGYGVEICVGNSLADMYIKNQKLVEGFRAFDGMPVRDVCSWTQMAAGCLL 316

Query: 326 CGEPMKALEVIYEMKNVGVRLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCV 385
           CGEP KALE++ EMK +GV+ NKFTLATALN+CANLAS+EEGKK HGLRIKLGTD+DVCV
Sbjct: 317 CGEPSKALEIVAEMKKMGVKPNKFTLATALNACANLASLEEGKKVHGLRIKLGTDIDVCV 376

Query: 386 DNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEA 445
           DNALLDMYAKCGC+ +A  VF+SM++ S+VSWTTMIM  A NGQ ++AL+I+DEMR  + 
Sbjct: 377 DNALLDMYAKCGCVDNAWTVFQSMNDHSIVSWTTMIMACAINGQARKALKIYDEMRLKDI 436

Query: 446 EPNHITFICVLNACSQGGFIDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAE 505
           +PN+ITFICVL AC QGGFIDE W+ FSSM+ D+GI P EDHY CMVNLLGRAGC+KEAE
Sbjct: 437 QPNYITFICVLYACGQGGFIDEGWELFSSMTRDYGILPVEDHYACMVNLLGRAGCVKEAE 496

Query: 506 DLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGD 565
           + IL+MPFQPG LVW+TLLGAC V+GD+E GKRAAEH LNLD  DPSTY++LSN+FAG  
Sbjct: 497 EFILRMPFQPGVLVWKTLLGACHVYGDMEIGKRAAEHVLNLDGKDPSTYVVLSNLFAGLS 556

Query: 566 NWDSVGILRELMETRDVKKVPGSSWM 592
           NWD VG+LRELME RDVKK+P SSW+
Sbjct: 557 NWDGVGMLRELMEARDVKKMPASSWI 581

BLAST of CsaV3_1G007460 vs. TrEMBL
Match: tr|A0A1U8AYJ6|A0A1U8AYJ6_NELNU (pentatricopeptide repeat-containing protein At4g33170-like isoform X1 OS=Nelumbo nucifera OX=4432 GN=LOC104607798 PE=4 SV=1)

HSP 1 Score: 785.4 bits (2027), Expect = 9.0e-224
Identity = 379/547 (69.29%), Postives = 444/547 (81.17%), Query Frame = 0

Query: 45  EDFYVSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFD 104
           E+ Y + LQRC   S   HG AIHAKF+K  L  SLF HN++LN YV+C  L+  + LF+
Sbjct: 34  EEKYFNLLQRCGEISTLGHGKAIHAKFIKESLISSLFLHNNLLNMYVRCDDLAGAVNLFE 93

Query: 105 EMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLI 164
           EMPERNVVSWSAIIAG VQHG  +EALSLF RM   G + PNEFTLVS L+ACSL++   
Sbjct: 94  EMPERNVVSWSAIIAGLVQHGCVHEALSLFSRMQQTG-VSPNEFTLVSTLNACSLSENPS 153

Query: 165 CSYQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYL 224
            +YQIYA+IV+ G+ SNVFL+NAFLTALIRH KL+EA E F+ C  +D VSWNAM+AGYL
Sbjct: 154 HAYQIYAWIVKFGFESNVFLINAFLTALIRHGKLVEAEEFFDKCPCRDIVSWNAMIAGYL 213

Query: 225 QLAYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICV 284
           Q +Y ++P FW RMN E VKPDNFTFA++LTGLA + + ++GLQVH QL+KSG+G++ICV
Sbjct: 214 QFSYIDVPGFWYRMNQEGVKPDNFTFATVLTGLATICDLKMGLQVHAQLIKSGHGDEICV 273

Query: 285 GNSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGV 344
           GNSL DMY+KN+ L++G KAF EM   DV SWTQMA+GCLQCG+P KAL VI EMK VGV
Sbjct: 274 GNSLVDMYLKNRNLIEGSKAFGEMPQKDVISWTQMASGCLQCGQPRKALTVIDEMKKVGV 333

Query: 345 RLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANV 404
           + NKFTLATA NSCA+LAS+EEGKK HGLRIKLGT++D+CVDNALLDMYAKCGCM  A  
Sbjct: 334 KPNKFTLATAFNSCASLASLEEGKKVHGLRIKLGTEIDICVDNALLDMYAKCGCMDGAWG 393

Query: 405 VFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGF 464
           VF+SMD RSVVSWTTMIMGFA NG  +EAL IF+ MR    EPN+ITFICVL ACSQGGF
Sbjct: 394 VFQSMDTRSVVSWTTMIMGFAQNGYAREALDIFERMRFAGVEPNYITFICVLYACSQGGF 453

Query: 465 IDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLL 524
           +DE WKYFSSM+ DHGIAP EDHY CMV++LGRAG I+EAE LIL MPFQPG LVWQTLL
Sbjct: 454 LDEGWKYFSSMTHDHGIAPGEDHYACMVDILGRAGQIREAEALILNMPFQPGVLVWQTLL 513

Query: 525 GACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKK 584
           GAC  HGD++TGKRAA  AL L++ DPSTY+LLSNMFA   NWD+VG LRELMETR+VKK
Sbjct: 514 GACRAHGDLDTGKRAAAQALALEKEDPSTYLLLSNMFADFGNWDNVGKLRELMETREVKK 573

Query: 585 VPGSSWM 592
           VPG SW+
Sbjct: 574 VPGCSWI 579

BLAST of CsaV3_1G007460 vs. TrEMBL
Match: tr|A0A1R3FUM0|A0A1R3FUM0_COCAP (Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_30939 PE=4 SV=1)

HSP 1 Score: 778.1 bits (2008), Expect = 1.4e-221
Identity = 373/547 (68.19%), Postives = 445/547 (81.35%), Query Frame = 0

Query: 45  EDFYVSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFD 104
           ED    FL  C  TS+  HG AIHAKF+KG +P SL+  NH+LNFY+KCG +  G ++FD
Sbjct: 36  EDLCSKFLASCTRTSNLVHGKAIHAKFIKGSIPNSLYLENHILNFYLKCGDIINGHKVFD 95

Query: 105 EMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLI 164
           EMP+RNVVSWSA+++GF QHG   EALSLF  M  DGT  PNEFT VS L ACSL + L 
Sbjct: 96  EMPQRNVVSWSAMVSGFAQHGFFIEALSLFIYMLRDGTSKPNEFTFVSVLQACSLHESLC 155

Query: 165 CSYQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYL 224
            +YQ+YA I+RLG+GSNVFL+NAFLTAL+RH K  EALEVF+ CL+KD V+WN M++GY 
Sbjct: 156 LAYQVYAVILRLGFGSNVFLVNAFLTALMRHGKKEEALEVFKECLNKDVVTWNVMLSGYS 215

Query: 225 QLAYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICV 284
           + +  +LPK W +MN E VKPD FTFAS+LTGLA+L +  +GLQVHGQ+VKSG+G +ICV
Sbjct: 216 ESSCLDLPKLWVQMNNEGVKPDCFTFASVLTGLASLGDLNMGLQVHGQIVKSGHGGEICV 275

Query: 285 GNSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGV 344
           GNSL DMY+KNQ+LLDG K F+EM + DVCSWTQMAAG L+ G+P KALEV+ EM+ +GV
Sbjct: 276 GNSLVDMYIKNQRLLDGLKVFNEMGTKDVCSWTQMAAGWLEYGKPEKALEVVAEMRMMGV 335

Query: 345 RLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANV 404
             NKFTLATA N+CANLAS+EEGKK HGLRIKLG ++DVCVDN+L+DMYAKCG M +A  
Sbjct: 336 NPNKFTLATAFNACANLASLEEGKKVHGLRIKLGVEIDVCVDNSLIDMYAKCGSMDAAWG 395

Query: 405 VFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGF 464
           VF+ MD+ SVVSWTTMIMG A NGQ +EA++IFDEM     +PN+ITF+CVL ACSQG F
Sbjct: 396 VFKVMDDHSVVSWTTMIMGCAQNGQAREAVKIFDEMIAKGIKPNYITFVCVLYACSQGMF 455

Query: 465 IDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLL 524
           IDEAWKYFSSM+ DHGI+P EDHYV MV+LLG+AG IKEAE+LIL MPFQPG+ VWQTLL
Sbjct: 456 IDEAWKYFSSMTIDHGISPGEDHYVYMVHLLGQAGHIKEAEELILSMPFQPGASVWQTLL 515

Query: 525 GACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKK 584
            AC VHGDIETGKRAAE A+NLDR DPS+Y+LLSNM AG ++WD V  LRELMETRDVKK
Sbjct: 516 NACQVHGDIETGKRAAERAINLDRKDPSSYVLLSNMLAGFNSWDDVRKLRELMETRDVKK 575

Query: 585 VPGSSWM 592
           VPGSSW+
Sbjct: 576 VPGSSWI 582

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011650978.10.0e+00100.00PREDICTED: putative pentatricopeptide repeat-containing protein At3g15130 [Cucum... [more]
XP_008467246.10.0e+0098.00PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis m... [more]
XP_022954723.10.0e+0089.17pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita moschata][more]
XP_022994939.10.0e+0089.17pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita maxima][more]
XP_023542233.10.0e+0088.83pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita pepo subsp... [more]
Match NameE-valueIdentityDescription
AT4G33170.12.0e-10436.45Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G15130.11.0e-10335.34Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G53360.12.7e-9634.38Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G13650.11.6e-9332.04Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.18.0e-9333.70Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SMZ2|PP347_ARATH3.7e-10336.45Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
sp|P0C898|PP232_ARATH1.8e-10235.34Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis th... [more]
sp|Q9LFI1|PP280_ARATH4.8e-9534.38Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidop... [more]
sp|Q9SVP7|PP307_ARATH2.9e-9232.04Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
sp|Q3E6Q1|PPR32_ARATH1.4e-9133.70Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LVZ1|A0A0A0LVZ1_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043090 PE=4 SV=1[more]
tr|A0A1S4E4G2|A0A1S4E4G2_CUCME0.0e+0098.00pentatricopeptide repeat-containing protein At2g13600-like OS=Cucumis melo OX=36... [more]
tr|A0A2I4F2Q0|A0A2I4F2Q0_9ROSI2.0e-23970.14pentatricopeptide repeat-containing protein At2g13600-like OS=Juglans regia OX=5... [more]
tr|A0A1U8AYJ6|A0A1U8AYJ6_NELNU9.0e-22469.29pentatricopeptide repeat-containing protein At4g33170-like isoform X1 OS=Nelumbo... [more]
tr|A0A1R3FUM0|A0A1R3FUM0_COCAP1.4e-22168.19Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_30939 PE=4 ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G007460.1CsaV3_1G007460.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 11..167
e-value: 1.1E-18
score: 69.8
coord: 464..597
e-value: 8.7E-13
score: 50.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 168..260
e-value: 9.6E-12
score: 46.6
coord: 366..463
e-value: 9.9E-22
score: 79.1
coord: 261..365
e-value: 5.2E-16
score: 60.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 450..484
e-value: 7.0E-5
score: 20.7
coord: 415..448
e-value: 1.3E-7
score: 29.3
coord: 112..142
e-value: 2.5E-6
score: 25.3
coord: 83..112
e-value: 9.8E-4
score: 17.2
coord: 315..347
e-value: 0.0032
score: 15.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 109..157
e-value: 2.6E-8
score: 33.8
coord: 413..460
e-value: 5.8E-13
score: 48.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 315..344
e-value: 0.0013
score: 18.8
coord: 488..511
e-value: 0.048
score: 13.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 181..215
score: 7.355
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 382..412
score: 7.169
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 347..381
score: 5.261
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 448..483
score: 8.802
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 413..447
score: 11.926
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 484..514
score: 6.577
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 79..109
score: 8.035
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 281..311
score: 6.358
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 110..144
score: 10.501
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 550..584
score: 5.941
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..280
score: 7.235
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 312..346
score: 9.449
NoneNo IPR availablePANTHERPTHR24015:SF374SUBFAMILY NOT NAMEDcoord: 23..592
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 23..592