CsaV3_6G017310 (gene) Cucumber (Chinese Long) v3

NameCsaV3_6G017310
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr6 : 12838228 .. 12840229 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAATACAAAAAGGATTTTTATTGACTAATGTTGAAACGAAATCTCTATTTTGCCCTCACCAGGCCTTATCTTTCCCGCGAAAACGAACGGCCATTTCTTCAAACCATTGAAAACCTCAGTCGCCATCTTCGAAATTGCAACGATTTGATTTCTTCAATCTCGACTCACTCCATAGCCTTAAAGCTTGGATTCTTAAACAACACTGTCAATGTCAACCATCTCATCAATTGCTATGTCCGATTCCGCAGCATTGCAACCGCACACCAACTGTTCGATGAAATGCCCAACCCAAATGTTGTGTCGTGGACCTCACTCATGGCTGGTTACGTCGACAACGGCCAACCGAGTACCGCTCTTTTTCTTTTTGGGGAAATGTTGAGAAGTCCGGTTGTTCCCAATGACTTCACTTTTGCTACTGCCATTAAGGCCTGTTCGATACTTTCGAATTTAAGACATGGTGAAATGTTTCATGCCCATGTTGAGATTTTTGGTTATGGAGGTAATATTGTGGTTTGTTCTTCTCTTATTGATATGTATGGGAAGTGTAATGATGTTGTTAAAGCTAGGGGCGTCTTTAATTCCATGTCTTGTAAGAATATAGTTTCTTGGACTTCAATGATTGCTGCTTATGCTCAGAATGCTCATGGCGATGAAGCATTAAAAGTATTTAGGGAATTCACTAGTTTAAGTTCTGAACATCCAAATCCTTACATGTTAGCTAGTGTGATTAGTGCTTGTGCAAGCTTGGGTAGGCTGGTTTCGGGAAAAGTCATGCATGGTGCAGCTATTTCTCTTGGCTGTGATTCAAGCGAGGTAGTTGCGAGTGTGTTGGTTGATATGTATGCTAAATGTGGGAGTCTTGATTATTCTGATAAGGTTTTTAACAGAATCTCAAACCCTTCTGTTATTCCTTATACTTCAATGATTGTTAGCACAGCAAAGTATGGATTTGGGAGAAAGTCTCTTCAACTCTTTGAAGAAATGGTTAGAAAAGGACTGAAACCTAACCATATCACTCTTGTTGGAGTCTTGCATGCTTGTAGCCATTCAGGGCTCCCTAATGAGGGCCTTTATTATTTGACATCCATGTATGAGAAATATGGGATAATGCCTGAGACTAAGCACTATACATGTGTTGTCGATATGCTAGCTCGAGTTGGAGAGCTAGATAAAGCCTTTGACCTAGCGAAATCGATGGATGTAGCACCTGATGACAAGGCATTGTTGTGGGGAGCATTGCTTTCAGCTAGTAGGTGTCATGGCAGGGTAGACATTGCAGCTGAAGCTTGTCAGCAACTTGTGAATTCCAATAGACAAGTTGCTGGTGCATATGTTACGTTGTCAAATGTTTACGCTTCGGCTGGAGATATGGAGAAGGCTCACAAACTCCGAGTTGAGATGAAACGTACTGGGGTTCACAAAGAACCAGGATGCAGTTGGATCGAGATAAAAGATTCAAGTTATATATTCTACGCCGGGGAAATAACTTCGTGCCCAAGAGGCGATGAAGTGTTGTGTTTGCTGAGAGAGTTGGACCAGAAAATGAAGGATAGAGGTTATGTAAGAGGAAGGAAAGGGTTGGTGTTTGTTGATATAGAAGAAGAGGCAGAGGAGGAAAAAGTTTGGTTGCACAGTGAGAGATTGGCATTGGGATTTGGTTTGATTAGCATTCCAAAAGGACTTACTATAAGAATAATGAAGAACTTGAGAATGTGCAGTGATTGTCATGAGGCTTTCAAGCTTATAAGTGAGATTATGGAGAGAGAATTTGTGGTTAGAGACATTAATAGATTTCATCATTTCAAAAATGGTTGTTGCACTTGCAATGGTTTCTGGTAATTTGTTAAGAACACAGTGAATTATTGAATCCCAAACCCAATTTTGTTTTGTTTTTTAACAAGAAAAACTTGTAAATCACTGTAAATACTTGCTGGAGGATTGGTTACATTATAATGTTAGTCTTCCAACTAGCTTTTATAGATTTAAACCTAGTAGGCTAC

mRNA sequence

ATGTTGAAACGAAATCTCTATTTTGCCCTCACCAGGCCTTATCTTTCCCGCGAAAACGAACGGCCATTTCTTCAAACCATTGAAAACCTCAGTCGCCATCTTCGAAATTGCAACGATTTGATTTCTTCAATCTCGACTCACTCCATAGCCTTAAAGCTTGGATTCTTAAACAACACTGTCAATGTCAACCATCTCATCAATTGCTATGTCCGATTCCGCAGCATTGCAACCGCACACCAACTGTTCGATGAAATGCCCAACCCAAATGTTGTGTCGTGGACCTCACTCATGGCTGGTTACGTCGACAACGGCCAACCGAGTACCGCTCTTTTTCTTTTTGGGGAAATGTTGAGAAGTCCGGTTGTTCCCAATGACTTCACTTTTGCTACTGCCATTAAGGCCTGTTCGATACTTTCGAATTTAAGACATGGTGAAATGTTTCATGCCCATGTTGAGATTTTTGGTTATGGAGGTAATATTGTGGTTTGTTCTTCTCTTATTGATATGTATGGGAAGTGTAATGATGTTGTTAAAGCTAGGGGCGTCTTTAATTCCATGTCTTGTAAGAATATAGTTTCTTGGACTTCAATGATTGCTGCTTATGCTCAGAATGCTCATGGCGATGAAGCATTAAAAGTATTTAGGGAATTCACTAGTTTAAGTTCTGAACATCCAAATCCTTACATGTTAGCTAGTGTGATTAGTGCTTGTGCAAGCTTGGGTAGGCTGGTTTCGGGAAAAGTCATGCATGGTGCAGCTATTTCTCTTGGCTGTGATTCAAGCGAGGTAGTTGCGAGTGTGTTGGTTGATATGTATGCTAAATGTGGGAGTCTTGATTATTCTGATAAGGTTTTTAACAGAATCTCAAACCCTTCTGTTATTCCTTATACTTCAATGATTGTTAGCACAGCAAAGTATGGATTTGGGAGAAAGTCTCTTCAACTCTTTGAAGAAATGGTTAGAAAAGGACTGAAACCTAACCATATCACTCTTGTTGGAGTCTTGCATGCTTGTAGCCATTCAGGGCTCCCTAATGAGGGCCTTTATTATTTGACATCCATGTATGAGAAATATGGGATAATGCCTGAGACTAAGCACTATACATGTGTTGTCGATATGCTAGCTCGAGTTGGAGAGCTAGATAAAGCCTTTGACCTAGCGAAATCGATGGATGTAGCACCTGATGACAAGGCATTGTTGTGGGGAGCATTGCTTTCAGCTAGTAGGTGTCATGGCAGGGTAGACATTGCAGCTGAAGCTTGTCAGCAACTTGTGAATTCCAATAGACAAGTTGCTGGTGCATATGTTACGTTGTCAAATGTTTACGCTTCGGCTGGAGATATGGAGAAGGCTCACAAACTCCGAGTTGAGATGAAACGTACTGGGGTTCACAAAGAACCAGGATGCAGTTGGATCGAGATAAAAGATTCAAGTTATATATTCTACGCCGGGGAAATAACTTCGTGCCCAAGAGGCGATGAAGTGTTGTGTTTGCTGAGAGAGTTGGACCAGAAAATGAAGGATAGAGGTTATGTAAGAGGAAGGAAAGGGTTGGTGTTTGTTGATATAGAAGAAGAGGCAGAGGAGGAAAAAGTTTGGTTGCACAGTGAGAGATTGGCATTGGGATTTGGTTTGATTAGCATTCCAAAAGGACTTACTATAAGAATAATGAAGAACTTGAGAATGTGCAGTGATTGTCATGAGGCTTTCAAGCTTATAAGTGAGATTATGGAGAGAGAATTTGTGGTTAGAGACATTAATAGATTTCATCATTTCAAAAATGGTTGTTGCACTTGCAATGGTTTCTGGTAA

Coding sequence (CDS)

ATGTTGAAACGAAATCTCTATTTTGCCCTCACCAGGCCTTATCTTTCCCGCGAAAACGAACGGCCATTTCTTCAAACCATTGAAAACCTCAGTCGCCATCTTCGAAATTGCAACGATTTGATTTCTTCAATCTCGACTCACTCCATAGCCTTAAAGCTTGGATTCTTAAACAACACTGTCAATGTCAACCATCTCATCAATTGCTATGTCCGATTCCGCAGCATTGCAACCGCACACCAACTGTTCGATGAAATGCCCAACCCAAATGTTGTGTCGTGGACCTCACTCATGGCTGGTTACGTCGACAACGGCCAACCGAGTACCGCTCTTTTTCTTTTTGGGGAAATGTTGAGAAGTCCGGTTGTTCCCAATGACTTCACTTTTGCTACTGCCATTAAGGCCTGTTCGATACTTTCGAATTTAAGACATGGTGAAATGTTTCATGCCCATGTTGAGATTTTTGGTTATGGAGGTAATATTGTGGTTTGTTCTTCTCTTATTGATATGTATGGGAAGTGTAATGATGTTGTTAAAGCTAGGGGCGTCTTTAATTCCATGTCTTGTAAGAATATAGTTTCTTGGACTTCAATGATTGCTGCTTATGCTCAGAATGCTCATGGCGATGAAGCATTAAAAGTATTTAGGGAATTCACTAGTTTAAGTTCTGAACATCCAAATCCTTACATGTTAGCTAGTGTGATTAGTGCTTGTGCAAGCTTGGGTAGGCTGGTTTCGGGAAAAGTCATGCATGGTGCAGCTATTTCTCTTGGCTGTGATTCAAGCGAGGTAGTTGCGAGTGTGTTGGTTGATATGTATGCTAAATGTGGGAGTCTTGATTATTCTGATAAGGTTTTTAACAGAATCTCAAACCCTTCTGTTATTCCTTATACTTCAATGATTGTTAGCACAGCAAAGTATGGATTTGGGAGAAAGTCTCTTCAACTCTTTGAAGAAATGGTTAGAAAAGGACTGAAACCTAACCATATCACTCTTGTTGGAGTCTTGCATGCTTGTAGCCATTCAGGGCTCCCTAATGAGGGCCTTTATTATTTGACATCCATGTATGAGAAATATGGGATAATGCCTGAGACTAAGCACTATACATGTGTTGTCGATATGCTAGCTCGAGTTGGAGAGCTAGATAAAGCCTTTGACCTAGCGAAATCGATGGATGTAGCACCTGATGACAAGGCATTGTTGTGGGGAGCATTGCTTTCAGCTAGTAGGTGTCATGGCAGGGTAGACATTGCAGCTGAAGCTTGTCAGCAACTTGTGAATTCCAATAGACAAGTTGCTGGTGCATATGTTACGTTGTCAAATGTTTACGCTTCGGCTGGAGATATGGAGAAGGCTCACAAACTCCGAGTTGAGATGAAACGTACTGGGGTTCACAAAGAACCAGGATGCAGTTGGATCGAGATAAAAGATTCAAGTTATATATTCTACGCCGGGGAAATAACTTCGTGCCCAAGAGGCGATGAAGTGTTGTGTTTGCTGAGAGAGTTGGACCAGAAAATGAAGGATAGAGGTTATGTAAGAGGAAGGAAAGGGTTGGTGTTTGTTGATATAGAAGAAGAGGCAGAGGAGGAAAAAGTTTGGTTGCACAGTGAGAGATTGGCATTGGGATTTGGTTTGATTAGCATTCCAAAAGGACTTACTATAAGAATAATGAAGAACTTGAGAATGTGCAGTGATTGTCATGAGGCTTTCAAGCTTATAAGTGAGATTATGGAGAGAGAATTTGTGGTTAGAGACATTAATAGATTTCATCATTTCAAAAATGGTTGTTGCACTTGCAATGGTTTCTGGTAA

Protein sequence

MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTVNVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSPVVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKARGVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASLGRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMIVSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGIMPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEACQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYIFYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLALGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCNGFW
BLAST of CsaV3_6G017310 vs. NCBI nr
Match: XP_004143199.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g15720 [Cucumis sativus] >KGN47194.1 hypothetical protein Csa_6G197240 [Cucumis sativus])

HSP 1 Score: 1227.6 bits (3175), Expect = 0.0e+00
Identity = 603/603 (100.00%), Postives = 603/603 (100.00%), Query Frame = 0

Query: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60
           MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV
Sbjct: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60

Query: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120
           NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP
Sbjct: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180

Query: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240
           GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL
Sbjct: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240

Query: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300
           GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI
Sbjct: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360
           VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI
Sbjct: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360

Query: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420
           MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA
Sbjct: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI 480
           CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI
Sbjct: 421 CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI 480

Query: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA
Sbjct: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540

Query: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN 600
           LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN
Sbjct: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN 600

Query: 601 GFW 604
           GFW
Sbjct: 601 GFW 603

BLAST of CsaV3_6G017310 vs. NCBI nr
Match: XP_008458473.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g15720 [Cucumis melo])

HSP 1 Score: 1184.5 bits (3063), Expect = 0.0e+00
Identity = 582/603 (96.52%), Postives = 590/603 (97.84%), Query Frame = 0

Query: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60
           MLKRNL+FALTRPYLSRENERPFLQTIENLS HLRNCNDLISSI THSIALKLGFLNNTV
Sbjct: 1   MLKRNLHFALTRPYLSRENERPFLQTIENLSLHLRNCNDLISSIFTHSIALKLGFLNNTV 60

Query: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120
            VNHLINCYVRFRSIATAH+LFDEMPNPNVVSWTSLMAGYVDNGQPSTAL LFGEMLRSP
Sbjct: 61  TVNHLINCYVRFRSIATAHRLFDEMPNPNVVSWTSLMAGYVDNGQPSTALLLFGEMLRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR+GE FHAHVEIFGYG NIVVCSSLIDMYGKCNDVVKAR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRYGERFHAHVEIFGYGCNIVVCSSLIDMYGKCNDVVKAR 180

Query: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240
            VFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSE PNPYMLASVISACASL
Sbjct: 181 SVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSERPNPYMLASVISACASL 240

Query: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300
           GRLVSGKV+HGAAI LGCDSSEVVASVLVDMYAKCGSLDYSD VFNRISNPSVIPYTSMI
Sbjct: 241 GRLVSGKVVHGAAICLGCDSSEVVASVLVDMYAKCGSLDYSDNVFNRISNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360
           VSTAKYGFGRKSLQLFEEMVRKGLKPNHITL+GVLHACSHSGLPNEGLYYLTSMYEKYGI
Sbjct: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLLGVLHACSHSGLPNEGLYYLTSMYEKYGI 360

Query: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420
           MPETKHYTCVVDMLAR GELDKAFD+AKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA
Sbjct: 361 MPETKHYTCVVDMLARSGELDKAFDIAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI 480
           CQQLVNSNRQVAGAYVTLSN YASAGDMEKAHKLRVEMK TGVHKEPGCSWIEIKDSSYI
Sbjct: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHKLRVEMKHTGVHKEPGCSWIEIKDSSYI 480

Query: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA
Sbjct: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540

Query: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN 600
           LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHF++GCCTCN
Sbjct: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFQSGCCTCN 600

Query: 601 GFW 604
           GFW
Sbjct: 601 GFW 603

BLAST of CsaV3_6G017310 vs. NCBI nr
Match: XP_022138437.1 (pentatricopeptide repeat-containing protein At4g15720 [Momordica charantia])

HSP 1 Score: 1008.1 bits (2605), Expect = 1.3e-290
Identity = 488/603 (80.93%), Postives = 544/603 (90.22%), Query Frame = 0

Query: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60
           MLK N++FALTRP LSRENER   QTI N SRHLRNCNDLIS+ S H IALKLGFL  T+
Sbjct: 1   MLKPNIHFALTRPRLSRENERLSFQTIANFSRHLRNCNDLISATSAHPIALKLGFLTQTL 60

Query: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120
            VNHLINCYVR R ++ AH LFDEMP PNVVSWTSLMAGY+D  QPSTAL LFGEM RSP
Sbjct: 61  PVNHLINCYVRLRRVSIAHHLFDEMPTPNVVSWTSLMAGYIDACQPSTALSLFGEMSRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR G+ FHA+VEI GYGGN+VVCSSLIDMYGKCNDVV+AR
Sbjct: 121 VVPNDFTFATAIKACSILSNLREGKKFHAYVEISGYGGNVVVCSSLIDMYGKCNDVVRAR 180

Query: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240
            VF+SM+CKNIVSWTSMIA YAQNAHGD+ALK+FREF+SL+ EHPN +MLAS ISACASL
Sbjct: 181 RVFDSMACKNIVSWTSMIATYAQNAHGDDALKLFREFSSLNYEHPNHFMLASAISACASL 240

Query: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300
           GRLV G+V+HGA I LG DS++V++SVLVDMYAKCGSL+ SDKVF+RI NPSVIPYTSMI
Sbjct: 241 GRLVLGRVVHGAVIRLGHDSNDVISSVLVDMYAKCGSLNCSDKVFSRILNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360
           VS AKYG GR+SLQLFEEMVRKGLKPNH+T VGVL+ACSHSGL +EGL YLTSMYE++GI
Sbjct: 301 VSRAKYGLGRQSLQLFEEMVRKGLKPNHVTFVGVLYACSHSGLLDEGLSYLTSMYERHGI 360

Query: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420
           MPE+KHYTCVVDMLARVG+LD+A+ LAKSM+V PDD+ALLWGALLSASR HGRVDIA EA
Sbjct: 361 MPESKHYTCVVDMLARVGQLDRAYQLAKSMEVEPDDEALLWGALLSASRYHGRVDIAVEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI 480
           CQ+LVNSN+QVAGAYVTLSN YASAGDMEKAH+LRVEMK TG++KEPGCSW+EIK+ +Y+
Sbjct: 421 CQRLVNSNQQVAGAYVTLSNAYASAGDMEKAHRLRVEMKHTGINKEPGCSWLEIKNLTYV 480

Query: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAG++ SCP   EVLCLLREL++KMKDRGY RG KGLVFVD+EEEAEEE V LHSERLA
Sbjct: 481 FYAGDVESCPHSKEVLCLLRELERKMKDRGYGRGSKGLVFVDVEEEAEEEAVGLHSERLA 540

Query: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN 600
           LGFGLISIPKG+TIRIMKNLRMCSDCHEAFKLISEI+ER+FVVRD+NRFHHF  GCCTCN
Sbjct: 541 LGFGLISIPKGITIRIMKNLRMCSDCHEAFKLISEIVERDFVVRDVNRFHHFTGGCCTCN 600

Query: 601 GFW 604
            FW
Sbjct: 601 DFW 603

BLAST of CsaV3_6G017310 vs. NCBI nr
Match: XP_022930461.1 (pentatricopeptide repeat-containing protein At4g15720 [Cucurbita moschata])

HSP 1 Score: 1005.7 bits (2599), Expect = 6.4e-290
Identity = 497/604 (82.28%), Postives = 541/604 (89.57%), Query Frame = 0

Query: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60
           MLKR+L FAL        NERP LQTIENLS HLR+C DLISS S HSIALKLGFLN T+
Sbjct: 1   MLKRSLPFAL--------NERPPLQTIENLSHHLRSCKDLISSTSIHSIALKLGFLNQTL 60

Query: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120
             NHLINCYVRFR +A AHQLFDEMP  NVVSWTSLMAGYVDNGQPS AL LFG M R+ 
Sbjct: 61  TANHLINCYVRFRRVAIAHQLFDEMPTRNVVSWTSLMAGYVDNGQPSIALSLFGAMSRTS 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR GE FHA +EIFGYGGN+VVCSSLIDMYGKCNDV++AR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRDGEKFHACMEIFGYGGNVVVCSSLIDMYGKCNDVIRAR 180

Query: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240
            VF+SMSCKNIVSWTSMIAAYAQNAHGD+ALK+FREF S S E PN +MLASVISACA L
Sbjct: 181 RVFDSMSCKNIVSWTSMIAAYAQNAHGDDALKLFREFGSSSWERPNHFMLASVISACACL 240

Query: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300
           GRLV+GKV+H AAI LGCDS++VVASVLVDMYAKCGSL+YSD+VF RI NP VI YTSMI
Sbjct: 241 GRLVAGKVVHSAAIRLGCDSNDVVASVLVDMYAKCGSLEYSDRVFKRILNPCVISYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360
           VSTAKYG GR+SL+LFEEMVRKGLKPNH+T VGVLHACSHSGLP+EGL+YLTSMYEK+GI
Sbjct: 301 VSTAKYGLGRQSLELFEEMVRKGLKPNHVTFVGVLHACSHSGLPDEGLHYLTSMYEKHGI 360

Query: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAP-DDKALLWGALLSASRCHGRVDIAAE 420
           MPETKHYTCVVDMLAR GELDKA+ LAKSM   P DD+ALLWGALLS SR HGRVDIAAE
Sbjct: 361 MPETKHYTCVVDMLARAGELDKAYQLAKSMKAMPDDDQALLWGALLSTSRLHGRVDIAAE 420

Query: 421 ACQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSY 480
           ACQ+LV++NRQVAGAYVTLSN YASAGDMEKA +LRVEMK +GV+KEPGCSW+EIKDSSY
Sbjct: 421 ACQRLVDANRQVAGAYVTLSNAYASAGDMEKAQRLRVEMKHSGVYKEPGCSWVEIKDSSY 480

Query: 481 IFYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERL 540
           +FYAG++ SCPRGDEVL LLREL++KMKDRGY RG KGLVFVDIEEEAEEEKVWLHSERL
Sbjct: 481 VFYAGDVMSCPRGDEVLRLLRELERKMKDRGYSRGSKGLVFVDIEEEAEEEKVWLHSERL 540

Query: 541 ALGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTC 600
           ALGF LISIP+GLTIR+MKNLRMCSDCHEAFKLISEI+ER+FVVRD+NRFHHFK G CTC
Sbjct: 541 ALGFCLISIPEGLTIRMMKNLRMCSDCHEAFKLISEIVERDFVVRDVNRFHHFKTGYCTC 596

Query: 601 NGFW 604
           N FW
Sbjct: 601 NDFW 596

BLAST of CsaV3_6G017310 vs. NCBI nr
Match: XP_023000616.1 (pentatricopeptide repeat-containing protein At4g15720 [Cucurbita maxima])

HSP 1 Score: 996.5 bits (2575), Expect = 3.9e-287
Identity = 491/604 (81.29%), Postives = 539/604 (89.24%), Query Frame = 0

Query: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60
           MLKR+L FAL        NERP LQTIENLS HLR+C DL+SS S HSIALKLGFLN T+
Sbjct: 1   MLKRSLPFAL--------NERPPLQTIENLSHHLRSCKDLVSSTSIHSIALKLGFLNQTI 60

Query: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120
             NHLINCYVRFR +A AHQLFDEMP  NVVSWTSLMAGYV++GQP  AL LFG M R+ 
Sbjct: 61  TANHLINCYVRFRRVAIAHQLFDEMPTRNVVSWTSLMAGYVNDGQPCIALSLFGAMSRTS 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR GE FHA +EIFGYGGN+VVCSSLIDMYGKCN+V++AR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRDGEKFHACMEIFGYGGNVVVCSSLIDMYGKCNNVIRAR 180

Query: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240
            VF+SMSCKNI+SWTSMIAAYAQNAH D+ALK+FREF S S EHPN +MLASVISACA L
Sbjct: 181 RVFDSMSCKNIISWTSMIAAYAQNAHSDDALKLFREFGSSSWEHPNHFMLASVISACACL 240

Query: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300
           GRLVSGKV+H AAI LGCDS++VVASVLVDMYAKCGSL+YSD+VF RI NP VI YTSMI
Sbjct: 241 GRLVSGKVVHSAAIRLGCDSNDVVASVLVDMYAKCGSLEYSDRVFKRILNPCVISYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360
           VSTAKYG GR+SL+LFEEMVRKGLKPNH+T VGVLHACSHSGLP+EGL+YLTSMYEK+GI
Sbjct: 301 VSTAKYGLGRQSLELFEEMVRKGLKPNHVTFVGVLHACSHSGLPDEGLHYLTSMYEKHGI 360

Query: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAP-DDKALLWGALLSASRCHGRVDIAAE 420
           MPETKHYTCVVDMLAR GELDKA+ LAKSM   P DD+ALLWGALLS SR HGRVDIAAE
Sbjct: 361 MPETKHYTCVVDMLARAGELDKAYQLAKSMKAMPDDDQALLWGALLSTSRLHGRVDIAAE 420

Query: 421 ACQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSY 480
           ACQ+LV++NRQVAGAYVTLSN YASAGDMEKA +LRVEMK +GV+KEPGCSW+EIKDSSY
Sbjct: 421 ACQRLVDANRQVAGAYVTLSNAYASAGDMEKAQRLRVEMKHSGVYKEPGCSWVEIKDSSY 480

Query: 481 IFYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERL 540
           +FYAG++ SCPRG+EVL LLREL++KMK RGY RG KGLVFVDIEEEAEEEKVWLHSERL
Sbjct: 481 VFYAGDVMSCPRGNEVLHLLRELERKMKGRGYSRGSKGLVFVDIEEEAEEEKVWLHSERL 540

Query: 541 ALGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTC 600
           ALGF LISIP+GLTIRIMKNLRMCSDCHEAFKLISEI+ER+FVVRD+NRFHHFK G CTC
Sbjct: 541 ALGFCLISIPEGLTIRIMKNLRMCSDCHEAFKLISEIVERDFVVRDVNRFHHFKTGYCTC 596

Query: 601 NGFW 604
           N FW
Sbjct: 601 NDFW 596

BLAST of CsaV3_6G017310 vs. TAIR10
Match: AT4G15720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 649.0 bits (1673), Expect = 2.8e-186
Identity = 326/567 (57.50%), Postives = 424/567 (74.78%), Query Frame = 0

Query: 47  HSIALKLGFLNNTVNVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQP 106
           H++ LKLGF ++T  VNHL+  YV+ + I TA +LFDEM  PNVVSWTS+++GY D G+P
Sbjct: 52  HTLTLKLGFASDTFTVNHLVISYVKLKEINTARKLFDEMCEPNVVSWTSVISGYNDMGKP 111

Query: 107 STALFLFGEMLRS-PVVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSS 166
             AL +F +M    PV PN++TFA+  KACS L+  R G+  HA +EI G   NIVV SS
Sbjct: 112 QNALSMFQKMHEDRPVPPNEYTFASVFKACSALAESRIGKNIHARLEISGLRRNIVVSSS 171

Query: 167 LIDMYGKCNDVVKARGVFNSM--SCKNIVSWTSMIAAYAQNAHGDEALKVFREF-TSLSS 226
           L+DMYGKCNDV  AR VF+SM    +N+VSWTSMI AYAQNA G EA+++FR F  +L+S
Sbjct: 172 LVDMYGKCNDVETARRVFDSMIGYGRNVVSWTSMITAYAQNARGHEAIELFRSFNAALTS 231

Query: 227 EHPNPYMLASVISACASLGRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSD 286
           +  N +MLASVISAC+SLGRL  GKV HG     G +S+ VVA+ L+DMYAKCGSL  ++
Sbjct: 232 DRANQFMLASVISACSSLGRLQWGKVAHGLVTRGGYESNTVVATSLLDMYAKCGSLSCAE 291

Query: 287 KVFNRISNPSVIPYTSMIVSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSG 346
           K+F RI   SVI YTSMI++ AK+G G  +++LF+EMV   + PN++TL+GVLHACSHSG
Sbjct: 292 KIFLRIRCHSVISYTSMIMAKAKHGLGEAAVKLFDEMVAGRINPNYVTLLGVLHACSHSG 351

Query: 347 LPNEGLYYLTSMYEKYGIMPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWG 406
           L NEGL YL+ M EKYG++P+++HYTCVVDML R G +D+A++LAK+++V  +  ALLWG
Sbjct: 352 LVNEGLEYLSLMAEKYGVVPDSRHYTCVVDMLGRFGRVDEAYELAKTIEVGAEQGALLWG 411

Query: 407 ALLSASRCHGRVDIAAEACQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTG 466
           ALLSA R HGRV+I +EA ++L+ SN+QV  AY+ LSN YA +G  E +  LR+EMKR+G
Sbjct: 412 ALLSAGRLHGRVEIVSEASKRLIQSNQQVTSAYIALSNAYAVSGGWEDSESLRLEMKRSG 471

Query: 467 VHKEPGCSWIEIKDSSYIFYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGL---- 526
             KE  CSWIE KDS Y+F+AG++ SC    E+   L++L+++MK+RG+ RG   +    
Sbjct: 472 NVKERACSWIENKDSVYVFHAGDL-SCDESGEIERFLKDLEKRMKERGH-RGSSSMITTS 531

Query: 527 --VFVDIEEEAEEEKVWLHSERLALGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEI 586
             VFVD++EEA++E V LH ERLAL +GL+ +P G TIRIM NLRMC DCHEAFKLISEI
Sbjct: 532 SSVFVDVDEEAKDEMVSLHCERLALAYGLLHLPAGSTIRIMNNLRMCRDCHEAFKLISEI 591

Query: 587 MEREFVVRDINRFHHFKNGCCTCNGFW 604
           +ERE VVRD+NRFH FKNG CTC  +W
Sbjct: 592 VEREIVVRDVNRFHCFKNGSCTCRDYW 616

BLAST of CsaV3_6G017310 vs. TAIR10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 401.4 bits (1030), Expect = 1.0e-111
Identity = 219/594 (36.87%), Postives = 335/594 (56.40%), Query Frame = 0

Query: 15  LSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTVNVNHLINCYVRFRS 74
           LS+E+  P  QT E L     + + L  ++  H   L  G   +      LI  Y    S
Sbjct: 69  LSQESS-PSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGS 128

Query: 75  IATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSPVVPNDFTFATAIKA 134
           +  A ++FD+     +  W +L       G     L L+ +M R  V  + FT+   +KA
Sbjct: 129 VDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKA 188

Query: 135 C----SILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKARGVFNSMSCKN 194
           C      +++L  G+  HAH+   GY  ++ + ++L+DMY +   V  A  VF  M  +N
Sbjct: 189 CVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRN 248

Query: 195 IVSWTSMIAAYAQNAHGDEALKVFRE-FTSLSSEHPNPYMLASVISACASLGRLVSGKVM 254
           +VSW++MIA YA+N    EAL+ FRE         PN   + SV+ ACASL  L  GK++
Sbjct: 249 VVSWSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLI 308

Query: 255 HGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMIVSTAKYGFG 314
           HG  +  G DS   V S LV MY +CG L+   +VF+R+ +  V+ + S+I S   +G+G
Sbjct: 309 HGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYG 368

Query: 315 RKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGIMPETKHYTC 374
           +K++Q+FEEM+  G  P  +T V VL ACSH GL  EG     +M+  +GI P+ +HY C
Sbjct: 369 KKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYAC 428

Query: 375 VVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEACQQLVNSNR 434
           +VD+L R   LD+A  + + M   P  K  +WG+LL + R HG V++A  A ++L     
Sbjct: 429 MVDLLGRANRLDEAAKMVQDMRTEPGPK--VWGSLLGSCRIHGNVELAERASRRLFALEP 488

Query: 435 QVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYIFYAGEITSC 494
           + AG YV L+++YA A   ++  +++  ++  G+ K PG  W+E++   Y F + +  + 
Sbjct: 489 KNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFN- 548

Query: 495 PRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLALGFGLISIP 554
           P  +++   L +L + MK++GY+   KG+++ ++E E +E  V  HSE+LAL FGLI+  
Sbjct: 549 PLMEQIHAFLVKLAEDMKEKGYIPQTKGVLY-ELETEEKERIVLGHSEKLALAFGLINTS 608

Query: 555 KGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCNGFW 604
           KG  IRI KNLR+C DCH   K IS+ ME+E +VRD+NRFH FKNG C+C  +W
Sbjct: 609 KGEPIRITKNLRLCEDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of CsaV3_6G017310 vs. TAIR10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 399.4 bits (1025), Expect = 3.8e-111
Identity = 202/536 (37.69%), Postives = 328/536 (61.19%), Query Frame = 0

Query: 68  CYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSPVVPNDFT 127
           C + F  I +  ++F+ MP  +VVS+ +++AGY  +G    AL +  EM  + + P+ FT
Sbjct: 186 CIMPF-GIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFT 245

Query: 128 FATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKARGVFNSMS 187
            ++ +   S   ++  G+  H +V   G   ++ + SSL+DMY K   +  +  VF+ + 
Sbjct: 246 LSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLY 305

Query: 188 CKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASLGRLVSGK 247
           C++ +SW S++A Y QN   +EAL++FR+  + +   P     +SVI ACA L  L  GK
Sbjct: 306 CRDGISWNSLVAGYVQNGRYNEALRLFRQMVT-AKVKPGAVAFSSVIPACAHLATLHLGK 365

Query: 248 VMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMIVSTAKYG 307
            +HG  +  G  S+  +AS LVDMY+KCG++  + K+F+R++    + +T++I+  A +G
Sbjct: 366 QLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHG 425

Query: 308 FGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGIMPETKHY 367
            G +++ LFEEM R+G+KPN +  V VL ACSH GL +E   Y  SM + YG+  E +HY
Sbjct: 426 HGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHY 485

Query: 368 TCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEACQQLVNS 427
             V D+L R G+L++A++    M V P     +W  LLS+   H  +++A +  +++   
Sbjct: 486 AAVADLLGRAGKLEEAYNFISKMCVEPTGS--VWSTLLSSCSVHKNLELAEKVAEKIFTV 545

Query: 428 NRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYIFYAGEIT 487
           + +  GAYV + N+YAS G  ++  KLR+ M++ G+ K+P CSWIE+K+ ++ F +G+  
Sbjct: 546 DSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGD-R 605

Query: 488 SCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLALGFGLIS 547
           S P  D++   L+ + ++M+  GYV    G V  D++EE + E ++ HSERLA+ FG+I+
Sbjct: 606 SHPSMDKINEFLKAVMEQMEKEGYVADTSG-VLHDVDEEHKRELLFGHSERLAVAFGIIN 665

Query: 548 IPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCNGFW 604
              G TIR+ KN+R+C+DCH A K IS+I ERE +VRD +RFHHF  G C+C  +W
Sbjct: 666 TEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of CsaV3_6G017310 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 396.4 bits (1017), Expect = 3.2e-110
Identity = 219/590 (37.12%), Postives = 330/590 (55.93%), Query Frame = 0

Query: 15  LSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTVNVN-HLINCYVRFR 74
           +  EN +P   TI ++   +     +      H  A++ GF ++ VN++  L++ Y +  
Sbjct: 227 MCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGF-DSLVNISTALVDMYAKCG 286

Query: 75  SIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSPVVPNDFTFATAIK 134
           S+ TA QLFD M   NVVSW S++  YV N  P  A+ +F +ML   V P D +   A+ 
Sbjct: 287 SLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALH 346

Query: 135 ACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKARGVFNSMSCKNIVS 194
           AC+ L +L  G   H      G   N+ V +SLI MY KC +V  A  +F  +  + +VS
Sbjct: 347 ACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVS 406

Query: 195 WTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASLGRLVSGKVMHGAA 254
           W +MI  +AQN    +AL  F +  S + + P+ +   SVI+A A L      K +HG  
Sbjct: 407 WNAMILGFAQNGRPIDALNYFSQMRSRTVK-PDTFTYVSVITAIAELSITHHAKWIHGVV 466

Query: 255 ISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMIVSTAKYGFGRKSL 314
           +    D +  V + LVDMYAKCG++  +  +F+ +S   V  + +MI     +GFG+ +L
Sbjct: 467 MRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAAL 526

Query: 315 QLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGIMPETKHYTCVVDM 374
           +LFEEM +  +KPN +T + V+ ACSHSGL   GL     M E Y I     HY  +VD+
Sbjct: 527 ELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDL 586

Query: 375 LARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEACQQLVNSNRQVAG 434
           L R G L++A+D    M V P     ++GA+L A + H  V+ A +A ++L   N    G
Sbjct: 587 LGRAGRLNEAWDFIMQMPVKP--AVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGG 646

Query: 435 AYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYIFYAGEITSCPRGD 494
            +V L+N+Y +A   EK  ++RV M R G+ K PGCS +EIK+  + F++G  T+ P   
Sbjct: 647 YHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGS-TAHPDSK 706

Query: 495 EVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLALGFGLISIPKGLT 554
           ++   L +L   +K+ GYV      + + +E + +E+ +  HSE+LA+ FGL++   G T
Sbjct: 707 KIYAFLEKLICHIKEAGYVPDTN--LVLGVENDVKEQLLSTHSEKLAISFGLLNTTAGTT 766

Query: 555 IRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCNGFW 604
           I + KNLR+C+DCH A K IS +  RE VVRD+ RFHHFKNG C+C  +W
Sbjct: 767 IHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CsaV3_6G017310 vs. TAIR10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 394.0 bits (1011), Expect = 1.6e-109
Identity = 215/566 (37.99%), Postives = 336/566 (59.36%), Query Frame = 0

Query: 47  HSIALKLGFLNNTVNVNHLINCYVRFR---SIATAHQLFDEMPNPNVVSWTSLMAGYVDN 106
           HS A++ G +++      L++ Y +     S+    ++FD M + +V+SWT+L+ GY+ N
Sbjct: 292 HSWAIRSGLVDDV--ECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSWTALITGYMKN 351

Query: 107 GQPST-ALFLFGEML-RSPVVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIV 166
              +T A+ LF EM+ +  V PN FTF++A KAC  LS+ R G+         G   N  
Sbjct: 352 CNLATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSS 411

Query: 167 VCSSLIDMYGKCNDVVKARGVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLS 226
           V +S+I M+ K + +  A+  F S+S KN+VS+ + +    +N + ++A K+  E T   
Sbjct: 412 VANSVISMFVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQAFKLLSEITE-R 471

Query: 227 SEHPNPYMLASVISACASLGRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYS 286
               + +  AS++S  A++G +  G+ +H   + LG   ++ V + L+ MY+KCGS+D +
Sbjct: 472 ELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALISMYSKCGSIDTA 531

Query: 287 DKVFNRISNPSVIPYTSMIVSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHS 346
            +VFN + N +VI +TSMI   AK+GF  + L+ F +M+ +G+KPN +T V +L ACSH 
Sbjct: 532 SRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVTYVAILSACSHV 591

Query: 347 GLPNEGLYYLTSMYEKYGIMPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLW 406
           GL +EG  +  SMYE + I P+ +HY C+VD+L R G L  AF+   +M    D   L+W
Sbjct: 592 GLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTMPFQAD--VLVW 651

Query: 407 GALLSASRCHGRVDIAAEACQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRT 466
              L A R H   ++   A ++++  +     AY+ LSN+YA AG  E++ ++R +MK  
Sbjct: 652 RTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEESTEMRRKMKER 711

Query: 467 GVHKEPGCSWIEIKDSSYIFYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFV 526
            + KE GCSWIE+ D  + FY G+ T+ P   ++   L  L  ++K  GYV     LV  
Sbjct: 712 NLVKEGGCSWIEVGDKIHKFYVGD-TAHPNAHQIYDELDRLITEIKRCGYVPD-TDLVLH 771

Query: 527 DIEE---EAEEEK-VWLHSERLALGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIM 586
            +EE   EAE+E+ ++ HSE++A+ FGLIS  K   +R+ KNLR+C DCH A K IS + 
Sbjct: 772 KLEEENDEAEKERLLYQHSEKIAVAFGLISTSKSRPVRVFKNLRVCGDCHNAMKYISTVS 831

Query: 587 EREFVVRDINRFHHFKNGCCTCNGFW 604
            RE V+RD+NRFHHFK+G C+CN +W
Sbjct: 832 GREIVLRDLNRFHHFKDGKCSCNDYW 850

BLAST of CsaV3_6G017310 vs. Swiss-Prot
Match: sp|Q8VYH0|PP313_ARATH (Pentatricopeptide repeat-containing protein At4g15720 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H1 PE=2 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 5.0e-185
Identity = 326/567 (57.50%), Postives = 424/567 (74.78%), Query Frame = 0

Query: 47  HSIALKLGFLNNTVNVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQP 106
           H++ LKLGF ++T  VNHL+  YV+ + I TA +LFDEM  PNVVSWTS+++GY D G+P
Sbjct: 52  HTLTLKLGFASDTFTVNHLVISYVKLKEINTARKLFDEMCEPNVVSWTSVISGYNDMGKP 111

Query: 107 STALFLFGEMLRS-PVVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSS 166
             AL +F +M    PV PN++TFA+  KACS L+  R G+  HA +EI G   NIVV SS
Sbjct: 112 QNALSMFQKMHEDRPVPPNEYTFASVFKACSALAESRIGKNIHARLEISGLRRNIVVSSS 171

Query: 167 LIDMYGKCNDVVKARGVFNSM--SCKNIVSWTSMIAAYAQNAHGDEALKVFREF-TSLSS 226
           L+DMYGKCNDV  AR VF+SM    +N+VSWTSMI AYAQNA G EA+++FR F  +L+S
Sbjct: 172 LVDMYGKCNDVETARRVFDSMIGYGRNVVSWTSMITAYAQNARGHEAIELFRSFNAALTS 231

Query: 227 EHPNPYMLASVISACASLGRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSD 286
           +  N +MLASVISAC+SLGRL  GKV HG     G +S+ VVA+ L+DMYAKCGSL  ++
Sbjct: 232 DRANQFMLASVISACSSLGRLQWGKVAHGLVTRGGYESNTVVATSLLDMYAKCGSLSCAE 291

Query: 287 KVFNRISNPSVIPYTSMIVSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSG 346
           K+F RI   SVI YTSMI++ AK+G G  +++LF+EMV   + PN++TL+GVLHACSHSG
Sbjct: 292 KIFLRIRCHSVISYTSMIMAKAKHGLGEAAVKLFDEMVAGRINPNYVTLLGVLHACSHSG 351

Query: 347 LPNEGLYYLTSMYEKYGIMPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWG 406
           L NEGL YL+ M EKYG++P+++HYTCVVDML R G +D+A++LAK+++V  +  ALLWG
Sbjct: 352 LVNEGLEYLSLMAEKYGVVPDSRHYTCVVDMLGRFGRVDEAYELAKTIEVGAEQGALLWG 411

Query: 407 ALLSASRCHGRVDIAAEACQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTG 466
           ALLSA R HGRV+I +EA ++L+ SN+QV  AY+ LSN YA +G  E +  LR+EMKR+G
Sbjct: 412 ALLSAGRLHGRVEIVSEASKRLIQSNQQVTSAYIALSNAYAVSGGWEDSESLRLEMKRSG 471

Query: 467 VHKEPGCSWIEIKDSSYIFYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGL---- 526
             KE  CSWIE KDS Y+F+AG++ SC    E+   L++L+++MK+RG+ RG   +    
Sbjct: 472 NVKERACSWIENKDSVYVFHAGDL-SCDESGEIERFLKDLEKRMKERGH-RGSSSMITTS 531

Query: 527 --VFVDIEEEAEEEKVWLHSERLALGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEI 586
             VFVD++EEA++E V LH ERLAL +GL+ +P G TIRIM NLRMC DCHEAFKLISEI
Sbjct: 532 SSVFVDVDEEAKDEMVSLHCERLALAYGLLHLPAGSTIRIMNNLRMCRDCHEAFKLISEI 591

Query: 587 MEREFVVRDINRFHHFKNGCCTCNGFW 604
           +ERE VVRD+NRFH FKNG CTC  +W
Sbjct: 592 VEREIVVRDVNRFHCFKNGSCTCRDYW 616

BLAST of CsaV3_6G017310 vs. Swiss-Prot
Match: sp|Q9STF3|PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.8e-110
Identity = 219/594 (36.87%), Postives = 335/594 (56.40%), Query Frame = 0

Query: 15  LSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTVNVNHLINCYVRFRS 74
           LS+E+  P  QT E L     + + L  ++  H   L  G   +      LI  Y    S
Sbjct: 69  LSQESS-PSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGS 128

Query: 75  IATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSPVVPNDFTFATAIKA 134
           +  A ++FD+     +  W +L       G     L L+ +M R  V  + FT+   +KA
Sbjct: 129 VDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKA 188

Query: 135 C----SILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKARGVFNSMSCKN 194
           C      +++L  G+  HAH+   GY  ++ + ++L+DMY +   V  A  VF  M  +N
Sbjct: 189 CVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRN 248

Query: 195 IVSWTSMIAAYAQNAHGDEALKVFRE-FTSLSSEHPNPYMLASVISACASLGRLVSGKVM 254
           +VSW++MIA YA+N    EAL+ FRE         PN   + SV+ ACASL  L  GK++
Sbjct: 249 VVSWSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLI 308

Query: 255 HGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMIVSTAKYGFG 314
           HG  +  G DS   V S LV MY +CG L+   +VF+R+ +  V+ + S+I S   +G+G
Sbjct: 309 HGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYG 368

Query: 315 RKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGIMPETKHYTC 374
           +K++Q+FEEM+  G  P  +T V VL ACSH GL  EG     +M+  +GI P+ +HY C
Sbjct: 369 KKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYAC 428

Query: 375 VVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEACQQLVNSNR 434
           +VD+L R   LD+A  + + M   P  K  +WG+LL + R HG V++A  A ++L     
Sbjct: 429 MVDLLGRANRLDEAAKMVQDMRTEPGPK--VWGSLLGSCRIHGNVELAERASRRLFALEP 488

Query: 435 QVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYIFYAGEITSC 494
           + AG YV L+++YA A   ++  +++  ++  G+ K PG  W+E++   Y F + +  + 
Sbjct: 489 KNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFN- 548

Query: 495 PRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLALGFGLISIP 554
           P  +++   L +L + MK++GY+   KG+++ ++E E +E  V  HSE+LAL FGLI+  
Sbjct: 549 PLMEQIHAFLVKLAEDMKEKGYIPQTKGVLY-ELETEEKERIVLGHSEKLALAFGLINTS 608

Query: 555 KGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCNGFW 604
           KG  IRI KNLR+C DCH   K IS+ ME+E +VRD+NRFH FKNG C+C  +W
Sbjct: 609 KGEPIRITKNLRLCEDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of CsaV3_6G017310 vs. Swiss-Prot
Match: sp|Q9LW63|PP251_ARATH (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 6.9e-110
Identity = 202/536 (37.69%), Postives = 328/536 (61.19%), Query Frame = 0

Query: 68  CYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSPVVPNDFT 127
           C + F  I +  ++F+ MP  +VVS+ +++AGY  +G    AL +  EM  + + P+ FT
Sbjct: 186 CIMPF-GIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFT 245

Query: 128 FATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKARGVFNSMS 187
            ++ +   S   ++  G+  H +V   G   ++ + SSL+DMY K   +  +  VF+ + 
Sbjct: 246 LSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLY 305

Query: 188 CKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASLGRLVSGK 247
           C++ +SW S++A Y QN   +EAL++FR+  + +   P     +SVI ACA L  L  GK
Sbjct: 306 CRDGISWNSLVAGYVQNGRYNEALRLFRQMVT-AKVKPGAVAFSSVIPACAHLATLHLGK 365

Query: 248 VMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMIVSTAKYG 307
            +HG  +  G  S+  +AS LVDMY+KCG++  + K+F+R++    + +T++I+  A +G
Sbjct: 366 QLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHG 425

Query: 308 FGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGIMPETKHY 367
            G +++ LFEEM R+G+KPN +  V VL ACSH GL +E   Y  SM + YG+  E +HY
Sbjct: 426 HGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHY 485

Query: 368 TCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEACQQLVNS 427
             V D+L R G+L++A++    M V P     +W  LLS+   H  +++A +  +++   
Sbjct: 486 AAVADLLGRAGKLEEAYNFISKMCVEPTGS--VWSTLLSSCSVHKNLELAEKVAEKIFTV 545

Query: 428 NRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYIFYAGEIT 487
           + +  GAYV + N+YAS G  ++  KLR+ M++ G+ K+P CSWIE+K+ ++ F +G+  
Sbjct: 546 DSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGD-R 605

Query: 488 SCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLALGFGLIS 547
           S P  D++   L+ + ++M+  GYV    G V  D++EE + E ++ HSERLA+ FG+I+
Sbjct: 606 SHPSMDKINEFLKAVMEQMEKEGYVADTSG-VLHDVDEEHKRELLFGHSERLAVAFGIIN 665

Query: 548 IPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCNGFW 604
              G TIR+ KN+R+C+DCH A K IS+I ERE +VRD +RFHHF  G C+C  +W
Sbjct: 666 TEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of CsaV3_6G017310 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 5.9e-109
Identity = 219/590 (37.12%), Postives = 330/590 (55.93%), Query Frame = 0

Query: 15  LSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTVNVN-HLINCYVRFR 74
           +  EN +P   TI ++   +     +      H  A++ GF ++ VN++  L++ Y +  
Sbjct: 227 MCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGF-DSLVNISTALVDMYAKCG 286

Query: 75  SIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSPVVPNDFTFATAIK 134
           S+ TA QLFD M   NVVSW S++  YV N  P  A+ +F +ML   V P D +   A+ 
Sbjct: 287 SLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALH 346

Query: 135 ACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKARGVFNSMSCKNIVS 194
           AC+ L +L  G   H      G   N+ V +SLI MY KC +V  A  +F  +  + +VS
Sbjct: 347 ACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVS 406

Query: 195 WTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASLGRLVSGKVMHGAA 254
           W +MI  +AQN    +AL  F +  S + + P+ +   SVI+A A L      K +HG  
Sbjct: 407 WNAMILGFAQNGRPIDALNYFSQMRSRTVK-PDTFTYVSVITAIAELSITHHAKWIHGVV 466

Query: 255 ISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMIVSTAKYGFGRKSL 314
           +    D +  V + LVDMYAKCG++  +  +F+ +S   V  + +MI     +GFG+ +L
Sbjct: 467 MRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAAL 526

Query: 315 QLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGIMPETKHYTCVVDM 374
           +LFEEM +  +KPN +T + V+ ACSHSGL   GL     M E Y I     HY  +VD+
Sbjct: 527 ELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDL 586

Query: 375 LARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEACQQLVNSNRQVAG 434
           L R G L++A+D    M V P     ++GA+L A + H  V+ A +A ++L   N    G
Sbjct: 587 LGRAGRLNEAWDFIMQMPVKP--AVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGG 646

Query: 435 AYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYIFYAGEITSCPRGD 494
            +V L+N+Y +A   EK  ++RV M R G+ K PGCS +EIK+  + F++G  T+ P   
Sbjct: 647 YHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGS-TAHPDSK 706

Query: 495 EVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLALGFGLISIPKGLT 554
           ++   L +L   +K+ GYV      + + +E + +E+ +  HSE+LA+ FGL++   G T
Sbjct: 707 KIYAFLEKLICHIKEAGYVPDTN--LVLGVENDVKEQLLSTHSEKLAISFGLLNTTAGTT 766

Query: 555 IRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCNGFW 604
           I + KNLR+C+DCH A K IS +  RE VVRD+ RFHHFKNG C+C  +W
Sbjct: 767 IHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CsaV3_6G017310 vs. Swiss-Prot
Match: sp|Q5G1T1|PP272_ARATH (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 2.9e-108
Identity = 215/566 (37.99%), Postives = 336/566 (59.36%), Query Frame = 0

Query: 47  HSIALKLGFLNNTVNVNHLINCYVRFR---SIATAHQLFDEMPNPNVVSWTSLMAGYVDN 106
           HS A++ G +++      L++ Y +     S+    ++FD M + +V+SWT+L+ GY+ N
Sbjct: 292 HSWAIRSGLVDDV--ECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSWTALITGYMKN 351

Query: 107 GQPST-ALFLFGEML-RSPVVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIV 166
              +T A+ LF EM+ +  V PN FTF++A KAC  LS+ R G+         G   N  
Sbjct: 352 CNLATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSS 411

Query: 167 VCSSLIDMYGKCNDVVKARGVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLS 226
           V +S+I M+ K + +  A+  F S+S KN+VS+ + +    +N + ++A K+  E T   
Sbjct: 412 VANSVISMFVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQAFKLLSEITE-R 471

Query: 227 SEHPNPYMLASVISACASLGRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYS 286
               + +  AS++S  A++G +  G+ +H   + LG   ++ V + L+ MY+KCGS+D +
Sbjct: 472 ELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALISMYSKCGSIDTA 531

Query: 287 DKVFNRISNPSVIPYTSMIVSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHS 346
            +VFN + N +VI +TSMI   AK+GF  + L+ F +M+ +G+KPN +T V +L ACSH 
Sbjct: 532 SRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVTYVAILSACSHV 591

Query: 347 GLPNEGLYYLTSMYEKYGIMPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLW 406
           GL +EG  +  SMYE + I P+ +HY C+VD+L R G L  AF+   +M    D   L+W
Sbjct: 592 GLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTMPFQAD--VLVW 651

Query: 407 GALLSASRCHGRVDIAAEACQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRT 466
              L A R H   ++   A ++++  +     AY+ LSN+YA AG  E++ ++R +MK  
Sbjct: 652 RTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEESTEMRRKMKER 711

Query: 467 GVHKEPGCSWIEIKDSSYIFYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFV 526
            + KE GCSWIE+ D  + FY G+ T+ P   ++   L  L  ++K  GYV     LV  
Sbjct: 712 NLVKEGGCSWIEVGDKIHKFYVGD-TAHPNAHQIYDELDRLITEIKRCGYVPD-TDLVLH 771

Query: 527 DIEE---EAEEEK-VWLHSERLALGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIM 586
            +EE   EAE+E+ ++ HSE++A+ FGLIS  K   +R+ KNLR+C DCH A K IS + 
Sbjct: 772 KLEEENDEAEKERLLYQHSEKIAVAFGLISTSKSRPVRVFKNLRVCGDCHNAMKYISTVS 831

Query: 587 EREFVVRDINRFHHFKNGCCTCNGFW 604
            RE V+RD+NRFHHFK+G C+CN +W
Sbjct: 832 GREIVLRDLNRFHHFKDGKCSCNDYW 850

BLAST of CsaV3_6G017310 vs. TrEMBL
Match: tr|A0A0A0KEC7|A0A0A0KEC7_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G197240 PE=4 SV=1)

HSP 1 Score: 1227.6 bits (3175), Expect = 0.0e+00
Identity = 603/603 (100.00%), Postives = 603/603 (100.00%), Query Frame = 0

Query: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60
           MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV
Sbjct: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60

Query: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120
           NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP
Sbjct: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180

Query: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240
           GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL
Sbjct: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240

Query: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300
           GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI
Sbjct: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360
           VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI
Sbjct: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360

Query: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420
           MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA
Sbjct: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI 480
           CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI
Sbjct: 421 CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI 480

Query: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA
Sbjct: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540

Query: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN 600
           LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN
Sbjct: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN 600

Query: 601 GFW 604
           GFW
Sbjct: 601 GFW 603

BLAST of CsaV3_6G017310 vs. TrEMBL
Match: tr|A0A1S3C8I5|A0A1S3C8I5_CUCME (pentatricopeptide repeat-containing protein At4g15720 OS=Cucumis melo OX=3656 GN=LOC103497869 PE=4 SV=1)

HSP 1 Score: 1184.5 bits (3063), Expect = 0.0e+00
Identity = 582/603 (96.52%), Postives = 590/603 (97.84%), Query Frame = 0

Query: 1   MLKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTV 60
           MLKRNL+FALTRPYLSRENERPFLQTIENLS HLRNCNDLISSI THSIALKLGFLNNTV
Sbjct: 1   MLKRNLHFALTRPYLSRENERPFLQTIENLSLHLRNCNDLISSIFTHSIALKLGFLNNTV 60

Query: 61  NVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSP 120
            VNHLINCYVRFRSIATAH+LFDEMPNPNVVSWTSLMAGYVDNGQPSTAL LFGEMLRSP
Sbjct: 61  TVNHLINCYVRFRSIATAHRLFDEMPNPNVVSWTSLMAGYVDNGQPSTALLLFGEMLRSP 120

Query: 121 VVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKAR 180
           VVPNDFTFATAIKACSILSNLR+GE FHAHVEIFGYG NIVVCSSLIDMYGKCNDVVKAR
Sbjct: 121 VVPNDFTFATAIKACSILSNLRYGERFHAHVEIFGYGCNIVVCSSLIDMYGKCNDVVKAR 180

Query: 181 GVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASL 240
            VFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSE PNPYMLASVISACASL
Sbjct: 181 SVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSERPNPYMLASVISACASL 240

Query: 241 GRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMI 300
           GRLVSGKV+HGAAI LGCDSSEVVASVLVDMYAKCGSLDYSD VFNRISNPSVIPYTSMI
Sbjct: 241 GRLVSGKVVHGAAICLGCDSSEVVASVLVDMYAKCGSLDYSDNVFNRISNPSVIPYTSMI 300

Query: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGI 360
           VSTAKYGFGRKSLQLFEEMVRKGLKPNHITL+GVLHACSHSGLPNEGLYYLTSMYEKYGI
Sbjct: 301 VSTAKYGFGRKSLQLFEEMVRKGLKPNHITLLGVLHACSHSGLPNEGLYYLTSMYEKYGI 360

Query: 361 MPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420
           MPETKHYTCVVDMLAR GELDKAFD+AKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA
Sbjct: 361 MPETKHYTCVVDMLARSGELDKAFDIAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEA 420

Query: 421 CQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYI 480
           CQQLVNSNRQVAGAYVTLSN YASAGDMEKAHKLRVEMK TGVHKEPGCSWIEIKDSSYI
Sbjct: 421 CQQLVNSNRQVAGAYVTLSNAYASAGDMEKAHKLRVEMKHTGVHKEPGCSWIEIKDSSYI 480

Query: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540
           FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA
Sbjct: 481 FYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLA 540

Query: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCN 600
           LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHF++GCCTCN
Sbjct: 541 LGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFQSGCCTCN 600

Query: 601 GFW 604
           GFW
Sbjct: 601 GFW 603

BLAST of CsaV3_6G017310 vs. TrEMBL
Match: tr|M5XKX1|M5XKX1_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G138900 PE=4 SV=1)

HSP 1 Score: 812.8 bits (2098), Expect = 5.3e-232
Identity = 399/602 (66.28%), Postives = 479/602 (79.57%), Query Frame = 0

Query: 2   LKRNLYFALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTVN 61
           L +NL  ALT   LSR+N+     +  +L + LR+C D  S+ S HS  +K G L +T  
Sbjct: 5   LNQNLISALTASNLSRQNQ----LSPSHLIQQLRSCKDSDSAKSLHSNGIKSGSLYDTFT 64

Query: 62  VNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSPV 121
            NHLINCYVR + I  A QLFDEMP PNVVSWTSLMAGYVD GQP  AL++FG+M    V
Sbjct: 65  TNHLINCYVRLQRIDLASQLFDEMPEPNVVSWTSLMAGYVDTGQPRMALWVFGKMPECSV 124

Query: 122 VPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKARG 181
           +PN+FTFAT I ACSIL++LR G+  HA VE+ G+  N+VVCSSL+DMYGKCNDV  A+ 
Sbjct: 125 LPNEFTFATVINACSILAHLRTGKKIHALVELLGFQSNLVVCSSLVDMYGKCNDVDHAQR 184

Query: 182 VFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASLG 241
           VF+ M C+N+VSWTS+IAAYAQNA GDEAL++FREF  L  E PN +MLASV++ACASLG
Sbjct: 185 VFDLMGCRNVVSWTSIIAAYAQNAQGDEALQLFREFNRLMLERPNHFMLASVVNACASLG 244

Query: 242 RLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMIV 301
           RLVSGKV HGA I  G DS+ V+AS L+DMYAK G ++YSDKVF RI NPSVIPYTSMIV
Sbjct: 245 RLVSGKVAHGAVIRGGYDSNAVIASALLDMYAKSGCVEYSDKVFRRIRNPSVIPYTSMIV 304

Query: 302 STAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGIM 361
           + AKYG GR SLQLF+EM+ + +KPN +T VGVLHACSHSGL +EGL  L SM+EK+GI 
Sbjct: 305 AAAKYGLGRMSLQLFQEMIDRRIKPNDVTFVGVLHACSHSGLVDEGLQQLESMHEKHGIT 364

Query: 362 PETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEAC 421
           P  KHYTC+VDML R G L++A++LAKS+    + +ALLWG LLSASR HGRVDIA EA 
Sbjct: 365 PTAKHYTCIVDMLGRTGRLNEAYELAKSIQAEANQEALLWGTLLSASRLHGRVDIAVEAS 424

Query: 422 QQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYIF 481
           ++L++SN+QV GAYVTLSN YA  G+ E AH LR+EM+RTGV KEPGCSW+E+KDSSY+F
Sbjct: 425 RRLIDSNQQVVGAYVTLSNAYALNGEWETAHDLRLEMRRTGVQKEPGCSWVEMKDSSYVF 484

Query: 482 YAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLAL 541
           YAG+++SC RG EV+ LLREL+ KMK RGYV G +GLVFVD+EEEA+E  V LHSERLAL
Sbjct: 485 YAGDVSSCTRGSEVVTLLRELEGKMKQRGYVGGSRGLVFVDVEEEAKEGIVGLHSERLAL 544

Query: 542 GFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCNG 601
           GF L+SIPKG+TIRIMKNLRMC DCHEAFKLIS+I+ERE VVRD+NRFHHFK+G CTC  
Sbjct: 545 GFALLSIPKGVTIRIMKNLRMCRDCHEAFKLISDIVERECVVRDVNRFHHFKSGSCTCRD 602

Query: 602 FW 604
           FW
Sbjct: 605 FW 602

BLAST of CsaV3_6G017310 vs. TrEMBL
Match: tr|A0A2C9UBS1|A0A2C9UBS1_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_15G014100 PE=4 SV=1)

HSP 1 Score: 808.9 bits (2088), Expect = 7.7e-231
Identity = 397/606 (65.51%), Postives = 473/606 (78.05%), Query Frame = 0

Query: 2   LKRNLYFALTRPYLSRENERPFLQTIENLSRH----LRNCNDLISSISTHSIALKLGFLN 61
           L R     LT   L R+N+R    T   L  H    LRNCN +I + S H   LK G L+
Sbjct: 5   LSRKCLSVLTNSRLPRQNKRSNFHT--QLQAHFIEKLRNCNHVICATSAHCYLLKSGLLH 64

Query: 62  NTVNVNHLINCYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEML 121
           +T  +NHLINCYVR    + A QLFDEMP PNVVSWTSLMAGY+D G+P  AL+L+ +ML
Sbjct: 65  DTFTINHLINCYVRLPKTSHAQQLFDEMPEPNVVSWTSLMAGYIDTGRPDFALWLYRKML 124

Query: 122 RSPVVPNDFTFATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVV 181
            S V PNDFTFAT I ACS+L+NL  G+  H H+EIFG+ GN+VV SSL+DMYGKCNDV 
Sbjct: 125 ESSVAPNDFTFATVINACSMLANLETGKQIHTHIEIFGFQGNLVVYSSLVDMYGKCNDVD 184

Query: 182 KARGVFNSMSCKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISAC 241
            AR VF+ M  KN+VSWTSMI+AYAQNA G +AL+VF+EF+    E PN +ML SVISAC
Sbjct: 185 GARRVFDIMDYKNVVSWTSMISAYAQNARGHDALEVFKEFSCSMQERPNHFMLGSVISAC 244

Query: 242 ASLGRLVSGKVMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYT 301
           ASLG+LVSGKV HGA I  G + S+VVAS LVDMYAKCG   YSDKVF RI +PSVIPYT
Sbjct: 245 ASLGKLVSGKVTHGAVIRSGHELSDVVASALVDMYAKCGCFSYSDKVFRRIQDPSVIPYT 304

Query: 302 SMIVSTAKYGFGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEK 361
           SMIV  AKYG G+ SLQLF+EM+ + +KPN +T VG+LHACSHSGL +EGL +L SM+EK
Sbjct: 305 SMIVGAAKYGLGKLSLQLFKEMIDRRIKPNDVTFVGLLHACSHSGLVDEGLEHLNSMHEK 364

Query: 362 YGIMPETKHYTCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIA 421
           +G++P+ KHYTCVVDML+RVG +D+A+ LAKS  V   + ALLWG LLSASR HGRVDIA
Sbjct: 365 HGLVPDAKHYTCVVDMLSRVGRIDEAYRLAKSTRVDHHEGALLWGTLLSASRLHGRVDIA 424

Query: 422 AEACQQLVNSNRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDS 481
            EA + L+  N+QVAGAYVTLSN YA AG+ E AH LR EMKRTGVHKEPGCSW+EIKDS
Sbjct: 425 VEASKWLIECNQQVAGAYVTLSNTYALAGEWENAHSLRTEMKRTGVHKEPGCSWVEIKDS 484

Query: 482 SYIFYAGEITSCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSE 541
           +Y+FYAG++ SC RG+EVLCLL+EL+++MK+RGYV G  GLVFVD+E E  EE V LHSE
Sbjct: 485 TYVFYAGDL-SCERGNEVLCLLKELERRMKERGYVGGSMGLVFVDVEPEVREEIVGLHSE 544

Query: 542 RLALGFGLISIPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCC 601
           RLAL FGL++IPKG+TIR+MKNLRMC DCH+AFKLISEI+ER+FVVRDINRFHHF +G C
Sbjct: 545 RLALAFGLMTIPKGITIRVMKNLRMCKDCHDAFKLISEIVERDFVVRDINRFHHFMDGSC 604

Query: 602 TCNGFW 604
           +C  FW
Sbjct: 605 SCRDFW 607

BLAST of CsaV3_6G017310 vs. TrEMBL
Match: tr|A0A067KYD1|A0A067KYD1_JATCU (Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_01996 PE=4 SV=1)

HSP 1 Score: 808.1 bits (2086), Expect = 1.3e-230
Identity = 395/596 (66.28%), Postives = 474/596 (79.53%), Query Frame = 0

Query: 8   FALTRPYLSRENERPFLQTIENLSRHLRNCNDLISSISTHSIALKLGFLNNTVNVNHLIN 67
           F+LT  +LSR+ +R    T  N    LRNCN   ++ISTHS   K G L +T  +NHLIN
Sbjct: 11  FSLTTLHLSRQKKRSNYHTKANFIEKLRNCNHFNAAISTHSSLFKSGLLFDTFTINHLIN 70

Query: 68  CYVRFRSIATAHQLFDEMPNPNVVSWTSLMAGYVDNGQPSTALFLFGEMLRSPVVPNDFT 127
           CY+R      A QLFDEMP PNVVSWTS++AGYVD GQP  AL+L+ +M  S VVPNDFT
Sbjct: 71  CYIRLHKTGHAQQLFDEMPEPNVVSWTSIIAGYVDTGQPKFALWLYTKMPESSVVPNDFT 130

Query: 128 FATAIKACSILSNLRHGEMFHAHVEIFGYGGNIVVCSSLIDMYGKCNDVVKARGVFNSMS 187
           F+T I ACSIL++L  G+  H H+E+ G+ GN+VVCSSLIDMYGKCND   AR VF+ M 
Sbjct: 131 FSTVINACSILADLETGKKIHTHIELLGFQGNLVVCSSLIDMYGKCNDPDGARRVFDLME 190

Query: 188 CKNIVSWTSMIAAYAQNAHGDEALKVFREFTSLSSEHPNPYMLASVISACASLGRLVSGK 247
            +N+VSWTS++AAYAQNA G EAL+VFREF+SL  E PN +MLASVI+ACASLG+LVSGK
Sbjct: 191 YRNVVSWTSLVAAYAQNARGHEALQVFREFSSLMRESPNHFMLASVINACASLGKLVSGK 250

Query: 248 VMHGAAISLGCDSSEVVASVLVDMYAKCGSLDYSDKVFNRISNPSVIPYTSMIVSTAKYG 307
           V HGA I  G + ++VVAS LVDMYAKCG   YSDKVF RI++PSVIPYTSMIV  AKYG
Sbjct: 251 VAHGAVIRSGHEINDVVASALVDMYAKCGCFSYSDKVFRRITDPSVIPYTSMIVGAAKYG 310

Query: 308 FGRKSLQLFEEMVRKGLKPNHITLVGVLHACSHSGLPNEGLYYLTSMYEKYGIMPETKHY 367
            G+ SLQ F+EM+ + +KPN IT VGVLHACSHSGL +EGL +L SMYEK+GIMP+TKHY
Sbjct: 311 LGKLSLQFFDEMIERRIKPNDITFVGVLHACSHSGLVDEGLKHLNSMYEKHGIMPDTKHY 370

Query: 368 TCVVDMLARVGELDKAFDLAKSMDVAPDDKALLWGALLSASRCHGRVDIAAEACQQLVNS 427
           TCVVDML+R+G LD+A  LAKS+ V PD+ ALLWG LLSASR  GRVDIA EA + L+ S
Sbjct: 371 TCVVDMLSRIGCLDEAHRLAKSIRVNPDEGALLWGTLLSASRLQGRVDIAVEASKWLIES 430

Query: 428 NRQVAGAYVTLSNVYASAGDMEKAHKLRVEMKRTGVHKEPGCSWIEIKDSSYIFYAGEIT 487
           N+QVAGAYV LSN YA AG+ E A+ LR EM+R GV+KEPGCSW+E KDS+Y+FYAG++ 
Sbjct: 431 NQQVAGAYVILSNTYALAGEWENANSLRSEMRRVGVYKEPGCSWVENKDSTYVFYAGDL- 490

Query: 488 SCPRGDEVLCLLRELDQKMKDRGYVRGRKGLVFVDIEEEAEEEKVWLHSERLALGFGLIS 547
           S  RG EVL LL++L+++MK++GYV G  GLVFVD+E+EA+EE V LHSERLAL FGLIS
Sbjct: 491 SFERGSEVLSLLKDLERRMKEKGYVGGSMGLVFVDVEQEAKEEIVGLHSERLALAFGLIS 550

Query: 548 IPKGLTIRIMKNLRMCSDCHEAFKLISEIMEREFVVRDINRFHHFKNGCCTCNGFW 604
           +PKG+TIR+MKNLRMC DCHEAFKLISEI+ERE VVRD+NRFHHFKNG C+C  FW
Sbjct: 551 MPKGITIRVMKNLRMCKDCHEAFKLISEIVEREIVVRDVNRFHHFKNGSCSCMDFW 605

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004143199.10.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At4g15720 [Cucumis sativu... [more]
XP_008458473.10.0e+0096.52PREDICTED: pentatricopeptide repeat-containing protein At4g15720 [Cucumis melo][more]
XP_022138437.11.3e-29080.93pentatricopeptide repeat-containing protein At4g15720 [Momordica charantia][more]
XP_022930461.16.4e-29082.28pentatricopeptide repeat-containing protein At4g15720 [Cucurbita moschata][more]
XP_023000616.13.9e-28781.29pentatricopeptide repeat-containing protein At4g15720 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT4G15720.12.8e-18657.50Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G46790.11.0e-11136.87Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G23330.13.8e-11137.69Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.13.2e-11037.12Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G49170.11.6e-10937.99Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q8VYH0|PP313_ARATH5.0e-18557.50Pentatricopeptide repeat-containing protein At4g15720 OS=Arabidopsis thaliana OX... [more]
sp|Q9STF3|PP265_ARATH1.8e-11036.87Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
sp|Q9LW63|PP251_ARATH6.9e-11037.69Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
sp|Q3E6Q1|PPR32_ARATH5.9e-10937.12Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|Q5G1T1|PP272_ARATH2.9e-10837.99Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KEC7|A0A0A0KEC7_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G197240 PE=4 SV=1[more]
tr|A0A1S3C8I5|A0A1S3C8I5_CUCME0.0e+0096.52pentatricopeptide repeat-containing protein At4g15720 OS=Cucumis melo OX=3656 GN... [more]
tr|M5XKX1|M5XKX1_PRUPE5.3e-23266.28Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G138900 PE=4 SV=1[more]
tr|A0A2C9UBS1|A0A2C9UBS1_MANES7.7e-23165.51Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_15G014100 PE=4 SV=... [more]
tr|A0A067KYD1|A0A067KYD1_JATCU1.3e-23066.28Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_01996 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_6G017310.1CsaV3_6G017310.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 161..192
e-value: 4.6E-5
score: 21.3
coord: 192..218
e-value: 8.1E-4
score: 17.4
coord: 91..124
e-value: 1.3E-5
score: 23.1
coord: 296..327
e-value: 1.5E-4
score: 19.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 366..390
e-value: 0.3
score: 11.3
coord: 296..324
e-value: 6.5E-4
score: 19.7
coord: 192..217
e-value: 4.3E-5
score: 23.4
coord: 161..191
e-value: 6.6E-4
score: 19.7
coord: 435..463
e-value: 0.0024
score: 17.9
coord: 63..86
e-value: 0.087
score: 13.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 88..135
e-value: 4.7E-11
score: 42.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 89..123
score: 11.115
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 58..88
score: 7.267
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 327..362
score: 6.412
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 363..393
score: 6.73
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..291
score: 6.062
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 431..465
score: 8.199
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 292..326
score: 10.358
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 190..224
score: 9.01
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 226..260
score: 6.106
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 159..189
score: 7.706
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 268..534
e-value: 8.6E-32
score: 112.7
coord: 17..148
e-value: 9.9E-20
score: 73.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 149..224
e-value: 5.4E-7
score: 31.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 191..231
coord: 371..453
coord: 27..123
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 467..593
e-value: 2.8E-30
score: 104.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 28..515
NoneNo IPR availablePANTHERPTHR24015:SF117SUBFAMILY NOT NAMEDcoord: 28..515

The following gene(s) are paralogous to this gene:

None