CsaV3_3G035640 (gene) Cucumber (Chinese Long) v3

NameCsaV3_3G035640
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr3 : 29819010 .. 29821404 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTTATAAGAACCTTCAGTTCTATATATTTTTTAACTTTAGCGTCAGATGCTATTGTTTTCTTCAACCCTTTTAGTCCAACTTCGTCAATAAAGAACTTCAAACCTCAACTGCATTTAGCACTCTCCCATATTCGACTCAATACCCAGCTTGCCTTTTCTCCCTGCCCCCTTACTGTACATTATCCTCATGATGATAAGATAATCTCTCTCTGCAAGAAAAACTTGCACAGAGAAGCTCTTAAAGCATTTGACATCTTTCAAAAGTGTTCAAGTTCTCCATTGAAGTCTGTCACCTATACCCATCTAATCAACGCATGCTCTTCTCTAAGATCCCTAGAGCATGGCAGGAAAATTCATCGCCATATGTTGACATGCAACTACCAGCCTGATATGATACTTCAGAATCATATTCTTAGTATGTATGGAAAATGTGGGTCTTTGAAGGAAGCAAGAAATATGTTTGATTCAATGCCTCTGAAGAATGTAGTATCTTGGACCTCCATGATATCTGGGTACTCACGTTATGGTGAAGAGGATAATGCCATTACATTGTATGTTCAAATGTTACGATCGGGACACATTCCTGATCACTTTACATTTGGAAGTATTGTAAAATCTTGCTCTGGACTTGATGACTTTAAGTTAGCAAGGCAACTGCATGCTCACGTTCTGAAGTCTGAATTTGGCGCTGACCTAATTGCACAGAATGCTCTAATCTCAATGTATACCAAGTTCAGTCAAATGGCTGATGCAATAAATGTCTTCTCTCGTATTATTATAAAGGATCTAATTTCCTGGGGCTCAATGATTGCAGGGTTTTCTCAGCTTGGCTATGAGCTTGAAGCTTTATGTCATTTTAGGGAAATGCTTTCTCAGTCTGTCTATCAGCCAAATGAGTTTGTCTTCGGTAGCGCCTTCAGTGCTTGTAGCAAGCTCTTAGAACCAGATTGTGGACGCCAAATACATGGCTTGTGTATAAAATTTGGGTTAGGAAGTGATCTTTTTGCTGGATGCTCCCTTTGTGACATGTATGCGAAGTGTGGGTTTTTAGAATCTGCAAGAACAGTATTTTATCATATTGAAAAGCCTGATCTAGTAGCCTGGAATGCTATTATTGCCGGATTTGCTAGTGTCAGTAATGCAAAGGAATCCTCGTCATTCTTTTCACAAATGAGGCATACAGGACTTGTTCCAAATGATGTCACTGTTCTCTCTTTACTTTGTGCTTGTTCGGAGCCTGTGATGCTTAATCATGGAATACAGGTTCACTCCTACATTGTCAAGATGGGTTTCAATTTAGATATTCCTGTGTGTAACAGTTTGCTCAGCATGTATTCGAAGTGCTCAAATTTGAATGATGCACTTCAAGTATTTGAAGATATAGGAAACAAAGCTGATATAGTTTCTTGGAACACCTTGCTTACAGCATGTCTCCAGCAGAACCAAGCTGGAGAGGTTTTAAGATTAACAAAGCTAATGTTTGCTTCTCGCATTAAGCCTGACCATGTTACTTTAACTAATGTGTTGGTGTCCTCTGGACAAATAGCATCTTATGAAGTGGGAAGTCAAATTCATTGTTTTATCATGAAATCAGGACTGAATCTTGATATTTCTGTTTCTAATGCGTTAATCAACATGTATACGAAGTGTGGATCCCTTGAATGTGCTCGAAAGATGTTTGATTCCATTGGCAATCCTGATATCATTTCATGGAGTAGCTTGATTGTTGGATATGCACAAGCTGGATGCGGCAAGGAGGCTTTTGAGCTTTTCAGAACCATGAGAGGCCTCGGTGTAAAGCCAAATGAAATTACATTTGTAGGAATTCTTACTGCTTGTAGTCATATTGGAATGGTAGAAGAAGGTTTGAAGTTATACAGGACAATGCAAGAGGATTATCGCATTTCACCAACCAAAGAACACTGTTCATGTATGGTCGACTTGCTCGCTCGTGCTGGATGCTTGGATGTAGCAGAGGACTTCATTAAGCAGATGCCTTTCGTTCCTGATGTTGTAGTCTGGAAGACTCTGCTAGCAGCATGTAAAGTCCATGGCAATCTTGAGGTTGGCAAGAGGGCTGCAGAGAATGTATTAAAAATTGATCCATCAAACTCCGCCGCAGTCGTAATGCTTTGTAACATACATGCTTCTTCCGGGCATTGGAAAGATTTCGCTCGACTTAGGAGTTCAATGAGACGAATGGATGTGGGCAAAGTTCCAGGTCAGAGCTGGATTGAGATCAAGGATAAAGTTCATGTGTTTCTTGCAGAAGATAACTTGCATCCTGAGAGAGGTAAAATTTACACGATGCTGGAAGAGTTGATGTTGCAAATTTTAGATGATGGTTGTGATCCATTACAGATGGTGTCTTGATTGGGTTACAAAGTAG

mRNA sequence

ATGGCATTTATAAGAACCTTCAGTTCTATATATTTTTTAACTTTAGCGTCAGATGCTATTGTTTTCTTCAACCCTTTTAGTCCAACTTCGTCAATAAAGAACTTCAAACCTCAACTGCATTTAGCACTCTCCCATATTCGACTCAATACCCAGCTTGCCTTTTCTCCCTGCCCCCTTACTGTACATTATCCTCATGATGATAAGATAATCTCTCTCTGCAAGAAAAACTTGCACAGAGAAGCTCTTAAAGCATTTGACATCTTTCAAAAGTGTTCAAGTTCTCCATTGAAGTCTGTCACCTATACCCATCTAATCAACGCATGCTCTTCTCTAAGATCCCTAGAGCATGGCAGGAAAATTCATCGCCATATGTTGACATGCAACTACCAGCCTGATATGATACTTCAGAATCATATTCTTAGTATGTATGGAAAATGTGGGTCTTTGAAGGAAGCAAGAAATATGTTTGATTCAATGCCTCTGAAGAATGTAGTATCTTGGACCTCCATGATATCTGGGTACTCACGTTATGGTGAAGAGGATAATGCCATTACATTGTATGTTCAAATGTTACGATCGGGACACATTCCTGATCACTTTACATTTGGAAGTATTGTAAAATCTTGCTCTGGACTTGATGACTTTAAGTTAGCAAGGCAACTGCATGCTCACGTTCTGAAGTCTGAATTTGGCGCTGACCTAATTGCACAGAATGCTCTAATCTCAATGTATACCAAGTTCAGTCAAATGGCTGATGCAATAAATGTCTTCTCTCGTATTATTATAAAGGATCTAATTTCCTGGGGCTCAATGATTGCAGGGTTTTCTCAGCTTGGCTATGAGCTTGAAGCTTTATGTCATTTTAGGGAAATGCTTTCTCAGTCTGTCTATCAGCCAAATGAGTTTGTCTTCGGTAGCGCCTTCAGTGCTTGTAGCAAGCTCTTAGAACCAGATTGTGGACGCCAAATACATGGCTTGTGTATAAAATTTGGGTTAGGAAGTGATCTTTTTGCTGGATGCTCCCTTTGTGACATGTATGCGAAGTGTGGGTTTTTAGAATCTGCAAGAACAGTATTTTATCATATTGAAAAGCCTGATCTAGTAGCCTGGAATGCTATTATTGCCGGATTTGCTAGTGTCAGTAATGCAAAGGAATCCTCGTCATTCTTTTCACAAATGAGGCATACAGGACTTGTTCCAAATGATGTCACTGTTCTCTCTTTACTTTGTGCTTGTTCGGAGCCTGTGATGCTTAATCATGGAATACAGGTTCACTCCTACATTGTCAAGATGGGTTTCAATTTAGATATTCCTGTGTGTAACAGTTTGCTCAGCATGTATTCGAAGTGCTCAAATTTGAATGATGCACTTCAAGTATTTGAAGATATAGGAAACAAAGCTGATATAGTTTCTTGGAACACCTTGCTTACAGCATGTCTCCAGCAGAACCAAGCTGGAGAGATGATGGTTGTGATCCATTACAGATGGTGTCTTGATTGGGTTACAAAGTAG

Coding sequence (CDS)

ATGGCATTTATAAGAACCTTCAGTTCTATATATTTTTTAACTTTAGCGTCAGATGCTATTGTTTTCTTCAACCCTTTTAGTCCAACTTCGTCAATAAAGAACTTCAAACCTCAACTGCATTTAGCACTCTCCCATATTCGACTCAATACCCAGCTTGCCTTTTCTCCCTGCCCCCTTACTGTACATTATCCTCATGATGATAAGATAATCTCTCTCTGCAAGAAAAACTTGCACAGAGAAGCTCTTAAAGCATTTGACATCTTTCAAAAGTGTTCAAGTTCTCCATTGAAGTCTGTCACCTATACCCATCTAATCAACGCATGCTCTTCTCTAAGATCCCTAGAGCATGGCAGGAAAATTCATCGCCATATGTTGACATGCAACTACCAGCCTGATATGATACTTCAGAATCATATTCTTAGTATGTATGGAAAATGTGGGTCTTTGAAGGAAGCAAGAAATATGTTTGATTCAATGCCTCTGAAGAATGTAGTATCTTGGACCTCCATGATATCTGGGTACTCACGTTATGGTGAAGAGGATAATGCCATTACATTGTATGTTCAAATGTTACGATCGGGACACATTCCTGATCACTTTACATTTGGAAGTATTGTAAAATCTTGCTCTGGACTTGATGACTTTAAGTTAGCAAGGCAACTGCATGCTCACGTTCTGAAGTCTGAATTTGGCGCTGACCTAATTGCACAGAATGCTCTAATCTCAATGTATACCAAGTTCAGTCAAATGGCTGATGCAATAAATGTCTTCTCTCGTATTATTATAAAGGATCTAATTTCCTGGGGCTCAATGATTGCAGGGTTTTCTCAGCTTGGCTATGAGCTTGAAGCTTTATGTCATTTTAGGGAAATGCTTTCTCAGTCTGTCTATCAGCCAAATGAGTTTGTCTTCGGTAGCGCCTTCAGTGCTTGTAGCAAGCTCTTAGAACCAGATTGTGGACGCCAAATACATGGCTTGTGTATAAAATTTGGGTTAGGAAGTGATCTTTTTGCTGGATGCTCCCTTTGTGACATGTATGCGAAGTGTGGGTTTTTAGAATCTGCAAGAACAGTATTTTATCATATTGAAAAGCCTGATCTAGTAGCCTGGAATGCTATTATTGCCGGATTTGCTAGTGTCAGTAATGCAAAGGAATCCTCGTCATTCTTTTCACAAATGAGGCATACAGGACTTGTTCCAAATGATGTCACTGTTCTCTCTTTACTTTGTGCTTGTTCGGAGCCTGTGATGCTTAATCATGGAATACAGGTTCACTCCTACATTGTCAAGATGGGTTTCAATTTAGATATTCCTGTGTGTAACAGTTTGCTCAGCATGTATTCGAAGTGCTCAAATTTGAATGATGCACTTCAAGTATTTGAAGATATAGGAAACAAAGCTGATATAGTTTCTTGGAACACCTTGCTTACAGCATGTCTCCAGCAGAACCAAGCTGGAGAGATGATGGTTGTGATCCATTACAGATGGTGTCTTGATTGGGTTACAAAGTAG

Protein sequence

MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLTVHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACLQQNQAGEMMVVIHYRWCLDWVTK
BLAST of CsaV3_3G035640 vs. NCBI nr
Match: XP_004137966.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucumis sativus] >KGN58944.1 hypothetical protein Csa_3G736910 [Cucumis sativus])

HSP 1 Score: 993.4 bits (2567), Expect = 2.8e-286
Identity = 487/489 (99.59%), Postives = 489/489 (100.00%), Query Frame = 0

Query: 1   MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60
           MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT
Sbjct: 1   MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60

Query: 61  VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120
           VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI
Sbjct: 61  VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120

Query: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180
           HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE
Sbjct: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180

Query: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240
           DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL
Sbjct: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240

Query: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
           ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN
Sbjct: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300

Query: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360
           EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360

Query: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420
           HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH
Sbjct: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420

Query: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480
           GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL
Sbjct: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480

Query: 481 QQNQAGEMM 490
           QQNQAGE++
Sbjct: 481 QQNQAGEVL 489

BLAST of CsaV3_3G035640 vs. NCBI nr
Match: XP_008442662.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucumis melo])

HSP 1 Score: 943.3 bits (2437), Expect = 3.3e-271
Identity = 459/489 (93.87%), Postives = 477/489 (97.55%), Query Frame = 0

Query: 1   MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60
           MA IRTF+SIYFLTLASDAIVFFNPFSP++S KNFKPQLHLA SHIRL TQLAFSPCP+T
Sbjct: 1   MAPIRTFNSIYFLTLASDAIVFFNPFSPSTSTKNFKPQLHLAHSHIRLGTQLAFSPCPVT 60

Query: 61  VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120
           VHYPHDDKIISLCKKNLHREAL+AFDIF+KCSSSPLKS+TYTHLINACSSLRSLEHGRKI
Sbjct: 61  VHYPHDDKIISLCKKNLHREALQAFDIFRKCSSSPLKSITYTHLINACSSLRSLEHGRKI 120

Query: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180
           HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARN+FD+MPLKNVVSWTSMISGYSRYG+E
Sbjct: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSRYGQE 180

Query: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240
           DNAITLY+QMLRSG+IPDHFTFGSIVKSCSGLDDF LARQLHAHVLK EFG  LIAQNAL
Sbjct: 181 DNAITLYIQMLRSGYIPDHFTFGSIVKSCSGLDDFMLARQLHAHVLKFEFGGHLIAQNAL 240

Query: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
           ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN
Sbjct: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300

Query: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360
           EFVFGSAFSACSKLLEPDCGRQIHGLCIK GLGSD+FAGCSLCDMYAKCGFLESARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKLGLGSDIFAGCSLCDMYAKCGFLESARTVFY 360

Query: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420
           HIEKPDLVAWNAIIAGFASVSNAKES SFFSQMRH G+VPNDVTVLSLLCACSEPVMLN+
Sbjct: 361 HIEKPDLVAWNAIIAGFASVSNAKESLSFFSQMRHRGVVPNDVTVLSLLCACSEPVMLNN 420

Query: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480
           G+QVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFE IGNKADIVSWNTLLTACL
Sbjct: 421 GMQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEGIGNKADIVSWNTLLTACL 480

Query: 481 QQNQAGEMM 490
           QQNQAGE++
Sbjct: 481 QQNQAGEVL 489

BLAST of CsaV3_3G035640 vs. NCBI nr
Match: XP_023526527.1 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucurbita pepo subsp. pepo] >XP_023526528.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucurbita pepo subsp. pepo] >XP_023526529.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 842.0 bits (2174), Expect = 1.0e-240
Identity = 407/489 (83.23%), Postives = 446/489 (91.21%), Query Frame = 0

Query: 1   MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60
           MAFIRT  SIYF+T +S+AIVFFNP SP +SIKNFKPQLHLALSHIRLN+Q+AFSP P+ 
Sbjct: 29  MAFIRTLHSIYFVTYSSEAIVFFNPCSPANSIKNFKPQLHLALSHIRLNSQIAFSPSPVA 88

Query: 61  VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120
            H  +DD IISLCKK LHREAL+AFDIFQKCS+SPL S+TYTHLI+ACSSLRSLEHGRKI
Sbjct: 89  EH-SYDDNIISLCKKKLHREALQAFDIFQKCSNSPLNSITYTHLIHACSSLRSLEHGRKI 148

Query: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180
           H HM T NYQPD+ILQNHIL+MYGKCGSLKEARN+FD+MPLKN VSWTSMISGYS YG++
Sbjct: 149 HCHMSTFNYQPDLILQNHILNMYGKCGSLKEARNIFDAMPLKNAVSWTSMISGYSHYGQD 208

Query: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240
           DNAITLYVQMLRSGHIPDHFTFGS+VKSCSGLDD  LARQLHAHVLKSEFG + IAQNAL
Sbjct: 209 DNAITLYVQMLRSGHIPDHFTFGSVVKSCSGLDDLMLARQLHAHVLKSEFGGNPIAQNAL 268

Query: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
           ISMYTKFSQ+ADA NVFS II K+LISWGSMIAGFSQLGYE+EALCHFREMLSQ +YQPN
Sbjct: 269 ISMYTKFSQIADATNVFSHIITKNLISWGSMIAGFSQLGYEIEALCHFREMLSQPIYQPN 328

Query: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360
           EFVFGSAFSACS L EP+CGRQIHGLCIKFGLG D FAGCSLCDMYAKCGFL SARTVF 
Sbjct: 329 EFVFGSAFSACSNLSEPNCGRQIHGLCIKFGLGRDRFAGCSLCDMYAKCGFLGSARTVFC 388

Query: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420
            IEKPDLVAWNAIIAGFASV +AKES SFFSQMRHTGL  NDVTVLSLLCACSEP+MLN 
Sbjct: 389 QIEKPDLVAWNAIIAGFASVGDAKESLSFFSQMRHTGLASNDVTVLSLLCACSEPMMLNQ 448

Query: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480
           G+QVHSYIVK GF+L++PVCN LLSMYSKCS+LND+L++FEDIGNKAD+VSWNT+LT C 
Sbjct: 449 GMQVHSYIVKTGFDLEVPVCNGLLSMYSKCSDLNDSLKIFEDIGNKADVVSWNTMLTVCR 508

Query: 481 QQNQAGEMM 490
            QNQAGE++
Sbjct: 509 LQNQAGEVL 516

BLAST of CsaV3_3G035640 vs. NCBI nr
Match: XP_022934158.1 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucurbita moschata] >XP_022934159.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucurbita moschata] >XP_022934160.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 835.1 bits (2156), Expect = 1.3e-238
Identity = 406/489 (83.03%), Postives = 443/489 (90.59%), Query Frame = 0

Query: 1   MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60
           MAFIRT  SIYF+TL+S+AIVFFNP SP +SIKNFKPQL LALSHIR ++ LA  P    
Sbjct: 29  MAFIRTLHSIYFVTLSSEAIVFFNPCSPATSIKNFKPQLRLALSHIRFSSLLASPPPSPV 88

Query: 61  VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120
             + +DD IISLCKK LHREAL+AFDIFQKCSSSPL S+TYTHLI+ACSSLRSLEHGRKI
Sbjct: 89  TEHSYDDNIISLCKKKLHREALQAFDIFQKCSSSPLNSITYTHLIHACSSLRSLEHGRKI 148

Query: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180
           H HM T NYQPD+ILQNHIL+MYGKCGSLKEARN+FD+MPLKN VSWTSMISGYS YG++
Sbjct: 149 HCHMSTFNYQPDLILQNHILNMYGKCGSLKEARNIFDAMPLKNAVSWTSMISGYSHYGQD 208

Query: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240
           DNAITLYVQMLRSGHIPDHFTFGS+VKSCSGLDD  LARQLHAHVLKSEFG + IAQNAL
Sbjct: 209 DNAITLYVQMLRSGHIPDHFTFGSVVKSCSGLDDLMLARQLHAHVLKSEFGGNPIAQNAL 268

Query: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
           ISMYTKFSQ+ADA NVFS IIIKDLISWGSMIAGFSQLGYE+EALCHFREMLSQ++YQPN
Sbjct: 269 ISMYTKFSQIADATNVFSHIIIKDLISWGSMIAGFSQLGYEIEALCHFREMLSQAIYQPN 328

Query: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360
           EFVFGSAFSACS L EP+CGRQIHGLCIKFGLGSD FAGCSLCDMYAKCGFL SARTVF 
Sbjct: 329 EFVFGSAFSACSSLSEPNCGRQIHGLCIKFGLGSDRFAGCSLCDMYAKCGFLGSARTVFC 388

Query: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420
            IEKPDLVAWNAIIAGFASV +AKES SFFSQMRHTGL  NDVTVLSLLCACS+P+MLN 
Sbjct: 389 QIEKPDLVAWNAIIAGFASVGDAKESLSFFSQMRHTGLASNDVTVLSLLCACSDPMMLNQ 448

Query: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480
           G+QVHSYIVK GF+L++PVCN LLSMYSKCS LND+L++FEDIGNKADIVSWNT+LTAC 
Sbjct: 449 GMQVHSYIVKTGFDLEVPVCNGLLSMYSKCSVLNDSLKIFEDIGNKADIVSWNTMLTACR 508

Query: 481 QQNQAGEMM 490
            QNQAGE++
Sbjct: 509 LQNQAGEVL 517

BLAST of CsaV3_3G035640 vs. NCBI nr
Match: XP_022983903.1 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucurbita maxima] >XP_022983904.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucurbita maxima] >XP_022983905.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 833.2 bits (2151), Expect = 4.8e-238
Identity = 406/489 (83.03%), Postives = 443/489 (90.59%), Query Frame = 0

Query: 1   MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60
           MAFIRT +SIYF+TL+S+AIVFFNP SP +SIKNFKPQLHLALSHIR ++ LA  P P+T
Sbjct: 29  MAFIRTLNSIYFVTLSSEAIVFFNPCSPATSIKNFKPQLHLALSHIRFSSLLASPPSPVT 88

Query: 61  VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120
            H  HDD IISLCKK LHREAL+AFDIF KCSSSPL S+TYTHLI+ACSSLR LEHGRKI
Sbjct: 89  EH-SHDDNIISLCKKKLHREALQAFDIFHKCSSSPLNSITYTHLIHACSSLRFLEHGRKI 148

Query: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180
           H HM T NYQPD+ILQNHIL+MYGKCGSLKEARN+FD+MPLKN VSWTSMISGYS YGE+
Sbjct: 149 HCHMSTFNYQPDLILQNHILNMYGKCGSLKEARNIFDAMPLKNAVSWTSMISGYSHYGED 208

Query: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240
           DNAITLYVQMLRSGHIPDHFTFGS+VKSCSGLDD  LARQLHAHVLKSEFG + IAQNAL
Sbjct: 209 DNAITLYVQMLRSGHIPDHFTFGSVVKSCSGLDDLMLARQLHAHVLKSEFGGNPIAQNAL 268

Query: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
           ISMYTKFSQ+ADA NVFS II KDLISWGSMIAGFSQLG E+EALCHFREMLSQ +YQPN
Sbjct: 269 ISMYTKFSQIADATNVFSHIITKDLISWGSMIAGFSQLGCEIEALCHFREMLSQPIYQPN 328

Query: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360
           EFVFGSAFSACS L EP+CGRQIHGLCIKFGLGSD FAGCSLCDMYAKCGFL SARTVF 
Sbjct: 329 EFVFGSAFSACSNLSEPNCGRQIHGLCIKFGLGSDRFAGCSLCDMYAKCGFLGSARTVFC 388

Query: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420
            IEKPDLVAWNAIIAGFASV +AKES SFFSQMRHTGL  NDVTVLSLLCACSEP+MLN 
Sbjct: 389 QIEKPDLVAWNAIIAGFASVGDAKESLSFFSQMRHTGLASNDVTVLSLLCACSEPMMLNQ 448

Query: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480
           G+QVHSYIVK GF+L++ VCN LLSMYSKCS+LND+L++FEDIGNKAD+VSWNT+LTAC 
Sbjct: 449 GMQVHSYIVKTGFDLEVLVCNGLLSMYSKCSDLNDSLKIFEDIGNKADVVSWNTMLTACR 508

Query: 481 QQNQAGEMM 490
            +NQAGE++
Sbjct: 509 LRNQAGEVL 516

BLAST of CsaV3_3G035640 vs. TAIR10
Match: AT3G53360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 503.1 bits (1294), Expect = 2.1e-142
Identity = 247/424 (58.25%), Postives = 312/424 (73.58%), Query Frame = 0

Query: 66  DDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKIHRHML 125
           +D I SLCK N +REAL+AFD  QK SS  ++  TY  LI ACSS RSL  GRKIH H+L
Sbjct: 35  NDHINSLCKSNFYREALEAFDFAQKNSSFKIRLRTYISLICACSSSRSLAQGRKIHDHIL 94

Query: 126 TCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEEDNAIT 185
             N + D IL NHILSMYGKCGSL++AR +FD MP +N+VS+TS+I+GYS+ G+   AI 
Sbjct: 95  NSNCKYDTILNNHILSMYGKCGSLRDAREVFDFMPERNLVSYTSVITGYSQNGQGAEAIR 154

Query: 186 LYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNALISMYT 245
           LY++ML+   +PD F FGSI+K+C+   D  L +QLHA V+K E  + LIAQNALI+MY 
Sbjct: 155 LYLKMLQEDLVPDQFAFGSIIKACASSSDVGLGKQLHAQVIKLESSSHLIAQNALIAMYV 214

Query: 246 KFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPNEFVFG 305
           +F+QM+DA  VF  I +KDLISW S+IAGFSQLG+E EAL H +EMLS  V+ PNE++FG
Sbjct: 215 RFNQMSDASRVFYGIPMKDLISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFG 274

Query: 306 SAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFYHIEKP 365
           S+  ACS LL PD G QIHGLCIK  L  +  AGCSLCDMYA+CGFL SAR VF  IE+P
Sbjct: 275 SSLKACSSLLRPDYGSQIHGLCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQIERP 334

Query: 366 DLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNHGIQVH 425
           D  +WN IIAG A+   A E+ S FSQMR +G +P+ +++ SLLCA ++P+ L+ G+Q+H
Sbjct: 335 DTASWNVIIAGLANNGYADEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIH 394

Query: 426 SYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACLQQNQA 485
           SYI+K GF  D+ VCNSLL+MY+ CS+L     +FED  N AD VSWNT+LTACLQ  Q 
Sbjct: 395 SYIIKWGFLADLTVCNSLLTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQP 454

Query: 486 GEMM 490
            EM+
Sbjct: 455 VEML 458

BLAST of CsaV3_3G035640 vs. TAIR10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 246.1 bits (627), Expect = 4.5e-65
Identity = 131/396 (33.08%), Postives = 222/396 (56.06%), Query Frame = 0

Query: 100 TYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSM 159
           T+  ++  C  +  L  G+++H H++   Y+ D+ + N +++MY KCG +K AR +FD M
Sbjct: 198 TFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRM 257

Query: 160 PLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLAR 219
           P ++++SW +MISGY   G     + L+  M      PD  T  S++ +C  L D +L R
Sbjct: 258 PRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACELLGDRRLGR 317

Query: 220 QLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLG 279
            +HA+V+ + F  D+   N+L  MY       +A  +FSR+  KD++SW +MI+G+    
Sbjct: 318 DIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNF 377

Query: 280 YELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAG 339
              +A+  +R M+ Q   +P+E    +  SAC+ L + D G ++H L IK  L S +   
Sbjct: 378 LPDKAIDTYR-MMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVA 437

Query: 340 CSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLV 399
            +L +MY+KC  ++ A  +F++I + ++++W +IIAG    +   E+  F  QM+ T L 
Sbjct: 438 NNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQMKMT-LQ 497

Query: 400 PNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQV 459
           PN +T+ + L AC+    L  G ++H+++++ G  LD  + N+LL MY +C  +N A   
Sbjct: 498 PNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRMNTAWSQ 557

Query: 460 FEDIGNKADIVSWNTLLTACLQQNQAGEMMVVIHYR 496
           F     K D+ SWN LLT   ++ Q G M+V +  R
Sbjct: 558 FN--SQKKDVTSWNILLTGYSERGQ-GSMVVELFDR 588

BLAST of CsaV3_3G035640 vs. TAIR10
Match: AT4G39530.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 242.3 bits (617), Expect = 6.5e-64
Identity = 128/378 (33.86%), Postives = 215/378 (56.88%), Query Frame = 0

Query: 104 LINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKN 163
           +++ACS L  LE G++IH H+L    + D  L N ++  Y KCG +  A  +F+ MP KN
Sbjct: 255 VLSACSILPFLEGGKQIHAHILRYGLEMDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKN 314

Query: 164 VVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHA 223
           ++SWT+++SGY +      A+ L+  M + G  PD +   SI+ SC+ L       Q+HA
Sbjct: 315 IISWTTLLSGYKQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCASLHALGFGTQVHA 374

Query: 224 HVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLG--YE 283
           + +K+  G D    N+LI MY K   + DA  VF      D++ + +MI G+S+LG  +E
Sbjct: 375 YTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNAMIEGYSRLGTQWE 434

Query: 284 L-EALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGC 343
           L EAL  FR+M  + + +P+   F S   A + L      +QIHGL  K+GL  D+FAG 
Sbjct: 435 LHEALNIFRDMRFRLI-RPSLLTFVSLLRASASLTSLGLSKQIHGLMFKYGLNLDIFAGS 494

Query: 344 SLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVP 403
           +L D+Y+ C  L+ +R VF  ++  DLV WN++ AG+   S  +E+ + F +++ +   P
Sbjct: 495 ALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEEALNLFLELQLSRERP 554

Query: 404 NDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVF 463
           ++ T  +++ A      +  G + H  ++K G   +  + N+LL MY+KC +  DA + F
Sbjct: 555 DEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYITNALLDMYAKCGSPEDAHKAF 614

Query: 464 EDIGNKADIVSWNTLLTA 479
           +   ++ D+V WN+++++
Sbjct: 615 DSAASR-DVVCWNSVISS 630

BLAST of CsaV3_3G035640 vs. TAIR10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 239.6 bits (610), Expect = 4.2e-63
Identity = 150/471 (31.85%), Postives = 241/471 (51.17%), Query Frame = 0

Query: 27  SPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLTVHYPHDDKIISLCKKNLHREALKAFD 86
           S  S IK F+P + L  S+   +  L  S            +   LC +     A+KA D
Sbjct: 3   SVMSKIKLFRPVVTLRCSYSSTDQTLLLS------------EFTRLCYQRDLPRAMKAMD 62

Query: 87  IFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKC 146
             Q        S TY+ LI  C S R++  G  I RH+    ++P M L N +++MY K 
Sbjct: 63  SLQS-HGLWADSATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKF 122

Query: 147 GSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIV 206
             L +A  +FD MP +NV+SWT+MIS YS+      A+ L V MLR    P+ +T+ S++
Sbjct: 123 NLLNDAHQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVL 182

Query: 207 KSCSGLDDFKLARQLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLI 266
           +SC+G+ D    R LH  ++K    +D+  ++ALI ++ K  +  DA++VF  ++  D I
Sbjct: 183 RSCNGMSD---VRMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAI 242

Query: 267 SWGSMIAGFSQLGYELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGL 326
            W S+I GF+Q      AL  F+ M  ++ +   +    S   AC+ L   + G Q H  
Sbjct: 243 VWNSIIGGFAQNSRSDVALELFKRM-KRAGFIAEQATLTSVLRACTGLALLELGMQAHVH 302

Query: 327 CIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKES 386
            +K+    DL    +L DMY KCG LE A  VF  +++ D++ W+ +I+G A    ++E+
Sbjct: 303 IVKY--DQDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEA 362

Query: 387 SSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPV---CNSL 446
              F +M+ +G  PN +T++ +L ACS   +L  G      + K+ + +D PV      +
Sbjct: 363 LKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKL-YGID-PVREHYGCM 422

Query: 447 LSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACLQQNQAGEMMVVIHY 495
           + +  K   L+DA+++  ++  + D V+W TLL AC  Q      MV+  Y
Sbjct: 423 IDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRN----MVLAEY 448

BLAST of CsaV3_3G035640 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 238.8 bits (608), Expect = 7.2e-63
Identity = 127/398 (31.91%), Postives = 223/398 (56.03%), Query Frame = 0

Query: 83  KAFDIFQKCSSSPLK--SVTYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHIL 142
           KA ++F++     L+  S T   L+ ACS+  +L  G+++H +     +  +  ++  +L
Sbjct: 372 KAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALL 431

Query: 143 SMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHF 202
           ++Y KC  ++ A + F    ++NVV W  M+  Y    +  N+  ++ QM     +P+ +
Sbjct: 432 NLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQY 491

Query: 203 TFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRI 262
           T+ SI+K+C  L D +L  Q+H+ ++K+ F  +    + LI MY K  ++  A ++  R 
Sbjct: 492 TYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRF 551

Query: 263 IIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCG 322
             KD++SW +MIAG++Q  ++ +AL  FR+ML + + + +E    +A SAC+ L     G
Sbjct: 552 AGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGI-RSDEVGLTNAVSACAGLQALKEG 611

Query: 323 RQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASV 382
           +QIH      G  SDL    +L  +Y++CG +E +   F   E  D +AWNA+++GF   
Sbjct: 612 QQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQS 671

Query: 383 SNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVC 442
            N +E+   F +M   G+  N+ T  S + A SE   +  G QVH+ I K G++ +  VC
Sbjct: 672 GNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVC 731

Query: 443 NSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTA 479
           N+L+SMY+KC +++DA + F ++  K + VSWN ++ A
Sbjct: 732 NALISMYAKCGSISDAEKQFLEVSTKNE-VSWNAIINA 767

BLAST of CsaV3_3G035640 vs. Swiss-Prot
Match: sp|Q9LFI1|PP280_ARATH (Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E86 PE=2 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 3.7e-141
Identity = 247/424 (58.25%), Postives = 312/424 (73.58%), Query Frame = 0

Query: 66  DDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKIHRHML 125
           +D I SLCK N +REAL+AFD  QK SS  ++  TY  LI ACSS RSL  GRKIH H+L
Sbjct: 35  NDHINSLCKSNFYREALEAFDFAQKNSSFKIRLRTYISLICACSSSRSLAQGRKIHDHIL 94

Query: 126 TCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEEDNAIT 185
             N + D IL NHILSMYGKCGSL++AR +FD MP +N+VS+TS+I+GYS+ G+   AI 
Sbjct: 95  NSNCKYDTILNNHILSMYGKCGSLRDAREVFDFMPERNLVSYTSVITGYSQNGQGAEAIR 154

Query: 186 LYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNALISMYT 245
           LY++ML+   +PD F FGSI+K+C+   D  L +QLHA V+K E  + LIAQNALI+MY 
Sbjct: 155 LYLKMLQEDLVPDQFAFGSIIKACASSSDVGLGKQLHAQVIKLESSSHLIAQNALIAMYV 214

Query: 246 KFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPNEFVFG 305
           +F+QM+DA  VF  I +KDLISW S+IAGFSQLG+E EAL H +EMLS  V+ PNE++FG
Sbjct: 215 RFNQMSDASRVFYGIPMKDLISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFG 274

Query: 306 SAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFYHIEKP 365
           S+  ACS LL PD G QIHGLCIK  L  +  AGCSLCDMYA+CGFL SAR VF  IE+P
Sbjct: 275 SSLKACSSLLRPDYGSQIHGLCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQIERP 334

Query: 366 DLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNHGIQVH 425
           D  +WN IIAG A+   A E+ S FSQMR +G +P+ +++ SLLCA ++P+ L+ G+Q+H
Sbjct: 335 DTASWNVIIAGLANNGYADEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIH 394

Query: 426 SYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACLQQNQA 485
           SYI+K GF  D+ VCNSLL+MY+ CS+L     +FED  N AD VSWNT+LTACLQ  Q 
Sbjct: 395 SYIIKWGFLADLTVCNSLLTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQP 454

Query: 486 GEMM 490
            EM+
Sbjct: 455 VEML 458

BLAST of CsaV3_3G035640 vs. Swiss-Prot
Match: sp|Q9M9E2|PPR45_ARATH (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 8.2e-64
Identity = 131/396 (33.08%), Postives = 222/396 (56.06%), Query Frame = 0

Query: 100 TYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSM 159
           T+  ++  C  +  L  G+++H H++   Y+ D+ + N +++MY KCG +K AR +FD M
Sbjct: 198 TFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRM 257

Query: 160 PLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLAR 219
           P ++++SW +MISGY   G     + L+  M      PD  T  S++ +C  L D +L R
Sbjct: 258 PRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACELLGDRRLGR 317

Query: 220 QLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLG 279
            +HA+V+ + F  D+   N+L  MY       +A  +FSR+  KD++SW +MI+G+    
Sbjct: 318 DIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNF 377

Query: 280 YELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAG 339
              +A+  +R M+ Q   +P+E    +  SAC+ L + D G ++H L IK  L S +   
Sbjct: 378 LPDKAIDTYR-MMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVA 437

Query: 340 CSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLV 399
            +L +MY+KC  ++ A  +F++I + ++++W +IIAG    +   E+  F  QM+ T L 
Sbjct: 438 NNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQMKMT-LQ 497

Query: 400 PNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQV 459
           PN +T+ + L AC+    L  G ++H+++++ G  LD  + N+LL MY +C  +N A   
Sbjct: 498 PNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRMNTAWSQ 557

Query: 460 FEDIGNKADIVSWNTLLTACLQQNQAGEMMVVIHYR 496
           F     K D+ SWN LLT   ++ Q G M+V +  R
Sbjct: 558 FN--SQKKDVTSWNILLTGYSERGQ-GSMVVELFDR 588

BLAST of CsaV3_3G035640 vs. Swiss-Prot
Match: sp|Q9SVA5|PP357_ARATH (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E52 PE=3 SV=1)

HSP 1 Score: 242.3 bits (617), Expect = 1.2e-62
Identity = 128/378 (33.86%), Postives = 215/378 (56.88%), Query Frame = 0

Query: 104 LINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKN 163
           +++ACS L  LE G++IH H+L    + D  L N ++  Y KCG +  A  +F+ MP KN
Sbjct: 255 VLSACSILPFLEGGKQIHAHILRYGLEMDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKN 314

Query: 164 VVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHA 223
           ++SWT+++SGY +      A+ L+  M + G  PD +   SI+ SC+ L       Q+HA
Sbjct: 315 IISWTTLLSGYKQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCASLHALGFGTQVHA 374

Query: 224 HVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLG--YE 283
           + +K+  G D    N+LI MY K   + DA  VF      D++ + +MI G+S+LG  +E
Sbjct: 375 YTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNAMIEGYSRLGTQWE 434

Query: 284 L-EALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGC 343
           L EAL  FR+M  + + +P+   F S   A + L      +QIHGL  K+GL  D+FAG 
Sbjct: 435 LHEALNIFRDMRFRLI-RPSLLTFVSLLRASASLTSLGLSKQIHGLMFKYGLNLDIFAGS 494

Query: 344 SLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVP 403
           +L D+Y+ C  L+ +R VF  ++  DLV WN++ AG+   S  +E+ + F +++ +   P
Sbjct: 495 ALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEEALNLFLELQLSRERP 554

Query: 404 NDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVF 463
           ++ T  +++ A      +  G + H  ++K G   +  + N+LL MY+KC +  DA + F
Sbjct: 555 DEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPYITNALLDMYAKCGSPEDAHKAF 614

Query: 464 EDIGNKADIVSWNTLLTA 479
           +   ++ D+V WN+++++
Sbjct: 615 DSAASR-DVVCWNSVISS 630

BLAST of CsaV3_3G035640 vs. Swiss-Prot
Match: sp|Q9SI53|PP147_ARATH (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 239.6 bits (610), Expect = 7.6e-62
Identity = 150/471 (31.85%), Postives = 241/471 (51.17%), Query Frame = 0

Query: 27  SPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLTVHYPHDDKIISLCKKNLHREALKAFD 86
           S  S IK F+P + L  S+   +  L  S            +   LC +     A+KA D
Sbjct: 3   SVMSKIKLFRPVVTLRCSYSSTDQTLLLS------------EFTRLCYQRDLPRAMKAMD 62

Query: 87  IFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKC 146
             Q        S TY+ LI  C S R++  G  I RH+    ++P M L N +++MY K 
Sbjct: 63  SLQS-HGLWADSATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKF 122

Query: 147 GSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIV 206
             L +A  +FD MP +NV+SWT+MIS YS+      A+ L V MLR    P+ +T+ S++
Sbjct: 123 NLLNDAHQLFDQMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVL 182

Query: 207 KSCSGLDDFKLARQLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLI 266
           +SC+G+ D    R LH  ++K    +D+  ++ALI ++ K  +  DA++VF  ++  D I
Sbjct: 183 RSCNGMSD---VRMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAI 242

Query: 267 SWGSMIAGFSQLGYELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGL 326
            W S+I GF+Q      AL  F+ M  ++ +   +    S   AC+ L   + G Q H  
Sbjct: 243 VWNSIIGGFAQNSRSDVALELFKRM-KRAGFIAEQATLTSVLRACTGLALLELGMQAHVH 302

Query: 327 CIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKES 386
            +K+    DL    +L DMY KCG LE A  VF  +++ D++ W+ +I+G A    ++E+
Sbjct: 303 IVKY--DQDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEA 362

Query: 387 SSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPV---CNSL 446
              F +M+ +G  PN +T++ +L ACS   +L  G      + K+ + +D PV      +
Sbjct: 363 LKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKL-YGID-PVREHYGCM 422

Query: 447 LSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACLQQNQAGEMMVVIHY 495
           + +  K   L+DA+++  ++  + D V+W TLL AC  Q      MV+  Y
Sbjct: 423 IDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRN----MVLAEY 448

BLAST of CsaV3_3G035640 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 238.8 bits (608), Expect = 1.3e-61
Identity = 127/398 (31.91%), Postives = 223/398 (56.03%), Query Frame = 0

Query: 83  KAFDIFQKCSSSPLK--SVTYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHIL 142
           KA ++F++     L+  S T   L+ ACS+  +L  G+++H +     +  +  ++  +L
Sbjct: 372 KAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALL 431

Query: 143 SMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHF 202
           ++Y KC  ++ A + F    ++NVV W  M+  Y    +  N+  ++ QM     +P+ +
Sbjct: 432 NLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQY 491

Query: 203 TFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRI 262
           T+ SI+K+C  L D +L  Q+H+ ++K+ F  +    + LI MY K  ++  A ++  R 
Sbjct: 492 TYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRF 551

Query: 263 IIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCG 322
             KD++SW +MIAG++Q  ++ +AL  FR+ML + + + +E    +A SAC+ L     G
Sbjct: 552 AGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGI-RSDEVGLTNAVSACAGLQALKEG 611

Query: 323 RQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASV 382
           +QIH      G  SDL    +L  +Y++CG +E +   F   E  D +AWNA+++GF   
Sbjct: 612 QQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQS 671

Query: 383 SNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVC 442
            N +E+   F +M   G+  N+ T  S + A SE   +  G QVH+ I K G++ +  VC
Sbjct: 672 GNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVC 731

Query: 443 NSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTA 479
           N+L+SMY+KC +++DA + F ++  K + VSWN ++ A
Sbjct: 732 NALISMYAKCGSISDAEKQFLEVSTKNE-VSWNAIINA 767

BLAST of CsaV3_3G035640 vs. TrEMBL
Match: tr|A0A0A0LDF3|A0A0A0LDF3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736910 PE=4 SV=1)

HSP 1 Score: 993.4 bits (2567), Expect = 1.8e-286
Identity = 487/489 (99.59%), Postives = 489/489 (100.00%), Query Frame = 0

Query: 1   MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60
           MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT
Sbjct: 1   MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60

Query: 61  VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120
           VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI
Sbjct: 61  VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120

Query: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180
           HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE
Sbjct: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180

Query: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240
           DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL
Sbjct: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240

Query: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
           ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN
Sbjct: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300

Query: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360
           EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360

Query: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420
           HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH
Sbjct: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420

Query: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480
           GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL
Sbjct: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480

Query: 481 QQNQAGEMM 490
           QQNQAGE++
Sbjct: 481 QQNQAGEVL 489

BLAST of CsaV3_3G035640 vs. TrEMBL
Match: tr|A0A1S3B684|A0A1S3B684_CUCME (pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103486463 PE=4 SV=1)

HSP 1 Score: 943.3 bits (2437), Expect = 2.2e-271
Identity = 459/489 (93.87%), Postives = 477/489 (97.55%), Query Frame = 0

Query: 1   MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60
           MA IRTF+SIYFLTLASDAIVFFNPFSP++S KNFKPQLHLA SHIRL TQLAFSPCP+T
Sbjct: 1   MAPIRTFNSIYFLTLASDAIVFFNPFSPSTSTKNFKPQLHLAHSHIRLGTQLAFSPCPVT 60

Query: 61  VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120
           VHYPHDDKIISLCKKNLHREAL+AFDIF+KCSSSPLKS+TYTHLINACSSLRSLEHGRKI
Sbjct: 61  VHYPHDDKIISLCKKNLHREALQAFDIFRKCSSSPLKSITYTHLINACSSLRSLEHGRKI 120

Query: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180
           HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARN+FD+MPLKNVVSWTSMISGYSRYG+E
Sbjct: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSRYGQE 180

Query: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240
           DNAITLY+QMLRSG+IPDHFTFGSIVKSCSGLDDF LARQLHAHVLK EFG  LIAQNAL
Sbjct: 181 DNAITLYIQMLRSGYIPDHFTFGSIVKSCSGLDDFMLARQLHAHVLKFEFGGHLIAQNAL 240

Query: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
           ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN
Sbjct: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300

Query: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360
           EFVFGSAFSACSKLLEPDCGRQIHGLCIK GLGSD+FAGCSLCDMYAKCGFLESARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKLGLGSDIFAGCSLCDMYAKCGFLESARTVFY 360

Query: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420
           HIEKPDLVAWNAIIAGFASVSNAKES SFFSQMRH G+VPNDVTVLSLLCACSEPVMLN+
Sbjct: 361 HIEKPDLVAWNAIIAGFASVSNAKESLSFFSQMRHRGVVPNDVTVLSLLCACSEPVMLNN 420

Query: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480
           G+QVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFE IGNKADIVSWNTLLTACL
Sbjct: 421 GMQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEGIGNKADIVSWNTLLTACL 480

Query: 481 QQNQAGEMM 490
           QQNQAGE++
Sbjct: 481 QQNQAGEVL 489

BLAST of CsaV3_3G035640 vs. TrEMBL
Match: tr|A5BS92|A5BS92_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_032420 PE=4 SV=1)

HSP 1 Score: 587.8 bits (1514), Expect = 2.3e-164
Identity = 285/457 (62.36%), Postives = 355/457 (77.68%), Query Frame = 0

Query: 32  IKNFKPQLHLALSHIRLNTQLAFSPCPLTVHYPHDDKIISLCKKNLHREALKAFDIFQKC 91
           IK  +PQ+  A ++++  T L+     L      ++ I +LCK+ L  EA+KAF+  QK 
Sbjct: 2   IKALRPQVGFATNNVK-ETVLS----KLRAEQSSNEYITTLCKQKLFNEAIKAFEFLQKK 61

Query: 92  SSSPLKSVTYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKCGSLKE 151
           +   L   TY +LI+ACS LRSLEHGRKIH HML     PD+ LQNHIL+MYGKCGSLK+
Sbjct: 62  TGFCLTLSTYAYLISACSYLRSLEHGRKIHDHMLKSKSHPDLTLQNHILNMYGKCGSLKD 121

Query: 152 ARNMFDSMPLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIVKSCSG 211
           A+ +FD+MP +NVVSWTS+I+GYS+ G+  NA+  Y QML+SG +PD FTFGSI+K+CS 
Sbjct: 122 AQKVFDAMPERNVVSWTSVIAGYSQNGQGGNALEFYFQMLQSGVMPDQFTFGSIIKACSS 181

Query: 212 LDDFKLARQLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLISWGSM 271
           L D  L RQLHAHVLKSEFGA +IAQNALISMYTK + + DA++VFSR+  +DLISWGSM
Sbjct: 182 LGDIGLGRQLHAHVLKSEFGAHIIAQNALISMYTKSNVIIDALDVFSRMATRDLISWGSM 241

Query: 272 IAGFSQLGYELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGLCIKFG 331
           IAGFSQLGYELEALC+F+EML Q VY PNEF+FGS FSACS LL+P+ GRQ+HG+ IKFG
Sbjct: 242 IAGFSQLGYELEALCYFKEMLHQGVYLPNEFIFGSVFSACSSLLQPEYGRQLHGMSIKFG 301

Query: 332 LGSDLFAGCSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKESSSFFS 391
           LG D+FAGCSLCDMYAKCG L  AR VFY I +PDLVAWNAIIAGFA   +AKE+ +FFS
Sbjct: 302 LGRDVFAGCSLCDMYAKCGLLSCARVVFYQIGRPDLVAWNAIIAGFAYGGDAKEAIAFFS 361

Query: 392 QMRHTGLVPNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVCNSLLSMYSKCS 451
           QMRH GL+P+++TV SLLCAC+ P  L  G+QVH YI KMG +LD+PVCN+LL+MY+KCS
Sbjct: 362 QMRHQGLIPDEITVRSLLCACTSPSELYQGMQVHGYINKMGLDLDVPVCNTLLTMYAKCS 421

Query: 452 NLNDALQVFEDIGNKADIVSWNTLLTACLQQNQAGEM 489
            L DA+  FE++   AD+VSWN +LTAC+  +QA E+
Sbjct: 422 ELRDAIFFFEEMRCNADLVSWNAILTACMHHDQAEEV 453

BLAST of CsaV3_3G035640 vs. TrEMBL
Match: tr|A0A2H5Q525|A0A2H5Q525_CITUN (Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_196310 PE=4 SV=1)

HSP 1 Score: 581.3 bits (1497), Expect = 2.2e-162
Identity = 281/461 (60.95%), Postives = 353/461 (76.57%), Query Frame = 0

Query: 32  IKNFKPQLHLALSHIRLNTQLAFSPCPLTVHYPHD----DKIISLCKKNLHREALKAFDI 91
           I+N K QL     H +      F P     ++ ++    D I SLCK+NL+ EAL AFD 
Sbjct: 2   IRNLKTQLRFTFYHSQ-----PFVPSNAQTYFRNEQFSNDYISSLCKQNLYNEALVAFDF 61

Query: 92  FQKCSSSPLKSVTYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKCG 151
            Q  ++  ++  TY  LI+ACSSLRSL+ GRK+H H+L+ N QPD++L NHIL+MYGKCG
Sbjct: 62  LQNNTNFRIRPSTYAGLISACSSLRSLQLGRKVHDHILSSNCQPDVVLHNHILNMYGKCG 121

Query: 152 SLKEARNMFDSMPLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIVK 211
           SL++AR +FD MP +NVVSWT+MI+G S+ G+E++AI LY+QMLRSG +PD FTFGSI+K
Sbjct: 122 SLEDARMVFDEMPQRNVVSWTAMIAGCSQNGQENDAIKLYIQMLRSGVMPDQFTFGSIIK 181

Query: 212 SCSGLDDFKLARQLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLIS 271
           +CSGL    L RQLHAHV+KSE G+ LIAQNALI+MYTKF Q+ DA NVFS I  KD+ S
Sbjct: 182 ACSGLGSVGLGRQLHAHVIKSEHGSHLIAQNALIAMYTKFDQILDAWNVFSGIARKDITS 241

Query: 272 WGSMIAGFSQLGYELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGLC 331
           WGSMIA FS+LGYELEALCHF EML    YQPNEF+FGS FSACS LL  +CGRQIHG+C
Sbjct: 242 WGSMIAAFSKLGYELEALCHFNEMLHHGAYQPNEFIFGSVFSACSSLLHYECGRQIHGIC 301

Query: 332 IKFGLGSDLFAGCSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKESS 391
           IKFGLG D +AGCSLCDMYA+CG L+ AR VF  IE PDL +WNA+IAG AS SNA E+ 
Sbjct: 302 IKFGLGRDTYAGCSLCDMYARCGLLDFARIVFNEIESPDLASWNALIAGVASHSNANEAM 361

Query: 392 SFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVCNSLLSMY 451
           S FS+MR   L+P+ +TV SLLCAC+ P+ L  G+QVHSYI+KMGF+ ++PVCN++L+MY
Sbjct: 362 SLFSEMRDRELIPDGLTVRSLLCACTGPLTLYQGMQVHSYIIKMGFDSNVPVCNAILTMY 421

Query: 452 SKCSNLNDALQVFEDIGNKADIVSWNTLLTACLQQNQAGEM 489
           +KCS L +AL VF+++G  AD VSWN+++ ACLQ NQAGE+
Sbjct: 422 AKCSVLCNALLVFKELGKNADSVSWNSIIAACLQHNQAGEL 457

BLAST of CsaV3_3G035640 vs. TrEMBL
Match: tr|A0A251NXJ7|A0A251NXJ7_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G239900 PE=4 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 1.2e-160
Identity = 275/451 (60.98%), Postives = 350/451 (77.61%), Query Frame = 0

Query: 55  SPCPLTVHYPH-----------------DDKIISLCKKNLHREALKAFDIFQKCSSSPLK 114
           +P PLT HY                   +D I SLC++ L++EAL+AF+  +  ++  + 
Sbjct: 6   APLPLTFHYSQPSLPSILLPKFKNQQCSNDYISSLCRQKLYKEALQAFEFLEGNTNFQIF 65

Query: 115 SVTYTHLINACSSLRSLEHGRKIHRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFD 174
             TY  L++ACS LRSL+HGRKIH H+L    +PD+IL NHIL+MYGKCGS+K+A  +FD
Sbjct: 66  PSTYADLVSACSFLRSLDHGRKIHDHILASKCEPDIILYNHILNMYGKCGSVKDAGKVFD 125

Query: 175 SMPLKNVVSWTSMISGYSRYGEEDNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKL 234
           +MP +NVVSWTS+ISG+S+  +ED AI LY +MLRSG  PDHFTFGSI+K+CSGL +  L
Sbjct: 126 AMPERNVVSWTSLISGHSQNKQEDKAIELYFEMLRSGCRPDHFTFGSIIKACSGLGNAWL 185

Query: 235 ARQLHAHVLKSEFGADLIAQNALISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQ 294
            RQ+HAHVLKSE G+  IAQNAL SMYTKF  +ADA +VFS +  KDLISWGSMIAGFSQ
Sbjct: 186 GRQVHAHVLKSETGSHSIAQNALTSMYTKFGLIADAFDVFSHVQTKDLISWGSMIAGFSQ 245

Query: 295 LGYELEALCHFREMLSQSVYQPNEFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLF 354
           LGY+ E+L HF+EML +  +QPNEF+FGSAFSACS LL+P+ G+Q+HG+CIKFGLG D+F
Sbjct: 246 LGYDKESLGHFKEMLCEGAHQPNEFIFGSAFSACSSLLQPEYGKQMHGMCIKFGLGRDIF 305

Query: 355 AGCSLCDMYAKCGFLESARTVFYHIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTG 414
           AGCSLCDMYAKCG+LESARTVFY IE+PDLV+WNAII+GF++  +A E+ SFFSQMRH G
Sbjct: 306 AGCSLCDMYAKCGYLESARTVFYQIERPDLVSWNAIISGFSNGGDANEAISFFSQMRHKG 365

Query: 415 LVPNDVTVLSLLCACSEPVMLNHGIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDAL 474
           LVP++++VLS+L AC+ P  L  G QVHSY++K  F+  + VCN+LL+MY+KCSNL DA 
Sbjct: 366 LVPDEISVLSILSACTSPSTLYQGRQVHSYLIKRAFDCIVIVCNALLTMYAKCSNLYDAF 425

Query: 475 QVFEDIGNKADIVSWNTLLTACLQQNQAGEM 489
            VFEDI N  D VSWN ++T+C+Q NQAGE+
Sbjct: 426 IVFEDIRNHTDSVSWNAIITSCMQHNQAGEV 456

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004137966.12.8e-28699.59PREDICTED: pentatricopeptide repeat-containing protein At3g53360, mitochondrial ... [more]
XP_008442662.13.3e-27193.87PREDICTED: pentatricopeptide repeat-containing protein At3g53360, mitochondrial ... [more]
XP_023526527.11.0e-24083.23pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucurbita ... [more]
XP_022934158.11.3e-23883.03pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucur... [more]
XP_022983903.14.8e-23883.03pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
AT3G53360.12.1e-14258.25Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G15510.14.5e-6533.08Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G39530.16.5e-6433.86Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G03880.14.2e-6331.85Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G13650.17.2e-6331.91Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9LFI1|PP280_ARATH3.7e-14158.25Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidop... [more]
sp|Q9M9E2|PPR45_ARATH8.2e-6433.08Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
sp|Q9SVA5|PP357_ARATH1.2e-6233.86Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX... [more]
sp|Q9SI53|PP147_ARATH7.6e-6231.85Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
sp|Q9SVP7|PP307_ARATH1.3e-6131.91Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LDF3|A0A0A0LDF3_CUCSA1.8e-28699.59Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736910 PE=4 SV=1[more]
tr|A0A1S3B684|A0A1S3B684_CUCME2.2e-27193.87pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Cucumis ... [more]
tr|A5BS92|A5BS92_VITVI2.3e-16462.36Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_032420 PE=4 SV=1[more]
tr|A0A2H5Q525|A0A2H5Q525_CITUN2.2e-16260.95Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_196310 PE=4 SV=1[more]
tr|A0A251NXJ7|A0A251NXJ7_PRUPE1.2e-16060.98Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G239900 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_3G035640.1CsaV3_3G035640.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 99..145
e-value: 0.0099
score: 15.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 99..132
e-value: 1.9E-4
score: 19.4
coord: 165..198
e-value: 2.6E-7
score: 28.4
coord: 368..401
e-value: 3.5E-4
score: 18.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 162..209
e-value: 2.2E-9
score: 37.2
coord: 365..412
e-value: 1.1E-7
score: 31.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 266..294
e-value: 0.0018
score: 18.3
coord: 440..463
e-value: 0.027
score: 14.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 401..435
score: 5.557
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..466
score: 8.309
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 97..131
score: 9.186
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..197
score: 11.849
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..400
score: 10.402
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 264..298
score: 8.265
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 233..263
score: 6.566
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 335..365
score: 5.612
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 198..232
score: 6.862
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 132..162
score: 7.64
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 468..502
score: 5.667
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 65..225
e-value: 7.9E-33
score: 116.3
coord: 416..494
e-value: 6.8E-12
score: 47.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 282..415
e-value: 3.1E-22
score: 81.3
NoneNo IPR availablePANTHERPTHR24015:SF128SUBFAMILY NOT NAMEDcoord: 420..482
NoneNo IPR availablePANTHERPTHR24015:SF128SUBFAMILY NOT NAMEDcoord: 235..423
coord: 80..210
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 80..210
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 132..312
NoneNo IPR availablePANTHERPTHR24015:SF128SUBFAMILY NOT NAMEDcoord: 132..312
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 235..423
NoneNo IPR availablePANTHERPTHR24015:SF128SUBFAMILY NOT NAMEDcoord: 394..477
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 394..477
coord: 420..482

The following gene(s) are paralogous to this gene:

None