CsaV3_1G014370 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G014370
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionpentatricopeptide repeat-containing protein At2g01860
Locationchr1 : 9840146 .. 9853424 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCAAATTAACCTCAAATTGATCTTATTTGAAAGTATTTTTTACTACTATTTTTTTTCTCTCTCATCTAGGAGGGCCAAATGTAAAAATGAGATTAATTAAAATCCTAATATTATATTTGACCACAATTGTGTCTCATGCATCAACAAAAGCATAAAGATTGATTCGAGGATGGAAGTTGCATGCAGTCGGTGCTAATTTTTCTTTAAAAAAGGAAGTTGCTATCAGATACAATAATTTGGAAAATATTCAAAGCATAAAAGAAGGTAAAATTGAAAAAAAACCCCCTAAAATATCCACGTTTTTTCTTAACCATTAGTCGAATGGAAGTTGCCTTTATCGTTTGAGAAAAAGAAAAGTGATCTTCATATGCTTTGCTGCAGTTCGATCATCGATCCTAGTAATGTATTCCAAAGTGGGTCGTCGCCCCCTCCATTTACTGTCTTCGTCTTAGAGGGATTGAACGCTATGGATCACTTGTTTCTACCTGAGAGTTTGCTTCTAAAGAGTGTGTTTTCTTTTTAGTACTTTTCTGAACGTTCTTAGCGAACTAAATTCAATTTGAAAATCACAAAACTAATAGAAAACAAAAATTAAAAATCAACACACTGTACCCAACCGAACTGATTTTTATTTTAAAAGATAGGATTTTTCTCTTTCTTCTTCAAACTAACGGCCTCTCAATCTCTATTCTCAAATTGCATCTCTCTCACCCAGCTCAGTAACTCACGTTCACGTCTTCAACTTCACGCCGCCCATCGCTCCACTTCTCCATAGCTCATTATCTCACTCTCACATTTCATCTCTCTTGACTCAGACTCAGACTCACGTTCACGTCCTCCACCGGTGACCGACCACCGCTCCTCCTCTCTCTCACGCTCACCCCTCTGACTCTCATCTCACGCTCCGAAATCGCGGAGATTCATTCAGTAAAACGGTTTTTGCAACTTTTTTCTCCCTTCGGTTCCCCCCTTGAATGGAAGATTCTCGTGGGCATTTCTTGCTTTGAAGAATATCTTCAAATTTTGAATCTTATAATCAATTTTATTTATGGTATGGCTTCTTCTATATTGTCTTTCATAATCTTATTGAATATTGTTATTGATTTGCAACTTTTTTCATCATTGTGTTGCTTTGTAGTATATTCCTCAAAATGCAAGTTGAAAATGTCCATTGGATTTTGTTCGCATTTCGTACTTCACATGTGTTTTCAATATTGTTTACAGTCTGTTGCACTGCATTCTTGTTGTTTCTGGGGTAGTTGAACAAATCCAAACCATGGTGAATTTGTGTATCTTTTATTAAATTAGAGTTGAAGTGTTTGGTGTTTGGTGGAGATGGTTATTTTATGTCATTTTCCTGTGAAATTCATGGTTTTTCCTTTCCTTTTTCTTTATTATTCCTTCCATAGGACAAATTCGAAGCAAGACCAAGGAGCCAGAGGAAGAAGTTAGAACGAGACCAATGGGCTACAATGTATAAAAGAGACGAGAGAGAGATAAGATTGAAGGGAAACAGAGTGTATCACTGATACTACTAATTGGATAGGTTCTTGTAATTTTATTTTCACTTGTAAGGTCAACTTTTGGTAGAAAACTATTTTTATAACCCCGTGGAGTCTGTAATTTGCACCTTCAAAGCTTCTCACTCGGAATTATGGATTCCATTGTCTCAGCAACTTCAGTGTCTTCCATTCTGGTGAAAGGAAATGGAGGAATTGGCTGCCAGAATACAATGGTTCATTTCAAGGCTAACTCCAGAAGACGCCCACCTAAAAACCTCCTCTGTCCACGACGGGCCAAGCTTCCTCCTAACCCTGCCGTCAACCAATTCTTTAACAACAAAACCTCAGCCCCTTCCCCACCCTTCACCGATTTGATTTCCTCTAAGATTTTCCAAGATGAGCATGAAGAAATCCATGCTCATGACTATACTAAGGATACTGATGTTGTTTGGGATTCAGATGAAATTGAAGCTATTTCGTCACTCTTCCAAGGGAGGATTCCTCAGAAACCTGGTAAATTGAATCGGGAGAGACCTCTTCCTCTCCCACTTCCTCACAAGCTACGACCACCAAGACTTCCTAACCCTAAAATCCGCCCAACAACAGTGGTGTCTTCCCGCGCTTTGCTGTCTAAGCAAGTCTACAAGCGTCCTGATTTTCTTATTGGCCTTGCTAGGGAGATTAGAGATCTATCCCCAGAGGAAAATGTGTCCAAGGTTCTCAATCGGTGGGGTCCGTTTTTGCAGAAGGGATCTCTTTCATTGACAATCAAGGAACTAGGGCATATGGGTCTTCCTGATAGAGCTCTAAACACGTTCTGTTGGGCACAGGAACAACATCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTTTCAAGGAACCATGAACTGAAGGTAGCTGTAAACTTGGAAGAGTTCACTAAACTTGCAAGTCGTGGTGTGCTCGAGGCAATGATGCGAGGGTTTATCAGAGGTGGGAGCTTAAATCTTGCTTGGAAGCTCCTCGTAGCTGCAAAGAAGGGTAAGAGAATGTTGGATCCCAGCGTCTATGTGAAGTTGATATTGGAGCTTGGTAAAAACCCTGATAAAAACATGTTGGTTCTTACCTTACTGGAAGAGCTAGGACAGAGAGAAGCCTTGAAGTTAAACCAACAAGATTGTACAACTATAGTTAAGGTCTGCACAAGGCTTGGTAAATTTGAAATTGCTGAGAAACTTTATAGCTGGTATGTTGAATCTGGACATGAACCGAGTATAGTTATGTATACTGCCTTGGTTCATAGTCGCTACTCAGACAGGAAATATAGGGAGGCATTATCTTTAGTGTGGGAAATGGAGTCTGGAAACTGTCCTTTTGATCTTCCTGCTTATAGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTTCCCCTACATATAATGTATATAGGAATATGATCACCATTTATTTAGTCTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTATAAGGAAGCAGAGAATGCTGGATTTATGATGGACAAACAAATTACTTCAATGTTGTTGCAAGCAAAAAGATGAATCACCTGTGGCAAGAGATATTTTTTATTATCTATCACTGAGTTGCACTATCGGTTTACCTATCATCTAGTAGGTGAGTTGTTCTCAATGTATTGGAATTGCATTTTCATCTGCACTAAATAAATTGTTAATGAATATTCTTTGTTTCCCTTCATCTACCACGTTCATATTGCATCTAATCTATGGATTTCAGTAGTACATCTTAAAACGTACTTGGATTTTTGCTTTAGGATATATACATATGACTTATGCACTTTTCTTTTTTTTTGTTTTTTTGTTTTTTTCCTTTTGAAAACAAAGCATGAAGCTCAAATGCATTACGCCAGAAATCTCTGATTTGCCCCAATACATTGTGATATCCAATCTGGCACATCAGTAACCCAACAAGAGAAAACCTGACAAGAAAGAGAAAATTTGGCCGATAAATGGGCCACACTATTACTCGTTCTGGGCAATTAAGTAAAGTGAAAACATCCTTATAAATAACTCCGTATACAGATAGATAACTCCTTATGCAGATAAGGAAAAGTTGAAAGAATGTCTCCTGGACTAATAATATACATTGTACAAATTCAACGAAGATTTTGACATATACTGACCAACAATTCTTATTCTCATCTAGCTTCTATATATTGTTAGTTCTATTATTCTATGAAGTAACTCAATGCTTTGTCCTTCCAAGTACCTGTTTGTAGATAAGTTTCATAAAAGTGTACCATTCCTGTTGTAGTTATAGTCAATACAGATGTGCTTCTGGTACCAAAACATCCCTGTGTTGAAATTTTTTTAAAAACTAAAGTCAAAATTTCTGCTATGAATGAAATCAAATTGTATGCTTATCAGAGGGGTCTGAAATTGGACAAAGACAGAGCTTGTATTGTACTCCCAGTCAGGGGAGGATATATGAGGAAGTTTGCTTTCATCAGTTTTTACATTGCCCCTCATTAATCTCTCATGGTATTTCACCTTCAGCATATATACGAAGCTGTTCATTAAATTTTAGCCTCAAGCGCTGCATCTGCATGTAAGTCAAAGCCTTCAGTCACTCTATTTTATTTTTCTTCACTTTTTCTTTCTTGCGGGTTCATTTGCCTTTTCACATGTGCCCATGTGTGTGTATTTGTAACCAATCTCCCCTTGTTGATGAATATAGATTTGAACATTTTGGGACCAACGAAGAAAGTTATCCCATTAAGTTGAATGGTAGTTATTTGGACAGTGGGACTGTTGTTTAGGGTGCGGTTGTCTAAGACTTTAGTGGCAATGGGCTTGGTGTCTGACATATTGCTGAAGAAACACCAACCCAAAGAGAAAAACCAAAGATCTGGAAAGAAAACACCAGATTAAAGAAGAAAAAACAGGGGTGGCGGTGATTGGAGGTGTAACATGGCAGAAGGTGACGGAACACGACATTGGTGTGATAACACCAACTAGGAGTGATCTTATTCAATCGGTAAAAACGAAAGAAATTACAAGGATGAATCGATAGATTGTCAATTCGGTGAAATAAATGCAAGAATACAAGAAAATAGTTGATCGCTCTCCTTCTATCTCTCCTCTCTCTCAAGGATGAAGATAAAGGTCCTTCGAAAATATTTCCTCCCCTCTCACAGTGACTCTCCCCGCTATTTAAACCTCCCATTCTCTCCTAACTAACTGTGGGCCCAGTATTAGCACTCACACTCACTCCCCATTACACATGCTTTCTCTTTCTTTCCTCTCCTGCTATATTGCAATATTATTGGAGGTCTAACATTGTTGCAGACTTGACTGACCTATGGTAGGAGCTATCCTAAGCAAAAAGCGGCTCCTTTGGACAAGTTTCACTCGAGGGAAAGAAAAAAAAAGCTTTGACGGCTCTGGCTCCGGGTTCAGCTTGGATTTGAAGGATTGACCACGATGGTAAACAAGGAAAAGTTGACGACCCTGGCAGCGGGGGGCAAATAGGGTGTTTTTAGTGTTTGAGAAATAGGTGTTTTTAGAGTTTTAGGACAATTTAATCTCTGATACCATGTGTAAAAATGTGTTTTCTCTTATTTCCCTCGGAAAGAAAGGGGTTTTCTTAAATAGAAATACCCAATACAAAAAATGAAAAATACAAATAAGGAAAAAATAAATACAGAAATATATCTTAAATACAATGAAAATATTAACAATGAAGGAAATAATCAACAATAAACAAACTTCTTAACTAAGATTAGTTGATTTTAACATGATGGAAGTTTTCTTTCACGAAGAATTTGGCTGAAATCTTGCCTGAAGAATTCAGTTAAATTCTTCTATCATTATGAGATCCAATATAAAATGTAAATTTAAAAATAATAAATAAGTGAAACTCTTGGCATCTCTTTCTTGGAGGGAATATCGATATTACAAAAAGGGAGGGTGGGGGGATGAAAAGAAAATAAATAGTGATGTATATCAGTGGATTGAATTTCCTCTAATTAAAGGGGATATAGATAAGAAATAGAAATTGGGAAAAGAATTGGAGAAACGAGAGAGATTGATTTGAGAGAAGCCGAGGAAAAAACTGAATGACATTTTGTGTATGGAGTACTTGACCAGTTCCAGCTTTCTGAGTGGGATGGTATAATGTACTGTTGGCTCTGGCTTTATGTGGTGTCCTTGCAATGGCACTCTCCACTACATGTTCACAATGTTGAAAGCTAAAGAAAAAGTGCAATGGGAGTCGACTATTGAAGGTACACCACATGACACCCTTGCACAAGGCAAGAAGATGGTTACGTGCCTCACCTTGCCTAGGACCATAAGCTTGCCTCAAAATGCACCTCAATAACACTGCTGAGTACCTTTTCAATTTATACTTTTCTGCCTGAAAGCTGGTTTGCTAATTGAAAGTTACCGGTGGTTTTAGCAAGAATGTGTTACGTACTTAAGCACCTTGGAGAGCCACGGGGTTTCTGTTGGGGTAGGAGAGCCAAGCTACTTAGGGCCCGTTTGGTAACGTTCCCGTTTTCCGTTTCCCATTTCTTGTTTCTGGTTCTAATTTTTTAAGAAACGGATGTGTTTTGTAACGTTCCTGTTTCTTGTTCCCAAAATAATTAGAAATGTTTCTAATTTAATAAGAAATTTGTGGGAACAACAAAAAAGAGTTTCTCCTCGTTCCATTCTTGTTCTCTTCCCTTTGTTTCTCCTCTTTGTTCTTGGGCTTTCTTCTCCGACTCTCTTTTTCTCTTCGTTCGTTCCTCTTCCGTTTCTCTTTGTTTAGTTCCTCTCCGATTACCTTTTCTCTCTTTGTTCTTCGTTTCTTCTCCGGCTTTCTTCTTTGCCTCTCTTCTAAAGTTATGGCTATAGTTTTCTCAAGCTTCCTCTTCATCAAACGTACAGCTACCCTTTCCTCTTCATCAATTTCCAAATTCATGATTTTCTTTGCTCCATCAACAGGAAGCTCTTCTCTTTCTCCTCACTCCCAAAGCGGAACTTGGGAACAGAATTCTAGGTATTCCCTTAATTTTCTCCCATTTTCCTATCATTTCTGTTATGGGTTTCTCAAATTTTCAATCCTAAGATGAATTTTGTTTAGATTTAAACCTTTTTGTTATCTTTGTTTGGCAGGATTTTTGTTTTCTATAGTAAATTTCAGCTGTAGTTTAGTGGAAAGGTGGATATTGTTTTTGGAAATTGAATCTATGTATTTGCTGCTTATTGTTATCATTGTTCATTATCAAATTAGAACTCCAATCCTCTGTTTTTGCCTTCAAAACCCCTTTCAGGTTCCAGGCATTCCTCGATTTATACTGATTTTATGCCTCATTTAATGATAATTAGCCTTATGGGTACTCTTGGGTCATTTTAACAACATTGCCAAATTTTAGCCTTTGTGTTCTATGTATTAGTGACTCATCAGTTTAAGCAAAATACTCTTTAATGATCTCAGGTTCTCAACATTTTTTTATACAGCCGTCTAGGAAGTGAAACCTTCAATGGCACAATATGCTAGTTTAGTCCATACTAGCCTCTTAAATTATGGCTTTACTCACAAAATGAATCCTTCATATTGGAGATATGATATGGTTAAATCACGACCTTCCCCTCAACGTAGCTTTCGTGTCAAAGTTGTGCAAGATACCGAAGGTCCTAGTAGGATAGTTGATATCATTAGACTCGTGCCTGAGCTCTCAAGAAATTACTTTAGAAGTCCTTCGAGGAGGGCCCTTTTTGGATGAATCTCATTGTTGGGTGGCTTTTATGTGGCACAAAATATCTCATTGTCATTTGGAGCTTTTGGAGTAAATGATGTTTTTGCTACTGTGGTATGCATTCTCCTCACCGATGTTACTCGATTTTATTACAATCGACCAAAGGTAACTTTCCCCATTGCTCTACTGAACAACTTCAAAATGGGTTTCACTTGTGGTCTTTTCATTGATGCTTTCAAACTTACTATTTAACAGCACTTGAAAAAACCATTCTGGTGGTAGCTATAGCATTTACATTTGTAAAGTTCTTTGAGGGATAATTTTGGTTTTACTCTCTATCTTTACAGCTTGTTCTAATTCTTTTTGTTGAATTTTTTTCCATTTTCCCCTTTGTTTCTTCTGTATATTTTCAACATACGGTTTTCATTGGAGAGTTAAATGTTCTATACACCCTCAAATCAAATATAGTAACAAATTTGAATGTAATTGATGACTTTTAAGAGTTTAATTCTTGTTTGTGTATTGAAATGCATCCCTTGCAATGGAGAATTGTTTGGAAACCATGAATCCAGTGTTGACTATCAGCGACAAATTTAAGATGGTAAGTTGTTAATAAGATACAAGCTTTCACCTAGTGTACTTCAACAGAACCATAATGTGTAGTCACTTGGCATTATTGTGTTTACTATTGTTGTTAAGAATATCTCTTATCTTTTGGTTTGATTCATAGATAAGCTTCAGATGTATTGAAATTTATGCTGATACATAATTTAAATGATATTGATCTTACCTTCTTATCTTTTGGTTTGATTCAAGGCTGTTGTAGTCTATATTTTTGTGAATAACTGGCATTTCTTTCAAACTTTCATATAAATATTTGTTTTAAATCTCTTTTGTAGATTGGAGAATATGAAGAAACCATAGGCACATGTCTTACACTGAAGTCATGTTCTCTAATGTAGAGGAAGTCCTTGTGGTTGAAGAAGCTCAGCCAACTGAAACAAATCATTGCACAAGAGAAGAAGTTGAACCAAAACAAGCTACCAAAAAGGAGTTGAAGCCCATTGCCTGTGTACATAAGATCCTTATATTTAACTTGATTCTGTTGTCCCAAAGCTCAATATCTTAGTGCATTAAAGGGACTAGTGTTTTCATATATGTATATATTGAGCTGTAAGTTTATGTATATGTAGATGCAATGGGACAGATAAGCTAAGAATTTAGATGAGTTGTCGGTAGTCATCCAAGAAGAACAAATATAGAGAATCATGAAGTTTGTAAAACAAAACAACCTTGAATGTTTGGTTTGTATTGTGAATTATCTATGCCTCTTTCTCCTTTCTAGTTTTAATGATTTTCTTTTTTATTCCAATT

mRNA sequence

ATGGATTCCATTGTCTCAGCAACTTCAGTGTCTTCCATTCTGGTGAAAGGAAATGGAGGAATTGGCTGCCAGAATACAATGGTTCATTTCAAGGCTAACTCCAGAAGACGCCCACCTAAAAACCTCCTCTGTCCACGACGGGCCAAGCTTCCTCCTAACCCTGCCGTCAACCAATTCTTTAACAACAAAACCTCAGCCCCTTCCCCACCCTTCACCGATTTGATTTCCTCTAAGATTTTCCAAGATGAGCATGAAGAAATCCATGCTCATGACTATACTAAGGATACTGATGTTGTTTGGGATTCAGATGAAATTGAAGCTATTTCGTCACTCTTCCAAGGGAGGATTCCTCAGAAACCTGGTAAATTGAATCGGGAGAGACCTCTTCCTCTCCCACTTCCTCACAAGCTACGACCACCAAGACTTCCTAACCCTAAAATCCGCCCAACAACAGTGGTGTCTTCCCGCGCTTTGCTGTCTAAGCAAGTCTACAAGCGTCCTGATTTTCTTATTGGCCTTGCTAGGGAGATTAGAGATCTATCCCCAGAGGAAAATGTGTCCAAGGTTCTCAATCGGTGGGGTCCGTTTTTGCAGAAGGGATCTCTTTCATTGACAATCAAGGAACTAGGGCATATGGGTCTTCCTGATAGAGCTCTAAACACGTTCTGTTGGGCACAGGAACAACATCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTTTCAAGGAACCATGAACTGAAGGTAGCTGTAAACTTGGAAGAGTTCACTAAACTTGCAAGTCGTGGTGTGCTCGAGGCAATGATGCGAGGGTTTATCAGAGGTGGGAGCTTAAATCTTGCTTGGAAGCTCCTCGTAGCTGCAAAGAAGGGTAAGAGAATGTTGGATCCCAGCGTCTATGTGAAGTTGATATTGGAGCTTGGTAAAAACCCTGATAAAAACATGTTGGTTCTTACCTTACTGGAAGAGCTAGGACAGAGAGAAGCCTTGAAGTTAAACCAACAAGATTGTACAACTATAGTTAAGGTCTGCACAAGGCTTGGTAAATTTGAAATTGCTGAGAAACTTTATAGCTGGTATGTTGAATCTGGACATGAACCGAGTATAGTTATGTATACTGCCTTGGTTCATAGTCGCTACTCAGACAGGAAATATAGGGAGGCATTATCTTTAGTGTGGGAAATGGAGTCTGGAAACTGTCCTTTTGATCTTCCTGCTTATAGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTTCCCCTACATATAATGTATATAGGAATATGATCACCATTTATTTAGTCTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTATAAGGAAGCAGAGAATGCTGGATTTATGATGGACAAACAAATTACTTCAATGTTGTTGCAAGCAAAAAGATGA

Coding sequence (CDS)

ATGGATTCCATTGTCTCAGCAACTTCAGTGTCTTCCATTCTGGTGAAAGGAAATGGAGGAATTGGCTGCCAGAATACAATGGTTCATTTCAAGGCTAACTCCAGAAGACGCCCACCTAAAAACCTCCTCTGTCCACGACGGGCCAAGCTTCCTCCTAACCCTGCCGTCAACCAATTCTTTAACAACAAAACCTCAGCCCCTTCCCCACCCTTCACCGATTTGATTTCCTCTAAGATTTTCCAAGATGAGCATGAAGAAATCCATGCTCATGACTATACTAAGGATACTGATGTTGTTTGGGATTCAGATGAAATTGAAGCTATTTCGTCACTCTTCCAAGGGAGGATTCCTCAGAAACCTGGTAAATTGAATCGGGAGAGACCTCTTCCTCTCCCACTTCCTCACAAGCTACGACCACCAAGACTTCCTAACCCTAAAATCCGCCCAACAACAGTGGTGTCTTCCCGCGCTTTGCTGTCTAAGCAAGTCTACAAGCGTCCTGATTTTCTTATTGGCCTTGCTAGGGAGATTAGAGATCTATCCCCAGAGGAAAATGTGTCCAAGGTTCTCAATCGGTGGGGTCCGTTTTTGCAGAAGGGATCTCTTTCATTGACAATCAAGGAACTAGGGCATATGGGTCTTCCTGATAGAGCTCTAAACACGTTCTGTTGGGCACAGGAACAACATCGACTCTTTCCAGATGATCGTGTTTTGGCCTCAACCGTTGAGGTCCTTTCAAGGAACCATGAACTGAAGGTAGCTGTAAACTTGGAAGAGTTCACTAAACTTGCAAGTCGTGGTGTGCTCGAGGCAATGATGCGAGGGTTTATCAGAGGTGGGAGCTTAAATCTTGCTTGGAAGCTCCTCGTAGCTGCAAAGAAGGGTAAGAGAATGTTGGATCCCAGCGTCTATGTGAAGTTGATATTGGAGCTTGGTAAAAACCCTGATAAAAACATGTTGGTTCTTACCTTACTGGAAGAGCTAGGACAGAGAGAAGCCTTGAAGTTAAACCAACAAGATTGTACAACTATAGTTAAGGTCTGCACAAGGCTTGGTAAATTTGAAATTGCTGAGAAACTTTATAGCTGGTATGTTGAATCTGGACATGAACCGAGTATAGTTATGTATACTGCCTTGGTTCATAGTCGCTACTCAGACAGGAAATATAGGGAGGCATTATCTTTAGTGTGGGAAATGGAGTCTGGAAACTGTCCTTTTGATCTTCCTGCTTATAGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTTCCCCTACATATAATGTATATAGGAATATGATCACCATTTATTTAGTCTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTATAAGGAAGCAGAGAATGCTGGATTTATGATGGACAAACAAATTACTTCAATGTTGTTGCAAGCAAAAAGATGA

Protein sequence

MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFFNNKTSAPSPPFTDLISSKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQITSMLLQAKR
BLAST of CsaV3_1G014370 vs. NCBI nr
Match: XP_004139567.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_011654198.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_011654204.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >KGN64877.1 hypothetical protein Csa_1G144300 [Cucumis sativus])

HSP 1 Score: 972.6 bits (2513), Expect = 4.9e-280
Identity = 489/489 (100.00%), Postives = 489/489 (100.00%), Query Frame = 0

Query: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60
           MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF
Sbjct: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60

Query: 61  NNKTSAPSPPFTDLISSKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKP 120
           NNKTSAPSPPFTDLISSKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKP
Sbjct: 61  NNKTSAPSPPFTDLISSKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKP 120

Query: 121 GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDL 180
           GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDL
Sbjct: 121 GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDL 180

Query: 181 SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLAS 240
           SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLAS
Sbjct: 181 SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLAS 240

Query: 241 TVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLD 300
           TVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLD
Sbjct: 241 TVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLD 300

Query: 301 PSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKL 360
           PSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKL
Sbjct: 301 PSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKL 360

Query: 361 YSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVA 420
           YSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVA
Sbjct: 361 YSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVA 420

Query: 421 LGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQI 480
           LGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQI
Sbjct: 421 LGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQI 480

Query: 481 TSMLLQAKR 490
           TSMLLQAKR
Sbjct: 481 TSMLLQAKR 489

BLAST of CsaV3_1G014370 vs. NCBI nr
Match: XP_008462173.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462181.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462189.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902994.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902996.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo])

HSP 1 Score: 918.7 bits (2373), Expect = 8.4e-264
Identity = 464/489 (94.89%), Postives = 474/489 (96.93%), Query Frame = 0

Query: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60
           M SIVSATSVSSILVKGNGGIGCQ TMVHFKANSRRRPPKNLLCPRRAKLPP+PAVNQF 
Sbjct: 1   MHSIVSATSVSSILVKGNGGIGCQITMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFL 60

Query: 61  NNKTSAPSPPFTDLISSKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKP 120
           NNKTSAPSP FTDLISSKIFQDEHEEIHA+DYTKDTDVVWDSDEIEAISSLFQGRIPQKP
Sbjct: 61  NNKTSAPSPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKP 120

Query: 121 GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDL 180
           GKLNRERPLPLPLPHKLRPPRLPNPKIRPTT VSSRALLSK+VYKRPDFLIGLAR IRDL
Sbjct: 121 GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLARAIRDL 180

Query: 181 SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLAS 240
           SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRAL TFCW QEQ RLFPDDRVLAS
Sbjct: 181 SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWVQEQRRLFPDDRVLAS 240

Query: 241 TVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLD 300
           TVEVLSRNHELKV VNLEEFTKLASRGVLEAMMRGFI+GGSLNLAWKLLVAAKKGKRMLD
Sbjct: 241 TVEVLSRNHELKVPVNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAAKKGKRMLD 300

Query: 301 PSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKL 360
           PSVYVKLILELGKNPDKN+LVLTLLEELGQREALKLNQQD TTI+KVCTRL KFEIAEKL
Sbjct: 301 PSVYVKLILELGKNPDKNVLVLTLLEELGQREALKLNQQDSTTIIKVCTRLRKFEIAEKL 360

Query: 361 YSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVA 420
           Y WYVESGHEPS+VMYTALVHSRYSDRKYREALSLVWEMES NCPFDLPAY+VVIKLFVA
Sbjct: 361 YCWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYNVVIKLFVA 420

Query: 421 LGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQI 480
           LGDLSRAVRYFAKLKEAGFSPTY+VYRNMITIYLVSGRLAK KEIYKEAENAGF+MDKQI
Sbjct: 421 LGDLSRAVRYFAKLKEAGFSPTYDVYRNMITIYLVSGRLAKSKEIYKEAENAGFIMDKQI 480

Query: 481 TSMLLQAKR 490
           TSMLLQAKR
Sbjct: 481 TSMLLQAKR 489

BLAST of CsaV3_1G014370 vs. NCBI nr
Match: XP_022951807.1 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita moschata])

HSP 1 Score: 790.0 bits (2039), Expect = 4.5e-225
Identity = 402/501 (80.24%), Postives = 439/501 (87.62%), Query Frame = 0

Query: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60
           MDS+ S T++SSILVK NGGI CQ  + HF+ NSRRRPPKNLL PRR KLPP+P VNQF 
Sbjct: 1   MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  NNKTSAPSP--PFTDLISSKIF------QDEHEEIHAHDY----TKDTDVVWDSDEIEAI 120
             +TS P P   F DLISS+         DE EE  A +Y      D+DVVWDS+EIEAI
Sbjct: 61  KKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPD 180
           +SLF+GRIPQKPGKLNRERPLPLPLPHKLRPP LPNPKIRP T VSSRAL+SKQVYKRPD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLAREIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQ 240
           FLIGLAR IRDL PEENVSKVLNRW PFLQKGSLSLTIKELGHMGL DRAL TFCW QEQ
Sbjct: 181 FLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 HRLFPDDRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKL 300
            RL+PDDRVLASTVEVL+RNHELK+  NL+EFTKLASRGVLEAMMRGFI+GG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVAAKKGKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVC 360
           LVAAK GKRMLDPSVYVKLILE+GKNPDKNMLVL LL+ELGQREAL LNQQD + I+KV 
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360

Query: 361 TRLGKFEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDL 420
           TRLGKFEIAE+LYSWYVESGHEPS+VMYTALVH+RYS+RKYREALS+VWEME+ N PFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANSPFDL 420

Query: 421 PAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKE 480
           PAYSVV+KLFVALGDLSRAVRYFAKLKEAGF+PTY +YRN+ITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AENAGFMMDKQITSMLLQAKR 490
           AENAG++MDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501

BLAST of CsaV3_1G014370 vs. NCBI nr
Match: XP_023537574.1 (pentatricopeptide repeat-containing protein At2g01860 [Cucurbita pepo subsp. pepo] >XP_023537576.1 pentatricopeptide repeat-containing protein At2g01860 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 788.9 bits (2036), Expect = 1.0e-224
Identity = 400/501 (79.84%), Postives = 437/501 (87.23%), Query Frame = 0

Query: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60
           MDS+ S T++SSILVK NGG+ CQ  M HF+ NSRRRPPKNLL PRR KLPP+P VNQF 
Sbjct: 1   MDSLFSTTTISSILVKRNGGVSCQIPMAHFQTNSRRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  NNKTSAPSP--PFTDLISSKIF------QDEHEEIHAHDY----TKDTDVVWDSDEIEAI 120
             +TS P P     DLI S+         DE EE  A +Y      D+D+VWDS+EIEAI
Sbjct: 61  KKRTSGPHPDTSLPDLIPSEKIGPPEEELDELEETAADNYFANDDNDSDIVWDSEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPD 180
           +SLF+GRIPQKPGKLNRERPLPLPLPHKLRPP LPNPKIRP T VSSRAL+SKQVYKRPD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLAREIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQ 240
           FLIGLAR IRDL PEENVSKVLNRW PFLQKGSLSLTIKELGHMGL DRAL TFCW QEQ
Sbjct: 181 FLIGLARAIRDLQPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 HRLFPDDRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKL 300
            RL+PDDRVLASTVEVL+RNHELK+  NL+EFTKLASRGVLEAMMRGFI+GG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVAAKKGKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVC 360
           LVAAK GKRMLDPSVYVKLILE+GKNPDKNMLVL LL+ELGQREAL LNQQD + I+KV 
Sbjct: 301 LVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKVS 360

Query: 361 TRLGKFEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDL 420
           TRLGKFEIAE+LYSWYVESGHEPS+VMYTALVH+RYS+RKYREALS+VWEME+  CPFDL
Sbjct: 361 TRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAAKCPFDL 420

Query: 421 PAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKE 480
           PAYSVVIKLFVALGDLSRAVRYFAKLKEAGF+PTY +YRN+ITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVIKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AENAGFMMDKQITSMLLQAKR 490
           AENAG++MDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501

BLAST of CsaV3_1G014370 vs. NCBI nr
Match: XP_023001961.1 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita maxima])

HSP 1 Score: 785.4 bits (2027), Expect = 1.1e-223
Identity = 400/501 (79.84%), Postives = 437/501 (87.23%), Query Frame = 0

Query: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60
           MDS+ S T+VSSILVK NGGI CQ  M HF  NS+RRPPKNLL PRR KLPP+P VNQF 
Sbjct: 1   MDSLFSTTAVSSILVKRNGGISCQIPMAHFLTNSKRRPPKNLLYPRRTKLPPDPGVNQFL 60

Query: 61  NNKTSAPSP--PFTDLISSKIF------QDEHEEIHAHDY----TKDTDVVWDSDEIEAI 120
             +TS P P   + DLI S+         DE EE  A +Y      D+D+VWD +EIEAI
Sbjct: 61  KKRTSDPHPDTSYPDLIPSEKIGLPEEELDELEETAADNYFANDDNDSDIVWDPEEIEAI 120

Query: 121 SSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPD 180
           +SLF+GRIPQKPGKLNRERPLPLPLPHKLRPP LPNPKIRP T VSSRAL+SKQVYKRPD
Sbjct: 121 TSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRPD 180

Query: 181 FLIGLAREIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQ 240
           FLIGLAR IRDL PEENVSKVLNRW PFLQKGSLSLTIKELGHMGL DRAL TFCW QEQ
Sbjct: 181 FLIGLARAIRDLQPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQEQ 240

Query: 241 HRLFPDDRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKL 300
            RL+PDDRVLASTVEVL+RNHELK+  NL+EFTKLASRGVLEAMMRGFI+GG L+LAWKL
Sbjct: 241 PRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWKL 300

Query: 301 LVAAKKGKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVC 360
           LVAAK GKRMLDPSV+VKLILE+GKNPDKNMLVL LL+ELGQREAL L+QQD + I+KV 
Sbjct: 301 LVAAKNGKRMLDPSVHVKLILEIGKNPDKNMLVLALLDELGQREALNLSQQDTSAIIKVS 360

Query: 361 TRLGKFEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDL 420
           TRLGKFEIAEKLYSWYVESGHEPS+VMYTALVH+RYS+RKYREALS+VWEME+ NCPFDL
Sbjct: 361 TRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANCPFDL 420

Query: 421 PAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKE 480
           PAYSVVIKLFVALGDLSRAVRYFAKLKEAGF+PTY +YRN+ITIYL +GRLAKCKEIYKE
Sbjct: 421 PAYSVVIKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYKE 480

Query: 481 AENAGFMMDKQITSMLLQAKR 490
           AENAG++MDKQITSMLLQAKR
Sbjct: 481 AENAGYVMDKQITSMLLQAKR 501

BLAST of CsaV3_1G014370 vs. TAIR10
Match: AT2G01860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 457.6 bits (1176), Expect = 9.6e-129
Identity = 250/476 (52.52%), Postives = 322/476 (67.65%), Query Frame = 0

Query: 17  GNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFFNNKTSAPSPPFTDLIS 76
           GN G+   N     + N  ++  KNL  PRR KLPP+  VN F       P         
Sbjct: 23  GNIGVTRVNAS---QRNHSKKLTKNLRNPRRTKLPPDFGVNLFLRKPKIEP--------- 82

Query: 77  SKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHK 136
             +                    W+ +EIEAISSLFQ RIPQKP K +R RPLPL     
Sbjct: 83  -LVIXXXXXXXXXXXXXXXXXXXWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPL---XX 142

Query: 137 LRPPRLPNPKIRPTTVVSSRAL--LSKQVYKRPDFLIGLAREIRDL-SPEENVSKVLNRW 196
                          ++ S AL  +SKQVYK P FLIGLAREI+ L S + +VS VLN+W
Sbjct: 143 XXXXXXXXXXXXXXNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLNKW 202

Query: 197 GPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNHELKV 256
             FL+KGSLS TI+ELGHMGLP+RAL T+ WA++   L PD+R+LAST++VL+++HELK+
Sbjct: 203 VSFLRKGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHELKL 262

Query: 257 AVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLDPSVYVKLILELGK 316
              L+    LAS+ V+EAM++G I GG LNLA KL++ +K   R+LD SVYVK+ILE+ K
Sbjct: 263 ---LKFDNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEIAK 322

Query: 317 NPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGHEPSI 376
           NPDK  LV+ LLEEL +RE LKL+QQDCT+I+K+C +LG+FE+ E L+ W+  S  EPS+
Sbjct: 323 NPDKYHLVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREPSV 382

Query: 377 VMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVALGDLSRAVRYFAK 436
           VMYT ++HSRYS++KYREA+S+VWEME  NC  DLPAY VVIKLFVAL DL RA+RY++K
Sbjct: 383 VMYTTMIHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYYSK 442

Query: 437 LKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQITSMLLQAKR 490
           LKEAGFSPTY++YR+MI++Y  SGRL KCKEI KE E+AG  +DK  +  LLQ ++
Sbjct: 443 LKEAGFSPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEK 479

BLAST of CsaV3_1G014370 vs. TAIR10
Match: AT5G25630.2 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 70.5 bits (171), Expect = 3.3e-12
Identity = 35/124 (28.23%), Postives = 64/124 (51.61%), Query Frame = 0

Query: 342 TTIVKVCTRLGKFEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMES 401
           T ++ V    G+   A+ ++    E+GH PS++ YT L+ +    ++Y    S+V E+E 
Sbjct: 49  TKLMNVLIERGRPHEAQTVFKTLAETGHRPSLISYTTLLAAMTVQKQYGSISSIVSEVEQ 108

Query: 402 GNCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAK 461
                D   ++ VI  F   G++  AV+   K+KE G +PT + Y  +I  Y ++G+  +
Sbjct: 109 SGTKLDSIFFNAVINAFSESGNMEDAVQALLKMKELGLNPTTSTYNTLIKGYGIAGKPER 168

Query: 462 CKEI 466
             E+
Sbjct: 169 SSEL 172

BLAST of CsaV3_1G014370 vs. TAIR10
Match: AT5G48730.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 53.5 bits (127), Expect = 4.2e-07
Identity = 61/258 (23.64%), Postives = 115/258 (44.57%), Query Frame = 0

Query: 202 LSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNHELKVAVNLEEFT 261
           +S++ +E  +    D++ NT    +E  +     + L +  +V  R+ +    +  ++ T
Sbjct: 30  ISISPREPNYAITSDKSNNTSLSLRETRQ----SKWLINAEDVNERDSK---EIKEDKNT 89

Query: 262 KLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLDPSVYVKLILE--LGKNPDKNM 321
           K+ASR  +  ++R          A K ++  KKG + L P   ++ + E       +  +
Sbjct: 90  KIASRKAISIILR--------REATKSIIEKKKGSKKLLPRTVLESLHERITALRWESAI 149

Query: 322 LVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGHEPSIV---MY 381
            V  LL     RE L   + +    VK+   LGK +  EK +  + E  +E  +V   +Y
Sbjct: 150 QVFELL-----REQL-WYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVY 209

Query: 382 TALVHSRYSDRKYREALSLVWEMESG-NCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLK 441
           TALV +     ++  A +L+  M+S  NC  D+  YS++IK F+ +    +     + ++
Sbjct: 210 TALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFLQVFAFDKVQDLLSDMR 266

Query: 442 EAGFSPTYNVYRNMITIY 454
             G  P    Y  +I  Y
Sbjct: 270 RQGIRPNTITYNTLIDAY 266

BLAST of CsaV3_1G014370 vs. TAIR10
Match: AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 48.5 bits (114), Expect = 1.3e-05
Identity = 37/158 (23.42%), Postives = 69/158 (43.67%), Query Frame = 0

Query: 335 KLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGHEPSIVM----YTALVHSRYSDRKYR 394
           +L + D  ++VK     G +E A  L+ W V S +  ++ +        V     + +Y 
Sbjct: 133 ELLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQYS 192

Query: 395 EALSLVWEMESGNCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMI 454
            A  L+ ++       D+ AY+ ++  +   G   +A+  F ++KE G SPT   Y  ++
Sbjct: 193 VAAKLLDKIPLQEYLLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVIL 252

Query: 455 TIYLVSGR-LAKCKEIYKEAENAGFMMDKQITSMLLQA 488
            ++   GR   K   +  E  + G   D+   S +L A
Sbjct: 253 DVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSA 290

BLAST of CsaV3_1G014370 vs. TAIR10
Match: AT1G20300.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 44.7 bits (104), Expect = 1.9e-04
Identity = 32/129 (24.81%), Postives = 59/129 (45.74%), Query Frame = 0

Query: 344 IVKVCTRLGKFEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGN 403
           ++ +  ++ +F++A  L         E SI  +T L+          EA+     ME   
Sbjct: 157 MIDLSGKVRQFDLAWHLIDLMKSRNVEISIETFTILIRRYVRAGLASEAVHCFNRMEDYG 216

Query: 404 CPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCK 463
           C  D  A+S+VI         S A  +F  LK+  F P   VY N++  +  +G +++ +
Sbjct: 217 CVPDKIAFSIVISNLSRKRRASEAQSFFDSLKDR-FEPDVIVYTNLVRGWCRAGEISEAE 276

Query: 464 EIYKEAENA 473
           +++KE + A
Sbjct: 277 KVFKEMKLA 284

BLAST of CsaV3_1G014370 vs. Swiss-Prot
Match: sp|Q5XET4|PP142_ARATH (Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX=3702 GN=EMB975 PE=2 SV=1)

HSP 1 Score: 457.6 bits (1176), Expect = 1.7e-127
Identity = 250/476 (52.52%), Postives = 322/476 (67.65%), Query Frame = 0

Query: 17  GNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFFNNKTSAPSPPFTDLIS 76
           GN G+   N     + N  ++  KNL  PRR KLPP+  VN F       P         
Sbjct: 23  GNIGVTRVNAS---QRNHSKKLTKNLRNPRRTKLPPDFGVNLFLRKPKIEP--------- 82

Query: 77  SKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHK 136
             +                    W+ +EIEAISSLFQ RIPQKP K +R RPLPL     
Sbjct: 83  -LVIXXXXXXXXXXXXXXXXXXXWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPL---XX 142

Query: 137 LRPPRLPNPKIRPTTVVSSRAL--LSKQVYKRPDFLIGLAREIRDL-SPEENVSKVLNRW 196
                          ++ S AL  +SKQVYK P FLIGLAREI+ L S + +VS VLN+W
Sbjct: 143 XXXXXXXXXXXXXXNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLNKW 202

Query: 197 GPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNHELKV 256
             FL+KGSLS TI+ELGHMGLP+RAL T+ WA++   L PD+R+LAST++VL+++HELK+
Sbjct: 203 VSFLRKGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHELKL 262

Query: 257 AVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLDPSVYVKLILELGK 316
              L+    LAS+ V+EAM++G I GG LNLA KL++ +K   R+LD SVYVK+ILE+ K
Sbjct: 263 ---LKFDNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEIAK 322

Query: 317 NPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGHEPSI 376
           NPDK  LV+ LLEEL +RE LKL+QQDCT+I+K+C +LG+FE+ E L+ W+  S  EPS+
Sbjct: 323 NPDKYHLVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREPSV 382

Query: 377 VMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVALGDLSRAVRYFAK 436
           VMYT ++HSRYS++KYREA+S+VWEME  NC  DLPAY VVIKLFVAL DL RA+RY++K
Sbjct: 383 VMYTTMIHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYYSK 442

Query: 437 LKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQITSMLLQAKR 490
           LKEAGFSPTY++YR+MI++Y  SGRL KCKEI KE E+AG  +DK  +  LLQ ++
Sbjct: 443 LKEAGFSPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEK 479

BLAST of CsaV3_1G014370 vs. Swiss-Prot
Match: sp|Q8GZ63|PP397_ARATH (Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX=3702 GN=At5g25630 PE=2 SV=2)

HSP 1 Score: 70.5 bits (171), Expect = 6.0e-11
Identity = 35/124 (28.23%), Postives = 64/124 (51.61%), Query Frame = 0

Query: 342 TTIVKVCTRLGKFEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMES 401
           T ++ V    G+   A+ ++    E+GH PS++ YT L+ +    ++Y    S+V E+E 
Sbjct: 49  TKLMNVLIERGRPHEAQTVFKTLAETGHRPSLISYTTLLAAMTVQKQYGSISSIVSEVEQ 108

Query: 402 GNCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAK 461
                D   ++ VI  F   G++  AV+   K+KE G +PT + Y  +I  Y ++G+  +
Sbjct: 109 SGTKLDSIFFNAVINAFSESGNMEDAVQALLKMKELGLNPTTSTYNTLIKGYGIAGKPER 168

Query: 462 CKEI 466
             E+
Sbjct: 169 SSEL 172

BLAST of CsaV3_1G014370 vs. Swiss-Prot
Match: sp|Q9FKC3|PP424_ARATH (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 53.5 bits (127), Expect = 7.6e-06
Identity = 61/258 (23.64%), Postives = 115/258 (44.57%), Query Frame = 0

Query: 202 LSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNHELKVAVNLEEFT 261
           +S++ +E  +    D++ NT    +E  +     + L +  +V  R+ +    +  ++ T
Sbjct: 30  ISISPREPNYAITSDKSNNTSLSLRETRQ----SKWLINAEDVNERDSK---EIKEDKNT 89

Query: 262 KLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLDPSVYVKLILE--LGKNPDKNM 321
           K+ASR  +  ++R          A K ++  KKG + L P   ++ + E       +  +
Sbjct: 90  KIASRKAISIILR--------REATKSIIEKKKGSKKLLPRTVLESLHERITALRWESAI 149

Query: 322 LVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGHEPSIV---MY 381
            V  LL     RE L   + +    VK+   LGK +  EK +  + E  +E  +V   +Y
Sbjct: 150 QVFELL-----REQL-WYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVY 209

Query: 382 TALVHSRYSDRKYREALSLVWEMESG-NCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLK 441
           TALV +     ++  A +L+  M+S  NC  D+  YS++IK F+ +    +     + ++
Sbjct: 210 TALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFLQVFAFDKVQDLLSDMR 266

Query: 442 EAGFSPTYNVYRNMITIY 454
             G  P    Y  +I  Y
Sbjct: 270 RQGIRPNTITYNTLIDAY 266

BLAST of CsaV3_1G014370 vs. Swiss-Prot
Match: sp|O64624|PP163_ARATH (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 48.5 bits (114), Expect = 2.4e-04
Identity = 37/158 (23.42%), Postives = 69/158 (43.67%), Query Frame = 0

Query: 335 KLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGHEPSIVM----YTALVHSRYSDRKYR 394
           +L + D  ++VK     G +E A  L+ W V S +  ++ +        V     + +Y 
Sbjct: 133 ELLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQYS 192

Query: 395 EALSLVWEMESGNCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMI 454
            A  L+ ++       D+ AY+ ++  +   G   +A+  F ++KE G SPT   Y  ++
Sbjct: 193 VAAKLLDKIPLQEYLLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVIL 252

Query: 455 TIYLVSGR-LAKCKEIYKEAENAGFMMDKQITSMLLQA 488
            ++   GR   K   +  E  + G   D+   S +L A
Sbjct: 253 DVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSA 290

BLAST of CsaV3_1G014370 vs. TrEMBL
Match: tr|A0A0A0LVM0|A0A0A0LVM0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1)

HSP 1 Score: 972.6 bits (2513), Expect = 3.2e-280
Identity = 489/489 (100.00%), Postives = 489/489 (100.00%), Query Frame = 0

Query: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60
           MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF
Sbjct: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60

Query: 61  NNKTSAPSPPFTDLISSKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKP 120
           NNKTSAPSPPFTDLISSKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKP
Sbjct: 61  NNKTSAPSPPFTDLISSKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKP 120

Query: 121 GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDL 180
           GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDL
Sbjct: 121 GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDL 180

Query: 181 SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLAS 240
           SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLAS
Sbjct: 181 SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLAS 240

Query: 241 TVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLD 300
           TVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLD
Sbjct: 241 TVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLD 300

Query: 301 PSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKL 360
           PSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKL
Sbjct: 301 PSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKL 360

Query: 361 YSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVA 420
           YSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVA
Sbjct: 361 YSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVA 420

Query: 421 LGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQI 480
           LGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQI
Sbjct: 421 LGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQI 480

Query: 481 TSMLLQAKR 490
           TSMLLQAKR
Sbjct: 481 TSMLLQAKR 489

BLAST of CsaV3_1G014370 vs. TrEMBL
Match: tr|A0A1S3CGD0|A0A1S3CGD0_CUCME (pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN=LOC103500594 PE=4 SV=1)

HSP 1 Score: 918.7 bits (2373), Expect = 5.6e-264
Identity = 464/489 (94.89%), Postives = 474/489 (96.93%), Query Frame = 0

Query: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNPAVNQFF 60
           M SIVSATSVSSILVKGNGGIGCQ TMVHFKANSRRRPPKNLLCPRRAKLPP+PAVNQF 
Sbjct: 1   MHSIVSATSVSSILVKGNGGIGCQITMVHFKANSRRRPPKNLLCPRRAKLPPDPAVNQFL 60

Query: 61  NNKTSAPSPPFTDLISSKIFQDEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKP 120
           NNKTSAPSP FTDLISSKIFQDEHEEIHA+DYTKDTDVVWDSDEIEAISSLFQGRIPQKP
Sbjct: 61  NNKTSAPSPSFTDLISSKIFQDEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKP 120

Query: 121 GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDL 180
           GKLNRERPLPLPLPHKLRPPRLPNPKIRPTT VSSRALLSK+VYKRPDFLIGLAR IRDL
Sbjct: 121 GKLNRERPLPLPLPHKLRPPRLPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLARAIRDL 180

Query: 181 SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLAS 240
           SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRAL TFCW QEQ RLFPDDRVLAS
Sbjct: 181 SPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWVQEQRRLFPDDRVLAS 240

Query: 241 TVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLD 300
           TVEVLSRNHELKV VNLEEFTKLASRGVLEAMMRGFI+GGSLNLAWKLLVAAKKGKRMLD
Sbjct: 241 TVEVLSRNHELKVPVNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAAKKGKRMLD 300

Query: 301 PSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKL 360
           PSVYVKLILELGKNPDKN+LVLTLLEELGQREALKLNQQD TTI+KVCTRL KFEIAEKL
Sbjct: 301 PSVYVKLILELGKNPDKNVLVLTLLEELGQREALKLNQQDSTTIIKVCTRLRKFEIAEKL 360

Query: 361 YSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVA 420
           Y WYVESGHEPS+VMYTALVHSRYSDRKYREALSLVWEMES NCPFDLPAY+VVIKLFVA
Sbjct: 361 YCWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYNVVIKLFVA 420

Query: 421 LGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDKQI 480
           LGDLSRAVRYFAKLKEAGFSPTY+VYRNMITIYLVSGRLAK KEIYKEAENAGF+MDKQI
Sbjct: 421 LGDLSRAVRYFAKLKEAGFSPTYDVYRNMITIYLVSGRLAKSKEIYKEAENAGFIMDKQI 480

Query: 481 TSMLLQAKR 490
           TSMLLQAKR
Sbjct: 481 TSMLLQAKR 489

BLAST of CsaV3_1G014370 vs. TrEMBL
Match: tr|A0A2N9HFH8|A0A2N9HFH8_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS38191 PE=4 SV=1)

HSP 1 Score: 604.0 bits (1556), Expect = 3.0e-169
Identity = 306/462 (66.23%), Postives = 375/462 (81.17%), Query Frame = 0

Query: 32  ANSRRRPPKNLLCPRRAKLPPNPAVNQFFNNKTSAPSPPFTDLISSKIFQDEHEEIHAHD 91
           ++++RR PKNL  PR  KLPP+  VN F   KT+ PS   TDLI+S + ++  E+     
Sbjct: 33  SSTKRRLPKNLRYPRSTKLPPDFGVNLFLKKKTTDPS--LTDLINSHLAEEGEEDTQ--- 92

Query: 92  YTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTT 151
             +DT +VWDSDEIEAISSLF+GRIPQKPGKLNR+RPL     +KLRP  LP PK    +
Sbjct: 93  -EEDTGIVWDSDEIEAISSLFRGRIPQKPGKLNRQRPLXXXXXYKLRPAGLPAPKKHVKS 152

Query: 152 V----VSSRALLSKQVYKRPDFLIGLAREIRDLSPEENVSKVLNRWGPFLQKGSLSLTIK 211
           V    +SSRA LSKQ+YK P  LIG+AREI+ LS EE+VS +LN+W  FL+KGSLSLTI+
Sbjct: 153 VSPSALSSRASLSKQLYKNPGVLIGIAREIKSLSSEEDVSVILNKWASFLRKGSLSLTIR 212

Query: 212 ELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNHELKVAVNLEEFTKLASRG 271
           ELGHMGLP+RAL TFCWAQ+Q +LFPDDR+LASTVEVL+RNHELKV  NLE+FT LASRG
Sbjct: 213 ELGHMGLPERALKTFCWAQKQPQLFPDDRILASTVEVLARNHELKVPFNLEKFTALASRG 272

Query: 272 VLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLDPSVYVKLILELGKNPDKNMLVLTLLEE 331
           V+EAM+RGFIRGGSL+LA K+L+ AK GKRMLD SVY KLILELGKNPDK +LV+ LL+E
Sbjct: 273 VIEAMVRGFIRGGSLHLARKVLLIAKHGKRMLDSSVYAKLILELGKNPDKQLLVVALLDE 332

Query: 332 LGQREALKLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDR 391
           LG+R+   L+QQDCT I+KVC RL KF+I E L++W+ +SGH+PS+VMYT L+HSRYS++
Sbjct: 333 LGERDDFNLSQQDCTAIMKVCIRLRKFDIVESLFNWFKQSGHDPSVVMYTTLIHSRYSEK 392

Query: 392 KYREALSLVWEMESGNCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYR 451
           KYREAL++VWEME+ NC FDLPAY VVI+LFVAL DLSRAVRYF+KLKEAGF PTY++YR
Sbjct: 393 KYREALAVVWEMEASNCLFDLPAYRVVIRLFVALSDLSRAVRYFSKLKEAGFCPTYDLYR 452

Query: 452 NMITIYLVSGRLAKCKEIYKEAENAGFMMDKQITSMLLQAKR 490
           ++I IY++SGRLAKCKE+ KEA  AGF +DK+ TS LLQ +R
Sbjct: 453 DLIKIYMISGRLAKCKEVCKEAGQAGFKLDKETTSWLLQFER 488

BLAST of CsaV3_1G014370 vs. TrEMBL
Match: tr|A0A2I4GWH4|A0A2I4GWH4_9ROSI (pentatricopeptide repeat-containing protein At2g01860 isoform X2 OS=Juglans regia OX=51240 GN=LOC109011476 PE=4 SV=1)

HSP 1 Score: 582.0 bits (1499), Expect = 1.2e-162
Identity = 301/469 (64.18%), Postives = 362/469 (77.19%), Query Frame = 0

Query: 31  KANSRRRPPKNLLCPRRAKLPPNPAVNQFFNNKTSAPSPPFTDLISSKIFQDEHEEI--- 90
           ++ +RRRPPKNL  PR  K PPN  VN F   KTS  S   TD+  + +   +   +   
Sbjct: 30  RSKTRRRPPKNLRYPRHPKSPPNFGVNLFL-KKTSTNS---TDISLAYLIDGKKPRLAGK 89

Query: 91  --------------HAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPL 150
                               ++T + WDSDEIEAISSLFQGR+PQKPGKLNRERPL    
Sbjct: 90  KGXXXXXXXXXXXXXXXXXRQETGICWDSDEIEAISSLFQGRVPQKPGKLNRERPLXXXX 149

Query: 151 PHKLRPPRLPNPKIRPTT----VVSSRALLSKQVYKRPDFLIGLAREIRDLSPEENVSKV 210
            +KL P  LP PK    +    VVSSRA LSKQVYK P  LIG+AREI+ +S EE+VS V
Sbjct: 150 XYKLXPLGLPTPKKHVKSASPLVVSSRASLSKQVYKNPGVLIGIAREIKMISSEEDVSVV 209

Query: 211 LNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNH 270
           LN+W  FL+KGSLSLTI+ELGHMGLP+RAL TFCWAQ+Q +LFPDDR+LASTVEVL+RNH
Sbjct: 210 LNKWARFLRKGSLSLTIRELGHMGLPERALQTFCWAQKQTQLFPDDRILASTVEVLARNH 269

Query: 271 ELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLDPSVYVKLIL 330
           ELKV   L +FT LASRGV+EAM+RGFIRGGSL+LAWKLL  A+ GKRMLDPS+Y KLIL
Sbjct: 270 ELKVPFKLGKFTSLASRGVMEAMVRGFIRGGSLHLAWKLLSVARDGKRMLDPSIYAKLIL 329

Query: 331 ELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGH 390
           ELGKNPDK+MLV++LL+ELG+RE L L+QQDCT I+K+C RLGKF++ + L++W+ +SG+
Sbjct: 330 ELGKNPDKHMLVVSLLDELGEREDLNLSQQDCTAIMKICIRLGKFDVVDGLFNWFKQSGY 389

Query: 391 EPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVALGDLSRAVR 450
           EPS+VMYT L+HS YS+RKYREAL+LVWEME+ NC  DLPAY VVIKLFVAL D+SRAVR
Sbjct: 390 EPSVVMYTTLIHSHYSERKYREALALVWEMEASNCLLDLPAYRVVIKLFVALNDISRAVR 449

Query: 451 YFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDK 479
           YF+KLKEAGFSPTY++YR +I IY+VSGRLAKCKE+ KEAE AGF +DK
Sbjct: 450 YFSKLKEAGFSPTYDMYRELIKIYMVSGRLAKCKEVCKEAEIAGFKLDK 494

BLAST of CsaV3_1G014370 vs. TrEMBL
Match: tr|A0A2I4GWH7|A0A2I4GWH7_9ROSI (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Juglans regia OX=51240 GN=LOC109011476 PE=4 SV=1)

HSP 1 Score: 582.0 bits (1499), Expect = 1.2e-162
Identity = 301/469 (64.18%), Postives = 362/469 (77.19%), Query Frame = 0

Query: 31  KANSRRRPPKNLLCPRRAKLPPNPAVNQFFNNKTSAPSPPFTDLISSKIFQDEHEEI--- 90
           ++ +RRRPPKNL  PR  K PPN  VN F   KTS  S   TD+  + +   +   +   
Sbjct: 84  RSKTRRRPPKNLRYPRHPKSPPNFGVNLFL-KKTSTNS---TDISLAYLIDGKKPRLAGK 143

Query: 91  --------------HAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPL 150
                               ++T + WDSDEIEAISSLFQGR+PQKPGKLNRERPL    
Sbjct: 144 KGXXXXXXXXXXXXXXXXXRQETGICWDSDEIEAISSLFQGRVPQKPGKLNRERPLXXXX 203

Query: 151 PHKLRPPRLPNPKIRPTT----VVSSRALLSKQVYKRPDFLIGLAREIRDLSPEENVSKV 210
            +KL P  LP PK    +    VVSSRA LSKQVYK P  LIG+AREI+ +S EE+VS V
Sbjct: 204 XYKLXPLGLPTPKKHVKSASPLVVSSRASLSKQVYKNPGVLIGIAREIKMISSEEDVSVV 263

Query: 211 LNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNH 270
           LN+W  FL+KGSLSLTI+ELGHMGLP+RAL TFCWAQ+Q +LFPDDR+LASTVEVL+RNH
Sbjct: 264 LNKWARFLRKGSLSLTIRELGHMGLPERALQTFCWAQKQTQLFPDDRILASTVEVLARNH 323

Query: 271 ELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAKKGKRMLDPSVYVKLIL 330
           ELKV   L +FT LASRGV+EAM+RGFIRGGSL+LAWKLL  A+ GKRMLDPS+Y KLIL
Sbjct: 324 ELKVPFKLGKFTSLASRGVMEAMVRGFIRGGSLHLAWKLLSVARDGKRMLDPSIYAKLIL 383

Query: 331 ELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGKFEIAEKLYSWYVESGH 390
           ELGKNPDK+MLV++LL+ELG+RE L L+QQDCT I+K+C RLGKF++ + L++W+ +SG+
Sbjct: 384 ELGKNPDKHMLVVSLLDELGEREDLNLSQQDCTAIMKICIRLGKFDVVDGLFNWFKQSGY 443

Query: 391 EPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSVVIKLFVALGDLSRAVR 450
           EPS+VMYT L+HS YS+RKYREAL+LVWEME+ NC  DLPAY VVIKLFVAL D+SRAVR
Sbjct: 444 EPSVVMYTTLIHSHYSERKYREALALVWEMEASNCLLDLPAYRVVIKLFVALNDISRAVR 503

Query: 451 YFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAGFMMDK 479
           YF+KLKEAGFSPTY++YR +I IY+VSGRLAKCKE+ KEAE AGF +DK
Sbjct: 504 YFSKLKEAGFSPTYDMYRELIKIYMVSGRLAKCKEVCKEAEIAGFKLDK 548

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139567.14.9e-280100.00PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativu... [more]
XP_008462173.18.4e-26494.89PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] ... [more]
XP_022951807.14.5e-22580.24pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita mosc... [more]
XP_023537574.11.0e-22479.84pentatricopeptide repeat-containing protein At2g01860 [Cucurbita pepo subsp. pep... [more]
XP_023001961.11.1e-22379.84pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
AT2G01860.19.6e-12952.52Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G25630.23.3e-1228.23Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G48730.14.2e-0723.64Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G18940.11.3e-0523.42Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G20300.11.9e-0424.81Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q5XET4|PP142_ARATH1.7e-12752.52Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX... [more]
sp|Q8GZ63|PP397_ARATH6.0e-1128.23Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX... [more]
sp|Q9FKC3|PP424_ARATH7.6e-0623.64Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
sp|O64624|PP163_ARATH2.4e-0423.42Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LVM0|A0A0A0LVM0_CUCSA3.2e-280100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1[more]
tr|A0A1S3CGD0|A0A1S3CGD0_CUCME5.6e-26494.89pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2N9HFH8|A0A2N9HFH8_FAGSY3.0e-16966.23Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS38191 PE=4 SV=1[more]
tr|A0A2I4GWH4|A0A2I4GWH4_9ROSI1.2e-16264.18pentatricopeptide repeat-containing protein At2g01860 isoform X2 OS=Juglans regi... [more]
tr|A0A2I4GWH7|A0A2I4GWH7_9ROSI1.2e-16264.18pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Juglans regi... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G014370.1CsaV3_1G014370.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 395..452
e-value: 0.0017
score: 18.3
coord: 330..380
e-value: 0.011
score: 15.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 410..442
e-value: 0.001
score: 17.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 265..299
score: 5.382
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 442..476
score: 7.092
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 372..406
score: 7.487
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 407..441
score: 9.986
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 337..371
score: 8.813
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 173..317
e-value: 2.1E-7
score: 32.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 318..489
e-value: 2.1E-26
score: 95.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 116..148
NoneNo IPR availablePANTHERPTHR24015:SF642SUBFAMILY NOT NAMEDcoord: 27..488
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 27..488