CsGy7G018360.1 (mRNA) Cucumber (Gy14) v2

NameCsGy7G018360.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionpentatricopeptide repeat-containing protein At1g80880, mitochondrial
LocationChr7 : 21430959 .. 21434291 (+)
Sequence length2469
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTAGATTTTCCATTTTAGTTTGTCAAATAATTTTTTTTTAAAAGTAAGTGAGAAAAACTAAAGGACTAAGAAATTTAATTCTACAACTATAACTTATGGTATATAAGTTAGACTTAAAATATATACGTAGGTATTCCTCTTTTGCACGGCCACTACTAAAAGGTTTATAAACACAATTGAGCTTCACTTTCCCCAATATATATATATTAAGGGATTGTAATTCAAATTTTACCGCGCAGCGAAACTCCCTCTCTGTATCTTCAGTCGTTTTTCATGGTAATCCATGGCAAGCCTTTCTTCTATTGCCAGAAGGCTCTGCAGAATCCATCCATTGCCATTCCATCATCTCCTTTATCTCAACCGTTTACGTATCCCCGATTCCCCCTTTCAGGCATTTCATCAAACATTTAGTCTGCATTCTTTCTTCGCTCGTCAATTCTCAGCTCTTCCATCTTTTTCTCAAAAACTTGGCGACCCATTTCTGTTTGACACAGGAAGATTCCAAAACTATCGCCAGAGTGACGCGTGTAATGCCCGATTCATCGAATTGTTCAAACGGGTCGCTCTTTTGCCATCGGAAGTGGAGGCTGTTGCTGCATTGGACGAGTTTGATGTCAAGGCAGATTTGGATTTGGTTTACTCGGCAATTTGGGTGTTGAGGGATGATTGGAAATCGTCCCTTCTAGCATTCAAATGGGGTGAGAAAGTGGGAGCCATTGATGAAGAGATTTGTAATTTGATGATATGGGTGTTGGGCAATCATAAGAAATTCAGTACTGCTTGGTCTTTGATCAGAGAATTGCACGGATCCTTGCTGAATTCGATGCAAGCGATGCTTGTCATGATTGATAGGTGAAGTTTATTTTCGCATATCTTCATTTATTCTACACTTGATTGTTGATATTCTGATATGCCATTTGTAACTATTGTCAAAGTCGTTTTGCATTTTGACTTTATTCGATAGCTTAGTTTATATGCTTAAGTGTAGAAAGCTAAGACGTATACTACATGACGGCATGCTATATTTTAAAAGTCTAGGAATGACACGACAAGGAAACTTTTATCAAAAGATCTTTTTTTAATGCATTCTGTTAATTGGTTTCTTACTGATTTATGTAGTAAATGAGAAAATGAAGGGAGATTACATCCATAGTTATCTTTTCGATTTATATGCTGCTAACAAGGAAGTTCCTGGTTTAAGGTATGCATATGCAAATGAGGCAAGTAAGGCTATTAAGACATTCCACATGATGGAGAAGTTCAGATTGACACCGGATCAAGAGGCTTTTCACGTGCTTCTCAATTCTCTCTGTAAATATGGGAATATCGAAGAAGCTGAAGAGTTTATGTTTGTAAACAAGAAGCTTTTTCCTTTGGGAACAGAAAGCTTTAATATTATTCTCAACGGTTGGTGCAATGTAACTGTTGATGTGTTTGAAGCAAAGAGAATTTGGAGAGAAATGTCTAAATGTTGCATTTTACCAGATTCAACTTCTTATACCCACATGATTTCCTGTTTTTCGAAGAATGGGAACCTTTTCGACTCGCTTAGATTCTATGATCAGATGAAGAAAAGGGATTGGATTCCAAGCGTCGAAGTCTATAATTCTTTAGCTTACGTGTTGACCCGGGAGAATTGCTTCAATGAAGCTCTCAAAATCCTTGAGAAAATAAAAGAAGTGGGCTTGCGGCCAGACTCCACTACATACAACTCACTTATAAGTCCTCTGTGTGAGGCGGGAAAGCTGGACGAAGCAAAAGATGTACTGACCATGATGACTGAGGACAATATCAGTCCGACGATCGAAACCTACCATTCTTTTATTCAGGCTGCAGATTCTAAAATGAGCTTTGAACTTCTTAAGCGGATGAGACAAGATGGTTTGGGTCCTACAGAGGGTACCTTTCTTATCATGTTTAATAAGTCATTTGAATTAGAAGAACCGGAGTATGCATTGAATGTGTGGGTAGAAATGAAGCGGTACGAGGTATTTCCGAGTTGTGAACATTACTCAGTCTTGATACAAGGCCTTGCAACATGTGGTCACTTAAAAAAGGCCAGGGAATTATATGACGAAATGATATTACATGGATTTATCGCACATCCAAAGATTAAAACGCTTCTGAAGGAACCAGATTTAGGTAGCATTGACGAAGCAAGGCAGCAAGTGAGACACAACAACAAAGGTAAGTTCATTCCTCATAGGAAAGGGAGAACGATGAGGTGGAAATCACATAAACAACGATCTAAAGGGGCTGCATCATTTGAATAGGTAATTTATCTGTATACGTGTTACTCGAATAAATTGAATCATATTCTATGTTCTGCTAGTATTGTATAGGTCCTATAGGAGATGAGCTAGATATCAATCTGATCATTCTGCTCGATACTCCTTTCACAATCTTTCAATCAGCACCCTTTAGTCCATAATTTTATGCTTGTGTAGAAAAAAATCATTAGAGGTTCATGGGTGCCTTCATGTGAGGGTTACAATTGAGGTTTTCGATTAGATCTCGTCAGGATGTGAACACAGTTCCACTTAAGTTAGAAATATACAAAATCCTTGGTATAAAAATAAGGAGAACTTCTTGTTAGGAATAGGATGAATCCTTGTCCTAACAAATTAATTATTCACTCTTCCATTGAATGCAGGATGACTTAATAATCTTGCAGCCCATATAAAAGTTAGATAGATTTCTGATTGAATGATGGGGGAGGTATTTTTTAAGATTGAGATTTAAATCTCTGATTTACTATTTGAACTATGTTTAAGTTGAGTAAGATCTTACATATTTGCTTTTTGGATGAACTGTTGACTGTTTGTCTTTTATCTGAATCCTTTGTCTGATAAGGAACAAGAACTCATTGTGGAAGCAGGTGAAAAAGCATTCCTTCCTTCGCTTTGGCACAAAGCTCGAAGAGAACACTCGCATGGGCTAGAGAAAAGATACGACATCGTTTTCCTTTTTTTCTTTTTCCTCTTTTGAAGGTATAAGCTGTTTGGCCTGGCAGTTGGATGATCGAGAAAGTTTAATTCTCAGTTGTCCTTTTTGTAGCTTCATTTGTTGGAATTGGCCACACTATAAAAGGTTATAAATTCTCACCAGAAGGGCAACCTTTTCAAGCATGGACACAGTTTTAAGGATTTTTCTTTTCATTGGAAAGATAATTAAGAGAATATTTCTTGTTTGTATAAATTAAAGTTGGGAAGTTTTTGTTTGGATCTTATAAATATGAACAAGTGTCAAAAAAACAAAAGATTAAATTAGAGTGAAAATATTTGAACTACATATCTATTGATTTTTAGCTTGAGCTAAAACTTTAATTTGAC

mRNA sequence

CGTAGATTTTCCATTTTAGTTTGTCAAATAATTTTTTTTTAAAAGTAAGTGAGAAAAACTAAAGGACTAAGAAATTTAATTCTACAACTATAACTTATGGTATATAAGTTAGACTTAAAATATATACGTAGGTATTCCTCTTTTGCACGGCCACTACTAAAAGGTTTATAAACACAATTGAGCTTCACTTTCCCCAATATATATATATTAAGGGATTGTAATTCAAATTTTACCGCGCAGCGAAACTCCCTCTCTGTATCTTCAGTCGTTTTTCATGGTAATCCATGGCAAGCCTTTCTTCTATTGCCAGAAGGCTCTGCAGAATCCATCCATTGCCATTCCATCATCTCCTTTATCTCAACCGTTTACGTATCCCCGATTCCCCCTTTCAGGCATTTCATCAAACATTTAGTCTGCATTCTTTCTTCGCTCGTCAATTCTCAGCTCTTCCATCTTTTTCTCAAAAACTTGGCGACCCATTTCTGTTTGACACAGGAAGATTCCAAAACTATCGCCAGAGTGACGCGTGTAATGCCCGATTCATCGAATTGTTCAAACGGGTCGCTCTTTTGCCATCGGAAGTGGAGGCTGTTGCTGCATTGGACGAGTTTGATGTCAAGGCAGATTTGGATTTGGTTTACTCGGCAATTTGGGTGTTGAGGGATGATTGGAAATCGTCCCTTCTAGCATTCAAATGGGGTGAGAAAGTGGGAGCCATTGATGAAGAGATTTGTAATTTGATGATATGGGTGTTGGGCAATCATAAGAAATTCAGTACTGCTTGGTCTTTGATCAGAGAATTGCACGGATCCTTGCTGAATTCGATGCAAGCGATGCTTGTCATGATTGATAGGTATGCATATGCAAATGAGGCAAGTAAGGCTATTAAGACATTCCACATGATGGAGAAGTTCAGATTGACACCGGATCAAGAGGCTTTTCACGTGCTTCTCAATTCTCTCTGTAAATATGGGAATATCGAAGAAGCTGAAGAGTTTATGTTTGTAAACAAGAAGCTTTTTCCTTTGGGAACAGAAAGCTTTAATATTATTCTCAACGGTTGGTGCAATGTAACTGTTGATGTGTTTGAAGCAAAGAGAATTTGGAGAGAAATGTCTAAATGTTGCATTTTACCAGATTCAACTTCTTATACCCACATGATTTCCTGTTTTTCGAAGAATGGGAACCTTTTCGACTCGCTTAGATTCTATGATCAGATGAAGAAAAGGGATTGGATTCCAAGCGTCGAAGTCTATAATTCTTTAGCTTACGTGTTGACCCGGGAGAATTGCTTCAATGAAGCTCTCAAAATCCTTGAGAAAATAAAAGAAGTGGGCTTGCGGCCAGACTCCACTACATACAACTCACTTATAAGTCCTCTGTGTGAGGCGGGAAAGCTGGACGAAGCAAAAGATGTACTGACCATGATGACTGAGGACAATATCAGTCCGACGATCGAAACCTACCATTCTTTTATTCAGGCTGCAGATTCTAAAATGAGCTTTGAACTTCTTAAGCGGATGAGACAAGATGGTTTGGGTCCTACAGAGGGTACCTTTCTTATCATGTTTAATAAGTCATTTGAATTAGAAGAACCGGAGTATGCATTGAATGTGTGGGTAGAAATGAAGCGGTACGAGGTATTTCCGAGTTGTGAACATTACTCAGTCTTGATACAAGGCCTTGCAACATGTGGTCACTTAAAAAAGGCCAGGGAATTATATGACGAAATGATATTACATGGATTTATCGCACATCCAAAGATTAAAACGCTTCTGAAGGAACCAGATTTAGGTAGCATTGACGAAGCAAGGCAGCAAGTGAGACACAACAACAAAGGTAAGTTCATTCCTCATAGGAAAGGGAGAACGATGAGGTGGAAATCACATAAACAACGATCTAAAGGGGCTGCATCATTTGAATAGGATGACTTAATAATCTTGCAGCCCATATAAAAGTTAGATAGATTTCTGATTGAATGATGGGGGAGGAACAAGAACTCATTGTGGAAGCAGGTGAAAAAGCATTCCTTCCTTCGCTTTGGCACAAAGCTCGAAGAGAACACTCGCATGGGCTAGAGAAAAGATACGACATCGTTTTCCTTTTTTTCTTTTTCCTCTTTTGAAGGTATAAGCTGTTTGGCCTGGCAGTTGGATGATCGAGAAAGTTTAATTCTCAGTTGTCCTTTTTGTAGCTTCATTTGTTGGAATTGGCCACACTATAAAAGGTTATAAATTCTCACCAGAAGGGCAACCTTTTCAAGCATGGACACAGTTTTAAGGATTTTTCTTTTCATTGGAAAGATAATTAAGAGAATATTTCTTGTTTGTATAAATTAAAGTTGGGAAGTTTTTGTTTGGATCTTATAAATATGAACAAGTGTCAAAAAAACAAAAGATTAAATTAGAGTGAAAATATTTGAACTACATATCTATTGATTTTTAGCTTGAGCTAAAACTTTAATTTGAC

Coding sequence (CDS)

ATGGCAAGCCTTTCTTCTATTGCCAGAAGGCTCTGCAGAATCCATCCATTGCCATTCCATCATCTCCTTTATCTCAACCGTTTACGTATCCCCGATTCCCCCTTTCAGGCATTTCATCAAACATTTAGTCTGCATTCTTTCTTCGCTCGTCAATTCTCAGCTCTTCCATCTTTTTCTCAAAAACTTGGCGACCCATTTCTGTTTGACACAGGAAGATTCCAAAACTATCGCCAGAGTGACGCGTGTAATGCCCGATTCATCGAATTGTTCAAACGGGTCGCTCTTTTGCCATCGGAAGTGGAGGCTGTTGCTGCATTGGACGAGTTTGATGTCAAGGCAGATTTGGATTTGGTTTACTCGGCAATTTGGGTGTTGAGGGATGATTGGAAATCGTCCCTTCTAGCATTCAAATGGGGTGAGAAAGTGGGAGCCATTGATGAAGAGATTTGTAATTTGATGATATGGGTGTTGGGCAATCATAAGAAATTCAGTACTGCTTGGTCTTTGATCAGAGAATTGCACGGATCCTTGCTGAATTCGATGCAAGCGATGCTTGTCATGATTGATAGGTATGCATATGCAAATGAGGCAAGTAAGGCTATTAAGACATTCCACATGATGGAGAAGTTCAGATTGACACCGGATCAAGAGGCTTTTCACGTGCTTCTCAATTCTCTCTGTAAATATGGGAATATCGAAGAAGCTGAAGAGTTTATGTTTGTAAACAAGAAGCTTTTTCCTTTGGGAACAGAAAGCTTTAATATTATTCTCAACGGTTGGTGCAATGTAACTGTTGATGTGTTTGAAGCAAAGAGAATTTGGAGAGAAATGTCTAAATGTTGCATTTTACCAGATTCAACTTCTTATACCCACATGATTTCCTGTTTTTCGAAGAATGGGAACCTTTTCGACTCGCTTAGATTCTATGATCAGATGAAGAAAAGGGATTGGATTCCAAGCGTCGAAGTCTATAATTCTTTAGCTTACGTGTTGACCCGGGAGAATTGCTTCAATGAAGCTCTCAAAATCCTTGAGAAAATAAAAGAAGTGGGCTTGCGGCCAGACTCCACTACATACAACTCACTTATAAGTCCTCTGTGTGAGGCGGGAAAGCTGGACGAAGCAAAAGATGTACTGACCATGATGACTGAGGACAATATCAGTCCGACGATCGAAACCTACCATTCTTTTATTCAGGCTGCAGATTCTAAAATGAGCTTTGAACTTCTTAAGCGGATGAGACAAGATGGTTTGGGTCCTACAGAGGGTACCTTTCTTATCATGTTTAATAAGTCATTTGAATTAGAAGAACCGGAGTATGCATTGAATGTGTGGGTAGAAATGAAGCGGTACGAGGTATTTCCGAGTTGTGAACATTACTCAGTCTTGATACAAGGCCTTGCAACATGTGGTCACTTAAAAAAGGCCAGGGAATTATATGACGAAATGATATTACATGGATTTATCGCACATCCAAAGATTAAAACGCTTCTGAAGGAACCAGATTTAGGTAGCATTGACGAAGCAAGGCAGCAAGTGAGACACAACAACAAAGGTAAGTTCATTCCTCATAGGAAAGGGAGAACGATGAGGTGGAAATCACATAAACAACGATCTAAAGGGGCTGCATCATTTGAATAG

Protein sequence

MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGPTEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSKGAASFE
BLAST of CsGy7G018360.1 vs. NCBI nr
Match: XP_011659500.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucumis sativus] >XP_011659501.1 PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucumis sativus] >KGN45215.1 hypothetical protein Csa_7G431960 [Cucumis sativus])

HSP 1 Score: 876.7 bits (2264), Expect = 4.1e-251
Identity = 546/546 (100.00%), Postives = 546/546 (100.00%), Query Frame = 0

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ
Sbjct: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120
           KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS
Sbjct: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS
Sbjct: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
           MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF
Sbjct: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG
Sbjct: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480
           TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY
Sbjct: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540

Query: 541 GAASFE 547
           GAASFE
Sbjct: 541 GAASFE 546

BLAST of CsGy7G018360.1 vs. NCBI nr
Match: XP_008461183.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucumis melo] >XP_016902675.1 PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucumis melo] >XP_016902676.1 PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucumis melo])

HSP 1 Score: 797.7 bits (2059), Expect = 2.4e-227
Identity = 511/546 (93.59%), Postives = 518/546 (94.87%), Query Frame = 0

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MA LSSIARRLCRIHPLPFHHLLYLNRL I DSPFQAF QT  L S FA QFSALPSFSQ
Sbjct: 1   MACLSSIARRLCRIHPLPFHHLLYLNRLSIRDSPFQAFRQTLCLRSLFAHQFSALPSFSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120
           K+GD F FDTGRF+NYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDV+AD DLVYS
Sbjct: 61  KVGDQFQFDTGRFKNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVQADSDLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSS LAFKWGEK GAIDEEICNLMIWVLGNHKKFSTAW LIRELHGSLLNS
Sbjct: 121 AIWVLRDDWKSSFLAFKWGEKWGAIDEEICNLMIWVLGNHKKFSTAWCLIRELHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
            QAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFH LLNSLCKYGNIEEAEEFMF
Sbjct: 181 RQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPL TESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG
Sbjct: 241 VNKKLFPLETESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480
            E TFLIMFNKSFELE+PEYALN WVEMKRY+VFPS EHYSVLIQGLATCGHLKKARELY
Sbjct: 421 IEVTFLIMFNKSFELEQPEYALNAWVEMKRYKVFPSSEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           DEMILHGFIAHPKIKTLLKEPD GSIDEARQQVRHN KGKF+ HRKG TMRWKSHKQ+SK
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDSGSIDEARQQVRHNKKGKFLSHRKGSTMRWKSHKQQSK 540

Query: 541 GAASFE 547
             ASFE
Sbjct: 541 RDASFE 546

BLAST of CsGy7G018360.1 vs. NCBI nr
Match: XP_023547792.1 (pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucurbita pepo subsp. pepo] >XP_023547793.1 pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 669.5 bits (1726), Expect = 9.9e-189
Identity = 452/546 (82.78%), Postives = 484/546 (88.64%), Query Frame = 0

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MA LSSIAR L R  PLPF  LL L   +IPDSPFQAFHQT  L S  ARQFSALP  SQ
Sbjct: 1   MACLSSIARGLSRNSPLPFRQLLQLINFQIPDSPFQAFHQTLHLPSQSARQFSALPFCSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120
           K+  PF FDT RFQN+R +DA +A+F+EL KR A LPSEVEA+AAL EFDV+AD +LVYS
Sbjct: 61  KVAHPFHFDTVRFQNHRPNDARSAQFVELLKRAARLPSEVEAIAALGEFDVQADPNLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSS LAFKWGEK G+IDEEICNLMIWVLGNHKKFSTAW LIRE HGSLLNS
Sbjct: 121 AIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLMIWVLGNHKKFSTAWCLIREFHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
            QAMLVMIDRYA+ANEASKAIKTFHMMEKFRLTPDQEAFH LLNSLCKYGNIEEAEEFM 
Sbjct: 181 RQAMLVMIDRYAHANEASKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFML 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPL TESFNIILNGWCNV+VDVFEAKRIWREMSKCCILPDSTSYT MISCFS+ G
Sbjct: 241 VNKKLFPLETESFNIILNGWCNVSVDVFEAKRIWREMSKCCILPDSTSYTLMISCFSRTG 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX ADS+ SFELLK+MRQ+GLGP
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGADSETSFELLKQMRQNGLGP 420

Query: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480
           TE TF+IMFNKSFELE+P+YAL  W EMKRYE+ P+ EHY+VL+QGLAT G LK+ARELY
Sbjct: 421 TEATFVIMFNKSFELEQPDYALKAWAEMKRYEISPNSEHYAVLVQGLATYGLLKQARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           D+M  HG+I HPKIK L+KEP+L SI+EA QQVRHN KGKF  HRKG  MRWKSHKQ+S+
Sbjct: 481 DQMTSHGYILHPKIKMLVKEPELRSIEEATQQVRHNKKGKFF-HRKGSMMRWKSHKQQSR 540

Query: 541 GAASFE 547
             ASFE
Sbjct: 541 DDASFE 545

BLAST of CsGy7G018360.1 vs. NCBI nr
Match: XP_022992478.1 (pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucurbita maxima])

HSP 1 Score: 663.3 bits (1710), Expect = 7.1e-187
Identity = 451/546 (82.60%), Postives = 482/546 (88.28%), Query Frame = 0

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MA LSSIAR L R  PLPF  LL L   +IP SP QAFHQT  L S  ARQFSALP  SQ
Sbjct: 1   MACLSSIARGLSRNTPLPFRQLLQLINFQIPHSPSQAFHQTLLLPSQSARQFSALPFCSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120
           K+  PF FDT RFQN+RQ+DA +A+F+EL KR A LPSEVEA+AAL EFDV+AD +LVYS
Sbjct: 61  KVAHPFHFDTLRFQNHRQNDARSAQFVELLKRAARLPSEVEAIAALGEFDVQADPNLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSS LAFKWGEK G+IDEEICNLMIWVLGNHKKFSTAW LIRE HGSLLNS
Sbjct: 121 AIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLMIWVLGNHKKFSTAWCLIREFHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
            QAM VMIDRYA+ANEASKAIKTFHMMEKFRLTPDQEAFH LLNSLCKYGNIEEAEEFM 
Sbjct: 181 RQAMHVMIDRYAHANEASKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFML 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPL TESFNIILNGWCNV+VDVFEAKRIWREMSKCCILPDSTSYT MISCFS+ G
Sbjct: 241 VNKKLFPLETESFNIILNGWCNVSVDVFEAKRIWREMSKCCILPDSTSYTLMISCFSRTG 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX ADS+MSFELLK+MRQ+GLGP
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGADSEMSFELLKQMRQNGLGP 420

Query: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480
           TE TF+IMFNKSFELE+P+YAL  W EMKRYE+ P+ EHY VLIQGLAT G LK+ARELY
Sbjct: 421 TEATFVIMFNKSFELEQPDYALKAWEEMKRYEILPNSEHYEVLIQGLATYGLLKQARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           D+M  HGFI HPKIK L+K+P+L SI+E+ QQVRHN KGKF  HRKG  MRWKSHKQ+S+
Sbjct: 481 DQMTSHGFILHPKIKMLVKQPELRSIEESTQQVRHNKKGKFF-HRKGSMMRWKSHKQQSR 540

Query: 541 GAASFE 547
             ASFE
Sbjct: 541 DDASFE 545

BLAST of CsGy7G018360.1 vs. NCBI nr
Match: XP_022953169.1 (pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucurbita moschata] >XP_022953170.1 pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucurbita moschata])

HSP 1 Score: 662.9 bits (1709), Expect = 9.3e-187
Identity = 451/546 (82.60%), Postives = 481/546 (88.10%), Query Frame = 0

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MA LSSIAR L R  PLPF  LL L   +IPDSP QAFHQT  L S  ARQFSALP  SQ
Sbjct: 1   MACLSSIARGLSRNTPLPFRQLLQLISFKIPDSPSQAFHQTLLLPSQSARQFSALPFCSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120
           K+  PF FDT RFQN+R +DA +A F+EL KR A LPSEVEA+AAL EFDV+AD +LVYS
Sbjct: 61  KVAHPFHFDTVRFQNHRPNDARSAEFVELLKRAARLPSEVEAIAALGEFDVQADSNLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSS LAFKWGEK G+IDEEICNLMIWVLGNHKKFSTAW LIRE HGSLLNS
Sbjct: 121 AIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLMIWVLGNHKKFSTAWCLIREFHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
            QAMLVMIDRYA+ANEASKAIKTFHMMEKFRLTPDQEAFH LLNSLCKYGNIEEAEEFM 
Sbjct: 181 RQAMLVMIDRYAHANEASKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFML 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPL TESFNIILNGWCNV+VDVFEAKRIWREMSKCCILPDSTSYT MISCFS+ G
Sbjct: 241 VNKKLFPLETESFNIILNGWCNVSVDVFEAKRIWREMSKCCILPDSTSYTLMISCFSRTG 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX ADS+MSFELLK++RQ+GLGP
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGADSEMSFELLKQIRQNGLGP 420

Query: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480
           TE TF+IMFNKSFELE+P+YAL  W EMKRYE+ P+ EHY+VLIQGLAT G LK+ARELY
Sbjct: 421 TEATFIIMFNKSFELEQPDYALKAWEEMKRYEISPNSEHYAVLIQGLATYGLLKQARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           D+M  HGFI HPKIK L+KEP+L SI+EA QQV HN KGKF  HRKG  MRWKSHKQ+S+
Sbjct: 481 DQMTSHGFILHPKIKMLVKEPELRSIEEATQQVTHNKKGKFF-HRKGSMMRWKSHKQQSR 540

Query: 541 GAASFE 547
             AS E
Sbjct: 541 DDASIE 545

BLAST of CsGy7G018360.1 vs. TAIR10
Match: AT1G80880.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 298.1 bits (762), Expect = 1.1e-80
Identity = 266/477 (55.77%), Postives = 337/477 (70.65%), Query Frame = 0

Query: 37  AFHQTFSLHSFFARQFSALPSF--SQKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVA 96
           AFH+   +HS   +  S LP F  S +     + +T    N           I+L ++V+
Sbjct: 47  AFHRAGHVHS---QVLSYLPHFASSNRFSTKTISETFDI-NLTALAPLEKGLIDLIRQVS 106

Query: 97  LLPSEVEAVAALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMI 156
            L SE +A+A+L++     + D  YS IW LRD+W+ + LAFKWGEK G  D++ C+LMI
Sbjct: 107 ELESEADAMASLEDSSFDLNHDSFYSLIWELRDEWRLAFLAFKWGEKRGCDDQKSCDLMI 166

Query: 157 WVLGNHKKFSTAWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTP 216
           WVLGNH+KF+ AW LIR++     ++ +AM +M+DRYA AN+ S+AI+TF +M+KF+ TP
Sbjct: 167 WVLGNHQKFNIAWCLIRDMFNVSKDTRKAMFLMMDRYAAANDTSQAIRTFDIMDKFKHTP 226

Query: 217 DQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIW 276
             EAF  LL +LC++G+IE+AEEFM  +KKLFP+  E FN+ILNGWCN+  DV EAKRIW
Sbjct: 227 YDEAFQGLLCALCRHGHIEKAEEFMLASKKLFPVDVEGFNVILNGWCNIWTDVTEAKRIW 286

Query: 277 REMSKCCILPDSTSYTHMISCFSKNGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 336
           REM   CI P+  SY+HMISCFSK G   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 287 REMGNYCITPNKDSYSHMISCFSKVGNLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 346

Query: 337 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 396
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 347 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 406

Query: 397 XXXXXAADSKMSFELLKRMRQDGLGPTEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVF 456
           XXXXX                   GPTE TFL++  K F+ ++PE AL +W EM R+E+ 
Sbjct: 407 XXXXXXXXXXXXXXXXXXXXXXXXGPTEETFLLILGKLFKGKQPENALKIWAEMDRFEIV 466

Query: 457 PSCEHYSVLIQGLATCGHLKKARELYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQ 512
            +   Y   IQGL +CG L+KARE+Y EM   GF+ +P ++ LL+E  +  + ++++
Sbjct: 467 ANPALYLATIQGLLSCGWLEKAREIYSEMKSKGFVGNPMLQKLLEEQKVKGVRKSKR 519

BLAST of CsGy7G018360.1 vs. TAIR10
Match: AT5G15010.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 155.6 bits (392), Expect = 8.7e-38
Identity = 81/200 (40.50%), Postives = 119/200 (59.50%), Query Frame = 0

Query: 106 LDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKVGAIDEEI--CNLMIWVLGNHKKF 165
           L+E DVK   +LV   +  +R+DW+++   F W  K       +   + MI +LG  +KF
Sbjct: 118 LEECDVKPSNELVVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGKMRKF 177

Query: 166 STAWSLI---RELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFH 225
            TAW+LI   R+   SL+NS Q +L+MI +Y   ++  KAI TFH  ++F+L    + F 
Sbjct: 178 DTAWTLIDEMRKFSPSLVNS-QTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGIDDFQ 237

Query: 226 VLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKC 285
            LL++LC+Y N+ +A   +F NK  +P   +SFNI+LNGWCNV     EA+R+W EM   
Sbjct: 238 SLLSALCRYKNVSDAGHLIFCNKDKYPFDAKSFNIVLNGWCNVIGSPREAERVWMEMGNV 297

Query: 286 CILPDSTSYTHMISCFSKNG 301
            +  D  SY+ MISC+SK G
Sbjct: 298 GVKHDVVSYSSMISCYSKGG 316

BLAST of CsGy7G018360.1 vs. TAIR10
Match: AT5G65820.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 77.8 bits (190), Expect = 2.3e-14
Identity = 41/134 (30.60%), Postives = 71/134 (52.99%), Query Frame = 0

Query: 148 EICNLMIWVLGNHKKFSTAWSLIREL--HGSLLNSMQAMLVMIDRYAYANEASKAIKTFH 207
           E+   M+ +L   ++F   W LI E+      L   +  +V++ R+A A+   KAI+   
Sbjct: 148 EVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLD 207

Query: 208 MMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTV 267
            M KF   PD+  F  LL++LCK+G++++A +     +  FP+    F  +L GWC V  
Sbjct: 208 EMPKFGFEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVG- 267

Query: 268 DVFEAKRIWREMSK 280
            + EAK +  +M++
Sbjct: 268 KMMEAKYVLVQMNE 280

BLAST of CsGy7G018360.1 vs. TAIR10
Match: AT5G11310.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 70.1 bits (170), Expect = 4.8e-12
Identity = 54/189 (28.57%), Postives = 91/189 (48.15%), Query Frame = 0

Query: 104 AALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGE-KVG-AIDEEICNLMIWVLGNHK 163
           +ALDE  ++  ++LV++    L          FKW E K G  +   + + ++  L   +
Sbjct: 90  SALDETGIEPSVELVHALFDRLSSSPMLLHSVFKWAEMKPGFTLSPSLFDSVVNSLCKAR 149

Query: 164 KFSTAWSL----IRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKF----RLT 223
           +F  AWSL    +R   GS L S    +V+I RYA A    +AI+ F     +    +  
Sbjct: 150 EFEIAWSLVFDRVRSDEGSNLVSADTFIVLIRRYARAGMVQQAIRAFEFARSYEPVCKSA 209

Query: 224 PDQEAFHVLLNSLCKYGNIEEAEEFM-----FVNKKLFPLGTESFNIILNGWCNVTVDVF 278
            +     VLL++LCK G++ EA  ++      ++    P     FNI+LNGW   +  + 
Sbjct: 210 TELRLLEVLLDALCKEGHVREASMYLERIGGTMDSNWVP-SVRIFNILLNGWFR-SRKLK 269

BLAST of CsGy7G018360.1 vs. TAIR10
Match: AT1G20300.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 67.4 bits (163), Expect = 3.1e-11
Identity = 42/150 (28.00%), Postives = 74/150 (49.33%), Query Frame = 0

Query: 132 SLLAFKWGEKVGAIDEEI---CNLMIWVLGNHKKFSTAWSLIRELHGSLLN-SMQAMLVM 191
           SL  F W       D +     N MI + G  ++F  AW LI  +    +  S++   ++
Sbjct: 133 SLAFFNWATSRDDYDHKSPHPYNEMIDLSGKVRQFDLAWHLIDLMKSRNVEISIETFTIL 192

Query: 192 IDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFP 251
           I RY  A  AS+A+  F+ ME +   PD+ AF +++++L +     EA+ F    K  F 
Sbjct: 193 IRRYVRAGLASEAVHCFNRMEDYGCVPDKIAFSIVISNLSRKRRASEAQSFFDSLKDRFE 252

Query: 252 LGTESFNIILNGWCNVTVDVFEAKRIWREM 278
                +  ++ GWC    ++ EA+++++EM
Sbjct: 253 PDVIVYTNLVRGWCRAG-EISEAEKVFKEM 281

BLAST of CsGy7G018360.1 vs. Swiss-Prot
Match: sp|Q9SAH2|PP137_ARATH (Pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g80880 PE=2 SV=1)

HSP 1 Score: 298.1 bits (762), Expect = 2.0e-79
Identity = 266/477 (55.77%), Postives = 337/477 (70.65%), Query Frame = 0

Query: 37  AFHQTFSLHSFFARQFSALPSF--SQKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVA 96
           AFH+   +HS   +  S LP F  S +     + +T    N           I+L ++V+
Sbjct: 47  AFHRAGHVHS---QVLSYLPHFASSNRFSTKTISETFDI-NLTALAPLEKGLIDLIRQVS 106

Query: 97  LLPSEVEAVAALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMI 156
            L SE +A+A+L++     + D  YS IW LRD+W+ + LAFKWGEK G  D++ C+LMI
Sbjct: 107 ELESEADAMASLEDSSFDLNHDSFYSLIWELRDEWRLAFLAFKWGEKRGCDDQKSCDLMI 166

Query: 157 WVLGNHKKFSTAWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTP 216
           WVLGNH+KF+ AW LIR++     ++ +AM +M+DRYA AN+ S+AI+TF +M+KF+ TP
Sbjct: 167 WVLGNHQKFNIAWCLIRDMFNVSKDTRKAMFLMMDRYAAANDTSQAIRTFDIMDKFKHTP 226

Query: 217 DQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIW 276
             EAF  LL +LC++G+IE+AEEFM  +KKLFP+  E FN+ILNGWCN+  DV EAKRIW
Sbjct: 227 YDEAFQGLLCALCRHGHIEKAEEFMLASKKLFPVDVEGFNVILNGWCNIWTDVTEAKRIW 286

Query: 277 REMSKCCILPDSTSYTHMISCFSKNGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 336
           REM   CI P+  SY+HMISCFSK G   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 287 REMGNYCITPNKDSYSHMISCFSKVGNLFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 346

Query: 337 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 396
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 347 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 406

Query: 397 XXXXXAADSKMSFELLKRMRQDGLGPTEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVF 456
           XXXXX                   GPTE TFL++  K F+ ++PE AL +W EM R+E+ 
Sbjct: 407 XXXXXXXXXXXXXXXXXXXXXXXXGPTEETFLLILGKLFKGKQPENALKIWAEMDRFEIV 466

Query: 457 PSCEHYSVLIQGLATCGHLKKARELYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQ 512
            +   Y   IQGL +CG L+KARE+Y EM   GF+ +P ++ LL+E  +  + ++++
Sbjct: 467 ANPALYLATIQGLLSCGWLEKAREIYSEMKSKGFVGNPMLQKLLEEQKVKGVRKSKR 519

BLAST of CsGy7G018360.1 vs. Swiss-Prot
Match: sp|Q9LFQ4|PP383_ARATH (Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g15010 PE=2 SV=2)

HSP 1 Score: 155.6 bits (392), Expect = 1.6e-36
Identity = 81/200 (40.50%), Postives = 119/200 (59.50%), Query Frame = 0

Query: 106 LDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKVGAIDEEI--CNLMIWVLGNHKKF 165
           L+E DVK   +LV   +  +R+DW+++   F W  K       +   + MI +LG  +KF
Sbjct: 118 LEECDVKPSNELVVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGKMRKF 177

Query: 166 STAWSLI---RELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFH 225
            TAW+LI   R+   SL+NS Q +L+MI +Y   ++  KAI TFH  ++F+L    + F 
Sbjct: 178 DTAWTLIDEMRKFSPSLVNS-QTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGIDDFQ 237

Query: 226 VLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKC 285
            LL++LC+Y N+ +A   +F NK  +P   +SFNI+LNGWCNV     EA+R+W EM   
Sbjct: 238 SLLSALCRYKNVSDAGHLIFCNKDKYPFDAKSFNIVLNGWCNVIGSPREAERVWMEMGNV 297

Query: 286 CILPDSTSYTHMISCFSKNG 301
            +  D  SY+ MISC+SK G
Sbjct: 298 GVKHDVVSYSSMISCYSKGG 316

BLAST of CsGy7G018360.1 vs. Swiss-Prot
Match: sp|Q9FH87|PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana OX=3702 GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 77.8 bits (190), Expect = 4.2e-13
Identity = 41/134 (30.60%), Postives = 71/134 (52.99%), Query Frame = 0

Query: 148 EICNLMIWVLGNHKKFSTAWSLIREL--HGSLLNSMQAMLVMIDRYAYANEASKAIKTFH 207
           E+   M+ +L   ++F   W LI E+      L   +  +V++ R+A A+   KAI+   
Sbjct: 148 EVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLD 207

Query: 208 MMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTV 267
            M KF   PD+  F  LL++LCK+G++++A +     +  FP+    F  +L GWC V  
Sbjct: 208 EMPKFGFEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVG- 267

Query: 268 DVFEAKRIWREMSK 280
            + EAK +  +M++
Sbjct: 268 KMMEAKYVLVQMNE 280

BLAST of CsGy7G018360.1 vs. Swiss-Prot
Match: sp|Q9LFM6|PP375_ARATH (Pentatricopeptide repeat-containing protein At5g11310, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g11310 PE=2 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 8.7e-11
Identity = 54/189 (28.57%), Postives = 91/189 (48.15%), Query Frame = 0

Query: 104 AALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGE-KVG-AIDEEICNLMIWVLGNHK 163
           +ALDE  ++  ++LV++    L          FKW E K G  +   + + ++  L   +
Sbjct: 90  SALDETGIEPSVELVHALFDRLSSSPMLLHSVFKWAEMKPGFTLSPSLFDSVVNSLCKAR 149

Query: 164 KFSTAWSL----IRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKF----RLT 223
           +F  AWSL    +R   GS L S    +V+I RYA A    +AI+ F     +    +  
Sbjct: 150 EFEIAWSLVFDRVRSDEGSNLVSADTFIVLIRRYARAGMVQQAIRAFEFARSYEPVCKSA 209

Query: 224 PDQEAFHVLLNSLCKYGNIEEAEEFM-----FVNKKLFPLGTESFNIILNGWCNVTVDVF 278
            +     VLL++LCK G++ EA  ++      ++    P     FNI+LNGW   +  + 
Sbjct: 210 TELRLLEVLLDALCKEGHVREASMYLERIGGTMDSNWVP-SVRIFNILLNGWFR-SRKLK 269

BLAST of CsGy7G018360.1 vs. Swiss-Prot
Match: sp|Q9LN22|PPR54_ARATH (Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g20300 PE=2 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 5.6e-10
Identity = 42/150 (28.00%), Postives = 74/150 (49.33%), Query Frame = 0

Query: 132 SLLAFKWGEKVGAIDEEI---CNLMIWVLGNHKKFSTAWSLIRELHGSLLN-SMQAMLVM 191
           SL  F W       D +     N MI + G  ++F  AW LI  +    +  S++   ++
Sbjct: 133 SLAFFNWATSRDDYDHKSPHPYNEMIDLSGKVRQFDLAWHLIDLMKSRNVEISIETFTIL 192

Query: 192 IDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFP 251
           I RY  A  AS+A+  F+ ME +   PD+ AF +++++L +     EA+ F    K  F 
Sbjct: 193 IRRYVRAGLASEAVHCFNRMEDYGCVPDKIAFSIVISNLSRKRRASEAQSFFDSLKDRFE 252

Query: 252 LGTESFNIILNGWCNVTVDVFEAKRIWREM 278
                +  ++ GWC    ++ EA+++++EM
Sbjct: 253 PDVIVYTNLVRGWCRAG-EISEAEKVFKEM 281

BLAST of CsGy7G018360.1 vs. TrEMBL
Match: tr|A0A0A0K678|A0A0A0K678_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G431960 PE=4 SV=1)

HSP 1 Score: 876.7 bits (2264), Expect = 2.7e-251
Identity = 546/546 (100.00%), Postives = 546/546 (100.00%), Query Frame = 0

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ
Sbjct: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120
           KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS
Sbjct: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS
Sbjct: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
           MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF
Sbjct: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG
Sbjct: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480
           TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY
Sbjct: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540

Query: 541 GAASFE 547
           GAASFE
Sbjct: 541 GAASFE 546

BLAST of CsGy7G018360.1 vs. TrEMBL
Match: tr|A0A1S3CE43|A0A1S3CE43_CUCME (pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103499843 PE=4 SV=1)

HSP 1 Score: 797.7 bits (2059), Expect = 1.6e-227
Identity = 511/546 (93.59%), Postives = 518/546 (94.87%), Query Frame = 0

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MA LSSIARRLCRIHPLPFHHLLYLNRL I DSPFQAF QT  L S FA QFSALPSFSQ
Sbjct: 1   MACLSSIARRLCRIHPLPFHHLLYLNRLSIRDSPFQAFRQTLCLRSLFAHQFSALPSFSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120
           K+GD F FDTGRF+NYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDV+AD DLVYS
Sbjct: 61  KVGDQFQFDTGRFKNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVQADSDLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSS LAFKWGEK GAIDEEICNLMIWVLGNHKKFSTAW LIRELHGSLLNS
Sbjct: 121 AIWVLRDDWKSSFLAFKWGEKWGAIDEEICNLMIWVLGNHKKFSTAWCLIRELHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
            QAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFH LLNSLCKYGNIEEAEEFMF
Sbjct: 181 RQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPL TESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG
Sbjct: 241 VNKKLFPLETESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480
            E TFLIMFNKSFELE+PEYALN WVEMKRY+VFPS EHYSVLIQGLATCGHLKKARELY
Sbjct: 421 IEVTFLIMFNKSFELEQPEYALNAWVEMKRYKVFPSSEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           DEMILHGFIAHPKIKTLLKEPD GSIDEARQQVRHN KGKF+ HRKG TMRWKSHKQ+SK
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDSGSIDEARQQVRHNKKGKFLSHRKGSTMRWKSHKQQSK 540

Query: 541 GAASFE 547
             ASFE
Sbjct: 541 RDASFE 546

BLAST of CsGy7G018360.1 vs. TrEMBL
Match: tr|A0A2I4GIU2|A0A2I4GIU2_9ROSI (pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Juglans regia OX=51240 GN=LOC109008254 PE=4 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 7.6e-121
Identity = 349/558 (62.54%), Postives = 421/558 (75.45%), Query Frame = 0

Query: 1   MASLSSIARRLCRIHP---LPFHHLLYLNRLRIPDSP-----------FQAFHQTFSLHS 60
           MA L S+ARRL R  P   LP   L  + R   P              F+AFHQT  L S
Sbjct: 1   MAHLPSLARRLPRTDPHFFLPLSMLHPITRSPCPSPATTILQPFLRYLFRAFHQTRHLPS 60

Query: 61  FFARQFSALPSFS-QKLGDPFLFDTGRFQ-NYRQSDACNARFIELFKRVALLPSEVEAVA 120
              R+FS    FS Q    PF FD  +F+  +   D    RF+E+ + V+  PSE +A+ 
Sbjct: 61  PQTRRFSTFQPFSAQNYHYPFDFDNYKFKVTHGTHDPGLPRFLEMLRGVSQCPSEAKAIE 120

Query: 121 ALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFS 180
           +LDE  ++A+ ++V SAIW  R++W+ + LAFKWGEK G  DE+ C LMIWVLGNH+KF+
Sbjct: 121 SLDESGIEANREIVCSAIWESREEWRLAFLAFKWGEKWGCTDEKACYLMIWVLGNHRKFN 180

Query: 181 TAWSLIREL--HGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVL 240
            AW LIR++  H S +++ +AML+MIDRYA AN+  KAI+TFH+ME FRLTPDQEAFH +
Sbjct: 181 IAWCLIRDMHTHRSKMDTRRAMLIMIDRYASANDPCKAIRTFHVMETFRLTPDQEAFHTV 240

Query: 241 LNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCI 300
           L +LCKYGNIEEAEEFMFVNKKLFPL TE FNIILN WCNV++DVFEAKR+WREMSKCCI
Sbjct: 241 LKALCKYGNIEEAEEFMFVNKKLFPLETEGFNIILNAWCNVSLDVFEAKRVWREMSKCCI 300

Query: 301 LPDSTSYTHMISCFSKNGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            PD+TSYT MISCFSK G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 TPDATSYTLMISCFSKVGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAAD 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX +A 
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSAG 420

Query: 421 SKMSFELLKRMRQDGLGPTEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSV 480
            + + E+L RM++ GLGPT  TFLI+  K F+L +P++AL V++EMK+YE+ P+  HYSV
Sbjct: 421 FEGTLEVLNRMKKAGLGPTGDTFLIIIGKFFKLGQPDHALKVFMEMKQYEIVPNSLHYSV 480

Query: 481 LIQGLATCGHLKKARELYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFI 540
           L+QGLATCG L KARE Y EM  +GF+  PK+K LLKEP   S+ E ++QV+  N+ +  
Sbjct: 481 LVQGLATCGWLIKAREFYAEMRSNGFLEDPKLKKLLKEPVRCSVHEGKRQVQRVNRDERA 540

BLAST of CsGy7G018360.1 vs. TrEMBL
Match: tr|A0A2P4LEF9|A0A2P4LEF9_QUESU (Pentatricopeptide repeat-containing protein, mitochondrial OS=Quercus suber OX=58331 GN=CFP56_34421 PE=4 SV=1)

HSP 1 Score: 439.5 bits (1129), Expect = 1.1e-119
Identity = 337/547 (61.61%), Postives = 415/547 (75.87%), Query Frame = 0

Query: 4   LSSIARRLCRIHPLPF---HHLLYLNRLRIPDSP------FQAFHQTFSLHSFFARQFSA 63
           L S+ARRL R  P  F   H +  L+    P S       F A HQT  + +     FS 
Sbjct: 6   LPSLARRLQRTPPHLFLLSHIIPSLSSSPSPSSQLPSHFLFHALHQTSLIPTLQTHHFST 65

Query: 64  LPSFS-QKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKA 123
              FS   L DPF F+  RF+ Y   D    RF+EL + V   PS+ EA+ +LD+  ++ 
Sbjct: 66  FQPFSAHNLDDPFDFNHPRFRTYDPHDRFLLRFLELLREVPHYPSKAEAMTSLDDSGIEV 125

Query: 124 DLDLVYSAIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIREL 183
           + +++YSA+W LR++W+ +LLAF+W E+ G  DE+  NLMIWVLGNH+KF+TAW LIR++
Sbjct: 126 NSEMIYSAVWELREEWRLALLAFRWSEQWGCADEKSSNLMIWVLGNHRKFNTAWCLIRDM 185

Query: 184 HGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIE 243
           H S +++ +AML+MIDRYA AN+  KAI TFH+MEKFRL+PDQ+AFH +LN+LCKYGN+E
Sbjct: 186 HQSKVDTRRAMLIMIDRYASANDPCKAIGTFHIMEKFRLSPDQQAFHTVLNALCKYGNVE 245

Query: 244 EAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMI 303
           EAEEFMF+NKKLFPL TE FNIILNGWCN+++D+FEAKR+WREMSKCCI PD+TSYT+MI
Sbjct: 246 EAEEFMFLNKKLFPLETEGFNIILNGWCNISLDIFEAKRVWREMSKCCITPDATSYTNMI 305

Query: 304 SCFSKNGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 363
           SCF+K GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 306 SCFAKVGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 365

Query: 364 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRM 423
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  A  + + E+L RM
Sbjct: 366 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEGAGFEGTLEILNRM 425

Query: 424 RQDGLGPTEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHL 483
           R+ GLGPT  + LI+ ++ F+L +PE AL +WVEMK+YEV  S  H SVL+QGLATCG L
Sbjct: 426 RKAGLGPTGDSLLIILSRFFKLGQPENALKIWVEMKQYEVELSHTHQSVLVQGLATCGWL 485

Query: 484 KKARELYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWK 541
            KARE Y +M   GF   PK+K LLKEP   S+ + +QQV+  N+  ++ H KG  +RWK
Sbjct: 486 TKAREFYADMRSKGFAEDPKLKKLLKEPARDSMHKGKQQVQRLNREGWLNHGKGTRVRWK 545

BLAST of CsGy7G018360.1 vs. TrEMBL
Match: tr|A0A2P4LEG7|A0A2P4LEG7_QUESU (Pentatricopeptide repeat-containing protein, mitochondrial OS=Quercus suber OX=58331 GN=CFP56_34421 PE=4 SV=1)

HSP 1 Score: 425.2 bits (1092), Expect = 2.2e-115
Identity = 331/533 (62.10%), Postives = 405/533 (75.98%), Query Frame = 0

Query: 4   LSSIARRLCRIHPLPF---HHLLYLNRLRIPDSP------FQAFHQTFSLHSFFARQFSA 63
           L S+ARRL R  P  F   H +  L+    P S       F A HQT  + +     FS 
Sbjct: 6   LPSLARRLQRTPPHLFLLSHIIPSLSSSPSPSSQLPSHFLFHALHQTSLIPTLQTHHFST 65

Query: 64  LPSFS-QKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKA 123
              FS   L DPF F+  RF+ Y   D    RF+EL + V   PS+ EA+ +LD+  ++ 
Sbjct: 66  FQPFSAHNLDDPFDFNHPRFRTYDPHDRFLLRFLELLREVPHYPSKAEAMTSLDDSGIEV 125

Query: 124 DLDLVYSAIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIREL 183
           + +++YSA+W LR++W+ +LLAF+W E+ G  DE+  NLMIWVLGNH+KF+TAW LIR++
Sbjct: 126 NSEMIYSAVWELREEWRLALLAFRWSEQWGCADEKSSNLMIWVLGNHRKFNTAWCLIRDM 185

Query: 184 HGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIE 243
           H S +++ +AML+MIDRYA AN+  KAI TFH+MEKFRL+PDQ+AFH +LN+LCKYGN+E
Sbjct: 186 HQSKVDTRRAMLIMIDRYASANDPCKAIGTFHIMEKFRLSPDQQAFHTVLNALCKYGNVE 245

Query: 244 EAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMI 303
           EAEEFMF+NKKLFPL TE FNIILNGWCN+++D+FEAKR+WREMSKCCI PD+TSYT+MI
Sbjct: 246 EAEEFMFLNKKLFPLETEGFNIILNGWCNISLDIFEAKRVWREMSKCCITPDATSYTNMI 305

Query: 304 SCFSKNGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 363
           SCF+K GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 306 SCFAKVGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 365

Query: 364 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAADSKMSFELLKRM 423
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  A  + + E+L RM
Sbjct: 366 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEGAGFEGTLEILNRM 425

Query: 424 RQDGLGPTEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHL 483
           R+ GLGPT  + LI+ ++ F+L +PE AL +WVEMK+YEV  S  H SVL+QGLATCG L
Sbjct: 426 RKAGLGPTGDSLLIILSRFFKLGQPENALKIWVEMKQYEVELSHTHQSVLVQGLATCGWL 485

Query: 484 KKARELYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRK 527
            KARE Y +M   GF   PK+K LLKEP   S+ + +QQV+  N+  ++ H K
Sbjct: 486 TKAREFYADMRSKGFAEDPKLKKLLKEPARDSMHKGKQQVQRLNREGWLNHGK 538

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011659500.14.1e-251100.00PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
XP_008461183.12.4e-22793.59PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
XP_023547792.19.9e-18982.78pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucurbita ... [more]
XP_022992478.17.1e-18782.60pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucurbita ... [more]
XP_022953169.19.3e-18782.60pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT1G80880.11.1e-8055.77Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G15010.18.7e-3840.50Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65820.12.3e-1430.60Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G11310.14.8e-1228.57Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G20300.13.1e-1128.00Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SAH2|PP137_ARATH2.0e-7955.77Pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Arabidop... [more]
sp|Q9LFQ4|PP383_ARATH1.6e-3640.50Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidop... [more]
sp|Q9FH87|PP447_ARATH4.2e-1330.60Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
sp|Q9LFM6|PP375_ARATH8.7e-1128.57Pentatricopeptide repeat-containing protein At5g11310, mitochondrial OS=Arabidop... [more]
sp|Q9LN22|PPR54_ARATH5.6e-1028.00Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0K678|A0A0A0K678_CUCSA2.7e-251100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G431960 PE=4 SV=1[more]
tr|A0A1S3CE43|A0A1S3CE43_CUCME1.6e-22793.59pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Cucumis ... [more]
tr|A0A2I4GIU2|A0A2I4GIU2_9ROSI7.6e-12162.54pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Juglans ... [more]
tr|A0A2P4LEF9|A0A2P4LEF9_QUESU1.1e-11961.61Pentatricopeptide repeat-containing protein, mitochondrial OS=Quercus suber OX=5... [more]
tr|A0A2P4LEG7|A0A2P4LEG7_QUESU2.2e-11562.10Pentatricopeptide repeat-containing protein, mitochondrial OS=Quercus suber OX=5... [more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy7G018360CsGy7G018360gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy7G018360.1CsGy7G018360.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy7G018360.1.five_prime_UTR.1CsGy7G018360.1.five_prime_UTR.1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy7G018360.1.exon.1CsGy7G018360.1.exon.1exon
CsGy7G018360.1.exon.2CsGy7G018360.1.exon.2exon
CsGy7G018360.1.exon.3CsGy7G018360.1.exon.3exon
CsGy7G018360.1.exon.4CsGy7G018360.1.exon.4exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy7G018360.1.CDS.1CsGy7G018360.1.CDS.1CDS
CsGy7G018360.1.CDS.2CsGy7G018360.1.CDS.2CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy7G018360.1.three_prime_UTR.1CsGy7G018360.1.three_prime_UTR.1three_prime_UTR
CsGy7G018360.1.three_prime_UTR.2CsGy7G018360.1.three_prime_UTR.2three_prime_UTR


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 288..315
e-value: 9.5E-6
score: 25.4
coord: 219..238
e-value: 0.13
score: 12.5
coord: 459..488
e-value: 4.1E-5
score: 23.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 358..390
e-value: 1.5E-6
score: 26.0
coord: 460..488
e-value: 1.3E-4
score: 19.9
coord: 323..356
e-value: 1.9E-4
score: 19.4
coord: 288..320
e-value: 2.3E-7
score: 28.6
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 342..400
e-value: 7.3E-10
score: 38.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 320..354
score: 9.898
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 180..214
score: 7.289
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 146..176
score: 6.347
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 285..319
score: 11.323
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 249..284
score: 8.309
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 421..455
score: 6.533
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 215..245
score: 7.048
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 355..389
score: 12.682
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 456..490
score: 10.008
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 405..504
e-value: 1.2E-12
score: 49.5
coord: 118..242
e-value: 1.9E-11
score: 45.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 243..404
e-value: 9.6E-35
score: 122.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 146..511
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 526..546
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 30..517
NoneNo IPR availablePANTHERPTHR24015:SF602SUBFAMILY NOT NAMEDcoord: 30..517