CsaV3_6G008310 (gene) Cucumber (Chinese Long) v3

NameCsaV3_6G008310
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein, putative
Locationchr6 : 6684173 .. 6686304 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTAAATTATATGATAGTTTCGTAAAAATAAAAATGATATAAAAGCAAGTTGAAATTGAGTATTTAGGAGGCGCGCCCGAATTCCATTGAAGGGAATCAGCTGAAGAGTTTCCGTCTCCGGGAAAAGGAGAAGGCCATATTCTTAGCCAAATTCGATCCAAGATACCTTTTTTTCAAGCGCTGAAGTTGATTATAAGATAGGACAGATAATCCCACCTAGCTGGTAAATTTCAAAGCTCTCAACTTCCTGTCTGCCCATTAGGTGTTTGTCAAAAAGTCTTGCCTAAAATTCGTCAAATTTCTCTTCATTTTTTATGAAGTTCGATAAGATTTTGGTCACTTGTTTCAAATATAAGTATGTTATTGATTAAACATACCACCAATTAAGTAGGTATGAGTATTAGGTGGCCAAGGATTTTAACGCCCACATGTTTATCTCAGATTATTAGGAAGCAGAATAATCCCCAAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAGGTACCCAGATTATCGGCACAATGGTCCAGTGTATGCCACAATGATCAATATACTCGGAAACTCGGGCAGAGTTTCCGAGATGAGAGAAGTGATGGATCAGATGAGAGATGACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGATGGTATATCTCTTTTTAAAAGTTTTGGGAGATTTAACTGTACCAATAGAACACAAACTTTCAATACTCTTTTAGAAATCCTGTTGAAGGAATCTCAGCTTCATGCTGCTTGTCAGCTTTTTCAGGAGTGTTCTTATGGTTGGGGAGTGAAATCCAGGACTCAGTCCTTGAATTTGTTGATGCAATCTCTCTGCCAAAGAGGTCAGTCGGAGCTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGTTGCTATCCAAATAGACTGAGTTATTTGATTGTAATGAAAGGACTGTGTCAAGATGGTAGGCTTAACGAGGCCATCCATTTGTTGTATTCCATGTTTTGGAGAATTTCCCGAAAGGGTGGTGGCGGGGACATAGTAATTTATAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATCGAGCAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTAAAAGCCCCTAAACGGGCTCATTACCGAATCGACTTAGATCAATGCAGGAATAGCAACCTCACCATTGAGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAACAAGACTGATCAGGGAGATAAAGTGGTTAGCCACATGATAGCTAAAGGCTTCAGGCCACCATCCTTGATCTATGAAGCGAAAGCAGCTTCATTATGCAAAGAAGGCAAAGTTGACGATGCAGTCAAAGTAATTGAAGAGCAAATAGTGGGAGGCTGTGTTCCAACTATTGCATTGTACAACATCGTTCTGAAGGGTCTTTGTGATGATGGAAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCCAACAAAGAAACTTACAGCACTTTAGTACATGGACTTTGTCTCGAAAATCGATATATTGAAGCATGTAAGGTTTTAGAGGAGATGGTAATCAAATCGTTTTGCCCTTGCTCTAACACATTCAATACACTTATCAAAGGTCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTGTGTCTGGAATTCTTTGGTTTCATCTTTGTGTTGCGATGTGGCTGGCATCGATATGTGTTCCAGGGTTTTATGATACTGGAGTATTTACTATAAAAATCTCATTCTGGTTCTTTCAAAATATTGAATGACACTTCATTTTGTTTTCATTTTTTGATATATTGATTAATGAGTTTAGTTAATTGATTTTGCTGTTAGTTTGTAGCAACAAATCAATTATTTAATATATAAAGCAGGAGGAAAAAAAAGGGCCCAACCGGGTTCGAACCGGTGACCTCTTGATCTGCAGTCAAATGCTCTACCACTGAGCTATGGACCCG

mRNA sequence

ATGAGTATTAGGTGGCCAAGGATTTTAACGCCCACATGTTTATCTCAGATTATTAGGAAGCAGAATAATCCCCAAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAGGTACCCAGATTATCGGCACAATGGTCCAGTGTATGCCACAATGATCAATATACTCGGAAACTCGGGCAGAGTTTCCGAGATGAGAGAAGTGATGGATCAGATGAGAGATGACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGATGGTATATCTCTTTTTAAAAGTTTTGGGAGATTTAACTGTACCAATAGAACACAAACTTTCAATACTCTTTTAGAAATCCTGTTGAAGGAATCTCAGCTTCATGCTGCTTGTCAGCTTTTTCAGGAGTGTTCTTATGGTTGGGGAGTGAAATCCAGGACTCAGTCCTTGAATTTGTTGATGCAATCTCTCTGCCAAAGAGGTCAGTCGGAGCTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGTTGCTATCCAAATAGACTGAGTTATTTGATTGTAATGAAAGGACTGTGTCAAGATGGTAGGCTTAACGAGGCCATCCATTTGTTGTATTCCATGTTTTGGAGAATTTCCCGAAAGGGTGGTGGCGGGGACATAGTAATTTATAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATCGAGCAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTAAAAGCCCCTAAACGGGCTCATTACCGAATCGACTTAGATCAATGCAGGAATAGCAACCTCACCATTGAGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAACAAGACTGATCAGGGAGATAAAGTGGTTAGCCACATGATAGCTAAAGGCTTCAGGCCACCATCCTTGATCTATGAAGCGAAAGCAGCTTCATTATGCAAAGAAGGCAAAGTTGACGATGCAGTCAAAGTAATTGAAGAGCAAATAGTGGGAGGCTGTGTTCCAACTATTGCATTGTACAACATCGTTCTGAAGGGTCTTTGTGATGATGGAAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCCAACAAAGAAACTTACAGCACTTTAGTACATGGACTTTGTCTCGAAAATCGATATATTGAAGCATGTAAGGTTTTAGAGGAGATGGTAATCAAATCGTTTTGCCCTTGCTCTAACACATTCAATACACTTATCAAAGGTCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTGTGTCTGGAATTCTTTGGTTTCATCTTTGTGTTGCGATGTGGCTGGCATCGATATGTGTTCCAGGGTTTTATGA

Coding sequence (CDS)

ATGAGTATTAGGTGGCCAAGGATTTTAACGCCCACATGTTTATCTCAGATTATTAGGAAGCAGAATAATCCCCAAACAGCTTACCAACTGTTTAAGGAAGCCAAATGTAGGTACCCAGATTATCGGCACAATGGTCCAGTGTATGCCACAATGATCAATATACTCGGAAACTCGGGCAGAGTTTCCGAGATGAGAGAAGTGATGGATCAGATGAGAGATGACTCTTGTGAATGCAAAGATTCTGTATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGATGGTATATCTCTTTTTAAAAGTTTTGGGAGATTTAACTGTACCAATAGAACACAAACTTTCAATACTCTTTTAGAAATCCTGTTGAAGGAATCTCAGCTTCATGCTGCTTGTCAGCTTTTTCAGGAGTGTTCTTATGGTTGGGGAGTGAAATCCAGGACTCAGTCCTTGAATTTGTTGATGCAATCTCTCTGCCAAAGAGGTCAGTCGGAGCTTGCTTTACATGTCTTTCAAGAAATGGATTACCAAAGTTGCTATCCAAATAGACTGAGTTATTTGATTGTAATGAAAGGACTGTGTCAAGATGGTAGGCTTAACGAGGCCATCCATTTGTTGTATTCCATGTTTTGGAGAATTTCCCGAAAGGGTGGTGGCGGGGACATAGTAATTTATAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATCGAGCAAGCTGTGGAAATACTTGGCAAGATCTTGAGGAAAGGACTAAAAGCCCCTAAACGGGCTCATTACCGAATCGACTTAGATCAATGCAGGAATAGCAACCTCACCATTGAGGAAATCAAGAGTTTAATCAATGAAGCTTTAATCAAAGGCGGAATTCCCAGTTCAGATAGCTATTGTGCCATGGCTGTTGATCTATATAACGAAAACAAGACTGATCAGGGAGATAAAGTGGTTAGCCACATGATAGCTAAAGGCTTCAGGCCACCATCCTTGATCTATGAAGCGAAAGCAGCTTCATTATGCAAAGAAGGCAAAGTTGACGATGCAGTCAAAGTAATTGAAGAGCAAATAGTGGGAGGCTGTGTTCCAACTATTGCATTGTACAACATCGTTCTGAAGGGTCTTTGTGATGATGGAAAATCAACAGTGGCTATGGAGTATTTGAAGAAAATGGCAAAGCAGGTCGGTCTTGTTGCCAACAAAGAAACTTACAGCACTTTAGTACATGGACTTTGTCTCGAAAATCGATATATTGAAGCATGTAAGGTTTTAGAGGAGATGGTAATCAAATCGTTTTGCCCTTGCTCTAACACATTCAATACACTTATCAAAGGTCTTTGCTCAGTTGGAAAACACTATGAAGCTGTGATGTGGTTGGAAGAAATGATTAGCCAAGGTCAATTGCCTCATGTTTGTGTCTGGAATTCTTTGGTTTCATCTTTGTGTTGCGATGTGGCTGGCATCGATATGTGTTCCAGGGTTTTATGA

Protein sequence

MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIVGGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSLVSSLCCDVAGIDMCSRVL
BLAST of CsaV3_6G008310 vs. NCBI nr
Match: XP_004140638.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis sativus] >KGN46470.1 hypothetical protein Csa_6G095860 [Cucumis sativus])

HSP 1 Score: 603.6 bits (1555), Expect = 6.1e-169
Identity = 497/497 (100.00%), Postives = 497/497 (100.00%), Query Frame = 0

Query: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60
           MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR
Sbjct: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60

Query: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120
           VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT
Sbjct: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120

Query: 121 LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240
           SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE
Sbjct: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240

Query: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300
           IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
Sbjct: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300

Query: 301 CAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           CAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 CAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRYI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRYI
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRYI 420

Query: 421 EACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           EACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 EACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXXDVAGIDMCSRVL 498
           XXXXXDVAGIDMCSRVL
Sbjct: 481 XXXXXDVAGIDMCSRVL 497

BLAST of CsaV3_6G008310 vs. NCBI nr
Match: XP_016902498.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis melo] >XP_016902499.1 PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis melo])

HSP 1 Score: 562.4 bits (1448), Expect = 1.6e-156
Identity = 477/497 (95.98%), Postives = 482/497 (96.98%), Query Frame = 0

Query: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60
           M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MINILGNSGR
Sbjct: 1   MTVRWPRILTPTYLSQIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMINILGNSGR 60

Query: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120
           VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKS GRFNCTNRTQTFNT
Sbjct: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSLGRFNCTNRTQTFNT 120

Query: 121 LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           LLEILL ESQLHAACQLFQECSYGW VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
Sbjct: 121 LLEILLNESQLHAACQLFQECSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ 180

Query: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240
           SCYPNRLSYLXXXXXXXXXXXXXXXXXXXX  FWRISRKG GGDIVIYRTLLFALCDNGE
Sbjct: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXXXFWRISRKGSGGDIVIYRTLLFALCDNGE 240

Query: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300
           IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSY
Sbjct: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNNKLTIEEIKSLINEALIKGGIPSSDSY 300

Query: 301 CAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           CAMAVDLYNENKTDQGD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 CAMAVDLYNENKTDQGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRYI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAK+VGLVANKETYSTLVHGLC ENRY 
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKKVGLVANKETYSTLVHGLCRENRYT 420

Query: 421 EACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           EACKVLEEMVIKSF PCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 EACKVLEEMVIKSFWPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXXDVAGIDMCSRVL 498
           XXXXXDVAGIDMCS+VL
Sbjct: 481 XXXXXDVAGIDMCSKVL 497

BLAST of CsaV3_6G008310 vs. NCBI nr
Match: XP_011656819.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cucumis sativus])

HSP 1 Score: 495.7 bits (1275), Expect = 1.8e-136
Identity = 447/447 (100.00%), Postives = 447/447 (100.00%), Query Frame = 0

Query: 51  MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFN 110
           MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFN
Sbjct: 1   MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFN 60

Query: 111 CTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXX 170
           CTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  CTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 171 XXXXXXXXXXSCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRT 230
           XXXXXXXXXXSCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRT
Sbjct: 121 XXXXXXXXXXSCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRT 180

Query: 231 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALI 290
           LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALI
Sbjct: 181 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALI 240

Query: 291 KGGIPSSDSYCAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 350
           KGGIPSSDSYCAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 241 KGGIPSSDSYCAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 351 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLV 410
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLV
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLV 360

Query: 411 HGLCLENRYIEACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 470
           HGLCLENRYIEACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 HGLCLENRYIEACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 471 XXXXXXXXXXXXXXXDVAGIDMCSRVL 498
           XXXXXXXXXXXXXXXDVAGIDMCSRVL
Sbjct: 421 XXXXXXXXXXXXXXXDVAGIDMCSRVL 447

BLAST of CsaV3_6G008310 vs. NCBI nr
Match: XP_008459832.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cucumis melo])

HSP 1 Score: 464.9 bits (1195), Expect = 3.4e-127
Identity = 432/447 (96.64%), Postives = 435/447 (97.32%), Query Frame = 0

Query: 51  MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFN 110
           MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKS GRFN
Sbjct: 1   MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSLGRFN 60

Query: 111 CTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXX 170
           CTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  CTNRTQTFNTLLEILLNESQLHAACQLFQECSYGWEVXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 171 XXXXXXXXXXSCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRT 230
           XXXXXXXXX SCYPNRLSYLXXXXXXXXXXXXXXXXXXXX  FWRISRKG GGDIVIYRT
Sbjct: 121 XXXXXXXXXQSCYPNRLSYLXXXXXXXXXXXXXXXXXXXXXXFWRISRKGSGGDIVIYRT 180

Query: 231 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALI 290
           LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALI
Sbjct: 181 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNNKLTIEEIKSLINEALI 240

Query: 291 KGGIPSSDSYCAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 350
           KGGIPSSDSYCAMAVDLYNENKTDQGD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 241 KGGIPSSDSYCAMAVDLYNENKTDQGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 351 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLV 410
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAK+VGLVANKETYSTLV
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKKVGLVANKETYSTLV 360

Query: 411 HGLCLENRYIEACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 470
           HGLC ENRY EACKVLEEMVIKSF PCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 HGLCRENRYTEACKVLEEMVIKSFWPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 471 XXXXXXXXXXXXXXXDVAGIDMCSRVL 498
           XXXXXXXXXXXXXXXDVAGIDMCS+VL
Sbjct: 421 XXXXXXXXXXXXXXXDVAGIDMCSKVL 447

BLAST of CsaV3_6G008310 vs. NCBI nr
Match: XP_022959953.1 (pentatricopeptide repeat-containing protein At1g05600 [Cucurbita moschata] >XP_022959954.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita moschata] >XP_022959955.1 pentatricopeptide repeat-containing protein At1g05600 [Cucurbita moschata])

HSP 1 Score: 441.0 bits (1133), Expect = 5.2e-120
Identity = 271/318 (85.22%), Postives = 291/318 (91.51%), Query Frame = 0

Query: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60
           M++RWPR+LTPT LSQIIRKQNNP TAYQLF EAKCRYP+Y+HNGPVYA MINILGNSGR
Sbjct: 1   MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGR 60

Query: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120
           +SEMREV+DQM+ DSC+CKDS+FSFAIKTYASHGLLE+GISLFKS G FNCT+RTQTFNT
Sbjct: 61  ISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNT 120

Query: 121 LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           LLEILL ESQL AACQLFQ+ S+    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   
Sbjct: 121 LLEILLNESQLDAACQLFQQSSFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYQ 180

Query: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240
           SCYPNRLSY XXXXXXXXXXXXXXXXXXXX  FWRISR+G GGDIVIYRTLLFALCDNGE
Sbjct: 181 SCYPNRLSYXXXXXXXXXXXXXXXXXXXXXXXFWRISRRGSGGDIVIYRTLLFALCDNGE 240

Query: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300
           IEQAVEILGKIL+KGLKAPKRAHY IDL+ CR S LT+ EIK LINEALIKGGIPSSDSY
Sbjct: 241 IEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY 300

Query: 301 CAMAVDLYNENKTDQGDK 319
           CAMA+DLYNEN+TDQGDK
Sbjct: 301 CAMAIDLYNENETDQGDK 318

BLAST of CsaV3_6G008310 vs. TAIR10
Match: AT1G05600.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 239.6 bits (610), Expect = 4.2e-63
Identity = 129/316 (40.82%), Postives = 171/316 (54.11%), Query Frame = 0

Query: 3   IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVS 62
           +RWPR+LTP+ LSQI++KQ NP TA +LF+EAK R+P Y HNG VYATMI+ILG S RV 
Sbjct: 4   VRWPRVLTPSLLSQILKKQKNPVTALKLFEEAKERFPSYGHNGSVYATMIDILGKSNRVL 63

Query: 63  EMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLL 122
           EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISLFKS   FNC N + +F+TLL
Sbjct: 64  EMKYVIERMKEDSCECKDSVFASVIRTFSRAGRLEDAISLFKSLHEFNCVNWSLSFDTLL 123

Query: 123 EILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSC 182
           + ++KES+L AAC +F++  YGW V                                  C
Sbjct: 124 QEMVKESELEAACHIFRKYCYGWEVNSRITALNLLMKVLCQVNRSDLASQVFQEMNYQGC 183

Query: 183 YPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGEIE 242
           YP+R SY                     S                               
Sbjct: 184 YPDRDSYRILMKGFCLEGKLEEATHLLYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 243

Query: 243 QAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA 302
                 GKILRKGLKAPKR ++ I+     +S+  IE +K L+ E LI+G IP  DSY A
Sbjct: 244 XXXXXXGKILRKGLKAPKRCYHHIEAGHWESSSEGIERVKRLLTETLIRGAIPCLDSYSA 303

Query: 303 MAVDLYNENKTDQGDK 319
           MA DL+ E K  +G++
Sbjct: 304 MATDLFEEGKLVEGEE 319

BLAST of CsaV3_6G008310 vs. TAIR10
Match: AT1G07740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 58.5 bits (140), Expect = 1.3e-08
Identity = 30/121 (24.79%), Postives = 61/121 (50.41%), Query Frame = 0

Query: 18  IRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCE 77
           +++  +P+ A  LF +   +   +RH+ P Y+++I  L  S     + +++  +R  +  
Sbjct: 56  LKEIEDPEEALSLFHQ--YQEMGFRHDYPSYSSLIYKLAKSRNFDAVDQILRLVRYRNVR 115

Query: 78  CKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQL 137
           C++S+F   I+ Y   G ++  I +F     F+C    Q+ NTL+ +L+   +L  A   
Sbjct: 116 CRESLFMGLIQHYGKAGSVDKAIDVFHKITSFDCVRTIQSLNTLINVLVDNGELEKAKSF 174

Query: 138 F 139
           F
Sbjct: 176 F 174

BLAST of CsaV3_6G008310 vs. TAIR10
Match: AT3G04130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 52.0 bits (123), Expect = 1.2e-06
Identity = 25/94 (26.60%), Postives = 49/94 (52.13%), Query Frame = 0

Query: 41  YRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI 100
           ++H+   Y   ++ILG + +   M+E +++MR D     ++V    ++ +A  G  E+ +
Sbjct: 117 HKHSSDAYDMAVDILGKAKKWDRMKEFVERMRGDKLVTLNTVAKI-MRRFAGAGEWEEAV 176

Query: 101 SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAA 135
            +F   G F     T++ N LL+ L KE ++  A
Sbjct: 177 GIFDRLGEFGLEKNTESMNLLLDTLCKEKRVEQA 209

BLAST of CsaV3_6G008310 vs. TAIR10
Match: AT1G02060.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 49.7 bits (117), Expect = 6.1e-06
Identity = 28/103 (27.18%), Postives = 52/103 (50.49%), Query Frame = 0

Query: 41  YRHNGPVYATMINILGNSGRVSEMREVM--DQMRDDSC-ECKDSVFSFAIKTYASHGLLE 100
           + H    +  M+  LG +  ++  R  +   + R + C + +D  F+  I++Y + GL +
Sbjct: 96  FSHKEQSFFLMLEFLGRARNLNVARNFLFSIERRSNGCVKLQDRYFNSLIRSYGNAGLFQ 155

Query: 101 DGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQE 141
           + + LF++  +   +    TFN+LL ILLK  +   A  LF E
Sbjct: 156 ESVKLFQTMKQMGISPSVLTFNSLLSILLKRGRTGMAHDLFDE 198

BLAST of CsaV3_6G008310 vs. TAIR10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 49.3 bits (116), Expect = 8.0e-06
Identity = 31/123 (25.20%), Postives = 58/123 (47.15%), Query Frame = 0

Query: 10  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMD 69
           T   L   +R Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++++
Sbjct: 49  TDVKLLDSLRSQPDDSAALRLFNLAS-KKPNFSPEPALYEEILLRLGRSGSFDDMKKILE 108

Query: 70  QMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGRFNCTNRTQTFNTLLEILLKE 129
            M+   CE   S F   I++YA   L ++ +S+       F     T  +N +L +L+  
Sbjct: 109 DMKSSRCEMGTSTFLILIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLVDG 168

Query: 130 SQL 132
           + L
Sbjct: 169 NSL 170

BLAST of CsaV3_6G008310 vs. Swiss-Prot
Match: sp|Q9SYK1|PPR11_ARATH (Pentatricopeptide repeat-containing protein At1g05600 OS=Arabidopsis thaliana OX=3702 GN=At1g05600 PE=2 SV=1)

HSP 1 Score: 239.6 bits (610), Expect = 7.6e-62
Identity = 129/316 (40.82%), Postives = 171/316 (54.11%), Query Frame = 0

Query: 3   IRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVS 62
           +RWPR+LTP+ LSQI++KQ NP TA +LF+EAK R+P Y HNG VYATMI+ILG S RV 
Sbjct: 4   VRWPRVLTPSLLSQILKKQKNPVTALKLFEEAKERFPSYGHNGSVYATMIDILGKSNRVL 63

Query: 63  EMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLL 122
           EM+ V+++M++DSCECKDSVF+  I+T++  G LED ISLFKS   FNC N + +F+TLL
Sbjct: 64  EMKYVIERMKEDSCECKDSVFASVIRTFSRAGRLEDAISLFKSLHEFNCVNWSLSFDTLL 123

Query: 123 EILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSC 182
           + ++KES+L AAC +F++  YGW V                                  C
Sbjct: 124 QEMVKESELEAACHIFRKYCYGWEVNSRITALNLLMKVLCQVNRSDLASQVFQEMNYQGC 183

Query: 183 YPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGEIE 242
           YP+R SY                     S                               
Sbjct: 184 YPDRDSYRILMKGFCLEGKLEEATHLLYSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 243

Query: 243 QAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSYCA 302
                 GKILRKGLKAPKR ++ I+     +S+  IE +K L+ E LI+G IP  DSY A
Sbjct: 244 XXXXXXGKILRKGLKAPKRCYHHIEAGHWESSSEGIERVKRLLTETLIRGAIPCLDSYSA 303

Query: 303 MAVDLYNENKTDQGDK 319
           MA DL+ E K  +G++
Sbjct: 304 MATDLFEEGKLVEGEE 319

BLAST of CsaV3_6G008310 vs. Swiss-Prot
Match: sp|Q9LQQ1|PPR20_ARATH (Pentatricopeptide repeat-containing protein At1g07740, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g07740 PE=2 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 2.4e-07
Identity = 30/121 (24.79%), Postives = 61/121 (50.41%), Query Frame = 0

Query: 18  IRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCE 77
           +++  +P+ A  LF +   +   +RH+ P Y+++I  L  S     + +++  +R  +  
Sbjct: 56  LKEIEDPEEALSLFHQ--YQEMGFRHDYPSYSSLIYKLAKSRNFDAVDQILRLVRYRNVR 115

Query: 78  CKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQL 137
           C++S+F   I+ Y   G ++  I +F     F+C    Q+ NTL+ +L+   +L  A   
Sbjct: 116 CRESLFMGLIQHYGKAGSVDKAIDVFHKITSFDCVRTIQSLNTLINVLVDNGELEKAKSF 174

Query: 138 F 139
           F
Sbjct: 176 F 174

BLAST of CsaV3_6G008310 vs. Swiss-Prot
Match: sp|Q9M8W9|PP211_ARATH (Pentatricopeptide repeat-containing protein At3g04130, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g04130 PE=2 SV=2)

HSP 1 Score: 52.0 bits (123), Expect = 2.2e-05
Identity = 25/94 (26.60%), Postives = 49/94 (52.13%), Query Frame = 0

Query: 41  YRHNGPVYATMINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGI 100
           ++H+   Y   ++ILG + +   M+E +++MR D     ++V    ++ +A  G  E+ +
Sbjct: 117 HKHSSDAYDMAVDILGKAKKWDRMKEFVERMRGDKLVTLNTVAKI-MRRFAGAGEWEEAV 176

Query: 101 SLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAA 135
            +F   G F     T++ N LL+ L KE ++  A
Sbjct: 177 GIFDRLGEFGLEKNTESMNLLLDTLCKEKRVEQA 209

BLAST of CsaV3_6G008310 vs. Swiss-Prot
Match: sp|O81908|PPR2_ARATH (Pentatricopeptide repeat-containing protein At1g02060, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g02060 PE=2 SV=2)

HSP 1 Score: 49.7 bits (117), Expect = 1.1e-04
Identity = 28/103 (27.18%), Postives = 52/103 (50.49%), Query Frame = 0

Query: 41  YRHNGPVYATMINILGNSGRVSEMREVM--DQMRDDSC-ECKDSVFSFAIKTYASHGLLE 100
           + H    +  M+  LG +  ++  R  +   + R + C + +D  F+  I++Y + GL +
Sbjct: 96  FSHKEQSFFLMLEFLGRARNLNVARNFLFSIERRSNGCVKLQDRYFNSLIRSYGNAGLFQ 155

Query: 101 DGISLFKSFGRFNCTNRTQTFNTLLEILLKESQLHAACQLFQE 141
           + + LF++  +   +    TFN+LL ILLK  +   A  LF E
Sbjct: 156 ESVKLFQTMKQMGISPSVLTFNSLLSILLKRGRTGMAHDLFDE 198

BLAST of CsaV3_6G008310 vs. Swiss-Prot
Match: sp|Q9LFF1|PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 49.3 bits (116), Expect = 1.4e-04
Identity = 31/123 (25.20%), Postives = 58/123 (47.15%), Query Frame = 0

Query: 10  TPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGRVSEMREVMD 69
           T   L   +R Q +   A +LF  A  + P++     +Y  ++  LG SG   +M+++++
Sbjct: 49  TDVKLLDSLRSQPDDSAALRLFNLAS-KKPNFSPEPALYEEILLRLGRSGSFDDMKKILE 108

Query: 70  QMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFK-SFGRFNCTNRTQTFNTLLEILLKE 129
            M+   CE   S F   I++YA   L ++ +S+       F     T  +N +L +L+  
Sbjct: 109 DMKSSRCEMGTSTFLILIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLVDG 168

Query: 130 SQL 132
           + L
Sbjct: 169 NSL 170

BLAST of CsaV3_6G008310 vs. TrEMBL
Match: tr|A0A0A0K9Q4|A0A0A0K9Q4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G095860 PE=4 SV=1)

HSP 1 Score: 603.6 bits (1555), Expect = 4.0e-169
Identity = 497/497 (100.00%), Postives = 497/497 (100.00%), Query Frame = 0

Query: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60
           MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR
Sbjct: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60

Query: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120
           VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT
Sbjct: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120

Query: 121 LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 121 LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240
           SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE
Sbjct: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240

Query: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300
           IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY
Sbjct: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300

Query: 301 CAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           CAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 CAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRYI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRYI
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRYI 420

Query: 421 EACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           EACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 EACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXXDVAGIDMCSRVL 498
           XXXXXDVAGIDMCSRVL
Sbjct: 481 XXXXXDVAGIDMCSRVL 497

BLAST of CsaV3_6G008310 vs. TrEMBL
Match: tr|A0A1S4E2N8|A0A1S4E2N8_CUCME (pentatricopeptide repeat-containing protein At1g05600 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498838 PE=4 SV=1)

HSP 1 Score: 562.4 bits (1448), Expect = 1.0e-156
Identity = 477/497 (95.98%), Postives = 482/497 (96.98%), Query Frame = 0

Query: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60
           M++RWPRILTPT LSQIIRKQNNP TAYQLFKEAKCRYPDYRHNGPVYA MINILGNSGR
Sbjct: 1   MTVRWPRILTPTYLSQIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMINILGNSGR 60

Query: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120
           VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKS GRFNCTNRTQTFNT
Sbjct: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSLGRFNCTNRTQTFNT 120

Query: 121 LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           LLEILL ESQLHAACQLFQECSYGW VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
Sbjct: 121 LLEILLNESQLHAACQLFQECSYGWEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ 180

Query: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240
           SCYPNRLSYLXXXXXXXXXXXXXXXXXXXX  FWRISRKG GGDIVIYRTLLFALCDNGE
Sbjct: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXXXFWRISRKGSGGDIVIYRTLLFALCDNGE 240

Query: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300
           IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALIKGGIPSSDSY
Sbjct: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNNKLTIEEIKSLINEALIKGGIPSSDSY 300

Query: 301 CAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           CAMAVDLYNENKTDQGD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 CAMAVDLYNENKTDQGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRYI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAK+VGLVANKETYSTLVHGLC ENRY 
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKKVGLVANKETYSTLVHGLCRENRYT 420

Query: 421 EACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480
           EACKVLEEMVIKSF PCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 421 EACKVLEEMVIKSFWPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 XXXXXDVAGIDMCSRVL 498
           XXXXXDVAGIDMCS+VL
Sbjct: 481 XXXXXDVAGIDMCSKVL 497

BLAST of CsaV3_6G008310 vs. TrEMBL
Match: tr|A0A1S3CB38|A0A1S3CB38_CUCME (pentatricopeptide repeat-containing protein At1g05600 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103498838 PE=4 SV=1)

HSP 1 Score: 464.9 bits (1195), Expect = 2.2e-127
Identity = 432/447 (96.64%), Postives = 435/447 (97.32%), Query Frame = 0

Query: 51  MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFN 110
           MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKS GRFN
Sbjct: 1   MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSLGRFN 60

Query: 111 CTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXX 170
           CTNRTQTFNTLLEILL ESQLHAACQLFQECSYGW VXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  CTNRTQTFNTLLEILLNESQLHAACQLFQECSYGWEVXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 171 XXXXXXXXXXSCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRT 230
           XXXXXXXXX SCYPNRLSYLXXXXXXXXXXXXXXXXXXXX  FWRISRKG GGDIVIYRT
Sbjct: 121 XXXXXXXXXQSCYPNRLSYLXXXXXXXXXXXXXXXXXXXXXXFWRISRKGSGGDIVIYRT 180

Query: 231 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALI 290
           LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRN+ LTIEEIKSLINEALI
Sbjct: 181 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNNKLTIEEIKSLINEALI 240

Query: 291 KGGIPSSDSYCAMAVDLYNENKTDQGDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 350
           KGGIPSSDSYCAMAVDLYNENKTDQGD XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 241 KGGIPSSDSYCAMAVDLYNENKTDQGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 351 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLV 410
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAK+VGLVANKETYSTLV
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKKVGLVANKETYSTLV 360

Query: 411 HGLCLENRYIEACKVLEEMVIKSFCPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 470
           HGLC ENRY EACKVLEEMVIKSF PCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 HGLCRENRYTEACKVLEEMVIKSFWPCSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 471 XXXXXXXXXXXXXXXDVAGIDMCSRVL 498
           XXXXXXXXXXXXXXXDVAGIDMCS+VL
Sbjct: 421 XXXXXXXXXXXXXXXDVAGIDMCSKVL 447

BLAST of CsaV3_6G008310 vs. TrEMBL
Match: tr|A0A2H5P127|A0A2H5P127_CITUN (Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_092180 PE=4 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 1.3e-95
Identity = 249/438 (56.85%), Postives = 292/438 (66.67%), Query Frame = 0

Query: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60
           MS+RWPR+LTPT LSQII+KQ +P TA ++FKEAK +YP+YRHNGPVYA+MI IL  S R
Sbjct: 1   MSVRWPRLLTPTYLSQIIKKQKSPLTALKIFKEAKEKYPNYRHNGPVYASMIGILSESNR 60

Query: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120
           ++EM+EV+DQM+ DSCECKDSVF+ AI+TYA  G L + +SLFK+  +FNC N TQ+FNT
Sbjct: 61  ITEMKEVIDQMKGDSCECKDSVFATAIRTYARAGQLNEAVSLFKNLSQFNCVNWTQSFNT 120

Query: 121 LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           LL+ ++KES+L AA  LF    YGW V                                 
Sbjct: 121 LLKEMVKESKLEAAHILFLRSCYGWEVKSRIQSLNLLMDVLCQRRRSDLALHVFQEMDFQ 180

Query: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240
            CYP+R SY                     SMFWRIS+KG G DIVIYRTLLFALCD G+
Sbjct: 181 GCYPDRESYHILMKGLCNDRRLNEATHLLYSMFWRISQKGSGEDIVIYRTLLFALCDQGK 240

Query: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300
           I+ A++IL KILRKGLKAPK   +RIDL  C N    IE  KSLINEALI+GGIPS  SY
Sbjct: 241 IQDAMQILEKILRKGLKAPKSRRHRIDLCPC-NDGEDIEGAKSLINEALIRGGIPSLASY 300

Query: 301 CAMAVDLYNENKTDQGD-KXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            AMAVDLYNE +  +GD  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SAMAVDLYNEGRIVEGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRY 420
           XXXXX                           KM+KQVG VAN ETY  LV GLC + R+
Sbjct: 361 XXXXXPTVRVYNILLKGLCDAGNSAVAVMYLKKMSKQVGCVANGETYGILVDGLCRDGRF 420

Query: 421 IEACKVLEEMVIKSFCPC 438
           +EA +VLEEM+I+S+ PC
Sbjct: 421 LEASRVLEEMLIRSYWPC 437

BLAST of CsaV3_6G008310 vs. TrEMBL
Match: tr|V4VX73|V4VX73_9ROSI (Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10019806mg PE=4 SV=1)

HSP 1 Score: 359.0 bits (920), Expect = 1.7e-95
Identity = 248/438 (56.62%), Postives = 292/438 (66.67%), Query Frame = 0

Query: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60
           MS+RWPR+LTPT LSQII+KQ +P TA ++FKEAK +YP+YRHNGPVYA+MI IL  S R
Sbjct: 1   MSVRWPRLLTPTYLSQIIKKQKSPLTALKIFKEAKEKYPNYRHNGPVYASMIGILSESNR 60

Query: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120
           ++EM+EV+DQM+ DSCECKDSVF+ AI+TYA  G L + +SLFK+  +FNC N TQ+FNT
Sbjct: 61  ITEMKEVIDQMKGDSCECKDSVFATAIRTYARAGQLNEAVSLFKNLSQFNCVNWTQSFNT 120

Query: 121 LLEILLKESQLHAACQLFQECSYGWGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180
           LL+ ++KES+L AA  LF    YGW V                                 
Sbjct: 121 LLKEMVKESKLEAAHILFLRSCYGWEVKSRIQSLNLLMDVLCQRRRSDLALHVFQEMDFQ 180

Query: 181 SCYPNRLSYLXXXXXXXXXXXXXXXXXXXXSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240
            CYP+R SY                     SMFWRIS+KG G DIVIYRTLLFALCD G+
Sbjct: 181 GCYPDRESYHILMKGLCNDRRLNEATHLLYSMFWRISQKGSGEDIVIYRTLLFALCDQGK 240

Query: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300
           I+ A++IL KILRKGLKAPK   +RIDL  C N    IE  KSLINEALI+GGIPS  SY
Sbjct: 241 IQDAMQILEKILRKGLKAPKSRRHRIDLCPC-NDGEDIEGAKSLINEALIRGGIPSLASY 300

Query: 301 CAMAVDLYNENKTDQGD-KXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            AMA+DLYNE +  +GD  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SAMAIDLYNEGRIVEGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKMAKQVGLVANKETYSTLVHGLCLENRY 420
           XXXXX                           KM+KQVG VAN ETY  LV GLC + R+
Sbjct: 361 XXXXXPTVRVYNILLKGLCDAGNSAVAVMYLKKMSKQVGCVANGETYGILVDGLCRDGRF 420

Query: 421 IEACKVLEEMVIKSFCPC 438
           +EA +VLEEM+I+S+ PC
Sbjct: 421 LEASRVLEEMLIRSYWPC 437

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004140638.16.1e-169100.00PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cuc... [more]
XP_016902498.11.6e-15695.98PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cuc... [more]
XP_011656819.11.8e-136100.00PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cuc... [more]
XP_008459832.13.4e-12796.64PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cuc... [more]
XP_022959953.15.2e-12085.22pentatricopeptide repeat-containing protein At1g05600 [Cucurbita moschata] >XP_0... [more]
Match NameE-valueIdentityDescription
AT1G05600.14.2e-6340.82Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G07740.11.3e-0824.79Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G04130.11.2e-0626.60Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02060.16.1e-0627.18Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G53700.18.0e-0625.20Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SYK1|PPR11_ARATH7.6e-6240.82Pentatricopeptide repeat-containing protein At1g05600 OS=Arabidopsis thaliana OX... [more]
sp|Q9LQQ1|PPR20_ARATH2.4e-0724.79Pentatricopeptide repeat-containing protein At1g07740, mitochondrial OS=Arabidop... [more]
sp|Q9M8W9|PP211_ARATH2.2e-0526.60Pentatricopeptide repeat-containing protein At3g04130, mitochondrial OS=Arabidop... [more]
sp|O81908|PPR2_ARATH1.1e-0427.18Pentatricopeptide repeat-containing protein At1g02060, chloroplastic OS=Arabidop... [more]
sp|Q9LFF1|PP281_ARATH1.4e-0425.20Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0K9Q4|A0A0A0K9Q4_CUCSA4.0e-169100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G095860 PE=4 SV=1[more]
tr|A0A1S4E2N8|A0A1S4E2N8_CUCME1.0e-15695.98pentatricopeptide repeat-containing protein At1g05600 isoform X1 OS=Cucumis melo... [more]
tr|A0A1S3CB38|A0A1S3CB38_CUCME2.2e-12796.64pentatricopeptide repeat-containing protein At1g05600 isoform X2 OS=Cucumis melo... [more]
tr|A0A2H5P127|A0A2H5P127_CITUN1.3e-9556.85Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_092180 PE=4 SV=1[more]
tr|V4VX73|V4VX73_9ROSI1.7e-9556.62Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10019806mg PE=4 ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_6G008310.1CsaV3_6G008310.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 10..132
e-value: 2.2E-18
score: 68.2
coord: 382..487
e-value: 1.9E-22
score: 81.5
coord: 278..381
e-value: 7.4E-12
score: 46.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 142..276
e-value: 1.3E-18
score: 69.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 341..357
e-value: 1.3
score: 9.4
coord: 48..76
e-value: 0.0031
score: 17.6
coord: 226..256
e-value: 0.0029
score: 17.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 341..366
e-value: 0.0017
score: 16.4
coord: 154..185
e-value: 5.5E-5
score: 21.1
coord: 226..257
e-value: 0.0024
score: 15.9
coord: 47..78
e-value: 2.6E-6
score: 25.3
coord: 369..402
e-value: 7.9E-4
score: 17.4
coord: 405..436
e-value: 5.3E-6
score: 24.3
coord: 440..471
e-value: 2.7E-6
score: 25.2
coord: 188..212
e-value: 0.0024
score: 15.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 436..484
e-value: 2.4E-7
score: 30.7
coord: 153..198
e-value: 2.9E-10
score: 40.0
coord: 365..414
e-value: 4.6E-9
score: 36.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 150..184
score: 9.525
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..396
score: 8.013
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 331..365
score: 8.396
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 185..215
score: 7.903
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 44..78
score: 9.262
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 402..436
score: 10.676
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 437..471
score: 10.556
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 224..258
score: 10.446
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 114..149
score: 6.292
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 296..330
score: 7.662
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 79..113
score: 6.215
NoneNo IPR availablePANTHERPTHR24015:SF542SUBFAMILY NOT NAMEDcoord: 9..480
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 9..480