CsaV3_UNG233330 (gene) Cucumber (Chinese Long) v3

NameCsaV3_UNG233330
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold119 : 88563 .. 91497 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAGATGTGTTCCCAAGGTTGCTCTCGAATCAGTATTCATTCACCTTTCTCTTTCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTTTCTCATTTGTAAGTCAGGATTTGCTTCTGATATGTTTGCTAATGGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGTGTCGAGATATACCACCTGGAATTCGTTGATTGCGGGTTATGCAAGGTCCGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTGCTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGTATTGATGCTCTTCAGCTTTATGATCAAATGTTGGTGAGTTTTCTATATCCTTCTTCATTTGTTTTTGTTCTGAATTTTAGTTTCTAAATCACTAATGAACTTCAGAAGTATAAGTGAGTACATAGTTAATCTTTTGTTTATGTTCACATGTTTTGTATGTTTGGTAGCTGGTATAGGTTAGTGATAATAGAACACCTACTTTGGATGATTCTTGTTCAATGGAAACAACAAAAGGAATTACAAGGAAATGGTAGAGCACCCAGTCTTGAACAAGAACAAAGACAACTAACAAAGAACGGAAAGTAAACCCTACCCTTCCCAGGCCTTTTCTATCTCTCAAGGAATAAACAAATCCCTCACACAATTTTCATACCCTCACTCTCAGACTCCCTCACCATTTAAACCTTCTCCTTTCCCTAACTAACTGTGGGACCCGGCCTTAGCAGCTATAGTGCCCATTACACATGCACTTCCTTTGTTCCCTCTCCTACTGTATTACCATATAATAGGTGCCCTAACATTACGCCTTCTCTAAAATCACATTCTCCTTAAGGTGAAATCTGGAAATTGCTGTTGAAAATCATAACAAATCTCCTAGGCAGCTTTATAAGGAGGTAGCACTTGCCAACAAATCAAGACCTCCCACACTCCCGTTGTAGGGTGCCTTTGATATCCATAAACTTCCTCTAGCTTGGCTACTCATTCATAATTTTTCGACAAGTGACTAGTTGCTGCACCTCAGAATGCTCACTAACACTCCCTCAACTGTGAGACATGAGACACCGGATGAATAGAGGCCGATGGAGGTAATTCCAGTTTTTATGCCACTGGTCCAATTCTTTCAAGCAAAAATTTTGGAGATAATTTTTCATTTCTCCGTTTTCGCAATGACGAGTGCCCGTTTGAAGTTGGTCGCCTAGCTGTTTTAAGAGGAGCAAGTGCCTCTCACCTAAGGCGAGCCCTCGGTTGCACCTCGAAAACACTGGCTGCAGTATTATTTTCTCTTCAATTAAAATGATTTCCATTTGTTGAGTCTTCTACATATTTCAAGCCTACAAAGGAGAGGGAAATGTTAAAGACGATACAACATTAAATTTTCCTTCACCTTAAACTATAGCTAAATCTACAAGGTTGAAGTTGTTGTGTTATTGTATGTTTAAGCATTTATATTAATAGTCTTGTTACATCTTTTGCATTTATACTGACGGTGGTGTTATATCTTTCTCACAACAGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGAAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATGTACATCATGATCCAAACGAAGATGAAGAACTACTCTATTCTTCGTAATTATTTTACGGTTTGATTGAATCGTTATATTGCTTCATGATTATTATATTAGATATATGAGATTTAAAGAACTAGGAATTAATGTCTAGAATAGTTCATTAGGGAATAGCAAAATTGTATGTAGTTGAATTTTGCACTTAGTCGGTTTTTGTCCTAAAGTTTAAATACTTTTATACCTATGAATTCATTCAAAATGATAATTTTTTTTA

mRNA sequence

ATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAGATGTGTTCCCAAGGTTGCTCTCGAATCAGTATTCATTCACCTTTCTCTTTCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTTTCTCATTTGTCCGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTGCTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGTATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGAAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATGTACATCATGATCCAAACGAAGATGAAGAACTACTCTATTCTTCGTAA

Coding sequence (CDS)

ATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAGATGTGTTCCCAAGGTTGCTCTCGAATCAGTATTCATTCACCTTTCTCTTTCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTTTCTCATTTGTCCGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTGCTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGTATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGAAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATGTACATCATGATCCAAACGAAGATGAAGAACTACTCTATTCTTCGTAA

Protein sequence

MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSRISIHSPFSFACASLFNVYPGQMLFSFVRHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHHDPNEDEELLYSS
BLAST of CsaV3_UNG233330 vs. NCBI nr
Match: XP_011660274.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Cucumis sativus] >KGN66775.1 hypothetical protein Csa_1G690140 [Cucumis sativus])

HSP 1 Score: 813.5 bits (2100), Expect = 3.6e-232
Identity = 415/521 (79.65%), Postives = 422/521 (81.00%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSF-ACASLFNVYPGQMLFSFVRHMEAALELFNK 120
           GHPHRCWLLYCQMCSQGCS       F F ACASLFNVYPGQML S       A ++F  
Sbjct: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMF-- 120

Query: 121 MPVRNVISWTALISGYAQNGKYAKALEMF------------------------------- 180
                  + TAL+  YA+ G    A ++F                               
Sbjct: 121 -------AMTALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 -------------------------------IGLENEKGTKPNEVSIASVLPACSQLGAL 240
                                           GLENEKGTKPNEVSIASVLPACSQLGAL
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGAL 240

Query: 241 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300
           DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG
Sbjct: 241 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300

Query: 301 LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 360
           LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP
Sbjct: 301 LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 360

Query: 361 KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 420
           KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL
Sbjct: 361 KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 420

Query: 421 FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 459
           FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE
Sbjct: 421 FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 480

BLAST of CsaV3_UNG233330 vs. NCBI nr
Match: XP_022979296.1 (pentatricopeptide repeat-containing protein At5g08510 isoform X2 [Cucurbita maxima])

HSP 1 Score: 756.1 bits (1951), Expect = 6.8e-215
Identity = 375/462 (81.17%), Postives = 404/462 (87.45%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQTFSSI 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSF---ACASLFNVYPGQMLFSFVRHMEAALELF 120
           GHPHRCWLLY QMC QGCS    H  F+F   ACAS  N YPG   ++   +M AALELF
Sbjct: 61  GHPHRCWLLYYQMCLQGCS--PNHHSFTFLFPACASFLNAYPG---YARSGYMGAALELF 120

Query: 121 NKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGAL 180
           +KMP+RNVISWTALISG            MF+ LENEKGTKPNEV+IASVLPAC+QLGAL
Sbjct: 121 DKMPIRNVISWTALISGXXXXXXXXXXXXMFLRLENEKGTKPNEVTIASVLPACAQLGAL 180

Query: 181 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 240
           DIGKRIE YAR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMG
Sbjct: 181 DIGKRIEVYARKNGFFKNLYVSNAILEVHARCGNIEEARRVFDEIGSKRNLCSWNTMIMG 240

Query: 241 LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 300
           LAVHGRC DALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQ+FESME KFQ+AP
Sbjct: 241 LAVHGRCPDALQLYDQMLIQRTRPDDVTFVGLLLACTHGGMVAKGRQIFESMEIKFQIAP 300

Query: 301 KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 360
           KLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGNVELGEVAAESL
Sbjct: 301 KLEHYGCLVDLLGRAGEIEEAYSLIQSMPMFPDSVIWGALLGACSFHGNVELGEVAAESL 360

Query: 361 FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 420
           FKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVE
Sbjct: 361 FKLEPWNPGNYVILSNIYASAGDWSGVARVRKMMKGGHIRKRAGCSYIEVGDGIHEFIVE 420

Query: 421 DRSHLKSGEIYALLHKIYDIIKLHKHVHHDPNE-DEELLYSS 459
           DRSH KS EIYALLH IY IIKLH       NE +EELLYSS
Sbjct: 421 DRSHPKSDEIYALLHAIYAIIKLHSQ-----NEGEEELLYSS 452

BLAST of CsaV3_UNG233330 vs. NCBI nr
Match: XP_022924285.1 (pentatricopeptide repeat-containing protein At5g08510 isoform X2 [Cucurbita moschata])

HSP 1 Score: 741.9 bits (1914), Expect = 1.3e-210
Identity = 368/462 (79.65%), Postives = 399/462 (86.36%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAYSLRNG+D+TKFLI+KLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGVDYTKFLIQKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQTFSSI 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSF---ACASLFNVYPGQMLFSFVRHMEAALELF 120
           GH HRCWLLY QMC QGCS    H  F+F   ACAS  N YPG   ++   +M AALELF
Sbjct: 61  GHHHRCWLLYYQMCLQGCS--PNHHSFTFLFPACASFLNAYPG---YARSGYMGAALELF 120

Query: 121 NKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGAL 180
           +KMP RNVISWTALIS               + LENEKGTKPNEV+IASVLPAC+ LGAL
Sbjct: 121 DKMPTRNVISWTALISXXXXXXXXXXXXXXXLRLENEKGTKPNEVTIASVLPACAHLGAL 180

Query: 181 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 240
           DIGKRIEAYAR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMG
Sbjct: 181 DIGKRIEAYARKNGFFKNLYVSNAILEVHARCGNIEEARRVFDEIGSKRNLCSWNTMIMG 240

Query: 241 LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 300
           LAVHGRC DALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQLFESME KFQ+AP
Sbjct: 241 LAVHGRCRDALQLYDQMLIQRTRPDDVTFVGLLLACTHGGMVAKGRQLFESMERKFQIAP 300

Query: 301 KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 360
           KLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGNVELGEVAAESL
Sbjct: 301 KLEHYGCLVDLLGRAGEIEEAYSLIQSMPMLPDSVIWGALLGACSFHGNVELGEVAAESL 360

Query: 361 FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 420
           FKLEPWNPGNYVILSNIYA AGDWSGVAR RKMMKGGH+ KRAG SYIEVGDGIHEF+VE
Sbjct: 361 FKLEPWNPGNYVILSNIYASAGDWSGVARARKMMKGGHMRKRAGCSYIEVGDGIHEFVVE 420

Query: 421 DRSHLKSGEIYALLHKIYDIIKLHKHVHHDPNE-DEELLYSS 459
           DRSH KS EIYALLH +Y IIKL     H+ NE +EELLYSS
Sbjct: 421 DRSHPKSDEIYALLHAVYAIIKL-----HNQNEGEEELLYSS 452

BLAST of CsaV3_UNG233330 vs. NCBI nr
Match: XP_022979295.1 (pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxima])

HSP 1 Score: 700.7 bits (1807), Expect = 3.4e-198
Identity = 361/515 (70.10%), Postives = 393/515 (76.31%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQTFSSI 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSF---ACASLFNVYPGQMLFSF----------- 120
           GHPHRCWLLY QMC QGCS    H  F+F   ACAS  N YPGQML S            
Sbjct: 61  GHPHRCWLLYYQMCLQGCS--PNHHSFTFLFPACASFLNAYPGQMLHSHFCKSGFASDVF 120

Query: 121 -----------VRHMEAALELFNKMPVRNV------------------------------ 180
                      +  +++A +LF++MPVR++                              
Sbjct: 121 ALTALLDMYGKLGILKSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 -ISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIE 240
                                      ENEKGTKPNEV+IASVLPAC+QLGALDIGKRIE
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXENEKGTKPNEVTIASVLPACAQLGALDIGKRIE 240

Query: 241 AYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRC 300
            YAR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMGLAVHGRC
Sbjct: 241 VYARKNGFFKNLYVSNAILEVHARCGNIEEARRVFDEIGSKRNLCSWNTMIMGLAVHGRC 300

Query: 301 IDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGC 360
            DALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQ+FESME KFQ+APKLEHYGC
Sbjct: 301 PDALQLYDQMLIQRTRPDDVTFVGLLLACTHGGMVAKGRQIFESMEIKFQIAPKLEHYGC 360

Query: 361 LVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWN 420
           LVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGNVELGEVAAESLFKLEPWN
Sbjct: 361 LVDLLGRAGEIEEAYSLIQSMPMFPDSVIWGALLGACSFHGNVELGEVAAESLFKLEPWN 420

Query: 421 PGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS 459
           PGNYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS
Sbjct: 421 PGNYVILSNIYASAGDWSGVARVRKMMKGGHIRKRAGCSYIEVGDGIHEFIVEDRSHPKS 480

BLAST of CsaV3_UNG233330 vs. NCBI nr
Match: XP_023527365.1 (pentatricopeptide repeat-containing protein At5g08510 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 696.0 bits (1795), Expect = 8.3e-197
Identity = 358/515 (69.51%), Postives = 392/515 (76.12%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGVDYTKFLIEKLLQIPNLPYACTLFDLIPKPSVFLYNKFIQTFSSI 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSF---ACASLFNVYPGQMLFSF----------- 120
           GH HRCWLLY QMC QGCS    H  F+F   ACAS  N YPGQML S            
Sbjct: 61  GHHHRCWLLYYQMCLQGCS--PNHHSFTFLFPACASFLNAYPGQMLHSHFCKSGFASDVF 120

Query: 121 -----------VRHMEAALELFNKMPVRNV------------------------------ 180
                      +  +++A +LF++ PVR++                              
Sbjct: 121 ALTALLDMYGKLGILKSARQLFDEKPVRDIPTXXXXXXXXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 -ISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIE 240
                                      ENEKGTKPNEV+IASVLPAC+ LGALDIGKRIE
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXENEKGTKPNEVTIASVLPACAHLGALDIGKRIE 240

Query: 241 AYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRC 300
           AYAR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMGLAVHGRC
Sbjct: 241 AYARKNGFFKNLYVSNAILEVHARCGNIEEARRVFDEIGSKRNLCSWNTMIMGLAVHGRC 300

Query: 301 IDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGC 360
            DALQLYDQML+++ RPDDVTFVGLLLACTHGGMVA+GRQLFESME KFQ+APKLEHYGC
Sbjct: 301 CDALQLYDQMLMQRTRPDDVTFVGLLLACTHGGMVAKGRQLFESMERKFQIAPKLEHYGC 360

Query: 361 LVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWN 420
           LVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGNVELGEVAAESLFKLEPWN
Sbjct: 361 LVDLLGRAGEIEEAYSLIQSMPMLPDSVIWGALLGACSFHGNVELGEVAAESLFKLEPWN 420

Query: 421 PGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS 459
           PGNYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS
Sbjct: 421 PGNYVILSNIYASAGDWSGVARVRKMMKGGHIRKRAGCSYIEVGDGIHEFIVEDRSHPKS 480

BLAST of CsaV3_UNG233330 vs. TAIR10
Match: AT5G08510.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 424.1 bits (1089), Expect = 1.1e-118
Identity = 226/508 (44.49%), Postives = 306/508 (60.24%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +   
Sbjct: 1   MNGIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSF---ACASLFNVYPGQMLFSFVRHMEAALELF 120
             PH   +LY  +   G      H  F+F   A AS  +  P ++L S         + F
Sbjct: 61  HQPHESIVLYNLLSFDGLR--PSHHTFNFIFAASASFSSARPLRLLHS---------QFF 120

Query: 121 NKMPVRNVISWTALISGYAQNGKYAKALEMFIGLEN------------------------ 180
                 +    T LI+ YA+ G    A  +F  +                          
Sbjct: 121 RSGFESDSFCCTTLITAYAKLGALCCARRVFDEMSKRDVPVWXXXXXXXXXXXXXXXXXX 180

Query: 181 --------------------------------------EKGTKPNEVSIASVLPACSQLG 240
                                                 +K  KPN +++ SVLPAC+ LG
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDKSVKPNHITVVSVLPACANLG 240

Query: 241 ALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMI 300
            L+IG+R+E YAR NGFF N YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI
Sbjct: 241 ELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMI 300

Query: 301 MGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQV 360
             LA HG+  +AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   ++
Sbjct: 301 GSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKI 360

Query: 361 APKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAE 420
           +PKLEHYGC++DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGNVE+ E+A+E
Sbjct: 361 SPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASE 420

Query: 421 SLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEF 443
           +LFKLEP NPGN VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F
Sbjct: 421 ALFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYFVEVGVDVHKF 480

BLAST of CsaV3_UNG233330 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 312.4 bits (799), Expect = 4.7e-85
Identity = 153/334 (45.81%), Postives = 219/334 (65.57%), Query Frame = 0

Query: 103 LFSFVRHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEV 162
           L+S    +E A  LF ++P ++VISW  LI GY     Y +AL +F  +    G  PN+V
Sbjct: 310 LYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEM-LRSGETPNDV 369

Query: 163 SIASVLPACSQLGALDIGKRIEAY--ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFD 222
           ++ S+LPAC+ LGA+DIG+ I  Y   R  G    + +  ++++++A+CG+IE A QVF+
Sbjct: 370 TMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFN 429

Query: 223 EIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVA 282
            I  K +L SWN MI G A+HGR   +  L+ +M    ++PDD+TFVGLL AC+H GM+ 
Sbjct: 430 SILHK-SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLD 489

Query: 283 EGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGA 342
            GR +F +M   +++ PKLEHYGC++DLLG +G  +EA  +I  M M PD VIW +LL A
Sbjct: 490 LGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKA 549

Query: 343 CSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRA 402
           C  HGNVELGE  AE+L K+EP NPG+YV+LSNIYA AG W+ VA+ R ++    + K  
Sbjct: 550 CKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVP 609

Query: 403 GYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKI 435
           G S IE+   +HEFI+ D+ H ++ EIY +L ++
Sbjct: 610 GCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEM 641

BLAST of CsaV3_UNG233330 vs. TAIR10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 292.4 bits (747), Expect = 5.0e-79
Identity = 168/481 (34.93%), Postives = 269/481 (55.93%), Query Frame = 0

Query: 7   IHAYSLRNGLDHTKFLIEKLLQL----PDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGH 66
           IH+YS+++GL+   F+  KL+ L      L     +FD++    +  +N  I+ +     
Sbjct: 269 IHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQ 328

Query: 67  PHRCWLLYCQM--------CSQGCSRISIHSPFS--FACASL--FNVYPGQML------- 126
           P R   L+ +M        C    S  SI S      AC S+  F +  G  L       
Sbjct: 329 PLRAISLFQEMRLSRIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGN 388

Query: 127 -----FSFVRHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTK 186
                ++ +  +++A  +FN +P  +VISW  +ISGYAQNG  ++A+EM+  +E E    
Sbjct: 389 AVVVMYAKLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIA 448

Query: 187 PNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQV 246
            N+ +  SVLPACSQ GAL  G ++      NG + + +V  ++ +++ +CG +E+A  +
Sbjct: 449 ANQGTWVSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSL 508

Query: 247 FDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGM 306
           F +I  + N   WNT+I     HG    A+ L+ +ML   ++PD +TFV LL AC+H G+
Sbjct: 509 FYQI-PRVNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGL 568

Query: 307 VAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL 366
           V EG+  FE M++ + + P L+HYGC+VD+ GRAG+L+ A   I++M + PD+ IWG LL
Sbjct: 569 VDEGQWCFEMMQTDYGITPSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALL 628

Query: 367 GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITK 426
            AC  HGNV+LG++A+E LF++EP + G +V+LSN+YA AG W GV  +R +  G  + K
Sbjct: 629 SACRVHGNVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRK 688

Query: 427 RAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKL------HKHVHHDPNEDE 454
             G+S +EV + +  F   +++H    E+Y  L  +   +K+      H+ V  D  +DE
Sbjct: 689 TPGWSSMEVDNKVEVFYTGNQTHPMYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDE 748

BLAST of CsaV3_UNG233330 vs. TAIR10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 292.0 bits (746), Expect = 6.6e-79
Identity = 168/471 (35.67%), Postives = 263/471 (55.84%), Query Frame = 0

Query: 6   QIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIG 65
           ++H + L NG D   FL  KL+     L  + YA  +FD+  K ++Y++N   +  +  G
Sbjct: 98  RVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAG 157

Query: 66  HPHRCWLLYCQMCSQGCSRISIHSP-FSF-----AC----ASLFNVYPGQMLFSFVRH-- 125
           H      LY +M     +RI + S  F++     AC     ++ ++  G+ + + +    
Sbjct: 158 HGEEVLGLYWKM-----NRIGVESDRFTYTYVLKACVASECTVNHLMKGKEIHAHLTRRG 217

Query: 126 --------------------MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFI 185
                               ++ A  +F  MPVRNV+SW+A+I+ YA+NGK  +AL  F 
Sbjct: 218 YSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFR 277

Query: 186 GLENE-KGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHAR 245
            +  E K + PN V++ SVL AC+ L AL+ GK I  Y    G      V +A++ ++ R
Sbjct: 278 EMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLDSILPVISALVTMYGR 337

Query: 246 CGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVG 305
           CG +E  Q+VFD +   R++ SWN++I    VHG    A+Q++++ML     P  VTFV 
Sbjct: 338 CGKLEVGQRVFDRM-HDRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVTFVS 397

Query: 306 LLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMA 365
           +L AC+H G+V EG++LFE+M     + P++EHY C+VDLLGRA  L EA  ++Q+M   
Sbjct: 398 VLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAAKMVQDMRTE 457

Query: 366 PDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLR 425
           P   +WG+LLG+C  HGNVEL E A+  LF LEP N GNYV+L++IYA A  W  V R++
Sbjct: 458 PGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRVK 517

Query: 426 KMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIK 440
           K+++   + K  G  ++EV   ++ F+  D  +    +I+A L K+ + +K
Sbjct: 518 KLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMK 562

BLAST of CsaV3_UNG233330 vs. TAIR10
Match: AT1G59720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 290.0 bits (741), Expect = 2.5e-78
Identity = 174/484 (35.95%), Postives = 271/484 (55.99%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDH---TKFLIEKLLQL----PDLPYACTLFDQIPKPSVYLYNKF 60
           M+QLKQ+HA++LR        T FL  K+LQL     D+ YA  +FD I   S +++N  
Sbjct: 61  MSQLKQLHAFTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSIENHSSFMWNTL 120

Query: 61  IQTFS-SIGHPHRCWLLYCQMCSQGCSRISIHS-PFSF-ACASLFNVYPG-QMLFSFVRH 120
           I+  +  +      ++LY +M  +G S    H+ PF   ACA +F    G Q+    V+H
Sbjct: 121 IRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIVKH 180

Query: 121 ---------------------MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF 180
                                ++ A ++F++MP R+++SW ++I    + G+Y  AL++F
Sbjct: 181 GFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMIDALVRFGEYDSALQLF 240

Query: 181 IGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNN---GFFKNAYVSNAVLEL 240
              E ++  +P+  ++ SVL AC+ LG+L +G    A+           +  V N+++E+
Sbjct: 241 --REMQRSFEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVDVAMDVLVKNSLIEM 300

Query: 241 HARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRK--MRPDD 300
           + +CG++  A+QVF  +  KR+L SWN MI+G A HGR  +A+  +D+M+ ++  +RP+ 
Sbjct: 301 YCKCGSLRMAEQVFQGM-QKRDLASWNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNS 360

Query: 301 VTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQ 360
           VTFVGLL+AC H G V +GRQ F+ M   + + P LEHYGC+VDL+ RAG + EA +++ 
Sbjct: 361 VTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVM 420

Query: 361 NMPMAPDSVIWGTLLGACSFHG-NVELGEVAAESLFKLEPWN-------PGNYVILSNIY 420
           +MPM PD+VIW +LL AC   G +VEL E  A ++   +  N        G YV+LS +Y
Sbjct: 421 SMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVY 480

Query: 421 ALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIY 440
           A A  W+ V  +RK+M    I K  G S IE+    HEF   D SH ++ +IY  L  I 
Sbjct: 481 ASASRWNDVGIVRKLMSEHGIRKEPGCSSIEINGISHEFFAGDTSHPQTKQIYQQLKVID 540

BLAST of CsaV3_UNG233330 vs. Swiss-Prot
Match: sp|Q9FNN7|PP371_ARATH (Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E20 PE=2 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 2.0e-117
Identity = 226/508 (44.49%), Postives = 306/508 (60.24%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +   
Sbjct: 1   MNGIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSF---ACASLFNVYPGQMLFSFVRHMEAALELF 120
             PH   +LY  +   G      H  F+F   A AS  +  P ++L S         + F
Sbjct: 61  HQPHESIVLYNLLSFDGLR--PSHHTFNFIFAASASFSSARPLRLLHS---------QFF 120

Query: 121 NKMPVRNVISWTALISGYAQNGKYAKALEMFIGLEN------------------------ 180
                 +    T LI+ YA+ G    A  +F  +                          
Sbjct: 121 RSGFESDSFCCTTLITAYAKLGALCCARRVFDEMSKRDVPVWXXXXXXXXXXXXXXXXXX 180

Query: 181 --------------------------------------EKGTKPNEVSIASVLPACSQLG 240
                                                 +K  KPN +++ SVLPAC+ LG
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDKSVKPNHITVVSVLPACANLG 240

Query: 241 ALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMI 300
            L+IG+R+E YAR NGFF N YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI
Sbjct: 241 ELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMI 300

Query: 301 MGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQV 360
             LA HG+  +AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   ++
Sbjct: 301 GSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKI 360

Query: 361 APKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAE 420
           +PKLEHYGC++DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGNVE+ E+A+E
Sbjct: 361 SPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASE 420

Query: 421 SLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEF 443
           +LFKLEP NPGN VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F
Sbjct: 421 ALFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYFVEVGVDVHKF 480

BLAST of CsaV3_UNG233330 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 8.5e-84
Identity = 153/334 (45.81%), Postives = 219/334 (65.57%), Query Frame = 0

Query: 103 LFSFVRHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEV 162
           L+S    +E A  LF ++P ++VISW  LI GY     Y +AL +F  +    G  PN+V
Sbjct: 310 LYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEM-LRSGETPNDV 369

Query: 163 SIASVLPACSQLGALDIGKRIEAY--ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFD 222
           ++ S+LPAC+ LGA+DIG+ I  Y   R  G    + +  ++++++A+CG+IE A QVF+
Sbjct: 370 TMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFN 429

Query: 223 EIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVA 282
            I  K +L SWN MI G A+HGR   +  L+ +M    ++PDD+TFVGLL AC+H GM+ 
Sbjct: 430 SILHK-SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLD 489

Query: 283 EGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGA 342
            GR +F +M   +++ PKLEHYGC++DLLG +G  +EA  +I  M M PD VIW +LL A
Sbjct: 490 LGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKA 549

Query: 343 CSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRA 402
           C  HGNVELGE  AE+L K+EP NPG+YV+LSNIYA AG W+ VA+ R ++    + K  
Sbjct: 550 CKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVP 609

Query: 403 GYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKI 435
           G S IE+   +HEFI+ D+ H ++ EIY +L ++
Sbjct: 610 GCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEM 641

BLAST of CsaV3_UNG233330 vs. Swiss-Prot
Match: sp|O81767|PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 292.4 bits (747), Expect = 9.0e-78
Identity = 168/481 (34.93%), Postives = 269/481 (55.93%), Query Frame = 0

Query: 7   IHAYSLRNGLDHTKFLIEKLLQL----PDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGH 66
           IH+YS+++GL+   F+  KL+ L      L     +FD++    +  +N  I+ +     
Sbjct: 269 IHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQ 328

Query: 67  PHRCWLLYCQM--------CSQGCSRISIHSPFS--FACASL--FNVYPGQML------- 126
           P R   L+ +M        C    S  SI S      AC S+  F +  G  L       
Sbjct: 329 PLRAISLFQEMRLSRIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGN 388

Query: 127 -----FSFVRHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTK 186
                ++ +  +++A  +FN +P  +VISW  +ISGYAQNG  ++A+EM+  +E E    
Sbjct: 389 AVVVMYAKLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIA 448

Query: 187 PNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQV 246
            N+ +  SVLPACSQ GAL  G ++      NG + + +V  ++ +++ +CG +E+A  +
Sbjct: 449 ANQGTWVSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSL 508

Query: 247 FDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGM 306
           F +I  + N   WNT+I     HG    A+ L+ +ML   ++PD +TFV LL AC+H G+
Sbjct: 509 FYQI-PRVNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGL 568

Query: 307 VAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL 366
           V EG+  FE M++ + + P L+HYGC+VD+ GRAG+L+ A   I++M + PD+ IWG LL
Sbjct: 569 VDEGQWCFEMMQTDYGITPSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALL 628

Query: 367 GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITK 426
            AC  HGNV+LG++A+E LF++EP + G +V+LSN+YA AG W GV  +R +  G  + K
Sbjct: 629 SACRVHGNVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRK 688

Query: 427 RAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKL------HKHVHHDPNEDE 454
             G+S +EV + +  F   +++H    E+Y  L  +   +K+      H+ V  D  +DE
Sbjct: 689 TPGWSSMEVDNKVEVFYTGNQTHPMYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDE 748

BLAST of CsaV3_UNG233330 vs. Swiss-Prot
Match: sp|Q9STF3|PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 292.0 bits (746), Expect = 1.2e-77
Identity = 168/471 (35.67%), Postives = 263/471 (55.84%), Query Frame = 0

Query: 6   QIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIG 65
           ++H + L NG D   FL  KL+     L  + YA  +FD+  K ++Y++N   +  +  G
Sbjct: 98  RVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAG 157

Query: 66  HPHRCWLLYCQMCSQGCSRISIHSP-FSF-----AC----ASLFNVYPGQMLFSFVRH-- 125
           H      LY +M     +RI + S  F++     AC     ++ ++  G+ + + +    
Sbjct: 158 HGEEVLGLYWKM-----NRIGVESDRFTYTYVLKACVASECTVNHLMKGKEIHAHLTRRG 217

Query: 126 --------------------MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFI 185
                               ++ A  +F  MPVRNV+SW+A+I+ YA+NGK  +AL  F 
Sbjct: 218 YSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFR 277

Query: 186 GLENE-KGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHAR 245
            +  E K + PN V++ SVL AC+ L AL+ GK I  Y    G      V +A++ ++ R
Sbjct: 278 EMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLDSILPVISALVTMYGR 337

Query: 246 CGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVG 305
           CG +E  Q+VFD +   R++ SWN++I    VHG    A+Q++++ML     P  VTFV 
Sbjct: 338 CGKLEVGQRVFDRM-HDRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVTFVS 397

Query: 306 LLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMA 365
           +L AC+H G+V EG++LFE+M     + P++EHY C+VDLLGRA  L EA  ++Q+M   
Sbjct: 398 VLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAAKMVQDMRTE 457

Query: 366 PDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLR 425
           P   +WG+LLG+C  HGNVEL E A+  LF LEP N GNYV+L++IYA A  W  V R++
Sbjct: 458 PGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRVK 517

Query: 426 KMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIK 440
           K+++   + K  G  ++EV   ++ F+  D  +    +I+A L K+ + +K
Sbjct: 518 KLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMK 562

BLAST of CsaV3_UNG233330 vs. Swiss-Prot
Match: sp|Q0WQW5|PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H51 PE=1 SV=2)

HSP 1 Score: 290.0 bits (741), Expect = 4.5e-77
Identity = 174/484 (35.95%), Postives = 271/484 (55.99%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDH---TKFLIEKLLQL----PDLPYACTLFDQIPKPSVYLYNKF 60
           M+QLKQ+HA++LR        T FL  K+LQL     D+ YA  +FD I   S +++N  
Sbjct: 61  MSQLKQLHAFTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSIENHSSFMWNTL 120

Query: 61  IQTFS-SIGHPHRCWLLYCQMCSQGCSRISIHS-PFSF-ACASLFNVYPG-QMLFSFVRH 120
           I+  +  +      ++LY +M  +G S    H+ PF   ACA +F    G Q+    V+H
Sbjct: 121 IRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIVKH 180

Query: 121 ---------------------MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF 180
                                ++ A ++F++MP R+++SW ++I    + G+Y  AL++F
Sbjct: 181 GFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMIDALVRFGEYDSALQLF 240

Query: 181 IGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNN---GFFKNAYVSNAVLEL 240
              E ++  +P+  ++ SVL AC+ LG+L +G    A+           +  V N+++E+
Sbjct: 241 --REMQRSFEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVDVAMDVLVKNSLIEM 300

Query: 241 HARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRK--MRPDD 300
           + +CG++  A+QVF  +  KR+L SWN MI+G A HGR  +A+  +D+M+ ++  +RP+ 
Sbjct: 301 YCKCGSLRMAEQVFQGM-QKRDLASWNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNS 360

Query: 301 VTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQ 360
           VTFVGLL+AC H G V +GRQ F+ M   + + P LEHYGC+VDL+ RAG + EA +++ 
Sbjct: 361 VTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVM 420

Query: 361 NMPMAPDSVIWGTLLGACSFHG-NVELGEVAAESLFKLEPWN-------PGNYVILSNIY 420
           +MPM PD+VIW +LL AC   G +VEL E  A ++   +  N        G YV+LS +Y
Sbjct: 421 SMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVY 480

Query: 421 ALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIY 440
           A A  W+ V  +RK+M    I K  G S IE+    HEF   D SH ++ +IY  L  I 
Sbjct: 481 ASASRWNDVGIVRKLMSEHGIRKEPGCSSIEINGISHEFFAGDTSHPQTKQIYQQLKVID 540

BLAST of CsaV3_UNG233330 vs. TrEMBL
Match: tr|A0A0A0LY28|A0A0A0LY28_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G690140 PE=4 SV=1)

HSP 1 Score: 813.5 bits (2100), Expect = 2.4e-232
Identity = 415/521 (79.65%), Postives = 422/521 (81.00%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSF-ACASLFNVYPGQMLFSFVRHMEAALELFNK 120
           GHPHRCWLLYCQMCSQGCS       F F ACASLFNVYPGQML S       A ++F  
Sbjct: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMF-- 120

Query: 121 MPVRNVISWTALISGYAQNGKYAKALEMF------------------------------- 180
                  + TAL+  YA+ G    A ++F                               
Sbjct: 121 -------AMTALLDMYAKLGMLRSARQLFDEMPVRDIPTXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 -------------------------------IGLENEKGTKPNEVSIASVLPACSQLGAL 240
                                           GLENEKGTKPNEVSIASVLPACSQLGAL
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLENEKGTKPNEVSIASVLPACSQLGAL 240

Query: 241 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300
           DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG
Sbjct: 241 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300

Query: 301 LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 360
           LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP
Sbjct: 301 LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 360

Query: 361 KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 420
           KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL
Sbjct: 361 KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 420

Query: 421 FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 459
           FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE
Sbjct: 421 FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 480

BLAST of CsaV3_UNG233330 vs. TrEMBL
Match: tr|A0A2N9IZX4|A0A2N9IZX4_FAGSY (RING-type E3 ubiquitin transferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57682 PE=4 SV=1)

HSP 1 Score: 540.4 bits (1391), Expect = 3.9e-150
Identity = 283/516 (54.84%), Postives = 343/516 (66.47%), Query Frame = 0

Query: 1    MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
            MNQLKQIHAY+LRNG+D+TK LI   LQ+P+L YA  +FD IPKP+V+LYNK IQ +S  
Sbjct: 1053 MNQLKQIHAYTLRNGIDYTKTLIVNSLQIPNLSYARKVFDLIPKPTVFLYNKLIQAYSCH 1112

Query: 61   GHPHRCWLLYCQMCSQGCSRISIHSPFSF-ACASLFNVYPGQMLFSFVRHMEAALELFNK 120
            G  ++C  LY QMC QGC        F F  CAS  ++  GQML +    +++  E    
Sbjct: 1113 GQYYQCMSLYSQMCIQGCPPNQHTFTFLFVTCASHSSLRHGQMLHA--HFVKSGFEF--- 1172

Query: 121  MPVRNVISWTALISGYAQNGKYAKALEMF------------------------------- 180
                +V + TAL+  YA+ G  A A + F                               
Sbjct: 1173 ----DVFALTALVDMYAKLGMLASARQKFDEIKVKDIPTWXXXXXXXXXXXXXXXXXXXX 1232

Query: 181  -------------------------------IGLENEKGTKPNEVSIASVLPACSQLGAL 240
                                              E EK  +PNEV+IAS+LPAC+ LGAL
Sbjct: 1233 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEKEKDVRPNEVTIASILPACANLGAL 1292

Query: 241  DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300
            ++G+RIEAYAR NGFFKN+YV+NAVLE++ARCG I+ A  VFDEIG +RNLCSWN+MIMG
Sbjct: 1293 EVGERIEAYARRNGFFKNSYVANAVLEMYARCGKIDVAWHVFDEIGRRRNLCSWNSMIMG 1352

Query: 301  LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 360
            LAVHGRC +AL+LYD+ML     PDDVT VGLLLACTHGGMV +GRQLFESME+   + P
Sbjct: 1353 LAVHGRCNEALELYDKMLGEGNAPDDVTLVGLLLACTHGGMVVKGRQLFESMETNLHITP 1412

Query: 361  KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 420
            KLEHYGC+VDLLGR GELQEA++LIQNMPM PDSV+WG LLGACSFHGNVEL E+AAESL
Sbjct: 1413 KLEHYGCMVDLLGRCGELQEAFDLIQNMPMKPDSVVWGALLGACSFHGNVELAEIAAESL 1472

Query: 421  FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 454
            FKLEPWNPGN+VILSNIYA AG W GVA+LRK+MKGG ITK AGYS+IE G  IH+FIV 
Sbjct: 1473 FKLEPWNPGNFVILSNIYASAGQWDGVAKLRKLMKGGQITKAAGYSFIEEGGQIHKFIVG 1532

BLAST of CsaV3_UNG233330 vs. TrEMBL
Match: tr|A0A2I4GS89|A0A2I4GS89_9ROSI (pentatricopeptide repeat-containing protein At5g08510 OS=Juglans regia OX=51240 GN=LOC109010380 PE=4 SV=1)

HSP 1 Score: 536.6 bits (1381), Expect = 5.6e-149
Identity = 278/505 (55.05%), Postives = 342/505 (67.72%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAY+LRNG+DH K LI  LL +P+L YA  LFD IP P+V+LYNK IQT+SS 
Sbjct: 1   MNQLKQIHAYTLRNGIDHAKTLIVGLLNIPNLSYARKLFDLIPNPTVFLYNKLIQTYSSH 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSFA-CASLFNVYPGQMLFSFVRHMEAALELFNK 120
           G  +RC  LY QMC QGC        F FA CA+L +   GQML +    +++  E    
Sbjct: 61  GQYYRCMSLYSQMCLQGCPPNQHTFTFVFATCAALSSPCHGQMLHT--HFVKSGFE---- 120

Query: 121 MPVRNVISWTALISGYAQNGKYAKALEMFIGLE--------------------------- 180
               +V + TAL+  YA+ G  A A + F  ++                           
Sbjct: 121 ---SDVFALTALVDMYAKLGMLASARQKFDEIKVRDIPTXXXXXXXXXXXXXXXXXXXXX 180

Query: 181 -----------------------------------NEKGTKPNEVSIASVLPACSQLGAL 240
                                               EK  +PNEV+IASVLPAC+ LGAL
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKEKDMRPNEVTIASVLPACANLGAL 240

Query: 241 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300
           ++G+RIEAYAR NGFFKN YVSNAVLE++ RCG I+ A +VFDEIG  R+LCSWN+MI+G
Sbjct: 241 EVGERIEAYARKNGFFKNLYVSNAVLEMYVRCGKIDIAWRVFDEIGGCRSLCSWNSMIVG 300

Query: 301 LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 360
           LAVHG+C +AL+LYDQML   + PDDVTFVGLLLACTHGG+V +GRQLF+ M + F +AP
Sbjct: 301 LAVHGQCNEALELYDQMLREGIAPDDVTFVGLLLACTHGGLVIKGRQLFQLMLTNFHIAP 360

Query: 361 KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 420
           KLEHYGC+VDLLGR+G+LQEAY+LI+ MPM PDSV+WG LLGACSFHGN+EL E+AAESL
Sbjct: 361 KLEHYGCMVDLLGRSGDLQEAYDLIKGMPMKPDSVVWGALLGACSFHGNIELAEIAAESL 420

Query: 421 FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 443
           F+LEPWNPGNYVILSNIYA AG W+GVA+LRK+MKGG +TK AGYS+IE G  IH+FIVE
Sbjct: 421 FQLEPWNPGNYVILSNIYASAGQWAGVAKLRKLMKGGLVTKAAGYSFIEEGGQIHKFIVE 480

BLAST of CsaV3_UNG233330 vs. TrEMBL
Match: tr|A0A2P5DQ78|A0A2P5DQ78_PARAD (Tetratricopeptide-like helical domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_040790 PE=4 SV=1)

HSP 1 Score: 525.4 bits (1352), Expect = 1.3e-145
Identity = 277/520 (53.27%), Postives = 350/520 (67.31%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAY+LRNG+D+T  L+ KLL++P++PYA  LFD IPKP+V+LYNK I+ +S  
Sbjct: 27  MNQLKQIHAYTLRNGIDYTDTLVLKLLEIPNIPYAHNLFDLIPKPTVFLYNKLIKAYSFH 86

Query: 61  GHPHRCWLLYCQMCSQGC--SRISIHSPFSFACASLFNVYPGQMLFSFVRHMEAALELFN 120
           G  H+C  LY +M  Q C  +  S    FS ACAS+ +   GQM+ S  R +++ L L  
Sbjct: 87  GQHHQCLSLYTRMSLQRCVPNERSFTLLFS-ACASISSPRLGQMIHS--RFVKSGLAL-- 146

Query: 121 KMPVRNVISWTALISGYAQNGKYAKALEMFIGLE-------------------------- 180
                +V S TAL+  YA+ G  A A + F  +                           
Sbjct: 147 -----DVFSETALVDMYAKLGMLACARQQFDEMRVRDXXXXXXXXXXXXXXXXXXXXXXX 206

Query: 181 ------------------------------------NEKGTKPNEVSIASVLPACSQLGA 240
                                                E+  KPNEV+IASVLPAC+ LGA
Sbjct: 207 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERDVKPNEVTIASVLPACANLGA 266

Query: 241 LDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIM 300
           L+IG+RIE Y+R +GFFKN++V NA+LE++ARCG I+ A++VFDEIGS+RNLCSWN+MIM
Sbjct: 267 LEIGERIEEYSRRSGFFKNSHVGNAILEMYARCGKIDIARRVFDEIGSRRNLCSWNSMIM 326

Query: 301 GLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVA 360
           GLAVHGRC +AL LY+QML   +RPDDVTFVGL+LACTHGGMV++G QLF SME  F +A
Sbjct: 327 GLAVHGRCREALNLYEQMLTVGIRPDDVTFVGLILACTHGGMVSKGHQLFRSMEPNFSIA 386

Query: 361 PKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAES 420
           PKLEHYGC+VDLLGRAGEL+EAY+LIQ+MP+ PDSVIWG LLGACSFHGN++  E AAES
Sbjct: 387 PKLEHYGCMVDLLGRAGELEEAYDLIQDMPIKPDSVIWGALLGACSFHGNIKFAEKAAES 446

Query: 421 LFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIV 457
           LF+LEPWNP NYVILSNIYA  G W GVA+LRK+MKGG ITK AGYS+IE    +H FIV
Sbjct: 447 LFELEPWNPANYVILSNIYASGGRWDGVAKLRKVMKGGKITKAAGYSFIEERGQVHMFIV 506

BLAST of CsaV3_UNG233330 vs. TrEMBL
Match: tr|F6I6G7|F6I6G7_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_15s0046g00030 PE=4 SV=1)

HSP 1 Score: 524.2 bits (1349), Expect = 2.9e-145
Identity = 279/516 (54.07%), Postives = 342/516 (66.28%), Query Frame = 0

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MN+LKQI AY+LRNG++HTK LI  LLQ+P +PYA  LFD IPKP+V+LYNK IQ +SS 
Sbjct: 1   MNRLKQIQAYTLRNGIEHTKQLIVSLLQIPSIPYAHKLFDFIPKPTVFLYNKLIQAYSSH 60

Query: 61  GHPHRCWLLYCQMCSQGCSRISIHSPFSF-ACASLFNVYPGQMLFS-FVRHMEAALELFN 120
           G  H+C+ LY QMC QGCS       F F ACASL +   G+ML + FV+          
Sbjct: 61  GPHHQCFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGC----- 120

Query: 121 KMPVRNVISWTALISGYAQNGKYAKALEMF------------------------------ 180
                +V + TAL+  YA+ G  + A + F                              
Sbjct: 121 -----DVFALTALVDMYAKLGLLSLARKQFDEMTVRDVPTXXXXXXXXXXXXXXXXXXXX 180

Query: 181 --------------------------------IGLENEKGTKPNEVSIASVLPACSQLGA 240
                                                E   +PNEV++ASVLPAC+ LGA
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEETEMRPNEVTLASVLPACANLGA 240

Query: 241 LDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIM 300
           L++G+RIE YAR NG+FKN YVSNA+LE++ARCG I++A  VF+EI  +RNLCSWN+MIM
Sbjct: 241 LEVGERIEVYARGNGYFKNLYVSNALLEMYARCGRIDKAWGVFEEIDGRRNLCSWNSMIM 300

Query: 301 GLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVA 360
           GLAVHGRC +A++L+ +ML     PDDVTFVG+LLACTHGGMV EG+  FESME  F +A
Sbjct: 301 GLAVHGRCDEAIELFYKMLREGAAPDDVTFVGVLLACTHGGMVVEGQHFFESMERDFSIA 360

Query: 361 PKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAES 420
           PKLEHYGC+VDLLGRAGEL+EA++LI  MPM PDSV+WGTLLGACSFHG+VEL E AA +
Sbjct: 361 PKLEHYGCMVDLLGRAGELREAHDLILRMPMEPDSVVWGTLLGACSFHGHVELAEKAAGA 420

Query: 421 LFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIV 453
           LF+LEP NPGNYVILSNIYA AG W GVARLRK+MKGG ITK AGYS+IE G  IH+FIV
Sbjct: 421 LFELEPSNPGNYVILSNIYATAGRWDGVARLRKLMKGGKITKAAGYSFIEEGGHIHKFIV 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011660274.13.6e-23279.65PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Cucumis sativu... [more]
XP_022979296.16.8e-21581.17pentatricopeptide repeat-containing protein At5g08510 isoform X2 [Cucurbita maxi... [more]
XP_022924285.11.3e-21079.65pentatricopeptide repeat-containing protein At5g08510 isoform X2 [Cucurbita mosc... [more]
XP_022979295.13.4e-19870.10pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxi... [more]
XP_023527365.18.3e-19769.51pentatricopeptide repeat-containing protein At5g08510 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
AT5G08510.11.1e-11844.49Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.14.7e-8545.81Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G33990.15.0e-7934.93Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G46790.16.6e-7935.67Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G59720.12.5e-7835.95Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9FNN7|PP371_ARATH2.0e-11744.49Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN01|PPR21_ARATH8.5e-8445.81Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|O81767|PP348_ARATH9.0e-7834.93Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
sp|Q9STF3|PP265_ARATH1.2e-7735.67Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
sp|Q0WQW5|PPR85_ARATH4.5e-7735.95Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LY28|A0A0A0LY28_CUCSA2.4e-23279.65Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G690140 PE=4 SV=1[more]
tr|A0A2N9IZX4|A0A2N9IZX4_FAGSY3.9e-15054.84RING-type E3 ubiquitin transferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57682... [more]
tr|A0A2I4GS89|A0A2I4GS89_9ROSI5.6e-14955.05pentatricopeptide repeat-containing protein At5g08510 OS=Juglans regia OX=51240 ... [more]
tr|A0A2P5DQ78|A0A2P5DQ78_PARAD1.3e-14553.27Tetratricopeptide-like helical domain containing protein OS=Parasponia andersoni... [more]
tr|F6I6G7|F6I6G7_VITVI2.9e-14554.07Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_15s0046g00030 PE=4 SV=... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_UNG233330.1CsaV3_UNG233330.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 226..406
e-value: 2.4E-29
score: 104.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 57..258
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 200..224
e-value: 0.045
score: 13.9
coord: 301..325
e-value: 0.22
score: 11.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 48..79
e-value: 0.0014
score: 16.6
coord: 126..160
e-value: 3.9E-5
score: 21.6
coord: 230..262
e-value: 3.3E-6
score: 24.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 123..171
e-value: 2.4E-7
score: 30.7
coord: 227..273
e-value: 5.6E-8
score: 32.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 227..261
score: 10.424
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 46..80
score: 7.991
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 298..328
score: 6.73
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 124..159
score: 10.468
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 195..225
score: 8.122
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 262..292
score: 7.815
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 330..360
score: 5.119
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 364..398
score: 5.821
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 160..194
score: 5.382
NoneNo IPR availablePANTHERPTHR24015:SF148SUBFAMILY NOT NAMEDcoord: 107..428
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 107..428

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_UNG233330CSPI01G33460Wild cucumber (PI 183967)cpicucB027
CsaV3_UNG233330Cucsa.291540Cucumber (Gy14) v1cgycucB452
CsaV3_UNG233330CsGy1G032120Cucumber (Gy14) v2cgybcucB024
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CsaV3_UNG233330CsaV3_1G045420Cucumber (Chinese Long) v3cuccucB016
The following block(s) are covering this gene:

None