Sgr029763 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029763
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00153449: 2671216 .. 2679622 (-)
RNA-Seq ExpressionSgr029763
SyntenySgr029763
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAGGAATGGATCGATTAAGCATCTCATGGGTGACCTTTTATTTCTTGATTCATCACCTTTTGCCAAGCTCTTGAATCGGTGTGCTCGCTCGACGTCAGCTAGAGACACGAGTTGTGTACATGCTTGCATAATTAAATCACCCTTTGCGTCTGAAATTTTTATCCAAAATAGGCTCATTGATGTATATGGTAAATGTGGATGTGTGGATGTTGCTCGCAAGTTGTTTGATAGATTGCTTGAGAGAAATATTTTCTCTTGGAACTCCATCACTTGTGCATTCACTAAGTCCGGATTTCTTGATGACGCTGTCCACATCTTTGAGAAGATGCCTGAAGTTGACCAGTGCTCGTGGAATTCTATGATTTCGGGTTTTGAACAACACAATCGCTTTGATGAAGCTTTAAATTATTTTGCTCAAATGCATAGTCATGGTTTTTTGATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGTGCAGGTTTACAAGATCTAAAATTGGGTTCCCAAATCCACAGTTTAATATATCGGTCAAATTATTTATCAGATGTGTATATAGGCTCTGCCCTAGTAGATATGTACTCTAAATGTGGAAGAGTTGACTGTGCTTATAGTGTTTTTTATGGAATGACTGCAAGAAGTAGAGTTTCCTGGAACAGCTTGATTACGTGTTATGAACAGAATGGTCCAGTTGATGAGGCTCTTGTTATTTTTGTTGAGATGATCAAATGCAGGGTTGAACCTGATGAGGTAACGCTTGCTAGTGTGGTTAGTGCATGTGCAACTATCTCGGCGATCAAAGAAGGTCGGCAGATTCATGCTCGTGTTGTGAAATGTGATGAATTTAGAAATGATCTTATTTTAGGCAATGCATTGGTTGATATGTATGCTAAATGTAAAAGGATCAACGAGGCTAGAATGGTTTTCGATCGGATGCCAATTAGGAATGTGGTGTCTGAAACCTCAATGGTAAGTGGGTATGCGAAAGCATCAAGTGTTAAAGCTGCAAGATATATGTTTTCAAATATGATGGTGAAAGACGTAATTACATGGAATGCACTTATTTCAGGGTGTACACAGAATGGAGAGAATGAAGAGGCACTTATACTCTTCTGTGATTTGAAGAGGGAGTCTGTTTGGCCTACACACTACACCTTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTGGCCAACAGGCTCACTCTCATGTTATAAAGCATGGATTTCGATTCCAATATGGAGAAGAGTCGGATATTTTTGTTGGGAATTCTCTGATAGATATGTATATGAAATGTGGATCAGTTGAGAATGGTTGTCAGGTATTTGAACATATGGTGGAAAGGGATTGTGTCTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAATGAGGCCCTTGGAGTTTTCAATAAAATGTTAGAATTGGGAGAGAAACCAGATCATGTCACAATGATTGGTGTTCTTTGCGCTTGTAGTCATGCTGGGCTGCTTGATGAAGGTCGTCATTACTTTCGATCAATGAGTGCACAACATGGTTTGGTGCCATTAAAGGACCATTATACATGTATGGTTGATTTACTTGGCCGAGCTGGCTGCCTTGAAGAAGCAAAAAATCTAATAGAGGAAATGCCGATGCAGCCTGATGCTGTTGTCTGGGGATCCTTGCTTGCTGCTTGTAAAGTCCATCGGAACATCGAATTGGGGGAATATGTAGTCGAGAAGCTTTTAGAGGTAGATCCCGAGAATTCTGGGCCATATGTTCTTCTCTCGAATATGTATGCTGAACGTGGAAATTGGGGGAATGTTGTGAGAATAAGAAAGCTGATGAGAAAGAGAGGAGTGATTAAACAACCAGGTTGCAGTTGGATTGAAATTCAAGGTCAGTTGAATGTTTTTATGGTTAAGGATAAAAGGCATTCAAAGAAGAAAGAAATCTACATGCTTTTGAGAACACTTTTACAACAGATGAAACGAGCTGGATATGTCCCATATGTTGGCAGCAATGAGATTGATGAAGAACAGCAGGAAGAACACGACATATCCTCATCTTACCAAATTAAAATGCCAGATACAGGTGACTAGATTCCTCTAGCTAATCTAAGTAAGACTAGATTTTAAGGATATTGATAATCTAAGTAAGAATAGATTTTAAGGATATTGGGAAAAGTCAAACCACTGTCATATGTTCATTTTTTTTAGCTAAAATTCGAAGTAAAGAAGACATAGATTCCTCTAGCTTTTTAATCAATACATTGTTAGGCTAGATTCTTAAATTCTATATTGGCTCTAATATATTCTCATATACTTCCATTGTGTCCAATAACGTGAAAATATATCTCGGTTACCCCCTATTTGTCTATCATATTTGTTTTCATTTAGGATAATAGTATTCTTTCAAATAATTTTTTTTTTATAGAAAATTGAACTTTCTTTCAGAAAAAATGAAAGAATACAAAGGGGCATACAAAAAGACAGCTCAACAGAAGGAGCCAAAACAAAGTCAATATTCTTTCAAATAATAAAACTACATGGTCTCAATTCAATCATGTAACGATCAGTTGTGAGAAAATGTCCCTTTTTTGTACCTTTTTTATTAATGATCCATGTTATGTATGCACTTGGGGAAAAGCATAAATCACTTGCTTCCTCATTGCGGATGGGTCTCAATCAATCATGTAAACGGCTTAGATCTTTTCATTTTGTTTGGGTTTTGCAAACGTTTGAGGATGGTATTTTCCTTGATTTTAAAGGGAAAATTGAGGTGATAGTACTATATCCGAATTGCCTGTGATTTTGGGAGATTTAAGAAGAAACAATTGGAAACATCTTTAAATATAGAATATAGGGTTGGTTGTGTAGATAACTTGTGGGAGAATTATTTAATAGCTTCATTATGACTTCCAAATGAAAAAATTTGTAGCATACTAATATGTCCCAACTTATTTTAGTAAACTGTATTCTTCTCATTATTTATGTTACTAGATTTTGAGATGAGAAGAAAATTATATTATTTCTCTTATAATAAAATGAAATACAAAGTCTTCTAATACAGGAGAAAGACCATCTACTAATAAAGAAAATCTAAACATAAATATAGACAACTAATATTTACATCATGAAACTCCTCAACCTTGTCGGAACCTATGAACTCGCAGTAGAAATAACGTGACTGAAAGTCATAGAAGATTACCAGAGACTTGTTGTGGATGACAGTAAGGAACATGGTTTTGGTGGTTAAGGTTTGGCGGTAGTGGCGACGACTAGGGCTTTTAGTGGCAGCTAAGTTTCTAGTAGCAACCAGGATTTTTCTGCGCTGATACCAACTAGATTTGAGAAGAGAAGAAAACTATATCATTTCTCTTGTAATGAAAGTAAATTCAAAGTTTTCTTGTATACCAGAAAAACCATCTACCAATAAGAAAAACCTAAAAAATAAAGATAACGTAATATTAACGTAAATATGAACAACTAATATTTACATTAAATATAAATACTAATCTAACAAAATATGCATAAAATCCAACCATGTTAGTAAAATTTGTTTCTTATTGGAGAAAACAAATTGAAACAACAATTAGCAGACAATATTTGCTTGAAATAATGAATTTGGTAGTCGGACACCCAGATAACTGATTAGATTCTCCCCCTTTGTGCTATGTTCCTTACTACTTGTACCATTTCCAACACAGAAGGAAAGTTGGTTAGAGATGTTTGGAGTGAAACCACTCAAACTTGGAACTTATGCCTAAGAAGGAATCTCTTCTCGTCGAGAGAGAGTCAAAAGAAGGGGCTTGGTTTTCTCTAAAGATGGAGCTTGTTTTTGTTAAACAGAACGATGATAAAGCTTGCTGGAAATTTGAGAGTTCAGGTCAATATTCCTGCAAATTTGCAGACTTCAACTAATCAGATCAAGAATCTTGCTAATTTACAATCTTCAACTAAATCAGATCGAGATTCTTGCAAATTTGTAATCTTCAACTAAATCAGGGTGAAGGAAGTTGGATCAGAATTTGGGCAAGTCGATTTGGGATAAAAAATTTCTGGAAACAGTAAGAAAAAATTTACTTTGTACATTGGCTGGCAATAGCCTCATATTCCCTTCACAGATTTTGTATTTATGTTTTTTTTTCTCCCTTTTTAATGCAACTTTGTTTCTTATAAAAAAGTCTGTCATCGAGTGTTTTGCAACCATGTAGATTTTGCTTGTCTGCTAGGAGAAGTATCATCTTTTAACTTGCAGCTCAGATTTAATACATTGGACTTTTTGGTGAGACTTAGAGGGAAAATATTGATGCTTGATGGAGATTCTATGAACCAAAACCAGTTTGAGTCAATCCTTCGTCTCCTTAAAGAAGGCCTTCAAGATAAGAGGACAATGTATGAGACACAGGGCTCCAAAATAACTAAGGGAAGTGGTTACTTTGTCTTCAGTTTCAAGGTAATAAAAAAGATTGCGATAATATTACTTCTTTAGGTTAACATAGATTACTTATCATGAATATATTCTTCTGTGGAGGAATGATTCAATGTTAACTACGATTATAGTAGTGTATTATATTTAAATATTGTTTGTTACAGGTCTTCGGTATTCAAGAAACAAATGGTGCAGGATGAGTTTTCGCGAGGTTATGACTTTCACCTTGCTGGGGTTTAATTTGGTGAAATGGTTCATTTGCATTTTTCAGAACTTTATACTACTTATATAAGAATGTTCCTGAATGTTCTTGAGTTTGAATCTGTTTGATTTATTTCAAGTTTTTCTTGCTTCTGTTGAACACATGTTTATATAGGTGCTTATTAGCACCACAACAGTATTTATTTCCTATGCTTTATTCATGACAACTTTTACTATAATAATGAAGTACAATCACTTCATTCAATTCAGCTTTTGTTACTACTGATAAGTAAATACCTTATCTATCATTTGGCTAAATATGTGCTGTTGGTCACTGCGAATCGTCTGATTTAGATTTGTGATTTAAGGTGATGCATTTTCAAAGTTGGGGCAAATTCTGGAGGAAATTTTTACAAGAAAACTATGGGTAGTTTGATTTTCGGGCTGAAATAGTGTTTACTTGTGGAGAAATATGCTATTTTATTATCTTTCCTATAAACTAGACTCAAACATGAAATATGTAGAAATGAGAATCACAGAATTTGGATGTGTAGGAAAAAAAGATAATAATCTTAGAAGTTACTGCAGAATTTTGAAAATTCTGCAGAATCCGGAAAATCCTACAGAGTCATCCAAATTATCATCTAATTTTGAGCTAAAAAGTCTACCCAGACTTTAAAATATGATGAGTTTATGATATTTAGTAAAAGAATGAAGTCATTTGGAGCTTTACCCAAAAAGTTGTGCACTTATTAAGATTAGGTCTTGGAAACTACAAATTGTCCAAAATGAAAAACTTTGGTCGTTAAATGCTAAAGATTTGTTATTCTAGCATTTTAAGCAAGAAATGCTTAAACACGCTTAAAAATGCATAAGAAAGTTGTCCAAGTTTCCTGTGTAGGATTTTCATCCGTGAAGTGTGAATTTCTGCTTCTTATTTTTAACTTATGGTTGTCTCGTATAAAAAGTAAGTTAAACGATATTTTGTCTATTTTTTCCTAAATTTATGATAATGTAAGTGAATTCATATGGTCTCTACTAAAGTATTGTTAGAAATTAATTCTCTCTTGTTACAAACAATTATTCCAGTTATCTTACTATTGTGCGTAATGACATTGTTTATAGGCTAAATATATCAAATGGTTCCTAATTTTTTAGGTTATTCTCAATTTGGTCTTTAATCCTTTAAAATTCTCAATTTTATCCCTAATATATAGAAAAATTCTCAATTATGTCCTTCCATAAAGAAATTGTTAAAATTTATTAACAAAAAATTGATATATCATCAAATTATTGTTTTATTAACATATTTGGACTCATGAAACCTAGGGGTGTTCAAATAACCCGACAACCCGAAAAAACCGAACGACCCAACCCAAACCGTAAGGGTTGGGTTGGGTTGGGTTTAGTAATTTTTCGGTTTGGGTTGGGTTTTTTTTTTTGCAGAACCGAATTTTCGGTTCGGTTCGCGGGTTAGAAAAAATTGGGGTCGGTTCAACCCGAACCAAACCGGTTATAACATATTAAATAAAAAATATATTTTTTTACTTAAGTTAGTGGGTTGGGCTGCACTCCTACTCCACCTTCCCCTCCTCCACGTGTATTTCATGCCTTGCCCATTCTCTCTCCTCAAACTTTTTCACACTTGAAGGTAGGGTTTTACATCTTTTTCTCTCTTTTCCTTCTCCCACTCTCCAGCCGCCTCTCTTTCACTTTCTCTCTCTCTTCCTCTTTCTCCCAATCTCCGCTCTCTCTCTCTTTCTCTTCTCCTCCTCCCAATCTCCAACCAGCTCTCTCTCTCTCTCTCTCTTTCTCTCTTCTCCTTCCTCCAATCTCCAGCCAGCTACATATTTTTCTTGCTCTTCTTTCCTTCGCCCAATCTCCAGCCACCTAAATCTTCATATCTCTACTCTTGATTATCAACTTGCAATTAAATTTTGTTTTTCAAATTTTATTTTAGATTTTGATCAATCAAACACACATTTCACAAGATATATCAGCAAAGAACAATGTAAAGGAAAATGTACCTCTTTCACAATGCGTATATCAGCAAAAATTTCGAAGATGAATATATTAGCAAGGTAGGGTAAAAGAAAATCAAAGGAAATAAAAGGTGAAATGAGACAATCGAGTTATAGAATGATAAATTGAAATAAATAGAAGAAATTAGGGTAAACGACAATAGACAAATCACAAGAAAACATAAGAAATTAGGATAAAAATCGAAAGAGAAATAATAATAAGTAAGGGTAAACGTGTTTAAGGAGAAGAAAAAGGTTCCGTGCAGTTGAAGTTATTGATATACCATTTATGTGGTAAAATATATCACTGATATACATATATATAGTGTATCACTTATATATTATTTTTGAATCATCAATAAAGTATATCACCAATAAACCTAAGGGTATTTTAGACTTTTTACATATTCTATTTGGGCTGGGCTAATTTTTTTTGCTATATTTGCAATTTTTGTAATATGAGTGCTAAATTTACTATTTATTTGGTTTTTTTTTTTGCTAGATTTGCAAGTGCCCCTATAAAAAATAGAATATTTAAAAAAGGAAAAGTTTTATTTTTTTGGAAAAAAATCAATTTATACCCCTGAACTTTATAGTTTTAGTCAATTAAAACCATAAACTAATAATTTCATCAATGAACATCCTGAACTTTGTAAGTTGAGTCAATTAAAATCCTAAACTATTAATTATCAATTAAACCTCTAAATTTGTCTTAATTTATATTCTATGTTAAATTTTGTTAAAAGACTATTAGTTAAATTGGCAGTGCACATGCGAAGGCTGTCATTTTATATTTTATATTACAAAAGAATGAAAAAAACGGTCAAATGCATGTTGCTTTTGCCTTGTTTATATTTAACTTAATATAGAAAATCATCGGGAGAAGACTCAGAAAAAGTGTCGAAACAATTTGAGAAATCGAATTTCAAAAGTGAAAAAGTCATAGCGGGTGTAACTCTTGAGGTAGCTCCACAAACAGAGCTAGAGATGAAAAAGCCTCTCTGACGAGGACAAAATATAAAATATAAAACAATAGCTTTCACATGTGCGCCCCAATTTAAATAACAGTTTTCTAATAGAATTAACGAAAAGAATAAATTGACATAAGTTTAAGATTTTAATTGATAATTATTAGTTTAAGATTTTAATTGATAAAATTATTAGTTTAGAATTTTAATGGACTCAACAAAGAGTATAAATTGATTTTGTTTCCTTTTTTTTTTTTTTTTAAACTGCTACTACCGTATTCTTCCTAAAAAAATTCGAACACTTCTTAAAAAAACTAGAACATTTTAATAAGAATTGAAGTTTAGCCCAATGAACAATTGCCACAATCCTACTCGCCCCCTTTCTTTCTCCCATCCTGCGTTCTGCTTTGATTTCTCCTCCCCTTTTTGCCGCCGCCGCACGCCAGCATACCCCCACTGGCCCACACCCTTTTCTTTCTTCTCTCCTACTCTCCCTCTCTGCCGCAGCCGCACGCCGGCACGCCCCCACCGGCCCACCCCATTATCTTTCTTCTCTCCCAATCTCCCTCTCTGCTGAAGCCGCACGCCGGCTCGCCAACCCCCGTGTTCTGCCACAGCCGCCCGACCCCCACCGCTTTCTTCTGCAGCGGCTCGTCGTCGAGGTCCACGCTCAGTCTTCCGCTGTCCCTACCATCTCTCAGTCGCTCGCTGTCGAGGTCCGCACGCTCTCTCTCTCAGTGGAACTGCAGCGTCTCTTCCGGCATTCTTGA

mRNA sequence

ATGGCAAGGAATGGATCGATTAAGCATCTCATGGGTGACCTTTTATTTCTTGATTCATCACCTTTTGCCAAGCTCTTGAATCGGTGTGCTCGCTCGACGTCAGCTAGAGACACGAGTTGTGTACATGCTTGCATAATTAAATCACCCTTTGCGTCTGAAATTTTTATCCAAAATAGGCTCATTGATGTATATGGTAAATGTGGATGTGTGGATGTTGCTCGCAAGTTGTTTGATAGATTGCTTGAGAGAAATATTTTCTCTTGGAACTCCATCACTTGTGCATTCACTAAGTCCGGATTTCTTGATGACGCTGTCCACATCTTTGAGAAGATGCCTGAAGTTGACCAGTGCTCGTGGAATTCTATGATTTCGGGTTTTGAACAACACAATCGCTTTGATGAAGCTTTAAATTATTTTGCTCAAATGCATAGTCATGGTTTTTTGATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGTGCAGGTTTACAAGATCTAAAATTGGGTTCCCAAATCCACAGTTTAATATATCGGTCAAATTATTTATCAGATGTGTATATAGGCTCTGCCCTAGTAGATATGTACTCTAAATGTGGAAGAGTTGACTGTGCTTATAGTGTTTTTTATGGAATGACTGCAAGAAGTAGAGTTTCCTGGAACAGCTTGATTACGTGTTATGAACAGAATGGTCCAGTTGATGAGGCTCTTGTTATTTTTGTTGAGATGATCAAATGCAGGGTTGAACCTGATGAGGTAACGCTTGCTAGTGTGGTTAGTGCATGTGCAACTATCTCGGCGATCAAAGAAGGTCGGCAGATTCATGCTCGTGTTGTGAAATGTGATGAATTTAGAAATGATCTTATTTTAGGCAATGCATTGGTTGATATGTATGCTAAATGTAAAAGGATCAACGAGGCTAGAATGGTTTTCGATCGGATGCCAATTAGGAATGTGGTGTCTGAAACCTCAATGGTAAGTGGGTATGCGAAAGCATCAAGTGTTAAAGCTGCAAGATATATGTTTTCAAATATGATGGTGAAAGACGTAATTACATGGAATGCACTTATTTCAGGGTGTACACAGAATGGAGAGAATGAAGAGGCACTTATACTCTTCTGTGATTTGAAGAGGGAGTCTGTTTGGCCTACACACTACACCTTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTGGCCAACAGGCTCACTCTCATGTTATAAAGCATGGATTTCGATTCCAATATGGAGAAGAGTCGGATATTTTTGTTGGGAATTCTCTGATAGATATGTATATGAAATGTGGATCAGTTGAGAATGGTTGTCAGGTATTTGAACATATGGTGGAAAGGGATTGTGTCTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAATGAGGCCCTTGGAGTTTTCAATAAAATGTTAGAATTGGGAGAGAAACCAGATCATGTCACAATGATTGGTGTTCTTTGCGCTTGTAGTCATGCTGGGCTGCTTGATGAAGGTCGTCATTACTTTCGATCAATGAGTGCACAACATGGTTTGGTGCCATTAAAGGACCATTATACATGTATGGTTGATTTACTTGGCCGAGCTGGCTGCCTTGAAGAAGCAAAAAATCTAATAGAGGAAATGCCGATGCAGCCTGATGCTGTTGTCTGGGGATCCTTGCTTGCTGCTTGTAAAGTCCATCGGAACATCGAATTGGGGGAATATGTAGTCGAGAAGCTTTTAGAGGTAGATCCCGAGAATTCTGGGCCATATGTTCTTCTCTCGAATATGTATGCTGAACGTGGAAATTGGGGGAATGTTGTGAGAATAAGAAAGCTGATGAGAAAGAGAGGAGTGATTAAACAACCAGGTTGCAGTTGGATTGAAATTCAAGGTCAGTTGAATGTTTTTATGGTTAAGGATAAAAGGCATTCAAAGAAGAAAGAAATCTACATGCTTTTGAGAACACTTTTACAACAGATGAAACGAGCTGGATATGTCCCATATGTTGGCAGCAATGAGATTGATGAAGAACAGCAGGAAGAACACGACATATCCTCATCTTACCAAATTAAAATGCCAGATACAGCCGCACGCCGGCACGCCCCCACCGGCCCACCCCATTATCTTTCTTCTCTCCCAATCTCCCTCTCTGCTGAAGCCGCACGCCGGCTCGCCAACCCCCGTGTTCTGCCACAGCCGCCCGACCCCCACCGCTTTCTTCTGCAGCGGCTCGTCGTCGAGGTCCACGCTCAGTCTTCCGCTGTCCCTACCATCTCTCAGTCGCTCGCTGTCGAGGTCCGCACGCTCTCTCTCTCAGTGGAACTGCAGCGTCTCTTCCGGCATTCTTGA

Coding sequence (CDS)

ATGGCAAGGAATGGATCGATTAAGCATCTCATGGGTGACCTTTTATTTCTTGATTCATCACCTTTTGCCAAGCTCTTGAATCGGTGTGCTCGCTCGACGTCAGCTAGAGACACGAGTTGTGTACATGCTTGCATAATTAAATCACCCTTTGCGTCTGAAATTTTTATCCAAAATAGGCTCATTGATGTATATGGTAAATGTGGATGTGTGGATGTTGCTCGCAAGTTGTTTGATAGATTGCTTGAGAGAAATATTTTCTCTTGGAACTCCATCACTTGTGCATTCACTAAGTCCGGATTTCTTGATGACGCTGTCCACATCTTTGAGAAGATGCCTGAAGTTGACCAGTGCTCGTGGAATTCTATGATTTCGGGTTTTGAACAACACAATCGCTTTGATGAAGCTTTAAATTATTTTGCTCAAATGCATAGTCATGGTTTTTTGATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGTGCAGGTTTACAAGATCTAAAATTGGGTTCCCAAATCCACAGTTTAATATATCGGTCAAATTATTTATCAGATGTGTATATAGGCTCTGCCCTAGTAGATATGTACTCTAAATGTGGAAGAGTTGACTGTGCTTATAGTGTTTTTTATGGAATGACTGCAAGAAGTAGAGTTTCCTGGAACAGCTTGATTACGTGTTATGAACAGAATGGTCCAGTTGATGAGGCTCTTGTTATTTTTGTTGAGATGATCAAATGCAGGGTTGAACCTGATGAGGTAACGCTTGCTAGTGTGGTTAGTGCATGTGCAACTATCTCGGCGATCAAAGAAGGTCGGCAGATTCATGCTCGTGTTGTGAAATGTGATGAATTTAGAAATGATCTTATTTTAGGCAATGCATTGGTTGATATGTATGCTAAATGTAAAAGGATCAACGAGGCTAGAATGGTTTTCGATCGGATGCCAATTAGGAATGTGGTGTCTGAAACCTCAATGGTAAGTGGGTATGCGAAAGCATCAAGTGTTAAAGCTGCAAGATATATGTTTTCAAATATGATGGTGAAAGACGTAATTACATGGAATGCACTTATTTCAGGGTGTACACAGAATGGAGAGAATGAAGAGGCACTTATACTCTTCTGTGATTTGAAGAGGGAGTCTGTTTGGCCTACACACTACACCTTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTGGCCAACAGGCTCACTCTCATGTTATAAAGCATGGATTTCGATTCCAATATGGAGAAGAGTCGGATATTTTTGTTGGGAATTCTCTGATAGATATGTATATGAAATGTGGATCAGTTGAGAATGGTTGTCAGGTATTTGAACATATGGTGGAAAGGGATTGTGTCTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAATGAGGCCCTTGGAGTTTTCAATAAAATGTTAGAATTGGGAGAGAAACCAGATCATGTCACAATGATTGGTGTTCTTTGCGCTTGTAGTCATGCTGGGCTGCTTGATGAAGGTCGTCATTACTTTCGATCAATGAGTGCACAACATGGTTTGGTGCCATTAAAGGACCATTATACATGTATGGTTGATTTACTTGGCCGAGCTGGCTGCCTTGAAGAAGCAAAAAATCTAATAGAGGAAATGCCGATGCAGCCTGATGCTGTTGTCTGGGGATCCTTGCTTGCTGCTTGTAAAGTCCATCGGAACATCGAATTGGGGGAATATGTAGTCGAGAAGCTTTTAGAGGTAGATCCCGAGAATTCTGGGCCATATGTTCTTCTCTCGAATATGTATGCTGAACGTGGAAATTGGGGGAATGTTGTGAGAATAAGAAAGCTGATGAGAAAGAGAGGAGTGATTAAACAACCAGGTTGCAGTTGGATTGAAATTCAAGGTCAGTTGAATGTTTTTATGGTTAAGGATAAAAGGCATTCAAAGAAGAAAGAAATCTACATGCTTTTGAGAACACTTTTACAACAGATGAAACGAGCTGGATATGTCCCATATGTTGGCAGCAATGAGATTGATGAAGAACAGCAGGAAGAACACGACATATCCTCATCTTACCAAATTAAAATGCCAGATACAGCCGCACGCCGGCACGCCCCCACCGGCCCACCCCATTATCTTTCTTCTCTCCCAATCTCCCTCTCTGCTGAAGCCGCACGCCGGCTCGCCAACCCCCGTGTTCTGCCACAGCCGCCCGACCCCCACCGCTTTCTTCTGCAGCGGCTCGTCGTCGAGGTCCACGCTCAGTCTTCCGCTGTCCCTACCATCTCTCAGTCGCTCGCTGTCGAGGTCCGCACGCTCTCTCTCTCAGTGGAACTGCAGCGTCTCTTCCGGCATTCTTGA

Protein sequence

MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRLIDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWNSMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIFVEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAKCKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGCTQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQYGEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVFNKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLLRTLLQQMKRAGYVPYVGSNEIDEEQQEEHDISSSYQIKMPDTAARRHAPTGPPHYLSSLPISLSAEAARRLANPRVLPQPPDPHRFLLQRLVVEVHAQSSAVPTISQSLAVEVRTLSLSVELQRLFRHS
Homology
BLAST of Sgr029763 vs. NCBI nr
Match: XP_022159229.1 (pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia])

HSP 1 Score: 1290.8 bits (3339), Expect = 0.0e+00
Identity = 628/690 (91.01%), Postives = 655/690 (94.93%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MARNG IKHL  D LFLDSS FAKLLN+C  S SARDTSCVHACIIK PFASE FIQNRL
Sbjct: 1   MARNGLIKHLTSDFLFLDSSYFAKLLNQCTSSKSARDTSCVHACIIKLPFASETFIQNRL 60

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCVDVARKLFD LLERNIFSWNSI CA+TK GFLDDAV IFE+MPEVDQCSWN
Sbjct: 61  IDVYGKCGCVDVARKLFDGLLERNIFSWNSIICAYTKFGFLDDAVDIFERMPEVDQCSWN 120

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH+RFDEALNYFAQMHSHGFLMNEYSFGSALSACAGL+D KLGSQIHSLIYRS
Sbjct: 121 SMISGFEQHDRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLRDFKLGSQIHSLIYRS 180

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSDVY+GSALVDMYSKCGRVDCA SVF GMT RSRVSWNSLITCYEQNGPVDEALVIF
Sbjct: 181 NYLSDVYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIF 240

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
            EMIKCRVE DEVTLASVVSACATISAI EG+QIHARVVKCDEFRNDLILGNALVDMYAK
Sbjct: 241 FEMIKCRVEADEVTLASVVSACATISAINEGQQIHARVVKCDEFRNDLILGNALVDMYAK 300

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RIN+AR+VFDRMPIR+VVSETSMVSGYAKASSVKAAR MFSNMMVKDVITWNALI+GC
Sbjct: 301 CNRINKARIVFDRMPIRSVVSETSMVSGYAKASSVKAARCMFSNMMVKDVITWNALIAGC 360

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEALILF  LKRESVWPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRFQ 
Sbjct: 361 TQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQS 420

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           GEESDIFVGNSLIDMYMKCGSVENGC+VFEHMV+RDCVSWNAMIVGYAQNGFGNEALGVF
Sbjct: 421 GEESDIFVGNSLIDMYMKCGSVENGCKVFEHMVQRDCVSWNAMIVGYAQNGFGNEALGVF 480

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
           +KMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYF+SMSAQHGLV LKDHYTCMVDLLGR
Sbjct: 481 SKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFQSMSAQHGLVSLKDHYTCMVDLLGR 540

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AGCLEEAK+LIEEMPMQPDAV+WGSLLAACKVHRNI+LGEYVVEKLLEVD E SGPYVLL
Sbjct: 541 AGCLEEAKDLIEEMPMQPDAVIWGSLLAACKVHRNIKLGEYVVEKLLEVDAETSGPYVLL 600

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAERG+WGNVVRIRKLMRKRGV+K PGCSWIEIQGQLNVFMVKDK+H KKKEIY LL
Sbjct: 601 SNMYAERGDWGNVVRIRKLMRKRGVVKHPGCSWIEIQGQLNVFMVKDKKHPKKKEIYTLL 660

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQQEEHD 691
           RTLL+QM+R GYVPYV SNEIDEE+ +EHD
Sbjct: 661 RTLLKQMRRVGYVPYVVSNEIDEEEMKEHD 690

BLAST of Sgr029763 vs. NCBI nr
Match: XP_022923215.1 (pentatricopeptide repeat-containing protein At2g13600 [Cucurbita moschata])

HSP 1 Score: 1289.2 bits (3335), Expect = 0.0e+00
Identity = 619/685 (90.36%), Postives = 660/685 (96.35%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MA NG I+ L GDLLFLDSSP +KLLN+CARS SARDTS VHACIIKSPFASE+FIQNRL
Sbjct: 3   MAGNGFIRRLTGDLLFLDSSPLSKLLNQCARSKSARDTSRVHACIIKSPFASEVFIQNRL 62

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCVDVARK+FDR+LERNIFSWNSI CAFTKSGFLDDAVHIFEKMP+VDQCSWN
Sbjct: 63  IDVYGKCGCVDVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWN 122

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH+RFDEAL YF QMH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRS
Sbjct: 123 SMISGFEQHDRFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRS 182

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSD+Y+GSALVDMYSKCGRVDCA SVF GMT RSRVSWNSLITCYEQNGPVDEAL IF
Sbjct: 183 NYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIF 242

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
           VEMI+C VEPDEVTLASVVSACAT+SAIKEG+QIHARVVKCDEFRNDLILGNAL+DMYAK
Sbjct: 243 VEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAK 302

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RINEAR+VFDRMPIR+VVSETSMVSGYAKASSVKAAR MFSNMMVKDVITWNALI+GC
Sbjct: 303 CNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAGC 362

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEAL LF  LKRESVWPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRF+Y
Sbjct: 363 TQNGENEEALALFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRY 422

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           G+ESDIFVGNSLIDMYMKCGSVENGC+VFEHM+ERDCVSWNAMIVGYAQNGFGN+ALG+F
Sbjct: 423 GDESDIFVGNSLIDMYMKCGSVENGCRVFEHMLERDCVSWNAMIVGYAQNGFGNKALGIF 482

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
           ++MLE GEKPDHVTMIGVL ACSHAGLL+EGRHYFRSM A+HGLVPLKDHYTCMVDLLGR
Sbjct: 483 SEMLESGEKPDHVTMIGVLSACSHAGLLNEGRHYFRSMRARHGLVPLKDHYTCMVDLLGR 542

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AGCLEEAKNLIEEMPMQPDA+VWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Sbjct: 543 AGCLEEAKNLIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLL 602

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAERG+WGNVVRIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH++K+EIYMLL
Sbjct: 603 SNMYAERGDWGNVVRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLL 662

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQ 686
           RTLLQQMKRAGYVPYVG++EIDEEQ
Sbjct: 663 RTLLQQMKRAGYVPYVGNDEIDEEQ 687

BLAST of Sgr029763 vs. NCBI nr
Match: KAG7015869.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1289.2 bits (3335), Expect = 0.0e+00
Identity = 618/685 (90.22%), Postives = 660/685 (96.35%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MA NG ++ L GDLLFLDSSP +KLLN+CARS SARDTS VHACIIKSPFASE+FIQNRL
Sbjct: 1   MAGNGFVRRLTGDLLFLDSSPLSKLLNQCARSKSARDTSRVHACIIKSPFASEVFIQNRL 60

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCVDVARK+FDR+LERNIFSWNSI CAFTKSGFLDDAVHIFEKMP+VDQCSWN
Sbjct: 61  IDVYGKCGCVDVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWN 120

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH+RFDEAL YF QMH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRS
Sbjct: 121 SMISGFEQHDRFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRS 180

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSD+Y+GSALVDMYSKCGRVDCA SVF GMT RSRVSWNSLITCYEQNGPVDEAL IF
Sbjct: 181 NYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIF 240

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
           VEMI+C VEPDEVTLASVVSACAT+SAIKEG+QIHARVVKCDEFRNDLILGNAL+DMYAK
Sbjct: 241 VEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAK 300

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RINEAR+VFDRMPIR+VVSETSMVSGYAKASSVKAAR MFSNMMVKDVITWNALI+GC
Sbjct: 301 CNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAGC 360

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEAL LF  LKRESVWPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRF+Y
Sbjct: 361 TQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRY 420

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           G+ESDIFVGNSLIDMYMKCGSVENGC+VFEHM+ERDCVSWNAMIVGYAQNGFGN+ALG+F
Sbjct: 421 GDESDIFVGNSLIDMYMKCGSVENGCRVFEHMLERDCVSWNAMIVGYAQNGFGNKALGIF 480

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
           ++MLE GEKPDHVTMIGVL ACSHAGLLDEGRHYFRSM A+HGLVPLKDHYTCMVDLLGR
Sbjct: 481 SEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMHARHGLVPLKDHYTCMVDLLGR 540

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AGCLEEAKNLIEEMPMQPDA+VWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Sbjct: 541 AGCLEEAKNLIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLL 600

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAERG+WGNVVRIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH++K+EIYMLL
Sbjct: 601 SNMYAERGDWGNVVRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLL 660

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQ 686
           RTLLQQMKRAGYVP+VG++EIDEEQ
Sbjct: 661 RTLLQQMKRAGYVPFVGNDEIDEEQ 685

BLAST of Sgr029763 vs. NCBI nr
Match: KAG6577831.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1289.2 bits (3335), Expect = 0.0e+00
Identity = 618/685 (90.22%), Postives = 660/685 (96.35%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MA NG ++ L GDLLFLDSSP +KLLN+CARS SARDTS VHACIIKSPFASE+FIQNRL
Sbjct: 3   MAGNGFVRRLTGDLLFLDSSPLSKLLNQCARSKSARDTSRVHACIIKSPFASEVFIQNRL 62

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCVDVARK+FDR+LERNIFSWNSI CAFTKSGFLDDAVHIFEKMP+VDQCSWN
Sbjct: 63  IDVYGKCGCVDVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWN 122

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH+RFDEAL YF QMH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRS
Sbjct: 123 SMISGFEQHDRFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRS 182

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSD+Y+GSALVDMYSKCGRVDCA SVF GMT RSRVSWNSLITCYEQNGPVDEAL IF
Sbjct: 183 NYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIF 242

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
           VEMI+C VEPDEVTLASVVSACAT+SAIKEG+QIHARVVKCDEFRNDLILGNAL+DMYAK
Sbjct: 243 VEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAK 302

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RINEAR+VFDRMPIR+VVSETSMVSGYAKASSVKAAR MFSNMMVKDVITWNALI+GC
Sbjct: 303 CNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAGC 362

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEAL LF  LKRESVWPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRF+Y
Sbjct: 363 TQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRY 422

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           G+ESDIFVGNSLIDMYMKCGSVENGC+VFEHM+ERDCVSWNAMIVGYAQNGFGN+ALG+F
Sbjct: 423 GDESDIFVGNSLIDMYMKCGSVENGCRVFEHMLERDCVSWNAMIVGYAQNGFGNKALGIF 482

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
           ++MLE GEKPDHVTMIGVL ACSHAGLLDEGRHYFRSM A+HGLVPLKDHYTCMVDLLGR
Sbjct: 483 SEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLGR 542

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AGCLEEAKNLIEEMPMQPDA+VWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Sbjct: 543 AGCLEEAKNLIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLL 602

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAERG+WGNVVRIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH++K+EIYMLL
Sbjct: 603 SNMYAERGDWGNVVRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLL 662

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQ 686
           RTLLQQMKRAGYVP+VG++EIDEEQ
Sbjct: 663 RTLLQQMKRAGYVPFVGNDEIDEEQ 687

BLAST of Sgr029763 vs. NCBI nr
Match: XP_023552034.1 (pentatricopeptide repeat-containing protein At2g13600 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1281.5 bits (3315), Expect = 0.0e+00
Identity = 615/685 (89.78%), Postives = 656/685 (95.77%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MA NG ++ L GDLLFLDSSP +KLLN+CARS SARDTS VHACIIKSPFASE+FIQNRL
Sbjct: 3   MAGNGFVRRLTGDLLFLDSSPLSKLLNQCARSKSARDTSRVHACIIKSPFASEVFIQNRL 62

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCVDVARK+FDR+LERNIFSWNSI CAFTKSGFLDDAVHIFEKMP+VDQCSWN
Sbjct: 63  IDVYGKCGCVDVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWN 122

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH+RFDEAL YF QMH HGF MNEYSFGSALSACA LQDLK+GSQIHSLIYRS
Sbjct: 123 SMISGFEQHDRFDEALKYFVQMHGHGFFMNEYSFGSALSACAALQDLKMGSQIHSLIYRS 182

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSD+Y+GSALVDMYSKCGRVDCA SVF GMT RSRVSWNSLITCYEQNGPVDEAL IF
Sbjct: 183 NYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIF 242

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
           VEMI+C VEPDEVTLASVVSACAT+SAIKEG+QIHARVVKCDEFRNDLILGNAL+DMYAK
Sbjct: 243 VEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAK 302

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RINEAR+VFDRMPIR+VVSETSMVSGYAKASSVKAAR MFSNMMVKDVITWNALI+GC
Sbjct: 303 CNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAGC 362

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEAL LF  LKRESVWPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRF+Y
Sbjct: 363 TQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRY 422

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           G+ESDIFVGNSLIDMYMKCGSVENGC+VFEHM+ERDCVSWNAMIVGYAQNGFGN+ LG+F
Sbjct: 423 GDESDIFVGNSLIDMYMKCGSVENGCRVFEHMLERDCVSWNAMIVGYAQNGFGNKVLGIF 482

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
           ++MLE GEKPDHVTMIGVL ACSHAGLLDEGRHYFRSM A+HGLVPLKDHYTCMVDLLGR
Sbjct: 483 SEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLGR 542

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AGCLEEAKNLIEEMPMQPDA+VWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Sbjct: 543 AGCLEEAKNLIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLL 602

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAERG+WGNVVRIRKLMR+RGV+K PGCSWIEIQG+LNVFMVKDKRH++K+EIYMLL
Sbjct: 603 SNMYAERGDWGNVVRIRKLMRQRGVVKHPGCSWIEIQGELNVFMVKDKRHARKQEIYMLL 662

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQ 686
           RTLLQQMKRAGYVP VG++EIDEEQ
Sbjct: 663 RTLLQQMKRAGYVPCVGNDEIDEEQ 687

BLAST of Sgr029763 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 936.8 bits (2420), Expect = 1.6e-271
Identity = 453/680 (66.62%), Postives = 543/680 (79.85%), Query Frame = 0

Query: 16  FLDSSPFAKLLNRCARS-TSARDTSCVHACIIKSPFASEIFIQNRLIDVYGKCGCVDVAR 75
           F DSSPFAKLL+ C +S  SA     VHA +IKS F++EIFIQNRLID Y KCG ++  R
Sbjct: 16  FTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGR 75

Query: 76  KLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWNSMISGFEQHNRFDE 135
           ++FD++ +RNI++WNS+    TK GFLD+A  +F  MPE DQC+WNSM+SGF QH+R +E
Sbjct: 76  QVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEE 135

Query: 136 ALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYIGSALVD 195
           AL YFA MH  GF++NEYSF S LSAC+GL D+  G Q+HSLI +S +LSDVYIGSALVD
Sbjct: 136 ALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVD 195

Query: 196 MYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIFVEMIKCRVEPDEVT 255
           MYSKCG V+ A  VF  M  R+ VSWNSLITC+EQNGP  EAL +F  M++ RVEPDEVT
Sbjct: 196 MYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVT 255

Query: 256 LASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAKCKRINEARMVFDRM 315
           LASV+SACA++SAIK G+++H RVVK D+ RND+IL NA VDMYAKC RI EAR +FD M
Sbjct: 256 LASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSM 315

Query: 316 PIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGCTQNGENEEALILFC 375
           PIRNV++ETSM+SGYA A+S KAAR MF+ M  ++V++WNALI+G TQNGENEEAL LFC
Sbjct: 316 PIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFC 375

Query: 376 DLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQYGEESDIFVGNSLID 435
            LKRESV PTHY+F N+L ACA+LA+L LG QAH HV+KHGF+FQ GEE DIFVGNSLID
Sbjct: 376 LLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLID 435

Query: 436 MYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVFNKMLELGEKPDHVT 495
           MY+KCG VE G  VF  M+ERDCVSWNAMI+G+AQNG+GNEAL +F +MLE GEKPDH+T
Sbjct: 436 MYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHIT 495

Query: 496 MIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKNLIEEM 555
           MIGVL AC HAG ++EGRHYF SM+   G+ PL+DHYTCMVDLLGRAG LEEAK++IEEM
Sbjct: 496 MIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEM 555

Query: 556 PMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGNWGNVV 615
           PMQPD+V+WGSLLAACKVHRNI LG+YV EKLLEV+P NSGPYVLLSNMYAE G W +V+
Sbjct: 556 PMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVM 615

Query: 616 RIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLLRTLLQQMKRAGYVP 675
            +RK MRK GV KQPGCSWI+IQG  +VFMVKDK H +KK+I+ LL  L+ +M+     P
Sbjct: 616 NVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR-----P 675

Query: 676 YVGSNEIDEEQQEEHDISSS 695
                EI     EE D SS+
Sbjct: 676 EQDHTEIGSLSSEEMDYSSN 690

BLAST of Sgr029763 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 478.0 bits (1229), Expect = 2.0e-133
Identity = 251/702 (35.75%), Postives = 406/702 (57.83%), Query Frame = 0

Query: 25  LLNRCARSTSARDTS-CVHACIIKSPFASEIFIQNRLIDVYGKCGCVDVARKLFDRLLER 84
           LL +    ++ R T+  VH  +IKS     +++ N L++VY K G    ARKLFD +  R
Sbjct: 19  LLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMPLR 78

Query: 85  NIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWNSMISGFEQHNRFDEALNYFAQMH 144
             FSWN++  A++K G +D     F+++P+ D  SW +MI G++   ++ +A+     M 
Sbjct: 79  TAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMV 138

Query: 145 SHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYIGSALVDMYSKC---- 204
             G    +++  + L++ A  + ++ G ++HS I +     +V + ++L++MY+KC    
Sbjct: 139 KEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPM 198

Query: 205 ---------------------------GRVDCAYSVFYGMTARSRVSWNSLITCYEQNGP 264
                                      G++D A + F  M  R  V+WNS+I+ + Q G 
Sbjct: 199 MAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGY 258

Query: 265 VDEALVIFVEMIK-CRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILG 324
              AL IF +M++   + PD  TLASV+SACA +  +  G+QIH+ +V      + ++L 
Sbjct: 259 DLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVL- 318

Query: 325 NALVDMYAKCKRINEARMVFDRMPIRNVVSE--TSMVSGYAKASSVKAARYMFSNMMVKD 384
           NAL+ MY++C  +  AR + ++   +++  E  T+++ GY K   +  A+ +F ++  +D
Sbjct: 319 NALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRD 378

Query: 385 VITWNALISGCTQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHS 444
           V+ W A+I G  Q+G   EA+ LF  +      P  YT   +L+  ++LA L  G+Q H 
Sbjct: 379 VVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHG 438

Query: 445 HVIKHGFRFQYGEESDIFVGNSLIDMYMKCGSVENGCQVFEHM-VERDCVSWNAMIVGYA 504
             +K       GE   + V N+LI MY K G++ +  + F+ +  ERD VSW +MI+  A
Sbjct: 439 SAVKS------GEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALA 498

Query: 505 QNGFGNEALGVFNKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLK 564
           Q+G   EAL +F  ML  G +PDH+T +GV  AC+HAGL+++GR YF  M     ++P  
Sbjct: 499 QHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTL 558

Query: 565 DHYTCMVDLLGRAGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLE 624
            HY CMVDL GRAG L+EA+  IE+MP++PD V WGSLL+AC+VH+NI+LG+   E+LL 
Sbjct: 559 SHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLL 618

Query: 625 VDPENSGPYVLLSNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDK 684
           ++PENSG Y  L+N+Y+  G W    +IRK M+   V K+ G SWIE++ +++VF V+D 
Sbjct: 619 LEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDG 678

Query: 685 RHSKKKEIYMLLRTLLQQMKRAGYVPYVGS--NEIDEEQQEE 689
            H +K EIYM ++ +  ++K+ GYVP   S  ++++EE +E+
Sbjct: 679 THPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQ 713

BLAST of Sgr029763 vs. ExPASy Swiss-Prot
Match: Q9SY02 (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 467.6 bits (1202), Expect = 2.7e-130
Identity = 251/632 (39.72%), Postives = 385/632 (60.92%), Query Frame = 0

Query: 58  NRLIDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQC 117
           N +I  Y + G  ++ARKLFD + ER++ SWN +   + ++  L  A  +FE MPE D C
Sbjct: 99  NGMISGYLRNGEFELARKLFDEMPERDLVSWNVMIKGYVRNRNLGKARELFEIMPERDVC 158

Query: 118 SWNSMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLI 177
           SWN+M+SG+ Q+   D+A + F +M       N+ S+ + LSA   +Q+ K+  +   ++
Sbjct: 159 SWNTMLSGYAQNGCVDDARSVFDRMPE----KNDVSWNALLSAY--VQNSKM--EEACML 218

Query: 178 YRSNYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEAL 237
           ++S     +   + L+  + K  ++  A   F  M  R  VSWN++IT Y Q+G +DEA 
Sbjct: 219 FKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEAR 278

Query: 238 VIFVEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDM 297
            +F E        D  T  ++VS       ++E R++  ++ + +E     +  NA++  
Sbjct: 279 QLFDE----SPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPERNE-----VSWNAMLAG 338

Query: 298 YAKCKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALI 357
           Y + +R+  A+ +FD MP RNV +  +M++GYA+   +  A+ +F  M  +D ++W A+I
Sbjct: 339 YVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMI 398

Query: 358 SGCTQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFR 417
           +G +Q+G + EAL LF  ++RE       +F + L+ CA++  L+LG+Q H  ++K G+ 
Sbjct: 399 AGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGY- 458

Query: 418 FQYGEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEAL 477
                E+  FVGN+L+ MY KCGS+E    +F+ M  +D VSWN MI GY+++GFG  AL
Sbjct: 459 -----ETGCFVGNALLLMYCKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVAL 518

Query: 478 GVFNKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDL 537
             F  M   G KPD  TM+ VL ACSH GL+D+GR YF +M+  +G++P   HY CMVDL
Sbjct: 519 RFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDL 578

Query: 538 LGRAGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPY 597
           LGRAG LE+A NL++ MP +PDA +WG+LL A +VH N EL E   +K+  ++PENSG Y
Sbjct: 579 LGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMY 638

Query: 598 VLLSNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIY 657
           VLLSN+YA  G WG+V ++R  MR +GV K PG SWIEIQ + + F V D+ H +K EI+
Sbjct: 639 VLLSNLYASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIF 698

Query: 658 MLLRTLLQQMKRAGYV--PYVGSNEIDEEQQE 688
             L  L  +MK+AGYV    V  ++++EE++E
Sbjct: 699 AFLEELDLRMKKAGYVSKTSVVLHDVEEEEKE 707

BLAST of Sgr029763 vs. ExPASy Swiss-Prot
Match: Q9FRI5 (Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H74 PE=2 SV=1)

HSP 1 Score: 467.2 bits (1201), Expect = 3.6e-130
Identity = 262/703 (37.27%), Postives = 393/703 (55.90%), Query Frame = 0

Query: 13  DLLFLDSSPFAKLLNRC--ARSTSARDTSCVHACIIKSPFASEIFIQNRLIDVYGKCGCV 72
           DL+   ++ +A  L  C   R TS +    VH  II   F     I NRLIDVY K   +
Sbjct: 6   DLVRAIANRYAANLRLCLPLRRTSLQLARAVHGNIITFGFQPRAHILNRLIDVYCKSSEL 65

Query: 73  DVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEV--DQCSWNSMISGFEQ 132
           + AR+LFD + E +  +  ++   +  SG +  A  +FEK P    D   +N+MI+GF  
Sbjct: 66  NYARQLFDEISEPDKIARTTMVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSH 125

Query: 133 HNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGL-QDLKLGSQIHSLIYRSNYLSDVY 192
           +N    A+N F +M   GF  + ++F S L+  A +  D K   Q H+   +S       
Sbjct: 126 NNDGYSAINLFCKMKHEGFKPDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITS 185

Query: 193 IGSALVDMYSKCGR----VDCAYSVFYGMTARSRVSWNSLITCYEQNGPVD--------- 252
           + +ALV +YSKC      +  A  VF  +  +   SW +++T Y +NG  D         
Sbjct: 186 VSNALVSVYSKCASSPSLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLEGM 245

Query: 253 -----------------------EALVIFVEMIKCRVEPDEVTLASVVSACATISAIKEG 312
                                  EAL +   M+   +E DE T  SV+ ACAT   ++ G
Sbjct: 246 DDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQLG 305

Query: 313 RQIHARVVKCDEFRNDLILGNALVDMYAKCKRINEARMVFDRMPIRNVVSETSMVSGYAK 372
           +Q+HA V++ ++F       N+LV +Y KC + +EAR +F++MP +++VS  +++SGY  
Sbjct: 306 KQVHAYVLRREDF--SFHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYVS 365

Query: 373 ASSVKAARYMFSNMMVKDVITWNALISGCTQNGENEEALILFCDLKRESVWPTHYTFGNL 432
           +  +  A+ +F  M  K++++W  +ISG  +NG  EE L LF  +KRE   P  Y F   
Sbjct: 366 SGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSGA 425

Query: 433 LNACANLADLQLGQQAHSHVIKHGFRFQYGEESDIFVGNSLIDMYMKCGSVENGCQVFEH 492
           + +CA L     GQQ H+ ++K GF      +S +  GN+LI MY KCG VE   QVF  
Sbjct: 426 IKSCAVLGAYCNGQQYHAQLLKIGF------DSSLSAGNALITMYAKCGVVEEARQVFRT 485

Query: 493 MVERDCVSWNAMIVGYAQNGFGNEALGVFNKMLELGEKPDHVTMIGVLCACSHAGLLDEG 552
           M   D VSWNA+I    Q+G G EA+ V+ +ML+ G +PD +T++ VL ACSHAGL+D+G
Sbjct: 486 MPCLDSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQG 545

Query: 553 RHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKNLIEEMPMQPDAVVWGSLLAACK 612
           R YF SM   + + P  DHY  ++DLL R+G   +A+++IE +P +P A +W +LL+ C+
Sbjct: 546 RKYFDSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCR 605

Query: 613 VHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGNWGNVVRIRKLMRKRGVIKQPGC 672
           VH N+ELG    +KL  + PE+ G Y+LLSNM+A  G W  V R+RKLMR RGV K+  C
Sbjct: 606 VHGNMELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVAC 665

Query: 673 SWIEIQGQLNVFMVKDKRHSKKKEIYMLLRTLLQQMKRAGYVP 675
           SWIE++ Q++ F+V D  H + + +Y+ L+ L ++M+R GYVP
Sbjct: 666 SWIEMETQVHTFLVDDTSHPEAEAVYIYLQDLGKEMRRLGYVP 700

BLAST of Sgr029763 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 447.6 bits (1150), Expect = 2.9e-124
Identity = 248/682 (36.36%), Postives = 379/682 (55.57%), Query Frame = 0

Query: 8   KHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRLIDVYGKC 67
           K +  D L  DS+  A L+  C+   +      +HA   K  FAS   I+  L+++Y KC
Sbjct: 378 KRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKC 437

Query: 68  GCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWNSMISGFE 127
             ++ A   F      N+  WN +  A+   G LDD  + F                   
Sbjct: 438 ADIETALDYFLETEVENVVLWNVMLVAY---GLLDDLRNSF------------------- 497

Query: 128 QHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVY 187
                      F QM     + N+Y++ S L  C  L DL+LG QIHS I ++N+  + Y
Sbjct: 498 ---------RIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAY 557

Query: 188 IGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIFVEMIKCR 247
           + S L+DMY+K G++D A+ +      +  VSW ++I  Y Q    D+AL  F +M+   
Sbjct: 558 VCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRG 617

Query: 248 VEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAKCKRINEA 307
           +  DEV L + VSACA + A+KEG+QIHA+      F +DL   NALV +Y++C +I E+
Sbjct: 618 IRSDEVGLTNAVSACAGLQALKEGQQIHAQAC-VSGFSSDLPFQNALVTLYSRCGKIEES 677

Query: 308 RMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGCTQNGENE 367
            + F++                                   D I WNAL+SG  Q+G NE
Sbjct: 678 YLAFEQTE-------------------------------AGDNIAWNALVSGFQQSGNNE 737

Query: 368 EALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQYGEESDIF 427
           EAL +F  + RE +   ++TFG+ + A +  A+++ G+Q H+ + K G+      +S+  
Sbjct: 738 EALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGY------DSETE 797

Query: 428 VGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVFNKMLELG 487
           V N+LI MY KCGS+ +  + F  +  ++ VSWNA+I  Y+++GFG+EAL  F++M+   
Sbjct: 798 VCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSN 857

Query: 488 EKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEA 547
            +P+HVT++GVL ACSH GL+D+G  YF SM++++GL P  +HY C+VD+L RAG L  A
Sbjct: 858 VRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRA 917

Query: 548 KNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAER 607
           K  I+EMP++PDA+VW +LL+AC VH+N+E+GE+    LLE++PE+S  YVLLSN+YA  
Sbjct: 918 KEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVS 977

Query: 608 GNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLLRTLLQQM 667
             W      R+ M+++GV K+PG SWIE++  ++ F V D+ H    EI+   + L ++ 
Sbjct: 978 KKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRA 990

Query: 668 KRAGYVPYVGS--NEIDEEQQE 688
              GYV    S  NE+  EQ++
Sbjct: 1038 SEIGYVQDCFSLLNELQHEQKD 990

BLAST of Sgr029763 vs. ExPASy TrEMBL
Match: A0A6J1E1T7 (pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charantia OX=3673 GN=LOC111025648 PE=4 SV=1)

HSP 1 Score: 1290.8 bits (3339), Expect = 0.0e+00
Identity = 628/690 (91.01%), Postives = 655/690 (94.93%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MARNG IKHL  D LFLDSS FAKLLN+C  S SARDTSCVHACIIK PFASE FIQNRL
Sbjct: 1   MARNGLIKHLTSDFLFLDSSYFAKLLNQCTSSKSARDTSCVHACIIKLPFASETFIQNRL 60

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCVDVARKLFD LLERNIFSWNSI CA+TK GFLDDAV IFE+MPEVDQCSWN
Sbjct: 61  IDVYGKCGCVDVARKLFDGLLERNIFSWNSIICAYTKFGFLDDAVDIFERMPEVDQCSWN 120

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH+RFDEALNYFAQMHSHGFLMNEYSFGSALSACAGL+D KLGSQIHSLIYRS
Sbjct: 121 SMISGFEQHDRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLRDFKLGSQIHSLIYRS 180

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSDVY+GSALVDMYSKCGRVDCA SVF GMT RSRVSWNSLITCYEQNGPVDEALVIF
Sbjct: 181 NYLSDVYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIF 240

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
            EMIKCRVE DEVTLASVVSACATISAI EG+QIHARVVKCDEFRNDLILGNALVDMYAK
Sbjct: 241 FEMIKCRVEADEVTLASVVSACATISAINEGQQIHARVVKCDEFRNDLILGNALVDMYAK 300

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RIN+AR+VFDRMPIR+VVSETSMVSGYAKASSVKAAR MFSNMMVKDVITWNALI+GC
Sbjct: 301 CNRINKARIVFDRMPIRSVVSETSMVSGYAKASSVKAARCMFSNMMVKDVITWNALIAGC 360

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEALILF  LKRESVWPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRFQ 
Sbjct: 361 TQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQS 420

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           GEESDIFVGNSLIDMYMKCGSVENGC+VFEHMV+RDCVSWNAMIVGYAQNGFGNEALGVF
Sbjct: 421 GEESDIFVGNSLIDMYMKCGSVENGCKVFEHMVQRDCVSWNAMIVGYAQNGFGNEALGVF 480

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
           +KMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYF+SMSAQHGLV LKDHYTCMVDLLGR
Sbjct: 481 SKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFQSMSAQHGLVSLKDHYTCMVDLLGR 540

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AGCLEEAK+LIEEMPMQPDAV+WGSLLAACKVHRNI+LGEYVVEKLLEVD E SGPYVLL
Sbjct: 541 AGCLEEAKDLIEEMPMQPDAVIWGSLLAACKVHRNIKLGEYVVEKLLEVDAETSGPYVLL 600

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAERG+WGNVVRIRKLMRKRGV+K PGCSWIEIQGQLNVFMVKDK+H KKKEIY LL
Sbjct: 601 SNMYAERGDWGNVVRIRKLMRKRGVVKHPGCSWIEIQGQLNVFMVKDKKHPKKKEIYTLL 660

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQQEEHD 691
           RTLL+QM+R GYVPYV SNEIDEE+ +EHD
Sbjct: 661 RTLLKQMRRVGYVPYVVSNEIDEEEMKEHD 690

BLAST of Sgr029763 vs. ExPASy TrEMBL
Match: A0A6J1EB52 (pentatricopeptide repeat-containing protein At2g13600 OS=Cucurbita moschata OX=3662 GN=LOC111430973 PE=4 SV=1)

HSP 1 Score: 1289.2 bits (3335), Expect = 0.0e+00
Identity = 619/685 (90.36%), Postives = 660/685 (96.35%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MA NG I+ L GDLLFLDSSP +KLLN+CARS SARDTS VHACIIKSPFASE+FIQNRL
Sbjct: 3   MAGNGFIRRLTGDLLFLDSSPLSKLLNQCARSKSARDTSRVHACIIKSPFASEVFIQNRL 62

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCVDVARK+FDR+LERNIFSWNSI CAFTKSGFLDDAVHIFEKMP+VDQCSWN
Sbjct: 63  IDVYGKCGCVDVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWN 122

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH+RFDEAL YF QMH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRS
Sbjct: 123 SMISGFEQHDRFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRS 182

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSD+Y+GSALVDMYSKCGRVDCA SVF GMT RSRVSWNSLITCYEQNGPVDEAL IF
Sbjct: 183 NYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIF 242

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
           VEMI+C VEPDEVTLASVVSACAT+SAIKEG+QIHARVVKCDEFRNDLILGNAL+DMYAK
Sbjct: 243 VEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAK 302

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RINEAR+VFDRMPIR+VVSETSMVSGYAKASSVKAAR MFSNMMVKDVITWNALI+GC
Sbjct: 303 CNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAGC 362

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEAL LF  LKRESVWPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRF+Y
Sbjct: 363 TQNGENEEALALFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRY 422

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           G+ESDIFVGNSLIDMYMKCGSVENGC+VFEHM+ERDCVSWNAMIVGYAQNGFGN+ALG+F
Sbjct: 423 GDESDIFVGNSLIDMYMKCGSVENGCRVFEHMLERDCVSWNAMIVGYAQNGFGNKALGIF 482

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
           ++MLE GEKPDHVTMIGVL ACSHAGLL+EGRHYFRSM A+HGLVPLKDHYTCMVDLLGR
Sbjct: 483 SEMLESGEKPDHVTMIGVLSACSHAGLLNEGRHYFRSMRARHGLVPLKDHYTCMVDLLGR 542

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AGCLEEAKNLIEEMPMQPDA+VWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Sbjct: 543 AGCLEEAKNLIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLL 602

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAERG+WGNVVRIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH++K+EIYMLL
Sbjct: 603 SNMYAERGDWGNVVRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLL 662

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQ 686
           RTLLQQMKRAGYVPYVG++EIDEEQ
Sbjct: 663 RTLLQQMKRAGYVPYVGNDEIDEEQ 687

BLAST of Sgr029763 vs. ExPASy TrEMBL
Match: A0A6J1HNH1 (pentatricopeptide repeat-containing protein At2g13600 OS=Cucurbita maxima OX=3661 GN=LOC111465245 PE=4 SV=1)

HSP 1 Score: 1279.2 bits (3309), Expect = 0.0e+00
Identity = 614/685 (89.64%), Postives = 657/685 (95.91%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MA NG +K L GDLLFLDSSP +KLLN+CARS SARDTS VHACIIKSPFASE+FIQNRL
Sbjct: 3   MAGNGFVKRLTGDLLFLDSSPLSKLLNQCARSKSARDTSRVHACIIKSPFASEVFIQNRL 62

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCV VARK+FDR+LERNIFSWNSI CAFTKSGFLDDAVHIFEKMP+VDQCSWN
Sbjct: 63  IDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWN 122

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH+ FDEAL YF QMH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRS
Sbjct: 123 SMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRS 182

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSD+Y+GSALVDMYSKCGRVDCA SVF GMT RSRVSWNSLITCYEQNGPVDEAL IF
Sbjct: 183 NYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIF 242

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
           VEMI+C VEPDEVTLASVVSACAT+SAIKEG+QIHARVVKCDEFRNDLILGNAL+DMYAK
Sbjct: 243 VEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAK 302

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RINEAR+VFDRMPIR+VVSETSMVSGYAKASSVKAAR MFSNMMVKDVITWNALI+GC
Sbjct: 303 CNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAGC 362

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEAL LF  LKRESVWPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRF+Y
Sbjct: 363 TQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRY 422

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           G+ESDIFVGNSLIDMYMKCGSVE+GC+VFE M+ERDCVSWNAMIVGYAQNGFGN+ALG+F
Sbjct: 423 GDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGIF 482

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
           ++MLE GEKPDHVTMIGVL ACSHAGLLDEGRHYFRSM A+HGLVPLKDHYTCMVDLLGR
Sbjct: 483 SEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLGR 542

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AGCLEEAKN+IEEMPMQPDA+VWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Sbjct: 543 AGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLL 602

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAERG+WGNV+RIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH++K+EIYMLL
Sbjct: 603 SNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLL 662

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQ 686
           RTLLQQMKRAGYVPYVG++EIDEEQ
Sbjct: 663 RTLLQQMKRAGYVPYVGNDEIDEEQ 687

BLAST of Sgr029763 vs. ExPASy TrEMBL
Match: A0A0A0KJ63 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G003610 PE=4 SV=1)

HSP 1 Score: 1269.2 bits (3283), Expect = 0.0e+00
Identity = 610/687 (88.79%), Postives = 654/687 (95.20%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MA NG +KHL GDLLFLDSSPF+KLLN+CARS SARDTS VHACIIKSPFASE FIQNRL
Sbjct: 1   MAGNGLVKHLKGDLLFLDSSPFSKLLNQCARSRSARDTSRVHACIIKSPFASETFIQNRL 60

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCVDVARKLFDR+LERNIFSWNSI CAFTKSGFLDDAVHIFEKMP+VDQCSWN
Sbjct: 61  IDVYGKCGCVDVARKLFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWN 120

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH RFDEAL YFAQMH HGFL+NEYSFGSALSACAGLQDLKLGSQIHSL+YRS
Sbjct: 121 SMISGFEQHGRFDEALVYFAQMHGHGFLVNEYSFGSALSACAGLQDLKLGSQIHSLVYRS 180

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSDVY+GSALVDMYSKCGRV+ A SVF  MT RSRVSWNSLITCYEQNGPVDEAL IF
Sbjct: 181 NYLSDVYMGSALVDMYSKCGRVEYAQSVFDEMTVRSRVSWNSLITCYEQNGPVDEALKIF 240

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
           VEMIKC VEPDEVTLASVVSACATISAIKEG+QIHARVVKCDEFRNDLILGNAL+DMYAK
Sbjct: 241 VEMIKCGVEPDEVTLASVVSACATISAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAK 300

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RINEAR++FD MPIR+VVSETSMVSGYAKAS VK ARYMFSNMMVKDVITWNALI+GC
Sbjct: 301 CNRINEARIIFDMMPIRSVVSETSMVSGYAKASKVKVARYMFSNMMVKDVITWNALIAGC 360

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEALILF  LKRESVWPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRFQY
Sbjct: 361 TQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQY 420

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           GE+SD+FVGNSLIDMYMKCGSVENGC+VF+HM+E+DCVSWNAMIVGYAQNGFGN+AL VF
Sbjct: 421 GEDSDVFVGNSLIDMYMKCGSVENGCRVFQHMLEKDCVSWNAMIVGYAQNGFGNKALEVF 480

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
            KMLE GE PDHVTMIGVLCACSHAGLLDEGR+YFRSM+AQHGL+PLKDHYTCMVDLLGR
Sbjct: 481 CKMLESGEAPDHVTMIGVLCACSHAGLLDEGRYYFRSMTAQHGLMPLKDHYTCMVDLLGR 540

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AG LEEAKNLIEEM MQPDA+VWGSLLAACKVHRNI+LGEYVV+KLLEVDPENSGPYVLL
Sbjct: 541 AGYLEEAKNLIEEMSMQPDAIVWGSLLAACKVHRNIQLGEYVVKKLLEVDPENSGPYVLL 600

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAE  +W NVVR+RKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH++KKEIYM+L
Sbjct: 601 SNMYAENRDWKNVVRVRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKKEIYMVL 660

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQQE 688
           RT+LQQMK+AGYVPYVGSNE DE++++
Sbjct: 661 RTILQQMKQAGYVPYVGSNEFDEDEEQ 687

BLAST of Sgr029763 vs. ExPASy TrEMBL
Match: A0A5A7VCZ2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold76G00590 PE=4 SV=1)

HSP 1 Score: 1253.8 bits (3243), Expect = 0.0e+00
Identity = 605/686 (88.19%), Postives = 646/686 (94.17%), Query Frame = 0

Query: 1   MARNGSIKHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRL 60
           MARNG +KHL GD LFLDSSPF+KLLN+C RS SARDTS VHACIIKSPFASE FIQNRL
Sbjct: 1   MARNGLVKHLKGDFLFLDSSPFSKLLNQCVRSRSARDTSRVHACIIKSPFASETFIQNRL 60

Query: 61  IDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWN 120
           IDVYGKCGCVDVARKLFDR+LERNIFSWNSI CAFTKSGFLDDAVHIFEKMPEVDQCSWN
Sbjct: 61  IDVYGKCGCVDVARKLFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPEVDQCSWN 120

Query: 121 SMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRS 180
           SMISGFEQH RF EAL YFAQMH HGFL+NEYSFGSALSACAGLQDLKLGSQIHSL+YRS
Sbjct: 121 SMISGFEQHGRFYEALVYFAQMHGHGFLVNEYSFGSALSACAGLQDLKLGSQIHSLVYRS 180

Query: 181 NYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIF 240
           NYLSDVY+GSALVDMYSKCGRV+ A S F  MT RSRVSWNSLITCYEQNGPVDEAL IF
Sbjct: 181 NYLSDVYMGSALVDMYSKCGRVEYAQSAFDEMTVRSRVSWNSLITCYEQNGPVDEALKIF 240

Query: 241 VEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAK 300
           VEMI+C VEPDEVTLASVVSACATISAIKEG+QIHARVVKCDEFRNDLILGNAL+DMYAK
Sbjct: 241 VEMIECGVEPDEVTLASVVSACATISAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAK 300

Query: 301 CKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGC 360
           C RINEAR++FD MPIR+VVSETSMVSGYAKAS VK AR MFSNMMVKDVITWNALI+GC
Sbjct: 301 CNRINEARIIFDMMPIRSVVSETSMVSGYAKASKVKVARSMFSNMMVKDVITWNALIAGC 360

Query: 361 TQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQY 420
           TQNGENEEALILF  LKRES+WPTHYTFGNLLNACANLADLQLG+QAHSHV+KHGFRFQY
Sbjct: 361 TQNGENEEALILFRLLKRESIWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQY 420

Query: 421 GEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVF 480
           GE+SD+FVGNSLIDMYMKCGSVENGC+VF+HM+ERDCVSWNAMIVGYAQNGFGN+AL VF
Sbjct: 421 GEDSDVFVGNSLIDMYMKCGSVENGCRVFQHMLERDCVSWNAMIVGYAQNGFGNKALEVF 480

Query: 481 NKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGR 540
           +KMLE GE PDHVTMIGVL ACSHAGLLDEGR+YFRSM+AQHGL+PLKDHYTCMVDLLGR
Sbjct: 481 SKMLESGEGPDHVTMIGVLSACSHAGLLDEGRYYFRSMTAQHGLMPLKDHYTCMVDLLGR 540

Query: 541 AGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL 600
           AG LEEAKNLIEEM MQPDA+VWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Sbjct: 541 AGYLEEAKNLIEEMSMQPDAIVWGSLLAACKVHRNIQLGEYVVEKLLEVDPENSGPYVLL 600

Query: 601 SNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLL 660
           SNMYAE  +W NVVR+RKLMR+RGVIKQPGCSWIEIQG+LNVFMVKDKRH++KKEI M+L
Sbjct: 601 SNMYAENRDWKNVVRVRKLMRQRGVIKQPGCSWIEIQGELNVFMVKDKRHARKKEICMVL 660

Query: 661 RTLLQQMKRAGYVPYVGSNEIDEEQQ 687
           RT+L QMK+AGYVPY GSNE DE++Q
Sbjct: 661 RTILHQMKQAGYVPYAGSNEFDEDEQ 686

BLAST of Sgr029763 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 936.8 bits (2420), Expect = 1.1e-272
Identity = 453/680 (66.62%), Postives = 543/680 (79.85%), Query Frame = 0

Query: 16  FLDSSPFAKLLNRCARS-TSARDTSCVHACIIKSPFASEIFIQNRLIDVYGKCGCVDVAR 75
           F DSSPFAKLL+ C +S  SA     VHA +IKS F++EIFIQNRLID Y KCG ++  R
Sbjct: 16  FTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGR 75

Query: 76  KLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWNSMISGFEQHNRFDE 135
           ++FD++ +RNI++WNS+    TK GFLD+A  +F  MPE DQC+WNSM+SGF QH+R +E
Sbjct: 76  QVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEE 135

Query: 136 ALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYIGSALVD 195
           AL YFA MH  GF++NEYSF S LSAC+GL D+  G Q+HSLI +S +LSDVYIGSALVD
Sbjct: 136 ALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVD 195

Query: 196 MYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIFVEMIKCRVEPDEVT 255
           MYSKCG V+ A  VF  M  R+ VSWNSLITC+EQNGP  EAL +F  M++ RVEPDEVT
Sbjct: 196 MYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVT 255

Query: 256 LASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAKCKRINEARMVFDRM 315
           LASV+SACA++SAIK G+++H RVVK D+ RND+IL NA VDMYAKC RI EAR +FD M
Sbjct: 256 LASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSM 315

Query: 316 PIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGCTQNGENEEALILFC 375
           PIRNV++ETSM+SGYA A+S KAAR MF+ M  ++V++WNALI+G TQNGENEEAL LFC
Sbjct: 316 PIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFC 375

Query: 376 DLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQYGEESDIFVGNSLID 435
            LKRESV PTHY+F N+L ACA+LA+L LG QAH HV+KHGF+FQ GEE DIFVGNSLID
Sbjct: 376 LLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLID 435

Query: 436 MYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVFNKMLELGEKPDHVT 495
           MY+KCG VE G  VF  M+ERDCVSWNAMI+G+AQNG+GNEAL +F +MLE GEKPDH+T
Sbjct: 436 MYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHIT 495

Query: 496 MIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKNLIEEM 555
           MIGVL AC HAG ++EGRHYF SM+   G+ PL+DHYTCMVDLLGRAG LEEAK++IEEM
Sbjct: 496 MIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEM 555

Query: 556 PMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGNWGNVV 615
           PMQPD+V+WGSLLAACKVHRNI LG+YV EKLLEV+P NSGPYVLLSNMYAE G W +V+
Sbjct: 556 PMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVM 615

Query: 616 RIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLLRTLLQQMKRAGYVP 675
            +RK MRK GV KQPGCSWI+IQG  +VFMVKDK H +KK+I+ LL  L+ +M+     P
Sbjct: 616 NVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR-----P 675

Query: 676 YVGSNEIDEEQQEEHDISSS 695
                EI     EE D SS+
Sbjct: 676 EQDHTEIGSLSSEEMDYSSN 690

BLAST of Sgr029763 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 478.0 bits (1229), Expect = 1.4e-134
Identity = 251/702 (35.75%), Postives = 406/702 (57.83%), Query Frame = 0

Query: 25  LLNRCARSTSARDTS-CVHACIIKSPFASEIFIQNRLIDVYGKCGCVDVARKLFDRLLER 84
           LL +    ++ R T+  VH  +IKS     +++ N L++VY K G    ARKLFD +  R
Sbjct: 19  LLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMPLR 78

Query: 85  NIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWNSMISGFEQHNRFDEALNYFAQMH 144
             FSWN++  A++K G +D     F+++P+ D  SW +MI G++   ++ +A+     M 
Sbjct: 79  TAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMV 138

Query: 145 SHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYIGSALVDMYSKC---- 204
             G    +++  + L++ A  + ++ G ++HS I +     +V + ++L++MY+KC    
Sbjct: 139 KEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPM 198

Query: 205 ---------------------------GRVDCAYSVFYGMTARSRVSWNSLITCYEQNGP 264
                                      G++D A + F  M  R  V+WNS+I+ + Q G 
Sbjct: 199 MAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQRGY 258

Query: 265 VDEALVIFVEMIK-CRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILG 324
              AL IF +M++   + PD  TLASV+SACA +  +  G+QIH+ +V      + ++L 
Sbjct: 259 DLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIVL- 318

Query: 325 NALVDMYAKCKRINEARMVFDRMPIRNVVSE--TSMVSGYAKASSVKAARYMFSNMMVKD 384
           NAL+ MY++C  +  AR + ++   +++  E  T+++ GY K   +  A+ +F ++  +D
Sbjct: 319 NALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRD 378

Query: 385 VITWNALISGCTQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHS 444
           V+ W A+I G  Q+G   EA+ LF  +      P  YT   +L+  ++LA L  G+Q H 
Sbjct: 379 VVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHG 438

Query: 445 HVIKHGFRFQYGEESDIFVGNSLIDMYMKCGSVENGCQVFEHM-VERDCVSWNAMIVGYA 504
             +K       GE   + V N+LI MY K G++ +  + F+ +  ERD VSW +MI+  A
Sbjct: 439 SAVKS------GEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIALA 498

Query: 505 QNGFGNEALGVFNKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLK 564
           Q+G   EAL +F  ML  G +PDH+T +GV  AC+HAGL+++GR YF  M     ++P  
Sbjct: 499 QHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIPTL 558

Query: 565 DHYTCMVDLLGRAGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLE 624
            HY CMVDL GRAG L+EA+  IE+MP++PD V WGSLL+AC+VH+NI+LG+   E+LL 
Sbjct: 559 SHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERLLL 618

Query: 625 VDPENSGPYVLLSNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDK 684
           ++PENSG Y  L+N+Y+  G W    +IRK M+   V K+ G SWIE++ +++VF V+D 
Sbjct: 619 LEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVEDG 678

Query: 685 RHSKKKEIYMLLRTLLQQMKRAGYVPYVGS--NEIDEEQQEE 689
            H +K EIYM ++ +  ++K+ GYVP   S  ++++EE +E+
Sbjct: 679 THPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQ 713

BLAST of Sgr029763 vs. TAIR 10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 467.6 bits (1202), Expect = 2.0e-131
Identity = 251/632 (39.72%), Postives = 385/632 (60.92%), Query Frame = 0

Query: 58  NRLIDVYGKCGCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQC 117
           N +I  Y + G  ++ARKLFD + ER++ SWN +   + ++  L  A  +FE MPE D C
Sbjct: 99  NGMISGYLRNGEFELARKLFDEMPERDLVSWNVMIKGYVRNRNLGKARELFEIMPERDVC 158

Query: 118 SWNSMISGFEQHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLI 177
           SWN+M+SG+ Q+   D+A + F +M       N+ S+ + LSA   +Q+ K+  +   ++
Sbjct: 159 SWNTMLSGYAQNGCVDDARSVFDRMPE----KNDVSWNALLSAY--VQNSKM--EEACML 218

Query: 178 YRSNYLSDVYIGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEAL 237
           ++S     +   + L+  + K  ++  A   F  M  R  VSWN++IT Y Q+G +DEA 
Sbjct: 219 FKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEAR 278

Query: 238 VIFVEMIKCRVEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDM 297
            +F E        D  T  ++VS       ++E R++  ++ + +E     +  NA++  
Sbjct: 279 QLFDE----SPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPERNE-----VSWNAMLAG 338

Query: 298 YAKCKRINEARMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALI 357
           Y + +R+  A+ +FD MP RNV +  +M++GYA+   +  A+ +F  M  +D ++W A+I
Sbjct: 339 YVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMI 398

Query: 358 SGCTQNGENEEALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFR 417
           +G +Q+G + EAL LF  ++RE       +F + L+ CA++  L+LG+Q H  ++K G+ 
Sbjct: 399 AGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGY- 458

Query: 418 FQYGEESDIFVGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEAL 477
                E+  FVGN+L+ MY KCGS+E    +F+ M  +D VSWN MI GY+++GFG  AL
Sbjct: 459 -----ETGCFVGNALLLMYCKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVAL 518

Query: 478 GVFNKMLELGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDL 537
             F  M   G KPD  TM+ VL ACSH GL+D+GR YF +M+  +G++P   HY CMVDL
Sbjct: 519 RFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDL 578

Query: 538 LGRAGCLEEAKNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPY 597
           LGRAG LE+A NL++ MP +PDA +WG+LL A +VH N EL E   +K+  ++PENSG Y
Sbjct: 579 LGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMY 638

Query: 598 VLLSNMYAERGNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIY 657
           VLLSN+YA  G WG+V ++R  MR +GV K PG SWIEIQ + + F V D+ H +K EI+
Sbjct: 639 VLLSNLYASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIF 698

Query: 658 MLLRTLLQQMKRAGYV--PYVGSNEIDEEQQE 688
             L  L  +MK+AGYV    V  ++++EE++E
Sbjct: 699 AFLEELDLRMKKAGYVSKTSVVLHDVEEEEKE 707

BLAST of Sgr029763 vs. TAIR 10
Match: AT1G25360.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 467.2 bits (1201), Expect = 2.6e-131
Identity = 262/703 (37.27%), Postives = 393/703 (55.90%), Query Frame = 0

Query: 13  DLLFLDSSPFAKLLNRC--ARSTSARDTSCVHACIIKSPFASEIFIQNRLIDVYGKCGCV 72
           DL+   ++ +A  L  C   R TS +    VH  II   F     I NRLIDVY K   +
Sbjct: 6   DLVRAIANRYAANLRLCLPLRRTSLQLARAVHGNIITFGFQPRAHILNRLIDVYCKSSEL 65

Query: 73  DVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEV--DQCSWNSMISGFEQ 132
           + AR+LFD + E +  +  ++   +  SG +  A  +FEK P    D   +N+MI+GF  
Sbjct: 66  NYARQLFDEISEPDKIARTTMVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSH 125

Query: 133 HNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGL-QDLKLGSQIHSLIYRSNYLSDVY 192
           +N    A+N F +M   GF  + ++F S L+  A +  D K   Q H+   +S       
Sbjct: 126 NNDGYSAINLFCKMKHEGFKPDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITS 185

Query: 193 IGSALVDMYSKCGR----VDCAYSVFYGMTARSRVSWNSLITCYEQNGPVD--------- 252
           + +ALV +YSKC      +  A  VF  +  +   SW +++T Y +NG  D         
Sbjct: 186 VSNALVSVYSKCASSPSLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLEGM 245

Query: 253 -----------------------EALVIFVEMIKCRVEPDEVTLASVVSACATISAIKEG 312
                                  EAL +   M+   +E DE T  SV+ ACAT   ++ G
Sbjct: 246 DDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQLG 305

Query: 313 RQIHARVVKCDEFRNDLILGNALVDMYAKCKRINEARMVFDRMPIRNVVSETSMVSGYAK 372
           +Q+HA V++ ++F       N+LV +Y KC + +EAR +F++MP +++VS  +++SGY  
Sbjct: 306 KQVHAYVLRREDF--SFHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYVS 365

Query: 373 ASSVKAARYMFSNMMVKDVITWNALISGCTQNGENEEALILFCDLKRESVWPTHYTFGNL 432
           +  +  A+ +F  M  K++++W  +ISG  +NG  EE L LF  +KRE   P  Y F   
Sbjct: 366 SGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSGA 425

Query: 433 LNACANLADLQLGQQAHSHVIKHGFRFQYGEESDIFVGNSLIDMYMKCGSVENGCQVFEH 492
           + +CA L     GQQ H+ ++K GF      +S +  GN+LI MY KCG VE   QVF  
Sbjct: 426 IKSCAVLGAYCNGQQYHAQLLKIGF------DSSLSAGNALITMYAKCGVVEEARQVFRT 485

Query: 493 MVERDCVSWNAMIVGYAQNGFGNEALGVFNKMLELGEKPDHVTMIGVLCACSHAGLLDEG 552
           M   D VSWNA+I    Q+G G EA+ V+ +ML+ G +PD +T++ VL ACSHAGL+D+G
Sbjct: 486 MPCLDSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQG 545

Query: 553 RHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKNLIEEMPMQPDAVVWGSLLAACK 612
           R YF SM   + + P  DHY  ++DLL R+G   +A+++IE +P +P A +W +LL+ C+
Sbjct: 546 RKYFDSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCR 605

Query: 613 VHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGNWGNVVRIRKLMRKRGVIKQPGC 672
           VH N+ELG    +KL  + PE+ G Y+LLSNM+A  G W  V R+RKLMR RGV K+  C
Sbjct: 606 VHGNMELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVAC 665

Query: 673 SWIEIQGQLNVFMVKDKRHSKKKEIYMLLRTLLQQMKRAGYVP 675
           SWIE++ Q++ F+V D  H + + +Y+ L+ L ++M+R GYVP
Sbjct: 666 SWIEMETQVHTFLVDDTSHPEAEAVYIYLQDLGKEMRRLGYVP 700

BLAST of Sgr029763 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 447.6 bits (1150), Expect = 2.1e-125
Identity = 248/682 (36.36%), Postives = 379/682 (55.57%), Query Frame = 0

Query: 8   KHLMGDLLFLDSSPFAKLLNRCARSTSARDTSCVHACIIKSPFASEIFIQNRLIDVYGKC 67
           K +  D L  DS+  A L+  C+   +      +HA   K  FAS   I+  L+++Y KC
Sbjct: 378 KRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKC 437

Query: 68  GCVDVARKLFDRLLERNIFSWNSITCAFTKSGFLDDAVHIFEKMPEVDQCSWNSMISGFE 127
             ++ A   F      N+  WN +  A+   G LDD  + F                   
Sbjct: 438 ADIETALDYFLETEVENVVLWNVMLVAY---GLLDDLRNSF------------------- 497

Query: 128 QHNRFDEALNYFAQMHSHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVY 187
                      F QM     + N+Y++ S L  C  L DL+LG QIHS I ++N+  + Y
Sbjct: 498 ---------RIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAY 557

Query: 188 IGSALVDMYSKCGRVDCAYSVFYGMTARSRVSWNSLITCYEQNGPVDEALVIFVEMIKCR 247
           + S L+DMY+K G++D A+ +      +  VSW ++I  Y Q    D+AL  F +M+   
Sbjct: 558 VCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRG 617

Query: 248 VEPDEVTLASVVSACATISAIKEGRQIHARVVKCDEFRNDLILGNALVDMYAKCKRINEA 307
           +  DEV L + VSACA + A+KEG+QIHA+      F +DL   NALV +Y++C +I E+
Sbjct: 618 IRSDEVGLTNAVSACAGLQALKEGQQIHAQAC-VSGFSSDLPFQNALVTLYSRCGKIEES 677

Query: 308 RMVFDRMPIRNVVSETSMVSGYAKASSVKAARYMFSNMMVKDVITWNALISGCTQNGENE 367
            + F++                                   D I WNAL+SG  Q+G NE
Sbjct: 678 YLAFEQTE-------------------------------AGDNIAWNALVSGFQQSGNNE 737

Query: 368 EALILFCDLKRESVWPTHYTFGNLLNACANLADLQLGQQAHSHVIKHGFRFQYGEESDIF 427
           EAL +F  + RE +   ++TFG+ + A +  A+++ G+Q H+ + K G+      +S+  
Sbjct: 738 EALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGY------DSETE 797

Query: 428 VGNSLIDMYMKCGSVENGCQVFEHMVERDCVSWNAMIVGYAQNGFGNEALGVFNKMLELG 487
           V N+LI MY KCGS+ +  + F  +  ++ VSWNA+I  Y+++GFG+EAL  F++M+   
Sbjct: 798 VCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSN 857

Query: 488 EKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEA 547
            +P+HVT++GVL ACSH GL+D+G  YF SM++++GL P  +HY C+VD+L RAG L  A
Sbjct: 858 VRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRA 917

Query: 548 KNLIEEMPMQPDAVVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAER 607
           K  I+EMP++PDA+VW +LL+AC VH+N+E+GE+    LLE++PE+S  YVLLSN+YA  
Sbjct: 918 KEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVS 977

Query: 608 GNWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHSKKKEIYMLLRTLLQQM 667
             W      R+ M+++GV K+PG SWIE++  ++ F V D+ H    EI+   + L ++ 
Sbjct: 978 KKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRA 990

Query: 668 KRAGYVPYVGS--NEIDEEQQE 688
              GYV    S  NE+  EQ++
Sbjct: 1038 SEIGYVQDCFSLLNELQHEQKD 990

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022159229.10.0e+0091.01pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia][more]
XP_022923215.10.0e+0090.36pentatricopeptide repeat-containing protein At2g13600 [Cucurbita moschata][more]
KAG7015869.10.0e+0090.22Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG6577831.10.0e+0090.22Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023552034.10.0e+0089.78pentatricopeptide repeat-containing protein At2g13600 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q9SIT71.6e-27166.62Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9SHZ82.0e-13335.75Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9SY022.7e-13039.72Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... [more]
Q9FRI53.6e-13037.27Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX... [more]
Q9SVP72.9e-12436.36Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1E1T70.0e+0091.01pentatricopeptide repeat-containing protein At2g13600-like OS=Momordica charanti... [more]
A0A6J1EB520.0e+0090.36pentatricopeptide repeat-containing protein At2g13600 OS=Cucurbita moschata OX=3... [more]
A0A6J1HNH10.0e+0089.64pentatricopeptide repeat-containing protein At2g13600 OS=Cucurbita maxima OX=366... [more]
A0A0A0KJ630.0e+0088.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G003610 PE=4 SV=1[more]
A0A5A7VCZ20.0e+0088.19Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT2G13600.11.1e-27266.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22070.11.4e-13435.75pentatricopeptide (PPR) repeat-containing protein [more]
AT4G02750.12.0e-13139.72Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G25360.12.6e-13137.27Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G13650.12.1e-12536.36Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 418..521
e-value: 4.7E-24
score: 87.4
coord: 522..701
e-value: 2.8E-13
score: 52.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 167..267
e-value: 4.1E-21
score: 77.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 14..166
e-value: 1.3E-25
score: 92.4
coord: 268..417
e-value: 3.4E-25
score: 91.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 190..213
e-value: 0.18
score: 12.1
coord: 58..85
e-value: 0.0012
score: 18.9
coord: 86..113
e-value: 0.0013
score: 18.8
coord: 292..317
e-value: 0.0062
score: 16.7
coord: 530..555
e-value: 3.8E-4
score: 20.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 115..162
e-value: 3.4E-8
score: 33.6
coord: 218..263
e-value: 9.3E-9
score: 35.4
coord: 456..502
e-value: 3.2E-8
score: 33.7
coord: 348..397
e-value: 6.4E-12
score: 45.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 118..150
e-value: 2.1E-7
score: 28.7
coord: 531..554
e-value: 0.0011
score: 17.0
coord: 458..491
e-value: 1.7E-7
score: 29.0
coord: 58..85
e-value: 9.9E-4
score: 17.1
coord: 323..349
e-value: 0.0026
score: 15.8
coord: 218..252
e-value: 5.5E-7
score: 27.4
coord: 430..458
e-value: 1.5E-4
score: 19.7
coord: 292..318
e-value: 0.0013
score: 16.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 425..455
score: 8.692369
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 216..250
score: 11.213468
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 84..114
score: 9.13082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 349..383
score: 11.41077
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 287..321
score: 9.054091
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 115..149
score: 11.103854
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 456..490
score: 11.8273
NoneNo IPR availablePANTHERPTHR47926:SF125PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 118..231
NoneNo IPR availablePANTHERPTHR47926:SF125PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 23..113
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 118..231
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 145..704
coord: 96..144
coord: 23..113
NoneNo IPR availablePANTHERPTHR47926:SF125PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 96..144
coord: 145..704

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029763.1Sgr029763.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
biological_process GO:0008380 RNA splicing
cellular_component GO:0009507 chloroplast
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding