Sgr016825.1 (mRNA) Monk fruit (Qingpiguo) v1

Overview
NameSgr016825.1
TypemRNA
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00153010: 1458092 .. 1463870 (+)
Sequence length2166
RNA-Seq ExpressionSgr016825.1
SyntenySgr016825.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTAAACAGGTAATTTTGACTCAAAAACCCCAAGCTTGCATTTTGCGCTCTTCTTGCTTGCTCAACCTCAATGGCTCAACCCCATCGTCGAAGCATTCCTCTGGAACGCTGAGCTGCCCAAATGTGAATCTTAGGATTCTAGCTACCATGGAGATCCCAGGCCTTTCCCGCAAGCTCCGTAATTTCTCGAACCCTTTCTCTATTTACTCTTTTGCTTCTCAATCTCCATATCCGTCATGCTCGCCTTCTTCCCATCCTAAAACCCTAGACCCCACCGATCCCGCTTCAAAAATTTTGTACCATTTTAGTCGATTTTATACAAGAACAGCATCCAATTCTGTTGCCTCTCCGATTAACGCTGTTGATTCTTCCTTAATCTGCAATACAACTTCGCAATCTGCACCTTCGGTATGTTTGTTTGAGAGATGTAATGGTGGAAGCTTGGATTTGAATCTTGGATTGCTTGGACCGCATTCCTATACAAGCTGCCATTCATTCTCGTCCTCTTCGTTTAATGGGCCGGAGCTGGGGACGACTTCGACGGAGAAGTCCCAATGTGGGACTGGTAATTTAGATGTGGCTAAAACGAATGCGACCTCGAATCAATTTTGGGAAATTATTAATATCATTCGGAGAAATGAGGAGGATTTGGAATCCAAGTTGGTCTCGTTGAATGTAAGTTTAACTAATGCTTTAGTTGCTCAGATTTTTCGGGTGCTGAATAATCAAAAGGTTTCTGCCTTTCGTTTTTTCAACTGGATTAGGGTTCAGTCGTGGAAATTTCCTTGTAACTCTGATATTTATAGTCTACTTATTGATAATTTTGGTAGATTAGATGATTATGAGGGGATGCTTCCTGTTCTGACTGAATTCCGGCGGAAAGGAATTGATCTAAACCACAAAGCATTTGGGTTTGTACTTGTTCAACTATCAAACGAAGCTTCAATTAAGATTTCTGTGGAAAAAGTGGTCAAGTTATTGAATGAAGTAGGAGGATCCTGTCGAATATCTGGGATCATGGCATTGATTGAGATGTTCTGCTCTCTTGGTTCTTTTGGAATGGCAAAGTTTGTAATCGAGATAAATGAGAGAAGGCCGTCTTTCTACAATATCATTGTCCGGGATCAATGTCGCCGATGTGATTTTGAAGGAGCTAGATGTACACTTAACGAGATGAGGCAAGTTGGTTGTAGCCCAGATGCGGGCATTCTCAATTATCTGCTCAGTAGTCTGTGCAAGAACAACAAATTTGATGAAGCTCAGAATATGTTTGAAGAAATGCTCGAACGAGATTGTCCTCTGAATTCATTGACATTTGAAGTCATTATCTGCCACTTGTGTGAAATTGGTAAGATTGAATCAGCACTCGGCTTTCTTGACATGATGGTGTCAAGGGGTCTTGATCCTCGCCTTTCAACACACGCTGCCTTTGTGAAAAGCTACTTCAATTCACGGAGATATGAGGAGGCATATCGGTACGCTGTTGATTCTAGCTCGAAACATGCCATGGCACAAAATGCAACATATAGCTTGCTTGCAAGTCTTCATGAAAAGAGAGGAAACTTAGTTGATGCTCAAAAAATTCTGTCTGAATTGATAGATGCAGGTCTCAGACCAAACTTTCCAGTGTATATGAGAGTTTTGAAGAAGCTTCAGCTTCAGGGCAAGGAAGAGTTGGTAAATGATTTGAAGGGCAAATTCTCTAATGTAAGTTTGCAATGTGGCATTCAAACTGGATAATGGCTTCCTTTTTTAATCTTTCCTTATAGAATAACAAAGTGTTATGGAAGTCTGGTGCCTGCCTTGGTTTGATTGTTTTGATTTTGTATGCCAACTTGGAGTGGAAAATAAATGAAGAGCTCAGTAGTTTGTTTTATTAAGGGCCTTTAAGCATGTTTCTGTAAAATGCTTTCATTCTTCTCTCTTTAGAAAATAGTTTTTGGACACTGTTTATAATTGTTAAAGACTATTTGCATAAAATTGACAGTTTTATTAATATCATTATTGATTGTATTTTAGCTACTTTGGCAAAGTAACAAGGAAAGTAGTCTCTGTTATTTTTTTGGTTGTCATCGCTCTCTCTCTCTCTCTCTCTCTAAGTGCCTGAGTCCCCAGTTTCATACTGAATCTGTTGGAAATGCAACTGCTCTAAAAGTGTTCACAGTAATTTTTTCTCTACCAACTCTGACATGTTACACGCTGGGGAGACATTTGACTCGGTTTTAGTAAGTTTGGATATGTGTAGTATATAGTAGGGAAGCTTGAATTTGTATAGTTTATGTTTGAGTTGGATTAAAAAAAAATGAGTTATAGAAATTTGTGTTTATAAAATCAGTATTTATTTTCTGGATTTGTGATATGTAAAAGTAAGTATAATCTAAGAGGTAAGTTTTAAAAGTGTAGGTTTATATGGCTCATAGAATTAGCTAGAGGTGCACATTGGTGGGCGTTGGTTTTTCTGTGGCTCGAACTGACATTGAAAATTGACGATTTTTTTAGTTGCATGTGAAAACCGATCTGTTGGTCGGTTTAGGTCAAAATTGAAAATATTTTTTCTTTTTGTGTTCGGTCGGCCAGTCAATTTTATTTTATTTTGTTTATTATGACAATATAATGGATCCATACTTTTTGTGTGTAAAATGATGTCGTATTCCATTAGAGTTCACTACCTTTTGCACTACAAAATGACATTTTATAACAACTAATTTTTTTTTAAAGTCAATCGGTTGGTCAGTCAGACAATGCATTTTTGCCACTCCGATCGATCAATTAGTAAAATCAATTTTGATCGCACCCTGACTAACCAACATCGATTTAGTCCATTTTTCAAAATTTAATATGTACTATTGGAATTAGCAAACTTTCACCAATTAAAAAAAAGAGTTTTGAGAGAGAGATTGACACTGCAAATTTTGAGTTGTTCCAAAGAATAATGTTAATGTAAAAACAACCTTGAATTCATCAAACTGAAACGTGGGTTTTGCTTTTGGGGTGTGAGTTTGGACTTTTGAAGTCCATACACAACTTCATATTCATCATAAAATAATCCATCATTTTTCTTAAGTCTTAACCTGCATCGGCATGCTCTTTTTTTCAATAGAATTGGTAAATGGAAAGTCAAAGGTTTGACCTCTTCAAATCCCCCCTGCTTGTTGTTCTCACAGCACATGGAATGGGACCTCAAATCGAGACCTCATAAACAAGTTTTTTCTTTTTTCAGGGAAATGTGTATTTAGTAGTTTTTGGCAAAAAAAAAAAAAAATTAATTTTTACCCCTAAAGTTGGCAAAATGTATCATTTTTTACCATAAACTTTTAATTTAATCAAATTGGATCTGAGACTTAAATAAATGTTGAAATCTTTACCTTAAACTTGAATATGTGTTGTAATTTTTACCTTCTGATAGACTTTTGTTTGAAGAAGCGTTCAAAAAGTATATTTTAACGTACATTTAATGAATTATAAAACATATGTCTAGTACGAAATTGATAGAACTTCAGGGAAAGATGATACTAGAGCTAGATTTTTGTTGAAATTTGTGAGTTTTAAAAAATTTATTCAATAGTTGTGCAATAATGGAGCTTATTTTTGTGCAATGTTTGATTTTAGTTGAAATTAATGAATTTTGAGCGATTTTTCCTTTACTTTTTGTTAAATTAGTGGGTAAATCAACGGAAATTGAACAAAAGGTAAAAATTACAATATTTGTCTAAGTTTAGGATTCAATTTGATAAAATTAAAAGTTTCAAGTGAAAGTTGATACAAGTACCGTGGAAAAATTGTTTTTTTTCTCCTTAAAAAACTAAAATTAAAAAATATTGCTTTTTTTTACCATAATTTAAAATTATAATCAATATTAGTTGATATATTAATTAACCAATCAGATTAACTATATTCAAAACATTGATTAACTAATATATCCAATCTCAATTATTGATTTACTATTATACCTTCAGTCAATCAAATTTTACTTCGGTCATTTTTGCAACGGTCTAATTTTTTTTTTTAACCGTAGACATAAGCAAGTGTTGTAATTTTTATCTTCTGTTCAATTTTCGCTAATTTACCCATTAATTTTAACAAAAAAAAAAACATGGTAAAAATCTCTCAAAACTCTCTAATTTCAATGAAAATCAAGTCTTATACAAAATTAAATCTAATCAATGCACAAGATCGATTGTTGAATAAATCTTTCGAAACTTTCAAATTTCAACCAAAAATCAACTCTAATGTCTTTTTTCTTGGAAGTCTCATCAATTTCATACTAGACATATGTTTTTTAATTCTTTAAATGTACATGATAAGATTTTTTTTAATTATATTTAAAACGAAAGTCTATCAGAAGGTAAAAATTGCAGATACTTATTCAAGTTCAAAGTAAAGATTGCAACATATATTCAAGCCTAAGGTGCAATTTGATGAAATTGAATTTTATGATAAAAATTGACATACGGTGCTAAATTTAAGAATAAAAATTGATTTTCCTAATCAATTTTAGTTGATATACTAATTAACTAATCAAATTGACTATATTTAAATCTATGATTATTTCCTATATCAATCAGTACATCGTCCATCAATCAAATTTTACTTCGGTCATTTTTGCAACTGTCTAATTTTCTTTTTTTAATAATTTGAGGAAGACATACGAAGGTTCTCCCAAAAGTCTTCAGATCTTTGGCCCACTTTCAAGATTGGGCGGCTGAAACAGTTACACCATTTGAACCCCTCACTCGTCCCCTCCACCTACCCAACAGCACCCATTTACACAAATTTTTTCCTCAATTTCTAAACCCAAAATAATTTTAATCGCTAATAATACCACTCCTTAATTACAATCATAAAACCTTAATACTTCATAATATTAATTAATTACATTCACTTTTATTTTTTCGAGAATATAATATTACAAATATAAAGTAAATATACTAACTTTTTTTTTTGAGTTCAACAAGAGGTAAAGTAGATATTCAAACTTATAACATTTAGAAAGGTTACTAGGTATATTAACTAATTGAACTATGCTCTCTAAACATACTAACTTTCAAGCCTTATGAAAAAAAAGGTTACATTTTCATCTCAAACCTTGAAATGAAAAAGTCATGGTAAAAAATATATATTGTTTATTTTTCTTGAAAAAATTAAGAGTTGAGTGTAATATATTTAAAATTAGTTAATAATTTAGCCAAAGGGGGAAAAAAAGGGGCTAAATAATTAAAAGGAAATTTGAGTTGAGTTAAGAATCCCAAATGGATTCATGCTGCATGTCCTCCCACTCGTCAAACTGATCATATAAAACAATGGAGTCTTCTAAACTAAGCGCCTCCGCCATTTGCATCCACATCGTCGGAGAGTCCAACGGCGAGTCGTTGATTGCCTGAATTTGGCTCGGCGACAGCCCCACTGTAATCGGCGCCGCCCCTCCATTGGCGGAGGAGCATCCGCCTCCGGAGTCCGCCGGCCTCTTCAACCTCAATGCCGCGAGCTGTGCCGCCATCTGAATGTCCTCGGGGCTGGAACTCGCCGGCCTCGGCAAAGTATCCACCAACTCCGGGAAGTTGAGGCGGGCGTCGCGGCCTCGAAGGTGCAGGGCCGCGACGTCGTAGGCCGCGGCCGCCATCTCCGGCGCTTCGTAGCTTCCAAGCCATATTCTCGTCTTCTTCCCCGGCTCCCGGATTTCAGAGACCCATTTCCCCCATTTTCGCTTGCGGACGCCTCTGTAGGTCTGGGGAAGAGCGAGGATGGAGGTGGGTTCCATGGAAATCAATGGAGATGA

mRNA sequence

ATGATTAAACAGGTAATTTTGACTCAAAAACCCCAAGCTTGCATTTTGCGCTCTTCTTGCTTGCTCAACCTCAATGGCTCAACCCCATCGTCGAAGCATTCCTCTGGAACGCTGAGCTGCCCAAATGTGAATCTTAGGATTCTAGCTACCATGGAGATCCCAGGCCTTTCCCGCAAGCTCCGTAATTTCTCGAACCCTTTCTCTATTTACTCTTTTGCTTCTCAATCTCCATATCCGTCATGCTCGCCTTCTTCCCATCCTAAAACCCTAGACCCCACCGATCCCGCTTCAAAAATTTTGTACCATTTTAGTCGATTTTATACAAGAACAGCATCCAATTCTGTTGCCTCTCCGATTAACGCTGTTGATTCTTCCTTAATCTGCAATACAACTTCGCAATCTGCACCTTCGGTATGTTTGTTTGAGAGATGTAATGGTGGAAGCTTGGATTTGAATCTTGGATTGCTTGGACCGCATTCCTATACAAGCTGCCATTCATTCTCGTCCTCTTCGTTTAATGGGCCGGAGCTGGGGACGACTTCGACGGAGAAGTCCCAATGTGGGACTGGTAATTTAGATGTGGCTAAAACGAATGCGACCTCGAATCAATTTTGGGAAATTATTAATATCATTCGGAGAAATGAGGAGGATTTGGAATCCAAGTTGGTCTCGTTGAATGTAAGTTTAACTAATGCTTTAGTTGCTCAGATTTTTCGGGTGCTGAATAATCAAAAGGTTTCTGCCTTTCGTTTTTTCAACTGGATTAGGGTTCAGTCGTGGAAATTTCCTTGTAACTCTGATATTTATAGTCTACTTATTGATAATTTTGGTAGATTAGATGATTATGAGGGGATGCTTCCTGTTCTGACTGAATTCCGGCGGAAAGGAATTGATCTAAACCACAAAGCATTTGGGTTTGTACTTGTTCAACTATCAAACGAAGCTTCAATTAAGATTTCTGTGGAAAAAGTGGTCAAGTTATTGAATGAAGTAGGAGGATCCTGTCGAATATCTGGGATCATGGCATTGATTGAGATGTTCTGCTCTCTTGGTTCTTTTGGAATGGCAAAGTTTGTAATCGAGATAAATGAGAGAAGGCCGTCTTTCTACAATATCATTGTCCGGGATCAATGTCGCCGATGTGATTTTGAAGGAGCTAGATGTACACTTAACGAGATGAGGCAAGTTGGTTGTAGCCCAGATGCGGGCATTCTCAATTATCTGCTCAGTAGTCTGTGCAAGAACAACAAATTTGATGAAGCTCAGAATATGTTTGAAGAAATGCTCGAACGAGATTGTCCTCTGAATTCATTGACATTTGAAGTCATTATCTGCCACTTGTGTGAAATTGGTAAGATTGAATCAGCACTCGGCTTTCTTGACATGATGGTGTCAAGGGGTCTTGATCCTCGCCTTTCAACACACGCTGCCTTTGTGAAAAGCTACTTCAATTCACGGAGATATGAGGAGGCATATCGGTACGCTGTTGATTCTAGCTCGAAACATGCCATGGCACAAAATGCAACATATAGCTTGCTTGCAAGTCTTCATGAAAAGAGAGGAAACTTAGTTGATGCTCAAAAAATTCTGTCTGAATTGATAGATGCAGGTCTCAGACCAAACTTTCCAGTGTATATGAGAGTTTTGAAGAAGCTTCAGCTTCAGGGCAAGGAAGAGTTGGTAAATGATTTGAAGGGCAAATTCTCTAATCGCCTCCGCCATTTGCATCCACATCGTCGGAGAGTCCAACGGCGAGTCGTTGATTGCCTGAATTTGGCTCGGCGACAGCCCCACTGTAATCGGCGCCGCCCCTCCATTGGCGGAGGAGCATCCGCCTCCGGAGTCCGCCGGCCTCTTCAACCTCAATGCCGCGAGCTGTGCCGCCATCTGAATGTCCTCGGGGCTGGAACTCGCCGGCCTCGGCAAAGTATCCACCAACTCCGGGAAGTTGAGGCGGGCGTCGCGGCCTCGAAGGTGCAGGGCCGCGACGTCGTAGGCCGCGGCCGCCATCTCCGGCGCTTCGTAGCTTCCAAGCCATATTCTCGTCTTCTTCCCCGGCTCCCGGATTTCAGAGACCCATTTCCCCCATTTTCGCTTGCGGACGCCTCTGTAGGTCTGGGGAAGAGCGAGGATGGAGGTGGGTTCCATGGAAATCAATGGAGATGA

Coding sequence (CDS)

ATGATTAAACAGGTAATTTTGACTCAAAAACCCCAAGCTTGCATTTTGCGCTCTTCTTGCTTGCTCAACCTCAATGGCTCAACCCCATCGTCGAAGCATTCCTCTGGAACGCTGAGCTGCCCAAATGTGAATCTTAGGATTCTAGCTACCATGGAGATCCCAGGCCTTTCCCGCAAGCTCCGTAATTTCTCGAACCCTTTCTCTATTTACTCTTTTGCTTCTCAATCTCCATATCCGTCATGCTCGCCTTCTTCCCATCCTAAAACCCTAGACCCCACCGATCCCGCTTCAAAAATTTTGTACCATTTTAGTCGATTTTATACAAGAACAGCATCCAATTCTGTTGCCTCTCCGATTAACGCTGTTGATTCTTCCTTAATCTGCAATACAACTTCGCAATCTGCACCTTCGGTATGTTTGTTTGAGAGATGTAATGGTGGAAGCTTGGATTTGAATCTTGGATTGCTTGGACCGCATTCCTATACAAGCTGCCATTCATTCTCGTCCTCTTCGTTTAATGGGCCGGAGCTGGGGACGACTTCGACGGAGAAGTCCCAATGTGGGACTGGTAATTTAGATGTGGCTAAAACGAATGCGACCTCGAATCAATTTTGGGAAATTATTAATATCATTCGGAGAAATGAGGAGGATTTGGAATCCAAGTTGGTCTCGTTGAATGTAAGTTTAACTAATGCTTTAGTTGCTCAGATTTTTCGGGTGCTGAATAATCAAAAGGTTTCTGCCTTTCGTTTTTTCAACTGGATTAGGGTTCAGTCGTGGAAATTTCCTTGTAACTCTGATATTTATAGTCTACTTATTGATAATTTTGGTAGATTAGATGATTATGAGGGGATGCTTCCTGTTCTGACTGAATTCCGGCGGAAAGGAATTGATCTAAACCACAAAGCATTTGGGTTTGTACTTGTTCAACTATCAAACGAAGCTTCAATTAAGATTTCTGTGGAAAAAGTGGTCAAGTTATTGAATGAAGTAGGAGGATCCTGTCGAATATCTGGGATCATGGCATTGATTGAGATGTTCTGCTCTCTTGGTTCTTTTGGAATGGCAAAGTTTGTAATCGAGATAAATGAGAGAAGGCCGTCTTTCTACAATATCATTGTCCGGGATCAATGTCGCCGATGTGATTTTGAAGGAGCTAGATGTACACTTAACGAGATGAGGCAAGTTGGTTGTAGCCCAGATGCGGGCATTCTCAATTATCTGCTCAGTAGTCTGTGCAAGAACAACAAATTTGATGAAGCTCAGAATATGTTTGAAGAAATGCTCGAACGAGATTGTCCTCTGAATTCATTGACATTTGAAGTCATTATCTGCCACTTGTGTGAAATTGGTAAGATTGAATCAGCACTCGGCTTTCTTGACATGATGGTGTCAAGGGGTCTTGATCCTCGCCTTTCAACACACGCTGCCTTTGTGAAAAGCTACTTCAATTCACGGAGATATGAGGAGGCATATCGGTACGCTGTTGATTCTAGCTCGAAACATGCCATGGCACAAAATGCAACATATAGCTTGCTTGCAAGTCTTCATGAAAAGAGAGGAAACTTAGTTGATGCTCAAAAAATTCTGTCTGAATTGATAGATGCAGGTCTCAGACCAAACTTTCCAGTGTATATGAGAGTTTTGAAGAAGCTTCAGCTTCAGGGCAAGGAAGAGTTGGTAAATGATTTGAAGGGCAAATTCTCTAATCGCCTCCGCCATTTGCATCCACATCGTCGGAGAGTCCAACGGCGAGTCGTTGATTGCCTGAATTTGGCTCGGCGACAGCCCCACTGTAATCGGCGCCGCCCCTCCATTGGCGGAGGAGCATCCGCCTCCGGAGTCCGCCGGCCTCTTCAACCTCAATGCCGCGAGCTGTGCCGCCATCTGAATGTCCTCGGGGCTGGAACTCGCCGGCCTCGGCAAAGTATCCACCAACTCCGGGAAGTTGAGGCGGGCGTCGCGGCCTCGAAGGTGCAGGGCCGCGACGTCGTAGGCCGCGGCCGCCATCTCCGGCGCTTCGTAGCTTCCAAGCCATATTCTCGTCTTCTTCCCCGGCTCCCGGATTTCAGAGACCCATTTCCCCCATTTTCGCTTGCGGACGCCTCTGTAGGTCTGGGGAAGAGCGAGGATGGAGGTGGGTTCCATGGAAATCAATGGAGATGA

Protein sequence

MIKQVILTQKPQACILRSSCLLNLNGSTPSSKHSSGTLSCPNVNLRILATMEIPGLSRKLRNFSNPFSIYSFASQSPYPSCSPSSHPKTLDPTDPASKILYHFSRFYTRTASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSCHSFSSSSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNVSLTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLPVLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMFCSLGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFSNRLRHLHPHRRRVQRRVVDCLNLARRQPHCNRRRPSIGGGASASGVRRPLQPQCRELCRHLNVLGAGTRRPRQSIHQLREVEAGVAASKVQGRDVVGRGRHLRRFVASKPYSRLLPRLPDFRDPFPPFSLADASVGLGKSEDGGGFHGNQWR
Homology
BLAST of Sgr016825.1 vs. NCBI nr
Match: XP_022141785.1 (pentatricopeptide repeat-containing protein At1g62930, chloroplastic-like [Momordica charantia])

HSP 1 Score: 846.7 bits (2186), Expect = 1.5e-241
Identity = 434/520 (83.46%), Postives = 465/520 (89.42%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNPFSIYSFASQSPYPSCSPSSHPKTL-DPTDPASKILYHFSRFYTR 110
           MEI GL RKLRNFSNP+ IYSF+SQSPYPSCS  SHP+TL  PTDP+S IL++FS FY+ 
Sbjct: 1   MEISGLPRKLRNFSNPYCIYSFSSQSPYPSCSTFSHPRTLATPTDPSSIILFNFSPFYSI 60

Query: 111 TASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSCHSFSS 170
           +AS+S  SPINA   SLICN  S SAPS+CLF RCNGG LDLNLGLL   SYT+C SF S
Sbjct: 61  SASDSAESPINAAGCSLICNAISHSAPSLCLFGRCNGGRLDLNLGLLQRRSYTTCRSFLS 120

Query: 171 SSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNVSL 230
           SSFN P    TSTEK QCGTGNLDV+K NA  NQFW+II IIRRNEEDLESKL SLN+SL
Sbjct: 121 SSFNQP----TSTEKPQCGTGNLDVSKPNARQNQFWDIIKIIRRNEEDLESKLNSLNLSL 180

Query: 231 TNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLPVL 290
           TN LVAQIFRVLNN KVSAFRFFNWIRVQS KFP NSD+YSLLIDNFGRLDDYEGMLPVL
Sbjct: 181 TNVLVAQIFRVLNNDKVSAFRFFNWIRVQSCKFPGNSDVYSLLIDNFGRLDDYEGMLPVL 240

Query: 291 TEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMFCS 350
           TEFRRKGIDLNHKAF F+ VQLSNEASIKISVE+V+KLLNEVGGSCRISG+M+LIEMFCS
Sbjct: 241 TEFRRKGIDLNHKAFVFLHVQLSNEASIKISVERVIKLLNEVGGSCRISGVMSLIEMFCS 300

Query: 351 LGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYLL 410
            GS+GMAKFVIEI ERR SFYNIIVR+QCRR DFEGARCTLNEMRQVGCSPD GILNYLL
Sbjct: 301 FGSYGMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTLNEMRQVGCSPDVGILNYLL 360

Query: 411 SSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGLD 470
           S LCKN++FDEAQ+MFE ML++DCP NSLTFEVIICHLCEIGKIESAL FLDMMVSRGL+
Sbjct: 361 SCLCKNDRFDEAQSMFEAMLQQDCPPNSLTFEVIICHLCEIGKIESALSFLDMMVSRGLE 420

Query: 471 PRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKI 530
           PRLSTHAAFVKSYFNS+RYEEAYRYAVDSSSKHA AQNATYSLLA+LHEKRGNLVDAQKI
Sbjct: 421 PRLSTHAAFVKSYFNSQRYEEAYRYAVDSSSKHATAQNATYSLLATLHEKRGNLVDAQKI 480

Query: 531 LSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFS 570
           LSELIDAGLRPNFPVY RV KKLQLQGKE+L NDLKGKFS
Sbjct: 481 LSELIDAGLRPNFPVYTRVFKKLQLQGKEDLANDLKGKFS 516

BLAST of Sgr016825.1 vs. NCBI nr
Match: XP_022932243.1 (pentatricopeptide repeat-containing protein At1g05670, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 755.0 bits (1948), Expect = 6.0e-214
Identity = 398/523 (76.10%), Postives = 431/523 (82.41%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNP--FSIYSFASQSPYPSCSPSSHPKTLDPTDPASKILYHFSRFYT 110
           M I  LSR LR+FS+P  +SIYSFA  SP+PS SPS H                F RFYT
Sbjct: 1   MGITSLSRNLRDFSHPYFYSIYSFAPHSPFPSSSPSKH-------------FGRFIRFYT 60

Query: 111 RTASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSC-HSF 170
              S+S A       S L+CN+TSQS PS+CLFERCNG + DLNLGL  PH Y SC  SF
Sbjct: 61  APTSDSAAR------SPLLCNSTSQSVPSLCLFERCNGVTKDLNLGLFRPH-YRSCRRSF 120

Query: 171 SSSSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNV 230
           SS SF          EK Q GTG L+V K N TSNQFW+IINIIR N+EDLESKL SLNV
Sbjct: 121 SSDSF----------EKPQFGTGELNVFKPNVTSNQFWDIINIIRANQEDLESKLDSLNV 180

Query: 231 SLTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLP 290
           S TNALVAQIFRVLNN KVSAFRFFNW+RVQS KFPCNSDIYSLLIDNFGRLDDYEG+LP
Sbjct: 181 SFTNALVAQIFRVLNNHKVSAFRFFNWVRVQSCKFPCNSDIYSLLIDNFGRLDDYEGILP 240

Query: 291 VLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMF 350
           VL EFR+KG+ LNHKAF F+ V LS+E SIKI VE++VKLLNEVGGSCRISG+MALIEMF
Sbjct: 241 VLNEFRQKGVGLNHKAFEFLHVHLSDEDSIKICVERLVKLLNEVGGSCRISGVMALIEMF 300

Query: 351 CSLGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNY 410
           CSLGSFGMAKFVIEI ERR SFYNIIVR+QCRR DFEGARCTL+EMRQ GCSPD GILNY
Sbjct: 301 CSLGSFGMAKFVIEITERRTSFYNIIVREQCRRNDFEGARCTLDEMRQAGCSPDVGILNY 360

Query: 411 LLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRG 470
           LLSSLCKN+K  EAQN+FEEMLERDCP NSLTFEVIICHLCEIG IESAL FLDMMVSRG
Sbjct: 361 LLSSLCKNDKLSEAQNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALNFLDMMVSRG 420

Query: 471 LDPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQ 530
           L+PRLSTHAAFVKSYFNS+RYEEAYRY +DSS KH MAQNATYSLLA+LHEKRGNLVDAQ
Sbjct: 421 LEPRLSTHAAFVKSYFNSQRYEEAYRYTIDSSLKHGMAQNATYSLLATLHEKRGNLVDAQ 480

Query: 531 KILSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFSN 571
           K+L ELIDAGLRPNFPVYMRVLKKLQ+QG+E+L NDLKGKFSN
Sbjct: 481 KVLCELIDAGLRPNFPVYMRVLKKLQVQGREDLANDLKGKFSN 493

BLAST of Sgr016825.1 vs. NCBI nr
Match: KAG7015541.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 753.4 bits (1944), Expect = 1.7e-213
Identity = 397/523 (75.91%), Postives = 430/523 (82.22%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNP--FSIYSFASQSPYPSCSPSSHPKTLDPTDPASKILYHFSRFYT 110
           M I  LSR LR+FS+P  +SIYSFA  SP+PS SPS H                F RFYT
Sbjct: 1   MGITSLSRNLRDFSHPYFYSIYSFAPHSPFPSSSPSKH-------------FGRFIRFYT 60

Query: 111 RTASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSC-HSF 170
              S+S A       S L+CN+TSQS PS+CLFERCNG + DLNLGL  PH Y SC  SF
Sbjct: 61  APTSDSAAR------SPLLCNSTSQSVPSLCLFERCNGVTKDLNLGLFRPH-YRSCRRSF 120

Query: 171 SSSSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNV 230
           SS SF          EK Q GTG L+V K N TSNQFW+IINIIR N+EDLESKL SLNV
Sbjct: 121 SSDSF----------EKPQFGTGELNVFKPNVTSNQFWDIINIIRANQEDLESKLDSLNV 180

Query: 231 SLTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLP 290
           S TNALVAQIFRVLNN KVSAFRFFNW+RVQS KFPCNSDIYSLLIDNFGRLDDYEG+LP
Sbjct: 181 SFTNALVAQIFRVLNNHKVSAFRFFNWVRVQSCKFPCNSDIYSLLIDNFGRLDDYEGILP 240

Query: 291 VLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMF 350
           VL EFR+KG+ LNHKAF F+ V LS+E SIKI VE++VKLLNEVGGSCRISG+MALIEMF
Sbjct: 241 VLNEFRQKGVGLNHKAFEFLHVHLSDEDSIKICVERLVKLLNEVGGSCRISGVMALIEMF 300

Query: 351 CSLGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNY 410
           CSLGSFGMAKFVIEI ERR SFYNIIVR+QCRR DFEGARCTL+EMRQ GCSPD GILNY
Sbjct: 301 CSLGSFGMAKFVIEITERRTSFYNIIVREQCRRNDFEGARCTLDEMRQAGCSPDVGILNY 360

Query: 411 LLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRG 470
           LLSSLCKN+K  EAQN+FEEMLERDCP NSLTFEVIICH CEIG IESAL FLDMMVSRG
Sbjct: 361 LLSSLCKNDKLSEAQNLFEEMLERDCPPNSLTFEVIICHFCEIGNIESALNFLDMMVSRG 420

Query: 471 LDPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQ 530
           L+PRLSTHAAFVKSYFNS+RYEEAYRY +DSS KH MAQNATYSLLA+LHEKRGNLVDAQ
Sbjct: 421 LEPRLSTHAAFVKSYFNSQRYEEAYRYTIDSSLKHGMAQNATYSLLATLHEKRGNLVDAQ 480

Query: 531 KILSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFSN 571
           K+L ELIDAGLRPNFPVYMRVLKKLQ+QG+E+L NDLKGKFSN
Sbjct: 481 KVLCELIDAGLRPNFPVYMRVLKKLQVQGREDLANDLKGKFSN 493

BLAST of Sgr016825.1 vs. NCBI nr
Match: XP_023552923.1 (pentatricopeptide repeat-containing protein At1g09820-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 750.7 bits (1937), Expect = 1.1e-212
Identity = 396/523 (75.72%), Postives = 430/523 (82.22%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNP--FSIYSFASQSPYPSCSPSSHPKTLDPTDPASKILYHFSRFYT 110
           M I  LSR LR+FS+P  +SIYSFA  SP+PS SPS H                F RFYT
Sbjct: 1   MGITSLSRNLRDFSHPYFYSIYSFAPPSPFPSSSPSKH-------------FGRFIRFYT 60

Query: 111 RTASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSC-HSF 170
              S+S A       S L+CN+TSQS PS+CL ERCNG + DLNLGL  PH Y SC  SF
Sbjct: 61  APTSHSAAR------SPLLCNSTSQSVPSLCLLERCNGVAKDLNLGLFRPH-YRSCRRSF 120

Query: 171 SSSSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNV 230
           SS SF          EK Q GTG L+V K N TSNQFW+IINIIR N+EDLESKL SLNV
Sbjct: 121 SSDSF----------EKPQFGTGELNVFKPNVTSNQFWDIINIIRENQEDLESKLDSLNV 180

Query: 231 SLTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLP 290
           S TNALVAQIFRVLNN KVSAFRFFNW+RVQS KFPCNSDIYSLLIDNFGRLDDYEG+LP
Sbjct: 181 SFTNALVAQIFRVLNNHKVSAFRFFNWVRVQSCKFPCNSDIYSLLIDNFGRLDDYEGILP 240

Query: 291 VLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMF 350
           VL EFR+KG+ LNHKAF F+ V LS+E SIKISVE++VKLLNEVGGSCRISG+MALIEMF
Sbjct: 241 VLNEFRQKGVGLNHKAFEFLHVHLSDEDSIKISVERLVKLLNEVGGSCRISGVMALIEMF 300

Query: 351 CSLGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNY 410
           CSLGSFGMAKFV EI E+R SFYNIIVR+QCRR DFEGARCTL+EMRQ GCSPD GILNY
Sbjct: 301 CSLGSFGMAKFVTEITEKRTSFYNIIVREQCRRNDFEGARCTLDEMRQAGCSPDVGILNY 360

Query: 411 LLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRG 470
           LLSSLCKN+KF EAQN+FEEMLERDCP NSLTFEVIICHLCEIG IESAL FLDMMVSRG
Sbjct: 361 LLSSLCKNDKFSEAQNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALNFLDMMVSRG 420

Query: 471 LDPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQ 530
           L+PRLSTHAAFVKSYFNS+RYEEAY Y +DSS KH MAQNATYSLLA+LHEKRGNLVDAQ
Sbjct: 421 LEPRLSTHAAFVKSYFNSQRYEEAYWYTIDSSLKHGMAQNATYSLLATLHEKRGNLVDAQ 480

Query: 531 KILSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFSN 571
           K+L ELIDAGLRPNFPVYMRVLKKLQ+QG+E+L NDLKGKFSN
Sbjct: 481 KVLCELIDAGLRPNFPVYMRVLKKLQVQGREDLANDLKGKFSN 493

BLAST of Sgr016825.1 vs. NCBI nr
Match: XP_023007595.1 (pentatricopeptide repeat-containing protein At1g09820-like [Cucurbita maxima])

HSP 1 Score: 748.8 bits (1932), Expect = 4.3e-212
Identity = 395/523 (75.53%), Postives = 429/523 (82.03%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNP--FSIYSFASQSPYPSCSPSSHPKTLDPTDPASKILYHFSRFYT 110
           M I  LSR LR+FS+P  +SIYSFA  SP+PS SPS H                F RFYT
Sbjct: 1   MGITSLSRNLRDFSHPYFYSIYSFAPHSPFPSSSPSKH-------------FGRFIRFYT 60

Query: 111 RTASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSC-HSF 170
              S+S A       S L+CN+T QS PS+CLFERCNG + DLNLGL  PH Y SC  SF
Sbjct: 61  APTSDSAAR------SPLLCNSTPQSVPSLCLFERCNGVTKDLNLGLFRPH-YRSCRRSF 120

Query: 171 SSSSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNV 230
           SS SF          EK Q GTG L+V   N TSNQFW+IINIIR N+E+LESKL SLNV
Sbjct: 121 SSDSF----------EKPQFGTGELNVFNPNVTSNQFWDIINIIRANQENLESKLDSLNV 180

Query: 231 SLTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLP 290
           S TNALVAQIFRVLNN KVSAFRFFNW++VQS KFPCNSDIYSLLIDNFGRLDDYEG++P
Sbjct: 181 SFTNALVAQIFRVLNNHKVSAFRFFNWVKVQSCKFPCNSDIYSLLIDNFGRLDDYEGIIP 240

Query: 291 VLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMF 350
           VL EFR+KG+ LNHKAF F+ V LSN+ SIKISVE++VKLLNEVGGSCRISG+MALIEMF
Sbjct: 241 VLNEFRQKGVGLNHKAFEFLHVNLSNDDSIKISVERLVKLLNEVGGSCRISGVMALIEMF 300

Query: 351 CSLGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNY 410
           CSLGSFGMAKFVIEI ERR SFYNIIVR+QCRR DFEGARCTL+EMRQ GCSPD GILNY
Sbjct: 301 CSLGSFGMAKFVIEITERRTSFYNIIVREQCRRNDFEGARCTLDEMRQAGCSPDVGILNY 360

Query: 411 LLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRG 470
           LLSSLCKN+KF EA N+FEEMLERDCP NSLTFEVIICHLCEIG IESAL FLD MVSRG
Sbjct: 361 LLSSLCKNDKFSEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALNFLDTMVSRG 420

Query: 471 LDPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQ 530
           L+PRLSTHAAFVKSYFNS+RYEEAYRY VDSS KH MAQNATYSLLA+LHEKRGNLVDAQ
Sbjct: 421 LEPRLSTHAAFVKSYFNSQRYEEAYRYTVDSSLKHGMAQNATYSLLATLHEKRGNLVDAQ 480

Query: 531 KILSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFSN 571
           KIL ELIDAGLRPNFPVYMRVLKKLQ+QG+E+L NDLKGKFSN
Sbjct: 481 KILCELIDAGLRPNFPVYMRVLKKLQVQGREDLANDLKGKFSN 493

BLAST of Sgr016825.1 vs. ExPASy Swiss-Prot
Match: Q9SSR6 (Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g52640 PE=2 SV=1)

HSP 1 Score: 100.1 bits (248), Expect = 1.1e-19
Identity = 92/378 (24.34%), Postives = 163/378 (43.12%), Query Frame = 0

Query: 202 NQFWEIINIIRRNEEDLESKLVSLNVSLTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWK 261
           N+   +++  R  ++DLE  LV+ +  +++ LV Q+ +   N    A RFF W R +   
Sbjct: 39  NEISRVLSDHRNPKDDLEHTLVAYSPRVSSNLVEQVLKRCKNLGFPAHRFFLWAR-RIPD 98

Query: 262 FPCNSDIYSLLIDNFGRLDDYEGMLPVLTEFRRKG-IDLNHKAFGFVLVQLSNEASIKIS 321
           F  + + Y +L++  G    +  +   L E R     +++ K F +++ +  + A++   
Sbjct: 99  FAHSLESYHILVEILGSSKQFALLWDFLIEAREYNYFEISSKVF-WIVFRAYSRANLPSE 158

Query: 322 VEKVVKLLNEVGGSCRISGIMALIEMFC-------SLGSFGMAKFVIEINERRPSFYNII 381
             +    + E G    +  +  L+   C       +   FG AK    +   +   Y+I+
Sbjct: 159 ACRAFNRMVEFGIKPCVDDLDQLLHSLCDKKHVNHAQEFFGKAKGFGIVPSAKT--YSIL 218

Query: 382 VRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFEEMLERDC 441
           VR   R  D  GAR   +EM +  C  D    N LL +LCK+   D    MF+EM     
Sbjct: 219 VRGWARIRDASGARKVFDEMLERNCVVDLLAYNALLDALCKSGDVDGGYKMFQEMGNLGL 278

Query: 442 PLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNSRRYEEAYR 501
             ++ +F + I   C+ G + SA   LD M    L P + T    +K+   + + ++AY 
Sbjct: 279 KPDAYSFAIFIHAYCDAGDVHSAYKVLDRMKRYDLVPNVYTFNHIIKTLCKNEKVDDAYL 338

Query: 502 YAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQ 561
              +   K A     TY+ + + H     +  A K+LS +      P+   Y  VLK L 
Sbjct: 339 LLDEMIQKGANPDTWTYNSIMAYHCDHCEVNRATKLLSRMDRTKCLPDRHTYNMVLKLLI 398

Query: 562 LQGKEELVNDLKGKFSNR 572
             G+ +   ++    S R
Sbjct: 399 RIGRFDRATEIWEGMSER 412

BLAST of Sgr016825.1 vs. ExPASy Swiss-Prot
Match: Q9CA58 (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana OX=3702 GN=At1g74580 PE=3 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 3.1e-19
Identity = 72/292 (24.66%), Postives = 131/292 (44.86%), Query Frame = 0

Query: 269 YSLLIDNFGRLDDYEGMLPVLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLL 328
           Y  LID      +    L +  E   KGI  N   +  ++  LSN+  I +   ++   +
Sbjct: 359 YRSLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMI-LEAAQLANEM 418

Query: 329 NEVGGSCRISGIMALIEMFCSLGSFGMAKFVIEINERRPSF-----YNIIVRDQCRRCDF 388
           +E G    +     L+   C +G    A  ++++   +  F     +NI++     +   
Sbjct: 419 SEKGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKM 478

Query: 389 EGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVI 448
           E A   L+ M   G  PD    N LL+ LCK +KF++    ++ M+E+ C  N  TF ++
Sbjct: 479 ENALEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNIL 538

Query: 449 ICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNS----------RRYEEAYR 508
           +  LC   K++ ALG L+ M ++ ++P   T    +  +  +          R+ EEAY+
Sbjct: 539 LESLCRYRKLDEALGLLEEMKNKSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYK 598

Query: 509 YAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVY 546
             V SS+        TY+++     ++ N+  A+K+  E++D  L P+   Y
Sbjct: 599 --VSSST-------PTYNIIIHAFTEKLNVTMAEKLFQEMVDRCLGPDGYTY 640

BLAST of Sgr016825.1 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 93.2 bits (230), Expect = 1.3e-17
Identity = 74/296 (25.00%), Postives = 135/296 (45.61%), Query Frame = 0

Query: 264 CNSDI--YSLLIDNFGRLDDYEGMLPVLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISV 323
           C  D+  Y++LI+   R       + +L E R +G   +   +  ++  +  E  +    
Sbjct: 235 CYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRL---- 294

Query: 324 EKVVKLLNEVGGS-CRISGIM--ALIEMFCSLGSFGMAKFVIEINERR---PSF--YNII 383
           ++ +K LN++  S C+ + I    ++   CS G +  A+ ++    R+   PS   +NI+
Sbjct: 295 DEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNIL 354

Query: 384 VRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFEEMLERDC 443
           +   CR+     A   L +M Q GC P++   N LL   CK  K D A    E M+ R C
Sbjct: 355 INFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRGC 414

Query: 444 PLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNSRRYEEAYR 503
             + +T+  ++  LC+ GK+E A+  L+ + S+G  P L T+   +     + +  +A +
Sbjct: 415 YPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIK 474

Query: 504 YAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVL 550
              +  +K       TYS L     + G + +A K   E    G+RPN   +  ++
Sbjct: 475 LLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIM 526

BLAST of Sgr016825.1 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 4.9e-17
Identity = 68/290 (23.45%), Postives = 130/290 (44.83%), Query Frame = 0

Query: 269 YSLLIDNFGRLDDYEGMLPVLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLL 328
           Y+ LI+ + +    E  + V+     + +  N + +  ++     +   K +V K + +L
Sbjct: 396 YNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELI-----KGYCKSNVHKAMGVL 455

Query: 329 NEVGGSCRISGIM---ALIEMFCSLGSFGMAKFVIEINERRPSF-----YNIIVRDQCRR 388
           N++     +  ++   +LI+  C  G+F  A  ++ +   R        Y  ++   C+ 
Sbjct: 456 NKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKS 515

Query: 389 CDFEGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTF 448
              E A    + + Q G +P+  +   L+   CK  K DEA  M E+ML ++C  NSLTF
Sbjct: 516 KRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTF 575

Query: 449 EVIICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSS 508
             +I  LC  GK++ A    + MV  GL P +ST    +        ++ AY       S
Sbjct: 576 NALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLS 635

Query: 509 KHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLK 551
                   TY+     + + G L+DA+ +++++ + G+ P+   Y  ++K
Sbjct: 636 SGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFTYSSLIK 680

BLAST of Sgr016825.1 vs. ExPASy Swiss-Prot
Match: Q940A6 (Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g19440 PE=2 SV=2)

HSP 1 Score: 90.5 bits (223), Expect = 8.4e-17
Identity = 85/379 (22.43%), Postives = 156/379 (41.16%), Query Frame = 0

Query: 194 VAKTNATSNQFWEIINIIRRNEED--LESKLVSLNVSLTNALVAQIFRVLNNQKVSAFRF 253
           VA    T N   + + +  R +E    + K+V   +  T    + + + L   K     +
Sbjct: 304 VAPNVVTFNTVIDGLGMCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTRAKRIGDAY 363

Query: 254 FNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLPVLTEFRRKGIDLNHKAFGFVLVQL 313
           F    +    FP N  +Y+ LID+F         + +      KG+ L    +  ++   
Sbjct: 364 FVLKEMTKKGFPPNVIVYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSSTYNTLIKGY 423

Query: 314 SNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMFCSLGSFGMA-KFVIEINERRPS-- 373
                   + E+++K +  +G +       ++I + CS   F  A +FV E+  R  S  
Sbjct: 424 CKNGQAD-NAERLLKEMLSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEMLLRNMSPG 483

Query: 374 --FYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFE 433
                 ++   C+      A     +    G   D    N LL  LC+  K DEA  + +
Sbjct: 484 GGLLTTLISGLCKHGKHSKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLDEAFRIQK 543

Query: 434 EMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNSR 493
           E+L R C ++ +++  +I   C   K++ A  FLD MV RGL P   T++  +   FN  
Sbjct: 544 EILGRGCVMDRVSYNTLISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILICGLFNMN 603

Query: 494 RYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVYM 553
           + EEA ++  D      +    TYS++     K     + Q+   E++   ++PN  VY 
Sbjct: 604 KVEEAIQFWDDCKRNGMLPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQPNTVVYN 663

Query: 554 RVLKKLQLQGKEELVNDLK 566
            +++     G+  +  +L+
Sbjct: 664 HLIRAYCRSGRLSMALELR 681

BLAST of Sgr016825.1 vs. ExPASy TrEMBL
Match: A0A6J1CJ38 (pentatricopeptide repeat-containing protein At1g62930, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111012070 PE=4 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 7.3e-242
Identity = 434/520 (83.46%), Postives = 465/520 (89.42%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNPFSIYSFASQSPYPSCSPSSHPKTL-DPTDPASKILYHFSRFYTR 110
           MEI GL RKLRNFSNP+ IYSF+SQSPYPSCS  SHP+TL  PTDP+S IL++FS FY+ 
Sbjct: 1   MEISGLPRKLRNFSNPYCIYSFSSQSPYPSCSTFSHPRTLATPTDPSSIILFNFSPFYSI 60

Query: 111 TASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSCHSFSS 170
           +AS+S  SPINA   SLICN  S SAPS+CLF RCNGG LDLNLGLL   SYT+C SF S
Sbjct: 61  SASDSAESPINAAGCSLICNAISHSAPSLCLFGRCNGGRLDLNLGLLQRRSYTTCRSFLS 120

Query: 171 SSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNVSL 230
           SSFN P    TSTEK QCGTGNLDV+K NA  NQFW+II IIRRNEEDLESKL SLN+SL
Sbjct: 121 SSFNQP----TSTEKPQCGTGNLDVSKPNARQNQFWDIIKIIRRNEEDLESKLNSLNLSL 180

Query: 231 TNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLPVL 290
           TN LVAQIFRVLNN KVSAFRFFNWIRVQS KFP NSD+YSLLIDNFGRLDDYEGMLPVL
Sbjct: 181 TNVLVAQIFRVLNNDKVSAFRFFNWIRVQSCKFPGNSDVYSLLIDNFGRLDDYEGMLPVL 240

Query: 291 TEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMFCS 350
           TEFRRKGIDLNHKAF F+ VQLSNEASIKISVE+V+KLLNEVGGSCRISG+M+LIEMFCS
Sbjct: 241 TEFRRKGIDLNHKAFVFLHVQLSNEASIKISVERVIKLLNEVGGSCRISGVMSLIEMFCS 300

Query: 351 LGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYLL 410
            GS+GMAKFVIEI ERR SFYNIIVR+QCRR DFEGARCTLNEMRQVGCSPD GILNYLL
Sbjct: 301 FGSYGMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTLNEMRQVGCSPDVGILNYLL 360

Query: 411 SSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGLD 470
           S LCKN++FDEAQ+MFE ML++DCP NSLTFEVIICHLCEIGKIESAL FLDMMVSRGL+
Sbjct: 361 SCLCKNDRFDEAQSMFEAMLQQDCPPNSLTFEVIICHLCEIGKIESALSFLDMMVSRGLE 420

Query: 471 PRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKI 530
           PRLSTHAAFVKSYFNS+RYEEAYRYAVDSSSKHA AQNATYSLLA+LHEKRGNLVDAQKI
Sbjct: 421 PRLSTHAAFVKSYFNSQRYEEAYRYAVDSSSKHATAQNATYSLLATLHEKRGNLVDAQKI 480

Query: 531 LSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFS 570
           LSELIDAGLRPNFPVY RV KKLQLQGKE+L NDLKGKFS
Sbjct: 481 LSELIDAGLRPNFPVYTRVFKKLQLQGKEDLANDLKGKFS 516

BLAST of Sgr016825.1 vs. ExPASy TrEMBL
Match: A0A6J1F148 (pentatricopeptide repeat-containing protein At1g05670, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111438608 PE=4 SV=1)

HSP 1 Score: 755.0 bits (1948), Expect = 2.9e-214
Identity = 398/523 (76.10%), Postives = 431/523 (82.41%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNP--FSIYSFASQSPYPSCSPSSHPKTLDPTDPASKILYHFSRFYT 110
           M I  LSR LR+FS+P  +SIYSFA  SP+PS SPS H                F RFYT
Sbjct: 1   MGITSLSRNLRDFSHPYFYSIYSFAPHSPFPSSSPSKH-------------FGRFIRFYT 60

Query: 111 RTASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSC-HSF 170
              S+S A       S L+CN+TSQS PS+CLFERCNG + DLNLGL  PH Y SC  SF
Sbjct: 61  APTSDSAAR------SPLLCNSTSQSVPSLCLFERCNGVTKDLNLGLFRPH-YRSCRRSF 120

Query: 171 SSSSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNV 230
           SS SF          EK Q GTG L+V K N TSNQFW+IINIIR N+EDLESKL SLNV
Sbjct: 121 SSDSF----------EKPQFGTGELNVFKPNVTSNQFWDIINIIRANQEDLESKLDSLNV 180

Query: 231 SLTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLP 290
           S TNALVAQIFRVLNN KVSAFRFFNW+RVQS KFPCNSDIYSLLIDNFGRLDDYEG+LP
Sbjct: 181 SFTNALVAQIFRVLNNHKVSAFRFFNWVRVQSCKFPCNSDIYSLLIDNFGRLDDYEGILP 240

Query: 291 VLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMF 350
           VL EFR+KG+ LNHKAF F+ V LS+E SIKI VE++VKLLNEVGGSCRISG+MALIEMF
Sbjct: 241 VLNEFRQKGVGLNHKAFEFLHVHLSDEDSIKICVERLVKLLNEVGGSCRISGVMALIEMF 300

Query: 351 CSLGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNY 410
           CSLGSFGMAKFVIEI ERR SFYNIIVR+QCRR DFEGARCTL+EMRQ GCSPD GILNY
Sbjct: 301 CSLGSFGMAKFVIEITERRTSFYNIIVREQCRRNDFEGARCTLDEMRQAGCSPDVGILNY 360

Query: 411 LLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRG 470
           LLSSLCKN+K  EAQN+FEEMLERDCP NSLTFEVIICHLCEIG IESAL FLDMMVSRG
Sbjct: 361 LLSSLCKNDKLSEAQNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALNFLDMMVSRG 420

Query: 471 LDPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQ 530
           L+PRLSTHAAFVKSYFNS+RYEEAYRY +DSS KH MAQNATYSLLA+LHEKRGNLVDAQ
Sbjct: 421 LEPRLSTHAAFVKSYFNSQRYEEAYRYTIDSSLKHGMAQNATYSLLATLHEKRGNLVDAQ 480

Query: 531 KILSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFSN 571
           K+L ELIDAGLRPNFPVYMRVLKKLQ+QG+E+L NDLKGKFSN
Sbjct: 481 KVLCELIDAGLRPNFPVYMRVLKKLQVQGREDLANDLKGKFSN 493

BLAST of Sgr016825.1 vs. ExPASy TrEMBL
Match: A0A6J1L3E8 (pentatricopeptide repeat-containing protein At1g09820-like OS=Cucurbita maxima OX=3661 GN=LOC111500167 PE=4 SV=1)

HSP 1 Score: 748.8 bits (1932), Expect = 2.1e-212
Identity = 395/523 (75.53%), Postives = 429/523 (82.03%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNP--FSIYSFASQSPYPSCSPSSHPKTLDPTDPASKILYHFSRFYT 110
           M I  LSR LR+FS+P  +SIYSFA  SP+PS SPS H                F RFYT
Sbjct: 1   MGITSLSRNLRDFSHPYFYSIYSFAPHSPFPSSSPSKH-------------FGRFIRFYT 60

Query: 111 RTASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSC-HSF 170
              S+S A       S L+CN+T QS PS+CLFERCNG + DLNLGL  PH Y SC  SF
Sbjct: 61  APTSDSAAR------SPLLCNSTPQSVPSLCLFERCNGVTKDLNLGLFRPH-YRSCRRSF 120

Query: 171 SSSSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNV 230
           SS SF          EK Q GTG L+V   N TSNQFW+IINIIR N+E+LESKL SLNV
Sbjct: 121 SSDSF----------EKPQFGTGELNVFNPNVTSNQFWDIINIIRANQENLESKLDSLNV 180

Query: 231 SLTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLP 290
           S TNALVAQIFRVLNN KVSAFRFFNW++VQS KFPCNSDIYSLLIDNFGRLDDYEG++P
Sbjct: 181 SFTNALVAQIFRVLNNHKVSAFRFFNWVKVQSCKFPCNSDIYSLLIDNFGRLDDYEGIIP 240

Query: 291 VLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMF 350
           VL EFR+KG+ LNHKAF F+ V LSN+ SIKISVE++VKLLNEVGGSCRISG+MALIEMF
Sbjct: 241 VLNEFRQKGVGLNHKAFEFLHVNLSNDDSIKISVERLVKLLNEVGGSCRISGVMALIEMF 300

Query: 351 CSLGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNY 410
           CSLGSFGMAKFVIEI ERR SFYNIIVR+QCRR DFEGARCTL+EMRQ GCSPD GILNY
Sbjct: 301 CSLGSFGMAKFVIEITERRTSFYNIIVREQCRRNDFEGARCTLDEMRQAGCSPDVGILNY 360

Query: 411 LLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRG 470
           LLSSLCKN+KF EA N+FEEMLERDCP NSLTFEVIICHLCEIG IESAL FLD MVSRG
Sbjct: 361 LLSSLCKNDKFSEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALNFLDTMVSRG 420

Query: 471 LDPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQ 530
           L+PRLSTHAAFVKSYFNS+RYEEAYRY VDSS KH MAQNATYSLLA+LHEKRGNLVDAQ
Sbjct: 421 LEPRLSTHAAFVKSYFNSQRYEEAYRYTVDSSLKHGMAQNATYSLLATLHEKRGNLVDAQ 480

Query: 531 KILSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFSN 571
           KIL ELIDAGLRPNFPVYMRVLKKLQ+QG+E+L NDLKGKFSN
Sbjct: 481 KILCELIDAGLRPNFPVYMRVLKKLQVQGREDLANDLKGKFSN 493

BLAST of Sgr016825.1 vs. ExPASy TrEMBL
Match: A0A0A0L3E7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G637180 PE=4 SV=1)

HSP 1 Score: 686.8 bits (1771), Expect = 9.7e-194
Identity = 365/522 (69.92%), Postives = 415/522 (79.50%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNP--FSIYSFASQSPYPSCSPSSHPKTLDPTDPASKILYHFSRFYT 110
           MEIP LSR LRNFSNP  +SIYSFA +SP+                    +LY FSRFYT
Sbjct: 1   MEIPCLSRNLRNFSNPYFYSIYSFAPRSPF--------------------LLYRFSRFYT 60

Query: 111 RTASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSCHSFS 170
             AS+S A       SSLICN+TSQS PS+CLFERCNGG+ DLNL L   H Y SC +FS
Sbjct: 61  ILASDSAAG------SSLICNSTSQSVPSLCLFERCNGGTSDLNLALF-RHHYRSCRAFS 120

Query: 171 SSSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNVS 230
           S S           EK QCGTGNL+V+K N TSNQ   IINIIR N+EDLESKL S NV 
Sbjct: 121 SFSL----------EKRQCGTGNLNVSKRNVTSNQLSNIINIIRENQEDLESKLDSPNVR 180

Query: 231 LTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLPV 290
           LTN LV QI  +LN  K+SA RFFNW+ VQS KFPCNSD+YSLLIDNFGRLDDYEG+LPV
Sbjct: 181 LTNVLVGQILEMLNKHKISASRFFNWVSVQSCKFPCNSDVYSLLIDNFGRLDDYEGILPV 240

Query: 291 LTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMFC 350
           L EF  KGI+LNHKAFGF L+ LSNE S+K+SV K+VKLLNE GG+CR+SGIMALIEMFC
Sbjct: 241 LIEFGLKGIELNHKAFGF-LLPLSNEHSMKLSVVKLVKLLNEAGGTCRLSGIMALIEMFC 300

Query: 351 SLGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYL 410
           SLGSFGMAKFVIEI E+R SFY IIVR++C++ DFEGARCTL+EMRQVGC PDAGILNYL
Sbjct: 301 SLGSFGMAKFVIEITEKRSSFYYIIVREKCKQKDFEGARCTLDEMRQVGCIPDAGILNYL 360

Query: 411 LSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGL 470
           LSSLCKN+KF EA N+ EEMLE++C  NSLTFE+IICHLC+IG IESALG+LDMMV+ GL
Sbjct: 361 LSSLCKNDKFGEAHNLLEEMLEQNCSPNSLTFEIIICHLCKIGNIESALGYLDMMVAGGL 420

Query: 471 DPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQK 530
            PRLSTHAAFVKSYF+S+RYEEAY+YAVDSS K+   QNATYSLLA+LHEKRGNLVDAQK
Sbjct: 421 MPRLSTHAAFVKSYFSSQRYEEAYQYAVDSSLKYVTTQNATYSLLATLHEKRGNLVDAQK 480

Query: 531 ILSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFSN 571
           ILSEL+DAGL+P+F VY R+LKKLQ+QG+ +L NDLK K SN
Sbjct: 481 ILSELMDAGLKPHFHVYTRLLKKLQVQGRGDLANDLKRKISN 484

BLAST of Sgr016825.1 vs. ExPASy TrEMBL
Match: A0A5D3D9T8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G003980 PE=4 SV=1)

HSP 1 Score: 665.2 bits (1715), Expect = 3.0e-187
Identity = 356/522 (68.20%), Postives = 409/522 (78.35%), Query Frame = 0

Query: 51  MEIPGLSRKLRNFSNP--FSIYSFASQSPYPSCSPSSHPKTLDPTDPASKILYHFSRFYT 110
           MEIP LSR LRNFSNP  +SIYSFA  SP+P                   +   FSRFYT
Sbjct: 1   MEIPCLSRNLRNFSNPYFYSIYSFAPYSPFP-------------------LYRFFSRFYT 60

Query: 111 RTASNSVASPINAVDSSLICNTTSQSAPSVCLFERCNGGSLDLNLGLLGPHSYTSCHSFS 170
             AS+S A       SSLICN+TSQS  S+CLFERCNG + DLNL L   H Y SC SFS
Sbjct: 61  TPASDSAAG------SSLICNSTSQSVSSLCLFERCNGRTSDLNLALF-RHHYRSCRSFS 120

Query: 171 SSSFNGPELGTTSTEKSQCGTGNLDVAKTNATSNQFWEIINIIRRNEEDLESKLVSLNVS 230
           S S           +K Q GTGNL+++K  ATS +FW+II+II+ N+EDLESKL SLNV 
Sbjct: 121 SFSL----------KKLQRGTGNLNLSKPIATSEKFWQIIDIIQINQEDLESKLDSLNVR 180

Query: 231 LTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLPV 290
           LTN LV +I  +LN +K+SAFRFFNW+ VQ  +FP NSD+YSLLIDNFGRLDDYEG+LP 
Sbjct: 181 LTNVLVVEILGMLNKRKISAFRFFNWVSVQWCRFPSNSDVYSLLIDNFGRLDDYEGILPF 240

Query: 291 LTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLLNEVGGSCRISGIMALIEMFC 350
           L EF RKGI+LNHKAFGF L+ L+NE S+K SV K+VK+LN+  G+CRISG+ ALIEMFC
Sbjct: 241 LIEFSRKGIELNHKAFGF-LLPLANEDSMKSSVIKLVKMLNKAEGTCRISGVKALIEMFC 300

Query: 351 SLGSFGMAKFVIEINERRPSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYL 410
           S+GS  MAKFVIEI E+R SFY IIVR+QC+R DFEGARCTL+EMRQ GC PDAGI NYL
Sbjct: 301 SVGSSEMAKFVIEITEKRSSFYYIIVREQCQRKDFEGARCTLDEMRQAGCLPDAGIFNYL 360

Query: 411 LSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGL 470
           LSSLCKN+KF EA N+FEEMLE++CP NSL+FEVIICHLC+IG IESALGFLD MV+RGL
Sbjct: 361 LSSLCKNDKFGEAHNLFEEMLEQNCPPNSLSFEVIICHLCKIGNIESALGFLDTMVARGL 420

Query: 471 DPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQK 530
            PRLSTHA FVKSYF SRRYEEAY+YAVDSSSK+ M QNATYSLLA+LHEKRGNLVDAQK
Sbjct: 421 QPRLSTHATFVKSYFYSRRYEEAYQYAVDSSSKYVMTQNATYSLLATLHEKRGNLVDAQK 480

Query: 531 ILSELIDAGLRPNFPVYMRVLKKLQLQGKEELVNDLKGKFSN 571
           ILSEL+DAGL+PNF V  RVLKKLQ+QG+E+L NDLKGK SN
Sbjct: 481 ILSELVDAGLKPNFHVCKRVLKKLQVQGREDLANDLKGKLSN 485

BLAST of Sgr016825.1 vs. TAIR 10
Match: AT1G52640.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 100.1 bits (248), Expect = 7.5e-21
Identity = 92/378 (24.34%), Postives = 163/378 (43.12%), Query Frame = 0

Query: 202 NQFWEIINIIRRNEEDLESKLVSLNVSLTNALVAQIFRVLNNQKVSAFRFFNWIRVQSWK 261
           N+   +++  R  ++DLE  LV+ +  +++ LV Q+ +   N    A RFF W R +   
Sbjct: 39  NEISRVLSDHRNPKDDLEHTLVAYSPRVSSNLVEQVLKRCKNLGFPAHRFFLWAR-RIPD 98

Query: 262 FPCNSDIYSLLIDNFGRLDDYEGMLPVLTEFRRKG-IDLNHKAFGFVLVQLSNEASIKIS 321
           F  + + Y +L++  G    +  +   L E R     +++ K F +++ +  + A++   
Sbjct: 99  FAHSLESYHILVEILGSSKQFALLWDFLIEAREYNYFEISSKVF-WIVFRAYSRANLPSE 158

Query: 322 VEKVVKLLNEVGGSCRISGIMALIEMFC-------SLGSFGMAKFVIEINERRPSFYNII 381
             +    + E G    +  +  L+   C       +   FG AK    +   +   Y+I+
Sbjct: 159 ACRAFNRMVEFGIKPCVDDLDQLLHSLCDKKHVNHAQEFFGKAKGFGIVPSAKT--YSIL 218

Query: 382 VRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFEEMLERDC 441
           VR   R  D  GAR   +EM +  C  D    N LL +LCK+   D    MF+EM     
Sbjct: 219 VRGWARIRDASGARKVFDEMLERNCVVDLLAYNALLDALCKSGDVDGGYKMFQEMGNLGL 278

Query: 442 PLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNSRRYEEAYR 501
             ++ +F + I   C+ G + SA   LD M    L P + T    +K+   + + ++AY 
Sbjct: 279 KPDAYSFAIFIHAYCDAGDVHSAYKVLDRMKRYDLVPNVYTFNHIIKTLCKNEKVDDAYL 338

Query: 502 YAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQ 561
              +   K A     TY+ + + H     +  A K+LS +      P+   Y  VLK L 
Sbjct: 339 LLDEMIQKGANPDTWTYNSIMAYHCDHCEVNRATKLLSRMDRTKCLPDRHTYNMVLKLLI 398

Query: 562 LQGKEELVNDLKGKFSNR 572
             G+ +   ++    S R
Sbjct: 399 RIGRFDRATEIWEGMSER 412

BLAST of Sgr016825.1 vs. TAIR 10
Match: AT1G74580.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 98.6 bits (244), Expect = 2.2e-20
Identity = 72/292 (24.66%), Postives = 131/292 (44.86%), Query Frame = 0

Query: 269 YSLLIDNFGRLDDYEGMLPVLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLL 328
           Y  LID      +    L +  E   KGI  N   +  ++  LSN+  I +   ++   +
Sbjct: 359 YRSLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMI-LEAAQLANEM 418

Query: 329 NEVGGSCRISGIMALIEMFCSLGSFGMAKFVIEINERRPSF-----YNIIVRDQCRRCDF 388
           +E G    +     L+   C +G    A  ++++   +  F     +NI++     +   
Sbjct: 419 SEKGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKM 478

Query: 389 EGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTFEVI 448
           E A   L+ M   G  PD    N LL+ LCK +KF++    ++ M+E+ C  N  TF ++
Sbjct: 479 ENALEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNIL 538

Query: 449 ICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNS----------RRYEEAYR 508
           +  LC   K++ ALG L+ M ++ ++P   T    +  +  +          R+ EEAY+
Sbjct: 539 LESLCRYRKLDEALGLLEEMKNKSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYK 598

Query: 509 YAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVY 546
             V SS+        TY+++     ++ N+  A+K+  E++D  L P+   Y
Sbjct: 599 --VSSST-------PTYNIIIHAFTEKLNVTMAEKLFQEMVDRCLGPDGYTY 640

BLAST of Sgr016825.1 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 93.2 bits (230), Expect = 9.2e-19
Identity = 74/296 (25.00%), Postives = 135/296 (45.61%), Query Frame = 0

Query: 264 CNSDI--YSLLIDNFGRLDDYEGMLPVLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISV 323
           C  D+  Y++LI+   R       + +L E R +G   +   +  ++  +  E  +    
Sbjct: 235 CYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRL---- 294

Query: 324 EKVVKLLNEVGGS-CRISGIM--ALIEMFCSLGSFGMAKFVIEINERR---PSF--YNII 383
           ++ +K LN++  S C+ + I    ++   CS G +  A+ ++    R+   PS   +NI+
Sbjct: 295 DEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNIL 354

Query: 384 VRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFEEMLERDC 443
           +   CR+     A   L +M Q GC P++   N LL   CK  K D A    E M+ R C
Sbjct: 355 INFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRGC 414

Query: 444 PLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNSRRYEEAYR 503
             + +T+  ++  LC+ GK+E A+  L+ + S+G  P L T+   +     + +  +A +
Sbjct: 415 YPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIK 474

Query: 504 YAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVL 550
              +  +K       TYS L     + G + +A K   E    G+RPN   +  ++
Sbjct: 475 LLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIM 526

BLAST of Sgr016825.1 vs. TAIR 10
Match: AT5G61990.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 92.0 bits (227), Expect = 2.0e-18
Identity = 72/325 (22.15%), Postives = 143/325 (44.00%), Query Frame = 0

Query: 255 IRVQSWKFPCNSDIYSLLIDNFGRLDDYEGMLPVLTEFRRKGIDLNHKAFGFVLVQLSNE 314
           + + S     ++  YSLLID   +  + +    ++ E    GI++    +   +  +S E
Sbjct: 301 VEMDSLGVSLDNHTYSLLIDGLLKGRNADAAKGLVHEMVSHGINIKPYMYDCCICVMSKE 360

Query: 315 ASIKISVEKVVKLLNEVGGSCRISGIMA---LIEMFCSLGSFGMA-KFVIEINERR---- 374
                 +EK   L + +  S  I    A   LIE +C   +     + ++E+ +R     
Sbjct: 361 G----VMEKAKALFDGMIASGLIPQAQAYASLIEGYCREKNVRQGYELLVEMKKRNIVIS 420

Query: 375 PSFYNIIVRDQCRRCDFEGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFE 434
           P  Y  +V+  C   D +GA   + EM   GC P+  I   L+ +  +N++F +A  + +
Sbjct: 421 PYTYGTVVKGMCSSGDLDGAYNIVKEMIASGCRPNVVIYTTLIKTFLQNSRFGDAMRVLK 480

Query: 435 EMLERDCPLNSLTFEVIICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNSR 494
           EM E+    +   +  +I  L +  +++ A  FL  MV  GL P   T+ AF+  Y  + 
Sbjct: 481 EMKEQGIAPDIFCYNSLIIGLSKAKRMDEARSFLVEMVENGLKPNAFTYGAFISGYIEAS 540

Query: 495 RYEEAYRYAVDSSSKHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVYM 554
            +  A +Y  +      +      + L + + K+G +++A      ++D G+  +   Y 
Sbjct: 541 EFASADKYVKEMRECGVLPNKVLCTGLINEYCKKGKVIEACSAYRSMVDQGILGDAKTYT 600

Query: 555 RVLKKL----QLQGKEELVNDLKGK 568
            ++  L    ++   EE+  +++GK
Sbjct: 601 VLMNGLFKNDKVDDAEEIFREMRGK 621

BLAST of Sgr016825.1 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 91.3 bits (225), Expect = 3.5e-18
Identity = 68/290 (23.45%), Postives = 130/290 (44.83%), Query Frame = 0

Query: 269 YSLLIDNFGRLDDYEGMLPVLTEFRRKGIDLNHKAFGFVLVQLSNEASIKISVEKVVKLL 328
           Y+ LI+ + +    E  + V+     + +  N + +  ++     +   K +V K + +L
Sbjct: 396 YNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELI-----KGYCKSNVHKAMGVL 455

Query: 329 NEVGGSCRISGIM---ALIEMFCSLGSFGMAKFVIEINERRPSF-----YNIIVRDQCRR 388
           N++     +  ++   +LI+  C  G+F  A  ++ +   R        Y  ++   C+ 
Sbjct: 456 NKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKS 515

Query: 389 CDFEGARCTLNEMRQVGCSPDAGILNYLLSSLCKNNKFDEAQNMFEEMLERDCPLNSLTF 448
              E A    + + Q G +P+  +   L+   CK  K DEA  M E+ML ++C  NSLTF
Sbjct: 516 KRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTF 575

Query: 449 EVIICHLCEIGKIESALGFLDMMVSRGLDPRLSTHAAFVKSYFNSRRYEEAYRYAVDSSS 508
             +I  LC  GK++ A    + MV  GL P +ST    +        ++ AY       S
Sbjct: 576 NALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLS 635

Query: 509 KHAMAQNATYSLLASLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLK 551
                   TY+     + + G L+DA+ +++++ + G+ P+   Y  ++K
Sbjct: 636 SGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFTYSSLIK 680

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022141785.11.5e-24183.46pentatricopeptide repeat-containing protein At1g62930, chloroplastic-like [Momor... [more]
XP_022932243.16.0e-21476.10pentatricopeptide repeat-containing protein At1g05670, mitochondrial-like [Cucur... [more]
KAG7015541.11.7e-21375.91Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_023552923.11.1e-21275.72pentatricopeptide repeat-containing protein At1g09820-like [Cucurbita pepo subsp... [more]
XP_023007595.14.3e-21275.53pentatricopeptide repeat-containing protein At1g09820-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9SSR61.1e-1924.34Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidop... [more]
Q9CA583.1e-1924.66Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... [more]
Q3EDF81.3e-1725.00Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
Q9LSL94.9e-1723.45Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q940A68.4e-1722.43Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1CJ387.3e-24283.46pentatricopeptide repeat-containing protein At1g62930, chloroplastic-like OS=Mom... [more]
A0A6J1F1482.9e-21476.10pentatricopeptide repeat-containing protein At1g05670, mitochondrial-like OS=Cuc... [more]
A0A6J1L3E82.1e-21275.53pentatricopeptide repeat-containing protein At1g09820-like OS=Cucurbita maxima O... [more]
A0A0A0L3E79.7e-19469.92Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G637180 PE=4 SV=1[more]
A0A5D3D9T83.0e-18768.20Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G52640.17.5e-2124.34Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74580.12.2e-2024.66Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G09900.19.2e-1925.00Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G61990.12.0e-1822.15Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G65560.13.5e-1823.45Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 439..470
e-value: 8.8E-4
score: 17.3
coord: 406..436
e-value: 1.5E-6
score: 26.0
coord: 370..402
e-value: 2.6E-4
score: 18.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 400..448
e-value: 1.9E-10
score: 40.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..400
score: 8.758137
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 401..435
score: 11.158661
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..470
score: 10.347525
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 506..540
score: 8.560833
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 199..518
e-value: 6.9E-28
score: 99.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 403..548
NoneNo IPR availablePANTHERPTHR47936:SF3REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 53..570
NoneNo IPR availablePANTHERPTHR47936FAMILY NOT NAMEDcoord: 53..570

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Sgr016825Sgr016825gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Sgr016825.1.exon1Sgr016825.1.exon1exon
Sgr016825.1.exon2Sgr016825.1.exon2exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.Sgr016825.1cds.Sgr016825.1CDS
cds.Sgr016825.1cds.Sgr016825.1_2CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Sgr016825.1Sgr016825.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding