Sgr024324 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr024324
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00001291: 1785045 .. 1788939 (+)
RNA-Seq ExpressionSgr024324
SyntenySgr024324
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCCTATGATTACTAAGGAAGCAGACACGTGTACGGCAAAGATGAGGTGGCTGACATTTTCTCATGATAGCCAACAGTAGAAAAGCCGTAAAAAGTCCAGCAAATGATTAAAATAAAAGAATCACTTTTTTTACAAATTTGGTACCTTTTTTTATTTTATTAAAGTTTTTTTTAATTTTAATTTTATTAAAGTTAGGCGGTAATAATTAGGGGAAAAAAGATTATAAAGGGAATATTGGAAATGGACATGGAACAGCTCGTTTGTTAAATCGATGTACAATTCATGGGCCCACCATCCACTAGTTTTTTACCACGTGGTTACCATCCCACATCATAATTACATCAACTACTGTTCAGCTTACGTTTTTTCTTTTGTCGAATCCCTTGCTTATTGGGACATTTTATTTGATAAAAAAAAAATCTTTTACTACTTATTTGGTTTGAGAAAACATGTTTATTTTATTTTATTTTTTTAATAAGGTTTCATTTACTTAATAAGAATAACAAGAGTAAGTCTTGTTTCTAAAACGAAAGAAAAAGGAAGAATAAAAATAGAATAAAATTTTGGTTTATATTTTATTTTGGTCTCTTTTAACTTTTAGGTTTGTTTTATTTTAATATTTAAACTTTTAAAATATTTATTTTAGTTCATATATTTTTAAGTTTGTTTAATTTGACTCTTTGAATTTTTAAAATATTTATTTAGTCCATAAATTTTTAAAAAATAGTATTTTGGTCATCGTCATTAGTTTGTATCAATTATTTTAACAAATGAATAAAATGGCTTCAATTATATTTGTTAATGTATTAACATGAAAATTTATTGGAGCCACATAGATTGGCTGAGTTTTATTGGGTGATGGGTGATAGCTATTTGATCGCTATATTCTTGCATGGAGAGGGATAAAAAACAATTAGGTAAAAAAAAAAAGGCAAAGTCCAACATATTTAGGAATCAAAATAATATTTCTTATATAGTTATTTGATCTTTTATTCAAAACTATGAATTTGTTTTATATTGTATATTGAGAATTATTTCAATTTCCACCGAGAGATATCTCAAAGAACTCTAACTTTGCAAAGTTAGACCCCTCCCCTCGTGGAGCTCTAAAAATCAATATGGATGCTGTTGCTATAATGAGGAGCGAAGATCGAAATTGGAGATGTTTAGGAATCGTGAAGGAACCATTTTAGCAACCCCTCACAAAATATCGATGTTGCTTATGTTCCTCTCATGGCGGGCGGCAACACTAAAAATCTTAGAATGAATCAAAGATATGACACATATTGAAGTGTTCTTTTTTGACATTGAAATCTGATTGAATAATGCAATCTCCTCAAACTTGTTAAATGAATCGTCCTTTATTGTCGAAGGAGTGTTGATTGAATAGATCGTTGGATTAGCCAAGTCACTCGAGTATGTTAGTTGTGAGAGTGTGAGAGAGAGTGCAATGTGATAATAGCCCATAAACAAATCACCATATTTGGGTATTTGATTTCTCTACTTCACTTGGCTAGACTATAAATCATAATACTATTTTATCTCATGATTAATAAGAAGCTCTCATCCAATTTACATCCAATTTTGTTTTTAAGAAATTGGTTTTCGATATTTTGTATAAATTGCATTTTGTTCCCTAAACACCAAATTTATTTTATATTTTTCTTTAATAAGTAGATATTTGGTGGTCTCTACTCTCCATTAGGATTTAGGATAACGTTGTTTTATAAGGTCGACTCCATGATTTTATATCGGATCAGCGAGGCCCAACTTAAAAAGTAAAGGCCCAAAAGGATTTTAAAAATACAAAAAAATTGAACGCCCACCGTGGGGCTCGAACCCACGACCACAAGGTTAAGAGCCTTGCGCTCTACCAACTGAGCTAGACGGGCCTGCTGATGAAAGTCATACAATTTTTTACTTAGAATCAAAAAATTTCTCAAAAATGGAAACATTATTGTTGCGTTTTGTTTGCCATTATTTTTCCTTTGCTCCAGTAACATTTCTTGTTTTATTTTATTATAAATTAAATTTATATATCGAGCTAATCAATATTTTGTCTTATGAATAAAATTATTTATATTATTTTAATACTTCAACTAATAAATTAATTTGAGATTTTCTTATCTAATATAATTTAATATTTATTTATACATATATATATTTTAGTACAACGATGGCTGGGTGAAAAGATTTGAACTTCTGTCTTTTAAAAGGTAATTTGTGCCTTAACCAATTGATTTATGTTTATGTTTGCACATTTATATAATCATTTCATATGAATTTAATTTACTAAGTTTGTCTAAAAAAAATCACTAAATTAAATATCATTAATTTTTCTAGAAGTAATTAGCTAAAAAGATTTTTTTCCCCTTTTGATAGTGCTCCTTTGGAATAAAAAGGGTATCCTGTAATTGTATTCTTGAGAACTCAATATATATATATATATTATATATATATATATTCTTGAAACTATTTTTATAATTTTATCAGTTGCATGAATCAAAAGAGAAAACGGGGGAAGAGAAGAACTAAGAAGGCTGAGGATATTTTCTCTCATCTGTCTTATAAATCCTTGTTTTTTAATGTGATAAATTTCGAATTTATGTAATAAATACTCATGTTTGTGAATTTGGGACGTAAATTCAAGAAGCGGTTCGAATTCTGATGCTTGCTTCCGAGCTAAAATATTGTTGAACAACTCTGTAAATGTAAAGCAGGCAACTCAAATTCATGCCCACATTCTCGTCAATGGGTTCCGACATCTCGAGTCTCTCTTGGTTCGTCAAATCACTCGCTCTGAGTTCACTTGCGCCAGAATTGTATCCCGTTATCTCCGACAAATCCTTCACTATTCGCAAAACCCAGTTGCTTTCTCATGGGGTTGCGCCGTTCGATTCTTTTCCCAGAACGGTCAATTCATGGAAGCTTTCTCTCATTATGTTCAGATGCAGAGATTGGGACTGCGTCCAAGCACATTTGCTGTATCTTCGACTTTGAGAGCTTGCGGTAGAATTATGTGCAAGTTTGGAGGGAGTTATGTTCACTCTCAGGTTTATAAGTTGGGGTTCTGTCGGTGTGTTACGTGCAGACTGCCCTTGTGGATTTTTATTCAAAACTGGGTGACATGGGTTTTGCACAGAAGGTTTTTGATGATCTGATCGAGAAAAATGTGGTTTCGTGGAATTCAATCTTGTCTGGTTATGTGAAAATTGGGAACTTAGTTGACGCTCAGAAAGTGTTCGATGAAATGCCTGAGAAAGATGTCATATCTTGGAATTCGATGTTGTCGGGATTCGCCAACTCTGGAAATTTGGATCAAGCGTCCTGTTTATTCCAACAAATGCGGGAGAAAAGTTCAGCTTCTTGGAACGCAATGATCAGTGGTTACGTGAACTGTGGAGACATAAAGTCTGCAAGAAACCTGTTTGATGCAATGCCTCAAAGAAATAATGTTTCCTGGATTACATTGATTGCTGGGTATTCGAAGCTTGGGGAGTTGGTGCTGCTCGCGAGCTCTTTGACAAGATGGGTGTGAAAGAGCTTCTCTCTTTTAACGCCATGATTGCTTGCTATTCACAAAATAGCCTACCCAACGAAGCATTGGAGTTGTTCAACCGAATGCTTCAACCCCATGTGAATATCCAACCCGATGAGATGACTTTGCTAGCATTATATCTGCTTGTTCGCAGCTGGGGAATTTGAATTATGGTACTTGGATTGAATCGTATATGGAAAAACTTGGGATTGAATTGGATGAATATTTGGCCACTGCATTGGTAGACTTGTATGCAAAATCCGGGAACATCGAAAGGGCGTTTGAGCTGTTCAATGGTCTGAAAAAGAGGGATTTAGTTGCTTATTCAGCTATGATCTTTGGATGTGGATAA

mRNA sequence

ATGGCGCCTATGATTACTAAGGAAGCAGACACGTGTACGGCAAAGATGAGAAGCGGTTCGAATTCTGATGCTTGCTTCCGAGCTAAAATATTGTTGAACAACTCTGTAAATGTAAAGCAGGCAACTCAAATTCATGCCCACATTCTCGTCAATGGGTTCCGACATCTCGAGTCTCTCTTGGTTCGTCAAATCACTCGCTCTGAGTTCACTTGCGCCAGAATTGTATCCCGTTATCTCCGACAAATCCTTCACTATTCGCAAAACCCAGTTGCTTTCTCATGGGGTTGCGCCGTTCGATTCTTTTCCCAGAACGGTCAATTCATGGAAGCTTTCTCTCATTATGTTCAGATGCAGAGATTGGGACTGCGTCCAAGCACATTTGCTGTATCTTCGACTTTGAGAGCTTGCGGTAGAATTATGTGCAAGTTTGGAGGGAGTTATGTTCACTCTCAGAAGGTTTTTGATGATCTGATCGAGAAAAATGTGGTTTCGTGGAATTCAATCTTGTCTGGTTATGTGAAAATTGGGAACTTAGTTGACGCTCAGAAAGTGTTCGATGAAATGCCTGAGAAAGATGTCATATCTTGGAATTCGATGTTGTCGGGATTCGCCAACTCTGGAAATTTGGATCAAGCGTCCTGTTTATTCCAACAAATGCGGGAGAAAAGTTCAGCTTCTTGGAACGCAATGATCAGTGGTTACGTGAACTGTGGAGACATAAAGTCTGCAAGAAACCTGTTTGATGCAATGCCTCAAAGAAATAATGTTTCCTGGATTACATTGATTGCTGGGTATTCGAAGCTTGGGGAGTTGGTGCTGCTCGCGAGCTCTTTGACAAGATGGCTGGGGAATTTGAATTATGGTACTTGGATTGAATCGTATATGGAAAAACTTGGGATTGAATTGGATGAATATTTGGCCACTGCATTGGTAGACTTGTATGCAAAATCCGGGAACATCGAAAGGGCGTTTGAGCTGTTCAATGGTCTGAAAAAGAGGGATTTAGTTGCTTATTCAGCTATGATCTTTGGATGTGGATAA

Coding sequence (CDS)

ATGGCGCCTATGATTACTAAGGAAGCAGACACGTGTACGGCAAAGATGAGAAGCGGTTCGAATTCTGATGCTTGCTTCCGAGCTAAAATATTGTTGAACAACTCTGTAAATGTAAAGCAGGCAACTCAAATTCATGCCCACATTCTCGTCAATGGGTTCCGACATCTCGAGTCTCTCTTGGTTCGTCAAATCACTCGCTCTGAGTTCACTTGCGCCAGAATTGTATCCCGTTATCTCCGACAAATCCTTCACTATTCGCAAAACCCAGTTGCTTTCTCATGGGGTTGCGCCGTTCGATTCTTTTCCCAGAACGGTCAATTCATGGAAGCTTTCTCTCATTATGTTCAGATGCAGAGATTGGGACTGCGTCCAAGCACATTTGCTGTATCTTCGACTTTGAGAGCTTGCGGTAGAATTATGTGCAAGTTTGGAGGGAGTTATGTTCACTCTCAGAAGGTTTTTGATGATCTGATCGAGAAAAATGTGGTTTCGTGGAATTCAATCTTGTCTGGTTATGTGAAAATTGGGAACTTAGTTGACGCTCAGAAAGTGTTCGATGAAATGCCTGAGAAAGATGTCATATCTTGGAATTCGATGTTGTCGGGATTCGCCAACTCTGGAAATTTGGATCAAGCGTCCTGTTTATTCCAACAAATGCGGGAGAAAAGTTCAGCTTCTTGGAACGCAATGATCAGTGGTTACGTGAACTGTGGAGACATAAAGTCTGCAAGAAACCTGTTTGATGCAATGCCTCAAAGAAATAATGTTTCCTGGATTACATTGATTGCTGGGTATTCGAAGCTTGGGGAGTTGGTGCTGCTCGCGAGCTCTTTGACAAGATGGCTGGGGAATTTGAATTATGGTACTTGGATTGAATCGTATATGGAAAAACTTGGGATTGAATTGGATGAATATTTGGCCACTGCATTGGTAGACTTGTATGCAAAATCCGGGAACATCGAAAGGGCGTTTGAGCTGTTCAATGGTCTGAAAAAGAGGGATTTAGTTGCTTATTCAGCTATGATCTTTGGATGTGGATAA

Protein sequence

MAPMITKEADTCTAKMRSGSNSDACFRAKILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGELVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFGCG
Homology
BLAST of Sgr024324 vs. NCBI nr
Match: XP_038874892.1 (pentatricopeptide repeat-containing protein At4g22760-like [Benincasa hispida])

HSP 1 Score: 520.0 bits (1338), Expect = 1.6e-143
Identity = 277/402 (68.91%), Postives = 294/402 (73.13%), Query Frame = 0

Query: 29  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQN 88
           K+ LN+S+N KQATQIHAHILVNG  +LES LVRQITRSEFTCARIVS YL+QILH+SQN
Sbjct: 7   KVFLNSSINFKQATQIHAHILVNGLPNLESCLVRQITRSEFTCARIVSLYLQQILHHSQN 66

Query: 89  PVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYV 148
           P AF+W CAVRFFSQNGQFMEA SHYVQMQRLGL P TFAVSSTLRACGRIMCKFGGSYV
Sbjct: 67  PDAFTWACAVRFFSQNGQFMEAISHYVQMQRLGLHPGTFAVSSTLRACGRIMCKFGGSYV 126

Query: 149 H------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNL 208
           H                              +QKVFDD+ EKNVVSWNSILSGYVKIGNL
Sbjct: 127 HAQVYKFGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGYVKIGNL 186

Query: 209 VDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCG 268
           VDAQKVFDEMP KDVISWNSML+GFANSGN+D+A CLFQQM EKSSASWNAMISGYVNCG
Sbjct: 187 VDAQKVFDEMPLKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMISGYVNCG 246

Query: 269 DIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL--------------------------- 328
           DIKSARNLFD MP RNNVSWITLIAGYSKLGE+                           
Sbjct: 247 DIKSARNLFDVMPNRNNVSWITLIAGYSKLGEVNSAYELFDKMGEKELLSFNAMIACYSQ 306

Query: 329 -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDE 347
                                    +  AS  S    LGNLN GTWIESYMEKLGIELD+
Sbjct: 307 NSLPNKALELFNQMLQPDVNIQPDEMTFASIISACTQLGNLNCGTWIESYMEKLGIELDD 366

BLAST of Sgr024324 vs. NCBI nr
Match: XP_038906935.1 (pentatricopeptide repeat-containing protein At4g22760 [Benincasa hispida])

HSP 1 Score: 520.0 bits (1338), Expect = 1.6e-143
Identity = 277/402 (68.91%), Postives = 294/402 (73.13%), Query Frame = 0

Query: 29  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQN 88
           K+ LN+S+N KQATQIHAHILVNG  +LES LVRQITRSEFTCARIVS YL+QILH+SQN
Sbjct: 7   KVFLNSSINFKQATQIHAHILVNGLPNLESCLVRQITRSEFTCARIVSLYLQQILHHSQN 66

Query: 89  PVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYV 148
           P AF+W CAVRFFSQNGQFMEA SHYVQMQRLGL P TFAVSSTLRACGRIMCKFGGSYV
Sbjct: 67  PDAFTWACAVRFFSQNGQFMEAISHYVQMQRLGLHPGTFAVSSTLRACGRIMCKFGGSYV 126

Query: 149 H------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNL 208
           H                              +QKVFDD+ EKNVVSWNSILSGYVKIGNL
Sbjct: 127 HAQVYKFGFCRCVYVQTALVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGYVKIGNL 186

Query: 209 VDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCG 268
           VDAQKVFDEMP KDVISWNSML+GFANSGN+D+A CLFQQM EKSSASWNAMISGYVNCG
Sbjct: 187 VDAQKVFDEMPLKDVISWNSMLTGFANSGNMDRAWCLFQQMGEKSSASWNAMISGYVNCG 246

Query: 269 DIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL--------------------------- 328
           DIKSARNLFD MP RNNVSWITLIAGYSKLGE+                           
Sbjct: 247 DIKSARNLFDVMPNRNNVSWITLIAGYSKLGEVNSAYELFDKMGEKELLSFNAMIACYSQ 306

Query: 329 -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDE 347
                                    +  AS  S    LGNLN GTWIESYMEKLGIELD+
Sbjct: 307 NSLPNKALELFNQMLQPDVNIQPDEMTFASIISACTQLGNLNCGTWIESYMEKLGIELDD 366

BLAST of Sgr024324 vs. NCBI nr
Match: XP_022155871.1 (pentatricopeptide repeat-containing protein At4g22760 [Momordica charantia])

HSP 1 Score: 509.6 bits (1311), Expect = 2.1e-140
Identity = 272/404 (67.33%), Postives = 299/404 (74.01%), Query Frame = 0

Query: 29  KILLNN--SVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYS 88
           +ILLN   SVNVKQA+QIH+ I+VNG RHLE+LLVRQITRSEFTCARIVS YL++IL +S
Sbjct: 7   QILLNKKCSVNVKQASQIHSRIIVNGLRHLETLLVRQITRSEFTCARIVSCYLQRILRHS 66

Query: 89  QNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGS 148
           QNP AFSWGCAVRFFSQNGQF+EAFSHYVQMQ LGL PSTF VSSTLRAC RIMCKFGGS
Sbjct: 67  QNPDAFSWGCAVRFFSQNGQFVEAFSHYVQMQTLGLHPSTFGVSSTLRACARIMCKFGGS 126

Query: 149 YVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIG 208
           +VH                              +QKVFDD+ EKNVVSWNSILSG+VKIG
Sbjct: 127 FVHAQVYKFGFCRCVYVQTALVDFYSKLGDMSFAQKVFDDMTEKNVVSWNSILSGHVKIG 186

Query: 209 NLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVN 268
           NLVDAQKVFDEMPEKDVISWNSMLSGFANSGN+D+ASCLFQQMREKSSASWNAMISGY+N
Sbjct: 187 NLVDAQKVFDEMPEKDVISWNSMLSGFANSGNMDEASCLFQQMREKSSASWNAMISGYMN 246

Query: 269 CGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL------------------------- 328
            GDIKSARNLFD MPQRNNVSWITLIAGYSKLG++                         
Sbjct: 247 YGDIKSARNLFDVMPQRNNVSWITLIAGYSKLGDVGSARELFDKMGEKELLSFNAMIACY 306

Query: 329 ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIEL 347
                                      +  AS  S    LGNLNYG+W+ESYMEKLGI+L
Sbjct: 307 SQNSLPNEALELFDQMLQPHENIQPDEMTFASIISACSQLGNLNYGSWVESYMEKLGIQL 366

BLAST of Sgr024324 vs. NCBI nr
Match: KAG6579446.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7016919.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 506.1 bits (1302), Expect = 2.3e-139
Identity = 270/402 (67.16%), Postives = 291/402 (72.39%), Query Frame = 0

Query: 29  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQN 88
           KI LN  VNVKQATQIHAHILVNG R+LES LVRQITRSEFTCARIVSRYL++IL +S+N
Sbjct: 7   KIFLNKPVNVKQATQIHAHILVNGLRNLESCLVRQITRSEFTCARIVSRYLQRILRHSKN 66

Query: 89  PVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYV 148
           P  FSWGCAVRFFSQNGQFMEA SHYVQMQRLGL PSTFAVSSTLRACGRI+CKF GS V
Sbjct: 67  PDYFSWGCAVRFFSQNGQFMEAISHYVQMQRLGLHPSTFAVSSTLRACGRIICKFSGSSV 126

Query: 149 HSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNL 208
           H+Q                              KVFDD+ EKNVVSWNSILSGYVKIGNL
Sbjct: 127 HAQVYKLGFCRCVYVQTALVDFYSKLGDMGFARKVFDDMTEKNVVSWNSILSGYVKIGNL 186

Query: 209 VDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCG 268
            DAQKVFDEMP KDVISWNSML+GFANSGN+D+ASCLFQQ+ E+SSASWNAMISGYVNCG
Sbjct: 187 DDAQKVFDEMPVKDVISWNSMLTGFANSGNMDRASCLFQQLGERSSASWNAMISGYVNCG 246

Query: 269 DIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL--------------------------- 328
           D+KSARN+FD MP RNNVSWITLIAGYSKLGE+                           
Sbjct: 247 DMKSARNMFDEMPNRNNVSWITLIAGYSKLGEVGSACELFNNMGEKEILSYNAMIACYSQ 306

Query: 329 -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDE 347
                                    +  AS  S    LGNLNYG WIESYMEKLGIELD+
Sbjct: 307 NGMPNEALKLFNQMLQPHVNIQPDEMTFASIISACTQLGNLNYGAWIESYMEKLGIELDD 366

BLAST of Sgr024324 vs. NCBI nr
Match: XP_022922292.1 (pentatricopeptide repeat-containing protein At4g22760 [Cucurbita moschata])

HSP 1 Score: 503.8 bits (1296), Expect = 1.2e-138
Identity = 268/402 (66.67%), Postives = 290/402 (72.14%), Query Frame = 0

Query: 29  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQN 88
           KI LN  VNVKQA QIHAHILVNG R+LES LVRQITRSEFTCARIVSRYL++IL +SQN
Sbjct: 7   KIFLNKPVNVKQAAQIHAHILVNGLRNLESCLVRQITRSEFTCARIVSRYLQRILRHSQN 66

Query: 89  PVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYV 148
           P +FSWGCAVRFFS+NGQFMEA SHYVQMQRLGL PSTFAVSSTLRACGRI+CKF GS V
Sbjct: 67  PDSFSWGCAVRFFSRNGQFMEAISHYVQMQRLGLHPSTFAVSSTLRACGRIICKFSGSSV 126

Query: 149 HSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNL 208
           H+Q                              KVFDD+ EKNVVSWNSILSGYVKIGNL
Sbjct: 127 HAQVYKLGFCRCVYVQTALVDFYSKLGDMGFARKVFDDITEKNVVSWNSILSGYVKIGNL 186

Query: 209 VDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCG 268
            DAQKVFDEMP KDVISWNSML+GFANSGN+D+ASCLFQQ+ E+SSASWNAMISGYVNCG
Sbjct: 187 DDAQKVFDEMPVKDVISWNSMLTGFANSGNMDRASCLFQQLGERSSASWNAMISGYVNCG 246

Query: 269 DIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL--------------------------- 328
           D+KSARN+FD MP RNNVSWITLIAGYSKLGE+                           
Sbjct: 247 DMKSARNMFDEMPNRNNVSWITLIAGYSKLGEVGSACELFNNMGEKEILSYNAMIACYSQ 306

Query: 329 -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDE 347
                                    +  AS  S    LGNLNYG WIESYMEKLGIELD+
Sbjct: 307 NGMPNEALKLFNQMLQPHVNIQPDEMTFASIISACTQLGNLNYGAWIESYMEKLGIELDD 366

BLAST of Sgr024324 vs. ExPASy Swiss-Prot
Match: P0C8Q5 (Pentatricopeptide repeat-containing protein At4g22760 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E6 PE=2 SV=1)

HSP 1 Score: 288.5 bits (737), Expect = 1.0e-76
Identity = 155/404 (38.37%), Postives = 221/404 (54.70%), Query Frame = 0

Query: 27  RAKILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYS 86
           + +  L   V ++QA Q+HA ++VN + HLE +LV Q        +R +  Y+++IL   
Sbjct: 5   KLRFFLQRCVVLEQAKQVHAQLVVNRYNHLEPILVHQTLHFTKEFSRNIVTYVKRILKGF 64

Query: 87  QNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGS 146
               +FSWGC VRF SQ+ +F E    Y+ M   G+ PS+ AV+S LRACG++     G 
Sbjct: 65  NGHDSFSWGCLVRFLSQHRKFKETVDVYIDMHNSGIPPSSHAVTSVLRACGKMENMVDGK 124

Query: 147 YVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIG 206
            +H+Q                              K FDD+ EKN VSWNS+L GY++ G
Sbjct: 125 PIHAQALKNGLCGCVYVQTGLVGLYSRLGYIELAKKAFDDIAEKNTVSWNSLLHGYLESG 184

Query: 207 NLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVN 266
            L +A++VFD++PEKD +SWN ++S +A  G++  A  LF  M  KS ASWN +I GYVN
Sbjct: 185 ELDEARRVFDKIPEKDAVSWNLIISSYAKKGDMGNACSLFSAMPLKSPASWNILIGGYVN 244

Query: 267 CGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGE-------------------------- 326
           C ++K AR  FDAMPQ+N VSWIT+I+GY+KLG+                          
Sbjct: 245 CREMKLARTYFDAMPQKNGVSWITMISGYTKLGDVQSAEELFRLMSKKDKLVYDAMIACY 304

Query: 327 ---------LVLLASSLTR-------------------WLGNLNYGTWIESYMEKLGIEL 347
                    L L A  L R                    LGN ++GTW+ESY+ + GI++
Sbjct: 305 TQNGKPKDALKLFAQMLERNSYIQPDEITLSSVVSANSQLGNTSFGTWVESYITEHGIKI 364

BLAST of Sgr024324 vs. ExPASy Swiss-Prot
Match: Q9LS72 (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 3.5e-45
Identity = 119/370 (32.16%), Postives = 189/370 (51.08%), Query Frame = 0

Query: 32  LNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVA 91
           L    N+ Q  Q+HA I+       E L +     S  +  R  +  +R + +  Q P  
Sbjct: 26  LPKCANLNQVKQLHAQIIRRNLH--EDLHIAPKLISALSLCRQTNLAVR-VFNQVQEPNV 85

Query: 92  FSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRAC--------------- 151
                 +R  +QN Q  +AF  + +MQR GL    F     L+AC               
Sbjct: 86  HLCNSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNH 145

Query: 152 --------------GRIMC--KFGGSYVH-SQKVFDDLIEKNVVSWNSILSGYVKIGNLV 211
                           I C  + GG  V  + K+F+ + E++ VSWNS+L G VK G L 
Sbjct: 146 IEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELR 205

Query: 212 DAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGD 271
           DA+++FDEMP++D+ISWN+ML G+A    + +A  LF++M E+++ SW+ M+ GY   GD
Sbjct: 206 DARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAGD 265

Query: 272 IKSARNLFDAM--PQRNNVSWITLIAGYSKLGEL--------VLLASSL----------- 331
           ++ AR +FD M  P +N V+W  +IAGY++ G L         ++AS L           
Sbjct: 266 MEMARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRLVDQMVASGLKFDAAAVISIL 325

Query: 332 --TRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLV 347
                 G L+ G  I S +++  +  + Y+  AL+D+YAK GN+++AF++FN + K+DLV
Sbjct: 326 AACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAKCGNLKKAFDVFNDIPKKDLV 385

BLAST of Sgr024324 vs. ExPASy Swiss-Prot
Match: O49399 (Pentatricopeptide repeat-containing protein At4g18840 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E101 PE=3 SV=2)

HSP 1 Score: 164.9 bits (416), Expect = 1.7e-39
Identity = 102/333 (30.63%), Postives = 171/333 (51.35%), Query Frame = 0

Query: 37  NVKQATQIHAHILVNGFRH----LESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAF 96
           ++ +  Q HA +L  G  H       L+    T  E    + VS Y   IL+   +P  F
Sbjct: 51  SLTEIQQAHAFMLKTGLFHDTFSASKLVAFAATNPE---PKTVS-YAHSILNRIGSPNGF 110

Query: 97  SWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYVHSQK 156
           +    +R ++ +     A + + +M    + P  ++ +  L+AC        G  +H   
Sbjct: 111 THNSVIRAYANSSTPEVALTVFREMLLGPVFPDKYSFTFVLKACAAFCGFEEGRQIHGLF 170

Query: 157 VFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQA 216
           +   L+  +V   N++++ Y + G    A+KV D MP +D +SWNS+LS +   G +D+A
Sbjct: 171 IKSGLV-TDVFVENTLVNVYGRSGYFEIARKVLDRMPVRDAVSWNSLLSAYLEKGLVDEA 230

Query: 217 SCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLG--- 276
             LF +M E++  SWN MISGY   G +K A+ +FD+MP R+ VSW  ++  Y+ +G   
Sbjct: 231 RALFDEMEERNVESWNFMISGYAAAGLVKEAKEVFDSMPVRDVVSWNAMVTAYAHVGCYN 290

Query: 277 --------------------ELVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATA 336
                                LV + S+    LG+L+ G W+  Y++K GIE++ +LATA
Sbjct: 291 EVLEVFNKMLDDSTEKPDGFTLVSVLSACAS-LGSLSQGEWVHVYIDKHGIEIEGFLATA 350

Query: 337 LVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI 343
           LVD+Y+K G I++A E+F    KRD+  ++++I
Sbjct: 351 LVDMYSKCGKIDKALEVFRATSKRDVSTWNSII 377

BLAST of Sgr024324 vs. ExPASy Swiss-Prot
Match: Q9MAT2 (Pentatricopeptide repeat-containing protein At1g04840 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H64 PE=2 SV=1)

HSP 1 Score: 157.1 bits (396), Expect = 3.5e-37
Identity = 94/334 (28.14%), Postives = 170/334 (50.90%), Query Frame = 0

Query: 44  IHAHILVNGFRHLESLLVRQITRSEFTCARIVS--------RYLRQILHYSQNPVAFSWG 103
           IHA       RH+ + ++R+   S    A++VS         Y   I   S+    F   
Sbjct: 36  IHACKDTASLRHVHAQILRRGVLSSRVAAQLVSCSSLLKSPDYSLSIFRNSEERNPFVLN 95

Query: 104 CAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYVHSQKVFD 163
             +R  ++N +F  +  H++ M RLG++P        L++  ++  ++ G  +H+     
Sbjct: 96  ALIRGLTENARFESSVRHFILMLRLGVKPDRLTFPFVLKSNSKLGFRWLGRALHA-ATLK 155

Query: 164 DLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEK----DVISWNSMLSGFANSGNLDQ 223
           + ++ +     S++  Y K G L  A +VF+E P++     ++ WN +++G+  + ++  
Sbjct: 156 NFVDCDSFVRLSLVDMYAKTGQLKHAFQVFEESPDRIKKESILIWNVLINGYCRAKDMHM 215

Query: 224 ASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL 283
           A+ LF+ M E++S SW+ +I GYV+ G++  A+ LF+ MP++N VSW TLI G+S+ G+ 
Sbjct: 216 ATTLFRSMPERNSGSWSTLIKGYVDSGELNRAKQLFELMPEKNVVSWTTLINGFSQTGDY 275

Query: 284 VLLASSLTRWL---------------------GNLNYGTWIESYMEKLGIELDEYLATAL 343
               S+    L                     G L  G  I  Y+   GI+LD  + TAL
Sbjct: 276 ETAISTYFEMLEKGLKPNEYTIAAVLSACSKSGALGSGIRIHGYILDNGIKLDRAIGTAL 335

Query: 344 VDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG 345
           VD+YAK G ++ A  +F+ +  +D+++++AMI G
Sbjct: 336 VDMYAKCGELDCAATVFSNMNHKDILSWTAMIQG 368

BLAST of Sgr024324 vs. ExPASy Swiss-Prot
Match: Q1PEU4 (Pentatricopeptide repeat-containing protein At2g44880 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E9 PE=2 SV=2)

HSP 1 Score: 156.4 bits (394), Expect = 5.9e-37
Identity = 88/312 (28.21%), Postives = 154/312 (49.36%), Query Frame = 0

Query: 87  QNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQR-LGLRPSTFAVSSTLRACGRIMCKFGG 146
           Q   +F     ++ + +  Q+ ++F+ Y  +++     P  F  ++  ++C   MC + G
Sbjct: 38  QRDDSFLSNSMIKAYLETRQYPDSFALYRDLRKETCFAPDNFTFTTLTKSCSLSMCVYQG 97

Query: 147 SYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKI 206
             +HSQ                                FD++  ++ VSW +++SGY++ 
Sbjct: 98  LQLHSQIWRFGFCADMYVSTGVVDMYAKFGKMGCARNAFDEMPHRSEVSWTALISGYIRC 157

Query: 207 GNLVDAQKVFDEMPE-KDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGY 266
           G L  A K+FD+MP  KDV+ +N+M+ GF  SG++  A  LF +M  K+  +W  MI GY
Sbjct: 158 GELDLASKLFDQMPHVKDVVIYNAMMDGFVKSGDMTSARRLFDEMTHKTVITWTTMIHGY 217

Query: 267 VNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGE----------------------LV 326
            N  DI +AR LFDAMP+RN VSW T+I GY +  +                       +
Sbjct: 218 CNIKDIDAARKLFDAMPERNLVSWNTMIGGYCQNKQPQEGIRLFQEMQATTSLDPDDVTI 277

Query: 327 LLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKK 345
           L         G L+ G W   ++++  ++    + TA++D+Y+K G IE+A  +F+ + +
Sbjct: 278 LSVLPAISDTGALSLGEWCHCFVQRKKLDKKVKVCTAILDMYSKCGEIEKAKRIFDEMPE 337

BLAST of Sgr024324 vs. ExPASy TrEMBL
Match: A0A6J1DT09 (pentatricopeptide repeat-containing protein At4g22760 OS=Momordica charantia OX=3673 GN=LOC111022884 PE=4 SV=1)

HSP 1 Score: 509.6 bits (1311), Expect = 1.0e-140
Identity = 272/404 (67.33%), Postives = 299/404 (74.01%), Query Frame = 0

Query: 29  KILLNN--SVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYS 88
           +ILLN   SVNVKQA+QIH+ I+VNG RHLE+LLVRQITRSEFTCARIVS YL++IL +S
Sbjct: 7   QILLNKKCSVNVKQASQIHSRIIVNGLRHLETLLVRQITRSEFTCARIVSCYLQRILRHS 66

Query: 89  QNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGS 148
           QNP AFSWGCAVRFFSQNGQF+EAFSHYVQMQ LGL PSTF VSSTLRAC RIMCKFGGS
Sbjct: 67  QNPDAFSWGCAVRFFSQNGQFVEAFSHYVQMQTLGLHPSTFGVSSTLRACARIMCKFGGS 126

Query: 149 YVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIG 208
           +VH                              +QKVFDD+ EKNVVSWNSILSG+VKIG
Sbjct: 127 FVHAQVYKFGFCRCVYVQTALVDFYSKLGDMSFAQKVFDDMTEKNVVSWNSILSGHVKIG 186

Query: 209 NLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVN 268
           NLVDAQKVFDEMPEKDVISWNSMLSGFANSGN+D+ASCLFQQMREKSSASWNAMISGY+N
Sbjct: 187 NLVDAQKVFDEMPEKDVISWNSMLSGFANSGNMDEASCLFQQMREKSSASWNAMISGYMN 246

Query: 269 CGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL------------------------- 328
            GDIKSARNLFD MPQRNNVSWITLIAGYSKLG++                         
Sbjct: 247 YGDIKSARNLFDVMPQRNNVSWITLIAGYSKLGDVGSARELFDKMGEKELLSFNAMIACY 306

Query: 329 ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIEL 347
                                      +  AS  S    LGNLNYG+W+ESYMEKLGI+L
Sbjct: 307 SQNSLPNEALELFDQMLQPHENIQPDEMTFASIISACSQLGNLNYGSWVESYMEKLGIQL 366

BLAST of Sgr024324 vs. ExPASy TrEMBL
Match: A0A6J1E2U4 (pentatricopeptide repeat-containing protein At4g22760 OS=Cucurbita moschata OX=3662 GN=LOC111430313 PE=4 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 5.6e-139
Identity = 268/402 (66.67%), Postives = 290/402 (72.14%), Query Frame = 0

Query: 29  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQN 88
           KI LN  VNVKQA QIHAHILVNG R+LES LVRQITRSEFTCARIVSRYL++IL +SQN
Sbjct: 7   KIFLNKPVNVKQAAQIHAHILVNGLRNLESCLVRQITRSEFTCARIVSRYLQRILRHSQN 66

Query: 89  PVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYV 148
           P +FSWGCAVRFFS+NGQFMEA SHYVQMQRLGL PSTFAVSSTLRACGRI+CKF GS V
Sbjct: 67  PDSFSWGCAVRFFSRNGQFMEAISHYVQMQRLGLHPSTFAVSSTLRACGRIICKFSGSSV 126

Query: 149 HSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNL 208
           H+Q                              KVFDD+ EKNVVSWNSILSGYVKIGNL
Sbjct: 127 HAQVYKLGFCRCVYVQTALVDFYSKLGDMGFARKVFDDITEKNVVSWNSILSGYVKIGNL 186

Query: 209 VDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCG 268
            DAQKVFDEMP KDVISWNSML+GFANSGN+D+ASCLFQQ+ E+SSASWNAMISGYVNCG
Sbjct: 187 DDAQKVFDEMPVKDVISWNSMLTGFANSGNMDRASCLFQQLGERSSASWNAMISGYVNCG 246

Query: 269 DIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL--------------------------- 328
           D+KSARN+FD MP RNNVSWITLIAGYSKLGE+                           
Sbjct: 247 DMKSARNMFDEMPNRNNVSWITLIAGYSKLGEVGSACELFNNMGEKEILSYNAMIACYSQ 306

Query: 329 -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDE 347
                                    +  AS  S    LGNLNYG WIESYMEKLGIELD+
Sbjct: 307 NGMPNEALKLFNQMLQPHVNIQPDEMTFASIISACTQLGNLNYGAWIESYMEKLGIELDD 366

BLAST of Sgr024324 vs. ExPASy TrEMBL
Match: A0A5A7THR1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold44G003540 PE=4 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 9.6e-139
Identity = 268/404 (66.34%), Postives = 294/404 (72.77%), Query Frame = 0

Query: 29  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQN 88
           K  LN+SV+VKQATQIHAHILVNG  +LES LVRQITRS+FTCARIVSRYL++ILH+SQN
Sbjct: 7   KFFLNSSVHVKQATQIHAHILVNGLPNLESCLVRQITRSQFTCARIVSRYLQRILHHSQN 66

Query: 89  PVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYV 148
           P AF+W CAVRFFS+NGQFMEA +HYVQMQRLGL PSTFAVSSTLRACGRIMCKFGG  +
Sbjct: 67  PDAFTWSCAVRFFSKNGQFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCKFGGRCI 126

Query: 149 H------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNL 208
           H                              +QKVFDD+ EKNVVSWNSILSGYVKIGNL
Sbjct: 127 HAQVYKLGFCRCVYVQTSLVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGYVKIGNL 186

Query: 209 VDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCG 268
           VDAQKVFDEMP KDVISWNSML+GF+NSGN+D+A CLFQQMREKSSASWNAMISGYVNCG
Sbjct: 187 VDAQKVFDEMPVKDVISWNSMLTGFSNSGNMDRALCLFQQMREKSSASWNAMISGYVNCG 246

Query: 269 DIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL--------------------------- 328
           D+K+ARNLFD MP RNNV+ ITLIAGYSKLGE+                           
Sbjct: 247 DMKAARNLFDVMPNRNNVTRITLIAGYSKLGEVNSACELFDKMGENEKELFSFNAMIACY 306

Query: 329 ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIEL 347
                                      +  AS  S    LGNL+YGTWIESYMEKLGIEL
Sbjct: 307 SQNGLPNKALELFNQMLQPHLNIQPDEMTFASVISACTQLGNLSYGTWIESYMEKLGIEL 366

BLAST of Sgr024324 vs. ExPASy TrEMBL
Match: A0A1S3ATB6 (pentatricopeptide repeat-containing protein At4g22760 OS=Cucumis melo OX=3656 GN=LOC103482665 PE=4 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 9.6e-139
Identity = 268/404 (66.34%), Postives = 294/404 (72.77%), Query Frame = 0

Query: 29  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQN 88
           K  LN+SV+VKQATQIHAHILVNG  +LES LVRQITRS+FTCARIVSRYL++ILH+SQN
Sbjct: 7   KFFLNSSVHVKQATQIHAHILVNGLPNLESCLVRQITRSQFTCARIVSRYLQRILHHSQN 66

Query: 89  PVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYV 148
           P AF+W CAVRFFS+NGQFMEA +HYVQMQRLGL PSTFAVSSTLRACGRIMCKFGG  +
Sbjct: 67  PDAFTWSCAVRFFSKNGQFMEAIAHYVQMQRLGLHPSTFAVSSTLRACGRIMCKFGGRCI 126

Query: 149 H------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNL 208
           H                              +QKVFDD+ EKNVVSWNSILSGYVKIGNL
Sbjct: 127 HAQVYKLGFCRCVYVQTSLVDFYSKLGDMGFAQKVFDDMTEKNVVSWNSILSGYVKIGNL 186

Query: 209 VDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCG 268
           VDAQKVFDEMP KDVISWNSML+GF+NSGN+D+A CLFQQMREKSSASWNAMISGYVNCG
Sbjct: 187 VDAQKVFDEMPVKDVISWNSMLTGFSNSGNMDRALCLFQQMREKSSASWNAMISGYVNCG 246

Query: 269 DIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL--------------------------- 328
           D+K+ARNLFD MP RNNV+ ITLIAGYSKLGE+                           
Sbjct: 247 DMKAARNLFDVMPNRNNVTRITLIAGYSKLGEVNSACELFDKMGENEKELFSFNAMIACY 306

Query: 329 ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIEL 347
                                      +  AS  S    LGNL+YGTWIESYMEKLGIEL
Sbjct: 307 SQNGLPNKALELFNQMLQPHLNIQPDEMTFASVISACTQLGNLSYGTWIESYMEKLGIEL 366

BLAST of Sgr024324 vs. ExPASy TrEMBL
Match: A0A6J1HXF1 (pentatricopeptide repeat-containing protein At4g22760 OS=Cucurbita maxima OX=3661 GN=LOC111468920 PE=4 SV=1)

HSP 1 Score: 495.7 bits (1275), Expect = 1.5e-136
Identity = 266/402 (66.17%), Postives = 289/402 (71.89%), Query Frame = 0

Query: 29  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQN 88
           KI LN  VNVKQATQIHA ILVNG R+LES LVRQITRSEF+ ARIVSRYL++IL +SQN
Sbjct: 7   KIFLNKPVNVKQATQIHAQILVNGLRNLESCLVRQITRSEFSRARIVSRYLQRILRHSQN 66

Query: 89  PVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYV 148
           P +FSWGCAVRFFSQNGQFME  SHYVQMQRLGL PSTFAVSSTLRACGRI+CKFGGS V
Sbjct: 67  PDSFSWGCAVRFFSQNGQFMETISHYVQMQRLGLHPSTFAVSSTLRACGRIICKFGGSSV 126

Query: 149 HSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNL 208
           H+Q                              KVFDD+ EKNVVSWNSILSGYVKIG L
Sbjct: 127 HAQVYKLGFCRCVYVQTALVDFYSKLGDMGFARKVFDDMTEKNVVSWNSILSGYVKIGIL 186

Query: 209 VDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCG 268
            DAQKVFDEMP KDVISWNSML+GFANSGN+D+ASCLFQQ+ E+SSASWNAMISGYVNCG
Sbjct: 187 DDAQKVFDEMPVKDVISWNSMLTGFANSGNMDRASCLFQQLGERSSASWNAMISGYVNCG 246

Query: 269 DIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL-------------------VLLAS--- 328
           D+KSARN+FD MP RNNVSWITLIAGYSKLGE+                    L+A    
Sbjct: 247 DMKSARNMFDEMPNRNNVSWITLIAGYSKLGEVGSACELFNNMGEKEILSYNALIACYSQ 306

Query: 329 --------------------------------SLTRWLGNLNYGTWIESYMEKLGIELDE 347
                                           S    LGNLNYG WIESYMEKLGIELD+
Sbjct: 307 NGMPNEALKLFNQMLQPHVDIQPDEMTFASIISACTQLGNLNYGAWIESYMEKLGIELDD 366

BLAST of Sgr024324 vs. TAIR 10
Match: AT4G22760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 288.5 bits (737), Expect = 7.1e-78
Identity = 155/404 (38.37%), Postives = 221/404 (54.70%), Query Frame = 0

Query: 27  RAKILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYS 86
           + +  L   V ++QA Q+HA ++VN + HLE +LV Q        +R +  Y+++IL   
Sbjct: 5   KLRFFLQRCVVLEQAKQVHAQLVVNRYNHLEPILVHQTLHFTKEFSRNIVTYVKRILKGF 64

Query: 87  QNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGS 146
               +FSWGC VRF SQ+ +F E    Y+ M   G+ PS+ AV+S LRACG++     G 
Sbjct: 65  NGHDSFSWGCLVRFLSQHRKFKETVDVYIDMHNSGIPPSSHAVTSVLRACGKMENMVDGK 124

Query: 147 YVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIG 206
            +H+Q                              K FDD+ EKN VSWNS+L GY++ G
Sbjct: 125 PIHAQALKNGLCGCVYVQTGLVGLYSRLGYIELAKKAFDDIAEKNTVSWNSLLHGYLESG 184

Query: 207 NLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVN 266
            L +A++VFD++PEKD +SWN ++S +A  G++  A  LF  M  KS ASWN +I GYVN
Sbjct: 185 ELDEARRVFDKIPEKDAVSWNLIISSYAKKGDMGNACSLFSAMPLKSPASWNILIGGYVN 244

Query: 267 CGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGE-------------------------- 326
           C ++K AR  FDAMPQ+N VSWIT+I+GY+KLG+                          
Sbjct: 245 CREMKLARTYFDAMPQKNGVSWITMISGYTKLGDVQSAEELFRLMSKKDKLVYDAMIACY 304

Query: 327 ---------LVLLASSLTR-------------------WLGNLNYGTWIESYMEKLGIEL 347
                    L L A  L R                    LGN ++GTW+ESY+ + GI++
Sbjct: 305 TQNGKPKDALKLFAQMLERNSYIQPDEITLSSVVSANSQLGNTSFGTWVESYITEHGIKI 364

BLAST of Sgr024324 vs. TAIR 10
Match: AT3G29230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 183.7 bits (465), Expect = 2.5e-46
Identity = 119/370 (32.16%), Postives = 189/370 (51.08%), Query Frame = 0

Query: 32  LNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVA 91
           L    N+ Q  Q+HA I+       E L +     S  +  R  +  +R + +  Q P  
Sbjct: 26  LPKCANLNQVKQLHAQIIRRNLH--EDLHIAPKLISALSLCRQTNLAVR-VFNQVQEPNV 85

Query: 92  FSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRAC--------------- 151
                 +R  +QN Q  +AF  + +MQR GL    F     L+AC               
Sbjct: 86  HLCNSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNH 145

Query: 152 --------------GRIMC--KFGGSYVH-SQKVFDDLIEKNVVSWNSILSGYVKIGNLV 211
                           I C  + GG  V  + K+F+ + E++ VSWNS+L G VK G L 
Sbjct: 146 IEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAGELR 205

Query: 212 DAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGD 271
           DA+++FDEMP++D+ISWN+ML G+A    + +A  LF++M E+++ SW+ M+ GY   GD
Sbjct: 206 DARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKMPERNTVSWSTMVMGYSKAGD 265

Query: 272 IKSARNLFDAM--PQRNNVSWITLIAGYSKLGEL--------VLLASSL----------- 331
           ++ AR +FD M  P +N V+W  +IAGY++ G L         ++AS L           
Sbjct: 266 MEMARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRLVDQMVASGLKFDAAAVISIL 325

Query: 332 --TRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLV 347
                 G L+ G  I S +++  +  + Y+  AL+D+YAK GN+++AF++FN + K+DLV
Sbjct: 326 AACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAKCGNLKKAFDVFNDIPKKDLV 385

BLAST of Sgr024324 vs. TAIR 10
Match: AT1G13410.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 169.5 bits (428), Expect = 4.8e-42
Identity = 84/223 (37.67%), Postives = 130/223 (58.30%), Query Frame = 0

Query: 145 GSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFA 204
           G    + KVF +++EKNVV W S+++GY+   +LV A++ FD  PE+D++ WN+M+SG+ 
Sbjct: 42  GVIASANKVFCEMVEKNVVLWTSMINGYLLNKDLVSARRYFDLSPERDIVLWNTMISGYI 101

Query: 205 NSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAG 264
             GN+ +A  LF QM  +   SWN ++ GY N GD+++   +FD MP+RN  SW  LI G
Sbjct: 102 EMGNMLEARSLFDQMPCRDVMSWNTVLEGYANIGDMEACERVFDDMPERNVFSWNGLIKG 161

Query: 265 YSKLGELVLLASSLTRW----------------------LGNLNYGTWIESYMEKLGI-E 324
           Y++ G +  +  S  R                       LG  ++G W+  Y E LG  +
Sbjct: 162 YAQNGRVSEVLGSFKRMVDEGSVVPNDATMTLVLSACAKLGAFDFGKWVHKYGETLGYNK 221

Query: 325 LDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG 345
           +D  +  AL+D+Y K G IE A E+F G+K+RDL++++ MI G
Sbjct: 222 VDVNVKNALIDMYGKCGAIEIAMEVFKGIKRRDLISWNTMING 264

BLAST of Sgr024324 vs. TAIR 10
Match: AT4G18840.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 164.9 bits (416), Expect = 1.2e-40
Identity = 102/333 (30.63%), Postives = 171/333 (51.35%), Query Frame = 0

Query: 37  NVKQATQIHAHILVNGFRH----LESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAF 96
           ++ +  Q HA +L  G  H       L+    T  E    + VS Y   IL+   +P  F
Sbjct: 51  SLTEIQQAHAFMLKTGLFHDTFSASKLVAFAATNPE---PKTVS-YAHSILNRIGSPNGF 110

Query: 97  SWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYVHSQK 156
           +    +R ++ +     A + + +M    + P  ++ +  L+AC        G  +H   
Sbjct: 111 THNSVIRAYANSSTPEVALTVFREMLLGPVFPDKYSFTFVLKACAAFCGFEEGRQIHGLF 170

Query: 157 VFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQA 216
           +   L+  +V   N++++ Y + G    A+KV D MP +D +SWNS+LS +   G +D+A
Sbjct: 171 IKSGLV-TDVFVENTLVNVYGRSGYFEIARKVLDRMPVRDAVSWNSLLSAYLEKGLVDEA 230

Query: 217 SCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLG--- 276
             LF +M E++  SWN MISGY   G +K A+ +FD+MP R+ VSW  ++  Y+ +G   
Sbjct: 231 RALFDEMEERNVESWNFMISGYAAAGLVKEAKEVFDSMPVRDVVSWNAMVTAYAHVGCYN 290

Query: 277 --------------------ELVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATA 336
                                LV + S+    LG+L+ G W+  Y++K GIE++ +LATA
Sbjct: 291 EVLEVFNKMLDDSTEKPDGFTLVSVLSACAS-LGSLSQGEWVHVYIDKHGIEIEGFLATA 350

Query: 337 LVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI 343
           LVD+Y+K G I++A E+F    KRD+  ++++I
Sbjct: 351 LVDMYSKCGKIDKALEVFRATSKRDVSTWNSII 377

BLAST of Sgr024324 vs. TAIR 10
Match: AT1G04840.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 157.1 bits (396), Expect = 2.5e-38
Identity = 94/334 (28.14%), Postives = 170/334 (50.90%), Query Frame = 0

Query: 44  IHAHILVNGFRHLESLLVRQITRSEFTCARIVS--------RYLRQILHYSQNPVAFSWG 103
           IHA       RH+ + ++R+   S    A++VS         Y   I   S+    F   
Sbjct: 36  IHACKDTASLRHVHAQILRRGVLSSRVAAQLVSCSSLLKSPDYSLSIFRNSEERNPFVLN 95

Query: 104 CAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYVHSQKVFD 163
             +R  ++N +F  +  H++ M RLG++P        L++  ++  ++ G  +H+     
Sbjct: 96  ALIRGLTENARFESSVRHFILMLRLGVKPDRLTFPFVLKSNSKLGFRWLGRALHA-ATLK 155

Query: 164 DLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEK----DVISWNSMLSGFANSGNLDQ 223
           + ++ +     S++  Y K G L  A +VF+E P++     ++ WN +++G+  + ++  
Sbjct: 156 NFVDCDSFVRLSLVDMYAKTGQLKHAFQVFEESPDRIKKESILIWNVLINGYCRAKDMHM 215

Query: 224 ASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL 283
           A+ LF+ M E++S SW+ +I GYV+ G++  A+ LF+ MP++N VSW TLI G+S+ G+ 
Sbjct: 216 ATTLFRSMPERNSGSWSTLIKGYVDSGELNRAKQLFELMPEKNVVSWTTLINGFSQTGDY 275

Query: 284 VLLASSLTRWL---------------------GNLNYGTWIESYMEKLGIELDEYLATAL 343
               S+    L                     G L  G  I  Y+   GI+LD  + TAL
Sbjct: 276 ETAISTYFEMLEKGLKPNEYTIAAVLSACSKSGALGSGIRIHGYILDNGIKLDRAIGTAL 335

Query: 344 VDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG 345
           VD+YAK G ++ A  +F+ +  +D+++++AMI G
Sbjct: 336 VDMYAKCGELDCAATVFSNMNHKDILSWTAMIQG 368

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874892.11.6e-14368.91pentatricopeptide repeat-containing protein At4g22760-like [Benincasa hispida][more]
XP_038906935.11.6e-14368.91pentatricopeptide repeat-containing protein At4g22760 [Benincasa hispida][more]
XP_022155871.12.1e-14067.33pentatricopeptide repeat-containing protein At4g22760 [Momordica charantia][more]
KAG6579446.12.3e-13967.16Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022922292.11.2e-13866.67pentatricopeptide repeat-containing protein At4g22760 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
P0C8Q51.0e-7638.37Pentatricopeptide repeat-containing protein At4g22760 OS=Arabidopsis thaliana OX... [more]
Q9LS723.5e-4532.16Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana OX... [more]
O493991.7e-3930.63Pentatricopeptide repeat-containing protein At4g18840 OS=Arabidopsis thaliana OX... [more]
Q9MAT23.5e-3728.14Pentatricopeptide repeat-containing protein At1g04840 OS=Arabidopsis thaliana OX... [more]
Q1PEU45.9e-3728.21Pentatricopeptide repeat-containing protein At2g44880 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1DT091.0e-14067.33pentatricopeptide repeat-containing protein At4g22760 OS=Momordica charantia OX=... [more]
A0A6J1E2U45.6e-13966.67pentatricopeptide repeat-containing protein At4g22760 OS=Cucurbita moschata OX=3... [more]
A0A5A7THR19.6e-13966.34Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3ATB69.6e-13966.34pentatricopeptide repeat-containing protein At4g22760 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1HXF11.5e-13666.17pentatricopeptide repeat-containing protein At4g22760 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT4G22760.17.1e-7838.37Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G29230.12.5e-4632.16Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G13410.14.8e-4237.67Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18840.11.2e-4030.63Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G04840.12.5e-3828.14Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 194..222
e-value: 1.3E-6
score: 28.3
coord: 226..254
e-value: 1.4E-6
score: 28.1
coord: 93..122
e-value: 0.12
score: 12.6
coord: 163..192
e-value: 1.5E-7
score: 31.2
coord: 308..334
e-value: 6.6E-4
score: 19.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 194..222
e-value: 1.6E-6
score: 25.9
coord: 308..336
e-value: 5.4E-4
score: 18.0
coord: 226..255
e-value: 1.0E-5
score: 23.3
coord: 163..193
e-value: 8.8E-6
score: 23.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 303..337
score: 9.361008
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 90..124
score: 8.692369
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 223..257
score: 10.413293
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 161..195
score: 11.619036
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 142..223
e-value: 9.3E-21
score: 76.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 24..141
e-value: 3.1E-6
score: 28.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 224..273
e-value: 2.3E-9
score: 39.1
coord: 274..346
e-value: 6.3E-10
score: 41.0
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 221..271
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 78..152
NoneNo IPR availablePANTHERPTHR24015:SF1771OS08G0162200 PROTEINcoord: 156..221
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 156..221
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 281..346
NoneNo IPR availablePANTHERPTHR24015:SF1771OS08G0162200 PROTEINcoord: 78..152
NoneNo IPR availablePANTHERPTHR24015:SF1771OS08G0162200 PROTEINcoord: 221..271
coord: 281..346

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr024324.1Sgr024324.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding