Sgr015661 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr015661
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionprotein THYLAKOID ASSEMBLY 8, chloroplastic
Locationtig00004836: 778969 .. 779772 (-)
RNA-Seq ExpressionSgr015661
SyntenySgr015661
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTAATTTCCCAACGCCCGGCGAGGGGTATAATAGTCATTGCGAAATCGCCCACCCAATCCACTACCACGGTCTGCGCTCCGCCATGGCTTTTTCTTCCGCTCTTCGCCCCAATCCTTCCAGTACCTTCCTCAAATCTCAAATCCCAATTCCGAGGCCCGTGCCCCCCGCCGCCGCCGCCGCCGTCGCCGTCTTTTTCTCCGTTCCAGTACGCTGCGGCCCACGCGACAACCGAGGACCTCTGGTGAAAGGCAGAACCCTAAGCATCGAGGCAATCCAAGCCATTCAATCACTGAAACGGGCACAAAGATCCGACCCGGCAAGGCTCCAACACGTCCTCTCCAATACCCTCTCGCGATTGCTCAAAGCAGACCTCGTCGCGACGTTGAAGGAGCTCCTCCGGCAGGACCAGTGCGCCCTCGCCTTGGAGGTTTTCGCCGTCGTCCGATCGGAGTTCGGAGCCGACCTGGGGTTGTACGCGGAGCTGGCTGCGGCACTGTCGAGGAACGGGATGGCGGAGGAAATCGACCGGCTGGTGGGTGATTTGGATGGAGAGGGGAAGATCCAGATCCAGTGTGACGATAAGGGTTTGATTAAGTTGATCAGGGCGGTGATTGGTGGGGACAGAAGGGAATCAACGGTCAGGATTTATAGGCTGATGAGGAGGAGCGGTTGGGGGTCCACCATCAAAGCTGATGATTACATGGTTAAGGTTTTGAGCGAGGGTTTAAGGAGACTTGGAGAAATGGACTTGGCTGATGAGATCAATAGGGAATTTCAAAATTTAGTGGGCACTTATTGA

mRNA sequence

ATGTTTAATTTCCCAACGCCCGGCGAGGGGTATAATAGTCATTGCGAAATCGCCCACCCAATCCACTACCACGGTCTGCGCTCCGCCATGGCTTTTTCTTCCGCTCTTCGCCCCAATCCTTCCAGTACCTTCCTCAAATCTCAAATCCCAATTCCGAGGCCCGTGCCCCCCGCCGCCGCCGCCGCCGTCGCCGTCTTTTTCTCCGTTCCAGTACGCTGCGGCCCACGCGACAACCGAGGACCTCTGGTGAAAGGCAGAACCCTAAGCATCGAGGCAATCCAAGCCATTCAATCACTGAAACGGGCACAAAGATCCGACCCGGCAAGGCTCCAACACGTCCTCTCCAATACCCTCTCGCGATTGCTCAAAGCAGACCTCGTCGCGACGTTGAAGGAGCTCCTCCGGCAGGACCAGTGCGCCCTCGCCTTGGAGGTTTTCGCCGTCGTCCGATCGGAGTTCGGAGCCGACCTGGGGTTGTACGCGGAGCTGGCTGCGGCACTGTCGAGGAACGGGATGGCGGAGGAAATCGACCGGCTGGTGGGTGATTTGGATGGAGAGGGGAAGATCCAGATCCAGTGTGACGATAAGGGTTTGATTAAGTTGATCAGGGCGGTGATTGGTGGGGACAGAAGGGAATCAACGGTCAGGATTTATAGGCTGATGAGGAGGAGCGGTTGGGGGTCCACCATCAAAGCTGATGATTACATGGTTAAGGTTTTGAGCGAGGGTTTAAGGAGACTTGGAGAAATGGACTTGGCTGATGAGATCAATAGGGAATTTCAAAATTTAGTGGGCACTTATTGA

Coding sequence (CDS)

ATGTTTAATTTCCCAACGCCCGGCGAGGGGTATAATAGTCATTGCGAAATCGCCCACCCAATCCACTACCACGGTCTGCGCTCCGCCATGGCTTTTTCTTCCGCTCTTCGCCCCAATCCTTCCAGTACCTTCCTCAAATCTCAAATCCCAATTCCGAGGCCCGTGCCCCCCGCCGCCGCCGCCGCCGTCGCCGTCTTTTTCTCCGTTCCAGTACGCTGCGGCCCACGCGACAACCGAGGACCTCTGGTGAAAGGCAGAACCCTAAGCATCGAGGCAATCCAAGCCATTCAATCACTGAAACGGGCACAAAGATCCGACCCGGCAAGGCTCCAACACGTCCTCTCCAATACCCTCTCGCGATTGCTCAAAGCAGACCTCGTCGCGACGTTGAAGGAGCTCCTCCGGCAGGACCAGTGCGCCCTCGCCTTGGAGGTTTTCGCCGTCGTCCGATCGGAGTTCGGAGCCGACCTGGGGTTGTACGCGGAGCTGGCTGCGGCACTGTCGAGGAACGGGATGGCGGAGGAAATCGACCGGCTGGTGGGTGATTTGGATGGAGAGGGGAAGATCCAGATCCAGTGTGACGATAAGGGTTTGATTAAGTTGATCAGGGCGGTGATTGGTGGGGACAGAAGGGAATCAACGGTCAGGATTTATAGGCTGATGAGGAGGAGCGGTTGGGGGTCCACCATCAAAGCTGATGATTACATGGTTAAGGTTTTGAGCGAGGGTTTAAGGAGACTTGGAGAAATGGACTTGGCTGATGAGATCAATAGGGAATTTCAAAATTTAGTGGGCACTTATTGA

Protein sequence

MFNFPTPGEGYNSHCEIAHPIHYHGLRSAMAFSSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY
Homology
BLAST of Sgr015661 vs. NCBI nr
Match: XP_022150715.1 (uncharacterized protein LOC111018772 [Momordica charantia])

HSP 1 Score: 366.7 bits (940), Expect = 1.7e-97
Identity = 195/235 (82.98%), Postives = 214/235 (91.06%), Query Frame = 0

Query: 33  SSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEA 92
           +S+LRPNP S FLKS+IP PR    +AAA  AV FSVPVRCGPRDNRGPLVKGRTLS EA
Sbjct: 2   ASSLRPNP-SPFLKSEIPTPR----SAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEA 61

Query: 93  IQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE 152
           IQAIQSLKRAQRSDP +L HVLS+TLSRLLKADLVATLKELLRQ+QC LALEVF VVRSE
Sbjct: 62  IQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE 121

Query: 153 FGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRE 212
           +GADLGLYAELAAALSRNGMAEEIDRL+ +L+GEG  +I+CDDKGLIKLIRAVIGGDRRE
Sbjct: 122 YGADLGLYAELAAALSRNGMAEEIDRLLCELEGEG--EIECDDKGLIKLIRAVIGGDRRE 181

Query: 213 STVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY 268
           STVRIYR+MRRSGWGST KADD+ VK+LS+GLRRLGE++LADEINREFQNLV TY
Sbjct: 182 STVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNLVATY 229

BLAST of Sgr015661 vs. NCBI nr
Match: XP_038898717.1 (protein THYLAKOID ASSEMBLY 8, chloroplastic [Benincasa hispida])

HSP 1 Score: 345.9 bits (886), Expect = 3.1e-91
Identity = 187/224 (83.48%), Postives = 207/224 (92.41%), Query Frame = 0

Query: 42  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR 101
           STFLKSQI I  P+P +AAAAVAV  S+PVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR
Sbjct: 7   STFLKSQISI--PIPVSAAAAVAV--SLPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR 66

Query: 102 AQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYA 161
           A+RSDP +LQ VLS TLSRLLKADLVA+LKELLRQD+CALALEVFAV+RSE+GADLG+YA
Sbjct: 67  AERSDPTKLQQVLSTTLSRLLKADLVASLKELLRQDRCALALEVFAVIRSEYGADLGMYA 126

Query: 162 ELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLM 221
           E+AAALSRNG AEEIDRLV DLDG G + IQ DDKGLIKLI+AVI GDRRESTVRIYR+M
Sbjct: 127 EVAAALSRNGAAEEIDRLVCDLDG-GDVLIQWDDKGLIKLIKAVISGDRRESTVRIYRMM 186

Query: 222 RRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVG 266
           RR+GWGSTIKADDY+V+VLS+GLRR GEM+LADEINREFQ+LVG
Sbjct: 187 RRNGWGSTIKADDYLVRVLSKGLRRFGEMELADEINREFQDLVG 225

BLAST of Sgr015661 vs. NCBI nr
Match: XP_022954347.1 (protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata])

HSP 1 Score: 344.4 bits (882), Expect = 9.1e-91
Identity = 179/226 (79.20%), Postives = 205/226 (90.71%), Query Frame = 0

Query: 42  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR 101
           STFLKSQIPIP P+    +AA  V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKR
Sbjct: 7   STFLKSQIPIPIPI----SAAATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKR 66

Query: 102 AQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYA 161
           A++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CALALEVFAVVRSE+GADLG+YA
Sbjct: 67  AEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGADLGMYA 126

Query: 162 ELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLM 221
           E+AAALSRNG  EEIDRLV DL+ E ++ IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M
Sbjct: 127 EVAAALSRNGAMEEIDRLVCDLEDEDRV-IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM 186

Query: 222 RRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY 268
           +RSGWGSTIKADDY V+VLS+GLRRLGEM++ADE+N +FQ+LVG++
Sbjct: 187 KRSGWGSTIKADDYTVRVLSKGLRRLGEMEMADEVNMQFQDLVGSF 227

BLAST of Sgr015661 vs. NCBI nr
Match: XP_022992386.1 (protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima])

HSP 1 Score: 344.0 bits (881), Expect = 1.2e-90
Identity = 179/226 (79.20%), Postives = 204/226 (90.27%), Query Frame = 0

Query: 42  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR 101
           STFLKSQIPIP P+    +A   V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKR
Sbjct: 7   STFLKSQIPIPIPI----SATATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKR 66

Query: 102 AQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYA 161
           A++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CALALEVFAVVRSE+G DLG+YA
Sbjct: 67  AEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGVDLGMYA 126

Query: 162 ELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLM 221
           E+AAALSRNG  EEIDRLV DL+ E ++ IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M
Sbjct: 127 EVAAALSRNGAMEEIDRLVCDLEDEDRV-IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM 186

Query: 222 RRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY 268
           +RSGWGSTIKADDYMV+VLS+GLRRLGEM++ADEIN +FQ+LVG++
Sbjct: 187 KRSGWGSTIKADDYMVRVLSKGLRRLGEMEMADEINMQFQDLVGSF 227

BLAST of Sgr015661 vs. NCBI nr
Match: XP_023522652.1 (protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023548438.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023548440.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 343.2 bits (879), Expect = 2.0e-90
Identity = 179/226 (79.20%), Postives = 204/226 (90.27%), Query Frame = 0

Query: 42  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR 101
           STFLKSQIPIP P+    +A   V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKR
Sbjct: 7   STFLKSQIPIPIPI----SATATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKR 66

Query: 102 AQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYA 161
           A++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CALALEVFAVVRSE+GADLG+YA
Sbjct: 67  AEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGADLGMYA 126

Query: 162 ELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLM 221
           E+AAALSRNG AEEIDRLV DL+ E +  IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M
Sbjct: 127 EVAAALSRNGAAEEIDRLVCDLEDEDR-AIQCDDKGLIKLIKAVIGGDRRESTVRIYRMM 186

Query: 222 RRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY 268
           +RSGWGSTIKADDY V+VLS+GLRRLGEM++ADE+N +FQ+LVG++
Sbjct: 187 KRSGWGSTIKADDYTVRVLSKGLRRLGEMEMADEVNMQFQDLVGSF 227

BLAST of Sgr015661 vs. ExPASy Swiss-Prot
Match: Q9LVW6 (Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8 PE=2 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 8.9e-49
Identity = 107/191 (56.02%), Postives = 143/191 (74.87%), Query Frame = 0

Query: 69  VPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVA 128
           V +RCGPRDNRGPL+KGR LS EAIQ+IQSLKRA R+  +    +    L RL+K+DL++
Sbjct: 29  VSIRCGPRDNRGPLLKGRILSTEAIQSIQSLKRAHRTGVS--LSLTLRPLRRLIKSDLIS 88

Query: 129 TLKELLRQDQCALALEVFAVVRSEF-GADLGLYAELAAALSRNGMAEEIDRLVGDLDGEG 188
            L+ELLRQD C LA+ V + +R+E+   DL LYA++  AL+RN   +EIDRL+G++DG  
Sbjct: 89  VLRELLRQDYCTLAVHVLSTLRTEYPPLDLVLYADIVNALTRNKEFDEIDRLIGEIDG-- 148

Query: 189 KIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGS-TIKADDYMVKVLSEGLRR 248
            I  + DDK L KLIRAV+G +RRES VR+Y LMR SGWGS + +AD+Y+ +VLS+GL R
Sbjct: 149 -IDQRSDDKALAKLIRAVVGAERRESVVRVYTLMRESGWGSESWEADEYVAEVLSKGLLR 208

Query: 249 LGEMDLADEIN 258
           LGE DLA +++
Sbjct: 209 LGEPDLASQVS 214

BLAST of Sgr015661 vs. ExPASy Swiss-Prot
Match: Q1PFH7 (Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX=3702 GN=At1g62350 PE=2 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 5.8e-08
Identity = 44/139 (31.65%), Postives = 73/139 (52.52%), Query Frame = 0

Query: 88  LSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFA 147
           +S E + A + LKR Q +   RL   + + +SRLLK+DLV+ L E  RQ+Q  L ++++ 
Sbjct: 1   MSKEGLIAAKELKRLQ-TQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYE 60

Query: 148 VVRSE--FGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAV 207
           VVR E  +  D+  Y ++   L+RN   +E  ++  DL  E   ++  D      L+R  
Sbjct: 61  VVRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKE---EVLFDQHTFGDLVRGF 120

Query: 208 IGGDRRESTVRIYRLMRRS 225
           +  +     +R+Y  MR S
Sbjct: 121 LDNELPLEAMRLYGEMRES 135

BLAST of Sgr015661 vs. ExPASy Swiss-Prot
Match: Q9STF9 (Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8L PE=1 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 1.9e-06
Identity = 50/188 (26.60%), Postives = 95/188 (50.53%), Query Frame = 0

Query: 79  RGPLVKGRTL-SIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQD 138
           RGPL +G+ L   EA+  I  LKR  + D  +L   +   + RLLK D++A + EL RQ+
Sbjct: 63  RGPLWRGKKLIGKEALFVILGLKRL-KEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQE 122

Query: 139 QCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDD 198
           + ALA+++F V++ +  +  D+ +Y +L  +L+++   +E   L   +  E       D 
Sbjct: 123 ETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKENLFP---DS 182

Query: 199 KGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADE 258
           +   ++IR  +        + +Y  M +    S    ++   +VL +GL  L    L ++
Sbjct: 183 QTYTEVIRGFLRDGCPADAMNVYEDMLK----SPDPPEELPFRVLLKGL--LPHPLLRNK 240

Query: 259 INREFQNL 264
           + ++F+ L
Sbjct: 243 VKKDFEEL 240

BLAST of Sgr015661 vs. ExPASy TrEMBL
Match: A0A6J1D998 (uncharacterized protein LOC111018772 OS=Momordica charantia OX=3673 GN=LOC111018772 PE=4 SV=1)

HSP 1 Score: 366.7 bits (940), Expect = 8.3e-98
Identity = 195/235 (82.98%), Postives = 214/235 (91.06%), Query Frame = 0

Query: 33  SSALRPNPSSTFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEA 92
           +S+LRPNP S FLKS+IP PR    +AAA  AV FSVPVRCGPRDNRGPLVKGRTLS EA
Sbjct: 2   ASSLRPNP-SPFLKSEIPTPR----SAAAVAAVVFSVPVRCGPRDNRGPLVKGRTLSTEA 61

Query: 93  IQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSE 152
           IQAIQSLKRAQRSDP +L HVLS+TLSRLLKADLVATLKELLRQ+QC LALEVF VVRSE
Sbjct: 62  IQAIQSLKRAQRSDPTKLHHVLSSTLSRLLKADLVATLKELLRQEQCGLALEVFTVVRSE 121

Query: 153 FGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRE 212
           +GADLGLYAELAAALSRNGMAEEIDRL+ +L+GEG  +I+CDDKGLIKLIRAVIGGDRRE
Sbjct: 122 YGADLGLYAELAAALSRNGMAEEIDRLLCELEGEG--EIECDDKGLIKLIRAVIGGDRRE 181

Query: 213 STVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY 268
           STVRIYR+MRRSGWGST KADD+ VK+LS+GLRRLGE++LADEINREFQNLV TY
Sbjct: 182 STVRIYRMMRRSGWGSTTKADDHTVKILSKGLRRLGEVELADEINREFQNLVATY 229

BLAST of Sgr015661 vs. ExPASy TrEMBL
Match: A0A6J1GQS0 (protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111456620 PE=4 SV=1)

HSP 1 Score: 344.4 bits (882), Expect = 4.4e-91
Identity = 179/226 (79.20%), Postives = 205/226 (90.71%), Query Frame = 0

Query: 42  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR 101
           STFLKSQIPIP P+    +AA  V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKR
Sbjct: 7   STFLKSQIPIPIPI----SAAATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKR 66

Query: 102 AQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYA 161
           A++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CALALEVFAVVRSE+GADLG+YA
Sbjct: 67  AEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGADLGMYA 126

Query: 162 ELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLM 221
           E+AAALSRNG  EEIDRLV DL+ E ++ IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M
Sbjct: 127 EVAAALSRNGAMEEIDRLVCDLEDEDRV-IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM 186

Query: 222 RRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY 268
           +RSGWGSTIKADDY V+VLS+GLRRLGEM++ADE+N +FQ+LVG++
Sbjct: 187 KRSGWGSTIKADDYTVRVLSKGLRRLGEMEMADEVNMQFQDLVGSF 227

BLAST of Sgr015661 vs. ExPASy TrEMBL
Match: A0A6J1JTE8 (protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111488709 PE=4 SV=1)

HSP 1 Score: 344.0 bits (881), Expect = 5.7e-91
Identity = 179/226 (79.20%), Postives = 204/226 (90.27%), Query Frame = 0

Query: 42  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR 101
           STFLKSQIPIP P+    +A   V  S+PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKR
Sbjct: 7   STFLKSQIPIPIPI----SATATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKR 66

Query: 102 AQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYA 161
           A++SDP +L+ VLS TLSRLLKADLVATLKELLRQD+CALALEVFAVVRSE+G DLG+YA
Sbjct: 67  AEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGVDLGMYA 126

Query: 162 ELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLM 221
           E+AAALSRNG  EEIDRLV DL+ E ++ IQCDDKGLIKLI+AVIGGDRRESTVRIYR+M
Sbjct: 127 EVAAALSRNGAMEEIDRLVCDLEDEDRV-IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM 186

Query: 222 RRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGTY 268
           +RSGWGSTIKADDYMV+VLS+GLRRLGEM++ADEIN +FQ+LVG++
Sbjct: 187 KRSGWGSTIKADDYMVRVLSKGLRRLGEMEMADEINMQFQDLVGSF 227

BLAST of Sgr015661 vs. ExPASy TrEMBL
Match: A0A1S3CGD3 (uncharacterized protein LOC103500597 OS=Cucumis melo OX=3656 GN=LOC103500597 PE=4 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 8.3e-90
Identity = 194/268 (72.39%), Postives = 220/268 (82.09%), Query Frame = 0

Query: 1   MFNF-PTPGEGYNSHCEIAHPIHYHGLRSAMAFSSALRPNPSSTFLKSQIPIPRPVPPAA 60
           +F+F P   +GY SHCE   P     L +AMA S        STFLKSQI IP P    A
Sbjct: 2   IFDFPPNLSKGYYSHCEKKKP----NLHTAMASSL------HSTFLKSQISIPIPT-STA 61

Query: 61  AAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLS 120
            AAVAV F   VRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRA+RSDP +LQ VLS TLS
Sbjct: 62  TAAVAVSFR--VRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAERSDPTKLQQVLSTTLS 121

Query: 121 RLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYAELAAALSRNGMAEEIDRL 180
           RLLKADLVATLKELLRQ++CALALEVFAV+RSE+ A+LGLYAE+AAALSRNG AEEIDRL
Sbjct: 122 RLLKADLVATLKELLRQERCALALEVFAVIRSEYRAELGLYAEVAAALSRNGAAEEIDRL 181

Query: 181 VGDLDG-EGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVK 240
           V DLDG +G I+   DDKGLIKLI+AVI G+RRESTVRIYR+MRR+GWGS IK DDYM+K
Sbjct: 182 VCDLDGRDGVIEWGDDDKGLIKLIKAVISGNRRESTVRIYRMMRRNGWGSMIKGDDYMIK 241

Query: 241 VLSEGLRRLGEMDLADEINREFQNLVGT 267
           V+S+GLRR+GE++LADEINREFQ+LVG+
Sbjct: 242 VMSKGLRRVGEIELADEINREFQDLVGS 256

BLAST of Sgr015661 vs. ExPASy TrEMBL
Match: A0A5A7V0E2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G002110 PE=4 SV=1)

HSP 1 Score: 328.2 bits (840), Expect = 3.3e-86
Identity = 179/226 (79.20%), Postives = 201/226 (88.94%), Query Frame = 0

Query: 42  STFLKSQIPIPRPVPPAAAAAVAVFFSVPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR 101
           STFLKSQI IP P    A AAVAV F   VRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR
Sbjct: 7   STFLKSQISIPIPT-STATAAVAVSFR--VRCGPRDNRGPLVKGRTLSIEAIQAIQSLKR 66

Query: 102 AQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFAVVRSEFGADLGLYA 161
           A+RSDP +LQ VLS TLSRLLKADLVATLKELLRQ++CALALEVFAV+RSE+ A+LGLYA
Sbjct: 67  AERSDPTKLQQVLSTTLSRLLKADLVATLKELLRQERCALALEVFAVIRSEYRAELGLYA 126

Query: 162 ELAAALSRNGMAEEIDRLVGDLDG-EGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRL 221
           E+AAALSRNG AEEIDRLV DLDG +G I+   DDKGLIKLI+AVI G+RRESTVRIYR+
Sbjct: 127 EVAAALSRNGAAEEIDRLVCDLDGRDGVIEWGDDDKGLIKLIKAVISGNRRESTVRIYRM 186

Query: 222 MRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADEINREFQNLVGT 267
           MRR+GWGS IK DDYM+KV+S+GLRR+GE++LADEINREFQ+LVG+
Sbjct: 187 MRRNGWGSMIKGDDYMIKVMSKGLRRVGELELADEINREFQDLVGS 229

BLAST of Sgr015661 vs. TAIR 10
Match: AT3G27750.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: Vacuolar sorting protein 9 (VPS9) domain (TAIR:AT5G09320.1); Has 106 Blast hits to 106 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 4; Plants - 102; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 195.3 bits (495), Expect = 6.3e-50
Identity = 107/191 (56.02%), Postives = 143/191 (74.87%), Query Frame = 0

Query: 69  VPVRCGPRDNRGPLVKGRTLSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVA 128
           V +RCGPRDNRGPL+KGR LS EAIQ+IQSLKRA R+  +    +    L RL+K+DL++
Sbjct: 29  VSIRCGPRDNRGPLLKGRILSTEAIQSIQSLKRAHRTGVS--LSLTLRPLRRLIKSDLIS 88

Query: 129 TLKELLRQDQCALALEVFAVVRSEF-GADLGLYAELAAALSRNGMAEEIDRLVGDLDGEG 188
            L+ELLRQD C LA+ V + +R+E+   DL LYA++  AL+RN   +EIDRL+G++DG  
Sbjct: 89  VLRELLRQDYCTLAVHVLSTLRTEYPPLDLVLYADIVNALTRNKEFDEIDRLIGEIDG-- 148

Query: 189 KIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGS-TIKADDYMVKVLSEGLRR 248
            I  + DDK L KLIRAV+G +RRES VR+Y LMR SGWGS + +AD+Y+ +VLS+GL R
Sbjct: 149 -IDQRSDDKALAKLIRAVVGAERRESVVRVYTLMRESGWGSESWEADEYVAEVLSKGLLR 208

Query: 249 LGEMDLADEIN 258
           LGE DLA +++
Sbjct: 209 LGEPDLASQVS 214

BLAST of Sgr015661 vs. TAIR 10
Match: AT5G09320.1 (Vacuolar sorting protein 9 (VPS9) domain )

HSP 1 Score: 87.8 bits (216), Expect = 1.4e-17
Identity = 64/204 (31.37%), Postives = 103/204 (50.49%), Query Frame = 0

Query: 78  NRGPLVKGRTLSIEAIQAIQSLKRAQ--------------RSDPARLQHVLSNTLSRLLK 137
           NR PL +GR LSIEAIQA+Q+LKRA                S  A L  V+ +   RLLK
Sbjct: 495 NRKPLQRGRMLSIEAIQAVQALKRANPLLPPPPVPSTSTTSSSSALLDRVIISKFRRLLK 554

Query: 138 ADLVATLKELLRQDQCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEEIDRLVG 197
            D+VA L+ELLRQ++C+LAL+VF  +R E  +   + +Y ++   ++ N + EE++ L  
Sbjct: 555 FDMVAVLRELLRQNECSLALKVFEEIRKEYWYKPQVRMYTDMITVMADNSLMEEVNYLYS 614

Query: 198 DLDGEGKIQIQCDDKGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLS 257
            +  E  +  + +      L+  ++     +  +  Y  M+  G+    + D    +VL 
Sbjct: 615 AMKSEKGLMAEIE--WFNTLLTILLNHKLFDLVMDCYAFMQSIGY----EPDRASFRVLV 674

Query: 258 EGLRRLGEMDLADEINREFQNLVG 266
            GL   GEM L+  + ++     G
Sbjct: 675 LGLESNGEMGLSAIVRQDAHEYYG 692

BLAST of Sgr015661 vs. TAIR 10
Match: AT1G62350.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 59.7 bits (143), Expect = 4.2e-09
Identity = 44/139 (31.65%), Postives = 73/139 (52.52%), Query Frame = 0

Query: 88  LSIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQDQCALALEVFA 147
           +S E + A + LKR Q +   RL   + + +SRLLK+DLV+ L E  RQ+Q  L ++++ 
Sbjct: 1   MSKEGLIAAKELKRLQ-TQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYE 60

Query: 148 VVRSE--FGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDDKGLIKLIRAV 207
           VVR E  +  D+  Y ++   L+RN   +E  ++  DL  E   ++  D      L+R  
Sbjct: 61  VVRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKE---EVLFDQHTFGDLVRGF 120

Query: 208 IGGDRRESTVRIYRLMRRS 225
           +  +     +R+Y  MR S
Sbjct: 121 LDNELPLEAMRLYGEMRES 135

BLAST of Sgr015661 vs. TAIR 10
Match: AT3G46870.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 54.7 bits (130), Expect = 1.3e-07
Identity = 50/188 (26.60%), Postives = 95/188 (50.53%), Query Frame = 0

Query: 79  RGPLVKGRTL-SIEAIQAIQSLKRAQRSDPARLQHVLSNTLSRLLKADLVATLKELLRQD 138
           RGPL +G+ L   EA+  I  LKR  + D  +L   +   + RLLK D++A + EL RQ+
Sbjct: 63  RGPLWRGKKLIGKEALFVILGLKRL-KEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQE 122

Query: 139 QCALALEVFAVVRSE--FGADLGLYAELAAALSRNGMAEEIDRLVGDLDGEGKIQIQCDD 198
           + ALA+++F V++ +  +  D+ +Y +L  +L+++   +E   L   +  E       D 
Sbjct: 123 ETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKENLFP---DS 182

Query: 199 KGLIKLIRAVIGGDRRESTVRIYRLMRRSGWGSTIKADDYMVKVLSEGLRRLGEMDLADE 258
           +   ++IR  +        + +Y  M +    S    ++   +VL +GL  L    L ++
Sbjct: 183 QTYTEVIRGFLRDGCPADAMNVYEDMLK----SPDPPEELPFRVLLKGL--LPHPLLRNK 240

Query: 259 INREFQNL 264
           + ++F+ L
Sbjct: 243 VKKDFEEL 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022150715.11.7e-9782.98uncharacterized protein LOC111018772 [Momordica charantia][more]
XP_038898717.13.1e-9183.48protein THYLAKOID ASSEMBLY 8, chloroplastic [Benincasa hispida][more]
XP_022954347.19.1e-9179.20protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata][more]
XP_022992386.11.2e-9079.20protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima][more]
XP_023522652.12.0e-9079.20protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo] >X... [more]
Match NameE-valueIdentityDescription
Q9LVW68.9e-4956.02Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=T... [more]
Q1PFH75.8e-0831.65Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX... [more]
Q9STF91.9e-0626.60Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702... [more]
Match NameE-valueIdentityDescription
A0A6J1D9988.3e-9882.98uncharacterized protein LOC111018772 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A6J1GQS04.4e-9179.20protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1JTE85.7e-9179.20protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A1S3CGD38.3e-9072.39uncharacterized protein LOC103500597 OS=Cucumis melo OX=3656 GN=LOC103500597 PE=... [more]
A0A5A7V0E23.3e-8679.20Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT3G27750.16.3e-5056.02FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT5G09320.11.4e-1731.37Vacuolar sorting protein 9 (VPS9) domain [more]
AT1G62350.14.2e-0931.65Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G46870.11.3e-0726.60Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 87..264
e-value: 2.7E-30
score: 107.3
IPR044190Protein THYLAKOID ASSEMBLY 8-likePANTHERPTHR47594PPR CONTAINING PLANT-LIKE PROTEINcoord: 55..264
NoneNo IPR availablePANTHERPTHR47594:SF3PROTEIN THYLAKOID ASSEMBLY 8, CHLOROPLASTICcoord: 55..264

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr015661.1Sgr015661.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
biological_process GO:0000373 Group II intron splicing
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding