Sgr014939 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr014939
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Locationtig00002486: 360640 .. 363760 (+)
RNA-Seq ExpressionSgr014939
SyntenySgr014939
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTACCTCAGGACGGCCATGGACTCCGCCTTCTGGGATTTGAACCTTTCGTCCCCTCAAACCCTCGCCGGCACTGCCAAGGCCGTCCCTGGCGAACCATTCCCACTCGACGGAGCTCGAGCCAGCCGCGCCCTGCGGATTCAGCAAATCTCCCTCCTCGGCAATGGATTTCCGCTCGGAATCATTCCTTCTTACTCTCCCACTGCACACAAGGAGCTAGGTTCCTTTTCTCTTCAGTCGCTCTTGCTCAGGTTGCCTGCCGCCGACTGGTAAGTAGCAATGGCTTCTTATGTGGATTGTTTGGCTGCTGAGAAAATGTCAGGGAAAAAATTACGTGTTCATGTGTGTCGTCGGAAAATACTCTTTGTTTTATGCTTGGTTTAGTTGCTGGAACAATCAACTGCAAGAAAAACATCTTGAAATTGAGAAATGAACTTTAGCTTCATAGAGTCATTTGGTATTAACATAGATAACTAGCTTTATGATCATAAATTCCTAGCTTACAGCAGGTTCTTTCTTTATGCTCTAACATTTCTTTTATCACCTGCTACCGTTTTTCTTTTTTGGGTCTGTCCTTGATAATGTTGACTCCTGGTGTTTTAAGTGCTTCATTTCATTGCAGTTCTCAATTAGTAAAGTCAGACCCTCATCCAGTTCCATATGACACATGCTATAAAATCCCTTGTATATTTAGTGCGTTTTCTTGTTTGAATTTTCTTGACAGCAGAGGTGAATCAAATAGCTGATTGACTCCCCTAATTATCTCAGTTATCAACTCATGCTGAAAATTCCCAAGGTTTAGTATTGGGATTTTCACACAGAATTGAAGTTTGGCTATCCCATTATTTCAAGCAAAATTTTCTGAGGGAAAAAGGTCCAACGTGACCATATTTACCTCTGACATCATTTAGATTGAATTTTACTTCACAAATCTCAGATGTACCATCTAATGGAATCTTTATCCCTATAGGTGGGTTGGATTAGTTGGCCAATTTCGTCCGAAGAAACTGATATCTGCTATAAAAGCTGAACTTTCTGCTGCGGATAGCCTTGAGCTCCCTGTTTTGAAAGATGTTGCTAGACAGTTTCTAGACAAGTCACTCTATTCATATGGATTATGCTCACAGTTTTCTCCTAGTCCCTTTTCATCTGTATTTTTCAGCACAGAAGAGCATGGCGAGAGGAAAGGACGTCGCCACAAAGCGATGTTTTATCACAAGGTGAGGATTGTAGTCTTCTAAGTTACATTAAATGATAATTATAGGAAAAGAGAAATATGAATGCTGTTCATCAAGTTCTTTAAATAATGATATCTGCCCTTTGTTTGGAAACTGCATGAGTTGCAAGTTCTTTTTTTTTGTATTTTTTTTTTGTAAATTTAAATATTAGTGCTGACTGGATTCTATCATCGATCTCTTTGTAACGCTGTAAGTACCTTATGCCTGAATATGATGAGGTTACGATCCCAAGTGTGGCTCTTGCAATAGAATCTGGCAGGATGAAAGCGCCATTTTGAAACTTCTCTTCACATTGCAAATTTAGAAGGATTGATATTGGGTATATTTGCATTTGTAGGTTAAGCTACACGTCTGGAGTTCTTTAGCTTTCATATCATTCATGCCTCATTACTATTTAGCGTTTGAGAACTTCATTTTAAGGAATATATTGTTCTATAACAGACACGTATTTTGCTCGACTCCAAAAGATTGAATTATATTGCAACTTATATCTGCAGCTTCCTAATCACGACATTAATCTGGAAGCAGCTTGGCCTGAGCTCTTCATTGATCATAAAGGACAGTATTGGGATGTGCCTGAGTCTATATCTTTGGATCTTTCCTCTCTTATGTCTGTCTCTGGTTTGCGATACCGGTTTGGGTTGCATAAGAATGGTGGCCTTCCCCGGGCTCTTAATTCTACAAACGATGACCCACCTCTTGCTCTTCTGCCTGGATTATGTGCAAAGGCTGCATTTTCTTTAGAAAAGAACAGGTACATTTGGAGGATAAAAGAAAGGAAGCAAGACATGATGGAGAAGACAGACAAGGGGGAATGGTATTGGAGGCCATCATATGATGTGCGTCTCAAAGAACCTCACTCAGCGATATCTGGAATTGTCGGTCAGTCTGCTGCCTTCTAAATTTGGTTTGCACTCAAATCAAAAGTTGAGCAGCTTTACTCAATTTTGTTGATACACACACACACACACACAAGTGTCCTTCCCATCCCCCTGCCCCGACGTTGCCTTTGGGAAGTTTGATTTTAAAGGTTCATTTTTGCTGCTTGTTTCTTACTAAGTATTTGCATATCAGGTGGCACCTTTAGCACCTGGTTTGGAGGCAGCGACATGGTTGGGTCCAATGGAGATGGAAACTTATCCATTGGCCCTAAGAAAAGAAGTCCATTGAATGCTGATCTTTTTGGCTCAATTTGCTATACTTTCCAACATGGGAGATTTAGAAAACAGTTTGGTGACCTTACGAGGATAGATGCTCGCTTAGATATTTCTTCAGCTTCAGGGTTTGCCAAAAGAGTTTTTAATGGTTTCAAGAAATCTGTTGATGATTTAGAGAGATCAGAATCTTCCCCCAGACTCAATTTGATCTTTCAACAGCAGGTAAATACCTTTTAAGTTTTGTGATACTAATGTAGGTCTCCATTCACATATCATCTAGTCTGTTATGCATGATGGAAGAAATCAGTTCATAAACTAAACTCAAGATGAATTTATCTAATCAACTACACTTCAAAAGACATGAATGATGATAAGAAGAAATGTCGTTTGCAGGTTGCTGGCCCGATTGTCTTCCGTGTAGATTCCAGGCTTAAGCTTGATTCTTCATCTGGCAAGCGCAGTCCCCATGTCGAGGACACAATATACAGCCTGAATTATTCTTTTAGGCTTCTTCAATCAGGCAAAGCCGTTTTCTGGTATTCTCCCAAAAGAAAAGAGGGGATGGTCGAGTTGCGCCTGTTTGAATTTTGATAAACCATGGTTTTAGTTCAGTTCATGCATTTGATTCTTATCTTTTTGACAACGAAATCGGCGACTTAGAGTTAGTTATAGCACTTGAGGCCTTTCTTCCCATCTATTTAGTGTTGCACAACTCTTTCAGAT

mRNA sequence

ATGGCGTACCTCAGGACGGCCATGGACTCCGCCTTCTGGGATTTGAACCTTTCGTCCCCTCAAACCCTCGCCGGCACTGCCAAGGCCGTCCCTGGCGAACCATTCCCACTCGACGGAGCTCGAGCCAGCCGCGCCCTGCGGATTCAGCAAATCTCCCTCCTCGGCAATGGATTTCCGCTCGGAATCATTCCTTCTTACTCTCCCACTGCACACAAGGAGCTAGGTTCCTTTTCTCTTCAGTCGCTCTTGCTCAGGTTGCCTGCCGCCGACTGGTGGGTTGGATTAGTTGGCCAATTTCGTCCGAAGAAACTGATATCTGCTATAAAAGCTGAACTTTCTGCTGCGGATAGCCTTGAGCTCCCTGTTTTGAAAGATGTTGCTAGACAGTTTCTAGACAAGTCACTCTATTCATATGGATTATGCTCACAGTTTTCTCCTAGTCCCTTTTCATCTGTATTTTTCAGCACAGAAGAGCATGGCGAGAGGAAAGGACGTCGCCACAAAGCGATGTTTTATCACAAGCTTCCTAATCACGACATTAATCTGGAAGCAGCTTGGCCTGAGCTCTTCATTGATCATAAAGGACAGTATTGGGATGTGCCTGAGTCTATATCTTTGGATCTTTCCTCTCTTATGTCTGTCTCTGGTTTGCGATACCGGTTTGGGTTGCATAAGAATGGTGGCCTTCCCCGGGCTCTTAATTCTACAAACGATGACCCACCTCTTGCTCTTCTGCCTGGATTATGTGCAAAGGCTGCATTTTCTTTAGAAAAGAACAGGTACATTTGGAGGATAAAAGAAAGGAAGCAAGACATGATGGAGAAGACAGACAAGGGGGAATGGTATTGGAGGCCATCATATGATGTGCGTCTCAAAGAACCTCACTCAGCGATATCTGGAATTGTCGGTGGCACCTTTAGCACCTGGTTTGGAGGCAGCGACATGGTTGGGTCCAATGGAGATGGAAACTTATCCATTGGCCCTAAGAAAAGAAGTCCATTGAATGCTGATCTTTTTGGCTCAATTTGCTATACTTTCCAACATGGGAGATTTAGAAAACAGTTTGGTGACCTTACGAGGATAGATGCTCGCTTAGATATTTCTTCAGCTTCAGGGTTTGCCAAAAGAGTTTTTAATGGTTTCAAGAAATCTGTTGATGATTTAGAGAGATCAGAATCTTCCCCCAGACTCAATTTGATCTTTCAACAGCAGGTTGCTGGCCCGATTGTCTTCCGTGTAGATTCCAGGCTTAAGCTTGATTCTTCATCTGGCAAGCGCAGTCCCCATGTCGAGGACACAATATACAGCCTGAATTATTCTTTTAGGCTTCTTCAATCAGGCAAAGCCGTTTTCTGTTCAGTTCATGCATTTGATTCTTATCTTTTTGACAACGAAATCGGCGACTTAGAGTTAGTTATAGCACTTGAGGCCTTTCTTCCCATCTATTTAGTGTTGCACAACTCTTTCAGAT

Coding sequence (CDS)

ATGGCGTACCTCAGGACGGCCATGGACTCCGCCTTCTGGGATTTGAACCTTTCGTCCCCTCAAACCCTCGCCGGCACTGCCAAGGCCGTCCCTGGCGAACCATTCCCACTCGACGGAGCTCGAGCCAGCCGCGCCCTGCGGATTCAGCAAATCTCCCTCCTCGGCAATGGATTTCCGCTCGGAATCATTCCTTCTTACTCTCCCACTGCACACAAGGAGCTAGGTTCCTTTTCTCTTCAGTCGCTCTTGCTCAGGTTGCCTGCCGCCGACTGGTGGGTTGGATTAGTTGGCCAATTTCGTCCGAAGAAACTGATATCTGCTATAAAAGCTGAACTTTCTGCTGCGGATAGCCTTGAGCTCCCTGTTTTGAAAGATGTTGCTAGACAGTTTCTAGACAAGTCACTCTATTCATATGGATTATGCTCACAGTTTTCTCCTAGTCCCTTTTCATCTGTATTTTTCAGCACAGAAGAGCATGGCGAGAGGAAAGGACGTCGCCACAAAGCGATGTTTTATCACAAGCTTCCTAATCACGACATTAATCTGGAAGCAGCTTGGCCTGAGCTCTTCATTGATCATAAAGGACAGTATTGGGATGTGCCTGAGTCTATATCTTTGGATCTTTCCTCTCTTATGTCTGTCTCTGGTTTGCGATACCGGTTTGGGTTGCATAAGAATGGTGGCCTTCCCCGGGCTCTTAATTCTACAAACGATGACCCACCTCTTGCTCTTCTGCCTGGATTATGTGCAAAGGCTGCATTTTCTTTAGAAAAGAACAGGTACATTTGGAGGATAAAAGAAAGGAAGCAAGACATGATGGAGAAGACAGACAAGGGGGAATGGTATTGGAGGCCATCATATGATGTGCGTCTCAAAGAACCTCACTCAGCGATATCTGGAATTGTCGGTGGCACCTTTAGCACCTGGTTTGGAGGCAGCGACATGGTTGGGTCCAATGGAGATGGAAACTTATCCATTGGCCCTAAGAAAAGAAGTCCATTGAATGCTGATCTTTTTGGCTCAATTTGCTATACTTTCCAACATGGGAGATTTAGAAAACAGTTTGGTGACCTTACGAGGATAGATGCTCGCTTAGATATTTCTTCAGCTTCAGGGTTTGCCAAAAGAGTTTTTAATGGTTTCAAGAAATCTGTTGATGATTTAGAGAGATCAGAATCTTCCCCCAGACTCAATTTGATCTTTCAACAGCAGGTTGCTGGCCCGATTGTCTTCCGTGTAGATTCCAGGCTTAAGCTTGATTCTTCATCTGGCAAGCGCAGTCCCCATGTCGAGGACACAATATACAGCCTGAATTATTCTTTTAGGCTTCTTCAATCAGGCAAAGCCGTTTTCTGTTCAGTTCATGCATTTGATTCTTATCTTTTTGACAACGAAATCGGCGACTTAGAGTTAGTTATAGCACTTGAGGCCTTTCTTCCCATCTATTTAGTGTTGCACAACTCTTTCAGAT

Protein sequence

MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAELSAADSLELPVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNHDINLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNSTNDDPPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSAISGIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLKLDSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVFCSVHAFDSYLFDNEIGDLELVIALEAFLPIYLVLHNSFRX
Homology
BLAST of Sgr014939 vs. NCBI nr
Match: XP_038875801.1 (protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida])

HSP 1 Score: 837.8 bits (2163), Expect = 4.8e-239
Identity = 402/452 (88.94%), Postives = 434/452 (96.02%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MAYLRTAMDSAFWDLN+SSPQTLAGTAKAVPGEPFPLDGARASR+LRIQQISLLGNGFPL
Sbjct: 1   MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRSLRIQQISLLGNGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAELSAADSLEL 120
           GIIPSYSP++ KELGSFSLQSLL RLPAADWWVGL+GQFRPKKLIS+IKAELSAADSLEL
Sbjct: 61  GIIPSYSPSSQKELGSFSLQSLLFRLPAADWWVGLIGQFRPKKLISSIKAELSAADSLEL 120

Query: 121 PVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNHDI 180
           PVLKDVARQFLDKSLY+YGLCSQFSP+PFSSV+ STE HGERKG RHKAMFYHKLP+HDI
Sbjct: 121 PVLKDVARQFLDKSLYTYGLCSQFSPNPFSSVYVSTEAHGERKGCRHKAMFYHKLPHHDI 180

Query: 181 NLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNSTN-DD 240
           N++AAWPELFIDHKGQYWDVPESISLDLSSL S SGLRYR GLHKNGG+PRALNSTN +D
Sbjct: 181 NVDAAWPELFIDHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSTNSND 240

Query: 241 PPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSAIS 300
           PPLAL+PGLCAKAAFS EKNRY+WR+KERKQD++EKTDK EWYW+PSYDVRLKEPH+AIS
Sbjct: 241 PPLALMPGLCAKAAFSFEKNRYLWRVKERKQDLIEKTDKREWYWKPSYDVRLKEPHAAIS 300

Query: 301 GIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGDLT 360
           GI+GGTFS+WFGG+D  GSNGDGNL++G KKRSPLNADLFGSICYTFQHGRF+KQFGDLT
Sbjct: 301 GIIGGTFSSWFGGNDTAGSNGDGNLTMGHKKRSPLNADLFGSICYTFQHGRFKKQFGDLT 360

Query: 361 RIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLKL 420
           RIDARLDISSASGFAKRVF GFKKSVDDLERS+SSPRLNL+FQQQVAGPIVFRVDSRL L
Sbjct: 361 RIDARLDISSASGFAKRVFRGFKKSVDDLERSKSSPRLNLVFQQQVAGPIVFRVDSRLML 420

Query: 421 DSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
           DS+SGK  PH+E+TIYSLNYSFRLLQSGKAVF
Sbjct: 421 DSASGKHGPHIENTIYSLNYSFRLLQSGKAVF 452

BLAST of Sgr014939 vs. NCBI nr
Match: XP_022146920.1 (protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Momordica charantia])

HSP 1 Score: 804.3 bits (2076), Expect = 5.9e-229
Identity = 398/452 (88.05%), Postives = 420/452 (92.92%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MAYLRTAMDSAF DLNLSSPQTLAGTAKAVPG+PFPLDGARASR LR+QQISLLGNGFPL
Sbjct: 1   MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAELSAADSLEL 120
           GIIPSYSPT HKELGSFSLQSLLL+LPAADWWVGLVGQFRPKKLIS+IKAELSA DSLEL
Sbjct: 61  GIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFRPKKLISSIKAELSAVDSLEL 120

Query: 121 PVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNHDI 180
           PVLKDVA QFLDKSLY+YGLCSQFSPSPFSS+FFSTEEHGE+KGRRHKAMFYHKLPNHDI
Sbjct: 121 PVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDI 180

Query: 181 NLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNSTN-DD 240
            LEAAWPELF+DHKGQYWDVPESISLDLSSL S SGLRYR GLHKNGGLPRAL+ TN D+
Sbjct: 181 FLEAAWPELFLDHKGQYWDVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDN 240

Query: 241 PPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSAIS 300
           PPLAL+PGLCAKAAFS EKNRY+WR++ERK+DMMEKTDKGE  WR SYDVRLKEPH+AIS
Sbjct: 241 PPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS 300

Query: 301 GIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGDLT 360
           GIVGGTFSTWF GS  +GSNGDGN      KRSPLNADLFGSICYTFQ GRFRKQFGDLT
Sbjct: 301 GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLT 360

Query: 361 RIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLKL 420
           RIDARLDISSASGFAKRVFN FK+S+DDLERS+SSPRLNLIFQQQVAGPIVFRVDS L L
Sbjct: 361 RIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQQQVAGPIVFRVDSSLML 420

Query: 421 DSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
           D  SG+  PHVEDTIYSLNYSFRLL+SGKAVF
Sbjct: 421 DPPSGQYRPHVEDTIYSLNYSFRLLKSGKAVF 446

BLAST of Sgr014939 vs. NCBI nr
Match: KAA0049109.1 (protein TRIGALACTOSYLDIACYLGLYCEROL 4 [Cucumis melo var. makuwa] >TYK17454.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4 [Cucumis melo var. makuwa])

HSP 1 Score: 764.6 bits (1973), Expect = 5.2e-217
Identity = 374/452 (82.74%), Postives = 411/452 (90.93%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MA LRTAMDSAFWD NLSSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPL
Sbjct: 1   MANLRTAMDSAFWDFNLSSPQTLAGTAKSVPGEPFPLNGARASRALRIQQLSLLGDGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAELSAADSLEL 120
           GIIPSYSPTAHKELGSFSLQSLLLRL  A WWVGLVGQFRPKKLIS +KA+LS  D  EL
Sbjct: 61  GIIPSYSPTAHKELGSFSLQSLLLRLSGARWWVGLVGQFRPKKLISLLKAKLSDEDGFEL 120

Query: 121 PVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNHDI 180
             LKDVAR  LDKS Y+YG+CSQFSPSPFSSV+ STE+HGERKGRRHKAMFYH+LP HDI
Sbjct: 121 SDLKDVARLLLDKSHYTYGICSQFSPSPFSSVYVSTEKHGERKGRRHKAMFYHRLPQHDI 180

Query: 181 NLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNST-NDD 240
           N++AAWPELFIDHKGQYWDVPESISLDLSS+ S SGLRYR GLHKNGG+PRALNST NDD
Sbjct: 181 NVDAAWPELFIDHKGQYWDVPESISLDLSSVKSKSGLRYRVGLHKNGGVPRALNSTNNDD 240

Query: 241 PPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSAIS 300
           PPL L+PGLCAKAAFS+EK RY+WR++E+KQD  +KT +GE     SYD+RLKEPH+AIS
Sbjct: 241 PPLTLMPGLCAKAAFSIEKKRYLWRVEEKKQDKTKKTGEGELDEMTSYDMRLKEPHAAIS 300

Query: 301 GIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGDLT 360
           GIVGGTFS+WFGGS+MVGSNGDGNL++G KKRSPLNADLFGS+CYTFQ G F K FGDLT
Sbjct: 301 GIVGGTFSSWFGGSNMVGSNGDGNLTMGHKKRSPLNADLFGSVCYTFQRGSFIKDFGDLT 360

Query: 361 RIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLKL 420
           RIDA+LDISSASGFAKRVF+GFKKSVDDLERS+SSPRLNLIFQQQVAGPIVFR+DS+L L
Sbjct: 361 RIDAQLDISSASGFAKRVFHGFKKSVDDLERSKSSPRLNLIFQQQVAGPIVFRLDSKLML 420

Query: 421 DSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
           DS+SGK  PHVEDTIYSL YSF+LL SGKAVF
Sbjct: 421 DSASGKIGPHVEDTIYSLTYSFKLLDSGKAVF 452

BLAST of Sgr014939 vs. NCBI nr
Match: XP_008438274.1 (PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo])

HSP 1 Score: 763.8 bits (1971), Expect = 8.8e-217
Identity = 374/452 (82.74%), Postives = 410/452 (90.71%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MA LRTAMDSAFWD NLSSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPL
Sbjct: 1   MANLRTAMDSAFWDFNLSSPQTLAGTAKSVPGEPFPLNGARASRALRIQQLSLLGDGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAELSAADSLEL 120
           GIIPSYSPTAHKELGSFSLQSLLLRL  A WWVGLVGQFRPKKLIS +KA+LS  D  EL
Sbjct: 61  GIIPSYSPTAHKELGSFSLQSLLLRLSGARWWVGLVGQFRPKKLISLLKAKLSDEDGFEL 120

Query: 121 PVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNHDI 180
             LKDVAR  LDKS Y+YG+CSQFSPSPFSSV+ STE+HGERKGRRHKAMFYH+LP HDI
Sbjct: 121 SDLKDVARLLLDKSHYTYGICSQFSPSPFSSVYVSTEKHGERKGRRHKAMFYHRLPQHDI 180

Query: 181 NLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNST-NDD 240
           N++AAWPELFIDHKGQYWDVPESISLDLSS+ S SGLRYR GLHKNGG+PRALNST NDD
Sbjct: 181 NVDAAWPELFIDHKGQYWDVPESISLDLSSVKSKSGLRYRVGLHKNGGVPRALNSTNNDD 240

Query: 241 PPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSAIS 300
           PPL L+PGLCAKAAFS+EK RY+WR++E+KQD  EKT +GE     SYD+RLKEPH+AIS
Sbjct: 241 PPLTLMPGLCAKAAFSIEKKRYLWRVEEKKQDKTEKTGEGELDEMTSYDMRLKEPHAAIS 300

Query: 301 GIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGDLT 360
           GIVGGTFS+WFGGS+ VGSNGDGNL++G KKRSPLNADLFGS+CYTFQ G F K FGDLT
Sbjct: 301 GIVGGTFSSWFGGSNTVGSNGDGNLTMGHKKRSPLNADLFGSVCYTFQRGSFIKDFGDLT 360

Query: 361 RIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLKL 420
           RIDA+LDISSASGFAKRVF+GFKKSVDDLERS+SSPRLNLIFQQQVAGPIVFR+DS+L L
Sbjct: 361 RIDAQLDISSASGFAKRVFHGFKKSVDDLERSKSSPRLNLIFQQQVAGPIVFRLDSKLML 420

Query: 421 DSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
           DS+SGK  PHVEDTIYSL YSF+LL SGKAVF
Sbjct: 421 DSASGKIGPHVEDTIYSLTYSFKLLDSGKAVF 452

BLAST of Sgr014939 vs. NCBI nr
Match: XP_023538749.1 (protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 737.6 bits (1903), Expect = 6.8e-209
Identity = 365/454 (80.40%), Postives = 404/454 (88.99%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MA+LRTAMDSAFWD ++SS QTL GTAKAVPGEPFPLDGARASR LRIQQ+S LGNGFPL
Sbjct: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAEL-SAADSLE 120
           GI+PS+SPTAHKELGSFSLQSLLL+ PAADWWVGLVGQFRPKK+IS+IK +L S  D+LE
Sbjct: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDLVSDLDNLE 120

Query: 121 -LPVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNH 180
            LP LKDVA  FLDK+LYSYGLCSQFSP+PFSSVF STEEHG+RKGRRHKAMFYH+LP+H
Sbjct: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180

Query: 181 DINLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRAL-NSTN 240
           DINLEAAWPELFIDHKGQYW+VPES+SLDLSSL S SGLRYR GLHKNGG+PRAL N+  
Sbjct: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYNTDG 240

Query: 241 DDPPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSA 300
            DPPL L+PGLCAKAAFSLEKNRY+W  KE+KQ + E  D  E    PSYDVRLK+PH+A
Sbjct: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETIDDAE----PSYDVRLKDPHAA 300

Query: 301 ISGIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGD 360
           ISGIVGGTFS WFGGSD VG+NGDGNL+I   KRSPLNADLFGS+C T+QHG FRK F D
Sbjct: 301 ISGIVGGTFSAWFGGSDTVGTNGDGNLAI-HNKRSPLNADLFGSLCCTYQHGSFRKDFRD 360

Query: 361 LTRIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRL 420
           LTR+DARLDISS S F+KRVFNGFKKS+DDLERS+S+PRLNLIFQQQ+AGPIVFRVDSRL
Sbjct: 361 LTRLDARLDISSGSAFSKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRL 420

Query: 421 KLDSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
            LDS+S KR PHVEDTI SLNYSF+LL+SGKAVF
Sbjct: 421 MLDSTSVKRGPHVEDTILSLNYSFKLLESGKAVF 449

BLAST of Sgr014939 vs. ExPASy Swiss-Prot
Match: Q9M903 (Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TGD4 PE=1 SV=1)

HSP 1 Score: 268.5 bits (685), Expect = 1.5e-70
Identity = 168/473 (35.52%), Postives = 253/473 (53.49%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+         
Sbjct: 1   MNRMRWVGEGDIWDLDMSTPVTLEGTARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMAS 60

Query: 61  GIIPSYSP----TAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAI---KAELS 120
            +IPS+SP    T     G FSLQ +L    + +W V L+GQF  ++ ++ I   KA   
Sbjct: 61  PLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKAFGR 120

Query: 121 AADSLELPVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEH-GE-RKGRRHKAMF 180
            + S     L  + +   DKSLY+ G CS+F  SP  ++  S + + G+  K  R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAKAIF 180

Query: 181 YHKLPNHDINLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPR 240
            H+ P H++  EA WP LF+D  G+YWDVP S+++DL+SL + SG  Y   LH N G P+
Sbjct: 181 NHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPK 240

Query: 241 ALNS-TNDDPPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVR 300
            L+S T + PP +LLPGL  K+A S   N  +WR    K +  +            YDV 
Sbjct: 241 KLHSDTMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCK-----------PYDVF 300

Query: 301 LKEPHSAISGIVGGTFSTWFGGSDMVG-----SNGDGNLSIG-PKKRSPLNADLFGSICY 360
           L  PH A+SGI+G   +  FG + +       S G G  S+  P   S   AD  G    
Sbjct: 301 LSSPHVAVSGIIGSVMTAAFGENSIRSKFENDSEGVGGFSLHFPSVNSGFMADALGRASL 360

Query: 361 TFQHGRFRKQFGDLTRIDARLD-------ISSASGFAKRVFNGFKKSVDDLERSESSPRL 420
           T Q+G F+K F DLTR  ARLD       ++ A+  A+ + N  + S++  ++    P +
Sbjct: 361 TAQYGNFQKFFFDLTRFHARLDFPHGLRFLTGATSVAQDLLNSRQPSLEAFQK--ICPEV 420

Query: 421 NLIFQQQVAGPIVFRVDSRLKLDSSSGKRSPHVEDTIYSLNYSFRLLQSGKAV 451
            +  QQQ+ GP  F+V+S +++D  +G     V+ T++++ Y+ ++L S KAV
Sbjct: 421 LVSLQQQIVGPFSFKVESGIEIDLRNGANPVTVDKTVFAIEYALQVLLSAKAV 460

BLAST of Sgr014939 vs. ExPASy TrEMBL
Match: A0A6J1CYP7 (protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111016004 PE=4 SV=1)

HSP 1 Score: 804.3 bits (2076), Expect = 2.8e-229
Identity = 398/452 (88.05%), Postives = 420/452 (92.92%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MAYLRTAMDSAF DLNLSSPQTLAGTAKAVPG+PFPLDGARASR LR+QQISLLGNGFPL
Sbjct: 1   MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAELSAADSLEL 120
           GIIPSYSPT HKELGSFSLQSLLL+LPAADWWVGLVGQFRPKKLIS+IKAELSA DSLEL
Sbjct: 61  GIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFRPKKLISSIKAELSAVDSLEL 120

Query: 121 PVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNHDI 180
           PVLKDVA QFLDKSLY+YGLCSQFSPSPFSS+FFSTEEHGE+KGRRHKAMFYHKLPNHDI
Sbjct: 121 PVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDI 180

Query: 181 NLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNSTN-DD 240
            LEAAWPELF+DHKGQYWDVPESISLDLSSL S SGLRYR GLHKNGGLPRAL+ TN D+
Sbjct: 181 FLEAAWPELFLDHKGQYWDVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDN 240

Query: 241 PPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSAIS 300
           PPLAL+PGLCAKAAFS EKNRY+WR++ERK+DMMEKTDKGE  WR SYDVRLKEPH+AIS
Sbjct: 241 PPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS 300

Query: 301 GIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGDLT 360
           GIVGGTFSTWF GS  +GSNGDGN      KRSPLNADLFGSICYTFQ GRFRKQFGDLT
Sbjct: 301 GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLT 360

Query: 361 RIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLKL 420
           RIDARLDISSASGFAKRVFN FK+S+DDLERS+SSPRLNLIFQQQVAGPIVFRVDS L L
Sbjct: 361 RIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQQQVAGPIVFRVDSSLML 420

Query: 421 DSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
           D  SG+  PHVEDTIYSLNYSFRLL+SGKAVF
Sbjct: 421 DPPSGQYRPHVEDTIYSLNYSFRLLKSGKAVF 446

BLAST of Sgr014939 vs. ExPASy TrEMBL
Match: A0A5D3D2D9 (Protein TRIGALACTOSYLDIACYLGLYCEROL 4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold434G003180 PE=4 SV=1)

HSP 1 Score: 764.6 bits (1973), Expect = 2.5e-217
Identity = 374/452 (82.74%), Postives = 411/452 (90.93%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MA LRTAMDSAFWD NLSSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPL
Sbjct: 1   MANLRTAMDSAFWDFNLSSPQTLAGTAKSVPGEPFPLNGARASRALRIQQLSLLGDGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAELSAADSLEL 120
           GIIPSYSPTAHKELGSFSLQSLLLRL  A WWVGLVGQFRPKKLIS +KA+LS  D  EL
Sbjct: 61  GIIPSYSPTAHKELGSFSLQSLLLRLSGARWWVGLVGQFRPKKLISLLKAKLSDEDGFEL 120

Query: 121 PVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNHDI 180
             LKDVAR  LDKS Y+YG+CSQFSPSPFSSV+ STE+HGERKGRRHKAMFYH+LP HDI
Sbjct: 121 SDLKDVARLLLDKSHYTYGICSQFSPSPFSSVYVSTEKHGERKGRRHKAMFYHRLPQHDI 180

Query: 181 NLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNST-NDD 240
           N++AAWPELFIDHKGQYWDVPESISLDLSS+ S SGLRYR GLHKNGG+PRALNST NDD
Sbjct: 181 NVDAAWPELFIDHKGQYWDVPESISLDLSSVKSKSGLRYRVGLHKNGGVPRALNSTNNDD 240

Query: 241 PPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSAIS 300
           PPL L+PGLCAKAAFS+EK RY+WR++E+KQD  +KT +GE     SYD+RLKEPH+AIS
Sbjct: 241 PPLTLMPGLCAKAAFSIEKKRYLWRVEEKKQDKTKKTGEGELDEMTSYDMRLKEPHAAIS 300

Query: 301 GIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGDLT 360
           GIVGGTFS+WFGGS+MVGSNGDGNL++G KKRSPLNADLFGS+CYTFQ G F K FGDLT
Sbjct: 301 GIVGGTFSSWFGGSNMVGSNGDGNLTMGHKKRSPLNADLFGSVCYTFQRGSFIKDFGDLT 360

Query: 361 RIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLKL 420
           RIDA+LDISSASGFAKRVF+GFKKSVDDLERS+SSPRLNLIFQQQVAGPIVFR+DS+L L
Sbjct: 361 RIDAQLDISSASGFAKRVFHGFKKSVDDLERSKSSPRLNLIFQQQVAGPIVFRLDSKLML 420

Query: 421 DSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
           DS+SGK  PHVEDTIYSL YSF+LL SGKAVF
Sbjct: 421 DSASGKIGPHVEDTIYSLTYSFKLLDSGKAVF 452

BLAST of Sgr014939 vs. ExPASy TrEMBL
Match: A0A1S3AWM5 (protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103483435 PE=4 SV=1)

HSP 1 Score: 763.8 bits (1971), Expect = 4.3e-217
Identity = 374/452 (82.74%), Postives = 410/452 (90.71%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MA LRTAMDSAFWD NLSSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPL
Sbjct: 1   MANLRTAMDSAFWDFNLSSPQTLAGTAKSVPGEPFPLNGARASRALRIQQLSLLGDGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAELSAADSLEL 120
           GIIPSYSPTAHKELGSFSLQSLLLRL  A WWVGLVGQFRPKKLIS +KA+LS  D  EL
Sbjct: 61  GIIPSYSPTAHKELGSFSLQSLLLRLSGARWWVGLVGQFRPKKLISLLKAKLSDEDGFEL 120

Query: 121 PVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNHDI 180
             LKDVAR  LDKS Y+YG+CSQFSPSPFSSV+ STE+HGERKGRRHKAMFYH+LP HDI
Sbjct: 121 SDLKDVARLLLDKSHYTYGICSQFSPSPFSSVYVSTEKHGERKGRRHKAMFYHRLPQHDI 180

Query: 181 NLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNST-NDD 240
           N++AAWPELFIDHKGQYWDVPESISLDLSS+ S SGLRYR GLHKNGG+PRALNST NDD
Sbjct: 181 NVDAAWPELFIDHKGQYWDVPESISLDLSSVKSKSGLRYRVGLHKNGGVPRALNSTNNDD 240

Query: 241 PPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSAIS 300
           PPL L+PGLCAKAAFS+EK RY+WR++E+KQD  EKT +GE     SYD+RLKEPH+AIS
Sbjct: 241 PPLTLMPGLCAKAAFSIEKKRYLWRVEEKKQDKTEKTGEGELDEMTSYDMRLKEPHAAIS 300

Query: 301 GIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGDLT 360
           GIVGGTFS+WFGGS+ VGSNGDGNL++G KKRSPLNADLFGS+CYTFQ G F K FGDLT
Sbjct: 301 GIVGGTFSSWFGGSNTVGSNGDGNLTMGHKKRSPLNADLFGSVCYTFQRGSFIKDFGDLT 360

Query: 361 RIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLKL 420
           RIDA+LDISSASGFAKRVF+GFKKSVDDLERS+SSPRLNLIFQQQVAGPIVFR+DS+L L
Sbjct: 361 RIDAQLDISSASGFAKRVFHGFKKSVDDLERSKSSPRLNLIFQQQVAGPIVFRLDSKLML 420

Query: 421 DSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
           DS+SGK  PHVEDTIYSL YSF+LL SGKAVF
Sbjct: 421 DSASGKIGPHVEDTIYSLTYSFKLLDSGKAVF 452

BLAST of Sgr014939 vs. ExPASy TrEMBL
Match: A0A6J1IIJ0 (protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111473485 PE=4 SV=1)

HSP 1 Score: 735.7 bits (1898), Expect = 1.2e-208
Identity = 364/454 (80.18%), Postives = 405/454 (89.21%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MA+LRTAMDSAFW+ ++SS QTL GTAKAVPGEPFPLDGARASR LRIQQ+S LGNGFPL
Sbjct: 1   MAHLRTAMDSAFWNFDVSSSQTLVGTAKAVPGEPFPLDGARASRTLRIQQVSFLGNGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAEL-SAADSLE 120
           GI+PS+SPTAHKELGSFSLQSLLL+ PAADWWVGLVGQFRPKK+IS IK +L S  D+LE
Sbjct: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISNIKEDLISDLDNLE 120

Query: 121 -LPVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNH 180
            LP LKDVA  FLDK+LYSYGLCSQFSP+PFSSVF STEEHG+RKGRRHKAMFYH+LP+H
Sbjct: 121 LLPALKDVATMFLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180

Query: 181 DINLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNSTN- 240
           DINLEAAWPELFIDHKGQYW+VPES+SLDLSSL S SGLRYR GLHKNGG+PRAL  T+ 
Sbjct: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDG 240

Query: 241 DDPPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSA 300
            +PPL L+PGLCAKAAFSLEKNRY+W  KE+KQ + E TD+ E    PSYDVRLK+PH+A
Sbjct: 241 GEPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGLTETTDEAE----PSYDVRLKDPHAA 300

Query: 301 ISGIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGD 360
           ISGIVGGTFS+WFGGSD VG+NGDGNL+I   KRSPLNADLFGS+CYT+QHG FRK F D
Sbjct: 301 ISGIVGGTFSSWFGGSDTVGTNGDGNLAI-HNKRSPLNADLFGSLCYTYQHGSFRKDFRD 360

Query: 361 LTRIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRL 420
           LTR+DARLDISS S FAKRVFNGFKKS+DDLERS+S+PRLNLIFQQQ+AGPIVFRVDSRL
Sbjct: 361 LTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRL 420

Query: 421 KLDSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
            L S+S K  PHVEDTI SLNYSF+LL+SGKAVF
Sbjct: 421 MLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVF 449

BLAST of Sgr014939 vs. ExPASy TrEMBL
Match: A0A6J1FCB0 (protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111442798 PE=4 SV=1)

HSP 1 Score: 734.9 bits (1896), Expect = 2.1e-208
Identity = 364/454 (80.18%), Postives = 404/454 (88.99%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MA+LRTAMDSAFWD ++SS QTL GTAKAVPG PFPLDGARASR LRIQQ+S LGNGFPL
Sbjct: 1   MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAE-LSAADSLE 120
           GI+PS+SPTAHKELGSFSLQSLLL+ PAADWWVGLVGQFRPKK+IS+IK + +S  D+LE
Sbjct: 61  GILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKEDIISDLDNLE 120

Query: 121 -LPVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNH 180
            LP LKDVA   LDK+LYSYGLCSQFSP+PFSSVF STEEHG+RKGRRHKAMFYH+LP+H
Sbjct: 121 LLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHH 180

Query: 181 DINLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNSTN- 240
           DINLEAAWPELFIDHKGQYW+VPES+SLDLSSL S SGLRYR GLHKNGG+PRAL  T+ 
Sbjct: 181 DINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDG 240

Query: 241 DDPPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHSA 300
            DPPL L+PGLCAKAAFSLEKNRY+W  KE+KQ + E TD+ E    PSYDVRLK+PH+A
Sbjct: 241 GDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAE----PSYDVRLKDPHAA 300

Query: 301 ISGIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFGD 360
           ISGIVGGTFS WFGGSD VG+NGDGNL+I   KRSPLNADLFGS+CYT+QHG FRK F D
Sbjct: 301 ISGIVGGTFSAWFGGSDTVGTNGDGNLAI-HNKRSPLNADLFGSLCYTYQHGSFRKDFCD 360

Query: 361 LTRIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRL 420
           LTR+DARLDISS S FAKRVFNGFKKS+DDLERS+S+PRLNLIFQQQ+AGPIVFRVDSRL
Sbjct: 361 LTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRL 420

Query: 421 KLDSSSGKRSPHVEDTIYSLNYSFRLLQSGKAVF 452
            L S+S KR PHVEDTI SLNYSF+LL+SGKAVF
Sbjct: 421 MLGSTSVKRGPHVEDTILSLNYSFKLLESGKAVF 449

BLAST of Sgr014939 vs. TAIR 10
Match: AT2G44640.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion, chloroplast, plasma membrane, plastid, chloroplast envelope; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF3769 (InterPro:IPR022244); BEST Arabidopsis thaliana protein match is: pigment defective 320 (TAIR:AT3G06960.1); Has 49 Blast hits to 48 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 48; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 491.9 bits (1265), Expect = 6.0e-139
Identity = 251/454 (55.29%), Postives = 330/454 (72.69%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           MA L +A+DS FWD N+SSPQTL GTA++VPGEPFPLDGARASR+ RIQQ+SLL  GFPL
Sbjct: 1   MANLNSAIDSVFWDQNVSSPQTLEGTARSVPGEPFPLDGARASRSHRIQQLSLLREGFPL 60

Query: 61  GIIPSYSPTAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAIKAELSAADSLEL 120
           GIIPS +P + K LGSFSL SLLL   + +WW+GLVGQF+PKKL + IKA++S A+  +L
Sbjct: 61  GIIPSLAPASDKRLGSFSLNSLLLSPSSNNWWLGLVGQFKPKKLFADIKADISNAEEWDL 120

Query: 121 PVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEHGERKGRRHKAMFYHKLPNHDI 180
            V+KD A+  +DKSLYS GL +Q +    SS+  STE  G++ G R+K M  H L  HD+
Sbjct: 121 QVVKDTAKHIVDKSLYSIGLWTQIALGTSSSLLLSTERLGDKNGLRNKLMLVHPLEKHDL 180

Query: 181 NLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPRALNS----T 240
            +EAAWP+LF+D+KG++WDVPES+++D+SSL+  SG+RYRFGLHK+ G P+ +N+    +
Sbjct: 181 TVEAAWPDLFLDNKGRFWDVPESLNVDVSSLVPESGVRYRFGLHKSRGNPQPVNAAGVES 240

Query: 241 NDDPPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVRLKEPHS 300
             D P +L+PGLCAKAA S + NR +WR +E K+   E+ DK  +     YD+RLKEPH+
Sbjct: 241 GSDAPTSLMPGLCAKAAVSYKVNRDLWRPQE-KEGNTEEEDKPVFL---PYDLRLKEPHA 300

Query: 301 AISGIVGGTFSTWFGGSDMVGSNGDGNLSIGPKKRSPLNADLFGSICYTFQHGRFRKQFG 360
           AISGIVG + + W  G  M+         +  KKRSP++AD+FGS CYTFQ GRF K +G
Sbjct: 301 AISGIVGSSLAAWITGRGML---------VNGKKRSPISADVFGSACYTFQKGRFSKLYG 360

Query: 361 DLTRIDARLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSR 420
           DLTR+DAR+D+ SA   AK++F+    + DD   +  SPRLNLIFQQQVAGPIVF+VDS+
Sbjct: 361 DLTRVDARVDLPSAFALAKKLFHASSNNSDD---TLWSPRLNLIFQQQVAGPIVFKVDSQ 420

Query: 421 LKLDSSSGKRSPHVEDTIYSLNYSFRLLQSGKAV 451
            ++ ++       +ED IYSLNYS RLL+SGK V
Sbjct: 421 FQVGAA------RMEDVIYSLNYSLRLLESGKIV 432

BLAST of Sgr014939 vs. TAIR 10
Match: AT3G06960.1 (pigment defective 320 )

HSP 1 Score: 268.5 bits (685), Expect = 1.1e-71
Identity = 168/473 (35.52%), Postives = 253/473 (53.49%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+         
Sbjct: 1   MNRMRWVGEGDIWDLDMSTPVTLEGTARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMAS 60

Query: 61  GIIPSYSP----TAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAI---KAELS 120
            +IPS+SP    T     G FSLQ +L    + +W V L+GQF  ++ ++ I   KA   
Sbjct: 61  PLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKAFGR 120

Query: 121 AADSLELPVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEH-GE-RKGRRHKAMF 180
            + S     L  + +   DKSLY+ G CS+F  SP  ++  S + + G+  K  R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAKAIF 180

Query: 181 YHKLPNHDINLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPR 240
            H+ P H++  EA WP LF+D  G+YWDVP S+++DL+SL + SG  Y   LH N G P+
Sbjct: 181 NHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPK 240

Query: 241 ALNS-TNDDPPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVR 300
            L+S T + PP +LLPGL  K+A S   N  +WR    K +  +            YDV 
Sbjct: 241 KLHSDTMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCK-----------PYDVF 300

Query: 301 LKEPHSAISGIVGGTFSTWFGGSDMVG-----SNGDGNLSIG-PKKRSPLNADLFGSICY 360
           L  PH A+SGI+G   +  FG + +       S G G  S+  P   S   AD  G    
Sbjct: 301 LSSPHVAVSGIIGSVMTAAFGENSIRSKFENDSEGVGGFSLHFPSVNSGFMADALGRASL 360

Query: 361 TFQHGRFRKQFGDLTRIDARLD-------ISSASGFAKRVFNGFKKSVDDLERSESSPRL 420
           T Q+G F+K F DLTR  ARLD       ++ A+  A+ + N  + S++  ++    P +
Sbjct: 361 TAQYGNFQKFFFDLTRFHARLDFPHGLRFLTGATSVAQDLLNSRQPSLEAFQK--ICPEV 420

Query: 421 NLIFQQQVAGPIVFRVDSRLKLDSSSGKRSPHVEDTIYSLNYSFRLLQSGKAV 451
            +  QQQ+ GP  F+V+S +++D  +G     V+ T++++ Y+ ++L S KAV
Sbjct: 421 LVSLQQQIVGPFSFKVESGIEIDLRNGANPVTVDKTVFAIEYALQVLLSAKAV 460

BLAST of Sgr014939 vs. TAIR 10
Match: AT3G06960.2 (pigment defective 320 )

HSP 1 Score: 199.9 bits (507), Expect = 4.7e-51
Identity = 120/313 (38.34%), Postives = 172/313 (54.95%), Query Frame = 0

Query: 1   MAYLRTAMDSAFWDLNLSSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPL 60
           M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+         
Sbjct: 1   MNRMRWVGEGDIWDLDMSTPVTLEGTARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMAS 60

Query: 61  GIIPSYSP----TAHKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISAI---KAELS 120
            +IPS+SP    T     G FSLQ +L    + +W V L+GQF  ++ ++ I   KA   
Sbjct: 61  PLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKAFGR 120

Query: 121 AADSLELPVLKDVARQFLDKSLYSYGLCSQFSPSPFSSVFFSTEEH-GE-RKGRRHKAMF 180
            + S     L  + +   DKSLY+ G CS+F  SP  ++  S + + G+  K  R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAKAIF 180

Query: 181 YHKLPNHDINLEAAWPELFIDHKGQYWDVPESISLDLSSLMSVSGLRYRFGLHKNGGLPR 240
            H+ P H++  EA WP LF+D  G+YWDVP S+++DL+SL + SG  Y   LH N G P+
Sbjct: 181 NHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPK 240

Query: 241 ALNS-TNDDPPLALLPGLCAKAAFSLEKNRYIWRIKERKQDMMEKTDKGEWYWRPSYDVR 300
            L+S T + PP +LLPGL  K+A S   N  +WR    K +  +            YDV 
Sbjct: 241 KLHSDTMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCK-----------PYDVF 300

Query: 301 LKEPHSAISGIVG 304
           L  PH A+SGI+G
Sbjct: 301 LSSPHVAVSGIIG 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875801.14.8e-23988.94protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida][more]
XP_022146920.15.9e-22988.05protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Momordica charantia][more]
KAA0049109.15.2e-21782.74protein TRIGALACTOSYLDIACYLGLYCEROL 4 [Cucumis melo var. makuwa] >TYK17454.1 pro... [more]
XP_008438274.18.8e-21782.74PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo][more]
XP_023538749.16.8e-20980.40protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo... [more]
Match NameE-valueIdentityDescription
Q9M9031.5e-7035.52Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Arabidopsis thaliana OX=... [more]
Match NameE-valueIdentityDescription
A0A6J1CYP72.8e-22988.05protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Momordica charantia OX=3... [more]
A0A5D3D2D92.5e-21782.74Protein TRIGALACTOSYLDIACYLGLYCEROL 4 OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
A0A1S3AWM54.3e-21782.74protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Cucumis melo OX=3656 GN=... [more]
A0A6J1IIJ01.2e-20880.18protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X1 OS=Cucurbit... [more]
A0A6J1FCB02.1e-20880.18protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic OS=Cucurbita moschata OX=36... [more]
Match NameE-valueIdentityDescription
AT2G44640.16.0e-13955.29FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT3G06960.11.1e-7135.52pigment defective 320 [more]
AT3G06960.24.7e-5138.34pigment defective 320 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34954:SF3EXPRESSED PROTEINcoord: 1..451
IPR044160Protein TRIGALACTOSYLDIACYLGLYCEROL 4-likePANTHERPTHR34954EXPRESSED PROTEINcoord: 1..451

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr014939.1Sgr014939.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0034196 acylglycerol transport
biological_process GO:1990052 ER to chloroplast lipid transport
cellular_component GO:0009941 chloroplast envelope
molecular_function GO:0070300 phosphatidic acid binding