HG10023363 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023363
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein THYLAKOID ASSEMBLY 8, chloroplastic
LocationChr05: 33389064 .. 33389747 (-)
RNA-Seq ExpressionHG10023363
SyntenyHG10023363
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTTGTCTCTTCACTCCACATTTCTCAAATCCCAAATCTCGATTCCGATCCCCGTCTCCGGCGCGGCTTCCGTCGCCGTTTCCCTTCCGGTACGCTGCGGCCCTCGGGACAACCGAGGACCGCTAGTGAAAGGCAGAACCCTAAGCACCGAAGCAATCCAAGCCATTCAATCTCTAAAACGAGCCGAGAGATCCGACCCGACGAAGCTCCAACAAGTGCTCTCCACTACGCTCTCCCGATTGCTCAAAGCCGACCTCGTCGCCGCCCTGAAGGAGCTCCTCCGGCAGGACCGGTGCGCCCTTGCTTTGGAGGTTTTCGCCGTAATCCGATCCGAGTACGGCGCCGAATTGGGGCTGTACGCGGAGGTGGCAGTGGCGCTGTCAAGGAACGGAGCGGCGGAGGAAATCGACCGGCTGGTTTGCGATTTAGACGGCGGAGATGGGCTGATTCAGTGGGATGATAAGGGTTTGATTAAGTTGATTAAGGCGGTTATTAGTGGGGATAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGAAGGAACGGTTGGGGATCCACCATTAAAGCTGATGATTATATGGTTAGGGTTTTGAGCAAGGGTTTAAGGAGGCTTGGGGAAATGGAGTTGGCCGATGAGATCAATAGGGAATTTCAAGATTTAGTGGGCAGTTTATGA

mRNA sequence

ATGGCTTTGTCTCTTCACTCCACATTTCTCAAATCCCAAATCTCGATTCCGATCCCCGTCTCCGGCGCGGCTTCCGTCGCCGTTTCCCTTCCGGTACGCTGCGGCCCTCGGGACAACCGAGGACCGCTAGTGAAAGGCAGAACCCTAAGCACCGAAGCAATCCAAGCCATTCAATCTCTAAAACGAGCCGAGAGATCCGACCCGACGAAGCTCCAACAAGTGCTCTCCACTACGCTCTCCCGATTGCTCAAAGCCGACCTCGTCGCCGCCCTGAAGGAGCTCCTCCGGCAGGACCGGTGCGCCCTTGCTTTGGAGGTTTTCGCCGTAATCCGATCCGAGTACGGCGCCGAATTGGGGCTGTACGCGGAGGTGGCAGTGGCGCTGTCAAGGAACGGAGCGGCGGAGGAAATCGACCGGCTGGTTTGCGATTTAGACGGCGGAGATGGGCTGATTCAGTGGGATGATAAGGGTTTGATTAAGTTGATTAAGGCGGTTATTAGTGGGGATAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGAAGGAACGGTTGGGGATCCACCATTAAAGCTGATGATTATATGGTTAGGGTTTTGAGCAAGGGTTTAAGGAGGCTTGGGGAAATGGAGTTGGCCGATGAGATCAATAGGGAATTTCAAGATTTAGTGGGCAGTTTATGA

Coding sequence (CDS)

ATGGCTTTGTCTCTTCACTCCACATTTCTCAAATCCCAAATCTCGATTCCGATCCCCGTCTCCGGCGCGGCTTCCGTCGCCGTTTCCCTTCCGGTACGCTGCGGCCCTCGGGACAACCGAGGACCGCTAGTGAAAGGCAGAACCCTAAGCACCGAAGCAATCCAAGCCATTCAATCTCTAAAACGAGCCGAGAGATCCGACCCGACGAAGCTCCAACAAGTGCTCTCCACTACGCTCTCCCGATTGCTCAAAGCCGACCTCGTCGCCGCCCTGAAGGAGCTCCTCCGGCAGGACCGGTGCGCCCTTGCTTTGGAGGTTTTCGCCGTAATCCGATCCGAGTACGGCGCCGAATTGGGGCTGTACGCGGAGGTGGCAGTGGCGCTGTCAAGGAACGGAGCGGCGGAGGAAATCGACCGGCTGGTTTGCGATTTAGACGGCGGAGATGGGCTGATTCAGTGGGATGATAAGGGTTTGATTAAGTTGATTAAGGCGGTTATTAGTGGGGATAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGAAGGAACGGTTGGGGATCCACCATTAAAGCTGATGATTATATGGTTAGGGTTTTGAGCAAGGGTTTAAGGAGGCTTGGGGAAATGGAGTTGGCCGATGAGATCAATAGGGAATTTCAAGATTTAGTGGGCAGTTTATGA

Protein sequence

MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGSL
Homology
BLAST of HG10023363 vs. NCBI nr
Match: XP_038898717.1 (protein THYLAKOID ASSEMBLY 8, chloroplastic [Benincasa hispida])

HSP 1 Score: 406.8 bits (1044), Expect = 1.3e-109
Identity = 213/227 (93.83%), Postives = 221/227 (97.36%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60
           MA S+HSTFLKSQISIPIPVS AA+VAVSLPVRCGPRDNRGPLVKGRTLS EAIQAIQSL
Sbjct: 1   MASSIHSTFLKSQISIPIPVSAAAAVAVSLPVRCGPRDNRGPLVKGRTLSIEAIQAIQSL 60

Query: 61  KRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELGL 120
           KRAERSDPTKLQQVLSTTLSRLLKADLVA+LKELLRQDRCALALEVFAVIRSEYGA+LG+
Sbjct: 61  KRAERSDPTKLQQVLSTTLSRLLKADLVASLKELLRQDRCALALEVFAVIRSEYGADLGM 120

Query: 121 YAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRM 180
           YAEVA ALSRNGAAEEIDRLVCDLDGGD LIQWDDKGLIKLIKAVISGDRRESTVRIYRM
Sbjct: 121 YAEVAAALSRNGAAEEIDRLVCDLDGGDVLIQWDDKGLIKLIKAVISGDRRESTVRIYRM 180

Query: 181 MRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGSL 228
           MRRNGWGSTIKADDY+VRVLSKGLRR GEMELADEINREFQDLVG++
Sbjct: 181 MRRNGWGSTIKADDYLVRVLSKGLRRFGEMELADEINREFQDLVGNV 227

BLAST of HG10023363 vs. NCBI nr
Match: XP_022954347.1 (protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata])

HSP 1 Score: 380.6 bits (976), Expect = 9.7e-102
Identity = 198/226 (87.61%), Postives = 212/226 (93.81%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60
           MA SLHSTFLKSQI IPIP+S AA+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL
Sbjct: 1   MAFSLHSTFLKSQIPIPIPISAAATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60

Query: 61  KRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELGL 120
           KRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRCALALEVFAV+RSEYGA+LG+
Sbjct: 61  KRAEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGADLGM 120

Query: 121 YAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRM 180
           YAEVA ALSRNGA EEIDRLVCDL+  D +IQ DDKGLIKLIKAVI GDRRESTVRIYRM
Sbjct: 121 YAEVAAALSRNGAMEEIDRLVCDLEDEDRVIQCDDKGLIKLIKAVIGGDRRESTVRIYRM 180

Query: 181 MRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS 227
           M+R+GWGSTIKADDY VRVLSKGLRRLGEME+ADE+N +FQDLVGS
Sbjct: 181 MKRSGWGSTIKADDYTVRVLSKGLRRLGEMEMADEVNMQFQDLVGS 226

BLAST of HG10023363 vs. NCBI nr
Match: XP_023522652.1 (protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023548438.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023548440.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 380.2 bits (975), Expect = 1.3e-101
Identity = 198/226 (87.61%), Postives = 211/226 (93.36%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60
           MA SLHSTFLKSQI IPIP+S  A+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL
Sbjct: 1   MAFSLHSTFLKSQIPIPIPISATATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60

Query: 61  KRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELGL 120
           KRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRCALALEVFAV+RSEYGA+LG+
Sbjct: 61  KRAEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGADLGM 120

Query: 121 YAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRM 180
           YAEVA ALSRNGAAEEIDRLVCDL+  D  IQ DDKGLIKLIKAVI GDRRESTVRIYRM
Sbjct: 121 YAEVAAALSRNGAAEEIDRLVCDLEDEDRAIQCDDKGLIKLIKAVIGGDRRESTVRIYRM 180

Query: 181 MRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS 227
           M+R+GWGSTIKADDY VRVLSKGLRRLGEME+ADE+N +FQDLVGS
Sbjct: 181 MKRSGWGSTIKADDYTVRVLSKGLRRLGEMEMADEVNMQFQDLVGS 226

BLAST of HG10023363 vs. NCBI nr
Match: XP_022992386.1 (protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima])

HSP 1 Score: 380.2 bits (975), Expect = 1.3e-101
Identity = 198/226 (87.61%), Postives = 211/226 (93.36%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60
           MA SLHSTFLKSQI IPIP+S  A+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL
Sbjct: 1   MAFSLHSTFLKSQIPIPIPISATATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60

Query: 61  KRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELGL 120
           KRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRCALALEVFAV+RSEYG +LG+
Sbjct: 61  KRAEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGVDLGM 120

Query: 121 YAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRM 180
           YAEVA ALSRNGA EEIDRLVCDL+  D +IQ DDKGLIKLIKAVI GDRRESTVRIYRM
Sbjct: 121 YAEVAAALSRNGAMEEIDRLVCDLEDEDRVIQCDDKGLIKLIKAVIGGDRRESTVRIYRM 180

Query: 181 MRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS 227
           M+R+GWGSTIKADDYMVRVLSKGLRRLGEME+ADEIN +FQDLVGS
Sbjct: 181 MKRSGWGSTIKADDYMVRVLSKGLRRLGEMEMADEINMQFQDLVGS 226

BLAST of HG10023363 vs. NCBI nr
Match: XP_008462172.1 (PREDICTED: uncharacterized protein LOC103500597 [Cucumis melo])

HSP 1 Score: 379.8 bits (974), Expect = 1.7e-101
Identity = 205/230 (89.13%), Postives = 215/230 (93.48%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGA-ASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQS 60
           MA SLHSTFLKSQISIPIP S A A+VAVS  VRCGPRDNRGPLVKGRTLS EAIQAIQS
Sbjct: 28  MASSLHSTFLKSQISIPIPTSTATAAVAVSFRVRCGPRDNRGPLVKGRTLSIEAIQAIQS 87

Query: 61  LKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELG 120
           LKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQ+RCALALEVFAVIRSEY AELG
Sbjct: 88  LKRAERSDPTKLQQVLSTTLSRLLKADLVATLKELLRQERCALALEVFAVIRSEYRAELG 147

Query: 121 LYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRI 180
           LYAEVA ALSRNGAAEEIDRLVCDLDG DG+I+W  DDKGLIKLIKAVISG+RRESTVRI
Sbjct: 148 LYAEVAAALSRNGAAEEIDRLVCDLDGRDGVIEWGDDDKGLIKLIKAVISGNRRESTVRI 207

Query: 181 YRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGSL 228
           YRMMRRNGWGS IK DDYM++V+SKGLRR+GE+ELADEINREFQDLVGSL
Sbjct: 208 YRMMRRNGWGSMIKGDDYMIKVMSKGLRRVGEIELADEINREFQDLVGSL 257

BLAST of HG10023363 vs. ExPASy Swiss-Prot
Match: Q9LVW6 (Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8 PE=2 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 3.5e-46
Identity = 115/228 (50.44%), Postives = 158/228 (69.30%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGAASVAVSLP------VRCGPRDNRGPLVKGRTLSTEAI 60
           MALSL  T        P  +S + +++V +P      +RCGPRDNRGPL+KGR LSTEAI
Sbjct: 1   MALSLSQT-------RPPSLSHSHTLSVIVPKRTFVSIRCGPRDNRGPLLKGRILSTEAI 60

Query: 61  QAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAALKELLRQDRCALALEVFAVIR 120
           Q+IQSLKRA R+  +     LS T   L RL+K+DL++ L+ELLRQD C LA+ V + +R
Sbjct: 61  QSIQSLKRAHRTGVS-----LSLTLRPLRRLIKSDLISVLRELLRQDYCTLAVHVLSTLR 120

Query: 121 SEY-GAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDR 180
           +EY   +L LYA++  AL+RN   +EIDRL+ ++DG D   + DDK L KLI+AV+  +R
Sbjct: 121 TEYPPLDLVLYADIVNALTRNKEFDEIDRLIGEIDGIDQ--RSDDKALAKLIRAVVGAER 180

Query: 181 RESTVRIYRMMRRNGWGS-TIKADDYMVRVLSKGLRRLGEMELADEIN 218
           RES VR+Y +MR +GWGS + +AD+Y+  VLSKGL RLGE +LA +++
Sbjct: 181 RESVVRVYTLMRESGWGSESWEADEYVAEVLSKGLLRLGEPDLASQVS 214

BLAST of HG10023363 vs. ExPASy Swiss-Prot
Match: Q9STF9 (Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THA8L PE=1 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 1.9e-07
Identity = 53/187 (28.34%), Postives = 99/187 (52.94%), Query Frame = 0

Query: 40  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQD 99
           RGPL +G+ L   EA+  I  LKR  + D  KL + + T + RLLK D++A + EL RQ+
Sbjct: 63  RGPLWRGKKLIGKEALFVILGLKRL-KEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQE 122

Query: 100 RCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDK 159
             ALA+++F VI+ +  Y  ++ +Y ++ V+L+++   +E   L   +   +  +  D +
Sbjct: 123 ETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKEN--LFPDSQ 182

Query: 160 GLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEI 219
              ++I+  +        + +Y  M +    S    ++   RVL KGL  L    L +++
Sbjct: 183 TYTEVIRGFLRDGCPADAMNVYEDMLK----SPDPPEELPFRVLLKGL--LPHPLLRNKV 240

Query: 220 NREFQDL 224
            ++F++L
Sbjct: 243 KKDFEEL 240

BLAST of HG10023363 vs. ExPASy Swiss-Prot
Match: Q1PFH7 (Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX=3702 GN=At1g62350 PE=2 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 7.2e-07
Identity = 47/177 (26.55%), Postives = 92/177 (51.98%), Query Frame = 0

Query: 49  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFA 108
           +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ++  L ++++ 
Sbjct: 1   MSKEGLIAAKELKRLQ-TQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYE 60

Query: 109 VIRSE--YGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVI 168
           V+R E  Y  ++  Y ++ + L+RN   +E  ++  DL   +  + +D      L++  +
Sbjct: 61  VVRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEE--VLFDQHTFGDLVRGFL 120

Query: 169 SGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL 224
             +     +R+Y  MR     S  +      RV+ KGL  +   EL +++  +F +L
Sbjct: 121 DNELPLEAMRLYGEMRE----SPDRPLSLPFRVILKGL--VPYPELREKVKDDFLEL 168

BLAST of HG10023363 vs. ExPASy TrEMBL
Match: A0A6J1GQS0 (protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111456620 PE=4 SV=1)

HSP 1 Score: 380.6 bits (976), Expect = 4.7e-102
Identity = 198/226 (87.61%), Postives = 212/226 (93.81%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60
           MA SLHSTFLKSQI IPIP+S AA+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL
Sbjct: 1   MAFSLHSTFLKSQIPIPIPISAAATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60

Query: 61  KRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELGL 120
           KRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRCALALEVFAV+RSEYGA+LG+
Sbjct: 61  KRAEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGADLGM 120

Query: 121 YAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRM 180
           YAEVA ALSRNGA EEIDRLVCDL+  D +IQ DDKGLIKLIKAVI GDRRESTVRIYRM
Sbjct: 121 YAEVAAALSRNGAMEEIDRLVCDLEDEDRVIQCDDKGLIKLIKAVIGGDRRESTVRIYRM 180

Query: 181 MRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS 227
           M+R+GWGSTIKADDY VRVLSKGLRRLGEME+ADE+N +FQDLVGS
Sbjct: 181 MKRSGWGSTIKADDYTVRVLSKGLRRLGEMEMADEVNMQFQDLVGS 226

BLAST of HG10023363 vs. ExPASy TrEMBL
Match: A0A6J1JTE8 (protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111488709 PE=4 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 6.1e-102
Identity = 198/226 (87.61%), Postives = 211/226 (93.36%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60
           MA SLHSTFLKSQI IPIP+S  A+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL
Sbjct: 1   MAFSLHSTFLKSQIPIPIPISATATVVVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSL 60

Query: 61  KRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELGL 120
           KRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRCALALEVFAV+RSEYG +LG+
Sbjct: 61  KRAEKSDPTKLEQVLSTTLSRLLKADLVATLKELLRQDRCALALEVFAVVRSEYGVDLGM 120

Query: 121 YAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRM 180
           YAEVA ALSRNGA EEIDRLVCDL+  D +IQ DDKGLIKLIKAVI GDRRESTVRIYRM
Sbjct: 121 YAEVAAALSRNGAMEEIDRLVCDLEDEDRVIQCDDKGLIKLIKAVIGGDRRESTVRIYRM 180

Query: 181 MRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS 227
           M+R+GWGSTIKADDYMVRVLSKGLRRLGEME+ADEIN +FQDLVGS
Sbjct: 181 MKRSGWGSTIKADDYMVRVLSKGLRRLGEMEMADEINMQFQDLVGS 226

BLAST of HG10023363 vs. ExPASy TrEMBL
Match: A0A1S3CGD3 (uncharacterized protein LOC103500597 OS=Cucumis melo OX=3656 GN=LOC103500597 PE=4 SV=1)

HSP 1 Score: 379.8 bits (974), Expect = 8.0e-102
Identity = 205/230 (89.13%), Postives = 215/230 (93.48%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGA-ASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQS 60
           MA SLHSTFLKSQISIPIP S A A+VAVS  VRCGPRDNRGPLVKGRTLS EAIQAIQS
Sbjct: 28  MASSLHSTFLKSQISIPIPTSTATAAVAVSFRVRCGPRDNRGPLVKGRTLSIEAIQAIQS 87

Query: 61  LKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELG 120
           LKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQ+RCALALEVFAVIRSEY AELG
Sbjct: 88  LKRAERSDPTKLQQVLSTTLSRLLKADLVATLKELLRQERCALALEVFAVIRSEYRAELG 147

Query: 121 LYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRI 180
           LYAEVA ALSRNGAAEEIDRLVCDLDG DG+I+W  DDKGLIKLIKAVISG+RRESTVRI
Sbjct: 148 LYAEVAAALSRNGAAEEIDRLVCDLDGRDGVIEWGDDDKGLIKLIKAVISGNRRESTVRI 207

Query: 181 YRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGSL 228
           YRMMRRNGWGS IK DDYM++V+SKGLRR+GE+ELADEINREFQDLVGSL
Sbjct: 208 YRMMRRNGWGSMIKGDDYMIKVMSKGLRRVGEIELADEINREFQDLVGSL 257

BLAST of HG10023363 vs. ExPASy TrEMBL
Match: A0A5A7V0E2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G002110 PE=4 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 5.2e-101
Identity = 204/230 (88.70%), Postives = 214/230 (93.04%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGA-ASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQS 60
           MA SL STFLKSQISIPIP S A A+VAVS  VRCGPRDNRGPLVKGRTLS EAIQAIQS
Sbjct: 1   MASSLRSTFLKSQISIPIPTSTATAAVAVSFRVRCGPRDNRGPLVKGRTLSIEAIQAIQS 60

Query: 61  LKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELG 120
           LKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQ+RCALALEVFAVIRSEY AELG
Sbjct: 61  LKRAERSDPTKLQQVLSTTLSRLLKADLVATLKELLRQERCALALEVFAVIRSEYRAELG 120

Query: 121 LYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRI 180
           LYAEVA ALSRNGAAEEIDRLVCDLDG DG+I+W  DDKGLIKLIKAVISG+RRESTVRI
Sbjct: 121 LYAEVAAALSRNGAAEEIDRLVCDLDGRDGVIEWGDDDKGLIKLIKAVISGNRRESTVRI 180

Query: 181 YRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGSL 228
           YRMMRRNGWGS IK DDYM++V+SKGLRR+GE+ELADEINREFQDLVGSL
Sbjct: 181 YRMMRRNGWGSMIKGDDYMIKVMSKGLRRVGELELADEINREFQDLVGSL 230

BLAST of HG10023363 vs. ExPASy TrEMBL
Match: A0A0A0K6N6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447050 PE=4 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 1.2e-100
Identity = 204/229 (89.08%), Postives = 213/229 (93.01%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGAAS-VAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQS 60
           MA SLHSTFLKSQISIPIP S A S VAVS  VRCGPRDNRGPLVKGRTLS EAIQAIQS
Sbjct: 1   MASSLHSTFLKSQISIPIPASTATSPVAVSFRVRCGPRDNRGPLVKGRTLSIEAIQAIQS 60

Query: 61  LKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSEYGAELG 120
           LKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQ+RCALALEVFAVI+SEY AELG
Sbjct: 61  LKRAERSDPTKLQQVLSTTLSRLLKADLVATLKELLRQERCALALEVFAVIKSEYRAELG 120

Query: 121 LYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRI 180
           LYAEVA ALSRNGAAEEIDRLV DLDGGDG+I+W  DDKGLIKLIKAVISG+RRESTVRI
Sbjct: 121 LYAEVAAALSRNGAAEEIDRLVSDLDGGDGVIEWGDDDKGLIKLIKAVISGNRRESTVRI 180

Query: 181 YRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS 227
           YRMMRR GWGS IKADDYM++VLSKGLRRLGE+ELADEINREF+DLVGS
Sbjct: 181 YRMMRRKGWGSMIKADDYMIKVLSKGLRRLGEIELADEINREFEDLVGS 229

BLAST of HG10023363 vs. TAIR 10
Match: AT3G27750.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: Vacuolar sorting protein 9 (VPS9) domain (TAIR:AT5G09320.1); Has 106 Blast hits to 106 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 4; Plants - 102; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 186.4 bits (472), Expect = 2.5e-47
Identity = 115/228 (50.44%), Postives = 158/228 (69.30%), Query Frame = 0

Query: 1   MALSLHSTFLKSQISIPIPVSGAASVAVSLP------VRCGPRDNRGPLVKGRTLSTEAI 60
           MALSL  T        P  +S + +++V +P      +RCGPRDNRGPL+KGR LSTEAI
Sbjct: 1   MALSLSQT-------RPPSLSHSHTLSVIVPKRTFVSIRCGPRDNRGPLLKGRILSTEAI 60

Query: 61  QAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAALKELLRQDRCALALEVFAVIR 120
           Q+IQSLKRA R+  +     LS T   L RL+K+DL++ L+ELLRQD C LA+ V + +R
Sbjct: 61  QSIQSLKRAHRTGVS-----LSLTLRPLRRLIKSDLISVLRELLRQDYCTLAVHVLSTLR 120

Query: 121 SEY-GAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDR 180
           +EY   +L LYA++  AL+RN   +EIDRL+ ++DG D   + DDK L KLI+AV+  +R
Sbjct: 121 TEYPPLDLVLYADIVNALTRNKEFDEIDRLIGEIDGIDQ--RSDDKALAKLIRAVVGAER 180

Query: 181 RESTVRIYRMMRRNGWGS-TIKADDYMVRVLSKGLRRLGEMELADEIN 218
           RES VR+Y +MR +GWGS + +AD+Y+  VLSKGL RLGE +LA +++
Sbjct: 181 RESVVRVYTLMRESGWGSESWEADEYVAEVLSKGLLRLGEPDLASQVS 214

BLAST of HG10023363 vs. TAIR 10
Match: AT5G09320.1 (Vacuolar sorting protein 9 (VPS9) domain )

HSP 1 Score: 86.7 bits (213), Expect = 2.7e-17
Identity = 68/206 (33.01%), Postives = 105/206 (50.97%), Query Frame = 0

Query: 39  NRGPLVKGRTLSTEAIQAIQSLKRAE--------------RSDPTKLQQVLSTTLSRLLK 98
           NR PL +GR LS EAIQA+Q+LKRA                S    L +V+ +   RLLK
Sbjct: 495 NRKPLQRGRMLSIEAIQAVQALKRANPLLPPPPVPSTSTTSSSSALLDRVIISKFRRLLK 554

Query: 99  ADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEEIDRLVC 158
            D+VA L+ELLRQ+ C+LAL+VF  IR E  Y  ++ +Y ++   ++ N   EE++ L  
Sbjct: 555 FDMVAVLRELLRQNECSLALKVFEEIRKEYWYKPQVRMYTDMITVMADNSLMEEVNYLYS 614

Query: 159 DLDGGDGL---IQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRV 218
            +    GL   I+W +     L+  +++    +  +  Y  M+  G+    + D    RV
Sbjct: 615 AMKSEKGLMAEIEWFN----TLLTILLNHKLFDLVMDCYAFMQSIGY----EPDRASFRV 674

Query: 219 LSKGLRRLGEMELADEINREFQDLVG 226
           L  GL   GEM L+  + ++  +  G
Sbjct: 675 LVLGLESNGEMGLSAIVRQDAHEYYG 692

BLAST of HG10023363 vs. TAIR 10
Match: AT3G46870.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 57.8 bits (138), Expect = 1.3e-08
Identity = 53/187 (28.34%), Postives = 99/187 (52.94%), Query Frame = 0

Query: 40  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQD 99
           RGPL +G+ L   EA+  I  LKR  + D  KL + + T + RLLK D++A + EL RQ+
Sbjct: 63  RGPLWRGKKLIGKEALFVILGLKRL-KEDDEKLDKFIKTHVFRLLKLDMLAVIGELERQE 122

Query: 100 RCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDK 159
             ALA+++F VI+ +  Y  ++ +Y ++ V+L+++   +E   L   +   +  +  D +
Sbjct: 123 ETALAIKMFEVIQKQEWYQPDVFMYKDLIVSLAKSKRMDEAMALWEKMKKEN--LFPDSQ 182

Query: 160 GLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEI 219
              ++I+  +        + +Y  M +    S    ++   RVL KGL  L    L +++
Sbjct: 183 TYTEVIRGFLRDGCPADAMNVYEDMLK----SPDPPEELPFRVLLKGL--LPHPLLRNKV 240

Query: 220 NREFQDL 224
            ++F++L
Sbjct: 243 KKDFEEL 240

BLAST of HG10023363 vs. TAIR 10
Match: AT1G62350.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 55.8 bits (133), Expect = 5.1e-08
Identity = 47/177 (26.55%), Postives = 92/177 (51.98%), Query Frame = 0

Query: 49  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFA 108
           +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ++  L ++++ 
Sbjct: 1   MSKEGLIAAKELKRLQ-TQSVRLDRFIGSHVSRLLKSDLVSVLAEFQRQNQVFLCMKLYE 60

Query: 109 VIRSE--YGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVI 168
           V+R E  Y  ++  Y ++ + L+RN   +E  ++  DL   +  + +D      L++  +
Sbjct: 61  VVRREIWYRPDMFFYRDMLMMLARNKKVDETKKVWEDLKKEE--VLFDQHTFGDLVRGFL 120

Query: 169 SGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL 224
             +     +R+Y  MR     S  +      RV+ KGL  +   EL +++  +F +L
Sbjct: 121 DNELPLEAMRLYGEMRE----SPDRPLSLPFRVILKGL--VPYPELREKVKDDFLEL 168

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898717.11.3e-10993.83protein THYLAKOID ASSEMBLY 8, chloroplastic [Benincasa hispida][more]
XP_022954347.19.7e-10287.61protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata][more]
XP_023522652.11.3e-10187.61protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo] >X... [more]
XP_022992386.11.3e-10187.61protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima][more]
XP_008462172.11.7e-10189.13PREDICTED: uncharacterized protein LOC103500597 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Q9LVW63.5e-4650.44Protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=T... [more]
Q9STF91.9e-0728.34Protein THYLAKOID ASSEMBLY 8-like, chloroplastic OS=Arabidopsis thaliana OX=3702... [more]
Q1PFH77.2e-0726.55Pentatricopeptide repeat-containing protein At1g62350 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1GQS04.7e-10287.61protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1JTE86.1e-10287.61protein THYLAKOID ASSEMBLY 8, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A1S3CGD38.0e-10289.13uncharacterized protein LOC103500597 OS=Cucumis melo OX=3656 GN=LOC103500597 PE=... [more]
A0A5A7V0E25.2e-10188.70Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0K6N61.2e-10089.08Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447050 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G27750.12.5e-4750.44FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT5G09320.12.7e-1733.01Vacuolar sorting protein 9 (VPS9) domain [more]
AT3G46870.11.3e-0828.34Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62350.15.1e-0826.55Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 48..223
e-value: 1.5E-29
score: 104.9
NoneNo IPR availablePANTHERPTHR47594:SF3PROTEIN THYLAKOID ASSEMBLY 8, CHLOROPLASTICcoord: 19..224
IPR044190Protein THYLAKOID ASSEMBLY 8-likePANTHERPTHR47594PPR CONTAINING PLANT-LIKE PROTEINcoord: 19..224

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023363.1HG10023363.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
biological_process GO:0000373 Group II intron splicing
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding