Cp4.1LG04g01840 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g01840
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionsec-independent protein translocase protein TATA, chloroplastic
LocationCp4.1LG04: 230256 .. 237179 (+)
RNA-Seq ExpressionCp4.1LG04g01840
SyntenyCp4.1LG04g01840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATCAAACCGATAAACTTTCCAATTCCCAGAATCAAGATTTTCCCCTCAACGACGCCCGCCATGGTAATCTCATCTGCAACTTTATCCTTCACATCTTCCATTCCCAGGCTTCCATCGTTTTCCCCTCCGTTTTCCTCCTCCAAATCCTCCTTCTTCGCCAACAATGCCACCGCCAATTCCTTCAACAAAGGCGACCATAGCCATTCTTTGCTATTGCGAACCACCAGGTCCAGAAAGACGATGGATAGAGCTCATAAGGGTCTCACTTGTAACGCTCTGTTTGGACTTGGAGTGCCTGAGCTTGTTGTAATTGCCGGCGTTGCCGCTCTGGTTTTTGGCCCGAAGAAGTTGCCTGAAGTTGGGAAGAGTATTGGAAAAACGGTCAAGAGCTTCCAACAGGTATGTTTATGTGTGGTTTTAATCAATTGTGTCAGTTTGGTTGCTCGGAAATTGTTGATTGTTAGGGATTTGATATCGGGAAGAGTTTTTTTCCACTGGATTTGGATAGTATGTGGTGTAGATGTTTGGAAATTGATTATTTGATGATTGTTGTTTCGTATAGTCGGATTAGGGAAGATTTTTACTGAATTTGAAATGAGATTGAATTGATGCAAAGTACGTTTTAGCTAACTAGAAAGCTTATCTACTTTTTCATGGGACTTGAACAACAATAGTTTGGAATTAGATGAAGCCTTGGTTTCGTGCCTTTTTCTGGATGTAAATAGAAATGCTTCACTTAAAAACAGTCCTATCAAAATCTGTTGTAAGTTTTCATGAAGATGTATTGTCATACCACTCCTTTAGTTCATAACCCTTTTAGCTCATATTCTCATTTATTTGGTTCATGTATGAGAATTAGTATTGATATGCCTATATAAAGGCTTGTCTAGCATTGCTTTGAGGTGTGGAGTCATTGTTTGAAGTCCCACATTGGTTGGAGAGGAGAACAAAACACCCTTTATAAGGGTGTGGAAACCTTCCCCTAGCAGATGCGTTTTAAAGCCTTGAGGGGAAACCCGAAAGGGAGAGCTCAAAGAGGACAATATCTGCTAGCGGTGGATCTGGGCTGTTACAAATGGTATCAGAGCCAGGCACCGGACTATGTGCTAGCGAGGAGGCTGTTCCCCAAAGGGGGGTAGACACGAGGCGGTGTGCCAGTAAGGACGCTGGCCCCAAAGGGGGTGGATTTGGTGGCGGTCCCACATCGATTGAAGAAAGGAACGAGTGCCAGTGAGGACGCTGGGCCCCGAAGGGGGGTGGATTGTGATGTCCCACATTGGTTGGAGAGGAGAACAAAATACCCTTTATAAGGGCGTGGAAACCTTCCCCTAACAAACACGTTTTAAAGCCTTGAGGGGAATCCCGGAGAGCTCAAAGAGGACAATATCTGCTAGCGGTAGATCTGGGCTATTACACATTGTCATTTTGTATTTGCCTAGAGGCAATTGGGAGGAACTTTGNGAATTAGTATTGATATGCCTATATAAAGGCTTGTCTAGCATTGCTTTGAGGTGTGGAGTCATTGTTTGAAGTCCCACATTGGTTGGAGAGGAGAACAAAACACCCTTTATAAGGGTGTGGAAACCTTCCCCTAGCAGATGCGTTTTAAAGCCTTGAGGGGAAACCCGAAAGGGAGAGCTCAAAGAGGACAATATCTGCTAGCGGTGGATCTGGGCTGTTACAAATGGTATCAGAGCCAGGCACCGGACTATGTGCTAGCGAGGAGGCTGTTCCCCAAAGGGGGGTAGACACGAGGCGGTGTGCCAGTAAGGACGCTGGCCCCAAAGGGGGTGGATTTGGTGGCGGTCCCACATCGATTGAAGAAAGGAACGAGTGCCAGTGAGGACGCTGGGCCCCGAAGGGGGGTGGATTGTGATGTCCCACATTGGTTGGAGAGGAGAACAAAATACCCTTTATAAGGGCGTGGAAACCTTCCCCTAACAAACACGTTTTAAAGCCTTGAGGGGAATCCCGGAGAGCTCAAAGAGGACAATATCTGCTAGCGGTAGATCTGGGCTATTACACATTGTCATTTTGTATTTGCCTAGAGGCAATTGGGAGGAACTTTGTTGAGAGACTTTGTTGTAGTGTGAGAGTGTGAGAGATACACTTGTAAACACTTTGGTTATAGTGTGATTGACTACCAACCGTGGATGTAGGAGAATTTATTCCCTCCAATCCACGTAAATCTTTGTGTCTCGTTGTTTAATTTTCTGTTTGTGAGTGTGGGTGCAACCGATCTTGCTTTCGCTTACGCTACAACAAATGGGTATCAAAACATTTTGGTTGTGGCGCTTGGGTACAACAAGTGGTATCATAGAATTGAAAGTTGTGGCATTACGGCACAACAAAGTCATTCCGAACATTTAAAATGAATCAAATAATACTTCTGTAACTGTCTATGAACCTGTTGTTCTGGCTCAGCTATCGTGATAAAAATGGACTTAAATAGGTGGGAACTCCACCCTCTTTCTTGAGAGGACCTTTAGCATGTTGAATAAGAAAAGAAGAAAAGTTCTTGGAAATCTCAATGGCATGATCGTAAGCTGTCGTACAACCTACCCTGTGTGCTTCTCACATAGTTTGTGTATTTTGTGCAAAAAAGTTGGAAGACCATACCCATCTGTTCTGTTCCTTTAGTCAGAAGCCTTGATGTTCTTGTGCTGAAGTGTTCAATCTGGTGGAGGGTCTTTTAGTGTCTATCACACGGAACCTTTGCCAACCCCATCCTTTTAAGCGGAAAGTAAATATGTTTTGGAATTATATTATCAAAGCGGTATGTGGGAGGTGTGGTTGGACACTGGTTGGTGTAGTATGGTAAAGGGTGAAGTATGACAATTAGAATCTATAAAACAAAGGAGTGGGAAATAAAGATTCAACGAGATAGTAAACAAACGGAATCTTTAAGGGTCGGGAATCATAGATTCAACCAGATAAACCGGGGAAACATTATTTTTACTTGGTTCCGTTTTCTTAAAGAAGCAATTTCATTCACAAAAACAGAGGAGAAGTGCCTTGATCCGAAGCATTTGCAATAAACATTTCCATTTGGACTAATGATTGTTGGTATGATATTCTTGATACATTGAAGAAGACGATTAAGGAGGATTTCATGTTGGATCCTATCCATGTAGAAATGGGGAGATATTGAGATGTCTTGCTGCAGAATTGGCAAAAGATTTGTGCGTCTACGTTGGATGGTGACTGTAGGTGACTTTGCTATGAACTTCGAAGATTGGAATGAAAAAAAAAAATTATTAGCCAGCGGAGATGTTACTTTGTAGGTGATTAAAAGTGTTACCTTGTTATGGAGGTTTATTCATTAATAGGTGGATGAAGGAGGTCTCAAACCATCGGAGAATAACGTGCATATTCTATTTCTCATGCCGTGCTATGAAAAGGAGAAACAATGCAAAACGTTCACCCTATCTTTTCTCCAATCTTCGGGGGTAGTTGCTGCTGAAAGTCAGGAAGGGTGTTTGATATAATGCCACTACAAGTGTACACACGTTACACTCATGATAAAAAGAAGCAGTCACGTATGGAACCCATTAGTGGAATTGTCTTCCACCTTCATTTTGATTTTTAGTTATTCCCTTGTTTTAGATTAAGAAAATTTCTCAAATATTGGCTAATCCTTACTCCCTCCCACAACAACACTGTTAAGGGCTCAAGTTTGGGATCGAGTTATGGATCTCTCGTAAGTGGAGGTAGACAGTGTTTGTGTGGGGGAAGAAGTAGAATTCTTCTTTCGACCTATTTTGAATAATTGTAAGGCTCAAATAGAGGGAACCTTAAACCTTTGAGTTAAAGTACTCAAGACTAAATTGGAAATTTTTAGGTACCATCTTCATTGATGTTTATGACTATATGTACCAAGTTGTATAATAATCATTGTTACATCAATGGAACCATGACTCAATCATGGAGGATCATCCTCCTAGAACAAAGGAAATAACTAAAAATAGAAATGAAAATGGAATCCTTACAACCATTGTTATATCAATAGAGCCATGACTCAATCATGGAGGATCATCTTCAACTAACACCAGCTCTTCAGGAAAGGAGGAGAAAAAGAGGAGGAAAATGAAGGAAGAAAATGAAGAACAGTGTTCTTCAAAAGTTTTAATAAAGAACATAAACCTGCATATGTTCTGCAAAAATCAAACTTGGAAGTAATGGATAAAAGGGCAGTAAAATCTTTGTATACCTTTATTCATATTGGCTGGTGGGAGGTGTTCAGAAGATCAGTGGGATTGTTAACATTATGTATGAAAAGATAATAAGACTATAAGAGTTACAATTGAGACGTTTGTTGTAATAACGTTGAACCACAAAGGTTGCAGGAAAAATTTAAGAGTATCTTAGCAGTTACTCTTAGCTTGAATGTGAGCGGTCGAACATCAATGAGGTTAAGAGTATCTTAGCAGTTACTAATTCAACCATCGAAGTAATAACCAAATGCCTCAAAAGTGCTTTTTTCGGTACTTAAACCGTTAGTTTTTAATCGAAATCCATCCCCCATATCCTTCAATGAACTTTTGATGACTATGCTTGTATCAAACCAGTTCTCAATCTTCAGATTTAAATCACCAACCAATTGCCATTTTCCCAACACGCAGAAATCAATGCTTTATCAGTTATTAAGGAATTGATCTGCACTTAAATTTTAAAGTATGCTTCTGCTGAAGTTTTGATATCCTTCCAAGAAATATGCACAAACAGCTGACTTACCACCAAGAAACTTTCAAAATCCATTCTAACCACTTCATGAGGTTTTGTGTAAGACTATTGGATATTTCTGATGTTGTTTGTTAATTGTTATCTTATTTATTTATTTAATGAATTGATTGGATGTTCTTTTTATGTTTAAAGGTAGAAATTTACAATAAAGGTATCAAATATTCATAATTTGTCGAAATATATGCATATGTAAGGGGGGAAAAATTAACATGTCGTGTCCTACTTTTTTGAAAATCGATGTAATGATATTTCCATGTCGTATAGTTTCCAATGTCGTGTGTCCATGTCTGTGCTGCTCTATAGGATGGAAGTTGGTCATTTCAAGTTATTTTTCGCCAAAATCGATGCTTGATAAAGAATTAAAAAAAATGACACTGAAACCGAAGATTCGGAAGTTGTGTGAATCACACATTTAGTTTACACTAACCAATCGAGTTTGTTAATTCAGTTTCGTTGTTGTCTTATTTTTTCAAATTTTGGATTAGACCCCTATTGCTCTTAGGCCAACCTTCGGAGCCATTTACGTTTGAGCATAACATTCTTTCTTATCAAATTGTGTATCCCTAAATAACGAATAGCCGGAATCATGGCATCATTTACCCAAAGTACATTGGAGCCCCCATCTCAGAGACAAAACAAACTAGAATTATTCCAATTATGATTATCTGAATTGAATTGAAGTCTAGCTTTCTTTCTTTCTTTCTTTCTTTTTTTGTAAAATTATCAATTGCTCTGAAGTTTTATATTTCCCAAGGAATTTTCATTTCTTTCTAAACTTAGTAGTCTATATGTTATCGATATTTTATGATTACAAAGATGAAGTTTATGTAATACCTTTTTTATTTTAGAGTTATGAAAGGGAACTTCATATCTTGGACAAATTGAATATTGAGGCCTTGTAGTTTGTGTGCTACGTACCATATGAACCAACATTTTAAGTTTGCTGTTGTTATGAGTTCCCACAGGGTTCACTAGTTAGGTTCTTGCTAATTTCATTAACCTTGGCTATATTTGGAAGTGTCAAATGCACTTTTGACATGGACAAAAGAACTTTTGACTCGAAATGTTTAAAAAAAATCATGTAGGAGAGTAATATAAAATGCTTTTTGAAAAATTGGAAAGTGCCTTGAAATACTTTGTTGTAGAGCGTTATGGTCTATGTGCTTGATTGGGAAGTCCTTCCAAAGTACTTTTAAGAATGCATAACCAATTTTACTTTACTTTTGCTCATTGTACTTTTTTAAAACCATTTCCAAATACGCTTTTAAGCATTTGTTAGTGAGTTAGAACATAATATGGGCAACGTAGGAGACATCTCTATTTTTTCCCTCGGGAATCCATAGAAGTACCATTGCTTAGTTTTCAATTTACCTCTGACCATCAAATGAAATGTGAGTAACCCGTCTTCCTCTCTTTGAAACAAGGTACGTTTCTAGTTCTGTCGGTGCTTGAAACAATTTAGTTTTCTCCATATTACTATATCTCTAGAGTCAATAATTTTATATGAGGAACTACTGAACTCAATCACTTGCTGCCGTTACTNTTCTCTTTGAGGTCTATTATGTGGAGGTACTCAAATTGGATCAATGATTGATTTCTAGGTTTCGTCGTAAACCTAATTGGTTACTTCTAATAGTTCAAGTGAATCAAGGATAAAGTAGCTTCAAGTTCGTTTATGAAACATGTTTTCAATCTTTCTTATTGATTTTCAAGCTCGAGGAAAACTTCAATTGCTCAATGGTTAAAGAATCAGCTAAACGAGGTTATCTGCTTTTGAGTTTCCTAGTAGGACTTATGTTTCTAATATCATTGCAGGCAGCTAAGGAGTTCGAGTCCGAGCTGAAGAAAGAGCCCGAGTCAATCGGAGAGACCGCAGTAGAAAAGCCTACATCAGTAGATGATGAGGAGAAGCAAGATTTGAAGGTATCAAACCAAAAGGAGAGTGTATGAAGCGGTAGGTAGTACCATCAATTTCATGTCCACTTGTTCCTATCATTGTACTTTATGTATGTGAATGTGTCTTTGGAATCTGAAGATCACTTAATGAAGAAAATTAGTTTTCCAGTGCTGGTCTCTAATAGCTATCATATTCCACTCTAATTAAATCAAAAGGAATTATGAGCTACAAGCTCAAGAACAACCAGCAAGGCCGATCCAAGAGGGCAGCTTTTTGCCTTACT

mRNA sequence

CATCAAACCGATAAACTTTCCAATTCCCAGAATCAAGATTTTCCCCTCAACGACGCCCGCCATGGTAATCTCATCTGCAACTTTATCCTTCACATCTTCCATTCCCAGGCTTCCATCGTTTTCCCCTCCGTTTTCCTCCTCCAAATCCTCCTTCTTCGCCAACAATGCCACCGCCAATTCCTTCAACAAAGGCGACCATAGCCATTCTTTGCTATTGCGAACCACCAGGTCCAGAAAGACGATGGATAGAGCTCATAAGGGTCTCACTTGTAACGCTCTGTTTGGACTTGGAGTGCCTGAGCTTGTTGTAATTGCCGGCGTTGCCGCTCTGGTTTTTGGCCCGAAGAAGTTGCCTGAAGTTGGGAAGAGTATTGGAAAAACGGTCAAGAGCTTCCAACAGGCAGCTAAGGAGTTCGAGTCCGAGCTGAAGAAAGAGCCCGAGTCAATCGGAGAGACCGCAGTAGAAAAGCCTACATCAGTAGATGATGAGGAGAAGCAAGATTTGAAGGTATCAAACCAAAAGGAGAGTGTATGAAGCGGTAGGTAGTACCATCAATTTCATGTCCACTTGTTCCTATCATTGTACTTTATGTATGTGAATGTGTCTTTGGAATCTGAAGATCACTTAATGAAGAAAATTAGTTTTCCAGTGCTGGTCTCTAATAGCTATCATATTCCACTCTAATTAAATCAAAAGGAATTATGAGCTACAAGCTCAAGAACAACCAGCAAGGCCGATCCAAGAGGGCAGCTTTTTGCCTTACT

Coding sequence (CDS)

ATGGTAATCTCATCTGCAACTTTATCCTTCACATCTTCCATTCCCAGGCTTCCATCGTTTTCCCCTCCGTTTTCCTCCTCCAAATCCTCCTTCTTCGCCAACAATGCCACCGCCAATTCCTTCAACAAAGGCGACCATAGCCATTCTTTGCTATTGCGAACCACCAGGTCCAGAAAGACGATGGATAGAGCTCATAAGGGTCTCACTTGTAACGCTCTGTTTGGACTTGGAGTGCCTGAGCTTGTTGTAATTGCCGGCGTTGCCGCTCTGGTTTTTGGCCCGAAGAAGTTGCCTGAAGTTGGGAAGAGTATTGGAAAAACGGTCAAGAGCTTCCAACAGGCAGCTAAGGAGTTCGAGTCCGAGCTGAAGAAAGAGCCCGAGTCAATCGGAGAGACCGCAGTAGAAAAGCCTACATCAGTAGATGATGAGGAGAAGCAAGATTTGAAGGTATCAAACCAAAAGGAGAGTGTATGA

Protein sequence

MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKTMDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFESELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV
Homology
BLAST of Cp4.1LG04g01840 vs. ExPASy Swiss-Prot
Match: Q9XH46 (Sec-independent protein translocase protein TATA, chloroplastic OS=Pisum sativum OX=3888 GN=TATA PE=1 SV=1)

HSP 1 Score: 123.2 bits (308), Expect = 2.5e-27
Identity = 87/154 (56.49%), Postives = 107/154 (69.48%), Query Frame = 0

Query: 7   TLSFTSS--IP-RLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKTMDR 66
           TLS +SS  IP RLP+ S     S  SF ++N+         ++ SLLL+  R +    R
Sbjct: 4   TLSISSSSVIPTRLPNSS---CYSNLSFLSSNS---------NTSSLLLKKARIK---TR 63

Query: 67  AHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFESELK 126
             KG TCNA FGLGVPELVVIAGVAALVFGPKKLPEVG+SIG+TVKSFQQAAKEFE+ELK
Sbjct: 64  TTKGFTCNAFFGLGVPELVVIAGVAALVFGPKKLPEVGRSIGQTVKSFQQAAKEFETELK 123

Query: 127 KEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 158
           KEP    E +V       ++EKQ++KVS+ K++V
Sbjct: 124 KEPNPTEEISV-----ASEQEKQEIKVSSTKDNV 137

BLAST of Cp4.1LG04g01840 vs. ExPASy Swiss-Prot
Match: Q9LKU2 (Sec-independent protein translocase protein TATA, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TATA PE=1 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 9.6e-27
Identity = 90/157 (57.32%), Postives = 105/157 (66.88%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60
           M  S ATL   SS P  P   P  SSS+SSFF+N  T  +     ++ SL+    R R+ 
Sbjct: 1   MATSVATL---SSPP--PVSLPLLSSSRSSFFSNCFTVTT---RPNTRSLVAIGRRIRQE 60

Query: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120
             R  K LTCNALFGLGVPEL VIAGVAAL+FGPKKLPE+GKSIGKTVKSFQQAAKEFES
Sbjct: 61  PTR--KPLTCNALFGLGVPELAVIAGVAALLFGPKKLPEIGKSIGKTVKSFQQAAKEFES 120

Query: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 158
           ELK EPE     + +  TS  +EEK+    S+ KE+V
Sbjct: 121 ELKTEPEESVAESSQVATSNKEEEKKTEVSSSSKENV 147

BLAST of Cp4.1LG04g01840 vs. ExPASy Swiss-Prot
Match: Q9XFJ8 (Sec-independent protein translocase protein TATA, chloroplastic OS=Zea mays OX=4577 GN=TATA PE=1 SV=2)

HSP 1 Score: 103.6 bits (257), Expect = 2.1e-21
Identity = 70/133 (52.63%), Postives = 83/133 (62.41%), Query Frame = 0

Query: 25  SSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKTMDRAHKGLTCNALFGLGVPELVVI 84
           SS  SSF   +        GD +        R R+        L C  LFGLGVPEL VI
Sbjct: 44  SSRASSFVGGSG-------GDLAAVAASVAARPRRAGSGGGGALGCKCLFGLGVPELAVI 103

Query: 85  AGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFESELKKEP-ESIGETAVEKPTSVDDE 144
           AGVAALVFGPK+LPE+G+SIGKTVKSFQQAAKEFE+ELKKEP E   +     PT+V   
Sbjct: 104 AGVAALVFGPKQLPEIGRSIGKTVKSFQQAAKEFETELKKEPGEGGDQPPPATPTAVSGG 163

Query: 145 EKQDLKVSNQKES 157
           E++ L+ S+ KES
Sbjct: 164 EEKGLEASSSKES 169

BLAST of Cp4.1LG04g01840 vs. ExPASy Swiss-Prot
Match: Q75GK3 (Sec-independent protein translocase protein TATA, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=TATA PE=2 SV=1)

HSP 1 Score: 101.7 bits (252), Expect = 7.9e-21
Identity = 70/135 (51.85%), Postives = 85/135 (62.96%), Query Frame = 0

Query: 25  SSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKTMDRAHKGLTCNALFGLGVPELVVI 84
           SS  SSF    A       G  + ++  RT        R    + C  LFGLGVPELVVI
Sbjct: 43  SSRASSFVTGGA-------GGLAVAVAARTRAGSGAGSRGGGAMGCKCLFGLGVPELVVI 102

Query: 85  AGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFESELKKEPESIGETAVEKPTSV---D 144
           AGVAALVFGPK+LPE+G+SIGKTVKSFQQAAKEFE+ELKKE +  G+     PT     D
Sbjct: 103 AGVAALVFGPKQLPEIGRSIGKTVKSFQQAAKEFETELKKESDDGGDQP-PPPTETAVSD 162

Query: 145 DEEKQDLKVSNQKES 157
             E+++L+ S+ KES
Sbjct: 163 GGEEKELEASSSKES 169

BLAST of Cp4.1LG04g01840 vs. ExPASy Swiss-Prot
Match: Q31RR1 (Sec-independent protein translocase protein TatA OS=Synechococcus elongatus (strain PCC 7942 / FACHB-805) OX=1140 GN=tatA PE=3 SV=1)

HSP 1 Score: 73.9 bits (180), Expect = 1.8e-12
Identity = 41/87 (47.13%), Postives = 60/87 (68.97%), Query Frame = 0

Query: 74  FGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFESELKKE----PESI 133
           FG+G+PE++VI  +A LVFGPKKLPE+G+S+GK ++ FQ A++EFESE+K+E    P + 
Sbjct: 4   FGIGLPEMLVILAIALLVFGPKKLPEIGRSLGKALRGFQDASREFESEIKREIDRTPATP 63

Query: 134 GETAVEKPTSVDDEEKQDLKVSNQKES 157
            E  VE P  +D    + + V  Q E+
Sbjct: 64  AEATVEPPV-LDSAPTEAVTVEKQTET 89

BLAST of Cp4.1LG04g01840 vs. NCBI nr
Match: XP_023529222.1 (sec-independent protein translocase protein TATA, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 290 bits (741), Expect = 2.87e-98
Identity = 157/157 (100.00%), Postives = 157/157 (100.00%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60
           MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT
Sbjct: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60

Query: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120
           MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES
Sbjct: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120

Query: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV
Sbjct: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157

BLAST of Cp4.1LG04g01840 vs. NCBI nr
Match: XP_022932561.1 (sec-independent protein translocase protein TATA, chloroplastic [Cucurbita moschata])

HSP 1 Score: 287 bits (735), Expect = 2.36e-97
Identity = 155/157 (98.73%), Postives = 156/157 (99.36%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60
           MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT
Sbjct: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60

Query: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120
           MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES
Sbjct: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120

Query: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           ELKKEPESIGETAVEKP S+DDEEKQDLKVSNQKESV
Sbjct: 121 ELKKEPESIGETAVEKPASIDDEEKQDLKVSNQKESV 157

BLAST of Cp4.1LG04g01840 vs. NCBI nr
Match: KAG6588320.1 (Sec-independent protein translocase protein TATA, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 285 bits (728), Expect = 2.75e-96
Identity = 154/157 (98.09%), Postives = 155/157 (98.73%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60
           MVISSATLSFTSSIPRLPSFSPPFSSSKSSFF+NN TANSFNKGDHSHS LLRTTRSRKT
Sbjct: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFSNNVTANSFNKGDHSHSSLLRTTRSRKT 60

Query: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120
           MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES
Sbjct: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120

Query: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV
Sbjct: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157

BLAST of Cp4.1LG04g01840 vs. NCBI nr
Match: KAG7020882.1 (Sec-independent protein translocase protein TATA, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 284 bits (727), Expect = 3.91e-96
Identity = 154/157 (98.09%), Postives = 154/157 (98.09%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60
           MVISS TLSFTSSIPRLPSFSPPFSSSKSSFFANN TANSFNKGDHSHS LLRTTRSRKT
Sbjct: 1   MVISSVTLSFTSSIPRLPSFSPPFSSSKSSFFANNVTANSFNKGDHSHSSLLRTTRSRKT 60

Query: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120
           MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES
Sbjct: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120

Query: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV
Sbjct: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157

BLAST of Cp4.1LG04g01840 vs. NCBI nr
Match: XP_022969040.1 (sec-independent protein translocase protein TATA, chloroplastic [Cucurbita maxima])

HSP 1 Score: 283 bits (725), Expect = 7.89e-96
Identity = 154/157 (98.09%), Postives = 154/157 (98.09%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60
           MVISSATLSFTSSIPRLPSFSPP SSSKSSFFANNATANSFNKGDHSHS LLRTTRSRKT
Sbjct: 1   MVISSATLSFTSSIPRLPSFSPPLSSSKSSFFANNATANSFNKGDHSHSSLLRTTRSRKT 60

Query: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120
           MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES
Sbjct: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120

Query: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           ELKKEPESIGETAVEKPTSVDDEEKQDLK SNQKESV
Sbjct: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKASNQKESV 157

BLAST of Cp4.1LG04g01840 vs. ExPASy TrEMBL
Match: A0A6J1EWP2 (sec-independent protein translocase protein TATA, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111439076 PE=3 SV=1)

HSP 1 Score: 287 bits (735), Expect = 1.14e-97
Identity = 155/157 (98.73%), Postives = 156/157 (99.36%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60
           MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT
Sbjct: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60

Query: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120
           MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES
Sbjct: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120

Query: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           ELKKEPESIGETAVEKP S+DDEEKQDLKVSNQKESV
Sbjct: 121 ELKKEPESIGETAVEKPASIDDEEKQDLKVSNQKESV 157

BLAST of Cp4.1LG04g01840 vs. ExPASy TrEMBL
Match: A0A6J1HYU6 (sec-independent protein translocase protein TATA, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111468153 PE=3 SV=1)

HSP 1 Score: 283 bits (725), Expect = 3.82e-96
Identity = 154/157 (98.09%), Postives = 154/157 (98.09%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60
           MVISSATLSFTSSIPRLPSFSPP SSSKSSFFANNATANSFNKGDHSHS LLRTTRSRKT
Sbjct: 1   MVISSATLSFTSSIPRLPSFSPPLSSSKSSFFANNATANSFNKGDHSHSSLLRTTRSRKT 60

Query: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120
           MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES
Sbjct: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120

Query: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           ELKKEPESIGETAVEKPTSVDDEEKQDLK SNQKESV
Sbjct: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKASNQKESV 157

BLAST of Cp4.1LG04g01840 vs. ExPASy TrEMBL
Match: A0A0A0LWL1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G573670 PE=3 SV=1)

HSP 1 Score: 225 bits (573), Expect = 5.75e-73
Identity = 127/158 (80.38%), Postives = 135/158 (85.44%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLP-SFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRK 60
           MVISS TLSFTSSIP+LP S SP FSSSKS+FFANNAT + F  GDH+HS LLR T  R 
Sbjct: 1   MVISSPTLSFTSSIPKLPPSLSPSFSSSKSAFFANNATTSFFTYGDHNHSSLLRLTSFRT 60

Query: 61  TMDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFE 120
           T    HKG TCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFE
Sbjct: 61  TTKTTHKGFTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFE 120

Query: 121 SELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           SELKKEPE   ET+VEKPTS + EE+QDLKVSNQKE+V
Sbjct: 121 SELKKEPEPTEETSVEKPTSTEAEERQDLKVSNQKETV 158

BLAST of Cp4.1LG04g01840 vs. ExPASy TrEMBL
Match: A0A1S3BPB0 (sec-independent protein translocase protein TATA, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103492233 PE=3 SV=1)

HSP 1 Score: 224 bits (572), Expect = 8.16e-73
Identity = 126/158 (79.75%), Postives = 136/158 (86.08%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLP-SFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRK 60
           MVIS+ TLSFTSSIP+LP S SP FSSSKS+FFANNAT + F  GDH+HS LLR T  R 
Sbjct: 1   MVISTPTLSFTSSIPKLPPSLSPSFSSSKSAFFANNATVSFFTNGDHNHSSLLRLTSFRT 60

Query: 61  TMDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFE 120
           T    HKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFE
Sbjct: 61  TTKTTHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFE 120

Query: 121 SELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           SELKKEPE   ET+VEKPTS + E++QDLKVSNQKE+V
Sbjct: 121 SELKKEPEPTEETSVEKPTSTEAEKRQDLKVSNQKETV 158

BLAST of Cp4.1LG04g01840 vs. ExPASy TrEMBL
Match: A0A6J1DX34 (sec-independent protein translocase protein TATA, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111024879 PE=3 SV=1)

HSP 1 Score: 214 bits (545), Expect = 6.43e-69
Identity = 128/158 (81.01%), Postives = 135/158 (85.44%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTR-SRK 60
           MVISSATLSFTSS+PR P  SPPFSSSKS+FFAN             H+LLLRTT  +  
Sbjct: 1   MVISSATLSFTSSLPR-PPLSPPFSSSKSAFFAN-------------HALLLRTTTPTTT 60

Query: 61  TMDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFE 120
           T  RAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFE
Sbjct: 61  THTRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFE 120

Query: 121 SELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 157
           SELKKEPE+IGETAVEKPTSVD EE+QD+KVSNQKESV
Sbjct: 121 SELKKEPEAIGETAVEKPTSVD-EERQDVKVSNQKESV 143

BLAST of Cp4.1LG04g01840 vs. TAIR 10
Match: AT5G28750.1 (Bacterial sec-independent translocation protein mttA/Hcf106 )

HSP 1 Score: 121.3 bits (303), Expect = 6.8e-28
Identity = 90/157 (57.32%), Postives = 105/157 (66.88%), Query Frame = 0

Query: 1   MVISSATLSFTSSIPRLPSFSPPFSSSKSSFFANNATANSFNKGDHSHSLLLRTTRSRKT 60
           M  S ATL   SS P  P   P  SSS+SSFF+N  T  +     ++ SL+    R R+ 
Sbjct: 1   MATSVATL---SSPP--PVSLPLLSSSRSSFFSNCFTVTT---RPNTRSLVAIGRRIRQE 60

Query: 61  MDRAHKGLTCNALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSFQQAAKEFES 120
             R  K LTCNALFGLGVPEL VIAGVAAL+FGPKKLPE+GKSIGKTVKSFQQAAKEFES
Sbjct: 61  PTR--KPLTCNALFGLGVPELAVIAGVAALLFGPKKLPEIGKSIGKTVKSFQQAAKEFES 120

Query: 121 ELKKEPESIGETAVEKPTSVDDEEKQDLKVSNQKESV 158
           ELK EPE     + +  TS  +EEK+    S+ KE+V
Sbjct: 121 ELKTEPEESVAESSQVATSNKEEEKKTEVSSSSKENV 147

BLAST of Cp4.1LG04g01840 vs. TAIR 10
Match: AT5G52440.1 (Bacterial sec-independent translocation protein mttA/Hcf106 )

HSP 1 Score: 52.8 bits (125), Expect = 3.0e-07
Identity = 28/61 (45.90%), Postives = 43/61 (70.49%), Query Frame = 0

Query: 72  ALFGLGVPELVVIAGVAALVFGPKKLPEVGKSIGKTVKSF-------QQAAKEFESELKK 126
           +LFG+G PE +VI  VA LVFGPK L EV +++GKT+++F       Q  +++F+S L++
Sbjct: 85  SLFGVGAPEALVIGVVALLVFGPKGLAEVARNLGKTLRTFQPTIRELQDVSRDFKSTLER 144

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9XH462.5e-2756.49Sec-independent protein translocase protein TATA, chloroplastic OS=Pisum sativum... [more]
Q9LKU29.6e-2757.32Sec-independent protein translocase protein TATA, chloroplastic OS=Arabidopsis t... [more]
Q9XFJ82.1e-2152.63Sec-independent protein translocase protein TATA, chloroplastic OS=Zea mays OX=4... [more]
Q75GK37.9e-2151.85Sec-independent protein translocase protein TATA, chloroplastic OS=Oryza sativa ... [more]
Q31RR11.8e-1247.13Sec-independent protein translocase protein TatA OS=Synechococcus elongatus (str... [more]
Match NameE-valueIdentityDescription
XP_023529222.12.87e-98100.00sec-independent protein translocase protein TATA, chloroplastic [Cucurbita pepo ... [more]
XP_022932561.12.36e-9798.73sec-independent protein translocase protein TATA, chloroplastic [Cucurbita mosch... [more]
KAG6588320.12.75e-9698.09Sec-independent protein translocase protein TATA, chloroplastic, partial [Cucurb... [more]
KAG7020882.13.91e-9698.09Sec-independent protein translocase protein TATA, chloroplastic, partial [Cucurb... [more]
XP_022969040.17.89e-9698.09sec-independent protein translocase protein TATA, chloroplastic [Cucurbita maxim... [more]
Match NameE-valueIdentityDescription
A0A6J1EWP21.14e-9798.73sec-independent protein translocase protein TATA, chloroplastic OS=Cucurbita mos... [more]
A0A6J1HYU63.82e-9698.09sec-independent protein translocase protein TATA, chloroplastic OS=Cucurbita max... [more]
A0A0A0LWL15.75e-7380.38Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G573670 PE=3 SV=1[more]
A0A1S3BPB08.16e-7379.75sec-independent protein translocase protein TATA, chloroplastic OS=Cucumis melo ... [more]
A0A6J1DX346.43e-6981.01sec-independent protein translocase protein TATA, chloroplastic OS=Momordica cha... [more]
Match NameE-valueIdentityDescription
AT5G28750.16.8e-2857.32Bacterial sec-independent translocation protein mttA/Hcf106 [more]
AT5G52440.13.0e-0745.90Bacterial sec-independent translocation protein mttA/Hcf106 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePRINTSPR01506TATBPROTEINcoord: 73..93
score: 35.53
coord: 93..112
score: 27.65
NoneNo IPR availableGENE3D1.20.5.3310coord: 75..126
e-value: 1.4E-18
score: 68.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 119..157
NoneNo IPR availablePANTHERPTHR33162:SF6BNAC07G27450D PROTEINcoord: 15..152
NoneNo IPR availablePANTHERPTHR33162SEC-INDEPENDENT PROTEIN TRANSLOCASE PROTEIN TATA, CHLOROPLASTICcoord: 15..152
IPR006312Sec-independent protein translocase protein TatA/ETIGRFAMTIGR01411TIGR01411coord: 75..121
e-value: 5.0E-20
score: 68.8
IPR006312Sec-independent protein translocase protein TatA/EHAMAPMF_00236TatA_Ecoord: 73..156
score: 18.540413
IPR003369Sec-independent protein translocase protein TatA/B/EPFAMPF02416TatA_B_Ecoord: 76..125
e-value: 8.0E-19
score: 66.7

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g01840.1Cp4.1LG04g01840.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051260 protein homooligomerization
biological_process GO:0043953 protein transport by the Tat complex
biological_process GO:0032594 protein transport within lipid bilayer
biological_process GO:0015031 protein transport
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0033281 TAT protein transport complex