Tan0011430 (gene) Snake gourd v1

Overview
NameTan0011430
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein GLE1
LocationLG04: 798402 .. 802508 (+)
RNA-Seq ExpressionTan0011430
SyntenyTan0011430
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATAGGGAAAATGCACAGTACTATGGGAGTATAGAACCTTTAATTAGATTATCATCGTAGTTCCAAACCAAATCGAAGTACATAAATAAATACACATTGAAAAAGAAACATTTCCTTTTCCGTTACCTCCCGCCCTTTTTTTATTTATTTATTTTGTTTGAACACTCCCGCGCTATTTGCTTTGTCGCTCGCGCTGCCCGAATCGATCTCTCTCTCCCGTCGCCTTCTCCGTTTCGGTTCGATTCGATTTTAACTAAATCGTTCAATTTGTAAGCTTTAGGGTTTTGCTCGTGCGTTGGCCGTTGCGTTCCTTCCCTTCCCTTCCTTTCCTTTCGTCCGAAGCTAGGTCTCGCGAGTTTCTTGTCCTTCCAGCCGCCGTCGCGAGTTTATTGTCTCCGTCTCTCCAGATCTCCCTACTGTCTATCTTCTGGTAACAATTACTTAAATTCCAATTTTTCTCACGTTCATTTGGAAGTTTGCTTACACTAATTTCAATTAATGTGTTACTTCTGCTTTTCCCTTCCTTGGCGTTCAAAAATCCTTTTGAGCAGAAATTTAAGAATGAACATAGGACATGAGTTCATATAATACCTCTGTACAAAAAAAAAAAAAAAACTAGATTAACTGACAAGTTTAGTCATGAAGTTGTTTTCGTGAACTTTAAAAACGATTCTTATATGTTCTTGAACTTTTAATGTTTTATATTTAATAGGTCACAAACTTAAAAAAATATCAAATAGGTCTTGAATTTTAATATTGTGTCTATAAATACCTGCCATATTCAACATTTTTTTTAATTAACCAATCTATTATAAATAAAATTGATTTGTGTGTCACTAAACTTTTAAATTTTTTGTCCATTTGTAAATTTTAAAAAAGGTCGAAGAGGTCATAGATCTATTAGGCATATAACTTATTCTTCTCTCTTGTGCACAGCCCCAGATCCACCAAGGGTCTTCCTCTGATTCTTGAAGCTGTTCAAAGTTTACTTATTGAAGCCATGTACGTTTAATATTCTTTGATATTGCAATTCTATTTTTTTCACGTTGATTCTATGGTAGGAGCAAAAATTTAGAAGCTTTTTTTCCTATTCTTTTGGGCATTTTATTGCGTTGTAGACATAGCACCTTTGTCTACATGCGTATTATAGTTGATTTCAAACTAGCATATGTTTATCAAGCATAGGTTAGCTCATTCATTTTATAATGTTGAATGACAGTCCCTCAACAGGATCCTAAGTATCACTTGTGTTAATCTTTTGTTTGTGTTTCATTTTCAGGAGTCCTGTGAAATTGACTCTTAGCTGTCCTTCTAGGGTAGGTCAAGTTACAGTTGACCCACAGCCTGATTTCAGTTTTGACGATTTGCGAGGGGAACTTTATTCCTTGGAGGAGAAGTTAAAGTTGTCTTCAACGCCTTTCACAAAAACATCTTTACGGTATCATATTTTTCTATATTTAAAAAAAACACAACAAATTTATGTGTGCTTCTAGTTTTCTTTGTTAAAGCATTTTTGGTTTATATGTTATCAGAGATTTCCCTGTCACAAAAAATGTGAAGAGAAGTTCCAAGCCTTTTATAATGGGTTTGTATGAGGAGGATGAACTACAAAACAATGCTAAGCGCTTTAATTGTGATGAAATTTTTCTCAGGTATTCTATTTTACTGTCGCTTCCAACTTTCTTTATTTCTTCTCGTAAAGTTACATCTGATTTTATTCAACTTTTGATGACTTTTTCATGTTTTTTAACTTTTCTTCGTCATGTGATGTGCAAAATTACTATAAGCTACTATCTTTTGATTTAAACTATTACATATTTGGCTTTCTGCCATGAAGTTTTAAGGGTTTTCTTGCCTTCTCTAGTGATAGTGAAGATTCTGACAAGGAATCCACTCTTGAAGCATGGTCCTACCGAATGGAGGACAGTTCTTTAGCCAGGCTAACATGTGATCGTCTGCTTAATGTGAAGGTATTTAGGAGAGTATATAATTTATATTTCTTTTATTGACGTTTACACATTTTTTAGCATCTAAATATGAATGCAACATACATTTCATTCTTTCTCATGAAATTTTATGTTTATCCCTCGTGTCACTTGTCAACATTTTTTTAGTTAGTTTGTTATTTTGCTTTAGCTGTTACTCGGTTAGTAGATAATACTTTGGGAGGGAAGCTCTTGTAAAAAAGCAACACATCAATATTGAATAAAGATCTTCTTTGATAGTTCTTGGAGTCTTAGATTCTGTATTGCGTCATTTAAGATACTGTGGATGAGATGAATGTGAAATTAGTTAAACTGACGTTTTTTTGGTACAAAAGCTCATTGATTGTATGAATGTAATTTGGATCTTGTCTCCTGAGATGTTTGAAACTACGAAGAATTTTTAAGTCATCGAGTAACAAAGTAAACGGATATCTAGCTAATCTTCCATTCTGTTCTAATGCTTTTCTAAATTTGTTTATAACAGGAGGAAATTAGGAACCAACACGGAAGGTTGGAAACCGACTTTACTACTTTAAATGAAAAGTCAAATGCTGCATTCTCTCAAATTGAGAAACACTATGAGGCAAGACAGCTGATCGGAGACTTGATAGTCAATATCAACGCCAGATGTATGCTTTTCTATTAGTTTTGTTTGGTCATCTTCCTTAATTTTTAATTTAGGAAATCTTTTTCTGTCATATACCTTTATTTAATCTTTCATAAGCTCCTATTGGTAACAAAAGATTAATATGTCATATATTCTATTGTATATAATTTTAAAATAAGAGTTTAGGTAATGAATTCCACTAATGTATTTCCTGTGTTTCTTCTCTTTCTAACAATGATATCTACTTTCTTCTTTTTTCCCCCTAGTCAATTGTATGACACCATTCTAACCTTGAATTGGCAATTTTTGCTTCTCCATACTCGAACCTAACCAACTAGATTTTTAAAATCATTTTTTACAATTTAGATTTTTTTTTAATTAGTTCTTTGTGCTTTTCTATTCTTAGCCACTTATGAATGATTGCGTAGAAGAAGCCTCTGCTAACATTGCAATTTTACTTGTGCTTATCCTTTACCTTCGATTTTAGTGCTGAAGGTCTCGATAAGTACTTGTCTACTGTTCAACGGCACCACGAACAAATATCACAAAGAGAAGAGAGAAAAATTAGAAGTGATGCGGCTTTTGAAGAAGCCAAAAGAAAGGAGAAGATCATTCTAGAAGAAAAAAGCATCAAGAAAAGGTTAAAGCAGAAGTCGAGGTTTGTTGTCAACTATTTGAAAATGATGAATGTTTTTTCTTGTTTGTTATTTGTTTTTGTGTTTGTCTTTTTTTTTCATAAGTGCGTTGCTCATTTTACTGGTACATTTGGCTTTCGATAAAAAAATAATAATAATAAAACAATTTTTTCTTGGGTTGTTGGCTTGTAATCATTTCTACTGGCACGACTCTATCAACTTTCCTTTGATTTGGTATGAGGATCATTGATCGATAATCTATCATGTTTTTAAATAATTTTATCAGGGGGTTAATTCCACCTCCCAATAGTTTCGGTTTGCTGGACATTATGTGTCCTTGATAATGAAAGGACTTCATCATTTACTTTTTGAGTCATCTGGACATTTTGTGCTTCTTGGAAAAGCCCACCATTGAAGCAAGGCGTCTAAGCCCACCTATTTATTATTTTTATCGTTGGAAAAGCTAAGAGATTTCTAAAAAAATGTTCATTTTTCTTTGCAATCTCTAATCAAAGCTCCCATTTCTCAAAACTTTTCATTATCTTATTATTCATAATATTATTTTTAATCATATATAAAAATTTATTTATATTTTTCTATGTGCACCTGACAAAATATAAAGCGTTTGCATTTTTTTCCACCTTGCACTTAAGCCCCGAAGGAGTGTTGCACTTTACTGCATCTCACACATTTAAAAACACTGGTGCAGGCAAAAGCCAGAGCTGAGGAGGCAAAGAAAGTTGCCATAGAAGCTGAGAAGAGAGCAATGAAAGAAGCAGCTGAAAGGGAAGCTACTGAAAACTTAAAAAAGGTTGATGTTGTACAAGCACAAGAAACTACTGTTGGGGCTGCTAAGTACCAAACCAGTAAACTGTAA

mRNA sequence

ATGGATAGGGAAAATGCACACCCCAGATCCACCAAGGGTCTTCCTCTGATTCTTGAAGCTGTTCAAAGTTTACTTATTGAAGCCATGAGTCCTGTGAAATTGACTCTTAGCTGTCCTTCTAGGGTAGGTCAAGTTACAGTTGACCCACAGCCTGATTTCAGTTTTGACGATTTGCGAGGGGAACTTTATTCCTTGGAGGAGAAGTTAAAGTTGTCTTCAACGCCTTTCACAAAAACATCTTTACGAGATTTCCCTGTCACAAAAAATGTGAAGAGAAGTTCCAAGCCTTTTATAATGGGTTTGTATGAGGAGGATGAACTACAAAACAATGCTAAGCGCTTTAATTGTGATGAAATTTTTCTCAGTGATAGTGAAGATTCTGACAAGGAATCCACTCTTGAAGCATGGTCCTACCGAATGGAGGACAGTTCTTTAGCCAGGCTAACATGTGATCGTCTGCTTAATGTGAAGGAGGAAATTAGGAACCAACACGGAAGGTTGGAAACCGACTTTACTACTTTAAATGAAAAGTCAAATGCTGCATTCTCTCAAATTGAGAAACACTATGAGGCAAGACAGCTGATCGGAGACTTGATAGTCAATATCAACGCCAGATGTCTCGATAAGTACTTGTCTACTGTTCAACGGCACCACGAACAAATATCACAAAGAGAAGAGAGAAAAATTAGAAGTGATGCGGCTTTTGAAGAAGCCAAAAGAAAGGAGAAGATCATTCTAGAAGAAAAAAGCATCAAGAAAAGGTTAAAGCAGAAGTCGAGAGCTGAGGAGGCAAAGAAAGTTGCCATAGAAGCTGAGAAGAGAGCAATGAAAGAAGCAGCTGAAAGGGAAGCTACTGAAAACTTAAAAAAGGTTGATGTTGTACAAGCACAAGAAACTACTGTTGGGGCTGCTAAGTACCAAACCAGTAAACTGTAA

Coding sequence (CDS)

ATGGATAGGGAAAATGCACACCCCAGATCCACCAAGGGTCTTCCTCTGATTCTTGAAGCTGTTCAAAGTTTACTTATTGAAGCCATGAGTCCTGTGAAATTGACTCTTAGCTGTCCTTCTAGGGTAGGTCAAGTTACAGTTGACCCACAGCCTGATTTCAGTTTTGACGATTTGCGAGGGGAACTTTATTCCTTGGAGGAGAAGTTAAAGTTGTCTTCAACGCCTTTCACAAAAACATCTTTACGAGATTTCCCTGTCACAAAAAATGTGAAGAGAAGTTCCAAGCCTTTTATAATGGGTTTGTATGAGGAGGATGAACTACAAAACAATGCTAAGCGCTTTAATTGTGATGAAATTTTTCTCAGTGATAGTGAAGATTCTGACAAGGAATCCACTCTTGAAGCATGGTCCTACCGAATGGAGGACAGTTCTTTAGCCAGGCTAACATGTGATCGTCTGCTTAATGTGAAGGAGGAAATTAGGAACCAACACGGAAGGTTGGAAACCGACTTTACTACTTTAAATGAAAAGTCAAATGCTGCATTCTCTCAAATTGAGAAACACTATGAGGCAAGACAGCTGATCGGAGACTTGATAGTCAATATCAACGCCAGATGTCTCGATAAGTACTTGTCTACTGTTCAACGGCACCACGAACAAATATCACAAAGAGAAGAGAGAAAAATTAGAAGTGATGCGGCTTTTGAAGAAGCCAAAAGAAAGGAGAAGATCATTCTAGAAGAAAAAAGCATCAAGAAAAGGTTAAAGCAGAAGTCGAGAGCTGAGGAGGCAAAGAAAGTTGCCATAGAAGCTGAGAAGAGAGCAATGAAAGAAGCAGCTGAAAGGGAAGCTACTGAAAACTTAAAAAAGGTTGATGTTGTACAAGCACAAGAAACTACTGTTGGGGCTGCTAAGTACCAAACCAGTAAACTGTAA

Protein sequence

MDRENAHPRSTKGLPLILEAVQSLLIEAMSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSSTPFTKTSLRDFPVTKNVKRSSKPFIMGLYEEDELQNNAKRFNCDEIFLSDSEDSDKESTLEAWSYRMEDSSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYEARQLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIILEEKSIKKRLKQKSRAEEAKKVAIEAEKRAMKEAAEREATENLKKVDVVQAQETTVGAAKYQTSKL
Homology
BLAST of Tan0011430 vs. ExPASy Swiss-Prot
Match: Q0WPZ7 (Protein GLE1 OS=Arabidopsis thaliana OX=3702 GN=GLE1 PE=1 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 1.7e-22
Identity = 96/284 (33.80%), Postives = 149/284 (52.46%), Query Frame = 0

Query: 38  CPSRVGQVTVDPQPDFSFDDLRGELYSLEEKL---KLSSTPFTKTSLRDFPVTKNVKRSS 97
           CP  V  +++DP+P+++F+ L  E+ S+E+KL    +   P T T+LR       + R  
Sbjct: 9   CPKSVDGISIDPEPNWNFESLVAEIASVEKKLNGFSMYPQPITNTTLR-------MGRRG 68

Query: 98  KPFIMGLYEEDELQNN--------------------AKRFNCDEIFLSDSEDS--DKEST 157
             F+M +  EDE++++                     KRF CDE++LSD  D   D E  
Sbjct: 69  GGFVMHV-SEDEMESDEGEESDDEEEEEDHSQICTAGKRFACDELYLSDESDEEFDHEPE 128

Query: 158 LEAWSYRMEDSSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYEAR 217
                  + +S+L  +  D    +K++IRNQ   +ET+     E S +A +++EK+ E R
Sbjct: 129 YMMNKLGLAESALYEVINDHQTEIKDDIRNQVSVVETEIMNEIETSLSAIARVEKYSETR 188

Query: 218 QLIG---DLIVNIN-ARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIILEE 277
           + +    DL      A  LD +L+ VQR H+  SQ EERKIRS+ A EEA+RKE+   EE
Sbjct: 189 KEVERKLDLQYQRKVAEALDTHLTAVQREHKIKSQIEERKIRSEEAQEEARRKERAHQEE 248

Query: 278 KSIKKRLK------QKSRAEEAKKVAIEAEKRAMKEAAEREATE 287
           K  +++ +       K RAEE KK   E E++A +E AE+E  +
Sbjct: 249 KIRQEKARAEAQMLAKIRAEEEKK---EVERKAAREVAEKEVAD 281

BLAST of Tan0011430 vs. NCBI nr
Match: KAG6601557.1 (Protein GLE1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 356.3 bits (913), Expect = 2.7e-94
Identity = 221/329 (67.17%), Postives = 241/329 (73.25%), Query Frame = 0

Query: 9   RSTKGLPLILEAVQSLLIEAMSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEK 68
           R  KG  L+   ++SLLIEAMSPVKLTL CPSRVGQVT DPQPDFSFDDLR ELYSLEEK
Sbjct: 3   RRVKGTGLLSICIRSLLIEAMSPVKLTLRCPSRVGQVTADPQPDFSFDDLRVELYSLEEK 62

Query: 69  LKLSSTPFTKTSL--------RDFPVTKNVKRSSKPFIMGLYEEDELQN----------- 128
           LK S+TPFTKT          RDFP+ K  KRSSKPF+MG+Y EDELQN           
Sbjct: 63  LKTSTTPFTKTCSRICGLYVNRDFPLIKTTKRSSKPFVMGVY-EDELQNIFNDGEVVCDQ 122

Query: 129 --NAKRFNCDEIFLSDSEDSDKESTLEAWSYRMED-----SSLARLTCDRLLNVKEEIRN 188
             NAKRFNCD  FLSDSEDSD ESTLE  ++ MED     SSLA+LT D LLN KEEIRN
Sbjct: 123 GSNAKRFNCDGCFLSDSEDSDNESTLETRAHLMEDVDLVESSLAQLTDDHLLNTKEEIRN 182

Query: 189 QHGRLETDFTTLNEKSNAAFSQIEKHYEAR----QLIGDLIVNINARCLDKYLSTVQRHH 248
           Q GRLET+ TTLNEKS+AA SQIEK+YEAR    + +        A  LDKYL+TVQRHH
Sbjct: 183 QLGRLETNLTTLNEKSSAAASQIEKYYEARREADRRLDTQYQREIAEGLDKYLTTVQRHH 242

Query: 249 EQISQREERKIRSDAAFEEAKRKEKIILEEK----SIKKRLKQKSRAEEAKKVAIEAEKR 304
           EQISQREERKIRSDAAFEEAKRKEK +LEEK      K   + K++AEEA K AIEAE R
Sbjct: 243 EQISQREERKIRSDAAFEEAKRKEKAMLEEKKRIEKAKAEAEAKAKAEEAMKAAIEAESR 302

BLAST of Tan0011430 vs. NCBI nr
Match: KAA0034248.1 (protein GLE1 [Cucumis melo var. makuwa] >TYK15672.1 protein GLE1 [Cucumis melo var. makuwa])

HSP 1 Score: 353.2 bits (905), Expect = 2.3e-93
Identity = 212/314 (67.52%), Postives = 240/314 (76.43%), Query Frame = 0

Query: 15  PLILEAVQSLLIEAMSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSST 74
           P ++EAVQSLLI+ MSPVKLTL CPS++GQV VDP PDFSFDDLR EL+SLEEKL  S+ 
Sbjct: 33  PPVIEAVQSLLIDTMSPVKLTLRCPSKIGQVIVDPDPDFSFDDLRLELHSLEEKLNKSTM 92

Query: 75  PFTKTSLRDFPVTKNVKRSSKPFIMGLYEEDELQ------------NNAKRFNCDEIFLS 134
           PF KT  RDFPVTK +KRSSKPFIMG+Y EDEL+            +NA RFNCD IFLS
Sbjct: 93  PFKKTCSRDFPVTKTLKRSSKPFIMGVY-EDELEEIFSDEVVCDPSSNANRFNCDGIFLS 152

Query: 135 DSEDSDKESTLEAWSYRMED-----SSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEK 194
           DSEDSD ESTLEA +Y  ED     SSLA+LT D +LN+KEEIRNQ GRLETD TTLNEK
Sbjct: 153 DSEDSDNESTLEAQAYLKEDMDLVESSLAKLTHDHMLNIKEEIRNQLGRLETDLTTLNEK 212

Query: 195 SNAAFSQIEKHYEAR----QLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDA 254
           S+AA SQIEK+YEAR    + +        A  LDKYL+TVQ HHEQISQREERKIRSDA
Sbjct: 213 SSAAISQIEKYYEARREADRRLDTQYQREIAEGLDKYLTTVQHHHEQISQREERKIRSDA 272

Query: 255 AFEEAKRKEKIILEEKSIKKRLK----QKSRAEEAKKVAIEAEKRAMKEAAEREATENLK 304
           AFEEAKRKEK ILE+K  +++LK     K++AEEA K AIEAE+RA KEAAE EA ENLK
Sbjct: 273 AFEEAKRKEKAILEDKKRQEKLKAEAEAKAKAEEAMKAAIEAERRAAKEAAETEAAENLK 332

BLAST of Tan0011430 vs. NCBI nr
Match: XP_022997611.1 (protein GLE1 isoform X1 [Cucurbita maxima])

HSP 1 Score: 350.5 bits (898), Expect = 1.5e-92
Identity = 211/301 (70.10%), Postives = 229/301 (76.08%), Query Frame = 0

Query: 29  MSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSSTPFTKTSLRDFPVTK 88
           MSPVKLTL CPSRVGQVT DPQPDFSFDDLRGELYSLEEKLK S+TPFTKT  RDFP+ K
Sbjct: 1   MSPVKLTLRCPSRVGQVTADPQPDFSFDDLRGELYSLEEKLKTSTTPFTKTCSRDFPLIK 60

Query: 89  NVKRSSKPFIMGLYEEDELQN-------------NAKRFNCDEIFLSDSEDSDKESTLEA 148
             KRSSKPF+MG+Y EDELQN             NAKRFNCD  FLSDSEDSD E+TLE 
Sbjct: 61  TTKRSSKPFVMGVY-EDELQNIFSDGEVVCDQGSNAKRFNCDGCFLSDSEDSDNEATLET 120

Query: 149 WSYRMED-----SSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYE 208
            ++ MED     SSLA+LT D LLN KEEIRNQ GRLET+ TTLNEKS+AA SQIEK+YE
Sbjct: 121 RAHLMEDVDLVESSLAQLTYDHLLNTKEEIRNQLGRLETNLTTLNEKSSAAASQIEKYYE 180

Query: 209 AR----QLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIIL 268
           AR    + +        A  LDKYL+TVQRHHEQISQREERKIRSDAAFEEAKRKEK +L
Sbjct: 181 ARREADRRLDTQYQREIAEGLDKYLTTVQRHHEQISQREERKIRSDAAFEEAKRKEKAML 240

Query: 269 EEK----SIKKRLKQKSRAEEAKKVAIEAEKRAMKEAAEREATENLKKVDVVQAQETTVG 304
           EEK      K   + K++AEEA K AIEAE RAMKE AEREA ENLKKVD VQAQET VG
Sbjct: 241 EEKKRLEKAKAEAEAKAKAEEAMKAAIEAESRAMKEVAEREAVENLKKVDAVQAQETIVG 300

BLAST of Tan0011430 vs. NCBI nr
Match: XP_023544608.1 (protein GLE1 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 347.8 bits (891), Expect = 9.5e-92
Identity = 211/301 (70.10%), Postives = 228/301 (75.75%), Query Frame = 0

Query: 29  MSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSSTPFTKTSLRDFPVTK 88
           MSPVKLTL CPSRVGQVT DPQPDFSFDDLR ELYSLEEKLK S+TPFTKT  RDFP+ K
Sbjct: 1   MSPVKLTLRCPSRVGQVTADPQPDFSFDDLRVELYSLEEKLKTSTTPFTKTCSRDFPLIK 60

Query: 89  NVKRSSKPFIMGLYEEDELQN-------------NAKRFNCDEIFLSDSEDSDKESTLEA 148
             KRSSKPF+MG+Y EDELQN             NAKRFNCD  FLSDSEDSD ESTLE 
Sbjct: 61  TTKRSSKPFVMGVY-EDELQNIFNDGEVVCDQGSNAKRFNCDGCFLSDSEDSDNESTLET 120

Query: 149 WSYRMED-----SSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYE 208
            ++ MED     SSLA+LT D LLN KEEIRNQ GRLET+ TTLNEKS+AA SQIEK+YE
Sbjct: 121 RAHLMEDVDLVESSLAQLTDDHLLNTKEEIRNQLGRLETNLTTLNEKSSAAASQIEKYYE 180

Query: 209 AR----QLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIIL 268
           AR    + +        A  LDKYL+TVQRHHEQISQREERKIRSDAAFEEAKRKEK +L
Sbjct: 181 ARREADRRLDTQYQREIAEGLDKYLTTVQRHHEQISQREERKIRSDAAFEEAKRKEKAML 240

Query: 269 EEK----SIKKRLKQKSRAEEAKKVAIEAEKRAMKEAAEREATENLKKVDVVQAQETTVG 304
           EEK      K   + K++AEEA K AIEAE RAMKE AEREA ENLKKVD VQAQET VG
Sbjct: 241 EEKKRLEKAKAEAEAKAKAEEAMKAAIEAESRAMKEVAEREAAENLKKVDAVQAQETIVG 300

BLAST of Tan0011430 vs. NCBI nr
Match: XP_022957181.1 (protein GLE1 isoform X1 [Cucurbita moschata])

HSP 1 Score: 345.1 bits (884), Expect = 6.2e-91
Identity = 210/301 (69.77%), Postives = 227/301 (75.42%), Query Frame = 0

Query: 29  MSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSSTPFTKTSLRDFPVTK 88
           MSPVKLTL CPSRVGQVT DPQPDFSFDDLR ELYSLEEKLK S+TPFTKT  RDFP+ K
Sbjct: 1   MSPVKLTLRCPSRVGQVTADPQPDFSFDDLRVELYSLEEKLKTSTTPFTKTCSRDFPLIK 60

Query: 89  NVKRSSKPFIMGLYEEDELQN-------------NAKRFNCDEIFLSDSEDSDKESTLEA 148
             KRSSKPF+MG+Y EDELQN             NAKRFNCD  FLSDSEDSD ESTL  
Sbjct: 61  TTKRSSKPFVMGVY-EDELQNIFNDGEVVCDQGSNAKRFNCDGCFLSDSEDSDNESTLGT 120

Query: 149 WSYRMED-----SSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYE 208
            ++ MED     SSLA+LT D LLN KEEIRNQ GRLET+ TTLNEKS+AA SQIEK+YE
Sbjct: 121 RAHLMEDVDLVESSLAQLTDDHLLNTKEEIRNQLGRLETNLTTLNEKSSAAASQIEKYYE 180

Query: 209 AR----QLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIIL 268
           AR    + +        A  LDKYL+TVQRHHEQISQREERKIRSDAAFEEAKRKEK +L
Sbjct: 181 ARREADRRLDTQYQREIAEGLDKYLTTVQRHHEQISQREERKIRSDAAFEEAKRKEKAML 240

Query: 269 EEK----SIKKRLKQKSRAEEAKKVAIEAEKRAMKEAAEREATENLKKVDVVQAQETTVG 304
           EEK      K   + K++AEEA K AIEAE RAMKE AEREA ENLKKVD VQAQET VG
Sbjct: 241 EEKKRIEKAKAEAEAKAKAEEAMKAAIEAESRAMKEVAEREAAENLKKVDAVQAQETIVG 300

BLAST of Tan0011430 vs. ExPASy TrEMBL
Match: A0A5A7SUC8 (Protein GLE1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G001690 PE=3 SV=1)

HSP 1 Score: 353.2 bits (905), Expect = 1.1e-93
Identity = 212/314 (67.52%), Postives = 240/314 (76.43%), Query Frame = 0

Query: 15  PLILEAVQSLLIEAMSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSST 74
           P ++EAVQSLLI+ MSPVKLTL CPS++GQV VDP PDFSFDDLR EL+SLEEKL  S+ 
Sbjct: 33  PPVIEAVQSLLIDTMSPVKLTLRCPSKIGQVIVDPDPDFSFDDLRLELHSLEEKLNKSTM 92

Query: 75  PFTKTSLRDFPVTKNVKRSSKPFIMGLYEEDELQ------------NNAKRFNCDEIFLS 134
           PF KT  RDFPVTK +KRSSKPFIMG+Y EDEL+            +NA RFNCD IFLS
Sbjct: 93  PFKKTCSRDFPVTKTLKRSSKPFIMGVY-EDELEEIFSDEVVCDPSSNANRFNCDGIFLS 152

Query: 135 DSEDSDKESTLEAWSYRMED-----SSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEK 194
           DSEDSD ESTLEA +Y  ED     SSLA+LT D +LN+KEEIRNQ GRLETD TTLNEK
Sbjct: 153 DSEDSDNESTLEAQAYLKEDMDLVESSLAKLTHDHMLNIKEEIRNQLGRLETDLTTLNEK 212

Query: 195 SNAAFSQIEKHYEAR----QLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDA 254
           S+AA SQIEK+YEAR    + +        A  LDKYL+TVQ HHEQISQREERKIRSDA
Sbjct: 213 SSAAISQIEKYYEARREADRRLDTQYQREIAEGLDKYLTTVQHHHEQISQREERKIRSDA 272

Query: 255 AFEEAKRKEKIILEEKSIKKRLK----QKSRAEEAKKVAIEAEKRAMKEAAEREATENLK 304
           AFEEAKRKEK ILE+K  +++LK     K++AEEA K AIEAE+RA KEAAE EA ENLK
Sbjct: 273 AFEEAKRKEKAILEDKKRQEKLKAEAEAKAKAEEAMKAAIEAERRAAKEAAETEAAENLK 332

BLAST of Tan0011430 vs. ExPASy TrEMBL
Match: A0A6J1K7Y4 (protein GLE1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492491 PE=3 SV=1)

HSP 1 Score: 350.5 bits (898), Expect = 7.1e-93
Identity = 211/301 (70.10%), Postives = 229/301 (76.08%), Query Frame = 0

Query: 29  MSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSSTPFTKTSLRDFPVTK 88
           MSPVKLTL CPSRVGQVT DPQPDFSFDDLRGELYSLEEKLK S+TPFTKT  RDFP+ K
Sbjct: 1   MSPVKLTLRCPSRVGQVTADPQPDFSFDDLRGELYSLEEKLKTSTTPFTKTCSRDFPLIK 60

Query: 89  NVKRSSKPFIMGLYEEDELQN-------------NAKRFNCDEIFLSDSEDSDKESTLEA 148
             KRSSKPF+MG+Y EDELQN             NAKRFNCD  FLSDSEDSD E+TLE 
Sbjct: 61  TTKRSSKPFVMGVY-EDELQNIFSDGEVVCDQGSNAKRFNCDGCFLSDSEDSDNEATLET 120

Query: 149 WSYRMED-----SSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYE 208
            ++ MED     SSLA+LT D LLN KEEIRNQ GRLET+ TTLNEKS+AA SQIEK+YE
Sbjct: 121 RAHLMEDVDLVESSLAQLTYDHLLNTKEEIRNQLGRLETNLTTLNEKSSAAASQIEKYYE 180

Query: 209 AR----QLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIIL 268
           AR    + +        A  LDKYL+TVQRHHEQISQREERKIRSDAAFEEAKRKEK +L
Sbjct: 181 ARREADRRLDTQYQREIAEGLDKYLTTVQRHHEQISQREERKIRSDAAFEEAKRKEKAML 240

Query: 269 EEK----SIKKRLKQKSRAEEAKKVAIEAEKRAMKEAAEREATENLKKVDVVQAQETTVG 304
           EEK      K   + K++AEEA K AIEAE RAMKE AEREA ENLKKVD VQAQET VG
Sbjct: 241 EEKKRLEKAKAEAEAKAKAEEAMKAAIEAESRAMKEVAEREAVENLKKVDAVQAQETIVG 300

BLAST of Tan0011430 vs. ExPASy TrEMBL
Match: A0A6J1GYI2 (protein GLE1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111458652 PE=3 SV=1)

HSP 1 Score: 345.1 bits (884), Expect = 3.0e-91
Identity = 210/301 (69.77%), Postives = 227/301 (75.42%), Query Frame = 0

Query: 29  MSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSSTPFTKTSLRDFPVTK 88
           MSPVKLTL CPSRVGQVT DPQPDFSFDDLR ELYSLEEKLK S+TPFTKT  RDFP+ K
Sbjct: 1   MSPVKLTLRCPSRVGQVTADPQPDFSFDDLRVELYSLEEKLKTSTTPFTKTCSRDFPLIK 60

Query: 89  NVKRSSKPFIMGLYEEDELQN-------------NAKRFNCDEIFLSDSEDSDKESTLEA 148
             KRSSKPF+MG+Y EDELQN             NAKRFNCD  FLSDSEDSD ESTL  
Sbjct: 61  TTKRSSKPFVMGVY-EDELQNIFNDGEVVCDQGSNAKRFNCDGCFLSDSEDSDNESTLGT 120

Query: 149 WSYRMED-----SSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYE 208
            ++ MED     SSLA+LT D LLN KEEIRNQ GRLET+ TTLNEKS+AA SQIEK+YE
Sbjct: 121 RAHLMEDVDLVESSLAQLTDDHLLNTKEEIRNQLGRLETNLTTLNEKSSAAASQIEKYYE 180

Query: 209 AR----QLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIIL 268
           AR    + +        A  LDKYL+TVQRHHEQISQREERKIRSDAAFEEAKRKEK +L
Sbjct: 181 ARREADRRLDTQYQREIAEGLDKYLTTVQRHHEQISQREERKIRSDAAFEEAKRKEKAML 240

Query: 269 EEK----SIKKRLKQKSRAEEAKKVAIEAEKRAMKEAAEREATENLKKVDVVQAQETTVG 304
           EEK      K   + K++AEEA K AIEAE RAMKE AEREA ENLKKVD VQAQET VG
Sbjct: 241 EEKKRIEKAKAEAEAKAKAEEAMKAAIEAESRAMKEVAEREAAENLKKVDAVQAQETIVG 300

BLAST of Tan0011430 vs. ExPASy TrEMBL
Match: A0A1S3BEB8 (protein GLE1 OS=Cucumis melo OX=3656 GN=LOC103488947 PE=3 SV=1)

HSP 1 Score: 336.7 bits (862), Expect = 1.1e-88
Identity = 203/300 (67.67%), Postives = 228/300 (76.00%), Query Frame = 0

Query: 29  MSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSSTPFTKTSLRDFPVTK 88
           MSPVKLTL CPS++GQV VDP PDFSFDDLR EL+SLEEKL  S+ PF KT  RDFPVTK
Sbjct: 1   MSPVKLTLRCPSKIGQVIVDPDPDFSFDDLRLELHSLEEKLNKSTMPFKKTCSRDFPVTK 60

Query: 89  NVKRSSKPFIMGLYEEDELQ------------NNAKRFNCDEIFLSDSEDSDKESTLEAW 148
            +KRSSKPFIMG+Y EDEL+            +NA RFNCD IFLSDSEDSD ESTLEA 
Sbjct: 61  TLKRSSKPFIMGVY-EDELEEIFSDEVVCDPSSNANRFNCDGIFLSDSEDSDNESTLEAQ 120

Query: 149 SYRMED-----SSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYEA 208
           +Y  ED     SSLA+LT D +LN+KEEIRNQ GRLETD TTLNEKS+AA SQIEK+YEA
Sbjct: 121 AYLKEDMDLVESSLAKLTHDHMLNIKEEIRNQLGRLETDLTTLNEKSSAAISQIEKYYEA 180

Query: 209 R----QLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIILE 268
           R    + +        A  LDKYL+TVQ HHEQISQREERKIRSDAAFEEAKRKEK ILE
Sbjct: 181 RREADRRLDTQYQREIAEGLDKYLTTVQHHHEQISQREERKIRSDAAFEEAKRKEKAILE 240

Query: 269 EKSIKKRLK----QKSRAEEAKKVAIEAEKRAMKEAAEREATENLKKVDVVQAQETTVGA 304
           +K  +++LK     K++AEEA K AIEAE+RA KEAAE EA ENLKKVD VQ QET VG+
Sbjct: 241 DKKRQEKLKAEAEAKAKAEEAMKAAIEAERRAAKEAAETEAAENLKKVDTVQVQETMVGS 299

BLAST of Tan0011430 vs. ExPASy TrEMBL
Match: A0A0A0KS46 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G593410 PE=3 SV=1)

HSP 1 Score: 332.4 bits (851), Expect = 2.0e-87
Identity = 199/299 (66.56%), Postives = 226/299 (75.59%), Query Frame = 0

Query: 29  MSPVKLTLSCPSRVGQVTVDPQPDFSFDDLRGELYSLEEKLKLSSTPFTKTSLRDFPVTK 88
           MSPVKLTL CPS++GQVTVDP PDFSFDDLR EL+SLEEKL  S+ PF KT  RDFPVTK
Sbjct: 1   MSPVKLTLRCPSKIGQVTVDPDPDFSFDDLRVELHSLEEKLNKSTMPFKKTCSRDFPVTK 60

Query: 89  NVKRSSKPFIMGLYEED-----------ELQNNAKRFNCDEIFLSDSEDSDKESTLEAWS 148
            +KRS KPFIMG+YE++           E  +NA RFNCD IFLSDSEDSD +ST EA +
Sbjct: 61  TLKRSFKPFIMGVYEDELKEIFNDEVVREPSSNANRFNCDGIFLSDSEDSDNDSTPEAQA 120

Query: 149 YRMED-----SSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYEAR 208
           Y  ED     SSLA LT D +LN+KEEIRNQ GRLETD TTLNEKS+AA SQIEK+YEAR
Sbjct: 121 YLKEDMDLVESSLAELTHDHMLNIKEEIRNQLGRLETDLTTLNEKSSAAISQIEKYYEAR 180

Query: 209 ----QLIGDLIVNINARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIILEE 268
               + +        A  LDKYL+TVQ HHEQISQREERKIRSDAAFEEAKRKEK ILE+
Sbjct: 181 READRRLDTQYQREIAEGLDKYLTTVQHHHEQISQREERKIRSDAAFEEAKRKEKAILED 240

Query: 269 KSIKKRLK----QKSRAEEAKKVAIEAEKRAMKEAAEREATENLKKVDVVQAQETTVGA 304
           K  +++LK     K++AEEA K AIEAE+RA KEAAEREA ENLKKV+ VQ QET VG+
Sbjct: 241 KKRQEKLKAEAEAKAKAEEAMKAAIEAERRATKEAAEREAAENLKKVNNVQVQETMVGS 299

BLAST of Tan0011430 vs. TAIR 10
Match: AT1G13120.1 (null )

HSP 1 Score: 108.2 bits (269), Expect = 1.2e-23
Identity = 96/284 (33.80%), Postives = 149/284 (52.46%), Query Frame = 0

Query: 38  CPSRVGQVTVDPQPDFSFDDLRGELYSLEEKL---KLSSTPFTKTSLRDFPVTKNVKRSS 97
           CP  V  +++DP+P+++F+ L  E+ S+E+KL    +   P T T+LR       + R  
Sbjct: 9   CPKSVDGISIDPEPNWNFESLVAEIASVEKKLNGFSMYPQPITNTTLR-------MGRRG 68

Query: 98  KPFIMGLYEEDELQNN--------------------AKRFNCDEIFLSDSEDS--DKEST 157
             F+M +  EDE++++                     KRF CDE++LSD  D   D E  
Sbjct: 69  GGFVMHV-SEDEMESDEGEESDDEEEEEDHSQICTAGKRFACDELYLSDESDEEFDHEPE 128

Query: 158 LEAWSYRMEDSSLARLTCDRLLNVKEEIRNQHGRLETDFTTLNEKSNAAFSQIEKHYEAR 217
                  + +S+L  +  D    +K++IRNQ   +ET+     E S +A +++EK+ E R
Sbjct: 129 YMMNKLGLAESALYEVINDHQTEIKDDIRNQVSVVETEIMNEIETSLSAIARVEKYSETR 188

Query: 218 QLIG---DLIVNIN-ARCLDKYLSTVQRHHEQISQREERKIRSDAAFEEAKRKEKIILEE 277
           + +    DL      A  LD +L+ VQR H+  SQ EERKIRS+ A EEA+RKE+   EE
Sbjct: 189 KEVERKLDLQYQRKVAEALDTHLTAVQREHKIKSQIEERKIRSEEAQEEARRKERAHQEE 248

Query: 278 KSIKKRLK------QKSRAEEAKKVAIEAEKRAMKEAAEREATE 287
           K  +++ +       K RAEE KK   E E++A +E AE+E  +
Sbjct: 249 KIRQEKARAEAQMLAKIRAEEEKK---EVERKAAREVAEKEVAD 281

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q0WPZ71.7e-2233.80Protein GLE1 OS=Arabidopsis thaliana OX=3702 GN=GLE1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
KAG6601557.12.7e-9467.17Protein GLE1, partial [Cucurbita argyrosperma subsp. sororia][more]
KAA0034248.12.3e-9367.52protein GLE1 [Cucumis melo var. makuwa] >TYK15672.1 protein GLE1 [Cucumis melo v... [more]
XP_022997611.11.5e-9270.10protein GLE1 isoform X1 [Cucurbita maxima][more]
XP_023544608.19.5e-9270.10protein GLE1 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022957181.16.2e-9169.77protein GLE1 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A5A7SUC81.1e-9367.52Protein GLE1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G001690 P... [more]
A0A6J1K7Y47.1e-9370.10protein GLE1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492491 PE=3 SV=1[more]
A0A6J1GYI23.0e-9169.77protein GLE1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111458652 PE=3 SV=1[more]
A0A1S3BEB81.1e-8867.67protein GLE1 OS=Cucumis melo OX=3656 GN=LOC103488947 PE=3 SV=1[more]
A0A0A0KS462.0e-8766.56Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G593410 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G13120.11.2e-2333.80null [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 252..281
NoneNo IPR availableCOILSCoilCoilcoord: 157..177
IPR012476GLE1-likePANTHERPTHR12960GLE-1-RELATEDcoord: 50..298

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0011430.1Tan0011430.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016973 poly(A)+ mRNA export from nucleus
biological_process GO:0006446 regulation of translational initiation
biological_process GO:0006449 regulation of translational termination
cellular_component GO:0005737 cytoplasm
cellular_component GO:0016020 membrane
cellular_component GO:0044614 nuclear pore cytoplasmic filaments
cellular_component GO:0005643 nuclear pore
molecular_function GO:0000822 inositol hexakisphosphate binding
molecular_function GO:0005543 phospholipid binding
molecular_function GO:0031369 translation initiation factor binding