Tan0016971 (gene) Snake gourd v1

Overview
NameTan0016971
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLEA_2 domain-containing protein
LocationLG07: 67202681 .. 67203733 (-)
RNA-Seq ExpressionTan0016971
SyntenyTan0016971
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGAGCCAAAGCTTAAAACAGACCAACGCCGGCGCATTCCAAAACTGACTCCTTCCATTCACACGAATTTCACTCCAAATCAATTCACTAACCGCTCTTCCACTCTCTCCTATAAAATCCTCTCTTTCTCTCTCTAAAACCAAACCCTCTTCTTCTTCTTCCCCTCCCCTCTGTAACAAACAATGGCCGATGACAACCAGAGCTTTCCCTTAGCGCACTACCAAGCTCACCACAAATCTGACGAAGAGCAACAAAAACTCTCCACATTCAAAACTCTCAAAAAGGAACGATCCAACAAATGCTTCATCTACGTTTTCTCCGCCTTCGTCTTCCTCAGCGTCGCCGTTTTGATCTTCGCTCTCATCGTCCTGCGCGTCAATTCCCCTACCATCCACTTCTCTTCCATCTCCGTCGCTAAGTTTTCAATCTCCAACACGAATTCCTCTTCTCCTTCCATGAATCTGACCTTGATCGCGGAATTCGCCGTCGACAATTCGAACTTCGGTCCGTTTAATTTCGATAACGGCACCGTCGGTCTCATGTACGGCGGCGCCATCGTTGGCGAGAGGAGTACCGGCGGAGGTAGGGCTGAGGCGAAAGGGACGAAGAGTATGAATGTTACTGTTGAAGGTTTCGCGAAGAATGTTAGCGGCGATTTGAATAGTACTTCGGGGATTTTGAATCTGAGTAGCTTCGCGAATTTGAGAGGCAGAGTTCGTTTGATTCATGTTTTCAGGAAGAGGATTTCGTCGGCGATTACTTGTTCGATTAATCTCGATTTGAATACTCATCAAATTCAGCCTGATTGGGTTTGTGAGTGACTAGATTAGAAGAATCGGAATATCATCAATGAACTCAAGAACACTAATTAGTTTACTATTTTTTAAATTCGTTTATTTTTTTTTTCTAATGTAGTCTGAAGTTTGTTACAAACTTACAATTTTGGAGGGGTTTTTTTTCTTTTTCTTTTTCTGCATACGATGTAAAAATATGCATGTTTTAAATTTTGTGCAAGATATTTCAATTAAAAAAATACTCTCAATTCAATCTCTA

mRNA sequence

CAGAGCCAAAGCTTAAAACAGACCAACGCCGGCGCATTCCAAAACTGACTCCTTCCATTCACACGAATTTCACTCCAAATCAATTCACTAACCGCTCTTCCACTCTCTCCTATAAAATCCTCTCTTTCTCTCTCTAAAACCAAACCCTCTTCTTCTTCTTCCCCTCCCCTCTGTAACAAACAATGGCCGATGACAACCAGAGCTTTCCCTTAGCGCACTACCAAGCTCACCACAAATCTGACGAAGAGCAACAAAAACTCTCCACATTCAAAACTCTCAAAAAGGAACGATCCAACAAATGCTTCATCTACGTTTTCTCCGCCTTCGTCTTCCTCAGCGTCGCCGTTTTGATCTTCGCTCTCATCGTCCTGCGCGTCAATTCCCCTACCATCCACTTCTCTTCCATCTCCGTCGCTAAGTTTTCAATCTCCAACACGAATTCCTCTTCTCCTTCCATGAATCTGACCTTGATCGCGGAATTCGCCGTCGACAATTCGAACTTCGGTCCGTTTAATTTCGATAACGGCACCGTCGGTCTCATGTACGGCGGCGCCATCGTTGGCGAGAGGAGTACCGGCGGAGGTAGGGCTGAGGCGAAAGGGACGAAGAGTATGAATGTTACTGTTGAAGGTTTCGCGAAGAATGTTAGCGGCGATTTGAATAGTACTTCGGGGATTTTGAATCTGAGTAGCTTCGCGAATTTGAGAGGCAGAGTTCGTTTGATTCATGTTTTCAGGAAGAGGATTTCGTCGGCGATTACTTGTTCGATTAATCTCGATTTGAATACTCATCAAATTCAGCCTGATTGGGTTTGTGAGTGACTAGATTAGAAGAATCGGAATATCATCAATGAACTCAAGAACACTAATTAGTTTACTATTTTTTAAATTCGTTTATTTTTTTTTTCTAATGTAGTCTGAAGTTTGTTACAAACTTACAATTTTGGAGGGGTTTTTTTTCTTTTTCTTTTTCTGCATACGATGTAAAAATATGCATGTTTTAAATTTTGTGCAAGATATTTCAATTAAAAAAATACTCTCAATTCAATCTCTA

Coding sequence (CDS)

ATGGCCGATGACAACCAGAGCTTTCCCTTAGCGCACTACCAAGCTCACCACAAATCTGACGAAGAGCAACAAAAACTCTCCACATTCAAAACTCTCAAAAAGGAACGATCCAACAAATGCTTCATCTACGTTTTCTCCGCCTTCGTCTTCCTCAGCGTCGCCGTTTTGATCTTCGCTCTCATCGTCCTGCGCGTCAATTCCCCTACCATCCACTTCTCTTCCATCTCCGTCGCTAAGTTTTCAATCTCCAACACGAATTCCTCTTCTCCTTCCATGAATCTGACCTTGATCGCGGAATTCGCCGTCGACAATTCGAACTTCGGTCCGTTTAATTTCGATAACGGCACCGTCGGTCTCATGTACGGCGGCGCCATCGTTGGCGAGAGGAGTACCGGCGGAGGTAGGGCTGAGGCGAAAGGGACGAAGAGTATGAATGTTACTGTTGAAGGTTTCGCGAAGAATGTTAGCGGCGATTTGAATAGTACTTCGGGGATTTTGAATCTGAGTAGCTTCGCGAATTTGAGAGGCAGAGTTCGTTTGATTCATGTTTTCAGGAAGAGGATTTCGTCGGCGATTACTTGTTCGATTAATCTCGATTTGAATACTCATCAAATTCAGCCTGATTGGGTTTGTGAGTGA

Protein sequence

MADDNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFALIVLRVNSPTIHFSSISVAKFSISNTNSSSPSMNLTLIAEFAVDNSNFGPFNFDNGTVGLMYGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRVRLIHVFRKRISSAITCSINLDLNTHQIQPDWVCE
Homology
BLAST of Tan0016971 vs. ExPASy Swiss-Prot
Match: Q6DST1 (Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana OX=3702 GN=At1g64065 PE=2 SV=1)

HSP 1 Score: 80.5 bits (197), Expect = 2.5e-14
Identity = 68/215 (31.63%), Postives = 111/215 (51.63%), Query Frame = 0

Query: 4   DNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFALIVL 63
           D     LA  + + +SDEEQ     ++   +E   KC +Y  +  V +    LI + I L
Sbjct: 3   DEDRITLAPTEIYGRSDEEQSGPRIWRRKTEEPPGKCLVYSLTIIVIIFALCLILSSIFL 62

Query: 64  RVNSPTIHFSSISVAKFSISNTNSSSPSMNLTLIAEFAVDNSNFGPFNFDNGTVGLMYGG 123
           R++ P I   SIS      S  NS++P  N TL+++ ++ NSNFG F F++ T+ ++Y  
Sbjct: 63  RISKPEIETRSISTRDLR-SGGNSTNPYFNATLVSDISIRNSNFGAFEFEDSTLRVVYAD 122

Query: 124 -AIVGERSTGGGRAEAKGTKSMN---VTVEGFAKNVSGDLNS--TSGILNLSSFANLRGR 183
             +VGE    G R EA  T  +    V +  F    + DL+     G L L S A +RGR
Sbjct: 123 HGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLLDTKDLDKDLRLGFLELRSVAEVRGR 182

Query: 184 VRLIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           ++++   R ++ S ++C++ L+L    IQ + +CE
Sbjct: 183 IKVLGRKRWKV-SVMSCTMRLNLTGRFIQ-NLLCE 214

BLAST of Tan0016971 vs. NCBI nr
Match: XP_038875090.1 (late embryogenesis abundant protein At1g64065 [Benincasa hispida])

HSP 1 Score: 325.1 bits (832), Expect = 4.5e-85
Identity = 170/212 (80.19%), Postives = 191/212 (90.09%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFAL 60
           M +D+QSFPLAHYQAHHKSDEEQQ L+TFKTL+KERSNKCFIYVFS FVFLSVAVLIFAL
Sbjct: 1   MVEDSQSFPLAHYQAHHKSDEEQQ-LATFKTLRKERSNKCFIYVFSTFVFLSVAVLIFAL 60

Query: 61  IVLRVNSPTIHFSSISVAKFSISNTNSSSPSMNLTLIAEFAVDNSNFGPFNFDNGTVGLM 120
           IVLRVNSP+I  SS+S+ KFSI+N NSSSPS+NLT+IAEF VDNSNFGPFNFDNGTVGLM
Sbjct: 61  IVLRVNSPSIQLSSVSIPKFSITNANSSSPSLNLTMIAEFTVDNSNFGPFNFDNGTVGLM 120

Query: 121 YGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRVRL 180
           YGGAIVGE+STG GRAEAKG+K MNVT+E  AKN+S D N+  GILNL+SF  LRGRVRL
Sbjct: 121 YGGAIVGEKSTGAGRAEAKGSKRMNVTMEASAKNISSDSNNL-GILNLNSFVKLRGRVRL 180

Query: 181 IHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           IH+FR+R SS ITCS+NLD+NTHQIQ +WVCE
Sbjct: 181 IHIFRRRTSSEITCSMNLDMNTHQIQYNWVCE 210

BLAST of Tan0016971 vs. NCBI nr
Match: KAG6579535.1 (Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 312.8 bits (800), Expect = 2.3e-81
Identity = 165/213 (77.46%), Postives = 188/213 (88.26%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQK-LSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFA 60
           M DD+QSFP+AHYQAHHKSDEEQ + L+TFK LKKERSNKCFIYVFSAFVFLSVAVLIFA
Sbjct: 1   MVDDSQSFPIAHYQAHHKSDEEQHRHLTTFKALKKERSNKCFIYVFSAFVFLSVAVLIFA 60

Query: 61  LIVLRVNSPTIHFSSISVAKFSISNTNSSSPSMNLTLIAEFAVDNSNFGPFNFDNGTVGL 120
           LIVLRVNSP +HFSS+SVAKFS+SNTNSSSPS+NLTL A+ AVDNSNFGPFNFD+ +VG 
Sbjct: 61  LIVLRVNSPALHFSSLSVAKFSLSNTNSSSPSLNLTLTAQLAVDNSNFGPFNFDHASVGF 120

Query: 121 MYGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRVR 180
           +Y GAIVG+ +TG GR +AKGTK+MNVTV   AKN+S D N+ S +LNLSSFANLRGRVR
Sbjct: 121 IYAGAIVGQTTTGAGRTKAKGTKTMNVTVHASAKNISADYNN-SRLLNLSSFANLRGRVR 180

Query: 181 LIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           LIH+FR+R SS I CS+ LDLNTHQIQ +WVCE
Sbjct: 181 LIHIFRRRASSEIGCSMILDLNTHQIQHNWVCE 212

BLAST of Tan0016971 vs. NCBI nr
Match: XP_022969774.1 (uncharacterized protein LOC111468875 [Cucurbita maxima])

HSP 1 Score: 310.8 bits (795), Expect = 8.8e-81
Identity = 163/213 (76.53%), Postives = 191/213 (89.67%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQK-LSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFA 60
           MADD+QSFP+AHY+AHHKSDEEQ++ L+TFK L+KERSNKCFIYVFSAFVFLSVAVLIFA
Sbjct: 1   MADDSQSFPIAHYKAHHKSDEEQRRHLTTFKALQKERSNKCFIYVFSAFVFLSVAVLIFA 60

Query: 61  LIVLRVNSPTIHFSSISVAKFSISNTNSSSPSMNLTLIAEFAVDNSNFGPFNFDNGTVGL 120
           LIVLRVNSP +HFSS+SVAKFS+SNTNSSSPS+NLT+ A+ AVDNSNFGPFNFD  +VG 
Sbjct: 61  LIVLRVNSPALHFSSLSVAKFSLSNTNSSSPSLNLTVTAQLAVDNSNFGPFNFDYASVGF 120

Query: 121 MYGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRVR 180
           +Y GAIVG+ +TG GRA+AKGTK+MNVTV+  A N+S D N+ S +LNLSSFANLRGRVR
Sbjct: 121 IYAGAIVGQSTTGAGRAKAKGTKTMNVTVQASANNISADYNN-SRLLNLSSFANLRGRVR 180

Query: 181 LIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           LIH+FR+R SS I+CS+ LDLNTHQIQ +WVCE
Sbjct: 181 LIHIFRRRASSEISCSMILDLNTHQIQHNWVCE 212

BLAST of Tan0016971 vs. NCBI nr
Match: XP_008437349.1 (PREDICTED: late embryogenesis abundant protein At1g64065 [Cucumis melo])

HSP 1 Score: 303.1 bits (775), Expect = 1.8e-78
Identity = 164/214 (76.64%), Postives = 189/214 (88.32%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFAL 60
           M +D+QSFPLAHYQAHHK+DEEQQ L+TFKTL KERSNKCFIY+FS FVFLSVA+LIFAL
Sbjct: 1   MGEDSQSFPLAHYQAHHKTDEEQQ-LATFKTLHKERSNKCFIYIFSTFVFLSVALLIFAL 60

Query: 61  IVLRVNSPTIHFSSISVAKFSISN-TNSSSP-SMNLTLIAEFAVDNSNFGPFNFDNGTVG 120
           IVLRVNSP+I+ S++S+ KFS+SN  NSSSP S++L+  A F VDNSNFGPFNFDNGTVG
Sbjct: 61  IVLRVNSPSINLSAVSIPKFSLSNANNSSSPNSLDLSFSAVFTVDNSNFGPFNFDNGTVG 120

Query: 121 LMYGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRV 180
           L+YGG I GERSTGGGRAEAKG+K MNVTVEG AKNVSG    ++GIL+LSSF  LRGRV
Sbjct: 121 LVYGGMIFGERSTGGGRAEAKGSKRMNVTVEGSAKNVSG----SNGILSLSSFVKLRGRV 180

Query: 181 RLIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           RLIHVFR+R+SS I+CS+NLDLNTHQIQ +WVCE
Sbjct: 181 RLIHVFRRRVSSEISCSMNLDLNTHQIQHNWVCE 209

BLAST of Tan0016971 vs. NCBI nr
Match: KAA0042716.1 (late embryogenesis abundant protein [Cucumis melo var. makuwa])

HSP 1 Score: 302.8 bits (774), Expect = 2.4e-78
Identity = 164/214 (76.64%), Postives = 189/214 (88.32%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFAL 60
           M +D+QSFPLAHYQAHHK+DEEQQ L+TFKTL KERSNKCFIY+FS FVFLSVA+LIFAL
Sbjct: 1   MGEDSQSFPLAHYQAHHKTDEEQQ-LATFKTLHKERSNKCFIYIFSTFVFLSVALLIFAL 60

Query: 61  IVLRVNSPTIHFSSISVAKFSISN-TNSSSP-SMNLTLIAEFAVDNSNFGPFNFDNGTVG 120
           IVLRVNSP+I+ S++S+ KFS+SN  NSSSP S++L+  A F VDNSNFGPFNFDNGTVG
Sbjct: 61  IVLRVNSPSINLSAVSIPKFSLSNANNSSSPNSLDLSFSAVFIVDNSNFGPFNFDNGTVG 120

Query: 121 LMYGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRV 180
           L+YGG I GERSTGGGRAEAKG+K MNVTVEG AKNVSG    ++GIL+LSSF  LRGRV
Sbjct: 121 LVYGGMIFGERSTGGGRAEAKGSKRMNVTVEGSAKNVSG----SNGILSLSSFVKLRGRV 180

Query: 181 RLIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           RLIHVFR+R+SS I+CS+NLDLNTHQIQ +WVCE
Sbjct: 181 RLIHVFRRRVSSEISCSMNLDLNTHQIQHNWVCE 209

BLAST of Tan0016971 vs. ExPASy TrEMBL
Match: A0A6J1I3M2 (uncharacterized protein LOC111468875 OS=Cucurbita maxima OX=3661 GN=LOC111468875 PE=4 SV=1)

HSP 1 Score: 310.8 bits (795), Expect = 4.3e-81
Identity = 163/213 (76.53%), Postives = 191/213 (89.67%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQK-LSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFA 60
           MADD+QSFP+AHY+AHHKSDEEQ++ L+TFK L+KERSNKCFIYVFSAFVFLSVAVLIFA
Sbjct: 1   MADDSQSFPIAHYKAHHKSDEEQRRHLTTFKALQKERSNKCFIYVFSAFVFLSVAVLIFA 60

Query: 61  LIVLRVNSPTIHFSSISVAKFSISNTNSSSPSMNLTLIAEFAVDNSNFGPFNFDNGTVGL 120
           LIVLRVNSP +HFSS+SVAKFS+SNTNSSSPS+NLT+ A+ AVDNSNFGPFNFD  +VG 
Sbjct: 61  LIVLRVNSPALHFSSLSVAKFSLSNTNSSSPSLNLTVTAQLAVDNSNFGPFNFDYASVGF 120

Query: 121 MYGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRVR 180
           +Y GAIVG+ +TG GRA+AKGTK+MNVTV+  A N+S D N+ S +LNLSSFANLRGRVR
Sbjct: 121 IYAGAIVGQSTTGAGRAKAKGTKTMNVTVQASANNISADYNN-SRLLNLSSFANLRGRVR 180

Query: 181 LIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           LIH+FR+R SS I+CS+ LDLNTHQIQ +WVCE
Sbjct: 181 LIHIFRRRASSEISCSMILDLNTHQIQHNWVCE 212

BLAST of Tan0016971 vs. ExPASy TrEMBL
Match: A0A1S3ATY3 (late embryogenesis abundant protein At1g64065 OS=Cucumis melo OX=3656 GN=LOC103482793 PE=4 SV=1)

HSP 1 Score: 303.1 bits (775), Expect = 8.9e-79
Identity = 164/214 (76.64%), Postives = 189/214 (88.32%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFAL 60
           M +D+QSFPLAHYQAHHK+DEEQQ L+TFKTL KERSNKCFIY+FS FVFLSVA+LIFAL
Sbjct: 1   MGEDSQSFPLAHYQAHHKTDEEQQ-LATFKTLHKERSNKCFIYIFSTFVFLSVALLIFAL 60

Query: 61  IVLRVNSPTIHFSSISVAKFSISN-TNSSSP-SMNLTLIAEFAVDNSNFGPFNFDNGTVG 120
           IVLRVNSP+I+ S++S+ KFS+SN  NSSSP S++L+  A F VDNSNFGPFNFDNGTVG
Sbjct: 61  IVLRVNSPSINLSAVSIPKFSLSNANNSSSPNSLDLSFSAVFTVDNSNFGPFNFDNGTVG 120

Query: 121 LMYGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRV 180
           L+YGG I GERSTGGGRAEAKG+K MNVTVEG AKNVSG    ++GIL+LSSF  LRGRV
Sbjct: 121 LVYGGMIFGERSTGGGRAEAKGSKRMNVTVEGSAKNVSG----SNGILSLSSFVKLRGRV 180

Query: 181 RLIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           RLIHVFR+R+SS I+CS+NLDLNTHQIQ +WVCE
Sbjct: 181 RLIHVFRRRVSSEISCSMNLDLNTHQIQHNWVCE 209

BLAST of Tan0016971 vs. ExPASy TrEMBL
Match: A0A5A7TL68 (Late embryogenesis abundant protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold44G002140 PE=4 SV=1)

HSP 1 Score: 302.8 bits (774), Expect = 1.2e-78
Identity = 164/214 (76.64%), Postives = 189/214 (88.32%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFAL 60
           M +D+QSFPLAHYQAHHK+DEEQQ L+TFKTL KERSNKCFIY+FS FVFLSVA+LIFAL
Sbjct: 1   MGEDSQSFPLAHYQAHHKTDEEQQ-LATFKTLHKERSNKCFIYIFSTFVFLSVALLIFAL 60

Query: 61  IVLRVNSPTIHFSSISVAKFSISN-TNSSSP-SMNLTLIAEFAVDNSNFGPFNFDNGTVG 120
           IVLRVNSP+I+ S++S+ KFS+SN  NSSSP S++L+  A F VDNSNFGPFNFDNGTVG
Sbjct: 61  IVLRVNSPSINLSAVSIPKFSLSNANNSSSPNSLDLSFSAVFIVDNSNFGPFNFDNGTVG 120

Query: 121 LMYGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRV 180
           L+YGG I GERSTGGGRAEAKG+K MNVTVEG AKNVSG    ++GIL+LSSF  LRGRV
Sbjct: 121 LVYGGMIFGERSTGGGRAEAKGSKRMNVTVEGSAKNVSG----SNGILSLSSFVKLRGRV 180

Query: 181 RLIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           RLIHVFR+R+SS I+CS+NLDLNTHQIQ +WVCE
Sbjct: 181 RLIHVFRRRVSSEISCSMNLDLNTHQIQHNWVCE 209

BLAST of Tan0016971 vs. ExPASy TrEMBL
Match: A0A0A0KQT7 (LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G152160 PE=4 SV=1)

HSP 1 Score: 297.4 bits (760), Expect = 4.9e-77
Identity = 163/214 (76.17%), Postives = 185/214 (86.45%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFAL 60
           M +D+QSFPLAHYQAHHK +EEQQ L+TFK L+KERSNKCFIY+FS FVFLSVA+LIFAL
Sbjct: 1   MGEDSQSFPLAHYQAHHKPNEEQQ-LATFKILRKERSNKCFIYIFSTFVFLSVALLIFAL 60

Query: 61  IVLRVNSPTIHFSSISVAKFSIS-NTNSSSP-SMNLTLIAEFAVDNSNFGPFNFDNGTVG 120
           IVLRVNSP+I  SSIS  + S+S NTNSSSP S+NL+  AEF VDNSNFGPFNFDNGTVG
Sbjct: 61  IVLRVNSPSISLSSISNPRVSLSNNTNSSSPNSLNLSFNAEFTVDNSNFGPFNFDNGTVG 120

Query: 121 LMYGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRV 180
           L+YGG I GERSTGGGRA AKG+K MNVTVEG AKNVSG    ++GILN SSF  LRGRV
Sbjct: 121 LVYGGMIFGERSTGGGRAGAKGSKRMNVTVEGSAKNVSG----SNGILNFSSFVKLRGRV 180

Query: 181 RLIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           RLIH+FR+R+SS I+CS+NLDLNTHQIQ +WVCE
Sbjct: 181 RLIHIFRRRVSSEISCSMNLDLNTHQIQHNWVCE 209

BLAST of Tan0016971 vs. ExPASy TrEMBL
Match: A0A6J1H2C0 (uncharacterized protein LOC111459739 OS=Cucurbita moschata OX=3662 GN=LOC111459739 PE=4 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 7.1e-68
Identity = 150/212 (70.75%), Postives = 165/212 (77.83%), Query Frame = 0

Query: 1   MADDNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFAL 60
           M + + SFPL H QAHH            KT K E SNKCFIY+FS+FVFL VA+LIF+L
Sbjct: 1   MGEHSHSFPLPHSQAHH------------KTPKNEPSNKCFIYIFSSFVFLCVALLIFSL 60

Query: 61  IVLRVNSPTIHFSSISVAKFSISNTNSSSPSMNLTLIAEFAVDNSNFGPFNFDNGTVGLM 120
           IVLRVNSPTI  SSISV KFSISNTNSSS S+NLTLIAEF++DNSNFGPF FD  TV  M
Sbjct: 61  IVLRVNSPTIDLSSISVRKFSISNTNSSSSSLNLTLIAEFSLDNSNFGPFIFDYVTVVFM 120

Query: 121 YGGAIVGERSTGGGRAEAKGTKSMNVTVEGFAKNVSGDLNSTSGILNLSSFANLRGRVRL 180
           YGG IVGERSTGGGRAEAKGT  MNV+VE   +NVS DLN  SGILN+SSFA   GR+ L
Sbjct: 121 YGGVIVGERSTGGGRAEAKGTTRMNVSVEASVENVSSDLNG-SGILNMSSFAKFGGRIHL 180

Query: 181 IHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           IHV RKRI S I+CSINLDLNTHQIQP WVC+
Sbjct: 181 IHVLRKRIWSEISCSINLDLNTHQIQPRWVCD 199

BLAST of Tan0016971 vs. TAIR 10
Match: AT1G64065.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 80.5 bits (197), Expect = 1.8e-15
Identity = 68/215 (31.63%), Postives = 111/215 (51.63%), Query Frame = 0

Query: 4   DNQSFPLAHYQAHHKSDEEQQKLSTFKTLKKERSNKCFIYVFSAFVFLSVAVLIFALIVL 63
           D     LA  + + +SDEEQ     ++   +E   KC +Y  +  V +    LI + I L
Sbjct: 3   DEDRITLAPTEIYGRSDEEQSGPRIWRRKTEEPPGKCLVYSLTIIVIIFALCLILSSIFL 62

Query: 64  RVNSPTIHFSSISVAKFSISNTNSSSPSMNLTLIAEFAVDNSNFGPFNFDNGTVGLMYGG 123
           R++ P I   SIS      S  NS++P  N TL+++ ++ NSNFG F F++ T+ ++Y  
Sbjct: 63  RISKPEIETRSISTRDLR-SGGNSTNPYFNATLVSDISIRNSNFGAFEFEDSTLRVVYAD 122

Query: 124 -AIVGERSTGGGRAEAKGTKSMN---VTVEGFAKNVSGDLNS--TSGILNLSSFANLRGR 183
             +VGE    G R EA  T  +    V +  F    + DL+     G L L S A +RGR
Sbjct: 123 HGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLLDTKDLDKDLRLGFLELRSVAEVRGR 182

Query: 184 VRLIHVFRKRISSAITCSINLDLNTHQIQPDWVCE 213
           ++++   R ++ S ++C++ L+L    IQ + +CE
Sbjct: 183 IKVLGRKRWKV-SVMSCTMRLNLTGRFIQ-NLLCE 214

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6DST12.5e-1431.63Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
XP_038875090.14.5e-8580.19late embryogenesis abundant protein At1g64065 [Benincasa hispida][more]
KAG6579535.12.3e-8177.46Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. soro... [more]
XP_022969774.18.8e-8176.53uncharacterized protein LOC111468875 [Cucurbita maxima][more]
XP_008437349.11.8e-7876.64PREDICTED: late embryogenesis abundant protein At1g64065 [Cucumis melo][more]
KAA0042716.12.4e-7876.64late embryogenesis abundant protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A6J1I3M24.3e-8176.53uncharacterized protein LOC111468875 OS=Cucurbita maxima OX=3661 GN=LOC111468875... [more]
A0A1S3ATY38.9e-7976.64late embryogenesis abundant protein At1g64065 OS=Cucumis melo OX=3656 GN=LOC1034... [more]
A0A5A7TL681.2e-7876.64Late embryogenesis abundant protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A0A0KQT74.9e-7776.17LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G152160 PE=4 ... [more]
A0A6J1H2C07.1e-6870.75uncharacterized protein LOC111459739 OS=Cucurbita moschata OX=3662 GN=LOC1114597... [more]
Match NameE-valueIdentityDescription
AT1G64065.11.8e-1531.63Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 100..190
e-value: 8.3E-8
score: 32.7
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 7..212
NoneNo IPR availablePANTHERPTHR31852:SF192LATE EMBRYOGENESIS ABUNDANT PROTEINcoord: 7..212

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0016971.1Tan0016971.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane