Tan0009126 (gene) Snake gourd v1

Overview
NameTan0009126
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionlate embryogenesis abundant protein At5g17165-like
LocationLG01: 4342729 .. 4348747 (-)
RNA-Seq ExpressionTan0009126
SyntenyTan0009126
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTAGAATCGAAATCAATAGGAATTAATCGCCGTTTTTCTTCTTCGTTCTCTCTCTCTTTCTGCTTAGGCGTTTTTCTGGTTCTTCGTTTTTGTTCTCTCTCTCTCTCTCTTTAAGCCTGAATTGTATGGCCGCTAACTCGAGGAGCGCAGGAGCGATCGCCGGCTTGGGGAAACGAATCACTAACCAGATCTGGACCAGCGATTCTGCGATCTCCTCCTCTGCTCTGAACTTCAGGTTCTTCTTCTTCTTCGATTTCTTCTTGTGTGAATATTCATCTGGCAATCTACTTGAAGTTTAGGTGCTGTGTTACTTCGTTTTGTGTTGTTATTCTACTTTTATTGCATTCTTATTGCGAAATTTTCTTTGAAAAAGTCGATTGTTGTTGCATTTCTTCTTCTTCACGCTGATTGCTTTTTTGAACAACTTCAAACTGACGCAGTAAGATGAACGTCGTCAGTTTTAGGTCCTCTGCTTTTTTATCATATACTTTTCTCGGTTCTTCTGATTTCTGTCTTTTCTTCTTTTAGGATTCTAATTCGCCCCCCGATTTTCCCGCCATATACAGTACTATGACTAGCTTGCCTAGAAAATTGATAATTACGCTTTCCTTCACTCTATGTTTCTTTTTTCCTCTCTTCCTTCTGTTTTTAGTTCAATTTACTGGATGTTTTGTCAGTTTATTTTATTCAGTGTTTGTTTGTTCAATCACGCTGTGGACAAGTAGAAACTGCTTGCTGTTGAAGAGGACTTGGAATTAGAGAGGGATAATTCTTATTCCTTCGAATCTTTGTTGAAATTTGAGGAAGAGATGGAGTGAATCGCGGCTATTTCGTGCTCTTTAGATTCTTTTTTTGGTAAGATGCTCAGATGATAAGAAATCAGAAAAATCGAAAAGTCGTGAATATTAAACTTAAATAACTAATTTATTTTTTAATATATTAAATAATTCATTCATTAAAAGATCAATCGATCAATCTTTTAACATTTTAACATTTTTATTGTTTTAAAGAAAAAATAATAATTCTTATAATAAATGAAATAAAATTTTAAAAAGTAATTGAGCTATATGTTTAATATACTTAATACATTTCAGCTTTACAATTATTTTAATTTAGTTATAGCAAAAAATGAATATCGCCAAGGTTTTAACATTTAATTAATTAACATAATTAAATATCTTCAGACCACAAATCTTCAATTATTTTTCTTTTTTGTTTTTTGAGACGAATCTTTAATTTTCAAGTCCTAAACCTATAAATTCATTCTTACTTTTCAATCCATTGACCAATGAAATACAAAATTAAAAGTTTTGTGATTTATTAAATTATTTTTTAGTTTTTAAAAAAAAAAAAGTAATAAATAAATATAAATTTATAAATTTAGAAACTAAACTTATAATTTAAGTTTGTAAATATTTTAAAGGAAGTCTCAAAAGAAATACTGAAATCCCTAGGGCCAGTCTACCTGTATTAATAGAACTTATTTAAATTTATTATTATACATGATTCAAAATTCACCCATTGAATTTCGACACCAAACAAAAAAAAACCTCGTTGAACTTATTTTCTGAGGATAAAGATTAGTTTATCGGTTAGTTGACTGTACTCCATTCAACCTTAGACTAGAAATTCTGTAAAAATAAAAGTAAGACACACTGGTTAGAATCCATAAATGGAAACCTCATAACTTCTTTGTTGACTAGATAGTTAAACGTCAATTTTACGCTCTGAAACTGGGGAACGTCGAAATTGTTTTTGTGGACCTTTTGGCTCACTACAACTATTTCTATTCTACATATCCCTGACCACTATTCACATTCTGCACTAAAAGTCTATTCATATTAAAACGTCTGAAAAAATATATATAGTTTGTTTCGAGACATTAAAATAGGATGCACATATGTATTTGATATCGTATGTAGTTTATTTCTATGATGAAAATGTCATCCATAAATTAAAAAATTTGACATTAATATAGAGGATTAATCGATAATATCCTCGATAGCCATAAAATATATTCCTGAAATAAAAAATACTATTTATTAACTTCAATATAAATATAAATACTAATAACAATATCTATAATTTAGTTTGAAAATAAAAATAACTTAATTATTAATTTAATAAATTTTGATTTTTTTTTAATTATAGAAAGAAAGATAGGTTAATATCAATATTTTACCAATATATAACTATAAGGGCCTGTTTGGCCCACTGGTTATTATTACTCGTGTTATAATAACTCGTATTATAATAATTTATGAAATATTCTACTATTATAATTTCAGACTGCAATTTAAATAATATTATATATTTTATAGATTATTATAACTCATTTAGTATCTACTAAATTTAGATTAAATATCTATAGCATAGCATCTCCCTATAGACGCACTATATTTCTGTAGGGTACGGTTTGGGTATAAATATCTATAGTATCTCCCTCTTTTTCATGCTAAAAAAGCTCAAAACAAAATCACTCACTACTATATAGTCTGTAGTTAGTCGACGGGGTTGCTGACATTTAAAGCTGACTGTCGATGTGGCTGATGGTGATCCACTCGTTTCTTTCAGCTGACTGTTTTTTTTTTCTTTTTAATTCTTCTTTGTCCAGTCCCACAGTGGATAGCGACGGAATGGCTTTAAATAGGATAGAATATAGGGGGCCAGTAAGGGGACAGCGTAGGGTTGGCGACTTAACATCTCTGCTCGTTTACTCCAGTACTCTTTCACTTGAAAAAATTATAGTATTCCAGTGGTAGTGTTTAAAATTTTATATACTATTTCAATTTCATTGTTTATAATATTTTCAAAAAAAAAAAATCGTTGTTTATAATATATAGTAATATTTAAGAAAGGTATGAGTTCAAGAAATAAATTTCTGAATGAAAGGATCCACTATCAAAATATTTTCCACGTCCATGTCATGCCACTTGGCTTTTAAAATTCACATCTAGTCAAATTAATGACAATATTTATTTGCTAATTAAGTTGATTTCTGCTGTGTTTTAATTTTTAAAAATTGTTTGAATAAGTTTTTTTTTTTGTCATCTTGTTTTCTTTTATTTCTTCTTTTTATCAATTTAAAGACACTACGTTTAGATAAAGTAATACTTTGAAAATGAACAATAATTATGAGTTAATAATTGATGGTTAAAAATCTATATTTTAAGAATTATAGTGCTGTATAACTTTAGTGCATAAAAATTTGTTCCTTTTTCCTTTTGTAGAAAATTGAAAACCATCTTTGAACTTTTAACTTTTATACAGAAAAATCTCATGTAGTTCGATCAATTATAAATGCACCGCTAACTATTATATTTGATTGAACTAGTTTATATTGAATAAAAATCTTGCAGGCTTTTCTGTTCACAAATTTTTTTATATAGACACCACTAATTACTTTATTATTTTATCTGTAAATCCCACACAGATTTAAATATTAGAGTAGTTCAATTTATGTTTTCTTGCACCCAACGTGAAACATGCGTGAAATCCAATCTGTTTGTAACCGATAAGTTAGCTGGCTCTTATATCATTTATCCTGTACGATCCACTTGTTGATCACCAGATGTTTGGAATAGGAACTGGTCCATCACCTATATATATATATATGTATGTATGTATACGTATCATTCCAATAGTCACAATGTACTTGTCACACTGAATAATTTTTCAATTTTTTACATCTTTCCTTCGAAATATCTCTTTTATCCCCTTTTCTGGGATGAGATAAGCATGATTCCCTTTCACGTCACCTTTGTTTGATAAATTTTCCTTAGAAGTTATCATCTCTATGAGACCTTTATTGGAGTGGCATGATGGGATATTTGTCTGGAATTATTGAATTGATCGTTACTATTCTTGGACTATTCTGCAAGTTTATGATATTTATCATGGCTCAATTCTTCCATTTTTAGTTTTTTTCCGTAGGTATATTCCATTTTTCATAATTTTGTTCCTAATTTTTTTTACTGCATTTTCGTGTCAATCAAATCACTATTTAATAATGTTCTTGAAATTAAACAAATGAGCAATGATCTGGTGTTTTGAAAATATCATTATCTATGAATAGAAGTGAGAAAAAAAGATTAGATATCGACTCAGAATAATCGAACCAAATCAAGATAGGGACAACAGCCCCAAATCAACTGAATATATATATAACAGAAACTGTAAAAAATCTATATATTTATATTATATTCTATAAATATATTATAAAAAATTATAAACCGACCAAACTGACCCGACGGTTGGTTAAAGGCACTAGAAAAAAAAAACCAATCAATCGAGTTTGGTCGAGACATCAGTTAGAGTTGAAAACTCACCTCTATCGATAGATGATCAACCCTATCTCTGAGTTGATGTTAAATATTTATCATCCAATATTAGATTGAAAAAAGACTTGATTAACCCTACCGGATACAAATGTTATATAAAAATAATGGTTGTTTTACTTTTAAGTTTTAACCAATAAATGTCGGAGCAGGTAAGGTTAGATATTTTTACCATTTAAATAACATTGGCTGATGTGATTCCACATCGCCATTTCTTTATTTGGTCTCAAATTTAATGGAGTTATTAAACAAATATTTTATTGAAACTTTTCTTATTTTGTAAGATTAATAGTGAACACTTTTCGAAATATTAGATATTAAATTTAAGAAAACAGTGTCATCTCAGAAGTTAAATAATAAACTTATTTAGTGGGCATTTTTTGGGAAAGGGCAAGAGCTTGAAATTTGATTTTTTTTTATGCGACTTTCCATCAAGTTGCTCACCACGTGGTGGGACTGTGTTGTTTATCATTTGGAACTGATCTATTAGTTATTATTCCAAAAGTCAAATCATAAATATAATTGCTTTTCCTCCATGTCTTTTCTTAGGCTGATCCTTTTATCCTTCCCCACCTGTGGTCCATGCAGTCGATTCGGATTCGAATCACATTTATTAATTATAATTCTTTACACTAATTATAAATTTAAACCTCATTTTGGAATAAAGTTTTTTATTTATTTTACTTTTGTCTACAAACCACCTTTGAAAACTTAGTTTACTCTCTCACCTTTTCAATATTCTATTTCAATTCTTAAATTCTAGTAGAAAACTATTGTAATAGACATTCTTGAATCAAATGGTTCCTCCTAAAGTAACTACTCATCAAGTAGAATTATCTCTATTAATTTTCCAAATAGAATTTAAAAAATTAATCACAAGGGTTTTTTATTTAAAAAAAATAGAATTCAGAAACAAAAATCAGAAGTTCAAAAGTTGAAAAATTAAAATATGTGTGAGTTTAGGAATGAAATAAAGATTTAAATCTTAAATTAGAGCATAAGATATAAAAAAAAAACAATATATATTATGGATTGCTTCAATATTTTGTTTTTGGTTAAATGAAGTACTAACGGGAAAAATACACAAACCTGAAGCAGGAGGGCAGCTCACACCTCAGTATATGACAAGAACCCAGACGAGCAAATCCGACCAAGCATAGTCCCTGATGATGTGATTCAGCCTCAAGCTGCTGATAAATACTGGGCTCCTCATCCACAGACAGGAGTTTTCGGGCCGGCCACCAACCACCCTGCTGCAGCGGCGGCGAGCCGTTCTGCAGATGGTGGCAACCACGCTGCTGTGGAGGAGGAAAGGGCTTGGTTCCGACCAACCAGTCTGGAGGATTCCGAGAAGCCGCACGGGTTCTAGGCTATATTATAAGGCATTGTTGTTCATTTTGGCCATCGGACATTAGACAACAAATAGCCACCTAAAAAGTTACTAAATAAAGGAGATGGTTGTTTTTATTGTTGTACTAAGGGTTTGGAGTTCAGAGTTGCTTGAGGCTGTGGTAATCTGGCTAAGGATGCTTTGCCTCAGCTTGTTATACGGTATTGTAGCAACTTTATTACAGTGTCGGACGTTTTGTTGTTCACGTTTGACTATGCAAAATAGTGTGGTTTTATAACAATATATTTCTCCATCCTACAGGAGAACAAAAAAAAAATTGTCTGTTCTTCA

mRNA sequence

ATTAGAATCGAAATCAATAGGAATTAATCGCCGTTTTTCTTCTTCGTTCTCTCTCTCTTTCTGCTTAGGCGTTTTTCTGGTTCTTCGTTTTTGTTCTCTCTCTCTCTCTCTTTAAGCCTGAATTGTATGGCCGCTAACTCGAGGAGCGCAGGAGCGATCGCCGGCTTGGGGAAACGAATCACTAACCAGATCTGGACCAGCGATTCTGCGATCTCCTCCTCTGCTCTGAACTTCAGGAGGGCAGCTCACACCTCAGTATATGACAAGAACCCAGACGAGCAAATCCGACCAAGCATAGTCCCTGATGATGTGATTCAGCCTCAAGCTGCTGATAAATACTGGGCTCCTCATCCACAGACAGGAGTTTTCGGGCCGGCCACCAACCACCCTGCTGCAGCGGCGGCGAGCCGTTCTGCAGATGGTGGCAACCACGCTGCTGTGGAGGAGGAAAGGGCTTGGTTCCGACCAACCAGTCTGGAGGATTCCGAGAAGCCGCACGGGTTCTAGGCTATATTATAAGGCATTGTTGTTCATTTTGGCCATCGGACATTAGACAACAAATAGCCACCTAAAAAGTTACTAAATAAAGGAGATGGTTGTTTTTATTGTTGTACTAAGGGTTTGGAGTTCAGAGTTGCTTGAGGCTGTGGTAATCTGGCTAAGGATGCTTTGCCTCAGCTTGTTATACGGTATTGTAGCAACTTTATTACAGTGTCGGACGTTTTGTTGTTCACGTTTGACTATGCAAAATAGTGTGGTTTTATAACAATATATTTCTCCATCCTACAGGAGAACAAAAAAAAAATTGTCTGTTCTTCA

Coding sequence (CDS)

ATGGCCGCTAACTCGAGGAGCGCAGGAGCGATCGCCGGCTTGGGGAAACGAATCACTAACCAGATCTGGACCAGCGATTCTGCGATCTCCTCCTCTGCTCTGAACTTCAGGAGGGCAGCTCACACCTCAGTATATGACAAGAACCCAGACGAGCAAATCCGACCAAGCATAGTCCCTGATGATGTGATTCAGCCTCAAGCTGCTGATAAATACTGGGCTCCTCATCCACAGACAGGAGTTTTCGGGCCGGCCACCAACCACCCTGCTGCAGCGGCGGCGAGCCGTTCTGCAGATGGTGGCAACCACGCTGCTGTGGAGGAGGAAAGGGCTTGGTTCCGACCAACCAGTCTGGAGGATTCCGAGAAGCCGCACGGGTTCTAG

Protein sequence

MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDSEKPHGF
Homology
BLAST of Tan0009126 vs. ExPASy Swiss-Prot
Match: F4KFM8 (Late embryogenesis abundant protein At5g17165 OS=Arabidopsis thaliana OX=3702 GN=At5g17165 PE=3 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 5.4e-20
Identity = 59/124 (47.58%), Postives = 80/124 (64.52%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAA S++   I  +G+ I N +    S   +  L   R  HTS YDKN +E+++PS VPD
Sbjct: 1   MAAKSKN---IQVVGRHIVNGV---RSRAVAYGLFTSRNDHTSAYDKNVEEELQPSQVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           ++I+P  +DKYW+PHPQTGVFGP+++   A    R   GG   +V EE+AWFRPTSLED 
Sbjct: 61  EMIKPD-SDKYWSPHPQTGVFGPSSSSTNAKDEFR---GGQEDSVMEEKAWFRPTSLEDL 114

Query: 121 EKPH 125
           +K H
Sbjct: 121 DKTH 114

BLAST of Tan0009126 vs. NCBI nr
Match: XP_022941802.1 (late embryogenesis abundant protein At5g17165-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 229.2 bits (583), Expect = 2.0e-56
Identity = 112/126 (88.89%), Postives = 117/126 (92.86%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPD
Sbjct: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           DVIQPQAA+KYWAPHP TGVFGP  + P AAA SR+ADGGNHAA EEE+AWFRPTSLEDS
Sbjct: 61  DVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGGNHAAAEEEKAWFRPTSLEDS 120

Query: 121 EKPHGF 127
           EKPHGF
Sbjct: 121 EKPHGF 126

BLAST of Tan0009126 vs. NCBI nr
Match: KAG6600216.1 (Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 224.2 bits (570), Expect = 6.4e-55
Identity = 110/126 (87.30%), Postives = 115/126 (91.27%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAANSRSAGAIAGLGKRITNQIWT DSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPD
Sbjct: 1   MAANSRSAGAIAGLGKRITNQIWTCDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           DVIQPQAA+KYWAPHP TGVFGP  +   AAA SR+ADGGNHAA EEE+AWFRPTSLEDS
Sbjct: 61  DVIQPQAAEKYWAPHPHTGVFGPEADQSTAAAGSRAADGGNHAAAEEEKAWFRPTSLEDS 120

Query: 121 EKPHGF 127
           EKPHGF
Sbjct: 121 EKPHGF 126

BLAST of Tan0009126 vs. NCBI nr
Match: XP_022990289.1 (late embryogenesis abundant protein At5g17165-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 224.2 bits (570), Expect = 6.4e-55
Identity = 111/126 (88.10%), Postives = 116/126 (92.06%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAANSRSAGAIAGLGKRITNQI TSDSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPD
Sbjct: 1   MAANSRSAGAIAGLGKRITNQICTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           DVIQPQAA+KYWAPHP TGVFGP    P AA A+R+ADGGNHAAVEEE+AWFRPTSLEDS
Sbjct: 61  DVIQPQAAEKYWAPHPHTGVFGPEAEQPTAAVATRAADGGNHAAVEEEKAWFRPTSLEDS 120

Query: 121 EKPHGF 127
           EKPHGF
Sbjct: 121 EKPHGF 126

BLAST of Tan0009126 vs. NCBI nr
Match: XP_022941803.1 (late embryogenesis abundant protein At5g17165-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 222.6 bits (566), Expect = 1.9e-54
Identity = 111/126 (88.10%), Postives = 116/126 (92.06%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL F RAAHTS YDKNPDEQ+RPSIVPD
Sbjct: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKF-RAAHTSAYDKNPDEQVRPSIVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           DVIQPQAA+KYWAPHP TGVFGP  + P AAA SR+ADGGNHAA EEE+AWFRPTSLEDS
Sbjct: 61  DVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGGNHAAAEEEKAWFRPTSLEDS 120

Query: 121 EKPHGF 127
           EKPHGF
Sbjct: 121 EKPHGF 125

BLAST of Tan0009126 vs. NCBI nr
Match: XP_023512363.1 (late embryogenesis abundant protein At5g17165-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 219.5 bits (558), Expect = 1.6e-53
Identity = 109/126 (86.51%), Postives = 114/126 (90.48%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL F RAAHTS YDKNPDEQ+RPSIVPD
Sbjct: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKF-RAAHTSAYDKNPDEQVRPSIVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           DVIQPQ A+KYWAPHP TGVFGP  + P A A SR+ADGGNHAA EEE+AWFRPTSLEDS
Sbjct: 61  DVIQPQVAEKYWAPHPHTGVFGPEVDQPTAVAGSRAADGGNHAAAEEEKAWFRPTSLEDS 120

Query: 121 EKPHGF 127
           EKPHGF
Sbjct: 121 EKPHGF 125

BLAST of Tan0009126 vs. ExPASy TrEMBL
Match: A0A6J1FNH5 (late embryogenesis abundant protein At5g17165-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447057 PE=4 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 9.7e-57
Identity = 112/126 (88.89%), Postives = 117/126 (92.86%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPD
Sbjct: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           DVIQPQAA+KYWAPHP TGVFGP  + P AAA SR+ADGGNHAA EEE+AWFRPTSLEDS
Sbjct: 61  DVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGGNHAAAEEEKAWFRPTSLEDS 120

Query: 121 EKPHGF 127
           EKPHGF
Sbjct: 121 EKPHGF 126

BLAST of Tan0009126 vs. ExPASy TrEMBL
Match: A0A6J1JRN3 (late embryogenesis abundant protein At5g17165-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487206 PE=4 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 3.1e-55
Identity = 111/126 (88.10%), Postives = 116/126 (92.06%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAANSRSAGAIAGLGKRITNQI TSDSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPD
Sbjct: 1   MAANSRSAGAIAGLGKRITNQICTSDSAISSSALKFRRAAHTSAYDKNPDEQVRPSIVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           DVIQPQAA+KYWAPHP TGVFGP    P AA A+R+ADGGNHAAVEEE+AWFRPTSLEDS
Sbjct: 61  DVIQPQAAEKYWAPHPHTGVFGPEAEQPTAAVATRAADGGNHAAVEEEKAWFRPTSLEDS 120

Query: 121 EKPHGF 127
           EKPHGF
Sbjct: 121 EKPHGF 126

BLAST of Tan0009126 vs. ExPASy TrEMBL
Match: A0A6J1FT47 (late embryogenesis abundant protein At5g17165-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111447057 PE=4 SV=1)

HSP 1 Score: 222.6 bits (566), Expect = 9.1e-55
Identity = 111/126 (88.10%), Postives = 116/126 (92.06%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL F RAAHTS YDKNPDEQ+RPSIVPD
Sbjct: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALKF-RAAHTSAYDKNPDEQVRPSIVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           DVIQPQAA+KYWAPHP TGVFGP  + P AAA SR+ADGGNHAA EEE+AWFRPTSLEDS
Sbjct: 61  DVIQPQAAEKYWAPHPHTGVFGPEADQPTAAAGSRAADGGNHAAAEEEKAWFRPTSLEDS 120

Query: 121 EKPHGF 127
           EKPHGF
Sbjct: 121 EKPHGF 125

BLAST of Tan0009126 vs. ExPASy TrEMBL
Match: A0A6J1JSV5 (late embryogenesis abundant protein At5g17165-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111487206 PE=4 SV=1)

HSP 1 Score: 217.6 bits (553), Expect = 2.9e-53
Identity = 110/126 (87.30%), Postives = 115/126 (91.27%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAANSRSAGAIAGLGKRITNQI TSDSAISSSAL F RAAHTS YDKNPDEQ+RPSIVPD
Sbjct: 1   MAANSRSAGAIAGLGKRITNQICTSDSAISSSALKF-RAAHTSAYDKNPDEQVRPSIVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           DVIQPQAA+KYWAPHP TGVFGP    P AA A+R+ADGGNHAAVEEE+AWFRPTSLEDS
Sbjct: 61  DVIQPQAAEKYWAPHPHTGVFGPEAEQPTAAVATRAADGGNHAAVEEEKAWFRPTSLEDS 120

Query: 121 EKPHGF 127
           EKPHGF
Sbjct: 121 EKPHGF 125

BLAST of Tan0009126 vs. ExPASy TrEMBL
Match: A0A1S3BYE5 (uncharacterized protein LOC103494437 OS=Cucumis melo OX=3656 GN=LOC103494437 PE=4 SV=1)

HSP 1 Score: 207.2 bits (526), Expect = 4.0e-50
Identity = 107/131 (81.68%), Postives = 119/131 (90.84%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTS-----DSAISSSALNFRRAAHTSVYDKNPDEQIRP 60
           MAANSRSAGAIAGLGKRIT+QIWTS     +S ISSSA  FRRAAHTSVYDKNP+EQ+RP
Sbjct: 1   MAANSRSAGAIAGLGKRITDQIWTSSDSLRNSVISSSAPKFRRAAHTSVYDKNPEEQVRP 60

Query: 61  SIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAA-AAASRSADGGNHAAVEEERAWFRP 120
           SIVPDDVIQPQAADKYWAPHPQTGVFGP +++PAA AAA+R+AD GN++A EEE+AWFRP
Sbjct: 61  SIVPDDVIQPQAADKYWAPHPQTGVFGPTSDNPAAVAAANRAADVGNYSAAEEEKAWFRP 120

Query: 121 TSLEDSEKPHG 126
           TSLEDSEKPHG
Sbjct: 121 TSLEDSEKPHG 131

BLAST of Tan0009126 vs. TAIR 10
Match: AT3G03150.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G17165.1); Has 39 Blast hits to 39 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 101.7 bits (252), Expect = 4.5e-22
Identity = 58/120 (48.33%), Postives = 76/120 (63.33%), Query Frame = 0

Query: 5   SRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQ 64
           S+S   I  L K + N    S  A +S+    RR+ H+S YDKN ++++  S VPD+VI+
Sbjct: 6   SKSFQLITSLRKHLVN-TRASTRATASALFPSRRSGHSSAYDKNVEDELHASAVPDEVIK 65

Query: 65  PQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDSEKPH 124
           P  +DKYW+PHP+TGVFGP+T   +A A     D     AV EE AWFRPTSLEDS+K H
Sbjct: 66  PD-SDKYWSPHPKTGVFGPSTTEHSATAEGAHQD----TAVLEETAWFRPTSLEDSDKTH 119

BLAST of Tan0009126 vs. TAIR 10
Match: AT5G17165.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G03150.1); Has 39 Blast hits to 39 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 98.6 bits (244), Expect = 3.8e-21
Identity = 59/124 (47.58%), Postives = 80/124 (64.52%), Query Frame = 0

Query: 1   MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPD 60
           MAA S++   I  +G+ I N +    S   +  L   R  HTS YDKN +E+++PS VPD
Sbjct: 1   MAAKSKN---IQVVGRHIVNGV---RSRAVAYGLFTSRNDHTSAYDKNVEEELQPSQVPD 60

Query: 61  DVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERAWFRPTSLEDS 120
           ++I+P  +DKYW+PHPQTGVFGP+++   A    R   GG   +V EE+AWFRPTSLED 
Sbjct: 61  EMIKPD-SDKYWSPHPQTGVFGPSSSSTNAKDEFR---GGQEDSVMEEKAWFRPTSLEDL 114

Query: 121 EKPH 125
           +K H
Sbjct: 121 DKTH 114

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4KFM85.4e-2047.58Late embryogenesis abundant protein At5g17165 OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
XP_022941802.12.0e-5688.89late embryogenesis abundant protein At5g17165-like isoform X1 [Cucurbita moschat... [more]
KAG6600216.16.4e-5587.30Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. soro... [more]
XP_022990289.16.4e-5588.10late embryogenesis abundant protein At5g17165-like isoform X1 [Cucurbita maxima][more]
XP_022941803.11.9e-5488.10late embryogenesis abundant protein At5g17165-like isoform X2 [Cucurbita moschat... [more]
XP_023512363.11.6e-5386.51late embryogenesis abundant protein At5g17165-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1FNH59.7e-5788.89late embryogenesis abundant protein At5g17165-like isoform X1 OS=Cucurbita mosch... [more]
A0A6J1JRN33.1e-5588.10late embryogenesis abundant protein At5g17165-like isoform X1 OS=Cucurbita maxim... [more]
A0A6J1FT479.1e-5588.10late embryogenesis abundant protein At5g17165-like isoform X2 OS=Cucurbita mosch... [more]
A0A6J1JSV52.9e-5387.30late embryogenesis abundant protein At5g17165-like isoform X2 OS=Cucurbita maxim... [more]
A0A1S3BYE54.0e-5081.68uncharacterized protein LOC103494437 OS=Cucumis melo OX=3656 GN=LOC103494437 PE=... [more]
Match NameE-valueIdentityDescription
AT3G03150.14.5e-2248.33unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G17165.13.8e-2147.58unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 101..126
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 71..126
NoneNo IPR availablePANTHERPTHR35122:SF2OSJNBA0093F12.14 PROTEINcoord: 1..124
IPR039291Late embryogenesis abundant protein At5g17165-likePANTHERPTHR35122OSJNBA0093F12.14 PROTEINcoord: 1..124

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009126.1Tan0009126.1mRNA