Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATAATCCCTTCCCTCAGCGCGGGCAATATCAACGTGCCGTGATGAAACAGAGAACGAACTGATCATGGAGTATCCCAACTTTAAAGGAGGCCCATAACTTCGGCCCCTATCCCTAGTCATGACTCAGTCATAACTATCTCAATTCCCATTTCCTCGGAGTTCAGTGGCCGACGCCACCGTACGGCAATCTTCTTTGATGTTAAAACCCTTTCTCTCCCCCTCCATCTCCGGCTGCCACCATGCAACCGTCTCCACCGTTGATTACTGACCTTACCAGAACTATAACGACCACCGCCCGACCTGGACCTTCCACGATGATCATCCAAGCCTACCAGTACCGGCAACCTTATCCAAATATTAATAGGTTTTTTGGGTATAAAACCGACCTTATCGGTGGTTGTAGCCGTAGATTTCCTGCCTGTGCAAGTGCCAGCTCAGGACCTCAAGTTCCGGCTGCTTCTGCTCCTTTAATCCAATCCGATATTGGCGCTGCGTCCCGGACGTCGGCACTGGAAAAGTTGGATACCATAGAGGAGGGCCTGGAAAAGGTTGCGCATGCCTAATTTATGTATGATTGTTAATTCTGTAGTAGCTTCGTAGTTTGACAATTTATTTTCATGGGTTTTCAGGCCATTTATCGATGCCGATTCATGGCATTTTTGGGCGTCTTAGGATCTTTGGTTGGTTCTGTACTCTGTTTCGTCAAGGTACGATGATTAACTTCCAGCTCTGTATCACATTATTCTGTTGTCACAGGCCAGGATTTGTGTTTCCTTTGTCTGTGTTATGAGAACTTCATTCTACTTGCTTCAATTGCTTATTACACTATCTTCATTCTAAGAATGCAATTTAGTTGACGAATTTGGTCTGCGTTACAAAGCTATGATGGTATCAGAGCTCCTTGAAATTCCATTTAAGAGCAGTGAACTAAGGATGTAGCTAGCCTAGACCAATTGACATTTGTGAATATCTCTTTCCAACAAAATTGTTGACAATAAAAGCGTTCTTCCTCTCTGACTTCCCTTCGCTTGAAGTTTGTTTTGACAAGGATTGTGGATGGATTGTTTACAGCTAATTTGTTTTAGGTTTACTCTTTGTACGAGGCCGTTATTTTAAGATACTTGGACGGTCTCACTTACAAAAATATACATGCACGTGTTGATCGATAATGTTGTCGAGGATAAAAATTTTAGGATTTAGAACTGTTGGGAATTCATGAACTGAGATCTCTTTTCAAGTTTTTGTAATTCTGAGCCTTGGTTGGACAATATTATATCGCCTTTATGTTCTTGTAGATTGGTGTACTTGAATAACCATGACCACCACAAGCTGATTGATGCTCCATTAATTTAAACTTTCATTAAAAATATGACACGTGCTCTTTATACTATGTGACTGTGAAATGCAGGGGTGCGTTCATGTAGCAGCATCTTTCTCAGAATATTTTGTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGGTACTTACTGTGAAGTACTCTCGAATTTATGTCTTTGTCTCTATTTTCTATTTGGCAAATCTGTGTTCAATGCTCATAATTTTGTTACCAAAAATAGCTTCGAATCTTTATAGAAAAGAATTTGATCCAGTCTCAAGTTTAGGTTGAGTAGTGGAGATCTGTGGAAATAAATACACGCTTTCATTGGGTGTTTTGGTACATTTTTGCTGTATTAGAGGTGTTCTCATGTTTTTTTTTTTTTTTGCTTCTAAGTTGACTTATTTTTATTAAGAAACCAAGATAGCCACTTGTAGTACTGGACTCACTGAACTGAACCAATTTTGTTCAAGTGCATAAAACTCAAATGAGATATGTGAAACTGTTTCCATTGCATACTCTATTTTAGGCAGAGTATCCTTGTCGGACATGGACATGCTCCGGACACATGTCCGACACACTAATTTACGAGTTTCTTCTTCTTCTTCTTATTTTTTAATTTTGGGCATGCTATGGACACACATAACAAAATTTTAACCAAGAAAAGGGATTGGCTGGTTTATTTTGAGCCCAAAGTTAAAAGCCCATATAATCTTAAAATTTAAACACGAAGAGGACATTAAAAAAACATAAATAATTAAATTAAAAAGTAAAAAATCTTCTGTCATTCACGAACAGGACATGTTTTCCATTTCCTTTTTGTGTGTAAAATTGTTTTCCATTTCCTTTTAGTGTATTTACTCTTGTTTGTATAAAAAACTTTGTATTGTTGAATGTTGGATAATGTATTTTGTACTACTACTACTATCATTTTGACATTTTATGCTTTTTAGTGGATTATATTTAGTATTTTATGTTAAATCTACCTATATCCTAAAAAAATAATTTTAAAAAAATCGTATCCCCAATGTGTTTGTGTCCTAGTTTTTTAGAAATTGATGTATTGTCGTGTCTGAGTCTGTGTCCGTGTCCGTGTCCGTGCTTCTTAGACACCTGTTGATATGTAAATGGCATGCCTAGATGTGAATATTCTACATAATTTCAATTCTCAACGTACTTGATTTTTGCTTCAAATGGTATGCAGATGTGTATCTCTTAGGAACTGTGATGCTAGTCTTTGGTACGGGTCTCTATGAGCTGTTTATCAGTCAGCTTGGAAGTGAACGCACTTTATCAAAGAGAAACGTTGAGCATAGATCCAACCTATTTGGCTTGTTCACTTTAAAGGTGGGTCGATCTTACCCGTGTAATTGTGATATGGTTTTTCTACTGTTATGCTGATTGTTGGTCCTTATTGCGGTTGTAGTTCTTCGGTAGAGTCTATTTGTTTATGGGTTAAAAATTACATTTTGTCAATGAGTTTTAATATGCAATGAATTTAGTCTCGTGTCTTTGAACATTAACAGGTAATGACTTCGTTCTTGTAGTTTAAAATTTGGGATGACTTAGTCCTTATTGTGAAAAGTCTATTAAAGCGAATCCATGTGTAAAAACAACAAAGGTCATGTGTGCAAAAAAGAAATGATGGAATGGTGGGATCTAGCGACAATCCCATCAAACCTACATCACACGTTAATGCATTTTTTCATGATAGGAACTAAACTGTTACGTACTAAAGGTTAGTGATTAAATTGTTACAGGTTTTGAACAATAGAGACTTAATTGTTACATACTAAAGTTTAAAGACTAAATTGTTATTTGAATGAAAACACATGGATTGAATGTGTCTTTTAACCTTGTTTATTCAGATGTCTCTTTTTCTGAGGGCTGAATAACATCCTAGTCTTGTTGGATTGATGTTTGGACTATAATTGATCTGGTTGTGCTACTCAAGCCTAAAATGGATTTTGATAATCACATATCCAAAGTATCAGGTTTGTAGTGGTTCAATAGAATCCAAGTGACATTACTTGCACAACAGAACACATAGGGGCTGTTTGGAGCGCTGGGTGAGTTATAATAGTCTGAGGAGTTATATAATTTGTGTTTAGGAGGCATGAGTTATGTAGCATGAATTATATAGTTTGCGTTTGGGTAATATAACCCATGCTCCAAACATCTCCATATAGATTGGCTCATATTCCTCGATCTTGTTACAATTTTATATCATTGAATTTTCATAGGAACGACCAAAATGGATGGACGTAACGACCGTTAACGAGCTGAAAACAAAGCTCGGGCATGTCATAGTGATGCTGCTTCTAATTGGGTTCTTCGACAAGAGTAAAAAGGCAGTTATACAAACTCCAGGTGATTTGCTTTGCTTAGCTGCTTCAATATTTCTTTCCTCTGGTAGCCTGTTTCTGCTGTCTAAACTAACCGAATAACAGTAATAAGTTATGTACAAATATAAATATGTAATACACCTTTTTTTTCACCTTTTTTGGCCCTCCTCTGAGAACGGTTGAAACCGAGAAATGTTAGTTCTTGTAAATGGATTTGTAAGAAGATGTTACTGGAGTGTAGAATGACTGCAAAGTAGTAAATAAATAAGGCAATAAAGCTTTAGAAAGCT
mRNA sequence
ATAATCCCTTCCCTCAGCGCGGGCAATATCAACGTGCCGTGATGAAACAGAGAACGAACTGATCATGGAGTATCCCAACTTTAAAGGAGGCCCATAACTTCGGCCCCTATCCCTAGTCATGACTCAGTCATAACTATCTCAATTCCCATTTCCTCGGAGTTCAGTGGCCGACGCCACCGTACGGCAATCTTCTTTGATGTTAAAACCCTTTCTCTCCCCCTCCATCTCCGGCTGCCACCATGCAACCGTCTCCACCGTTGATTACTGACCTTACCAGAACTATAACGACCACCGCCCGACCTGGACCTTCCACGATGATCATCCAAGCCTACCAGTACCGGCAACCTTATCCAAATATTAATAGGTTTTTTGGGTATAAAACCGACCTTATCGGTGGTTGTAGCCGTAGATTTCCTGCCTGTGCAAGTGCCAGCTCAGGACCTCAAGTTCCGGCTGCTTCTGCTCCTTTAATCCAATCCGATATTGGCGCTGCGTCCCGGACGTCGGCACTGGAAAAGTTGGATACCATAGAGGAGGGCCTGGAAAAGGCCATTTATCGATGCCGATTCATGGCATTTTTGGGCGTCTTAGGATCTTTGGTTGGTTCTGTACTCTGTTTCGTCAAGGGGTGCGTTCATGTAGCAGCATCTTTCTCAGAATATTTTGTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCTCTTAGGAACTGTGATGCTAGTCTTTGGTACGGGTCTCTATGAGCTGTTTATCAGTCAGCTTGGAAGTGAACGCACTTTATCAAAGAGAAACGTTGAGCATAGATCCAACCTATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGGACGTAACGACCGTTAACGAGCTGAAAACAAAGCTCGGGCATGTCATAGTGATGCTGCTTCTAATTGGGTTCTTCGACAAGAGTAAAAAGGCAGTTATACAAACTCCAGGTGATTTGCTTTGCTTAGCTGCTTCAATATTTCTTTCCTCTGGTAGCCTGTTTCTGCTGTCTAAACTAACCGAATAACAGTAATAAGTTATGTACAAATATAAATATGTAATACACCTTTTTTTTCACCTTTTTTGGCCCTCCTCTGAGAACGGTTGAAACCGAGAAATGTTAGTTCTTGTAAATGGATTTGTAAGAAGATGTTACTGGAGTGTAGAATGACTGCAAAGTAGTAAATAAATAAGGCAATAAAGCTTTAGAAAGCT
Coding sequence (CDS)
ATGCAACCGTCTCCACCGTTGATTACTGACCTTACCAGAACTATAACGACCACCGCCCGACCTGGACCTTCCACGATGATCATCCAAGCCTACCAGTACCGGCAACCTTATCCAAATATTAATAGGTTTTTTGGGTATAAAACCGACCTTATCGGTGGTTGTAGCCGTAGATTTCCTGCCTGTGCAAGTGCCAGCTCAGGACCTCAAGTTCCGGCTGCTTCTGCTCCTTTAATCCAATCCGATATTGGCGCTGCGTCCCGGACGTCGGCACTGGAAAAGTTGGATACCATAGAGGAGGGCCTGGAAAAGGCCATTTATCGATGCCGATTCATGGCATTTTTGGGCGTCTTAGGATCTTTGGTTGGTTCTGTACTCTGTTTCGTCAAGGGGTGCGTTCATGTAGCAGCATCTTTCTCAGAATATTTTGTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCTCTTAGGAACTGTGATGCTAGTCTTTGGTACGGGTCTCTATGAGCTGTTTATCAGTCAGCTTGGAAGTGAACGCACTTTATCAAAGAGAAACGTTGAGCATAGATCCAACCTATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGGACGTAACGACCGTTAACGAGCTGAAAACAAAGCTCGGGCATGTCATAGTGATGCTGCTTCTAATTGGGTTCTTCGACAAGAGTAAAAAGGCAGTTATACAAACTCCAGGTGATTTGCTTTGCTTAGCTGCTTCAATATTTCTTTCCTCTGGTAGCCTGTTTCTGCTGTCTAAACTAACCGAATAA
Protein sequence
MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
Homology
BLAST of Tan0021720 vs. NCBI nr
Match:
XP_022937012.1 (uncharacterized protein LOC111443436 [Cucurbita moschata])
HSP 1 Score: 446.8 bits (1148), Expect = 1.3e-121
Identity = 231/271 (85.24%), Postives = 248/271 (91.51%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPA 60
MQPSP LIT RT+TTTAR PST+IIQAYQ++QP P N FGY+ DL+GGC RRFPA
Sbjct: 1 MQPSPSLITGPIRTLTTTAR--PSTIIIQAYQHQQPNPKFNGIFGYRADLVGGCGRRFPA 60
Query: 61 CASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSL 120
CAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEGLEKAIYRCRFMAFLGVLGSL
Sbjct: 61 CASPSSGPQVPAASAPFVQSDVGAASRTSALEKLDTVEEGLEKAIYRCRFMAFLGVLGSL 120
Query: 121 VGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQL 180
+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS L
Sbjct: 121 IGSVLCFVKGCVHVAASLSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISNL 180
Query: 181 GSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKK 240
GS R+ S+R+V HRSNLFGLFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK
Sbjct: 181 GSARSFSERSVLHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKK 240
Query: 241 AVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Sbjct: 241 VVIQSPGDLLCLAVSIFLSSATLFLLSKLTE 269
BLAST of Tan0021720 vs. NCBI nr
Match:
XP_023536456.1 (uncharacterized protein LOC111797628 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 444.9 bits (1143), Expect = 5.0e-121
Identity = 230/271 (84.87%), Postives = 247/271 (91.14%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPA 60
MQPSP LIT RT+ TTAR PST+IIQAYQ++QP P N FGY+ DL+GGC RRFPA
Sbjct: 1 MQPSPSLITGPIRTLATTAR--PSTIIIQAYQHQQPNPKFNGIFGYRADLVGGCGRRFPA 60
Query: 61 CASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSL 120
CAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEGLEKAIYRCRFMAFLGVLGSL
Sbjct: 61 CASPSSGPQVPAASAPFVQSDVGAASRTSALEKLDTVEEGLEKAIYRCRFMAFLGVLGSL 120
Query: 121 VGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQL 180
+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS L
Sbjct: 121 IGSVLCFVKGCVHVAASLSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISNL 180
Query: 181 GSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKK 240
GS R+ S+R+V HRSNLFGLFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK
Sbjct: 181 GSARSFSERSVLHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKK 240
Query: 241 AVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Sbjct: 241 VVIQSPGDLLCLAVSIFLSSATLFLLSKLTE 269
BLAST of Tan0021720 vs. NCBI nr
Match:
XP_022976828.1 (uncharacterized protein LOC111477089 [Cucurbita maxima])
HSP 1 Score: 443.4 bits (1139), Expect = 1.5e-120
Identity = 230/271 (84.87%), Postives = 246/271 (90.77%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPA 60
MQPSP LIT RT+TTTAR PST+IIQAYQ++QP N FGY+ DL+GGC RRFPA
Sbjct: 1 MQPSPSLITGPIRTLTTTAR--PSTIIIQAYQHQQPNSKFNGIFGYRADLVGGCGRRFPA 60
Query: 61 CASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSL 120
CAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEGLEKAIYRCRFMAFLGVLGSL
Sbjct: 61 CASPSSGPQVPAASAPFVQSDVGAASRTSALEKLDTVEEGLEKAIYRCRFMAFLGVLGSL 120
Query: 121 VGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQL 180
+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS L
Sbjct: 121 IGSVLCFVKGCVHVAASLSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISNL 180
Query: 181 GSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKK 240
GS R+ S+R V HRSNLFGLFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK
Sbjct: 181 GSARSFSERGVLHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKK 240
Query: 241 AVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Sbjct: 241 VVIQSPGDLLCLAVSIFLSSATLFLLSKLTE 269
BLAST of Tan0021720 vs. NCBI nr
Match:
KAG6591856.1 (hypothetical protein SDJN03_14202, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 442.2 bits (1136), Expect = 3.2e-120
Identity = 229/271 (84.50%), Postives = 247/271 (91.14%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPA 60
MQPSP LIT RT+TTTAR PST+IIQAYQ++QP P + FGY+ DL+GGC R FPA
Sbjct: 1 MQPSPSLITGPIRTLTTTAR--PSTIIIQAYQHQQPNPKFSGIFGYRADLVGGCGRGFPA 60
Query: 61 CASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSL 120
CAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEGLEKAIYRCRFMAFLGVLGSL
Sbjct: 61 CASPSSGPQVPAASAPFVQSDVGAASRTSALEKLDTVEEGLEKAIYRCRFMAFLGVLGSL 120
Query: 121 VGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQL 180
+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS L
Sbjct: 121 IGSVLCFVKGCVHVAASLSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISNL 180
Query: 181 GSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKK 240
GS R+ S+R+V HRSNLFGLFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK
Sbjct: 181 GSARSFSERSVLHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKK 240
Query: 241 AVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Sbjct: 241 VVIQSPGDLLCLAVSIFLSSATLFLLSKLTE 269
BLAST of Tan0021720 vs. NCBI nr
Match:
XP_022140712.1 (uncharacterized protein LOC111011276 [Momordica charantia])
HSP 1 Score: 441.4 bits (1134), Expect = 5.5e-120
Identity = 228/271 (84.13%), Postives = 246/271 (90.77%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPA 60
MQPSPPLIT RT+TTT R PST+I+QAY Y+Q P +RFFGY TDL+GGCSRRFPA
Sbjct: 1 MQPSPPLITGPIRTLTTTVR--PSTIIVQAYHYQQSNPKFSRFFGYTTDLVGGCSRRFPA 60
Query: 61 CASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSL 120
CAS SSGPQVPAASAPLIQSD AA RTSALEKL+TIEEGLEKAIYRCRFMAFLGVLGSL
Sbjct: 61 CASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLEKAIYRCRFMAFLGVLGSL 120
Query: 121 VGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQL 180
+GSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS L
Sbjct: 121 IGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHL 180
Query: 181 GSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKK 240
G+ ++ S +N EHRSNLFGLFTLKERPKW+ + TVNELKTKLGHVIVMLLLIGFF+K+KK
Sbjct: 181 GTAQSPSMKNAEHRSNLFGLFTLKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKK 240
Query: 241 AVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
VIQ+PGDLLCLA S+FLSSGSLFLLSKLTE
Sbjct: 241 VVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE 269
BLAST of Tan0021720 vs. ExPASy TrEMBL
Match:
A0A6J1F9X5 (uncharacterized protein LOC111443436 OS=Cucurbita moschata OX=3662 GN=LOC111443436 PE=4 SV=1)
HSP 1 Score: 446.8 bits (1148), Expect = 6.4e-122
Identity = 231/271 (85.24%), Postives = 248/271 (91.51%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPA 60
MQPSP LIT RT+TTTAR PST+IIQAYQ++QP P N FGY+ DL+GGC RRFPA
Sbjct: 1 MQPSPSLITGPIRTLTTTAR--PSTIIIQAYQHQQPNPKFNGIFGYRADLVGGCGRRFPA 60
Query: 61 CASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSL 120
CAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEGLEKAIYRCRFMAFLGVLGSL
Sbjct: 61 CASPSSGPQVPAASAPFVQSDVGAASRTSALEKLDTVEEGLEKAIYRCRFMAFLGVLGSL 120
Query: 121 VGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQL 180
+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS L
Sbjct: 121 IGSVLCFVKGCVHVAASLSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISNL 180
Query: 181 GSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKK 240
GS R+ S+R+V HRSNLFGLFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK
Sbjct: 181 GSARSFSERSVLHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKK 240
Query: 241 AVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Sbjct: 241 VVIQSPGDLLCLAVSIFLSSATLFLLSKLTE 269
BLAST of Tan0021720 vs. ExPASy TrEMBL
Match:
A0A6J1INA6 (uncharacterized protein LOC111477089 OS=Cucurbita maxima OX=3661 GN=LOC111477089 PE=4 SV=1)
HSP 1 Score: 443.4 bits (1139), Expect = 7.0e-121
Identity = 230/271 (84.87%), Postives = 246/271 (90.77%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPA 60
MQPSP LIT RT+TTTAR PST+IIQAYQ++QP N FGY+ DL+GGC RRFPA
Sbjct: 1 MQPSPSLITGPIRTLTTTAR--PSTIIIQAYQHQQPNSKFNGIFGYRADLVGGCGRRFPA 60
Query: 61 CASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSL 120
CAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEGLEKAIYRCRFMAFLGVLGSL
Sbjct: 61 CASPSSGPQVPAASAPFVQSDVGAASRTSALEKLDTVEEGLEKAIYRCRFMAFLGVLGSL 120
Query: 121 VGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQL 180
+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS L
Sbjct: 121 IGSVLCFVKGCVHVAASLSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISNL 180
Query: 181 GSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKK 240
GS R+ S+R V HRSNLFGLFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK
Sbjct: 181 GSARSFSERGVLHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKK 240
Query: 241 AVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Sbjct: 241 VVIQSPGDLLCLAVSIFLSSATLFLLSKLTE 269
BLAST of Tan0021720 vs. ExPASy TrEMBL
Match:
A0A6J1CHU0 (uncharacterized protein LOC111011276 OS=Momordica charantia OX=3673 GN=LOC111011276 PE=4 SV=1)
HSP 1 Score: 441.4 bits (1134), Expect = 2.7e-120
Identity = 228/271 (84.13%), Postives = 246/271 (90.77%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPA 60
MQPSPPLIT RT+TTT R PST+I+QAY Y+Q P +RFFGY TDL+GGCSRRFPA
Sbjct: 1 MQPSPPLITGPIRTLTTTVR--PSTIIVQAYHYQQSNPKFSRFFGYTTDLVGGCSRRFPA 60
Query: 61 CASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSL 120
CAS SSGPQVPAASAPLIQSD AA RTSALEKL+TIEEGLEKAIYRCRFMAFLGVLGSL
Sbjct: 61 CASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLEKAIYRCRFMAFLGVLGSL 120
Query: 121 VGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQL 180
+GSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS L
Sbjct: 121 IGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHL 180
Query: 181 GSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKK 240
G+ ++ S +N EHRSNLFGLFTLKERPKW+ + TVNELKTKLGHVIVMLLLIGFF+K+KK
Sbjct: 181 GTAQSPSMKNAEHRSNLFGLFTLKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKK 240
Query: 241 AVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
VIQ+PGDLLCLA S+FLSSGSLFLLSKLTE
Sbjct: 241 VVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE 269
BLAST of Tan0021720 vs. ExPASy TrEMBL
Match:
A0A6J1J2R4 (uncharacterized protein LOC111480745 OS=Cucurbita maxima OX=3661 GN=LOC111480745 PE=4 SV=1)
HSP 1 Score: 436.8 bits (1122), Expect = 6.6e-119
Identity = 233/272 (85.66%), Postives = 246/272 (90.44%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNINRFFGYKTDLIGGCSRRFP 60
MQPSPPLI+ +R++TTT R PSTMIIQAY QY Q YP N F GYKT LI GC RRFP
Sbjct: 1 MQPSPPLISGPSRSLTTTVR--PSTMIIQAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFP 60
Query: 61 ACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGS 120
A A+ASSGP VPAASAP IQSDIG ASRTSALEK IEE LEKAIYRCRFMAFLGV GS
Sbjct: 61 AFATASSGPHVPAASAPSIQSDIGMASRTSALEKSGIIEEDLEKAIYRCRFMAFLGVFGS 120
Query: 121 LVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQ 180
LVGS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS
Sbjct: 121 LVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISH 180
Query: 181 LGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSK 240
LG+ERTLSKRN+EHRSNLFGLFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK
Sbjct: 181 LGTERTLSKRNIEHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSK 240
Query: 241 KAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
KA IQ+PGDLLCLAAS+FLSSGSLFLLSKLTE
Sbjct: 241 KAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE 269
BLAST of Tan0021720 vs. ExPASy TrEMBL
Match:
A0A6J1FPY6 (uncharacterized protein LOC111445906 OS=Cucurbita moschata OX=3662 GN=LOC111445906 PE=4 SV=1)
HSP 1 Score: 434.5 bits (1116), Expect = 3.3e-118
Identity = 231/272 (84.93%), Postives = 244/272 (89.71%), Query Frame = 0
Query: 1 MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNINRFFGYKTDLIGGCSRRFP 60
MQPSPPLI+ +RT+TTT R PSTMII AY QY Q YP N F GYKT LI GC RRFP
Sbjct: 1 MQPSPPLISGPSRTLTTTVR--PSTMIIHAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFP 60
Query: 61 ACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGS 120
A A+ASSGP VPAASAP IQSD+G ASRTS LEK IEE LEKAIYRCRFMAFLGV GS
Sbjct: 61 AFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLEKAIYRCRFMAFLGVFGS 120
Query: 121 LVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQ 180
LVGS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS
Sbjct: 121 LVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISH 180
Query: 181 LGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSK 240
LG+ERTLSKRN+EHRSNLFGLFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK
Sbjct: 181 LGTERTLSKRNIEHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSK 240
Query: 241 KAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE 272
KA IQ+PGDLLCLAAS+FLSSGSLFLLSKLTE
Sbjct: 241 KAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE 269
BLAST of Tan0021720 vs. TAIR 10
Match:
AT4G19390.1 (Uncharacterised protein family (UPF0114) )
HSP 1 Score: 241.1 bits (614), Expect = 1.0e-63
Identity = 142/274 (51.82%), Postives = 181/274 (66.06%), Query Frame = 0
Query: 8 ITDLTRTITTT--ARPGPSTMIIQAYQYRQPYPN---INRFFGYKTDLIGGCSRRFPACA 67
+T RTI A P PS +I ++ P I+ F G K SR
Sbjct: 1 MTTPCRTINANAIAAPSPSGLIFNGFRDFVPIEKRLVISSFRGLKLP-----SRTTKTIT 60
Query: 68 SA-------SSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLG 127
S+ S G A+++ + AA +++ + + +EEG+EK IY CRFM FLG
Sbjct: 61 SSDWSWSYRSPGRLASASTSTSASTSTSAAVTSNSTNRFEALEEGIEKVIYSCRFMTFLG 120
Query: 128 VLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYEL 187
LGSL+GSVLCF+KGC++V SF +Y VNRGKVI LLVEAID+YLLGTVMLVFG GLYEL
Sbjct: 121 TLGSLLGSVLCFIKGCMYVVDSFLQYSVNRGKVIFLLVEAIDIYLLGTVMLVFGLGLYEL 180
Query: 188 FISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFF 247
FIS L + + + V +RS+LFG+FTLKERP+W++V +V+ELKTKLGHVIVMLLLIG F
Sbjct: 181 FISNLDTSESRTHDIVSNRSSLFGMFTLKERPQWLEVKSVSELKTKLGHVIVMLLLIGLF 240
Query: 248 DKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKL 270
DKSK+ VI + DLLC++ SIF SS LFLLS+L
Sbjct: 241 DKSKRVVITSVTDLLCISVSIFFSSACLFLLSRL 269
BLAST of Tan0021720 vs. TAIR 10
Match:
AT5G13720.1 (Uncharacterised protein family (UPF0114) )
HSP 1 Score: 160.2 bits (404), Expect = 2.3e-39
Identity = 93/218 (42.66%), Postives = 133/218 (61.01%), Query Frame = 0
Query: 59 PACASASSGPQVPAASAPLIQSDIGAASRTSALEK-LDTIEEGLEKAIYRCRFMAFLGVL 118
P +++SS P + + S G S + + E +E+ I+ RF+A L V
Sbjct: 40 PESSASSSIPTSIPVNGNTLPSSYGTRKDDSPFAQFFRSTESNVERIIFDFRFLALLAVG 99
Query: 119 GSLVGSVLCFVKGCVHVAASFSEYFVN------RGKVIMLLVEAIDVYLLGTVMLVFGTG 178
GSL GS+LCF+ GCV++ ++ Y+ N G++++ LVEAIDVYL GTVML+F G
Sbjct: 100 GSLAGSLLCFLNGCVYIVEAYKVYWTNCSKGIHTGQMVLRLVEAIDVYLAGTVMLIFSMG 159
Query: 179 LYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLL 238
LY LFIS + S+LFG+F +KERPKWM +++++ELKTK+GHVIVM+LL
Sbjct: 160 LYGLFISHSPHDVPPESDRALRSSSLFGMFAMKERPKWMKISSLDELKTKVGHVIVMILL 219
Query: 239 IGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKL 270
+ F++SK I T DLL + IFLSS SL++L L
Sbjct: 220 VKMFERSKMVTIATGLDLLSYSVCIFLSSASLYILHNL 257
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022937012.1 | 1.3e-121 | 85.24 | uncharacterized protein LOC111443436 [Cucurbita moschata] | [more] |
XP_023536456.1 | 5.0e-121 | 84.87 | uncharacterized protein LOC111797628 [Cucurbita pepo subsp. pepo] | [more] |
XP_022976828.1 | 1.5e-120 | 84.87 | uncharacterized protein LOC111477089 [Cucurbita maxima] | [more] |
KAG6591856.1 | 3.2e-120 | 84.50 | hypothetical protein SDJN03_14202, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022140712.1 | 5.5e-120 | 84.13 | uncharacterized protein LOC111011276 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1F9X5 | 6.4e-122 | 85.24 | uncharacterized protein LOC111443436 OS=Cucurbita moschata OX=3662 GN=LOC1114434... | [more] |
A0A6J1INA6 | 7.0e-121 | 84.87 | uncharacterized protein LOC111477089 OS=Cucurbita maxima OX=3661 GN=LOC111477089... | [more] |
A0A6J1CHU0 | 2.7e-120 | 84.13 | uncharacterized protein LOC111011276 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
A0A6J1J2R4 | 6.6e-119 | 85.66 | uncharacterized protein LOC111480745 OS=Cucurbita maxima OX=3661 GN=LOC111480745... | [more] |
A0A6J1FPY6 | 3.3e-118 | 84.93 | uncharacterized protein LOC111445906 OS=Cucurbita moschata OX=3662 GN=LOC1114459... | [more] |
Match Name | E-value | Identity | Description | |
AT4G19390.1 | 1.0e-63 | 51.82 | Uncharacterised protein family (UPF0114) | [more] |
AT5G13720.1 | 2.3e-39 | 42.66 | Uncharacterised protein family (UPF0114) | [more] |