Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTAAGTTTGAAATTGCAGGAACCACTGCTAGGAAATTCCAGAGAACTTCTCCCTGAAAATGTCGATTTCGAATGCAATTATACAGTTATACTTTCACATACAGAAAATTACAGAGCGTTGGGGTTCTTGTGAGCTCTGAGTTGTTCTATTCTCTGTAAGTTGTCTTGAAATTTTCATACTATTATTGCCTTTACGATTTTCTCAATTACCGAAGTTTCAATGCTTTGCTGAATAATTTTTCAGAAGATTTGTTTCATGCCAGAGTTGGTATCAATTCGAGCTTATGTCATCAAACCCTTCTTACACGTATCGATTACTAGACTTTGATCAACCAGGTATTGTTGTTGTCTCTGATTTATGTTTGAATTCATTTTCGTAGGTTTTTTGGTTCTTAATCGAGAAAGAAGCGTAATCTAGGTGTAAAATGATCGATCTGTTTCTAGTAGAGCCCATCTTCAACGAAGAAGAGGATGCTGGCTCCGCGAAGTCGAGAATTTCTCTGTTAAGTAGATTAGAATCTGTTTTACGGAAATTGATGGCTTCTGGAGGACGGTCGGAGGTCCGATTATGGCTTTCTAATACTATAGCTAGCATGACTTCCATCAGTCCCCAGCATCAACGAGACCTGTTCATGACCTTTCTGAGACTGAAGCCACTGAAGTGGGCCTTCGCATCTCAATTACTGCAAATGTTGTTTGAAAAGAGACAACGAGAGGCAGGAATTCTCATTGCCAAGAGAAGCTACATCATGGAAAAGTTTTTTGAAGGTAATAGTTCTCTCTACTCATTGTCAAATTTAGCTTTGGCAGTTCTATCTGATTGCAAGAACACTTAACCAAATCTTCATTCTTTCAAATTGCTTTTATAGATAAAATATTATTTGGCTGGAAAATAATAATCCGAACCTTGTTCTTAAGGATCCACTATGTATTTGTTGGGTGTAATGTGGATTTAAGAAGTATGGACATGGATATGAGACATGGACATGACACAACATGGACATGTTATTAGAATATAAATTTTTTAAATATATACATCTTAGGCCTTGTTCAATAACCAAGGTTTTTGGATTTTGAAAATTAAGCTTGTTTTCTCACACCTTCTCTACAAAAGTTTTCATCTCTATTAAGGAAACATTTGAATGCTTACCCAAATTTCAAAAACGAAAACAAGTTTTTGAAAACTAAAAAAAGTAGTTTTTGGAATTTTGAATTTGGCTAAGAACTCAAATGTTTCCTTAAAAGAGAACCGTTGTAAATTGGGAGAAAACAAGCTTAATTTTCAAAAATCGAAGGGATATCAAATGAAGCCTTAATGATTCTCTTGCTAAAAGTTTGTTTTAGTTGATGATGCTTTTGCATTTATGGCACATCGCTTTATGTTATCAATTTTGTAAATGAAATATATGCTTAACAAAAAGGAAGTTTAAAGTCAATATGCTTATGCACTTATATGTATTTCACTTTATGTTCCTAAGTGCTTAATGTTAATTGTGCTTAACAAGTATTCCAACAAGTATTCAAGTATTAGACTCATATTCAATTTGTTCAACTAGCGCTGGACACATGTCAATTATGTTTAATAAGTATCCGACATTTGTGCAACAAATGTTAGAGTGCCAAATATTGTATCGGACACAGGCATGTTGCTCAAACTGAAGTCTTTGTTCTTCGTAGATGTAAACTGCAGCCTCTCTAAACAGTAGTTTCACATGAACTTGACTATTGGAGACCGGGGATGTAACTATGTCTGGTGTGGAATGTTGCTGGATGTGTCCTTTTTCTTATTCCTTTTGTTATTCATATGCCATAATTTATCTCGTGTTTAGTTTCCTTGTCTATCAATGGATTTTAACTTTAACTTAAGACACTGAAACTGCTTCCCTTGACAGGGGAGTTAGTTTATTAATGACCCAACAGACTATTCAATTACAATAGAATGGTTTGCGCTAGTTCTGTCTTTGATGATTTGCTTTCACTTTCCATGCAGATTCCTGATCATGCTATCACTCCTTGATTCTGCAGGAAACCCTAGACGAATATCTCAGTGGTTTTCCAATTTTGCTACAAATGGTGCATCAGATCATGGGAAAGGTGCCAAGGCCCTGGCACAGTTTGCTTTTGTAAATCGAGACATTTGCTGGGAGGAGCTTGAGTGGAAAGGGAAGCACGGGCAATCGCCTGCCGTGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCATCAAACTGTGAAGAATTTCATTAAGAATGTACCTGAGTTTTGGTCTTCCAATGAGTTTGCCGAATCACTAAAAGATGGTGAAATTTTGTTCCTTGATACGAAATTCTTCGTGACATATCTCATCGATCTGATGCTTAAAGATGATTCAAGAGATGTTTTGGATGTCATTAATGAGTTCCTAATGCAGGAGTCATTTTCTTCGTTGTGTCAACACCTCCTTATTACTCTTGAAGAGGCGGATTTCTGCTACTTTTTAAAAATTCTTTGTAAACTTCTCAGCCCCAGAATAGAAACCAAGGATTTTGACAATTCATCTTTACTGTTTGAGGTCATACTTTCTAAATATGGTGACTCTGAATCTATAGATGAGATTTTACTATTAAATGCTGTCATGAATCAAGGACGCCAACTTCTACGGTTTTTACAAGATGAAGACGCAGAGGAAGAATTGTATGAGATCAAGGCTATTGTCTATGAGATTTCAGCAATCTCAAGCGACACTCATAGCTTATCCATATCCCCATTATTGAAAGAGTGTGACAGGAGAAAAAAGACAATAGAGGTGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGAATGTCAGTGGAATGTCTGACTCCTGAGTTGTGGGAATCCTTGTTTGTTGATAACGGCATAGGCTTCCGAAAATCTAATGAATATGCGTTGTTAGATCACAGTTGCTTGTCAGAAGAGGATGGTTTCGAACTGTGTAATACTGCATCGGCTAAATCTAAGAAGCGAAAAAGAGGGGGCAAAGGTAGAAAAAGAAGAAAAAGGAACTTTGACGAAGAGGATAGCTGTGACGATGAGCTGTTGGACTTTGATATTAAAAATGATAGAACAGATTTGAAGTTAAAATCTGGGAGTTGGTTGCTTTCCATTGACAACTATACTGTACCATGGAATGCTGTAAGTACACTAGTCATCAGCCCACATTTCCGTTACTGATTGTAGATTAAAGTAAATAGTCATGGTATGAGGATACAAGCTTGTGAAGTGTCTGTTAAATATGTTTACTTACTAATTTATCTGGGGAATCATTTTATTTCTTTGGTAATAGGATCTTCCTTGGTTGTGGTTTGTGTACCTCTAATAGTTTAAATGAAGGTCCTGTGTCAATTCTTTGTTTACTGCCACATTAGTTCATATCAAACTTTAATGACTAGTTCCTAAAGAATTGAAATTTCTAATAGAAGAGCTCTTCTAATATGCAACCTGGCTTAGGAGGTGTTGTTCCATCCCCGTTCAAAGAAAAAGGGGAGTTTTGTGGAGGTCGATATGTTAGAAATTTTGTGGAACATCTAACTTGAGAGAAACAAGAGGATATTTAAGGGGAGGATGAAAGGTTTGGTCCACAATTAGATCAATACGTCCTTCTGTTGTTGAAAAAGTTTTGTATTTATCCTCTGTTGCTTATTCTAAACAATTGGAGCTCTTTCTTTTAGTTTTTTTGGCCTTTTCCATTGTTTCTGTTTCAGCCGCCGTGGAATGCCTTTTCATTTGCTCTATATGGCTGTTTCATTTAAATAAATAAAAAATAATAATAATGCTTGTTATCCGAAGCTTGGCATACTACAAGTCTGTGTTTGATCTTATGGTTATCATGAAATACTTGATTACATACAATTTTATTACTTGAAGAAGTAAAGGTTCAGTGTTTCGTCCTATTGATTCTGCTCTTCTGCATATTCCATGTATTCAATGAAATTAGTTACCTGGGAAAAAAAAAAGGTTCTGCAGTTTTCTGATTATATTCCTCTCTTGTAGATGGATCTACCAGAACACCTATCGAAGCATTGTATGGCTTCATGGATGAAATGGCTCTTTGCTAAGCGGGAATGAAGAATGAAGCTTTGCTAGTGTGAAAGATATTCAGCATTGGAAGGATACTTGATTCGACATTCTGAAACCGTGATGGCCGCAATAAGATTACTGTATCGATAGATATATGCACGATCAAATGTATTGGTAAGGCAAGGAATGATGTGATGGCCTCAGGTTCATCATACAATTCATGGTATCAAGTTTTCTCACAGCCTTCTGTTATATAATATTTCTGTCATAGAATACAATTAATCATATGTCAACTAACGAATTACAACTGAACTGTATCATTGTTCTTTGATTGTAAGTTGTTGCCGATCAGGAAAATACGAGGATTAGACACTCAGGTTAGTTGCCACTTTTACTGCCTTTTTTGGTAGAATCAATGTGTTTAGTTTTGGTACTGTTTGTATATATAAAATTCTGAATCTAGTTCACAAAATTGCTGGTTCTTGTTTCTCTTGTGCTAGTGATAAAAATTTCTCGTTCATGCATCACATTCTGTAGATGTAGAATTCATGGAATAATATGTTTCACTAAAATTGGGAGTGATAGAAGCAAGATTCAAAATCTCTTAACCCAC
mRNA sequence
CTAAGTTTGAAATTGCAGGAACCACTGCTAGGAAATTCCAGAGAACTTCTCCCTGAAAATGTCGATTTCGAATGCAATTATACAGTTATACTTTCACATACAGAAAATTACAGAGCGTTGGGGTTCTTGTGAGCTCTGAGTTGTTCTATTCTCTGTTTTTTGGTTCTTAATCGAGAAAGAAGCGTAATCTAGGTGTAAAATGATCGATCTGTTTCTAGTAGAGCCCATCTTCAACGAAGAAGAGGATGCTGGCTCCGCGAAGTCGAGAATTTCTCTGTTAAGTAGATTAGAATCTGTTTTACGGAAATTGATGGCTTCTGGAGGACGGTCGGAGGTCCGATTATGGCTTTCTAATACTATAGCTAGCATGACTTCCATCAGTCCCCAGCATCAACGAGACCTGTTCATGACCTTTCTGAGACTGAAGCCACTGAAGTGGGCCTTCGCATCTCAATTACTGCAAATGTTGTTTGAAAAGAGACAACGAGAGGCAGGAATTCTCATTGCCAAGAGAAGCTACATCATGGAAAAGTTTTTTGAAGATTCCTGATCATGCTATCACTCCTTGATTCTGCAGGAAACCCTAGACGAATATCTCAGTGGTTTTCCAATTTTGCTACAAATGGTGCATCAGATCATGGGAAAGGTGCCAAGGCCCTGGCACAGTTTGCTTTTGTAAATCGAGACATTTGCTGGGAGGAGCTTGAGTGGAAAGGGAAGCACGGGCAATCGCCTGCCGTGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCATCAAACTGTGAAGAATTTCATTAAGAATGTACCTGAGTTTTGGTCTTCCAATGAGTTTGCCGAATCACTAAAAGATGGTGAAATTTTGTTCCTTGATACGAAATTCTTCGTGACATATCTCATCGATCTGATGCTTAAAGATGATTCAAGAGATGTTTTGGATGTCATTAATGAGTTCCTAATGCAGGAGTCATTTTCTTCGTTGTGTCAACACCTCCTTATTACTCTTGAAGAGGCGGATTTCTGCTACTTTTTAAAAATTCTTTGTAAACTTCTCAGCCCCAGAATAGAAACCAAGGATTTTGACAATTCATCTTTACTGTTTGAGGTCATACTTTCTAAATATGGTGACTCTGAATCTATAGATGAGATTTTACTATTAAATGCTGTCATGAATCAAGGACGCCAACTTCTACGGTTTTTACAAGATGAAGACGCAGAGGAAGAATTGTATGAGATCAAGGCTATTGTCTATGAGATTTCAGCAATCTCAAGCGACACTCATAGCTTATCCATATCCCCATTATTGAAAGAGTGTGACAGGAGAAAAAAGACAATAGAGGTGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGAATGTCAGTGGAATGTCTGACTCCTGAGTTGTGGGAATCCTTGTTTGTTGATAACGGCATAGGCTTCCGAAAATCTAATGAATATGCGTTGTTAGATCACAGTTGCTTGTCAGAAGAGGATGGTTTCGAACTGTGTAATACTGCATCGGCTAAATCTAAGAAGCGAAAAAGAGGGGGCAAAGGTAGAAAAAGAAGAAAAAGGAACTTTGACGAAGAGGATAGCTGTGACGATGAGCTGTTGGACTTTGATATTAAAAATGATAGAACAGATTTGAAGTTAAAATCTGGGAGTTGGTTGCTTTCCATTGACAACTATACTGTACCATGGAATGCTATGGATCTACCAGAACACCTATCGAAGCATTGTATGGCTTCATGGATGAAATGGCTCTTTGCTAAGCGGGAATGAAGAATGAAGCTTTGCTAGTGTGAAAGATATTCAGCATTGGAAGGATACTTGATTCGACATTCTGAAACCGTGATGGCCGCAATAAGATTACTGTATCGATAGATATATGCACGATCAAATGTATTGGTAAGGCAAGGAATGATGTGATGGCCTCAGGTTCATCATACAATTCATGGTATCAAGTTTTCTCACAGCCTTCTGTTATATAATATTTCTGTCATAGAATACAATTAATCATATGTCAACTAACGAATTACAACTGAACTGTATCATTGTTCTTTGATTGTAAGTTGTTGCCGATCAGGAAAATACGAGGATTAGACACTCAGGTTAGTTGCCACTTTTACTGCCTTTTTTGGTAGAATCAATGTGTTTAGTTTTGGTACTGTTTGTATATATAAAATTCTGAATCTAGTTCACAAAATTGCTGGTTCTTGTTTCTCTTGTGCTAGTGATAAAAATTTCTCGTTCATGCATCACATTCTGTAGATGTAGAATTCATGGAATAATATGTTTCACTAAAATTGGGAGTGATAGAAGCAAGATTCAAAATCTCTTAACCCAC
Coding sequence (CDS)
ATGCTATCACTCCTTGATTCTGCAGGAAACCCTAGACGAATATCTCAGTGGTTTTCCAATTTTGCTACAAATGGTGCATCAGATCATGGGAAAGGTGCCAAGGCCCTGGCACAGTTTGCTTTTGTAAATCGAGACATTTGCTGGGAGGAGCTTGAGTGGAAAGGGAAGCACGGGCAATCGCCTGCCGTGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCATCAAACTGTGAAGAATTTCATTAAGAATGTACCTGAGTTTTGGTCTTCCAATGAGTTTGCCGAATCACTAAAAGATGGTGAAATTTTGTTCCTTGATACGAAATTCTTCGTGACATATCTCATCGATCTGATGCTTAAAGATGATTCAAGAGATGTTTTGGATGTCATTAATGAGTTCCTAATGCAGGAGTCATTTTCTTCGTTGTGTCAACACCTCCTTATTACTCTTGAAGAGGCGGATTTCTGCTACTTTTTAAAAATTCTTTGTAAACTTCTCAGCCCCAGAATAGAAACCAAGGATTTTGACAATTCATCTTTACTGTTTGAGGTCATACTTTCTAAATATGGTGACTCTGAATCTATAGATGAGATTTTACTATTAAATGCTGTCATGAATCAAGGACGCCAACTTCTACGGTTTTTACAAGATGAAGACGCAGAGGAAGAATTGTATGAGATCAAGGCTATTGTCTATGAGATTTCAGCAATCTCAAGCGACACTCATAGCTTATCCATATCCCCATTATTGAAAGAGTGTGACAGGAGAAAAAAGACAATAGAGGTGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGAATGTCAGTGGAATGTCTGACTCCTGAGTTGTGGGAATCCTTGTTTGTTGATAACGGCATAGGCTTCCGAAAATCTAATGAATATGCGTTGTTAGATCACAGTTGCTTGTCAGAAGAGGATGGTTTCGAACTGTGTAATACTGCATCGGCTAAATCTAAGAAGCGAAAAAGAGGGGGCAAAGGTAGAAAAAGAAGAAAAAGGAACTTTGACGAAGAGGATAGCTGTGACGATGAGCTGTTGGACTTTGATATTAAAAATGATAGAACAGATTTGAAGTTAAAATCTGGGAGTTGGTTGCTTTCCATTGACAACTATACTGTACCATGGAATGCTATGGATCTACCAGAACACCTATCGAAGCATTGTATGGCTTCATGGATGAAATGGCTCTTTGCTAAGCGGGAATGA
Protein sequence
MLSLLDSAGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVINEFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGDSESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLKECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSCLSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKSGSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE
Homology
BLAST of Tan0022110 vs. NCBI nr
Match:
XP_023540456.1 (uncharacterized protein LOC111800821 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 916.4 bits (2367), Expect = 1.1e-262
Identity = 461/520 (88.65%), Postives = 481/520 (92.50%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLFL EP+FNEEED GSAK RISLLSRLESVL KL+ASGGRSEVRLWL NTIASMTSI
Sbjct: 1 MIDLFLAEPVFNEEEDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLYNTIASMTSI 60
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
SPQHQR+LFMTFLR KPL W FAS LLQMLFEKR REAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61 SPQHQRELFMTFLRSKPLNWDFASHLLQMLFEKRPREAGVLIAKRSYIMEKFFEGNPRRI 120
Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180
Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
DVHQTVKNFIKNVPEFW SNEFAESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFAESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240
Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
EFL QESFSSLCQHLLITLEEADFC FLK+LCKLL PR+ETKDF NSSLLFEVILSKYGD
Sbjct: 241 EFLTQESFSSLCQHLLITLEEADFCCFLKMLCKLLRPRMETKDFGNSSLLFEVILSKYGD 300
Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
+ES+D+ILLLNAV+NQGRQLLRF+QDEDAEEEL EIK I+YEISAISSDTHSL SPLLK
Sbjct: 301 AESLDQILLLNAVINQGRQLLRFVQDEDAEEELDEIKTIIYEISAISSDTHSL--SPLLK 360
Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWESLFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWESLFVDNGICFRKSNEYALLDHSC 420
Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
LSE+DGFE CNTAS KSKKRKR KGRKRRKRN D+EDSCDDELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRVKKGRKRRKRNSDDEDSCDDELLDFDIKRDKTDLKLNT 480
Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
GSWLLSIDNYTVPWNA+DLPE+LSK CMASWMKWLF+ RE
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKQCMASWMKWLFSNRE 518
BLAST of Tan0022110 vs. NCBI nr
Match:
XP_022948246.1 (uncharacterized protein LOC111451855 [Cucurbita moschata])
HSP 1 Score: 913.7 bits (2360), Expect = 7.3e-262
Identity = 457/520 (87.88%), Postives = 480/520 (92.31%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLFL EP+FNEE+D GSAK RISLLSRLESVL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1 MIDLFLAEPVFNEEDDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLSNTIASMTSI 60
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61 SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEGNPRRI 120
Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180
Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
DVHQTVKNFIKNVPEFW SNEF+ESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFSESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240
Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
EFL QE FSSLCQHLLITLEEADFC FLK+LCKLL P ETKDF NSS LFEV+LSKYGD
Sbjct: 241 EFLTQEPFSSLCQHLLITLEEADFCCFLKMLCKLLRPSRETKDFGNSSFLFEVVLSKYGD 300
Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
+ES+D+ILLLNAV+NQGRQLLRF+QDEDAEEEL EIK I+YEISAISS+THSL SPLLK
Sbjct: 301 AESLDQILLLNAVINQGRQLLRFVQDEDAEEELDEIKTIIYEISAISSNTHSL--SPLLK 360
Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWESLFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWESLFVDNGICFRKSNEYALLDHSC 420
Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
LSE+DGFE CNTAS KSKKRKRG KGRKRRKR+FD+EDSCDDELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRGKKGRKRRKRDFDDEDSCDDELLDFDIKRDKTDLKLNT 480
Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
GSWLLSIDNYTVPWNA+DLPE+LSK CMASWMKWLF RE
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKQCMASWMKWLFTNRE 518
BLAST of Tan0022110 vs. NCBI nr
Match:
XP_023005966.1 (uncharacterized protein LOC111498825 [Cucurbita maxima])
HSP 1 Score: 910.6 bits (2352), Expect = 6.2e-261
Identity = 459/518 (88.61%), Postives = 478/518 (92.28%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLFL EPIFNEEED GSAK RISLLSRLE+VL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1 MIDLFLAEPIFNEEEDVGSAKLRISLLSRLETVLWKLLASGGRSEVRLWLSNTIASMTSI 60
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61 SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEGNPRRI 120
Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180
Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
DVHQTVKNFIKNVPEFW SNEFAESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFAESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240
Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
EFL QESFSSLCQHLLITLEEADFC FLK+LCKLL P +ETKDF NSS LFEVILSKYGD
Sbjct: 241 EFLTQESFSSLCQHLLITLEEADFCCFLKMLCKLLRPSLETKDFGNSSFLFEVILSKYGD 300
Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
SES+D+ILLLNAV+N+GRQLLRF+QDEDAEEEL EIK I+YEISAISSDTHSL SPLLK
Sbjct: 301 SESLDQILLLNAVINRGRQLLRFVQDEDAEEELDEIKNIIYEISAISSDTHSL--SPLLK 360
Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWE LFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWEFLFVDNGICFRKSNEYALLDHSC 420
Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
LSE+DGFE CNTAS KSKKRKRG KGRKRRKRN D+EDSCD ELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRGKKGRKRRKRNSDDEDSCDYELLDFDIKRDKTDLKLNT 480
Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAK 519
GSWLLSIDNYTVPWNA+DLPE+LSKHCMASWMKWL K
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKHCMASWMKWLRRK 516
BLAST of Tan0022110 vs. NCBI nr
Match:
KAG6596711.1 (hypothetical protein SDJN03_09891, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 902.5 bits (2331), Expect = 1.7e-258
Identity = 460/556 (82.73%), Postives = 484/556 (87.05%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLFL EP+FNEE+D GSAK RISLLSRLESVL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1 MIDLFLAEPVFNEEDDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLSNTIASMTSI 60
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFE------ 120
SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFE
Sbjct: 61 SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEESVLVD 120
Query: 121 ------------------------------GNPRRISQWFSNFATNGASDHGKGAKALAQ 180
GNPRRISQWFSNFATNGASDHGKGAKALAQ
Sbjct: 121 DALAFYDTMLYVISVVNEIFLIMLSLLDSAGNPRRISQWFSNFATNGASDHGKGAKALAQ 180
Query: 181 FAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWSSNEFAE 240
F+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFW SNEFAE
Sbjct: 181 FSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWYSNEFAE 240
Query: 241 SLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVINEFLMQESFSSLCQHLLITLEEADF 300
SLKDGEILFLDTKFFV YL D MLKDDSRDV D INEFL QESFSSLCQHL+ITLEEADF
Sbjct: 241 SLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAINEFLTQESFSSLCQHLIITLEEADF 300
Query: 301 CYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGDSESIDEILLLNAVMNQGRQLLRFL 360
C FLK+LCKLL PR+ETKDF NSS LFEVILSKYGD+ES+D+ILLLNAV+NQGRQLLRF+
Sbjct: 301 CCFLKMLCKLLRPRMETKDFGNSSFLFEVILSKYGDAESLDQILLLNAVINQGRQLLRFV 360
Query: 361 QDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLKECDRRKKTIEVIKWLGLQSWVLHY 420
QDEDAEEEL EIK I+YEISAISS+THSL SPLLKEC RRKKTIEVIKWLGLQSWVLHY
Sbjct: 361 QDEDAEEELDEIKTIIYEISAISSNTHSL--SPLLKECYRRKKTIEVIKWLGLQSWVLHY 420
Query: 421 RMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSCLSEEDGFELCNTASAKSKKRKRGG 480
RMS EC T ELWESLFVDNGI FRKSNEYALLDHSCLSE+DGFE CNTAS KSKKRKRG
Sbjct: 421 RMSDECQTSELWESLFVDNGICFRKSNEYALLDHSCLSEDDGFEPCNTASVKSKKRKRGK 480
Query: 481 KGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKSGSWLLSIDNYTVPWNAMDLPEHLS 521
KGRKRRKR+FD+EDSCDDELLDFDIK D+TDLKL +GSWLLSIDNYTVPWNA+DLPE+LS
Sbjct: 481 KGRKRRKRDFDDEDSCDDELLDFDIKRDKTDLKLNTGSWLLSIDNYTVPWNAIDLPEYLS 540
BLAST of Tan0022110 vs. NCBI nr
Match:
KAG7028247.1 (hypothetical protein SDJN02_09428, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 901.0 bits (2327), Expect = 4.9e-258
Identity = 459/552 (83.15%), Postives = 483/552 (87.50%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLFL EP+FNEE+D GSAK RISLLSRLESVL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1 MIDLFLAEPVFNEEDDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLSNTIASMTSI 60
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFE------ 120
SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFE
Sbjct: 61 SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEVDDALA 120
Query: 121 --------------------------GNPRRISQWFSNFATNGASDHGKGAKALAQFAFV 180
GNPRRISQWFSNFATNGASDHGKGAKALAQF+FV
Sbjct: 121 FYDTMLYVISVVNEIFLIMLSLLDSAGNPRRISQWFSNFATNGASDHGKGAKALAQFSFV 180
Query: 181 NRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWSSNEFAESLKD 240
NRDICWEELEW GKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFW SNEFAESLKD
Sbjct: 181 NRDICWEELEWNGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWYSNEFAESLKD 240
Query: 241 GEILFLDTKFFVTYLIDLMLKDDSRDVLDVINEFLMQESFSSLCQHLLITLEEADFCYFL 300
GEILFLDTKFFV YL D MLKDDSRDV D INEFL QESFSSLCQHL+ITLEEADFC FL
Sbjct: 241 GEILFLDTKFFVKYLFDQMLKDDSRDVWDAINEFLTQESFSSLCQHLIITLEEADFCCFL 300
Query: 301 KILCKLLSPRIETKDFDNSSLLFEVILSKYGDSESIDEILLLNAVMNQGRQLLRFLQDED 360
K+LCKLL P +ETKDF NSS LFEVILSKYGD+ES+D+ILLLNAV+NQGRQLLRF+QDED
Sbjct: 301 KMLCKLLRPIMETKDFGNSSFLFEVILSKYGDAESLDQILLLNAVINQGRQLLRFVQDED 360
Query: 361 AEEELYEIKAIVYEISAISSDTHSLSISPLLKECDRRKKTIEVIKWLGLQSWVLHYRMSV 420
AEEEL EIK I+YEISAISS+THSL SPLLKEC RRKKTIEVIKWLGLQSWVLHYRMS
Sbjct: 361 AEEELDEIKTIIYEISAISSNTHSL--SPLLKECYRRKKTIEVIKWLGLQSWVLHYRMSD 420
Query: 421 ECLTPELWESLFVDNGIGFRKSNEYALLDHSCLSEEDGFELCNTASAKSKKRKRGGKGRK 480
EC T ELWESLFVDNGI FRKSNEYALLDHSCLSE+DGFE CNTAS KSKKRKRG KGRK
Sbjct: 421 ECQTSELWESLFVDNGICFRKSNEYALLDHSCLSEDDGFEPCNTASVKSKKRKRGKKGRK 480
Query: 481 RRKRNFDEEDSCDDELLDFDIKNDRTDLKLKSGSWLLSIDNYTVPWNAMDLPEHLSKHCM 521
RRKR+FD+EDSCDDELLDFDIK D+TDLKL +GSWLLSIDNYTVPWNA+DLPE+LSK CM
Sbjct: 481 RRKRDFDDEDSCDDELLDFDIKRDKTDLKLNTGSWLLSIDNYTVPWNAIDLPEYLSKQCM 540
BLAST of Tan0022110 vs. ExPASy TrEMBL
Match:
A0A6J1G8R1 (uncharacterized protein LOC111451855 OS=Cucurbita moschata OX=3662 GN=LOC111451855 PE=4 SV=1)
HSP 1 Score: 913.7 bits (2360), Expect = 3.5e-262
Identity = 457/520 (87.88%), Postives = 480/520 (92.31%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLFL EP+FNEE+D GSAK RISLLSRLESVL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1 MIDLFLAEPVFNEEDDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLSNTIASMTSI 60
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61 SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEGNPRRI 120
Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180
Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
DVHQTVKNFIKNVPEFW SNEF+ESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFSESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240
Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
EFL QE FSSLCQHLLITLEEADFC FLK+LCKLL P ETKDF NSS LFEV+LSKYGD
Sbjct: 241 EFLTQEPFSSLCQHLLITLEEADFCCFLKMLCKLLRPSRETKDFGNSSFLFEVVLSKYGD 300
Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
+ES+D+ILLLNAV+NQGRQLLRF+QDEDAEEEL EIK I+YEISAISS+THSL SPLLK
Sbjct: 301 AESLDQILLLNAVINQGRQLLRFVQDEDAEEELDEIKTIIYEISAISSNTHSL--SPLLK 360
Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWESLFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWESLFVDNGICFRKSNEYALLDHSC 420
Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
LSE+DGFE CNTAS KSKKRKRG KGRKRRKR+FD+EDSCDDELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRGKKGRKRRKRDFDDEDSCDDELLDFDIKRDKTDLKLNT 480
Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
GSWLLSIDNYTVPWNA+DLPE+LSK CMASWMKWLF RE
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKQCMASWMKWLFTNRE 518
BLAST of Tan0022110 vs. ExPASy TrEMBL
Match:
A0A6J1KWG9 (uncharacterized protein LOC111498825 OS=Cucurbita maxima OX=3661 GN=LOC111498825 PE=4 SV=1)
HSP 1 Score: 910.6 bits (2352), Expect = 3.0e-261
Identity = 459/518 (88.61%), Postives = 478/518 (92.28%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLFL EPIFNEEED GSAK RISLLSRLE+VL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1 MIDLFLAEPIFNEEEDVGSAKLRISLLSRLETVLWKLLASGGRSEVRLWLSNTIASMTSI 60
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61 SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEGNPRRI 120
Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180
Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
DVHQTVKNFIKNVPEFW SNEFAESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFAESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240
Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
EFL QESFSSLCQHLLITLEEADFC FLK+LCKLL P +ETKDF NSS LFEVILSKYGD
Sbjct: 241 EFLTQESFSSLCQHLLITLEEADFCCFLKMLCKLLRPSLETKDFGNSSFLFEVILSKYGD 300
Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
SES+D+ILLLNAV+N+GRQLLRF+QDEDAEEEL EIK I+YEISAISSDTHSL SPLLK
Sbjct: 301 SESLDQILLLNAVINRGRQLLRFVQDEDAEEELDEIKNIIYEISAISSDTHSL--SPLLK 360
Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWE LFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWEFLFVDNGICFRKSNEYALLDHSC 420
Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
LSE+DGFE CNTAS KSKKRKRG KGRKRRKRN D+EDSCD ELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRGKKGRKRRKRNSDDEDSCDYELLDFDIKRDKTDLKLNT 480
Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAK 519
GSWLLSIDNYTVPWNA+DLPE+LSKHCMASWMKWL K
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKHCMASWMKWLRRK 516
BLAST of Tan0022110 vs. ExPASy TrEMBL
Match:
A0A1S3BKS3 (uncharacterized protein LOC103490747 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103490747 PE=4 SV=1)
HSP 1 Score: 892.9 bits (2306), Expect = 6.4e-256
Identity = 443/520 (85.19%), Postives = 476/520 (91.54%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLF++E FN+E+D SAK RISLLS LESVL KL+ GGRSEVRLWLSNTIAS+TSI
Sbjct: 21 MIDLFILESTFNDEQDVSSAKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSI 80
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
SPQHQRDLFMT LR KPLKWAFASQLLQMLFEKR REAGILIAKRSYIMEKFFEGNPRRI
Sbjct: 81 SPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRI 140
Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
Sbjct: 141 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 200
Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
DVHQTVKNFI+NVPEFWSSNEFAESLKDGEILFLDTKFFV + IDLMLKDDS+DV +VIN
Sbjct: 201 DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKFFIDLMLKDDSKDVWEVIN 260
Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
EFLM ESFSSLCQHLL+TLE+ADFC FLK+LCKLL PRIETKDF NSS +FEVIL+KYGD
Sbjct: 261 EFLMHESFSSLCQHLLVTLEDADFCNFLKVLCKLLRPRIETKDFGNSSFMFEVILAKYGD 320
Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
SESID+ILLLNAV+NQGRQLLR L+DED EE+L EIKAI+++ISAISS++H L PLLK
Sbjct: 321 SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIIHKISAISSNSHCL--FPLLK 380
Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
ECD RKKTIE+IKWLGLQSWVLHYR S EC TPELWESLFVDNGIGFRKSNEY LLDHSC
Sbjct: 381 ECDGRKKTIEMIKWLGLQSWVLHYRTSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSC 440
Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
SE+DGFE CN A AKSKKRK+G KGRKRRKRNFD ++SCDDELLD DI+NDR DLKL +
Sbjct: 441 SSEDDGFEPCNRARAKSKKRKKGEKGRKRRKRNFDSQESCDDELLDLDIRNDRMDLKLNT 500
Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
GSW LS D+YTVPWNA DLPEHLSK+CMASWMKWLFAKRE
Sbjct: 501 GSWFLSTDDYTVPWNAKDLPEHLSKYCMASWMKWLFAKRE 538
BLAST of Tan0022110 vs. ExPASy TrEMBL
Match:
A0A0A0L6D1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G006760 PE=4 SV=1)
HSP 1 Score: 892.5 bits (2305), Expect = 8.4e-256
Identity = 446/520 (85.77%), Postives = 474/520 (91.15%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLFL E FN+E+D S K RISLLS LESVL KL+ GGRSEVRLWLSNTIAS+TSI
Sbjct: 21 MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSI 80
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
SPQHQRDLFMT LR KPLKWAFASQLLQMLFEKR REAGILIAKRSYIMEKFFEGNPRRI
Sbjct: 81 SPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRI 140
Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
Sbjct: 141 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 200
Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
DVHQTVKNFI+NVPEFWSSNEFAESLKDGEILFLDTKFFV Y +DLMLKDD +DV +VIN
Sbjct: 201 DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVIN 260
Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
EFL ESFSSLCQHLL+TLEEADFC FLK+LCKLL PRIETKDF NSS +FEVIL+KYGD
Sbjct: 261 EFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD 320
Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
SESID+ILLLNAV+NQGRQLLR L+DED EE+L EIKAIV++IS+ISS+ H L PLLK
Sbjct: 321 SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCL--FPLLK 380
Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
ECD RKKTIE+IKWLGLQSWVLHYRMS EC TPELWESLFVDNGIGFRKSNEY LLDHSC
Sbjct: 381 ECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSC 440
Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
SE+DGFEL N A A+SKKRK+GGKGRKRRK NFD +DSCDDELLDFDIKNDR DLKL +
Sbjct: 441 SSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNT 500
Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
GSWLLS D+YTVPWNA DLPEHLSK+CMASWMKWLFAKRE
Sbjct: 501 GSWLLSTDDYTVPWNAKDLPEHLSKYCMASWMKWLFAKRE 538
BLAST of Tan0022110 vs. ExPASy TrEMBL
Match:
A0A1S3BK58 (uncharacterized protein LOC103490747 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490747 PE=4 SV=1)
HSP 1 Score: 881.7 bits (2277), Expect = 1.5e-252
Identity = 443/538 (82.34%), Postives = 476/538 (88.48%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
MIDLF++E FN+E+D SAK RISLLS LESVL KL+ GGRSEVRLWLSNTIAS+TSI
Sbjct: 21 MIDLFILESTFNDEQDVSSAKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSI 80
Query: 61 SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFE------ 120
SPQHQRDLFMT LR KPLKWAFASQLLQMLFEKR REAGILIAKRSYIMEKFFE
Sbjct: 81 SPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGDCSVC 140
Query: 121 ------------GNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK 180
GNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK
Sbjct: 141 LLMSNLAIEILAGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK 200
Query: 181 HGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTY 240
HGQSPAVVATKPHYFLDLDVHQTVKNFI+NVPEFWSSNEFAESLKDGEILFLDTKFFV +
Sbjct: 201 HGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKF 260
Query: 241 LIDLMLKDDSRDVLDVINEFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETK 300
IDLMLKDDS+DV +VINEFLM ESFSSLCQHLL+TLE+ADFC FLK+LCKLL PRIETK
Sbjct: 261 FIDLMLKDDSKDVWEVINEFLMHESFSSLCQHLLVTLEDADFCNFLKVLCKLLRPRIETK 320
Query: 301 DFDNSSLLFEVILSKYGDSESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYE 360
DF NSS +FEVIL+KYGDSESID+ILLLNAV+NQGRQLLR L+DED EE+L EIKAI+++
Sbjct: 321 DFGNSSFMFEVILAKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIIHK 380
Query: 361 ISAISSDTHSLSISPLLKECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVD 420
ISAISS++H L PLLKECD RKKTIE+IKWLGLQSWVLHYR S EC TPELWESLFVD
Sbjct: 381 ISAISSNSHCL--FPLLKECDGRKKTIEMIKWLGLQSWVLHYRTSEECQTPELWESLFVD 440
Query: 421 NGIGFRKSNEYALLDHSCLSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDD 480
NGIGFRKSNEY LLDHSC SE+DGFE CN A AKSKKRK+G KGRKRRKRNFD ++SCDD
Sbjct: 441 NGIGFRKSNEYLLLDHSCSSEDDGFEPCNRARAKSKKRKKGEKGRKRRKRNFDSQESCDD 500
Query: 481 ELLDFDIKNDRTDLKLKSGSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
ELLD DI+NDR DLKL +GSW LS D+YTVPWNA DLPEHLSK+CMASWMKWLFAKRE
Sbjct: 501 ELLDLDIRNDRMDLKLNTGSWFLSTDDYTVPWNAKDLPEHLSKYCMASWMKWLFAKRE 556
BLAST of Tan0022110 vs. TAIR 10
Match:
AT5G48340.1 (unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 520.0 bits (1338), Expect = 2.2e-147
Identity = 273/523 (52.20%), Postives = 373/523 (71.32%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEE-EDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTS 60
M++LFL EP +N++ + + + + LL++L S ++ L+ G RSE RLWL + ++++ S
Sbjct: 1 MVNLFLSEPKWNDDAQKSSNINVILPLLNKLGSQIQSLVTHGARSEARLWLCSALSTI-S 60
Query: 61 ISPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRR 120
ISP Q ++FM LR KP K F SQ+L M+FEKR R+ G L+AKRSYI+EKFFEGN +R
Sbjct: 61 ISPSKQLNIFMKLLRSKPRKMQFLSQVLTMMFEKRPRKLGFLLAKRSYILEKFFEGNQKR 120
Query: 121 ISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLD 180
I +WFS FA +G SDH +GAKALAQFAF NRDICWEELEW+GKHGQSPAVVATKPHY LD
Sbjct: 121 ILEWFSEFAYDGGSDHKRGAKALAQFAFANRDICWEELEWRGKHGQSPAVVATKPHYLLD 180
Query: 181 LDVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVI 240
LDV +T++NF+ NVPEFWSSNEFAESLKDG+ILFLDTKFF+ I M ++D DV D +
Sbjct: 181 LDVERTIQNFLDNVPEFWSSNEFAESLKDGQILFLDTKFFIDLFIRFMYEEDMYDVWDAV 240
Query: 241 NEFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYG 300
EFL +ESFSSL QHLLITLEE D C FL++L P IE+ D +SS V+LS+Y
Sbjct: 241 EEFLREESFSSLTQHLLITLEERDLCRFLELLGNYFEPGIESWDSGDSSRWLGVLLSRYV 300
Query: 301 DSESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLL 360
D+ESIDE+LLL++++NQGRQLLR ++DE+ +E +K + EI + S S+ +L
Sbjct: 301 DTESIDELLLLSSIINQGRQLLRLVRDENGNDEGELLKETMAEICRGLENESSFSV--IL 360
Query: 361 KECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHS 420
+E + K I+VIK LGL SW +H+R+S EC TP+ WE LF +NGI FR+S++++LL ++
Sbjct: 361 RELSKMKH-IQVIKLLGLLSWTIHFRLSEECQTPDSWELLFRENGIEFRRSSDHSLLSYN 420
Query: 421 CLSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNF--DEEDSCDDELLDFDIKNDRTDLK 480
SEE + + + K+ KR K RK++K+ D++D DDELL DL
Sbjct: 421 GFSEESESDSDSRSRVSKKRHKREKKKRKKKKKRAFDDDDDRGDDELL---------DLH 480
Query: 481 LKSGSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
S SWLLS D ++ W ++DLPE+++K+C+++WMK L A+++
Sbjct: 481 SISRSWLLSTDGFSATWTSVDLPEYIAKYCLSTWMKGLLARQK 510
BLAST of Tan0022110 vs. TAIR 10
Match:
AT5G48340.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages. )
HSP 1 Score: 489.2 bits (1258), Expect = 4.1e-138
Identity = 262/499 (52.51%), Postives = 351/499 (70.34%), Query Frame = 0
Query: 1 MIDLFLVEPIFNEE-EDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTS 60
M++LFL EP +N++ + + + + LL++L S ++ L+ G RSE RLWL + ++++ S
Sbjct: 1 MVNLFLSEPKWNDDAQKSSNINVILPLLNKLGSQIQSLVTHGARSEARLWLCSALSTI-S 60
Query: 61 ISPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRR 120
ISP Q ++FM LR KP K F SQ+L M+FEKR R+ G L+AKRSYI+EKFFEGN +R
Sbjct: 61 ISPSKQLNIFMKLLRSKPRKMQFLSQVLTMMFEKRPRKLGFLLAKRSYILEKFFEGNQKR 120
Query: 121 ISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLD 180
I +WFS FA +G SDH +GAKALAQFAF NRDICWEELEW+GKHGQSPAVVATKPHY LD
Sbjct: 121 ILEWFSEFAYDGGSDHKRGAKALAQFAFANRDICWEELEWRGKHGQSPAVVATKPHYLLD 180
Query: 181 LDVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVI 240
LDV +T++NF+ NVPEFWSSNEFAESLKDG+ILFLDTKFF+ I M ++D DV D +
Sbjct: 181 LDVERTIQNFLDNVPEFWSSNEFAESLKDGQILFLDTKFFIDLFIRFMYEEDMYDVWDAV 240
Query: 241 NEFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYG 300
EFL +ESFSSL QHLLITLEE D C FL++L P IE+ D +SS V+LS+Y
Sbjct: 241 EEFLREESFSSLTQHLLITLEERDLCRFLELLGNYFEPGIESWDSGDSSRWLGVLLSRYV 300
Query: 301 DSESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLL 360
D+ESIDE+LLL++++NQGRQLLR ++DE+ +E +K + EI + S S+ +L
Sbjct: 301 DTESIDELLLLSSIINQGRQLLRLVRDENGNDEGELLKETMAEICRGLENESSFSV--IL 360
Query: 361 KECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHS 420
+E + K I+VIK LGL SW +H+R+S EC TP+ WE LF +NGI FR+S++++LL ++
Sbjct: 361 RELSKMKH-IQVIKLLGLLSWTIHFRLSEECQTPDSWELLFRENGIEFRRSSDHSLLSYN 420
Query: 421 CLSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNF--DEEDSCDDELLDFDIKNDRTDLK 480
SEE + + + K+ KR K RK++K+ D++D DDELL DL
Sbjct: 421 GFSEESESDSDSRSRVSKKRHKREKKKRKKKKKRAFDDDDDRGDDELL---------DLH 480
Query: 481 LKSGSWLLSIDNYTVPWNA 497
S SWLLS D ++ W +
Sbjct: 481 SISRSWLLSTDGFSATWTS 486
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023540456.1 | 1.1e-262 | 88.65 | uncharacterized protein LOC111800821 [Cucurbita pepo subsp. pepo] | [more] |
XP_022948246.1 | 7.3e-262 | 87.88 | uncharacterized protein LOC111451855 [Cucurbita moschata] | [more] |
XP_023005966.1 | 6.2e-261 | 88.61 | uncharacterized protein LOC111498825 [Cucurbita maxima] | [more] |
KAG6596711.1 | 1.7e-258 | 82.73 | hypothetical protein SDJN03_09891, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7028247.1 | 4.9e-258 | 83.15 | hypothetical protein SDJN02_09428, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1G8R1 | 3.5e-262 | 87.88 | uncharacterized protein LOC111451855 OS=Cucurbita moschata OX=3662 GN=LOC1114518... | [more] |
A0A6J1KWG9 | 3.0e-261 | 88.61 | uncharacterized protein LOC111498825 OS=Cucurbita maxima OX=3661 GN=LOC111498825... | [more] |
A0A1S3BKS3 | 6.4e-256 | 85.19 | uncharacterized protein LOC103490747 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A0A0L6D1 | 8.4e-256 | 85.77 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G006760 PE=4 SV=1 | [more] |
A0A1S3BK58 | 1.5e-252 | 82.34 | uncharacterized protein LOC103490747 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT5G48340.1 | 2.2e-147 | 52.20 | unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0... | [more] |
AT5G48340.2 | 4.1e-138 | 52.51 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |