Tan0022110 (gene) Snake gourd v1

Overview
NameTan0022110
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG01: 26566770 .. 26571436 (+)
RNA-Seq ExpressionTan0022110
SyntenyTan0022110
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTAAGTTTGAAATTGCAGGAACCACTGCTAGGAAATTCCAGAGAACTTCTCCCTGAAAATGTCGATTTCGAATGCAATTATACAGTTATACTTTCACATACAGAAAATTACAGAGCGTTGGGGTTCTTGTGAGCTCTGAGTTGTTCTATTCTCTGTAAGTTGTCTTGAAATTTTCATACTATTATTGCCTTTACGATTTTCTCAATTACCGAAGTTTCAATGCTTTGCTGAATAATTTTTCAGAAGATTTGTTTCATGCCAGAGTTGGTATCAATTCGAGCTTATGTCATCAAACCCTTCTTACACGTATCGATTACTAGACTTTGATCAACCAGGTATTGTTGTTGTCTCTGATTTATGTTTGAATTCATTTTCGTAGGTTTTTTGGTTCTTAATCGAGAAAGAAGCGTAATCTAGGTGTAAAATGATCGATCTGTTTCTAGTAGAGCCCATCTTCAACGAAGAAGAGGATGCTGGCTCCGCGAAGTCGAGAATTTCTCTGTTAAGTAGATTAGAATCTGTTTTACGGAAATTGATGGCTTCTGGAGGACGGTCGGAGGTCCGATTATGGCTTTCTAATACTATAGCTAGCATGACTTCCATCAGTCCCCAGCATCAACGAGACCTGTTCATGACCTTTCTGAGACTGAAGCCACTGAAGTGGGCCTTCGCATCTCAATTACTGCAAATGTTGTTTGAAAAGAGACAACGAGAGGCAGGAATTCTCATTGCCAAGAGAAGCTACATCATGGAAAAGTTTTTTGAAGGTAATAGTTCTCTCTACTCATTGTCAAATTTAGCTTTGGCAGTTCTATCTGATTGCAAGAACACTTAACCAAATCTTCATTCTTTCAAATTGCTTTTATAGATAAAATATTATTTGGCTGGAAAATAATAATCCGAACCTTGTTCTTAAGGATCCACTATGTATTTGTTGGGTGTAATGTGGATTTAAGAAGTATGGACATGGATATGAGACATGGACATGACACAACATGGACATGTTATTAGAATATAAATTTTTTAAATATATACATCTTAGGCCTTGTTCAATAACCAAGGTTTTTGGATTTTGAAAATTAAGCTTGTTTTCTCACACCTTCTCTACAAAAGTTTTCATCTCTATTAAGGAAACATTTGAATGCTTACCCAAATTTCAAAAACGAAAACAAGTTTTTGAAAACTAAAAAAAGTAGTTTTTGGAATTTTGAATTTGGCTAAGAACTCAAATGTTTCCTTAAAAGAGAACCGTTGTAAATTGGGAGAAAACAAGCTTAATTTTCAAAAATCGAAGGGATATCAAATGAAGCCTTAATGATTCTCTTGCTAAAAGTTTGTTTTAGTTGATGATGCTTTTGCATTTATGGCACATCGCTTTATGTTATCAATTTTGTAAATGAAATATATGCTTAACAAAAAGGAAGTTTAAAGTCAATATGCTTATGCACTTATATGTATTTCACTTTATGTTCCTAAGTGCTTAATGTTAATTGTGCTTAACAAGTATTCCAACAAGTATTCAAGTATTAGACTCATATTCAATTTGTTCAACTAGCGCTGGACACATGTCAATTATGTTTAATAAGTATCCGACATTTGTGCAACAAATGTTAGAGTGCCAAATATTGTATCGGACACAGGCATGTTGCTCAAACTGAAGTCTTTGTTCTTCGTAGATGTAAACTGCAGCCTCTCTAAACAGTAGTTTCACATGAACTTGACTATTGGAGACCGGGGATGTAACTATGTCTGGTGTGGAATGTTGCTGGATGTGTCCTTTTTCTTATTCCTTTTGTTATTCATATGCCATAATTTATCTCGTGTTTAGTTTCCTTGTCTATCAATGGATTTTAACTTTAACTTAAGACACTGAAACTGCTTCCCTTGACAGGGGAGTTAGTTTATTAATGACCCAACAGACTATTCAATTACAATAGAATGGTTTGCGCTAGTTCTGTCTTTGATGATTTGCTTTCACTTTCCATGCAGATTCCTGATCATGCTATCACTCCTTGATTCTGCAGGAAACCCTAGACGAATATCTCAGTGGTTTTCCAATTTTGCTACAAATGGTGCATCAGATCATGGGAAAGGTGCCAAGGCCCTGGCACAGTTTGCTTTTGTAAATCGAGACATTTGCTGGGAGGAGCTTGAGTGGAAAGGGAAGCACGGGCAATCGCCTGCCGTGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCATCAAACTGTGAAGAATTTCATTAAGAATGTACCTGAGTTTTGGTCTTCCAATGAGTTTGCCGAATCACTAAAAGATGGTGAAATTTTGTTCCTTGATACGAAATTCTTCGTGACATATCTCATCGATCTGATGCTTAAAGATGATTCAAGAGATGTTTTGGATGTCATTAATGAGTTCCTAATGCAGGAGTCATTTTCTTCGTTGTGTCAACACCTCCTTATTACTCTTGAAGAGGCGGATTTCTGCTACTTTTTAAAAATTCTTTGTAAACTTCTCAGCCCCAGAATAGAAACCAAGGATTTTGACAATTCATCTTTACTGTTTGAGGTCATACTTTCTAAATATGGTGACTCTGAATCTATAGATGAGATTTTACTATTAAATGCTGTCATGAATCAAGGACGCCAACTTCTACGGTTTTTACAAGATGAAGACGCAGAGGAAGAATTGTATGAGATCAAGGCTATTGTCTATGAGATTTCAGCAATCTCAAGCGACACTCATAGCTTATCCATATCCCCATTATTGAAAGAGTGTGACAGGAGAAAAAAGACAATAGAGGTGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGAATGTCAGTGGAATGTCTGACTCCTGAGTTGTGGGAATCCTTGTTTGTTGATAACGGCATAGGCTTCCGAAAATCTAATGAATATGCGTTGTTAGATCACAGTTGCTTGTCAGAAGAGGATGGTTTCGAACTGTGTAATACTGCATCGGCTAAATCTAAGAAGCGAAAAAGAGGGGGCAAAGGTAGAAAAAGAAGAAAAAGGAACTTTGACGAAGAGGATAGCTGTGACGATGAGCTGTTGGACTTTGATATTAAAAATGATAGAACAGATTTGAAGTTAAAATCTGGGAGTTGGTTGCTTTCCATTGACAACTATACTGTACCATGGAATGCTGTAAGTACACTAGTCATCAGCCCACATTTCCGTTACTGATTGTAGATTAAAGTAAATAGTCATGGTATGAGGATACAAGCTTGTGAAGTGTCTGTTAAATATGTTTACTTACTAATTTATCTGGGGAATCATTTTATTTCTTTGGTAATAGGATCTTCCTTGGTTGTGGTTTGTGTACCTCTAATAGTTTAAATGAAGGTCCTGTGTCAATTCTTTGTTTACTGCCACATTAGTTCATATCAAACTTTAATGACTAGTTCCTAAAGAATTGAAATTTCTAATAGAAGAGCTCTTCTAATATGCAACCTGGCTTAGGAGGTGTTGTTCCATCCCCGTTCAAAGAAAAAGGGGAGTTTTGTGGAGGTCGATATGTTAGAAATTTTGTGGAACATCTAACTTGAGAGAAACAAGAGGATATTTAAGGGGAGGATGAAAGGTTTGGTCCACAATTAGATCAATACGTCCTTCTGTTGTTGAAAAAGTTTTGTATTTATCCTCTGTTGCTTATTCTAAACAATTGGAGCTCTTTCTTTTAGTTTTTTTGGCCTTTTCCATTGTTTCTGTTTCAGCCGCCGTGGAATGCCTTTTCATTTGCTCTATATGGCTGTTTCATTTAAATAAATAAAAAATAATAATAATGCTTGTTATCCGAAGCTTGGCATACTACAAGTCTGTGTTTGATCTTATGGTTATCATGAAATACTTGATTACATACAATTTTATTACTTGAAGAAGTAAAGGTTCAGTGTTTCGTCCTATTGATTCTGCTCTTCTGCATATTCCATGTATTCAATGAAATTAGTTACCTGGGAAAAAAAAAAGGTTCTGCAGTTTTCTGATTATATTCCTCTCTTGTAGATGGATCTACCAGAACACCTATCGAAGCATTGTATGGCTTCATGGATGAAATGGCTCTTTGCTAAGCGGGAATGAAGAATGAAGCTTTGCTAGTGTGAAAGATATTCAGCATTGGAAGGATACTTGATTCGACATTCTGAAACCGTGATGGCCGCAATAAGATTACTGTATCGATAGATATATGCACGATCAAATGTATTGGTAAGGCAAGGAATGATGTGATGGCCTCAGGTTCATCATACAATTCATGGTATCAAGTTTTCTCACAGCCTTCTGTTATATAATATTTCTGTCATAGAATACAATTAATCATATGTCAACTAACGAATTACAACTGAACTGTATCATTGTTCTTTGATTGTAAGTTGTTGCCGATCAGGAAAATACGAGGATTAGACACTCAGGTTAGTTGCCACTTTTACTGCCTTTTTTGGTAGAATCAATGTGTTTAGTTTTGGTACTGTTTGTATATATAAAATTCTGAATCTAGTTCACAAAATTGCTGGTTCTTGTTTCTCTTGTGCTAGTGATAAAAATTTCTCGTTCATGCATCACATTCTGTAGATGTAGAATTCATGGAATAATATGTTTCACTAAAATTGGGAGTGATAGAAGCAAGATTCAAAATCTCTTAACCCAC

mRNA sequence

CTAAGTTTGAAATTGCAGGAACCACTGCTAGGAAATTCCAGAGAACTTCTCCCTGAAAATGTCGATTTCGAATGCAATTATACAGTTATACTTTCACATACAGAAAATTACAGAGCGTTGGGGTTCTTGTGAGCTCTGAGTTGTTCTATTCTCTGTTTTTTGGTTCTTAATCGAGAAAGAAGCGTAATCTAGGTGTAAAATGATCGATCTGTTTCTAGTAGAGCCCATCTTCAACGAAGAAGAGGATGCTGGCTCCGCGAAGTCGAGAATTTCTCTGTTAAGTAGATTAGAATCTGTTTTACGGAAATTGATGGCTTCTGGAGGACGGTCGGAGGTCCGATTATGGCTTTCTAATACTATAGCTAGCATGACTTCCATCAGTCCCCAGCATCAACGAGACCTGTTCATGACCTTTCTGAGACTGAAGCCACTGAAGTGGGCCTTCGCATCTCAATTACTGCAAATGTTGTTTGAAAAGAGACAACGAGAGGCAGGAATTCTCATTGCCAAGAGAAGCTACATCATGGAAAAGTTTTTTGAAGATTCCTGATCATGCTATCACTCCTTGATTCTGCAGGAAACCCTAGACGAATATCTCAGTGGTTTTCCAATTTTGCTACAAATGGTGCATCAGATCATGGGAAAGGTGCCAAGGCCCTGGCACAGTTTGCTTTTGTAAATCGAGACATTTGCTGGGAGGAGCTTGAGTGGAAAGGGAAGCACGGGCAATCGCCTGCCGTGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCATCAAACTGTGAAGAATTTCATTAAGAATGTACCTGAGTTTTGGTCTTCCAATGAGTTTGCCGAATCACTAAAAGATGGTGAAATTTTGTTCCTTGATACGAAATTCTTCGTGACATATCTCATCGATCTGATGCTTAAAGATGATTCAAGAGATGTTTTGGATGTCATTAATGAGTTCCTAATGCAGGAGTCATTTTCTTCGTTGTGTCAACACCTCCTTATTACTCTTGAAGAGGCGGATTTCTGCTACTTTTTAAAAATTCTTTGTAAACTTCTCAGCCCCAGAATAGAAACCAAGGATTTTGACAATTCATCTTTACTGTTTGAGGTCATACTTTCTAAATATGGTGACTCTGAATCTATAGATGAGATTTTACTATTAAATGCTGTCATGAATCAAGGACGCCAACTTCTACGGTTTTTACAAGATGAAGACGCAGAGGAAGAATTGTATGAGATCAAGGCTATTGTCTATGAGATTTCAGCAATCTCAAGCGACACTCATAGCTTATCCATATCCCCATTATTGAAAGAGTGTGACAGGAGAAAAAAGACAATAGAGGTGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGAATGTCAGTGGAATGTCTGACTCCTGAGTTGTGGGAATCCTTGTTTGTTGATAACGGCATAGGCTTCCGAAAATCTAATGAATATGCGTTGTTAGATCACAGTTGCTTGTCAGAAGAGGATGGTTTCGAACTGTGTAATACTGCATCGGCTAAATCTAAGAAGCGAAAAAGAGGGGGCAAAGGTAGAAAAAGAAGAAAAAGGAACTTTGACGAAGAGGATAGCTGTGACGATGAGCTGTTGGACTTTGATATTAAAAATGATAGAACAGATTTGAAGTTAAAATCTGGGAGTTGGTTGCTTTCCATTGACAACTATACTGTACCATGGAATGCTATGGATCTACCAGAACACCTATCGAAGCATTGTATGGCTTCATGGATGAAATGGCTCTTTGCTAAGCGGGAATGAAGAATGAAGCTTTGCTAGTGTGAAAGATATTCAGCATTGGAAGGATACTTGATTCGACATTCTGAAACCGTGATGGCCGCAATAAGATTACTGTATCGATAGATATATGCACGATCAAATGTATTGGTAAGGCAAGGAATGATGTGATGGCCTCAGGTTCATCATACAATTCATGGTATCAAGTTTTCTCACAGCCTTCTGTTATATAATATTTCTGTCATAGAATACAATTAATCATATGTCAACTAACGAATTACAACTGAACTGTATCATTGTTCTTTGATTGTAAGTTGTTGCCGATCAGGAAAATACGAGGATTAGACACTCAGGTTAGTTGCCACTTTTACTGCCTTTTTTGGTAGAATCAATGTGTTTAGTTTTGGTACTGTTTGTATATATAAAATTCTGAATCTAGTTCACAAAATTGCTGGTTCTTGTTTCTCTTGTGCTAGTGATAAAAATTTCTCGTTCATGCATCACATTCTGTAGATGTAGAATTCATGGAATAATATGTTTCACTAAAATTGGGAGTGATAGAAGCAAGATTCAAAATCTCTTAACCCAC

Coding sequence (CDS)

ATGCTATCACTCCTTGATTCTGCAGGAAACCCTAGACGAATATCTCAGTGGTTTTCCAATTTTGCTACAAATGGTGCATCAGATCATGGGAAAGGTGCCAAGGCCCTGGCACAGTTTGCTTTTGTAAATCGAGACATTTGCTGGGAGGAGCTTGAGTGGAAAGGGAAGCACGGGCAATCGCCTGCCGTGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCATCAAACTGTGAAGAATTTCATTAAGAATGTACCTGAGTTTTGGTCTTCCAATGAGTTTGCCGAATCACTAAAAGATGGTGAAATTTTGTTCCTTGATACGAAATTCTTCGTGACATATCTCATCGATCTGATGCTTAAAGATGATTCAAGAGATGTTTTGGATGTCATTAATGAGTTCCTAATGCAGGAGTCATTTTCTTCGTTGTGTCAACACCTCCTTATTACTCTTGAAGAGGCGGATTTCTGCTACTTTTTAAAAATTCTTTGTAAACTTCTCAGCCCCAGAATAGAAACCAAGGATTTTGACAATTCATCTTTACTGTTTGAGGTCATACTTTCTAAATATGGTGACTCTGAATCTATAGATGAGATTTTACTATTAAATGCTGTCATGAATCAAGGACGCCAACTTCTACGGTTTTTACAAGATGAAGACGCAGAGGAAGAATTGTATGAGATCAAGGCTATTGTCTATGAGATTTCAGCAATCTCAAGCGACACTCATAGCTTATCCATATCCCCATTATTGAAAGAGTGTGACAGGAGAAAAAAGACAATAGAGGTGATAAAATGGCTAGGGCTTCAGTCTTGGGTTCTTCACTATAGAATGTCAGTGGAATGTCTGACTCCTGAGTTGTGGGAATCCTTGTTTGTTGATAACGGCATAGGCTTCCGAAAATCTAATGAATATGCGTTGTTAGATCACAGTTGCTTGTCAGAAGAGGATGGTTTCGAACTGTGTAATACTGCATCGGCTAAATCTAAGAAGCGAAAAAGAGGGGGCAAAGGTAGAAAAAGAAGAAAAAGGAACTTTGACGAAGAGGATAGCTGTGACGATGAGCTGTTGGACTTTGATATTAAAAATGATAGAACAGATTTGAAGTTAAAATCTGGGAGTTGGTTGCTTTCCATTGACAACTATACTGTACCATGGAATGCTATGGATCTACCAGAACACCTATCGAAGCATTGTATGGCTTCATGGATGAAATGGCTCTTTGCTAAGCGGGAATGA

Protein sequence

MLSLLDSAGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVINEFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGDSESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLKECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSCLSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKSGSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE
Homology
BLAST of Tan0022110 vs. NCBI nr
Match: XP_023540456.1 (uncharacterized protein LOC111800821 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 916.4 bits (2367), Expect = 1.1e-262
Identity = 461/520 (88.65%), Postives = 481/520 (92.50%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLFL EP+FNEEED GSAK RISLLSRLESVL KL+ASGGRSEVRLWL NTIASMTSI
Sbjct: 1   MIDLFLAEPVFNEEEDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLYNTIASMTSI 60

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
           SPQHQR+LFMTFLR KPL W FAS LLQMLFEKR REAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61  SPQHQRELFMTFLRSKPLNWDFASHLLQMLFEKRPREAGVLIAKRSYIMEKFFEGNPRRI 120

Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
           SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180

Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
           DVHQTVKNFIKNVPEFW SNEFAESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFAESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240

Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
           EFL QESFSSLCQHLLITLEEADFC FLK+LCKLL PR+ETKDF NSSLLFEVILSKYGD
Sbjct: 241 EFLTQESFSSLCQHLLITLEEADFCCFLKMLCKLLRPRMETKDFGNSSLLFEVILSKYGD 300

Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
           +ES+D+ILLLNAV+NQGRQLLRF+QDEDAEEEL EIK I+YEISAISSDTHSL  SPLLK
Sbjct: 301 AESLDQILLLNAVINQGRQLLRFVQDEDAEEELDEIKTIIYEISAISSDTHSL--SPLLK 360

Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
           EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWESLFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWESLFVDNGICFRKSNEYALLDHSC 420

Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
           LSE+DGFE CNTAS KSKKRKR  KGRKRRKRN D+EDSCDDELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRVKKGRKRRKRNSDDEDSCDDELLDFDIKRDKTDLKLNT 480

Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
           GSWLLSIDNYTVPWNA+DLPE+LSK CMASWMKWLF+ RE
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKQCMASWMKWLFSNRE 518

BLAST of Tan0022110 vs. NCBI nr
Match: XP_022948246.1 (uncharacterized protein LOC111451855 [Cucurbita moschata])

HSP 1 Score: 913.7 bits (2360), Expect = 7.3e-262
Identity = 457/520 (87.88%), Postives = 480/520 (92.31%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLFL EP+FNEE+D GSAK RISLLSRLESVL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1   MIDLFLAEPVFNEEDDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLSNTIASMTSI 60

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
           SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61  SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEGNPRRI 120

Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
           SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180

Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
           DVHQTVKNFIKNVPEFW SNEF+ESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFSESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240

Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
           EFL QE FSSLCQHLLITLEEADFC FLK+LCKLL P  ETKDF NSS LFEV+LSKYGD
Sbjct: 241 EFLTQEPFSSLCQHLLITLEEADFCCFLKMLCKLLRPSRETKDFGNSSFLFEVVLSKYGD 300

Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
           +ES+D+ILLLNAV+NQGRQLLRF+QDEDAEEEL EIK I+YEISAISS+THSL  SPLLK
Sbjct: 301 AESLDQILLLNAVINQGRQLLRFVQDEDAEEELDEIKTIIYEISAISSNTHSL--SPLLK 360

Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
           EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWESLFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWESLFVDNGICFRKSNEYALLDHSC 420

Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
           LSE+DGFE CNTAS KSKKRKRG KGRKRRKR+FD+EDSCDDELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRGKKGRKRRKRDFDDEDSCDDELLDFDIKRDKTDLKLNT 480

Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
           GSWLLSIDNYTVPWNA+DLPE+LSK CMASWMKWLF  RE
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKQCMASWMKWLFTNRE 518

BLAST of Tan0022110 vs. NCBI nr
Match: XP_023005966.1 (uncharacterized protein LOC111498825 [Cucurbita maxima])

HSP 1 Score: 910.6 bits (2352), Expect = 6.2e-261
Identity = 459/518 (88.61%), Postives = 478/518 (92.28%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLFL EPIFNEEED GSAK RISLLSRLE+VL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1   MIDLFLAEPIFNEEEDVGSAKLRISLLSRLETVLWKLLASGGRSEVRLWLSNTIASMTSI 60

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
           SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61  SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEGNPRRI 120

Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
           SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180

Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
           DVHQTVKNFIKNVPEFW SNEFAESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFAESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240

Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
           EFL QESFSSLCQHLLITLEEADFC FLK+LCKLL P +ETKDF NSS LFEVILSKYGD
Sbjct: 241 EFLTQESFSSLCQHLLITLEEADFCCFLKMLCKLLRPSLETKDFGNSSFLFEVILSKYGD 300

Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
           SES+D+ILLLNAV+N+GRQLLRF+QDEDAEEEL EIK I+YEISAISSDTHSL  SPLLK
Sbjct: 301 SESLDQILLLNAVINRGRQLLRFVQDEDAEEELDEIKNIIYEISAISSDTHSL--SPLLK 360

Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
           EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWE LFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWEFLFVDNGICFRKSNEYALLDHSC 420

Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
           LSE+DGFE CNTAS KSKKRKRG KGRKRRKRN D+EDSCD ELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRGKKGRKRRKRNSDDEDSCDYELLDFDIKRDKTDLKLNT 480

Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAK 519
           GSWLLSIDNYTVPWNA+DLPE+LSKHCMASWMKWL  K
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKHCMASWMKWLRRK 516

BLAST of Tan0022110 vs. NCBI nr
Match: KAG6596711.1 (hypothetical protein SDJN03_09891, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 902.5 bits (2331), Expect = 1.7e-258
Identity = 460/556 (82.73%), Postives = 484/556 (87.05%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLFL EP+FNEE+D GSAK RISLLSRLESVL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1   MIDLFLAEPVFNEEDDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLSNTIASMTSI 60

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFE------ 120
           SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFE      
Sbjct: 61  SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEESVLVD 120

Query: 121 ------------------------------GNPRRISQWFSNFATNGASDHGKGAKALAQ 180
                                         GNPRRISQWFSNFATNGASDHGKGAKALAQ
Sbjct: 121 DALAFYDTMLYVISVVNEIFLIMLSLLDSAGNPRRISQWFSNFATNGASDHGKGAKALAQ 180

Query: 181 FAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWSSNEFAE 240
           F+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFW SNEFAE
Sbjct: 181 FSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWYSNEFAE 240

Query: 241 SLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVINEFLMQESFSSLCQHLLITLEEADF 300
           SLKDGEILFLDTKFFV YL D MLKDDSRDV D INEFL QESFSSLCQHL+ITLEEADF
Sbjct: 241 SLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAINEFLTQESFSSLCQHLIITLEEADF 300

Query: 301 CYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGDSESIDEILLLNAVMNQGRQLLRFL 360
           C FLK+LCKLL PR+ETKDF NSS LFEVILSKYGD+ES+D+ILLLNAV+NQGRQLLRF+
Sbjct: 301 CCFLKMLCKLLRPRMETKDFGNSSFLFEVILSKYGDAESLDQILLLNAVINQGRQLLRFV 360

Query: 361 QDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLKECDRRKKTIEVIKWLGLQSWVLHY 420
           QDEDAEEEL EIK I+YEISAISS+THSL  SPLLKEC RRKKTIEVIKWLGLQSWVLHY
Sbjct: 361 QDEDAEEELDEIKTIIYEISAISSNTHSL--SPLLKECYRRKKTIEVIKWLGLQSWVLHY 420

Query: 421 RMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSCLSEEDGFELCNTASAKSKKRKRGG 480
           RMS EC T ELWESLFVDNGI FRKSNEYALLDHSCLSE+DGFE CNTAS KSKKRKRG 
Sbjct: 421 RMSDECQTSELWESLFVDNGICFRKSNEYALLDHSCLSEDDGFEPCNTASVKSKKRKRGK 480

Query: 481 KGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKSGSWLLSIDNYTVPWNAMDLPEHLS 521
           KGRKRRKR+FD+EDSCDDELLDFDIK D+TDLKL +GSWLLSIDNYTVPWNA+DLPE+LS
Sbjct: 481 KGRKRRKRDFDDEDSCDDELLDFDIKRDKTDLKLNTGSWLLSIDNYTVPWNAIDLPEYLS 540

BLAST of Tan0022110 vs. NCBI nr
Match: KAG7028247.1 (hypothetical protein SDJN02_09428, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 901.0 bits (2327), Expect = 4.9e-258
Identity = 459/552 (83.15%), Postives = 483/552 (87.50%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLFL EP+FNEE+D GSAK RISLLSRLESVL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1   MIDLFLAEPVFNEEDDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLSNTIASMTSI 60

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFE------ 120
           SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFE      
Sbjct: 61  SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEVDDALA 120

Query: 121 --------------------------GNPRRISQWFSNFATNGASDHGKGAKALAQFAFV 180
                                     GNPRRISQWFSNFATNGASDHGKGAKALAQF+FV
Sbjct: 121 FYDTMLYVISVVNEIFLIMLSLLDSAGNPRRISQWFSNFATNGASDHGKGAKALAQFSFV 180

Query: 181 NRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWSSNEFAESLKD 240
           NRDICWEELEW GKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFW SNEFAESLKD
Sbjct: 181 NRDICWEELEWNGKHGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWYSNEFAESLKD 240

Query: 241 GEILFLDTKFFVTYLIDLMLKDDSRDVLDVINEFLMQESFSSLCQHLLITLEEADFCYFL 300
           GEILFLDTKFFV YL D MLKDDSRDV D INEFL QESFSSLCQHL+ITLEEADFC FL
Sbjct: 241 GEILFLDTKFFVKYLFDQMLKDDSRDVWDAINEFLTQESFSSLCQHLIITLEEADFCCFL 300

Query: 301 KILCKLLSPRIETKDFDNSSLLFEVILSKYGDSESIDEILLLNAVMNQGRQLLRFLQDED 360
           K+LCKLL P +ETKDF NSS LFEVILSKYGD+ES+D+ILLLNAV+NQGRQLLRF+QDED
Sbjct: 301 KMLCKLLRPIMETKDFGNSSFLFEVILSKYGDAESLDQILLLNAVINQGRQLLRFVQDED 360

Query: 361 AEEELYEIKAIVYEISAISSDTHSLSISPLLKECDRRKKTIEVIKWLGLQSWVLHYRMSV 420
           AEEEL EIK I+YEISAISS+THSL  SPLLKEC RRKKTIEVIKWLGLQSWVLHYRMS 
Sbjct: 361 AEEELDEIKTIIYEISAISSNTHSL--SPLLKECYRRKKTIEVIKWLGLQSWVLHYRMSD 420

Query: 421 ECLTPELWESLFVDNGIGFRKSNEYALLDHSCLSEEDGFELCNTASAKSKKRKRGGKGRK 480
           EC T ELWESLFVDNGI FRKSNEYALLDHSCLSE+DGFE CNTAS KSKKRKRG KGRK
Sbjct: 421 ECQTSELWESLFVDNGICFRKSNEYALLDHSCLSEDDGFEPCNTASVKSKKRKRGKKGRK 480

Query: 481 RRKRNFDEEDSCDDELLDFDIKNDRTDLKLKSGSWLLSIDNYTVPWNAMDLPEHLSKHCM 521
           RRKR+FD+EDSCDDELLDFDIK D+TDLKL +GSWLLSIDNYTVPWNA+DLPE+LSK CM
Sbjct: 481 RRKRDFDDEDSCDDELLDFDIKRDKTDLKLNTGSWLLSIDNYTVPWNAIDLPEYLSKQCM 540

BLAST of Tan0022110 vs. ExPASy TrEMBL
Match: A0A6J1G8R1 (uncharacterized protein LOC111451855 OS=Cucurbita moschata OX=3662 GN=LOC111451855 PE=4 SV=1)

HSP 1 Score: 913.7 bits (2360), Expect = 3.5e-262
Identity = 457/520 (87.88%), Postives = 480/520 (92.31%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLFL EP+FNEE+D GSAK RISLLSRLESVL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1   MIDLFLAEPVFNEEDDVGSAKLRISLLSRLESVLWKLLASGGRSEVRLWLSNTIASMTSI 60

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
           SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61  SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEGNPRRI 120

Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
           SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180

Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
           DVHQTVKNFIKNVPEFW SNEF+ESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFSESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240

Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
           EFL QE FSSLCQHLLITLEEADFC FLK+LCKLL P  ETKDF NSS LFEV+LSKYGD
Sbjct: 241 EFLTQEPFSSLCQHLLITLEEADFCCFLKMLCKLLRPSRETKDFGNSSFLFEVVLSKYGD 300

Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
           +ES+D+ILLLNAV+NQGRQLLRF+QDEDAEEEL EIK I+YEISAISS+THSL  SPLLK
Sbjct: 301 AESLDQILLLNAVINQGRQLLRFVQDEDAEEELDEIKTIIYEISAISSNTHSL--SPLLK 360

Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
           EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWESLFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWESLFVDNGICFRKSNEYALLDHSC 420

Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
           LSE+DGFE CNTAS KSKKRKRG KGRKRRKR+FD+EDSCDDELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRGKKGRKRRKRDFDDEDSCDDELLDFDIKRDKTDLKLNT 480

Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
           GSWLLSIDNYTVPWNA+DLPE+LSK CMASWMKWLF  RE
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKQCMASWMKWLFTNRE 518

BLAST of Tan0022110 vs. ExPASy TrEMBL
Match: A0A6J1KWG9 (uncharacterized protein LOC111498825 OS=Cucurbita maxima OX=3661 GN=LOC111498825 PE=4 SV=1)

HSP 1 Score: 910.6 bits (2352), Expect = 3.0e-261
Identity = 459/518 (88.61%), Postives = 478/518 (92.28%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLFL EPIFNEEED GSAK RISLLSRLE+VL KL+ASGGRSEVRLWLSNTIASMTSI
Sbjct: 1   MIDLFLAEPIFNEEEDVGSAKLRISLLSRLETVLWKLLASGGRSEVRLWLSNTIASMTSI 60

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
           SPQHQR+LFMTFLR KPLKW FAS LLQM FEKRQREAG+LIAKRSYIMEKFFEGNPRRI
Sbjct: 61  SPQHQRELFMTFLRSKPLKWDFASHLLQMFFEKRQREAGVLIAKRSYIMEKFFEGNPRRI 120

Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
           SQWFSNFATNGASDHGKGAKALAQF+FVNRDICWEELEW GKHGQSPAVVATKPHYFLDL
Sbjct: 121 SQWFSNFATNGASDHGKGAKALAQFSFVNRDICWEELEWNGKHGQSPAVVATKPHYFLDL 180

Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
           DVHQTVKNFIKNVPEFW SNEFAESLKDGEILFLDTKFFV YL D MLKDDSRDV D IN
Sbjct: 181 DVHQTVKNFIKNVPEFWYSNEFAESLKDGEILFLDTKFFVKYLFDQMLKDDSRDVWDAIN 240

Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
           EFL QESFSSLCQHLLITLEEADFC FLK+LCKLL P +ETKDF NSS LFEVILSKYGD
Sbjct: 241 EFLTQESFSSLCQHLLITLEEADFCCFLKMLCKLLRPSLETKDFGNSSFLFEVILSKYGD 300

Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
           SES+D+ILLLNAV+N+GRQLLRF+QDEDAEEEL EIK I+YEISAISSDTHSL  SPLLK
Sbjct: 301 SESLDQILLLNAVINRGRQLLRFVQDEDAEEELDEIKNIIYEISAISSDTHSL--SPLLK 360

Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
           EC RRKKTIEVIKWLGLQSWVLHYRMS EC T ELWE LFVDNGI FRKSNEYALLDHSC
Sbjct: 361 ECYRRKKTIEVIKWLGLQSWVLHYRMSDECQTSELWEFLFVDNGICFRKSNEYALLDHSC 420

Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
           LSE+DGFE CNTAS KSKKRKRG KGRKRRKRN D+EDSCD ELLDFDIK D+TDLKL +
Sbjct: 421 LSEDDGFEPCNTASVKSKKRKRGKKGRKRRKRNSDDEDSCDYELLDFDIKRDKTDLKLNT 480

Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAK 519
           GSWLLSIDNYTVPWNA+DLPE+LSKHCMASWMKWL  K
Sbjct: 481 GSWLLSIDNYTVPWNAIDLPEYLSKHCMASWMKWLRRK 516

BLAST of Tan0022110 vs. ExPASy TrEMBL
Match: A0A1S3BKS3 (uncharacterized protein LOC103490747 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103490747 PE=4 SV=1)

HSP 1 Score: 892.9 bits (2306), Expect = 6.4e-256
Identity = 443/520 (85.19%), Postives = 476/520 (91.54%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLF++E  FN+E+D  SAK RISLLS LESVL KL+  GGRSEVRLWLSNTIAS+TSI
Sbjct: 21  MIDLFILESTFNDEQDVSSAKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSI 80

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
           SPQHQRDLFMT LR KPLKWAFASQLLQMLFEKR REAGILIAKRSYIMEKFFEGNPRRI
Sbjct: 81  SPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRI 140

Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
           SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
Sbjct: 141 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 200

Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
           DVHQTVKNFI+NVPEFWSSNEFAESLKDGEILFLDTKFFV + IDLMLKDDS+DV +VIN
Sbjct: 201 DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKFFIDLMLKDDSKDVWEVIN 260

Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
           EFLM ESFSSLCQHLL+TLE+ADFC FLK+LCKLL PRIETKDF NSS +FEVIL+KYGD
Sbjct: 261 EFLMHESFSSLCQHLLVTLEDADFCNFLKVLCKLLRPRIETKDFGNSSFMFEVILAKYGD 320

Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
           SESID+ILLLNAV+NQGRQLLR L+DED EE+L EIKAI+++ISAISS++H L   PLLK
Sbjct: 321 SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIIHKISAISSNSHCL--FPLLK 380

Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
           ECD RKKTIE+IKWLGLQSWVLHYR S EC TPELWESLFVDNGIGFRKSNEY LLDHSC
Sbjct: 381 ECDGRKKTIEMIKWLGLQSWVLHYRTSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSC 440

Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
            SE+DGFE CN A AKSKKRK+G KGRKRRKRNFD ++SCDDELLD DI+NDR DLKL +
Sbjct: 441 SSEDDGFEPCNRARAKSKKRKKGEKGRKRRKRNFDSQESCDDELLDLDIRNDRMDLKLNT 500

Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
           GSW LS D+YTVPWNA DLPEHLSK+CMASWMKWLFAKRE
Sbjct: 501 GSWFLSTDDYTVPWNAKDLPEHLSKYCMASWMKWLFAKRE 538

BLAST of Tan0022110 vs. ExPASy TrEMBL
Match: A0A0A0L6D1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G006760 PE=4 SV=1)

HSP 1 Score: 892.5 bits (2305), Expect = 8.4e-256
Identity = 446/520 (85.77%), Postives = 474/520 (91.15%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLFL E  FN+E+D  S K RISLLS LESVL KL+  GGRSEVRLWLSNTIAS+TSI
Sbjct: 21  MIDLFLQESTFNDEQDVSSEKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSI 80

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRRI 120
           SPQHQRDLFMT LR KPLKWAFASQLLQMLFEKR REAGILIAKRSYIMEKFFEGNPRRI
Sbjct: 81  SPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGNPRRI 140

Query: 121 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 180
           SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL
Sbjct: 141 SQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDL 200

Query: 181 DVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVIN 240
           DVHQTVKNFI+NVPEFWSSNEFAESLKDGEILFLDTKFFV Y +DLMLKDD +DV +VIN
Sbjct: 201 DVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKYFVDLMLKDDPKDVWEVIN 260

Query: 241 EFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYGD 300
           EFL  ESFSSLCQHLL+TLEEADFC FLK+LCKLL PRIETKDF NSS +FEVIL+KYGD
Sbjct: 261 EFLTHESFSSLCQHLLVTLEEADFCNFLKMLCKLLRPRIETKDFGNSSFMFEVILTKYGD 320

Query: 301 SESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLLK 360
           SESID+ILLLNAV+NQGRQLLR L+DED EE+L EIKAIV++IS+ISS+ H L   PLLK
Sbjct: 321 SESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIVHKISSISSNCHCL--FPLLK 380

Query: 361 ECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHSC 420
           ECD RKKTIE+IKWLGLQSWVLHYRMS EC TPELWESLFVDNGIGFRKSNEY LLDHSC
Sbjct: 381 ECDGRKKTIEMIKWLGLQSWVLHYRMSEECQTPELWESLFVDNGIGFRKSNEYLLLDHSC 440

Query: 421 LSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDDELLDFDIKNDRTDLKLKS 480
            SE+DGFEL N A A+SKKRK+GGKGRKRRK NFD +DSCDDELLDFDIKNDR DLKL +
Sbjct: 441 SSEDDGFELYNRARAQSKKRKKGGKGRKRRKGNFDSQDSCDDELLDFDIKNDRMDLKLNT 500

Query: 481 GSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
           GSWLLS D+YTVPWNA DLPEHLSK+CMASWMKWLFAKRE
Sbjct: 501 GSWLLSTDDYTVPWNAKDLPEHLSKYCMASWMKWLFAKRE 538

BLAST of Tan0022110 vs. ExPASy TrEMBL
Match: A0A1S3BK58 (uncharacterized protein LOC103490747 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490747 PE=4 SV=1)

HSP 1 Score: 881.7 bits (2277), Expect = 1.5e-252
Identity = 443/538 (82.34%), Postives = 476/538 (88.48%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEEEDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTSI 60
           MIDLF++E  FN+E+D  SAK RISLLS LESVL KL+  GGRSEVRLWLSNTIAS+TSI
Sbjct: 21  MIDLFILESTFNDEQDVSSAKLRISLLSELESVLWKLLTCGGRSEVRLWLSNTIASVTSI 80

Query: 61  SPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFE------ 120
           SPQHQRDLFMT LR KPLKWAFASQLLQMLFEKR REAGILIAKRSYIMEKFFE      
Sbjct: 81  SPQHQRDLFMTLLRRKPLKWAFASQLLQMLFEKRSREAGILIAKRSYIMEKFFEGDCSVC 140

Query: 121 ------------GNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK 180
                       GNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK
Sbjct: 141 LLMSNLAIEILAGNPRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGK 200

Query: 181 HGQSPAVVATKPHYFLDLDVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTY 240
           HGQSPAVVATKPHYFLDLDVHQTVKNFI+NVPEFWSSNEFAESLKDGEILFLDTKFFV +
Sbjct: 201 HGQSPAVVATKPHYFLDLDVHQTVKNFIQNVPEFWSSNEFAESLKDGEILFLDTKFFVKF 260

Query: 241 LIDLMLKDDSRDVLDVINEFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETK 300
            IDLMLKDDS+DV +VINEFLM ESFSSLCQHLL+TLE+ADFC FLK+LCKLL PRIETK
Sbjct: 261 FIDLMLKDDSKDVWEVINEFLMHESFSSLCQHLLVTLEDADFCNFLKVLCKLLRPRIETK 320

Query: 301 DFDNSSLLFEVILSKYGDSESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYE 360
           DF NSS +FEVIL+KYGDSESID+ILLLNAV+NQGRQLLR L+DED EE+L EIKAI+++
Sbjct: 321 DFGNSSFMFEVILAKYGDSESIDQILLLNAVINQGRQLLRLLRDEDGEEQLDEIKAIIHK 380

Query: 361 ISAISSDTHSLSISPLLKECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVD 420
           ISAISS++H L   PLLKECD RKKTIE+IKWLGLQSWVLHYR S EC TPELWESLFVD
Sbjct: 381 ISAISSNSHCL--FPLLKECDGRKKTIEMIKWLGLQSWVLHYRTSEECQTPELWESLFVD 440

Query: 421 NGIGFRKSNEYALLDHSCLSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNFDEEDSCDD 480
           NGIGFRKSNEY LLDHSC SE+DGFE CN A AKSKKRK+G KGRKRRKRNFD ++SCDD
Sbjct: 441 NGIGFRKSNEYLLLDHSCSSEDDGFEPCNRARAKSKKRKKGEKGRKRRKRNFDSQESCDD 500

Query: 481 ELLDFDIKNDRTDLKLKSGSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
           ELLD DI+NDR DLKL +GSW LS D+YTVPWNA DLPEHLSK+CMASWMKWLFAKRE
Sbjct: 501 ELLDLDIRNDRMDLKLNTGSWFLSTDDYTVPWNAKDLPEHLSKYCMASWMKWLFAKRE 556

BLAST of Tan0022110 vs. TAIR 10
Match: AT5G48340.1 (unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 520.0 bits (1338), Expect = 2.2e-147
Identity = 273/523 (52.20%), Postives = 373/523 (71.32%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEE-EDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTS 60
           M++LFL EP +N++ + + +    + LL++L S ++ L+  G RSE RLWL + ++++ S
Sbjct: 1   MVNLFLSEPKWNDDAQKSSNINVILPLLNKLGSQIQSLVTHGARSEARLWLCSALSTI-S 60

Query: 61  ISPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRR 120
           ISP  Q ++FM  LR KP K  F SQ+L M+FEKR R+ G L+AKRSYI+EKFFEGN +R
Sbjct: 61  ISPSKQLNIFMKLLRSKPRKMQFLSQVLTMMFEKRPRKLGFLLAKRSYILEKFFEGNQKR 120

Query: 121 ISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLD 180
           I +WFS FA +G SDH +GAKALAQFAF NRDICWEELEW+GKHGQSPAVVATKPHY LD
Sbjct: 121 ILEWFSEFAYDGGSDHKRGAKALAQFAFANRDICWEELEWRGKHGQSPAVVATKPHYLLD 180

Query: 181 LDVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVI 240
           LDV +T++NF+ NVPEFWSSNEFAESLKDG+ILFLDTKFF+   I  M ++D  DV D +
Sbjct: 181 LDVERTIQNFLDNVPEFWSSNEFAESLKDGQILFLDTKFFIDLFIRFMYEEDMYDVWDAV 240

Query: 241 NEFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYG 300
            EFL +ESFSSL QHLLITLEE D C FL++L     P IE+ D  +SS    V+LS+Y 
Sbjct: 241 EEFLREESFSSLTQHLLITLEERDLCRFLELLGNYFEPGIESWDSGDSSRWLGVLLSRYV 300

Query: 301 DSESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLL 360
           D+ESIDE+LLL++++NQGRQLLR ++DE+  +E   +K  + EI     +  S S+  +L
Sbjct: 301 DTESIDELLLLSSIINQGRQLLRLVRDENGNDEGELLKETMAEICRGLENESSFSV--IL 360

Query: 361 KECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHS 420
           +E  + K  I+VIK LGL SW +H+R+S EC TP+ WE LF +NGI FR+S++++LL ++
Sbjct: 361 RELSKMKH-IQVIKLLGLLSWTIHFRLSEECQTPDSWELLFRENGIEFRRSSDHSLLSYN 420

Query: 421 CLSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNF--DEEDSCDDELLDFDIKNDRTDLK 480
             SEE   +  + +    K+ KR  K RK++K+    D++D  DDELL         DL 
Sbjct: 421 GFSEESESDSDSRSRVSKKRHKREKKKRKKKKKRAFDDDDDRGDDELL---------DLH 480

Query: 481 LKSGSWLLSIDNYTVPWNAMDLPEHLSKHCMASWMKWLFAKRE 521
             S SWLLS D ++  W ++DLPE+++K+C+++WMK L A+++
Sbjct: 481 SISRSWLLSTDGFSATWTSVDLPEYIAKYCLSTWMKGLLARQK 510

BLAST of Tan0022110 vs. TAIR 10
Match: AT5G48340.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages. )

HSP 1 Score: 489.2 bits (1258), Expect = 4.1e-138
Identity = 262/499 (52.51%), Postives = 351/499 (70.34%), Query Frame = 0

Query: 1   MIDLFLVEPIFNEE-EDAGSAKSRISLLSRLESVLRKLMASGGRSEVRLWLSNTIASMTS 60
           M++LFL EP +N++ + + +    + LL++L S ++ L+  G RSE RLWL + ++++ S
Sbjct: 1   MVNLFLSEPKWNDDAQKSSNINVILPLLNKLGSQIQSLVTHGARSEARLWLCSALSTI-S 60

Query: 61  ISPQHQRDLFMTFLRLKPLKWAFASQLLQMLFEKRQREAGILIAKRSYIMEKFFEGNPRR 120
           ISP  Q ++FM  LR KP K  F SQ+L M+FEKR R+ G L+AKRSYI+EKFFEGN +R
Sbjct: 61  ISPSKQLNIFMKLLRSKPRKMQFLSQVLTMMFEKRPRKLGFLLAKRSYILEKFFEGNQKR 120

Query: 121 ISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLD 180
           I +WFS FA +G SDH +GAKALAQFAF NRDICWEELEW+GKHGQSPAVVATKPHY LD
Sbjct: 121 ILEWFSEFAYDGGSDHKRGAKALAQFAFANRDICWEELEWRGKHGQSPAVVATKPHYLLD 180

Query: 181 LDVHQTVKNFIKNVPEFWSSNEFAESLKDGEILFLDTKFFVTYLIDLMLKDDSRDVLDVI 240
           LDV +T++NF+ NVPEFWSSNEFAESLKDG+ILFLDTKFF+   I  M ++D  DV D +
Sbjct: 181 LDVERTIQNFLDNVPEFWSSNEFAESLKDGQILFLDTKFFIDLFIRFMYEEDMYDVWDAV 240

Query: 241 NEFLMQESFSSLCQHLLITLEEADFCYFLKILCKLLSPRIETKDFDNSSLLFEVILSKYG 300
            EFL +ESFSSL QHLLITLEE D C FL++L     P IE+ D  +SS    V+LS+Y 
Sbjct: 241 EEFLREESFSSLTQHLLITLEERDLCRFLELLGNYFEPGIESWDSGDSSRWLGVLLSRYV 300

Query: 301 DSESIDEILLLNAVMNQGRQLLRFLQDEDAEEELYEIKAIVYEISAISSDTHSLSISPLL 360
           D+ESIDE+LLL++++NQGRQLLR ++DE+  +E   +K  + EI     +  S S+  +L
Sbjct: 301 DTESIDELLLLSSIINQGRQLLRLVRDENGNDEGELLKETMAEICRGLENESSFSV--IL 360

Query: 361 KECDRRKKTIEVIKWLGLQSWVLHYRMSVECLTPELWESLFVDNGIGFRKSNEYALLDHS 420
           +E  + K  I+VIK LGL SW +H+R+S EC TP+ WE LF +NGI FR+S++++LL ++
Sbjct: 361 RELSKMKH-IQVIKLLGLLSWTIHFRLSEECQTPDSWELLFRENGIEFRRSSDHSLLSYN 420

Query: 421 CLSEEDGFELCNTASAKSKKRKRGGKGRKRRKRNF--DEEDSCDDELLDFDIKNDRTDLK 480
             SEE   +  + +    K+ KR  K RK++K+    D++D  DDELL         DL 
Sbjct: 421 GFSEESESDSDSRSRVSKKRHKREKKKRKKKKKRAFDDDDDRGDDELL---------DLH 480

Query: 481 LKSGSWLLSIDNYTVPWNA 497
             S SWLLS D ++  W +
Sbjct: 481 SISRSWLLSTDGFSATWTS 486

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023540456.11.1e-26288.65uncharacterized protein LOC111800821 [Cucurbita pepo subsp. pepo][more]
XP_022948246.17.3e-26287.88uncharacterized protein LOC111451855 [Cucurbita moschata][more]
XP_023005966.16.2e-26188.61uncharacterized protein LOC111498825 [Cucurbita maxima][more]
KAG6596711.11.7e-25882.73hypothetical protein SDJN03_09891, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7028247.14.9e-25883.15hypothetical protein SDJN02_09428, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A6J1G8R13.5e-26287.88uncharacterized protein LOC111451855 OS=Cucurbita moschata OX=3662 GN=LOC1114518... [more]
A0A6J1KWG93.0e-26188.61uncharacterized protein LOC111498825 OS=Cucurbita maxima OX=3661 GN=LOC111498825... [more]
A0A1S3BKS36.4e-25685.19uncharacterized protein LOC103490747 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0L6D18.4e-25685.77Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G006760 PE=4 SV=1[more]
A0A1S3BK581.5e-25282.34uncharacterized protein LOC103490747 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G48340.12.2e-14752.20unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0... [more]
AT5G48340.24.1e-13852.51unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37766OS01G0897100 PROTEINcoord: 9..411

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0022110.2Tan0022110.2mRNA
Tan0022110.3Tan0022110.3mRNA
Tan0022110.5Tan0022110.5mRNA
Tan0022110.6Tan0022110.6mRNA
Tan0022110.1Tan0022110.1mRNA
Tan0022110.4Tan0022110.4mRNA