Tan0004718 (gene) Snake gourd v1

Overview
NameTan0004718
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG05: 74094880 .. 74096345 (+)
RNA-Seq ExpressionTan0004718
SyntenyTan0004718
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCACTTCTGCAACAATTCCTTTAATCTTTCTCTCTTTAATGGCGGTTTAAACCCCTATTTCTTCTTCTTCTTCTTCTTCTTCTCTCTATTTCTATTCTTTCTTCTTCCATGGAAATTTACACCGACAACAGAAGGCGAGTTCGCGACGAGTTCGACGACTCGCTTGTCGATTCGGCCGAGTCGAAGCTCAGACGACTCAACTCAGCGGAATCGAGATTCGTGAAGCCATGCACCAAGGGGAACTGGAATGTTGTTAACTCGTCGAAGTCGGAGCAGGTAGGATCCGACGGAGATGATTTGAGAATCGATTTGGCGGAGTCGGAGGAGATTCAAGATGAGTTGCTTAACATCCTCGAAGATGGCGACGCCGTAACGGAGCGAGATGAGAGTATTCAAGGTCTCGAACTCGATTCGTTCATCCGAAGCTTCGAGGAGGAGATTCAAGCTCTACCGCCGGCGAAAACGACGTCGGATCGGAATGAGACTCCTCAGGCGGAACTTGGATATCTTTTCGAGGCGTCGGATGATGAACTCGGGCTGCCGCCGACGGGAGGTTCGAGTACTGAAGGGAAGATGGAGGCGATTGATTTTATGCCGGCTTGTTCTTCGGCTGGTGTGTTCGAGTTGGATGGGAACTTAGGGTTTGAGGATGAGATACCGTGTTATGACTCGTTCGAAATCGGAATCGGCATTGGTTCCGGCGCGGCGGAGGAGAATTGTTTGGGCGGAGAGTTTGTTGCATTGGGCGGTTTGTTTGATTATTCAGACGTGCCGTTTCGGCCGGAGTCGTTATCGGCTCTGTAGAAACGGGTTTCCAATGGGGGAAGGTTGGTTTTTGGTTTTTAGGGTTGGTGTGGAACCGACGTCGTTTTAAGTCTGTAGGTTGTAAATCTGAAAGGACAAAAACACAACTCTTTGAATAGCAATGCAGAATTTCCAAAAAAAAAAAAGGAAATGTAGAGGAGTTTTTTTTTAATTTTGTTTTTTTTAATAGAACGATTCTTAATTTTTACCGTTGTAAAAACCCTGTGTATCCGTCGGTTTCTCTCTTTTGTCGTTTCTCTGATTTTGTTATTTTATTATTGACTGGAGAGGGGTAGTTTTGGAAAATTAGGAGATAAAAAAAAGGTTGAATCAGTTTCGAGTTGAAAATAGGTGGGGATTGATGGAGTGTCGCTGTGTGTGGGCACCTTTGAGTAAGAGATGATATTTTTTCTTTTCTTTTTCTTTTTTGAACCACGCAGGTTTTAATGACTACTTTTTAGTTTTTAAGAGACGTTTAATTCATGAATTTGAAAAGTTTTAGGTTTGTCGTAATTAATTTTTTTAAAAGAAAATATTGGGTAGAAAATTTATTTTAAATTTAGATTTACGTGACTCGTGGAACTTTTTAAATGGTGCGTCAAGTTTTTAATTCATATAAAATTCCAACTTACAATTAAAAAAAAATGTAAAATTGTTGAGC

mRNA sequence

CTCACTTCTGCAACAATTCCTTTAATCTTTCTCTCTTTAATGGCGGTTTAAACCCCTATTTCTTCTTCTTCTTCTTCTTCTTCTCTCTATTTCTATTCTTTCTTCTTCCATGGAAATTTACACCGACAACAGAAGGCGAGTTCGCGACGAGTTCGACGACTCGCTTGTCGATTCGGCCGAGTCGAAGCTCAGACGACTCAACTCAGCGGAATCGAGATTCGTGAAGCCATGCACCAAGGGGAACTGGAATGTTGTTAACTCGTCGAAGTCGGAGCAGGTAGGATCCGACGGAGATGATTTGAGAATCGATTTGGCGGAGTCGGAGGAGATTCAAGATGAGTTGCTTAACATCCTCGAAGATGGCGACGCCGTAACGGAGCGAGATGAGAGTATTCAAGGTCTCGAACTCGATTCGTTCATCCGAAGCTTCGAGGAGGAGATTCAAGCTCTACCGCCGGCGAAAACGACGTCGGATCGGAATGAGACTCCTCAGGCGGAACTTGGATATCTTTTCGAGGCGTCGGATGATGAACTCGGGCTGCCGCCGACGGGAGGTTCGAGTACTGAAGGGAAGATGGAGGCGATTGATTTTATGCCGGCTTGTTCTTCGGCTGGTGTGTTCGAGTTGGATGGGAACTTAGGGTTTGAGGATGAGATACCGTGTTATGACTCGTTCGAAATCGGAATCGGCATTGGTTCCGGCGCGGCGGAGGAGAATTGTTTGGGCGGAGAGTTTGTTGCATTGGGCGGTTTGTTTGATTATTCAGACGTGCCGTTTCGGCCGGAGTCGTTATCGGCTCTGTAGAAACGGGTTTCCAATGGGGGAAGGTTGGTTTTTGGTTTTTAGGGTTGGTGTGGAACCGACGTCGTTTTAAGTCTGTAGGTTGTAAATCTGAAAGGACAAAAACACAACTCTTTGAATAGCAATGCAGAATTTCCAAAAAAAAAAAAGGAAATGTAGAGGAGTTTTTTTTTAATTTTGTTTTTTTTAATAGAACGATTCTTAATTTTTACCGTTGTAAAAACCCTGTGTATCCGTCGGTTTCTCTCTTTTGTCGTTTCTCTGATTTTGTTATTTTATTATTGACTGGAGAGGGGTAGTTTTGGAAAATTAGGAGATAAAAAAAAGGTTGAATCAGTTTCGAGTTGAAAATAGGTGGGGATTGATGGAGTGTCGCTGTGTGTGGGCACCTTTGAGTAAGAGATGATATTTTTTCTTTTCTTTTTCTTTTTTGAACCACGCAGGTTTTAATGACTACTTTTTAGTTTTTAAGAGACGTTTAATTCATGAATTTGAAAAGTTTTAGGTTTGTCGTAATTAATTTTTTTAAAAGAAAATATTGGGTAGAAAATTTATTTTAAATTTAGATTTACGTGACTCGTGGAACTTTTTAAATGGTGCGTCAAGTTTTTAATTCATATAAAATTCCAACTTACAATTAAAAAAAAATGTAAAATTGTTGAGC

Coding sequence (CDS)

ATGGAAATTTACACCGACAACAGAAGGCGAGTTCGCGACGAGTTCGACGACTCGCTTGTCGATTCGGCCGAGTCGAAGCTCAGACGACTCAACTCAGCGGAATCGAGATTCGTGAAGCCATGCACCAAGGGGAACTGGAATGTTGTTAACTCGTCGAAGTCGGAGCAGGTAGGATCCGACGGAGATGATTTGAGAATCGATTTGGCGGAGTCGGAGGAGATTCAAGATGAGTTGCTTAACATCCTCGAAGATGGCGACGCCGTAACGGAGCGAGATGAGAGTATTCAAGGTCTCGAACTCGATTCGTTCATCCGAAGCTTCGAGGAGGAGATTCAAGCTCTACCGCCGGCGAAAACGACGTCGGATCGGAATGAGACTCCTCAGGCGGAACTTGGATATCTTTTCGAGGCGTCGGATGATGAACTCGGGCTGCCGCCGACGGGAGGTTCGAGTACTGAAGGGAAGATGGAGGCGATTGATTTTATGCCGGCTTGTTCTTCGGCTGGTGTGTTCGAGTTGGATGGGAACTTAGGGTTTGAGGATGAGATACCGTGTTATGACTCGTTCGAAATCGGAATCGGCATTGGTTCCGGCGCGGCGGAGGAGAATTGTTTGGGCGGAGAGTTTGTTGCATTGGGCGGTTTGTTTGATTATTCAGACGTGCCGTTTCGGCCGGAGTCGTTATCGGCTCTGTAG

Protein sequence

MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPFRPESLSAL
Homology
BLAST of Tan0004718 vs. NCBI nr
Match: XP_022954576.1 (uncharacterized protein LOC111456805 [Cucurbita moschata])

HSP 1 Score: 363.2 bits (931), Expect = 1.6e-96
Identity = 191/231 (82.68%), Postives = 202/231 (87.45%), Query Frame = 0

Query: 1   MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSD 60
           ME  TDNR+R RDE D SL DSAESKLRRLNS ESRFVKPC          SKSEQVGSD
Sbjct: 1   MENCTDNRKRDRDELDVSLADSAESKLRRLNSMESRFVKPC----------SKSEQVGSD 60

Query: 61  GDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQALPPAKTT 120
           G DLRIDLA+S+EIQD+LLNILED D V ERDESIQG ELDSFIRSFEEEI ALPPA+T+
Sbjct: 61  GGDLRIDLADSDEIQDKLLNILEDSDVVAERDESIQGFELDSFIRSFEEEIHALPPAETS 120

Query: 121 SDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFE 180
           S++NETPQAELGYLFEASDDELGLPPT GSS EGK+EAIDF PACS  GVFE+DGN+GFE
Sbjct: 121 SNQNETPQAELGYLFEASDDELGLPPTVGSSNEGKIEAIDFTPACSWPGVFEMDGNVGFE 180

Query: 181 DEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPFRPESLSAL 232
           DEIPCYDSFEIGIGIGSGAAEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Sbjct: 181 DEIPCYDSFEIGIGIGSGAAEENGLGGEFVALGGLFDYSDVPFRPESLSAL 221

BLAST of Tan0004718 vs. NCBI nr
Match: XP_023541956.1 (uncharacterized protein LOC111801944 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 361.3 bits (926), Expect = 6.2e-96
Identity = 190/231 (82.25%), Postives = 200/231 (86.58%), Query Frame = 0

Query: 1   MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSD 60
           ME  TDNR+R RDE D SL DSAESKLRRL+S ESRFVKPC          SKSE VGSD
Sbjct: 1   MENCTDNRKRDRDELDVSLADSAESKLRRLDSTESRFVKPC----------SKSEPVGSD 60

Query: 61  GDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQALPPAKTT 120
           GDDLRIDLA+S+EIQD+LLNILED D V ERDESIQG ELDSFIRSFEEEI ALPPA+T 
Sbjct: 61  GDDLRIDLADSDEIQDKLLNILEDSDVVAERDESIQGFELDSFIRSFEEEIHALPPAETL 120

Query: 121 SDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFE 180
           SD+NETPQAELGYLFEASDDELGLPPT GSS EGK+EAIDF PACS  GVFE+DGN+GFE
Sbjct: 121 SDQNETPQAELGYLFEASDDELGLPPTVGSSNEGKIEAIDFTPACSWPGVFEMDGNVGFE 180

Query: 181 DEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPFRPESLSAL 232
           DEIPCYDSFEIGIGIGSG AEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Sbjct: 181 DEIPCYDSFEIGIGIGSGLAEENGLGGEFVALGGLFDYSDVPFRPESLSAL 221

BLAST of Tan0004718 vs. NCBI nr
Match: KAG6572935.1 (hypothetical protein SDJN03_26822, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 359.4 bits (921), Expect = 2.4e-95
Identity = 190/231 (82.25%), Postives = 201/231 (87.01%), Query Frame = 0

Query: 1   MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSD 60
           ME  TDNR+R RDE D SL DSA SKLRRLNS ESRFVKPC          SKSEQVGSD
Sbjct: 1   MENCTDNRKRDRDELDVSLADSAGSKLRRLNSMESRFVKPC----------SKSEQVGSD 60

Query: 61  GDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQALPPAKTT 120
           G DLRIDLA+S+EIQD+LLNILED D V ERDESIQG ELDSFIRSFEEEI ALPPA+T+
Sbjct: 61  GGDLRIDLADSDEIQDKLLNILEDCDVVAERDESIQGFELDSFIRSFEEEIHALPPAETS 120

Query: 121 SDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFE 180
           S++NETPQAELGYLFEASDDELGLPPT GSS EGK+EAIDF PACS  GVFE+DGN+GFE
Sbjct: 121 SNQNETPQAELGYLFEASDDELGLPPTVGSSNEGKIEAIDFTPACSWPGVFEMDGNVGFE 180

Query: 181 DEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPFRPESLSAL 232
           DEIPCYDSFEIGIGIGSGAAEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Sbjct: 181 DEIPCYDSFEIGIGIGSGAAEENGLGGEFVALGGLFDYSDVPFRPESLSAL 221

BLAST of Tan0004718 vs. NCBI nr
Match: XP_022994085.1 (uncharacterized protein LOC111489916 [Cucurbita maxima])

HSP 1 Score: 356.7 bits (914), Expect = 1.5e-94
Identity = 187/231 (80.95%), Postives = 199/231 (86.15%), Query Frame = 0

Query: 1   MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSD 60
           ME  TDNR+R RD+ D SL DSAESKLRRLN  ESRFVKPC          SKSEQVGSD
Sbjct: 1   MENCTDNRKRDRDKLDVSLADSAESKLRRLNPTESRFVKPC----------SKSEQVGSD 60

Query: 61  GDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQALPPAKTT 120
           GDDLRIDLA+S+EI D+LLNILED D V ERDE IQG ELDSFIRSFEEEI ALPPA+T+
Sbjct: 61  GDDLRIDLADSDEIHDKLLNILEDSDVVAERDECIQGFELDSFIRSFEEEIHALPPAETS 120

Query: 121 SDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFE 180
           S++NETPQAELGYLFEASDDELGLPPT GSS EG +EAIDF PACS  GVFE+DGN+GFE
Sbjct: 121 SNQNETPQAELGYLFEASDDELGLPPTVGSSNEGTIEAIDFTPACSWPGVFEMDGNVGFE 180

Query: 181 DEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPFRPESLSAL 232
           DEIPCYDSFEIGIGIGSGAAEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Sbjct: 181 DEIPCYDSFEIGIGIGSGAAEENGLGGEFVALGGLFDYSDVPFRPESLSAL 221

BLAST of Tan0004718 vs. NCBI nr
Match: KAG6584364.1 (hypothetical protein SDJN03_20296, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 324.7 bits (831), Expect = 6.4e-85
Identity = 174/233 (74.68%), Postives = 191/233 (81.97%), Query Frame = 0

Query: 1   MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSD 60
           ME  TDN++R+  EFDDSL DSAESKLRRL+S++   + PCTKGNWNVV   +S      
Sbjct: 1   MENCTDNKKRLLSEFDDSLADSAESKLRRLDSSDFSSLNPCTKGNWNVVQDLESVA---- 60

Query: 61  GDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQALPPAKTT 120
             D  I+L+ES +IQD+LLNILED D V ERDESI+GLELDSFIRSFEEEIQALP  KT 
Sbjct: 61  --DFTINLSESHDIQDDLLNILEDSDVVLERDESIEGLELDSFIRSFEEEIQALPSVKTP 120

Query: 121 SDRNETPQAELGYLFEASDDELGLPPTGGSSTEGK-MEAIDFMPACSS-AGVFELDGNLG 180
           S +NETPQ ELGYL+ ASDDELGLPPTGG ST+GK MEAIDFMPA SS  GVFELDGN G
Sbjct: 121 SHQNETPQVELGYLYGASDDELGLPPTGGLSTDGKRMEAIDFMPASSSPPGVFELDGNAG 180

Query: 181 FEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPFRPESLSAL 232
           FEDEIPCYD FEIG+G  SGAAEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Sbjct: 181 FEDEIPCYDLFEIGMGYSSGAAEENGLGGEFVALGGLFDYSDVPFRPESLSAL 227

BLAST of Tan0004718 vs. ExPASy TrEMBL
Match: A0A6J1GR95 (uncharacterized protein LOC111456805 OS=Cucurbita moschata OX=3662 GN=LOC111456805 PE=4 SV=1)

HSP 1 Score: 363.2 bits (931), Expect = 7.9e-97
Identity = 191/231 (82.68%), Postives = 202/231 (87.45%), Query Frame = 0

Query: 1   MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSD 60
           ME  TDNR+R RDE D SL DSAESKLRRLNS ESRFVKPC          SKSEQVGSD
Sbjct: 1   MENCTDNRKRDRDELDVSLADSAESKLRRLNSMESRFVKPC----------SKSEQVGSD 60

Query: 61  GDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQALPPAKTT 120
           G DLRIDLA+S+EIQD+LLNILED D V ERDESIQG ELDSFIRSFEEEI ALPPA+T+
Sbjct: 61  GGDLRIDLADSDEIQDKLLNILEDSDVVAERDESIQGFELDSFIRSFEEEIHALPPAETS 120

Query: 121 SDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFE 180
           S++NETPQAELGYLFEASDDELGLPPT GSS EGK+EAIDF PACS  GVFE+DGN+GFE
Sbjct: 121 SNQNETPQAELGYLFEASDDELGLPPTVGSSNEGKIEAIDFTPACSWPGVFEMDGNVGFE 180

Query: 181 DEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPFRPESLSAL 232
           DEIPCYDSFEIGIGIGSGAAEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Sbjct: 181 DEIPCYDSFEIGIGIGSGAAEENGLGGEFVALGGLFDYSDVPFRPESLSAL 221

BLAST of Tan0004718 vs. ExPASy TrEMBL
Match: A0A6J1K097 (uncharacterized protein LOC111489916 OS=Cucurbita maxima OX=3661 GN=LOC111489916 PE=4 SV=1)

HSP 1 Score: 356.7 bits (914), Expect = 7.4e-95
Identity = 187/231 (80.95%), Postives = 199/231 (86.15%), Query Frame = 0

Query: 1   MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSD 60
           ME  TDNR+R RD+ D SL DSAESKLRRLN  ESRFVKPC          SKSEQVGSD
Sbjct: 1   MENCTDNRKRDRDKLDVSLADSAESKLRRLNPTESRFVKPC----------SKSEQVGSD 60

Query: 61  GDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQALPPAKTT 120
           GDDLRIDLA+S+EI D+LLNILED D V ERDE IQG ELDSFIRSFEEEI ALPPA+T+
Sbjct: 61  GDDLRIDLADSDEIHDKLLNILEDSDVVAERDECIQGFELDSFIRSFEEEIHALPPAETS 120

Query: 121 SDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFE 180
           S++NETPQAELGYLFEASDDELGLPPT GSS EG +EAIDF PACS  GVFE+DGN+GFE
Sbjct: 121 SNQNETPQAELGYLFEASDDELGLPPTVGSSNEGTIEAIDFTPACSWPGVFEMDGNVGFE 180

Query: 181 DEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPFRPESLSAL 232
           DEIPCYDSFEIGIGIGSGAAEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Sbjct: 181 DEIPCYDSFEIGIGIGSGAAEENGLGGEFVALGGLFDYSDVPFRPESLSAL 221

BLAST of Tan0004718 vs. ExPASy TrEMBL
Match: A0A6J1E811 (uncharacterized protein LOC111431577 OS=Cucurbita moschata OX=3662 GN=LOC111431577 PE=4 SV=1)

HSP 1 Score: 321.2 bits (822), Expect = 3.4e-84
Identity = 172/233 (73.82%), Postives = 190/233 (81.55%), Query Frame = 0

Query: 1   MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSD 60
           ME  TDN++R+  EFDDSL DSAESKLRRL+S++   + PCT+GNWNVV   +S      
Sbjct: 1   MENCTDNKKRLLSEFDDSLADSAESKLRRLDSSDFSSLNPCTEGNWNVVQDLESVA---- 60

Query: 61  GDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQALPPAKTT 120
             D  I+L+ES +IQD+LLNILED D V ERDESI+GLELDSFIRSFEEEIQALP  KT 
Sbjct: 61  --DFTINLSESHDIQDDLLNILEDSDVVLERDESIEGLELDSFIRSFEEEIQALPSVKTP 120

Query: 121 SDRNETPQAELGYLFEASDDELGLPPTGGSSTEGK-MEAIDFMPACSS-AGVFELDGNLG 180
           S +NETPQ ELGYL+ ASDDELGLPPTGG ST+GK  EAIDFMPA SS  GVFELDGN G
Sbjct: 121 SHQNETPQVELGYLYGASDDELGLPPTGGLSTDGKRTEAIDFMPASSSPPGVFELDGNAG 180

Query: 181 FEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPFRPESLSAL 232
           FEDEIPCYD FEIG+G  SGAAEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Sbjct: 181 FEDEIPCYDLFEIGMGFSSGAAEENGLGGEFVALGGLFDYSDVPFRPESLSAL 227

BLAST of Tan0004718 vs. ExPASy TrEMBL
Match: A0A0A0LQD5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G042270 PE=4 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 1.3e-70
Identity = 167/244 (68.44%), Postives = 192/244 (78.69%), Query Frame = 0

Query: 1   MEIYTDNRRRVRDEF-DDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGS 60
           M+ ++++R+RV DE  DDSL DSAESKLRRLNS++ R  KPCTK ++NVV  S      +
Sbjct: 1   MDNFSEDRKRVHDELDDDSLADSAESKLRRLNSSDLRIGKPCTKEDFNVVQGS-----SA 60

Query: 61  DGDDLRI-DLAESEEIQDE-LLNILEDGDAVTERDES-IQGLELDSFIRSFEEEIQALPP 120
              DL I DL ESEEIQDE LLNILED D V ERDES I+GLELDSFI+SFEEEIQ +P 
Sbjct: 61  IAGDLNIDDLVESEEIQDELLLNILEDSDVVAERDESAIEGLELDSFIKSFEEEIQGVPS 120

Query: 121 A-KTTSDRNETPQAELGYLFEASDDELGLPPTGG--SSTEG-KMEAIDFMPACSSAG--V 180
           +    ++ NETPQAELGYLF ASDDELGLPP+GG  S+TEG KMEAIDFMP  SS    V
Sbjct: 121 SVDDQNNNNETPQAELGYLFGASDDELGLPPSGGLSSTTEGKKMEAIDFMPPASSCSPDV 180

Query: 181 FELDGNLGFEDEIPCYDSFEIGIGIGSGA--AEENCL-GGEFVALGGLFDYSDVPFRPES 232
           FEL+G LGF+D+IPCYDSFE+G+GIGSGA  AE+N L GGEFVALGGLFDYSDV FRPES
Sbjct: 181 FELEGKLGFDDDIPCYDSFELGMGIGSGAAVAEDNGLGGGEFVALGGLFDYSDVLFRPES 239

BLAST of Tan0004718 vs. ExPASy TrEMBL
Match: A0A5E4F0J5 (PREDICTED: AT1G13360 OS=Prunus dulcis OX=3755 GN=ALMOND_2B007874 PE=4 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 8.9e-40
Identity = 124/246 (50.41%), Postives = 149/246 (60.57%), Query Frame = 0

Query: 7   NRRRVRDEFDDSLVDS----AESKLRRLNSAESRFVKP--------CTKGNWNVVNSSKS 66
           N  R R  +D   +++     ESKL R NS+ S    P         T    N   S   
Sbjct: 6   NHNRKRPRYDSDHLETNHTQPESKLVRANSSHSVACSPESGSTVFDSTNSEVNSDESLTP 65

Query: 67  EQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQ-- 126
           + VG D D+L ++  E + IQD+LLNIL+D D VT+RD +IQ  +LDS I+SFEEEIQ  
Sbjct: 66  QPVGVDSDELVMESEEVKLIQDDLLNILDDSDIVTDRDPAIQ--DLDSVIKSFEEEIQVP 125

Query: 127 ALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFE 186
           A P ++TTS    + Q ELGYL EASDDELGLPPT G S +GK+EA DF  + S A    
Sbjct: 126 AFPVSETTSSPGGSSQPELGYLLEASDDELGLPPTNGGSEDGKLEAADFTSSGSEA--VG 185

Query: 187 LDGNLGFE-DEIPCYDSFEIGIGIGSGAAEENCLGG-EFVALGGLFDY-----SDVPFRP 232
           LDG LGFE D IP YDSFE+GIG G      N  GG E+VALGGLFDY     SDV +R 
Sbjct: 186 LDGMLGFENDIIPNYDSFELGIG-GDCNLNNNYNGGAEYVALGGLFDYSDGGVSDVSWRN 245

BLAST of Tan0004718 vs. TAIR 10
Match: AT1G13360.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G25870.1); Has 69 Blast hits to 69 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 6; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 80.1 bits (196), Expect = 2.6e-15
Identity = 70/200 (35.00%), Postives = 108/200 (54.00%), Query Frame = 0

Query: 50  NSSKSEQVGSDG-DDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFE 109
           +S++ ++V  +  D+  +D  E + ++D+L ++L+D D      E +   +LDS ++SFE
Sbjct: 12  DSAEKKRVRDESFDEAVLDSPEVKRLRDDLFDVLDDSD-----PEPV-SQDLDSVMKSFE 71

Query: 110 EEIQAL--PPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSS------TEGKMEAI- 169
           +E+  +    A+ +S   ET Q +LGYL EASDDELGLPP    S       E   E + 
Sbjct: 72  DELSTVTTTTAQGSSTAGET-QPDLGYLLEASDDELGLPPPPSISPVPVAKEEVTTETVT 131

Query: 170 DFMPACS-SAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDY 229
           D + A S S+G+ E+    GFED +  Y   + G G+G         GG++VA+ GLF++
Sbjct: 132 DLVRASSDSSGIDEI---WGFEDHVSNYGGLDFGSGVGD--------GGDYVAVEGLFEF 191

Query: 230 SDVPF--------RPESLSA 231
           SD  F        R ESL A
Sbjct: 192 SDDCFDSGDLFSWRSESLPA 193

BLAST of Tan0004718 vs. TAIR 10
Match: AT1G13360.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G25870.1). )

HSP 1 Score: 72.8 bits (177), Expect = 4.1e-13
Identity = 62/181 (34.25%), Postives = 99/181 (54.70%), Query Frame = 0

Query: 50  NSSKSEQVGSDG-DDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFE 109
           +S++ ++V  +  D+  +D  E + ++D+L ++L+D D      E +   +LDS ++SFE
Sbjct: 12  DSAEKKRVRDESFDEAVLDSPEVKRLRDDLFDVLDDSD-----PEPV-SQDLDSVMKSFE 71

Query: 110 EEIQAL--PPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSS------TEGKMEAI- 169
           +E+  +    A+ +S   ET Q +LGYL EASDDELGLPP    S       E   E + 
Sbjct: 72  DELSTVTTTTAQGSSTAGET-QPDLGYLLEASDDELGLPPPPSISPVPVAKEEVTTETVT 131

Query: 170 DFMPACS-SAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDY 220
           D + A S S+G+ E+    GFED +  Y   + G G+G         GG++VA+ G F Y
Sbjct: 132 DLVRASSDSSGIDEI---WGFEDHVSNYGGLDFGSGVGD--------GGDYVAVEGFFYY 174

BLAST of Tan0004718 vs. TAIR 10
Match: AT1G13360.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G25870.1). )

HSP 1 Score: 70.1 bits (170), Expect = 2.7e-12
Identity = 61/177 (34.46%), Postives = 97/177 (54.80%), Query Frame = 0

Query: 50  NSSKSEQVGSDG-DDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFE 109
           +S++ ++V  +  D+  +D  E + ++D+L ++L+D D      E +   +LDS ++SFE
Sbjct: 12  DSAEKKRVRDESFDEAVLDSPEVKRLRDDLFDVLDDSD-----PEPV-SQDLDSVMKSFE 71

Query: 110 EEIQAL--PPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSS------TEGKMEAI- 169
           +E+  +    A+ +S   ET Q +LGYL EASDDELGLPP    S       E   E + 
Sbjct: 72  DELSTVTTTTAQGSSTAGET-QPDLGYLLEASDDELGLPPPPSISPVPVAKEEVTTETVT 131

Query: 170 DFMPACS-SAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGL 216
           D + A S S+G+ E+    GFED +  Y   + G G+G         GG++VA+ GL
Sbjct: 132 DLVRASSDSSGIDEI---WGFEDHVSNYGGLDFGSGVGD--------GGDYVAVEGL 170

BLAST of Tan0004718 vs. TAIR 10
Match: AT3G25870.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G13360.1); Has 50 Blast hits to 50 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 48.5 bits (114), Expect = 8.3e-06
Identity = 57/191 (29.84%), Postives = 89/191 (46.60%), Query Frame = 0

Query: 52  SKSEQVGSD--GDDLRIDLAESEEIQDELLNILEDG-DAVTERDESIQGLELDSFIRSFE 111
           ++++ VG+    D L +D  + + ++D+L +  + G D V++        +LDS ++SFE
Sbjct: 10  TRTDSVGNKRVRDGLDLDSPDVKRLRDDLFD--DSGLDPVSQ--------DLDSVMKSFE 69

Query: 112 EEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPP--------TGGSSTEGKMEAID 171
            E+     A ++ +     Q +LGYLFEASDDELGLPP           S  E   E + 
Sbjct: 70  NELSTTTAALSSGE----TQPDLGYLFEASDDELGLPPPLTPPQTLLPPSCEETVTELV- 129

Query: 172 FMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSD 231
              +  S+ V EL    GFED +  +   ++G              G F    G  D  D
Sbjct: 130 -RASSDSSEVGEL---CGFEDHVTEFGPCDLGD------------DGLFEYFDGCLDSGD 169

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022954576.11.6e-9682.68uncharacterized protein LOC111456805 [Cucurbita moschata][more]
XP_023541956.16.2e-9682.25uncharacterized protein LOC111801944 [Cucurbita pepo subsp. pepo][more]
KAG6572935.12.4e-9582.25hypothetical protein SDJN03_26822, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022994085.11.5e-9480.95uncharacterized protein LOC111489916 [Cucurbita maxima][more]
KAG6584364.16.4e-8574.68hypothetical protein SDJN03_20296, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1GR957.9e-9782.68uncharacterized protein LOC111456805 OS=Cucurbita moschata OX=3662 GN=LOC1114568... [more]
A0A6J1K0977.4e-9580.95uncharacterized protein LOC111489916 OS=Cucurbita maxima OX=3661 GN=LOC111489916... [more]
A0A6J1E8113.4e-8473.82uncharacterized protein LOC111431577 OS=Cucurbita moschata OX=3662 GN=LOC1114315... [more]
A0A0A0LQD51.3e-7068.44Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G042270 PE=4 SV=1[more]
A0A5E4F0J58.9e-4050.41PREDICTED: AT1G13360 OS=Prunus dulcis OX=3755 GN=ALMOND_2B007874 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G13360.12.6e-1535.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G13360.24.1e-1334.25unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G13360.32.7e-1234.46unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G25870.18.3e-0629.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34539:SF15SUBFAMILY NOT NAMEDcoord: 5..231
NoneNo IPR availablePANTHERPTHR34539T6J4.11 PROTEINcoord: 5..231

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004718.1Tan0004718.1mRNA