Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTACATCCGGCCTTCCAATTTAAACGAGAGTTCTTCAACCGTCATCTCAGTTTCTCCTGGCAATAACAAGCAACCCCTCCGGATTCTCTCATCTCCAATCTCCAGCCAAATTAGGGTTTCAATTCTATGAGGTTAGGATTTCTGATCTTTTTCCTTTTATCTCTCTTTCTCTATCCGCACATTACTGTCATGATTGATCCATTTACAGCCATCTATCTCGTTCACGCCATCACATGATTATTTGTTGAGTTGATAGTTTCTTGTATTAATTTCACGCGAATTCATTGCGTTTATAAAGTTTCAAGAAGTTGGAAGCATAATTGTTTGATTTGCAGCGTTGATTTCAGTTCACAGAAACATTTATGATTTTTTCTTTGTTTGATGCTTATAGTATTTGATTTATTAATATAGAGGAGAAGACAAGGGTGCCGAGGTTGGTAGATGAAATTGAAAATTACAGACTCGGAGACTACAGGGCATATATTATACATTTGGAGTGACACAAGAGTGATGTAGTTTAGTTTTAAATGGGATTTTGGGTTCTTGGATTAATCTTGCATAATCATTGGAAATTTTGTGTGGAATTGGTCTTTCTGTTCTTGTTTTGATCATTTTTTTTTCTCTTCACTCGAAAAAATACGATCATTCATGAAACTGTCCTTAATGCTTTCTTGCATTGTTTTTATGCTTCTTCTTGTAGTTTAGGCTCTATCAACTTCCTCCCCCTTGCCCTTATTATGGTTTTATTTTCTTTGTTATGGATTGACTGTGAAAGGAATATTTTCCGATGCCCACTAAGAAAGTTAGGATAAACCATAGTCTATACCATCCCCATTAGCAACAGAAACTTTCAATATTGACCTAAAAGACATTGTCATTTACAAAATACTAGGATCCGGGTCATTTTTGTGAACTGAAACTAGTAGAGTTACTTGTAGTTGGAAAGGCTCGAGTAATTTTTTTGACAGTCGTTAAAGATTGAAGTATTTCTGATAAATGTACCATGGGAATATGGAACAACATTGGTTGGAGGATCATGGGAAATTTTGATGCTTGGGCTTGTGGAATTGGTCCGTCTGTTTTGTTTTGAACCTTTTTCCCCCTCTTCACCTAGAGAAGGTGATCATTAATGAACCTGTCCATGTTTTCCCTACTGTTTTTATGCTGCATCTTGCAATTTAGGCTCTACCAACTTCCTCCCCCTAGCCCTTATCATGGCTCTACGTGCTTGATGGACTGTGAATGTCTGTGAATGGAATATTTTCACACCCATTTATGAACCATACTTACCTACTGACTGAGCAACAGGACCATTGTTAAATGTTTCACATGGTTTTTTTATTTTATTTTTGAAAAAAGGAAAAACATCTCTTCCCTTTGTTCCTTTCTGCATTTCTCTCTCCTCTTTTATCTTTAGTAATTCCTGCTGCTCAGTCAATAGGTAAGTATTATCCATAACAAAATTGGGATAACTCTTAGTAACCAATAAGCAGACAGACACTCTATATTGGTCATATTGTCATTAACAAAATTCTAGGATCCCAGTCGTTCTTTGTGAACTGAGTTTTTTTTCCCATTCTTAAAAATCAGTCTTTCTGATAAATTTATACATATGGTTGTATGGAACAGCATGGTCGAAGGATCTATTTCTTCTCATCTTGACAATGGACCAGAATCTAAAGATCAAGTCGATGCTCTTACTCCTGAAGACATTGCTTGGGTTGATTCTTGTCTGATTAAAGAGGTACCAGATACTTCAGATGGCAATTGGAACCAGATAAAGGATGCCTTGTTAGAAGTCCTTGATCTGTATCCTCAAGGTTTTGAATCTTCTCTTGCTGTAAGTGGTAATGTTCCAGGAGGTACTAACGATGATATCGACGTTGACATGCTTCTCTTTAATAATGTGAAGGAGCCTACATTTCCCTCAAGAGATACCGATGATCCTATGAATGAAACAGGAACAGCTTCAGAAGATCCTCAAAACGATAATGATGTTGATTTGTCTCTGCCGTTATCTTTCAGCAAGAATCCACTTTTACCCACTTACAAAGAGGAGGATAATGTTCTGGGAGGTACTAATGATGATATCGACATCGACATGCTTCTCTCTAATAATGTGAAGGAGCCTACATTTCTCTCGAGAGATAGCGATGACCCTATGAATGAAACAGGAATAGCTTCGGAAGATCAACAAAACCACGATGAGATCAATACTTCTCTGTCGCTATCTAACAAGAATCCATTTTTACCTACTTACAAAGAGGAGGCAGAAGGGAAGGAGACCATTCAAACTGGATCTAGCCATGATTTATCAGAAATTGGATCTGAGCCCCCAATCAATGATATTTTCCGGGTCTGGGATTTGAACCTTCCTCCAGTCGAAGACGAGCTTGTCAAGCAGCTGAACAAAGCCCTTTCTGAAAATTCTGCTGAATCAGTCCCTTCAATGGATAGTAATCTCAGTTCGTTGGAAGACTTACAGGAATATTTACTTGATGACTTGATCAGTAGCATTTCTGGCCTGTCTTTGGAACAGAATAAATAATAGGGACATAGTTGTTCTGGAGCCACTGATTTCTGAAAGTTCATGCTGCTGACACGTGGCACTCTGTTATTCGATAACTTCAAATGTCTCACATTGGAATAGTGGTGATTTTAAAGACGCTACTGAGAGGCTTAGATGATATAGTTCAACGGTATGTTTATTTTTTCTGGTGCAAACTAGTGGCATATGTGAATACCTTTATCTCAAAATCTTATTTGACAGGTTTGGTTAATGATGGCGTCTTAGTAATTTTCTTTTTCTTTTGGGAACATGTCTTAGTAATTAGTATATCTAAGAGGTTGTGATGTTATGGAGGGTGGGATGTTGTTTAAAGTTTAATTGTATTTTTG
mRNA sequence
GTACATCCGGCCTTCCAATTTAAACGAGAGTTCTTCAACCGTCATCTCAGTTTCTCCTGGCAATAACAAGCAACCCCTCCGGATTCTCTCATCTCCAATCTCCAGCCAAATTAGGGTTTCAATTCTATGAGCATGGTCGAAGGATCTATTTCTTCTCATCTTGACAATGGACCAGAATCTAAAGATCAAGTCGATGCTCTTACTCCTGAAGACATTGCTTGGGTTGATTCTTGTCTGATTAAAGAGGTACCAGATACTTCAGATGGCAATTGGAACCAGATAAAGGATGCCTTGTTAGAAGTCCTTGATCTGTATCCTCAAGGTTTTGAATCTTCTCTTGCTGTAAGTGGTAATGTTCCAGGAGGTACTAACGATGATATCGACGTTGACATGCTTCTCTTTAATAATGTGAAGGAGCCTACATTTCCCTCAAGAGATACCGATGATCCTATGAATGAAACAGGAACAGCTTCAGAAGATCCTCAAAACGATAATGATGTTGATTTGTCTCTGCCGTTATCTTTCAGCAAGAATCCACTTTTACCCACTTACAAAGAGGAGGATAATGTTCTGGGAGGTACTAATGATGATATCGACATCGACATGCTTCTCTCTAATAATGTGAAGGAGCCTACATTTCTCTCGAGAGATAGCGATGACCCTATGAATGAAACAGGAATAGCTTCGGAAGATCAACAAAACCACGATGAGATCAATACTTCTCTGTCGCTATCTAACAAGAATCCATTTTTACCTACTTACAAAGAGGAGGCAGAAGGGAAGGAGACCATTCAAACTGGATCTAGCCATGATTTATCAGAAATTGGATCTGAGCCCCCAATCAATGATATTTTCCGGGTCTGGGATTTGAACCTTCCTCCAGTCGAAGACGAGCTTGTCAAGCAGCTGAACAAAGCCCTTTCTGAAAATTCTGCTGAATCAGTCCCTTCAATGGATAGTAATCTCAGTTCGTTGGAAGACTTACAGGAATATTTACTTGATGACTTGATCAGTAGCATTTCTGGCCTGTCTTTGGAACAGAATAAATAATAGGGACATAGTTGTTCTGGAGCCACTGATTTCTGAAAGTTCATGCTGCTGACACGTGGCACTCTGTTATTCGATAACTTCAAATGTCTCACATTGGAATAGTGGTGATTTTAAAGACGCTACTGAGAGGCTTAGATGATATAGTTCAACGGTATGTTTATTTTTTCTGGTGCAAACTAGTGGCATATGTGAATACCTTTATCTCAAAATCTTATTTGACAGGTTTGGTTAATGATGGCGTCTTAGTAATTTTCTTTTTCTTTTGGGAACATGTCTTAGTAATTAGTATATCTAAGAGGTTGTGATGTTATGGAGGGTGGGATGTTGTTTAAAGTTTAATTGTATTTTTG
Coding sequence (CDS)
ATGAGCATGGTCGAAGGATCTATTTCTTCTCATCTTGACAATGGACCAGAATCTAAAGATCAAGTCGATGCTCTTACTCCTGAAGACATTGCTTGGGTTGATTCTTGTCTGATTAAAGAGGTACCAGATACTTCAGATGGCAATTGGAACCAGATAAAGGATGCCTTGTTAGAAGTCCTTGATCTGTATCCTCAAGGTTTTGAATCTTCTCTTGCTGTAAGTGGTAATGTTCCAGGAGGTACTAACGATGATATCGACGTTGACATGCTTCTCTTTAATAATGTGAAGGAGCCTACATTTCCCTCAAGAGATACCGATGATCCTATGAATGAAACAGGAACAGCTTCAGAAGATCCTCAAAACGATAATGATGTTGATTTGTCTCTGCCGTTATCTTTCAGCAAGAATCCACTTTTACCCACTTACAAAGAGGAGGATAATGTTCTGGGAGGTACTAATGATGATATCGACATCGACATGCTTCTCTCTAATAATGTGAAGGAGCCTACATTTCTCTCGAGAGATAGCGATGACCCTATGAATGAAACAGGAATAGCTTCGGAAGATCAACAAAACCACGATGAGATCAATACTTCTCTGTCGCTATCTAACAAGAATCCATTTTTACCTACTTACAAAGAGGAGGCAGAAGGGAAGGAGACCATTCAAACTGGATCTAGCCATGATTTATCAGAAATTGGATCTGAGCCCCCAATCAATGATATTTTCCGGGTCTGGGATTTGAACCTTCCTCCAGTCGAAGACGAGCTTGTCAAGCAGCTGAACAAAGCCCTTTCTGAAAATTCTGCTGAATCAGTCCCTTCAATGGATAGTAATCTCAGTTCGTTGGAAGACTTACAGGAATATTTACTTGATGACTTGATCAGTAGCATTTCTGGCCTGTCTTTGGAACAGAATAAATAA
Protein sequence
MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNETGIASEDQQNHDEINTSLSLSNKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISGLSLEQNK
Homology
BLAST of Tan0013059 vs. NCBI nr
Match:
XP_022971265.1 (uncharacterized protein LOC111470037 [Cucurbita maxima] >XP_022971272.1 uncharacterized protein LOC111470037 [Cucurbita maxima])
HSP 1 Score: 423.3 bits (1087), Expect = 1.8e-114
Identity = 224/306 (73.20%), Postives = 250/306 (81.70%), Query Frame = 0
Query: 3 MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDL 62
M+EGS SS L++G ESK+Q+D LTPEDIAW DSCLIKE+PD DGNWN IKDALLEV DL
Sbjct: 1 MLEGSDSSDLESGQESKEQIDDLTPEDIAWADSCLIKEIPDILDGNWNHIKDALLEVFDL 60
Query: 63 YPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQND 122
YPQGFES LAVS NVPGGTNDD+DVDML F NVK+ T RD+DDPM
Sbjct: 61 YPQGFESPLAVSDNVPGGTNDDVDVDMLRFKNVKKST--ERDSDDPM------------- 120
Query: 123 NDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNE 182
NDVD SL LSFS NPLLPTYKEEDNV GGT+DDI+I+MLLSNNVK+PTF SRDSDD MNE
Sbjct: 121 NDVDSSLSLSFSTNPLLPTYKEEDNVPGGTSDDININMLLSNNVKKPTFFSRDSDDRMNE 180
Query: 183 TGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPIND 242
TG+ASEDQQ+HD+I+TSLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPIND
Sbjct: 181 TGMASEDQQHHDDIDTSLSLSFNKNPFLPTYKEEVDGKESIQTESSHDLPEIGFEPPIND 240
Query: 243 IFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISGL 302
IF+VWDLNLPP+E++LV+QLNKALSENS ESV DSN S L+D + LLD LI SIS L
Sbjct: 241 IFQVWDLNLPPIENDLVEQLNKALSENSTESVRLQDSNFSVLKDFNDDLLDSLIDSISDL 291
Query: 303 SLEQNK 308
SL K
Sbjct: 301 SLGPKK 291
BLAST of Tan0013059 vs. NCBI nr
Match:
XP_022925089.1 (uncharacterized protein LOC111432435 isoform X1 [Cucurbita moschata])
HSP 1 Score: 422.5 bits (1085), Expect = 3.0e-114
Identity = 225/310 (72.58%), Postives = 252/310 (81.29%), Query Frame = 0
Query: 1 MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVL 60
MSM+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD DGNWN IKDALLEVL
Sbjct: 1 MSMLEGSDSSHLESGQESKEQIDDLTPEDIAWADSCLIKDIPDILDGNWNHIKDALLEVL 60
Query: 61 DLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQ 120
DLYPQGFES LAVS VPGG NDDIDVD+L F NVK+PT RD+DDPM
Sbjct: 61 DLYPQGFESPLAVSDTVPGGINDDIDVDVLRFKNVKKPT--ERDSDDPM----------- 120
Query: 121 NDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDD 180
NDVD SL LSFS NPLLPTYKEEDNV G T+D DIDIDMLLSNNVK+PTF S+DSDD
Sbjct: 121 --NDVDSSLSLSFSTNPLLPTYKEEDNVPGSTSDDIDIDIDMLLSNNVKKPTFFSKDSDD 180
Query: 181 PMNETGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEP 240
MNETG+ASEDQQ+HD+I+TSLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EP
Sbjct: 181 RMNETGMASEDQQHHDDIDTSLSLSFNKNPFLPTYKEEVDGKESIQTESSHDLPEIGFEP 240
Query: 241 PINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS 300
PINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV DSN ++D + LLD LI S
Sbjct: 241 PINDIFQVWDLNLPPIENDLVEQLNKALSENSTESVRLQDSNFRVVKDFNDDLLDSLIDS 295
Query: 301 ISGLSLEQNK 308
IS LSLE K
Sbjct: 301 ISDLSLEPKK 295
BLAST of Tan0013059 vs. NCBI nr
Match:
KAG7022261.1 (hypothetical protein SDJN02_15992 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 422.5 bits (1085), Expect = 3.0e-114
Identity = 225/310 (72.58%), Postives = 252/310 (81.29%), Query Frame = 0
Query: 1 MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVL 60
MSM+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD DGNWN IKDALLEVL
Sbjct: 1 MSMLEGSDSSHLESGQESKEQIDDLTPEDIAWADSCLIKDIPDILDGNWNHIKDALLEVL 60
Query: 61 DLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQ 120
DLYPQGFES LAVS VPGG NDDIDVD+L F NVK+PT RD+DDPM
Sbjct: 61 DLYPQGFESPLAVSDTVPGGINDDIDVDVLRFKNVKKPT--ERDSDDPM----------- 120
Query: 121 NDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDD 180
NDVD SL LSFS NPLLPTYKEEDNV G T+D DIDIDMLLSNNVK+PTF S+DSDD
Sbjct: 121 --NDVDSSLSLSFSMNPLLPTYKEEDNVPGSTSDDIDIDIDMLLSNNVKKPTFFSKDSDD 180
Query: 181 PMNETGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEP 240
MNETG+ASEDQQ+HD+I+TSLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EP
Sbjct: 181 RMNETGMASEDQQHHDDIDTSLSLSFNKNPFLPTYKEEVDGKESIQTESSHDLPEIGFEP 240
Query: 241 PINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS 300
PINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV DSN ++D + LLD LI S
Sbjct: 241 PINDIFQVWDLNLPPIENDLVEQLNKALSENSTESVRLQDSNFRVVKDFNDDLLDSLIDS 295
Query: 301 ISGLSLEQNK 308
IS LSLE K
Sbjct: 301 ISDLSLEPKK 295
BLAST of Tan0013059 vs. NCBI nr
Match:
XP_022925097.1 (uncharacterized protein LOC111432435 isoform X2 [Cucurbita moschata] >KAG6588418.1 hypothetical protein SDJN03_16983, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 419.5 bits (1077), Expect = 2.5e-113
Identity = 223/308 (72.40%), Postives = 250/308 (81.17%), Query Frame = 0
Query: 3 MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDL 62
M+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD DGNWN IKDALLEVLDL
Sbjct: 1 MLEGSDSSHLESGQESKEQIDDLTPEDIAWADSCLIKDIPDILDGNWNHIKDALLEVLDL 60
Query: 63 YPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQND 122
YPQGFES LAVS VPGG NDDIDVD+L F NVK+PT RD+DDPM
Sbjct: 61 YPQGFESPLAVSDTVPGGINDDIDVDVLRFKNVKKPT--ERDSDDPM------------- 120
Query: 123 NDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPM 182
NDVD SL LSFS NPLLPTYKEEDNV G T+D DIDIDMLLSNNVK+PTF S+DSDD M
Sbjct: 121 NDVDSSLSLSFSTNPLLPTYKEEDNVPGSTSDDIDIDIDMLLSNNVKKPTFFSKDSDDRM 180
Query: 183 NETGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPI 242
NETG+ASEDQQ+HD+I+TSLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPI
Sbjct: 181 NETGMASEDQQHHDDIDTSLSLSFNKNPFLPTYKEEVDGKESIQTESSHDLPEIGFEPPI 240
Query: 243 NDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSIS 302
NDIF+VWDLNLPP+E++LV+QLNKALSENS ESV DSN ++D + LLD LI SIS
Sbjct: 241 NDIFQVWDLNLPPIENDLVEQLNKALSENSTESVRLQDSNFRVVKDFNDDLLDSLIDSIS 293
Query: 303 GLSLEQNK 308
LSLE K
Sbjct: 301 DLSLEPKK 293
BLAST of Tan0013059 vs. NCBI nr
Match:
XP_023529376.1 (uncharacterized protein LOC111792251 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 417.5 bits (1072), Expect = 9.7e-113
Identity = 224/310 (72.26%), Postives = 250/310 (80.65%), Query Frame = 0
Query: 1 MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVL 60
MSM+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIKE+PD DGNWN IKDALLEVL
Sbjct: 1 MSMLEGSDSSHLESGQESKEQIDDLTPEDIAWADSCLIKEIPDILDGNWNHIKDALLEVL 60
Query: 61 DLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQ 120
DLYPQGFES LAVS VPGG ND IDVDML F NVK+PT RD+DDPM
Sbjct: 61 DLYPQGFESPLAVSDTVPGGINDYIDVDMLRFKNVKKPT--ERDSDDPM----------- 120
Query: 121 NDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDD 180
NDVD SL LSFS NPLLP YKEEDNV GGT+D DIDIDMLLSNNVK+PTF ++DSDD
Sbjct: 121 --NDVDSSLSLSFSTNPLLPIYKEEDNVPGGTSDDIDIDIDMLLSNNVKKPTFFAKDSDD 180
Query: 181 PMNETGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEP 240
MNETG+ASEDQQ+HD+I+TSLSLS NKNPFLPTYKEE + KE+IQT SSHDL EIG EP
Sbjct: 181 RMNETGMASEDQQHHDDIDTSLSLSFNKNPFLPTYKEEVDEKESIQTESSHDLPEIGFEP 240
Query: 241 PINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS 300
PINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV DSN ++D + LLD LI S
Sbjct: 241 PINDIFQVWDLNLPPIENDLVEQLNKALSENSTESVRLQDSNFRVVKDFNDDLLDSLIDS 295
Query: 301 ISGLSLEQNK 308
IS LSLE K
Sbjct: 301 ISDLSLEPKK 295
BLAST of Tan0013059 vs. ExPASy TrEMBL
Match:
A0A6J1I1I4 (uncharacterized protein LOC111470037 OS=Cucurbita maxima OX=3661 GN=LOC111470037 PE=4 SV=1)
HSP 1 Score: 423.3 bits (1087), Expect = 8.5e-115
Identity = 224/306 (73.20%), Postives = 250/306 (81.70%), Query Frame = 0
Query: 3 MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDL 62
M+EGS SS L++G ESK+Q+D LTPEDIAW DSCLIKE+PD DGNWN IKDALLEV DL
Sbjct: 1 MLEGSDSSDLESGQESKEQIDDLTPEDIAWADSCLIKEIPDILDGNWNHIKDALLEVFDL 60
Query: 63 YPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQND 122
YPQGFES LAVS NVPGGTNDD+DVDML F NVK+ T RD+DDPM
Sbjct: 61 YPQGFESPLAVSDNVPGGTNDDVDVDMLRFKNVKKST--ERDSDDPM------------- 120
Query: 123 NDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNE 182
NDVD SL LSFS NPLLPTYKEEDNV GGT+DDI+I+MLLSNNVK+PTF SRDSDD MNE
Sbjct: 121 NDVDSSLSLSFSTNPLLPTYKEEDNVPGGTSDDININMLLSNNVKKPTFFSRDSDDRMNE 180
Query: 183 TGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPIND 242
TG+ASEDQQ+HD+I+TSLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPIND
Sbjct: 181 TGMASEDQQHHDDIDTSLSLSFNKNPFLPTYKEEVDGKESIQTESSHDLPEIGFEPPIND 240
Query: 243 IFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISGL 302
IF+VWDLNLPP+E++LV+QLNKALSENS ESV DSN S L+D + LLD LI SIS L
Sbjct: 241 IFQVWDLNLPPIENDLVEQLNKALSENSTESVRLQDSNFSVLKDFNDDLLDSLIDSISDL 291
Query: 303 SLEQNK 308
SL K
Sbjct: 301 SLGPKK 291
BLAST of Tan0013059 vs. ExPASy TrEMBL
Match:
A0A6J1EB39 (uncharacterized protein LOC111432435 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432435 PE=4 SV=1)
HSP 1 Score: 422.5 bits (1085), Expect = 1.5e-114
Identity = 225/310 (72.58%), Postives = 252/310 (81.29%), Query Frame = 0
Query: 1 MSMVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVL 60
MSM+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD DGNWN IKDALLEVL
Sbjct: 1 MSMLEGSDSSHLESGQESKEQIDDLTPEDIAWADSCLIKDIPDILDGNWNHIKDALLEVL 60
Query: 61 DLYPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQ 120
DLYPQGFES LAVS VPGG NDDIDVD+L F NVK+PT RD+DDPM
Sbjct: 61 DLYPQGFESPLAVSDTVPGGINDDIDVDVLRFKNVKKPT--ERDSDDPM----------- 120
Query: 121 NDNDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDD 180
NDVD SL LSFS NPLLPTYKEEDNV G T+D DIDIDMLLSNNVK+PTF S+DSDD
Sbjct: 121 --NDVDSSLSLSFSTNPLLPTYKEEDNVPGSTSDDIDIDIDMLLSNNVKKPTFFSKDSDD 180
Query: 181 PMNETGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEP 240
MNETG+ASEDQQ+HD+I+TSLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EP
Sbjct: 181 RMNETGMASEDQQHHDDIDTSLSLSFNKNPFLPTYKEEVDGKESIQTESSHDLPEIGFEP 240
Query: 241 PINDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISS 300
PINDIF+VWDLNLPP+E++LV+QLNKALSENS ESV DSN ++D + LLD LI S
Sbjct: 241 PINDIFQVWDLNLPPIENDLVEQLNKALSENSTESVRLQDSNFRVVKDFNDDLLDSLIDS 295
Query: 301 ISGLSLEQNK 308
IS LSLE K
Sbjct: 301 ISDLSLEPKK 295
BLAST of Tan0013059 vs. ExPASy TrEMBL
Match:
A0A6J1EAW3 (uncharacterized protein LOC111432435 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111432435 PE=4 SV=1)
HSP 1 Score: 419.5 bits (1077), Expect = 1.2e-113
Identity = 223/308 (72.40%), Postives = 250/308 (81.17%), Query Frame = 0
Query: 3 MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDL 62
M+EGS SSHL++G ESK+Q+D LTPEDIAW DSCLIK++PD DGNWN IKDALLEVLDL
Sbjct: 1 MLEGSDSSHLESGQESKEQIDDLTPEDIAWADSCLIKDIPDILDGNWNHIKDALLEVLDL 60
Query: 63 YPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQND 122
YPQGFES LAVS VPGG NDDIDVD+L F NVK+PT RD+DDPM
Sbjct: 61 YPQGFESPLAVSDTVPGGINDDIDVDVLRFKNVKKPT--ERDSDDPM------------- 120
Query: 123 NDVDLSLPLSFSKNPLLPTYKEEDNVLGGTND--DIDIDMLLSNNVKEPTFLSRDSDDPM 182
NDVD SL LSFS NPLLPTYKEEDNV G T+D DIDIDMLLSNNVK+PTF S+DSDD M
Sbjct: 121 NDVDSSLSLSFSTNPLLPTYKEEDNVPGSTSDDIDIDIDMLLSNNVKKPTFFSKDSDDRM 180
Query: 183 NETGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPI 242
NETG+ASEDQQ+HD+I+TSLSLS NKNPFLPTYKEE +GKE+IQT SSHDL EIG EPPI
Sbjct: 181 NETGMASEDQQHHDDIDTSLSLSFNKNPFLPTYKEEVDGKESIQTESSHDLPEIGFEPPI 240
Query: 243 NDIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSIS 302
NDIF+VWDLNLPP+E++LV+QLNKALSENS ESV DSN ++D + LLD LI SIS
Sbjct: 241 NDIFQVWDLNLPPIENDLVEQLNKALSENSTESVRLQDSNFRVVKDFNDDLLDSLIDSIS 293
Query: 303 GLSLEQNK 308
LSLE K
Sbjct: 301 DLSLEPKK 293
BLAST of Tan0013059 vs. ExPASy TrEMBL
Match:
A0A5D3BCZ2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold10G00320 PE=4 SV=1)
HSP 1 Score: 341.3 bits (874), Expect = 4.3e-90
Identity = 193/307 (62.87%), Postives = 211/307 (68.73%), Query Frame = 0
Query: 3 MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDL 62
MVEGSISSHLD GPESK+QVD LT EDIAWVDSCLIKE+PD SDGNWN +KDALLE+LDL
Sbjct: 1 MVEGSISSHLDIGPESKEQVDGLTREDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDL 60
Query: 63 YPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQND 122
YPQGFESSLA+S NVPG +N DIDVDML NNVKEPTF SRD+DD MNET TA E
Sbjct: 61 YPQGFESSLALSDNVPGASNGDIDVDMLPSNNVKEPTFSSRDSDDLMNETRTALE----- 120
Query: 123 NDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNE 182
D PMN+
Sbjct: 121 ------------------------------------------------------DHPMND 180
Query: 183 TGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGK-ETIQTGSSHDLSEIGSEPPIN 242
TGIASED Q HD+I+TSL L+ KNPFLPTYKEE EG E Q G H+LSEIGSE PIN
Sbjct: 181 TGIASEDPQKHDDIDTSLPLTLIKNPFLPTYKEEVEGNDENDQAGIGHELSEIGSESPIN 240
Query: 243 DIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISG 302
DIF VWDLN PPVEDEL++QLNKAL+ENS ESVPSMDSNL L+DL+E LLDDLI+SIS
Sbjct: 241 DIFHVWDLNFPPVEDELMEQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISD 248
Query: 303 LSLEQNK 308
LSLEQ K
Sbjct: 301 LSLEQTK 248
BLAST of Tan0013059 vs. ExPASy TrEMBL
Match:
A0A1S3BBI4 (uncharacterized protein LOC103488123 OS=Cucumis melo OX=3656 GN=LOC103488123 PE=4 SV=1)
HSP 1 Score: 341.3 bits (874), Expect = 4.3e-90
Identity = 193/307 (62.87%), Postives = 211/307 (68.73%), Query Frame = 0
Query: 3 MVEGSISSHLDNGPESKDQVDALTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDL 62
MVEGSISSHLD GPESK+QVD LT EDIAWVDSCLIKE+PD SDGNWN +KDALLE+LDL
Sbjct: 1 MVEGSISSHLDIGPESKEQVDGLTREDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDL 60
Query: 63 YPQGFESSLAVSGNVPGGTNDDIDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQND 122
YPQGFESSLA+S NVPG +N DIDVDML NNVKEPTF SRD+DD MNET TA E
Sbjct: 61 YPQGFESSLALSDNVPGASNGDIDVDMLPSNNVKEPTFSSRDSDDLMNETRTALE----- 120
Query: 123 NDVDLSLPLSFSKNPLLPTYKEEDNVLGGTNDDIDIDMLLSNNVKEPTFLSRDSDDPMNE 182
D PMN+
Sbjct: 121 ------------------------------------------------------DHPMND 180
Query: 183 TGIASEDQQNHDEINTSLSLS-NKNPFLPTYKEEAEGK-ETIQTGSSHDLSEIGSEPPIN 242
TGIASED Q HD+I+TSL L+ KNPFLPTYKEE EG E Q G H+LSEIGSE PIN
Sbjct: 181 TGIASEDPQKHDDIDTSLPLTLIKNPFLPTYKEEVEGNDENDQAGIGHELSEIGSESPIN 240
Query: 243 DIFRVWDLNLPPVEDELVKQLNKALSENSAESVPSMDSNLSSLEDLQEYLLDDLISSISG 302
DIF VWDLN PPVEDEL++QLNKAL+ENS ESVPSMDSNL L+DL+E LLDDLI+SIS
Sbjct: 241 DIFHVWDLNFPPVEDELMEQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISD 248
Query: 303 LSLEQNK 308
LSLEQ K
Sbjct: 301 LSLEQTK 248
BLAST of Tan0013059 vs. TAIR 10
Match:
AT4G38980.1 (unknown protein; Has 44 Blast hits to 44 proteins in 19 species: Archae - 0; Bacteria - 2; Metazoa - 2; Fungi - 8; Plants - 24; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )
HSP 1 Score: 70.1 bits (170), Expect = 3.5e-12
Identity = 77/288 (26.74%), Postives = 130/288 (45.14%), Query Frame = 0
Query: 25 LTPEDIAWVDSCLIKEVPDTSDGNWNQIKDALLEVLDLYPQGFESSLAVSGNVPGGTNDD 84
L+PE +AW DSC+I + D+ + NW +DAL E++D++P+ F S GT
Sbjct: 20 LSPECVAWADSCIISFLDDSDNNNWGTFRDALTEIIDIHPEMFVFSSTT------GTRGV 79
Query: 85 IDVDMLLFNNVKEPTFPSRDTDDPMNETGTASEDPQNDNDVDLSLPLSFSKNPLLPTYKE 144
+ D ++ + T R +P + + + N+ ++ L+F +P
Sbjct: 80 LSPDEVM---TESETIDLR-RSEPAAGSANSRTNSSNEEVSEIISMLTFESDP------- 139
Query: 145 EDNVLGGTNDDIDIDMLLSNNVKEPTFLSR---DSDDPMNETGIASEDQQNHDEINTSLS 204
N L D + + N +EP S+ + + E G S + ++ + S
Sbjct: 140 SKNCL---EDYYFPESIAENGNREPADGSKTDLGGVESIEEDGSVSNGEAKEEKPASVSS 199
Query: 205 LSNKNPFLPTYKEEAEGKETIQTGSSHDLSEIGSEPPINDIFRVWDLNL---PPVEDELV 264
K+ F+ TY E+ +++E + +IF+VWDL + ED LV
Sbjct: 200 QVFKDDFMSTYVED--------NAEDCNVTEDPVKVTSQEIFKVWDLEVVGDNDEEDGLV 259
Query: 265 KQLNKALSENSAESVPSMDSNLSSLEDLQEYL-LDDLISSISGLSLEQ 306
QL KAL E+S +V + L+ + + E +DDLIS IS LSL +
Sbjct: 260 LQLKKALDESS--TVQPLPQPLNDDQVVSEKSNIDDLISGISDLSLAE 277
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022971265.1 | 1.8e-114 | 73.20 | uncharacterized protein LOC111470037 [Cucurbita maxima] >XP_022971272.1 uncharac... | [more] |
XP_022925089.1 | 3.0e-114 | 72.58 | uncharacterized protein LOC111432435 isoform X1 [Cucurbita moschata] | [more] |
KAG7022261.1 | 3.0e-114 | 72.58 | hypothetical protein SDJN02_15992 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022925097.1 | 2.5e-113 | 72.40 | uncharacterized protein LOC111432435 isoform X2 [Cucurbita moschata] >KAG6588418... | [more] |
XP_023529376.1 | 9.7e-113 | 72.26 | uncharacterized protein LOC111792251 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1I1I4 | 8.5e-115 | 73.20 | uncharacterized protein LOC111470037 OS=Cucurbita maxima OX=3661 GN=LOC111470037... | [more] |
A0A6J1EB39 | 1.5e-114 | 72.58 | uncharacterized protein LOC111432435 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1EAW3 | 1.2e-113 | 72.40 | uncharacterized protein LOC111432435 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A5D3BCZ2 | 4.3e-90 | 62.87 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BBI4 | 4.3e-90 | 62.87 | uncharacterized protein LOC103488123 OS=Cucumis melo OX=3656 GN=LOC103488123 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT4G38980.1 | 3.5e-12 | 26.74 | unknown protein; Has 44 Blast hits to 44 proteins in 19 species: Archae - 0; Bac... | [more] |