Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTATTGCGAGCACAAACTTTACCTTGTCTTCAAGTCTTCCTTTCTTCACCAACAACATTTTCTCTCAAATCTCAAAATCCCGATTCCCAAGCTCCACAGCAACCTCATCAATGGCGTCTTCAGCTTCCAGCCCTTTGACCAAACCCCATTTTCCTCATTCTCCACTTCCACCAACTCCTTCCGCTCACCAACACAAGTCCTGTGCACAATTTCTATGTAAATCCCTCTTCTTCTGCATCTTCCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCAACCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAGCATCCAGGTGAGTGTAGACGAACCTCGCTTCTCCAATTTTGATCATCCACAGTCGTATTTCTCTAAGATGTTTCACGTCGCTTCGATTTTTGAAGATGCTGACGATTTGAGTGATTCTGATGAGAGGAAATTGAGTGAAGTTCTGTACATTCAGCCGAATCGTGGATCCGTGAGTGATTTTTGGGATTTTAATGCTCAATCTCGCGAACAGGAAAAACTCCATTGCTCTATACCCAAAAAAAGGTATGAAAATTCTTATGAATCTTCTGATACTGACAATGTCGGTCATGCTTGTAAATCGAGATATACTCGGGGTGGATCTTTGTTGGTTGTTGCTGAAACAAATCGTAGTTCTGGTGAATGGATGGAATCAGGAGCCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTAAGATCGAATCTTACTGAACCTGACGATCTCGAATTCGATTGTGGTGATGAATCTTGCTTGAGTTCTAAAACTTCATCCAATAGCTCTGAGAATAATTGTGAAAGAATAAGTGAATTTGGTGATAATTGTTGTACAAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTTCATCATTGTCCCCATTTCAATGGCGCGAGAAATTTGGAAAGAAGTTGGTGAGAGAGAGAGGAGTTGGGAATGCTATTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAACTCTAGCTCTCTTCATTCTTCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTCCGGCGTTGCCATCAACGACGAGAAAGCACCGTAAAATGTCGTCGCTCAGTAACATTTCCTCTAAGTCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACGGTAGAGGGAGCTCTGAGGACCCTCTGATTGAACCAGAAAATTCATCCGAGTGCAATGGATCCGTGGTAAGTTCCCCAAATTTAGATAGGAATTTCGCAAGAATGCCGAAAGCTTTATCCCGGGGAAAATCCGTTAGAACAGTTAGAGCAAATGTAGTTACCATGGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAATGGAGCATGATAACAATACAGGGAAGTTTGAAGAAGGTGGAGAGTCACCATATATGAGAGAAGATGGAATGGGACATGGATGGGATGCTGTTGTTAACCCGAATGCTGGTAATTCGAATCGTTTGTCGAAGACGACATTCTTGGGAATTGAGGAGCAGAAGGAAGACACTGAGAGTTTGCTGACAGATGATGGTAAAGATAACTCTGATAAGGAGGATGAAACTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCGAGTATGGCGGGAGATTCAGAATCGGGGGCTTACGAGGTCGACAAGAAAGCTGGCGAGTTCATAGCCAAGTTTAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGCAGAAAAAAGATTGAGAGGAGGATGGGGGTCATTCAGCAGCACAGGCAGCAGCTATTTCAGTTGATTGGCTAAGGCTTCAATCTTAAAAAGGCCTTCTTCTAAGCCCTTTACACATTTAAGGTTAAAT
mRNA sequence
GTTTATTGCGAGCACAAACTTTACCTTGTCTTCAAGTCTTCCTTTCTTCACCAACAACATTTTCTCTCAAATCTCAAAATCCCGATTCCCAAGCTCCACAGCAACCTCATCAATGGCGTCTTCAGCTTCCAGCCCTTTGACCAAACCCCATTTTCCTCATTCTCCACTTCCACCAACTCCTTCCGCTCACCAACACAAGTCCTGTGCACAATTTCTATGTAAATCCCTCTTCTTCTGCATCTTCCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCAACCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAGCATCCAGGTGAGTGTAGACGAACCTCGCTTCTCCAATTTTGATCATCCACAGTCGTATTTCTCTAAGATGTTTCACGTCGCTTCGATTTTTGAAGATGCTGACGATTTGAGTGATTCTGATGAGAGGAAATTGAGTGAAGTTCTGTACATTCAGCCGAATCGTGGATCCGTGAGTGATTTTTGGGATTTTAATGCTCAATCTCGCGAACAGGAAAAACTCCATTGCTCTATACCCAAAAAAAGGTATGAAAATTCTTATGAATCTTCTGATACTGACAATGTCGGTCATGCTTGTAAATCGAGATATACTCGGGGTGGATCTTTGTTGGTTGTTGCTGAAACAAATCGTAGTTCTGGTGAATGGATGGAATCAGGAGCCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTAAGATCGAATCTTACTGAACCTGACGATCTCGAATTCGATTGTGGTGATGAATCTTGCTTGAGTTCTAAAACTTCATCCAATAGCTCTGAGAATAATTGTGAAAGAATAAGTGAATTTGGTGATAATTGTTGTACAAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTTCATCATTGTCCCCATTTCAATGGCGCGAGAAATTTGGAAAGAAGTTGGTGAGAGAGAGAGGAGTTGGGAATGCTATTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAACTCTAGCTCTCTTCATTCTTCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTCCGGCGTTGCCATCAACGACGAGAAAGCACCGTAAAATGTCGTCGCTCAGTAACATTTCCTCTAAGTCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACGGTAGAGGGAGCTCTGAGGACCCTCTGATTGAACCAGAAAATTCATCCGAGTGCAATGGATCCGTGGTAAGTTCCCCAAATTTAGATAGGAATTTCGCAAGAATGCCGAAAGCTTTATCCCGGGGAAAATCCGTTAGAACAGTTAGAGCAAATGTAGTTACCATGGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAATGGAGCATGATAACAATACAGGGAAGTTTGAAGAAGGTGGAGAGTCACCATATATGAGAGAAGATGGAATGGGACATGGATGGGATGCTGTTGTTAACCCGAATGCTGGTAATTCGAATCGTTTGTCGAAGACGACATTCTTGGGAATTGAGGAGCAGAAGGAAGACACTGAGAGTTTGCTGACAGATGATGGTAAAGATAACTCTGATAAGGAGGATGAAACTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCGAGTATGGCGGGAGATTCAGAATCGGGGGCTTACGAGGTCGACAAGAAAGCTGGCGAGTTCATAGCCAAGTTTAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGCAGAAAAAAGATTGAGAGGAGGATGGGGGTCATTCAGCAGCACAGGCAGCAGCTATTTCAGTTGATTGGCTAAGGCTTCAATCTTAAAAAGGCCTTCTTCTAAGCCCTTTACACATTTAAGGTTAAAT
Coding sequence (CDS)
ATGGCGTCTTCAGCTTCCAGCCCTTTGACCAAACCCCATTTTCCTCATTCTCCACTTCCACCAACTCCTTCCGCTCACCAACACAAGTCCTGTGCACAATTTCTATGTAAATCCCTCTTCTTCTGCATCTTCCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCAACCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAGCATCCAGGTGAGTGTAGACGAACCTCGCTTCTCCAATTTTGATCATCCACAGTCGTATTTCTCTAAGATGTTTCACGTCGCTTCGATTTTTGAAGATGCTGACGATTTGAGTGATTCTGATGAGAGGAAATTGAGTGAAGTTCTGTACATTCAGCCGAATCGTGGATCCGTGAGTGATTTTTGGGATTTTAATGCTCAATCTCGCGAACAGGAAAAACTCCATTGCTCTATACCCAAAAAAAGGTATGAAAATTCTTATGAATCTTCTGATACTGACAATGTCGGTCATGCTTGTAAATCGAGATATACTCGGGGTGGATCTTTGTTGGTTGTTGCTGAAACAAATCGTAGTTCTGGTGAATGGATGGAATCAGGAGCCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTAAGATCGAATCTTACTGAACCTGACGATCTCGAATTCGATTGTGGTGATGAATCTTGCTTGAGTTCTAAAACTTCATCCAATAGCTCTGAGAATAATTGTGAAAGAATAAGTGAATTTGGTGATAATTGTTGTACAAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTTCATCATTGTCCCCATTTCAATGGCGCGAGAAATTTGGAAAGAAGTTGGTGAGAGAGAGAGGAGTTGGGAATGCTATTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAACTCTAGCTCTCTTCATTCTTCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTCCGGCGTTGCCATCAACGACGAGAAAGCACCGTAAAATGTCGTCGCTCAGTAACATTTCCTCTAAGTCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACGGTAGAGGGAGCTCTGAGGACCCTCTGATTGAACCAGAAAATTCATCCGAGTGCAATGGATCCGTGGTAAGTTCCCCAAATTTAGATAGGAATTTCGCAAGAATGCCGAAAGCTTTATCCCGGGGAAAATCCGTTAGAACAGTTAGAGCAAATGTAGTTACCATGGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAATGGAGCATGATAACAATACAGGGAAGTTTGAAGAAGGTGGAGAGTCACCATATATGAGAGAAGATGGAATGGGACATGGATGGGATGCTGTTGTTAACCCGAATGCTGGTAATTCGAATCGTTTGTCGAAGACGACATTCTTGGGAATTGAGGAGCAGAAGGAAGACACTGAGAGTTTGCTGACAGATGATGGTAAAGATAACTCTGATAAGGAGGATGAAACTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCGAGTATGGCGGGAGATTCAGAATCGGGGGCTTACGAGGTCGACAAGAAAGCTGGCGAGTTCATAGCCAAGTTTAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGCAGAAAAAAGATTGAGAGGAGGATGGGGGTCATTCAGCAGCACAGGCAGCAGCTATTTCAGTTGA
Protein sequence
MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDTDNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDGMGHGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSSYFS
Homology
BLAST of Tan0002794 vs. NCBI nr
Match:
ADN34231.1 (hypothetical protein [Cucumis melo subsp. melo] >TYK24724.1 DUF761 domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 907.5 bits (2344), Expect = 6.0e-260
Identity = 483/602 (80.23%), Postives = 523/602 (86.88%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MASS S+P TKPHFPHSPLPPT + SC FLCKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1 MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
QTLLTKFWELFHLMFVGIAVSYGLFSRR++QVSV DEPRFSNF++PQSY SKM HVASI
Sbjct: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120
Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
FED DD S SDERKLSEVLYIQPN GSV FNA SR+QE H SIPKKRYENS E
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVR---GFNAISRQQENFHYSIPKKRYENSLEFD 180
Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
DT++VGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSLRSNLTEPD
Sbjct: 181 DTNSVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPD 240
Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+ +SPFQ RE F
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENF 300
Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
GK ++RERGV NA+LRPSHFRP SIDETQFESLK S SLHS+LSQSSQTSS SP+L STT
Sbjct: 301 GKNMMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTT 360
Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLIEPENSSECN S++SSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDR 420
Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
NFA +PKALSRGKSVRT+RAN +EEMKAQEMYRNQ+EHD+N G EGG SPYMREDG
Sbjct: 421 NFAHIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKFEGGMSPYMREDG 480
Query: 481 MGHGWDAVVNPNAGNSNRLSK-TTFLGIEEQKEDTESLLTDDG--KDNSDKEDETIFASS 540
GHGW + +PNAG SNR K TTF GIEEQKED ES LTDD +DNS++ED + F SS
Sbjct: 481 TGHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESS 540
Query: 541 DEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSSY 597
DEEAASSMAG+SESGAYEVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SSY
Sbjct: 541 DEEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSY 599
BLAST of Tan0002794 vs. NCBI nr
Match:
XP_004140631.1 (uncharacterized protein LOC101220435 [Cucumis sativus] >KGN46495.1 hypothetical protein Csa_004883 [Cucumis sativus])
HSP 1 Score: 893.6 bits (2308), Expect = 8.9e-256
Identity = 477/603 (79.10%), Postives = 519/603 (86.07%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MA S S+P TKPHFPHSPLPPT + SC QF+CKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1 MAPSPSTPFTKPHFPHSPLPPTSTTRHSNSCTQFICKSLFFCIFLLLLPLFPSEAPEFVN 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
QT LTKFWELFHLMF+GIAVSYGLFSRR++QVSV DEPRFSNF++PQSY SKMFHVASI
Sbjct: 61 QTFLTKFWELFHLMFIGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMFHVASI 120
Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
FED DD S SDERKLSEVLYIQPN GSVS NA SR+QE H SIPKKRYENS E +
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVS---GLNAISRQQENFHYSIPKKRYENSLEFA 180
Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
+TDNVGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSL+S+LTEPD
Sbjct: 181 ETDNVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLKSSLTEPD 240
Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+S+SPFQ REKF
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIASMSPFQLREKF 300
Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
K ++RER V NA+LRPSHFRP SIDETQFESLK S+SLHS+LSQSSQTSS S L S T
Sbjct: 301 EKNMMRERRVKNAVLRPSHFRPSSIDETQFESLKKSTSLHSNLSQSSQTSSLSSPLSSRT 360
Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLI+PENSSECN SVVSSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIDPENSSECNESVVSSPRLDR 420
Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
NFA PKALSRGKSVRTVRA+ +EEMKAQEMYRNQ+EHD+N EGG SPYMRED
Sbjct: 421 NFANTPKALSRGKSVRTVRASTSAIEEMKAQEMYRNQVEHDDNVENKFEGGMSPYMREDE 480
Query: 481 MGHGWDAVVNPNAGNSNRLSK----TTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFAS 540
GHGW + N NA SNR SK TTF GIEEQKEDTES +TDDGKDNS++ED++ F S
Sbjct: 481 TGHGWPGINNLNAAYSNRYSKTTATTTFSGIEEQKEDTESQVTDDGKDNSEREDDSFFES 540
Query: 541 SDEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSS 597
SDEEAA SM GDSESGA+EVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SS
Sbjct: 541 SDEEAALSMTGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTTSS 600
BLAST of Tan0002794 vs. NCBI nr
Match:
XP_023006022.1 (uncharacterized protein LOC111498900 [Cucurbita maxima])
HSP 1 Score: 871.3 bits (2250), Expect = 4.8e-249
Identity = 465/599 (77.63%), Postives = 512/599 (85.48%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MASSASSP TK HFPHSPLP P+ H SCAQFLCKSLFFC FLLLLPLFPSEAPDFV+
Sbjct: 1 MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVD 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
QTL TKFWELFHLM VGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61 QTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120
Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
D DD S SDERKLSEVLYIQPN GS S D NAQSR+QEKL SIPKKRYENSYE +DT
Sbjct: 121 DVDDFSVSDERKLSEVLYIQPNLGSAS---DLNAQSRQQEKLRYSIPKKRYENSYEFADT 180
Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
DNV HACKSRYTRGGS++VV ETNRSS SG IVNYKPLGLPVRSL+S+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLKSSLTESDDVE 240
Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
FDCGDESCLSSK+S SSENNCE SEFGDNCC NLEEKFDE I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300
Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
++RERG GNA+LRPSHFRPPSIDETQFESLK S SLHS+LSQSSQTSS S +L STTRKH
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKH 360
Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
KMSSLSNIS KSLHSRQYSMSSLSEN RGSSEDPLIE ENSSECN SVVSSP D NF
Sbjct: 361 HKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDMNFR 420
Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGESPYMREDGMG 480
+PKALS+GKS+R ++AN +E++KAQEM+R Q++HD+ G KFEEGG SPY+REDG G
Sbjct: 421 SIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTG 480
Query: 481 HGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEAA 540
HGW V NPNA N +R TTFLGI+EQKE+TESL+ DD KD+S+ EDE+ FASSDEEAA
Sbjct: 481 HGWPDVANPNASNMSRFPTTTFLGIKEQKEETESLVADDSKDDSEGEDESFFASSDEEAA 540
Query: 541 SSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR--GGWGSFSSTGSSYFS 597
SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR GGWGSFSST SSYFS
Sbjct: 541 SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS 591
BLAST of Tan0002794 vs. NCBI nr
Match:
KAG6575261.1 (hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 866.3 bits (2237), Expect = 1.5e-247
Identity = 469/601 (78.04%), Postives = 511/601 (85.02%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MASSASSP TK HFPHSPLP P+ SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+
Sbjct: 1 MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
QTL TKFWELFHLMFVGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61 QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120
Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
D DD SDERK+SEVLYIQP GS S D NAQSR QEKL S+PKKRYENSYE +DT
Sbjct: 121 DVDDFGVSDERKVSEVLYIQPKLGSAS---DLNAQSRHQEKLRYSMPKKRYENSYEFADT 180
Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
DNV HACKSRYTRGGS++VV ETNRSS SG IVNYKPLGLPVRSLRS+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLRSSLTESDDVE 240
Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
FDCGDESCLSSK+S SSENNCE SEFGDNCC NLEEKFDE I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300
Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
++RERG GNA+LRPSHFRPPSIDETQFESL+ S SLHS LSQSSQTSS S L STTRKH
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLRKSGSLHSDLSQSSQTSSLSSQLSSTTRKH 360
Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
KMSSLSNIS KSLHSRQYSMSSLSEN RGSSEDPLIE ENSSECN SVVSSP DRNFA
Sbjct: 361 SKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420
Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGES-PYMREDGM 480
+PKALS+GKSVR +RAN +E+MKAQEM+R Q++HD+ G KFEEGG S PYMREDG
Sbjct: 421 SIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGT 480
Query: 481 GHGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEA 540
GHGW VVNPNAGN NR KTTFLGI+EQKE+TESL+ DD KD S+ EDE++FASSDEEA
Sbjct: 481 GHGWPDVVNPNAGNMNRFPKTTFLGIKEQKEETESLVADDSKDGSEGEDESLFASSDEEA 540
Query: 541 ASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR---GGWGSFSSTGSSYF 597
SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR GGWGSFSST SSYF
Sbjct: 541 GSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGWGSFSSTSSSYF 589
BLAST of Tan0002794 vs. NCBI nr
Match:
KAG7013816.1 (hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 864.8 bits (2233), Expect = 4.5e-247
Identity = 469/601 (78.04%), Postives = 510/601 (84.86%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MASSASSP TK HFPHSPLP P+ SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+
Sbjct: 1 MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
QTL TKFWELFHLMFVGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61 QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120
Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
D DD SDERK+SEVLYIQP GS S D NAQSR QEKL S+PKKRYENSYE +DT
Sbjct: 121 DVDDFGVSDERKVSEVLYIQPKLGSAS---DLNAQSRHQEKLRYSMPKKRYENSYEFADT 180
Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
DNV HACKSRYTRGGS++VV ETNRSS SG IVNYKPLGLPVRSLRS+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLRSSLTESDDVE 240
Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
FDCGDESCLSSK+S SSENNCE SEFGDNCC NLEEKFDE I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300
Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
++RERG GNA+LRPSHFRPPSIDETQFESL+ S SLHS LSQSSQTSS S L STTRKH
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLRKSGSLHSDLSQSSQTSSLSSPLSSTTRKH 360
Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
KMSSLSNIS KSLHSRQYSMSSLSEN RGSSEDPLIE ENSSECN SVVSSP DRNFA
Sbjct: 361 SKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420
Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGES-PYMREDGM 480
+PKALS+GKSVR +RAN +E+MKAQEM+R Q++HD+ G KFEEGG S PYMREDG
Sbjct: 421 SIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGT 480
Query: 481 GHGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEA 540
GHGW VVNPNAGN NR KTTFLGI+EQKE+TESL+ DD KD S+ EDE+ FASSDEEA
Sbjct: 481 GHGWPDVVNPNAGNMNRFPKTTFLGIKEQKEETESLVADDSKDGSEGEDESSFASSDEEA 540
Query: 541 ASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR---GGWGSFSSTGSSYF 597
SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR GGWGSFSST SSYF
Sbjct: 541 GSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGWGSFSSTSSSYF 589
BLAST of Tan0002794 vs. ExPASy TrEMBL
Match:
A0A5D3DMA5 (DUF761 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G002840 PE=4 SV=1)
HSP 1 Score: 907.5 bits (2344), Expect = 2.9e-260
Identity = 483/602 (80.23%), Postives = 523/602 (86.88%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MASS S+P TKPHFPHSPLPPT + SC FLCKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1 MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
QTLLTKFWELFHLMFVGIAVSYGLFSRR++QVSV DEPRFSNF++PQSY SKM HVASI
Sbjct: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120
Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
FED DD S SDERKLSEVLYIQPN GSV FNA SR+QE H SIPKKRYENS E
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVR---GFNAISRQQENFHYSIPKKRYENSLEFD 180
Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
DT++VGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSLRSNLTEPD
Sbjct: 181 DTNSVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPD 240
Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+ +SPFQ RE F
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENF 300
Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
GK ++RERGV NA+LRPSHFRP SIDETQFESLK S SLHS+LSQSSQTSS SP+L STT
Sbjct: 301 GKNMMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTT 360
Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLIEPENSSECN S++SSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDR 420
Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
NFA +PKALSRGKSVRT+RAN +EEMKAQEMYRNQ+EHD+N G EGG SPYMREDG
Sbjct: 421 NFAHIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKFEGGMSPYMREDG 480
Query: 481 MGHGWDAVVNPNAGNSNRLSK-TTFLGIEEQKEDTESLLTDDG--KDNSDKEDETIFASS 540
GHGW + +PNAG SNR K TTF GIEEQKED ES LTDD +DNS++ED + F SS
Sbjct: 481 TGHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESS 540
Query: 541 DEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSSY 597
DEEAASSMAG+SESGAYEVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SSY
Sbjct: 541 DEEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSY 599
BLAST of Tan0002794 vs. ExPASy TrEMBL
Match:
E5GCN2 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)
HSP 1 Score: 907.5 bits (2344), Expect = 2.9e-260
Identity = 483/602 (80.23%), Postives = 523/602 (86.88%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MASS S+P TKPHFPHSPLPPT + SC FLCKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1 MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
QTLLTKFWELFHLMFVGIAVSYGLFSRR++QVSV DEPRFSNF++PQSY SKM HVASI
Sbjct: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120
Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
FED DD S SDERKLSEVLYIQPN GSV FNA SR+QE H SIPKKRYENS E
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVR---GFNAISRQQENFHYSIPKKRYENSLEFD 180
Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
DT++VGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSLRSNLTEPD
Sbjct: 181 DTNSVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPD 240
Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+ +SPFQ RE F
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENF 300
Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
GK ++RERGV NA+LRPSHFRP SIDETQFESLK S SLHS+LSQSSQTSS SP+L STT
Sbjct: 301 GKNMMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTT 360
Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLIEPENSSECN S++SSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDR 420
Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
NFA +PKALSRGKSVRT+RAN +EEMKAQEMYRNQ+EHD+N G EGG SPYMREDG
Sbjct: 421 NFAHIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKFEGGMSPYMREDG 480
Query: 481 MGHGWDAVVNPNAGNSNRLSK-TTFLGIEEQKEDTESLLTDDG--KDNSDKEDETIFASS 540
GHGW + +PNAG SNR K TTF GIEEQKED ES LTDD +DNS++ED + F SS
Sbjct: 481 TGHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESS 540
Query: 541 DEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSSY 597
DEEAASSMAG+SESGAYEVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SSY
Sbjct: 541 DEEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSY 599
BLAST of Tan0002794 vs. ExPASy TrEMBL
Match:
A0A0A0K9X1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G103540 PE=4 SV=1)
HSP 1 Score: 893.6 bits (2308), Expect = 4.3e-256
Identity = 477/603 (79.10%), Postives = 519/603 (86.07%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MA S S+P TKPHFPHSPLPPT + SC QF+CKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1 MAPSPSTPFTKPHFPHSPLPPTSTTRHSNSCTQFICKSLFFCIFLLLLPLFPSEAPEFVN 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
QT LTKFWELFHLMF+GIAVSYGLFSRR++QVSV DEPRFSNF++PQSY SKMFHVASI
Sbjct: 61 QTFLTKFWELFHLMFIGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMFHVASI 120
Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
FED DD S SDERKLSEVLYIQPN GSVS NA SR+QE H SIPKKRYENS E +
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVS---GLNAISRQQENFHYSIPKKRYENSLEFA 180
Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
+TDNVGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSL+S+LTEPD
Sbjct: 181 ETDNVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLKSSLTEPD 240
Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+S+SPFQ REKF
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIASMSPFQLREKF 300
Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
K ++RER V NA+LRPSHFRP SIDETQFESLK S+SLHS+LSQSSQTSS S L S T
Sbjct: 301 EKNMMRERRVKNAVLRPSHFRPSSIDETQFESLKKSTSLHSNLSQSSQTSSLSSPLSSRT 360
Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLI+PENSSECN SVVSSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIDPENSSECNESVVSSPRLDR 420
Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
NFA PKALSRGKSVRTVRA+ +EEMKAQEMYRNQ+EHD+N EGG SPYMRED
Sbjct: 421 NFANTPKALSRGKSVRTVRASTSAIEEMKAQEMYRNQVEHDDNVENKFEGGMSPYMREDE 480
Query: 481 MGHGWDAVVNPNAGNSNRLSK----TTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFAS 540
GHGW + N NA SNR SK TTF GIEEQKEDTES +TDDGKDNS++ED++ F S
Sbjct: 481 TGHGWPGINNLNAAYSNRYSKTTATTTFSGIEEQKEDTESQVTDDGKDNSEREDDSFFES 540
Query: 541 SDEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSS 597
SDEEAA SM GDSESGA+EVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SS
Sbjct: 541 SDEEAALSMTGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTTSS 600
BLAST of Tan0002794 vs. ExPASy TrEMBL
Match:
A0A6J1KUS4 (uncharacterized protein LOC111498900 OS=Cucurbita maxima OX=3661 GN=LOC111498900 PE=4 SV=1)
HSP 1 Score: 871.3 bits (2250), Expect = 2.3e-249
Identity = 465/599 (77.63%), Postives = 512/599 (85.48%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MASSASSP TK HFPHSPLP P+ H SCAQFLCKSLFFC FLLLLPLFPSEAPDFV+
Sbjct: 1 MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVD 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
QTL TKFWELFHLM VGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61 QTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120
Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
D DD S SDERKLSEVLYIQPN GS S D NAQSR+QEKL SIPKKRYENSYE +DT
Sbjct: 121 DVDDFSVSDERKLSEVLYIQPNLGSAS---DLNAQSRQQEKLRYSIPKKRYENSYEFADT 180
Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
DNV HACKSRYTRGGS++VV ETNRSS SG IVNYKPLGLPVRSL+S+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLKSSLTESDDVE 240
Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
FDCGDESCLSSK+S SSENNCE SEFGDNCC NLEEKFDE I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300
Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
++RERG GNA+LRPSHFRPPSIDETQFESLK S SLHS+LSQSSQTSS S +L STTRKH
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKH 360
Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
KMSSLSNIS KSLHSRQYSMSSLSEN RGSSEDPLIE ENSSECN SVVSSP D NF
Sbjct: 361 HKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDMNFR 420
Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGESPYMREDGMG 480
+PKALS+GKS+R ++AN +E++KAQEM+R Q++HD+ G KFEEGG SPY+REDG G
Sbjct: 421 SIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTG 480
Query: 481 HGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEAA 540
HGW V NPNA N +R TTFLGI+EQKE+TESL+ DD KD+S+ EDE+ FASSDEEAA
Sbjct: 481 HGWPDVANPNASNMSRFPTTTFLGIKEQKEETESLVADDSKDDSEGEDESFFASSDEEAA 540
Query: 541 SSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR--GGWGSFSSTGSSYFS 597
SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR GGWGSFSST SSYFS
Sbjct: 541 SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS 591
BLAST of Tan0002794 vs. ExPASy TrEMBL
Match:
A0A6J1H4M0 (uncharacterized protein LOC111459998 OS=Cucurbita moschata OX=3662 GN=LOC111459998 PE=4 SV=1)
HSP 1 Score: 859.0 bits (2218), Expect = 1.2e-245
Identity = 468/604 (77.48%), Postives = 510/604 (84.44%), Query Frame = 0
Query: 1 MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
MASSASSP TK HFPHSPLP P+ SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+
Sbjct: 1 MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 60
Query: 61 QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
QTL TKFWELFHLMFVGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61 QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120
Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
D DD SDERK+SEVLYIQP GS S D NAQSR QEKL S+PKKRYENSYE +DT
Sbjct: 121 DVDDFGVSDERKVSEVLYIQPKLGSAS---DLNAQSRHQEKLRYSMPKKRYENSYEFADT 180
Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
DNV HACKSRYTRGGS++VV ETNRSS SG IVNYKPLGLPVRSLRS+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLRSSLTESDDVE 240
Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
FDCGDESCLSSK+S SSENNCE SEFGDNCC NLEEKFDE I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300
Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
++RERG GNA+LRPSHFRPPSIDETQFESLK S SLHS LSQSSQTSS S L STTRK
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKP 360
Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
RKMSSLSNIS KSLHSRQYS SSLSEN RGSSEDPLIE ENSSECN SVVSSP DRNFA
Sbjct: 361 RKMSSLSNISYKSLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420
Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGES-PYMREDGM 480
+PKALS+GKSVR +RAN +E+MKAQEM+R Q++HD+ G KFEEGG S PYMREDG
Sbjct: 421 SIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGT 480
Query: 481 GHGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEA 540
G GW VVNPNAGN NR KTTFLGI+EQKE+TESL+ DD KD+S+ EDE++FASSDEEA
Sbjct: 481 GQGWPDVVNPNAGNMNRFPKTTFLGIKEQKEETESLVADDSKDDSEGEDESLFASSDEEA 540
Query: 541 ASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR------GGWGSFSSTGS 597
SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR GGWGSFSST S
Sbjct: 541 GSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFSSTSS 592
BLAST of Tan0002794 vs. TAIR 10
Match:
AT3G60380.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT4G16790.1); Has 6102 Blast hits to 3981 proteins in 424 species: Archae - 6; Bacteria - 372; Metazoa - 2603; Fungi - 655; Plants - 291; Viruses - 28; Other Eukaryotes - 2147 (source: NCBI BLink). )
HSP 1 Score: 142.9 bits (359), Expect = 8.3e-34
Identity = 133/402 (33.08%), Postives = 200/402 (49.75%), Query Frame = 0
Query: 4 SASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTL 63
++ +P TK P + + P ++ F CKS+ F +FLL LPLFPS+APDFV +T+
Sbjct: 2 ASPNPYTKRRSPPNVVVPPQPRYKSIGGGGFFCKSVLFALFLLALPLFPSQAPDFVGETV 61
Query: 64 LTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFEDAD 123
LTKFWEL HL+FVGIAV+YGLFSRR+++ +VD + SY S++F V+S+F++
Sbjct: 62 LTKFWELIHLLFVGIAVAYGLFSRRNVESAVDLRMTRVDESSLSYVSRIFQVSSVFDEEF 121
Query: 124 DLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDTDNV 183
D + + + + V F +S E E S E +T+ V
Sbjct: 122 DDNSCEFVDVRSDESVSARASVVGKSESFVVESGE------------LEESSEFGETNEV 181
Query: 184 GHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLEFDC 243
A S+Y +G S +VVA + G +V ++PLGLP+R LRS+L
Sbjct: 182 -RAWNSQYFQGKSKVVVARP-----AYGLDGHVV-HQPLGLPIRRLRSSLR--------- 241
Query: 244 GDESCLSSKTSSNSSEN--NCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKKL 303
D + L K+ ++S + N E S DN FDE + + SP W+ +
Sbjct: 242 -DNAALQDKSFADSCDGAVNAEAESLLADNF-------FDEVLAAPASPVPWQAR----- 301
Query: 304 VRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKHR 363
G+G+ PS+F+P S+DET LK+ SS S+ S SSQTS S +
Sbjct: 302 PEMMGIGDNY--PSNFQPISVDET----LKSISS-RSTGSSSSQTSYAS-------QNQN 348
Query: 364 KMSSLSNISSKSLHSR-QYSMSSLSENGRGSSEDPLIEPENS 403
+ S ++S++SL+S + + S S P + P S
Sbjct: 362 RFSPSRSVSAESLNSNVEELVKEKSRQSSSRSSSPSLPPSPS 348
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
ADN34231.1 | 6.0e-260 | 80.23 | hypothetical protein [Cucumis melo subsp. melo] >TYK24724.1 DUF761 domain-contai... | [more] |
XP_004140631.1 | 8.9e-256 | 79.10 | uncharacterized protein LOC101220435 [Cucumis sativus] >KGN46495.1 hypothetical ... | [more] |
XP_023006022.1 | 4.8e-249 | 77.63 | uncharacterized protein LOC111498900 [Cucurbita maxima] | [more] |
KAG6575261.1 | 1.5e-247 | 78.04 | hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7013816.1 | 4.5e-247 | 78.04 | hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3DMA5 | 2.9e-260 | 80.23 | DUF761 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... | [more] |
E5GCN2 | 2.9e-260 | 80.23 | Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1 | [more] |
A0A0A0K9X1 | 4.3e-256 | 79.10 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G103540 PE=4 SV=1 | [more] |
A0A6J1KUS4 | 2.3e-249 | 77.63 | uncharacterized protein LOC111498900 OS=Cucurbita maxima OX=3661 GN=LOC111498900... | [more] |
A0A6J1H4M0 | 1.2e-245 | 77.48 | uncharacterized protein LOC111459998 OS=Cucurbita moschata OX=3662 GN=LOC1114599... | [more] |
Match Name | E-value | Identity | Description | |
AT3G60380.1 | 8.3e-34 | 33.08 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |