Tan0002794 (gene) Snake gourd v1

Overview
NameTan0002794
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF761 domain-containing protein
LocationLG06: 40382301 .. 40384266 (-)
RNA-Seq ExpressionTan0002794
SyntenyTan0002794
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTATTGCGAGCACAAACTTTACCTTGTCTTCAAGTCTTCCTTTCTTCACCAACAACATTTTCTCTCAAATCTCAAAATCCCGATTCCCAAGCTCCACAGCAACCTCATCAATGGCGTCTTCAGCTTCCAGCCCTTTGACCAAACCCCATTTTCCTCATTCTCCACTTCCACCAACTCCTTCCGCTCACCAACACAAGTCCTGTGCACAATTTCTATGTAAATCCCTCTTCTTCTGCATCTTCCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCAACCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAGCATCCAGGTGAGTGTAGACGAACCTCGCTTCTCCAATTTTGATCATCCACAGTCGTATTTCTCTAAGATGTTTCACGTCGCTTCGATTTTTGAAGATGCTGACGATTTGAGTGATTCTGATGAGAGGAAATTGAGTGAAGTTCTGTACATTCAGCCGAATCGTGGATCCGTGAGTGATTTTTGGGATTTTAATGCTCAATCTCGCGAACAGGAAAAACTCCATTGCTCTATACCCAAAAAAAGGTATGAAAATTCTTATGAATCTTCTGATACTGACAATGTCGGTCATGCTTGTAAATCGAGATATACTCGGGGTGGATCTTTGTTGGTTGTTGCTGAAACAAATCGTAGTTCTGGTGAATGGATGGAATCAGGAGCCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTAAGATCGAATCTTACTGAACCTGACGATCTCGAATTCGATTGTGGTGATGAATCTTGCTTGAGTTCTAAAACTTCATCCAATAGCTCTGAGAATAATTGTGAAAGAATAAGTGAATTTGGTGATAATTGTTGTACAAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTTCATCATTGTCCCCATTTCAATGGCGCGAGAAATTTGGAAAGAAGTTGGTGAGAGAGAGAGGAGTTGGGAATGCTATTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAACTCTAGCTCTCTTCATTCTTCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTCCGGCGTTGCCATCAACGACGAGAAAGCACCGTAAAATGTCGTCGCTCAGTAACATTTCCTCTAAGTCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACGGTAGAGGGAGCTCTGAGGACCCTCTGATTGAACCAGAAAATTCATCCGAGTGCAATGGATCCGTGGTAAGTTCCCCAAATTTAGATAGGAATTTCGCAAGAATGCCGAAAGCTTTATCCCGGGGAAAATCCGTTAGAACAGTTAGAGCAAATGTAGTTACCATGGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAATGGAGCATGATAACAATACAGGGAAGTTTGAAGAAGGTGGAGAGTCACCATATATGAGAGAAGATGGAATGGGACATGGATGGGATGCTGTTGTTAACCCGAATGCTGGTAATTCGAATCGTTTGTCGAAGACGACATTCTTGGGAATTGAGGAGCAGAAGGAAGACACTGAGAGTTTGCTGACAGATGATGGTAAAGATAACTCTGATAAGGAGGATGAAACTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCGAGTATGGCGGGAGATTCAGAATCGGGGGCTTACGAGGTCGACAAGAAAGCTGGCGAGTTCATAGCCAAGTTTAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGCAGAAAAAAGATTGAGAGGAGGATGGGGGTCATTCAGCAGCACAGGCAGCAGCTATTTCAGTTGATTGGCTAAGGCTTCAATCTTAAAAAGGCCTTCTTCTAAGCCCTTTACACATTTAAGGTTAAAT

mRNA sequence

GTTTATTGCGAGCACAAACTTTACCTTGTCTTCAAGTCTTCCTTTCTTCACCAACAACATTTTCTCTCAAATCTCAAAATCCCGATTCCCAAGCTCCACAGCAACCTCATCAATGGCGTCTTCAGCTTCCAGCCCTTTGACCAAACCCCATTTTCCTCATTCTCCACTTCCACCAACTCCTTCCGCTCACCAACACAAGTCCTGTGCACAATTTCTATGTAAATCCCTCTTCTTCTGCATCTTCCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCAACCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAGCATCCAGGTGAGTGTAGACGAACCTCGCTTCTCCAATTTTGATCATCCACAGTCGTATTTCTCTAAGATGTTTCACGTCGCTTCGATTTTTGAAGATGCTGACGATTTGAGTGATTCTGATGAGAGGAAATTGAGTGAAGTTCTGTACATTCAGCCGAATCGTGGATCCGTGAGTGATTTTTGGGATTTTAATGCTCAATCTCGCGAACAGGAAAAACTCCATTGCTCTATACCCAAAAAAAGGTATGAAAATTCTTATGAATCTTCTGATACTGACAATGTCGGTCATGCTTGTAAATCGAGATATACTCGGGGTGGATCTTTGTTGGTTGTTGCTGAAACAAATCGTAGTTCTGGTGAATGGATGGAATCAGGAGCCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTAAGATCGAATCTTACTGAACCTGACGATCTCGAATTCGATTGTGGTGATGAATCTTGCTTGAGTTCTAAAACTTCATCCAATAGCTCTGAGAATAATTGTGAAAGAATAAGTGAATTTGGTGATAATTGTTGTACAAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTTCATCATTGTCCCCATTTCAATGGCGCGAGAAATTTGGAAAGAAGTTGGTGAGAGAGAGAGGAGTTGGGAATGCTATTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAACTCTAGCTCTCTTCATTCTTCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTCCGGCGTTGCCATCAACGACGAGAAAGCACCGTAAAATGTCGTCGCTCAGTAACATTTCCTCTAAGTCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACGGTAGAGGGAGCTCTGAGGACCCTCTGATTGAACCAGAAAATTCATCCGAGTGCAATGGATCCGTGGTAAGTTCCCCAAATTTAGATAGGAATTTCGCAAGAATGCCGAAAGCTTTATCCCGGGGAAAATCCGTTAGAACAGTTAGAGCAAATGTAGTTACCATGGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAATGGAGCATGATAACAATACAGGGAAGTTTGAAGAAGGTGGAGAGTCACCATATATGAGAGAAGATGGAATGGGACATGGATGGGATGCTGTTGTTAACCCGAATGCTGGTAATTCGAATCGTTTGTCGAAGACGACATTCTTGGGAATTGAGGAGCAGAAGGAAGACACTGAGAGTTTGCTGACAGATGATGGTAAAGATAACTCTGATAAGGAGGATGAAACTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCGAGTATGGCGGGAGATTCAGAATCGGGGGCTTACGAGGTCGACAAGAAAGCTGGCGAGTTCATAGCCAAGTTTAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGCAGAAAAAAGATTGAGAGGAGGATGGGGGTCATTCAGCAGCACAGGCAGCAGCTATTTCAGTTGATTGGCTAAGGCTTCAATCTTAAAAAGGCCTTCTTCTAAGCCCTTTACACATTTAAGGTTAAAT

Coding sequence (CDS)

ATGGCGTCTTCAGCTTCCAGCCCTTTGACCAAACCCCATTTTCCTCATTCTCCACTTCCACCAACTCCTTCCGCTCACCAACACAAGTCCTGTGCACAATTTCTATGTAAATCCCTCTTCTTCTGCATCTTCCTCCTCCTCCTCCCTCTCTTCCCTTCCGAAGCTCCAGATTTCGTCAACCAGACTTTGCTCACCAAATTCTGGGAGCTTTTTCACCTCATGTTCGTCGGCATTGCTGTTTCCTATGGTCTCTTTAGCAGAAGGAGCATCCAGGTGAGTGTAGACGAACCTCGCTTCTCCAATTTTGATCATCCACAGTCGTATTTCTCTAAGATGTTTCACGTCGCTTCGATTTTTGAAGATGCTGACGATTTGAGTGATTCTGATGAGAGGAAATTGAGTGAAGTTCTGTACATTCAGCCGAATCGTGGATCCGTGAGTGATTTTTGGGATTTTAATGCTCAATCTCGCGAACAGGAAAAACTCCATTGCTCTATACCCAAAAAAAGGTATGAAAATTCTTATGAATCTTCTGATACTGACAATGTCGGTCATGCTTGTAAATCGAGATATACTCGGGGTGGATCTTTGTTGGTTGTTGCTGAAACAAATCGTAGTTCTGGTGAATGGATGGAATCAGGAGCCATTGTAAATTATAAACCTCTAGGTTTGCCTGTTAGGAGTCTAAGATCGAATCTTACTGAACCTGACGATCTCGAATTCGATTGTGGTGATGAATCTTGCTTGAGTTCTAAAACTTCATCCAATAGCTCTGAGAATAATTGTGAAAGAATAAGTGAATTTGGTGATAATTGTTGTACAAATTTGGAGGAGAAGTTTGATGAAGCTGTTATTTCATCATTGTCCCCATTTCAATGGCGCGAGAAATTTGGAAAGAAGTTGGTGAGAGAGAGAGGAGTTGGGAATGCTATTCTTCGCCCTTCCCATTTTAGACCTCCCTCCATTGATGAAACTCAATTTGAATCACTGAAAAACTCTAGCTCTCTTCATTCTTCTCTATCTCAGTCATCACAAACTAGTTCCTTCTCTCCGGCGTTGCCATCAACGACGAGAAAGCACCGTAAAATGTCGTCGCTCAGTAACATTTCCTCTAAGTCATTGCATTCTCGACAATACAGTATGAGTTCTCTGTCTGAAAACGGTAGAGGGAGCTCTGAGGACCCTCTGATTGAACCAGAAAATTCATCCGAGTGCAATGGATCCGTGGTAAGTTCCCCAAATTTAGATAGGAATTTCGCAAGAATGCCGAAAGCTTTATCCCGGGGAAAATCCGTTAGAACAGTTAGAGCAAATGTAGTTACCATGGAGGAAATGAAAGCTCAAGAGATGTATAGAAACCAAATGGAGCATGATAACAATACAGGGAAGTTTGAAGAAGGTGGAGAGTCACCATATATGAGAGAAGATGGAATGGGACATGGATGGGATGCTGTTGTTAACCCGAATGCTGGTAATTCGAATCGTTTGTCGAAGACGACATTCTTGGGAATTGAGGAGCAGAAGGAAGACACTGAGAGTTTGCTGACAGATGATGGTAAAGATAACTCTGATAAGGAGGATGAAACTATTTTTGCAAGTTCAGATGAAGAAGCTGCTTCGAGTATGGCGGGAGATTCAGAATCGGGGGCTTACGAGGTCGACAAGAAAGCTGGCGAGTTCATAGCCAAGTTTAGGGAGCAAATACAGCTTCAGAGGATGGCTTCAGCAGAAAAAAGATTGAGAGGAGGATGGGGGTCATTCAGCAGCACAGGCAGCAGCTATTTCAGTTGA

Protein sequence

MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDTDNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDGMGHGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSSYFS
Homology
BLAST of Tan0002794 vs. NCBI nr
Match: ADN34231.1 (hypothetical protein [Cucumis melo subsp. melo] >TYK24724.1 DUF761 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 907.5 bits (2344), Expect = 6.0e-260
Identity = 483/602 (80.23%), Postives = 523/602 (86.88%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MASS S+P TKPHFPHSPLPPT +     SC  FLCKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1   MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
           QTLLTKFWELFHLMFVGIAVSYGLFSRR++QVSV  DEPRFSNF++PQSY SKM HVASI
Sbjct: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120

Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
           FED DD S SDERKLSEVLYIQPN GSV     FNA SR+QE  H SIPKKRYENS E  
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVR---GFNAISRQQENFHYSIPKKRYENSLEFD 180

Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
           DT++VGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSLRSNLTEPD
Sbjct: 181 DTNSVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPD 240

Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
           D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+ +SPFQ RE F
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENF 300

Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
           GK ++RERGV NA+LRPSHFRP SIDETQFESLK S SLHS+LSQSSQTSS SP+L STT
Sbjct: 301 GKNMMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTT 360

Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
           RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLIEPENSSECN S++SSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDR 420

Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
           NFA +PKALSRGKSVRT+RAN   +EEMKAQEMYRNQ+EHD+N G   EGG SPYMREDG
Sbjct: 421 NFAHIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKFEGGMSPYMREDG 480

Query: 481 MGHGWDAVVNPNAGNSNRLSK-TTFLGIEEQKEDTESLLTDDG--KDNSDKEDETIFASS 540
            GHGW  + +PNAG SNR  K TTF GIEEQKED ES LTDD   +DNS++ED + F SS
Sbjct: 481 TGHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESS 540

Query: 541 DEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSSY 597
           DEEAASSMAG+SESGAYEVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SSY
Sbjct: 541 DEEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSY 599

BLAST of Tan0002794 vs. NCBI nr
Match: XP_004140631.1 (uncharacterized protein LOC101220435 [Cucumis sativus] >KGN46495.1 hypothetical protein Csa_004883 [Cucumis sativus])

HSP 1 Score: 893.6 bits (2308), Expect = 8.9e-256
Identity = 477/603 (79.10%), Postives = 519/603 (86.07%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MA S S+P TKPHFPHSPLPPT +     SC QF+CKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1   MAPSPSTPFTKPHFPHSPLPPTSTTRHSNSCTQFICKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
           QT LTKFWELFHLMF+GIAVSYGLFSRR++QVSV  DEPRFSNF++PQSY SKMFHVASI
Sbjct: 61  QTFLTKFWELFHLMFIGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMFHVASI 120

Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
           FED DD S SDERKLSEVLYIQPN GSVS     NA SR+QE  H SIPKKRYENS E +
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVS---GLNAISRQQENFHYSIPKKRYENSLEFA 180

Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
           +TDNVGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSL+S+LTEPD
Sbjct: 181 ETDNVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLKSSLTEPD 240

Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
           D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+S+SPFQ REKF
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIASMSPFQLREKF 300

Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
            K ++RER V NA+LRPSHFRP SIDETQFESLK S+SLHS+LSQSSQTSS S  L S T
Sbjct: 301 EKNMMRERRVKNAVLRPSHFRPSSIDETQFESLKKSTSLHSNLSQSSQTSSLSSPLSSRT 360

Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
           RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLI+PENSSECN SVVSSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIDPENSSECNESVVSSPRLDR 420

Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
           NFA  PKALSRGKSVRTVRA+   +EEMKAQEMYRNQ+EHD+N     EGG SPYMRED 
Sbjct: 421 NFANTPKALSRGKSVRTVRASTSAIEEMKAQEMYRNQVEHDDNVENKFEGGMSPYMREDE 480

Query: 481 MGHGWDAVVNPNAGNSNRLSK----TTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFAS 540
            GHGW  + N NA  SNR SK    TTF GIEEQKEDTES +TDDGKDNS++ED++ F S
Sbjct: 481 TGHGWPGINNLNAAYSNRYSKTTATTTFSGIEEQKEDTESQVTDDGKDNSEREDDSFFES 540

Query: 541 SDEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSS 597
           SDEEAA SM GDSESGA+EVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SS
Sbjct: 541 SDEEAALSMTGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTTSS 600

BLAST of Tan0002794 vs. NCBI nr
Match: XP_023006022.1 (uncharacterized protein LOC111498900 [Cucurbita maxima])

HSP 1 Score: 871.3 bits (2250), Expect = 4.8e-249
Identity = 465/599 (77.63%), Postives = 512/599 (85.48%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MASSASSP TK HFPHSPLP  P+ H   SCAQFLCKSLFFC FLLLLPLFPSEAPDFV+
Sbjct: 1   MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVD 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
           QTL TKFWELFHLM VGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61  QTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120

Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
           D DD S SDERKLSEVLYIQPN GS S   D NAQSR+QEKL  SIPKKRYENSYE +DT
Sbjct: 121 DVDDFSVSDERKLSEVLYIQPNLGSAS---DLNAQSRQQEKLRYSIPKKRYENSYEFADT 180

Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
           DNV HACKSRYTRGGS++VV ETNRSS     SG IVNYKPLGLPVRSL+S+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLKSSLTESDDVE 240

Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
           FDCGDESCLSSK+S  SSENNCE  SEFGDNCC NLEEKFDE  I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300

Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
           ++RERG GNA+LRPSHFRPPSIDETQFESLK S SLHS+LSQSSQTSS S +L STTRKH
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKH 360

Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
            KMSSLSNIS KSLHSRQYSMSSLSEN RGSSEDPLIE ENSSECN SVVSSP  D NF 
Sbjct: 361 HKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDMNFR 420

Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGESPYMREDGMG 480
            +PKALS+GKS+R ++AN   +E++KAQEM+R Q++HD+  G KFEEGG SPY+REDG G
Sbjct: 421 SIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTG 480

Query: 481 HGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEAA 540
           HGW  V NPNA N +R   TTFLGI+EQKE+TESL+ DD KD+S+ EDE+ FASSDEEAA
Sbjct: 481 HGWPDVANPNASNMSRFPTTTFLGIKEQKEETESLVADDSKDDSEGEDESFFASSDEEAA 540

Query: 541 SSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR--GGWGSFSSTGSSYFS 597
           SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR  GGWGSFSST SSYFS
Sbjct: 541 SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS 591

BLAST of Tan0002794 vs. NCBI nr
Match: KAG6575261.1 (hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 866.3 bits (2237), Expect = 1.5e-247
Identity = 469/601 (78.04%), Postives = 511/601 (85.02%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MASSASSP TK HFPHSPLP  P+     SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+
Sbjct: 1   MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
           QTL TKFWELFHLMFVGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61  QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120

Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
           D DD   SDERK+SEVLYIQP  GS S   D NAQSR QEKL  S+PKKRYENSYE +DT
Sbjct: 121 DVDDFGVSDERKVSEVLYIQPKLGSAS---DLNAQSRHQEKLRYSMPKKRYENSYEFADT 180

Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
           DNV HACKSRYTRGGS++VV ETNRSS     SG IVNYKPLGLPVRSLRS+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLRSSLTESDDVE 240

Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
           FDCGDESCLSSK+S  SSENNCE  SEFGDNCC NLEEKFDE  I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300

Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
           ++RERG GNA+LRPSHFRPPSIDETQFESL+ S SLHS LSQSSQTSS S  L STTRKH
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLRKSGSLHSDLSQSSQTSSLSSQLSSTTRKH 360

Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
            KMSSLSNIS KSLHSRQYSMSSLSEN RGSSEDPLIE ENSSECN SVVSSP  DRNFA
Sbjct: 361 SKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420

Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGES-PYMREDGM 480
            +PKALS+GKSVR +RAN   +E+MKAQEM+R Q++HD+  G KFEEGG S PYMREDG 
Sbjct: 421 SIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGT 480

Query: 481 GHGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEA 540
           GHGW  VVNPNAGN NR  KTTFLGI+EQKE+TESL+ DD KD S+ EDE++FASSDEEA
Sbjct: 481 GHGWPDVVNPNAGNMNRFPKTTFLGIKEQKEETESLVADDSKDGSEGEDESLFASSDEEA 540

Query: 541 ASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR---GGWGSFSSTGSSYF 597
            SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR   GGWGSFSST SSYF
Sbjct: 541 GSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGWGSFSSTSSSYF 589

BLAST of Tan0002794 vs. NCBI nr
Match: KAG7013816.1 (hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 864.8 bits (2233), Expect = 4.5e-247
Identity = 469/601 (78.04%), Postives = 510/601 (84.86%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MASSASSP TK HFPHSPLP  P+     SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+
Sbjct: 1   MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
           QTL TKFWELFHLMFVGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61  QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120

Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
           D DD   SDERK+SEVLYIQP  GS S   D NAQSR QEKL  S+PKKRYENSYE +DT
Sbjct: 121 DVDDFGVSDERKVSEVLYIQPKLGSAS---DLNAQSRHQEKLRYSMPKKRYENSYEFADT 180

Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
           DNV HACKSRYTRGGS++VV ETNRSS     SG IVNYKPLGLPVRSLRS+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLRSSLTESDDVE 240

Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
           FDCGDESCLSSK+S  SSENNCE  SEFGDNCC NLEEKFDE  I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300

Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
           ++RERG GNA+LRPSHFRPPSIDETQFESL+ S SLHS LSQSSQTSS S  L STTRKH
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLRKSGSLHSDLSQSSQTSSLSSPLSSTTRKH 360

Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
            KMSSLSNIS KSLHSRQYSMSSLSEN RGSSEDPLIE ENSSECN SVVSSP  DRNFA
Sbjct: 361 SKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420

Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGES-PYMREDGM 480
            +PKALS+GKSVR +RAN   +E+MKAQEM+R Q++HD+  G KFEEGG S PYMREDG 
Sbjct: 421 SIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGT 480

Query: 481 GHGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEA 540
           GHGW  VVNPNAGN NR  KTTFLGI+EQKE+TESL+ DD KD S+ EDE+ FASSDEEA
Sbjct: 481 GHGWPDVVNPNAGNMNRFPKTTFLGIKEQKEETESLVADDSKDGSEGEDESSFASSDEEA 540

Query: 541 ASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR---GGWGSFSSTGSSYF 597
            SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR   GGWGSFSST SSYF
Sbjct: 541 GSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGWGSFSSTSSSYF 589

BLAST of Tan0002794 vs. ExPASy TrEMBL
Match: A0A5D3DMA5 (DUF761 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G002840 PE=4 SV=1)

HSP 1 Score: 907.5 bits (2344), Expect = 2.9e-260
Identity = 483/602 (80.23%), Postives = 523/602 (86.88%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MASS S+P TKPHFPHSPLPPT +     SC  FLCKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1   MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
           QTLLTKFWELFHLMFVGIAVSYGLFSRR++QVSV  DEPRFSNF++PQSY SKM HVASI
Sbjct: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120

Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
           FED DD S SDERKLSEVLYIQPN GSV     FNA SR+QE  H SIPKKRYENS E  
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVR---GFNAISRQQENFHYSIPKKRYENSLEFD 180

Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
           DT++VGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSLRSNLTEPD
Sbjct: 181 DTNSVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPD 240

Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
           D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+ +SPFQ RE F
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENF 300

Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
           GK ++RERGV NA+LRPSHFRP SIDETQFESLK S SLHS+LSQSSQTSS SP+L STT
Sbjct: 301 GKNMMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTT 360

Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
           RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLIEPENSSECN S++SSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDR 420

Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
           NFA +PKALSRGKSVRT+RAN   +EEMKAQEMYRNQ+EHD+N G   EGG SPYMREDG
Sbjct: 421 NFAHIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKFEGGMSPYMREDG 480

Query: 481 MGHGWDAVVNPNAGNSNRLSK-TTFLGIEEQKEDTESLLTDDG--KDNSDKEDETIFASS 540
            GHGW  + +PNAG SNR  K TTF GIEEQKED ES LTDD   +DNS++ED + F SS
Sbjct: 481 TGHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESS 540

Query: 541 DEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSSY 597
           DEEAASSMAG+SESGAYEVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SSY
Sbjct: 541 DEEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSY 599

BLAST of Tan0002794 vs. ExPASy TrEMBL
Match: E5GCN2 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 907.5 bits (2344), Expect = 2.9e-260
Identity = 483/602 (80.23%), Postives = 523/602 (86.88%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MASS S+P TKPHFPHSPLPPT +     SC  FLCKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1   MASSPSTPFTKPHFPHSPLPPTSTTRHSNSCTHFLCKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
           QTLLTKFWELFHLMFVGIAVSYGLFSRR++QVSV  DEPRFSNF++PQSY SKM HVASI
Sbjct: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMLHVASI 120

Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
           FED DD S SDERKLSEVLYIQPN GSV     FNA SR+QE  H SIPKKRYENS E  
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVR---GFNAISRQQENFHYSIPKKRYENSLEFD 180

Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
           DT++VGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSLRSNLTEPD
Sbjct: 181 DTNSVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLRSNLTEPD 240

Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
           D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+ +SPFQ RE F
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIAKMSPFQLRENF 300

Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
           GK ++RERGV NA+LRPSHFRP SIDETQFESLK S SLHS+LSQSSQTSS SP+L STT
Sbjct: 301 GKNMMRERGVKNAVLRPSHFRPSSIDETQFESLKKSRSLHSNLSQSSQTSSLSPSLSSTT 360

Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
           RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLIEPENSSECN S++SSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIEPENSSECNESIISSPRLDR 420

Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
           NFA +PKALSRGKSVRT+RAN   +EEMKAQEMYRNQ+EHD+N G   EGG SPYMREDG
Sbjct: 421 NFAHIPKALSRGKSVRTIRANTSAIEEMKAQEMYRNQVEHDDNVGNKFEGGMSPYMREDG 480

Query: 481 MGHGWDAVVNPNAGNSNRLSK-TTFLGIEEQKEDTESLLTDDG--KDNSDKEDETIFASS 540
            GHGW  + +PNAG SNR  K TTF GIEEQKED ES LTDD   +DNS++ED + F SS
Sbjct: 481 TGHGWPGINSPNAGYSNRHPKTTTFSGIEEQKEDIESQLTDDDGKEDNSEREDVSFFESS 540

Query: 541 DEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSSY 597
           DEEAASSMAG+SESGAYEVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SSY
Sbjct: 541 DEEAASSMAGESESGAYEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTSSSY 599

BLAST of Tan0002794 vs. ExPASy TrEMBL
Match: A0A0A0K9X1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G103540 PE=4 SV=1)

HSP 1 Score: 893.6 bits (2308), Expect = 4.3e-256
Identity = 477/603 (79.10%), Postives = 519/603 (86.07%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MA S S+P TKPHFPHSPLPPT +     SC QF+CKSLFFCIFLLLLPLFPSEAP+FVN
Sbjct: 1   MAPSPSTPFTKPHFPHSPLPPTSTTRHSNSCTQFICKSLFFCIFLLLLPLFPSEAPEFVN 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSV--DEPRFSNFDHPQSYFSKMFHVASI 120
           QT LTKFWELFHLMF+GIAVSYGLFSRR++QVSV  DEPRFSNF++PQSY SKMFHVASI
Sbjct: 61  QTFLTKFWELFHLMFIGIAVSYGLFSRRNVQVSVDSDEPRFSNFENPQSYLSKMFHVASI 120

Query: 121 FEDADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESS 180
           FED DD S SDERKLSEVLYIQPN GSVS     NA SR+QE  H SIPKKRYENS E +
Sbjct: 121 FEDVDDFSVSDERKLSEVLYIQPNLGSVS---GLNAISRQQENFHYSIPKKRYENSLEFA 180

Query: 181 DTDNVGHACKSRYTRGGSLLVVAETNRS-SGEWMESGAIVNYKPLGLPVRSLRSNLTEPD 240
           +TDNVGHACKSRYTRGGS++VVAETNRS SGEW+ESGAIVNYKPLGLPVRSL+S+LTEPD
Sbjct: 181 ETDNVGHACKSRYTRGGSVVVVAETNRSNSGEWLESGAIVNYKPLGLPVRSLKSSLTEPD 240

Query: 241 DLEFDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKF 300
           D+EFDCGDESCLSSK+SS +SE+NCER SEFGDNCC NLEEKFDE VI+S+SPFQ REKF
Sbjct: 241 DVEFDCGDESCLSSKSSSKNSESNCERTSEFGDNCCVNLEEKFDETVIASMSPFQLREKF 300

Query: 301 GKKLVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTT 360
            K ++RER V NA+LRPSHFRP SIDETQFESLK S+SLHS+LSQSSQTSS S  L S T
Sbjct: 301 EKNMMRERRVKNAVLRPSHFRPSSIDETQFESLKKSTSLHSNLSQSSQTSSLSSPLSSRT 360

Query: 361 RKHRKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDR 420
           RKHRKMSSL NIS KS HSRQYS+SSLSEN RGSSEDPLI+PENSSECN SVVSSP LDR
Sbjct: 361 RKHRKMSSLGNISYKSSHSRQYSLSSLSENSRGSSEDPLIDPENSSECNESVVSSPRLDR 420

Query: 421 NFARMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTGKFEEGGESPYMREDG 480
           NFA  PKALSRGKSVRTVRA+   +EEMKAQEMYRNQ+EHD+N     EGG SPYMRED 
Sbjct: 421 NFANTPKALSRGKSVRTVRASTSAIEEMKAQEMYRNQVEHDDNVENKFEGGMSPYMREDE 480

Query: 481 MGHGWDAVVNPNAGNSNRLSK----TTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFAS 540
            GHGW  + N NA  SNR SK    TTF GIEEQKEDTES +TDDGKDNS++ED++ F S
Sbjct: 481 TGHGWPGINNLNAAYSNRYSKTTATTTFSGIEEQKEDTESQVTDDGKDNSEREDDSFFES 540

Query: 541 SDEEAASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLRGGWGSFSSTGSS 597
           SDEEAA SM GDSESGA+EVDKKAGEFIAKFREQIQLQRMAS +KRLRGGWGSFSST SS
Sbjct: 541 SDEEAALSMTGDSESGAHEVDKKAGEFIAKFREQIQLQRMASVDKRLRGGWGSFSSTTSS 600

BLAST of Tan0002794 vs. ExPASy TrEMBL
Match: A0A6J1KUS4 (uncharacterized protein LOC111498900 OS=Cucurbita maxima OX=3661 GN=LOC111498900 PE=4 SV=1)

HSP 1 Score: 871.3 bits (2250), Expect = 2.3e-249
Identity = 465/599 (77.63%), Postives = 512/599 (85.48%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MASSASSP TK HFPHSPLP  P+ H   SCAQFLCKSLFFC FLLLLPLFPSEAPDFV+
Sbjct: 1   MASSASSPFTKLHFPHSPLPQPPATHHSNSCAQFLCKSLFFCFFLLLLPLFPSEAPDFVD 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
           QTL TKFWELFHLM VGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61  QTLFTKFWELFHLMLVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120

Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
           D DD S SDERKLSEVLYIQPN GS S   D NAQSR+QEKL  SIPKKRYENSYE +DT
Sbjct: 121 DVDDFSVSDERKLSEVLYIQPNLGSAS---DLNAQSRQQEKLRYSIPKKRYENSYEFADT 180

Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
           DNV HACKSRYTRGGS++VV ETNRSS     SG IVNYKPLGLPVRSL+S+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLKSSLTESDDVE 240

Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
           FDCGDESCLSSK+S  SSENNCE  SEFGDNCC NLEEKFDE  I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300

Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
           ++RERG GNA+LRPSHFRPPSIDETQFESLK S SLHS+LSQSSQTSS S +L STTRKH
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSNLSQSSQTSSLSSSLSSTTRKH 360

Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
            KMSSLSNIS KSLHSRQYSMSSLSEN RGSSEDPLIE ENSSECN SVVSSP  D NF 
Sbjct: 361 HKMSSLSNISYKSLHSRQYSMSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDMNFR 420

Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGESPYMREDGMG 480
            +PKALS+GKS+R ++AN   +E++KAQEM+R Q++HD+  G KFEEGG SPY+REDG G
Sbjct: 421 SIPKALSQGKSIRRIQANAAAIEDIKAQEMHRKQVKHDDIIGNKFEEGGTSPYIREDGTG 480

Query: 481 HGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEAA 540
           HGW  V NPNA N +R   TTFLGI+EQKE+TESL+ DD KD+S+ EDE+ FASSDEEAA
Sbjct: 481 HGWPDVANPNASNMSRFPTTTFLGIKEQKEETESLVADDSKDDSEGEDESFFASSDEEAA 540

Query: 541 SSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR--GGWGSFSSTGSSYFS 597
           SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR  GGWGSFSST SSYFS
Sbjct: 541 SSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGWGSFSSTSSSYFS 591

BLAST of Tan0002794 vs. ExPASy TrEMBL
Match: A0A6J1H4M0 (uncharacterized protein LOC111459998 OS=Cucurbita moschata OX=3662 GN=LOC111459998 PE=4 SV=1)

HSP 1 Score: 859.0 bits (2218), Expect = 1.2e-245
Identity = 468/604 (77.48%), Postives = 510/604 (84.44%), Query Frame = 0

Query: 1   MASSASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVN 60
           MASSASSP TK HFPHSPLP  P+     SCAQFLCKS+FFC FLLLLPLFPSEAPDFV+
Sbjct: 1   MASSASSPFTKLHFPHSPLPQPPA----NSCAQFLCKSIFFCFFLLLLPLFPSEAPDFVD 60

Query: 61  QTLLTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFE 120
           QTL TKFWELFHLMFVGIAVSYGLFS R+ Q++VDEPR+S+F++PQSY SKM +VASIF+
Sbjct: 61  QTLFTKFWELFHLMFVGIAVSYGLFSTRNNQMNVDEPRYSSFENPQSYLSKMLYVASIFD 120

Query: 121 DADDLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDT 180
           D DD   SDERK+SEVLYIQP  GS S   D NAQSR QEKL  S+PKKRYENSYE +DT
Sbjct: 121 DVDDFGVSDERKVSEVLYIQPKLGSAS---DLNAQSRHQEKLRYSMPKKRYENSYEFADT 180

Query: 181 DNVGHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLE 240
           DNV HACKSRYTRGGS++VV ETNRSS     SG IVNYKPLGLPVRSLRS+LTE DD+E
Sbjct: 181 DNVAHACKSRYTRGGSVVVVPETNRSS-----SGGIVNYKPLGLPVRSLRSSLTESDDVE 240

Query: 241 FDCGDESCLSSKTSSNSSENNCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKK 300
           FDCGDESCLSSK+S  SSENNCE  SEFGDNCC NLEEKFDE  I+S+S FQ REKFGKK
Sbjct: 241 FDCGDESCLSSKSSPKSSENNCEGNSEFGDNCCVNLEEKFDETAIASMSSFQLREKFGKK 300

Query: 301 LVRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKH 360
           ++RERG GNA+LRPSHFRPPSIDETQFESLK S SLHS LSQSSQTSS S  L STTRK 
Sbjct: 301 VIRERGFGNAVLRPSHFRPPSIDETQFESLKKSGSLHSYLSQSSQTSSLSSPLSSTTRKP 360

Query: 361 RKMSSLSNISSKSLHSRQYSMSSLSENGRGSSEDPLIEPENSSECNGSVVSSPNLDRNFA 420
           RKMSSLSNIS KSLHSRQYS SSLSEN RGSSEDPLIE ENSSECN SVVSSP  DRNFA
Sbjct: 361 RKMSSLSNISYKSLHSRQYSTSSLSENSRGSSEDPLIEQENSSECNESVVSSPRSDRNFA 420

Query: 421 RMPKALSRGKSVRTVRANVVTMEEMKAQEMYRNQMEHDNNTG-KFEEGGES-PYMREDGM 480
            +PKALS+GKSVR +RAN   +E+MKAQEM+R Q++HD+  G KFEEGG S PYMREDG 
Sbjct: 421 SIPKALSQGKSVRRIRANAAAIEDMKAQEMHRKQVKHDDIIGNKFEEGGMSPPYMREDGT 480

Query: 481 GHGWDAVVNPNAGNSNRLSKTTFLGIEEQKEDTESLLTDDGKDNSDKEDETIFASSDEEA 540
           G GW  VVNPNAGN NR  KTTFLGI+EQKE+TESL+ DD KD+S+ EDE++FASSDEEA
Sbjct: 481 GQGWPDVVNPNAGNMNRFPKTTFLGIKEQKEETESLVADDSKDDSEGEDESLFASSDEEA 540

Query: 541 ASSMAGDSESGAYEVDKKAGEFIAKFREQIQLQRMASAEKRLR------GGWGSFSSTGS 597
            SSMAGDSESGA+EVDKKAGEFIAKFREQIQLQRMAS EKRLR      GGWGSFSST S
Sbjct: 541 GSSMAGDSESGAFEVDKKAGEFIAKFREQIQLQRMASVEKRLRGGGGGGGGWGSFSSTSS 592

BLAST of Tan0002794 vs. TAIR 10
Match: AT3G60380.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT4G16790.1); Has 6102 Blast hits to 3981 proteins in 424 species: Archae - 6; Bacteria - 372; Metazoa - 2603; Fungi - 655; Plants - 291; Viruses - 28; Other Eukaryotes - 2147 (source: NCBI BLink). )

HSP 1 Score: 142.9 bits (359), Expect = 8.3e-34
Identity = 133/402 (33.08%), Postives = 200/402 (49.75%), Query Frame = 0

Query: 4   SASSPLTKPHFPHSPLPPTPSAHQHKSCAQFLCKSLFFCIFLLLLPLFPSEAPDFVNQTL 63
           ++ +P TK   P + + P    ++      F CKS+ F +FLL LPLFPS+APDFV +T+
Sbjct: 2   ASPNPYTKRRSPPNVVVPPQPRYKSIGGGGFFCKSVLFALFLLALPLFPSQAPDFVGETV 61

Query: 64  LTKFWELFHLMFVGIAVSYGLFSRRSIQVSVDEPRFSNFDHPQSYFSKMFHVASIFEDAD 123
           LTKFWEL HL+FVGIAV+YGLFSRR+++ +VD       +   SY S++F V+S+F++  
Sbjct: 62  LTKFWELIHLLFVGIAVAYGLFSRRNVESAVDLRMTRVDESSLSYVSRIFQVSSVFDEEF 121

Query: 124 DLSDSDERKLSEVLYIQPNRGSVSDFWDFNAQSREQEKLHCSIPKKRYENSYESSDTDNV 183
           D +  +   +     +      V     F  +S E             E S E  +T+ V
Sbjct: 122 DDNSCEFVDVRSDESVSARASVVGKSESFVVESGE------------LEESSEFGETNEV 181

Query: 184 GHACKSRYTRGGSLLVVAETNRSSGEWMESGAIVNYKPLGLPVRSLRSNLTEPDDLEFDC 243
             A  S+Y +G S +VVA        +   G +V ++PLGLP+R LRS+L          
Sbjct: 182 -RAWNSQYFQGKSKVVVARP-----AYGLDGHVV-HQPLGLPIRRLRSSLR--------- 241

Query: 244 GDESCLSSKTSSNSSEN--NCERISEFGDNCCTNLEEKFDEAVISSLSPFQWREKFGKKL 303
            D + L  K+ ++S +   N E  S   DN        FDE + +  SP  W+ +     
Sbjct: 242 -DNAALQDKSFADSCDGAVNAEAESLLADNF-------FDEVLAAPASPVPWQAR----- 301

Query: 304 VRERGVGNAILRPSHFRPPSIDETQFESLKNSSSLHSSLSQSSQTSSFSPALPSTTRKHR 363
               G+G+    PS+F+P S+DET    LK+ SS  S+ S SSQTS  S       +   
Sbjct: 302 PEMMGIGDNY--PSNFQPISVDET----LKSISS-RSTGSSSSQTSYAS-------QNQN 348

Query: 364 KMSSLSNISSKSLHSR-QYSMSSLSENGRGSSEDPLIEPENS 403
           + S   ++S++SL+S  +  +   S      S  P + P  S
Sbjct: 362 RFSPSRSVSAESLNSNVEELVKEKSRQSSSRSSSPSLPPSPS 348

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
ADN34231.16.0e-26080.23hypothetical protein [Cucumis melo subsp. melo] >TYK24724.1 DUF761 domain-contai... [more]
XP_004140631.18.9e-25679.10uncharacterized protein LOC101220435 [Cucumis sativus] >KGN46495.1 hypothetical ... [more]
XP_023006022.14.8e-24977.63uncharacterized protein LOC111498900 [Cucurbita maxima][more]
KAG6575261.11.5e-24778.04hypothetical protein SDJN03_25900, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7013816.14.5e-24778.04hypothetical protein SDJN02_23985, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A5D3DMA52.9e-26080.23DUF761 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... [more]
E5GCN22.9e-26080.23Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
A0A0A0K9X14.3e-25679.10Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G103540 PE=4 SV=1[more]
A0A6J1KUS42.3e-24977.63uncharacterized protein LOC111498900 OS=Cucurbita maxima OX=3661 GN=LOC111498900... [more]
A0A6J1H4M01.2e-24577.48uncharacterized protein LOC111459998 OS=Cucurbita moschata OX=3662 GN=LOC1114599... [more]
Match NameE-valueIdentityDescription
AT3G60380.18.3e-3433.08FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 552..578
e-value: 5.4E-10
score: 38.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 504..529
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 328..416
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 328..415
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 504..551
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availablePANTHERPTHR34059EXPRESSED PROTEINcoord: 4..366
NoneNo IPR availablePANTHERPTHR34059:SF6COTTON FIBER PROTEINcoord: 367..584
NoneNo IPR availablePANTHERPTHR34059:SF6COTTON FIBER PROTEINcoord: 4..366
NoneNo IPR availablePANTHERPTHR34059EXPRESSED PROTEINcoord: 367..584

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002794.1Tan0002794.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane