Tan0003802 (gene) Snake gourd v1

Overview
NameTan0003802
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG01: 26763171 .. 26769062 (-)
RNA-Seq ExpressionTan0003802
SyntenyTan0003802
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGAACAGAGCCCTAGACTCGAGCCGGCATCGGAGATCCAAAACGTGCCGCCGTTCAGGTCCGCCAGTGGCGGGCGGAGCATGTCGGCGAGCGGCGGCGGAGGCAGCCGGAGGGAAACGCCGGACGTTCACAGCACGGCGGCGAAGCTGGAGAGGGCGAAGGAAGTGTATAGGGCGTACGAAGGGCATGGAGAAAGGCCGACGATTGTGGAGATTGTGGGATGGTGCTTCTATGAACTTTGCTCGTTGTCAGTTGTGACGGTGCTGATTCCGGTAGTTTTTCCGTTGATTATCAGCCAGATTAGTGGGGCTGCGACGGAGCCGCCTCAGGGATGGTTTAAGAGCTTAATGGGCTTCCATTGTCCTCCCAGAGAGATTCGACTGTAAGTCGGTTTGCTTTGTGTCTCGGTTGAGTTTGAGGAATATTTTGCACCTCAAACATATAATAAAAATAAATAATTTTTTATTCTAAACACAAATTTAGTAATAATTTATACGGTATCTCACTCGTCTAGTCATACTCAAAAAAATAAAATAAAGATTGATTGTAACTCACTCAACTCCTACGACCACGACATTGGTTTTCTATCCTCGGAGAAATTTCACTTCTTAAATTACTAATTTCTTTTTGATTGGGTAAAAATTTTAAATGAGTAAAATTACAATGTTCTATAAATTTTTTGTGTGAAAAGTTTAATTTTGTGTCCAAAAAGTTTTTAACTTATTCGACTTTTCTAAATTCATCAATTTATTAGAAACAAAATTAAAAATTTTAGTTTCTATTAGACATAAATTTTGATTTTATATTTAATAGATTGGTTAATTTTTTAAATTTTTAAATATGTCAATGACAATTCAAAATTGAAAGTTCATGACCTATTAAATATTCTTAAGATTTAAAACTTGTTAGACACAAAATCTAAAGTCTAAACCGACATAAACCTAAAAGTTTAAGAACTAAATTTGTAATTTAACCTAAATTTAATATGCATTTTAAAAGTGTCACAAATTTTTATTTTTCTCTAACATAATATGAAATTTTGTTCTAAGAGTTTGAAAACTAATGGACCTAATTGCAACCAAACTCAAACCTAAGTCACATTTTTCTTGAAAAAAAAAGGTTGGGTTTACATTATATCTAGGTTTAACTTTTATTTCAATTATTTCTTTAAAAAATTGAATTATGTTATTTTTAAGAGTTGAAGTTTCTATTATATTTAAAAGTACATTTTTTTTTTCTAAAACTCATTTCAAAAATATTTAAATTAATAGAAAATCATTTAAAAGGAAACATGATAATTTTTTTATCATTAATATTATCTTTACGCCAATCTCTTCTATATTATTTTTCTAAGGATTGGAATGAACTTCAATTTTTTTGTTTAAATATTAGATCACACACGAAGTTAGCATTCCGAGGATTTGAGTCTACGATCTCTTAGAATAACTAAAGGCTAAGTGAAGCGATGAATCACTAATTGATTTATATAGTTATAAAAGTTTTGATTTGATGCTTATGTTTTGAAAAGTTTATTTTTACATACTACAATTTGGATTTAATTTTCAATTTTTTTTTTATACAGAAACCCTAGATAATTTCAAAAAGATTTTTGTAGTTGGTATCTATAATTTGACAAATTCTCCAATATAGTTCCCGAAACATAGATTATCCCTTAAAAAATTCACCCATTTTTTCTCTACTTTTTAGTCCATCCTCTTTATAATTGACACAATTTGTAAGTTTTACGAACCATAAAGACATCTAAAAAAACAAAATCATTAAACTATAAAGGACAAATTAAAAATTTTAAAAGTCTAAATGAAAACGAAGCTTAGATTACGCGAACTAAATTGAAATTTTTCAAAGTGAGAATCGTATAAACCATAGATAACTATTTGTTTATTATTTTTAACCACAGAAAATTAAGCTTATAAATATAAACACTACTTTCATCTTATTGAGGTTTTAAAAACTACAAAAGTTAATTTTTAATTTGTTTTTTTAGAATGTGCTTAAGAATCTAAATATTTTTAAAAAAAAAAAAATTGAAAATGATTGTAAAAATATTATTTAAAAAATCATCATTTTTAAAAACAAAATTAAATTTATAATCCAAATGAAAAAATAGTAACGTACACATGGCTTCCTTTGATTGGCTGGATGCAGGTACCAGAGCCTAACGGAGCACACAATAAAAGTAGGCAGCTACCAATTCTCGCCATTAATATGGACCTCAATCTCATGGGCTTTGGGCCTGGTTCTGACCGGCCCAATCCTCGCCTTTGCCTCCTTCCATCTCGATTACGGCTTCAACCAACACCTCATCACTCTTGCCGCCGTCGCCGCTGGCGCCCTTTCCTGTCTTCCGATCGGCATCTTCAAAACCGTCAAGATTTTTCCTCTTTACATCATTTTAATCGTTATTGCTCAGTCCGTCGCCTTCACCTCTCACACGCGCCACCTCGGCCTCATTCTCCGCGGCCTCGCCGGACCGACCGTTCGTAAGGCCAAATTTTCTCAAAGAAGAATCGGATCTGGTTTGATTTCGTCGTGCTCCGCCGCCGTCGGCGGCTTGGGCTCCGCCGCTATCTCCGCCTTCACTTACCACATGCTTCGACGGTTGGTATTCTAATCAGTACTCCGTTCCTACTGTTTAGTTCGAGTCGGAATCTTTTTATGGGAAGGACTAAGGATAAGATTTTACATTTTAATTTATAATTATGATTTTTTTTTAAGATAAATGAAAGAAAAAGACTTTAAATTTTTAATTTTTTTAATGGTTAAATTATTATTATATAATTTTTAGATTCATGTCTAATAGGTTTTAGACTAAAAAAACTGTAAAAAAAAAAGTACATTAAGTTTCAATTTTGTGTAGGATAGGTTTTCAAACTTCCAAAAAAAAATTGTAATTCACAAATTTTATGGAAATAAAGTTAAAACATCACAGTTTTGTTAGACACAAAATTAAATTTTGTTGCTAGTAGTAGTACGCATTGATTTTGAAAATTTTTTAATAGTGAAAGACAGACATGAAATTGAAATGTTAGAGATATTTATTAGCCACGTCTAGAAATTTATTAGATTCAAAATTAAAATGGTAGGACCTATACAAGTTTTTTTTTTTTCTTTTTGCTTCCTTTGAGAATATGAACCTATTAAGTATTATATACCTTTTAAACACCTTGAACATATTAGATACAACTATTAGATTAGATACAAAATTAATAGCTAATCTTAACTTAATAAGACATCAATTATCATTTTAAAAGTGGATAGTTTACTCCTTCCACCCCACAATTGAACTCATAAAAGAAGGCTAAATTATAGAAATACCCTTGAACTTTATACTTTGTTCAAAAATACTCTTGAACTTTTAAAAGTTTCATTAATACTATTGAACTTTAAAAAATGTTAAAAAATATCATTACCGATAGTACTTTGTTCAAAAATACTCTTGAACTTTTAAAAGTTTCATTAATACTATTGAACTTTAAAAAATGTTCAAAAATACCATTACTATTAGTACTTTGTTTCAAAAATACCCCTAAACTTTGTACTTTATTTCAAAAATAAAATTTTTTCCTTACTCTTAGTACATGGATAGAAATTGTTAGTACTTCGTTTTAAAATATCCCTGAACTTTCAAAAGTTGTACGTTAAAAAATTCAAAAAAATAAAACAACACCCTTACCGCTATCGAAAAGTATTTATCAATACCTCATTACCTCCCTCTCCTATTTCTTGTTTCTTGATTTGCTTCGATTTATAGCCTAACACTTCTCTAATTTTGCTCTTTTTTTTCATTATTTTTCAATACCTTCTTTACTCTCTTATTCCATTTTCCAATCTCTAAAATTACCCAATTCAATCAAGGTATATAGGCTATAGCAAGAAGAAAACAGAATAATTGAACCACCAAAGGATCTTAGGATTTAAATGCTTTAACATGGTTATGTTTTAGTAATTTGAATTAATACGTTTGAGTTTAATACACAAAAAGAGAAACATGTGAAGAATTTATATCATAATTTTGAGGACATTTAGTGAGAGGAAAAATAAAGAAAACACATGCATTACCAAATTTGTAGCGCATGGCGAACAAGAGCATAGCTCAATTGATATCAAATATGACTCATGACCAAGAGGTCATAAGTTTGGGTTTGAATCTCCCACCCTCAATTTTTTTTTGCAGTTCAAATTACTAATTAGTATATTTATTAGGGCCAGTTAAGAAGGTTATCATTATTGAAATTTGGACAAATGGAGGAAAAATGTGAATATCTATATGAAAAAATAAAGTTATCTATATGATTTGTTTTCAAATTGATTATTTTCCAACTCAAAATAATTTCAAATACATTGACTATTTAGGTCTTGACATTTTTTATATTTATTTTTGGGCATGATTAGAAGTTAGAAATGAGTGAAAGTTTTTTCCCGCCTATTTTAAGTTCTAATATTTAAGAAGTAGGAGAAATGAGGGAAGTCTTAGACATTGAATGGAAGAAAAGAGTAAGAAGATATTGAAAAAAAAGAAAAAACAAGAAAAAATAGAGAAGTGTTAGATCAAATCAAGAGAAAGAGGAAATAGACTAGGGAGGTAATTGATAAAAGAGTGTATGTTATATATTAATAGTAAGTGCATTTTTAAAATTACTTTAAAGATTAAGAGTATTAATGTGAAAGTTCAAATGTATTTTTAAAACGAGATACTAACAAATTTTGTTCATATATTAACGATAAATGTATTTTTTAATTTTTTTTTTTTTTGAAGTTTAAGAATATTAATGAAACTTTTGAAAGTTTAGAGGTATTTTTTAAAAAAAGTACAAAATATTAACGGTTTCCCTTCAAAATTAATGGTAGAAGTATTTTTGAAATTTTCTTTGTTTAGATGTATTTTTTAACCTTGTTTTTATTTTTAATTACATATTTCGAGTTTCATAAGCTATGCTAATTTCTAAATACACTTTATTATTATTTTTTGGTGATTTTGTTCAAAAAAAAAAATCACTTCCTGTTCACTTGCGAGTGTTTTCGATGTTAGTGACAGGCAATTGAAAGAAGACGACGAAAATCACTTCCTCAGCCTATGGATCGTCACGATCTTCGGCGGCCTAAAATGGCTTCTCGGAATCTTCCACATCTTCGTCACAAATCGATCACTCTCCGTAACAATCCCTTCCGATTCAGAGCTTCACACTCTTTCAATTTTCAAATTTCCTCACGCAATCGGCAGCGTAATCTCCGGCGGATTCCTCTCTTCCTTCGCCACAATCTGCATCTTCACCGCCGTTATACTCTTCCTAATCGGTCAAATCTGTTTCAAACCAGTCTTGATTTTCTATCTATGGCTGATCTACTTCCTCGTTCCTCTAATTTCCCTTCCATTACTCCATCAATTCCAGATCCGAATCAAATCCGACGCCTCCAAAATGCAAATCCTAGGGTTCATCCTCTCCGCCGCCACGTCCGCCACTTGCTTCTACTTCCACGACGGCGCCTGGCGGCGGAGCGTGGTCTTCGTCTTCGCCGCTCTCCAAGGCACGGCGGCGGCGGTTCTACAGGCGTACGGAAGAGTTTTGGTGCTCGATTGCTCGCCGGCGGGGAAGGAAGGTGCGATTTCGATGTGGTTTTCGTGGATGAGAGTGATCGGCGGTTGCGTTGGATTTACGGTCGCCGCGGTGGTTCCGGCGAAGTTGCAGGTTTCTTCCGGTGTGGCGTTTTGTAGCGCTGTCGTCGGAGGAGTGGTGCTGATCTATGGTAATGTTACTGATTACGGCGGCGCTGTGGCGGCGGGGCATGTGAGAGATGATAGTGAAAAGGGATCGCCGGTGATTGGATTGGAATCTCGGAGTGAGAGTAAAGAGCTTGAATCGCCTTGA

mRNA sequence

ATGGCCGAACAGAGCCCTAGACTCGAGCCGGCATCGGAGATCCAAAACGTGCCGCCGTTCAGGTCCGCCAGTGGCGGGCGGAGCATGTCGGCGAGCGGCGGCGGAGGCAGCCGGAGGGAAACGCCGGACGTTCACAGCACGGCGGCGAAGCTGGAGAGGGCGAAGGAAGTGTATAGGGCGTACGAAGGGCATGGAGAAAGGCCGACGATTGTGGAGATTGTGGGATGGTGCTTCTATGAACTTTGCTCGTTGTCAGTTGTGACGGTGCTGATTCCGGTAGTTTTTCCGTTGATTATCAGCCAGATTAGTGGGGCTGCGACGGAGCCGCCTCAGGGATGGTTTAAGAGCTTAATGGGCTTCCATTGTCCTCCCAGAGAGATTCGACTGTACCAGAGCCTAACGGAGCACACAATAAAAGTAGGCAGCTACCAATTCTCGCCATTAATATGGACCTCAATCTCATGGGCTTTGGGCCTGGTTCTGACCGGCCCAATCCTCGCCTTTGCCTCCTTCCATCTCGATTACGGCTTCAACCAACACCTCATCACTCTTGCCGCCGTCGCCGCTGGCGCCCTTTCCTGTCTTCCGATCGGCATCTTCAAAACCGTCAAGATTTTTCCTCTTTACATCATTTTAATCGTTATTGCTCAGTCCGTCGCCTTCACCTCTCACACGCGCCACCTCGGCCTCATTCTCCGCGGCCTCGCCGGACCGACCGTTCGTAAGGCCAAATTTTCTCAAAGAAGAATCGGATCTGGTTTGATTTCGTCGTGCTCCGCCGCCGTCGGCGGCTTGGGCTCCGCCGCTATCTCCGCCTTCACTTACCACATGCTTCGACGTGACAGGCAATTGAAAGAAGACGACGAAAATCACTTCCTCAGCCTATGGATCGTCACGATCTTCGGCGGCCTAAAATGGCTTCTCGGAATCTTCCACATCTTCGTCACAAATCGATCACTCTCCGTAACAATCCCTTCCGATTCAGAGCTTCACACTCTTTCAATTTTCAAATTTCCTCACGCAATCGGCAGCGTAATCTCCGGCGGATTCCTCTCTTCCTTCGCCACAATCTGCATCTTCACCGCCGTTATACTCTTCCTAATCGGTCAAATCTGTTTCAAACCAGTCTTGATTTTCTATCTATGGCTGATCTACTTCCTCGTTCCTCTAATTTCCCTTCCATTACTCCATCAATTCCAGATCCGAATCAAATCCGACGCCTCCAAAATGCAAATCCTAGGGTTCATCCTCTCCGCCGCCACGTCCGCCACTTGCTTCTACTTCCACGACGGCGCCTGGCGGCGGAGCGTGGTCTTCGTCTTCGCCGCTCTCCAAGGCACGGCGGCGGCGGTTCTACAGGCGTACGGAAGAGTTTTGGTGCTCGATTGCTCGCCGGCGGGGAAGGAAGGTGCGATTTCGATGTGGTTTTCGTGGATGAGAGTGATCGGCGGTTGCGTTGGATTTACGGTCGCCGCGGTGGTTCCGGCGAAGTTGCAGGTTTCTTCCGGTGTGGCGTTTTGTAGCGCTGTCGTCGGAGGAGTGGTGCTGATCTATGGTAATGTTACTGATTACGGCGGCGCTGTGGCGGCGGGGCATGTGAGAGATGATAGTGAAAAGGGATCGCCGGTGATTGGATTGGAATCTCGGAGTGAGAGTAAAGAGCTTGAATCGCCTTGA

Coding sequence (CDS)

ATGGCCGAACAGAGCCCTAGACTCGAGCCGGCATCGGAGATCCAAAACGTGCCGCCGTTCAGGTCCGCCAGTGGCGGGCGGAGCATGTCGGCGAGCGGCGGCGGAGGCAGCCGGAGGGAAACGCCGGACGTTCACAGCACGGCGGCGAAGCTGGAGAGGGCGAAGGAAGTGTATAGGGCGTACGAAGGGCATGGAGAAAGGCCGACGATTGTGGAGATTGTGGGATGGTGCTTCTATGAACTTTGCTCGTTGTCAGTTGTGACGGTGCTGATTCCGGTAGTTTTTCCGTTGATTATCAGCCAGATTAGTGGGGCTGCGACGGAGCCGCCTCAGGGATGGTTTAAGAGCTTAATGGGCTTCCATTGTCCTCCCAGAGAGATTCGACTGTACCAGAGCCTAACGGAGCACACAATAAAAGTAGGCAGCTACCAATTCTCGCCATTAATATGGACCTCAATCTCATGGGCTTTGGGCCTGGTTCTGACCGGCCCAATCCTCGCCTTTGCCTCCTTCCATCTCGATTACGGCTTCAACCAACACCTCATCACTCTTGCCGCCGTCGCCGCTGGCGCCCTTTCCTGTCTTCCGATCGGCATCTTCAAAACCGTCAAGATTTTTCCTCTTTACATCATTTTAATCGTTATTGCTCAGTCCGTCGCCTTCACCTCTCACACGCGCCACCTCGGCCTCATTCTCCGCGGCCTCGCCGGACCGACCGTTCGTAAGGCCAAATTTTCTCAAAGAAGAATCGGATCTGGTTTGATTTCGTCGTGCTCCGCCGCCGTCGGCGGCTTGGGCTCCGCCGCTATCTCCGCCTTCACTTACCACATGCTTCGACGTGACAGGCAATTGAAAGAAGACGACGAAAATCACTTCCTCAGCCTATGGATCGTCACGATCTTCGGCGGCCTAAAATGGCTTCTCGGAATCTTCCACATCTTCGTCACAAATCGATCACTCTCCGTAACAATCCCTTCCGATTCAGAGCTTCACACTCTTTCAATTTTCAAATTTCCTCACGCAATCGGCAGCGTAATCTCCGGCGGATTCCTCTCTTCCTTCGCCACAATCTGCATCTTCACCGCCGTTATACTCTTCCTAATCGGTCAAATCTGTTTCAAACCAGTCTTGATTTTCTATCTATGGCTGATCTACTTCCTCGTTCCTCTAATTTCCCTTCCATTACTCCATCAATTCCAGATCCGAATCAAATCCGACGCCTCCAAAATGCAAATCCTAGGGTTCATCCTCTCCGCCGCCACGTCCGCCACTTGCTTCTACTTCCACGACGGCGCCTGGCGGCGGAGCGTGGTCTTCGTCTTCGCCGCTCTCCAAGGCACGGCGGCGGCGGTTCTACAGGCGTACGGAAGAGTTTTGGTGCTCGATTGCTCGCCGGCGGGGAAGGAAGGTGCGATTTCGATGTGGTTTTCGTGGATGAGAGTGATCGGCGGTTGCGTTGGATTTACGGTCGCCGCGGTGGTTCCGGCGAAGTTGCAGGTTTCTTCCGGTGTGGCGTTTTGTAGCGCTGTCGTCGGAGGAGTGGTGCTGATCTATGGTAATGTTACTGATTACGGCGGCGCTGTGGCGGCGGGGCATGTGAGAGATGATAGTGAAAAGGGATCGCCGGTGATTGGATTGGAATCTCGGAGTGAGAGTAAAGAGCTTGAATCGCCTTGA

Protein sequence

MAEQSPRLEPASEIQNVPPFRSASGGRSMSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKSLMGFHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLAGPTVRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLWIVTIFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFATICIFTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFILSAATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
Homology
BLAST of Tan0003802 vs. NCBI nr
Match: XP_022965332.1 (uncharacterized protein LOC111465229 isoform X1 [Cucurbita maxima])

HSP 1 Score: 907.1 bits (2343), Expect = 7.3e-260
Identity = 466/559 (83.36%), Postives = 507/559 (90.70%), Query Frame = 0

Query: 1   MAEQSPRLEPASEIQNVPPFRSASGG-RSMSASGGGGSRRETPDVHSTAAKLERAKEVYR 60
           MAEQSPR   +SEIQN PP RS SG   S + SG GGSR++TPD HS AAKLERAKEVYR
Sbjct: 1   MAEQSPR-PKSSEIQNAPPPRSGSGRITSTTRSGSGGSRKDTPDFHSMAAKLERAKEVYR 60

Query: 61  AYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKSLMG 120
           AYEGHGE+P+I+E+ GWCFYELCSLSV+TVLIPVVFPLIISQISGAA EPPQGWF+S MG
Sbjct: 61  AYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAAMEPPQGWFQSFMG 120

Query: 121 FHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYGFNQ 180
           F CPP E++LYQ LT+HTIK+   +FSPLIWTSISWALGL++ GPILAFASFHLDYGFNQ
Sbjct: 121 FDCPPGEMQLYQILTDHTIKISGTRFSPLIWTSISWALGLIIAGPILAFASFHLDYGFNQ 180

Query: 181 HLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLAGPT 240
           HLI + AVAAGALSCLP G+F+TVKIFPLYI+LIVIA SVAFTSHTRHLGL+LRGL GPT
Sbjct: 181 HLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGPT 240

Query: 241 VRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLWIVT 300
           V KAKF+QRR GSGLISSCS AVGGLG+AAISAFTYHMLRR+RQ KE D+NHFLSLWIVT
Sbjct: 241 VLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRRNRQEKEGDDNHFLSLWIVT 300

Query: 301 IFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFATICI 360
           IFGGLKWLLGIFH+F+TNRS+SVTIPSDSELH L+IFK+PHAIG+VIS GFLSSF TI I
Sbjct: 301 IFGGLKWLLGIFHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIAI 360

Query: 361 FTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFILSA 420
           F AV LFLIGQICFKPVLI YLWLIYFL+PLISLPLLHQFQIRIK+DASKMQILGFILSA
Sbjct: 361 FIAVSLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILSA 420

Query: 421 ATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWM 480
            TSA CFYFH+ AWR  VVFVFAALQGTAAA+L  YGRVLVLDCSPAGKE AISMWFSWM
Sbjct: 421 VTSAICFYFHNDAWRLPVVFVFAALQGTAAALLHTYGRVLVLDCSPAGKEAAISMWFSWM 480

Query: 481 RVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDDSEK 540
           R IGGCVGFTVAAVVPA+LQVSSGVAFC AVVGGVVLIYGN+TDYGGAV+AGHV++DSEK
Sbjct: 481 RAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGVVLIYGNITDYGGAVSAGHVKNDSEK 540

Query: 541 GSPVIGLESRSESKELESP 559
           GSPVIGLESRS SKELESP
Sbjct: 541 GSPVIGLESRSVSKELESP 558

BLAST of Tan0003802 vs. NCBI nr
Match: KAG6577745.1 (hypothetical protein SDJN03_25319, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 906.0 bits (2340), Expect = 1.6e-259
Identity = 466/559 (83.36%), Postives = 506/559 (90.52%), Query Frame = 0

Query: 1   MAEQSPRLEPASEIQNVPPFRSASGG-RSMSASGGGGSRRETPDVHSTAAKLERAKEVYR 60
           MAEQSPR   +SEIQ+ PP RS SG   S + SG GGSR++TPD HS AAKLERAKEVYR
Sbjct: 1   MAEQSPR-PKSSEIQSAPPPRSGSGRITSTTRSGSGGSRKDTPDFHSMAAKLERAKEVYR 60

Query: 61  AYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKSLMG 120
           AYEGHGE+P+I+E+ GWCFYELCSLSV+TVLIPVVFPLIISQISGAATEPPQGWFKS+MG
Sbjct: 61  AYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAATEPPQGWFKSVMG 120

Query: 121 FHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYGFNQ 180
           F C P E++LYQ LTEHTIKV   +FSPLIWTSISWALGL+L GPIL FASFHLDYGFNQ
Sbjct: 121 FDCAPGEMQLYQILTEHTIKVSGTRFSPLIWTSISWALGLILAGPILVFASFHLDYGFNQ 180

Query: 181 HLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLAGPT 240
           HLI + AVAAGALSCLP G+F+TVKIFPLYI+LIVIA SVAFTSHTRHLGL+LRGL GPT
Sbjct: 181 HLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGPT 240

Query: 241 VRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLWIVT 300
           V KAKF+QRR GSGLISSCS AVGGLG+AAISAFTYHMLRR+RQ KE DENHFLSLWIVT
Sbjct: 241 VLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRRNRQEKEGDENHFLSLWIVT 300

Query: 301 IFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFATICI 360
           IFGGLKWLLG+ H+F+TNRS+SVTIPSDSELH L+IFK+PHAIG+VIS GFLSSF TI +
Sbjct: 301 IFGGLKWLLGVVHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIAV 360

Query: 361 FTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFILSA 420
           F AV LFLIGQICFKP LI YLWLIYFL+PLISLPLLHQFQIRIK+DASKMQILGFILSA
Sbjct: 361 FIAVSLFLIGQICFKPALILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILSA 420

Query: 421 ATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWM 480
            TSA CFYFH+ AWRR VVFVFAALQGTAAA+L +YGRVLVLDCSPAGKE AISMWFSWM
Sbjct: 421 VTSAICFYFHNDAWRRPVVFVFAALQGTAAALLHSYGRVLVLDCSPAGKEAAISMWFSWM 480

Query: 481 RVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDDSEK 540
           R IGGCVGFTVAAVVP +LQVSSGVAFC AVVGGVVLIYGNVTDYGGAVAAGHV++DSEK
Sbjct: 481 RAIGGCVGFTVAAVVPTRLQVSSGVAFCCAVVGGVVLIYGNVTDYGGAVAAGHVKNDSEK 540

Query: 541 GSPVIGLESRSESKELESP 559
           GSPV+GLESRS SKELESP
Sbjct: 541 GSPVVGLESRSVSKELESP 558

BLAST of Tan0003802 vs. NCBI nr
Match: KAG7015784.1 (hypothetical protein SDJN02_23422, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 904.8 bits (2337), Expect = 3.6e-259
Identity = 466/559 (83.36%), Postives = 506/559 (90.52%), Query Frame = 0

Query: 1   MAEQSPRLEPASEIQNVPPFRSASGG-RSMSASGGGGSRRETPDVHSTAAKLERAKEVYR 60
           MAEQSPR   +SEIQ+ PP RS SG   S + SG GGSR++TPD HS AAKLERAKEVYR
Sbjct: 1   MAEQSPR-PKSSEIQSAPPPRSGSGRITSTTRSGSGGSRKDTPDFHSMAAKLERAKEVYR 60

Query: 61  AYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKSLMG 120
           AYEGHGE+P+I+E+ GWCFYELCSLSV+TVLIPVVFPLIISQISGAATEPPQGWFKS+MG
Sbjct: 61  AYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAATEPPQGWFKSVMG 120

Query: 121 FHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYGFNQ 180
           F C P E++LYQ LTEHTIKV   +FSPLIWTSISWALGL+L GPIL FASFHLDYGFNQ
Sbjct: 121 FDCVPGEMQLYQILTEHTIKVSGTRFSPLIWTSISWALGLILAGPILVFASFHLDYGFNQ 180

Query: 181 HLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLAGPT 240
           HLI + AVAAGALSCLP G+F+TVKIFPLYI+LIVIA SVAFTSHTRHLGL+LRGL GPT
Sbjct: 181 HLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGPT 240

Query: 241 VRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLWIVT 300
           V KAKF+QRR GSGLISSCS AVGGLG+AAISAFTYHMLRR+RQ KE DENHFLSLWIVT
Sbjct: 241 VLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRRNRQEKEGDENHFLSLWIVT 300

Query: 301 IFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFATICI 360
           IFGGLKWLLGI H+F+TNRS+SVTIPSDSELH L+IFK+PHAIG+VIS GFLSSF TI +
Sbjct: 301 IFGGLKWLLGIVHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIAV 360

Query: 361 FTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFILSA 420
           F AV LFLIGQICFKP LI YLWLIYFL+PLISLPLLHQFQIRIK+DASKMQILGFILSA
Sbjct: 361 FIAVSLFLIGQICFKPALILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILSA 420

Query: 421 ATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWM 480
            TSA CFYFH+ AWRR VVFVFA+LQGTAAA+L +YGRVLVLDCSPAGKE AISMWFSWM
Sbjct: 421 VTSAICFYFHNDAWRRPVVFVFASLQGTAAALLHSYGRVLVLDCSPAGKEAAISMWFSWM 480

Query: 481 RVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDDSEK 540
           R IGGCVGFTVAAVVP +LQVSSGVAFC AVVGGVVLIYGNVTDYGGAVAAGHV++DSEK
Sbjct: 481 RAIGGCVGFTVAAVVPTRLQVSSGVAFCCAVVGGVVLIYGNVTDYGGAVAAGHVKNDSEK 540

Query: 541 GSPVIGLESRSESKELESP 559
           GSPV+GLESRS SKELESP
Sbjct: 541 GSPVVGLESRSVSKELESP 558

BLAST of Tan0003802 vs. NCBI nr
Match: XP_022965333.1 (uncharacterized protein LOC111465229 isoform X2 [Cucurbita maxima])

HSP 1 Score: 899.8 bits (2324), Expect = 1.2e-257
Identity = 465/559 (83.18%), Postives = 505/559 (90.34%), Query Frame = 0

Query: 1   MAEQSPRLEPASEIQNVPPFRSASGG-RSMSASGGGGSRRETPDVHSTAAKLERAKEVYR 60
           MAEQSPR   +SEIQN PP RS SG   S + SG GGSR++TPD HS AAKLERAKEVYR
Sbjct: 1   MAEQSPR-PKSSEIQNAPPPRSGSGRITSTTRSGSGGSRKDTPDFHSMAAKLERAKEVYR 60

Query: 61  AYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKSLMG 120
           AYEGHGE+P+I+E+ GWCFYELCSLSV+TVLIPVVFPLIISQISGAA EPPQGWF+S MG
Sbjct: 61  AYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAAMEPPQGWFQSFMG 120

Query: 121 FHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYGFNQ 180
           F CPP E++LYQ LT+HTIK+   +FSPLIWTSISWALGL++ GPILAFASFHLDYGFNQ
Sbjct: 121 FDCPPGEMQLYQILTDHTIKISGTRFSPLIWTSISWALGLIIAGPILAFASFHLDYGFNQ 180

Query: 181 HLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLAGPT 240
           HLI + AVAAGALSCLP G+F+TVKIFPLYI+LIVIA SVAFTSHTRHLGL+LRGL GPT
Sbjct: 181 HLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGPT 240

Query: 241 VRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLWIVT 300
           V KAKF+QRR GSGLISSCS AVGGLG+AAISAFTYHMLR  RQ KE D+NHFLSLWIVT
Sbjct: 241 VLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLR--RQEKEGDDNHFLSLWIVT 300

Query: 301 IFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFATICI 360
           IFGGLKWLLGIFH+F+TNRS+SVTIPSDSELH L+IFK+PHAIG+VIS GFLSSF TI I
Sbjct: 301 IFGGLKWLLGIFHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIAI 360

Query: 361 FTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFILSA 420
           F AV LFLIGQICFKPVLI YLWLIYFL+PLISLPLLHQFQIRIK+DASKMQILGFILSA
Sbjct: 361 FIAVSLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILSA 420

Query: 421 ATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWM 480
            TSA CFYFH+ AWR  VVFVFAALQGTAAA+L  YGRVLVLDCSPAGKE AISMWFSWM
Sbjct: 421 VTSAICFYFHNDAWRLPVVFVFAALQGTAAALLHTYGRVLVLDCSPAGKEAAISMWFSWM 480

Query: 481 RVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDDSEK 540
           R IGGCVGFTVAAVVPA+LQVSSGVAFC AVVGGVVLIYGN+TDYGGAV+AGHV++DSEK
Sbjct: 481 RAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGVVLIYGNITDYGGAVSAGHVKNDSEK 540

Query: 541 GSPVIGLESRSESKELESP 559
           GSPVIGLESRS SKELESP
Sbjct: 541 GSPVIGLESRSVSKELESP 556

BLAST of Tan0003802 vs. NCBI nr
Match: XP_008448612.1 (PREDICTED: uncharacterized protein LOC103490734 [Cucumis melo] >KAA0053002.1 uncharacterized protein E6C27_scaffold344G001140 [Cucumis melo var. makuwa] >TYK11458.1 uncharacterized protein E5676_scaffold139G001150 [Cucumis melo var. makuwa])

HSP 1 Score: 887.1 bits (2291), Expect = 7.8e-254
Identity = 465/562 (82.74%), Postives = 504/562 (89.68%), Query Frame = 0

Query: 2   AEQSPRLEPASEIQNVPPFRSASGGRSMSA-----SGGGGSRRETPDVHSTAAKLERAKE 61
           AEQSPR    SEIQN+PP +S S GRS+S       GGGGSRRETPD HSTAAKLERAKE
Sbjct: 3   AEQSPR-PKQSEIQNLPPSKSTS-GRSVSTPRSANGGGGGSRRETPDFHSTAAKLERAKE 62

Query: 62  VYRAYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKS 121
           VY+AYEGHGERPTIVEIVGWCFYELCS  V+T+LIPVVFPLIISQISG  T PPQGWFKS
Sbjct: 63  VYKAYEGHGERPTIVEIVGWCFYELCSFFVLTLLIPVVFPLIISQISGTPTAPPQGWFKS 122

Query: 122 LMGFHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYG 181
            MGF CP RE++LYQSLTE TIKV + +FSPLIWTSISWA+GLVL GPILA ASFHLDYG
Sbjct: 123 FMGFDCPLREMQLYQSLTEQTIKVSNAEFSPLIWTSISWAMGLVLAGPILAAASFHLDYG 182

Query: 182 FNQHLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLA 241
           FNQHLITLAAVAAGAL+CLP G+FKTVKIFPLYI+LIVIA SVAFTSHTRHLGL+LRGL 
Sbjct: 183 FNQHLITLAAVAAGALTCLPTGLFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLT 242

Query: 242 GPTVRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLW 301
           GP V KAKFS RRIGSG ISS SAAVGG+G++ ISAFTYHMLRRD+Q++E  +NHFL+LW
Sbjct: 243 GPIVHKAKFSLRRIGSGQISSWSAAVGGVGASVISAFTYHMLRRDKQVQEGVDNHFLNLW 302

Query: 302 IVTIFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFAT 361
           IVTIF GLKWL+GIFH+F+TNRS+S++IPS+SELH LSIFK+P+AI +VISGGFLSSFAT
Sbjct: 303 IVTIFAGLKWLIGIFHVFLTNRSISISIPSNSELHILSIFKYPYAIATVISGGFLSSFAT 362

Query: 362 ICIFTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFI 421
           I IFTAV+LFLIGQICFKPVLI YL LIYFLVPLISLPLLHQFQIRIK+DASKM ILGFI
Sbjct: 363 ISIFTAVLLFLIGQICFKPVLILYLLLIYFLVPLISLPLLHQFQIRIKADASKMLILGFI 422

Query: 422 LSAATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWF 481
           LSAATSATCFYFH   WRR +VFVFA LQGTAAAVL AYGR LVLDCSPAGKE AISMWF
Sbjct: 423 LSAATSATCFYFHAYTWRRHLVFVFAVLQGTAAAVLHAYGRALVLDCSPAGKESAISMWF 482

Query: 482 SWMRVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDD 541
           SWMR IGGCVGFTVAAVVPA+LQVSSGV FC AVVGGVVLI+GNVTDY GAVAAGHVRDD
Sbjct: 483 SWMRSIGGCVGFTVAAVVPARLQVSSGVVFCCAVVGGVVLIFGNVTDYDGAVAAGHVRDD 542

Query: 542 SEKGSPVIGLESRSESKELESP 559
           SEKGSPVIGL+SRSESKELESP
Sbjct: 543 SEKGSPVIGLDSRSESKELESP 562

BLAST of Tan0003802 vs. ExPASy TrEMBL
Match: A0A6J1HK19 (uncharacterized protein LOC111465229 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465229 PE=4 SV=1)

HSP 1 Score: 907.1 bits (2343), Expect = 3.5e-260
Identity = 466/559 (83.36%), Postives = 507/559 (90.70%), Query Frame = 0

Query: 1   MAEQSPRLEPASEIQNVPPFRSASGG-RSMSASGGGGSRRETPDVHSTAAKLERAKEVYR 60
           MAEQSPR   +SEIQN PP RS SG   S + SG GGSR++TPD HS AAKLERAKEVYR
Sbjct: 1   MAEQSPR-PKSSEIQNAPPPRSGSGRITSTTRSGSGGSRKDTPDFHSMAAKLERAKEVYR 60

Query: 61  AYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKSLMG 120
           AYEGHGE+P+I+E+ GWCFYELCSLSV+TVLIPVVFPLIISQISGAA EPPQGWF+S MG
Sbjct: 61  AYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAAMEPPQGWFQSFMG 120

Query: 121 FHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYGFNQ 180
           F CPP E++LYQ LT+HTIK+   +FSPLIWTSISWALGL++ GPILAFASFHLDYGFNQ
Sbjct: 121 FDCPPGEMQLYQILTDHTIKISGTRFSPLIWTSISWALGLIIAGPILAFASFHLDYGFNQ 180

Query: 181 HLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLAGPT 240
           HLI + AVAAGALSCLP G+F+TVKIFPLYI+LIVIA SVAFTSHTRHLGL+LRGL GPT
Sbjct: 181 HLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGPT 240

Query: 241 VRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLWIVT 300
           V KAKF+QRR GSGLISSCS AVGGLG+AAISAFTYHMLRR+RQ KE D+NHFLSLWIVT
Sbjct: 241 VLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRRNRQEKEGDDNHFLSLWIVT 300

Query: 301 IFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFATICI 360
           IFGGLKWLLGIFH+F+TNRS+SVTIPSDSELH L+IFK+PHAIG+VIS GFLSSF TI I
Sbjct: 301 IFGGLKWLLGIFHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIAI 360

Query: 361 FTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFILSA 420
           F AV LFLIGQICFKPVLI YLWLIYFL+PLISLPLLHQFQIRIK+DASKMQILGFILSA
Sbjct: 361 FIAVSLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILSA 420

Query: 421 ATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWM 480
            TSA CFYFH+ AWR  VVFVFAALQGTAAA+L  YGRVLVLDCSPAGKE AISMWFSWM
Sbjct: 421 VTSAICFYFHNDAWRLPVVFVFAALQGTAAALLHTYGRVLVLDCSPAGKEAAISMWFSWM 480

Query: 481 RVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDDSEK 540
           R IGGCVGFTVAAVVPA+LQVSSGVAFC AVVGGVVLIYGN+TDYGGAV+AGHV++DSEK
Sbjct: 481 RAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGVVLIYGNITDYGGAVSAGHVKNDSEK 540

Query: 541 GSPVIGLESRSESKELESP 559
           GSPVIGLESRS SKELESP
Sbjct: 541 GSPVIGLESRSVSKELESP 558

BLAST of Tan0003802 vs. ExPASy TrEMBL
Match: A0A6J1HNK9 (uncharacterized protein LOC111465229 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465229 PE=4 SV=1)

HSP 1 Score: 899.8 bits (2324), Expect = 5.7e-258
Identity = 465/559 (83.18%), Postives = 505/559 (90.34%), Query Frame = 0

Query: 1   MAEQSPRLEPASEIQNVPPFRSASGG-RSMSASGGGGSRRETPDVHSTAAKLERAKEVYR 60
           MAEQSPR   +SEIQN PP RS SG   S + SG GGSR++TPD HS AAKLERAKEVYR
Sbjct: 1   MAEQSPR-PKSSEIQNAPPPRSGSGRITSTTRSGSGGSRKDTPDFHSMAAKLERAKEVYR 60

Query: 61  AYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKSLMG 120
           AYEGHGE+P+I+E+ GWCFYELCSLSV+TVLIPVVFPLIISQISGAA EPPQGWF+S MG
Sbjct: 61  AYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAAMEPPQGWFQSFMG 120

Query: 121 FHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYGFNQ 180
           F CPP E++LYQ LT+HTIK+   +FSPLIWTSISWALGL++ GPILAFASFHLDYGFNQ
Sbjct: 121 FDCPPGEMQLYQILTDHTIKISGTRFSPLIWTSISWALGLIIAGPILAFASFHLDYGFNQ 180

Query: 181 HLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLAGPT 240
           HLI + AVAAGALSCLP G+F+TVKIFPLYI+LIVIA SVAFTSHTRHLGL+LRGL GPT
Sbjct: 181 HLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGPT 240

Query: 241 VRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLWIVT 300
           V KAKF+QRR GSGLISSCS AVGGLG+AAISAFTYHMLR  RQ KE D+NHFLSLWIVT
Sbjct: 241 VLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLR--RQEKEGDDNHFLSLWIVT 300

Query: 301 IFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFATICI 360
           IFGGLKWLLGIFH+F+TNRS+SVTIPSDSELH L+IFK+PHAIG+VIS GFLSSF TI I
Sbjct: 301 IFGGLKWLLGIFHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIAI 360

Query: 361 FTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFILSA 420
           F AV LFLIGQICFKPVLI YLWLIYFL+PLISLPLLHQFQIRIK+DASKMQILGFILSA
Sbjct: 361 FIAVSLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILSA 420

Query: 421 ATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWM 480
            TSA CFYFH+ AWR  VVFVFAALQGTAAA+L  YGRVLVLDCSPAGKE AISMWFSWM
Sbjct: 421 VTSAICFYFHNDAWRLPVVFVFAALQGTAAALLHTYGRVLVLDCSPAGKEAAISMWFSWM 480

Query: 481 RVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDDSEK 540
           R IGGCVGFTVAAVVPA+LQVSSGVAFC AVVGGVVLIYGN+TDYGGAV+AGHV++DSEK
Sbjct: 481 RAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGVVLIYGNITDYGGAVSAGHVKNDSEK 540

Query: 541 GSPVIGLESRSESKELESP 559
           GSPVIGLESRS SKELESP
Sbjct: 541 GSPVIGLESRSVSKELESP 556

BLAST of Tan0003802 vs. ExPASy TrEMBL
Match: A0A5D3CJT7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001150 PE=4 SV=1)

HSP 1 Score: 887.1 bits (2291), Expect = 3.8e-254
Identity = 465/562 (82.74%), Postives = 504/562 (89.68%), Query Frame = 0

Query: 2   AEQSPRLEPASEIQNVPPFRSASGGRSMSA-----SGGGGSRRETPDVHSTAAKLERAKE 61
           AEQSPR    SEIQN+PP +S S GRS+S       GGGGSRRETPD HSTAAKLERAKE
Sbjct: 3   AEQSPR-PKQSEIQNLPPSKSTS-GRSVSTPRSANGGGGGSRRETPDFHSTAAKLERAKE 62

Query: 62  VYRAYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKS 121
           VY+AYEGHGERPTIVEIVGWCFYELCS  V+T+LIPVVFPLIISQISG  T PPQGWFKS
Sbjct: 63  VYKAYEGHGERPTIVEIVGWCFYELCSFFVLTLLIPVVFPLIISQISGTPTAPPQGWFKS 122

Query: 122 LMGFHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYG 181
            MGF CP RE++LYQSLTE TIKV + +FSPLIWTSISWA+GLVL GPILA ASFHLDYG
Sbjct: 123 FMGFDCPLREMQLYQSLTEQTIKVSNAEFSPLIWTSISWAMGLVLAGPILAAASFHLDYG 182

Query: 182 FNQHLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLA 241
           FNQHLITLAAVAAGAL+CLP G+FKTVKIFPLYI+LIVIA SVAFTSHTRHLGL+LRGL 
Sbjct: 183 FNQHLITLAAVAAGALTCLPTGLFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLT 242

Query: 242 GPTVRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLW 301
           GP V KAKFS RRIGSG ISS SAAVGG+G++ ISAFTYHMLRRD+Q++E  +NHFL+LW
Sbjct: 243 GPIVHKAKFSLRRIGSGQISSWSAAVGGVGASVISAFTYHMLRRDKQVQEGVDNHFLNLW 302

Query: 302 IVTIFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFAT 361
           IVTIF GLKWL+GIFH+F+TNRS+S++IPS+SELH LSIFK+P+AI +VISGGFLSSFAT
Sbjct: 303 IVTIFAGLKWLIGIFHVFLTNRSISISIPSNSELHILSIFKYPYAIATVISGGFLSSFAT 362

Query: 362 ICIFTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFI 421
           I IFTAV+LFLIGQICFKPVLI YL LIYFLVPLISLPLLHQFQIRIK+DASKM ILGFI
Sbjct: 363 ISIFTAVLLFLIGQICFKPVLILYLLLIYFLVPLISLPLLHQFQIRIKADASKMLILGFI 422

Query: 422 LSAATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWF 481
           LSAATSATCFYFH   WRR +VFVFA LQGTAAAVL AYGR LVLDCSPAGKE AISMWF
Sbjct: 423 LSAATSATCFYFHAYTWRRHLVFVFAVLQGTAAAVLHAYGRALVLDCSPAGKESAISMWF 482

Query: 482 SWMRVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDD 541
           SWMR IGGCVGFTVAAVVPA+LQVSSGV FC AVVGGVVLI+GNVTDY GAVAAGHVRDD
Sbjct: 483 SWMRSIGGCVGFTVAAVVPARLQVSSGVVFCCAVVGGVVLIFGNVTDYDGAVAAGHVRDD 542

Query: 542 SEKGSPVIGLESRSESKELESP 559
           SEKGSPVIGL+SRSESKELESP
Sbjct: 543 SEKGSPVIGLDSRSESKELESP 562

BLAST of Tan0003802 vs. ExPASy TrEMBL
Match: A0A1S3BK45 (uncharacterized protein LOC103490734 OS=Cucumis melo OX=3656 GN=LOC103490734 PE=4 SV=1)

HSP 1 Score: 887.1 bits (2291), Expect = 3.8e-254
Identity = 465/562 (82.74%), Postives = 504/562 (89.68%), Query Frame = 0

Query: 2   AEQSPRLEPASEIQNVPPFRSASGGRSMSA-----SGGGGSRRETPDVHSTAAKLERAKE 61
           AEQSPR    SEIQN+PP +S S GRS+S       GGGGSRRETPD HSTAAKLERAKE
Sbjct: 3   AEQSPR-PKQSEIQNLPPSKSTS-GRSVSTPRSANGGGGGSRRETPDFHSTAAKLERAKE 62

Query: 62  VYRAYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFKS 121
           VY+AYEGHGERPTIVEIVGWCFYELCS  V+T+LIPVVFPLIISQISG  T PPQGWFKS
Sbjct: 63  VYKAYEGHGERPTIVEIVGWCFYELCSFFVLTLLIPVVFPLIISQISGTPTAPPQGWFKS 122

Query: 122 LMGFHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDYG 181
            MGF CP RE++LYQSLTE TIKV + +FSPLIWTSISWA+GLVL GPILA ASFHLDYG
Sbjct: 123 FMGFDCPLREMQLYQSLTEQTIKVSNAEFSPLIWTSISWAMGLVLAGPILAAASFHLDYG 182

Query: 182 FNQHLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGLA 241
           FNQHLITLAAVAAGAL+CLP G+FKTVKIFPLYI+LIVIA SVAFTSHTRHLGL+LRGL 
Sbjct: 183 FNQHLITLAAVAAGALTCLPTGLFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLT 242

Query: 242 GPTVRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSLW 301
           GP V KAKFS RRIGSG ISS SAAVGG+G++ ISAFTYHMLRRD+Q++E  +NHFL+LW
Sbjct: 243 GPIVHKAKFSLRRIGSGQISSWSAAVGGVGASVISAFTYHMLRRDKQVQEGVDNHFLNLW 302

Query: 302 IVTIFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFAT 361
           IVTIF GLKWL+GIFH+F+TNRS+S++IPS+SELH LSIFK+P+AI +VISGGFLSSFAT
Sbjct: 303 IVTIFAGLKWLIGIFHVFLTNRSISISIPSNSELHILSIFKYPYAIATVISGGFLSSFAT 362

Query: 362 ICIFTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGFI 421
           I IFTAV+LFLIGQICFKPVLI YL LIYFLVPLISLPLLHQFQIRIK+DASKM ILGFI
Sbjct: 363 ISIFTAVLLFLIGQICFKPVLILYLLLIYFLVPLISLPLLHQFQIRIKADASKMLILGFI 422

Query: 422 LSAATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWF 481
           LSAATSATCFYFH   WRR +VFVFA LQGTAAAVL AYGR LVLDCSPAGKE AISMWF
Sbjct: 423 LSAATSATCFYFHAYTWRRHLVFVFAVLQGTAAAVLHAYGRALVLDCSPAGKESAISMWF 482

Query: 482 SWMRVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRDD 541
           SWMR IGGCVGFTVAAVVPA+LQVSSGV FC AVVGGVVLI+GNVTDY GAVAAGHVRDD
Sbjct: 483 SWMRSIGGCVGFTVAAVVPARLQVSSGVVFCCAVVGGVVLIFGNVTDYDGAVAAGHVRDD 542

Query: 542 SEKGSPVIGLESRSESKELESP 559
           SEKGSPVIGL+SRSESKELESP
Sbjct: 543 SEKGSPVIGLDSRSESKELESP 562

BLAST of Tan0003802 vs. ExPASy TrEMBL
Match: A0A0A0L1Q8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G006650 PE=4 SV=1)

HSP 1 Score: 847.4 bits (2188), Expect = 3.3e-242
Identity = 448/563 (79.57%), Postives = 488/563 (86.68%), Query Frame = 0

Query: 1   MAEQSPRLEPASEIQNVPP-----FRSASGGRSMSASGGGGSRRETPDVHSTAAKLERAK 60
           M EQSPR    SEI N+PP      RS S  RS ++ GGGGSRRETPD HSTAAKLERAK
Sbjct: 1   MTEQSPR-PKQSEIHNLPPPKSTSARSVSTPRSATSGGGGGSRRETPDFHSTAAKLERAK 60

Query: 61  EVYRAYEGHGERPTIVEIVGWCFYELCSLSVVTVLIPVVFPLIISQISGAATEPPQGWFK 120
           EVYRAYEGHGERPTI EI+GWCFYELCS  V+ +LIPVVFPLIISQISG  T PPQGWFK
Sbjct: 61  EVYRAYEGHGERPTIAEILGWCFYELCSFFVLALLIPVVFPLIISQISGPPTAPPQGWFK 120

Query: 121 SLMGFHCPPREIRLYQSLTEHTIKVGSYQFSPLIWTSISWALGLVLTGPILAFASFHLDY 180
           S  GF C  RE++LYQSLTE TI V + QFSPLIWTSISWA+GLVL GPILA ASFHLDY
Sbjct: 121 SFRGFDCSSREMQLYQSLTEQTINVSNAQFSPLIWTSISWAVGLVLAGPILAVASFHLDY 180

Query: 181 GFNQHLITLAAVAAGALSCLPIGIFKTVKIFPLYIILIVIAQSVAFTSHTRHLGLILRGL 240
           GF+Q+LITLAAVAAGAL+CLP G FKTVKIFPLYIILIVIA SVA TSHTRHLGL+LRGL
Sbjct: 181 GFHQYLITLAAVAAGALTCLPTGFFKTVKIFPLYIILIVIAHSVASTSHTRHLGLMLRGL 240

Query: 241 AGPTVRKAKFSQRRIGSGLISSCSAAVGGLGSAAISAFTYHMLRRDRQLKEDDENHFLSL 300
            GP + KAKFS R IGSG ISS SA VGG+G+AAISAFTYHMLR D+Q++  D +HFL+L
Sbjct: 241 TGPIIHKAKFSLRIIGSGQISSWSAGVGGVGAAAISAFTYHMLRSDKQVQGID-SHFLNL 300

Query: 301 WIVTIFGGLKWLLGIFHIFVTNRSLSVTIPSDSELHTLSIFKFPHAIGSVISGGFLSSFA 360
           WIVTIF GLKWL+GIFH+F+TNRS+SV+IPSDSE+H LSIFK+PHAI +VISGGFLSSFA
Sbjct: 301 WIVTIFAGLKWLIGIFHVFLTNRSISVSIPSDSEIHILSIFKYPHAIATVISGGFLSSFA 360

Query: 361 TICIFTAVILFLIGQICFKPVLIFYLWLIYFLVPLISLPLLHQFQIRIKSDASKMQILGF 420
           TI IFT+V+LFLI QICFKPVLIFYL LIYFLVPLISLPLLHQ QIRIK+DASKM ILGF
Sbjct: 361 TISIFTSVLLFLISQICFKPVLIFYLLLIYFLVPLISLPLLHQLQIRIKADASKMLILGF 420

Query: 421 ILSAATSATCFYFHDGAWRRSVVFVFAALQGTAAAVLQAYGRVLVLDCSPAGKEGAISMW 480
           ILSAATSATCFYFH  AW+R +VFVFA LQGTAAAVL AYGR LV+ CSPAGKE AISMW
Sbjct: 421 ILSAATSATCFYFHAYAWQRHLVFVFAVLQGTAAAVLHAYGRALVVHCSPAGKESAISMW 480

Query: 481 FSWMRVIGGCVGFTVAAVVPAKLQVSSGVAFCSAVVGGVVLIYGNVTDYGGAVAAGHVRD 540
           FSWMR IGGCVGFTVAAVVP  LQVSSGV FC AVVGG++LI+GNVTDY GAVAAGHVRD
Sbjct: 481 FSWMRAIGGCVGFTVAAVVPTMLQVSSGVVFCCAVVGGMLLIFGNVTDYDGAVAAGHVRD 540

Query: 541 DSEKGSPVIGLESRSESKELESP 559
           DSEKGSPV GL+SRSESKELESP
Sbjct: 541 DSEKGSPVFGLDSRSESKELESP 561

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022965332.17.3e-26083.36uncharacterized protein LOC111465229 isoform X1 [Cucurbita maxima][more]
KAG6577745.11.6e-25983.36hypothetical protein SDJN03_25319, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7015784.13.6e-25983.36hypothetical protein SDJN02_23422, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022965333.11.2e-25783.18uncharacterized protein LOC111465229 isoform X2 [Cucurbita maxima][more]
XP_008448612.17.8e-25482.74PREDICTED: uncharacterized protein LOC103490734 [Cucumis melo] >KAA0053002.1 unc... [more]
Match NameE-valueIdentityDescription
A0A6J1HK193.5e-26083.36uncharacterized protein LOC111465229 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HNK95.7e-25883.18uncharacterized protein LOC111465229 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A5D3CJT73.8e-25482.74Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BK453.8e-25482.74uncharacterized protein LOC103490734 OS=Cucumis melo OX=3656 GN=LOC103490734 PE=... [more]
A0A0A0L1Q83.3e-24279.57Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G006650 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..35
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 536..558
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..47
NoneNo IPR availablePANTHERPTHR37891OS06G0113900 PROTEINcoord: 29..553
IPR036259MFS transporter superfamilySUPERFAMILY103473MFS general substrate transportercoord: 151..516

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003802.1Tan0003802.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane