Tan0012789 (gene) Snake gourd v1

Overview
NameTan0012789
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF506)
LocationLG03: 76849808 .. 76850788 (+)
RNA-Seq ExpressionTan0012789
SyntenyTan0012789
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ACGGATTTCACAATTATAGCCAACAAAATGGATAGATTAGCCGAGGTCTTCCGCCACGGCGCTGATGCCGCCCTCAGTGGCAGCGATAATTCGCCACAAAACTCCGCCACCGACCTCTTCGACCTTGTCAAGTCCTTCATTGAAATGGATGATATGGAAATTAACGATGGGGAGAAGGAAGATGGCAGCAGAGAAGAATCAGACGGGTTCTCTTGTGATTCGGATGCAGGGGTAATCAAATTACGTAATCTGTTTGGTTCCCGTGAAAGTAAGAACGACGAAGAAATCAGAATTGAAGCAGAACAAGCACTGAAGAAGCTCGTCGGAGGAAGATCGTTTCAGGGGATTAAACGACAATTGATGGCGCATTTGCGCAGAAAAGGCTTCGATGCCGGTGAGTTTTGAGTTCCATTTTCACTTTAAATTAAACCCCATTTTAGAAAAAGTTGAACAGGAGTTTAATTCAATTGATTATGTTTGGCTCGAGCAGGACTCTGCAAATCCAAGGTGGAGAAACTCCAGTCATTTCCGCCAGGGAACCATGAGTACATCGACGTCAATTTTGGTGGAAATCGGTACATCGTAGAAATTTTTCTAGCCAGAGAATTTGAAATTGCCCGTCCGACCAGTAAATACATTTCATTTCTCAACACATTTCCAGAGATATTCGTCGGAACTTTGGAGGATTTGAAGCAGGTGGTGAAACTAATGTGCTCTGCCATGAAAGAGTCCATGAAAATGAGGAACATGCATGTACCTCCATGGAGAAGAAACGGGTACATGCAGGCAAAATGGTTCAGTTCTTACAAGCGGACCACAAACCATAAAGTCTCAGGATCGGCAGAAGCAGAAACTCTACTGCCGGAAATGGGTTCGGTCAGTTTCAAACCCAACCATTGCAGGGGAGATTTTGGTCGAAACGCAGGTATCAAAGTTGGAAATTTAACTGCTGTTTTTGGTGGCAATGGGTTGCTACTGTAA

mRNA sequence

ACGGATTTCACAATTATAGCCAACAAAATGGATAGATTAGCCGAGGTCTTCCGCCACGGCGCTGATGCCGCCCTCAGTGGCAGCGATAATTCGCCACAAAACTCCGCCACCGACCTCTTCGACCTTGTCAAGTCCTTCATTGAAATGGATGATATGGAAATTAACGATGGGGAGAAGGAAGATGGCAGCAGAGAAGAATCAGACGGGTTCTCTTGTGATTCGGATGCAGGGGTAATCAAATTACGTAATCTGTTTGGTTCCCGTGAAAGTAAGAACGACGAAGAAATCAGAATTGAAGCAGAACAAGCACTGAAGAAGCTCGTCGGAGGAAGATCGTTTCAGGGGATTAAACGACAATTGATGGCGCATTTGCGCAGAAAAGGCTTCGATGCCGGACTCTGCAAATCCAAGGTGGAGAAACTCCAGTCATTTCCGCCAGGGAACCATGAGTACATCGACGTCAATTTTGGTGGAAATCGGTACATCGTAGAAATTTTTCTAGCCAGAGAATTTGAAATTGCCCGTCCGACCAGTAAATACATTTCATTTCTCAACACATTTCCAGAGATATTCGTCGGAACTTTGGAGGATTTGAAGCAGGTGGTGAAACTAATGTGCTCTGCCATGAAAGAGTCCATGAAAATGAGGAACATGCATGTACCTCCATGGAGAAGAAACGGGTACATGCAGGCAAAATGGTTCAGTTCTTACAAGCGGACCACAAACCATAAAGTCTCAGGATCGGCAGAAGCAGAAACTCTACTGCCGGAAATGGGTTCGGTCAGTTTCAAACCCAACCATTGCAGGGGAGATTTTGGTCGAAACGCAGGTATCAAAGTTGGAAATTTAACTGCTGTTTTTGGTGGCAATGGGTTGCTACTGTAA

Coding sequence (CDS)

ATGGATAGATTAGCCGAGGTCTTCCGCCACGGCGCTGATGCCGCCCTCAGTGGCAGCGATAATTCGCCACAAAACTCCGCCACCGACCTCTTCGACCTTGTCAAGTCCTTCATTGAAATGGATGATATGGAAATTAACGATGGGGAGAAGGAAGATGGCAGCAGAGAAGAATCAGACGGGTTCTCTTGTGATTCGGATGCAGGGGTAATCAAATTACGTAATCTGTTTGGTTCCCGTGAAAGTAAGAACGACGAAGAAATCAGAATTGAAGCAGAACAAGCACTGAAGAAGCTCGTCGGAGGAAGATCGTTTCAGGGGATTAAACGACAATTGATGGCGCATTTGCGCAGAAAAGGCTTCGATGCCGGACTCTGCAAATCCAAGGTGGAGAAACTCCAGTCATTTCCGCCAGGGAACCATGAGTACATCGACGTCAATTTTGGTGGAAATCGGTACATCGTAGAAATTTTTCTAGCCAGAGAATTTGAAATTGCCCGTCCGACCAGTAAATACATTTCATTTCTCAACACATTTCCAGAGATATTCGTCGGAACTTTGGAGGATTTGAAGCAGGTGGTGAAACTAATGTGCTCTGCCATGAAAGAGTCCATGAAAATGAGGAACATGCATGTACCTCCATGGAGAAGAAACGGGTACATGCAGGCAAAATGGTTCAGTTCTTACAAGCGGACCACAAACCATAAAGTCTCAGGATCGGCAGAAGCAGAAACTCTACTGCCGGAAATGGGTTCGGTCAGTTTCAAACCCAACCATTGCAGGGGAGATTTTGGTCGAAACGCAGGTATCAAAGTTGGAAATTTAACTGCTGTTTTTGGTGGCAATGGGTTGCTACTGTAA

Protein sequence

MDRLAEVFRHGADAALSGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSREESDGFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLRRKGFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLNTFPEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKVSGSAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFGGNGLLL
Homology
BLAST of Tan0012789 vs. NCBI nr
Match: XP_022926904.1 (uncharacterized protein LOC111433882 [Cucurbita moschata])

HSP 1 Score: 464.5 bits (1194), Expect = 6.4e-127
Identity = 234/289 (80.97%), Postives = 257/289 (88.93%), Query Frame = 0

Query: 1   MDRLAEVFRHGADAAL----SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSRE 60
           MDR AE+FRHGA+AA+    SGSD+SP+NSA DLFDLVKSF+E DD+EIN+GE+EDG +E
Sbjct: 5   MDRFAEIFRHGAEAAVWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDGGKE 64

Query: 61  ESDGFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLR 120
           ESD FSCDSDAGVIKL+NLFGSR++++D EIRIEAEQALKKLVGGRSFQGIKR+LMAHLR
Sbjct: 65  ESDSFSCDSDAGVIKLKNLFGSRDNESD-EIRIEAEQALKKLVGGRSFQGIKRKLMAHLR 124

Query: 121 RKGFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLN 180
           RKGFDAGLCKSK EKLQSFP G+HEYIDVNFGGNRYIVE+FLAREFEIARPT KY S LN
Sbjct: 125 RKGFDAGLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEVFLAREFEIARPTRKYTSLLN 184

Query: 181 TFPEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKV 240
           TFPEIFVG LE+LKQVVKLMCSAMK+SM +RNMHVPPWRR GYMQ KWF SYKRTTNHK 
Sbjct: 185 TFPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNHKG 244

Query: 241 SGSAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFGGNGLLL 286
           SGSAEAET  P M S  FK +HCRGDFGRN GI VGNLTA FG +GLLL
Sbjct: 245 SGSAEAET-SPGMSSACFKTSHCRGDFGRNRGIMVGNLTAAFGADGLLL 291

BLAST of Tan0012789 vs. NCBI nr
Match: XP_023517353.1 (uncharacterized protein LOC111781137 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 461.5 bits (1186), Expect = 5.4e-126
Identity = 233/289 (80.62%), Postives = 255/289 (88.24%), Query Frame = 0

Query: 1   MDRLAEVFRHGADAAL----SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSRE 60
           MDR AE+FRHGA+AA+    SGS++SP+NSA DLFDLVKSF+E DD+EIN+GE+EDG +E
Sbjct: 5   MDRFAEIFRHGAEAAVRDTSSGSEHSPENSAADLFDLVKSFMERDDVEINEGEEEDGGKE 64

Query: 61  ESDGFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLR 120
           ESD FSCDSDAGVIKLRNLFGSR++K+D EIRIEAEQALKKLVGGRSFQGIKR+LMAHLR
Sbjct: 65  ESDSFSCDSDAGVIKLRNLFGSRDNKSD-EIRIEAEQALKKLVGGRSFQGIKRKLMAHLR 124

Query: 121 RKGFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLN 180
           RKGFDAGLCKSK EKL SFP G+HEYIDVNF GNRYIVE+FLAREFEIARPT KY S LN
Sbjct: 125 RKGFDAGLCKSKGEKLHSFPAGDHEYIDVNFSGNRYIVEVFLAREFEIARPTRKYTSLLN 184

Query: 181 TFPEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKV 240
           TFPEIFVG LE+LKQVVKLMCSAMK+SM +RNMHVPPWRR GYMQ KWF SYKRTTNHK 
Sbjct: 185 TFPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNHKG 244

Query: 241 SGSAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFGGNGLLL 286
           SGSAEAET  P M S  FK +HCRGDFGRN GI VGNLTA FG +GLLL
Sbjct: 245 SGSAEAET-SPGMSSACFKASHCRGDFGRNRGIMVGNLTAAFGADGLLL 291

BLAST of Tan0012789 vs. NCBI nr
Match: XP_023003814.1 (uncharacterized protein LOC111497287 [Cucurbita maxima])

HSP 1 Score: 461.1 bits (1185), Expect = 7.1e-126
Identity = 239/291 (82.13%), Postives = 258/291 (88.66%), Query Frame = 0

Query: 1   MDRLAEVFRHGADAAL----SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSRE 60
           MDR AE+FRHGA+AAL    SGSD+SP+NSA DLFDLVKSF+E DD+EIN+GE+EDGS E
Sbjct: 5   MDRFAEIFRHGAEAALWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDGSTE 64

Query: 61  ESD-GFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHL 120
           ESD GFSCDSDAGVIKL+NLFGSR++K+D EIRIEAEQALKKLVGGRSFQGIKR+LMAHL
Sbjct: 65  ESDGGFSCDSDAGVIKLKNLFGSRDNKSD-EIRIEAEQALKKLVGGRSFQGIKRKLMAHL 124

Query: 121 RRKGFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFL 180
           RRKGFDAGLCKSK EKLQSFP G+HEYIDVNFGGNRYIVEIFLAREFEIARPT KY S L
Sbjct: 125 RRKGFDAGLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTRKYTSLL 184

Query: 181 NTFPEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHK 240
           NTFPEIFVG LE+LKQVVKLMCSAMK+SM +RNMHVPPWRR GYMQ KWF SYKRTTNHK
Sbjct: 185 NTFPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNHK 244

Query: 241 VSGSAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFG-GNGLLL 286
            SGSAEAET  P M S  FK +HCRGDFGRN GI VGNLTA FG  +GLLL
Sbjct: 245 GSGSAEAET-SPGMSSACFKTSHCRGDFGRNRGIMVGNLTAAFGAADGLLL 293

BLAST of Tan0012789 vs. NCBI nr
Match: KAG6594690.1 (hypothetical protein SDJN03_11243, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 460.3 bits (1183), Expect = 1.2e-125
Identity = 235/289 (81.31%), Postives = 255/289 (88.24%), Query Frame = 0

Query: 1   MDRLAEVFRHGADAAL----SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSRE 60
           MDR AE+FRHGA+AAL    SGSD+SP+NSA DLFDLVKSF+E DD+EIN+GE+ED   E
Sbjct: 5   MDRFAEMFRHGAEAALWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDRGTE 64

Query: 61  ESDGFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLR 120
           ESDGFSCDSDAGVIKL+NLFGSR++K+D EIRIEAEQALKKLVGGRSFQGIKR+LMAHLR
Sbjct: 65  ESDGFSCDSDAGVIKLKNLFGSRDNKSD-EIRIEAEQALKKLVGGRSFQGIKRKLMAHLR 124

Query: 121 RKGFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLN 180
           RKGFDAGLCKSK EKLQSFP G+HEYIDVNFGGNRYIVE+FLAREFEIARPT KY S LN
Sbjct: 125 RKGFDAGLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEVFLAREFEIARPTRKYTSLLN 184

Query: 181 TFPEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKV 240
           TFPEIFVG LE+LKQVVKLMCSAMK+SM +RNMHVPPWRR GYMQ KWF SYKRTTN K 
Sbjct: 185 TFPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNLKG 244

Query: 241 SGSAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFGGNGLLL 286
           SGSAEAET  P M S  FK +HCRGDFGRN GI VGNLTA FG +GLLL
Sbjct: 245 SGSAEAET-SPGMSSACFKASHCRGDFGRNRGIMVGNLTAAFGADGLLL 291

BLAST of Tan0012789 vs. NCBI nr
Match: KAG7026658.1 (hypothetical protein SDJN02_10661, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 450.3 bits (1157), Expect = 1.3e-122
Identity = 230/283 (81.27%), Postives = 250/283 (88.34%), Query Frame = 0

Query: 7   VFRHGADAAL----SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSREESDGFS 66
           +FRHGA+AAL    SGSD+SP+NSA DLFDLVKSF+E DD+EIN+GE+ED   EESDGFS
Sbjct: 1   MFRHGAEAALWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDRGTEESDGFS 60

Query: 67  CDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLRRKGFDA 126
           CDSDAGVIKL+NLFGSR++K+D EIRIEAEQALKKLVGGRSFQGIKR+LMAHLRRKGFDA
Sbjct: 61  CDSDAGVIKLKNLFGSRDNKSD-EIRIEAEQALKKLVGGRSFQGIKRKLMAHLRRKGFDA 120

Query: 127 GLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLNTFPEIF 186
           GLCKSK EKLQSFP G+HEYIDVNFGGNRYIVE+FLAREFEIARPT KY S LNTFPEIF
Sbjct: 121 GLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEVFLAREFEIARPTRKYTSLLNTFPEIF 180

Query: 187 VGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKVSGSAEA 246
           VG LE+LKQVVKLMCSAMK+SM +RNMHVPPWRR GYMQ KWF SYKRTTN K SGSAEA
Sbjct: 181 VGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNLKGSGSAEA 240

Query: 247 ETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFGGNGLLL 286
           ET  P M S  FK +HCRGDFGRN GI VGNLTA FG +GLLL
Sbjct: 241 ET-SPGMSSACFKASHCRGDFGRNRGIMVGNLTAAFGADGLLL 281

BLAST of Tan0012789 vs. ExPASy TrEMBL
Match: A0A6J1EGH2 (uncharacterized protein LOC111433882 OS=Cucurbita moschata OX=3662 GN=LOC111433882 PE=4 SV=1)

HSP 1 Score: 464.5 bits (1194), Expect = 3.1e-127
Identity = 234/289 (80.97%), Postives = 257/289 (88.93%), Query Frame = 0

Query: 1   MDRLAEVFRHGADAAL----SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSRE 60
           MDR AE+FRHGA+AA+    SGSD+SP+NSA DLFDLVKSF+E DD+EIN+GE+EDG +E
Sbjct: 5   MDRFAEIFRHGAEAAVWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDGGKE 64

Query: 61  ESDGFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLR 120
           ESD FSCDSDAGVIKL+NLFGSR++++D EIRIEAEQALKKLVGGRSFQGIKR+LMAHLR
Sbjct: 65  ESDSFSCDSDAGVIKLKNLFGSRDNESD-EIRIEAEQALKKLVGGRSFQGIKRKLMAHLR 124

Query: 121 RKGFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLN 180
           RKGFDAGLCKSK EKLQSFP G+HEYIDVNFGGNRYIVE+FLAREFEIARPT KY S LN
Sbjct: 125 RKGFDAGLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEVFLAREFEIARPTRKYTSLLN 184

Query: 181 TFPEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKV 240
           TFPEIFVG LE+LKQVVKLMCSAMK+SM +RNMHVPPWRR GYMQ KWF SYKRTTNHK 
Sbjct: 185 TFPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNHKG 244

Query: 241 SGSAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFGGNGLLL 286
           SGSAEAET  P M S  FK +HCRGDFGRN GI VGNLTA FG +GLLL
Sbjct: 245 SGSAEAET-SPGMSSACFKTSHCRGDFGRNRGIMVGNLTAAFGADGLLL 291

BLAST of Tan0012789 vs. ExPASy TrEMBL
Match: A0A6J1KUE8 (uncharacterized protein LOC111497287 OS=Cucurbita maxima OX=3661 GN=LOC111497287 PE=4 SV=1)

HSP 1 Score: 461.1 bits (1185), Expect = 3.4e-126
Identity = 239/291 (82.13%), Postives = 258/291 (88.66%), Query Frame = 0

Query: 1   MDRLAEVFRHGADAAL----SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSRE 60
           MDR AE+FRHGA+AAL    SGSD+SP+NSA DLFDLVKSF+E DD+EIN+GE+EDGS E
Sbjct: 5   MDRFAEIFRHGAEAALWDTSSGSDHSPENSAADLFDLVKSFMERDDVEINEGEEEDGSTE 64

Query: 61  ESD-GFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHL 120
           ESD GFSCDSDAGVIKL+NLFGSR++K+D EIRIEAEQALKKLVGGRSFQGIKR+LMAHL
Sbjct: 65  ESDGGFSCDSDAGVIKLKNLFGSRDNKSD-EIRIEAEQALKKLVGGRSFQGIKRKLMAHL 124

Query: 121 RRKGFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFL 180
           RRKGFDAGLCKSK EKLQSFP G+HEYIDVNFGGNRYIVEIFLAREFEIARPT KY S L
Sbjct: 125 RRKGFDAGLCKSKGEKLQSFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTRKYTSLL 184

Query: 181 NTFPEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHK 240
           NTFPEIFVG LE+LKQVVKLMCSAMK+SM +RNMHVPPWRR GYMQ KWF SYKRTTNHK
Sbjct: 185 NTFPEIFVGNLEELKQVVKLMCSAMKQSMNIRNMHVPPWRRKGYMQEKWFGSYKRTTNHK 244

Query: 241 VSGSAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFG-GNGLLL 286
            SGSAEAET  P M S  FK +HCRGDFGRN GI VGNLTA FG  +GLLL
Sbjct: 245 GSGSAEAET-SPGMSSACFKTSHCRGDFGRNRGIMVGNLTAAFGAADGLLL 293

BLAST of Tan0012789 vs. ExPASy TrEMBL
Match: A0A5D3CPA4 (DUF506 family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G006060 PE=4 SV=1)

HSP 1 Score: 447.6 bits (1150), Expect = 3.9e-122
Identity = 230/287 (80.14%), Postives = 249/287 (86.76%), Query Frame = 0

Query: 1   MDRLAEVFRHGADAAL--SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSREES 60
           MDRLA VFR+GAD+++  SGSD+SP+    DLFDLVKSFIE  D E  +GE ED   EES
Sbjct: 1   MDRLAAVFRYGADSSVWESGSDHSPEKPTADLFDLVKSFIEKGDFEFKEGETEDSCTEES 60

Query: 61  DGFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLRRK 120
           DGFS DSDAGV+KLRNLFGS E+KN EEIRIE EQAL KLVGGRS  GI RQLMAHLRRK
Sbjct: 61  DGFSFDSDAGVVKLRNLFGSLENKN-EEIRIETEQAL-KLVGGRSVPGINRQLMAHLRRK 120

Query: 121 GFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLNTF 180
           GFDAGLCKSK+EKL++FP G+HEYIDVNFGGNRYIVEIFLAREFEIARPTSKY+S LNTF
Sbjct: 121 GFDAGLCKSKMEKLRAFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYVSLLNTF 180

Query: 181 PEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKVSG 240
           PEIFVGTL++LKQVVKLMCSAMKESMK RNMH+PPWRRNGYMQAKWF SYKRTTNHKVSG
Sbjct: 181 PEIFVGTLDELKQVVKLMCSAMKESMKKRNMHIPPWRRNGYMQAKWFGSYKRTTNHKVSG 240

Query: 241 SAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFGGNGLLL 286
           SAEAET   EM    FK  +CRGDFGRNAGI+VGNLTAVFGGN LLL
Sbjct: 241 SAEAETSPSEMSLPCFKSYYCRGDFGRNAGIRVGNLTAVFGGNELLL 285

BLAST of Tan0012789 vs. ExPASy TrEMBL
Match: A0A1S3B074 (uncharacterized protein LOC103484651 OS=Cucumis melo OX=3656 GN=LOC103484651 PE=4 SV=1)

HSP 1 Score: 447.6 bits (1150), Expect = 3.9e-122
Identity = 230/287 (80.14%), Postives = 249/287 (86.76%), Query Frame = 0

Query: 1   MDRLAEVFRHGADAAL--SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSREES 60
           MDRLA VFR+GAD+++  SGSD+SP+    DLFDLVKSFIE  D E  +GE ED   EES
Sbjct: 1   MDRLAAVFRYGADSSVWESGSDHSPEKPTADLFDLVKSFIEKGDFEFKEGETEDSCTEES 60

Query: 61  DGFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLRRK 120
           DGFS DSDAGV+KLRNLFGS E+KN EEIRIE EQAL KLVGGRS  GI RQLMAHLRRK
Sbjct: 61  DGFSFDSDAGVVKLRNLFGSLENKN-EEIRIETEQAL-KLVGGRSVPGINRQLMAHLRRK 120

Query: 121 GFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLNTF 180
           GFDAGLCKSK+EKL++FP G+HEYIDVNFGGNRYIVEIFLAREFEIARPTSKY+S LNTF
Sbjct: 121 GFDAGLCKSKMEKLRAFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYVSLLNTF 180

Query: 181 PEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKVSG 240
           PEIFVGTL++LKQVVKLMCSAMKESMK RNMH+PPWRRNGYMQAKWF SYKRTTNHKVSG
Sbjct: 181 PEIFVGTLDELKQVVKLMCSAMKESMKKRNMHIPPWRRNGYMQAKWFGSYKRTTNHKVSG 240

Query: 241 SAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFGGNGLLL 286
           SAEAET   EM    FK  +CRGDFGRNAGI+VGNLTAVFGGN LLL
Sbjct: 241 SAEAETSPSEMSLPCFKSYYCRGDFGRNAGIRVGNLTAVFGGNELLL 285

BLAST of Tan0012789 vs. ExPASy TrEMBL
Match: A0A0A0KL43 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511780 PE=4 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 8.2e-104
Identity = 203/287 (70.73%), Postives = 225/287 (78.40%), Query Frame = 0

Query: 1   MDRLAEVFRHGADAAL--SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSREES 60
           MDRLA +FRH AD++   SGSD+SP+    DLFDLVKSFIE  D+E  +GE+ED   EES
Sbjct: 1   MDRLAALFRHRADSSFSESGSDHSPEKPTADLFDLVKSFIEKGDLEFKEGEREDCCTEES 60

Query: 61  DGFSCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLRRK 120
           DGFS DSDAGV+KLRNLFGS E+KN EEIRIE EQALK +                    
Sbjct: 61  DGFSFDSDAGVVKLRNLFGSVENKN-EEIRIETEQALKLV-------------------- 120

Query: 121 GFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLNTF 180
               GLCKSK+EK ++FP G+HEYIDVNFGGNRYIVEIFLAREFEIARPTSKY+S LNTF
Sbjct: 121 ----GLCKSKMEKPRAFPAGDHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYVSLLNTF 180

Query: 181 PEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKVSG 240
           PEIFVGTL++LK VVKLMCSAMKESMK  NMHVPPWRRNGYMQAKWF SYKRTTNHKVSG
Sbjct: 181 PEIFVGTLDELKHVVKLMCSAMKESMKKMNMHVPPWRRNGYMQAKWFGSYKRTTNHKVSG 240

Query: 241 SAEAETLLPEMGSVSFKPNHCRGDFGRNAGIKVGNLTAVFGGNGLLL 286
           S+EAET   E+    FK  HCRGDFGRNAGI+VGNLTAVFGGN LL+
Sbjct: 241 SSEAETSPAEISLPCFKSYHCRGDFGRNAGIRVGNLTAVFGGNELLM 262

BLAST of Tan0012789 vs. TAIR 10
Match: AT1G12030.1 (Protein of unknown function (DUF506) )

HSP 1 Score: 195.3 bits (495), Expect = 6.8e-50
Identity = 124/273 (45.42%), Postives = 163/273 (59.71%), Query Frame = 0

Query: 14  AALSGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSREESDGFSCDSDAGVIKLR 73
           A+ SGSD+SP ++  DL+DLV+SFI   D E+    ++    EE D    D +    +LR
Sbjct: 27  ASSSGSDHSPDDT-EDLWDLVESFI---DREVETLPEDAFQEEEDDKSDEDYEDVKERLR 86

Query: 74  NLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLRRKGFDAGLCKSKVEKLQ 133
            +  +   +  + I  EA  A       R F G KR  MA+LR KGFDAGLCKS+ EK  
Sbjct: 87  EILENHGGEERQRIMDEAVNA------SRVFAGEKRHFMAYLRNKGFDAGLCKSRWEKFG 146

Query: 134 SFPPGNHEYIDVNFGG-NRYIVEIFLAREFEIARPTSKYISFLNTFPEIFVGTLEDLKQV 193
               G +EY+DV  G  NRYIVE  LA EFEIARPT++Y+S L   P +FVGT E+LKQ+
Sbjct: 147 KNTAGKYEYVDVKAGDKNRYIVETNLAGEFEIARPTTRYLSVLAQVPRVFVGTPEELKQL 206

Query: 194 VKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKVSGSAEAETLLPEMG-- 253
           V++MC  ++ SMK  ++ VPPWRRNGYMQAKWF  YKRT+N  VS   ++    P +G  
Sbjct: 207 VRIMCFEIRRSMKRADIFVPPWRRNGYMQAKWFGHYKRTSNEVVS-RVKSCGCGPRVGFE 266

Query: 254 -SVSFKP-NHCRGDFGRNAGIKVGNLTAVFGGN 282
            SV     N  +    R +G+KVG LT  F G+
Sbjct: 267 ESVKMTTFNGFKDGEMRRSGLKVGQLTVAFNGS 288

BLAST of Tan0012789 vs. TAIR 10
Match: AT1G62420.1 (Protein of unknown function (DUF506) )

HSP 1 Score: 174.5 bits (441), Expect = 1.2e-43
Identity = 116/271 (42.80%), Postives = 151/271 (55.72%), Query Frame = 0

Query: 17  SGSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSREESDGFSCDSDAGVI--KLRN 76
           SGSD+SP     DL DLV SFIE +   +         REE +  S D++   +  +LR 
Sbjct: 30  SGSDHSP-----DLSDLVASFIEKEGQIV--------LREEEETSSDDNNLEDVNERLRK 89

Query: 77  LFGSRESKNDEEIRIEAEQALKKLVGG--RSFQGIKRQLMAHLRRKGFDAGLCKSKVEKL 136
           L    E  +  E R+    A  ++ G         KR LMA LR KGFDAGLCKS  E+ 
Sbjct: 90  LL---EGLSCGEERMRILSATMEVAGTFVGDISSSKRHLMAFLRNKGFDAGLCKSSWERF 149

Query: 137 QSFPPGNHEYIDVNFGG---NRYIVEIFLAREFEIARPTSKYISFLNTFPEIFVGTLEDL 196
                G +EY+DV  GG   NRY VE  LA EFEIARPT +Y+S L+  P +FVGT E+L
Sbjct: 150 GKNTGGKYEYVDVRCGGDYNNRYFVETNLAGEFEIARPTKRYLSILSQVPRVFVGTSEEL 209

Query: 197 KQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKVSGSAEAETLLPEM 256
           K +V++MC  M+ SMK   +HVPPWRRNGYMQAKWF  YKRT+      +     ++   
Sbjct: 210 KLLVRIMCHEMRRSMKHVGIHVPPWRRNGYMQAKWFGFYKRTS------TTNNYEMVNTY 269

Query: 257 GSVSFKPNHCRGDFGRNAGIK--VGNLTAVF 279
            + +FK   C+ +F    G+K  VG L+  F
Sbjct: 270 DTTAFK--GCKEEFWEAKGLKVMVGQLSIAF 276

BLAST of Tan0012789 vs. TAIR 10
Match: AT4G14620.1 (Protein of unknown function (DUF506) )

HSP 1 Score: 125.9 bits (315), Expect = 5.0e-29
Identity = 77/247 (31.17%), Postives = 130/247 (52.63%), Query Frame = 0

Query: 13  DAALSGSDNSPQNSATDLFDLVKSFI-EMDDMEINDGE----------KEDGSREESDGF 72
           D  ++G++  P      L  +V++++ E +D +  +G             D S +E D F
Sbjct: 50  DGVVAGTEFEP-----SLAKMVQNYMEENNDKQTKNGRNTHRCNCFNGNNDISDDELDFF 109

Query: 73  SCDSDAGVIKLRNLFGSRESKNDEEIRIEAEQALKKLVGGRSFQGIKRQLMAHLRRKGFD 132
             D+   +I+  +         ++ + +EA + ++K    +    +++ ++  L   G+D
Sbjct: 110 DYDNFKSLIQCGSFV-------EKSLLVEATKIIEKNKSVKRKDELRKIVVDELSSLGYD 169

Query: 133 AGLCKSKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLNTFPEI 192
           + +CKSK +K +S P G +EYIDV   G R I++I    EFEIAR TS Y   L + P I
Sbjct: 170 SSICKSKWDKTRSIPAGEYEYIDVIVNGERLIIDIDFRSEFEIARQTSGYKELLQSLPLI 229

Query: 193 FVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKVSGSAE 249
           FVG  + ++Q+V ++  A K+S+K + MH PPWR+  YM+AKW SSY R +  K      
Sbjct: 230 FVGKSDRIRQIVSIVSEASKQSLKKKGMHFPPWRKADYMRAKWLSSYTRNSGEKKPTVTS 284

BLAST of Tan0012789 vs. TAIR 10
Match: AT3G07350.1 (Protein of unknown function (DUF506) )

HSP 1 Score: 124.8 bits (312), Expect = 1.1e-28
Identity = 83/245 (33.88%), Postives = 137/245 (55.92%), Query Frame = 0

Query: 8   FRHGADAALSGSDNSPQNSATDLFDLVKSFIEMDDMEINDGE-----KEDGSREESD--- 67
           F  G++    G ++   + +  L DLV+ F+E D+++  D E     ++ GS  +SD   
Sbjct: 26  FSSGSEHTGDGIEDYEDDDSPCLSDLVQGFLE-DEVDTVDDESCWCDQDSGSDSDSDSEL 85

Query: 68  GFSCDSDAGVIK-LRNLFGSRESKNDEEIRIEAEQALKKL--VGGRSFQG--IKRQLMAH 127
           G   D    + K LRN    RE      + +   +A++ L  +G +  Q    +R++M+ 
Sbjct: 86  GELPDFADDIAKLLRN--SLREDSYGRTVLVHVARAMEMLSSLGSQPEQRAVFQRKVMSL 145

Query: 128 LRRKGFDAGLCKSKVEKLQSFPPGNHEYIDVNFGGN------RYIVEIFLAREFEIARPT 187
           LR  G +A +CK+K +       GNHE+IDV +  +      R+IV++  +  F+IARPT
Sbjct: 146 LRELGHNAAICKTKWKSSGGLTAGNHEFIDVVYTPSASSQSVRFIVDLDFSSRFQIARPT 205

Query: 188 SKYISFLNTFPEIFVGTLEDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSY 234
           S+Y   L + P +FVG  +DLK++++L+C A + S++ R + +PPWR+N YMQ +W   Y
Sbjct: 206 SQYARVLQSLPAVFVGKGDDLKRILRLVCDAARISLRNRGLTLPPWRKNRYMQTRWLGPY 265

BLAST of Tan0012789 vs. TAIR 10
Match: AT2G38820.1 (Protein of unknown function (DUF506) )

HSP 1 Score: 121.7 bits (304), Expect = 9.5e-28
Identity = 86/247 (34.82%), Postives = 132/247 (53.44%), Query Frame = 0

Query: 13  DAALS-GSDNSPQNSATDLFDLVKSFIEMDDMEINDGEKEDGSREESDGFSCDSDAGVIK 72
           +A LS G+    + S+  L  +V +F+E    + N GEK+   R   + FS         
Sbjct: 56  EAPLSRGNSGDFEPSSVCLAKMVLNFME----DNNGGEKQRCGRSRCNCFS--------- 115

Query: 73  LRNLFGSRESKNDEEIRI---EAEQALKKLVGGRSFQGIKRQL--MAHLRRKGFDAGLCK 132
                GS    +D+E      EA + LK LV  +S + ++  L  +  +    +DA LCK
Sbjct: 116 -----GSGTESSDDETECSSGEACEILKSLVLCKSIR-VRNLLTDVTKIAETSYDAALCK 175

Query: 133 SKVEKLQSFPPGNHEYIDVNFGGNRYIVEIFLAREFEIARPTSKYISFLNTFPEIFVGTL 192
           S+ EK  S P G +EY+DV   G R +++I    +FEIAR T  Y S L T P IFVG  
Sbjct: 176 SRWEKSPSCPAGEYEYVDVIMKGERLLIDIDFKSKFEIARATKTYKSMLQTLPYIFVGKA 235

Query: 193 EDLKQVVKLMCSAMKESMKMRNMHVPPWRRNGYMQAKWFSSYKRTTNHKVSGSAEAETLL 252
           + L++++ L+C A K+S+K + +HVPPWRR  Y+++KW SS+ R   +  +G  + E++ 
Sbjct: 236 DRLQKIIVLICKAAKQSLKKKGLHVPPWRRAEYVKSKWLSSHVRVDQNS-NGEVKQESVE 282

Query: 253 PEMGSVS 254
               SVS
Sbjct: 296 VIAESVS 282

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022926904.16.4e-12780.97uncharacterized protein LOC111433882 [Cucurbita moschata][more]
XP_023517353.15.4e-12680.62uncharacterized protein LOC111781137 [Cucurbita pepo subsp. pepo][more]
XP_023003814.17.1e-12682.13uncharacterized protein LOC111497287 [Cucurbita maxima][more]
KAG6594690.11.2e-12581.31hypothetical protein SDJN03_11243, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7026658.11.3e-12281.27hypothetical protein SDJN02_10661, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A6J1EGH23.1e-12780.97uncharacterized protein LOC111433882 OS=Cucurbita moschata OX=3662 GN=LOC1114338... [more]
A0A6J1KUE83.4e-12682.13uncharacterized protein LOC111497287 OS=Cucurbita maxima OX=3661 GN=LOC111497287... [more]
A0A5D3CPA43.9e-12280.14DUF506 family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25... [more]
A0A1S3B0743.9e-12280.14uncharacterized protein LOC103484651 OS=Cucumis melo OX=3656 GN=LOC103484651 PE=... [more]
A0A0A0KL438.2e-10470.73Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511780 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G12030.16.8e-5045.42Protein of unknown function (DUF506) [more]
AT1G62420.11.2e-4342.80Protein of unknown function (DUF506) [more]
AT4G14620.15.0e-2931.17Protein of unknown function (DUF506) [more]
AT3G07350.11.1e-2833.88Protein of unknown function (DUF506) [more]
AT2G38820.19.5e-2834.82Protein of unknown function (DUF506) [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006502Protein of unknown function PDDEXK-likeTIGRFAMTIGR01615TIGR01615coord: 108..231
e-value: 2.5E-46
score: 154.7
IPR006502Protein of unknown function PDDEXK-likePFAMPF04720PDDEXK_6coord: 32..230
e-value: 2.3E-57
score: 194.5
IPR006502Protein of unknown function PDDEXK-likePANTHERPTHR31579OS03G0796600 PROTEINcoord: 5..280
NoneNo IPR availablePANTHERPTHR31579:SF42DUF506 FAMILY PROTEIN (DUF506)coord: 5..280

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012789.1Tan0012789.1mRNA