Tan0021539 (gene) Snake gourd v1

Overview
NameTan0021539
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF2301 domain-containing protein
LocationLG06: 19031439 .. 19035082 (-)
RNA-Seq ExpressionTan0021539
SyntenyTan0021539
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTGTGGATGTTTGTGTGCCTCCATTCTTTCTCCTTCGAAGCTTCCTTCTCTTAATTATTCAGCTGATACTAAAACTAAGCTGCTGCTTCGTTCCCCAGTTGCTTTCCCTTCGCCATCTAAGTTATCAGCTCTCAAGTGCAAGGCTGCTGGCCAAACCTCGCCCAGTTCCACCGTCTATCAGGGAATTTACGGTCCTTGGACCGTTGATCCTTCCGACGTTCGAGAGGTTCCGATTTCTTTCTCCTTCTCTTTCAATTCTTTCTAATTGTTGTCTCTTACAATTTGGGATTTGCGGTTGCCCTGTAACTTGCTACTGATTATGTGTATTAACTTCAATATTCGTGTGTATTATAGTATGTTACTATTCATCTGTCCTCCTTCATTTGTTGGGTTTTCTCTGTTGTTGGGCCAACTTGCATCTGACGGTTTATATGTTTCATTCTATTAGCGCTTGCGAAGCTATCTAGTTTACATGGATTCTTAGTTACTTTTAAGGATGAGATTCTTTAAAAATTTTACAACCCTAATGAACGAGTAATTATGATATTTACTCCAATGAATAATTGGCCTGAATTGTACAAGAATTTGTTTTTCTATATGGCTATCTATTAGCATGACCCAAGTTATCCAGGTTTATATGAATTCTTAGTTCGAAGGATGAAATTGGTTAAAGACTTGGCGACCGAATGGGTTCATTATTGCGTTAAGAAAATGAGTCTATGCCATTTACTTCGATAAAATAGAGCCAAAATAATAATGTACTCGGCGAGTTACTTGGCAAATAAATGTAATAGGGTCAAATAGTTGTTTCGCGAGAATAATTGAGGTGTGCAAGCTGGCTCAGTCACTCAAGGGTATAAAAAAAGAAAAAAAAACCTTGAATGATACAAGAATTTGTTTGAGTACTATAAAAAAGTAATGTAGCCGCTTTTTCAAGCATCATATAGAAATAAATCTCTCTTGTCTTCTTCTAGGTCTCTTATTCCCCTTTCTAATCTTTTGGTTGTGGGTGGGACTCAAGTGGCGCATTCTCAATGACAACGGTGCTAGGCTTCTAAGAAACATGCAAACGAGTAATGACTAATTCAACCTTGATAGTACTTGTTAATTAAGTGATTTTAGGTTCATAGTTTCTCGATTGCATATCGTGTTTCTCACAATTTATTTTTCACTCAAATTTCAAAATTATTAAACAGGAAATTCAAATTTCCTTTTTTTTTAATCCACTTAAATGTTGGTGTGTTGTCTTCATCTGGAATTGTCATGCACCCCTTGGTCATATTATTGTAGCCTTCCTAATGTTATGTCATCCAAGCATCCTTACTGACAAGATCATGGTGCCTGGCTCAATAATTATATATGATTTGGTCATAATGCTTAAACCTGATAGCAGTCTTCATTTACTAGTAAATTAGGATTTGATGTGTACTCACTCAATACTCATTGGTGAAGACAGTCTCTTGCGGTTGAACCTGCTTAGACAGTCTATGACAAGATGAAGACAGTCTCAAATATTTCAGTGATAATTTGCTTACTGTGGCAGGTAATTTTATATAGAGCGGGCTTAGTGACAGCTGCTACCTCTTTTGTGATAGCTTCATCAGTTGCTTTTTTACCCGATAACTCTTCATTGAGTGACACACTTAAGCAAAATCTTGATCTATTCTATGCCTTAGGTGGAGGAGGATTAGGCCTATCCCTAGTTTTGATTCACATATATGTAACTGCAATTAAGCGTACTCTTCAAGCTTTTTGGGTGCTTGGTGGTGCTGGATCTTTGGTAACTTACATAAATCTTGCACAACCAGCTGGGGACAGCTTAGTGCAGTATGTTGTTGATAATCCATCGGCAGTTTGGCTTATTGGTCCTCTCTATGCAGCACTGACTGGACTTGTTTTCAAAGAAGGTATTGAGTATTTTTAATATCTCCCAGGCCTTTGACTTACTATTTGCGTCTTTAATGTTTTATTGATGAAACAAGCAAATAATAAGAAAAAGATATTGAAGTTTCGTAGGCTCATATATTGGTGAAATCTCATAGATCTCGTAGTCTTGATGAAGCAGACAAATAATAAGAAGAAATAAGAGTCTACTTTGCTTGTTCAGCATGTAGAAATTAGACTATTAATTATTAGTAATATTGCAGAGGTTTTTGTTATGATATTGGGTCATTTAATTTAACGACTCTGTCTCCCTGTGGTTTTTTCGTTCCCTCTTGGTCATTATTTAAAATACAAAAACAGCTTGTGGTGTATGATGTCTGAAGTGCTGTGAGACATATATTTCGGACTGTTAATATCAATTGGTCAGACGTACTTTATGATTAGTGTTGGTTTCAATTTTAGTCTTGTTTGTCCTCTATATGCAGGGCTTTGCTATGGAAAGCTTGAGGCTGGAGTTCTCACCTTTGTCATACCTACGCTACTTCTGGGCCATCTGGTAGGGTTCACGTTCCAACTTTATAGTTTTTAACTGGAAACTTCACCAATGAATGAATAAGAATTTGTATTGCTCAACTTTGCCTATGAGTTAGACCTTTTCAATTAGAAGCTTGGCAAGATATAACATGTATGAACAAGTTTTCAATCGTGGTACCATCATGCCTTGTAGTATTGTTTGCATGCATCATTAATATCTGAAGCAACGTGCAAGGTTTTCACGGACCCGTGTAATTTTTAATTTGAAACTGCAGTTACTTTCTTGTGTTAAAAGTTGCTCTAAAAAAATGCTTGTCTATGAATATGGAGAGCTAATTTTGTATGGATGAGCATTTTTGTAAAAGCTTGTCAAACTGATAAATTTCTATTAAACCCAGACTAAAAAGGATGGTGGAGCCTCTGAATAATCGCAGTAAAGATTTTTTATTAGGCTAAGTTATATTTTTCAATATGTTATGGCAACTTCTAAGCAAGAACAGTTTCATTTTTATGTTGTAGCTGTTACACACTCAAAAGCTTAGGCTAAATTTAAGCATTTAATATATTTTCATTGTTATCCCTCACATATGTAAGCTCGCATATTTGTTTTATTCATGAGGCTCAACATTGTGAATAACTTTAATGCTATATGAAATCACCATTAAAATCAAAATCTATAATTTTGAGCTGTAGACCTAAAATCTCTTTCTAGCATTTTAACCCTATATAACCTGTTGGACGGACTTTTTTTTTGAGATGCAGACTGGTTTGATGGACGATGGGGTTAAACTTGCTTTATTGGGTTCATGGATGGCCCTCTTTGTAATATTTGCTGGAAGAAAGTTCACTCAGCCCATCAAGGTAATATCTTTCCACCGTAAAAGAAAATCTTGTAGTTGTAGTTTTATTCTTTTTTATCTGTCAAGGAGTCTGTATCTTGAACATAGTTGCAATTTTCATTCATCCATGTGTTAAATTTTTTTTTTTAAAAAAAAAAAAGTAGAAATCAGAACAATGTAGTTGTTTTGAAAGTCAACTTGAAAGATAAACACTGAATACGTAGTGCATTTTATTGTTGCACTGTGTGACTGGTTTCTTATTTATTTTCAGGATGATATTGGTGACAAATCTGTTTTCATGTTCAATACACTTGGAGAGGACGAAAAGAAGGCCTTGATTGCAAAGCTTGAGCAGCAGGAGGTTAATCAGAATGCTGGTTAG

mRNA sequence

ATGGCGTGTGGATGTTTGTGTGCCTCCATTCTTTCTCCTTCGAAGCTTCCTTCTCTTAATTATTCAGCTGATACTAAAACTAAGCTGCTGCTTCGTTCCCCAGTTGCTTTCCCTTCGCCATCTAAGTTATCAGCTCTCAAGTGCAAGGCTGCTGGCCAAACCTCGCCCAGTTCCACCGTCTATCAGGGAATTTACGGTCCTTGGACCGTTGATCCTTCCGACGTTCGAGAGGTAATTTTATATAGAGCGGGCTTAGTGACAGCTGCTACCTCTTTTGTGATAGCTTCATCAGTTGCTTTTTTACCCGATAACTCTTCATTGAGTGACACACTTAAGCAAAATCTTGATCTATTCTATGCCTTAGGTGGAGGAGGATTAGGCCTATCCCTAGTTTTGATTCACATATATGTAACTGCAATTAAGCGTACTCTTCAAGCTTTTTGGGTGCTTGGTGGTGCTGGATCTTTGGTAACTTACATAAATCTTGCACAACCAGCTGGGGACAGCTTAGTGCAGTATGTTGTTGATAATCCATCGGCAGTTTGGCTTATTGGTCCTCTCTATGCAGCACTGACTGGACTTGTTTTCAAAGAAGGGCTTTGCTATGGAAAGCTTGAGGCTGGAGTTCTCACCTTTGTCATACCTACGCTACTTCTGGGCCATCTGACTGGTTTGATGGACGATGGGGTTAAACTTGCTTTATTGGGTTCATGGATGGCCCTCTTTGTAATATTTGCTGGAAGAAAGTTCACTCAGCCCATCAAGGATGATATTGGTGACAAATCTGTTTTCATGTTCAATACACTTGGAGAGGACGAAAAGAAGGCCTTGATTGCAAAGCTTGAGCAGCAGGAGGTTAATCAGAATGCTGGTTAG

Coding sequence (CDS)

ATGGCGTGTGGATGTTTGTGTGCCTCCATTCTTTCTCCTTCGAAGCTTCCTTCTCTTAATTATTCAGCTGATACTAAAACTAAGCTGCTGCTTCGTTCCCCAGTTGCTTTCCCTTCGCCATCTAAGTTATCAGCTCTCAAGTGCAAGGCTGCTGGCCAAACCTCGCCCAGTTCCACCGTCTATCAGGGAATTTACGGTCCTTGGACCGTTGATCCTTCCGACGTTCGAGAGGTAATTTTATATAGAGCGGGCTTAGTGACAGCTGCTACCTCTTTTGTGATAGCTTCATCAGTTGCTTTTTTACCCGATAACTCTTCATTGAGTGACACACTTAAGCAAAATCTTGATCTATTCTATGCCTTAGGTGGAGGAGGATTAGGCCTATCCCTAGTTTTGATTCACATATATGTAACTGCAATTAAGCGTACTCTTCAAGCTTTTTGGGTGCTTGGTGGTGCTGGATCTTTGGTAACTTACATAAATCTTGCACAACCAGCTGGGGACAGCTTAGTGCAGTATGTTGTTGATAATCCATCGGCAGTTTGGCTTATTGGTCCTCTCTATGCAGCACTGACTGGACTTGTTTTCAAAGAAGGGCTTTGCTATGGAAAGCTTGAGGCTGGAGTTCTCACCTTTGTCATACCTACGCTACTTCTGGGCCATCTGACTGGTTTGATGGACGATGGGGTTAAACTTGCTTTATTGGGTTCATGGATGGCCCTCTTTGTAATATTTGCTGGAAGAAAGTTCACTCAGCCCATCAAGGATGATATTGGTGACAAATCTGTTTTCATGTTCAATACACTTGGAGAGGACGAAAAGAAGGCCTTGATTGCAAAGCTTGAGCAGCAGGAGGTTAATCAGAATGCTGGTTAG

Protein sequence

MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTVYQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYALGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSAVWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMALFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQNAG
Homology
BLAST of Tan0021539 vs. NCBI nr
Match: XP_022959722.1 (uncharacterized protein LOC111460706 [Cucurbita moschata])

HSP 1 Score: 524.6 bits (1350), Expect = 5.3e-145
Identity = 268/289 (92.73%), Postives = 277/289 (95.85%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MACGCLCASILSPS L SL+YSAD KTKLLL SPVAFPSPSKLSALKCKAAGQ+SP+STV
Sbjct: 1   MACGCLCASILSPSNLLSLDYSADIKTKLLLPSPVAFPSPSKLSALKCKAAGQSSPTSTV 60

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           Y+GIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDL YA
Sbjct: 61  YRGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYA 120

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLG AGSLV Y+NLAQPAGDSLVQYVVDNPSA
Sbjct: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGVAGSLVAYVNLAQPAGDSLVQYVVDNPSA 180

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW IGPL+AALTGLVFKEGLCYGKLEAG+LTFVIPTLLLGHL+GLMDDG KL LLGSWMA
Sbjct: 181 VWFIGPLFAALTGLVFKEGLCYGKLEAGILTFVIPTLLLGHLSGLMDDGAKLGLLGSWMA 240

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQN 290
           LFVIFAGRKFTQPIKDDIGDKSVFMFN LGEDEKKALIAKLEQQE+ QN
Sbjct: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNDLGEDEKKALIAKLEQQELGQN 289

BLAST of Tan0021539 vs. NCBI nr
Match: XP_023004744.1 (uncharacterized protein LOC111497955 [Cucurbita maxima])

HSP 1 Score: 524.2 bits (1349), Expect = 7.0e-145
Identity = 269/289 (93.08%), Postives = 276/289 (95.50%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MACGCLCASILSPS L SLNYSA  KTKLLL SPVAFPSPSKLSALKCKAAGQ+SP+STV
Sbjct: 29  MACGCLCASILSPSNLLSLNYSAHIKTKLLLPSPVAFPSPSKLSALKCKAAGQSSPTSTV 88

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           Y+GIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDL YA
Sbjct: 89  YRGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYA 148

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLG AGSLV Y+NLAQPAGDSLVQYVVDNPSA
Sbjct: 149 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGVAGSLVAYVNLAQPAGDSLVQYVVDNPSA 208

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW IGPL+AALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHL+GLMDDG KL LLGSWMA
Sbjct: 209 VWFIGPLFAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLSGLMDDGAKLGLLGSWMA 268

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQN 290
           LFVIFAGRKFTQPIKDDIGDKSVFMFN LGEDEKKALIAKLEQQE+ QN
Sbjct: 269 LFVIFAGRKFTQPIKDDIGDKSVFMFNDLGEDEKKALIAKLEQQELGQN 317

BLAST of Tan0021539 vs. NCBI nr
Match: KAG7025235.1 (hypothetical protein SDJN02_11730 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 521.5 bits (1342), Expect = 4.5e-144
Identity = 266/289 (92.04%), Postives = 277/289 (95.85%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MAC CLCASILSPS LPSL+YSAD KTKLLL SPVAFPSPSKLSALKCKAAGQ+SP+STV
Sbjct: 1   MACVCLCASILSPSNLPSLDYSADIKTKLLLPSPVAFPSPSKLSALKCKAAGQSSPTSTV 60

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           Y+GIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLP+NSSLSDTLKQNLDL YA
Sbjct: 61  YRGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPENSSLSDTLKQNLDLLYA 120

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLG AGSLV Y+NLAQPAGDSLVQYVVDNPSA
Sbjct: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGVAGSLVAYVNLAQPAGDSLVQYVVDNPSA 180

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW IGPL+AALTGLVFKEGLCYGKLEAG+LTFVIPTLLLGHL+GLMD+G KL LLGSWMA
Sbjct: 181 VWFIGPLFAALTGLVFKEGLCYGKLEAGILTFVIPTLLLGHLSGLMDNGAKLGLLGSWMA 240

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQN 290
           LFVIFAGRKFTQPIKDDIGDKSVFMFN LGEDEKKALIAKLEQQE+ QN
Sbjct: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNDLGEDEKKALIAKLEQQELGQN 289

BLAST of Tan0021539 vs. NCBI nr
Match: XP_023515274.1 (uncharacterized protein LOC111779356 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 518.8 bits (1335), Expect = 2.9e-143
Identity = 266/289 (92.04%), Postives = 274/289 (94.81%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MACGCLCASILSPS L SLNYSA  KTKLLL SPVAFPSPSKLSALKCKAA Q+SP+STV
Sbjct: 1   MACGCLCASILSPSNLLSLNYSAHIKTKLLLPSPVAFPSPSKLSALKCKAAAQSSPTSTV 60

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           Y+GIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDL YA
Sbjct: 61  YRGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYA 120

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLG AGSLV Y+NLAQPAGDSL QYVV+NPSA
Sbjct: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGVAGSLVAYVNLAQPAGDSLAQYVVENPSA 180

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW IGPL+AALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHL+GLMDDG KL LLGSWMA
Sbjct: 181 VWFIGPLFAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLSGLMDDGAKLGLLGSWMA 240

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQN 290
           LFVIFAGRKFTQPIKDDIGDKSVFMFN LGEDEKKALIAKLEQQE+ QN
Sbjct: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNDLGEDEKKALIAKLEQQELGQN 289

BLAST of Tan0021539 vs. NCBI nr
Match: KAG6592828.1 (40S ribosomal protein S27-2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 518.1 bits (1333), Expect = 5.0e-143
Identity = 264/289 (91.35%), Postives = 277/289 (95.85%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MACGCLCASILSPS L SL+YSAD KTKLLL SPVAFPSPSKLSALKCKAAGQ+SP+STV
Sbjct: 1   MACGCLCASILSPSNLRSLDYSADIKTKLLLPSPVAFPSPSKLSALKCKAAGQSSPTSTV 60

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           Y+GIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLP+NSSLSDTLKQNLDL YA
Sbjct: 61  YRGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPENSSLSDTLKQNLDLLYA 120

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLG AGSLV Y+NLAQPAGDSLVQYVVDNPSA
Sbjct: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGVAGSLVAYVNLAQPAGDSLVQYVVDNPSA 180

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW IGPL+AALTGLVFKEGLCYGKLEAG+LTFVIPTLLLGHL+GLMD+G KL LLGSWMA
Sbjct: 181 VWFIGPLFAALTGLVFKEGLCYGKLEAGILTFVIPTLLLGHLSGLMDNGAKLGLLGSWMA 240

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQN 290
           LFVIFAGRKFTQPIKDDIGDKSVFMFN LGEDEKKALIAKLEQQE++ +
Sbjct: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNDLGEDEKKALIAKLEQQELDSS 289

BLAST of Tan0021539 vs. ExPASy TrEMBL
Match: A0A6J1H5C2 (uncharacterized protein LOC111460706 OS=Cucurbita moschata OX=3662 GN=LOC111460706 PE=4 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 2.6e-145
Identity = 268/289 (92.73%), Postives = 277/289 (95.85%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MACGCLCASILSPS L SL+YSAD KTKLLL SPVAFPSPSKLSALKCKAAGQ+SP+STV
Sbjct: 1   MACGCLCASILSPSNLLSLDYSADIKTKLLLPSPVAFPSPSKLSALKCKAAGQSSPTSTV 60

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           Y+GIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDL YA
Sbjct: 61  YRGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYA 120

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLG AGSLV Y+NLAQPAGDSLVQYVVDNPSA
Sbjct: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGVAGSLVAYVNLAQPAGDSLVQYVVDNPSA 180

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW IGPL+AALTGLVFKEGLCYGKLEAG+LTFVIPTLLLGHL+GLMDDG KL LLGSWMA
Sbjct: 181 VWFIGPLFAALTGLVFKEGLCYGKLEAGILTFVIPTLLLGHLSGLMDDGAKLGLLGSWMA 240

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQN 290
           LFVIFAGRKFTQPIKDDIGDKSVFMFN LGEDEKKALIAKLEQQE+ QN
Sbjct: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNDLGEDEKKALIAKLEQQELGQN 289

BLAST of Tan0021539 vs. ExPASy TrEMBL
Match: A0A6J1KX72 (uncharacterized protein LOC111497955 OS=Cucurbita maxima OX=3661 GN=LOC111497955 PE=4 SV=1)

HSP 1 Score: 524.2 bits (1349), Expect = 3.4e-145
Identity = 269/289 (93.08%), Postives = 276/289 (95.50%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MACGCLCASILSPS L SLNYSA  KTKLLL SPVAFPSPSKLSALKCKAAGQ+SP+STV
Sbjct: 29  MACGCLCASILSPSNLLSLNYSAHIKTKLLLPSPVAFPSPSKLSALKCKAAGQSSPTSTV 88

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           Y+GIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDL YA
Sbjct: 89  YRGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYA 148

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLG AGSLV Y+NLAQPAGDSLVQYVVDNPSA
Sbjct: 149 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGVAGSLVAYVNLAQPAGDSLVQYVVDNPSA 208

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW IGPL+AALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHL+GLMDDG KL LLGSWMA
Sbjct: 209 VWFIGPLFAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLSGLMDDGAKLGLLGSWMA 268

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQN 290
           LFVIFAGRKFTQPIKDDIGDKSVFMFN LGEDEKKALIAKLEQQE+ QN
Sbjct: 269 LFVIFAGRKFTQPIKDDIGDKSVFMFNDLGEDEKKALIAKLEQQELGQN 317

BLAST of Tan0021539 vs. ExPASy TrEMBL
Match: A0A0A0KC45 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G091330 PE=4 SV=1)

HSP 1 Score: 516.9 bits (1330), Expect = 5.4e-143
Identity = 264/290 (91.03%), Postives = 274/290 (94.48%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MACGCLCASILS  KLPSLNYSA TKTKLL RSPV+FP PSKLSA KCKAAGQTSPS TV
Sbjct: 1   MACGCLCASILSSPKLPSLNYSALTKTKLLRRSPVSFPPPSKLSAFKCKAAGQTSPSPTV 60

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           YQGIYGPWTVD SDVREVILYRAGLVTAATSFVIASSVAFLPD+SSL DTLKQNLDL Y 
Sbjct: 61  YQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAFLPDSSSLGDTLKQNLDLLYV 120

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSL LIHIYVTAIKRTLQA WVLG AGSLVTY+NL+QPAG+SLVQYVVDNPSA
Sbjct: 121 LGGGGLGLSLFLIHIYVTAIKRTLQALWVLGVAGSLVTYLNLSQPAGESLVQYVVDNPSA 180

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW +GPLYAALTGLVFKEGLCYGKLEAG+LTFVIPTLLLGHLTGLMDDGVKLALLGSWMA
Sbjct: 181 VWFVGPLYAALTGLVFKEGLCYGKLEAGILTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQNA 291
           LFVIFAGRKF+QPIKDDIGDKSVF+FN LGEDEKKALIAKLEQQEV+QNA
Sbjct: 241 LFVIFAGRKFSQPIKDDIGDKSVFLFNALGEDEKKALIAKLEQQEVSQNA 290

BLAST of Tan0021539 vs. ExPASy TrEMBL
Match: A0A5D3DMA3 (DUF2301 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G001900 PE=4 SV=1)

HSP 1 Score: 516.2 bits (1328), Expect = 9.2e-143
Identity = 265/290 (91.38%), Postives = 274/290 (94.48%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MACGCLCASILS  KLPSLNYSA TKTKLL RSPV+FPSPSKLSALKCKAAGQTSPS TV
Sbjct: 1   MACGCLCASILSYPKLPSLNYSALTKTKLLRRSPVSFPSPSKLSALKCKAAGQTSPSPTV 60

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           YQGIYGPWTVD SDVREVILYRAGLVTAATSFVIASSVAFLPDNSSL DTLKQNLDL Y 
Sbjct: 61  YQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLGDTLKQNLDLLYV 120

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSL LIHIYVTAIKRTLQA WVLG AGSLVTY NLAQPAG+SLVQYVVDNPSA
Sbjct: 121 LGGGGLGLSLFLIHIYVTAIKRTLQALWVLGVAGSLVTYSNLAQPAGESLVQYVVDNPSA 180

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW +GPLYAALTGLVFKEGLCYGKLEAG+LTF+IPTLLLGHLTGLMDDGVKLALLGSWMA
Sbjct: 181 VWFVGPLYAALTGLVFKEGLCYGKLEAGILTFIIPTLLLGHLTGLMDDGVKLALLGSWMA 240

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQNA 291
           LFVIFAGRKF+QPIKDDIGDKSVF+FN LGE+EKKALIAKLEQQ V+QNA
Sbjct: 241 LFVIFAGRKFSQPIKDDIGDKSVFIFNALGEEEKKALIAKLEQQGVSQNA 290

BLAST of Tan0021539 vs. ExPASy TrEMBL
Match: E5GB55 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 515.0 bits (1325), Expect = 2.0e-142
Identity = 263/290 (90.69%), Postives = 274/290 (94.48%), Query Frame = 0

Query: 1   MACGCLCASILSPSKLPSLNYSADTKTKLLLRSPVAFPSPSKLSALKCKAAGQTSPSSTV 60
           MACGCLCASILS  KLPSLNYSA TKT+LL RSPV+FPSPSKLSALKCKAAGQTSPS TV
Sbjct: 1   MACGCLCASILSYPKLPSLNYSALTKTRLLRRSPVSFPSPSKLSALKCKAAGQTSPSPTV 60

Query: 61  YQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLFYA 120
           YQGIYGPWTVD SDVREVILYR GLVTAATSFVIASSVAFLPDNSSL DTLKQNLDL Y 
Sbjct: 61  YQGIYGPWTVDSSDVREVILYRGGLVTAATSFVIASSVAFLPDNSSLGDTLKQNLDLLYV 120

Query: 121 LGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSLVQYVVDNPSA 180
           LGGGGLGLSL LIHIYVTAIKRTLQA WVLG AGSLVTY+NLAQPAG+SLVQYVVDNPSA
Sbjct: 121 LGGGGLGLSLFLIHIYVTAIKRTLQALWVLGVAGSLVTYLNLAQPAGESLVQYVVDNPSA 180

Query: 181 VWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGVKLALLGSWMA 240
           VW +GPLYAALTGLVFKEGLCYGKLEAG+LTF+IPTLLLGHLTGLMDDGVKLALLGSWMA
Sbjct: 181 VWFVGPLYAALTGLVFKEGLCYGKLEAGILTFIIPTLLLGHLTGLMDDGVKLALLGSWMA 240

Query: 241 LFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEVNQNA 291
           LFVIFAGRKF+QPIKDDIGDKSVF+FN LGE+EKKALIAKLEQQ V+QNA
Sbjct: 241 LFVIFAGRKFSQPIKDDIGDKSVFIFNALGEEEKKALIAKLEQQGVSQNA 290

BLAST of Tan0021539 vs. TAIR 10
Match: AT1G28140.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2301, transmembrane (InterPro:IPR019275); Has 140 Blast hits to 140 proteins in 72 species: Archae - 0; Bacteria - 86; Metazoa - 10; Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )

HSP 1 Score: 349.7 bits (896), Expect = 2.2e-96
Identity = 169/236 (71.61%), Postives = 203/236 (86.02%), Query Frame = 0

Query: 51  AGQTSPSSTVYQGIYGPWTVDPSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDT 110
           + Q +   TVY+G+YGPWT+D +DV+EVILYR+GLVTAA SFV ASS AFLP +S LS+T
Sbjct: 44  SSQGAVDGTVYKGVYGPWTIDQADVKEVILYRSGLVTAAASFVAASSAAFLPGDSWLSET 103

Query: 111 LKQNLDLFYALGGGGLGLSLVLIHIYVTAIKRTLQAFWVLGGAGSLVTYINLAQPAGDSL 170
           +KQN DLFY +G  GLGLSL LIHIYVT IKRTLQA W LG  GS  TY  LA+PAGD+L
Sbjct: 104 IKQNHDLFYFVGASGLGLSLFLIHIYVTEIKRTLQALWALGFVGSFATYAALARPAGDNL 163

Query: 171 VQYVVDNPSAVWLIGPLYAALTGLVFKEGLCYGKLEAGVLTFVIPTLLLGHLTGLMDDGV 230
           V YVVD+PSAVW +GPL+A+LTGLVFKEGLCYGKLEAG+LTF+IP++LLGHL+GLM+D V
Sbjct: 164 VHYVVDHPSAVWFVGPLFASLTGLVFKEGLCYGKLEAGLLTFIIPSVLLGHLSGLMNDEV 223

Query: 231 KLALLGSWMALFVIFAGRKFTQPIKDDIGDKSVFMFNTLGEDEKKALIAKLEQQEV 287
           KL LLG+WMALF++FAGRKFTQPIKDDIGDKSVF F +L +DEKKA++ KLEQ+++
Sbjct: 224 KLVLLGTWMALFLVFAGRKFTQPIKDDIGDKSVFTFMSLSDDEKKAIVEKLEQEKL 279

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022959722.15.3e-14592.73uncharacterized protein LOC111460706 [Cucurbita moschata][more]
XP_023004744.17.0e-14593.08uncharacterized protein LOC111497955 [Cucurbita maxima][more]
KAG7025235.14.5e-14492.04hypothetical protein SDJN02_11730 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023515274.12.9e-14392.04uncharacterized protein LOC111779356 [Cucurbita pepo subsp. pepo][more]
KAG6592828.15.0e-14391.3540S ribosomal protein S27-2, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
A0A6J1H5C22.6e-14592.73uncharacterized protein LOC111460706 OS=Cucurbita moschata OX=3662 GN=LOC1114607... [more]
A0A6J1KX723.4e-14593.08uncharacterized protein LOC111497955 OS=Cucurbita maxima OX=3661 GN=LOC111497955... [more]
A0A0A0KC455.4e-14391.03Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G091330 PE=4 SV=1[more]
A0A5D3DMA39.2e-14391.38DUF2301 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
E5GB552.0e-14290.69Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G28140.12.2e-9671.61unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019275Protein of unknown function DUF2301PFAMPF10063DUF2301coord: 128..266
e-value: 4.1E-54
score: 182.7
IPR019275Protein of unknown function DUF2301PANTHERPTHR36716F3H9.20 PROTEINcoord: 28..286

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021539.1Tan0021539.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane