Tan0010129 (gene) Snake gourd v1

Overview
NameTan0010129
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionArabidopsis protein of unknown function (DUF241)
LocationLG01: 116337490 .. 116338846 (-)
RNA-Seq ExpressionTan0010129
SyntenyTan0010129
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCAAACATCCCAAAATCCTCTCCCCGCTCTCACCAAAATCACAGCCATGGTCGGCGTTTTCCGGCGATCTTTCTCTTTTCCGAACAAGTCTCCAGCCAAGCCTTCTCTCTCTCATCACGTCCGTTCCATCAGTCTTCCCTGCAGATCTCACCCCTTGATTTTCCAACTCAAGGACGAGATCGCCAATCTCAATTCCTGGTCGTCTAATTCCGATTCTCGATCCGCCGCCTGGATCTGCGACGGCCTCAACCGCCTCAAAACCGTCCACAACCATCTCGACGACGTTCTCAACCTCCCTCAGACTCAAGAATCTCTCCGCCACCAGCCGCACTGGATCAATAAGCTTCTCGAACATTTCTTACGCTTCGTCGATGTTTACGGAATCTTCCAGACTTTGATTCTGTCGCTCAAAGAAGAGCACTCCGCCGCGCAGGTCGCGATGAGGAGAAAAGACGAAGAGAAGATCGCGTTATATGTTAAATCTAGGAAGAGATTAGCTAGGCAAATGGCGAAACTGGTTTCGACCGTACAGAAGAAAATCAAGACGGCGGAGCGAGGCGTCGCCGCGGCAGCCGATCTTGCCGCCGTGATCGAAGAAGTTGTCGGAGTGACGACGGCGGTTTCTCTCGCACTGTTCAACGGAATCGCAGAATCGTTCTCAACGAAGAAGCCATGGCGATGGACAGGAGTGGATCGCCTTTCGAAGAAATCGGTGGAGGATAAAGGAATTCGAGAGTTCAGAGAGATTGGATCGGAGAATTTGAGAGAATTGAAGAAGAAAGGGAAAGAAGAAACGAAAATGGCGATGAAGAAGATGAGAGATTTGGAGGATTGGATTAGCGACATTGAAACTCGAAGCCAGAAGGTTTTCAGAAGTTTGATCAGTGCCAGAGTTTCGTTGCTGAACGCTCTGTCACAGCAACAAACATCGGAAAAATAAGGGGGATTTTTTCTTTTAACACGAAATTATTATACTAATTAAATATGTACATGATTTAGAGATTTTGATCCCTTTTGTTAAGAAAAAAAGGGGTTAAAAAATGAAAAGAAAATTTGTGGTGTTTAAAGTTTAAACTATAGGGTGGTGAAAAATCAAAGAGATCTAATTTAGAAAAAAAAAATACAATTATAAGGTGTGTATGGTAGGGATGAGCTGGTTTTGATGATATATTGTAGATAGAATCTTTGGATGTGTGTAGAAGATTTTTTTAGTTCAATAACATATAGAAGTGGGAGATTTGAACTATGACTTTTTGGTCATAAGTCTACACTTATGTCACTTATGTCAGTTGAGCTATGTTATGTGTGTAGAAGATTGAGAATCTATGACCATTATCCCATTTGGGATGTCTAAAG

mRNA sequence

CCCAAACATCCCAAAATCCTCTCCCCGCTCTCACCAAAATCACAGCCATGGTCGGCGTTTTCCGGCGATCTTTCTCTTTTCCGAACAAGTCTCCAGCCAAGCCTTCTCTCTCTCATCACGTCCGTTCCATCAGTCTTCCCTGCAGATCTCACCCCTTGATTTTCCAACTCAAGGACGAGATCGCCAATCTCAATTCCTGGTCGTCTAATTCCGATTCTCGATCCGCCGCCTGGATCTGCGACGGCCTCAACCGCCTCAAAACCGTCCACAACCATCTCGACGACGTTCTCAACCTCCCTCAGACTCAAGAATCTCTCCGCCACCAGCCGCACTGGATCAATAAGCTTCTCGAACATTTCTTACGCTTCGTCGATGTTTACGGAATCTTCCAGACTTTGATTCTGTCGCTCAAAGAAGAGCACTCCGCCGCGCAGGTCGCGATGAGGAGAAAAGACGAAGAGAAGATCGCGTTATATGTTAAATCTAGGAAGAGATTAGCTAGGCAAATGGCGAAACTGGTTTCGACCGTACAGAAGAAAATCAAGACGGCGGAGCGAGGCGTCGCCGCGGCAGCCGATCTTGCCGCCGTGATCGAAGAAGTTGTCGGAGTGACGACGGCGGTTTCTCTCGCACTGTTCAACGGAATCGCAGAATCGTTCTCAACGAAGAAGCCATGGCGATGGACAGGAGTGGATCGCCTTTCGAAGAAATCGGTGGAGGATAAAGGAATTCGAGAGTTCAGAGAGATTGGATCGGAGAATTTGAGAGAATTGAAGAAGAAAGGGAAAGAAGAAACGAAAATGGCGATGAAGAAGATGAGAGATTTGGAGGATTGGATTAGCGACATTGAAACTCGAAGCCAGAAGGTTTTCAGAAGTTTGATCAGTGCCAGAGTTTCGTTGCTGAACGCTCTGTCACAGCAACAAACATCGGAAAAATAAGGGGGATTTTTTCTTTTAACACGAAATTATTATACTAATTAAATATGTACATGATTTAGAGATTTTGATCCCTTTTGTTAAGAAAAAAAGGGGTTAAAAAATGAAAAGAAAATTTGTGGTGTTTAAAGTTTAAACTATAGGGTGGTGAAAAATCAAAGAGATCTAATTTAGAAAAAAAAAATACAATTATAAGGTGTGTATGGTAGGGATGAGCTGGTTTTGATGATATATTGTAGATAGAATCTTTGGATGTGTGTAGAAGATTTTTTTAGTTCAATAACATATAGAAGTGGGAGATTTGAACTATGACTTTTTGGTCATAAGTCTACACTTATGTCACTTATGTCAGTTGAGCTATGTTATGTGTGTAGAAGATTGAGAATCTATGACCATTATCCCATTTGGGATGTCTAAAG

Coding sequence (CDS)

ATGGTCGGCGTTTTCCGGCGATCTTTCTCTTTTCCGAACAAGTCTCCAGCCAAGCCTTCTCTCTCTCATCACGTCCGTTCCATCAGTCTTCCCTGCAGATCTCACCCCTTGATTTTCCAACTCAAGGACGAGATCGCCAATCTCAATTCCTGGTCGTCTAATTCCGATTCTCGATCCGCCGCCTGGATCTGCGACGGCCTCAACCGCCTCAAAACCGTCCACAACCATCTCGACGACGTTCTCAACCTCCCTCAGACTCAAGAATCTCTCCGCCACCAGCCGCACTGGATCAATAAGCTTCTCGAACATTTCTTACGCTTCGTCGATGTTTACGGAATCTTCCAGACTTTGATTCTGTCGCTCAAAGAAGAGCACTCCGCCGCGCAGGTCGCGATGAGGAGAAAAGACGAAGAGAAGATCGCGTTATATGTTAAATCTAGGAAGAGATTAGCTAGGCAAATGGCGAAACTGGTTTCGACCGTACAGAAGAAAATCAAGACGGCGGAGCGAGGCGTCGCCGCGGCAGCCGATCTTGCCGCCGTGATCGAAGAAGTTGTCGGAGTGACGACGGCGGTTTCTCTCGCACTGTTCAACGGAATCGCAGAATCGTTCTCAACGAAGAAGCCATGGCGATGGACAGGAGTGGATCGCCTTTCGAAGAAATCGGTGGAGGATAAAGGAATTCGAGAGTTCAGAGAGATTGGATCGGAGAATTTGAGAGAATTGAAGAAGAAAGGGAAAGAAGAAACGAAAATGGCGATGAAGAAGATGAGAGATTTGGAGGATTGGATTAGCGACATTGAAACTCGAAGCCAGAAGGTTTTCAGAAGTTTGATCAGTGCCAGAGTTTCGTTGCTGAACGCTCTGTCACAGCAACAAACATCGGAAAAATAA

Protein sequence

MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSAAWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILSLKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAAVIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLSKKSVEDKGIREFREIGSENLRELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQTSEK
Homology
BLAST of Tan0010129 vs. NCBI nr
Match: XP_022941094.1 (uncharacterized protein LOC111446494 [Cucurbita moschata])

HSP 1 Score: 506.5 bits (1303), Expect = 1.5e-139
Identity = 268/298 (89.93%), Postives = 281/298 (94.30%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRSFSFPNKSP KP+LSHHVRSISLPCRSHPLIFQLKDEIANL SWS + DSR+A
Sbjct: 1   MVGVFRRSFSFPNKSPPKPALSHHVRSISLPCRSHPLIFQLKDEIANLKSWSLSLDSRTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFL FVDVYGIFQTLIL+
Sbjct: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLTFVDVYGIFQTLILT 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAA 180
           LKEEHSAAQ AMRRKDEEKIALYVK+RKRLARQMAKLVST+QKKIKTAE+G  AAADLA+
Sbjct: 121 LKEEHSAAQAAMRRKDEEKIALYVKARKRLARQMAKLVSTLQKKIKTAEQG-TAAADLAS 180

Query: 181 VIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLSKKSV-EDKGIREFREIGSENL 240
           VIEEVVGVTTAVSLAL NGIAESFST+KPW WTG+DRLSKKS  E+KGIREFR+IGSE L
Sbjct: 181 VIEEVVGVTTAVSLALLNGIAESFSTRKPWTWTGLDRLSKKSAEEEKGIREFRDIGSEKL 240

Query: 241 RELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQTSEK 298
           RELKKKGKEET+ AMKKMRD EDW SDIETRSQKVFRSLISARVSLLNALSQQQ SEK
Sbjct: 241 RELKKKGKEETEKAMKKMRDSEDWFSDIETRSQKVFRSLISARVSLLNALSQQQASEK 297

BLAST of Tan0010129 vs. NCBI nr
Match: KAG7037942.1 (hypothetical protein SDJN02_01575, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 505.0 bits (1299), Expect = 4.5e-139
Identity = 267/294 (90.82%), Postives = 280/294 (95.24%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRSFSFPNKSPAKP+LSHHVRSISLPCRSHPLIFQLKDEIANL SWS + DSR+A
Sbjct: 1   MVGVFRRSFSFPNKSPAKPALSHHVRSISLPCRSHPLIFQLKDEIANLKSWSLSLDSRTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFL FVDVYGIFQTLIL+
Sbjct: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLTFVDVYGIFQTLILT 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAA 180
           LKEEHSAAQVAMRRKDEEKIALYVK+RKRLARQMAKLV+T+QKKIKTAE+G  AAADLA+
Sbjct: 121 LKEEHSAAQVAMRRKDEEKIALYVKARKRLARQMAKLVTTLQKKIKTAEQG-TAAADLAS 180

Query: 181 VIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLSKKSV-EDKGIREFREIGSENL 240
           VIEEVVGVTTAVSLAL NGIAESFST+KPW WTG+DRLSKKS  E+KGIREFREIGSE L
Sbjct: 181 VIEEVVGVTTAVSLALLNGIAESFSTRKPWTWTGLDRLSKKSAEEEKGIREFREIGSEKL 240

Query: 241 RELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQ 294
           RELKKKGKEET+ AMKKMRD EDW SDIETRSQKVFRSLISARVSLLNALSQQQ
Sbjct: 241 RELKKKGKEETEKAMKKMRDSEDWFSDIETRSQKVFRSLISARVSLLNALSQQQ 293

BLAST of Tan0010129 vs. NCBI nr
Match: XP_022982526.1 (uncharacterized protein LOC111481317 [Cucurbita maxima])

HSP 1 Score: 504.6 bits (1298), Expect = 5.8e-139
Identity = 266/298 (89.26%), Postives = 280/298 (93.96%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRSFSFPNKSP KP+LSHHVRSISLPCRSHPLIFQLK+EIANLNSWS + DSR+A
Sbjct: 1   MVGVFRRSFSFPNKSPPKPALSHHVRSISLPCRSHPLIFQLKEEIANLNSWSLSLDSRTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFL FVDVYGIFQTLIL+
Sbjct: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLTFVDVYGIFQTLILT 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAA 180
           LKEEHSAAQVAMRRKDEEKIALYVK+RKRLARQM KLVST+QKKIKTAE+G   AADLA+
Sbjct: 121 LKEEHSAAQVAMRRKDEEKIALYVKARKRLARQMTKLVSTLQKKIKTAEQG-TTAADLAS 180

Query: 181 VIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLSKKSV-EDKGIREFREIGSENL 240
           VIEEVVGVTTAVSLAL NGI ESFST+KPW WTG+DR+SKKS  E+KGIREFREIGSE L
Sbjct: 181 VIEEVVGVTTAVSLALLNGIVESFSTRKPWAWTGLDRISKKSAEEEKGIREFREIGSEKL 240

Query: 241 RELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQTSEK 298
           RELKKKGKEET+ AMKKMRD EDW SDIETRSQKVFRSLISARVSLLNALSQQQ SEK
Sbjct: 241 RELKKKGKEETEKAMKKMRDSEDWFSDIETRSQKVFRSLISARVSLLNALSQQQASEK 297

BLAST of Tan0010129 vs. NCBI nr
Match: KAG6591230.1 (hypothetical protein SDJN03_13576, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 483.0 bits (1242), Expect = 1.8e-132
Identity = 256/301 (85.05%), Postives = 274/301 (91.03%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRSFSFPNK+  KP+LSHHVRSISLP R HPLIFQLKDEIANL SWS +S+SR+A
Sbjct: 1   MVGVFRRSFSFPNKAHPKPALSHHVRSISLPSRPHPLIFQLKDEIANLRSWSLSSESRTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWICDGLNRLKTVHNHLDD+LNLPQTQESLRHQPHW++KLLEHFLRFVDVYGIFQTLILS
Sbjct: 61  AWICDGLNRLKTVHNHLDDILNLPQTQESLRHQPHWMDKLLEHFLRFVDVYGIFQTLILS 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAA 180
           LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKK KTAE+G     DL+A
Sbjct: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKTKTAEQG-NITTDLSA 180

Query: 181 VIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLSKK----SVEDKGIREFREIGS 240
            IEEV+GVT AVSLALFNGIAESF T+KPW WTG D++SKK    + E+KGIREFREIGS
Sbjct: 181 AIEEVIGVTMAVSLALFNGIAESFRTRKPWAWTGFDQVSKKGKKSAEEEKGIREFREIGS 240

Query: 241 ENLRELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQTSE 298
           ENLRELKKKGKEETK  MKKMRDLEDWI DIET+SQKVFRSLISARVSLLNALSQQQ  +
Sbjct: 241 ENLRELKKKGKEETKRTMKKMRDLEDWIGDIETQSQKVFRSLISARVSLLNALSQQQILQ 300

BLAST of Tan0010129 vs. NCBI nr
Match: KAG7024116.1 (hypothetical protein SDJN02_12929, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 481.5 bits (1238), Expect = 5.3e-132
Identity = 255/301 (84.72%), Postives = 274/301 (91.03%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRSFSFPNK+  KP+LSHHVRSISLP R HPLIFQLKDEIANL SWS +S+SR+A
Sbjct: 1   MVGVFRRSFSFPNKAHPKPALSHHVRSISLPSRPHPLIFQLKDEIANLRSWSLSSESRTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWICDGLNRLKTVHNHLDD+LNLPQTQESLRHQPHW++KLLEHFLRFVDVYGIFQTLILS
Sbjct: 61  AWICDGLNRLKTVHNHLDDILNLPQTQESLRHQPHWMDKLLEHFLRFVDVYGIFQTLILS 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAA 180
           LKEEHSAAQVA+RRKDEEKIALYVKSRKRLARQMAKLVSTVQKK KTAE+G     DLAA
Sbjct: 121 LKEEHSAAQVAVRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKTKTAEQG-NITTDLAA 180

Query: 181 VIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLSKK----SVEDKGIREFREIGS 240
            IEEV+GVT AVSLALFNGIAESF T+KPW WTG D++SKK    + E+KGIREFREIGS
Sbjct: 181 AIEEVIGVTMAVSLALFNGIAESFRTRKPWAWTGFDQVSKKGKKSAEEEKGIREFREIGS 240

Query: 241 ENLRELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQTSE 298
           ENLRELKKKGKEETK  M+KMRDLEDWI DIET+SQKVFRSLISARVSLLNALSQQQ  +
Sbjct: 241 ENLRELKKKGKEETKRTMRKMRDLEDWIGDIETQSQKVFRSLISARVSLLNALSQQQILQ 300

BLAST of Tan0010129 vs. ExPASy TrEMBL
Match: A0A6J1FM80 (uncharacterized protein LOC111446494 OS=Cucurbita moschata OX=3662 GN=LOC111446494 PE=4 SV=1)

HSP 1 Score: 506.5 bits (1303), Expect = 7.4e-140
Identity = 268/298 (89.93%), Postives = 281/298 (94.30%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRSFSFPNKSP KP+LSHHVRSISLPCRSHPLIFQLKDEIANL SWS + DSR+A
Sbjct: 1   MVGVFRRSFSFPNKSPPKPALSHHVRSISLPCRSHPLIFQLKDEIANLKSWSLSLDSRTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFL FVDVYGIFQTLIL+
Sbjct: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLTFVDVYGIFQTLILT 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAA 180
           LKEEHSAAQ AMRRKDEEKIALYVK+RKRLARQMAKLVST+QKKIKTAE+G  AAADLA+
Sbjct: 121 LKEEHSAAQAAMRRKDEEKIALYVKARKRLARQMAKLVSTLQKKIKTAEQG-TAAADLAS 180

Query: 181 VIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLSKKSV-EDKGIREFREIGSENL 240
           VIEEVVGVTTAVSLAL NGIAESFST+KPW WTG+DRLSKKS  E+KGIREFR+IGSE L
Sbjct: 181 VIEEVVGVTTAVSLALLNGIAESFSTRKPWTWTGLDRLSKKSAEEEKGIREFRDIGSEKL 240

Query: 241 RELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQTSEK 298
           RELKKKGKEET+ AMKKMRD EDW SDIETRSQKVFRSLISARVSLLNALSQQQ SEK
Sbjct: 241 RELKKKGKEETEKAMKKMRDSEDWFSDIETRSQKVFRSLISARVSLLNALSQQQASEK 297

BLAST of Tan0010129 vs. ExPASy TrEMBL
Match: A0A6J1IZK2 (uncharacterized protein LOC111481317 OS=Cucurbita maxima OX=3661 GN=LOC111481317 PE=4 SV=1)

HSP 1 Score: 504.6 bits (1298), Expect = 2.8e-139
Identity = 266/298 (89.26%), Postives = 280/298 (93.96%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRSFSFPNKSP KP+LSHHVRSISLPCRSHPLIFQLK+EIANLNSWS + DSR+A
Sbjct: 1   MVGVFRRSFSFPNKSPPKPALSHHVRSISLPCRSHPLIFQLKEEIANLNSWSLSLDSRTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFL FVDVYGIFQTLIL+
Sbjct: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLTFVDVYGIFQTLILT 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAA 180
           LKEEHSAAQVAMRRKDEEKIALYVK+RKRLARQM KLVST+QKKIKTAE+G   AADLA+
Sbjct: 121 LKEEHSAAQVAMRRKDEEKIALYVKARKRLARQMTKLVSTLQKKIKTAEQG-TTAADLAS 180

Query: 181 VIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLSKKSV-EDKGIREFREIGSENL 240
           VIEEVVGVTTAVSLAL NGI ESFST+KPW WTG+DR+SKKS  E+KGIREFREIGSE L
Sbjct: 181 VIEEVVGVTTAVSLALLNGIVESFSTRKPWAWTGLDRISKKSAEEEKGIREFREIGSEKL 240

Query: 241 RELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQTSEK 298
           RELKKKGKEET+ AMKKMRD EDW SDIETRSQKVFRSLISARVSLLNALSQQQ SEK
Sbjct: 241 RELKKKGKEETEKAMKKMRDSEDWFSDIETRSQKVFRSLISARVSLLNALSQQQASEK 297

BLAST of Tan0010129 vs. ExPASy TrEMBL
Match: A0A6J1F8P4 (uncharacterized protein LOC111443334 OS=Cucurbita moschata OX=3662 GN=LOC111443334 PE=4 SV=1)

HSP 1 Score: 479.6 bits (1233), Expect = 9.7e-132
Identity = 255/301 (84.72%), Postives = 272/301 (90.37%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRSFSFPNK+  KP+LSHHVRSISLP R HPLIF LKDEIANL SWS +S+SR+A
Sbjct: 1   MVGVFRRSFSFPNKAHPKPALSHHVRSISLPSRPHPLIFHLKDEIANLRSWSLSSESRTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWICDGLNRLKTVHNHLDD+LNLPQTQESLRHQPHW++KLLEHFLRFVDVYGIFQTLILS
Sbjct: 61  AWICDGLNRLKTVHNHLDDILNLPQTQESLRHQPHWMDKLLEHFLRFVDVYGIFQTLILS 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAA 180
           LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKK KTAE+G     DLAA
Sbjct: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKTKTAEQG-NITTDLAA 180

Query: 181 VIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLSKK----SVEDKGIREFREIGS 240
            I EV+GVT AVSLALFNGIAESF T+KPW WTG D++SKK    + E+KGIREFREIGS
Sbjct: 181 AIGEVIGVTMAVSLALFNGIAESFRTRKPWAWTGFDQVSKKGKKSAEEEKGIREFREIGS 240

Query: 241 ENLRELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQTSE 298
           ENLRELKKKGKEETK  MKKMRDLEDWI DIET+SQKVFRSLISARVSLLNALSQQQ  +
Sbjct: 241 ENLRELKKKGKEETKRTMKKMRDLEDWIGDIETQSQKVFRSLISARVSLLNALSQQQILQ 300

BLAST of Tan0010129 vs. ExPASy TrEMBL
Match: A0A0A0L1A0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G658480 PE=4 SV=1)

HSP 1 Score: 475.7 bits (1223), Expect = 1.4e-130
Identity = 254/300 (84.67%), Postives = 272/300 (90.67%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRS SFPNK+P KPSLSHHVRSISLPCRSHPLIFQLKD+IANL+SWS NSDS +A
Sbjct: 1   MVGVFRRSISFPNKTPVKPSLSHHVRSISLPCRSHPLIFQLKDQIANLHSWSLNSDSHTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWICDGLN LKTVHNHLDD+LNLPQT++SLRH PHWI+KLLEHFLRFVDVYGIFQTLILS
Sbjct: 61  AWICDGLNHLKTVHNHLDDILNLPQTRDSLRHHPHWIDKLLEHFLRFVDVYGIFQTLILS 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVA-AAADLA 180
           LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKK K AE+G A   ADLA
Sbjct: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKTKIAEQGQAGVTADLA 180

Query: 181 AVIEEVVGVTTAVSLALFNGIAESFSTKK-PWRWTGVDRLSKK-----SVEDKGIREFRE 240
           AVIEEV+GVTT VSLALFNGI+ESF TKK  W+WT +D ++KK       E KGI+EFRE
Sbjct: 181 AVIEEVIGVTTTVSLALFNGISESFGTKKITWKWTRLDSVTKKVKKSAEEEKKGIQEFRE 240

Query: 241 IGSENLRELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQQQ 294
           IGSENLRELKKKGKEETK+AMKKMRDLEDWISDIE  SQ+VFRSLISARVSLLNALSQQQ
Sbjct: 241 IGSENLRELKKKGKEETKIAMKKMRDLEDWISDIENGSQRVFRSLISARVSLLNALSQQQ 300

BLAST of Tan0010129 vs. ExPASy TrEMBL
Match: A0A5A7VG10 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G00760 PE=4 SV=1)

HSP 1 Score: 470.7 bits (1210), Expect = 4.5e-129
Identity = 255/302 (84.44%), Postives = 273/302 (90.40%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNKSPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSA 60
           MVGVFRRS SFPNK+  KPSLSHHVRSISLPCRSHPLIFQLKD+IANL+SWS NSDSR+A
Sbjct: 1   MVGVFRRSISFPNKTLVKPSLSHHVRSISLPCRSHPLIFQLKDQIANLHSWSLNSDSRTA 60

Query: 61  AWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILS 120
           AWIC+GL+ LKTVHNHLDD+LNLPQT+ESLRH PHWI+KLLEHFLRFVDVYGIFQTLILS
Sbjct: 61  AWICEGLSHLKTVHNHLDDILNLPQTRESLRHNPHWIDKLLEHFLRFVDVYGIFQTLILS 120

Query: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLV---STVQKKIKTAERGVA-AAA 180
           LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLV   STVQKK K AE+G A   A
Sbjct: 121 LKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSSLSTVQKKTKIAEQGQAGVTA 180

Query: 181 DLAAVIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTGVDRLS---KKSVED--KGIREF 240
           DLAAVIEEV+GVT  VSLALFNGI+ESF TK  WRWT +DR++   KKS ED  KGI+EF
Sbjct: 181 DLAAVIEEVIGVTMTVSLALFNGISESFGTKNTWRWTRLDRVTKKVKKSAEDQEKGIQEF 240

Query: 241 REIGSENLRELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNALSQ 294
           REIGSENLRELKKKGKEETK+AMKKMRDLEDWISDIE  SQ+VFRSLISARVSLLNALSQ
Sbjct: 241 REIGSENLRELKKKGKEETKIAMKKMRDLEDWISDIENGSQRVFRSLISARVSLLNALSQ 300

BLAST of Tan0010129 vs. TAIR 10
Match: AT1G76240.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 258.1 bits (658), Expect = 8.8e-69
Identity = 161/308 (52.27%), Postives = 207/308 (67.21%), Query Frame = 0

Query: 1   MVGVFRRSFSFPNK-----SP-AKPSLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSS- 60
           MVGVFRRS SFPNK     SP +KP +SHH RSISLPCRSHPLI  +  EI+ L SW S 
Sbjct: 1   MVGVFRRSLSFPNKPCGRSSPSSKPRVSHHTRSISLPCRSHPLISHVNHEISQLKSWFSF 60

Query: 61  --NSDSRSAAWICDGLNRLKTVHNHLDDVLNLPQTQESLRHQPHWINKLLEHFLRFVDVY 120
              + SR+ +WI DGL+ LK V   L D+L LPQ+QESLR++P +   LLE  LRFVD Y
Sbjct: 61  AGETHSRTTSWITDGLSLLKDVQETLADILQLPQSQESLRNRPVFFENLLEDLLRFVDAY 120

Query: 121 GIFQTLILSLKEEHSAAQVAMRRKDEEKIALYVKSRKRLARQMAKLVSTVQKKIKTAER- 180
           GIF+T IL L+E  SAAQVA+R+KD+EKIA Y+KSR+ LAR +AKL S++++  KT  + 
Sbjct: 121 GIFRTSILCLREHQSAAQVALRKKDDEKIASYLKSRRSLARDIAKLTSSIREP-KTKHQH 180

Query: 181 -------GVAAAADLAAVIEEVVGVTTAVSLALFNGIAESFSTKKPWRWTG-VDRLSKKS 240
                  G    A+LA+VI +V+ VT  VS+ALFNG+  S    K   + G + R  KK 
Sbjct: 181 CHVDNVNGTYGDAELASVIGDVIEVTVLVSVALFNGVYLSLRATKTTPFIGFLKRSEKKE 240

Query: 241 VEDKGIREFREIGSENLRELKKKGKEETKMAMKKMRDLEDWISDIETRSQKVFRSLISAR 291
             D+GI E +++  ++L  L KK  EE K  MK+M +LE+ I +IE  S+KVFR LIS R
Sbjct: 241 KLDEGIVELKQVEEKSLIGLSKKKNEEVKSLMKRMMELENSIREIECESEKVFRGLISTR 300

BLAST of Tan0010129 vs. TAIR 10
Match: AT2G17080.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 95.9 bits (237), Expect = 5.8e-20
Identity = 88/278 (31.65%), Postives = 139/278 (50.00%), Query Frame = 0

Query: 20  SLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSAAWICDGLNRLKTVHNHLDD 79
           ++S HVRS S P RSHP    + +++A L S S  + S S++ IC  L+ L+ +H  LD 
Sbjct: 2   AVSFHVRSNSFPSRSHPQAAHVDEQLARLRS-SEQASSSSSSSICQRLDNLQELHESLDK 61

Query: 80  VLNLPQTQESL--RHQPHWINKLLEHFLRFVDVYGIFQTLILSLKEEHSAAQVAMRRKD- 139
           +++ P TQ++L   H    + +LL+  LR +D+  I +  +  +KE     Q  +RRK  
Sbjct: 62  LISRPVTQQALSQEHNKKAVEQLLDGSLRILDLCNISKDALSEMKEGLMEIQSILRRKRG 121

Query: 140 --EEKIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAAVIEEVVGVTTAVSL 199
              E++  Y+ SRK L +   K    VQK +K     V  A D       V G   A++L
Sbjct: 122 DLSEEVKKYLTSRKSLKKSFQK----VQKSLK-----VTQAEDNNDDTLAVFGEAEAITL 181

Query: 200 ALFNGIAESFSTKKPW-RWTGVDRL--SKKSVEDKGIREFREIGSENLRELKKKGKEETK 259
           +LF+ +    S  K   +W+ V +L   KK   +    EF ++ SE         + E  
Sbjct: 182 SLFDSLLSYMSGSKTCSKWSVVSKLMNKKKVTCEAQENEFTKVDSE--------FQSEKT 241

Query: 260 MAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNAL 290
           + M  +++LE  I D+E   + + +SLI  RVS LN L
Sbjct: 242 LKMDDVQNLESCIQDLEDGLESLSKSLIKYRVSFLNIL 261

BLAST of Tan0010129 vs. TAIR 10
Match: AT2G17070.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 86.3 bits (212), Expect = 4.6e-17
Identity = 83/277 (29.96%), Postives = 137/277 (49.46%), Query Frame = 0

Query: 20  SLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSAAWICDGLNRLKTVHNHLDD 79
           ++S HVRS S P   HP    + +++A L S    S S S++ IC  L+ L+ +H  LD 
Sbjct: 2   AVSFHVRSHSYPSIPHPQAAHVDEQLARLRSSEETSTSSSSS-ICQRLDNLQELHESLDK 61

Query: 80  VLNLPQTQESLRHQPHW--INKLLEHFLRFVDVYGIFQTLILSLKEEHSAAQVAMRRKDE 139
           ++ LP TQ++L  + +   + +LL+  L+ +DV  I +  +  +KE     Q  +RRK  
Sbjct: 62  LIRLPVTQQALGQEKNKKDVEQLLDGSLKILDVCNISKDALSQMKEGLMEIQSILRRKRG 121

Query: 140 E---KIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAAVIEEVVGVTTAVSL 199
           +   ++  Y+ SRK   +   K    VQK +K A+        LA     V G   AV++
Sbjct: 122 DLSGEVKKYLASRKSFKKTFQK----VQKSLKAAQAEDNKDKSLA-----VFGEAEAVTI 181

Query: 200 ALFNGIAESFSTKKPW-RWTGVDRL--SKKSVEDKGIREFREIGSENLRELKKKGKEETK 259
           A+F+ +    S  K   +W+ V +L   KK   +    EF ++ SE         + E  
Sbjct: 182 AMFDSLFSYMSGSKTCSKWSVVSKLMNKKKITCEAQENEFTKVDSE--------FQSEKT 241

Query: 260 MAMKKMRDLEDWISDIETRSQKVFRSLISARVSLLNA 289
           + M+ ++ LE  I D E   + + +SLI  RVS+LN+
Sbjct: 242 LKMEDVQILESCIQDFEDGLESLSKSLIKYRVSILNS 260

BLAST of Tan0010129 vs. TAIR 10
Match: AT4G35200.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 85.5 bits (210), Expect = 7.9e-17
Identity = 82/274 (29.93%), Postives = 135/274 (49.27%), Query Frame = 0

Query: 20  SLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSAAWICDGLNRLKTVHNHLDD 79
           ++S HVRS S P R HP    + +++  L S    SDS S++ IC  L+ L+ +H+ L+ 
Sbjct: 2   AVSFHVRSNSYPSRQHPQAAHVDEQLTRLRS----SDSASSSSICQRLSNLQDLHDSLEK 61

Query: 80  VLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILSLKEEHSAAQVAMRRKDEE- 139
           ++ L  T  +L      I KLL+  LR +D+  I +  I  +KE     Q  +RRK  + 
Sbjct: 62  MIRLSVTNLALSQDQ--IEKLLDGSLRILDLCNIAKDAISQMKEGLMEIQSILRRKPGDL 121

Query: 140 --KIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAAVIEEVVGVTTAVSLAL 199
             ++  Y+ SRK L + + K++    K +K  +   +  A L      V G   AV++AL
Sbjct: 122 SGEVKKYLVSRKFLKKSLQKVI----KSLKVCQSKDSTNASLV-----VFGRAEAVTMAL 181

Query: 200 FNGIAESFS-TKKPWRWTGVDRL--SKKSVEDKGIREFREIGSENLRELKKKGKEETKMA 259
           F  +    S +K   +W+ V ++    K   +    EF  I SE         + E  + 
Sbjct: 182 FESLFSFMSGSKACGKWSLVSKMMSQNKVTCEAEANEFTRIDSE--------FQSEKSLQ 241

Query: 260 MKKMRDLEDWISDIETRSQKVFRSLISARVSLLN 288
           M+ +++LE  I D+E   + + +SLI  RVS+LN
Sbjct: 242 MEDVQNLESCIQDLEDGIESLSKSLIKYRVSILN 252

BLAST of Tan0010129 vs. TAIR 10
Match: AT4G35210.1 (Arabidopsis protein of unknown function (DUF241) )

HSP 1 Score: 75.1 bits (183), Expect = 1.1e-13
Identity = 77/271 (28.41%), Postives = 135/271 (49.82%), Query Frame = 0

Query: 20  SLSHHVRSISLPCRSHPLIFQLKDEIANLNSWSSNSDSRSAAWICDGLNRLKTVHNHLDD 79
           ++S HVRS S P R HP    + +++  L S    S + S++ IC  L+ L+ +H+ L+ 
Sbjct: 2   AVSFHVRSSSYPSRQHPQAAHVDEQLTRLRS----SGTASSSSICQRLSNLQDLHDSLEK 61

Query: 80  VLNLPQTQESLRHQPHWINKLLEHFLRFVDVYGIFQTLILSLKEEHSAAQVAMRRKDEE- 139
           ++ L  T ++L      I KLL+  ++ +D+  I +  +  +KE     Q  +RRK  + 
Sbjct: 62  MIRLSVTNQALSQDQ--IEKLLDGSIKILDLCSISKDGLSQMKESLKEIQSIVRRKRGDL 121

Query: 140 --KIALYVKSRKRLARQMAKLVSTVQKKIKTAERGVAAAADLAAVIEEVVGVTTAVSLAL 199
             ++  Y+ SRK L +   K    V K +KT++       D  AV  E   VT A+  +L
Sbjct: 122 SAEVKKYLASRKFLKKSFEK----VLKSLKTSQN----KNDALAVFGEAETVTIALFESL 181

Query: 200 FNGIAESFSTKKPWRWTGVDRLSKKSVEDKGIREFREIGSENLRELKKKGKEETKMAMKK 259
           F+ ++ S   K   +W+ V ++  +S   KG  E     +     +  + + E  + M+ 
Sbjct: 182 FSFMSGS---KACGKWSLVSKMMSQS---KGTCEAE---ANEFTRVDMEFQSEKSLQMED 241

Query: 260 MRDLEDWISDIETRSQKVFRSLISARVSLLN 288
           +++LE  I D+E     + +SLI  RVS+LN
Sbjct: 242 VQNLEICIQDLEDGIGSLSKSLIKYRVSILN 249

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022941094.11.5e-13989.93uncharacterized protein LOC111446494 [Cucurbita moschata][more]
KAG7037942.14.5e-13990.82hypothetical protein SDJN02_01575, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022982526.15.8e-13989.26uncharacterized protein LOC111481317 [Cucurbita maxima][more]
KAG6591230.11.8e-13285.05hypothetical protein SDJN03_13576, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7024116.15.3e-13284.72hypothetical protein SDJN02_12929, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A6J1FM807.4e-14089.93uncharacterized protein LOC111446494 OS=Cucurbita moschata OX=3662 GN=LOC1114464... [more]
A0A6J1IZK22.8e-13989.26uncharacterized protein LOC111481317 OS=Cucurbita maxima OX=3661 GN=LOC111481317... [more]
A0A6J1F8P49.7e-13284.72uncharacterized protein LOC111443334 OS=Cucurbita moschata OX=3662 GN=LOC1114433... [more]
A0A0A0L1A01.4e-13084.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G658480 PE=4 SV=1[more]
A0A5A7VG104.5e-12984.44Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT1G76240.18.8e-6952.27Arabidopsis protein of unknown function (DUF241) [more]
AT2G17080.15.8e-2031.65Arabidopsis protein of unknown function (DUF241) [more]
AT2G17070.14.6e-1729.96Arabidopsis protein of unknown function (DUF241) [more]
AT4G35200.17.9e-1729.93Arabidopsis protein of unknown function (DUF241) [more]
AT4G35210.11.1e-1328.41Arabidopsis protein of unknown function (DUF241) [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004320Protein of unknown function DUF241, plantPFAMPF03087DUF241coord: 66..287
e-value: 1.8E-56
score: 191.6
NoneNo IPR availablePANTHERPTHR33070:SF49OS06G0725500 PROTEINcoord: 12..289
NoneNo IPR availablePANTHERPTHR33070OS06G0725500 PROTEINcoord: 12..289

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0010129.1Tan0010129.1mRNA