Tan0007571 (gene) Snake gourd v1

Overview
NameTan0007571
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFilamentous hemagglutinin transporter
LocationLG05: 4354127 .. 4355577 (+)
RNA-Seq ExpressionTan0007571
SyntenyTan0007571
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTAATAGTTTATATAACGGCCTGTTAACTTTCTTCCATATATAGAGACCTCTCTGCTGCCATTGTCGAAGAAAACCGACACTCCCATCTCTCTCTCTCCCTACCTTCTCTCTCTTCGTCTCTCAAAACCAGAAAAAAAAAAATGGCCGCCGAAGTGAGCTCGCTCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCTGTCCCCGAGAAATTAACGCCTCTGATCACCCGAGACTTACTCAGCGGCGGCTATTCCAAATTTACAGAGCCCCAAGAATTGGACCTCGATCTTCAAGTTCCTTCCGGCTGGGAAAAAAGACTCGACTTGAAGGTACCATTTTTTTTTCTTCTTCCTCTGTTTTTCTTTTATTTTTTTTAATGAAAATTTGGGCAATTTTTTTTTTGAGCCTATTATTGTTACGTGGCTCTAGATATTTGATAAGTTTTAATTTTAACCAAGACCCTTTTTGGATTTTTCGCTAAATTATGAAAAAATGTTTTGGATTTTTGTGTGTAATTTAAGACGGCAATGAAACAGAGTCCTAGTCAATTAAGCAGTATTCAATTATTTGATGGGGATTAATAATTAATTGATTGAATTTTGAATTATTTCAGTCGGGGAAAATGTTCATTCAAAGATGCAATGTTCAAGATTTCAACAACCATCAAACGAATCAAACAGTGTCAAAGCTTCAAGATTTGAACTTTCCGCCGTCCCCCAATTACTCCAAATTCCAATTGTCCAATCATTTCGTCGACGAAACGAATTTGGATTTGAAATTGGTTTCTTCGTCGCCGTCGCCGTCGCCGTCGCCGAGGAGTAATTATCAGAGTGTTTGTACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAAAAATCCCATCAGAAAACGCTCGTCGCTGTGGAAATCGTCGCCGTCGCCGTCGTATTCGTCGTCGTCGTCGTCAGCCGCGGCGGCGAGAGAGTTTCAAGAAGAAGACAACTTTAAATCTCTGTCGACGTCGTCATCAGCAGCGGCGGCGGCTCCGATTGCCGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGAAGAACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCCGCGAAGAAACCTCGGATTGATCTGAACATTTCAATTTGATTTAAAGAAGTGGAAAAGTATTGTTATTGTAGATGGAGGGACAGAAATTATCAGCTTCAAGACATAGATTTCTTTGTTTTTTTTTTTTGTTTTTTTTTTTGGGTAGCTGTAGAAACTATATGTAAAAAAAAATGCAAAAGGGAAGGTGGGGATTTCAATTCTTAGTATGATTTCTCATATGATAATTTTATTTCATCTCTCAATTATCAATTAGATAATTAGTTATTTTGATGTTAAATTGATATTCATGGCGA

mRNA sequence

TTTAATAGTTTATATAACGGCCTGTTAACTTTCTTCCATATATAGAGACCTCTCTGCTGCCATTGTCGAAGAAAACCGACACTCCCATCTCTCTCTCTCCCTACCTTCTCTCTCTTCGTCTCTCAAAACCAGAAAAAAAAAAATGGCCGCCGAAGTGAGCTCGCTCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCTGTCCCCGAGAAATTAACGCCTCTGATCACCCGAGACTTACTCAGCGGCGGCTATTCCAAATTTACAGAGCCCCAAGAATTGGACCTCGATCTTCAAGTTCCTTCCGGCTGGGAAAAAAGACTCGACTTGAAGTCGGGGAAAATGTTCATTCAAAGATGCAATGTTCAAGATTTCAACAACCATCAAACGAATCAAACAGTGTCAAAGCTTCAAGATTTGAACTTTCCGCCGTCCCCCAATTACTCCAAATTCCAATTGTCCAATCATTTCGTCGACGAAACGAATTTGGATTTGAAATTGGTTTCTTCGTCGCCGTCGCCGTCGCCGTCGCCGAGGAGTAATTATCAGAGTGTTTGTACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAAAAATCCCATCAGAAAACGCTCGTCGCTGTGGAAATCGTCGCCGTCGCCGTCGTATTCGTCGTCGTCGTCGTCAGCCGCGGCGGCGAGAGAGTTTCAAGAAGAAGACAACTTTAAATCTCTGTCGACGTCGTCATCAGCAGCGGCGGCGGCTCCGATTGCCGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGAAGAACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCCGCGAAGAAACCTCGGATTGATCTGAACATTTCAATTTGATTTAAAGAAGTGGAAAAGTATTGTTATTGTAGATGGAGGGACAGAAATTATCAGCTTCAAGACATAGATTTCTTTGTTTTTTTTTTTTGTTTTTTTTTTTGGGTAGCTGTAGAAACTATATGTAAAAAAAAATGCAAAAGGGAAGGTGGGGATTTCAATTCTTAGTATGATTTCTCATATGATAATTTTATTTCATCTCTCAATTATCAATTAGATAATTAGTTATTTTGATGTTAAATTGATATTCATGGCGA

Coding sequence (CDS)

ATGGCCGCCGAAGTGAGCTCGCTCGTGCGAGTTCTTACAGGCTACAACAAAGACGACCGTCATCGGACGGTCGGAAACGAATCTGTCCCCGAGAAATTAACGCCTCTGATCACCCGAGACTTACTCAGCGGCGGCTATTCCAAATTTACAGAGCCCCAAGAATTGGACCTCGATCTTCAAGTTCCTTCCGGCTGGGAAAAAAGACTCGACTTGAAGTCGGGGAAAATGTTCATTCAAAGATGCAATGTTCAAGATTTCAACAACCATCAAACGAATCAAACAGTGTCAAAGCTTCAAGATTTGAACTTTCCGCCGTCCCCCAATTACTCCAAATTCCAATTGTCCAATCATTTCGTCGACGAAACGAATTTGGATTTGAAATTGGTTTCTTCGTCGCCGTCGCCGTCGCCGTCGCCGAGGAGTAATTATCAGAGTGTTTGTACTTTGGATAAGGTCAAATCGGCGCTCGAACGGGCTGAGAAAAATCCCATCAGAAAACGCTCGTCGCTGTGGAAATCGTCGCCGTCGCCGTCGTATTCGTCGTCGTCGTCGTCAGCCGCGGCGGCGAGAGAGTTTCAAGAAGAAGACAACTTTAAATCTCTGTCGACGTCGTCATCAGCAGCGGCGGCGGCTCCGATTGCCGCCGGCTGCCCTGGGTGTTTGTCGTATGTGTTGGTGATGAAGAACAATCCGACGTGTCCACGGTGTAGCTCCGTCGTGCCGTTGCCGGCCGCGAAGAAACCTCGGATTGATCTGAACATTTCAATTTGA

Protein sequence

MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQVPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVDETNLDLKLVSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI
Homology
BLAST of Tan0007571 vs. NCBI nr
Match: XP_038901828.1 (uncharacterized protein LOC120088523 [Benincasa hispida])

HSP 1 Score: 419.1 bits (1076), Expect = 2.8e-113
Identity = 222/258 (86.05%), Postives = 233/258 (90.31%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAAEVSSLVRVLTGYNKDDRHRTVGN+S  EKLTPLITRDLLSGGYSK+TE QELDLDL 
Sbjct: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNDSAAEKLTPLITRDLLSGGYSKYTESQELDLDLH 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           VPSGWE+RLDLKSGK FIQRCNVQDFNN   NQTV KLQDLNFPPSPN+SKFQ SNH VD
Sbjct: 61  VPSGWERRLDLKSGKTFIQRCNVQDFNN---NQTVPKLQDLNFPPSPNFSKFQSSNHLVD 120

Query: 121 ETNLDLKLVSS-SPSPSP-SPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPS 180
           ET+LDLKLVSS SPSPSP SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPS
Sbjct: 121 ETSLDLKLVSSLSPSPSPSSPRSNYQSVCTLDKVKSALERAERNPIRKRSSLWKSSPSPS 180

Query: 181 YSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSS 240
           YSSSSSSA A +EF++EDN KSLS        +PIAAGCPGCLSYVLVMKNNPTCPRC+S
Sbjct: 181 YSSSSSSAMAEKEFRDEDNLKSLS--------SPIAAGCPGCLSYVLVMKNNPTCPRCNS 240

Query: 241 VVPLPAAKKPRIDLNISI 257
           VVPLPA KKPRIDLNISI
Sbjct: 241 VVPLPAPKKPRIDLNISI 247

BLAST of Tan0007571 vs. NCBI nr
Match: XP_022970865.1 (uncharacterized protein LOC111469711 [Cucurbita maxima])

HSP 1 Score: 409.5 bits (1051), Expect = 2.2e-110
Identity = 218/258 (84.50%), Postives = 231/258 (89.53%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAAEV+S VRVLTGYNKDD H TV NES P+ LTPLITRDLL+GG SKFT+PQELDLDLQ
Sbjct: 1   MAAEVTSHVRVLTGYNKDDPHPTVANESGPDTLTPLITRDLLTGGCSKFTDPQELDLDLQ 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           +PSGWEKRLDLKSGKMFIQR NVQDFNNHQTNQTV+KLQDLNFPPS NYSKF+LSNH V 
Sbjct: 61  LPSGWEKRLDLKSGKMFIQRSNVQDFNNHQTNQTVAKLQDLNFPPSLNYSKFKLSNHLVH 120

Query: 121 ETNLDLKL--VSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPS 180
           ET+L+LKL   SSSP PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWKSSPSPS
Sbjct: 121 ETSLELKLDSSSSSPPPSPSPRSNYQSVCTLDKVKSALERADKNPIRKRSSLWKSSPSPS 180

Query: 181 YSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSS 240
           YSSSSSS   AREFQEEDNF   S SSS++AAA IA GCPGCLSYVLVMKNNPTCPRC S
Sbjct: 181 YSSSSSS---AREFQEEDNFNK-SLSSSSSAAAQIAVGCPGCLSYVLVMKNNPTCPRCRS 240

Query: 241 VVPLPAAKKPRIDLNISI 257
           VV LPA KKPR+DLNISI
Sbjct: 241 VVALPAEKKPRLDLNISI 254

BLAST of Tan0007571 vs. NCBI nr
Match: XP_004140015.1 (uncharacterized protein LOC101202760 [Cucumis sativus] >KGN46805.1 hypothetical protein Csa_021089 [Cucumis sativus])

HSP 1 Score: 407.9 bits (1047), Expect = 6.4e-110
Identity = 218/258 (84.50%), Postives = 231/258 (89.53%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAAEVSSLVRVLT YNK+DRHRT G+ES  EKLTPLITRDLL+GGYSKFTE QELDLDL 
Sbjct: 1   MAAEVSSLVRVLTTYNKEDRHRTGGDESTAEKLTPLITRDLLNGGYSKFTESQELDLDLH 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQDLNFPPSPN SKFQL+NH VD
Sbjct: 61  VPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQDLNFPPSPNCSKFQLTNHLVD 120

Query: 121 ETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPS 180
           ET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPS
Sbjct: 121 ETSLDLKLVSSLSSSPSSSSPRSNYQSVCTLDKVKSALERAERNPIRKRSSLWKSSPSPS 180

Query: 181 YSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSS 240
           YSSSSSSAAA +EF+EE+N K LS        +PIAAGCPGCLSYVLVMKNNPTCPRCSS
Sbjct: 181 YSSSSSSAAAEKEFREEENLKCLS--------SPIAAGCPGCLSYVLVMKNNPTCPRCSS 240

Query: 241 VVPLPAAKKPRIDLNISI 257
           +VPLPA KKPRIDLNISI
Sbjct: 241 IVPLPAVKKPRIDLNISI 248

BLAST of Tan0007571 vs. NCBI nr
Match: XP_008456306.1 (PREDICTED: uncharacterized protein LOC103496294 [Cucumis melo] >KAA0054671.1 putative YUP8H12R.23 protein [Cucumis melo var. makuwa] >TYK08600.1 putative YUP8H12R.23 protein [Cucumis melo var. makuwa])

HSP 1 Score: 406.8 bits (1044), Expect = 1.4e-109
Identity = 216/258 (83.72%), Postives = 230/258 (89.15%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAAEVSSLVRVLT YNK+DRH T GNES  EKL PLITRDLL+GGYSKFTE QELDLDL 
Sbjct: 1   MAAEVSSLVRVLTTYNKEDRHLTAGNESTAEKLAPLITRDLLNGGYSKFTESQELDLDLH 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQDLNFPPSPNYSKFQL+NH VD
Sbjct: 61  VPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQDLNFPPSPNYSKFQLTNHLVD 120

Query: 121 ETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPS 180
           ET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPS
Sbjct: 121 ETSLDLKLVSSLSSSPSSSSPRSNYQSVCTLDKVKSALERAERNPIRKRSSLWKSSPSPS 180

Query: 181 YSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSS 240
           YSSSSSSAAA +EF+EE+  K         +++PIAAGCPGCLSYVLVMKNNPTCPRCSS
Sbjct: 181 YSSSSSSAAADKEFREEEKLK--------CSSSPIAAGCPGCLSYVLVMKNNPTCPRCSS 240

Query: 241 VVPLPAAKKPRIDLNISI 257
           +VPLPAAKKPRIDLNISI
Sbjct: 241 IVPLPAAKKPRIDLNISI 248

BLAST of Tan0007571 vs. NCBI nr
Match: XP_022947770.1 (uncharacterized protein LOC111451530 isoform X1 [Cucurbita moschata] >KAG6604706.1 hypothetical protein SDJN03_02023, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 399.4 bits (1025), Expect = 2.3e-107
Identity = 214/259 (82.63%), Postives = 228/259 (88.03%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAA+V+S VRVLT YNKDD HRTV N+S P+ LTPLITRDLL+GG SKFT+PQELDLDLQ
Sbjct: 1   MAAQVTSHVRVLTVYNKDDPHRTVANQSGPDTLTPLITRDLLTGGSSKFTDPQELDLDLQ 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           +PSGWEK LDLKSGKMFIQR NVQDFNNHQTNQTV+KLQDLNFPPS NYSKF+LSNH V 
Sbjct: 61  LPSGWEKTLDLKSGKMFIQRSNVQDFNNHQTNQTVAKLQDLNFPPSLNYSKFKLSNHLVH 120

Query: 121 ETNLDLKLVSS---SPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSP 180
           ET+LDLKL SS   S  PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWK SPSP
Sbjct: 121 ETSLDLKLDSSSSLSSPPSPSPRSNYQSVCTLDKVKSALERADKNPIRKRSSLWKLSPSP 180

Query: 181 SYSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCS 240
           SYSSSSSS   AREFQEEDNF   S SSS++AAA IA GCPGCLSYVLVMKNNPTCPRC 
Sbjct: 181 SYSSSSSS---AREFQEEDNFNK-SLSSSSSAAAQIAVGCPGCLSYVLVMKNNPTCPRCR 240

Query: 241 SVVPLPAAKKPRIDLNISI 257
           SVV LPA KKPR+DLNISI
Sbjct: 241 SVVALPAEKKPRLDLNISI 255

BLAST of Tan0007571 vs. ExPASy TrEMBL
Match: A0A6J1I433 (uncharacterized protein LOC111469711 OS=Cucurbita maxima OX=3661 GN=LOC111469711 PE=4 SV=1)

HSP 1 Score: 409.5 bits (1051), Expect = 1.1e-110
Identity = 218/258 (84.50%), Postives = 231/258 (89.53%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAAEV+S VRVLTGYNKDD H TV NES P+ LTPLITRDLL+GG SKFT+PQELDLDLQ
Sbjct: 1   MAAEVTSHVRVLTGYNKDDPHPTVANESGPDTLTPLITRDLLTGGCSKFTDPQELDLDLQ 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           +PSGWEKRLDLKSGKMFIQR NVQDFNNHQTNQTV+KLQDLNFPPS NYSKF+LSNH V 
Sbjct: 61  LPSGWEKRLDLKSGKMFIQRSNVQDFNNHQTNQTVAKLQDLNFPPSLNYSKFKLSNHLVH 120

Query: 121 ETNLDLKL--VSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPS 180
           ET+L+LKL   SSSP PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWKSSPSPS
Sbjct: 121 ETSLELKLDSSSSSPPPSPSPRSNYQSVCTLDKVKSALERADKNPIRKRSSLWKSSPSPS 180

Query: 181 YSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSS 240
           YSSSSSS   AREFQEEDNF   S SSS++AAA IA GCPGCLSYVLVMKNNPTCPRC S
Sbjct: 181 YSSSSSS---AREFQEEDNFNK-SLSSSSSAAAQIAVGCPGCLSYVLVMKNNPTCPRCRS 240

Query: 241 VVPLPAAKKPRIDLNISI 257
           VV LPA KKPR+DLNISI
Sbjct: 241 VVALPAEKKPRLDLNISI 254

BLAST of Tan0007571 vs. ExPASy TrEMBL
Match: A0A0A0KAN5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G137580 PE=4 SV=1)

HSP 1 Score: 407.9 bits (1047), Expect = 3.1e-110
Identity = 218/258 (84.50%), Postives = 231/258 (89.53%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAAEVSSLVRVLT YNK+DRHRT G+ES  EKLTPLITRDLL+GGYSKFTE QELDLDL 
Sbjct: 1   MAAEVSSLVRVLTTYNKEDRHRTGGDESTAEKLTPLITRDLLNGGYSKFTESQELDLDLH 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQDLNFPPSPN SKFQL+NH VD
Sbjct: 61  VPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQDLNFPPSPNCSKFQLTNHLVD 120

Query: 121 ETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPS 180
           ET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPS
Sbjct: 121 ETSLDLKLVSSLSSSPSSSSPRSNYQSVCTLDKVKSALERAERNPIRKRSSLWKSSPSPS 180

Query: 181 YSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSS 240
           YSSSSSSAAA +EF+EE+N K LS        +PIAAGCPGCLSYVLVMKNNPTCPRCSS
Sbjct: 181 YSSSSSSAAAEKEFREEENLKCLS--------SPIAAGCPGCLSYVLVMKNNPTCPRCSS 240

Query: 241 VVPLPAAKKPRIDLNISI 257
           +VPLPA KKPRIDLNISI
Sbjct: 241 IVPLPAVKKPRIDLNISI 248

BLAST of Tan0007571 vs. ExPASy TrEMBL
Match: A0A5D3CB81 (Putative YUP8H12R.23 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold3734G00220 PE=4 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 6.9e-110
Identity = 216/258 (83.72%), Postives = 230/258 (89.15%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAAEVSSLVRVLT YNK+DRH T GNES  EKL PLITRDLL+GGYSKFTE QELDLDL 
Sbjct: 1   MAAEVSSLVRVLTTYNKEDRHLTAGNESTAEKLAPLITRDLLNGGYSKFTESQELDLDLH 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQDLNFPPSPNYSKFQL+NH VD
Sbjct: 61  VPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQDLNFPPSPNYSKFQLTNHLVD 120

Query: 121 ETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPS 180
           ET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPS
Sbjct: 121 ETSLDLKLVSSLSSSPSSSSPRSNYQSVCTLDKVKSALERAERNPIRKRSSLWKSSPSPS 180

Query: 181 YSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSS 240
           YSSSSSSAAA +EF+EE+  K         +++PIAAGCPGCLSYVLVMKNNPTCPRCSS
Sbjct: 181 YSSSSSSAAADKEFREEEKLK--------CSSSPIAAGCPGCLSYVLVMKNNPTCPRCSS 240

Query: 241 VVPLPAAKKPRIDLNISI 257
           +VPLPAAKKPRIDLNISI
Sbjct: 241 IVPLPAAKKPRIDLNISI 248

BLAST of Tan0007571 vs. ExPASy TrEMBL
Match: A0A1S3C2I3 (uncharacterized protein LOC103496294 OS=Cucumis melo OX=3656 GN=LOC103496294 PE=4 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 6.9e-110
Identity = 216/258 (83.72%), Postives = 230/258 (89.15%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAAEVSSLVRVLT YNK+DRH T GNES  EKL PLITRDLL+GGYSKFTE QELDLDL 
Sbjct: 1   MAAEVSSLVRVLTTYNKEDRHLTAGNESTAEKLAPLITRDLLNGGYSKFTESQELDLDLH 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           VPSGWE+RLDLKSGKMFIQRCNVQDFNN+  NQTV KLQDLNFPPSPNYSKFQL+NH VD
Sbjct: 61  VPSGWERRLDLKSGKMFIQRCNVQDFNNN--NQTVPKLQDLNFPPSPNYSKFQLTNHLVD 120

Query: 121 ETNLDLKLVSS-SPSP-SPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSPS 180
           ET+LDLKLVSS S SP S SPRSNYQSVCTLDKVKSALERAE+NPIRKRSSLWKSSPSPS
Sbjct: 121 ETSLDLKLVSSLSSSPSSSSPRSNYQSVCTLDKVKSALERAERNPIRKRSSLWKSSPSPS 180

Query: 181 YSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCSS 240
           YSSSSSSAAA +EF+EE+  K         +++PIAAGCPGCLSYVLVMKNNPTCPRCSS
Sbjct: 181 YSSSSSSAAADKEFREEEKLK--------CSSSPIAAGCPGCLSYVLVMKNNPTCPRCSS 240

Query: 241 VVPLPAAKKPRIDLNISI 257
           +VPLPAAKKPRIDLNISI
Sbjct: 241 IVPLPAAKKPRIDLNISI 248

BLAST of Tan0007571 vs. ExPASy TrEMBL
Match: A0A6J1G7U1 (uncharacterized protein LOC111451530 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111451530 PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 1.1e-107
Identity = 214/259 (82.63%), Postives = 228/259 (88.03%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQELDLDLQ 60
           MAA+V+S VRVLT YNKDD HRTV N+S P+ LTPLITRDLL+GG SKFT+PQELDLDLQ
Sbjct: 1   MAAQVTSHVRVLTVYNKDDPHRTVANQSGPDTLTPLITRDLLTGGSSKFTDPQELDLDLQ 60

Query: 61  VPSGWEKRLDLKSGKMFIQRCNVQDFNNHQTNQTVSKLQDLNFPPSPNYSKFQLSNHFVD 120
           +PSGWEK LDLKSGKMFIQR NVQDFNNHQTNQTV+KLQDLNFPPS NYSKF+LSNH V 
Sbjct: 61  LPSGWEKTLDLKSGKMFIQRSNVQDFNNHQTNQTVAKLQDLNFPPSLNYSKFKLSNHLVH 120

Query: 121 ETNLDLKLVSS---SPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSPSP 180
           ET+LDLKL SS   S  PSPSPRSNYQSVCTLDKVKSALERA+KNPIRKRSSLWK SPSP
Sbjct: 121 ETSLDLKLDSSSSLSSPPSPSPRSNYQSVCTLDKVKSALERADKNPIRKRSSLWKLSPSP 180

Query: 181 SYSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPRCS 240
           SYSSSSSS   AREFQEEDNF   S SSS++AAA IA GCPGCLSYVLVMKNNPTCPRC 
Sbjct: 181 SYSSSSSS---AREFQEEDNFNK-SLSSSSSAAAQIAVGCPGCLSYVLVMKNNPTCPRCR 240

Query: 241 SVVPLPAAKKPRIDLNISI 257
           SVV LPA KKPR+DLNISI
Sbjct: 241 SVVALPAEKKPRLDLNISI 255

BLAST of Tan0007571 vs. TAIR 10
Match: AT1G79160.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G16500.1); Has 104 Blast hits to 102 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 206.5 bits (524), Expect = 2.6e-53
Identity = 136/264 (51.52%), Postives = 169/264 (64.02%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLLSGGYSKFTEPQ-ELDLDL 60
           MAA+VSSLVR+L+GY KDDR   V + +  +    L+TRDLL  G     +   ELDLDL
Sbjct: 5   MAADVSSLVRLLSGY-KDDR-AVVKDSAGAKSSAALMTRDLLGNGRGGGGDRSLELDLDL 64

Query: 61  QVPSGWEKRLDLKSGKMFIQRCNVQD----FNNHQTNQTVSKLQDLNFPPSPNYSKFQLS 120
           QVP+G+EKRLDLKSGK+++QRCN        N  QTNQTV   QDLNFPP P  +   L 
Sbjct: 65  QVPTGYEKRLDLKSGKVYLQRCNSTSSSSITNADQTNQTVPTFQDLNFPP-PTLNNSPLL 124

Query: 121 NHFVDETNLDLKLVSSSPSPSPSPRSNYQSVCTLDKVKSALERAEKNPIRKRSSLWKSSP 180
           N F D+T  +LKL+ SS S  P+  SN QSVCTLDKVKSALERAE++P     +++K   
Sbjct: 125 NLF-DDTTPELKLLPSSRSSRPN-TSNLQSVCTLDKVKSALERAERDP-----AMFKKRQ 184

Query: 181 SPSYSSSSSSAAAAREFQEEDNFKSLSTSSSAAAAAPIAAGCPGCLSYVLVMKNNPTCPR 240
           SP             +    D+++      + A A+P+ AGCPGCLSYVLVM NNP CPR
Sbjct: 185 SP-------------DDTVYDHYR------TEAVASPVVAGCPGCLSYVLVMMNNPKCPR 239

Query: 241 CSSVVPLPA---AKKPRIDLNISI 257
           C ++VPLP     KKP+IDLNISI
Sbjct: 245 CDTIVPLPTNPMKKKPKIDLNISI 239

BLAST of Tan0007571 vs. TAIR 10
Match: AT1G16500.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G79160.1); Has 136 Blast hits to 134 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 4; Plants - 131; Viruses - 1; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 193.4 bits (490), Expect = 2.3e-49
Identity = 134/286 (46.85%), Postives = 168/286 (58.74%), Query Frame = 0

Query: 1   MAAEVSSLVRVLTGYNKDDRHRTVGNESVPEKLTPLITRDLL------SGGYSKFTEPQE 60
           MAA+VSSLVR+L+ + KDDR   V + + P     L+TRDLL       GG     +  E
Sbjct: 1   MAADVSSLVRILSRF-KDDR-TVVKDSTGPRSTVALMTRDLLGIGGCVGGGGGGDEQSLE 60

Query: 61  LDLDLQVPSGWEKRLDLKSGKMFI-QRCNV----------QDFNNHQTNQTVSKLQDLNF 120
           LDLD+QVP+GWEKRLDLKSGK+++ Q+CN              +  QTNQTV + QDLN 
Sbjct: 61  LDLDVQVPNGWEKRLDLKSGKVYLQQQCNSTSSSSSSHHHHHHHEDQTNQTVPRFQDLNV 120

Query: 121 PP-SPNYSKFQLSNHF--VDETNLDLKLVSSSPS-PSPSPRSNY---------QSVCTLD 180
           PP S  +    L + F   D+T+L+LKLV SS S P P P S++          SVCTLD
Sbjct: 121 PPISDKFPAKPLLSLFDDDDDTSLELKLVPSSISRPLPPPLSSFSPNQSLSYLSSVCTLD 180

Query: 181 KVKSALERAEKNPIRKRSSLWKSSPSPSYSSSSSSAAAAREFQEEDNFKSLSTSSSAAAA 240
           KVK ALERAEK+  +++S                          ED+     T+S+  AA
Sbjct: 181 KVKLALERAEKDTKKRQS-------------------------PEDDGVYDGTASATVAA 240

Query: 241 APIAAGCPGCLSYVLVMKNNPTCPRCSSVVPLPAAKKPRIDLNISI 257
           + +AAGCPGCLSYV V KNNP CPRC S VPLPA KKP+IDLNIS+
Sbjct: 241 SQVAAGCPGCLSYVFVAKNNPKCPRCHSFVPLPAMKKPKIDLNISM 259

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038901828.12.8e-11386.05uncharacterized protein LOC120088523 [Benincasa hispida][more]
XP_022970865.12.2e-11084.50uncharacterized protein LOC111469711 [Cucurbita maxima][more]
XP_004140015.16.4e-11084.50uncharacterized protein LOC101202760 [Cucumis sativus] >KGN46805.1 hypothetical ... [more]
XP_008456306.11.4e-10983.72PREDICTED: uncharacterized protein LOC103496294 [Cucumis melo] >KAA0054671.1 put... [more]
XP_022947770.12.3e-10782.63uncharacterized protein LOC111451530 isoform X1 [Cucurbita moschata] >KAG6604706... [more]
Match NameE-valueIdentityDescription
A0A6J1I4331.1e-11084.50uncharacterized protein LOC111469711 OS=Cucurbita maxima OX=3661 GN=LOC111469711... [more]
A0A0A0KAN53.1e-11084.50Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G137580 PE=4 SV=1[more]
A0A5D3CB816.9e-11083.72Putative YUP8H12R.23 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A1S3C2I36.9e-11083.72uncharacterized protein LOC103496294 OS=Cucumis melo OX=3656 GN=LOC103496294 PE=... [more]
A0A6J1G7U11.1e-10782.63uncharacterized protein LOC111451530 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT1G79160.12.6e-5351.52unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G16500.12.3e-4946.85unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 158..186
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 168..186
NoneNo IPR availablePANTHERPTHR33177:SF44F3O9.30coord: 1..255
NoneNo IPR availablePANTHERPTHR33177PUTATIVE-RELATEDcoord: 1..255

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007571.1Tan0007571.1mRNA