Tan0019795 (gene) Snake gourd v1

Overview
NameTan0019795
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPHY domain-containing protein
LocationLG08: 65461962 .. 65464161 (+)
RNA-Seq ExpressionTan0019795
SyntenyTan0019795
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGCGATCTCTGTTGAAAACTTTGGCGCCGGGTTTTCTTCGGCGGAAGTTTCCCTTTCCAAGCTCCAATTTCCAATTCAACCATGGCCATTTCCGATCTCTCACCGGCATACGCTGTGCTGCTTCTGTTGGTGACGGCGGCGGCGATTGTCGAAGCTGGCGACAACAACCGAGTTTTCTCACCTTGCACCGACACGACTGTTGAGAAGTCTGATGGTTTCACCTTAGGGCTTGCTTTTGCGACGGAGCAGAAGTTTACCTTCAATAAAACCTTGAAGTTGTCTCCTTGCGACAGCAGGCTTGCTCTTACGAATGGAAATTCTCTGATCTCTGTGTTTAGACCTAAGGTTGATGAGATCTCCCTCCTCACCGTCAATACTACTCCCTCCGTGTCCAACTTCAATCCGGTTAGTTCCTCCTCTAACTCTTTGTTCTTTGTTCATGTCGAATTTATCGGTTGTTTCTGTTTGATCTTCGTTTGAAGTTGATACTAGTACTAGAATTAGGTTCTGTGCGTGGGCTTGTCGAAGCGGATGGTTATTAGGTTTTGTGAGATTGGAGTATTAGGTTTGTATGACTCTCAAATCAAACATGGGGTTGGTTTCTAAAATCCTAATCCTTAGCTCCAAACAGGGAGGCAAACAGAACAGATCCTTTTGGGATCACTTCGCTTACTTTAATTGCGTGATTCAAGCTCGATTCGATTTGGGAATTTTTGGTATGTTGATTGTTTCTTTGAGGGAAATTAACAATCAATAGCTATAGGTTGACATTTTGCACGACTCCATTTTAACAAGAAACTCCAATGTTGAAGTTTAGTTGGAAGGAACTTCACTGAAGATGGTATTAAGGCGTTCTACTCTTAGTTCGTGTTTGAGAGAGTATACAAATCACACCTTCTTCTTCCGCTTTAAGATGATGCCTTCTTAATGCTTGGGCTGCTTTGTCTCCATAGAAGGAAGAGGTTAATGTATCTGCTAGTTTGATTTCTACTTTGCTCACATAGATCTAAAATCTTTTCTACTGGTTTATTTTTACAATTATATGCTAGTTTTCTGTTCTTACCAGTAGCGGACTCACGATTTCTTTTTCTTTTCTTTCATTACCAGTCTTCAAATGGCTATATGGTTGCATTTGGTGGTCGGAAATATGCTGCAAGGTCCCCTCCAATTTTTGTCGCAGATGAACAACATACCGTAACCAGCTTTACTTTGGTAAGTTTATGACATAGGAAGCAGGAAGTAAAGACATCAGATATGAACTCAATACAACACTGACTCTGTGATATGTTGTTTTTAAGAATATAACATGGATATATAGCAGGGACATATGTAACAAATTACCTATTTTGTGAAATGTAAGAGGCAATGGGGAAAAATTATTGAATGAGGCCATCACATTCGAAATATTTCGGTAGGTTCTCTCTCGTGCATCTAGAGTGTAAAACTACTTGCTACATTGTGGTAGTCTCCTCCGGGGAGAGGCTTAGACTAGAGAAGTTCATTGTTGGAGATAGTGGGTAAATCATGATGTATTAAACATTTTCAGGGCAGGCTGAATTTTAATCATCGAAAAGTGTTTGTGCTTTCCAACTTATGACAAATTTTCTCAAATACAGGTGCTTGAGTTTGAGAAGGGTAGGCTGCAAAACTTGTTCTGGAAAAGGGATGGTTGTGGTCGATGTTCAAACAACAATACCTTTGTTTGCATCAACAATCAGGATTGTGCAATCAAAACAAACAGCTGCAAATATCGTGGCGGTCCTGTCGATTGCAGTCTAGCGATACAACTAGCGTTCTCCGGCACCGATAAGCACCTTTCGGTCTTCAACTCTTGGTATGAAGTGTCAAAACTTCGCCAGTACTCGCTCTTCAATCTTTATTCGAACCTCAAAGATTCTCTCACAAGTCAGTACAACAAGATCTTCCAATAGAGCATATTTTGGTTAATGTATATTTCCATCCTTGTATGCTGTAATTCAGTTTGTGCAACAAATTCCTTTCAAATCTTCTACATTTCTCCATCAATTAAATGAGTCAGTATAGGCTGGACTAGTGTTTGTTTGGTAGATTTATCATCAGTGTATACTAAACTCCTGGTTGAAAGTATTTTGTATACAAAATCAAAGTGTTGAATTACTTGAACCACATTGTTGCCTTTCTCTTCTTCTGATGTATTGTAATGAATGACTTGGATTGGGC

mRNA sequence

TCGCGATCTCTGTTGAAAACTTTGGCGCCGGGTTTTCTTCGGCGGAAGTTTCCCTTTCCAAGCTCCAATTTCCAATTCAACCATGGCCATTTCCGATCTCTCACCGGCATACGCTGTGCTGCTTCTGTTGGTGACGGCGGCGGCGATTGTCGAAGCTGGCGACAACAACCGAGTTTTCTCACCTTGCACCGACACGACTGTTGAGAAGTCTGATGGTTTCACCTTAGGGCTTGCTTTTGCGACGGAGCAGAAGTTTACCTTCAATAAAACCTTGAAGTTGTCTCCTTGCGACAGCAGGCTTGCTCTTACGAATGGAAATTCTCTGATCTCTGTGTTTAGACCTAAGGTTGATGAGATCTCCCTCCTCACCGTCAATACTACTCCCTCCGTGTCCAACTTCAATCCGTCTTCAAATGGCTATATGGTTGCATTTGGTGGTCGGAAATATGCTGCAAGGTCCCCTCCAATTTTTGTCGCAGATGAACAACATACCGTAACCAGCTTTACTTTGGTGCTTGAGTTTGAGAAGGGTAGGCTGCAAAACTTGTTCTGGAAAAGGGATGGTTGTGGTCGATGTTCAAACAACAATACCTTTGTTTGCATCAACAATCAGGATTGTGCAATCAAAACAAACAGCTGCAAATATCGTGGCGGTCCTGTCGATTGCAGTCTAGCGATACAACTAGCGTTCTCCGGCACCGATAAGCACCTTTCGGTCTTCAACTCTTGGTATGAAGTGTCAAAACTTCGCCAGTACTCGCTCTTCAATCTTTATTCGAACCTCAAAGATTCTCTCACAAGTCAGTACAACAAGATCTTCCAATAGAGCATATTTTGGTTAATGTATATTTCCATCCTTGTATGCTGTAATTCAGTTTGTGCAACAAATTCCTTTCAAATCTTCTACATTTCTCCATCAATTAAATGAGTCAGTATAGGCTGGACTAGTGTTTGTTTGGTAGATTTATCATCAGTGTATACTAAACTCCTGGTTGAAAGTATTTTGTATACAAAATCAAAGTGTTGAATTACTTGAACCACATTGTTGCCTTTCTCTTCTTCTGATGTATTGTAATGAATGACTTGGATTGGGC

Coding sequence (CDS)

ATGGCCATTTCCGATCTCTCACCGGCATACGCTGTGCTGCTTCTGTTGGTGACGGCGGCGGCGATTGTCGAAGCTGGCGACAACAACCGAGTTTTCTCACCTTGCACCGACACGACTGTTGAGAAGTCTGATGGTTTCACCTTAGGGCTTGCTTTTGCGACGGAGCAGAAGTTTACCTTCAATAAAACCTTGAAGTTGTCTCCTTGCGACAGCAGGCTTGCTCTTACGAATGGAAATTCTCTGATCTCTGTGTTTAGACCTAAGGTTGATGAGATCTCCCTCCTCACCGTCAATACTACTCCCTCCGTGTCCAACTTCAATCCGTCTTCAAATGGCTATATGGTTGCATTTGGTGGTCGGAAATATGCTGCAAGGTCCCCTCCAATTTTTGTCGCAGATGAACAACATACCGTAACCAGCTTTACTTTGGTGCTTGAGTTTGAGAAGGGTAGGCTGCAAAACTTGTTCTGGAAAAGGGATGGTTGTGGTCGATGTTCAAACAACAATACCTTTGTTTGCATCAACAATCAGGATTGTGCAATCAAAACAAACAGCTGCAAATATCGTGGCGGTCCTGTCGATTGCAGTCTAGCGATACAACTAGCGTTCTCCGGCACCGATAAGCACCTTTCGGTCTTCAACTCTTGGTATGAAGTGTCAAAACTTCGCCAGTACTCGCTCTTCAATCTTTATTCGAACCTCAAAGATTCTCTCACAAGTCAGTACAACAAGATCTTCCAATAG

Protein sequence

MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTFNKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGRKYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCAIKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTSQYNKIFQ
Homology
BLAST of Tan0019795 vs. NCBI nr
Match: XP_023538924.1 (uncharacterized protein LOC111799707 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 452.6 bits (1163), Expect = 2.2e-123
Identity = 228/247 (92.31%), Postives = 234/247 (94.74%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAI DLS A AVLLLL+TAAA VEA D+NRVFSPC DTTVEKSDGFTLGLAFAT QKF F
Sbjct: 1   MAIYDLSRASAVLLLLMTAAA-VEARDSNRVFSPCADTTVEKSDGFTLGLAFATHQKFVF 60

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTL LSPCDSRLALTNGNSLISVFRP VDEISLLTVNTTPSVSNFNPSSNGYMVAF GR
Sbjct: 61  NKTLNLSPCDSRLALTNGNSLISVFRPMVDEISLLTVNTTPSVSNFNPSSNGYMVAFAGR 120

Query: 121 KYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCA 180
           KYAARSPPIFVADEQH VTSFTLVLEFEKGRLQNLFWKRDGC +CSNNNTFVCINNQDCA
Sbjct: 121 KYAARSPPIFVADEQHAVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCINNQDCA 180

Query: 181 IKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240
           I+T++CKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS
Sbjct: 181 IRTSNCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240

Query: 241 QYNKIFQ 248
           QYNKIFQ
Sbjct: 241 QYNKIFQ 246

BLAST of Tan0019795 vs. NCBI nr
Match: KAG6596627.1 (hypothetical protein SDJN03_09807, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 449.5 bits (1155), Expect = 1.9e-122
Identity = 225/247 (91.09%), Postives = 233/247 (94.33%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAI D S A AVLLLL+TAAA VEA D+NR+FSPC DTTVEKSDGFTLGLAFAT+QKF F
Sbjct: 31  MAIYDFSTASAVLLLLMTAAA-VEARDSNRIFSPCADTTVEKSDGFTLGLAFATQQKFVF 90

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTL LSPCDSRLALTNGNSLISVFRP VDEISLLTVNTTPSVSNFNPSSN YMVAF GR
Sbjct: 91  NKTLNLSPCDSRLALTNGNSLISVFRPMVDEISLLTVNTTPSVSNFNPSSNSYMVAFAGR 150

Query: 121 KYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCA 180
           KYAARSPPIFVADEQH VTSFTLVLEFEKGRLQNLFWKRDGC +CSNNNTFVCINNQDCA
Sbjct: 151 KYAARSPPIFVADEQHAVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCINNQDCA 210

Query: 181 IKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240
           I+T++CKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS
Sbjct: 211 IRTSNCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 270

Query: 241 QYNKIFQ 248
           QYNKIFQ
Sbjct: 271 QYNKIFQ 276

BLAST of Tan0019795 vs. NCBI nr
Match: XP_022939140.1 (uncharacterized protein LOC111445105 [Cucurbita moschata])

HSP 1 Score: 449.1 bits (1154), Expect = 2.4e-122
Identity = 225/247 (91.09%), Postives = 234/247 (94.74%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAI DLS A AVLLLL+TAAA VEA D+NRVFSPC DTTVEKSDGFTLGLAFAT+QKF F
Sbjct: 1   MAIYDLSTASAVLLLLMTAAA-VEARDSNRVFSPCADTTVEKSDGFTLGLAFATQQKFVF 60

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTL LSPCDSRLALTNGNSLIS+FRP VDEISLLTVNTTPSVSNFNPSSN YMVAF GR
Sbjct: 61  NKTLNLSPCDSRLALTNGNSLISMFRPMVDEISLLTVNTTPSVSNFNPSSNSYMVAFAGR 120

Query: 121 KYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCA 180
           KYAARSPPIFVADEQH VTSFTLVLEFEKGRLQNLFWKRDGC +CSNNNTFVCINNQDCA
Sbjct: 121 KYAARSPPIFVADEQHAVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCINNQDCA 180

Query: 181 IKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240
           I+T++CKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSLFNLYSNLKDSLTS
Sbjct: 181 IRTSNCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLFNLYSNLKDSLTS 240

Query: 241 QYNKIFQ 248
           QYNKIFQ
Sbjct: 241 QYNKIFQ 246

BLAST of Tan0019795 vs. NCBI nr
Match: XP_023005908.1 (uncharacterized protein LOC111498771 [Cucurbita maxima])

HSP 1 Score: 444.1 bits (1141), Expect = 7.8e-121
Identity = 224/247 (90.69%), Postives = 232/247 (93.93%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAI DLS A AVLLLL+TAAA VEA D+NRVFSPC DTTVEKSDGFTLGLAFAT+QKF F
Sbjct: 1   MAIYDLSTASAVLLLLMTAAA-VEARDSNRVFSPCADTTVEKSDGFTLGLAFATQQKFVF 60

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTL LSPCDSRLALTNGNSLISVFRP VDEISLLTV+TTPSVSNFNPSSN YMVAF GR
Sbjct: 61  NKTLNLSPCDSRLALTNGNSLISVFRPMVDEISLLTVSTTPSVSNFNPSSNSYMVAFAGR 120

Query: 121 KYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCA 180
           KYAARSPPIFVADEQH VTSFTLVLEFEKGRLQNLFWKRDGC +CSNNNTFVCINNQDCA
Sbjct: 121 KYAARSPPIFVADEQHAVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCINNQDCA 180

Query: 181 IKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240
           I+T++CKYRGGPVDCSLAIQLAFSGTDKHL VFNSWYEVSKL QYSLFNLYSNLKDSLTS
Sbjct: 181 IRTSNCKYRGGPVDCSLAIQLAFSGTDKHLLVFNSWYEVSKLCQYSLFNLYSNLKDSLTS 240

Query: 241 QYNKIFQ 248
           QYNKIFQ
Sbjct: 241 QYNKIFQ 246

BLAST of Tan0019795 vs. NCBI nr
Match: KAG7028165.1 (hypothetical protein SDJN02_09345 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 442.2 bits (1136), Expect = 3.0e-120
Identity = 226/259 (87.26%), Postives = 234/259 (90.35%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAI DLS A AVLLLL+TAAA VEA D+NR+FSPC DTTVEKSDGFTLGLAFAT+QKF F
Sbjct: 31  MAIYDLSTASAVLLLLMTAAA-VEARDSNRIFSPCADTTVEKSDGFTLGLAFATQQKFVF 90

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTL LSPCDSRLALTNGNSLISVFRP VDEISLLTVNTTPSVSNFNPSSN YMVAF GR
Sbjct: 91  NKTLNLSPCDSRLALTNGNSLISVFRPMVDEISLLTVNTTPSVSNFNPSSNSYMVAFAGR 150

Query: 121 KYAARSPPIFVADEQHTVTSFTL------------VLEFEKGRLQNLFWKRDGCGRCSNN 180
           KYAARSPPIFVADEQH VTSFTL            VLEFEKGRLQNLFWKRDGC +CSNN
Sbjct: 151 KYAARSPPIFVADEQHAVTSFTLNMTSQDSALLAKVLEFEKGRLQNLFWKRDGCAQCSNN 210

Query: 181 NTFVCINNQDCAIKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLF 240
           NTFVCINNQDCAI+T++CKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLF
Sbjct: 211 NTFVCINNQDCAIRTSNCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLF 270

Query: 241 NLYSNLKDSLTSQYNKIFQ 248
           NLYSNLKDSLTSQYNKIFQ
Sbjct: 271 NLYSNLKDSLTSQYNKIFQ 288

BLAST of Tan0019795 vs. ExPASy TrEMBL
Match: A0A6J1FLU2 (uncharacterized protein LOC111445105 OS=Cucurbita moschata OX=3662 GN=LOC111445105 PE=4 SV=1)

HSP 1 Score: 449.1 bits (1154), Expect = 1.2e-122
Identity = 225/247 (91.09%), Postives = 234/247 (94.74%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAI DLS A AVLLLL+TAAA VEA D+NRVFSPC DTTVEKSDGFTLGLAFAT+QKF F
Sbjct: 1   MAIYDLSTASAVLLLLMTAAA-VEARDSNRVFSPCADTTVEKSDGFTLGLAFATQQKFVF 60

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTL LSPCDSRLALTNGNSLIS+FRP VDEISLLTVNTTPSVSNFNPSSN YMVAF GR
Sbjct: 61  NKTLNLSPCDSRLALTNGNSLISMFRPMVDEISLLTVNTTPSVSNFNPSSNSYMVAFAGR 120

Query: 121 KYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCA 180
           KYAARSPPIFVADEQH VTSFTLVLEFEKGRLQNLFWKRDGC +CSNNNTFVCINNQDCA
Sbjct: 121 KYAARSPPIFVADEQHAVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCINNQDCA 180

Query: 181 IKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240
           I+T++CKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSLFNLYSNLKDSLTS
Sbjct: 181 IRTSNCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLFNLYSNLKDSLTS 240

Query: 241 QYNKIFQ 248
           QYNKIFQ
Sbjct: 241 QYNKIFQ 246

BLAST of Tan0019795 vs. ExPASy TrEMBL
Match: A0A6J1L3G6 (uncharacterized protein LOC111498771 OS=Cucurbita maxima OX=3661 GN=LOC111498771 PE=4 SV=1)

HSP 1 Score: 444.1 bits (1141), Expect = 3.8e-121
Identity = 224/247 (90.69%), Postives = 232/247 (93.93%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAI DLS A AVLLLL+TAAA VEA D+NRVFSPC DTTVEKSDGFTLGLAFAT+QKF F
Sbjct: 1   MAIYDLSTASAVLLLLMTAAA-VEARDSNRVFSPCADTTVEKSDGFTLGLAFATQQKFVF 60

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTL LSPCDSRLALTNGNSLISVFRP VDEISLLTV+TTPSVSNFNPSSN YMVAF GR
Sbjct: 61  NKTLNLSPCDSRLALTNGNSLISVFRPMVDEISLLTVSTTPSVSNFNPSSNSYMVAFAGR 120

Query: 121 KYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCA 180
           KYAARSPPIFVADEQH VTSFTLVLEFEKGRLQNLFWKRDGC +CSNNNTFVCINNQDCA
Sbjct: 121 KYAARSPPIFVADEQHAVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCINNQDCA 180

Query: 181 IKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240
           I+T++CKYRGGPVDCSLAIQLAFSGTDKHL VFNSWYEVSKL QYSLFNLYSNLKDSLTS
Sbjct: 181 IRTSNCKYRGGPVDCSLAIQLAFSGTDKHLLVFNSWYEVSKLCQYSLFNLYSNLKDSLTS 240

Query: 241 QYNKIFQ 248
           QYNKIFQ
Sbjct: 241 QYNKIFQ 246

BLAST of Tan0019795 vs. ExPASy TrEMBL
Match: A0A6J1J6R6 (uncharacterized protein LOC111481736 OS=Cucurbita maxima OX=3661 GN=LOC111481736 PE=4 SV=1)

HSP 1 Score: 438.3 bits (1126), Expect = 2.1e-119
Identity = 223/247 (90.28%), Postives = 230/247 (93.12%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAISDLSP   VLLLLVT A +VEA DNNRVFSPC DTTVEKSDGFTLGLAFATEQKF F
Sbjct: 1   MAISDLSPLSFVLLLLVT-AVVVEARDNNRVFSPCIDTTVEKSDGFTLGLAFATEQKFVF 60

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTLKLSPCDSRLALTNGN+LISVFRPKVDEISLLTVNTTPSVS FNPSSNGYMVAF GR
Sbjct: 61  NKTLKLSPCDSRLALTNGNALISVFRPKVDEISLLTVNTTPSVSTFNPSSNGYMVAFAGR 120

Query: 121 KYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCA 180
           KYAARSPPIFV+D QH VTSFTLVLEFEKGRLQNLFWKRDGC RCSNN+TFVCINNQDCA
Sbjct: 121 KYAARSPPIFVSDGQHIVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNHTFVCINNQDCA 180

Query: 181 IKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240
           I+TN+CK   GPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSL +LYSNLKDSLTS
Sbjct: 181 IRTNNCK-NSGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLVDLYSNLKDSLTS 240

Query: 241 QYNKIFQ 248
           QYNKIFQ
Sbjct: 241 QYNKIFQ 245

BLAST of Tan0019795 vs. ExPASy TrEMBL
Match: A0A6J1F5Z3 (uncharacterized protein LOC111442414 OS=Cucurbita moschata OX=3662 GN=LOC111442414 PE=4 SV=1)

HSP 1 Score: 435.6 bits (1119), Expect = 1.3e-118
Identity = 221/247 (89.47%), Postives = 231/247 (93.52%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAISDLSP   VLLLLVT A +VEA DNNRVFSPC DTTVEKSDGFTLGLAFATEQKF F
Sbjct: 1   MAISDLSPLSFVLLLLVT-AVVVEARDNNRVFSPCIDTTVEKSDGFTLGLAFATEQKFVF 60

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTLKLSPCDSRLALTNGN+LISVFRPKVDEISLLTVNTTPSVS+FNPSS+GYMVAF GR
Sbjct: 61  NKTLKLSPCDSRLALTNGNALISVFRPKVDEISLLTVNTTPSVSSFNPSSHGYMVAFAGR 120

Query: 121 KYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCA 180
           KYAARSPPIFV+D QH VTSFTLVLEFEKGRLQNLFWKRDGC RCSNN+TFVCINNQDCA
Sbjct: 121 KYAARSPPIFVSDGQHIVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNHTFVCINNQDCA 180

Query: 181 IKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240
           I+TN+CK   GPVDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSL +LYSNLKDSLTS
Sbjct: 181 IRTNNCK-NSGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLVDLYSNLKDSLTS 240

Query: 241 QYNKIFQ 248
           QYNKIFQ
Sbjct: 241 QYNKIFQ 245

BLAST of Tan0019795 vs. ExPASy TrEMBL
Match: A0A5A7TKL4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004720 PE=4 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 6.6e-118
Identity = 218/246 (88.62%), Postives = 230/246 (93.50%), Query Frame = 0

Query: 1   MAISDLSPAYAVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTF 60
           MAIS  SPA AVL+LLVT A +VEAGDNNRVFSPCTDTTVE SDGFTLG AFAT+QKF F
Sbjct: 1   MAISYPSPASAVLILLVT-AVLVEAGDNNRVFSPCTDTTVETSDGFTLGFAFATQQKFFF 60

Query: 61  NKTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGR 120
           NKTL+LSPCDSRL LTNGNSLISVFRPKVDEISLLTVNT+ SVS+F+PSSNGYMVAF GR
Sbjct: 61  NKTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSRSVSSFDPSSNGYMVAFAGR 120

Query: 121 KYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCA 180
           KYAARSPPIFVAD+QH VTSFTLVLEFEKGRLQNLFWKRDGC +CSNNNTFVCI+NQDCA
Sbjct: 121 KYAARSPPIFVADQQHIVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCIHNQDCA 180

Query: 181 IKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTS 240
           I+TNSCK  GG VDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSLFNLYSNLKDSLTS
Sbjct: 181 IRTNSCKNNGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLFNLYSNLKDSLTS 240

Query: 241 QYNKIF 247
           QYNKIF
Sbjct: 241 QYNKIF 245

BLAST of Tan0019795 vs. TAIR 10
Match: AT3G11800.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G44150.1); Has 74 Blast hits to 73 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 72; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 300.1 bits (767), Expect = 1.7e-81
Identity = 146/238 (61.34%), Postives = 184/238 (77.31%), Query Frame = 0

Query: 13  LLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKF---TFNKTLKLSPC 72
           L   V  +++ EAGDNN+V+SPC+D+TV   DGFT G+AFA +  F     +K+++ SPC
Sbjct: 10  LFAAVLTSSLTEAGDNNQVYSPCSDSTVAIGDGFTFGIAFAAKDSFFSTNRSKSVQYSPC 69

Query: 73  DSRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNP-SSNGYMVAFGGRKYAARSPP 132
           D R    NGNS ++VFRPKVDEI+LLT+NT+ S S+F P +S GYMVAF G KYAARS P
Sbjct: 70  DHRHLSLNGNSEVAVFRPKVDEITLLTINTSSS-SSFRPDASKGYMVAFAGAKYAARSLP 129

Query: 133 IFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCAIKTNSCKY 192
           I VAD  H VTSFTLVLEF+KGRL+N+FWK+DGC +CS ++ FVC+N ++CAIK  +CK 
Sbjct: 130 IMVADSNHIVTSFTLVLEFQKGRLENMFWKKDGCSKCSGDSKFVCLNKEECAIKPQNCKN 189

Query: 193 RGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTSQYNKIF 247
           +GG VDCSL IQLAFSGTDKH +  NSWYEV+ L+QYSL+ LYSNLKDSLT+ +  IF
Sbjct: 190 QGGQVDCSLGIQLAFSGTDKHYTALNSWYEVANLKQYSLYGLYSNLKDSLTNPFKNIF 246

BLAST of Tan0019795 vs. TAIR 10
Match: AT3G44150.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: cultured cell; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11800.1); Has 76 Blast hits to 75 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 74; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 298.9 bits (764), Expect = 3.8e-81
Identity = 144/236 (61.02%), Postives = 178/236 (75.42%), Query Frame = 0

Query: 11  AVLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTFNKTLKLSPCD 70
           AV+L +        +G+ N ++SPC+DT +++SDGFT G+AF++   F  N+T+ LSPCD
Sbjct: 14  AVILTVALGGDSGGSGNTNTIYSPCSDTRIQRSDGFTFGIAFSSRPSFFINQTVLLSPCD 73

Query: 71  SRLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPSSNGYMVAFGGRKYAARSPPIF 130
            RL+L   NS  SVFRPK+DEISLL++NT+   + F  +  GYMVAF GRKYAARS P F
Sbjct: 74  RRLSLAAMNSQFSVFRPKIDEISLLSINTS---AFFPDNYGGYMVAFAGRKYAARSIPAF 133

Query: 131 VADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGRCSNNNTFVCINNQDCAIKTNSCKYRG 190
           +A+    VTSFTLV+EF+KGRLQNL+WKRDGC  C  N  FVC+N QDCAI+T SCK RG
Sbjct: 134 IANSTFIVTSFTLVMEFQKGRLQNLYWKRDGCASCKGNQNFVCLNKQDCAIRTPSCKGRG 193

Query: 191 GPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTSQYNKIF 247
           G VDCSL IQLAFSGTDKHL+V NSWYEV  L+QYSL+ LYSNLK SLT+Q+N  F
Sbjct: 194 GAVDCSLGIQLAFSGTDKHLAVLNSWYEVENLKQYSLYGLYSNLKSSLTNQFNNFF 246

BLAST of Tan0019795 vs. TAIR 10
Match: AT2G15910.1 (CSL zinc finger domain-containing protein )

HSP 1 Score: 232.6 bits (592), Expect = 3.3e-61
Identity = 119/240 (49.58%), Postives = 169/240 (70.42%), Query Frame = 0

Query: 12  VLLLLVTAAAIVEAGDNNRVFSPCTDTTVEKSDGFTLGLAFATEQKFTFNKTLKLSPCDS 71
           ++++++     V A DNN V+SPC+DT + K DGFT+G+A ++++ F F   ++LSPCD+
Sbjct: 129 IMMIVMMVDDWVGAADNNPVYSPCSDTQISKGDGFTIGIAISSKEAF-FLDQVQLSPCDT 188

Query: 72  RLALTNGNSLISVFRPKVDEISLLTVNTTPSVSNFNPS-SNGYMVAFGGRKYAARSPPIF 131
           RL L    + +++FRPKVDEISLL+++T    S FNPS + G+MV F G KYAARS P+ 
Sbjct: 189 RLGLAAKMAQLALFRPKVDEISLLSIDT----SKFNPSEAGGFMVGFAGSKYAARSYPVK 248

Query: 132 VADEQHTVTSFT---------LVLEFEKGRLQNLFWKRDGCGRC--SNNNTFVCINNQDC 191
           VAD  +T+T+FT         LVLEF+KG LQNLFWK  GC  C  + +++ VC+N  DC
Sbjct: 249 VADGSNTITAFTLVMKLTLSPLVLEFQKGVLQNLFWKSFGCDLCKGTGSSSSVCLNGTDC 308

Query: 192 AIKTNSCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLT 240
           A+ T+ CK  GG  +C++ IQ+AFSGTD++L   N+WYEV+ LRQYSL +LY+N  DSL+
Sbjct: 309 AVPTSKCKANGGQANCNIGIQVAFSGTDRNLESLNTWYEVNNLRQYSLTDLYANAVDSLS 363

BLAST of Tan0019795 vs. TAIR 10
Match: AT3G48630.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G44150.1); Has 64 Blast hits to 64 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 64; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 59.7 bits (143), Expect = 3.8e-09
Identity = 26/52 (50.00%), Postives = 34/52 (65.38%), Query Frame = 0

Query: 113 YMVAFGGRKYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCGR 165
           Y V   G +  +   P F+A+    VTSFT V+EF+KGRLQNL+WKRD C +
Sbjct: 2   YNVGTRGSEIRSEVDPAFIANSTFIVTSFTWVMEFQKGRLQNLYWKRDVCAK 53

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023538924.12.2e-12392.31uncharacterized protein LOC111799707 [Cucurbita pepo subsp. pepo][more]
KAG6596627.11.9e-12291.09hypothetical protein SDJN03_09807, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022939140.12.4e-12291.09uncharacterized protein LOC111445105 [Cucurbita moschata][more]
XP_023005908.17.8e-12190.69uncharacterized protein LOC111498771 [Cucurbita maxima][more]
KAG7028165.13.0e-12087.26hypothetical protein SDJN02_09345 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1FLU21.2e-12291.09uncharacterized protein LOC111445105 OS=Cucurbita moschata OX=3662 GN=LOC1114451... [more]
A0A6J1L3G63.8e-12190.69uncharacterized protein LOC111498771 OS=Cucurbita maxima OX=3661 GN=LOC111498771... [more]
A0A6J1J6R62.1e-11990.28uncharacterized protein LOC111481736 OS=Cucurbita maxima OX=3661 GN=LOC111481736... [more]
A0A6J1F5Z31.3e-11889.47uncharacterized protein LOC111442414 OS=Cucurbita moschata OX=3662 GN=LOC1114424... [more]
A0A5A7TKL46.6e-11888.62Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT3G11800.11.7e-8161.34unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G44150.13.8e-8161.02unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G15910.13.3e-6149.58CSL zinc finger domain-containing protein [more]
AT3G48630.13.8e-0950.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR044248Diphthamide biosynthesis protein 3/4-likePANTHERPTHR21454DPH3 HOMOLOG-RELATEDcoord: 11..246
NoneNo IPR availablePANTHERPTHR21454:SF33SUBFAMILY NOT NAMEDcoord: 11..246

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0019795.1Tan0019795.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0017183 peptidyl-diphthamide biosynthetic process from peptidyl-histidine
biological_process GO:0002098 tRNA wobble uridine modification
cellular_component GO:0005829 cytosol
cellular_component GO:0005634 nucleus
molecular_function GO:0046872 metal ion binding