Tan0009302 (gene) Snake gourd v1

Overview
NameTan0009302
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
LocationLG04: 80686754 .. 80690719 (+)
RNA-Seq ExpressionTan0009302
SyntenyTan0009302
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATCCACAACAAATTATGAAAACAAAACAGAGCCCCACAAATCACCTCTTTTCCCCTGCCCAGAAAGGAAAGGAAAAACCAGAGGAAACTACTTTTGTTGTTCTTCTTCTACAATCCAAAACTGGTTACAGAGATTGATGATTCCAATTTCCAAAAATTGCCCACATTACCCTCTTTTCCAAACCAAACGATCGTCGAATTTGAACTTTTGTTCTTGAATTTCTTGCTCGGAAAACTGTTGAAGCTGCCCTGATGGGCGTCGAATCGAACTCAGCGCCGCCGCCGCCATCATCATCGTCTTCTACGCCATCTCCGAGCGGGAAGCGGGCCAGAGATCCCGAGGATGAAGTTTATCTCGACAATTTCCACTCTCACAAGCGCTACCTCAGTGAGGTCCCCATTTTGTGCCCTTCACTTTACTGTTAGGTCATCTGAAACTTTTTCTAGACATCTTTTTTTGTTTGATCGAGAAATGCTGCCGGTTTGTAAGTTTCACTAAAAATTTCAGCGTTTAGGTGGCGTTTCTTCATTTGGGTCAATTTTGCTCTTTAAGTTTGTGTCATTCGTTTGTGTATTGCCTTTCTTGGATGCAAAATTTCCTCCTAACGTACATCCTTTCTTTGTTAATGATCTTTCTCAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGATCCCCTTTCAGAGAATCTCATGGATTCCCCTGCAAGGTCGGAGTCCATGCTTTATCTAAGGTAAATGGAATTTGTTGCAATAAAAGTTTAACTGATTTTCTTTTAGATTCTTTTGTATATGAATGGGAATAATCACATTAAAGTTTACAAGCTCTTTCCAGTAAAAAACATCAGGATGTTGCTGGAAGAAATTGAAATTCTCCATTGTGTCTGGAAGGCCTTAGTGGATGGGAACTGTGAAACTAAATTCTTGTTTGATTCCTTAGTTTAGTTTTCTCGAGATGGAGTTTTCCTGAGAAACTATGATTCTCTTTTGCATGTAAATAAAAAGTTACATAACCATTATGGGCTAGCCTAGTGGTCTATAAGGACCAATGAAAGTAATAGAGGGCTTAGAGGGAATATCACACACTTAGGATTTAGTATGATACATGTTTTCTTGATAACTAAATGTCGTAGAGTTAGACAATTGTCTCGTGAGAGTAGTTGAAGTGTGCTCCAGTTAGCCTAGAAACTCTTGGGGAAAAAAGTTACGAAGCTTATTCAAAATGTTTGATACTAACATTTCATTTTAAGTTGTAATTGGCTCAAAAAAATAAAATAAAATCCAAAGTTTCAAACTTCAAGGTAGCTTTCCCAGATTCTTTTAGCTCGAGGGTATCCTATCTGCTTCATATGCCAATTGGAGAAGCTTATAAGTAGTTACTTATTTATGATTGTGCTGTTATAACCAGGCAATATTGATGCTCCTGATGATGGCATATTCCAGGAAAGGTAGTTTTCATGGTTTTAATATGTAATGTGAATGTGTTTTATGTAAGTTTTATTTGTTTAGTGAACCATTCCCCTTGATTTGAACGAAGGAGCATTTAGCATTCTTTGTTATAATCTGGTGGCTGAAAAGATGAGTAATTTCCAGGAGGACAATTTTCCCTTTTCCTATCTTCTGCCCCATCTGAAAATTTAAGAATTCTGCCTTATTATTATTGTTTTTTTCCTTTAATTGATCAATCATCCTTCTCTTTCTTCAGTTCGACTTTGTGTAGATTATATGTTGTGGTGAAGTACTTTCAAATCCTTGGTATGTGTGTCGACATTTTCTTCCATTAGCTACATTTGAAGAAGAGTCTTTCTAGTTCATAGATGCCCCAACTTGCTTTCATAAGACTTCACAACTTATTCTCTCTTTTTATGTGCTTTGAAGATTTACCAAGGATATAGCCGTAATCTTTTAAATTTTCATTCTGATTTTTCCGCTCCATGGAGTTAACTGTCTTTCTCCCCCTTTTTTTAATCAACATTGTAGGGGTGGAGATCCAAACAACTCACCTTTAGGACAAATGCCTTAACCAGTTGAAATATGTTGTACATTGCTAAAAATATGTTAGTATATTTTTATACTTCTATAAGGTATTAACTATTATATGAAACTTTGGTTGGAAAAGAGGTCATATGTCCCAATTTCTCTATCATGAAAAAAAAAACATAAAATTTCAGCTTGATTGATATTCGAGTTTCAGGAAACTTCTGTATTTCTACATTCACAATGTGTTTGTTTTTTTTAAAGCTTTTGAGAATTTATTGCTATTGTTATTGGTTTTTATTAAAAAAAAAACCATCTTTAATAGTTTAAGATTCCTTGAACTACATTCTTTTGTTCTTTGTAAGCTGATAAATGGTATATTTTTTAACCAAAGAAAAACAGTAGTTGGGATGCAACGGAAGAAACTATGATGGACTGGAGTAAAACGTTGTTATCTGAAATCGTGATCATCTTTAAAGTAAGCTAAGTTTTCTACATCACAAAAGACCTGCATATCACTTGTTCTCATCGGTGTTCCGCTGTCTAAACTAGTAGCCGGGTCACTGTAGGCAGAGCAAAGAGATGAAGTATATAGCTATTCGAACTCTCCTTCAATTTCGAATCCTTTTACTGCAACCCTTATTCTTTGGTTTTGAAAATTTTCCCTCATTTCAAATTAATTCCTCCTAGTTTGGCCAAATCACATGCATCTTTACAAATTCTATTATGTAGAAAATCAATTCCCCCTGTTCACCAAGAGTCATGAAGTACATTTGGGGTTTAGTACTTTGTTTAATATTTTTATCATTCCTAAAAGATGGTTTGGGTTTTCCCTATTTTGATTCTTTTTTTTTTTCTCTCTAACTCTACATCCCAGGGATGAAATGTCCTGGCAATATTCTCCGATGTCGGAAGATTCAGATGACTGCCGGTTTTGTGAGACATCGACAAACTTATTTCCCTCGCAATCTGATAGTAGTGTACCTACCAGTCCAGTCTCTCCATATCGATATCAGAGGCCATTGAGCGGGGTGACTCCTTCAACAGGTACTAATACTTCACTTGGATGTTCTATTAGTCCCATCACTAGCTTGCAGCCCCATCAGCGCGGATCAGATTCCGAAGGTCGTTTCCCATCATCTCCCAGCGATATATGCCACTCAGCAGACTTGAGAAGAGCTGCGCTCCTGCGTTCGGTACAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATATAGAAGCCGAAGAACGACCTTATCCTTGCATAAAATCGTTGGTCGATGAAAGAGTTTATCAACTTGAGGAATGCTCCTCAATGGGAGTGTCCGAGCCTGAATATAATGAACAAAAAACATGCAAGGACTTGAACAGAGATATGAAAGACAGTGAGTCTGGAGGTTAGTAAATACTAAAAAAATGTACAGAGGACAATGTATTCTGACATCACACACGGGACACATTGCTGCTGTGTGACCGAAGTGTTTGCCTTTTGGTTCAATCACCTTGGAAAATTCTCCTTTTGCATATAGCTTCCAAAATTTACTCGATTTCGCTTGTTCGAGGATGGTTTGGTCTTCTGCTAGGTTCTTTCTCGAATTCGTGGTGGCTTGTGCTATCTACATTGTTGAGCATGTGTGGAAGTTCTGCCTATCTATTTGATTCTTCCGAAGCACGAGTTGTGAAGATGTTCATTGTTGAGCATGTGTGGAAGCTCTGTTTATCTATCCGATTCATGCCGAAGCCCGAGTTGAGAGAAGATCTTCGTTGTGGAGCATGTGTGGAAGCTCTGTCTATCTATTTGATTCTACACGAGCTCGAGTCGAGACAAAATGCTGGGACCTTAATGGGTCATAAAGTTGAAGTTGCAAGTGTTCCATTTGTTTCATATTTGTCAGGCTGTTATTATTGAATTCTTTGTTCTAATTTGCCTTGGGAATTTGCAG

mRNA sequence

AAAATCCACAACAAATTATGAAAACAAAACAGAGCCCCACAAATCACCTCTTTTCCCCTGCCCAGAAAGGAAAGGAAAAACCAGAGGAAACTACTTTTGTTGTTCTTCTTCTACAATCCAAAACTGGTTACAGAGATTGATGATTCCAATTTCCAAAAATTGCCCACATTACCCTCTTTTCCAAACCAAACGATCGTCGAATTTGAACTTTTGTTCTTGAATTTCTTGCTCGGAAAACTGTTGAAGCTGCCCTGATGGGCGTCGAATCGAACTCAGCGCCGCCGCCGCCATCATCATCGTCTTCTACGCCATCTCCGAGCGGGAAGCGGGCCAGAGATCCCGAGGATGAAGTTTATCTCGACAATTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGATCCCCTTTCAGAGAATCTCATGGATTCCCCTGCAAGGTCGGAGTCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCGATGTCGGAAGATTCAGATGACTGCCGGTTTTGTGAGACATCGACAAACTTATTTCCCTCGCAATCTGATAGTAGTGTACCTACCAGTCCAGTCTCTCCATATCGATATCAGAGGCCATTGAGCGGGGTGACTCCTTCAACAGGTACTAATACTTCACTTGGATGTTCTATTAGTCCCATCACTAGCTTGCAGCCCCATCAGCGCGGATCAGATTCCGAAGGTCGTTTCCCATCATCTCCCAGCGATATATGCCACTCAGCAGACTTGAGAAGAGCTGCGCTCCTGCGTTCGGTACAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATATAGAAGCCGAAGAACGACCTTATCCTTGCATAAAATCGTTGGTCGATGAAAGAGTTTATCAACTTGAGGAATGCTCCTCAATGGGAGTGTCCGAGCCTGAATATAATGAACAAAAAACATGCAAGGACTTGAACAGAGATATGAAAGACAGTGAGTCTGGAGGTTAGTAAATACTAAAAAAATGTACAGAGGACAATGTATTCTGACATCACACACGGGACACATTGCTGCTGTGTGACCGAAGTGTTTGCCTTTTGGTTCAATCACCTTGGAAAATTCTCCTTTTGCATATAGCTTCCAAAATTTACTCGATTTCGCTTGTTCGAGGATGGTTTGGTCTTCTGCTAGGTTCTTTCTCGAATTCGTGGTGGCTTGTGCTATCTACATTGTTGAGCATGTGTGGAAGTTCTGCCTATCTATTTGATTCTTCCGAAGCACGAGTTGTGAAGATGTTCATTGTTGAGCATGTGTGGAAGCTCTGTTTATCTATCCGATTCATGCCGAAGCCCGAGTTGAGAGAAGATCTTCGTTGTGGAGCATGTGTGGAAGCTCTGTCTATCTATTTGATTCTACACGAGCTCGAGTCGAGACAAAATGCTGGGACCTTAATGGGTCATAAAGTTGAAGTTGCAAGTGTTCCATTTGTTTCATATTTGTCAGGCTGTTATTATTGAATTCTTTGTTCTAATTTGCCTTGGGAATTTGCAG

Coding sequence (CDS)

ATGGGCGTCGAATCGAACTCAGCGCCGCCGCCGCCATCATCATCGTCTTCTACGCCATCTCCGAGCGGGAAGCGGGCCAGAGATCCCGAGGATGAAGTTTATCTCGACAATTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGATCCCCTTTCAGAGAATCTCATGGATTCCCCTGCAAGGTCGGAGTCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCGATGTCGGAAGATTCAGATGACTGCCGGTTTTGTGAGACATCGACAAACTTATTTCCCTCGCAATCTGATAGTAGTGTACCTACCAGTCCAGTCTCTCCATATCGATATCAGAGGCCATTGAGCGGGGTGACTCCTTCAACAGGTACTAATACTTCACTTGGATGTTCTATTAGTCCCATCACTAGCTTGCAGCCCCATCAGCGCGGATCAGATTCCGAAGGTCGTTTCCCATCATCTCCCAGCGATATATGCCACTCAGCAGACTTGAGAAGAGCTGCGCTCCTGCGTTCGGTACAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATATAGAAGCCGAAGAACGACCTTATCCTTGCATAAAATCGTTGGTCGATGAAAGAGTTTATCAACTTGAGGAATGCTCCTCAATGGGAGTGTCCGAGCCTGAATATAATGAACAAAAAACATGCAAGGACTTGAACAGAGATATGAAAGACAGTGAGTCTGGAGGTTAG

Protein sequence

MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG
Homology
BLAST of Tan0009302 vs. NCBI nr
Match: XP_004152357.1 (uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus] >KGN52842.1 hypothetical protein Csa_014618 [Cucumis sativus])

HSP 1 Score: 495.7 bits (1275), Expect = 2.4e-136
Identity = 254/270 (94.07%), Postives = 259/270 (95.93%), Query Frame = 0

Query: 1   MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGD 60
           MGVESNSAPPPP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGD
Sbjct: 1   MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGD 60

Query: 61  PLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPV 120
           PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPV
Sbjct: 61  PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPV 120

Query: 121 SPYRYQRPLSGVTPSTGTNTSLGCS-ISPITSLQPHQRGSDSEGRFPSSPSDICHSADLR 180
           SPYRYQRP SGV PSTGTNTSLGCS  SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLR
Sbjct: 121 SPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLR 180

Query: 181 RAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSS 240
           RAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSS
Sbjct: 181 RAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSS 240

Query: 241 M--GVSEPEYNEQKTCKDLNRDMKDSESGG 268
           M  GVSE EYNEQK+CKDLNRDMKDS SGG
Sbjct: 241 MGLGVSESEYNEQKSCKDLNRDMKDSRSGG 270

BLAST of Tan0009302 vs. NCBI nr
Match: XP_038904570.1 (uncharacterized protein LOC120090942 [Benincasa hispida])

HSP 1 Score: 494.2 bits (1271), Expect = 7.1e-136
Identity = 251/267 (94.01%), Postives = 258/267 (96.63%), Query Frame = 0

Query: 1   MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGD 60
           MGVESNSAPPP  SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGD
Sbjct: 1   MGVESNSAPPP--SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGD 60

Query: 61  PLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPV 120
           PLSENLMDSPARSESMLY R+EMSWQYSPMSEDSDDCRFCETSTNLFP+QSDSSVPTSPV
Sbjct: 61  PLSENLMDSPARSESMLYQREEMSWQYSPMSEDSDDCRFCETSTNLFPTQSDSSVPTSPV 120

Query: 121 SPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRR 180
           SPYRYQRP SGVTPSTGTNTSLGCS SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLRR
Sbjct: 121 SPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRR 180

Query: 181 AALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM 240
           AALLRSVQMRAQP GPSSMELPYCSMPEPGPNIEAEERP  CIKSLVDER +QLEECSSM
Sbjct: 181 AALLRSVQMRAQPLGPSSMELPYCSMPEPGPNIEAEERPCSCIKSLVDERDFQLEECSSM 240

Query: 241 GVSEPEYNEQKTCKDLNRDMKDSESGG 268
           GVSEPEYNE+K+CKDLNRDMKDSESGG
Sbjct: 241 GVSEPEYNEEKSCKDLNRDMKDSESGG 265

BLAST of Tan0009302 vs. NCBI nr
Match: XP_008454305.1 (PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo] >KAA0044407.1 uncharacterized protein E6C27_scaffold46G001360 [Cucumis melo var. makuwa] >TYK29534.1 uncharacterized protein E5676_scaffold655G001370 [Cucumis melo var. makuwa])

HSP 1 Score: 490.0 bits (1260), Expect = 1.3e-134
Identity = 251/270 (92.96%), Postives = 258/270 (95.56%), Query Frame = 0

Query: 1   MGVESNSA-PPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG 60
           MGVESNSA PPPP+SSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG
Sbjct: 1   MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG 60

Query: 61  DPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSP 120
           +PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSP
Sbjct: 61  EPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSP 120

Query: 121 VSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLR 180
           VSPYRYQRP SG+ PS GTNTSLGCS SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLR
Sbjct: 121 VSPYRYQRPFSGMAPSNGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLR 180

Query: 181 RAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSS 240
           RAALLRSVQMRAQP GPSSMELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSS
Sbjct: 181 RAALLRSVQMRAQPAGPSSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSS 240

Query: 241 M--GVSEPEYNEQKTCKDLNRDMKDSESGG 268
           M  GVSE EYNEQK+CKDLNRDMKDS+SGG
Sbjct: 241 MGLGVSESEYNEQKSCKDLNRDMKDSQSGG 270

BLAST of Tan0009302 vs. NCBI nr
Match: XP_022934215.1 (uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata])

HSP 1 Score: 486.1 bits (1250), Expect = 1.9e-133
Identity = 250/270 (92.59%), Postives = 256/270 (94.81%), Query Frame = 0

Query: 1   MGVESNSAPPPP---SSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLT 60
           MGVESNS PPPP   SSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLT
Sbjct: 1   MGVESNSGPPPPPPSSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLT 60

Query: 61  VGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPT 120
           VGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFCETSTNLFPSQSDSSVPT
Sbjct: 61  VGDPLSENLMDSPARSESMLYLRDEMSCQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPT 120

Query: 121 SPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSAD 180
           SPVSPYRYQRP SG+TPSTGTNTSLGC+  P+TSLQPHQRGSDSEGRFPSSPSDICHSAD
Sbjct: 121 SPVSPYRYQRPFSGMTPSTGTNTSLGCNTGPVTSLQPHQRGSDSEGRFPSSPSDICHSAD 180

Query: 181 LRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEEC 240
           LRRAALLRSVQMRAQP GPSSMELPYCSMPEPGPNIEAEER    IKSLVDERVYQL EC
Sbjct: 181 LRRAALLRSVQMRAQPHGPSSMELPYCSMPEPGPNIEAEERSCSFIKSLVDERVYQLGEC 240

Query: 241 SSMGVSEPEYNEQKTCKDLNRDMKDSESGG 268
           SSMGVSEPEYNEQK+CKDLNR+MKDSESGG
Sbjct: 241 SSMGVSEPEYNEQKSCKDLNREMKDSESGG 270

BLAST of Tan0009302 vs. NCBI nr
Match: KAG6581290.1 (hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sororia] >KAG7018013.1 hypothetical protein SDJN02_19879 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 485.7 bits (1249), Expect = 2.5e-133
Identity = 250/271 (92.25%), Postives = 256/271 (94.46%), Query Frame = 0

Query: 1   MGVESNSAPPPP----SSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGL 60
           MGVESNS PPPP    SSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGL
Sbjct: 1   MGVESNSGPPPPPPPSSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGL 60

Query: 61  TVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVP 120
           TVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFCETSTNLFPSQSDSSVP
Sbjct: 61  TVGDPLSENLMDSPARSESMLYLRDEMSCQYSPMSEDSDDCRFCETSTNLFPSQSDSSVP 120

Query: 121 TSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSA 180
           TSPVSPYRYQRP SG+TPSTGTNTSLGC+  P+TSLQPHQRGSDSEGRFPSSPSDICHSA
Sbjct: 121 TSPVSPYRYQRPFSGMTPSTGTNTSLGCNTGPVTSLQPHQRGSDSEGRFPSSPSDICHSA 180

Query: 181 DLRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEE 240
           DLRRAALLRSVQMRAQP GPSSMELPYCSMPEPGPNIEAEER    IKSLVDERVYQL E
Sbjct: 181 DLRRAALLRSVQMRAQPHGPSSMELPYCSMPEPGPNIEAEERSCSFIKSLVDERVYQLGE 240

Query: 241 CSSMGVSEPEYNEQKTCKDLNRDMKDSESGG 268
           CSSMGVSEPEYNEQK+CKDLNR+MKDSESGG
Sbjct: 241 CSSMGVSEPEYNEQKSCKDLNREMKDSESGG 271

BLAST of Tan0009302 vs. ExPASy TrEMBL
Match: A0A0A0KWN0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G003080 PE=4 SV=1)

HSP 1 Score: 495.7 bits (1275), Expect = 1.2e-136
Identity = 254/270 (94.07%), Postives = 259/270 (95.93%), Query Frame = 0

Query: 1   MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGD 60
           MGVESNSAPPPP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGD
Sbjct: 1   MGVESNSAPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGD 60

Query: 61  PLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPV 120
           PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPV
Sbjct: 61  PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPV 120

Query: 121 SPYRYQRPLSGVTPSTGTNTSLGCS-ISPITSLQPHQRGSDSEGRFPSSPSDICHSADLR 180
           SPYRYQRP SGV PSTGTNTSLGCS  SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLR
Sbjct: 121 SPYRYQRPFSGVAPSTGTNTSLGCSTTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLR 180

Query: 181 RAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSS 240
           RAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSS
Sbjct: 181 RAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSS 240

Query: 241 M--GVSEPEYNEQKTCKDLNRDMKDSESGG 268
           M  GVSE EYNEQK+CKDLNRDMKDS SGG
Sbjct: 241 MGLGVSESEYNEQKSCKDLNRDMKDSRSGG 270

BLAST of Tan0009302 vs. ExPASy TrEMBL
Match: A0A5A7TRC2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G001370 PE=4 SV=1)

HSP 1 Score: 490.0 bits (1260), Expect = 6.5e-135
Identity = 251/270 (92.96%), Postives = 258/270 (95.56%), Query Frame = 0

Query: 1   MGVESNSA-PPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG 60
           MGVESNSA PPPP+SSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG
Sbjct: 1   MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG 60

Query: 61  DPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSP 120
           +PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSP
Sbjct: 61  EPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSP 120

Query: 121 VSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLR 180
           VSPYRYQRP SG+ PS GTNTSLGCS SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLR
Sbjct: 121 VSPYRYQRPFSGMAPSNGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLR 180

Query: 181 RAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSS 240
           RAALLRSVQMRAQP GPSSMELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSS
Sbjct: 181 RAALLRSVQMRAQPAGPSSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSS 240

Query: 241 M--GVSEPEYNEQKTCKDLNRDMKDSESGG 268
           M  GVSE EYNEQK+CKDLNRDMKDS+SGG
Sbjct: 241 MGLGVSESEYNEQKSCKDLNRDMKDSQSGG 270

BLAST of Tan0009302 vs. ExPASy TrEMBL
Match: A0A1S3BXT2 (uncharacterized protein LOC103494744 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494744 PE=4 SV=1)

HSP 1 Score: 490.0 bits (1260), Expect = 6.5e-135
Identity = 251/270 (92.96%), Postives = 258/270 (95.56%), Query Frame = 0

Query: 1   MGVESNSA-PPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG 60
           MGVESNSA PPPP+SSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG
Sbjct: 1   MGVESNSAPPPPPTSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG 60

Query: 61  DPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSP 120
           +PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSP
Sbjct: 61  EPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSP 120

Query: 121 VSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLR 180
           VSPYRYQRP SG+ PS GTNTSLGCS SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLR
Sbjct: 121 VSPYRYQRPFSGMAPSNGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLR 180

Query: 181 RAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSS 240
           RAALLRSVQMRAQP GPSSMELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSS
Sbjct: 181 RAALLRSVQMRAQPAGPSSMELPYCSMPEPGPNIEAEDRPCSCIKSLVDERVYQLEECSS 240

Query: 241 M--GVSEPEYNEQKTCKDLNRDMKDSESGG 268
           M  GVSE EYNEQK+CKDLNRDMKDS+SGG
Sbjct: 241 MGLGVSESEYNEQKSCKDLNRDMKDSQSGG 270

BLAST of Tan0009302 vs. ExPASy TrEMBL
Match: A0A6J1F722 (uncharacterized protein LOC111441454 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441454 PE=4 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 9.3e-134
Identity = 250/270 (92.59%), Postives = 256/270 (94.81%), Query Frame = 0

Query: 1   MGVESNSAPPPP---SSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLT 60
           MGVESNS PPPP   SSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLT
Sbjct: 1   MGVESNSGPPPPPPSSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLT 60

Query: 61  VGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPT 120
           VGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFCETSTNLFPSQSDSSVPT
Sbjct: 61  VGDPLSENLMDSPARSESMLYLRDEMSCQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPT 120

Query: 121 SPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSAD 180
           SPVSPYRYQRP SG+TPSTGTNTSLGC+  P+TSLQPHQRGSDSEGRFPSSPSDICHSAD
Sbjct: 121 SPVSPYRYQRPFSGMTPSTGTNTSLGCNTGPVTSLQPHQRGSDSEGRFPSSPSDICHSAD 180

Query: 181 LRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEEC 240
           LRRAALLRSVQMRAQP GPSSMELPYCSMPEPGPNIEAEER    IKSLVDERVYQL EC
Sbjct: 181 LRRAALLRSVQMRAQPHGPSSMELPYCSMPEPGPNIEAEERSCSFIKSLVDERVYQLGEC 240

Query: 241 SSMGVSEPEYNEQKTCKDLNRDMKDSESGG 268
           SSMGVSEPEYNEQK+CKDLNR+MKDSESGG
Sbjct: 241 SSMGVSEPEYNEQKSCKDLNREMKDSESGG 270

BLAST of Tan0009302 vs. ExPASy TrEMBL
Match: A0A6J1J464 (uncharacterized protein LOC111482488 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482488 PE=4 SV=1)

HSP 1 Score: 483.4 bits (1243), Expect = 6.1e-133
Identity = 246/267 (92.13%), Postives = 254/267 (95.13%), Query Frame = 0

Query: 1   MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGD 60
           MGVESNSAPPPP SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGD
Sbjct: 1   MGVESNSAPPPPPSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGD 60

Query: 61  PLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPV 120
            LSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFCETSTNLFPSQSD+SVPTSPV
Sbjct: 61  SLSENLMDSPARSESMLYLRDEMSCQYSPMSEDSDDCRFCETSTNLFPSQSDNSVPTSPV 120

Query: 121 SPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRR 180
           SPYRYQRP SG+TPST TNTSLGC+ SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLRR
Sbjct: 121 SPYRYQRPFSGMTPSTATNTSLGCNTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRR 180

Query: 181 AALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM 240
           AALLRSVQMRAQP GPSSMELPYCSMPEPGPNIEAEER    IKSLVDERVYQL ECS+M
Sbjct: 181 AALLRSVQMRAQPHGPSSMELPYCSMPEPGPNIEAEERSCSFIKSLVDERVYQLGECSAM 240

Query: 241 GVSEPEYNEQKTCKDLNRDMKDSESGG 268
           GVSEPEYNEQK+CKDLNR+MKD ESGG
Sbjct: 241 GVSEPEYNEQKSCKDLNREMKDGESGG 267

BLAST of Tan0009302 vs. TAIR 10
Match: AT2G25920.1 (BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2); Has 131 Blast hits to 125 proteins in 54 species: Archae - 0; Bacteria - 50; Metazoa - 12; Fungi - 12; Plants - 41; Viruses - 0; Other Eukaryotes - 16 (source: NCBI BLink). )

HSP 1 Score: 224.6 bits (571), Expect = 9.7e-59
Identity = 149/267 (55.81%), Postives = 170/267 (63.67%), Query Frame = 0

Query: 4   ESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLS 63
           E+N  P P S S    SP GKR RDPEDEVYLDN  S KRYLSEIMA SLNGLTVGD L 
Sbjct: 26  EANEPPVPISVS----SPCGKRTRDPEDEVYLDNLRSQKRYLSEIMACSLNGLTVGDSLP 85

Query: 64  ENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE--TSTNLFPSQSDSSVPTSPVS 123
            N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE  T+T    S    S PTSPVS
Sbjct: 86  VNMLESPARSESFLYHRDDLSLQYSPMSEDSDEARFCEDPTATASTSSSQPESRPTSPVS 145

Query: 124 PYRYQRPLSGVTPSTGTNTSL----GCSISPI------TSLQPHQRGSDSEGRFPSSPSD 183
           PYRYQRPL+       + T L     C  S I      T+ Q  QRGSD+EGRFPSSPSD
Sbjct: 146 PYRYQRPLTSTNSPQPSPTILHHSHTCPASMISNAATTTTPQSRQRGSDTEGRFPSSPSD 205

Query: 184 ICHSADLRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERV 243
           ICHS DLRR ALLRSVQMR QP G SS   P         NI+ EER   C KS+ ++R 
Sbjct: 206 ICHSGDLRRTALLRSVQMRTQPCGYSSSSGP--------SNIDGEER--MCSKSMEEDRG 265

Query: 244 YQL-EECSSMGVSEPEYNEQKTCKDLN 258
           Y   E+     VS    ++ K+CK L+
Sbjct: 266 YNKGEDIPYAEVS----SKSKSCKALD 274

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_004152357.12.4e-13694.07uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus] >KGN52842.1 hy... [more]
XP_038904570.17.1e-13694.01uncharacterized protein LOC120090942 [Benincasa hispida][more]
XP_008454305.11.3e-13492.96PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo] >KAA00... [more]
XP_022934215.11.9e-13392.59uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata][more]
KAG6581290.12.5e-13392.25hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A0A0KWN01.2e-13694.07Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G003080 PE=4 SV=1[more]
A0A5A7TRC26.5e-13592.96Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BXT26.5e-13592.96uncharacterized protein LOC103494744 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1F7229.3e-13492.59uncharacterized protein LOC111441454 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1J4646.1e-13392.13uncharacterized protein LOC111482488 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT2G25920.19.7e-5955.81BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing ... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 191..216
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 103..172
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 103..168
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 239..267
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 248..267
NoneNo IPR availablePANTHERPTHR35717OS05G0156200 PROTEINcoord: 3..265

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009302.1Tan0009302.1mRNA