Tan0000804 (gene) Snake gourd v1

Overview
NameTan0000804
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHTH myb-type domain-containing protein
LocationLG05: 3828793 .. 3832142 (+)
RNA-Seq ExpressionTan0000804
SyntenyTan0000804
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTATTTTTTCCTTCCATTCCAAACGACATAACGTTTGAATCGTCCATGAAGCATCTCTCGAATTCAAATTTCGCTCTCTCTGCAATCTCCGGCGCCCGGCGGTTTGTCCTTTTTACCTTTATCATCATCGTCTTCCTTTCTTCGCCAGCTCCCTTCGTCTAACATTGAAGAAAATCTCCAATTTATTGAAATCTCAGTTCCCGTTGTATAGGGATTCGGAGCAACATTTCTTTTTTGTTGCGATTCTTTTCGTCAGTTAGGGTTTTTTTCTTTCTCCCGTCTCTTCCATCTCCGATTTCATTTTGACCATGGACGACGACATTTGCCGTTGGATTATCGAATTCATACTCCGGAGTTCAATGGACGACCACTTGCTGAAGAGAATTCTTGCGGTTATTCCCTTGCCGGACATGGATTTCCGCCTGAAGAAAACAGCGCTTTTACGGGCCATTGGGAGCGAAATCTCAGAAGCTGTCGTAACGGAGAAATTACTCGAAATTTTCGAGATGATTGAGCAGTTGGACAAGACTGAAGGCCTTGCAATCATGGAGTCAATGAAAGCTGCATATTGTGCTGTGGCAGTGGAGTGCACGGTGAAGTACTTGGCGGTGGGAGGCACTGACACGAACGGAAAGTATTTTGATGCAATGAGGAGGATTTGGAGAGGGAGAGTGACGGAATTGAAGAGATCAGGTAATAGCGAGCTGGTTTCACGTGAATTGAAAGGATGGAAGGATGAAGTAGAAGCGGCACTCCAGGATAAGAGTGTTTGGAAGAAGCTGGTGAATATGAATACTAGATATGAAGCTTTGAAACTGATAGGAGATTATTTGGGTGAGGCATGGGGCATTCTTGGCCCGCCATTCCTTCAATTGTCTGCTTCTTTGATGGATAAAAGGATGACGAATGAATTGCAGTCTGTTCAATTGGTGCGGGCAATAGACAAAACTGCCATTGCGAGTGAAGATGTTGGTGGTGGGATTGAATTGCCCTCCCAAACTGAAAACTTGTCGAGACTGGAGCGTCAGGGGAGCGTGCCAGTCGTTAGCCAAGCTGAAACAGAGAGAAATGATTTGCTAAATATGAATCGGGATTCGGGTGTCAACTATGGTTCTAAACAATCGGCTATTGTTGGAAAGAACACTGAGACAGAAACAGGTGAAGGACAAGAATCAGTTGAGAAGGAGGTTGCAGTTTTGCCCGATCCAAGTCCAAGCCGACATGGTACTGATATGCTAGCTCACATCTTTATCATTCTATTATGTTCTTAATCAAATTAAGATTTTTTTGTAACCTTCAATTTTGAAATTTTAGGTGCAAGTATCTTCAATCTAGTTATTTCATTTTGTGTACTCAGAAAATTTGAAGACTTCTGTATTGCCAAGATGCAAATCCCTTGCATCCCATAGACGTGTTAGAGGAGGGGCTAAAATTAGTCATCTTGAGGACTTGGAGAATGATAGTTCTTCTGGAAAATATGCTTGCCTACAGACTCCTGAAGTTGACCGGGTGCGGGAAGCGCTTAAAACCAGCTCTTCAGAGTTGCAAGCATTGGTGAAAGATCCACTTCCTGATGCGTTACGCATAGCAGAATCAGTAGCTCATGATCTCGCTGAAAAGAATAAAACTCGCGAGCATTCCTTGGAGGACCAAAATGATGCAGGTGCTGCTAATTCAGCCATGAACAAGGATGCTGTGCCTCTTCGATCTGTGAATGCAGTTCTTAAAAATCCATGTCATGGCCATCAGACAATTGTTCCTCGACCTAGCATAATGGAACGCAACAGTACAGCTTGTACGTATGAGGTAATGTAATCCATGTTTGATTCCATACTAACATGCCTTTTCCGCTTTCTATTTTATAGTTGCATGAGATAGTTTCTACATTGCTTATTAAAGGCTTCGTTCTCGATTGTGCGATACAAATATTTTATGTTTTTGGGATTATCATTGCTAGGATGAATGCTGCTGACTTTGAATGAAATGTTGGAGCATCCTCTTTTGGGTTAATATATGGTGTTGTCAACTATTTTCATTTTAATCAAGAATTATAGTGTTATTGCAGTGAATTCTTTTTGTACTGCAAGAAAAGAAAATATGCGAACATATACAATTTTATTATGGTTGCATCATTAAATGTTATATTGTATTTTGTACTAGAAGGGATCTTTGACTGATTTCTTTGGTAGTGGGTTGTTATGATAGTAGGACTACAGTAAGTACTGGTTTTATGCTTTTTGTATACGTACGTTTGAAACTCCTTCTCTAGGAGATGGAATGTTGATCTTCCTGACTGGATATTTCCTCCCTCCTAACGGAGAGGATTATTCTATGTTCTTAATTTTTTTATTTACTTTTAAATTTACTGCTTCTATGTGAGACTTTTTTTCCTTTTTGTTGCAGTGGAATGATTCAATAGATGGTTCGCCAGAAGGAAATCACGCCAGTCGACTTCACCTTCCTAGTCCCAAGAGAAAGGTCATTTCTCCCTTGAAGAAGTATGAAGAAGCCAAAATGGTTCGGAGAAGGAAGTGTAAGAGGTGGAGCTTGCTTGAAGAAGACACCTTAAGAACTGCTGTGCAGAGGTATGTGCTTTCAATTGTTATTATTTTCTTCTGTTTTTAGAGGGATGGATAGAAAGGATTTGATTGTTGCCCCTACTCAAACACACACATTATCTAGTTAAAAACTTGAAATTGTACTATGATTGGTGTTTGCGGTCCTTTGACATTTAGTATGATGTGTGTGTGTATTTTTTTTTTTTTTTGATAACTGGGCGTCAGAGCTTTGCTCTCCTATATCTGGGCACCCGCGCCTACCTCACGGCCCGAACCCAGGAAACATCAAGGATTTTTATTTATTAAGCTCATCTGAGAGCTTGAACTTGAGACCTCTAAGTCAGTATAACCAAGAGATCCCAAGCCCTTACCAACGGGGCTGCCGCTTGGGGGCCATTTAGTATGATGTTCTATCAGCTTAAAATTGTTGCTCTTTCAGTTAATAATTTTGTTTGTGTTGGTAATGTAGATTTGGGAAAGGAAATTGGAAGCTCATCTTAAACAGCTATCGTGATATATTTGATGAGAGAACAGAGGTAACCAATTCCACATTGATCTTACATTATCTTTTGTGTTTTACCTGTTTGTAAGTGGTGGTTGGATTGCTTGTCTCAGGTTGATCTAAAGGATAAGTGGAGAAACATGACCAGATACTAATGTCTTAGCAAATCCTGATCCAAACTTGTGTATTGTAGATTAACCCTAAATATAGGCTATTTGCCCAAATTTAAATAGTCGCATCATGGATTTTGATTGTAATATAGAAAAATTGAAAATATCAATGCAAAA

mRNA sequence

TTTTTATTTTTTCCTTCCATTCCAAACGACATAACGTTTGAATCGTCCATGAAGCATCTCTCGAATTCAAATTTCGCTCTCTCTGCAATCTCCGGCGCCCGGCGGTTTGTCCTTTTTACCTTTATCATCATCGTCTTCCTTTCTTCGCCAGCTCCCTTCGTCTAACATTGAAGAAAATCTCCAATTTATTGAAATCTCAGTTCCCGTTGTATAGGGATTCGGAGCAACATTTCTTTTTTGTTGCGATTCTTTTCGTCAGTTAGGGTTTTTTTCTTTCTCCCGTCTCTTCCATCTCCGATTTCATTTTGACCATGGACGACGACATTTGCCGTTGGATTATCGAATTCATACTCCGGAGTTCAATGGACGACCACTTGCTGAAGAGAATTCTTGCGGTTATTCCCTTGCCGGACATGGATTTCCGCCTGAAGAAAACAGCGCTTTTACGGGCCATTGGGAGCGAAATCTCAGAAGCTGTCGTAACGGAGAAATTACTCGAAATTTTCGAGATGATTGAGCAGTTGGACAAGACTGAAGGCCTTGCAATCATGGAGTCAATGAAAGCTGCATATTGTGCTGTGGCAGTGGAGTGCACGGTGAAGTACTTGGCGGTGGGAGGCACTGACACGAACGGAAAGTATTTTGATGCAATGAGGAGGATTTGGAGAGGGAGAGTGACGGAATTGAAGAGATCAGGTAATAGCGAGCTGGTTTCACGTGAATTGAAAGGATGGAAGGATGAAGTAGAAGCGGCACTCCAGGATAAGAGTGTTTGGAAGAAGCTGGTGAATATGAATACTAGATATGAAGCTTTGAAACTGATAGGAGATTATTTGGGTGAGGCATGGGGCATTCTTGGCCCGCCATTCCTTCAATTGTCTGCTTCTTTGATGGATAAAAGGATGACGAATGAATTGCAGTCTGTTCAATTGGTGCGGGCAATAGACAAAACTGCCATTGCGAGTGAAGATGTTGGTGGTGGGATTGAATTGCCCTCCCAAACTGAAAACTTGTCGAGACTGGAGCGTCAGGGGAGCGTGCCAGTCGTTAGCCAAGCTGAAACAGAGAGAAATGATTTGCTAAATATGAATCGGGATTCGGGTGTCAACTATGGTTCTAAACAATCGGCTATTGTTGGAAAGAACACTGAGACAGAAACAGGTGAAGGACAAGAATCAGTTGAGAAGGAGGTTGCAGTTTTGCCCGATCCAAGTCCAAGCCGACATGAAAATTTGAAGACTTCTGTATTGCCAAGATGCAAATCCCTTGCATCCCATAGACGTGTTAGAGGAGGGGCTAAAATTAGTCATCTTGAGGACTTGGAGAATGATAGTTCTTCTGGAAAATATGCTTGCCTACAGACTCCTGAAGTTGACCGGGTGCGGGAAGCGCTTAAAACCAGCTCTTCAGAGTTGCAAGCATTGGTGAAAGATCCACTTCCTGATGCGTTACGCATAGCAGAATCAGTAGCTCATGATCTCGCTGAAAAGAATAAAACTCGCGAGCATTCCTTGGAGGACCAAAATGATGCAGGTGCTGCTAATTCAGCCATGAACAAGGATGCTGTGCCTCTTCGATCTGTGAATGCAGTTCTTAAAAATCCATGTCATGGCCATCAGACAATTGTTCCTCGACCTAGCATAATGGAACGCAACAGTACAGCTTGTACGTATGAGTGGAATGATTCAATAGATGGTTCGCCAGAAGGAAATCACGCCAGTCGACTTCACCTTCCTAGTCCCAAGAGAAAGGTCATTTCTCCCTTGAAGAAGTATGAAGAAGCCAAAATGGTTCGGAGAAGGAAGTGTAAGAGGTGGAGCTTGCTTGAAGAAGACACCTTAAGAACTGCTGTGCAGAGATTTGGGAAAGGAAATTGGAAGCTCATCTTAAACAGCTATCGTGATATATTTGATGAGAGAACAGAGGTTGATCTAAAGGATAAGTGGAGAAACATGACCAGATACTAATGTCTTAGCAAATCCTGATCCAAACTTGTGTATTGTAGATTAACCCTAAATATAGGCTATTTGCCCAAATTTAAATAGTCGCATCATGGATTTTGATTGTAATATAGAAAAATTGAAAATATCAATGCAAAA

Coding sequence (CDS)

ATGGACGACGACATTTGCCGTTGGATTATCGAATTCATACTCCGGAGTTCAATGGACGACCACTTGCTGAAGAGAATTCTTGCGGTTATTCCCTTGCCGGACATGGATTTCCGCCTGAAGAAAACAGCGCTTTTACGGGCCATTGGGAGCGAAATCTCAGAAGCTGTCGTAACGGAGAAATTACTCGAAATTTTCGAGATGATTGAGCAGTTGGACAAGACTGAAGGCCTTGCAATCATGGAGTCAATGAAAGCTGCATATTGTGCTGTGGCAGTGGAGTGCACGGTGAAGTACTTGGCGGTGGGAGGCACTGACACGAACGGAAAGTATTTTGATGCAATGAGGAGGATTTGGAGAGGGAGAGTGACGGAATTGAAGAGATCAGGTAATAGCGAGCTGGTTTCACGTGAATTGAAAGGATGGAAGGATGAAGTAGAAGCGGCACTCCAGGATAAGAGTGTTTGGAAGAAGCTGGTGAATATGAATACTAGATATGAAGCTTTGAAACTGATAGGAGATTATTTGGGTGAGGCATGGGGCATTCTTGGCCCGCCATTCCTTCAATTGTCTGCTTCTTTGATGGATAAAAGGATGACGAATGAATTGCAGTCTGTTCAATTGGTGCGGGCAATAGACAAAACTGCCATTGCGAGTGAAGATGTTGGTGGTGGGATTGAATTGCCCTCCCAAACTGAAAACTTGTCGAGACTGGAGCGTCAGGGGAGCGTGCCAGTCGTTAGCCAAGCTGAAACAGAGAGAAATGATTTGCTAAATATGAATCGGGATTCGGGTGTCAACTATGGTTCTAAACAATCGGCTATTGTTGGAAAGAACACTGAGACAGAAACAGGTGAAGGACAAGAATCAGTTGAGAAGGAGGTTGCAGTTTTGCCCGATCCAAGTCCAAGCCGACATGAAAATTTGAAGACTTCTGTATTGCCAAGATGCAAATCCCTTGCATCCCATAGACGTGTTAGAGGAGGGGCTAAAATTAGTCATCTTGAGGACTTGGAGAATGATAGTTCTTCTGGAAAATATGCTTGCCTACAGACTCCTGAAGTTGACCGGGTGCGGGAAGCGCTTAAAACCAGCTCTTCAGAGTTGCAAGCATTGGTGAAAGATCCACTTCCTGATGCGTTACGCATAGCAGAATCAGTAGCTCATGATCTCGCTGAAAAGAATAAAACTCGCGAGCATTCCTTGGAGGACCAAAATGATGCAGGTGCTGCTAATTCAGCCATGAACAAGGATGCTGTGCCTCTTCGATCTGTGAATGCAGTTCTTAAAAATCCATGTCATGGCCATCAGACAATTGTTCCTCGACCTAGCATAATGGAACGCAACAGTACAGCTTGTACGTATGAGTGGAATGATTCAATAGATGGTTCGCCAGAAGGAAATCACGCCAGTCGACTTCACCTTCCTAGTCCCAAGAGAAAGGTCATTTCTCCCTTGAAGAAGTATGAAGAAGCCAAAATGGTTCGGAGAAGGAAGTGTAAGAGGTGGAGCTTGCTTGAAGAAGACACCTTAAGAACTGCTGTGCAGAGATTTGGGAAAGGAAATTGGAAGCTCATCTTAAACAGCTATCGTGATATATTTGATGAGAGAACAGAGGTTGATCTAAAGGATAAGTGGAGAAACATGACCAGATACTAA

Protein sequence

MDDDICRWIIEFILRSSMDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEKLLEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRGRVTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWGILGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASEDVGGGIELPSQTENLSRLERQGSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTETETGEGQESVEKEVAVLPDPSPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTPEVDRVREALKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANSAMNKDAVPLRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRLHLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTEVDLKDKWRNMTRY
Homology
BLAST of Tan0000804 vs. ExPASy Swiss-Prot
Match: O55036 (Telomeric repeat-binding factor 1 (Fragment) OS=Cricetulus griseus OX=10029 GN=TERF1 PE=2 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 3.0e-06
Identity = 31/86 (36.05%), Postives = 47/86 (54.65%), Query Frame = 0

Query: 465 EGNHASRLHLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKL 524
           E   A+   +P  K + ++P K        R RK + W   E+  LR+ V+++G+GNW  
Sbjct: 353 ESRRATESRIPVSKSQPVTPEKH-------RARKRQAWLWEEDKNLRSGVRKYGEGNWSK 412

Query: 525 ILNSYRDIFDERTEVDLKDKWRNMTR 551
           IL  Y+  F+ RT V LKD+WR M +
Sbjct: 413 ILLHYK--FNNRTSVMLKDRWRTMKK 429

BLAST of Tan0000804 vs. ExPASy Swiss-Prot
Match: P54274 (Telomeric repeat-binding factor 1 OS=Homo sapiens OX=9606 GN=TERF1 PE=1 SV=3)

HSP 1 Score: 55.1 bits (131), Expect = 3.0e-06
Identity = 31/86 (36.05%), Postives = 47/86 (54.65%), Query Frame = 0

Query: 465 EGNHASRLHLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKL 524
           E   A+   +P  K + ++P K        R RK + W   E+  LR+ V+++G+GNW  
Sbjct: 353 ESRRATESRIPVSKSQPVTPEKH-------RARKRQAWLWEEDKNLRSGVRKYGEGNWSK 412

Query: 525 ILNSYRDIFDERTEVDLKDKWRNMTR 551
           IL  Y+  F+ RT V LKD+WR M +
Sbjct: 413 ILLHYK--FNNRTSVMLKDRWRTMKK 429

BLAST of Tan0000804 vs. ExPASy Swiss-Prot
Match: Q6WLH3 (Single myb histone 5 OS=Zea mays OX=4577 GN=SMH5 PE=2 SV=1)

HSP 1 Score: 52.8 bits (125), Expect = 1.5e-05
Identity = 26/51 (50.98%), Postives = 31/51 (60.78%), Query Frame = 0

Query: 500 KRWSLLEEDTLRTAVQRFGKGNWKLILN--SYRDIFDERTEVDLKDKWRNM 549
           +RW+  EE  LR  V R G GNW++ILN          R+ VDLKDKWRNM
Sbjct: 6   QRWTSEEEAALRAGVARHGVGNWRMILNDPELSSTLRYRSNVDLKDKWRNM 56

BLAST of Tan0000804 vs. ExPASy Swiss-Prot
Match: P70371 (Telomeric repeat-binding factor 1 OS=Mus musculus OX=10090 GN=Terf1 PE=1 SV=1)

HSP 1 Score: 52.4 bits (124), Expect = 1.9e-05
Identity = 25/55 (45.45%), Postives = 36/55 (65.45%), Query Frame = 0

Query: 496 RRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTEVDLKDKWRNMTR 551
           RRK + W   E+  L+  V+++G+GNW  IL+ Y+  F+ RT V LKD+WR M R
Sbjct: 364 RRKRQTWLWEEDRILKCGVKKYGEGNWAKILSHYK--FNNRTSVMLKDRWRTMKR 416

BLAST of Tan0000804 vs. ExPASy Swiss-Prot
Match: Q9C7B1 (Telomere repeat-binding protein 3 OS=Arabidopsis thaliana OX=3702 GN=TRP3 PE=1 SV=1)

HSP 1 Score: 50.1 bits (118), Expect = 9.6e-05
Identity = 34/115 (29.57%), Postives = 63/115 (54.78%), Query Frame = 0

Query: 438 IVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRLHLPSPKRKVIS--PL-KKYEEAKMV 497
           ++   +I++ N     Y+ + S+D  P  +    + LP  + K ++  PL +K +  ++ 
Sbjct: 446 VIDSRNIVDSNLELVPYQGDISVD-EPSSDSKELVPLPELEVKALAIVPLNQKPKRTELA 505

Query: 498 RRRKCKRWSLLEEDTLRTAVQRFGKGNWKLI-LNSYRDIFDERTEVDLKDKWRNM 549
           +RR  + +S+ E + L  AV+  G G W+ + L ++ D  D RT VDLKDKW+ +
Sbjct: 506 QRRTRRPFSVTEVEALVQAVEELGTGRWRDVKLRAFEDA-DHRTYVDLKDKWKTL 558

BLAST of Tan0000804 vs. NCBI nr
Match: KAG6605382.1 (Telomeric repeat-binding factor 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 872.8 bits (2254), Expect = 1.5e-249
Identity = 453/559 (81.04%), Postives = 494/559 (88.37%), Query Frame = 0

Query: 1   MDDDICRWIIEFILRSSMDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEK 60
           MD D+CRWIIEFILRSSM+DHLLKR LAV+P PD DFRLKKT LLRAI SE SEAVVTEK
Sbjct: 1   MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEK 60

Query: 61  LLEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRG 120
           +L IFEMIEQLDKTEGLA+M+SMK+AYCAVAVECTVKYLAV G   NGKYFD + RIWRG
Sbjct: 61  VLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRG 120

Query: 121 RVTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWG 180
           RV ELKRSG SELVSREL+ WKDEVEAAL DK+VWKKLVNMNTRYEALKLIGDYLGEAWG
Sbjct: 121 RVMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWG 180

Query: 181 ILGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLE 240
           +LGP FL+LSASLMD R  NE+QSVQL +AIDKTA+ASEDVG  GGIELPSQTEN +R E
Sbjct: 181 VLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPE 240

Query: 241 RQGSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEK 300
            QGSVPV+++AET+R D+L+MN+DSGVN  SK+S+ V  NTE      TET EGQES+EK
Sbjct: 241 HQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK 300

Query: 301 EVAVLPDPSPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTP 360
           EV VL D SP+  +NLKTS+LPRCKSLASH+RVRGGAKI HLEDLENDSSSGK  CLQTP
Sbjct: 301 EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTP 360

Query: 361 EVDRVREALKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANS 420
           E +RVREALKTSS ELQALVKDPLPDALRIAESVA DLA+KNKT EHSLEDQNDA AAN 
Sbjct: 361 EFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANP 420

Query: 421 AMNKDAVPLRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRL 480
           A+NKD +PL+S+N    NP HGHQTIVPRPSIMERNS+ACTYEWNDSIDGSPE N ASRL
Sbjct: 421 AINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRL 480

Query: 481 HLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDI 540
           HLPSPKRKVISPLKKYEE ++V RRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+I
Sbjct: 481 HLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREI 540

Query: 541 FDERTEVDLKDKWRNMTRY 552
           FD+RTEVDLKDKWRNMTRY
Sbjct: 541 FDDRTEVDLKDKWRNMTRY 559

BLAST of Tan0000804 vs. NCBI nr
Match: XP_023532585.1 (uncharacterized protein LOC111794704 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 869.8 bits (2246), Expect = 1.3e-248
Identity = 449/559 (80.32%), Postives = 493/559 (88.19%), Query Frame = 0

Query: 1   MDDDICRWIIEFILRSSMDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEK 60
           MD D+CRWIIEFILRSSM+DHLLKR LAV+P PD DFRLKKT LLRAI SE SEAVVTEK
Sbjct: 1   MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEK 60

Query: 61  LLEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRG 120
           +L IFEMIEQLDKTEGLA+M+SMK+AYCAVAVECTVKYLAV G   NGKYFD + RIWRG
Sbjct: 61  VLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRG 120

Query: 121 RVTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWG 180
           RV ELK+SG SELVSREL+ WKD+VE AL DK+VWKKLVNMNTRYEALKLIGDYLGEAWG
Sbjct: 121 RVMELKKSGRSELVSRELEEWKDKVEVALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWG 180

Query: 181 ILGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLE 240
           +LGP FL+LSASLMD R  NE+ SVQL +AIDKTA+ SEDVG  GGIELPS+TEN +R +
Sbjct: 181 VLGPSFLELSASLMDNRTRNEMPSVQLEQAIDKTAVVSEDVGGSGGIELPSRTENHARPD 240

Query: 241 RQGSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEK 300
            QGSVPV+++AET+RND+L+MN+DSGVN  SK+S+ V  NTE      TET EGQES+EK
Sbjct: 241 HQGSVPVLTRAETKRNDVLDMNQDSGVNDDSKRSSTVEMNTERVQELPTETTEGQESIEK 300

Query: 301 EVAVLPDPSPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTP 360
           EV VL DPSP+  +NLKTSVLPRCKSLASH+RVRGGAKI HLEDLENDSSSGK  CLQTP
Sbjct: 301 EVTVLQDPSPNCRKNLKTSVLPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTP 360

Query: 361 EVDRVREALKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANS 420
           E DRVREALKTSS ELQALVKDPLPDALRIA+SVA DLA+KNKT EHSLEDQNDA AAN 
Sbjct: 361 EFDRVREALKTSSLELQALVKDPLPDALRIAQSVAQDLAKKNKTPEHSLEDQNDAVAANP 420

Query: 421 AMNKDAVPLRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRL 480
           A+NK  +PL+S+N    NPCHGHQTIVPRPSIMERNS+ACTYEWNDSIDGSPE N ASRL
Sbjct: 421 AINKGNMPLQSMNTAFNNPCHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRL 480

Query: 481 HLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDI 540
           HLPSPKRKVISPLKKYEE ++V RRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+I
Sbjct: 481 HLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREI 540

Query: 541 FDERTEVDLKDKWRNMTRY 552
           FD+RTEVDLKDKWRNMTRY
Sbjct: 541 FDDRTEVDLKDKWRNMTRY 559

BLAST of Tan0000804 vs. NCBI nr
Match: XP_023007149.1 (uncharacterized protein LOC111499729 isoform X1 [Cucurbita maxima])

HSP 1 Score: 865.9 bits (2236), Expect = 1.8e-247
Identity = 448/559 (80.14%), Postives = 492/559 (88.01%), Query Frame = 0

Query: 1   MDDDICRWIIEFILRSSMDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEK 60
           MD D+CRWIIEFILRSSM+D LLKR LA++P PD DFRLKKT LLRAI SE SEA+VTEK
Sbjct: 1   MDKDVCRWIIEFILRSSMNDQLLKRTLAIMPFPDNDFRLKKTVLLRAIESERSEAIVTEK 60

Query: 61  LLEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRG 120
           +L IFEMIEQLDKTEGLA+M+SMK+AYCAVAVECTVKYLAV G   NGKYFD + RIWRG
Sbjct: 61  VLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRG 120

Query: 121 RVTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWG 180
           RV ELKRSG SELVSREL+ WKD+VEAAL DK+VWKKLVNMN+RYEALKLIGDYLGEAWG
Sbjct: 121 RVMELKRSGRSELVSRELEEWKDKVEAALWDKTVWKKLVNMNSRYEALKLIGDYLGEAWG 180

Query: 181 ILGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLE 240
           +LGP FL+LSASLMD R  NE+QSVQL +AI KTA+ SEDVG  GGIELPSQTEN ++ E
Sbjct: 181 VLGPSFLELSASLMDNRTRNEMQSVQLEQAIHKTAVVSEDVGGSGGIELPSQTENHAKRE 240

Query: 241 RQGSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEK 300
            QGSVPV+++AET+RND+L++N+DSGVN  SK+S+ V  NTE      TET EG+ESVEK
Sbjct: 241 HQGSVPVLTRAETKRNDVLDINQDSGVNDNSKRSSTVEMNTERVQELPTETTEGKESVEK 300

Query: 301 EVAVLPDPSPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTP 360
           EV VL DPSP+  +NLKTSVLPRCKSLASH+RVRGGAKI HLEDLENDSSSGK  CLQTP
Sbjct: 301 EVTVLQDPSPNCRKNLKTSVLPRCKSLASHKRVRGGAKIGHLEDLENDSSSGKSTCLQTP 360

Query: 361 EVDRVREALKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANS 420
           E DRVREA KTSS ELQALVKDPLPDALRIAESVA DLA+KNKT EHSLEDQNDA AAN 
Sbjct: 361 EFDRVREAFKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANP 420

Query: 421 AMNKDAVPLRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRL 480
           A+NKD +PL+S+N    NP HGHQTIVPRPSIMERNS+ACTYEWNDSIDGSPE N ASRL
Sbjct: 421 AINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRL 480

Query: 481 HLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDI 540
           HLPSPKRKVISPLKKYEE ++V RRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+I
Sbjct: 481 HLPSPKRKVISPLKKYEETRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREI 540

Query: 541 FDERTEVDLKDKWRNMTRY 552
           FDERTEVDLKDKWRNMTRY
Sbjct: 541 FDERTEVDLKDKWRNMTRY 559

BLAST of Tan0000804 vs. NCBI nr
Match: KAG7035336.1 (hypothetical protein SDJN02_02131, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 843.2 bits (2177), Expect = 1.3e-240
Identity = 439/546 (80.40%), Postives = 481/546 (88.10%), Query Frame = 0

Query: 1   MDDDICRWIIEFILRSSMDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEK 60
           MD D+CRWIIEFILRSSM+DHLLKR LAV+P PD DFRLKKT LLRAI SE SEAVVTEK
Sbjct: 1   MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEK 60

Query: 61  LLEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRG 120
           +L IFEMIEQLDKTEGLA+M+SMK+AYCAVAVECTVKYLAV G   NGKYFD + RIWRG
Sbjct: 61  VLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRG 120

Query: 121 RVTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWG 180
           R+ ELKRSG SELVSREL+ WKDEVEAAL DK+VWKKLVNMNTRYEALKLIGDYLGEAWG
Sbjct: 121 RLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWG 180

Query: 181 ILGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLE 240
           +LGP FL+LSASLMD R  NE+QSVQL +AIDKTA+ASEDVG  GGIELPSQTEN +R E
Sbjct: 181 VLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPE 240

Query: 241 RQGSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEK 300
            QGSVPV+++AET+R D+L+MN+DSGVN  SK+S+ V  NTE      TET EGQES+EK
Sbjct: 241 HQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK 300

Query: 301 EVAVLPDPSPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTP 360
           EV VL D SP+  +NLKTS+LPRCKSLASH+RVRGGAKI HLEDLENDSSSGK  CLQTP
Sbjct: 301 EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTP 360

Query: 361 EVDRVREALKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANS 420
           E +RVREALKTSS ELQALVKDPLPDALRIAESVA DLA+KNKT EHSLEDQNDA AAN 
Sbjct: 361 EFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANP 420

Query: 421 AMNKDAVPLRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRL 480
           A+NKD +PL+S+N    NP HGHQTIVPRPSIMERNS+ACTYEWNDSIDGSPE N ASRL
Sbjct: 421 AINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRL 480

Query: 481 HLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDI 539
           HLPSPKRKVISPLKKYEE ++V RRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+I
Sbjct: 481 HLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREI 540

BLAST of Tan0000804 vs. NCBI nr
Match: XP_022947187.1 (uncharacterized protein LOC111451133 [Cucurbita moschata])

HSP 1 Score: 834.3 bits (2154), Expect = 6.0e-238
Identity = 435/542 (80.26%), Postives = 474/542 (87.45%), Query Frame = 0

Query: 18  MDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEKLLEIFEMIEQLDKTEGL 77
           M+DHLLKR LAV+P PD DFRLKKT LLRAI SE SEAVVTEK+L IFEMIEQLDKTEGL
Sbjct: 1   MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGL 60

Query: 78  AIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRGRVTELKRSGNSELVSRE 137
           A+M+SMK+AYCAVAVECTVKYLAV G   NGKYFD + RIWRGRV ELKRSG SELVSRE
Sbjct: 61  AMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRGRVMELKRSGRSELVSRE 120

Query: 138 LKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWGILGPPFLQLSASLMDKR 197
            + WKDEVEAAL DK VWKKLVNMNTRYEALKLIGDYLGEAWG+LGP FL+LSASLMD R
Sbjct: 121 FEEWKDEVEAALWDKIVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNR 180

Query: 198 MTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLERQGSVPVVSQAETERND 257
             NE+QSVQL +AIDKTA+ SEDVG  GGIE PS+TEN +R E QGSVPV+++AET+RND
Sbjct: 181 TRNEMQSVQLEQAIDKTAVVSEDVGGSGGIEFPSRTENHARPEHQGSVPVLTRAETKRND 240

Query: 258 LLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEKEVAVLPDPSPSRHENLK 317
           +L+MN+DSGVN  SK+S+ V  NTE      TET EGQES+EKEV VL D SP+  +NLK
Sbjct: 241 VLDMNQDSGVNDNSKRSSTVEMNTERVQELPTETTEGQESIEKEVTVLQDRSPNCRKNLK 300

Query: 318 TSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTPEVDRVREALKTSSSELQ 377
           TS+LPRCKSLASH+RVRGGAKI HLE LENDSSSGK  CLQTPE DRVREALKTSS ELQ
Sbjct: 301 TSILPRCKSLASHKRVRGGAKIDHLEGLENDSSSGKSTCLQTPEFDRVREALKTSSLELQ 360

Query: 378 ALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANSAMNKDAVPLRSVNAVLK 437
           ALVKDPLPDALRIAESVA DLA+KNKT EHSLEDQNDA AAN A+NKD +PL+S+N    
Sbjct: 361 ALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFN 420

Query: 438 NPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRLHLPSPKRKVISPLKKYE 497
           NP HGHQTIVPRPSIMERNS+ACTYEWNDSIDGSPE N ASRLHLPSPKRKVISPLKKYE
Sbjct: 421 NPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYE 480

Query: 498 EAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTEVDLKDKWRNMT 552
           E ++V RRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFD+RTEVDLKDKWRNMT
Sbjct: 481 ENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTEVDLKDKWRNMT 540

BLAST of Tan0000804 vs. ExPASy TrEMBL
Match: A0A6J1L461 (uncharacterized protein LOC111499729 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111499729 PE=4 SV=1)

HSP 1 Score: 865.9 bits (2236), Expect = 8.9e-248
Identity = 448/559 (80.14%), Postives = 492/559 (88.01%), Query Frame = 0

Query: 1   MDDDICRWIIEFILRSSMDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEK 60
           MD D+CRWIIEFILRSSM+D LLKR LA++P PD DFRLKKT LLRAI SE SEA+VTEK
Sbjct: 1   MDKDVCRWIIEFILRSSMNDQLLKRTLAIMPFPDNDFRLKKTVLLRAIESERSEAIVTEK 60

Query: 61  LLEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRG 120
           +L IFEMIEQLDKTEGLA+M+SMK+AYCAVAVECTVKYLAV G   NGKYFD + RIWRG
Sbjct: 61  VLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRG 120

Query: 121 RVTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWG 180
           RV ELKRSG SELVSREL+ WKD+VEAAL DK+VWKKLVNMN+RYEALKLIGDYLGEAWG
Sbjct: 121 RVMELKRSGRSELVSRELEEWKDKVEAALWDKTVWKKLVNMNSRYEALKLIGDYLGEAWG 180

Query: 181 ILGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLE 240
           +LGP FL+LSASLMD R  NE+QSVQL +AI KTA+ SEDVG  GGIELPSQTEN ++ E
Sbjct: 181 VLGPSFLELSASLMDNRTRNEMQSVQLEQAIHKTAVVSEDVGGSGGIELPSQTENHAKRE 240

Query: 241 RQGSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEK 300
            QGSVPV+++AET+RND+L++N+DSGVN  SK+S+ V  NTE      TET EG+ESVEK
Sbjct: 241 HQGSVPVLTRAETKRNDVLDINQDSGVNDNSKRSSTVEMNTERVQELPTETTEGKESVEK 300

Query: 301 EVAVLPDPSPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTP 360
           EV VL DPSP+  +NLKTSVLPRCKSLASH+RVRGGAKI HLEDLENDSSSGK  CLQTP
Sbjct: 301 EVTVLQDPSPNCRKNLKTSVLPRCKSLASHKRVRGGAKIGHLEDLENDSSSGKSTCLQTP 360

Query: 361 EVDRVREALKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANS 420
           E DRVREA KTSS ELQALVKDPLPDALRIAESVA DLA+KNKT EHSLEDQNDA AAN 
Sbjct: 361 EFDRVREAFKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANP 420

Query: 421 AMNKDAVPLRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRL 480
           A+NKD +PL+S+N    NP HGHQTIVPRPSIMERNS+ACTYEWNDSIDGSPE N ASRL
Sbjct: 421 AINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRL 480

Query: 481 HLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDI 540
           HLPSPKRKVISPLKKYEE ++V RRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+I
Sbjct: 481 HLPSPKRKVISPLKKYEETRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREI 540

Query: 541 FDERTEVDLKDKWRNMTRY 552
           FDERTEVDLKDKWRNMTRY
Sbjct: 541 FDERTEVDLKDKWRNMTRY 559

BLAST of Tan0000804 vs. ExPASy TrEMBL
Match: A0A6J1G5R7 (uncharacterized protein LOC111451133 OS=Cucurbita moschata OX=3662 GN=LOC111451133 PE=4 SV=1)

HSP 1 Score: 834.3 bits (2154), Expect = 2.9e-238
Identity = 435/542 (80.26%), Postives = 474/542 (87.45%), Query Frame = 0

Query: 18  MDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEKLLEIFEMIEQLDKTEGL 77
           M+DHLLKR LAV+P PD DFRLKKT LLRAI SE SEAVVTEK+L IFEMIEQLDKTEGL
Sbjct: 1   MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGL 60

Query: 78  AIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRGRVTELKRSGNSELVSRE 137
           A+M+SMK+AYCAVAVECTVKYLAV G   NGKYFD + RIWRGRV ELKRSG SELVSRE
Sbjct: 61  AMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRGRVMELKRSGRSELVSRE 120

Query: 138 LKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWGILGPPFLQLSASLMDKR 197
            + WKDEVEAAL DK VWKKLVNMNTRYEALKLIGDYLGEAWG+LGP FL+LSASLMD R
Sbjct: 121 FEEWKDEVEAALWDKIVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNR 180

Query: 198 MTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLERQGSVPVVSQAETERND 257
             NE+QSVQL +AIDKTA+ SEDVG  GGIE PS+TEN +R E QGSVPV+++AET+RND
Sbjct: 181 TRNEMQSVQLEQAIDKTAVVSEDVGGSGGIEFPSRTENHARPEHQGSVPVLTRAETKRND 240

Query: 258 LLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEKEVAVLPDPSPSRHENLK 317
           +L+MN+DSGVN  SK+S+ V  NTE      TET EGQES+EKEV VL D SP+  +NLK
Sbjct: 241 VLDMNQDSGVNDNSKRSSTVEMNTERVQELPTETTEGQESIEKEVTVLQDRSPNCRKNLK 300

Query: 318 TSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTPEVDRVREALKTSSSELQ 377
           TS+LPRCKSLASH+RVRGGAKI HLE LENDSSSGK  CLQTPE DRVREALKTSS ELQ
Sbjct: 301 TSILPRCKSLASHKRVRGGAKIDHLEGLENDSSSGKSTCLQTPEFDRVREALKTSSLELQ 360

Query: 378 ALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANSAMNKDAVPLRSVNAVLK 437
           ALVKDPLPDALRIAESVA DLA+KNKT EHSLEDQNDA AAN A+NKD +PL+S+N    
Sbjct: 361 ALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFN 420

Query: 438 NPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRLHLPSPKRKVISPLKKYE 497
           NP HGHQTIVPRPSIMERNS+ACTYEWNDSIDGSPE N ASRLHLPSPKRKVISPLKKYE
Sbjct: 421 NPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYE 480

Query: 498 EAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTEVDLKDKWRNMT 552
           E ++V RRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFD+RTEVDLKDKWRNMT
Sbjct: 481 ENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTEVDLKDKWRNMT 540

BLAST of Tan0000804 vs. ExPASy TrEMBL
Match: A0A6J1L270 (uncharacterized protein LOC111499729 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111499729 PE=4 SV=1)

HSP 1 Score: 833.9 bits (2153), Expect = 3.8e-238
Identity = 433/542 (79.89%), Postives = 476/542 (87.82%), Query Frame = 0

Query: 18  MDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEKLLEIFEMIEQLDKTEGL 77
           M+D LLKR LA++P PD DFRLKKT LLRAI SE SEA+VTEK+L IFEMIEQLDKTEGL
Sbjct: 1   MNDQLLKRTLAIMPFPDNDFRLKKTVLLRAIESERSEAIVTEKVLAIFEMIEQLDKTEGL 60

Query: 78  AIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRGRVTELKRSGNSELVSRE 137
           A+M+SMK+AYCAVAVECTVKYLAV G   NGKYFD + RIWRGRV ELKRSG SELVSRE
Sbjct: 61  AMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRGRVMELKRSGRSELVSRE 120

Query: 138 LKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWGILGPPFLQLSASLMDKR 197
           L+ WKD+VEAAL DK+VWKKLVNMN+RYEALKLIGDYLGEAWG+LGP FL+LSASLMD R
Sbjct: 121 LEEWKDKVEAALWDKTVWKKLVNMNSRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNR 180

Query: 198 MTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLERQGSVPVVSQAETERND 257
             NE+QSVQL +AI KTA+ SEDVG  GGIELPSQTEN ++ E QGSVPV+++AET+RND
Sbjct: 181 TRNEMQSVQLEQAIHKTAVVSEDVGGSGGIELPSQTENHAKREHQGSVPVLTRAETKRND 240

Query: 258 LLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEKEVAVLPDPSPSRHENLK 317
           +L++N+DSGVN  SK+S+ V  NTE      TET EG+ESVEKEV VL DPSP+  +NLK
Sbjct: 241 VLDINQDSGVNDNSKRSSTVEMNTERVQELPTETTEGKESVEKEVTVLQDPSPNCRKNLK 300

Query: 318 TSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTPEVDRVREALKTSSSELQ 377
           TSVLPRCKSLASH+RVRGGAKI HLEDLENDSSSGK  CLQTPE DRVREA KTSS ELQ
Sbjct: 301 TSVLPRCKSLASHKRVRGGAKIGHLEDLENDSSSGKSTCLQTPEFDRVREAFKTSSLELQ 360

Query: 378 ALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANSAMNKDAVPLRSVNAVLK 437
           ALVKDPLPDALRIAESVA DLA+KNKT EHSLEDQNDA AAN A+NKD +PL+S+N    
Sbjct: 361 ALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFN 420

Query: 438 NPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRLHLPSPKRKVISPLKKYE 497
           NP HGHQTIVPRPSIMERNS+ACTYEWNDSIDGSPE N ASRLHLPSPKRKVISPLKKYE
Sbjct: 421 NPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYE 480

Query: 498 EAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTEVDLKDKWRNMT 552
           E ++V RRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFDERTEVDLKDKWRNMT
Sbjct: 481 ETRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDERTEVDLKDKWRNMT 540

BLAST of Tan0000804 vs. ExPASy TrEMBL
Match: A0A6J1FVK5 (uncharacterized protein LOC111448860 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448860 PE=4 SV=1)

HSP 1 Score: 773.5 bits (1996), Expect = 6.0e-220
Identity = 417/563 (74.07%), Postives = 466/563 (82.77%), Query Frame = 0

Query: 1   MDDDICRWIIEFILRSSMDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEK 60
           M++DICRWI EFILRSSMDDHLLKR+LAVIPL D DFRLKKT LLRAI SEISEAV+TEK
Sbjct: 1   MNEDICRWITEFILRSSMDDHLLKRVLAVIPLSDKDFRLKKTVLLRAIESEISEAVITEK 60

Query: 61  LLEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRG 120
           LLEIFEMIEQL+K EGL IMESMKAAYCAVAVECTVKYL V G   +G+YFDA+RRIWRG
Sbjct: 61  LLEIFEMIEQLEKAEGLQIMESMKAAYCAVAVECTVKYLLVEGVYKHGRYFDAVRRIWRG 120

Query: 121 RVTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWG 180
           RVT+      +ELVS E K WKDEVEA+L D ++ KKLV+MNTRY+ALKLIGDYLGE+W 
Sbjct: 121 RVTK------TELVSNEFKAWKDEVEASLCDTNIRKKLVHMNTRYDALKLIGDYLGESWA 180

Query: 181 ILGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLE 240
            +GP FLQLSASL+DK+M NE+ S+QL +  +  AI S DVG  GGIELPSQ EN  R E
Sbjct: 181 AMGPSFLQLSASLVDKKMRNEMHSIQLEQETNNLAIESVDVGGSGGIELPSQRENCVRTE 240

Query: 241 RQGSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEK 300
            QGS  V+SQ E+ R DLL+ ++D G N GSKQSA+   NTE      TET EGQES EK
Sbjct: 241 WQGSGRVLSQPESRRTDLLHSHQDLGTNDGSKQSAVDAMNTERVQELATETAEGQESAEK 300

Query: 301 EVAVLPDPSPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDL----ENDSSSGKYAC 360
           EVAVL + S  R E LKTSVLPR KSLA HRRVRGG KISHLEDL    E++SSS +Y C
Sbjct: 301 EVAVLQNASSCR-EILKTSVLPRGKSLAFHRRVRGGVKISHLEDLEKENEDESSSERYNC 360

Query: 361 LQTPEVDRVREALKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAG 420
           L+TPEV+RVREALKTSS ELQA+VKDPLPDALRIAESVA DLAEKNKT E+SLED+NDAG
Sbjct: 361 LETPEVNRVREALKTSSLELQAVVKDPLPDALRIAESVAQDLAEKNKTCENSLEDRNDAG 420

Query: 421 AANSAMNKDAVPLRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNH 480
             N  +NK+AVPL+ ++A LK+P HGH+T+ PRPSIMERNSTACTYEWNDSID  PEG+ 
Sbjct: 421 VDNPTINKEAVPLQPMSANLKDPGHGHKTVFPRPSIMERNSTACTYEWNDSIDDLPEGSP 480

Query: 481 ASRLHLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNS 540
           ASRLHL SPKRK ISPLKKYEE K V RR+CK+WSLLEEDTLRTAVQRFGKGNWKLILNS
Sbjct: 481 ASRLHLHSPKRKAISPLKKYEETKAVGRRRCKKWSLLEEDTLRTAVQRFGKGNWKLILNS 540

Query: 541 YRDIFDERTEVDLKDKWRNMTRY 552
           YRDIFDERTEVDLKDKWRNMTRY
Sbjct: 541 YRDIFDERTEVDLKDKWRNMTRY 556

BLAST of Tan0000804 vs. ExPASy TrEMBL
Match: A0A6J1JF51 (uncharacterized protein LOC111484457 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484457 PE=4 SV=1)

HSP 1 Score: 770.0 bits (1987), Expect = 6.7e-219
Identity = 414/563 (73.53%), Postives = 465/563 (82.59%), Query Frame = 0

Query: 1   MDDDICRWIIEFILRSSMDDHLLKRILAVIPLPDMDFRLKKTALLRAIGSEISEAVVTEK 60
           M++DICRWI EFILRSSMDDHLLKR+LAVIPL + DFRLKKT LLRAI SEISEAV+T+K
Sbjct: 1   MNEDICRWITEFILRSSMDDHLLKRVLAVIPLSNKDFRLKKTVLLRAIESEISEAVITDK 60

Query: 61  LLEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRG 120
           LLEIFEMIEQL+K EGL IMESMKAAYCAVAVECTVKYL V G   +G+YFDA+RRIWRG
Sbjct: 61  LLEIFEMIEQLEKAEGLQIMESMKAAYCAVAVECTVKYLLVEGVYKHGRYFDAVRRIWRG 120

Query: 121 RVTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWG 180
            VT+      +ELVS E K WKDEVEA+L D ++ KKLV+MNTRY+ALKLIGDYLGE+W 
Sbjct: 121 GVTK------TELVSDEFKAWKDEVEASLCDANIRKKLVHMNTRYDALKLIGDYLGESWA 180

Query: 181 ILGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASEDVG--GGIELPSQTENLSRLE 240
            +GP FLQLSASL+DK+M NE+ S+QL R  +  AI S DVG  GGIELPSQ EN  + E
Sbjct: 181 AIGPSFLQLSASLVDKKMRNEMHSIQLERETNILAIESVDVGGSGGIELPSQRENCVKTE 240

Query: 241 RQGSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTE------TETGEGQESVEK 300
            QGS  V+SQ E+ R DLL+ ++D G N GSKQSAI   NTE      TET EGQES EK
Sbjct: 241 WQGSGRVLSQPESRRTDLLHSHQDLGTNDGSKQSAIDAMNTERVQELATETAEGQESAEK 300

Query: 301 EVAVLPDPSPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDL----ENDSSSGKYAC 360
           EVAVL + S  R E LKTSVLPRCKSLA HRRV GG KISHLEDL    E++SSS +Y C
Sbjct: 301 EVAVLQNASSCR-EILKTSVLPRCKSLAFHRRVSGGVKISHLEDLEKENEDESSSERYNC 360

Query: 361 LQTPEVDRVREALKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAG 420
           L+TPEV+RVREALKTSS ELQA+ KDPLPDALRIAESVAHDLAEKNKT E+SLED+NDAG
Sbjct: 361 LETPEVNRVREALKTSSLELQAVAKDPLPDALRIAESVAHDLAEKNKTCENSLEDRNDAG 420

Query: 421 AANSAMNKDAVPLRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNH 480
             N  +NK+AVPL+ ++A LK+P HGH+T+ PRPSIMERNSTACTYEWNDSID  PEG+ 
Sbjct: 421 VDNPTINKEAVPLQPMSANLKDPGHGHKTVFPRPSIMERNSTACTYEWNDSIDDLPEGSP 480

Query: 481 ASRLHLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNS 540
           ASRLHL SPKRK +SPLKKYEE K V RR+CK+WSLLEEDTLRTAVQRFGKGNWKLILNS
Sbjct: 481 ASRLHLHSPKRKAVSPLKKYEETKAVGRRRCKKWSLLEEDTLRTAVQRFGKGNWKLILNS 540

Query: 541 YRDIFDERTEVDLKDKWRNMTRY 552
           YRDIFDERTEVDLKDKWRNMTRY
Sbjct: 541 YRDIFDERTEVDLKDKWRNMTRY 556

BLAST of Tan0000804 vs. TAIR 10
Match: AT1G15720.1 (TRF-like 5 )

HSP 1 Score: 131.3 bits (329), Expect = 2.3e-30
Identity = 139/549 (25.32%), Postives = 214/549 (38.98%), Query Frame = 0

Query: 7   RWIIEFILRSSMDDHLLK-RILAVIPLPDMD--FRLKKTALLRAIGSEISEAVVTEKLLE 66
           +W+ EF LR  ++  +    +L+ +   D D   +LK TA+LR I + + +  V E +L+
Sbjct: 6   KWVAEFFLRRQLNPRVYAFPLLSALKPVDSDDCVKLKLTAVLRDISNSMIQGTVDEGMLD 65

Query: 67  IFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYL-AVGGTDTNGKYFDAMRRIWRGRV 126
           + E++E+L   E   IM S+K+AYC  AVECT++++  V  +D  G + DA+ RIWR R+
Sbjct: 66  LLEILEKLLLQEHSVIMGSLKSAYCWTAVECTLRFMWPVNASD--GFFGDALERIWRNRI 125

Query: 127 TELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWGIL 186
             LK    S+LV+REL  W+ ++  A ++  +++K+   N RY A+  +   L E W +L
Sbjct: 126 GTLKEK-ESDLVTRELLKWESDLNKAFEEPEIYQKIRETNIRYNAISHLNQLLKEQWALL 185

Query: 187 GPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASEDVGGGIELPSQTENLSRLERQGS 246
           G   L+  A    KR      S    R            GG  E  +  E +  +E    
Sbjct: 186 GCSSLESEAR---KRFLKRKDSPYASRR-----------GGNREKANDVEEVGGVENPDG 245

Query: 247 VPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTETETGEGQESVEKEVAVLPDPSP 306
           V  V++ E E    LN                          +G+  V +E         
Sbjct: 246 VGKVNEHEQEHEPSLN--------------------------KGEMLVARE--------- 305

Query: 307 SRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTPEVDRVREALK 366
                                                                       
Sbjct: 306 ------------------------------------------------------------ 365

Query: 367 TSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANSAMNKDAVPLR 426
                    +KD L +  R+ + +     E N   EHS++                    
Sbjct: 366 ---------LKDFLLEIQRLIDPITRQDQEPNNAMEHSVD-------------------- 386

Query: 427 SVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRL---HLPSPKR 486
                          + P+P    R          D+ D   EG  +SR    HLP+P+ 
Sbjct: 426 ---------------VTPQPDGANR---------TDAEDS--EGTSSSRRVRPHLPTPEP 386

Query: 487 KVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTEV 546
             +SPLKK    +   RR  K W+  E   LR  V+ +GK +WK I NSY  +F +R+EV
Sbjct: 486 LNVSPLKKGRLERPRPRRPMKFWTSEEVAALREGVKEYGK-SWKDIKNSYPVVFADRSEV 386

Query: 547 DLKDKWRNM 549
           DLKDKWRN+
Sbjct: 546 DLKDKWRNL 386

BLAST of Tan0000804 vs. TAIR 10
Match: AT5G58340.1 (myb-like HTH transcriptional regulator family protein )

HSP 1 Score: 126.7 bits (317), Expect = 5.7e-29
Identity = 137/556 (24.64%), Postives = 230/556 (41.37%), Query Frame = 0

Query: 5   ICRWIIE-FILRSSMDDHLLKRILAVIPLPDMD--FRLKKTALLRAIGSEISEAVVTEKL 64
           I +W+ E F+LR          +++ + L D     +LK +++LR I + +    + E +
Sbjct: 4   IDQWVAEFFLLRQHNPRASPINLISALKLGDSSDCIKLKISSVLRDISNSLIRGTIDEGM 63

Query: 65  LEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRGR 124
           L++ E++E+L   +   +M+S K+AYC  A ECT++++      ++G + DA+ RIW  R
Sbjct: 64  LDLLEILEKLLLQQHSLLMDSHKSAYCWTATECTLRFMWPMFA-SDGLFTDALERIWTKR 123

Query: 125 VTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWGI 184
           +  LK SG S+LV+ +L  W+ +++ AL D  +++++   N RY A+  +   L E W +
Sbjct: 124 IGILKESG-SDLVTCDLLKWESDLKKALGDPELYQRIRETNIRYTAISFLTQLLKEQWAL 183

Query: 185 LGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASE-DVGGGIELPSQTENLSRLERQ 244
           LG               ++ L+SV   R + + A+  E DV       S  +  +R    
Sbjct: 184 LG---------------SSSLESVAQRRFLKRKAVNVEGDVVDNRGDQSDVDESTRRFGS 243

Query: 245 GSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTETETGEGQESVEKEVAVLPDP 304
            ++ + ++A  ER D   + RD+                    GEG E +E +     + 
Sbjct: 244 DTIDIANEARGEREDGNGIGRDN-----------------ANDGEGMECLENDGIDNVNA 303

Query: 305 SPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTPEVDRVREA 364
           +   H                             +D E++           P +D+  E 
Sbjct: 304 ADEEH-------------------------TVSAQDQEHE-----------PSLDKGDEM 363

Query: 365 LKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANSAMNKDAVP 424
                 E    ++  +  + R  E       E N   +HS++                  
Sbjct: 364 AARELKEYLVEIQGHIDPSTRQGE-------EPNSAIDHSVD------------------ 423

Query: 425 LRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHAS--------RL 484
                            + P P+ + R  T C  + N++ D   E    S        R 
Sbjct: 424 -----------------VTPPPTRVNRTGTGC-QDHNEASDNVNEKGSDSQETWSSRVRP 445

Query: 485 HLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDI 544
             P+P    +SPLKK   AK   RR  K W   E + LR  V+ +GK +WK I N    +
Sbjct: 484 RRPTPVTLSVSPLKKGGLAKPHVRRPKKFWKPEEVEALREGVKEYGK-SWKDIKNGNPTV 445

Query: 545 FDERTEVDLKDKWRNM 549
           F ERTEVDLKDKWRN+
Sbjct: 544 FAERTEVDLKDKWRNL 445

BLAST of Tan0000804 vs. TAIR 10
Match: AT5G58340.2 (myb-like HTH transcriptional regulator family protein )

HSP 1 Score: 126.7 bits (317), Expect = 5.7e-29
Identity = 137/556 (24.64%), Postives = 230/556 (41.37%), Query Frame = 0

Query: 5   ICRWIIE-FILRSSMDDHLLKRILAVIPLPDMD--FRLKKTALLRAIGSEISEAVVTEKL 64
           I +W+ E F+LR          +++ + L D     +LK +++LR I + +    + E +
Sbjct: 4   IDQWVAEFFLLRQHNPRASPINLISALKLGDSSDCIKLKISSVLRDISNSLIRGTIDEGM 63

Query: 65  LEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLAVGGTDTNGKYFDAMRRIWRGR 124
           L++ E++E+L   +   +M+S K+AYC  A ECT++++      ++G + DA+ RIW  R
Sbjct: 64  LDLLEILEKLLLQQHSLLMDSHKSAYCWTATECTLRFMWPMFA-SDGLFTDALERIWTKR 123

Query: 125 VTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVNMNTRYEALKLIGDYLGEAWGI 184
           +  LK SG S+LV+ +L  W+ +++ AL D  +++++   N RY A+  +   L E W +
Sbjct: 124 IGILKESG-SDLVTCDLLKWESDLKKALGDPELYQRIRETNIRYTAISFLTQLLKEQWAL 183

Query: 185 LGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASE-DVGGGIELPSQTENLSRLERQ 244
           LG               ++ L+SV   R + + A+  E DV       S  +  +R    
Sbjct: 184 LG---------------SSSLESVAQRRFLKRKAVNVEGDVVDNRGDQSDVDESTRRFGS 243

Query: 245 GSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTETETGEGQESVEKEVAVLPDP 304
            ++ + ++A  ER D   + RD+                    GEG E +E +     + 
Sbjct: 244 DTIDIANEARGEREDGNGIGRDN-----------------ANDGEGMECLENDGIDNVNA 303

Query: 305 SPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLENDSSSGKYACLQTPEVDRVREA 364
           +   H                             +D E++           P +D+  E 
Sbjct: 304 ADEEH-------------------------TVSAQDQEHE-----------PSLDKGDEM 363

Query: 365 LKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHSLEDQNDAGAANSAMNKDAVP 424
                 E    ++  +  + R  E       E N   +HS++                  
Sbjct: 364 AARELKEYLVEIQGHIDPSTRQGE-------EPNSAIDHSVD------------------ 423

Query: 425 LRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSIDGSPEGNHAS--------RL 484
                            + P P+ + R  T C  + N++ D   E    S        R 
Sbjct: 424 -----------------VTPPPTRVNRTGTGC-QDHNEASDNVNEKGSDSQETWSSRVRP 445

Query: 485 HLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDI 544
             P+P    +SPLKK   AK   RR  K W   E + LR  V+ +GK +WK I N    +
Sbjct: 484 RRPTPVTLSVSPLKKGGLAKPHVRRPKKFWKPEEVEALREGVKEYGK-SWKDIKNGNPTV 445

Query: 545 FDERTEVDLKDKWRNM 549
           F ERTEVDLKDKWRN+
Sbjct: 544 FAERTEVDLKDKWRNL 445

BLAST of Tan0000804 vs. TAIR 10
Match: AT1G06910.1 (TRF-like 7 )

HSP 1 Score: 91.7 bits (226), Expect = 2.0e-18
Identity = 135/508 (26.57%), Postives = 194/508 (38.19%), Query Frame = 0

Query: 41  KTALLRAIGSEISEAVVTEKLLEIFEMIEQLDKTEGLAIMESMKAAYCAVAVECTVKYLA 100
           K A+L  I  EI + +V EK LE  E + ++   EG  + +S+  AYC VAVECTVK LA
Sbjct: 7   KEAILEGIFMEIEDGIVEEKNLERLENLVEILHKEGSKVPKSVTEAYCKVAVECTVKCLA 66

Query: 101 VGGTDTNGKYFDAMRRIWRGRVTELKRSGNSELVSRELKGWKDEVEAALQDKSVWKKLVN 160
               D    Y +A++ IW GR+  L     S LV+ +L      +  A  D    K L++
Sbjct: 67  Y-EKDAKKAYTEAIKTIWLGRIMPL-CDKVSCLVTLDLLKCCRRLWKAHTDDKACKTLMD 126

Query: 161 MNTRYEALKLIGDYLGEAWGILGPPFLQLSASLMDKRMTNELQSVQLVRAIDKTAIASED 220
            +TR +AL  +   +           L L+ +L+                          
Sbjct: 127 EDTRDKALVCLRKVV-----------LDLNPNLV-------------------------- 186

Query: 221 VGGGIELPSQTENLSRLERQGSVPVVSQAETERNDLLNMNRDSGVNYGSKQSAIVGKNTE 280
                        L  L    S  + S  E+E  + +   R+   N  S+ S       E
Sbjct: 187 -------------LENLNMDESDEIESSEESEETESMVEAREGVGNQNSQAS-------E 246

Query: 281 TETGEGQESVEKEVAVLPDPSPSRHENLKTSVLPRCKSLASHRRVRGGAKISHLEDLEND 340
               + QES+                 L T +          R   GG+K  ++    N 
Sbjct: 247 AMEEDDQESL-----------------LDTEL---------ERPTSGGSKAVYVPSQFNP 306

Query: 341 SSSGKYACLQTPEVDRVREALKTSSSELQALVKDPLPDALRIAESVAHDLAEKNKTREHS 400
                   + +  VDR    L+ S  EL   ++   P                N   E  
Sbjct: 307 --------IPSAVVDRALRKLRASKIELMKALEKGRP---------------SNLNNETI 366

Query: 401 LEDQNDAGAANSAMNKDAVPLRSVNAVLKNPCHGHQTIVPRPSIMERNSTACTYEWNDSI 460
            E +ND  A  SA N                        PRPS+ME  STA TYEWNDSI
Sbjct: 367 TEQENDV-ANPSATN----------------------AAPRPSLMEPRSTAHTYEWNDSI 380

Query: 461 DGS--PEGNHASRLHLPSPKRKVISPLKKYEEAKMVRRRKCKRWSLLEEDTLRTAVQRFG 520
           D S    G+   R++    KR V+SPLK+   ++  RR K   WS  E   +    +++G
Sbjct: 427 DDSDGEMGDDIERINKSKRKRIVVSPLKRNRCSEGARRPKLP-WSTAETLAVLKGYEKYG 380

Query: 521 KGNWKLILNSYRDIFDERTEVDLKDKWR 547
             NWK I +    +   RT  D+KDK+R
Sbjct: 487 -ANWKRIKDE-NPVLVRRTNGDIKDKFR 380

BLAST of Tan0000804 vs. TAIR 10
Match: AT3G12560.1 (TRF-like 9 )

HSP 1 Score: 50.1 bits (118), Expect = 6.8e-06
Identity = 34/115 (29.57%), Postives = 63/115 (54.78%), Query Frame = 0

Query: 438 IVPRPSIMERNSTACTYEWNDSIDGSPEGNHASRLHLPSPKRKVIS--PL-KKYEEAKMV 497
           ++   +I++ N     Y+ + S+D  P  +    + LP  + K ++  PL +K +  ++ 
Sbjct: 446 VIDSRNIVDSNLELVPYQGDISVD-EPSSDSKELVPLPELEVKALAIVPLNQKPKRTELA 505

Query: 498 RRRKCKRWSLLEEDTLRTAVQRFGKGNWKLI-LNSYRDIFDERTEVDLKDKWRNM 549
           +RR  + +S+ E + L  AV+  G G W+ + L ++ D  D RT VDLKDKW+ +
Sbjct: 506 QRRTRRPFSVTEVEALVQAVEELGTGRWRDVKLRAFEDA-DHRTYVDLKDKWKTL 558

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O550363.0e-0636.05Telomeric repeat-binding factor 1 (Fragment) OS=Cricetulus griseus OX=10029 GN=T... [more]
P542743.0e-0636.05Telomeric repeat-binding factor 1 OS=Homo sapiens OX=9606 GN=TERF1 PE=1 SV=3[more]
Q6WLH31.5e-0550.98Single myb histone 5 OS=Zea mays OX=4577 GN=SMH5 PE=2 SV=1[more]
P703711.9e-0545.45Telomeric repeat-binding factor 1 OS=Mus musculus OX=10090 GN=Terf1 PE=1 SV=1[more]
Q9C7B19.6e-0529.57Telomere repeat-binding protein 3 OS=Arabidopsis thaliana OX=3702 GN=TRP3 PE=1 S... [more]
Match NameE-valueIdentityDescription
KAG6605382.11.5e-24981.04Telomeric repeat-binding factor 1, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023532585.11.3e-24880.32uncharacterized protein LOC111794704 [Cucurbita pepo subsp. pepo][more]
XP_023007149.11.8e-24780.14uncharacterized protein LOC111499729 isoform X1 [Cucurbita maxima][more]
KAG7035336.11.3e-24080.40hypothetical protein SDJN02_02131, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022947187.16.0e-23880.26uncharacterized protein LOC111451133 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1L4618.9e-24880.14uncharacterized protein LOC111499729 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1G5R72.9e-23880.26uncharacterized protein LOC111451133 OS=Cucurbita moschata OX=3662 GN=LOC1114511... [more]
A0A6J1L2703.8e-23879.89uncharacterized protein LOC111499729 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FVK56.0e-22074.07uncharacterized protein LOC111448860 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JF516.7e-21973.53uncharacterized protein LOC111484457 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT1G15720.12.3e-3025.32TRF-like 5 [more]
AT5G58340.15.7e-2924.64myb-like HTH transcriptional regulator family protein [more]
AT5G58340.25.7e-2924.64myb-like HTH transcriptional regulator family protein [more]
AT1G06910.12.0e-1826.57TRF-like 7 [more]
AT3G12560.16.8e-0629.57TRF-like 9 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainSMARTSM00717santcoord: 498..551
e-value: 3.7E-7
score: 39.8
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 501..549
score: 7.56108
IPR017930Myb domainPFAMPF00249Myb_DNA-bindingcoord: 500..548
e-value: 1.8E-7
score: 31.2
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 493..551
score: 10.993067
NoneNo IPR availableGENE3D1.10.246.220coord: 502..551
e-value: 1.1E-16
score: 62.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 391..414
NoneNo IPR availablePANTHERPTHR46993:SF6MYB TRANSCRIPTION FACTORcoord: 1..551
NoneNo IPR availablePANTHERPTHR46993MYB TRANSCRIPTION FACTORcoord: 1..551
NoneNo IPR availableCDDcd11660SANT_TRFcoord: 500..550
e-value: 1.52228E-17
score: 74.5259
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 496..550

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0000804.1Tan0000804.1mRNA