Cp4.1LG10g04330 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g04330
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSulfite exporter TauE/SafE family protein
LocationCp4.1LG10 : 1540837 .. 1543678 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTATTCGCCACGTGTTTTCTAAATTTGCAATGGAATTTAATTGCTATTTTCGAATTTGAATATTTTTCTATTTATTTGAACCACAGCCACCACCACAATCCGCAGAATCCTGAAACCACACCCGCCGGAGACCAACATCGGCGGGATATATATTTTTCTTCCGGCTTCTCTTTTTCTGCCATGGCTACTTCTGGGTTTCTTCTTTACCTACTTTCTGCCTCTTCCATCGCCATTCTCTCTGTTCTGTACCTCTCTAGCCCTTCTTCTTCTCCTTCTTCCTCCTTCGTCCTATCCGATTCTCTCTCCACCGACAAAATATGGCCGGTTCGTTCTTCCTTCTTTCCTTTTCCGTTTCCGTCAATGTCCTCGTTCTTAATTTTTCTTTTTTGGTGAAGGATTTGGAGCTGAGTTGGAGGCTTGTGGTGGCGACGGTGATTGGATTTCTCGGCTCCGCCTGCGGAACCGTCGGAGGAGTTGGTGGCGGCGGCATTTTTGTCCCTATGCTTACTTTGATTATTGGGTTCGACACCAAATCTGCTGCTGCCATTTCCAAATGTATTGCGATTATTCTATTCAACCCAAAACCACCCCCAACACATATATATCTAAATTGAAGAAAAAATTTCAAATGGGGTGTGTTTATTTTGTTGCAGGTATGATAATGGGGGCTTCCTCCTCGTCGGTTTGGTACAATTTGAGAGTGGCGCATCCAACGAAGGATGTGCCGATTATAGACCATGATTTGGCGTTGCTGTTTCAGCCCATGTTGATGCTTGGAATCACCGTTGGTGTTTCCCTTAGCGTTGTTTTCCCCTATTGGCTCATCACTGTTCTCATCATTATCCTCTTTATTGGTCTGCCCTTTTCCCCTCCTCTTCGTTCCTTTTCCAATTCTCCAAATTCTGCACAATTCTGAGTTTATTTTGTGGCGGTTTGTGATTCAGGGACTTCCTCCAGGTCTTTCTTCAAGGGAATTGAGATGTGGAAAGAAGAAACCATTCTAAAGGTCGTTTTCTAATTCATTCTCTTAACCAAATTCTTAGTTTTGTTCAGAGATTCATAATTTGGTTTTCCTTTCCCCAGAAAGAATTCGCCAAACAAAGTGAGACTTTTGTGAATTCTCGTGGCGAATGTAAGTAGAACATGTTTGGATTTGTTATTAACTATGTTGAAATAGGATAAAGGGTTGATATTTTGTGTTCTTGGTGGGTTTGCAGTATTGATTGATGTAGAGTATGACCCACTGATACCCAAAGAGCAGAAGACAGCACTGGTGGGGCTCAAACACCACCAATAATTTCTTTGATTCATTGTTTCTGCAAGTTGAAGATTTGAATAAGCTACCATAATTTTGTGTCTGTTTCAGGAATTAATGTGCTTCAATCTTAGGTGGAAAAGGACCTCCATTTTGTTGGCTGTCTGGATTTCTTTCCTCATACTTCAAGTTGTTAAGGCTAGTTCCTTCTCTTTCTTTCTATATCTCTCTCTCAAAGTTCATCTTTTGATGTATATTCTTGACTGAACAATGATGCAGAATGATGTAGCAGTTTGTAGCCCTTGGTATTGGATCCTCTTCTCCTTGCAGGTTTGTGGATTTCATTTTCAAATTCATGATTATGAACTGACTAATAAAAGTTCATGAATTTTACCAAAAGCTTGATGATTATGAACTGAACTGGATTGTTTGTGGGAGCTGTATCAGTTTCCTGTAGCATTTGCAGTGTTTGGATATGAAGCAAGAAAGCTATACAATGAACATAAGAAGAGGATGGAGGCAGGAAACTTGGAACACATTTGTGAGGCTTCAATAGGATGGACTGGCACACACCTTGCTTTCTGTGCACTTTGTGGCATTGTAGGAGGCACTGTTGGAGGCCTTTTGGGTTCTGGTGGTGGCTTTGTTTTAGGCCCTTTGCTCTTGGAGATTGGCGTTGTCCCTCAGGTTGCTAGTGCTACTGCCACTTTTGTTATGATGTTCTCCTCATCCTTGTCTGTTGTTGAATTCTACTTGCTCCACAGATTCCCCATTCCATTTGGTGGGTTCCCTTTTTCATTCTCTCATTATTCATCTCCTATCATCAAATCATCCCTAACCTCTTCTTTCCGACTTCGTTTGCAGCTCTGTATCTTACGTCGGTGTCGGTTTTAGCCGGGTTCTGGGGGCAGTTCTTTGTTAGAAAAATGATCGCCATTCTGAGACGAGCCTCACTTATTGTGTTTGTACTCTCTGGTGTCATCTTTGCCAGTGCCATCACCATGGGTACGTAACCAAACGTTTTGAAAGTTCAATGACTAAACTTTGAGGAGCCAACTCGTGGGTTCGCTTTGTTTTGCAGGCATTGTAGGAGTATCGAAAAGTATAACGATGATACATAATCATGAGTTCATGGGGTTCTTGGATTTCTGCAGCAGCCAGTGAGAGATTCATCACACAAACAGAAAAGTGTAAGAATAGAAAGAAGATTGGTGGAGGAAAAGAGGGTTTGAATTTGTTTAAGCATAGAATTGGCCAACGCCTTGTTGACCAATTCTCAATCCCATGTTGCCTGTGAAGAAAAATAATGAGGTGATTGATGACTGTTAGCTCCTCGCTGACAGCATCATCGTACCTTCCAAAGTTATTTGAATGTCTCAAAATTGTATTGTAGTGTTGTGGTTGGCTGTTATGTTCTGTTAGCACCTCGCTGACAGCATCATCTGTTAATTGAACTTTTTATCAGTTAATGTTGTGTTCGAATATTGTTCTCCTTTTTTATTATTGTTACTATCATCATCATCCTTCCTCTATCGCCCCTGCATTCAGATGTACACAAACTTTATAAGAGAGATTTTATAGTTGGAA

mRNA sequence

GTATTCGCCACGTGTTTTCTAAATTTGCAATGGAATTTAATTGCTATTTTCGAATTTGAATATTTTTCTATTTATTTGAACCACAGCCACCACCACAATCCGCAGAATCCTGAAACCACACCCGCCGGAGACCAACATCGGCGGGATATATATTTTTCTTCCGGCTTCTCTTTTTCTGCCATGGCTACTTCTGGGTTTCTTCTTTACCTACTTTCTGCCTCTTCCATCGCCATTCTCTCTGTTCTGTACCTCTCTAGCCCTTCTTCTTCTCCTTCTTCCTCCTTCGTCCTATCCGATTCTCTCTCCACCGACAAAATATGGCCGGATTTGGAGCTGAGTTGGAGGCTTGTGGTGGCGACGGTGATTGGATTTCTCGGCTCCGCCTGCGGAACCGTCGGAGGAGTTGGTGGCGGCGGCATTTTTGTCCCTATGCTTACTTTGATTATTGGGTTCGACACCAAATCTGCTGCTGCCATTTCCAAATGTATGATAATGGGGGCTTCCTCCTCGTCGGTTTGGTACAATTTGAGAGTGGCGCATCCAACGAAGGATGTGCCGATTATAGACCATGATTTGGCGTTGCTGTTTCAGCCCATGTTGATGCTTGGAATCACCGTTGGTGTTTCCCTTAGCGTTGTTTTCCCCTATTGGCTCATCACTGTTCTCATCATTATCCTCTTTATTGGGACTTCCTCCAGGTCTTTCTTCAAGGGAATTGAGATGTGGAAAGAAGAAACCATTCTAAAGAAAGAATTCGCCAAACAAAGTGAGACTTTTGTGAATTCTCGTGGCGAATTATTGATTGATGTAGAGTATGACCCACTGATACCCAAAGAGCAGAAGACAGCACTGGAATTAATGTGCTTCAATCTTAGGTGGAAAAGGACCTCCATTTTGTTGGCTGTCTGGATTTCTTTCCTCATACTTCAAAATGATGTAGCAGTTTGTAGCCCTTGGTATTGGATCCTCTTCTCCTTGCAGTTTCCTGTAGCATTTGCAGTGTTTGGATATGAAGCAAGAAAGCTATACAATGAACATAAGAAGAGGATGGAGGCAGGAAACTTGGAACACATTTGTGAGGCTTCAATAGGATGGACTGGCACACACCTTGCTTTCTGTGCACTTTGTGGCATTGTAGGAGGCACTGTTGGAGGCCTTTTGGGTTCTGGTGGTGGCTTTGTTTTAGGCCCTTTGCTCTTGGAGATTGGCGTTGTCCCTCAGGTTGCTAGTGCTACTGCCACTTTTGTTATGATGTTCTCCTCATCCTTGTCTGTTGTTGAATTCTACTTGCTCCACAGATTCCCCATTCCATTTGCTCTGTATCTTACGTCGGTGTCGGTTTTAGCCGGGTTCTGGGGGCAGTTCTTTGTTAGAAAAATGATCGCCATTCTGAGACGAGCCTCACTTATTGTGTTTGTACTCTCTGGTGTCATCTTTGCCAGTGCCATCACCATGGGCATTGTAGGAGTATCGAAAAGTATAACGATGATACATAATCATGAGTTCATGGGGTTCTTGGATTTCTGCAGCAGCCAGTGAGAGATTCATCACACAAACAGAAAAGTGTAAGAATAGAAAGAAGATTGGTGGAGGAAAAGAGGGTTTGAATTTGTTTAAGCATAGAATTGGCCAACGCCTTGTTGACCAATTCTCAATCCCATGTTGCCTGTGAAGAAAAATAATGAGGTGATTGATGACTGTTAGCTCCTCGCTGACAGCATCATCGTACCTTCCAAAGTTATTTGAATGTCTCAAAATTGTATTGTAGTGTTGTGGTTGGCTGTTATGTTCTGTTAGCACCTCGCTGACAGCATCATCTGTTAATTGAACTTTTTATCAGTTAATGTTGTGTTCGAATATTGTTCTCCTTTTTTATTATTGTTACTATCATCATCATCCTTCCTCTATCGCCCCTGCATTCAGATGTACACAAACTTTATAAGAGAGATTTTATAGTTGGAA

Coding sequence (CDS)

GTATTCGCCACGTGTTTTCTAAATTTGCAATGGAATTTAATTGCTATTTTCGAATTTGAATATTTTTCTATTTATTTGAACCACAGCCACCACCACAATCCGCAGAATCCTGAAACCACACCCGCCGGAGACCAACATCGGCGGGATATATATTTTTCTTCCGGCTTCTCTTTTTCTGCCATGGCTACTTCTGGGTTTCTTCTTTACCTACTTTCTGCCTCTTCCATCGCCATTCTCTCTGTTCTGTACCTCTCTAGCCCTTCTTCTTCTCCTTCTTCCTCCTTCGTCCTATCCGATTCTCTCTCCACCGACAAAATATGGCCGGATTTGGAGCTGAGTTGGAGGCTTGTGGTGGCGACGGTGATTGGATTTCTCGGCTCCGCCTGCGGAACCGTCGGAGGAGTTGGTGGCGGCGGCATTTTTGTCCCTATGCTTACTTTGATTATTGGGTTCGACACCAAATCTGCTGCTGCCATTTCCAAATGTATGATAATGGGGGCTTCCTCCTCGTCGGTTTGGTACAATTTGAGAGTGGCGCATCCAACGAAGGATGTGCCGATTATAGACCATGATTTGGCGTTGCTGTTTCAGCCCATGTTGATGCTTGGAATCACCGTTGGTGTTTCCCTTAGCGTTGTTTTCCCCTATTGGCTCATCACTGTTCTCATCATTATCCTCTTTATTGGGACTTCCTCCAGGTCTTTCTTCAAGGGAATTGAGATGTGGAAAGAAGAAACCATTCTAAAGAAAGAATTCGCCAAACAAAGTGAGACTTTTGTGAATTCTCGTGGCGAATTATTGATTGATGTAGAGTATGACCCACTGATACCCAAAGAGCAGAAGACAGCACTGGAATTAATGTGCTTCAATCTTAGGTGGAAAAGGACCTCCATTTTGTTGGCTGTCTGGATTTCTTTCCTCATACTTCAAAATGATGTAGCAGTTTGTAGCCCTTGGTATTGGATCCTCTTCTCCTTGCAGTTTCCTGTAGCATTTGCAGTGTTTGGATATGAAGCAAGAAAGCTATACAATGAACATAAGAAGAGGATGGAGGCAGGAAACTTGGAACACATTTGTGAGGCTTCAATAGGATGGACTGGCACACACCTTGCTTTCTGTGCACTTTGTGGCATTGTAGGAGGCACTGTTGGAGGCCTTTTGGGTTCTGGTGGTGGCTTTGTTTTAGGCCCTTTGCTCTTGGAGATTGGCGTTGTCCCTCAGGTTGCTAGTGCTACTGCCACTTTTGTTATGATGTTCTCCTCATCCTTGTCTGTTGTTGAATTCTACTTGCTCCACAGATTCCCCATTCCATTTGCTCTGTATCTTACGTCGGTGTCGGTTTTAGCCGGGTTCTGGGGGCAGTTCTTTGTTAGAAAAATGATCGCCATTCTGAGACGAGCCTCACTTATTGTGTTTGTACTCTCTGGTGTCATCTTTGCCAGTGCCATCACCATGGGCATTGTAGGAGTATCGAAAAGTATAACGATGATACATAATCATGAGTTCATGGGGTTCTTGGATTTCTGCAGCAGCCAGTGA

Protein sequence

VFATCFLNLQWNLIAIFEFEYFSIYLNHSHHHNPQNPETTPAGDQHRRDIYFSSGFSFSAMATSGFLLYLLSASSIAILSVLYLSSPSSSPSSSFVLSDSLSTDKIWPDLELSWRLVVATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYNLRVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKRTSILLAVWISFLILQNDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEAGNLEHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFVMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVFVLSGVIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSSQ
BLAST of Cp4.1LG10g04330 vs. TrEMBL
Match: A0A0A0KII9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G449290 PE=4 SV=1)

HSP 1 Score: 780.4 bits (2014), Expect = 1.3e-222
Identity = 414/455 (90.99%), Postives = 429/455 (94.29%), Query Frame = 1

Query: 61  MATSGFLLYLLSASSIAILSVLYLS-SPSSSPSSSFVLSDSLSTDKIWPDLELSWRLVVA 120
           MATSGFLLYLLSASSIA+LS+LYLS S SSS SS+  LS SLSTDK WPDLE SWRLV A
Sbjct: 1   MATSGFLLYLLSASSIAVLSLLYLSDSSSSSSSSTTALSASLSTDKTWPDLEPSWRLVAA 60

Query: 121 TVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYNLRVA 180
           TVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGAS+SSVWYNLRVA
Sbjct: 61  TVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASTSSVWYNLRVA 120

Query: 181 HPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSFFKGI 240
           HPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLIT+LIIILFIGTSSRSFFKGI
Sbjct: 121 HPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITILIIILFIGTSSRSFFKGI 180

Query: 241 EMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKRTSIL 300
           EMWKEETILKKEFAK+ ET VNS GELLIDVEYDPLIPKEQKT LELMCFNLRWKRTSIL
Sbjct: 181 EMWKEETILKKEFAKRCETVVNSHGELLIDVEYDPLIPKEQKTELELMCFNLRWKRTSIL 240

Query: 301 LAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEAGNLE 360
            AVWISFLILQ   NDVA CS WYW++F LQFP+A  VFGYEARKLY EHKKRMEAGNLE
Sbjct: 241 FAVWISFLILQVVKNDVAACSIWYWVVFFLQFPIAIVVFGYEARKLYKEHKKRMEAGNLE 300

Query: 361 HICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFV 420
            ICEASIGWTG+HLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFV
Sbjct: 301 QICEASIGWTGSHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFV 360

Query: 421 MMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVFVLSG 480
           MMFSSSLSVVEFYLL+RFPIP+ALYLTSVSVLAGFWGQFFVRK+I ILRRASLIVF+LSG
Sbjct: 361 MMFSSSLSVVEFYLLNRFPIPYALYLTSVSVLAGFWGQFFVRKLITILRRASLIVFILSG 420

Query: 481 VIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSS 512
           VIFASAITMGIVGV+KSITMI NHEFMGFLDFCSS
Sbjct: 421 VIFASAITMGIVGVTKSITMIQNHEFMGFLDFCSS 455

BLAST of Cp4.1LG10g04330 vs. TrEMBL
Match: A0A061F0N0_THECC (Sulfite exporter TauE/SafE family protein isoform 1 OS=Theobroma cacao GN=TCM_022388 PE=4 SV=1)

HSP 1 Score: 682.6 bits (1760), Expect = 3.8e-193
Identity = 350/457 (76.59%), Postives = 399/457 (87.31%), Query Frame = 1

Query: 61  MATSGFLLYLLSASSIAILSVLYLSSPSSSPSSSFVLSDS--LSTDKIWPDLELSWRLVV 120
           MAT GFLLYLLS  S+AILSVL+++  ++   +S +L      S DK WP+LEL+WRLV+
Sbjct: 1   MATRGFLLYLLSGFSVAILSVLFINKNNNMYHNSTLLHSPNVSSVDKDWPELELNWRLVL 60

Query: 121 ATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYNLRV 180
           ATVIGFLGSACGTVGGVGGGGIFVPMLTLI+GFDTKSAAAISKCMIMGAS+SSVWYNLRV
Sbjct: 61  ATVIGFLGSACGTVGGVGGGGIFVPMLTLIVGFDTKSAAAISKCMIMGASASSVWYNLRV 120

Query: 181 AHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSFFKG 240
            HPTK+VPIID+DLALLFQPMLMLGITVGV+LSVVFPYWLITVLIIILF+ TSSRSF+K 
Sbjct: 121 PHPTKEVPIIDYDLALLFQPMLMLGITVGVALSVVFPYWLITVLIIILFLSTSSRSFYKA 180

Query: 241 IEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKRTSI 300
            EMWKEETILKKE  +Q ET VNSRGELLID EY+PL+P+E+K+ L+++CFNLRWKR  I
Sbjct: 181 TEMWKEETILKKELTRQQETLVNSRGELLIDAEYEPLVPREEKSELQILCFNLRWKRLLI 240

Query: 301 LLAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEAGNL 360
           L  VW+ F ++Q   ND+ VCS WYW+LF LQ P+A  VFGYEA KLY EHKKRM  GN 
Sbjct: 241 LATVWVLFTLIQVIKNDLVVCSTWYWVLFCLQLPIAVLVFGYEATKLYKEHKKRMSTGNR 300

Query: 361 EHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATF 420
           E ICEASI W+  ++AFCALCGI+GGTVGGLLGSGGGF+LGPLLLEIGV+PQVASATATF
Sbjct: 301 EAICEASIQWSPLNIAFCALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASATATF 360

Query: 421 VMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVFVLS 480
           VMMFSSSLSVVEFYLL RFPIP+ALYL  VS+LAGFWGQ+FVRK+I IL+RASLIVF+LS
Sbjct: 361 VMMFSSSLSVVEFYLLKRFPIPYALYLMGVSILAGFWGQYFVRKLITILKRASLIVFILS 420

Query: 481 GVIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSSQ 513
           GVIFASA+TMG++G+  SI MIHNHEFMGFLDFCSSQ
Sbjct: 421 GVIFASALTMGVIGIDTSIQMIHNHEFMGFLDFCSSQ 457

BLAST of Cp4.1LG10g04330 vs. TrEMBL
Match: B9MTM3_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s11920g PE=4 SV=1)

HSP 1 Score: 681.8 bits (1758), Expect = 6.5e-193
Identity = 358/460 (77.83%), Postives = 396/460 (86.09%), Query Frame = 1

Query: 61  MATSGFLLYLLSASSIAILSVLYLSSP----SSSPSSSFVLSDSLST-DKIWPDLELSWR 120
           MAT G +LYLLS  S+AILSV +LS P    S +P+S    S  LST DK+WP LE SWR
Sbjct: 1   MATRGLVLYLLSGFSVAILSVFFLSHPNEKASPNPNSDIFASPYLSTTDKVWPKLEFSWR 60

Query: 121 LVVATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYN 180
            V+ATVIG LGSACGTVGGVGGGGIFVPMLTLI+GFDTKSAAA+SKCMIM AS+SSVWYN
Sbjct: 61  TVLATVIGLLGSACGTVGGVGGGGIFVPMLTLIVGFDTKSAAALSKCMIMAASASSVWYN 120

Query: 181 LRVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSF 240
           LRV HPT++VPIID+DLALLFQPML+LGIT+GVSLSVVFPYWLITVLIIILFIGTSSRSF
Sbjct: 121 LRVPHPTREVPIIDYDLALLFQPMLLLGITLGVSLSVVFPYWLITVLIIILFIGTSSRSF 180

Query: 241 FKGIEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKR 300
           FKGIEMWKEETILKKE   Q ET VNSRGELLID EY+PLIP+E+K+ ++++CFNL+WKR
Sbjct: 181 FKGIEMWKEETILKKEMVIQQETIVNSRGELLIDTEYEPLIPREEKSKMQILCFNLKWKR 240

Query: 301 TSILLAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEA 360
             IL  VW SFL+LQ   NDVAVCS WYW+LF LQFP+AF VFGYEA KLY E+KKR+  
Sbjct: 241 LLILFLVWTSFLLLQVIKNDVAVCSTWYWVLFCLQFPIAFGVFGYEAVKLYRENKKRIST 300

Query: 361 GNLEHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASAT 420
           GN E ICEASI WT  H+ FCALCGI+GGTVGGLLGSGGGFVLGPLLLEIGV P VASAT
Sbjct: 301 GNTETICEASIEWTPMHILFCALCGIIGGTVGGLLGSGGGFVLGPLLLEIGVSPHVASAT 360

Query: 421 ATFVMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVF 480
           +TFVMMFSSSLSVVEFYLL RFPIPFALYL  VSVLAGFWGQFFVRK++ IL RASLIVF
Sbjct: 361 STFVMMFSSSLSVVEFYLLKRFPIPFALYLMGVSVLAGFWGQFFVRKLVKILGRASLIVF 420

Query: 481 VLSGVIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSSQ 513
           +LSGVIF SA+TMG VG+  SITMI NHEFMGFL+FCSSQ
Sbjct: 421 ILSGVIFVSALTMGGVGIDTSITMIRNHEFMGFLEFCSSQ 460

BLAST of Cp4.1LG10g04330 vs. TrEMBL
Match: A0A0B0NNJ1_GOSAR (Putative tripeptidyl-peptidase SED4 OS=Gossypium arboreum GN=F383_18561 PE=4 SV=1)

HSP 1 Score: 677.2 bits (1746), Expect = 1.6e-191
Identity = 347/460 (75.43%), Postives = 404/460 (87.83%), Query Frame = 1

Query: 61  MATSGFLLYLLSASSIAILSVLYLSSPSS---SPSSSFVLS--DSLSTDKIWPDLELSWR 120
           MAT GF+LYLLS  SIA+LSVL++   ++   + SS+ + S  +  +T+K+WP LEL+WR
Sbjct: 1   MATKGFVLYLLSGFSIAVLSVLFIQKSNNDDMNQSSNLLESPYNLSTTEKVWPALELNWR 60

Query: 121 LVVATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYN 180
           LV+ATVIGFLGSACGTVGGVGGGGIFVPMLTLI+GFDTKSAAAISKCMIMGAS+SSVWYN
Sbjct: 61  LVMATVIGFLGSACGTVGGVGGGGIFVPMLTLIVGFDTKSAAAISKCMIMGASASSVWYN 120

Query: 181 LRVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSF 240
           LRV HPTK+VPIID+DLALLFQPMLMLGITVGV+LSVVFPYWLITVLIIILF+GTSSRSF
Sbjct: 121 LRVPHPTKEVPIIDYDLALLFQPMLMLGITVGVALSVVFPYWLITVLIIILFLGTSSRSF 180

Query: 241 FKGIEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKR 300
           ++GIEMWKEETIL KE  K  E+FVNSRGELLID EY+PL+PKE+K+ L+++CFNLRWKR
Sbjct: 181 YRGIEMWKEETILNKELTKPQESFVNSRGELLIDTEYEPLVPKEEKSKLQILCFNLRWKR 240

Query: 301 TSILLAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEA 360
             +L  VW+ F ++Q   NDV  C+  YW+LFSLQFP+A  VFGYEA KLY EHKKRM  
Sbjct: 241 LLVLATVWVLFTVIQVIKNDVVPCTTLYWVLFSLQFPIATLVFGYEATKLYKEHKKRMST 300

Query: 361 GNLEHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASAT 420
           GN E +C ASI W+  ++AFCALCGI+GGTVGGLLGSGGGF+LGPLLLEIGV+PQVASAT
Sbjct: 301 GNAETVCGASIQWSPLNIAFCALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASAT 360

Query: 421 ATFVMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVF 480
           ATFVMMFSSSLSVVEFYLL RFP+P+ALYL  VS+LAGFWGQ+FVRK+I ILRRASLIVF
Sbjct: 361 ATFVMMFSSSLSVVEFYLLKRFPMPYALYLMGVSILAGFWGQYFVRKLITILRRASLIVF 420

Query: 481 VLSGVIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSSQ 513
           +LSGVIFASA+TMG++G+ +SI MIHNHEFMGFL+FCSSQ
Sbjct: 421 ILSGVIFASALTMGVIGIERSIRMIHNHEFMGFLNFCSSQ 460

BLAST of Cp4.1LG10g04330 vs. TrEMBL
Match: A0A059DFV3_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A01580 PE=4 SV=1)

HSP 1 Score: 675.6 bits (1742), Expect = 4.6e-191
Identity = 345/459 (75.16%), Postives = 398/459 (86.71%), Query Frame = 1

Query: 61  MATSGFLLYLLSASSIAILSVLYLSSPSSSPSSSFVL----SDSLSTDKIWPDLELSWRL 120
           MAT G ++YL+S  ++AI+S  +L+S +   +    L    SD   T+K+WPDLE  WR+
Sbjct: 1   MATKGLVVYLISGFALAIVSAFFLASGTREDAGRGPLVSLRSDLRDTEKVWPDLEFGWRV 60

Query: 121 VVATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYNL 180
           VVATVIGFLGSACGTVGGVGGGGIFVPMLTL++GFDTKSAAAISKCMIMGAS++SVWYNL
Sbjct: 61  VVATVIGFLGSACGTVGGVGGGGIFVPMLTLVVGFDTKSAAAISKCMIMGASTASVWYNL 120

Query: 181 RVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSFF 240
           RV HP ++VPIID+DLALLFQPMLMLGITVGV+LSVVFPYWLITVLIIILF+GTSSRSF 
Sbjct: 121 RVPHPMREVPIIDYDLALLFQPMLMLGITVGVALSVVFPYWLITVLIIILFLGTSSRSFC 180

Query: 241 KGIEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKRT 300
           KG+EMWKEET+  KE AKQ ET VNSRGELLID +YD LIP E+KT L+++C NLRWKRT
Sbjct: 181 KGVEMWKEETVFNKELAKQQETLVNSRGELLIDTQYDLLIPNEEKTELQILCLNLRWKRT 240

Query: 301 SILLAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEAG 360
            IL+ VW++FL+LQ   ND+AVCS WYW+LF LQFP+A +VFGYEA KLY EHKKRM +G
Sbjct: 241 LILVLVWVAFLLLQVIKNDLAVCSTWYWVLFCLQFPIAMSVFGYEAVKLYREHKKRMCSG 300

Query: 361 NLEHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATA 420
           N E ICEA+I WT  H+AFCALCGI+GG VGGLLGSGGGF+LGPLLLEIGV+PQVASATA
Sbjct: 301 NTEAICEATIQWTAVHIAFCALCGILGGCVGGLLGSGGGFILGPLLLEIGVIPQVASATA 360

Query: 421 TFVMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVFV 480
           TFVMMFSSSLSVVEFYLL RFPIP+ALYL +VSVLAGFWGQ+FVRK+I  LRRASLIVF+
Sbjct: 361 TFVMMFSSSLSVVEFYLLKRFPIPYALYLMAVSVLAGFWGQYFVRKLITFLRRASLIVFL 420

Query: 481 LSGVIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSSQ 513
           LSGVIFASA+TMG++G+ +SI MI NHEFMGFL FCSSQ
Sbjct: 421 LSGVIFASALTMGVIGIEESIVMIKNHEFMGFLSFCSSQ 459

BLAST of Cp4.1LG10g04330 vs. TAIR10
Match: AT2G36630.1 (AT2G36630.1 Sulfite exporter TauE/SafE family protein)

HSP 1 Score: 644.8 bits (1662), Expect = 4.4e-185
Identity = 334/458 (72.93%), Postives = 391/458 (85.37%), Query Frame = 1

Query: 58  FSAMATSGFLLYLLSASSIAILSVLYLSSPSSSPSSSFVLSDSLSTDKIWPDLELSWRLV 117
           ++   T GF+LYLL A S+A+ SV Y+   ++       LS   +T+KIWPDL+ SW+LV
Sbjct: 4   WNGKGTGGFILYLLVAFSVAVFSVSYVGDTTNPIHHH--LSSLSATEKIWPDLKFSWKLV 63

Query: 118 VATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYNLR 177
           +ATVI FLGSACGTVGGVGGGGIFVPMLTLI+GFDTKSAAAISKCMIMGAS+SSVWYN+R
Sbjct: 64  LATVIAFLGSACGTVGGVGGGGIFVPMLTLILGFDTKSAAAISKCMIMGASASSVWYNVR 123

Query: 178 VAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSFFK 237
           V HPTK+VPI+D+DLALLFQPML+LGITVGVSLSVVFPYWLITVLIIILF+GTSSRSFFK
Sbjct: 124 VRHPTKEVPILDYDLALLFQPMLLLGITVGVSLSVVFPYWLITVLIIILFVGTSSRSFFK 183

Query: 238 GIEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKRTS 297
           GIEMWKEET+LK E A+Q    VNSRGELLID EY+PL P+E+K+ LE++  NL+WK   
Sbjct: 184 GIEMWKEETLLKNEMAQQRANMVNSRGELLIDTEYEPLYPREEKSELEIIRSNLKWKGLL 243

Query: 298 ILLAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEAGN 357
           IL+ VW++FL++Q   N++ VCS  YWILF +QFPVA AVFG+EA KLY  +KKR+ +GN
Sbjct: 244 ILVTVWLTFLLIQIVKNEIKVCSTIYWILFIVQFPVALAVFGFEASKLYTANKKRLNSGN 303

Query: 358 LEHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATAT 417
            E ICEA+I WT   L FC LCG++GG VGGLLGSGGGFVLGPLLLEIGV+PQVASATAT
Sbjct: 304 TECICEATIEWTPLSLIFCGLCGLIGGIVGGLLGSGGGFVLGPLLLEIGVIPQVASATAT 363

Query: 418 FVMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVFVL 477
           FVMMFSSSLSVVEFYLL RFPIP+A+YL SVS+LAGFWGQ F+RK++AILRRAS+IVFVL
Sbjct: 364 FVMMFSSSLSVVEFYLLKRFPIPYAMYLISVSILAGFWGQSFIRKLVAILRRASIIVFVL 423

Query: 478 SGVIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSSQ 513
           SGVI ASA+TMG++G+ KSI MIHNHEFMGFL FCSSQ
Sbjct: 424 SGVICASALTMGVIGIEKSIKMIHNHEFMGFLGFCSSQ 459

BLAST of Cp4.1LG10g04330 vs. TAIR10
Match: AT2G25737.1 (AT2G25737.1 Sulfite exporter TauE/SafE family protein)

HSP 1 Score: 402.5 bits (1033), Expect = 3.8e-112
Identity = 224/421 (53.21%), Postives = 292/421 (69.36%), Query Frame = 1

Query: 98  SDSLSTDKIWPDLELSWRLVVATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAA 157
           SD +    +WP+ E +W++V+ T++GF G+A G+VGGVGGGGIFVPML+LIIGFD KSA 
Sbjct: 62  SDQIGYRHVWPEFEFNWQIVLGTLVGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSAT 121

Query: 158 AISKCMIMGASSSSVWYNLRVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYW 217
           AISKCMIMGAS S+V+YNLR+ HPT D+PIID+DLALL QPMLMLGI++GV+ +V+FP W
Sbjct: 122 AISKCMIMGASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFPDW 181

Query: 218 LITVLIIILFIGTSSRSFFKGIEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPL-- 277
           L+TVL+I+LF+GTS+++F KG E W +ETI KKE AK+ E    S G    +VEY PL  
Sbjct: 182 LVTVLLIVLFLGTSTKAFLKGSETWNKETIEKKEAAKRLE----SNGVSGTEVEYVPLPA 241

Query: 278 ----IPKEQKTALELMCFNLRWKRTSILLAVWISFLILQ---NDVAVCSPWYWILFSLQF 337
                P  +K     +  N+ WK   +L+ VWI FL LQ    ++A CS  YW++  LQ 
Sbjct: 242 APSTNPGNKKKEEVSIIENVYWKELGLLVFVWIVFLALQISKQNLANCSVAYWVINLLQI 301

Query: 338 PVAFAVFGYEARKLYNEHKKRMEAGNLEHICEASIGWTGTHLAFCALCGIVGGTVGGLLG 397
           PVA  V GYEA  LY   +     G      +    +T   L      GI+ G VGGLLG
Sbjct: 302 PVAVGVSGYEAVALYQGRRIIASKG------QGDSNFTVGQLVMYCTFGIIAGIVGGLLG 361

Query: 398 SGGGFVLGPLLLEIGVVPQVASATATFVMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVL 457
            GGGF++GPL LE+GV PQV+SATATF M FSSS+SVVE+YLL RFP+P+ALYL  V+ +
Sbjct: 362 LGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLVGVATI 421

Query: 458 AGFWGQFFVRKMIAILRRASLIVFVLSGVIFASAITMGIVGVSKSITMIHNHEFMGFLDF 510
           A + GQ  VR++IA + RASLI+F+L+ +IF SAI++G VG+   I  I  HE+MGF + 
Sbjct: 422 AAWVGQHVVRRLIAAIGRASLIIFILASMIFISAISLGGVGIVNMIGKIQRHEYMGFENL 472

BLAST of Cp4.1LG10g04330 vs. TAIR10
Match: AT4G21250.1 (AT4G21250.1 Sulfite exporter TauE/SafE family protein)

HSP 1 Score: 171.4 bits (433), Expect = 1.4e-42
Identity = 126/408 (30.88%), Postives = 211/408 (51.72%), Query Frame = 1

Query: 109 DLELSWRLVVATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGAS 168
           +L+LS  +++A V+ FL +   + GG+GGGG+F+P++T++ G D K+A++ S  M+ G S
Sbjct: 51  ELKLSSAIIMAGVLCFLAALISSAGGIGGGGLFIPIMTIVAGVDLKTASSFSAFMVTGGS 110

Query: 169 SSSVWYNLRVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFI 228
            ++V  NL          ++D+DLALL +P ++LG+++GV  + V P WLITVL  +   
Sbjct: 111 IANVISNLFGGKA-----LLDYDLALLLEPCMLLGVSIGVICNRVLPEWLITVLFAVFLA 170

Query: 229 GTSSRSFFKGIEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMC 288
            +S ++   G++ WK    L+ E A++S      RG+  I+ E   L     +       
Sbjct: 171 WSSLKTCRSGVKFWK----LESEIARESGHGRPERGQGQIEEETKNLKAPLLEAQATKNK 230

Query: 289 FNLRWKRTSILLAVWISFLILQN-----------DVAVCSPWYWILFSLQFPVAFAVFGY 348
             + W +  +L+ VW SF ++              +  C   YWIL SLQ P+A     +
Sbjct: 231 SKIPWTKLGVLVIVWASFFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQIPLALI---F 290

Query: 349 EARKLYNEHKKRMEAGNLEHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGP 408
               L     ++ ++ N +   E +     T L F A+   + G +GG+ G GGG ++ P
Sbjct: 291 TKLALSRTESRQEQSPNDQKNQEGTRLDKSTRLKFPAM-SFLAGLLGGIFGIGGGMLISP 350

Query: 409 LLLEIGVVPQVASATATFVMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFV 468
           LLL+ G+ PQ+ +AT +F++ FS+++S V++ LL       A   + +  LA   G   V
Sbjct: 351 LLLQSGIPPQITAATTSFMVFFSATMSAVQYLLLGMQNTDTAYVFSFICFLASLLGLVLV 410

Query: 469 RKMIAILRRASLIVFVLSGVIFASAITMGIVGVSKSITMIHNHEFMGF 506
           +K +A   RAS+IVF +  V+  S + M   G     T     + MGF
Sbjct: 411 QKAVAQFGRASIIVFSVGTVMSLSTVLMTSFGALDVWTDYVAGKDMGF 445

BLAST of Cp4.1LG10g04330 vs. TAIR10
Match: AT1G61740.1 (AT1G61740.1 Sulfite exporter TauE/SafE family protein)

HSP 1 Score: 158.7 bits (400), Expect = 9.7e-39
Identity = 124/454 (27.31%), Postives = 227/454 (50.00%), Query Frame = 1

Query: 78  ILSVLYLSSPSSSPSSSFVLSD------------SLSTDKIWPDLELSWRLVVATVIGFL 137
           ILS +   +PS +     +LS               ST    P +EL+   ++A ++ FL
Sbjct: 9   ILSFIIFLTPSIAEQEPSILSPVDQLLNKTSSYLDFSTKFNQPRIELTTSTIIAGLLSFL 68

Query: 138 GSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYNLRVAHPTKD- 197
            S+  + GG+GGGG++VP++T++ G D K+A++ S  M+ G S ++V  NL V +P    
Sbjct: 69  ASSISSAGGIGGGGLYVPIMTIVAGLDLKTASSFSAFMVTGGSIANVGCNLFVRNPKSGG 128

Query: 198 VPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWKE 257
             +ID DLALL +P ++LG+++GV  ++VFP WLIT L  +    ++ ++F  G+  W+ 
Sbjct: 129 KTLIDFDLALLLEPCMLLGVSIGVICNLVFPNWLITSLFAVFLAWSTLKTFGNGLYYWRL 188

Query: 258 ETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKRTSILLAVWI 317
           E+ + K   ++S        E  I+    PL+   Q+           W +  +L+ +W+
Sbjct: 189 ESEMVK--IRESNRIEEDDEEDKIESLKLPLLEDYQRPK------RFPWIKLGVLVIIWL 248

Query: 318 SFL---ILQND--------VAVCSPWYWILFSLQFPVA--FAVFGYEARKLYNEHKKRME 377
           S+    +L+ +        +  C   YW++ S Q P+   F ++   +  + ++ +    
Sbjct: 249 SYFAVYLLRGNKYGEGIISIEPCGNAYWLISSSQIPLTLFFTLWICFSDNVQSQQQSDYH 308

Query: 378 AGNLEHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASA 437
               +     S     ++     +  ++ G +GG+ G GGG ++ PLLL++G+ P+V +A
Sbjct: 309 VSVKDVEDLRSNDGARSNKCMFPVMALLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAA 368

Query: 438 TATFVMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIV 497
           T +F+++FSS++S +++ LL       A     +  +A   G   V+K+I    RAS+IV
Sbjct: 369 TCSFMVLFSSTMSAIQYLLLGMEHTGTASIFAVICFVASLVGLKVVQKVITEYGRASIIV 428

Query: 498 FVLSGVIFASAITMGIVGVSKSITMIHNHEFMGF 506
           F +  V+  S + M   G         +  +MGF
Sbjct: 429 FSVGIVMALSIVLMTSYGALDVWNDYVSGRYMGF 454

BLAST of Cp4.1LG10g04330 vs. TAIR10
Match: AT1G11540.1 (AT1G11540.1 Sulfite exporter TauE/SafE family protein)

HSP 1 Score: 148.7 bits (374), Expect = 1.0e-35
Identity = 112/376 (29.79%), Postives = 192/376 (51.06%), Query Frame = 1

Query: 145 LTLIIGFDTKSAAAISKCMIMGASSSSVWYNLRVAHP-TKDVPIIDHDLALLFQPMLMLG 204
           +T+I G + K+A++ S  M+ G S ++V  NL + +P ++D  +ID DLAL  QP L+LG
Sbjct: 1   MTIIAGLEMKTASSFSAFMVTGVSFANVGCNLFLRNPKSRDKTLIDFDLALTIQPCLLLG 60

Query: 205 ITVGVSLSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWKEETILKKEFAK-QSETFVNS 264
           +++GV  + +FP WL+  L  +    ++ ++  KG+  W     L+ E AK +S   V+ 
Sbjct: 61  VSIGVICNRMFPNWLVLFLFAVFLAWSTMKTCKKGVSYWN----LESERAKIKSPRDVDG 120

Query: 265 RGELLIDVEYDPLIPKEQKTALELMCFNLRWKRTSILLAVWISFLIL---------QNDV 324
                I+V   PL+ +E++   +       W +  +L+ +W+ F  +         Q  +
Sbjct: 121 -----IEVARSPLLSEEREDVRQRGMIRFPWMKLGVLVIIWLLFFSINLFRGNKYGQGII 180

Query: 325 AV--CSPWYWILFSLQFPVA--FAVFGYEARKLYNEHKKRMEAGNLEHICEASIGWTGTH 384
           ++  C   YW L SLQ P+   F +  Y +  + + H       + +   E  +G     
Sbjct: 181 SIKPCGALYWFLSSLQIPLTIFFTLCIYFSDNVQSNHTSHSNQNSEQ---ETGVGGRQNK 240

Query: 385 LAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFVMMFSSSLSVVEFY 444
           L    +  ++ G +GGL G GGG ++ PLLL+IG+ P+V +AT +F+++FSSS+S +++ 
Sbjct: 241 LMLPVMA-LLAGVLGGLFGIGGGMLISPLLLQIGIAPEVTAATCSFMVLFSSSMSAIQYL 300

Query: 445 LLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVFVLSGVIFASAITMGIVG 504
           LL       A     V  +A   G   V+K+IA   RAS+IVF +  V+  S + M   G
Sbjct: 301 LLGMEHAGTAAIFALVCFVASLVGLMVVKKVIAKYGRASIIVFAVGIVMALSTVLMTTHG 360

Query: 505 VSKSITMIHNHEFMGF 506
                    +  +MGF
Sbjct: 361 AFNVWNDFVSGRYMGF 363

BLAST of Cp4.1LG10g04330 vs. NCBI nr
Match: gi|449451245|ref|XP_004143372.1| (PREDICTED: uncharacterized protein LOC101206149 [Cucumis sativus])

HSP 1 Score: 780.4 bits (2014), Expect = 1.9e-222
Identity = 414/455 (90.99%), Postives = 429/455 (94.29%), Query Frame = 1

Query: 61  MATSGFLLYLLSASSIAILSVLYLS-SPSSSPSSSFVLSDSLSTDKIWPDLELSWRLVVA 120
           MATSGFLLYLLSASSIA+LS+LYLS S SSS SS+  LS SLSTDK WPDLE SWRLV A
Sbjct: 1   MATSGFLLYLLSASSIAVLSLLYLSDSSSSSSSSTTALSASLSTDKTWPDLEPSWRLVAA 60

Query: 121 TVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYNLRVA 180
           TVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGAS+SSVWYNLRVA
Sbjct: 61  TVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASTSSVWYNLRVA 120

Query: 181 HPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSFFKGI 240
           HPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLIT+LIIILFIGTSSRSFFKGI
Sbjct: 121 HPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITILIIILFIGTSSRSFFKGI 180

Query: 241 EMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKRTSIL 300
           EMWKEETILKKEFAK+ ET VNS GELLIDVEYDPLIPKEQKT LELMCFNLRWKRTSIL
Sbjct: 181 EMWKEETILKKEFAKRCETVVNSHGELLIDVEYDPLIPKEQKTELELMCFNLRWKRTSIL 240

Query: 301 LAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEAGNLE 360
            AVWISFLILQ   NDVA CS WYW++F LQFP+A  VFGYEARKLY EHKKRMEAGNLE
Sbjct: 241 FAVWISFLILQVVKNDVAACSIWYWVVFFLQFPIAIVVFGYEARKLYKEHKKRMEAGNLE 300

Query: 361 HICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFV 420
            ICEASIGWTG+HLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFV
Sbjct: 301 QICEASIGWTGSHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFV 360

Query: 421 MMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVFVLSG 480
           MMFSSSLSVVEFYLL+RFPIP+ALYLTSVSVLAGFWGQFFVRK+I ILRRASLIVF+LSG
Sbjct: 361 MMFSSSLSVVEFYLLNRFPIPYALYLTSVSVLAGFWGQFFVRKLITILRRASLIVFILSG 420

Query: 481 VIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSS 512
           VIFASAITMGIVGV+KSITMI NHEFMGFLDFCSS
Sbjct: 421 VIFASAITMGIVGVTKSITMIQNHEFMGFLDFCSS 455

BLAST of Cp4.1LG10g04330 vs. NCBI nr
Match: gi|659125400|ref|XP_008462667.1| (PREDICTED: uncharacterized protein LOC103500970, partial [Cucumis melo])

HSP 1 Score: 749.6 bits (1934), Expect = 3.6e-213
Identity = 392/430 (91.16%), Postives = 404/430 (93.95%), Query Frame = 1

Query: 85  SSPSSSPSSSFVLSDSLSTDKIWPDLELSWRLVVATVIGFLGSACGTVGGVGGGGIFVPM 144
           SS SSS SSS VL+ SL TDK WPDLELSWRLV ATVIGFLGSACGTVGGVGGGGIFVPM
Sbjct: 5   SSSSSSSSSSTVLNASLYTDKTWPDLELSWRLVAATVIGFLGSACGTVGGVGGGGIFVPM 64

Query: 145 LTLIIGFDTKSAAAISKCMIMGASSSSVWYNLRVAHPTKDVPIIDHDLALLFQPMLMLGI 204
           LTLIIGFDTKSAAAISKCMIMGAS+SSVWYNLRVAHPTKDVPIIDHDLALLFQPMLMLGI
Sbjct: 65  LTLIIGFDTKSAAAISKCMIMGASTSSVWYNLRVAHPTKDVPIIDHDLALLFQPMLMLGI 124

Query: 205 TVGVSLSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWKEETILKKEFAKQSETFVNSRG 264
           TVGV+LSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWKEETILKKEFAKQ ET  NS G
Sbjct: 125 TVGVALSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWKEETILKKEFAKQCETVANSHG 184

Query: 265 ELLIDVEYDPLIPKEQKTALELMCFNLRWKRTSILLAVWISFLILQ---NDVAVCSPWYW 324
           ELLIDVEYDPLIPKEQKT LELMCFNLRWKRTSIL  VWISFLILQ   NDVA CS WYW
Sbjct: 185 ELLIDVEYDPLIPKEQKTKLELMCFNLRWKRTSILFVVWISFLILQVVKNDVAACSNWYW 244

Query: 325 ILFSLQFPVAFAVFGYEARKLYNEHKKRMEAGNLEHICEASIGWTGTHLAFCALCGIVGG 384
           ++F LQFP+A AVFGYEARKLY EHKKRME GNLE ICEASIGWTG+HLAFCALCGIVGG
Sbjct: 245 VVFFLQFPIAIAVFGYEARKLYKEHKKRMETGNLEQICEASIGWTGSHLAFCALCGIVGG 304

Query: 385 TVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFVMMFSSSLSVVEFYLLHRFPIPFALY 444
           TVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFVMMFSSSLSVVEFYLL+RFPIP+ALY
Sbjct: 305 TVGGLLGSGGGFVLGPLLLEIGVVPQVASATATFVMMFSSSLSVVEFYLLNRFPIPYALY 364

Query: 445 LTSVSVLAGFWGQFFVRKMIAILRRASLIVFVLSGVIFASAITMGIVGVSKSITMIHNHE 504
           LTSVSVLAGFWGQFFVRK+I ILRRASLIVF+LSGVIFASAITMGIVGV+KSITMI NHE
Sbjct: 365 LTSVSVLAGFWGQFFVRKLITILRRASLIVFILSGVIFASAITMGIVGVTKSITMIQNHE 424

Query: 505 FMGFLDFCSS 512
           FMGFLDFCSS
Sbjct: 425 FMGFLDFCSS 434

BLAST of Cp4.1LG10g04330 vs. NCBI nr
Match: gi|590631448|ref|XP_007027567.1| (Sulfite exporter TauE/SafE family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 682.6 bits (1760), Expect = 5.5e-193
Identity = 350/457 (76.59%), Postives = 399/457 (87.31%), Query Frame = 1

Query: 61  MATSGFLLYLLSASSIAILSVLYLSSPSSSPSSSFVLSDS--LSTDKIWPDLELSWRLVV 120
           MAT GFLLYLLS  S+AILSVL+++  ++   +S +L      S DK WP+LEL+WRLV+
Sbjct: 1   MATRGFLLYLLSGFSVAILSVLFINKNNNMYHNSTLLHSPNVSSVDKDWPELELNWRLVL 60

Query: 121 ATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYNLRV 180
           ATVIGFLGSACGTVGGVGGGGIFVPMLTLI+GFDTKSAAAISKCMIMGAS+SSVWYNLRV
Sbjct: 61  ATVIGFLGSACGTVGGVGGGGIFVPMLTLIVGFDTKSAAAISKCMIMGASASSVWYNLRV 120

Query: 181 AHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSFFKG 240
            HPTK+VPIID+DLALLFQPMLMLGITVGV+LSVVFPYWLITVLIIILF+ TSSRSF+K 
Sbjct: 121 PHPTKEVPIIDYDLALLFQPMLMLGITVGVALSVVFPYWLITVLIIILFLSTSSRSFYKA 180

Query: 241 IEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKRTSI 300
            EMWKEETILKKE  +Q ET VNSRGELLID EY+PL+P+E+K+ L+++CFNLRWKR  I
Sbjct: 181 TEMWKEETILKKELTRQQETLVNSRGELLIDAEYEPLVPREEKSELQILCFNLRWKRLLI 240

Query: 301 LLAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEAGNL 360
           L  VW+ F ++Q   ND+ VCS WYW+LF LQ P+A  VFGYEA KLY EHKKRM  GN 
Sbjct: 241 LATVWVLFTLIQVIKNDLVVCSTWYWVLFCLQLPIAVLVFGYEATKLYKEHKKRMSTGNR 300

Query: 361 EHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATF 420
           E ICEASI W+  ++AFCALCGI+GGTVGGLLGSGGGF+LGPLLLEIGV+PQVASATATF
Sbjct: 301 EAICEASIQWSPLNIAFCALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASATATF 360

Query: 421 VMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVFVLS 480
           VMMFSSSLSVVEFYLL RFPIP+ALYL  VS+LAGFWGQ+FVRK+I IL+RASLIVF+LS
Sbjct: 361 VMMFSSSLSVVEFYLLKRFPIPYALYLMGVSILAGFWGQYFVRKLITILKRASLIVFILS 420

Query: 481 GVIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSSQ 513
           GVIFASA+TMG++G+  SI MIHNHEFMGFLDFCSSQ
Sbjct: 421 GVIFASALTMGVIGIDTSIQMIHNHEFMGFLDFCSSQ 457

BLAST of Cp4.1LG10g04330 vs. NCBI nr
Match: gi|566175809|ref|XP_006381336.1| (hypothetical protein POPTR_0006s11920g [Populus trichocarpa])

HSP 1 Score: 681.8 bits (1758), Expect = 9.3e-193
Identity = 358/460 (77.83%), Postives = 396/460 (86.09%), Query Frame = 1

Query: 61  MATSGFLLYLLSASSIAILSVLYLSSP----SSSPSSSFVLSDSLST-DKIWPDLELSWR 120
           MAT G +LYLLS  S+AILSV +LS P    S +P+S    S  LST DK+WP LE SWR
Sbjct: 1   MATRGLVLYLLSGFSVAILSVFFLSHPNEKASPNPNSDIFASPYLSTTDKVWPKLEFSWR 60

Query: 121 LVVATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYN 180
            V+ATVIG LGSACGTVGGVGGGGIFVPMLTLI+GFDTKSAAA+SKCMIM AS+SSVWYN
Sbjct: 61  TVLATVIGLLGSACGTVGGVGGGGIFVPMLTLIVGFDTKSAAALSKCMIMAASASSVWYN 120

Query: 181 LRVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSF 240
           LRV HPT++VPIID+DLALLFQPML+LGIT+GVSLSVVFPYWLITVLIIILFIGTSSRSF
Sbjct: 121 LRVPHPTREVPIIDYDLALLFQPMLLLGITLGVSLSVVFPYWLITVLIIILFIGTSSRSF 180

Query: 241 FKGIEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKR 300
           FKGIEMWKEETILKKE   Q ET VNSRGELLID EY+PLIP+E+K+ ++++CFNL+WKR
Sbjct: 181 FKGIEMWKEETILKKEMVIQQETIVNSRGELLIDTEYEPLIPREEKSKMQILCFNLKWKR 240

Query: 301 TSILLAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEA 360
             IL  VW SFL+LQ   NDVAVCS WYW+LF LQFP+AF VFGYEA KLY E+KKR+  
Sbjct: 241 LLILFLVWTSFLLLQVIKNDVAVCSTWYWVLFCLQFPIAFGVFGYEAVKLYRENKKRIST 300

Query: 361 GNLEHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASAT 420
           GN E ICEASI WT  H+ FCALCGI+GGTVGGLLGSGGGFVLGPLLLEIGV P VASAT
Sbjct: 301 GNTETICEASIEWTPMHILFCALCGIIGGTVGGLLGSGGGFVLGPLLLEIGVSPHVASAT 360

Query: 421 ATFVMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVF 480
           +TFVMMFSSSLSVVEFYLL RFPIPFALYL  VSVLAGFWGQFFVRK++ IL RASLIVF
Sbjct: 361 STFVMMFSSSLSVVEFYLLKRFPIPFALYLMGVSVLAGFWGQFFVRKLVKILGRASLIVF 420

Query: 481 VLSGVIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSSQ 513
           +LSGVIF SA+TMG VG+  SITMI NHEFMGFL+FCSSQ
Sbjct: 421 ILSGVIFVSALTMGGVGIDTSITMIRNHEFMGFLEFCSSQ 460

BLAST of Cp4.1LG10g04330 vs. NCBI nr
Match: gi|1000947655|ref|XP_015580525.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC8274876 [Ricinus communis])

HSP 1 Score: 681.8 bits (1758), Expect = 9.3e-193
Identity = 353/457 (77.24%), Postives = 398/457 (87.09%), Query Frame = 1

Query: 61  MATSGFLLYLLSASSIAILSVLYLSSPSSSPSSSFVLSDSL--STDKIWPDLELSWRLVV 120
           MAT G +LYL  A S A+LS ++L        +S +LS     +T+++WP+LE SWR+V+
Sbjct: 1   MATRGLVLYLSLAFSAAVLSAVFLFDHHPYVKNSTLLSSHYISTTERVWPELEFSWRIVL 60

Query: 121 ATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMGASSSSVWYNLRV 180
           ATVIGFLGSACGTVGGVGGGGIFVPMLTLI+GFDTKSAAAISKCMIMGAS+SSVWYNLRV
Sbjct: 61  ATVIGFLGSACGTVGGVGGGGIFVPMLTLIVGFDTKSAAAISKCMIMGASASSVWYNLRV 120

Query: 181 AHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITVLIIILFIGTSSRSFFKG 240
            HPTK+VPI+D+DLALLFQPMLMLGITVGV+ SVVFPYWLITVLIIILFIGTSSRSFFKG
Sbjct: 121 PHPTKEVPILDYDLALLFQPMLMLGITVGVASSVVFPYWLITVLIIILFIGTSSRSFFKG 180

Query: 241 IEMWKEETILKKEFAKQSETFVNSRGELLIDVEYDPLIPKEQKTALELMCFNLRWKRTSI 300
           +EMWKEETILKKE AKQ E  VNSRGELLID EY+PL+PKE+K+ ++++CFNLRWKR  +
Sbjct: 181 VEMWKEETILKKELAKQQEAVVNSRGELLIDTEYEPLVPKEEKSEMQIVCFNLRWKRLFV 240

Query: 301 LLAVWISFLILQ---NDVAVCSPWYWILFSLQFPVAFAVFGYEARKLYNEHKKRMEAGNL 360
           LL V + FL+LQ   NDVA CS WYW+LF LQFPVA AVFGYEA KLY EHKKR+  GN 
Sbjct: 241 LLFVCLXFLLLQVIKNDVATCSKWYWVLFCLQFPVALAVFGYEAVKLYKEHKKRISTGNT 300

Query: 361 EHICEASIGWTGTHLAFCALCGIVGGTVGGLLGSGGGFVLGPLLLEIGVVPQVASATATF 420
           E ICEASI WT  H++FCALCGI+GGTVGGLLGSGGGF+LGPLLLEIGV+PQVASATATF
Sbjct: 301 ESICEASIAWTPMHISFCALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASATATF 360

Query: 421 VMMFSSSLSVVEFYLLHRFPIPFALYLTSVSVLAGFWGQFFVRKMIAILRRASLIVFVLS 480
           VMMFSSSLSVVEFYLL RFP+P+ALYLT VSVLAGFWGQFFVRK+I IL+R SLIVF+LS
Sbjct: 361 VMMFSSSLSVVEFYLLKRFPMPYALYLTGVSVLAGFWGQFFVRKLITILKRGSLIVFILS 420

Query: 481 GVIFASAITMGIVGVSKSITMIHNHEFMGFLDFCSSQ 513
           GVIFASAITMG+VG  KSI MI+NHEFMGFL FCSSQ
Sbjct: 421 GVIFASAITMGVVGTEKSIRMINNHEFMGFLGFCSSQ 457

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KII9_CUCSA1.3e-22290.99Uncharacterized protein OS=Cucumis sativus GN=Csa_6G449290 PE=4 SV=1[more]
A0A061F0N0_THECC3.8e-19376.59Sulfite exporter TauE/SafE family protein isoform 1 OS=Theobroma cacao GN=TCM_02... [more]
B9MTM3_POPTR6.5e-19377.83Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s11920g PE=4 SV=1[more]
A0A0B0NNJ1_GOSAR1.6e-19175.43Putative tripeptidyl-peptidase SED4 OS=Gossypium arboreum GN=F383_18561 PE=4 SV=... [more]
A0A059DFV3_EUCGR4.6e-19175.16Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A01580 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G36630.14.4e-18572.93 Sulfite exporter TauE/SafE family protein[more]
AT2G25737.13.8e-11253.21 Sulfite exporter TauE/SafE family protein[more]
AT4G21250.11.4e-4230.88 Sulfite exporter TauE/SafE family protein[more]
AT1G61740.19.7e-3927.31 Sulfite exporter TauE/SafE family protein[more]
AT1G11540.11.0e-3529.79 Sulfite exporter TauE/SafE family protein[more]
Match NameE-valueIdentityDescription
gi|449451245|ref|XP_004143372.1|1.9e-22290.99PREDICTED: uncharacterized protein LOC101206149 [Cucumis sativus][more]
gi|659125400|ref|XP_008462667.1|3.6e-21391.16PREDICTED: uncharacterized protein LOC103500970, partial [Cucumis melo][more]
gi|590631448|ref|XP_007027567.1|5.5e-19376.59Sulfite exporter TauE/SafE family protein isoform 1 [Theobroma cacao][more]
gi|566175809|ref|XP_006381336.1|9.3e-19377.83hypothetical protein POPTR_0006s11920g [Populus trichocarpa][more]
gi|1000947655|ref|XP_015580525.1|9.3e-19377.24PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC8274876 [Ricinus comm... [more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0016021integral component of membrane
Vocabulary: INTERPRO
TermDefinition
IPR002781TM_pro_TauE-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g04330.1Cp4.1LG10g04330.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002781Transmembrane protein TauE-likePFAMPF01925TauEcoord: 299..477
score: 2.8E-11coord: 121..238
score: 2.7
NoneNo IPR availablePANTHERPTHR14255ATP-DEPENDENT PROTEASE CEREBLONcoord: 35..512
score: 1.4E
NoneNo IPR availablePANTHERPTHR14255:SF5SUBFAMILY NOT NAMEDcoord: 35..512
score: 1.4E