Cp4.1LG17g10040 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG17g10040
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionUBX domain-containing family protein
LocationCp4.1LG17 : 7547362 .. 7556246 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGGATTAACCTATAATTTTACACCCGCGAAAAAGAGAAAAAATCTTCGTCGAATGTCCGTTGCTGTGCTCGTGACCGCGCGTCGGGGTTGTCAACAGAGAATCAGCTCATCCTTCTGAAATTTCTTAGTCCAATCTCCACTGAATTTGTTTCAGGTGATCGGAATTTCAGTTTCTTGTAATAGGTCATCAAATTTTGCTTTTGGAATTGACGGCCAGTGGCAAGTGGATAGACGAATGGAGATTGAAGGGGCTGTGTGAAGACGGGTCCTGATTTTATCACGGGGGATATAATTGAATTGTTTATGTTTTGATTCTTCTCATTATTACTTAGTTTTTCTTGAGTGGTTTGGTTTTAAAATCTGACTCTCTTTGTATTTCTTACTGTCTGCGATTTTGCCTCATCGCCATTGTTGTTTATTCTTTCTTTGTGCAGATGGAGCAGTCTATATCATCACTGGCATTCAAGGGATCCGTGCCTGAAGCAATTATCGAATCCAAAAATCAAAAAAAACTATTTGTGGTTTATATTTCCGGTAAATTTTTAGTCGTTCTACCTTACTCAAGGGTTTTTCTGATTTGGACGTCTTTTAGATGGAAGGAGGAAAGTCAACTTTTGTTATCCTTAAGTAGAGGGGGAGTAACTGGTCGAAGTGATGATGTATATCTATACTTTCTCCCTTTTTCCCTGTATGTTTCTACTTAGATATAGCCAACCTAGAGTTTCTACTTGGAAAAGCACATTCAAGAAGATCAGTTTATTTGTTCGTTCCTTTTTTATGGTTATCAGGATAGGTGAATTATTAAGGACCCGTATCAAAATTGATTTCTTACCCAAATCCAAGGATATATGAACGATTAGTTCTAGGTGTGCAAAAATTTAGGAGAATTCTGTTTTTCTTAATCACATTATTGCAAGGAATTCAGTTCTTATACTCAAGTAGGCCTGCAAATTTGTTCATGTTCAGGTGCTCTCTCGGTTGTATTACCATTGGGTTTTAGCCCCCCAAAGATGTTGACACATGGAGTGCCTTACACACATCTTTGTTAATATTCCATTTTTCCTCATTAATCTATATCGGAGGCCTTTTGTTTGACCTCTCTTTTCAGGATTAGGGATCCCTTGTCCATAAGCCTTCACGTTATCTTCCACTCTTGTTTTGTCTAATGAAGTTTCCTTGTTTCTTATTTTAACAAATTCAGTCAACTATGGGAAAAAGTATTTTGTGATGTGGGATTCTTTTCTAGCACCTACAAATTTCCTTATGCTTTTAGCACAGGTGACACTTGCCGTTAATATTGTATATTCCACTAGTTCGTACCCCATTTTCTTTGAGATTAACATGAATATCCATTCCCCCGGGGCTGGCAGATGCAGAGAGAGAGAGAGAGAGAGAGAGGGAAAAAAGTAAGAAAAAAGGAAAAGGAAAAATGAAGGAAAAGAGGAAAAAAGAGAGCCGAAAGTTTTCTCTAAGATATGAGAATTCTGTGTAGTTAAACAACTAATGTGTGATGGATTGGATGTTCTATTTATGTGAGGTCCTGCAGCATAGAAGTAAGGAGAATGGGCCATGCCAGGGTGTTGATTAGGAAATCATTGGGAGAGAGGAAGGTTGTGTTGCACCTAATTGTCATTTTTCTGTTGGGTCCCATGGATGGTAATTGGAAGGGGAGCTTCAACTAGATTCTGTGAGGAAAGGTCTAAGACTCCTGAATTTCCTGGGGTTTCTTTAGCTTGAGATTTTACAACCGCACTTACCTATAGAAGAACATTTTAATCTCTATTTTCTTTTCAGTTTTCTATTCCAAAACCCATCAACAATCTCAAGATATTATAACAACTGACTACTATGAGAATTTTGAATTTTGTGCTGTTGTAATTTATCCTTCTCCTTCCCCTTTTATTTTATTTAGGTGATGATGCTGAATCAAGCAGTTTGGAAAGTTCAACTTGGACAAGTTCAAGGGTAAACTTCTATCTGGAGTCTCAAATACTTGAATTGACGATTAATGTTCCTTCGTTTGATGGTTTACCAGGTGGCTGAAGCAGTGTCAAAATACTGCGTTTTATTGCATATTCCTGCTGGAAGCTCTGATGCTGCTCAGTTTTCATCGATATGTATCCTTATATTGAGTAGCATTTTTCAGTTGTTATCTGTATTGCATGTTACTGGGCATGGCTTGACTTACTGTTATGAAATCATAACCTGAGCAATAAAAATTGGTTTGTACATCCTTTTTTTCTTGTTAAAAAAAAAAAAATAGAATTGGTTAAAATTGATTTCACATTGGCCTTGACGTTGGGTCTCTGTTAACTACAATAATAAAAAGTTGATTTGGAATATTTTTTAGTTAATTTTCTAACATTTTTTTATGCTGACATGGCAATAGTTGACTAGTTTAATAATTTGTCTTTTTTCATAAATTATATAAATCAACAATTAAACCAATAGAGTAATCTTGTATTTTGTTGTGGAGACACCAAGACAGACTGAATCCTTGAATAATTGGTTCCAACGATTTTAGTTGTACTCATAATCAAATGAACAGTTATTGTCTACATTTCCACAACGGCCGTAGATGCTAGGGTGAAGAAAAACATGGGAGTTTCTGTACTACTGCTGTACTTACAGCTCCAAGAGCCAAATTCCAACAAAAAAAAAAACAAAAAAAAAACAGACATTTTTAGGAATTTTGAACCCCAAGTTACTTCATTCAATGTCTTGGGGAGGATGGGTTGCTAGTTCTTTATCTCCACAACGATGTTGAAGAAACATATTGTTGAAAAAATTGCCTCAAATGAACTAGTCATCTTGGGAGATATGATGGAAATCCGTGAAACAAATGTAATATGTAGCTACAACCACTCATCTAATTCTCTATTGTGTATGGATATTCTTAAGTGGAGGCATCGAATTTTCTTCTCTTCCTTCTACCTCTTACGTTAAACTTAAAATCCTTCTCCATGCATTGTTTGCATTTGATTATAAAGTTTTGAGGACATATTCTGCTCATACCTTCATTTTTCTTTATGAACAAACTGCTACTGTCAACAGCATACCTACTGAAACCTTGGGTATCTCCTACTACATTTATTCTCTTCAGATTTTCAAGACTGAGACCACCTTCCATTGTGTAGTTTGGACTTTTATATTTCGTTTTTTTTTTTTGAACTCTTAGATCTTTTCAAAATATATTTAATCATAAGGCGTCTATAAAGCTGGTATAATTGTATTTGATCTGTAGATCAAAGTTATCAGTGCCTACTACTGTCATGTCTAAAAGCAGTAACACTCAAGGAGTGGTGATAACTGTATTATTTATTTATTATTATTATTTTTAGTTAGTGCTTGATGAGAATTTGTACGATTTCTAAATAGGATTGTTATTCTTTTTCCTTGGGCTAGCTTCTCAGCTTGATATATCTTCTTTCCCCCAGTCTCTAAATGACATCATGAACTTGATCTCTAATACTTTTCAAGGTCGAAGTTTTTTCCTTGCATCAGTCTACTGCAGACCCGCAGAAATCTGTACCATGTATAACAGCTGTTGGATACAATGGTATACAACTTTGGCAAAATGGTAAGGATTGTTGATGAGAAGTTTCAATTGCCACAAGCTAAACATATTATATGTTTCTGTTCAAGTTTTGATTTCTGTCTTGAAATATTCTAGAGGGCTTTGTTGGTGCTGAGGTTTTGGCTTCCAGTTTAGAGAAGGCATGGTTGGGTCTTCATATCCAGGTATATTACTGTTTCCTATCAGTAGTTTTTTTAATAATTTTTTTTACCTACTTTGTGTTATAATGTGGAGCTCTACAGATTCTTGGTGTGTGATATACCATGTAAAAGTTGTCAGATTTTGTGTTTTAATGTGTGATGTTGGAAGGATTTGGAGTTGCAAGAGCTGAAAAAAGGAATGCATGTTTGGTGGTTCAATTGAAACTTCAATGTACAGCCAATCTTCTATTCCTTCTTTGTTGAAAGTCGCATTTAGATCATACTTTTCTTTTGATAATCCGGGAGCATTATTGGGATTCAAAATAGAGAAAGAGGCTTCATGGAGAAAAGTCAGAGTTGGTATATCAGAAATTTACAGTTGACGCATAAAAGTAATCCAAGATGGGCTGGAATGGTCCCTGGCTCTTGATTTAAAATACTGGAATTTTTGGGATGACGTGTTTAGTTATGAATTGAAAATTGAATCCAAGTCGGGAGGGTGAGTGGACTGACGGCTTCCTAATCTTTTCCATGTCCAAGTTTTTAGAAAACTTGCTGGTTTGGGGATACAATTATTTGAACCTATCCCTTAGGATGAATCTTTTTGATCGGAAGCTTAATGGGTGGGTTGGGCTCATCAACAATGTGGAAGGGATTGCTCTCTTAATCAGGATTGATGCCTTTGAAACATTGGCTATACTAACTTGCAAAGCTGCCCTGACCTTCTCCAACTCCTATCCACACTCAAGTTCGATAAACATACTACAAATGTGATTTGAAATGGAGAAACCACTTCTTATTAATTTATCTATCTTTAAAAAACTGAGCTTTTATTGGGAAAAATAAAAGAATATTCAAGAGCATACAAAAGAAAAACCTATCAATGAAGGAATCCAAAAAAACGACCTATAGAATGAGACTTCAATCCAACAAAATCAACTAAGTTCATAAATACTAAAATGTCTAGTCATCCACGAAAAGTCCCTCATTAGAAGACAAAGATAAAAGGGTCCTGAAAAAAGTTTGGGTCCTTTAACTTAGACAAGACTTGACACAATTGATAGAAGGCAGAACCAGAGTTGTGCTATTTGTTCTGTGTGAACTGTAAAAATCATGAAGGTGCTTGTAACATCTTTTTATCCCCTGTATTCTTGTACTCGGTTTAGATTTTTGTGGTTAAGCTGGTTTTGTGGGAGAACGCCACGAAAGACATTTTGTAGCTTAGTCGGATGGAAGAAATTAGAAGAATCGTGCACAAAATGCACTTTTCATCCCTGAAATTTGAGTTTGGCATCTATTTGGTCTGTAAGGTTTCAAAAGGGCACTTTAGTTTTTGAGGTTTGGAAAATACTATTAAGTGGTTCCTAACTTTGACAATTAGTTGACTAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAGATGTGACAAACTATAAATGAGGTGGCAATTTTTACTAATTTTATACTCTGATCTTTTGGATTACATGGCCTTTTCATGTTTTGCCCTCTTATTCCTTTTCTCTCTCTTCCCTTCTCATTTTTCATAGCTTTTCAAAAATGAAATATTATGAGGTATTATGATGTGATTGAGGTAAGTTAGGCCTCTCATTTTGTTCCTCCTATCTTTTGTGAAATATTTTTCATCTTGAACCATCCTTAGAAGTCTGAGGCACCAGAATATTTTAGTTATATCTGAAGATGAACTGCAGGAAACAACCGCATCTGTTTTGACGGCAGCCCTTGCTTCAAAGAAGTCTGAGGCATCTACTTCGAGGCCTTCTGATTCAGGGAGTTCCTCTTTGGCAGCTGTTTCTCATGCAGATCATCATATTGATTCCTCAGAGACTAATCTGGGTGTCAATAGTTGTGTAGCAGAGGAAGAGAAAGGGACTGAAAATTCCTCTAAAAAAAGGAAGGAGCCTGAAAAACGAGTCAAGGTATGGTATCAACATTGTTTACCCTAGGCTAAATGGTTATATCATTTATAATTGGTTCCATTTTATTTTGTAATTTTTAGGAGGAAGACATCAAATCAGATGTTAAGGAGTCCACTGTACATCATTCTGTGAGTGTTGGGAATAATAATGAATCCCCTGATCCCTCTGAGAATAATAAAGGCTCATTGGCTGATCCTGGAGGCCAGAAAAACTGTTCTTCTGAAAATACTTCCACAATTGTGCACGACTCTCCTATCATTCCAAACCATATTGAGTCATGTCAATCAGGAGCTTCAAGACCAATTCCTCCAGAAACCAAGGAAGTGGCCCAACAAGAAAAGAACAAAATTGTGGATGAGAATAATGCTATTGAAAATGGCAGTGCCCCCAAGAACTACACTTCAAGTGATATTCATTTAAACATTCGGCTGCTGAATGGTGTTAACGTCCAGGAGAAGTTCTCCAAGACAAGCACCTTGAGAATGGTCAAAGACTACGTGGATAACAGCCAGGAAAGTACCTTTGGATCTTATGATTTAGCTGTTCCATATCCACGCAAGGTTTTCACAGATCAAGGTATGATCTGAATATTTATTACTTGGTTAAATTCAATAGATCCTCATATATTATTGATTTTTTTTTCATTTGGTCCCTAATAATATTAAAATTTTAATTTATATTTCCAATAAATTACTTTTGTTTGTCTTTGAAGTTTGAAGTTTGGGTTAGGATTCTATTTAGTGTTAGAACTAAAGCCTAATTAAAATTTTAACCTGAGAGGTTTGTGAAAAGGTAAAATCATTAGTGATATTAAACAAACGTTGCTAGAAGGCTAGAACGTGTTAGAGGATGATATGAAAATATGTTACGTGGTTAATCTAACAAACCATGTGACAAGTTTTGTTTTTTGTTTTTGTTTTTTTTTTTTTATAATTTTATTGATGAATAAAATAAAAGAGTAAAAACTCCCACTGCCTAAAGGTTACATGAAAACTCAACAATTAGACATCAAAGAAGAAAGGTTATAATCAGTAAAAGGGTGCTTACACTTTCAAAAATACATAGCATGAAGAATTACTAGGTCTGAAAATTGATCAAAAGTAGATAAAGAGTTTCTGAAATTTCTTCCATTTCTTTGCTGCAAAGAAATCGGAGGAAAATTCTATGATAGCAAGCCACAAAATCCTCTTCGCTCTCTGGAAAGGGTGTCCCACCAAAATAGTAGCAAGATCATACTTGTTAGGAAAATTCAAAGACCAGTCAAAAGCTTCTAAAGTCACCATCCGATATCTTGAGGCTAAGGAGCAATTAGAAAAAAGTGTCCAAGATGCTCTGAGTTAGAAGCACACCTAATACACCATGGAGGAGAGAAATAGAAATGTAGCATCCTCCTTTGAAAGTTGATCTGCTGTACTTTTTATTTTTAACAAGGAATTACTTCTCATTGATATAATGAAAAAAAGTTAAAAGTTCAAGTGATACAAACTCCCAAAGAAAGTGAAAAATAAGAATTAAAAACAGCAAAGACATCTATGTAAAGAACAACCATGGACAAGAACTTTTCATAAACCTAACAATACAAACCCCTACAAAAGGTGACTCTATTATTGAATTGACCATTCACGTTTCGAGTTTCATGTAAATAATCTTTAAACGGCATAATACTCATAGTCCGGCAATTGTTTTTGTTGCCATTCTAGCACCTCCTTGAAATTAGAACATCCAACCTATATGGCACTTCGAATACATGTTGGTACCCCTGTTTCTTTGAAATGAGATACTAGAGGTATGGTGATCACATCTATGAGTTCTTATCATGAAACCCTTGCTGTGTGCCTTACTAAAATATCAACTGGACTGGCATCCACTGATTAGGACTAAGGATCAGCTGGTATTGATTGAAACTGATATTCAGATCTTTCAAACGATTCATCTATGATTCTATAACTATAGCTAATAGTCCGGCTCAAATTGGTGTTTTTCTGGTCTGTGGACTTCATTAAACTCTTTAACTTCCTTATTTGAAGTTTTGTTGGAATCTGTCATTGTAATACTTAATGACATGAAGATTCTTACTGGGAAGCTCGGAGTTGACCATTTTCTTAATTATTTGAATTGCTGCAGATTTGGAGAAATCATTGTCTTACTTGGGCCTTTCTAATAGACAATCATTGATAATGGTTCGACATCTGGGAGTTACTCGTGATTTCAGGGGAGCATCCTCCTCTGCTGATCAAAGAAATTCTGCAGCCAATGGTGGATCTTCAGATGAAAACGGTGATGGATACTTTGCATTCGTTAGAAGGATTTTATCTTATGTGAATCCATTCTCCTATCTTGGGGGCACCAGACATGAAACTCAAGGAGATGTGCGACAGTATAGTGAGTTCCTTTTCCTAAAATAGAAATTTCACATGAGTGTTAATGTTCTTTTAAGGTTAAAATCTTGGTTTAACCCTTGAACTTTCAGGATTATTTTGTTATGGTCTCTTAACTTTCAAAGGGTCTACTTTAGTTTCAGGGGTTTTGTGGTTGGCTTTGAGTATTTTTATGGTTGGGTTTGGGTGATTTTGTGGCTTAGTTTTTTACTCTTTTAAGTATGTGAAAATTCAGCCATTGTTGTAAGTATAGAGTTAGAAATGAACTTTTTTTCTAATATGTTTATATATATATGTATCAGGTAGTGAATTTTCAGAAGTAGAGAGGCGCCATGTCCGCCAACCAAACCAAGGAACTGCAACAACGAGCGGAAACAATACCCGTGGGAAGCAACCATTGTCAACAGCTAGATTTGGAGCAAATATTCATAGCATTCATACGTTGAAGAAGGATGAAGACGACGAACGGTTCAAAGGTAGGAATTCCTTTTGGAATGGTAATTCTACTGAGTATGGTGGTGATGATGGTGATAGCAAATGAGGCCCAATGTCTAGACAACACTACATGTGATGCTTATATTATATAAACAACCTGCTCTTAAACCGTTTATTAACAACACAGACACTGTAAATACCTAAAATTAAATAATTTTATACATTCGGATGTCATCTCTGCTCTCAAGGTTGTTTTTCAGTTTGACCATACACTCTTTTAAAAGGACGAGGGAAGCATAGTGTTCTTTGTGGTTCTGCATTTTGAATAAAGCTTCGTTTGGATTGAGGTGTGGATTAGTATTGTAACGGCCCAGATCTACCGCTGAAACCTTCTCCTAGCAGACGCGTTTTAAAGCCTT

mRNA sequence

AAGGATTAACCTATAATTTTACACCCGCGAAAAAGAGAAAAAATCTTCGTCGAATGTCCGTTGCTGTGCTCGTGACCGCGCGTCGGGGTTGTCAACAGAGAATCAGCTCATCCTTCTGAAATTTCTTAGTCCAATCTCCACTGAATTTGTTTCAGATGGAGCAGTCTATATCATCACTGGCATTCAAGGGATCCGTGCCTGAAGCAATTATCGAATCCAAAAATCAAAAAAAACTATTTGTGGTTTATATTTCCGGTGATGATGCTGAATCAAGCAGTTTGGAAAGTTCAACTTGGACAAGTTCAAGGGTGGCTGAAGCAGTGTCAAAATACTGCGTTTTATTGCATATTCCTGCTGGAAGCTCTGATGCTGCTCAGTTTTCATCGATATACCCGCAGAAATCTGTACCATGTATAACAGCTGTTGGATACAATGGTATACAACTTTGGCAAAATGAGGGCTTTGTTGGTGCTGAGGTTTTGGCTTCCAGTTTAGAGAAGGCATGGTTGGGTCTTCATATCCAGGAAACAACCGCATCTGTTTTGACGGCAGCCCTTGCTTCAAAGAAGTCTGAGGCATCTACTTCGAGGCCTTCTGATTCAGGGAGTTCCTCTTTGGCAGCTGTTTCTCATGCAGATCATCATATTGATTCCTCAGAGACTAATCTGGGTGTCAATAGTTGTGTAGCAGAGGAAGAGAAAGGGACTGAAAATTCCTCTAAAAAAAGGAAGGAGCCTGAAAAACGAGTCAAGGAGGAAGACATCAAATCAGATGTTAAGGAGTCCACTGTACATCATTCTGTGAGTGTTGGGAATAATAATGAATCCCCTGATCCCTCTGAGAATAATAAAGGCTCATTGGCTGATCCTGGAGGCCAGAAAAACTGTTCTTCTGAAAATACTTCCACAATTGTGCACGACTCTCCTATCATTCCAAACCATATTGAGTCATGTCAATCAGGAGCTTCAAGACCAATTCCTCCAGAAACCAAGGAAGTGGCCCAACAAGAAAAGAACAAAATTGTGGATGAGAATAATGCTATTGAAAATGGCAGTGCCCCCAAGAACTACACTTCAAGTGATATTCATTTAAACATTCGGCTGCTGAATGGTGTTAACGTCCAGGAGAAGTTCTCCAAGACAAGCACCTTGAGAATGGTCAAAGACTACGTGGATAACAGCCAGGAAAGTACCTTTGGATCTTATGATTTAGCTGTTCCATATCCACGCAAGGTTTTCACAGATCAAGATTTGGAGAAATCATTGTCTTACTTGGGCCTTTCTAATAGACAATCATTGATAATGGTTCGACATCTGGGAGTTACTCGTGATTTCAGGGGAGCATCCTCCTCTGCTGATCAAAGAAATTCTGCAGCCAATGGTGGATCTTCAGATGAAAACGGTAGTGAATTTTCAGAAGTAGAGAGGCGCCATGTCCGCCAACCAAACCAAGGAACTGCAACAACGAGCGGAAACAATACCCGTGGGAAGCAACCATTGTCAACAGCTAGATTTGGAGCAAATATTCATAGCATTCATACGTTGAAGAAGGATGAAGACGACGAACGGTTCAAAGGTAGGAATTCCTTTTGGAATGGTAATTCTACTGAGTATGGTGGTGATGATGGTGATAGCAAATGAGGCCCAATGTCTAGACAACACTACATGTGATGCTTATATTATATAAACAACCTGCTCTTAAACCGTTTATTAACAACACAGACACTGTAAATACCTAAAATTAAATAATTTTATACATTCGGATGTCATCTCTGCTCTCAAGGTTGTTTTTCAGTTTGACCATACACTCTTTTAAAAGGACGAGGGAAGCATAGTGTTCTTTGTGGTTCTGCATTTTGAATAAAGCTTCGTTTGGATTGAGGTGTGGATTAGTATTGTAACGGCCCAGATCTACCGCTGAAACCTTCTCCTAGCAGACGCGTTTTAAAGCCTT

Coding sequence (CDS)

ATGGAGCAGTCTATATCATCACTGGCATTCAAGGGATCCGTGCCTGAAGCAATTATCGAATCCAAAAATCAAAAAAAACTATTTGTGGTTTATATTTCCGGTGATGATGCTGAATCAAGCAGTTTGGAAAGTTCAACTTGGACAAGTTCAAGGGTGGCTGAAGCAGTGTCAAAATACTGCGTTTTATTGCATATTCCTGCTGGAAGCTCTGATGCTGCTCAGTTTTCATCGATATACCCGCAGAAATCTGTACCATGTATAACAGCTGTTGGATACAATGGTATACAACTTTGGCAAAATGAGGGCTTTGTTGGTGCTGAGGTTTTGGCTTCCAGTTTAGAGAAGGCATGGTTGGGTCTTCATATCCAGGAAACAACCGCATCTGTTTTGACGGCAGCCCTTGCTTCAAAGAAGTCTGAGGCATCTACTTCGAGGCCTTCTGATTCAGGGAGTTCCTCTTTGGCAGCTGTTTCTCATGCAGATCATCATATTGATTCCTCAGAGACTAATCTGGGTGTCAATAGTTGTGTAGCAGAGGAAGAGAAAGGGACTGAAAATTCCTCTAAAAAAAGGAAGGAGCCTGAAAAACGAGTCAAGGAGGAAGACATCAAATCAGATGTTAAGGAGTCCACTGTACATCATTCTGTGAGTGTTGGGAATAATAATGAATCCCCTGATCCCTCTGAGAATAATAAAGGCTCATTGGCTGATCCTGGAGGCCAGAAAAACTGTTCTTCTGAAAATACTTCCACAATTGTGCACGACTCTCCTATCATTCCAAACCATATTGAGTCATGTCAATCAGGAGCTTCAAGACCAATTCCTCCAGAAACCAAGGAAGTGGCCCAACAAGAAAAGAACAAAATTGTGGATGAGAATAATGCTATTGAAAATGGCAGTGCCCCCAAGAACTACACTTCAAGTGATATTCATTTAAACATTCGGCTGCTGAATGGTGTTAACGTCCAGGAGAAGTTCTCCAAGACAAGCACCTTGAGAATGGTCAAAGACTACGTGGATAACAGCCAGGAAAGTACCTTTGGATCTTATGATTTAGCTGTTCCATATCCACGCAAGGTTTTCACAGATCAAGATTTGGAGAAATCATTGTCTTACTTGGGCCTTTCTAATAGACAATCATTGATAATGGTTCGACATCTGGGAGTTACTCGTGATTTCAGGGGAGCATCCTCCTCTGCTGATCAAAGAAATTCTGCAGCCAATGGTGGATCTTCAGATGAAAACGGTAGTGAATTTTCAGAAGTAGAGAGGCGCCATGTCCGCCAACCAAACCAAGGAACTGCAACAACGAGCGGAAACAATACCCGTGGGAAGCAACCATTGTCAACAGCTAGATTTGGAGCAAATATTCATAGCATTCATACGTTGAAGAAGGATGAAGACGACGAACGGTTCAAAGGTAGGAATTCCTTTTGGAATGGTAATTCTACTGAGTATGGTGGTGATGATGGTGATAGCAAATGA

Protein sequence

MEQSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYCVLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGLHIQETTASVLTAALASKKSEASTSRPSDSGSSSLAAVSHADHHIDSSETNLGVNSCVAEEEKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVSVGNNNESPDPSENNKGSLADPGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPETKEVAQQEKNKIVDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTFGSYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAANGGSSDENGSEFSEVERRHVRQPNQGTATTSGNNTRGKQPLSTARFGANIHSIHTLKKDEDDERFKGRNSFWNGNSTEYGGDDGDSK
BLAST of Cp4.1LG17g10040 vs. Swiss-Prot
Match: PUX11_ARATH (Plant UBX domain-containing protein 11 OS=Arabidopsis thaliana GN=PUX11 PE=1 SV=2)

HSP 1 Score: 277.3 bits (708), Expect = 3.2e-73
Identity = 220/535 (41.12%), Postives = 282/535 (52.71%), Query Frame = 1

Query: 3   QSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYCVL 62
           +++SSL FKGS+PEAI E+K +KKLFVVYISG+D ES  L   TWT + VA+++SKYC+L
Sbjct: 2   EALSSLTFKGSLPEAIFEAKGKKKLFVVYISGEDEESDKLNRLTWTDASVADSLSKYCIL 61

Query: 63  LHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGLHI 122
           +HI AGS DA  FS+IYP  SVPCI A+G++G Q+W+ EGF+ AE LASSLEKAWLGLHI
Sbjct: 62  VHIQAGSVDATNFSAIYPYSSVPCIAAIGFSGTQVWRTEGFITAEDLASSLEKAWLGLHI 121

Query: 123 QETTASVLTAA-----LASKKSEAST------SRPSDSGSSSLAAVSHADHHIDSSETNL 182
           QETTAS+ +AA       +  S AS+      S P D+  +S +  S     +  SET  
Sbjct: 122 QETTASIFSAALASQNSETPVSSASSVVLPPGSVPLDAAVASPSTASS----VQPSETKS 181

Query: 183 GVNSC-VAEEEKGTEN-SSKKRKEPEK---RVKEEDIKS-DVKESTVHHSVSVGNNNESP 242
            V S    E   GT     K+  EP       K +   S D  ++ V H  +     E+P
Sbjct: 182 TVTSASTTENNDGTVAVKGKESAEPSNLCDTTKNQPAPSVDGTKANVEHEAT-----ETP 241

Query: 243 DPSENNKGSLAD--PGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPETKEVAQ 302
              +  K  +    PG   N S   +S           + E    G S      TK V  
Sbjct: 242 LRVQAEKEPIRPTAPGTNDNTSRVRSSVDRKRKQGTVINEEDSGVGVSGRDINLTKSVDT 301

Query: 303 QEKNKIVDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQ 362
           +E  K  DE    E+G   K   +SD+HLNIRL +G ++QEKFS TS LRMVKDYV+++Q
Sbjct: 302 KETMKPKDEGGEEEDGEKSKK--ASDVHLNIRLPDGSSLQEKFSVTSILRMVKDYVNSNQ 361

Query: 363 ESTFGSYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQR 422
               G+YDLAVPYPRKV+TDQDL+KSLS L L +RQ+L++V     T   RG S S    
Sbjct: 362 TIGLGAYDLAVPYPRKVYTDQDLDKSLSELRLFDRQALVVVPRKRATVYQRGTSYSESNN 421

Query: 423 NSAANGGS------------------SDENGSEFSEVERRHVRQPNQGTATTSGNNTRGK 482
           N+  N G                        +  S V  R  R PN       G      
Sbjct: 422 NTDPNSGGYFAYVRRVLSYANPFSYFGGGTANASSSVPERQTR-PNTEVRNNLGQVGTSF 481

Query: 483 QPLSTARFGANIH---------SIHTLKKDEDDERFKGRNSFWNGNSTEYGGDDG 492
           Q  S  R               +IHTL  +ED+  F   N+FWNGNST+YGG  G
Sbjct: 482 QDPSEGRSNVRNRRPTTSRIGSNIHTLNHNEDEAPFGDGNAFWNGNSTQYGGGSG 524

BLAST of Cp4.1LG17g10040 vs. Swiss-Prot
Match: UBXN4_HUMAN (UBX domain-containing protein 4 OS=Homo sapiens GN=UBXN4 PE=1 SV=2)

HSP 1 Score: 73.9 bits (180), Expect = 5.4e-12
Identity = 92/413 (22.28%), Postives = 176/413 (42.62%), Query Frame = 1

Query: 8   LAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYCVLLHIPA 67
           L F+G++P AI  +K    +FVV+++GDD +S+ + +S W   +V EA S   V + I  
Sbjct: 2   LWFQGAIPAAIATAKRSGAVFVVFVAGDDEQSTQMAAS-WEDDKVTEASSNSFVAIKIDT 61

Query: 68  GSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGLHIQETTA 127
            S    QFS IYP   VP    +G +GI L    G V A+ L + + K    +H+ ++  
Sbjct: 62  KSEACLQFSQIYPVVCVPSSFFIGDSGIPLEVIAGSVSADELVTRIHKV-RQMHLLKSET 121

Query: 128 SVLTAALASKKSEASTSRPS--------------------------DSGSSSLAAVSHAD 187
           SV   +    +SE+S S PS                          D+ S +      A 
Sbjct: 122 SVANGS----QSESSVSTPSASFEPNNTCENSQSRNAELCEIPPTSDTKSDTATGGESAG 181

Query: 188 HHIDSSETNLGVNSCVAEE-----EKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSV 247
           H   S E +   +   AE+     E+ T+   ++R+E  K  ++ +IK +++       +
Sbjct: 182 HATSSQEPSGCSDQRPAEDLNIRVERLTKKLEERREEKRKEEEQREIKKEIERRKTGKEM 241

Query: 248 SVGNNNESPDPS-----ENNKGSLADPGGQKNCSSENTSTIVHDSPIIPNHIESCQSGAS 307
                 +  + +     E N+    D   ++    +        +       E  ++  +
Sbjct: 242 LDYKRKQEEELTKRMLEERNREKAEDRAARERIKQQIALDRAERAARFAKTKEEVEAAKA 301

Query: 308 RPIPPETKEVAQQEKNKIVDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTST 367
             +      +A+Q + ++  E+ A E  +  +        +  RL +G +   +F   + 
Sbjct: 302 AAL------LAKQAEMEVKRESYARERSTVAR--------IQFRLPDGSSFTNQFPSDAP 361

Query: 368 LRMVKDYVDNSQESTFGSYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMV 385
           L   + +   +  +T+G++ LA  +PR+ FT +D +K L  L L+   S++++
Sbjct: 362 LEEARQFAAQTVGNTYGNFSLATMFPRREFTKEDYKKKLLDLELAPSASVVLL 394

BLAST of Cp4.1LG17g10040 vs. Swiss-Prot
Match: UBXN4_PONAB (UBX domain-containing protein 4 OS=Pongo abelii GN=UBXN4 PE=2 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 7.0e-12
Identity = 92/413 (22.28%), Postives = 176/413 (42.62%), Query Frame = 1

Query: 8   LAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYCVLLHIPA 67
           L F+G++P AI  +K    +FVV+++GDD +S+ + +S W   +V EA S   V + I  
Sbjct: 2   LWFQGAIPAAIATAKRSGAVFVVFVAGDDEQSTQMAAS-WEDDKVTEASSNSFVAIKIDT 61

Query: 68  GSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGLHIQETTA 127
            S    QFS IYP   VP    +G +GI L    G V A+ L + + K    +H+ ++  
Sbjct: 62  KSEACLQFSQIYPVVCVPSSFFIGDSGIPLEVIAGSVSADELVTRIHKV-RQMHLLKSET 121

Query: 128 SVLTAALASKKSEASTSRPS--------------------------DSGSSSLAAVSHAD 187
           SV   +    +SE+S S PS                          D+ S +      A 
Sbjct: 122 SVANGS----QSESSVSTPSASFEPNNTCENSQSRNAELCEIPPTSDTKSDTATGGESAG 181

Query: 188 HHIDSSETNLGVNSCVAEE-----EKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSV 247
           H   S E +   +   AE+     E+ T+   ++R+E  K  ++ +IK +++       +
Sbjct: 182 HATSSQEPSGCSDQRPAEDLNIRVERLTKKLEERREEKRKEEEQREIKKEIERRKTGKEM 241

Query: 248 SVGNNNESPDPS-----ENNKGSLADPGGQKNCSSENTSTIVHDSPIIPNHIESCQSGAS 307
                 +  + +     E N+    D   ++    +        +       E  ++  +
Sbjct: 242 LDYKRKQEEELTKRMLEERNREKAEDRAARERIKQQIALDRAERAARFAKTKEEVEAAKA 301

Query: 308 RPIPPETKEVAQQEKNKIVDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTST 367
             +      +A+Q + ++  E+ A E  +  +        +  RL +G +   +F   + 
Sbjct: 302 AAL------LAKQAEMEVKRESYARERSTVAR--------IQFRLPDGSSFTNQFPSDAP 361

Query: 368 LRMVKDYVDNSQESTFGSYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMV 385
           L   + +   +  +T+G++ LA  +PR+ FT +D +K L  L L+   S++++
Sbjct: 362 LEEARQFAAQTVGNTYGNFSLATMFPRREFTKEDYKKKLLDLELAPSASVVVL 394

BLAST of Cp4.1LG17g10040 vs. Swiss-Prot
Match: UBXN4_RAT (UBX domain-containing protein 4 OS=Rattus norvegicus GN=Ubxn4 PE=1 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 1.9e-09
Identity = 97/407 (23.83%), Postives = 170/407 (41.77%), Query Frame = 1

Query: 8   LAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYCVLLHIPA 67
           L F+G++P AI  +K    +FVV+++GDD +S+ + +S W   +V EA S   V + I  
Sbjct: 2   LWFQGAIPAAIASAKRSGAVFVVFVAGDDEQSTQMAAS-WEDEKVREASSDNFVAIKIDT 61

Query: 68  GSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGLHIQETTA 127
            S    QFS IYP   VP    +G +GI L    G V A+ L + + K    +H  +   
Sbjct: 62  KSEACLQFSQIYPVVCVPSSFFIGDSGIPLEVIAGSVSADELVTRIHKVQQ-MHSLKGET 121

Query: 128 SVLTAALASKKSEASTSRPS-------------------------DSGSSSLAAVSHADH 187
           SV       K+SE+S S PS                         D  S + A    A H
Sbjct: 122 SVTN----DKQSESSVSTPSASFEPDICESAESRNTELCETPTTSDPKSDTAAGGECAGH 181

Query: 188 HIDSSETNLGVNSCVAEE-----EKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVS 247
              S E     N   AE+     E+ T+   ++R+E  K   + +IK +++       + 
Sbjct: 182 DSLSQEPPGCSNQRPAEDLTVRVERLTKKLEERREEKRKEEAQREIKKEIERRKTGKEML 241

Query: 248 VGNNNESPDPSENNKGSLADPGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPE 307
              + +     E  K  L +   +K  + +  +       I  +  E     A       
Sbjct: 242 ---DYKRKQEEELTKRMLEERSREK--AEDRAARERIKQQIALDRAERAARFAKTKEAEA 301

Query: 308 TKEVAQQEKNKIVDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKD 367
            K  A   K    +    ++  S+ ++  S+   +  RL +G +   +F   + L   + 
Sbjct: 302 AKAAALLAKQAEAE----VKRESSTRD-RSTIARIQFRLPDGSSFTNQFPSDAPLEEARQ 361

Query: 368 YVDNSQESTFGSYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMV 385
           +   +  +T+G++ LA  +PR+ FT +D ++ L  L L+   S++++
Sbjct: 362 FAAQTVGNTYGNFSLATMFPRREFTREDYKRKLLDLELAPSASVVLL 392

BLAST of Cp4.1LG17g10040 vs. Swiss-Prot
Match: UBXN4_MOUSE (UBX domain-containing protein 4 OS=Mus musculus GN=Ubxn4 PE=1 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 1.6e-08
Identity = 94/407 (23.10%), Postives = 169/407 (41.52%), Query Frame = 1

Query: 8   LAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYCVLLHIPA 67
           L F+G++P AI  +K    +FVV+++GDD +S  + +S W   +V +A S   V + I  
Sbjct: 2   LWFQGAIPAAIASAKRSGAVFVVFVAGDDEQSIQMAAS-WEDEKVTQASSNNFVAIKIDT 61

Query: 68  GSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGLHIQETTA 127
            S    QFS IYP   VP    +G +GI L    G V A+ L + + K    +H  +  A
Sbjct: 62  KSEACLQFSQIYPVVCVPSSFFIGDSGIPLEVIAGSVSADELVTRIHKVQQ-MHSSKGEA 121

Query: 128 SVLTAALASKKSEASTSRPS-------------------------DSGSSSLAAVSHADH 187
           SV        +SE+S S PS                         D  S +        H
Sbjct: 122 SVTN----DNQSESSVSTPSASFEPDVCENPESKNTELCETPATSDIKSDTATGGECTGH 181

Query: 188 HIDSSETNLGVNSCVAEE-----EKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVS 247
              S E +   N   AE+     E+ T+   ++R+E  K   + +IK +++       + 
Sbjct: 182 DSHSQEPHGCSNQRPAEDLTVRVERLTKKLEERREEKRKEEAQREIKKEIERRKTGKEML 241

Query: 248 VGNNNESPDPSENNKGSLADPGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPE 307
              + +     E  K  L +   +K  + +  +       I  +  E     A+R    +
Sbjct: 242 ---DYKRKQEEELTKRMLEERSREK--AEDRAARERIKQQIALDRAER----AARFAKTK 301

Query: 308 TKEVAQQEKNKIVDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKD 367
             E A+            ++  S  ++  S+   +  RL +G +   +F   + L   + 
Sbjct: 302 EAEAAKAAALLTKQAGTEVKRESTARD-RSTIARIQFRLPDGSSFTNQFPSDAPLEEARQ 361

Query: 368 YVDNSQESTFGSYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMV 385
           +   +  +T+G++ LA  +PR+ FT +D ++ L  L L+   S++++
Sbjct: 362 FAAQTVGNTYGNFSLATMFPRREFTREDYKRRLLDLELAPSASVVLL 392

BLAST of Cp4.1LG17g10040 vs. TrEMBL
Match: A0A0A0KC97_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G041180 PE=4 SV=1)

HSP 1 Score: 592.0 bits (1525), Expect = 6.5e-166
Identity = 352/538 (65.43%), Postives = 396/538 (73.61%), Query Frame = 1

Query: 1   MEQSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYC 60
           MEQSISSLAFKGS+ EAI+ESKNQ+KLF+VYISGDDAESS LESSTWTSS+VAE+VSKYC
Sbjct: 1   MEQSISSLAFKGSIAEAIVESKNQRKLFLVYISGDDAESSRLESSTWTSSKVAESVSKYC 60

Query: 61  VLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGL 120
           VLLHIPAGS DAAQFSSIYPQKSVPCITAVGYNGIQLW NEGF+GAEVLAS+LEKAWLGL
Sbjct: 61  VLLHIPAGSMDAAQFSSIYPQKSVPCITAVGYNGIQLWLNEGFIGAEVLASNLEKAWLGL 120

Query: 121 HIQETTASVLTAALASKKSEASTSRPSDSGSSSLAAVSHADHHIDSSETNLGVNSCVAEE 180
           HIQETTASVLTAALASKKSEASTSRPSD  SSSLA+VS +DHHI S ETNLGVNS + EE
Sbjct: 121 HIQETTASVLTAALASKKSEASTSRPSDLRSSSLASVSPSDHHIGSLETNLGVNSGIVEE 180

Query: 181 EKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVSV---GNNNESPDPSENNKGSLAD 240
           EKG          PEK VK+ED K+D+KES VHHS+SV    N+  SP+PS  +K SLA 
Sbjct: 181 EKG----------PEKLVKQEDSKADIKESNVHHSLSVEIQNNDESSPEPSGKDKSSLAH 240

Query: 241 PGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPETKEVAQQEKNKIVDENNAIE 300
           P  Q++CS ENTS IV+DS   PN IES QSGA +PI  E KE  ++ K +IVD+NNAIE
Sbjct: 241 PQDQQSCSPENTSKIVNDSYTTPNLIESSQSGAPQPISLEAKEDVRENK-EIVDDNNAIE 300

Query: 301 NGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTFGSYDLAVPYP 360
           N SA K+Y S+D+HLNIRLLNG+N+QEKFSKTSTLRM+KDYVDNSQ STFG YDLA+PYP
Sbjct: 301 NDSARKDYASNDVHLNIRLLNGINLQEKFSKTSTLRMIKDYVDNSQPSTFGPYDLAIPYP 360

Query: 361 RKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAANGGSSDENGS 420
           RKVFTDQDL KSLS LGL NRQ+LIMVRH GV  D RGASSS+D+R  +ANG SSDEN  
Sbjct: 361 RKVFTDQDLGKSLSDLGLHNRQALIMVRHQGVRSDLRGASSSSDERKFSANGVSSDENSD 420

Query: 421 EFSEVERRHV------------------RQPNQGTATTSGNNTR-------GKQPLSTAR 480
            +    +R +                  R   QG A    NN+         K    TA 
Sbjct: 421 GYFAFVKRILSYVNPFSYLGVGASTASSRHETQGDARQYSNNSLEAEDHYVRKPNQGTAM 480

Query: 481 FGANIHSIHTL----------------KKDEDDERFKGRNSFWNGNSTEYGGDDGDSK 495
            G N                       K D+D+ERFK RNSFWNGNSTEYGGD+ DSK
Sbjct: 481 VGGNNTRGKQPSSSSRFGANIHSIHTLKHDDDEERFKSRNSFWNGNSTEYGGDN-DSK 526

BLAST of Cp4.1LG17g10040 vs. TrEMBL
Match: B9HHD9_POPTR (UBX domain-containing family protein OS=Populus trichocarpa GN=POPTR_0007s02410g PE=4 SV=1)

HSP 1 Score: 360.9 bits (925), Expect = 2.4e-96
Identity = 245/530 (46.23%), Postives = 314/530 (59.25%), Query Frame = 1

Query: 1   MEQSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYC 60
           M+ SISS  +KGS+ EAI+ESK QKKLFVVYISG++  S+ LE STWT S+VAE++SKYC
Sbjct: 1   MKGSISSFTYKGSIAEAILESKKQKKLFVVYISGENVASAELEKSTWTDSKVAESLSKYC 60

Query: 61  VLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGL 120
           +LLHIP GS+DA  FS+IY QKS PCITA+GYNG+QLWQ+EGFV AEVLAS LEK WL L
Sbjct: 61  ILLHIPEGSTDALNFSAIYQQKSAPCITAIGYNGVQLWQSEGFVTAEVLASGLEKVWLTL 120

Query: 121 HIQETTASVLTAALASKKSEASTSRPSDSGSS-------SLAAVSHADHHIDSSETNL-G 180
           HIQETTA+VLT ALASKK E   S  SD GSS       ++  V   D HI  SE     
Sbjct: 121 HIQETTATVLTTALASKKPEP-LSGSSDIGSSGQGSSSGTVVPVPLKDRHIQPSEVGTQA 180

Query: 181 VNSCVAEEEKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVSVGNNNESPDPSENNK 240
             S V EE K  E ++      EK +     K+  K   V  S +VG+   +    E+ K
Sbjct: 181 AASEVIEENKSHEPTA------EKTITNLGDKTSSKSFNVQKSQTVGDERSTCPTEEDKK 240

Query: 241 GSLADPGGQKNCSSENTSTIVHDSPI-----IPNHIESCQSGASRPIPPETKEVAQQEKN 300
              +      N  +++TS+   D  +     + NH     +G S     E KEV  ++  
Sbjct: 241 SPSSSVTSTDNIIADHTSSAAEDGLLAQEKSVSNHT-GVPTGGSELSTTEIKEVGDKKAE 300

Query: 301 KIVDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTF 360
            + D      N +   N  SSD+HLNIRL +GV++QEKFS TSTLR VKDYVD +Q S  
Sbjct: 301 SMDDMVPGTLNNNKKVN-VSSDVHLNIRLPDGVSLQEKFSVTSTLRTVKDYVDRNQASGI 360

Query: 361 GSYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAA 420
           G+YDLA+PYPRK F+DQDL KSLS L L NRQ+LI+V     T   +  S S     + +
Sbjct: 361 GAYDLAIPYPRKTFSDQDLNKSLSELSLLNRQALIVVPRQRATSYHQRGSLSDRATTTTS 420

Query: 421 NGGSSDENGSEFSEVER---------------------------RHVRQPNQGTATTSGN 480
           +G  +  +G  F+ V+R                                 +Q +++T  N
Sbjct: 421 SGSVNANDGGYFAYVKRILSYVNPLSYFGGSANPSSSGQAQSAIGEYEWTDQNSSSTGRN 480

Query: 481 NTRGKQPLSTARFGANIHSIHTLKKDEDDERFKGRNSFWNGNSTEYGGDD 491
           +++GKQP +T+R G+NIH   TLK DEDD RF  RNSFWNGNST+YGGD+
Sbjct: 481 DSKGKQP-TTSRVGSNIH---TLKHDEDDGRFSERNSFWNGNSTQYGGDN 517

BLAST of Cp4.1LG17g10040 vs. TrEMBL
Match: A0A061EJB9_THECC (Ubiquitin-like superfamily protein, putative isoform 3 OS=Theobroma cacao GN=TCM_020101 PE=4 SV=1)

HSP 1 Score: 345.9 bits (886), Expect = 8.1e-92
Identity = 246/542 (45.39%), Postives = 316/542 (58.30%), Query Frame = 1

Query: 1   MEQS--ISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSK 60
           ME+S  +SSL +KGS+PEAI+E+KNQKKLFVVYISGDDAES +LE STWT  +V E++SK
Sbjct: 1   MERSECLSSLTYKGSIPEAILEAKNQKKLFVVYISGDDAESKNLEDSTWTDLKVKESLSK 60

Query: 61  YCVLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWL 120
           YC+LLHI  GS+DAA FS+IYPQKSVPCITA+GYNG+Q WQ+EG V AEVLASSLEKAWL
Sbjct: 61  YCILLHIQGGSADAANFSAIYPQKSVPCITAIGYNGVQAWQSEGSVSAEVLASSLEKAWL 120

Query: 121 GLHIQETTASVLTAALASKKSEASTS-----RPSDSGSSSLAAVSHADHHIDSSETNLGV 180
            LHIQETT +VLTAALASKK E STS     R S+ GSSS  +V  +  +  S  +   V
Sbjct: 121 SLHIQETTVTVLTAALASKKYETSTSGASTVRQSEHGSSSSNSVPSSTMNERSLGSKSAV 180

Query: 181 NSCVAEEEKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVSVGNNNESPDPSENNKG 240
           +S V EE   +EN+ K     EK  +  D  S    ST + +  V    ++ + +     
Sbjct: 181 SSGVIEENFVSENTVK-----EKNAESVDKGSSESFSTDNLANVVDEQGDASNEATRTMA 240

Query: 241 SLADPGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPI-PPETKEVAQQEKNKIVDE 300
           S    G   +  SENTS+   D  +IP    + Q+  S P+   E +E  Q EK+K +++
Sbjct: 241 SSITVGPAVSL-SENTSSPPEDGCLIPVKGINNQASVSSPVSAAEAEEAVQHEKDKGIND 300

Query: 301 NNAIENGSAPKNYTS---SDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTFGS 360
               E+G + K  T+   +D+HLNIRL +G +++EKF    TLRMVKDYVD +Q S  GS
Sbjct: 301 K---EDGGSDKPSTANIPTDVHLNIRLPDGSSLREKFPVADTLRMVKDYVDRNQSSGMGS 360

Query: 361 YDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAANG 420
           YDLA+PYPRK+F DQDL KSL  LGL NRQ+L++V  L  T  F+G  +SADQ NS    
Sbjct: 361 YDLAIPYPRKLFGDQDLSKSLLDLGLLNRQALVVV-PLQRTSGFQGQRTSADQINSTPTE 420

Query: 421 GSSDENGSEFSEVER--RHVR-------------------------QPNQGTATTSGNNT 480
            S+  NG  F+ ++    +V                           PN           
Sbjct: 421 ASTGSNGGYFAYIKSILSYVNPFSYLGGGASSSTTEQESQSGIWEYSPNPTMQNNLAGTI 480

Query: 481 RGKQPLSTARFGANIHSIHTL---------------KKDEDDERFKGRNSFWNGNSTEYG 490
           R   P S     + +    +                K DEDD RF  RN FWNGNST+YG
Sbjct: 481 RSYSPYSPNGSTSTVRDGSSNRRPTTSRYGSNIHTLKHDEDDGRFNDRNPFWNGNSTQYG 532

BLAST of Cp4.1LG17g10040 vs. TrEMBL
Match: M5XDI9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003931mg PE=4 SV=1)

HSP 1 Score: 344.0 bits (881), Expect = 3.1e-91
Identity = 236/539 (43.78%), Postives = 319/539 (59.18%), Query Frame = 1

Query: 1   MEQSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYC 60
           ME S++SL +KGS+PEAI E+K Q+KLFVVYISG + ESS LE+STWT   VA++V+KYC
Sbjct: 1   MEPSLTSLTYKGSIPEAITEAKKQRKLFVVYISGKNDESSRLENSTWTDVNVADSVAKYC 60

Query: 61  VLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGL 120
           +LLHIP  S+DAA FS+IYPQKS+PCITA+GYNG+Q+WQNEGFV AEVLASSLEKAWLG+
Sbjct: 61  LLLHIPEESTDAANFSAIYPQKSIPCITAIGYNGVQIWQNEGFVSAEVLASSLEKAWLGI 120

Query: 121 HIQETTASVLTAALASKKSEASTSRPSDSGSSSLAAVSHADHHIDSSET----NLGVNSC 180
           HIQETTA+VL+AALAS  SE STS   ++ S+   + S  +    SS      ++  N  
Sbjct: 121 HIQETTATVLSAALASNNSEPSTSGVPNTVSTDEGSSSSTNQGRSSSAAVPLPSIDTNDQ 180

Query: 181 VAEEEKGTENSSKKRKEPEKRVKEEDIKSDVKESTVH---HSVSVGNNNESPDPSENNKG 240
             +      ++ KK K  + RV+E++     K S+     + +    + +S  P ++ +G
Sbjct: 181 SPDAIDAVSDTVKKNKGRDCRVEEKNNDLGYKTSSKSFDANGLECVGDEQSSSPMKSAQG 240

Query: 241 -SLADPGGQKNCSSENTSTIVHDSPIIPNHIESCQSG----ASRPIPPETKEVAQQEKNK 300
               D     N  ++N S+I       P       SG    AS+    E  E  Q E+ +
Sbjct: 241 VQDIDMEDLNNSGADNVSSIAEVGYSGPEKTTLNHSGVSGEASQAYSSEKNEALQVERGE 300

Query: 301 IVDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTFG 360
           + D+          +   SSD+HLNIRL NGV++++KFS TST+RMVKDYVD +Q S  G
Sbjct: 301 VKDDKKVDAFEKCTEVSKSSDVHLNIRLPNGVSLKQKFSVTSTVRMVKDYVDENQGSGIG 360

Query: 361 SYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAAN 420
           +YDLA+PYPRKVF++Q+L +SLS LGL +RQ+LI+V H   T   RG S+ ++Q +S   
Sbjct: 361 TYDLAIPYPRKVFSNQELNESLSDLGLFDRQALIVVPHQQGTSYQRGRSAFSEQIDSRNT 420

Query: 421 GGSSD-ENGSEFSEVE---------RRHVRQPNQGTATTSGNNTRGKQPLSTARFGANIH 480
           G SS+  NG  FS V+                N  ++     N   +   ++AR   +  
Sbjct: 421 GSSSNGSNGGYFSYVKGFLSYFNPLSYFGGGANSSSSGQQSQNGMWEYSPNSARQNNSTE 480

Query: 481 S-----------------------IHTLKKDEDDERFKGRNSFWNGNSTEYGGDDGDSK 495
           S                       IHTLK DEDDERF  RN+FWNGNST+YGG + D K
Sbjct: 481 SISTSSRNEGKKNRQPPASGFGSNIHTLKHDEDDERFSDRNAFWNGNSTQYGGGNDDGK 539

BLAST of Cp4.1LG17g10040 vs. TrEMBL
Match: A0A061ERL5_THECC (Ubiquitin-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_020101 PE=4 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 3.8e-89
Identity = 245/545 (44.95%), Postives = 315/545 (57.80%), Query Frame = 1

Query: 1   MEQS--ISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSK 60
           ME+S  +SSL +KGS+PEAI+E+KNQKKLFVVYISGDDAES +LE STWT  +V E++SK
Sbjct: 13  MERSECLSSLTYKGSIPEAILEAKNQKKLFVVYISGDDAESKNLEDSTWTDLKVKESLSK 72

Query: 61  YCVLLHIPAGSSDAAQFSSI---YPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEK 120
           YC+LLHI  GS+DAA FS+I    PQKSVPCITA+GYNG+Q WQ+EG V AEVLASSLEK
Sbjct: 73  YCILLHIQGGSADAANFSAICILNPQKSVPCITAIGYNGVQAWQSEGSVSAEVLASSLEK 132

Query: 121 AWLGLHIQETTASVLTAALASKKSEASTS-----RPSDSGSSSLAAVSHADHHIDSSETN 180
           AWL LHIQETT +VLTAALASKK E STS     R S+ GSSS  +V  +  +  S  + 
Sbjct: 133 AWLSLHIQETTVTVLTAALASKKYETSTSGASTVRQSEHGSSSSNSVPSSTMNERSLGSK 192

Query: 181 LGVNSCVAEEEKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVSVGNNNESPDPSEN 240
             V+S V EE   +EN+ K     EK  +  D  S    ST + +  V    ++ + +  
Sbjct: 193 SAVSSGVIEENFVSENTVK-----EKNAESVDKGSSESFSTDNLANVVDEQGDASNEATR 252

Query: 241 NKGSLADPGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPI-PPETKEVAQQEKNKI 300
              S    G   +  SENTS+   D  +IP    + Q+  S P+   E +E  Q EK+K 
Sbjct: 253 TMASSITVGPAVSL-SENTSSPPEDGCLIPVKGINNQASVSSPVSAAEAEEAVQHEKDKG 312

Query: 301 VDENNAIENGSAPKNYTS---SDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQEST 360
           +++    E+G + K  T+   +D+HLNIRL +G +++EKF    TLRMVKDYVD +Q S 
Sbjct: 313 INDK---EDGGSDKPSTANIPTDVHLNIRLPDGSSLREKFPVADTLRMVKDYVDRNQSSG 372

Query: 361 FGSYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSA 420
            GSYDLA+PYPRK+F DQDL KSL  LGL NRQ+L++V  L  T  F+G  +SADQ NS 
Sbjct: 373 MGSYDLAIPYPRKLFGDQDLSKSLLDLGLLNRQALVVV-PLQRTSGFQGQRTSADQINST 432

Query: 421 ANGGSSDENGSEFSEVER--RHVR-------------------------QPNQGTATTSG 480
               S+  NG  F+ ++    +V                           PN        
Sbjct: 433 PTEASTGSNGGYFAYIKSILSYVNPFSYLGGGASSSTTEQESQSGIWEYSPNPTMQNNLA 492

Query: 481 NNTRGKQPLSTARFGANIHSIHTL---------------KKDEDDERFKGRNSFWNGNST 490
              R   P S     + +    +                K DEDD RF  RN FWNGNST
Sbjct: 493 GTIRSYSPYSPNGSTSTVRDGSSNRRPTTSRYGSNIHTLKHDEDDGRFNDRNPFWNGNST 547

BLAST of Cp4.1LG17g10040 vs. TAIR10
Match: AT2G43210.1 (AT2G43210.1 Ubiquitin-like superfamily protein)

HSP 1 Score: 277.3 bits (708), Expect = 1.8e-74
Identity = 220/535 (41.12%), Postives = 282/535 (52.71%), Query Frame = 1

Query: 3   QSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYCVL 62
           +++SSL FKGS+PEAI E+K +KKLFVVYISG+D ES  L   TWT + VA+++SKYC+L
Sbjct: 2   EALSSLTFKGSLPEAIFEAKGKKKLFVVYISGEDEESDKLNRLTWTDASVADSLSKYCIL 61

Query: 63  LHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGLHI 122
           +HI AGS DA  FS+IYP  SVPCI A+G++G Q+W+ EGF+ AE LASSLEKAWLGLHI
Sbjct: 62  VHIQAGSVDATNFSAIYPYSSVPCIAAIGFSGTQVWRTEGFITAEDLASSLEKAWLGLHI 121

Query: 123 QETTASVLTAA-----LASKKSEAST------SRPSDSGSSSLAAVSHADHHIDSSETNL 182
           QETTAS+ +AA       +  S AS+      S P D+  +S +  S     +  SET  
Sbjct: 122 QETTASIFSAALASQNSETPVSSASSVVLPPGSVPLDAAVASPSTASS----VQPSETKS 181

Query: 183 GVNSC-VAEEEKGTEN-SSKKRKEPEK---RVKEEDIKS-DVKESTVHHSVSVGNNNESP 242
            V S    E   GT     K+  EP       K +   S D  ++ V H  +     E+P
Sbjct: 182 TVTSASTTENNDGTVAVKGKESAEPSNLCDTTKNQPAPSVDGTKANVEHEAT-----ETP 241

Query: 243 DPSENNKGSLAD--PGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPETKEVAQ 302
              +  K  +    PG   N S   +S           + E    G S      TK V  
Sbjct: 242 LRVQAEKEPIRPTAPGTNDNTSRVRSSVDRKRKQGTVINEEDSGVGVSGRDINLTKSVDT 301

Query: 303 QEKNKIVDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQ 362
           +E  K  DE    E+G   K   +SD+HLNIRL +G ++QEKFS TS LRMVKDYV+++Q
Sbjct: 302 KETMKPKDEGGEEEDGEKSKK--ASDVHLNIRLPDGSSLQEKFSVTSILRMVKDYVNSNQ 361

Query: 363 ESTFGSYDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQR 422
               G+YDLAVPYPRKV+TDQDL+KSLS L L +RQ+L++V     T   RG S S    
Sbjct: 362 TIGLGAYDLAVPYPRKVYTDQDLDKSLSELRLFDRQALVVVPRKRATVYQRGTSYSESNN 421

Query: 423 NSAANGGS------------------SDENGSEFSEVERRHVRQPNQGTATTSGNNTRGK 482
           N+  N G                        +  S V  R  R PN       G      
Sbjct: 422 NTDPNSGGYFAYVRRVLSYANPFSYFGGGTANASSSVPERQTR-PNTEVRNNLGQVGTSF 481

Query: 483 QPLSTARFGANIH---------SIHTLKKDEDDERFKGRNSFWNGNSTEYGGDDG 492
           Q  S  R               +IHTL  +ED+  F   N+FWNGNST+YGG  G
Sbjct: 482 QDPSEGRSNVRNRRPTTSRIGSNIHTLNHNEDEAPFGDGNAFWNGNSTQYGGGSG 524

BLAST of Cp4.1LG17g10040 vs. NCBI nr
Match: gi|778710172|ref|XP_011656529.1| (PREDICTED: UBX domain-containing protein 4 isoform X1 [Cucumis sativus])

HSP 1 Score: 592.0 bits (1525), Expect = 9.4e-166
Identity = 352/538 (65.43%), Postives = 396/538 (73.61%), Query Frame = 1

Query: 1   MEQSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYC 60
           MEQSISSLAFKGS+ EAI+ESKNQ+KLF+VYISGDDAESS LESSTWTSS+VAE+VSKYC
Sbjct: 1   MEQSISSLAFKGSIAEAIVESKNQRKLFLVYISGDDAESSRLESSTWTSSKVAESVSKYC 60

Query: 61  VLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGL 120
           VLLHIPAGS DAAQFSSIYPQKSVPCITAVGYNGIQLW NEGF+GAEVLAS+LEKAWLGL
Sbjct: 61  VLLHIPAGSMDAAQFSSIYPQKSVPCITAVGYNGIQLWLNEGFIGAEVLASNLEKAWLGL 120

Query: 121 HIQETTASVLTAALASKKSEASTSRPSDSGSSSLAAVSHADHHIDSSETNLGVNSCVAEE 180
           HIQETTASVLTAALASKKSEASTSRPSD  SSSLA+VS +DHHI S ETNLGVNS + EE
Sbjct: 121 HIQETTASVLTAALASKKSEASTSRPSDLRSSSLASVSPSDHHIGSLETNLGVNSGIVEE 180

Query: 181 EKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVSV---GNNNESPDPSENNKGSLAD 240
           EKG          PEK VK+ED K+D+KES VHHS+SV    N+  SP+PS  +K SLA 
Sbjct: 181 EKG----------PEKLVKQEDSKADIKESNVHHSLSVEIQNNDESSPEPSGKDKSSLAH 240

Query: 241 PGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPETKEVAQQEKNKIVDENNAIE 300
           P  Q++CS ENTS IV+DS   PN IES QSGA +PI  E KE  ++ K +IVD+NNAIE
Sbjct: 241 PQDQQSCSPENTSKIVNDSYTTPNLIESSQSGAPQPISLEAKEDVRENK-EIVDDNNAIE 300

Query: 301 NGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTFGSYDLAVPYP 360
           N SA K+Y S+D+HLNIRLLNG+N+QEKFSKTSTLRM+KDYVDNSQ STFG YDLA+PYP
Sbjct: 301 NDSARKDYASNDVHLNIRLLNGINLQEKFSKTSTLRMIKDYVDNSQPSTFGPYDLAIPYP 360

Query: 361 RKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAANGGSSDENGS 420
           RKVFTDQDL KSLS LGL NRQ+LIMVRH GV  D RGASSS+D+R  +ANG SSDEN  
Sbjct: 361 RKVFTDQDLGKSLSDLGLHNRQALIMVRHQGVRSDLRGASSSSDERKFSANGVSSDENSD 420

Query: 421 EFSEVERRHV------------------RQPNQGTATTSGNNTR-------GKQPLSTAR 480
            +    +R +                  R   QG A    NN+         K    TA 
Sbjct: 421 GYFAFVKRILSYVNPFSYLGVGASTASSRHETQGDARQYSNNSLEAEDHYVRKPNQGTAM 480

Query: 481 FGANIHSIHTL----------------KKDEDDERFKGRNSFWNGNSTEYGGDDGDSK 495
            G N                       K D+D+ERFK RNSFWNGNSTEYGGD+ DSK
Sbjct: 481 VGGNNTRGKQPSSSSRFGANIHSIHTLKHDDDEERFKSRNSFWNGNSTEYGGDN-DSK 526

BLAST of Cp4.1LG17g10040 vs. NCBI nr
Match: gi|449446221|ref|XP_004140870.1| (PREDICTED: UBX domain-containing protein 4 isoform X2 [Cucumis sativus])

HSP 1 Score: 587.0 bits (1512), Expect = 3.0e-164
Identity = 352/538 (65.43%), Postives = 395/538 (73.42%), Query Frame = 1

Query: 1   MEQSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYC 60
           MEQSISSLAFKGS+ EAI+ESKNQ+KLF+VYISGDDAESS LESSTWTSS+VAE+VSKYC
Sbjct: 1   MEQSISSLAFKGSIAEAIVESKNQRKLFLVYISGDDAESSRLESSTWTSSKVAESVSKYC 60

Query: 61  VLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGL 120
           VLLHIPAGS DAAQFSSIYPQKSVPCITAVGYNGIQLW NEGF+GAEVLAS+LEKAWLGL
Sbjct: 61  VLLHIPAGSMDAAQFSSIYPQKSVPCITAVGYNGIQLWLNEGFIGAEVLASNLEKAWLGL 120

Query: 121 HIQETTASVLTAALASKKSEASTSRPSDSGSSSLAAVSHADHHIDSSETNLGVNSCVAEE 180
           HIQETTASVLTAALASKKSEASTSRPSD  SSSLA+VS +DHHI S ETNLGVNS + EE
Sbjct: 121 HIQETTASVLTAALASKKSEASTSRPSDLRSSSLASVSPSDHHIGSLETNLGVNSGIVEE 180

Query: 181 EKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVSV---GNNNESPDPSENNKGSLAD 240
           EKG          PEK VK ED K+D+KES VHHS+SV    N+  SP+PS  +K SLA 
Sbjct: 181 EKG----------PEKLVK-EDSKADIKESNVHHSLSVEIQNNDESSPEPSGKDKSSLAH 240

Query: 241 PGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPETKEVAQQEKNKIVDENNAIE 300
           P  Q++CS ENTS IV+DS   PN IES QSGA +PI  E KE  ++ K +IVD+NNAIE
Sbjct: 241 PQDQQSCSPENTSKIVNDSYTTPNLIESSQSGAPQPISLEAKEDVRENK-EIVDDNNAIE 300

Query: 301 NGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTFGSYDLAVPYP 360
           N SA K+Y S+D+HLNIRLLNG+N+QEKFSKTSTLRM+KDYVDNSQ STFG YDLA+PYP
Sbjct: 301 NDSARKDYASNDVHLNIRLLNGINLQEKFSKTSTLRMIKDYVDNSQPSTFGPYDLAIPYP 360

Query: 361 RKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAANGGSSDENGS 420
           RKVFTDQDL KSLS LGL NRQ+LIMVRH GV  D RGASSS+D+R  +ANG SSDEN  
Sbjct: 361 RKVFTDQDLGKSLSDLGLHNRQALIMVRHQGVRSDLRGASSSSDERKFSANGVSSDENSD 420

Query: 421 EFSEVERRHV------------------RQPNQGTATTSGNNTR-------GKQPLSTAR 480
            +    +R +                  R   QG A    NN+         K    TA 
Sbjct: 421 GYFAFVKRILSYVNPFSYLGVGASTASSRHETQGDARQYSNNSLEAEDHYVRKPNQGTAM 480

Query: 481 FGANIHSIHTL----------------KKDEDDERFKGRNSFWNGNSTEYGGDDGDSK 495
            G N                       K D+D+ERFK RNSFWNGNSTEYGGD+ DSK
Sbjct: 481 VGGNNTRGKQPSSSSRFGANIHSIHTLKHDDDEERFKSRNSFWNGNSTEYGGDN-DSK 525

BLAST of Cp4.1LG17g10040 vs. NCBI nr
Match: gi|659089807|ref|XP_008445691.1| (PREDICTED: UBX domain-containing protein 4 [Cucumis melo])

HSP 1 Score: 566.2 bits (1458), Expect = 5.5e-158
Identity = 347/538 (64.50%), Postives = 398/538 (73.98%), Query Frame = 1

Query: 1   MEQSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYC 60
           MEQSISSLAFKGS+ EAI+ESKNQ+KLF+VYISGDDAESS LESSTWTSS+VAE+VSKYC
Sbjct: 1   MEQSISSLAFKGSIAEAIVESKNQRKLFLVYISGDDAESSRLESSTWTSSKVAESVSKYC 60

Query: 61  VLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGL 120
           VLLHIPAGS DAAQFSSIYPQKSVPCITAVGYNGIQLW NEGF+ AEVLAS+LEKAWLGL
Sbjct: 61  VLLHIPAGSMDAAQFSSIYPQKSVPCITAVGYNGIQLWLNEGFISAEVLASNLEKAWLGL 120

Query: 121 HIQETTASVLTAALASKKSEASTSRPSDSGSSSLAAVSHADHHIDSSETNLGVNSCVAEE 180
           HIQETTASVLTAALASKKSEASTSR SD  SSSLA+VS +DHHI SSETNLGVNS + EE
Sbjct: 121 HIQETTASVLTAALASKKSEASTSRTSDLCSSSLASVSPSDHHIGSSETNLGVNSGIVEE 180

Query: 181 EKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVSV--GNNNE-SPDPSENNKGSLAD 240
           EKG          PEK VK EDIK+D+KES VHHS+SV   NN+E S +PSE +K SLA 
Sbjct: 181 EKG----------PEKLVK-EDIKADIKESNVHHSLSVEIQNNDELSLEPSEKDKSSLAH 240

Query: 241 PGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPETKEVAQQEKNKIVDENNAIE 300
           P  Q++CS +N S IV+DS I P  IES QS A +P+  E KE  ++ K +IVD+NNAIE
Sbjct: 241 PRDQESCSPKNASKIVNDSYITPKLIESSQSRAPQPMSLEAKEEVRENK-EIVDDNNAIE 300

Query: 301 NGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTFGSYDLAVPYP 360
           N SA K+YTS+D+HLNIRLLNG+N+QEKF KTSTLRM+KDYVDNSQ STFGSYDLA+PYP
Sbjct: 301 NDSAHKDYTSNDVHLNIRLLNGINLQEKFPKTSTLRMIKDYVDNSQPSTFGSYDLAIPYP 360

Query: 361 RKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAANGGSSDENGS 420
           RKVFTDQDL KSLS LGL NRQ+LI VRH GV+ + RG SSS D+R  +A+G SSDEN  
Sbjct: 361 RKVFTDQDLGKSLSDLGLHNRQALITVRHRGVSTNLRGGSSS-DERKFSADGVSSDENSD 420

Query: 421 EFSEVERRHVRQPNQ-------GTATTSGNNTRG-----------------KQP-LSTAR 480
            +    +R +   N         +A +S + T+G                 +QP   TA 
Sbjct: 421 GYFAFVKRILSYVNPFSYLGVGASAASSRHETQGDARQYSNSALEAENYYVRQPNQGTAM 480

Query: 481 FGANIHSIHTL----------------KKDEDDERFKGRNSFWNGNSTEYGGDDGDSK 495
            G N                       K D+DDERF+ RNSFWNGNSTEYGGD+ DSK
Sbjct: 481 AGENNTRGKQPSSSSRFGANIHSIHTLKHDDDDERFRSRNSFWNGNSTEYGGDN-DSK 524

BLAST of Cp4.1LG17g10040 vs. NCBI nr
Match: gi|778710182|ref|XP_011656531.1| (PREDICTED: UBX domain-containing protein 4 isoform X3 [Cucumis sativus])

HSP 1 Score: 565.8 bits (1457), Expect = 7.2e-158
Identity = 316/428 (73.83%), Postives = 355/428 (82.94%), Query Frame = 1

Query: 1   MEQSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYC 60
           MEQSISSLAFKGS+ EAI+ESKNQ+KLF+VYISGDDAESS LESSTWTSS+VAE+VSKYC
Sbjct: 1   MEQSISSLAFKGSIAEAIVESKNQRKLFLVYISGDDAESSRLESSTWTSSKVAESVSKYC 60

Query: 61  VLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGL 120
           VLLHIPAGS DAAQFSSIYPQKSVPCITAVGYNGIQLW NEGF+GAEVLAS+LEKAWLGL
Sbjct: 61  VLLHIPAGSMDAAQFSSIYPQKSVPCITAVGYNGIQLWLNEGFIGAEVLASNLEKAWLGL 120

Query: 121 HIQETTASVLTAALASKKSEASTSRPSDSGSSSLAAVSHADHHIDSSETNLGVNSCVAEE 180
           HIQETTASVLTAALASKKSEASTSRPSD  SSSLA+VS +DHHI S ETNLGVNS + EE
Sbjct: 121 HIQETTASVLTAALASKKSEASTSRPSDLRSSSLASVSPSDHHIGSLETNLGVNSGIVEE 180

Query: 181 EKGTENSSKKRKEPEKRVKEEDIKSDVKESTVHHSVSV---GNNNESPDPSENNKGSLAD 240
           EKG          PEK VK+ED K+D+KES VHHS+SV    N+  SP+PS  +K SLA 
Sbjct: 181 EKG----------PEKLVKQEDSKADIKESNVHHSLSVEIQNNDESSPEPSGKDKSSLAH 240

Query: 241 PGGQKNCSSENTSTIVHDSPIIPNHIESCQSGASRPIPPETKEVAQQEKNKIVDENNAIE 300
           P  Q++CS ENTS IV+DS   PN IES QSGA +PI  E KE  ++ K +IVD+NNAIE
Sbjct: 241 PQDQQSCSPENTSKIVNDSYTTPNLIESSQSGAPQPISLEAKEDVRENK-EIVDDNNAIE 300

Query: 301 NGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTFGSYDLAVPYP 360
           N SA K+Y S+D+HLNIRLLNG+N+QEKFSKTSTLRM+KDYVDNSQ STFG YDLA+PYP
Sbjct: 301 NDSARKDYASNDVHLNIRLLNGINLQEKFSKTSTLRMIKDYVDNSQPSTFGPYDLAIPYP 360

Query: 361 RKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAANGGSSDENGS 420
           RKVFTDQDL KSLS LGL NRQ+LIMVRH GV  D RGASSS+D+R  +ANG SSDEN  
Sbjct: 361 RKVFTDQDLGKSLSDLGLHNRQALIMVRHQGVRSDLRGASSSSDERKFSANGVSSDENSD 417

Query: 421 EFSEVERR 426
            +    +R
Sbjct: 421 GYFAFVKR 417

BLAST of Cp4.1LG17g10040 vs. NCBI nr
Match: gi|694418995|ref|XP_009337470.1| (PREDICTED: UBX domain-containing protein 4-like [Pyrus x bretschneideri])

HSP 1 Score: 368.6 bits (945), Expect = 1.7e-98
Identity = 247/537 (46.00%), Postives = 320/537 (59.59%), Query Frame = 1

Query: 1   MEQSISSLAFKGSVPEAIIESKNQKKLFVVYISGDDAESSSLESSTWTSSRVAEAVSKYC 60
           MEQS++SL +KGS+PEAI E+K Q+KLFVVYISG + ESS LE+STWT   VA++V KYC
Sbjct: 1   MEQSLTSLTYKGSIPEAITEAKKQRKLFVVYISGKNDESSRLENSTWTDLNVADSVEKYC 60

Query: 61  VLLHIPAGSSDAAQFSSIYPQKSVPCITAVGYNGIQLWQNEGFVGAEVLASSLEKAWLGL 120
           +LLHIP  S+DAA FS+IYPQKSVPCITA+GYNG+QLWQNEGF+ AE LASSLEKAWLGL
Sbjct: 61  ILLHIPEESTDAANFSAIYPQKSVPCITAIGYNGVQLWQNEGFLSAEALASSLEKAWLGL 120

Query: 121 HIQETTASVLTAALASKKSEASTS------RPSDSGSSSLAAVSHADHHIDSSETNLGVN 180
           HIQETTA+VL+AALAS KSE STS         +  SSS A +S      + S ++  V 
Sbjct: 121 HIQETTATVLSAALASNKSEPSTSGVPNAVSTGEESSSSKAVISD-----EGSSSSTAVE 180

Query: 181 SCVAEEEKGTENSSKKRKEPEKRVKEEDIKSDVKESTVH----HSVSVGNNNESPDPSEN 240
           S + +    + ++     +   + K  D K +   ST      H V++ + N+S      
Sbjct: 181 SPLIDTNVQSPDAIDAASDTVMKNKGRDCKIESSSSTKSAQGVHDVAMEDLNKSG----- 240

Query: 241 NKGSLADPGGQKNCSSENTSTIVHDSPIIPNHIESCQSG-ASRPIPPETKEVAQQEKNKI 300
             G + +P  +   S    +T+ H          SC SG  S+ +     EV Q +K ++
Sbjct: 241 --GDIVNPVAEGGYSGPEKTTVNH----------SCVSGEGSQAVSNVKSEVVQVDKGEV 300

Query: 301 VDENNAIENGSAPKNYTSSDIHLNIRLLNGVNVQEKFSKTSTLRMVKDYVDNSQESTFGS 360
            D+       +  +   SSD+HLNIRL NGV+++EKFS TST+RMVK+YVD +Q S  G+
Sbjct: 301 EDDKKVETFENCTRVSRSSDVHLNIRLPNGVSLKEKFSITSTMRMVKNYVDENQGSGIGN 360

Query: 361 YDLAVPYPRKVFTDQDLEKSLSYLGLSNRQSLIMVRHLGVTRDFRGASSSADQRNSAANG 420
           YDLA+PYPRK+F DQDL KSLS LGL +RQ+LI+V     T    G S+ +DQ +S    
Sbjct: 361 YDLAIPYPRKIFCDQDLSKSLSDLGLFDRQALIVVPRQKGTGYLSGTSAFSDQTDSRNTV 420

Query: 421 GSSD-ENGSEFSEVER-----------------------------RHVRQPNQGTATTSG 480
            SSD  NG  FS V+R                              +   P +  + T  
Sbjct: 421 SSSDGSNGGYFSYVKRLLSYFNPLSYLGSGASSSSSGQQSENGTWEYSPNPTRRNSVTQN 480

Query: 481 NNTRGKQPLSTAR------FGANIHSIHTLKKDEDDERFKGRNSFWNGNSTEYGGDD 491
            +T GK     +R      FG+N   IHTLK+DEDDERF  RNSFWNGNSTEYGG+D
Sbjct: 481 ASTSGKSEEKKSRKPPASGFGSN---IHTLKRDEDDERFXDRNSFWNGNSTEYGGND 512

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PUX11_ARATH3.2e-7341.12Plant UBX domain-containing protein 11 OS=Arabidopsis thaliana GN=PUX11 PE=1 SV=... [more]
UBXN4_HUMAN5.4e-1222.28UBX domain-containing protein 4 OS=Homo sapiens GN=UBXN4 PE=1 SV=2[more]
UBXN4_PONAB7.0e-1222.28UBX domain-containing protein 4 OS=Pongo abelii GN=UBXN4 PE=2 SV=1[more]
UBXN4_RAT1.9e-0923.83UBX domain-containing protein 4 OS=Rattus norvegicus GN=Ubxn4 PE=1 SV=1[more]
UBXN4_MOUSE1.6e-0823.10UBX domain-containing protein 4 OS=Mus musculus GN=Ubxn4 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KC97_CUCSA6.5e-16665.43Uncharacterized protein OS=Cucumis sativus GN=Csa_6G041180 PE=4 SV=1[more]
B9HHD9_POPTR2.4e-9646.23UBX domain-containing family protein OS=Populus trichocarpa GN=POPTR_0007s02410g... [more]
A0A061EJB9_THECC8.1e-9245.39Ubiquitin-like superfamily protein, putative isoform 3 OS=Theobroma cacao GN=TCM... [more]
M5XDI9_PRUPE3.1e-9143.78Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003931mg PE=4 SV=1[more]
A0A061ERL5_THECC3.8e-8944.95Ubiquitin-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM... [more]
Match NameE-valueIdentityDescription
AT2G43210.11.8e-7441.12 Ubiquitin-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778710172|ref|XP_011656529.1|9.4e-16665.43PREDICTED: UBX domain-containing protein 4 isoform X1 [Cucumis sativus][more]
gi|449446221|ref|XP_004140870.1|3.0e-16465.43PREDICTED: UBX domain-containing protein 4 isoform X2 [Cucumis sativus][more]
gi|659089807|ref|XP_008445691.1|5.5e-15864.50PREDICTED: UBX domain-containing protein 4 [Cucumis melo][more]
gi|778710182|ref|XP_011656531.1|7.2e-15873.83PREDICTED: UBX domain-containing protein 4 isoform X3 [Cucumis sativus][more]
gi|694418995|ref|XP_009337470.1|1.7e-9846.00PREDICTED: UBX domain-containing protein 4-like [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR012336Thioredoxin-like_fold
IPR001012UBX_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g10040.1Cp4.1LG17g10040.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001012UBX domainPFAMPF00789UBXcoord: 307..384
score: 1.4
IPR001012UBX domainSMARTSM00166ubx_3coord: 302..385
score: 5.
IPR001012UBX domainPROFILEPS50033UBXcoord: 305..383
score: 16
IPR012336Thioredoxin-like foldunknownSSF52833Thioredoxin-likecoord: 8..116
score: 4.03
NoneNo IPR availableGENE3DG3DSA:3.10.20.90coord: 306..383
score: 3.7
NoneNo IPR availablePANTHERPTHR13020UBIQUITIN-ASSOCIATED UBA/UBX DOMAIN-CONTAININGcoord: 1..153
score: 1.6
NoneNo IPR availablePANTHERPTHR13020:SF36EXPRESSED PROTEINcoord: 1..153
score: 1.6