Cp4.1LG14g04750 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g04750
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTransmembrane protein, putative
LocationCp4.1LG14 : 1449787 .. 1453963 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAATCCGTAGATATCGATCCAACGGCTATAAACGCTTCTATTCCTTCCCAAATTCGCACACTTGCTTACCTATTCTTCGCAACTCCAACGGTCCAATACAAAACCCTCTATTTTTAAAATTCCAGAAGATCCCTTCATTAAATTTGAAAACTAGCCGTTACTAACCAAGAAGCTAAGGAATCATCTTCACCAAAACCCCCCCTCTCCCCCCTTAAATATTTTCGCCGACATTTTCATTTCTGATCCAATTTGAATCTCTCAAATTTCTCTTTTCCTTTATATCTGAAATCTTCACCCAGATCCATTTTAATGGCGCTGCCGTCTAACAGATCGTCTTCACCGTCGATGGTGACCGGAAGAACAGGCCCTATTTCTAGAAATTCCGAAATCGGCAACCCTGTCTACCGGAGCTTCTCCAGTAATCCGTTTTCGAAGCCGTCGATCGCCACCAGTTTGAAAAGCTTGAATCCAATCACTCCGGCGAATAGTCCCTCCGGTTTGTTTGAGTTTTCTTCATGCTTAGTTTCTTTCCTGTGATCTTAGAGTGGTTTAATTCTGTTTGGTTGCTGAGAAAGTGGCTTAGAATATCTGCTTTTCCTTTATGTTTTGCTCTGATTTTGCTCTCTCCTTGCTTGGTTTCTTCCTTTAGAACTGGAAATTTGTTTGGTTGCCGAGAAATTGGATGAGAAAATTTGTTAGGGTTAGGGTTACAGAATCAAAGAATCTCTGATGCTCACTAGCATAGCCATTTCGATTCTTCCATCGATATAATCCGTTCCTTTCTTTGTTCTTCTAAACATGATGATTTCCATCTCCAACGTTTCTGCGATCTTTGTTCTTATTTGTTTCTTTGAATCAAAGTTGCAGATTATCCGCCACAAAGGAATTCTGTGAGCAGAGAAATTTTATTTACTTCTCGAGACAATGAGGATAAAGAAAATGGGAAAGATCAGAGTCCGAAACTCACCCGAGTCCGTTCGCCGACTGTCGGAAAATCGATGAAGAATTTCATGTCGTCTACAATTTCCGCCGCCTCCAAGATTGCCGTCTCTCCGAAGAAGAAGATTCTGGGCGATCGGAACGAGCCAGTTCGGTCGTCTCTTTCATTTTCCGGCATGAAAAGCTCTTCACTCAACTCGGTGAATCCAACTCCTGAGGCATCAATGGCATTTGAATCCGATACGAACCCTCCAATGCCTCTGATTTCATATCCTAAATCAACAAAAACAGTGAGATTCGGTGGTGTTGAGGTAATTTCTGGTTCGTACGAGGATTCAGAATCGACTTACCGATATGATTTGAACCCGGAGTTGGTGGCAGCTGTAACCGATTCGAAGTCTGGAATTGTTCCGATTACTAAATCTGCCATTGCAGAAGGATCTTCCAAATCTTCTAAAACTGTGACATTCGGTGGTTTTGAGGTTATTTCTGATTTCTGTGACGATTCGGAATCCACATACCGACATGGACACCATCCGAACCCAGAAGCTGTAACAGTGGCTGTTGAGGCCAACGCAGAGCCTGAAATCGGTCCGATTTCAGATTCCGACATTGCAGCTGTAACTCCCGAAGCTTCAAAGATTATGAGATTTTCTGATTTCGAAGCTGTCTCGAACAACGCTTTAGAGTCTTCGGTTAACAGTAATTTAACTGAAGAAGTGGATTCTGTTAATCTTGACCCAAGTTTTAACATCAGTCCAGTTTCTTCTCCAATGATAGCACCTATAATCACTCCTTATGATCCTAAAACTAATTATCTGTCGCCAAGGCCACAGTTCCTCCATTACAACCCAAACCGAAGAATCAATCGACCAGATGGTAGATTTGAGGAACTCTTTTCCTCTTCGGAGGAGACTGACTGTGAAGATCCACAGAAGGAATCTGATGAAGTTTCTTCCAATGAATCGCAGATGAAAGAAGAAGAAAAAGAAGAAGAAGAGGTTAATGTTTCTATACAAGGCCCCACGGAAGTTAAAAAGTCGTCGAAGCCTCTGTTGTCTAGGATATTCAAGATCAGTTCTCTGCTTTTGATTTTGTTTACTGCTTGCTTGTCGATTTGTGTTGTGAATGTCCATGATCCAACTATCTTCGAAAGATCAACCTTGTTAACAATGGGGGATCAGTCTGAGATTTTCGAGTCTGCTAAAACGAATTTCAATGTGTTGGTTGGGAAACTTGAGATTTGGCATGCGAATTCCATCTCTTTTATTTCTGATGTGGTTTTCAACTTCAGAGGAGGGCCGCCATTGATACATCTGAACCAAACTGAGTTTTTCTACGGGGATGTCAATAAGGATGAACAGTGTCTTGTATTATCTCATCAGAACGTGTGGGAAGAAGAAAACAATTTGATTAATGCAATGGAAGCCATGAAGGAGAGAGTAATTGTTGATACTTTTGAAGAGCCTATTGACAGAGAAGGTCAGAATAAAGAGGGACAAGAACAAGAAGAAGATGCACAAGAGGTTGAAGCCATAAAGGTGAGAGAAATTGGCATCGAAACCTTTGAAAGAGAATCTCACAATGAAGAAGTAGAAGAAGAATCGTTTCAAGAGATTGAGGCCAGAACCAATGATTCAGCAGACATTGAAGAAGAGAATAACGAGGCTTCTGAAGAATCATTACAAGAAATCATTGAACATATTGAGGGAGAAGGTCAGAATATAGAGGGACAAGAACAGCAAGAAGAAGCACAAGATACCGAAGCCATGAAGGTGAGAGAAATTGGAATCGAAACTGTTGAAAGAGAATCTCAGAACGAAGAAGTAGAAGAAGAACCGTTTCAAAAAGACAGCGAAGAAGAGAATGACGAGGCTTCTGAAGAATCATTACTAGAAATCGTTGAAGAAGAATACGTTCAAGAGAAAACCGTAGAGAATTTCAAAGCTTCTTCATCGTCTGATTTTAAATTACATGATGAAATTGAAAAAGCAGCAGCAACAGGGGAAACACAGGAAGAAACGAACACAGAATTTCAATACCAGTCACCTCCAGTCTCTTCTCCTCCATCTGAACATCAATCTGATGTTGAAGAAGAAAATGGCGGCAAAATCGTCGATCTCATCAGAACAGCCACCGGAATATCTCGTGATTTCACACAGAACACGGCTGCAATAATATCTGCAATACTGCTAGGTTTATTTCTTATTATACCTGCAGGACTGATTTATGCAAGGAAATCAAGCTCAAGACGAACAACATCCACGGCGGCCATTGCTGAAGAGCAGCAAGAGGAGCCATTGCTGAAAGATAAGAAGACGAACCAGAGTCTGGTGGAAGAGGAAGAAGAAGAAGCACTGGATGATGATGATGAAGATGATATGGCTGGAGAATTTTGCTCTTCTGAAACGAGTAGTTTTTTCCAGTACAGCAGCGTGAGAGAAGGAGAAACAGAAGCAGCGAAAAGATCGAGTGAGTTTCAAAGCCATAGCCATGTGAGGAGGGAGAATTCAAGAAGAGAATCCATCGCTGAAGCAGCGACGAGATCGAGTGAAGTTCAGAGCCATAGCCATGGGAGGAGGAAGATTAGGATGGAGAATTCACGAAGAGAATCCATGGCTTCTTCTTCTTTGGATGATTACTCAGTGTCCTCTTCGGCTTCTCCATCTTATGGGCGTTTCACAACTTATGAGAAGATCCCAATCAAGCATGTGAGTATCTCAAAACTTTTTTTTTCCTTTTTTCCCCTTAAATTAGTGATTTAAATTATATTATTATTTTAATGTTTTTACTTTATTTTATTTTTTAAATAGAAACTATATTTATAACGGTACAATAATAACTAATAAAATTTTTGTTTCTAATTAGAGTGACATTTTTTTTTTTTTTTTCAGAAAAATGGAGACGAAGAGATTGTGACTCCCGTCAGGCGCTCTAGTAGGATTAGAAATCGACACAATAATAGTTGATTCGTGTATTAAATGAAAAAAAAAACGTGCTTATATGTTAAACTTTTGGAGTGTTTTTGTGCATACATTTAACCTCGAGTCAAATGTTATGAGTAAAACTCCGAGGAAGCAAGACAATAGTAGTTAGAATCAGAAAGCAATGCACAACAAATTGCTGGAATTTGAGAGTATTTTGTGCCCGTGCGAGTTTCATATTAGAGCAACTTTTTATTGCATTTTTTATACGAAATAATCATGATAATACTATATATATTTGGATAGAGGGAAAT

mRNA sequence

CAATCCGTAGATATCGATCCAACGGCTATAAACGCTTCTATTCCTTCCCAAATTCGCACACTTGCTTACCTATTCTTCGCAACTCCAACGGTCCAATACAAAACCCTCTATTTTTAAAATTCCAGAAGATCCCTTCATTAAATTTGAAAACTAGCCGTTACTAACCAAGAAGCTAAGGAATCATCTTCACCAAAACCCCCCCTCTCCCCCCTTAAATATTTTCGCCGACATTTTCATTTCTGATCCAATTTGAATCTCTCAAATTTCTCTTTTCCTTTATATCTGAAATCTTCACCCAGATCCATTTTAATGGCGCTGCCGTCTAACAGATCGTCTTCACCGTCGATGGTGACCGGAAGAACAGGCCCTATTTCTAGAAATTCCGAAATCGGCAACCCTGTCTACCGGAGCTTCTCCAGTAATCCGTTTTCGAAGCCGTCGATCGCCACCAGTTTGAAAAGCTTGAATCCAATCACTCCGGCGAATAGTCCCTCCGATTATCCGCCACAAAGGAATTCTGTGAGCAGAGAAATTTTATTTACTTCTCGAGACAATGAGGATAAAGAAAATGGGAAAGATCAGAGTCCGAAACTCACCCGAGTCCGTTCGCCGACTGTCGGAAAATCGATGAAGAATTTCATGTCGTCTACAATTTCCGCCGCCTCCAAGATTGCCGTCTCTCCGAAGAAGAAGATTCTGGGCGATCGGAACGAGCCAGTTCGGTCGTCTCTTTCATTTTCCGGCATGAAAAGCTCTTCACTCAACTCGGTGAATCCAACTCCTGAGGCATCAATGGCATTTGAATCCGATACGAACCCTCCAATGCCTCTGATTTCATATCCTAAATCAACAAAAACAGTGAGATTCGGTGGTGTTGAGGTAATTTCTGGTTCGTACGAGGATTCAGAATCGACTTACCGATATGATTTGAACCCGGAGTTGGTGGCAGCTGTAACCGATTCGAAGTCTGGAATTGTTCCGATTACTAAATCTGCCATTGCAGAAGGATCTTCCAAATCTTCTAAAACTGTGACATTCGGTGGTTTTGAGGTTATTTCTGATTTCTGTGACGATTCGGAATCCACATACCGACATGGACACCATCCGAACCCAGAAGCTGTAACAGTGGCTGTTGAGGCCAACGCAGAGCCTGAAATCGGTCCGATTTCAGATTCCGACATTGCAGCTGTAACTCCCGAAGCTTCAAAGATTATGAGATTTTCTGATTTCGAAGCTGTCTCGAACAACGCTTTAGAGTCTTCGGTTAACAGTAATTTAACTGAAGAAGTGGATTCTGTTAATCTTGACCCAAGTTTTAACATCAGTCCAGTTTCTTCTCCAATGATAGCACCTATAATCACTCCTTATGATCCTAAAACTAATTATCTGTCGCCAAGGCCACAGTTCCTCCATTACAACCCAAACCGAAGAATCAATCGACCAGATGGTAGATTTGAGGAACTCTTTTCCTCTTCGGAGGAGACTGACTGTGAAGATCCACAGAAGGAATCTGATGAAGTTTCTTCCAATGAATCGCAGATGAAAGAAGAAGAAAAAGAAGAAGAAGAGGTTAATGTTTCTATACAAGGCCCCACGGAAGTTAAAAAGTCGTCGAAGCCTCTGTTGTCTAGGATATTCAAGATCAGTTCTCTGCTTTTGATTTTGTTTACTGCTTGCTTGTCGATTTGTGTTGTGAATGTCCATGATCCAACTATCTTCGAAAGATCAACCTTGTTAACAATGGGGGATCAGTCTGAGATTTTCGAGTCTGCTAAAACGAATTTCAATGTGTTGGTTGGGAAACTTGAGATTTGGCATGCGAATTCCATCTCTTTTATTTCTGATGTGGTTTTCAACTTCAGAGGAGGGCCGCCATTGATACATCTGAACCAAACTGAGTTTTTCTACGGGGATGTCAATAAGGATGAACAGTGTCTTGTATTATCTCATCAGAACGTGTGGGAAGAAGAAAACAATTTGATTAATGCAATGGAAGCCATGAAGGAGAGAGTAATTGTTGATACTTTTGAAGAGCCTATTGACAGAGAAGGTCAGAATAAAGAGGGACAAGAACAAGAAGAAGATGCACAAGAGGTTGAAGCCATAAAGGTGAGAGAAATTGGCATCGAAACCTTTGAAAGAGAATCTCACAATGAAGAAGTAGAAGAAGAATCGTTTCAAGAGATTGAGGCCAGAACCAATGATTCAGCAGACATTGAAGAAGAGAATAACGAGGCTTCTGAAGAATCATTACAAGAAATCATTGAACATATTGAGGGAGAAGGTCAGAATATAGAGGGACAAGAACAGCAAGAAGAAGCACAAGATACCGAAGCCATGAAGGTGAGAGAAATTGGAATCGAAACTGTTGAAAGAGAATCTCAGAACGAAGAAGTAGAAGAAGAACCGTTTCAAAAAGACAGCGAAGAAGAGAATGACGAGGCTTCTGAAGAATCATTACTAGAAATCGTTGAAGAAGAATACGTTCAAGAGAAAACCGTAGAGAATTTCAAAGCTTCTTCATCGTCTGATTTTAAATTACATGATGAAATTGAAAAAGCAGCAGCAACAGGGGAAACACAGGAAGAAACGAACACAGAATTTCAATACCAGTCACCTCCAGTCTCTTCTCCTCCATCTGAACATCAATCTGATGTTGAAGAAGAAAATGGCGGCAAAATCGTCGATCTCATCAGAACAGCCACCGGAATATCTCGTGATTTCACACAGAACACGGCTGCAATAATATCTGCAATACTGCTAGGTTTATTTCTTATTATACCTGCAGGACTGATTTATGCAAGGAAATCAAGCTCAAGACGAACAACATCCACGGCGGCCATTGCTGAAGAGCAGCAAGAGGAGCCATTGCTGAAAGATAAGAAGACGAACCAGAGTCTGGTGGAAGAGGAAGAAGAAGAAGCACTGGATGATGATGATGAAGATGATATGGCTGGAGAATTTTGCTCTTCTGAAACGAGTAGTTTTTTCCAGTACAGCAGCGTGAGAGAAGGAGAAACAGAAGCAGCGAAAAGATCGAGTGAGTTTCAAAGCCATAGCCATGTGAGGAGGGAGAATTCAAGAAGAGAATCCATCGCTGAAGCAGCGACGAGATCGAGTGAAGTTCAGAGCCATAGCCATGGGAGGAGGAAGATTAGGATGGAGAATTCACGAAGAGAATCCATGGCTTCTTCTTCTTTGGATGATTACTCAGTGTCCTCTTCGGCTTCTCCATCTTATGGGCGTTTCACAACTTATGAGAAGATCCCAATCAAGCATAAAAATGGAGACGAAGAGATTGTGACTCCCGTCAGGCGCTCTAGTAGGATTAGAAATCGACACAATAATAGTTGATTCGTGTATTAAATGAAAAAAAAAACGTGCTTATATGTTAAACTTTTGGAGTGTTTTTGTGCATACATTTAACCTCGAGTCAAATGTTATGAGTAAAACTCCGAGGAAGCAAGACAATAGTAGTTAGAATCAGAAAGCAATGCACAACAAATTGCTGGAATTTGAGAGTATTTTGTGCCCGTGCGAGTTTCATATTAGAGCAACTTTTTATTGCATTTTTTATACGAAATAATCATGATAATACTATATATATTTGGATAGAGGGAAAT

Coding sequence (CDS)

ATGGCGCTGCCGTCTAACAGATCGTCTTCACCGTCGATGGTGACCGGAAGAACAGGCCCTATTTCTAGAAATTCCGAAATCGGCAACCCTGTCTACCGGAGCTTCTCCAGTAATCCGTTTTCGAAGCCGTCGATCGCCACCAGTTTGAAAAGCTTGAATCCAATCACTCCGGCGAATAGTCCCTCCGATTATCCGCCACAAAGGAATTCTGTGAGCAGAGAAATTTTATTTACTTCTCGAGACAATGAGGATAAAGAAAATGGGAAAGATCAGAGTCCGAAACTCACCCGAGTCCGTTCGCCGACTGTCGGAAAATCGATGAAGAATTTCATGTCGTCTACAATTTCCGCCGCCTCCAAGATTGCCGTCTCTCCGAAGAAGAAGATTCTGGGCGATCGGAACGAGCCAGTTCGGTCGTCTCTTTCATTTTCCGGCATGAAAAGCTCTTCACTCAACTCGGTGAATCCAACTCCTGAGGCATCAATGGCATTTGAATCCGATACGAACCCTCCAATGCCTCTGATTTCATATCCTAAATCAACAAAAACAGTGAGATTCGGTGGTGTTGAGGTAATTTCTGGTTCGTACGAGGATTCAGAATCGACTTACCGATATGATTTGAACCCGGAGTTGGTGGCAGCTGTAACCGATTCGAAGTCTGGAATTGTTCCGATTACTAAATCTGCCATTGCAGAAGGATCTTCCAAATCTTCTAAAACTGTGACATTCGGTGGTTTTGAGGTTATTTCTGATTTCTGTGACGATTCGGAATCCACATACCGACATGGACACCATCCGAACCCAGAAGCTGTAACAGTGGCTGTTGAGGCCAACGCAGAGCCTGAAATCGGTCCGATTTCAGATTCCGACATTGCAGCTGTAACTCCCGAAGCTTCAAAGATTATGAGATTTTCTGATTTCGAAGCTGTCTCGAACAACGCTTTAGAGTCTTCGGTTAACAGTAATTTAACTGAAGAAGTGGATTCTGTTAATCTTGACCCAAGTTTTAACATCAGTCCAGTTTCTTCTCCAATGATAGCACCTATAATCACTCCTTATGATCCTAAAACTAATTATCTGTCGCCAAGGCCACAGTTCCTCCATTACAACCCAAACCGAAGAATCAATCGACCAGATGGTAGATTTGAGGAACTCTTTTCCTCTTCGGAGGAGACTGACTGTGAAGATCCACAGAAGGAATCTGATGAAGTTTCTTCCAATGAATCGCAGATGAAAGAAGAAGAAAAAGAAGAAGAAGAGGTTAATGTTTCTATACAAGGCCCCACGGAAGTTAAAAAGTCGTCGAAGCCTCTGTTGTCTAGGATATTCAAGATCAGTTCTCTGCTTTTGATTTTGTTTACTGCTTGCTTGTCGATTTGTGTTGTGAATGTCCATGATCCAACTATCTTCGAAAGATCAACCTTGTTAACAATGGGGGATCAGTCTGAGATTTTCGAGTCTGCTAAAACGAATTTCAATGTGTTGGTTGGGAAACTTGAGATTTGGCATGCGAATTCCATCTCTTTTATTTCTGATGTGGTTTTCAACTTCAGAGGAGGGCCGCCATTGATACATCTGAACCAAACTGAGTTTTTCTACGGGGATGTCAATAAGGATGAACAGTGTCTTGTATTATCTCATCAGAACGTGTGGGAAGAAGAAAACAATTTGATTAATGCAATGGAAGCCATGAAGGAGAGAGTAATTGTTGATACTTTTGAAGAGCCTATTGACAGAGAAGGTCAGAATAAAGAGGGACAAGAACAAGAAGAAGATGCACAAGAGGTTGAAGCCATAAAGGTGAGAGAAATTGGCATCGAAACCTTTGAAAGAGAATCTCACAATGAAGAAGTAGAAGAAGAATCGTTTCAAGAGATTGAGGCCAGAACCAATGATTCAGCAGACATTGAAGAAGAGAATAACGAGGCTTCTGAAGAATCATTACAAGAAATCATTGAACATATTGAGGGAGAAGGTCAGAATATAGAGGGACAAGAACAGCAAGAAGAAGCACAAGATACCGAAGCCATGAAGGTGAGAGAAATTGGAATCGAAACTGTTGAAAGAGAATCTCAGAACGAAGAAGTAGAAGAAGAACCGTTTCAAAAAGACAGCGAAGAAGAGAATGACGAGGCTTCTGAAGAATCATTACTAGAAATCGTTGAAGAAGAATACGTTCAAGAGAAAACCGTAGAGAATTTCAAAGCTTCTTCATCGTCTGATTTTAAATTACATGATGAAATTGAAAAAGCAGCAGCAACAGGGGAAACACAGGAAGAAACGAACACAGAATTTCAATACCAGTCACCTCCAGTCTCTTCTCCTCCATCTGAACATCAATCTGATGTTGAAGAAGAAAATGGCGGCAAAATCGTCGATCTCATCAGAACAGCCACCGGAATATCTCGTGATTTCACACAGAACACGGCTGCAATAATATCTGCAATACTGCTAGGTTTATTTCTTATTATACCTGCAGGACTGATTTATGCAAGGAAATCAAGCTCAAGACGAACAACATCCACGGCGGCCATTGCTGAAGAGCAGCAAGAGGAGCCATTGCTGAAAGATAAGAAGACGAACCAGAGTCTGGTGGAAGAGGAAGAAGAAGAAGCACTGGATGATGATGATGAAGATGATATGGCTGGAGAATTTTGCTCTTCTGAAACGAGTAGTTTTTTCCAGTACAGCAGCGTGAGAGAAGGAGAAACAGAAGCAGCGAAAAGATCGAGTGAGTTTCAAAGCCATAGCCATGTGAGGAGGGAGAATTCAAGAAGAGAATCCATCGCTGAAGCAGCGACGAGATCGAGTGAAGTTCAGAGCCATAGCCATGGGAGGAGGAAGATTAGGATGGAGAATTCACGAAGAGAATCCATGGCTTCTTCTTCTTTGGATGATTACTCAGTGTCCTCTTCGGCTTCTCCATCTTATGGGCGTTTCACAACTTATGAGAAGATCCCAATCAAGCATAAAAATGGAGACGAAGAGATTGTGACTCCCGTCAGGCGCTCTAGTAGGATTAGAAATCGACACAATAATAGTTGA

Protein sequence

MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANSPSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKSTKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKSSKTVTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEASKIMRFSDFEAVSNNALESSVNSNLTEEVDSVNLDPSFNISPVSSPMIAPIITPYDPKTNYLSPRPQFLHYNPNRRINRPDGRFEELFSSSEETDCEDPQKESDEVSSNESQMKEEEKEEEEVNVSIQGPTEVKKSSKPLLSRIFKISSLLLILFTACLSICVVNVHDPTIFERSTLLTMGDQSEIFESAKTNFNVLVGKLEIWHANSISFISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLINAMEAMKERVIVDTFEEPIDREGQNKEGQEQEEDAQEVEAIKVREIGIETFERESHNEEVEEESFQEIEARTNDSADIEEENNEASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKVREIGIETVERESQNEEVEEEPFQKDSEEENDEASEESLLEIVEEEYVQEKTVENFKASSSSDFKLHDEIEKAAATGETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGISRDFTQNTAAIISAILLGLFLIIPAGLIYARKSSSRRTTSTAAIAEEQQEEPLLKDKKTNQSLVEEEEEEALDDDDEDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRENSRRESIAEAATRSSEVQSHSHGRRKIRMENSRRESMASSSLDDYSVSSSASPSYGRFTTYEKIPIKHKNGDEEIVTPVRRSSRIRNRHNNS
BLAST of Cp4.1LG14g04750 vs. TrEMBL
Match: E5GBH8_CUCME (Putative uncharacterized protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 717.2 bits (1850), Expect = 2.8e-203
Identity = 550/1053 (52.23%), Postives = 652/1053 (61.92%), Query Frame = 1

Query: 1    MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
            MALPSNRSSSPS+ +GRT P SR+SEI NP+ RSFS NPFSK SI  + + LNPITPANS
Sbjct: 1    MALPSNRSSSPSLPSGRTSPTSRSSEISNPIRRSFSGNPFSKQSIVANPRGLNPITPANS 60

Query: 61   PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
            PSDYP +RNS++RE  FTSRD  +KENGKDQSPK  RVRS              +  +SK
Sbjct: 61   PSDYP-RRNSMNRENSFTSRDIPEKENGKDQSPKPVRVRS------------PIVGKSSK 120

Query: 121  IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKS 180
              +SP                    + ++S  +V+P  +       D N P        +
Sbjct: 121  HFMSPT-------------------ISAASKIAVSPRKKVL----GDRNEP--------A 180

Query: 181  TKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKSSKT 240
              +V F G++  S       S  R    PE  A  +DS S I P++ S +A       KT
Sbjct: 181  RSSVSFSGMKSSS-----LNSVNRSLEAPE--ALESDSNSQIPPVSNSKVA-------KT 240

Query: 241  VTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEASK 300
            V FGGFEVISD  DDSESTYR+  +P    VT+AVE   + E   +S S  A    E+S 
Sbjct: 241  VRFGGFEVISDSFDDSESTYRYDLNPET-VVTMAVETGMKSENVQVSKSTNAVAPSESSN 300

Query: 301  IMRFSDFE--AVSNNALESS-VNSNLTEEVDSVNLDPSFNISPVSSPMIAPI-----ITP 360
                S+FE  +VSNN L+S    SNLTEEVD VNLD    ISPVSSP IAP+     + P
Sbjct: 301  ----SEFEVISVSNNDLDSPPAKSNLTEEVDCVNLDQ--RISPVSSPTIAPLDADPSLPP 360

Query: 361  YDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELF---------SSSEETDCEDPQKES 420
            YDPKTNYLSPRPQFLHY PNRRINR  PDGR E+            S EETD ED  KE 
Sbjct: 361  YDPKTNYLSPRPQFLHYRPNRRINRYEPDGRLEDKLLSFANVSESESVEETDSEDSSKEL 420

Query: 421  DEVSSNESQMKEEEKEEEE--------VNVSIQGPTEVKKSSKPLLSRIFKISSLLLILF 480
            DE SSN SQM+EEE EEEE        +NVS Q PTEV+KS K  LSRIFKISSLLLILF
Sbjct: 421  DEASSNGSQMEEEEAEEEEEEEEEEDGINVSEQCPTEVQKSWKVSLSRIFKISSLLLILF 480

Query: 481  TACLSICVVNVHDPTIFERSTLLTMGDQSEIFESAKTNFNVLVGKLEIWHANSISFISDV 540
            TAC SI VVNVHDP+IF+R + LTM D SEI+E AKTNFNV V KLE+W+ NSISFISD+
Sbjct: 481  TACFSIYVVNVHDPSIFKRPSSLTMEDASEIYELAKTNFNVFVQKLEVWNVNSISFISDM 540

Query: 541  VFNFRGGPPLIHL-NQTEFFYGDVNKDEQCLVLSHQNVWEEENNLINAMEAMKERVIVDT 600
            VFNFRG  PLIH  NQTEFF    N +EQCLVLSHQ VW EEN L N MEAMK+R   D 
Sbjct: 541  VFNFRGALPLIHYENQTEFF----NMNEQCLVLSHQTVWGEENTL-NVMEAMKDRE-TDI 600

Query: 601  FEEPIDREGQNKEGQEQ--EEDAQEVEAIKVREIGI--ETFERESHNEEVEEES----FQ 660
            FEEPI+ E + +EG+    EE     +  +  EIGI  E  ERES  EE E+E      Q
Sbjct: 601  FEEPIEIEERQEEGEIDIFEELINIEKRQEEEEIGIFEEPVERESEKEEQEQEQQVDLSQ 660

Query: 661  EIEARTNDSADIEEENNEA-SEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKVREIG 720
            EIEA       IE    E+ +EE L E+    +G G N                      
Sbjct: 661  EIEAMKMREIGIENSEKESQNEEELGEV--SFQGSGVN---------------------- 720

Query: 721  IETVERESQNEEVEEEPFQKDSEEENDEASEESLLEIVEEEYVQEKTVENFKASSSSDFK 780
                  E +N EV EEP ++ +EE    ++ + L E  EEEY+QEK+ +NF+ SSS DFK
Sbjct: 721  ---ANEEEKNGEVFEEPLEEINEEALKNSASDELCE--EEEYIQEKSEDNFRFSSSDDFK 780

Query: 781  LHDEI--EKAAATGETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGI 840
             HD+I  E AAATGET+   NTEFQYQSPPVSS P+E Q D E E GG+ +D+IRT TGI
Sbjct: 781  FHDQIKQEAAAATGETEVAKNTEFQYQSPPVSS-PAERQPDFEHEIGGRTIDVIRTETGI 840

Query: 841  SRDFTQNTAAIISAILLGLFLIIPAGLIYARKSSSRRTTSTAAIAEEQQEEPLLKDKKTN 900
            S DFTQ  A IISAILLGL L + AGLIY RKS S+    ++   E+++E+PL+     N
Sbjct: 841  SPDFTQTKAIIISAILLGLSL-VTAGLIYGRKSCSKPPPPSSIAEEQEKEQPLM-----N 900

Query: 901  QSLVEEEEEEALDDDDEDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRR 960
             S VEE+      DD+EDDM GEF  SETSS FQYSS+REGET+  K             
Sbjct: 901  TSRVEEK------DDEEDDMGGEFSISETSS-FQYSSMREGETKEDK------------- 910

Query: 961  ENSRRESIAEAATRSSEVQSHSHGRRKIRMENSRRESMASSSLDDYSVSSSASPSYGRFT 1015
                         + +EV+SHSHGRRK++ +NSRRESMASSSLD+YS+S+SASPSYG FT
Sbjct: 961  -------------KMNEVESHSHGRRKMK-KNSRRESMASSSLDEYSLSTSASPSYGSFT 910

BLAST of Cp4.1LG14g04750 vs. TrEMBL
Match: A0A0A0KUZ2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000550 PE=4 SV=1)

HSP 1 Score: 692.6 bits (1786), Expect = 7.3e-196
Identity = 534/1052 (50.76%), Postives = 646/1052 (61.41%), Query Frame = 1

Query: 1    MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
            MALPSNRSSSPS ++GRT P SR+SEI NP+ RSFS NPFSK SI  + + LNPITPANS
Sbjct: 1    MALPSNRSSSPSFLSGRTSPNSRSSEISNPIRRSFSGNPFSKQSIVANPRGLNPITPANS 60

Query: 61   PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
            PSDYP +RNSV+RE  FTSRD  +KENGKDQSPK  RVRS              +  +SK
Sbjct: 61   PSDYP-RRNSVNRENSFTSRDISEKENGKDQSPKPVRVRS------------PIVGKSSK 120

Query: 121  IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKS 180
              +SP                    + ++S  +V+P  +       D N P        +
Sbjct: 121  HFMSPT-------------------ISAASKIAVSPRKKVL----GDRNEP--------A 180

Query: 181  TKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKSSKT 240
              ++ F G++  S       S  R    PE  A  +D+ S I P++       +SK++K 
Sbjct: 181  RSSISFSGMKSSS-----LNSVNRSLEAPE--ALESDTNSQIPPVS-------NSKTAKI 240

Query: 241  VTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEASK 300
            V FGGFEVISD  DDS+STYR+  +P    VT+AVE +       +S S  A    E S 
Sbjct: 241  VRFGGFEVISDSFDDSKSTYRYDLNPEM-VVTMAVETDMTSGNAQVSKSTNAVAPSEPSN 300

Query: 301  IMRFSDFE--AVSNNALESS-VNSNLTEEVDSVNLD--PSFNISPVSSPMIAPI-----I 360
                S+F   +VSNN L+S    SNLTEEVD VNLD   SF ISPVSSP IAP+     +
Sbjct: 301  ----SEFAVISVSNNDLDSPPAKSNLTEEVDCVNLDLDQSFKISPVSSPTIAPLDADPSL 360

Query: 361  TPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELF---------SSSEETDCEDPQK 420
             PYDPKTNYLSPRPQFLHY PNRRINR  PDGR EE            S EETD ED  K
Sbjct: 361  PPYDPKTNYLSPRPQFLHYRPNRRINRFEPDGRLEEKLLSFANVSESESVEETDSEDSSK 420

Query: 421  ESDEVSSNESQMKEEE----KEEEEVNVSIQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
            E DE SSNESQM+EEE    +EEE +NVS Q PT+V+KS K  +SRIFKISSLLLILFTA
Sbjct: 421  ELDEASSNESQMEEEEDEVEEEEEGINVSEQSPTKVQKSWKVSVSRIFKISSLLLILFTA 480

Query: 481  CLSICVVNVHDPTIFERSTLLTMGDQSEIFESAKTNFNVLVGKLEIWHANSISFISDVVF 540
            C S+ VVNVHDP+IF+R + LTM D SEI+E AKTNFNV V KLE+W+ NSISFISD+VF
Sbjct: 481  CFSLYVVNVHDPSIFKRPSSLTMEDASEIYELAKTNFNVFVQKLEVWNVNSISFISDMVF 540

Query: 541  NFRGGPPLIHL-NQTEFFYGDVNKDEQCLVLSHQNVWEEENNLINAMEAMKERVIVDTFE 600
            NFRGG PL+H  NQTEFF    N +EQCLVLSHQ VWEEE N++N MEAMK+    D FE
Sbjct: 541  NFRGGLPLVHYENQTEFF----NMNEQCLVLSHQTVWEEE-NILNVMEAMKDG-DTDIFE 600

Query: 601  EPIDREGQNKEGQEQEEDAQEVEAIKVR----EIGI--ETFERESHNEEVEEES----FQ 660
            EPI+ E + +E  E+ +  +E+  I+ R    EIGI  E  ERES NEE E+E      Q
Sbjct: 601  EPIEIEERQEE--EETDIFEELVGIEKRPEEEEIGIFEEPVERESENEEQEQEQQVDLLQ 660

Query: 661  EIEARTNDSADIEEENNEASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKVREIGI 720
            EIEA       IE    E+  E   E +E +  +G +     ++E+  +     + EI  
Sbjct: 661  EIEAMKMREIGIENFERESQNE---EELEEVSFQGSDEVNANEEEKNGEVFEEPLEEINE 720

Query: 721  ETVERESQNEEVEEEPFQKDSEEENDEASEESLLEIVEEEYVQEKTVENFKASSSSDFKL 780
            ET E  + +E  E                        EEEY+QEK+ +NFK SS+ DFK 
Sbjct: 721  ETSENSASDELCE------------------------EEEYIQEKSEDNFKFSSTDDFKF 780

Query: 781  HDEI--EKAAATGETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGIS 840
            HD+I  E AAATGET+   NTE QYQSPPV     E Q+D + E GG+ +D+IRT  GIS
Sbjct: 781  HDQIRQEAAAATGETEGAKNTELQYQSPPV-----ERQTDFDHEIGGRTIDVIRTEIGIS 840

Query: 841  RDFTQNTAAIISAILLGLFLIIPAGLIYARKSSSRRTTSTAAIAEEQQEEPLLKDKKTNQ 900
            RDFTQ  A IISAILLGL L + AGLIY RKS S+    + A  E+++E+PL+     N 
Sbjct: 841  RDFTQTKAIIISAILLGLSL-VTAGLIYGRKSGSKPPPLSIA-DEQKKEQPLM-----NM 900

Query: 901  SLVEEEEEEALDDDDEDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRE 960
            S VEE+      DD+EDDM GEF  SETSS FQYSS+REGET+A K  +E +SHSHVRR+
Sbjct: 901  SRVEEK------DDEEDDMGGEFSISETSS-FQYSSMREGETKADKTLNEVESHSHVRRK 905

Query: 961  NSRRESIAEAATRSSEVQSHSHGRRKIRMENSRRESMASSSLDDYSVSSSASPSYGRFTT 1015
              +                           NSRRESMA SSLD+YS+S+SASPSYG FTT
Sbjct: 961  MKK---------------------------NSRRESMA-SSLDEYSLSTSASPSYGSFTT 905

BLAST of Cp4.1LG14g04750 vs. TrEMBL
Match: A0A061E0N4_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_007273 PE=4 SV=1)

HSP 1 Score: 210.3 bits (534), Expect = 1.1e-50
Identity = 270/755 (35.76%), Postives = 372/755 (49.27%), Query Frame = 1

Query: 1   MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
           MA P+  SSS   +  RT P  + SEI +P+ RSFS NPF+KPSI T+ ++ NP TPANS
Sbjct: 1   MASPAKTSSSS--LPCRTNPNMKKSEISDPMRRSFSGNPFAKPSIVTNPRTFNPSTPANS 60

Query: 61  PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
           PSD+P +R+S  RE + + RD+ DKEN KDQ+PK TRVRSP   K  KNFMS TISAASK
Sbjct: 61  PSDFP-RRHSAGRESVASLRDS-DKENSKDQNPKPTRVRSPAPSKGSKNFMSPTISAASK 120

Query: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKS 180
           I  SP+KKIL +RNE VRSS+SFS +KS        TPE ++                  
Sbjct: 121 INASPRKKILVERNESVRSSVSFSDVKSLIKEDNESTPEIAL------------------ 180

Query: 181 TKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKSSKT 240
            K  R    +V S   ED E+T    LN + V + +D KS I+   +S      S + K 
Sbjct: 181 -KQKRVSSSDVKSVIMED-EATPEIGLNQKKV-SFSDVKSIIMADNQSTPV--ISVNQKK 240

Query: 241 VTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEASK 300
           VTF   +V S   DD EST + G            + N E    P   S    V  E  K
Sbjct: 241 VTFA--DVKSVVMDDDESTPQIG----------LKQKNVEV---PHDSSSSNHVYEEPLK 300

Query: 301 IMRFSDFEAVSNNALESSVNSNLTEEVDSVNLDPSFNISP-----VSSPMIAPI-----I 360
               +DF+   +      +   +TEE DSVN+DPSF ISP      S P++AP+     +
Sbjct: 301 --SNADFDYKESKHDSDLLPETVTEENDSVNVDPSFKISPRVSITPSCPILAPLDADPSM 360

Query: 361 TPYDPKTNYLSPRPQFLHYNPNRRIN----RPDGRFEELFSSSE--------ETDCEDPQ 420
            PYDPKTNYLSPRPQFLHY PN RI+    R   + EE F+S          ET C+  Q
Sbjct: 361 PPYDPKTNYLSPRPQFLHYRPNPRIDLYREREGKQLEEHFASESYSDTEVTGETQCDASQ 420

Query: 421 KESDEVSSNESQMKEEEKEEEEVNVSIQGP------TEVKKSSKPLLSRIFKISSLLLIL 480
           +ES+++SS E+   + E EEEE+  S + P       E  + SKP  S   K  + LL+L
Sbjct: 421 RESEDISSEETM--KGEGEEEELYASERNPIAHDMVEESLRMSKPRFSTRSKFIAFLLVL 480

Query: 481 FTACLSICVVNVHDPTIFERSTL----LTMGDQSEIFESAKTNFNVLVGKLEIWHANSIS 540
             A  SI V N   PT F  S L    L++    E+ E AK NF+     L+   A  +S
Sbjct: 481 AFAYFSILVAN--SPT-FAPSGLGDLSLSIQVPPEVSEFAKANFDRFTQYLQHLSARFLS 540

Query: 541 FISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLI---NAMEAMK 600
            +S+++ + R     +H     F Y +++     L+  H +    E +L+   + ++ ++
Sbjct: 541 CVSNIISSSRE----VH-RTVSFQYANLSH----LLEDHIS----EGHLLFDCSVVDPVR 600

Query: 601 ERVIVDTFEEPIDREGQNKEGQEQEEDAQEVEAIKVREIGIETFERESHNEEVEEESFQE 660
           ER    T+ + I+ +    E  EQE   QE +  +  E        E  + E  +E+ Q 
Sbjct: 601 ER---GTYHQEIEADEAVDEDDEQEIKEQEDQESQAYE------NLELVSGEEPDEAQQG 660

Query: 661 IEARTNDSADIEEENNEASEESLQEIIEHIEGEGQN-----IEGQEQQEEAQDTEAM--- 713
           IEA   +   +E E NE  E + Q   EH      N     I    +  ++ +TE +   
Sbjct: 661 IEAEMIELDHLEAEENEGVEFAAQIDAEHQSNVNLNHLPSIIPQAAEVSKSGNTEGVDLK 684

BLAST of Cp4.1LG14g04750 vs. TrEMBL
Match: A0A061E2G4_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_007273 PE=4 SV=1)

HSP 1 Score: 210.3 bits (534), Expect = 1.1e-50
Identity = 270/755 (35.76%), Postives = 372/755 (49.27%), Query Frame = 1

Query: 1   MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
           MA P+  SSS   +  RT P  + SEI +P+ RSFS NPF+KPSI T+ ++ NP TPANS
Sbjct: 1   MASPAKTSSSS--LPCRTNPNMKKSEISDPMRRSFSGNPFAKPSIVTNPRTFNPSTPANS 60

Query: 61  PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
           PSD+P +R+S  RE + + RD+ DKEN KDQ+PK TRVRSP   K  KNFMS TISAASK
Sbjct: 61  PSDFP-RRHSAGRESVASLRDS-DKENSKDQNPKPTRVRSPAPSKGSKNFMSPTISAASK 120

Query: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKS 180
           I  SP+KKIL +RNE VRSS+SFS +KS        TPE ++                  
Sbjct: 121 INASPRKKILVERNESVRSSVSFSDVKSLIKEDNESTPEIAL------------------ 180

Query: 181 TKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKSSKT 240
            K  R    +V S   ED E+T    LN + V + +D KS I+   +S      S + K 
Sbjct: 181 -KQKRVSSSDVKSVIMED-EATPEIGLNQKKV-SFSDVKSIIMADNQSTPV--ISVNQKK 240

Query: 241 VTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEASK 300
           VTF   +V S   DD EST + G            + N E    P   S    V  E  K
Sbjct: 241 VTFA--DVKSVVMDDDESTPQIG----------LKQKNVEV---PHDSSSSNHVYEEPLK 300

Query: 301 IMRFSDFEAVSNNALESSVNSNLTEEVDSVNLDPSFNISP-----VSSPMIAPI-----I 360
               +DF+   +      +   +TEE DSVN+DPSF ISP      S P++AP+     +
Sbjct: 301 --SNADFDYKESKHDSDLLPETVTEENDSVNVDPSFKISPRVSITPSCPILAPLDADPSM 360

Query: 361 TPYDPKTNYLSPRPQFLHYNPNRRIN----RPDGRFEELFSSSE--------ETDCEDPQ 420
            PYDPKTNYLSPRPQFLHY PN RI+    R   + EE F+S          ET C+  Q
Sbjct: 361 PPYDPKTNYLSPRPQFLHYRPNPRIDLYREREGKQLEEHFASESYSDTEVTGETQCDASQ 420

Query: 421 KESDEVSSNESQMKEEEKEEEEVNVSIQGP------TEVKKSSKPLLSRIFKISSLLLIL 480
           +ES+++SS E+   + E EEEE+  S + P       E  + SKP  S   K  + LL+L
Sbjct: 421 RESEDISSEETM--KGEGEEEELYASERNPIAHDMVEESLRMSKPRFSTRSKFIAFLLVL 480

Query: 481 FTACLSICVVNVHDPTIFERSTL----LTMGDQSEIFESAKTNFNVLVGKLEIWHANSIS 540
             A  SI V N   PT F  S L    L++    E+ E AK NF+     L+   A  +S
Sbjct: 481 AFAYFSILVAN--SPT-FAPSGLGDLSLSIQVPPEVSEFAKANFDRFTQYLQHLSARFLS 540

Query: 541 FISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLI---NAMEAMK 600
            +S+++ + R     +H     F Y +++     L+  H +    E +L+   + ++ ++
Sbjct: 541 CVSNIISSSRE----VH-RTVSFQYANLSH----LLEDHIS----EGHLLFDCSVVDPVR 600

Query: 601 ERVIVDTFEEPIDREGQNKEGQEQEEDAQEVEAIKVREIGIETFERESHNEEVEEESFQE 660
           ER    T+ + I+ +    E  EQE   QE +  +  E        E  + E  +E+ Q 
Sbjct: 601 ER---GTYHQEIEADEAVDEDDEQEIKEQEDQESQAYE------NLELVSGEEPDEAQQG 660

Query: 661 IEARTNDSADIEEENNEASEESLQEIIEHIEGEGQN-----IEGQEQQEEAQDTEAM--- 713
           IEA   +   +E E NE  E + Q   EH      N     I    +  ++ +TE +   
Sbjct: 661 IEAEMIELDHLEAEENEGVEFAAQIDAEHQSNVNLNHLPSIIPQAAEVSKSGNTEGVDLK 684

BLAST of Cp4.1LG14g04750 vs. TrEMBL
Match: W9QZR6_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003739 PE=4 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 3.8e-35
Identity = 100/183 (54.64%), Postives = 122/183 (66.67%), Query Frame = 1

Query: 1   MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
           MA P N+SSS S + GR  P +R+SEIGNP+ RSF+ NPFSKPSI  + K LNP TP NS
Sbjct: 1   MAAPPNKSSS-SPIPGRANPNARSSEIGNPMRRSFAGNPFSKPSIVPNPKGLNPNTPVNS 60

Query: 61  PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
           PS+Y P+RNSVSRE + T RDNEDKENG     K+ ++RSP   K  KNFMS TISAASK
Sbjct: 61  PSEY-PRRNSVSRESVVTLRDNEDKENG-----KMVKLRSPMGSKGAKNFMSPTISAASK 120

Query: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKS 180
           I  SP+KKIL +RNEPV  S+SFS  K +S +   P  + S   + D  P  P +S    
Sbjct: 121 INASPRKKILEERNEPVSDSISFSEFKIASFSP--PIADLSDHKQEDKAPAAPPVSEDSK 174

Query: 181 TKT 184
            K+
Sbjct: 181 AKS 174

BLAST of Cp4.1LG14g04750 vs. TAIR10
Match: AT1G16630.1 (AT1G16630.1 unknown protein)

HSP 1 Score: 67.4 bits (163), Expect = 5.8e-11
Identity = 62/153 (40.52%), Postives = 76/153 (49.67%), Query Frame = 1

Query: 8   SSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANSPSDYPPQ 67
           SSSPSM + R  P  RNSE G+ + RSF  NPFS                       P +
Sbjct: 20  SSSPSMPS-RPNPKQRNSETGDLMRRSFRGNPFSAD---------------------PSR 79

Query: 68  RNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASKIAVSPKK 127
           RNS+ RE      +  DKEN  D+      V+ PT G   K+FMS TISA SKI  SP+K
Sbjct: 80  RNSIGRECS-NRVEIGDKENQNDKDQIANVVKGPTKGS--KHFMSPTISAVSKINPSPRK 139

Query: 128 KILGDRNE------------PVRSSLSFSGMKS 149
           KIL D+NE             V+SS+SFS + S
Sbjct: 140 KILSDKNEVSRSFDKSHHQVQVKSSVSFSDVIS 147

BLAST of Cp4.1LG14g04750 vs. TAIR10
Match: AT2G16270.1 (AT2G16270.1 unknown protein)

HSP 1 Score: 60.8 bits (146), Expect = 5.4e-09
Identity = 60/156 (38.46%), Postives = 78/156 (50.00%), Query Frame = 1

Query: 1   MALPSNRSSSPSM-VTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPAN 60
           MA P+N++ S S  +  R  P  RNSE G+P+ RSF  NPF                PAN
Sbjct: 1   MASPTNKNPSFSPPIPNRPNPKPRNSEAGDPLRRSFGGNPF----------------PAN 60

Query: 61  SPSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAAS 120
           S  + P   + ++R   F      DKEN + +  +LT        K  KNFMS TISA S
Sbjct: 61  SKVNIP---SDLTRRNSFGG----DKEN-ETKPVQLTP-------KGSKNFMSPTISAVS 120

Query: 121 KIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVN 156
           KI  SP+K++L D+NE    S SFS +K   L   N
Sbjct: 121 KINASPRKRVLSDKNE---MSRSFSDVKGLILEDDN 122

BLAST of Cp4.1LG14g04750 vs. NCBI nr
Match: gi|659108861|ref|XP_008454425.1| (PREDICTED: gelsolin-related protein of 125 kDa-like [Cucumis melo])

HSP 1 Score: 717.2 bits (1850), Expect = 4.0e-203
Identity = 550/1053 (52.23%), Postives = 652/1053 (61.92%), Query Frame = 1

Query: 1    MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
            MALPSNRSSSPS+ +GRT P SR+SEI NP+ RSFS NPFSK SI  + + LNPITPANS
Sbjct: 1    MALPSNRSSSPSLPSGRTSPTSRSSEISNPIRRSFSGNPFSKQSIVANPRGLNPITPANS 60

Query: 61   PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
            PSDYP +RNS++RE  FTSRD  +KENGKDQSPK  RVRS              +  +SK
Sbjct: 61   PSDYP-RRNSMNRENSFTSRDIPEKENGKDQSPKPVRVRS------------PIVGKSSK 120

Query: 121  IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKS 180
              +SP                    + ++S  +V+P  +       D N P        +
Sbjct: 121  HFMSPT-------------------ISAASKIAVSPRKKVL----GDRNEP--------A 180

Query: 181  TKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKSSKT 240
              +V F G++  S       S  R    PE  A  +DS S I P++ S +A       KT
Sbjct: 181  RSSVSFSGMKSSS-----LNSVNRSLEAPE--ALESDSNSQIPPVSNSKVA-------KT 240

Query: 241  VTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEASK 300
            V FGGFEVISD  DDSESTYR+  +P    VT+AVE   + E   +S S  A    E+S 
Sbjct: 241  VRFGGFEVISDSFDDSESTYRYDLNPET-VVTMAVETGMKSENVQVSKSTNAVAPSESSN 300

Query: 301  IMRFSDFE--AVSNNALESS-VNSNLTEEVDSVNLDPSFNISPVSSPMIAPI-----ITP 360
                S+FE  +VSNN L+S    SNLTEEVD VNLD    ISPVSSP IAP+     + P
Sbjct: 301  ----SEFEVISVSNNDLDSPPAKSNLTEEVDCVNLDQ--RISPVSSPTIAPLDADPSLPP 360

Query: 361  YDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELF---------SSSEETDCEDPQKES 420
            YDPKTNYLSPRPQFLHY PNRRINR  PDGR E+            S EETD ED  KE 
Sbjct: 361  YDPKTNYLSPRPQFLHYRPNRRINRYEPDGRLEDKLLSFANVSESESVEETDSEDSSKEL 420

Query: 421  DEVSSNESQMKEEEKEEEE--------VNVSIQGPTEVKKSSKPLLSRIFKISSLLLILF 480
            DE SSN SQM+EEE EEEE        +NVS Q PTEV+KS K  LSRIFKISSLLLILF
Sbjct: 421  DEASSNGSQMEEEEAEEEEEEEEEEDGINVSEQCPTEVQKSWKVSLSRIFKISSLLLILF 480

Query: 481  TACLSICVVNVHDPTIFERSTLLTMGDQSEIFESAKTNFNVLVGKLEIWHANSISFISDV 540
            TAC SI VVNVHDP+IF+R + LTM D SEI+E AKTNFNV V KLE+W+ NSISFISD+
Sbjct: 481  TACFSIYVVNVHDPSIFKRPSSLTMEDASEIYELAKTNFNVFVQKLEVWNVNSISFISDM 540

Query: 541  VFNFRGGPPLIHL-NQTEFFYGDVNKDEQCLVLSHQNVWEEENNLINAMEAMKERVIVDT 600
            VFNFRG  PLIH  NQTEFF    N +EQCLVLSHQ VW EEN L N MEAMK+R   D 
Sbjct: 541  VFNFRGALPLIHYENQTEFF----NMNEQCLVLSHQTVWGEENTL-NVMEAMKDRE-TDI 600

Query: 601  FEEPIDREGQNKEGQEQ--EEDAQEVEAIKVREIGI--ETFERESHNEEVEEES----FQ 660
            FEEPI+ E + +EG+    EE     +  +  EIGI  E  ERES  EE E+E      Q
Sbjct: 601  FEEPIEIEERQEEGEIDIFEELINIEKRQEEEEIGIFEEPVERESEKEEQEQEQQVDLSQ 660

Query: 661  EIEARTNDSADIEEENNEA-SEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKVREIG 720
            EIEA       IE    E+ +EE L E+    +G G N                      
Sbjct: 661  EIEAMKMREIGIENSEKESQNEEELGEV--SFQGSGVN---------------------- 720

Query: 721  IETVERESQNEEVEEEPFQKDSEEENDEASEESLLEIVEEEYVQEKTVENFKASSSSDFK 780
                  E +N EV EEP ++ +EE    ++ + L E  EEEY+QEK+ +NF+ SSS DFK
Sbjct: 721  ---ANEEEKNGEVFEEPLEEINEEALKNSASDELCE--EEEYIQEKSEDNFRFSSSDDFK 780

Query: 781  LHDEI--EKAAATGETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGI 840
             HD+I  E AAATGET+   NTEFQYQSPPVSS P+E Q D E E GG+ +D+IRT TGI
Sbjct: 781  FHDQIKQEAAAATGETEVAKNTEFQYQSPPVSS-PAERQPDFEHEIGGRTIDVIRTETGI 840

Query: 841  SRDFTQNTAAIISAILLGLFLIIPAGLIYARKSSSRRTTSTAAIAEEQQEEPLLKDKKTN 900
            S DFTQ  A IISAILLGL L + AGLIY RKS S+    ++   E+++E+PL+     N
Sbjct: 841  SPDFTQTKAIIISAILLGLSL-VTAGLIYGRKSCSKPPPPSSIAEEQEKEQPLM-----N 900

Query: 901  QSLVEEEEEEALDDDDEDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRR 960
             S VEE+      DD+EDDM GEF  SETSS FQYSS+REGET+  K             
Sbjct: 901  TSRVEEK------DDEEDDMGGEFSISETSS-FQYSSMREGETKEDK------------- 910

Query: 961  ENSRRESIAEAATRSSEVQSHSHGRRKIRMENSRRESMASSSLDDYSVSSSASPSYGRFT 1015
                         + +EV+SHSHGRRK++ +NSRRESMASSSLD+YS+S+SASPSYG FT
Sbjct: 961  -------------KMNEVESHSHGRRKMK-KNSRRESMASSSLDEYSLSTSASPSYGSFT 910

BLAST of Cp4.1LG14g04750 vs. NCBI nr
Match: gi|449465121|ref|XP_004150277.1| (PREDICTED: uncharacterized protein LOC101223143 [Cucumis sativus])

HSP 1 Score: 692.6 bits (1786), Expect = 1.0e-195
Identity = 534/1052 (50.76%), Postives = 646/1052 (61.41%), Query Frame = 1

Query: 1    MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
            MALPSNRSSSPS ++GRT P SR+SEI NP+ RSFS NPFSK SI  + + LNPITPANS
Sbjct: 1    MALPSNRSSSPSFLSGRTSPNSRSSEISNPIRRSFSGNPFSKQSIVANPRGLNPITPANS 60

Query: 61   PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
            PSDYP +RNSV+RE  FTSRD  +KENGKDQSPK  RVRS              +  +SK
Sbjct: 61   PSDYP-RRNSVNRENSFTSRDISEKENGKDQSPKPVRVRS------------PIVGKSSK 120

Query: 121  IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKS 180
              +SP                    + ++S  +V+P  +       D N P        +
Sbjct: 121  HFMSPT-------------------ISAASKIAVSPRKKVL----GDRNEP--------A 180

Query: 181  TKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKSSKT 240
              ++ F G++  S       S  R    PE  A  +D+ S I P++       +SK++K 
Sbjct: 181  RSSISFSGMKSSS-----LNSVNRSLEAPE--ALESDTNSQIPPVS-------NSKTAKI 240

Query: 241  VTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEASK 300
            V FGGFEVISD  DDS+STYR+  +P    VT+AVE +       +S S  A    E S 
Sbjct: 241  VRFGGFEVISDSFDDSKSTYRYDLNPEM-VVTMAVETDMTSGNAQVSKSTNAVAPSEPSN 300

Query: 301  IMRFSDFE--AVSNNALESS-VNSNLTEEVDSVNLD--PSFNISPVSSPMIAPI-----I 360
                S+F   +VSNN L+S    SNLTEEVD VNLD   SF ISPVSSP IAP+     +
Sbjct: 301  ----SEFAVISVSNNDLDSPPAKSNLTEEVDCVNLDLDQSFKISPVSSPTIAPLDADPSL 360

Query: 361  TPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELF---------SSSEETDCEDPQK 420
             PYDPKTNYLSPRPQFLHY PNRRINR  PDGR EE            S EETD ED  K
Sbjct: 361  PPYDPKTNYLSPRPQFLHYRPNRRINRFEPDGRLEEKLLSFANVSESESVEETDSEDSSK 420

Query: 421  ESDEVSSNESQMKEEE----KEEEEVNVSIQGPTEVKKSSKPLLSRIFKISSLLLILFTA 480
            E DE SSNESQM+EEE    +EEE +NVS Q PT+V+KS K  +SRIFKISSLLLILFTA
Sbjct: 421  ELDEASSNESQMEEEEDEVEEEEEGINVSEQSPTKVQKSWKVSVSRIFKISSLLLILFTA 480

Query: 481  CLSICVVNVHDPTIFERSTLLTMGDQSEIFESAKTNFNVLVGKLEIWHANSISFISDVVF 540
            C S+ VVNVHDP+IF+R + LTM D SEI+E AKTNFNV V KLE+W+ NSISFISD+VF
Sbjct: 481  CFSLYVVNVHDPSIFKRPSSLTMEDASEIYELAKTNFNVFVQKLEVWNVNSISFISDMVF 540

Query: 541  NFRGGPPLIHL-NQTEFFYGDVNKDEQCLVLSHQNVWEEENNLINAMEAMKERVIVDTFE 600
            NFRGG PL+H  NQTEFF    N +EQCLVLSHQ VWEEE N++N MEAMK+    D FE
Sbjct: 541  NFRGGLPLVHYENQTEFF----NMNEQCLVLSHQTVWEEE-NILNVMEAMKDG-DTDIFE 600

Query: 601  EPIDREGQNKEGQEQEEDAQEVEAIKVR----EIGI--ETFERESHNEEVEEES----FQ 660
            EPI+ E + +E  E+ +  +E+  I+ R    EIGI  E  ERES NEE E+E      Q
Sbjct: 601  EPIEIEERQEE--EETDIFEELVGIEKRPEEEEIGIFEEPVERESENEEQEQEQQVDLLQ 660

Query: 661  EIEARTNDSADIEEENNEASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKVREIGI 720
            EIEA       IE    E+  E   E +E +  +G +     ++E+  +     + EI  
Sbjct: 661  EIEAMKMREIGIENFERESQNE---EELEEVSFQGSDEVNANEEEKNGEVFEEPLEEINE 720

Query: 721  ETVERESQNEEVEEEPFQKDSEEENDEASEESLLEIVEEEYVQEKTVENFKASSSSDFKL 780
            ET E  + +E  E                        EEEY+QEK+ +NFK SS+ DFK 
Sbjct: 721  ETSENSASDELCE------------------------EEEYIQEKSEDNFKFSSTDDFKF 780

Query: 781  HDEI--EKAAATGETQEETNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATGIS 840
            HD+I  E AAATGET+   NTE QYQSPPV     E Q+D + E GG+ +D+IRT  GIS
Sbjct: 781  HDQIRQEAAAATGETEGAKNTELQYQSPPV-----ERQTDFDHEIGGRTIDVIRTEIGIS 840

Query: 841  RDFTQNTAAIISAILLGLFLIIPAGLIYARKSSSRRTTSTAAIAEEQQEEPLLKDKKTNQ 900
            RDFTQ  A IISAILLGL L + AGLIY RKS S+    + A  E+++E+PL+     N 
Sbjct: 841  RDFTQTKAIIISAILLGLSL-VTAGLIYGRKSGSKPPPLSIA-DEQKKEQPLM-----NM 900

Query: 901  SLVEEEEEEALDDDDEDDMAGEFCSSETSSFFQYSSVREGETEAAKRSSEFQSHSHVRRE 960
            S VEE+      DD+EDDM GEF  SETSS FQYSS+REGET+A K  +E +SHSHVRR+
Sbjct: 901  SRVEEK------DDEEDDMGGEFSISETSS-FQYSSMREGETKADKTLNEVESHSHVRRK 905

Query: 961  NSRRESIAEAATRSSEVQSHSHGRRKIRMENSRRESMASSSLDDYSVSSSASPSYGRFTT 1015
              +                           NSRRESMA SSLD+YS+S+SASPSYG FTT
Sbjct: 961  MKK---------------------------NSRRESMA-SSLDEYSLSTSASPSYGSFTT 905

BLAST of Cp4.1LG14g04750 vs. NCBI nr
Match: gi|1009150252|ref|XP_015892920.1| (PREDICTED: uncharacterized protein LOC107427091 [Ziziphus jujuba])

HSP 1 Score: 218.8 bits (556), Expect = 4.4e-53
Identity = 260/773 (33.64%), Postives = 384/773 (49.68%), Query Frame = 1

Query: 1   MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
           MALPSN+SSS S ++GR  P +RNSE+GNP+ RSF+ NPF+KPSI  + +  N  TP NS
Sbjct: 1   MALPSNKSSS-SPISGRANPNARNSEMGNPIRRSFTGNPFTKPSIVANPRGFNHNTPVNS 60

Query: 61  PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
           P     +RNSV R+ + T RD EDKENGKDQ+ K ++VRSP   K  KNFMS TISAASK
Sbjct: 61  PEHL--RRNSVGRDTIVTFRDYEDKENGKDQNSKPSKVRSPMGSKGTKNFMSPTISAASK 120

Query: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYP-- 180
           I  SP+KKIL +RNEP+R+S++FS  K  + +      E      + +  P    S P  
Sbjct: 121 ITQSPRKKILAERNEPLRASVTFSESKIPTFSQAIAESEHKEEDLTASLNPKTFKSEPAH 180

Query: 181 KSTKT-VRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKS 240
           ++T+T V F   ++ + S  +++S  R + +P                     A   SK+
Sbjct: 181 ETTRTSVSFSDSKIRTFSQTNADSERREEEDP--------------------TAPLDSKA 240

Query: 241 SKTVTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPE 300
           SKT      E + D      ST   G +  PE +     +N   E   ++      ++P 
Sbjct: 241 SKT------EPVHD------STEPLGSNNLPETLCEPQNSNVTEETDSVNLDPSFKISPP 300

Query: 301 ASKIMRFSDFEAVSNNALESSVNSNLTEEVDSVNLDPSFNISPVSSPMIAPIITPYDPKT 360
           A                  S  +S+L   V S++ DPS              + PYDPKT
Sbjct: 301 AI-----------------SYQSSSLFPTVASLDSDPS--------------MPPYDPKT 360

Query: 361 NYLSPRPQFLHYNPNRRI-----------NRPDGRF-EELFSSS---EETDCEDPQKESD 420
           NYLSPRPQFLHY PN R+            R D  F  E+FS S   EE   E  QKE +
Sbjct: 361 NYLSPRPQFLHYRPNPRVELYLLNKEREGKRLDDSFISEIFSDSDATEEAQSESSQKELE 420

Query: 421 EVSSNESQMKEEEKEEEEVNVSIQGPT----EVKKSSK-PLLSRIF---KISSLLLILFT 480
           +VSS+E  ++ +EK EEE+ VS   P     E+ +SSK     R F   K  +LL++L  
Sbjct: 421 DVSSSE-VIETKEKLEEELLVSEPSPAFMSEEIDESSKGETRPRFFWWKKFIALLMVLSF 480

Query: 481 ACLSICVVN--VHDPTIFERSTLLTMGDQSEIFESAKTNFNVLVGKLEIWHANSISFISD 540
           ACLSI + N  V D ++FE +T L + D  EI E  +T+++ L   L++W + S+S++++
Sbjct: 481 ACLSISITNSPVIDHSMFEGATFLKLYDHFEIAEFVRTSYDGLDRNLQVWSSKSMSYLAE 540

Query: 541 VVFNFRGGP----PLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLINAMEAMKERV 600
           ++ N  G       L + N T     DV K    +VL H N  E    +I +    + +V
Sbjct: 541 LISNLNGEKSKLGSLHYCNLTTLLMDDV-KSNGYMVLDHDN--EGSAEVIMSGSTGRRKV 600

Query: 601 IVDTFEEPIDREGQNKEGQEQEEDAQEVEAIKVREIGIETFERESHNEEVEEESFQEIEA 660
            ++  EE         +GQ +  D++E+          ++F       E EE+   E E 
Sbjct: 601 HIEFSEE---------KGQPEIGDSEEI---------ADSFSEIHPIPECEEQVDLEAEP 660

Query: 661 RTNDSADIEEENNEASE-ESLQEIIEHIEGEGQNIEGQEQQE--EAQDT----EAMKVRE 720
           R N  +   EE ++ASE E ++ + +HI  + + +    + E  E Q+T    +   V E
Sbjct: 661 RANHISPDSEEVHQASEFEVVEPLFDHINPDTEKLHEAPESELVEPQETDVSGQVSMVTE 683

Query: 721 IGIETVE---RESQNEEVEEEPFQKDSEEENDEASEESLLEIVEEEYVQEKTV 732
               + E   R SQ  EV+ +    ++  E ++ SE S LE+V  +  ++ TV
Sbjct: 721 SATNSKEEPVRTSQAGEVQSK--VSEAVVELEDKSEISSLEVVPLKEAEDVTV 683

BLAST of Cp4.1LG14g04750 vs. NCBI nr
Match: gi|590687581|ref|XP_007042704.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 210.3 bits (534), Expect = 1.6e-50
Identity = 270/755 (35.76%), Postives = 372/755 (49.27%), Query Frame = 1

Query: 1   MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
           MA P+  SSS   +  RT P  + SEI +P+ RSFS NPF+KPSI T+ ++ NP TPANS
Sbjct: 1   MASPAKTSSSS--LPCRTNPNMKKSEISDPMRRSFSGNPFAKPSIVTNPRTFNPSTPANS 60

Query: 61  PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
           PSD+P +R+S  RE + + RD+ DKEN KDQ+PK TRVRSP   K  KNFMS TISAASK
Sbjct: 61  PSDFP-RRHSAGRESVASLRDS-DKENSKDQNPKPTRVRSPAPSKGSKNFMSPTISAASK 120

Query: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKS 180
           I  SP+KKIL +RNE VRSS+SFS +KS        TPE ++                  
Sbjct: 121 INASPRKKILVERNESVRSSVSFSDVKSLIKEDNESTPEIAL------------------ 180

Query: 181 TKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKSSKT 240
            K  R    +V S   ED E+T    LN + V + +D KS I+   +S      S + K 
Sbjct: 181 -KQKRVSSSDVKSVIMED-EATPEIGLNQKKV-SFSDVKSIIMADNQSTPV--ISVNQKK 240

Query: 241 VTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEASK 300
           VTF   +V S   DD EST + G            + N E    P   S    V  E  K
Sbjct: 241 VTFA--DVKSVVMDDDESTPQIG----------LKQKNVEV---PHDSSSSNHVYEEPLK 300

Query: 301 IMRFSDFEAVSNNALESSVNSNLTEEVDSVNLDPSFNISP-----VSSPMIAPI-----I 360
               +DF+   +      +   +TEE DSVN+DPSF ISP      S P++AP+     +
Sbjct: 301 --SNADFDYKESKHDSDLLPETVTEENDSVNVDPSFKISPRVSITPSCPILAPLDADPSM 360

Query: 361 TPYDPKTNYLSPRPQFLHYNPNRRIN----RPDGRFEELFSSSE--------ETDCEDPQ 420
            PYDPKTNYLSPRPQFLHY PN RI+    R   + EE F+S          ET C+  Q
Sbjct: 361 PPYDPKTNYLSPRPQFLHYRPNPRIDLYREREGKQLEEHFASESYSDTEVTGETQCDASQ 420

Query: 421 KESDEVSSNESQMKEEEKEEEEVNVSIQGP------TEVKKSSKPLLSRIFKISSLLLIL 480
           +ES+++SS E+   + E EEEE+  S + P       E  + SKP  S   K  + LL+L
Sbjct: 421 RESEDISSEETM--KGEGEEEELYASERNPIAHDMVEESLRMSKPRFSTRSKFIAFLLVL 480

Query: 481 FTACLSICVVNVHDPTIFERSTL----LTMGDQSEIFESAKTNFNVLVGKLEIWHANSIS 540
             A  SI V N   PT F  S L    L++    E+ E AK NF+     L+   A  +S
Sbjct: 481 AFAYFSILVAN--SPT-FAPSGLGDLSLSIQVPPEVSEFAKANFDRFTQYLQHLSARFLS 540

Query: 541 FISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLI---NAMEAMK 600
            +S+++ + R     +H     F Y +++     L+  H +    E +L+   + ++ ++
Sbjct: 541 CVSNIISSSRE----VH-RTVSFQYANLSH----LLEDHIS----EGHLLFDCSVVDPVR 600

Query: 601 ERVIVDTFEEPIDREGQNKEGQEQEEDAQEVEAIKVREIGIETFERESHNEEVEEESFQE 660
           ER    T+ + I+ +    E  EQE   QE +  +  E        E  + E  +E+ Q 
Sbjct: 601 ER---GTYHQEIEADEAVDEDDEQEIKEQEDQESQAYE------NLELVSGEEPDEAQQG 660

Query: 661 IEARTNDSADIEEENNEASEESLQEIIEHIEGEGQN-----IEGQEQQEEAQDTEAM--- 713
           IEA   +   +E E NE  E + Q   EH      N     I    +  ++ +TE +   
Sbjct: 661 IEAEMIELDHLEAEENEGVEFAAQIDAEHQSNVNLNHLPSIIPQAAEVSKSGNTEGVDLK 684

BLAST of Cp4.1LG14g04750 vs. NCBI nr
Match: gi|590687585|ref|XP_007042705.1| (Uncharacterized protein isoform 2 [Theobroma cacao])

HSP 1 Score: 210.3 bits (534), Expect = 1.6e-50
Identity = 270/755 (35.76%), Postives = 372/755 (49.27%), Query Frame = 1

Query: 1   MALPSNRSSSPSMVTGRTGPISRNSEIGNPVYRSFSSNPFSKPSIATSLKSLNPITPANS 60
           MA P+  SSS   +  RT P  + SEI +P+ RSFS NPF+KPSI T+ ++ NP TPANS
Sbjct: 1   MASPAKTSSSS--LPCRTNPNMKKSEISDPMRRSFSGNPFAKPSIVTNPRTFNPSTPANS 60

Query: 61  PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
           PSD+P +R+S  RE + + RD+ DKEN KDQ+PK TRVRSP   K  KNFMS TISAASK
Sbjct: 61  PSDFP-RRHSAGRESVASLRDS-DKENSKDQNPKPTRVRSPAPSKGSKNFMSPTISAASK 120

Query: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISYPKS 180
           I  SP+KKIL +RNE VRSS+SFS +KS        TPE ++                  
Sbjct: 121 INASPRKKILVERNESVRSSVSFSDVKSLIKEDNESTPEIAL------------------ 180

Query: 181 TKTVRFGGVEVISGSYEDSESTYRYDLNPELVAAVTDSKSGIVPITKSAIAEGSSKSSKT 240
            K  R    +V S   ED E+T    LN + V + +D KS I+   +S      S + K 
Sbjct: 181 -KQKRVSSSDVKSVIMED-EATPEIGLNQKKV-SFSDVKSIIMADNQSTPV--ISVNQKK 240

Query: 241 VTFGGFEVISDFCDDSESTYRHGHHPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEASK 300
           VTF   +V S   DD EST + G            + N E    P   S    V  E  K
Sbjct: 241 VTFA--DVKSVVMDDDESTPQIG----------LKQKNVEV---PHDSSSSNHVYEEPLK 300

Query: 301 IMRFSDFEAVSNNALESSVNSNLTEEVDSVNLDPSFNISP-----VSSPMIAPI-----I 360
               +DF+   +      +   +TEE DSVN+DPSF ISP      S P++AP+     +
Sbjct: 301 --SNADFDYKESKHDSDLLPETVTEENDSVNVDPSFKISPRVSITPSCPILAPLDADPSM 360

Query: 361 TPYDPKTNYLSPRPQFLHYNPNRRIN----RPDGRFEELFSSSE--------ETDCEDPQ 420
            PYDPKTNYLSPRPQFLHY PN RI+    R   + EE F+S          ET C+  Q
Sbjct: 361 PPYDPKTNYLSPRPQFLHYRPNPRIDLYREREGKQLEEHFASESYSDTEVTGETQCDASQ 420

Query: 421 KESDEVSSNESQMKEEEKEEEEVNVSIQGP------TEVKKSSKPLLSRIFKISSLLLIL 480
           +ES+++SS E+   + E EEEE+  S + P       E  + SKP  S   K  + LL+L
Sbjct: 421 RESEDISSEETM--KGEGEEEELYASERNPIAHDMVEESLRMSKPRFSTRSKFIAFLLVL 480

Query: 481 FTACLSICVVNVHDPTIFERSTL----LTMGDQSEIFESAKTNFNVLVGKLEIWHANSIS 540
             A  SI V N   PT F  S L    L++    E+ E AK NF+     L+   A  +S
Sbjct: 481 AFAYFSILVAN--SPT-FAPSGLGDLSLSIQVPPEVSEFAKANFDRFTQYLQHLSARFLS 540

Query: 541 FISDVVFNFRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLI---NAMEAMK 600
            +S+++ + R     +H     F Y +++     L+  H +    E +L+   + ++ ++
Sbjct: 541 CVSNIISSSRE----VH-RTVSFQYANLSH----LLEDHIS----EGHLLFDCSVVDPVR 600

Query: 601 ERVIVDTFEEPIDREGQNKEGQEQEEDAQEVEAIKVREIGIETFERESHNEEVEEESFQE 660
           ER    T+ + I+ +    E  EQE   QE +  +  E        E  + E  +E+ Q 
Sbjct: 601 ER---GTYHQEIEADEAVDEDDEQEIKEQEDQESQAYE------NLELVSGEEPDEAQQG 660

Query: 661 IEARTNDSADIEEENNEASEESLQEIIEHIEGEGQN-----IEGQEQQEEAQDTEAM--- 713
           IEA   +   +E E NE  E + Q   EH      N     I    +  ++ +TE +   
Sbjct: 661 IEAEMIELDHLEAEENEGVEFAAQIDAEHQSNVNLNHLPSIIPQAAEVSKSGNTEGVDLK 684

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E5GBH8_CUCME2.8e-20352.23Putative uncharacterized protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A0A0KUZ2_CUCSA7.3e-19650.76Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000550 PE=4 SV=1[more]
A0A061E0N4_THECC1.1e-5035.76Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_007273 PE=4 SV=1[more]
A0A061E2G4_THECC1.1e-5035.76Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_007273 PE=4 SV=1[more]
W9QZR6_9ROSA3.8e-3554.64Uncharacterized protein OS=Morus notabilis GN=L484_003739 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G16630.15.8e-1140.52 unknown protein[more]
AT2G16270.15.4e-0938.46 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659108861|ref|XP_008454425.1|4.0e-20352.23PREDICTED: gelsolin-related protein of 125 kDa-like [Cucumis melo][more]
gi|449465121|ref|XP_004150277.1|1.0e-19550.76PREDICTED: uncharacterized protein LOC101223143 [Cucumis sativus][more]
gi|1009150252|ref|XP_015892920.1|4.4e-5333.64PREDICTED: uncharacterized protein LOC107427091 [Ziziphus jujuba][more]
gi|590687581|ref|XP_007042704.1|1.6e-5035.76Uncharacterized protein isoform 1 [Theobroma cacao][more]
gi|590687585|ref|XP_007042705.1|1.6e-5035.76Uncharacterized protein isoform 2 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006096 glycolytic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016301 kinase activity
molecular_function GO:0000287 magnesium ion binding
molecular_function GO:0030955 potassium ion binding
molecular_function GO:0004743 pyruvate kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g04750.1Cp4.1LG14g04750.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 626..646
scor
NoneNo IPR availablePANTHERPTHR34775FAMILY NOT NAMEDcoord: 2..169
score: 9.5E-86coord: 318..1014
score: 9.5E-86coord: 195..259
score: 9.5
NoneNo IPR availablePANTHERPTHR34775:SF3SUBFAMILY NOT NAMEDcoord: 195..259
score: 9.5E-86coord: 2..169
score: 9.5E-86coord: 318..1014
score: 9.5