Cp4.1LG08g12820 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g12820
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionFact complex subunit spt16
LocationCp4.1LG08 : 9263124 .. 9267973 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGCGGTGCCAGCACAAATGTCGACGGTTCCGACGCCTGAAACGTGGTCAGATAGAAGCCAACTTCCTTCCGAGACGAATATACTCCGCCGCAGTGGATGAGAAAATGGCCGTGTCGAGCGTTGAGTGCGGAGGGCCGGATTCAGAAGCATGCCGCGTAATTAAGAAATTTCCCGGCGCAAGGACGGTGCTACATGGACGACATGTCGGTTCGAGTGATAAAAAATGTCTAATCAATTTTTTAATTTTTGTACATAATATCCGACAGGAATGGCGTGGAATTGGACTTACGAATGTGTAAAAACTTCTTCCTAATATACACGTTTTATAACTACGAGGCTGATGGCGTACGTAACAAGTCAGTCAAAATGAACAATATAAAGATATGAAAACCTCTCCCTAAAAGACGCGTTTTGAAATCGTGAGGCTGATAGTTATACGTAACGAGCCAAAGCGGACAACATTTACTAATGGTAGTGGGATTAGGCTTACAAGGGTTTGGAAACCTCTCCCTAGCAGACGCGCTGATGAATATACGTAACAAGTTAAAGTAAACAATATCTATTAACGATGGATCAGACTATTGAAAAGCAGGCAGGTAATACATATTAATATGCAAGTTAGATAGGACCATGGGCTAGGGATCCTATTTTTATAATCACACCTTACAACCAATCCAATTTTTCCTTTTATATTTATATTATTATTATTTAATAAATTTAATTAATTATTAGGACACGCATATGCATCATATTAAAACATGATCTTCAGTTTTTCTTTGGACCTGAATTTTCTTCTTCCCCTTCCCTTCTTCTTCCTCTTCTTTTCTCTGAAAACCCTAATTACACCTTTTTCTTTGTATAATTCCCTCCTCTACAGTACCAGAGATCAGGTATGTACTTTGCTTTTTTCATCTCTTTCTCTACGCTTTTTCCTTTGCAANTTTTATTTTCTGCATGTTTGTGTATTTTCGTTGTGTTCAGATTGCTTTATCTTAGTTCTGTTTGGGAATTGGATGTTGATATCGTTTTCTGATTCCGGAAACGTGAGTACTGTGTCCATGGATGAAATGCAGTTTGTATTCATGCGCTGATTATGCCCTTAAAACCTTGGGAGATTGGCAATCCATTATGTGAGATCTTCATCATATAGCAGATAGTGGCAGCTTCCAATTTAAAGTTGTACTCTTGATTACTGAAGAAATGTTTGATTGGGATGTTATTGTATGGCTTCCGAGTTTTCTAATTGATGTAAGAGTCTGATGGAATGTGCAGAGTGGAATTTGTGGTAGAATTAGAAGCTAATTTACTGTTCTCGTAATTATTTTCAGCGAAAGTAAATGTCAACTCAGTTTCTTGTCCTCGTGGAGAAACATTAGCGATTTATGTTTAAAGCTAGAATTTGGAAGGGGAAGAATGGAGGCTAACATCTGTGACGTCAATCACCTCGATTCCGATGTCCTTTTACCTCCAAGAAAGCGTCTTCTTGCTGGATTGAGAAAGAAAGGGGCCGATGGTGATGGTACTTTTAATGTGCCACCAGTTGCCTCCACCTCTTGTTCTCCCCCTCCCTCTCCTTCCTATGGCTTCACATCTATTGAATTCAATATACGGCTCAACAATCTGTTGAGTGCTCATTCAAATACTAACCTATCGCCCGAGGAGATAGTGGAGGCCTCAAGATCAGCGGCAGCTGCAGCTGTGAAGGCTGCAGAGGCTGCCAGGACAGCAGCTGAAGAGAAGGCTGCGATTGCAGCGAAGGCTGTTGCAGCTGCAAAGAGTGCCATGGACTTGGTTGCCTCGATTTCTGAAGAAGCAGCCTATAAAGAAATAAAACAGAGAAAGAACAAGCTGAAGAAACATGTCCCAGTTCAGTTTCTGTATACAAAATATCAACCTCTCGAGAATACCAGGACAGATGAAGAGATGGCCCGCAAATTACATCGGGCAATTAATAGCTCCCCAAGAATTTTGAAGAATTCATCTGGTTCTGATGCTAGAGGCCACAAACATAAGAAGTTGAAAACTTCACCTGGTTCTGAGAAAATTATGGTTTCCAATTGTGGCATCTCACAGGAGTTGGAACCTACTCCTACATGCAACGGGCATGCTAAAACTAACGAGGCCGACTCTGAATGTAGCTTTCAGGAAGTATACAAGCTCAAATCTGATGAGAAGACTATTAAATATGAGAAGAACAATCAGTCTCAGATGGATACTGGCGAAGCAGAGACAAGTCAGAAGGAGAAAAAAACGTGCGATGATATTAATGTCACTGGTAAAAAGAAGGGAAGAGTGAAGCTTAAAAAATTGTCGTTGAGTATTTGTTCTTTCAGGGATCAAAAAGCTCTCAAGGAAGACATGGACAATGGAAGTTCCCCAATATTGGCTGTGCAAAACACGGGGTCTCCCACACCCGAAAATGTTATTTTGCATTCTGTAGATAGCCTTAGCCCTACCCAAGGCGTGCTGCCAATAGATTCTACATCTTTATGGAAATGTCAGGAGTTCAAAGCTCCCCTTTCTGTCGAACAGAATAAAGTTATGCAGTCATGACCGGGAATGACATATGGTAAAAGGTTGCTGGCAATAGAAGTCACGGTCGAGGTAGGATCCATGCAGCTTTTCTTATAGAAACAGCGTTACTCCATCTGTCTTCCTTTTGGTCCTGCATCTTTTGTTTCACAATGAACTGGGATAGACCAATTGTAAATTTCATATTATGTGTTTGATTATTCTGCAGAGTTCAAAGGTATCCTTTACGATACACAGAGCTTGAAGTCCAAATAAGAGGGTTACAATAAACTGAATACTTGAACACTCCTTGTTTTACAGGAGGCATTTTTGTCTTGGTTTCTTTATATATGCTAAATATACTAAAAAGTAATCCCTATTGTATCTGTAGGTAAGATCTGTAGATGGTTGTTTTCATGGTCCTGCTACAGTTGTATCTATATATTACTTAACGACGTCCTCCTCGTTTTTAGGAGTTGTAGATTGAAAGAAATTGAAGAAAACTTCAAATAATTGTGTTCTACTTCTGGAAAGACTAAACCGTTTGCCTTTTGTTGGTCATCTAGTCTGATTATTGATCATCTGAGATTGAGATTGTATTTTCTACTTCTTTTTTTCCCTTTAAGAAGAGGATGTTCTTTCCTCTTGTAGCTTTGTAATTGCTTTTAATATATTAATGAAAGCTTCCCTTTTCGTTAATTGTTGTTCTTCTTTTAGTTGCAGCAGCTCTCCTCTGTTCTTATGTTTGCTATCACAGAGATGAGCTGCTGCAAAACTCTGTTACTGTTTGTAATGGGATTAGCTGGTTTTGAGCTCACGATATCTCTTTTTTGTAGATCGTAAAAGTCAGGTTCAGGTTTAAGGTTTTGTCTTGACATCAGTTGGAGGGACTCGAAGTCCCTTTCTTTAATGGGATTCCCTTTTGTGGGCTTGGTTTTTTTAATGCCGGTATATTCTTTCATTTTTATTCTCAACAGAAGTTCGATTCTTCATAAGAAATCGAACAAAACTATACTGTTTACTAATGTTCTTGTTCGTAGGGGTCGAGACAAATGTATCAACAAACGACAAAGATTTGAGGTTTGTAACTCAGACGTAAATGTCTTAAATACGATCATTTGTCTAAAGTTTATTAGATTTACCTTTTTATATTGAATGTCCCATGAATCAAGAACATAGGTTACTGAAACAATGCTTCAATTTCACCTCCTCTGGATCATGTAATCACTATTGTTTTATCTATTTTTTAAATCTAGTTTTTTTGTAAAAAAAAATACAAAATTGGTGAGAAAACAAACCAAAAAATATATATATATATATATATATATATCAAGCCAAAATTTTTGTAAAAAAAAATACAAAATTGGTGAGAAAACAAACCAAAAAAAAAAATATATATATATATATATATCAAGCCAAAATCCACATGAAGACTGCAGACCAAACAGCAAGAGCAAGAGCGGCTGCCAAATAAGTCCATAGCATGATAACAGAGCATTCCTCTTGACCAACGCCGAACAACTGAGTCATTGTGCCGATGGACATGGCCGGCGGCGTGGTGTATTGAACCATCAACAAGAAGTGGTAAAGAGGATCCGGCGGCAGAAATCCTAGCGAGTTTGCTCCTTTCACAACCAATATCCCTATTGCAGGCAGTGCAAAGTACCGAACCACAATCACTCCGATGATCGTTGTCGGCTTCACTGCTGACGACCTTAGCCCTTGAATCAGGTTGCCGCCGAGTATCAGCATGGTGCACGGGATTGTCCCATCTCTGTTTGTTATACATCACCACAACTCGTTTAGTAATGTTTTCATTTCTCATGCATATGGTAACGAACTCTAAAAAGTATACATACCCGAGTAGTTGAACAGAATCTTGTATCACACGTAACGGAGCATTATCTCCAATTACGAAGTTCCTAAGCCATGTCACCGCTCCAAAAATGAACCCCACAATCTAAAAATAGGAATATTTATCCAATTCAAGCATAACTCAATAGCTAAAACATCAGTGAGTATCTCAAAAGTCGATACTTACCGCTCCTAACGATGGAGGGGCCATTAACTCTTCAACAATGCTATGCAAAAATTCAAATGTTTTAGCCCAAATGGACGATGATTCCCGTTTCTCTAAGATGGCCACAGATTGTTGGGATTCCCTTGATGAGACTGAAACAGGAAGGCCTTCTTCTTCTCCACATTGTTCGTGGAGAAGTCGAGTTTGCAAGTCGCCATTGGAGGTGTGGTTGGGTGCTTTCACGTGCTCCTCCGGTGCTTCAAGTGCTTTCATTCGCAAAGACGACGTTTTCACAAGGTGAAAAGTGTGAGTCCA

mRNA sequence

ATGTGGCGGTGCCAGCACAAATGTCGACGGTTCCGACGCCTGAAACGTGGTCAGATAGAAGCCAACTTCCTTCCGAGACGAATATACTCCGCCGCAGTGGATGAGAAAATGGCCGTGTCGAGCGTTGAGTGCGGAGGGCCGGATTCAGAAGCATGCCGCGTAATTAAGAAATTTCCCGGCGCAAGGACGGTGCTACATGGACGACATGTCGTACCAGAGATCAGCGAAAGTAAATGTCAACTCAGTTTCTTGTCCTCGTGGAGAAACATTAGCGATTTATGTTTAAAGCTAGAATTTGGAAGGGGAAGAATGGAGGCTAACATCTGTGACGTCAATCACCTCGATTCCGATGTCCTTTTACCTCCAAGAAAGCGTCTTCTTGCTGGATTGAGAAAGAAAGGGGCCGATGGTGATGGTACTTTTAATGTGCCACCAGTTGCCTCCACCTCTTGTTCTCCCCCTCCCTCTCCTTCCTATGGCTTCACATCTATTGAATTCAATATACGGCTCAACAATCTGTTGAGTGCTCATTCAAATACTAACCTATCGCCCGAGGAGATAGTGGAGGCCTCAAGATCAGCGGCAGCTGCAGCTGTGAAGGCTGCAGAGGCTGCCAGGACAGCAGCTGAAGAGAAGGCTGCGATTGCAGCGAAGGCTGTTGCAGCTGCAAAGAGTGCCATGGACTTGGTTGCCTCGATTTCTGAAGAAGCAGCCTATAAAGAAATAAAACAGAGAAAGAACAAGCTGAAGAAACATGTCCCAGTTCAGTTTCTGTATACAAAATATCAACCTCTCGAGAATACCAGGACAGATGAAGAGATGGCCCGCAAATTACATCGGGCAATTAATAGCTCCCCAAGAATTTTGAAGAATTCATCTGGTTCTGATGCTAGAGGCCACAAACATAAGAAGTTGAAAACTTCACCTGGTTCTGAGAAAATTATGGTTTCCAATTGTGGCATCTCACAGGAGTTGGAACCTACTCCTACATGCAACGGGCATGCTAAAACTAACGAGGCCGACTCTGAATGTAGCTTTCAGGAAGTATACAAGCTCAAATCTGATGAGAAGACTATTAAATATGAGAAGAACAATCAGTCTCAGATGGATACTGGCGAAGCAGAGACAAGTCAGAAGGAGAAAAAAACGTGCGATGATATTAATGTCACTGGTAAAAAGAAGGGAAGAGTGAAGCTTAAAAAATTGTCGTTGAGTATTTGTTCTTTCAGGGATCAAAAAGCTCTCAAGGAAGACATGGACAATGGAAGTTCCCCAATATTGGCTGTGCAAAACACGGGGTCTCCCACACCCGAAAATGTTATTTTGCATTCTGTAGATAGCCTTAGCCCTACCCAAGGCGTGCTGCCAATAGATTCTACATCTTTATGGAAATGTCAGGAGTTCAAAGCTCCCCTTTCTGTCGAACAGAATAAAGTTATGCAGTCATGACCGGGAATGACATATGGTAAAAGGTTGCTGGCAATAGAAGTCACGGTCGAGATCGTAAAAGTCAGGTTCAGGTTTAAGGTTTTGTCTTGACATCAGTTGGAGGGACTCGAAGTCCCTTTCTTTAATGGGATTCCCTTTTGTGGGCTTGGTTTTTTTAATGCCGGGGTCGAGACAAATGTATCAACAAACGACAAAGATTTGAGGTTTGTAACTCAGACGTAAATGTCTTAAATACGATCATTTGTCTAAAGTTTATTAGATTTACCTTTTTATATTGAATGTCCCATGAATCAAGAACATAGGTTACTGAAACAATGCTTCAATTTCACCTCCTCTGGATCATGTAATCACTATTGTTTTATCTATTTTTTAAATCTAGTTTTTTTGTAAAAAAAAATACAAAATTGGTGAGAAAACAAACCAAAAAATATATATATATATATATATATATATCAAGCCAAAATTTTTGTAAAAAAAAATACAAAATTGGTGAGAAAACAAACCAAAAAAAAAAATATATATATATATATATATCAAGCCAAAATCCACATGAAGACTGCAGACCAAACAGCAAGAGCAAGAGCGGCTGCCAAATAAGTCCATAGCATGATAACAGAGCATTCCTCTTGACCAACGCCGAACAACTGAGTCATTGTGCCGATGGACATGGCCGGCGGCGTGGTGTATTGAACCATCAACAAGAAGTGGTAAAGAGGATCCGGCGGCAGAAATCCTAGCGAGTTTGCTCCTTTCACAACCAATATCCCTATTGCAGGCAGTGCAAAGTACCGAACCACAATCACTCCGATGATCGTTGTCGGCTTCACTGCTGACGACCTTAGCCCTTGAATCAGGTTGCCGCCGAGTATCAGCATGGTGCACGGGATTGTCCCATCTCTGTTTGTTATACATCACCACAACTCGTTTAGTAATGTTTTCATTTCTCATGCATATGGTAACGAACTCTAAAAAGTATACATACCCGAGTAGTTGAACAGAATCTTGTATCACACGTAACGGAGCATTATCTCCAATTACGAAGTTCCTAAGCCATGTCACCGCTCCAAAAATGAACCCCACAATCTAAAAATAGGAATATTTATCCAATTCAAGCATAACTCAATAGCTAAAACATCAGTGAGTATCTCAAAAGTCGATACTTACCGCTCCTAACGATGGAGGGGCCATTAACTCTTCAACAATGCTATGCAAAAATTCAAATGTTTTAGCCCAAATGGACGATGATTCCCGTTTCTCTAAGATGGCCACAGATTGTTGGGATTCCCTTGATGAGACTGAAACAGGAAGGCCTTCTTCTTCTCCACATTGTTCGTGGAGAAGTCGAGTTTGCAAGTCGCCATTGGAGGTGTGGTTGGGTGCTTTCACGTGCTCCTCCGGTGCTTCAAGTGCTTTCATTCGCAAAGACGACGTTTTCACAAGGTGAAAAGTGTGAGTCCA

Coding sequence (CDS)

ATGTGGCGGTGCCAGCACAAATGTCGACGGTTCCGACGCCTGAAACGTGGTCAGATAGAAGCCAACTTCCTTCCGAGACGAATATACTCCGCCGCAGTGGATGAGAAAATGGCCGTGTCGAGCGTTGAGTGCGGAGGGCCGGATTCAGAAGCATGCCGCGTAATTAAGAAATTTCCCGGCGCAAGGACGGTGCTACATGGACGACATGTCGTACCAGAGATCAGCGAAAGTAAATGTCAACTCAGTTTCTTGTCCTCGTGGAGAAACATTAGCGATTTATGTTTAAAGCTAGAATTTGGAAGGGGAAGAATGGAGGCTAACATCTGTGACGTCAATCACCTCGATTCCGATGTCCTTTTACCTCCAAGAAAGCGTCTTCTTGCTGGATTGAGAAAGAAAGGGGCCGATGGTGATGGTACTTTTAATGTGCCACCAGTTGCCTCCACCTCTTGTTCTCCCCCTCCCTCTCCTTCCTATGGCTTCACATCTATTGAATTCAATATACGGCTCAACAATCTGTTGAGTGCTCATTCAAATACTAACCTATCGCCCGAGGAGATAGTGGAGGCCTCAAGATCAGCGGCAGCTGCAGCTGTGAAGGCTGCAGAGGCTGCCAGGACAGCAGCTGAAGAGAAGGCTGCGATTGCAGCGAAGGCTGTTGCAGCTGCAAAGAGTGCCATGGACTTGGTTGCCTCGATTTCTGAAGAAGCAGCCTATAAAGAAATAAAACAGAGAAAGAACAAGCTGAAGAAACATGTCCCAGTTCAGTTTCTGTATACAAAATATCAACCTCTCGAGAATACCAGGACAGATGAAGAGATGGCCCGCAAATTACATCGGGCAATTAATAGCTCCCCAAGAATTTTGAAGAATTCATCTGGTTCTGATGCTAGAGGCCACAAACATAAGAAGTTGAAAACTTCACCTGGTTCTGAGAAAATTATGGTTTCCAATTGTGGCATCTCACAGGAGTTGGAACCTACTCCTACATGCAACGGGCATGCTAAAACTAACGAGGCCGACTCTGAATGTAGCTTTCAGGAAGTATACAAGCTCAAATCTGATGAGAAGACTATTAAATATGAGAAGAACAATCAGTCTCAGATGGATACTGGCGAAGCAGAGACAAGTCAGAAGGAGAAAAAAACGTGCGATGATATTAATGTCACTGGTAAAAAGAAGGGAAGAGTGAAGCTTAAAAAATTGTCGTTGAGTATTTGTTCTTTCAGGGATCAAAAAGCTCTCAAGGAAGACATGGACAATGGAAGTTCCCCAATATTGGCTGTGCAAAACACGGGGTCTCCCACACCCGAAAATGTTATTTTGCATTCTGTAGATAGCCTTAGCCCTACCCAAGGCGTGCTGCCAATAGATTCTACATCTTTATGGAAATGTCAGGAGTTCAAAGCTCCCCTTTCTGTCGAACAGAATAAAGTTATGCAGTCATGA

Protein sequence

MWRCQHKCRRFRRLKRGQIEANFLPRRIYSAAVDEKMAVSSVECGGPDSEACRVIKKFPGARTVLHGRHVVPEISESKCQLSFLSSWRNISDLCLKLEFGRGRMEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTSIEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVAAAKSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHRAINSSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEADSECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLKKLSLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGVLPIDSTSLWKCQEFKAPLSVEQNKVMQS
BLAST of Cp4.1LG08g12820 vs. TrEMBL
Match: A0A0A0L0X5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G000190 PE=4 SV=1)

HSP 1 Score: 592.4 bits (1526), Expect = 4.9e-166
Identity = 321/379 (84.70%), Postives = 344/379 (90.77%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTS 163
           ME NICDVNHL+SDVLLPPRKRLLAGLRK+G DGDGTFN+PPVAS+SCSPPPSPSYGFTS
Sbjct: 1   MEGNICDVNHLNSDVLLPPRKRLLAGLRKQGGDGDGTFNLPPVASSSCSPPPSPSYGFTS 60

Query: 164 IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVAAA 223
           IEFNIRLN+LLSAHSN+NLSPEEIV+ASRSAAAAAVKAAEAAR AAEEKAAIAA+AV  A
Sbjct: 61  IEFNIRLNSLLSAHSNSNLSPEEIVQASRSAAAAAVKAAEAARAAAEEKAAIAARAVTVA 120

Query: 224 KSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHRAIN 283
           KSAMDLVASISEEAAYKEI  RKNKLKKHVPVQ LYTKYQPLENT+TDEE+ARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEINLRKNKLKKHVPVQLLYTKYQPLENTKTDEELARKLHRAIN 180

Query: 284 SSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEADSE 343
           SSPRILKNSSGSD R HKHKKLK+S  SEKI VSNCGISQ+L+PT TCNGHAK NE DSE
Sbjct: 181 SSPRILKNSSGSDVRSHKHKKLKSSTSSEKIRVSNCGISQDLDPTTTCNGHAKPNEVDSE 240

Query: 344 CSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLKKLS 403
           CSFQEVYKLK DEKT KYEK+N S  D GE ETSQKE K CDDI+VT KK+GRVKLKKL 
Sbjct: 241 CSFQEVYKLKPDEKTSKYEKSNPSLTDNGE-ETSQKE-KMCDDISVTIKKRGRVKLKKLP 300

Query: 404 LSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGVLPIDSTSLW 463
           LSICSFRD+  LKEDM+NGSSPIL VQN GSPT E VILHSVD  SPT+GV+PIDSTS+W
Sbjct: 301 LSICSFRDKTTLKEDMNNGSSPILTVQNRGSPTSEKVILHSVD--SPTEGVMPIDSTSVW 360

Query: 464 KCQEFKAPLSVEQNKVMQS 483
           KCQEFKAPLSV+QNKV+QS
Sbjct: 361 KCQEFKAPLSVKQNKVVQS 375

BLAST of Cp4.1LG08g12820 vs. TrEMBL
Match: M5W8H2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007189mg PE=4 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 4.8e-105
Identity = 231/382 (60.47%), Postives = 281/382 (73.56%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVAST---SCSPPPSPSYG 163
           MEANICDVNHLD+DVLLPPRKRLLAGL+K+  DGDG+  +  VAS+   S S   S S  
Sbjct: 1   MEANICDVNHLDADVLLPPRKRLLAGLKKQSPDGDGSSWLSLVASSASASASASASSSSS 60

Query: 164 FTSIEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAV 223
            +S +FN RLNN+LS+H N NLSPEE+VE S SAAAAA KAAE AR AAEEKA IAAKAV
Sbjct: 61  PSSSDFNTRLNNILSSH-NPNLSPEELVEISNSAAAAAAKAAEDARAAAEEKAGIAAKAV 120

Query: 224 AAAKSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHR 283
           AAAKSA++LVAS SE+   KE   +KNKLKKHVPVQ LY KYQP+EN + DEE+ARKLHR
Sbjct: 121 AAAKSALELVASFSEDVGCKEKYLKKNKLKKHVPVQLLYKKYQPIENCKKDEELARKLHR 180

Query: 284 AINSSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEA 343
           AINSSPRI KNSS +D++GHKHKK K    SEK+ VSN GI  E  P P CNGHA   + 
Sbjct: 181 AINSSPRISKNSSSTDSKGHKHKKPKIVHSSEKVRVSNGGIELEQNPEPACNGHAVAGKV 240

Query: 344 DSECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLK 403
           + E +  E+YK K++EK  +++K  + +MD  EAE+SQ ++K+ DD+  +GKK+GRVKLK
Sbjct: 241 NVEGTILELYKNKAEEKAYRHDKTGRLEMDNAEAESSQMKEKSWDDVCTSGKKRGRVKLK 300

Query: 404 KLSLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGVLPIDST 463
           KL LSIC+FRDQ   KE+MD   SP L V N G+PT     L  V+  S    +LPI++T
Sbjct: 301 KLPLSICTFRDQANPKEEMDARGSP-LTVINRGNPTAGKKPLFPVE--SSADSMLPIEAT 360

Query: 464 SLWKCQEFKAPLSVEQNKVMQS 483
            +WK Q+FKAP  V+QNKVMQS
Sbjct: 361 PVWKYQDFKAPACVKQNKVMQS 378

BLAST of Cp4.1LG08g12820 vs. TrEMBL
Match: W9RW22_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_007067 PE=4 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 1.1e-104
Identity = 238/393 (60.56%), Postives = 277/393 (70.48%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCS----------- 163
           MEANICDVNHLD+DVLLPPRKRLLAGL+K+G+D DG+  +   ASTS S           
Sbjct: 1   MEANICDVNHLDADVLLPPRKRLLAGLKKQGSDSDGSLLLSLAASTSASGSASTSASASA 60

Query: 164 ---PPPSPSYGFTSIEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAA 223
              PP SPS    S EF+IRL+NLLS+H N+NLSP EIVEAS+SAAAAA KAAEAAR AA
Sbjct: 61  SAPPPSSPS----SSEFDIRLSNLLSSH-NSNLSPVEIVEASKSAAAAAAKAAEAARAAA 120

Query: 224 EEKAAIAAKAVAAAKSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTR 283
           EEKAA+AAKAVAAAKSA+DLVAS SEEAA KE   +KNKLKKHVPVQ LY KYQP+EN +
Sbjct: 121 EEKAAVAAKAVAAAKSALDLVASFSEEAACKERHLKKNKLKKHVPVQLLYKKYQPIENCK 180

Query: 284 TDEEMARKLHRAINSSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTP 343
            DEE+ARKLHRAINSSPRI KNSSGSD +GHKHKK K S  SEK  VSN G+  E  P  
Sbjct: 181 KDEELARKLHRAINSSPRISKNSSGSDGKGHKHKKPKISSSSEKTRVSNGGVVLEQTPAS 240

Query: 344 TCNGHAKTNEADSECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINV 403
           T NGHA   E + E   +E +K K DE T  Y++  QS +D GEA   Q E+KT DD   
Sbjct: 241 TSNGHAAAGEVNLESPTRESHKNKVDENTCTYDRAGQSDIDNGEAVFGQPEEKTWDDTLA 300

Query: 404 TGKKKGRVKLKKLSLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLS 463
            GKK+GRVKLKKL LS C+FRDQ   KE+  +     L   N G+PT  N+ L S++   
Sbjct: 301 GGKKRGRVKLKKLPLSTCAFRDQANPKEEAADIRKTPLTDMNMGNPTAGNIPLFSME--P 360

Query: 464 PTQGVLPIDSTSLWKCQEFKAPLSVEQNKVMQS 483
               V+P ++ S+WK QEFK P  V+QNKVMQS
Sbjct: 361 SADNVMPQEAKSVWKFQEFKGPECVKQNKVMQS 386

BLAST of Cp4.1LG08g12820 vs. TrEMBL
Match: A0A061DFG5_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_000059 PE=4 SV=1)

HSP 1 Score: 371.3 bits (952), Expect = 1.8e-99
Identity = 222/387 (57.36%), Postives = 279/387 (72.09%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCS-------PPPS 163
           MEANICD+NHLD+DVLLPPRKRLLAG +K+ ++ +G+ + P VAS+S S       P PS
Sbjct: 2   MEANICDINHLDADVLLPPRKRLLAGFKKQASNANGSSDQPTVASSSSSLPSPSPSPSPS 61

Query: 164 PSYGFTSIEFNIRLNNLLSAH-SNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAI 223
           PS   +S + N  LNNLLS+H +N NLSPEEI+ ASR AA AA KAAEAAR AAEEKAAI
Sbjct: 62  PSPSTSSSDVNTHLNNLLSSHINNPNLSPEEILAASRVAAIAAAKAAEAARAAAEEKAAI 121

Query: 224 AAKAVAAAKSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMA 283
           AAKAVAAAKSA+DLVA+ SEE   K+   +KNKLKKHVPVQ LY K+QP+EN RTDEE+A
Sbjct: 122 AAKAVAAAKSALDLVATFSEETVSKDRYLKKNKLKKHVPVQLLYKKHQPIENNRTDEELA 181

Query: 284 RKLHRAINSSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHA 343
            +LHRAINSSPRI KNS  S+ +GHKHK+ K+ P  EK  + N GI      + TCNG  
Sbjct: 182 HRLHRAINSSPRISKNSPTSEWKGHKHKRPKSLPTLEKTKIYNGGIVLGGSQSSTCNGDT 241

Query: 344 KTNEADSECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKG 403
              E DSE S QE   +K++ K  KYEK+ QS++D GEAE++Q ++K C+D+   GK++G
Sbjct: 242 VAGEIDSEDSIQE--SVKAEAKGTKYEKSGQSELDNGEAESNQSKEKACEDVYSPGKRRG 301

Query: 404 RVKLKKLSLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGVL 463
           RVKLKKL LSICSFRD+   KE+    SSP L  +N G+P+     L S++    T GV+
Sbjct: 302 RVKLKKLPLSICSFRDRVNPKEETITKSSP-LTEKNMGNPSAAVKPLFSLE--PSTDGVI 361

Query: 464 PIDSTSLWKCQEFKAPLSVEQNKVMQS 483
            I+ T +WKCQ++KAP  ++QNKVMQS
Sbjct: 362 SIEGTPIWKCQDYKAPACIKQNKVMQS 383

BLAST of Cp4.1LG08g12820 vs. TrEMBL
Match: A0A067L991_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22400 PE=4 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 2.5e-98
Identity = 229/395 (57.97%), Postives = 275/395 (69.62%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSP---------- 163
           MEANICD+NHLD+DVLLPPRKRLLAG +K+ +DGD     P +AS+S S           
Sbjct: 1   MEANICDINHLDADVLLPPRKRLLAGFKKQSSDGDALLVPPTIASSSSSASPSSPSPSTP 60

Query: 164 ---PPSPSYGF-TSIEFNIRLNNLLSAH--SNTNLSPEEIVEASRSAAAAAVKAAEAART 223
              PPSP+    TS E   RLNNLLS+H  +N NLSPE+IVEAS+SAA AA KAAEAAR 
Sbjct: 61  SLSPPSPTAPSPTSSELQSRLNNLLSSHFRNNHNLSPEQIVEASKSAADAAAKAAEAARA 120

Query: 224 AAEEKAAIAAKAVAAAKSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLEN 283
           AA+EKA +AAKA+ AAKSA+ LVAS  EEAA KE + +KNK KKHV VQ LY K+QP+EN
Sbjct: 121 AAQEKAILAAKAITAAKSALALVASFPEEAANKERQLKKNKQKKHVQVQLLYKKHQPIEN 180

Query: 284 TRTDEEMARKLHRAINSSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEP 343
            R DEE+ARKLHR INSSPRI KNSS SD +GHK KK K+SP SEK  VSN  ++    P
Sbjct: 181 YRDDEELARKLHRVINSSPRISKNSSSSDWKGHKSKKPKSSPTSEKTWVSNGSVAFGGNP 240

Query: 344 TPTCNGHAKTNEADSECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDI 403
           +  CNGHA   E DSE S  EVY   +DEKT KYEK  Q ++D+GEAE+SQ ++K   D 
Sbjct: 241 SSICNGHAIAGELDSEGSIGEVYTSMADEKTSKYEKATQLEIDSGEAESSQSKEKMTGDA 300

Query: 404 NVTGKKKGRVKLKKLSLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDS 463
           +  GKK+GR+KLKKL LSICS RDQ   K+D    SSP L  +N G+PT  N  L S+D 
Sbjct: 301 SSPGKKRGRLKLKKLPLSICSSRDQANPKDDSFLRSSP-LGDKNIGNPTTRNKPLFSMD- 360

Query: 464 LSPTQGVLPIDSTSLWKCQEFKAPLSVEQNKVMQS 483
                 ++PI+   + KCQEFKAP  V+QNKV+QS
Sbjct: 361 -PSADNMMPIEVAPMRKCQEFKAPACVKQNKVIQS 392

BLAST of Cp4.1LG08g12820 vs. TAIR10
Match: AT4G35510.1 (AT4G35510.1 unknown protein)

HSP 1 Score: 179.1 bits (453), Expect = 6.5e-45
Identity = 154/385 (40.00%), Postives = 217/385 (56.36%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTS 163
           ME N CD+N LDSD  LPPRKRLLAG +K  ++G    +    AS+S +   S S G +S
Sbjct: 1   METNPCDMNQLDSDSHLPPRKRLLAGFKKFNSNGINGSSPSDFASSSST---SNSNGSSS 60

Query: 164 IEFNIR--LNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVA 223
              N++  L NLLS+  N + SPEE+VEA+RSAAA AVKAA+AAR  A EKA I+AKA+A
Sbjct: 61  ASTNVQTHLGNLLSSPFNNDQSPEELVEATRSAAALAVKAAKAARAIANEKALISAKAIA 120

Query: 224 AAKSAMDLVASISEEAAY--KEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLH 283
           AAK A++LV S  +EA    KE   RKNK KKHVPV+ LY+K Q  +    ++++AR+LH
Sbjct: 121 AAKRALELVDSFPKEAMADCKERSPRKNKQKKHVPVELLYSKGQLRDE---EDDLARRLH 180

Query: 284 RAINSS--PRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKT 343
           RAI+++  PR+L+ S   +  G ++KK K +    K +V   G S  +  T +    A  
Sbjct: 181 RAIDNTSYPRVLRTS---EENGQRYKKQKKN----KSVVE--GGSSSIIVTGSMKDIAGV 240

Query: 344 NEADSECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRV 403
            ++DS     E+ +   DE                 A++    +K+ ++ N   K++GRV
Sbjct: 241 VDSDSSYEGLEIARSNRDE-----------------ADSMLMMEKSGEESNSLVKRRGRV 300

Query: 404 KLKKLSLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGVLPI 463
           KLKKL LSIC+ R+Q       +NG+S       + SP P                +  I
Sbjct: 301 KLKKLPLSICNSRNQ-------ENGTS-------SASPLP------VAQPQEDGGAITVI 333

Query: 464 DSTSLWKCQEFKAPLSVEQNKVMQS 483
             +S WKCQ+ KAP  V+QNK ++S
Sbjct: 361 AGSSSWKCQDIKAPECVKQNKAVRS 333

BLAST of Cp4.1LG08g12820 vs. TAIR10
Match: AT2G17540.2 (AT2G17540.2 unknown protein)

HSP 1 Score: 106.3 bits (264), Expect = 5.4e-23
Identity = 122/380 (32.11%), Postives = 173/380 (45.53%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTS 163
           M  N C    L+SD LLPPRKRLLAG + + +        PP AS+S S   S     ++
Sbjct: 1   MATNAC----LESDSLLPPRKRLLAGFKNQNSRISNESPSPPFASSSSSTTTSNGSSASA 60

Query: 164 IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVAAA 223
           +     L++LL+  +    SPEE+ +AS++ AA AVK A+AAR  A EKA IA+KAVAAA
Sbjct: 61  VVHTHHLDHLLNDQTR---SPEELAQASKATAALAVKVAKAARATANEKAIIASKAVAAA 120

Query: 224 KSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHRAIN 283
           K+A++L AS                               P   T + +E   +L RAIN
Sbjct: 121 KNALELFASF------------------------------PAAETVSCKEP--RLIRAIN 180

Query: 284 SSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCN-GHAKTNEADS 343
           +SPR+L     +D  GH++KK KT                    T T N G+      DS
Sbjct: 181 NSPRVL-----TDCSGHRNKKQKTL-------------------TSTMNDGNDVAGVVDS 240

Query: 344 ECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLKKL 403
           +   +   KL   + T+K                    +KT ++ +  GK++GRV   KL
Sbjct: 241 DDGTRVNGKLLLCDNTLK--------------------EKTEEESSSLGKRRGRV---KL 275

Query: 404 SLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGVLPIDSTSL 463
            LS+C+++DQ       +NG             T E   L    S     GV  +     
Sbjct: 301 PLSMCAYKDQ-------ENGI------------TLEENNLSVSKSEGYNGGVAQVIERPS 275

Query: 464 WKCQEFKAPLSVEQNKVMQS 483
           WKCQ+ K+P  V+QNKV++S
Sbjct: 361 WKCQDLKSPECVKQNKVVRS 275

BLAST of Cp4.1LG08g12820 vs. TAIR10
Match: AT5G66000.1 (AT5G66000.1 unknown protein)

HSP 1 Score: 63.5 bits (153), Expect = 4.0e-10
Identity = 60/145 (41.38%), Postives = 86/145 (59.31%), Query Frame = 1

Query: 155 PSPSYGFTSIEFNIRLNNLLSAH-SNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKA 214
           P PS   T++++++ +  LL++H SN +L+P+EI +ASR  AAAA  AA+AAR  A+EKA
Sbjct: 17  PHPS---TNVDYHLSI--LLASHLSNPDLTPQEIADASRCTAAAAAIAAKAARATADEKA 76

Query: 215 AIAAKAVAAAKSAMDLVASISEEAAYKE---IKQRKNKLKKHVPVQFLYTKYQPLENTRT 274
           A AAKAVAAAK+A+DL+AS        +   + + K   KKHV    L++K         
Sbjct: 77  AAAAKAVAAAKTALDLIASFPPNQGLVQDACLHKDKKMKKKHVAADLLFSK--------- 136

Query: 275 DEEMARKLHRAINSSPRILKNSSGS 296
           D+ +  KL      S  I+ NSS S
Sbjct: 137 DDALPSKLQLGGVVSQGIVSNSSSS 147

BLAST of Cp4.1LG08g12820 vs. NCBI nr
Match: gi|659095199|ref|XP_008448452.1| (PREDICTED: uncharacterized protein LOC103490642 [Cucumis melo])

HSP 1 Score: 598.6 bits (1542), Expect = 9.7e-168
Identity = 324/379 (85.49%), Postives = 347/379 (91.56%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTS 163
           ME NICDVNHL+SDVLLPPRKRLLAGLRK+G DGDGTFN+PPVAS++CSPPPSPSYGFTS
Sbjct: 1   MEGNICDVNHLNSDVLLPPRKRLLAGLRKQGGDGDGTFNLPPVASSTCSPPPSPSYGFTS 60

Query: 164 IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVAAA 223
           IEFNIRLN+LLSAHSN+NLSPEEIVEASRSAAAAAVKAAEAAR AAEEKAAIAA+AV  A
Sbjct: 61  IEFNIRLNSLLSAHSNSNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAAIAARAVTVA 120

Query: 224 KSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHRAIN 283
           KSAMDLVASISEEAAYKEI  RKNKLKKHVPVQFLYTKYQPLENT+TDEE+ARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEINLRKNKLKKHVPVQFLYTKYQPLENTKTDEELARKLHRAIN 180

Query: 284 SSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEADSE 343
           SSPRILKNSSGSD R HKHKKLK+S  SEKI VSNCGISQ+L+P  TCNGHAK+NEADSE
Sbjct: 181 SSPRILKNSSGSDVRSHKHKKLKSSTSSEKIRVSNCGISQDLDPATTCNGHAKSNEADSE 240

Query: 344 CSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLKKLS 403
           CSFQEVYK K DEKT KYEKNNQS  D GE ETSQKE KTCDDI+VT KK+GRVKLKKL 
Sbjct: 241 CSFQEVYKPKPDEKTSKYEKNNQSLTDDGE-ETSQKE-KTCDDISVTIKKRGRVKLKKLP 300

Query: 404 LSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGVLPIDSTSLW 463
           LSICSFRD+  LKEDM+NGSSPIL VQN GSPT E VILHSVD  SPT+GV+PIDSTS+W
Sbjct: 301 LSICSFRDKTTLKEDMNNGSSPILTVQNMGSPTSEKVILHSVD--SPTEGVMPIDSTSVW 360

Query: 464 KCQEFKAPLSVEQNKVMQS 483
           KCQEFKAPLSV+QNKV+QS
Sbjct: 361 KCQEFKAPLSVKQNKVVQS 375

BLAST of Cp4.1LG08g12820 vs. NCBI nr
Match: gi|778674564|ref|XP_011650244.1| (PREDICTED: uncharacterized protein LOC101214022 [Cucumis sativus])

HSP 1 Score: 592.4 bits (1526), Expect = 7.0e-166
Identity = 321/379 (84.70%), Postives = 344/379 (90.77%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTS 163
           ME NICDVNHL+SDVLLPPRKRLLAGLRK+G DGDGTFN+PPVAS+SCSPPPSPSYGFTS
Sbjct: 1   MEGNICDVNHLNSDVLLPPRKRLLAGLRKQGGDGDGTFNLPPVASSSCSPPPSPSYGFTS 60

Query: 164 IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVAAA 223
           IEFNIRLN+LLSAHSN+NLSPEEIV+ASRSAAAAAVKAAEAAR AAEEKAAIAA+AV  A
Sbjct: 61  IEFNIRLNSLLSAHSNSNLSPEEIVQASRSAAAAAVKAAEAARAAAEEKAAIAARAVTVA 120

Query: 224 KSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHRAIN 283
           KSAMDLVASISEEAAYKEI  RKNKLKKHVPVQ LYTKYQPLENT+TDEE+ARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEINLRKNKLKKHVPVQLLYTKYQPLENTKTDEELARKLHRAIN 180

Query: 284 SSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEADSE 343
           SSPRILKNSSGSD R HKHKKLK+S  SEKI VSNCGISQ+L+PT TCNGHAK NE DSE
Sbjct: 181 SSPRILKNSSGSDVRSHKHKKLKSSTSSEKIRVSNCGISQDLDPTTTCNGHAKPNEVDSE 240

Query: 344 CSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLKKLS 403
           CSFQEVYKLK DEKT KYEK+N S  D GE ETSQKE K CDDI+VT KK+GRVKLKKL 
Sbjct: 241 CSFQEVYKLKPDEKTSKYEKSNPSLTDNGE-ETSQKE-KMCDDISVTIKKRGRVKLKKLP 300

Query: 404 LSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGVLPIDSTSLW 463
           LSICSFRD+  LKEDM+NGSSPIL VQN GSPT E VILHSVD  SPT+GV+PIDSTS+W
Sbjct: 301 LSICSFRDKTTLKEDMNNGSSPILTVQNRGSPTSEKVILHSVD--SPTEGVMPIDSTSVW 360

Query: 464 KCQEFKAPLSVEQNKVMQS 483
           KCQEFKAPLSV+QNKV+QS
Sbjct: 361 KCQEFKAPLSVKQNKVVQS 375

BLAST of Cp4.1LG08g12820 vs. NCBI nr
Match: gi|1009143995|ref|XP_015889560.1| (PREDICTED: uncharacterized protein LOC107424309 [Ziziphus jujuba])

HSP 1 Score: 393.7 bits (1010), Expect = 4.8e-106
Identity = 243/388 (62.63%), Postives = 285/388 (73.45%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFN---VPPVASTSCSPPPSPSYG 163
           MEANICDVNHLD+DVLLPPRKRLLAGL+K+ +DGD T     VP  +S + S   S S  
Sbjct: 1   MEANICDVNHLDADVLLPPRKRLLAGLKKQNSDGDSTSQPSLVPAASSPASSSSASASVF 60

Query: 164 FTSI-----EFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAI 223
            +S+     EFN RLNNLLS+HS  NLSPEEIV+ASR AA+AA KAAE AR AAEEKAAI
Sbjct: 61  ASSLSPSSSEFNNRLNNLLSSHS-ANLSPEEIVDASRIAASAAAKAAETARAAAEEKAAI 120

Query: 224 AAKAVAAAKSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMA 283
           AAKAVAAAKSA+DLVAS SEE A K+   RKNKLKKHVPVQ LY KYQP+EN + DEE+A
Sbjct: 121 AAKAVAAAKSALDLVASFSEEGASKDRYSRKNKLKKHVPVQLLYKKYQPIENCKKDEELA 180

Query: 284 RKLHRAINSSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHA 343
           RKLHRAINSSPR  KNSS SD RGHKHKK K S  SE+   SN  +  E   T  CNGHA
Sbjct: 181 RKLHRAINSSPRTCKNSSSSDLRGHKHKKPKISSTSERTR-SNGDMVLEQNRTSACNGHA 240

Query: 344 KTNEA-DSECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKK 403
              E  DSE + +E YK K+DEK  +Y+K  Q +MD GEAE+SQ  +KT DDI+ +GKK+
Sbjct: 241 VAGEVDDSEGTIRESYKNKADEKACRYDKAGQLEMDYGEAESSQTNEKTWDDISTSGKKR 300

Query: 404 GRVKLKKLSLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGV 463
           GRVKLKKL LSIC+FRDQ   KE+ D  S+P LA  N G+PT  N+ L ++D  S  + V
Sbjct: 301 GRVKLKKLPLSICTFRDQANPKEEADTRSTP-LADMNMGNPTGGNMPLFTMDPSS--EVV 360

Query: 464 LPIDSTSLWKCQEFKAPLSVEQNKVMQS 483
            PI++TS+WKCQ+F AP  V+QNKVMQS
Sbjct: 361 RPIEATSVWKCQQFNAPACVKQNKVMQS 383

BLAST of Cp4.1LG08g12820 vs. NCBI nr
Match: gi|595818494|ref|XP_007204359.1| (hypothetical protein PRUPE_ppa007189mg [Prunus persica])

HSP 1 Score: 389.8 bits (1000), Expect = 6.9e-105
Identity = 231/382 (60.47%), Postives = 281/382 (73.56%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVAST---SCSPPPSPSYG 163
           MEANICDVNHLD+DVLLPPRKRLLAGL+K+  DGDG+  +  VAS+   S S   S S  
Sbjct: 1   MEANICDVNHLDADVLLPPRKRLLAGLKKQSPDGDGSSWLSLVASSASASASASASSSSS 60

Query: 164 FTSIEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAV 223
            +S +FN RLNN+LS+H N NLSPEE+VE S SAAAAA KAAE AR AAEEKA IAAKAV
Sbjct: 61  PSSSDFNTRLNNILSSH-NPNLSPEELVEISNSAAAAAAKAAEDARAAAEEKAGIAAKAV 120

Query: 224 AAAKSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHR 283
           AAAKSA++LVAS SE+   KE   +KNKLKKHVPVQ LY KYQP+EN + DEE+ARKLHR
Sbjct: 121 AAAKSALELVASFSEDVGCKEKYLKKNKLKKHVPVQLLYKKYQPIENCKKDEELARKLHR 180

Query: 284 AINSSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEA 343
           AINSSPRI KNSS +D++GHKHKK K    SEK+ VSN GI  E  P P CNGHA   + 
Sbjct: 181 AINSSPRISKNSSSTDSKGHKHKKPKIVHSSEKVRVSNGGIELEQNPEPACNGHAVAGKV 240

Query: 344 DSECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLK 403
           + E +  E+YK K++EK  +++K  + +MD  EAE+SQ ++K+ DD+  +GKK+GRVKLK
Sbjct: 241 NVEGTILELYKNKAEEKAYRHDKTGRLEMDNAEAESSQMKEKSWDDVCTSGKKRGRVKLK 300

Query: 404 KLSLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLSPTQGVLPIDST 463
           KL LSIC+FRDQ   KE+MD   SP L V N G+PT     L  V+  S    +LPI++T
Sbjct: 301 KLPLSICTFRDQANPKEEMDARGSP-LTVINRGNPTAGKKPLFPVE--SSADSMLPIEAT 360

Query: 464 SLWKCQEFKAPLSVEQNKVMQS 483
            +WK Q+FKAP  V+QNKVMQS
Sbjct: 361 PVWKYQDFKAPACVKQNKVMQS 378

BLAST of Cp4.1LG08g12820 vs. NCBI nr
Match: gi|703125204|ref|XP_010103249.1| (hypothetical protein L484_007067 [Morus notabilis])

HSP 1 Score: 388.7 bits (997), Expect = 1.5e-104
Identity = 238/393 (60.56%), Postives = 277/393 (70.48%), Query Frame = 1

Query: 104 MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCS----------- 163
           MEANICDVNHLD+DVLLPPRKRLLAGL+K+G+D DG+  +   ASTS S           
Sbjct: 1   MEANICDVNHLDADVLLPPRKRLLAGLKKQGSDSDGSLLLSLAASTSASGSASTSASASA 60

Query: 164 ---PPPSPSYGFTSIEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAA 223
              PP SPS    S EF+IRL+NLLS+H N+NLSP EIVEAS+SAAAAA KAAEAAR AA
Sbjct: 61  SAPPPSSPS----SSEFDIRLSNLLSSH-NSNLSPVEIVEASKSAAAAAAKAAEAARAAA 120

Query: 224 EEKAAIAAKAVAAAKSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTR 283
           EEKAA+AAKAVAAAKSA+DLVAS SEEAA KE   +KNKLKKHVPVQ LY KYQP+EN +
Sbjct: 121 EEKAAVAAKAVAAAKSALDLVASFSEEAACKERHLKKNKLKKHVPVQLLYKKYQPIENCK 180

Query: 284 TDEEMARKLHRAINSSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTP 343
            DEE+ARKLHRAINSSPRI KNSSGSD +GHKHKK K S  SEK  VSN G+  E  P  
Sbjct: 181 KDEELARKLHRAINSSPRISKNSSGSDGKGHKHKKPKISSSSEKTRVSNGGVVLEQTPAS 240

Query: 344 TCNGHAKTNEADSECSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINV 403
           T NGHA   E + E   +E +K K DE T  Y++  QS +D GEA   Q E+KT DD   
Sbjct: 241 TSNGHAAAGEVNLESPTRESHKNKVDENTCTYDRAGQSDIDNGEAVFGQPEEKTWDDTLA 300

Query: 404 TGKKKGRVKLKKLSLSICSFRDQKALKEDMDNGSSPILAVQNTGSPTPENVILHSVDSLS 463
            GKK+GRVKLKKL LS C+FRDQ   KE+  +     L   N G+PT  N+ L S++   
Sbjct: 301 GGKKRGRVKLKKLPLSTCAFRDQANPKEEAADIRKTPLTDMNMGNPTAGNIPLFSME--P 360

Query: 464 PTQGVLPIDSTSLWKCQEFKAPLSVEQNKVMQS 483
               V+P ++ S+WK QEFK P  V+QNKVMQS
Sbjct: 361 SADNVMPQEAKSVWKFQEFKGPECVKQNKVMQS 386

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L0X5_CUCSA4.9e-16684.70Uncharacterized protein OS=Cucumis sativus GN=Csa_3G000190 PE=4 SV=1[more]
M5W8H2_PRUPE4.8e-10560.47Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007189mg PE=4 SV=1[more]
W9RW22_9ROSA1.1e-10460.56Uncharacterized protein OS=Morus notabilis GN=L484_007067 PE=4 SV=1[more]
A0A061DFG5_THECC1.8e-9957.36Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_000059 PE=4 SV=1[more]
A0A067L991_JATCU2.5e-9857.97Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G35510.16.5e-4540.00 unknown protein[more]
AT2G17540.25.4e-2332.11 unknown protein[more]
AT5G66000.14.0e-1041.38 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659095199|ref|XP_008448452.1|9.7e-16885.49PREDICTED: uncharacterized protein LOC103490642 [Cucumis melo][more]
gi|778674564|ref|XP_011650244.1|7.0e-16684.70PREDICTED: uncharacterized protein LOC101214022 [Cucumis sativus][more]
gi|1009143995|ref|XP_015889560.1|4.8e-10662.63PREDICTED: uncharacterized protein LOC107424309 [Ziziphus jujuba][more]
gi|595818494|ref|XP_007204359.1|6.9e-10560.47hypothetical protein PRUPE_ppa007189mg [Prunus persica][more]
gi|703125204|ref|XP_010103249.1|1.5e-10460.56hypothetical protein L484_007067 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g12820.1Cp4.1LG08g12820.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35477FAMILY NOT NAMEDcoord: 105..482
score: 8.0E