Cp4.1LG08g05140 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g05140
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function (DUF789)
LocationCp4.1LG08 : 1074670 .. 1078786 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAGTCTCCGGTGGTGTTTCGATTGCCCGAATCCGTGGGGAGAATCGGTTCTATCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAGCAGCAGCAACAGCAACAGCAGCAGCAGCAGCAGCAGAAGCAAACTGCCTTGGATTCTAAGGAGGTTGCTGCTGCTACTACTGCGAGGATCGATGAGTTGGAGAAGAGTGAGGTTGATGAGTGTCGTTCTTGGTCCACTCGCTCGGATTGCTCTGTTTCTGATCGTGGAGTTGCTGATTCTACTAATTTGGATCGGTTCTTGGAGTACACTACTCCCGTTGTTCCGGCTCAATGTTTTTCTCAATGTGAAATTTGGAGAACTCTCCATTTTTTTTTTTTTTNTTCGTGGTTTGGTTTTTAGGTCAATTGTTTCATTTTTTTTTGGTGGACTTGGAAGTGCCTTATTATGATGGAAATTGATGTTGGTTAGTCTTCGCGTATGAATTTCTGCGATTATGGTTCTTGTAATTATTAGGTTCAGGAAGATCTTATGGTTTAGTTGATTTATCTTGACGAAGATGTTGGAATCTTCCTTTTCGTTTTGCGTTTTATCTTCTTTGTTTTTCTTCAATGGATTTCTTTAGTATTATTCTTAAGGCCCCTCCCCCTTCTCTTTTGTGCTTCTACATTGTCTACTTGATCACTTTAACTTTAGTGTTAGTTTCTTCTTTTAATTTCCTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTGTCTCACTATGATATCAACGTCCTGTTAAGTTGCATACTGATCTTTTCTAAGTTGCTTTTATTTGGAATCTGAATTTACTTCAAATTTTCAAATCATTAAATTGCTCCGATGTTGGCCTTAGACGAGCCTAAAGGGATGGAGAAATCGTGAAGTCTCAGAGGCACCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATACGGAGCGGGTATCCCTCTATTGTTAAATGGTAGCGACTCTGTAGTACAGTACTACGTTCCATATCTGTCGGGCATACAACTCTATGTAGATCCATCAAAATCCTCTGCCTTAAGGTGAGGATTATTCATTGAGCTTCATGTTCTCTTATTTCTTATTCGGTTCTAGCAACGTATTTTGTTAACTTCCCCTTCTATTCTCATTTCCTATCTAATTATGATTTGTGATTACATTACATTCCTAGGAATGCTAGCTTCATTGTATTTATTGTTCTGCATCCTGATGCAATATGCAGATCTTTTTAATATTTAGTTCGTAAGCTTGTTATTGCCCTGGGATCTTCAATTGTGTTGAAAGGTGTTGGGTATTGATATATATGTATTATATATCACTCATCTTATGTGGATGTGTATTGAACAAATCCACTTTGGGTTTCACAATATATGCATATAATAGCTTGAATGTGTTGGTAATTTTGTTTTCCCCTCTTTCATATAATTATTTCTTTGCACAATGAATGTGTTGGCCATCATTTGTGGTTGAAAATATAGCAGTTGGCATGTCTTTATATGAGTTCATCATTTGTCTGAGACTAACTAACCTACTTTGCCGTTAATTATTTCACCGTCTCATTCAACTTCCTTTTTTATTTTCTTCTATTAAGCTTTTTCCTTACTGATCATTTATGTTTTCTTTTACAAAAATGCCTTATCCTAAATTATTTGTGAATAATAGATTTATTAATATGGATCTGGGGTTATTATTTTTTAGCTGGTAACCATTGCAAGGTCATGGGTTTTCTGGGTATCGTGAATTGGTATCATCATGTGTGATATTTACATGAAGTGTTGTTTAAAATTGAGTATATGGGAGTAGCAAGTGAATTTGTAATTTGGTACTGTTTTTTGTAGTAGAAGGCGTGGTGCTGATAGTGATGCTGAGTCCTCAAAGGAAACAACTAGTGATGGAAGCAGTAATTGTGGGATGGAAAAAAAAACTAGTACGGCTCTTCAGGATGAGTGGATACAGGACTCCAGTGTTACTGGGTCACAGAGAGCGCTTCAAATGAATGTACCCTCTGCCGAGTCGTCAAGTGATGAAAGTGACTCTTGCTACCGTCAAGGTCAGCTTGTGTTCGAATACATGGAGCTTGATCCACCATTTTGTCGTGAACCACTAACTGATAAGGTGGGTGTATTTTTATTTAGGTGTAGTAGATGAGAGCACTCTTCATTTTTACTAACGTTAACTCGCATTCGACAGATCACTATCCTTGCATCTCGTTTTCCTGAATTGAAAACATATAGAAGCTGTGATTTATCTCCTTCCAGTTGGATATCTGTGGCATGGTATGTCTTGTCCCCTCTGGATTATATGTTACTCTATGATCCAGATCTGTTACTTTGTTCTGACATGGTAATTTTTTCTCTGAAGGTATCCAATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTTGATGCTTGCTTTTTGACCTTCCATTCTCTGTCAACAGCATTTCAAGGTAGTTCCTCGCTTTATTTATTTTTATTTTTTTTGTTATGCATATATGTCATCAATCAATTCTCTGGGCCAGTTTATATTCTATGTGCCTTTTGTTCTGTTTAGTTTAAAATCAAAGTGGATTGGTTTATGGTAAATTGATTTCAATGGCTACAGTCATGTGTAGTTAGGTAAGATTGTAGTTTCGATCACCTGTAATGGCTGTATTATGGGAAAACATCAGCTGTGGCAGGAAGATTGAGTAGTATTTGGCTATAAAAAAGTTTATCGATTTAGTGTTCGGAATATAGTTTTAAAAACCTATTTTGTTAGTTAAGGCCGCAATATGCCTGTAAGACAATCAACTCATAACTTGCAGTTAACTTGTGCTATTGTGTACGAACTTCATTTACAAGATAGACATTGTAAGAACATTTTCAAATGGACATCATGAGGCTTGTGTTTGGAACAGGCATCGGCACCGACGGGTTGCAATTCCATTGGCCAAGAGTTCGAGAGGTGCACACTGCAAATTTGCCTCTCAAACTACAGTTGCCAACATTTGGACTTGCTTCCTACAAGTTCAAAATTTCGTTTTGGAATTCAACTGGTGTGGAGGAATGTCCAAAGGCTAACACATTGTGGCAAGATGCCGACAACTGGCTCAGGTCATTAAACGTGAACCATCCCGATTACAGATTTTTTGCATCTCATACTTCGTCCGGGAGATGATAAGGAAAAGAATGTTATACATGATGCTCAAATGTGGGATTACCAAAGAAACTCGCTTCTTCGAAGTGTCGTGAAAATTTATATGGGATCTATGGCATTTTCTGAATTTCTATATGTTGACCATATAAAGGGTTTTGTACAGGGAGAATGATGTTGATAGTAACATTACGGTTGAAATGAGGATTGGCTTTAGGATATAGTTATTATTCTGGGAGCGTCAATCCTCTCCCCCTATGGTGGTACTAGTCTCTATTCTTCCTTCATTGTGTTGGGATATGAGTGATGATGGGGATTTGATGATGGTTGTGGGCAACAACAGAGAAACTGCTGTCAATAACAACACCCCGCCCTGCCCTGCCCCGCCCCGCCCCGTCCCTTTTTTGCTCTTTTTACTTCAAATTGTGGGTTAGGTCTGTAACTTTCACTGCCCCCACGATATTGGTGAAAAGTATATCTAGTATGTTAACTGCAATGTCGCTGTGTATTAACAGAGTCTTTATCAGTTGATGTGATAAAGTGTAATTGGTTCTCCGTTTTAAGAGTATGTTGAATACATTTTCTACGTTTTCATTATTATGTAAACAGATGGAAGCCCACCACTTATGGATGGTGTCATGATTGTTGTGTTTGAAACTCTTATATATAACAACGATCCATCATGTTTGAGATGGATATTTCAAGGTTTGTAGGGTGGTGTCTATACCTGAAAAAGTTTGAAGTGGTGTGTAGTAGCTGAATTTGGTTTTTACCATG

mRNA sequence

ATGTCAGTCTCCGGTGGTGTTTCGATTGCCCGAATCCGTGGGGAGAATCGGTTCTATCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAGCAGCAGCAACAGCAACAGCAGCAGCAGCAGCAGCAGAAGCAAACTGCCTTGGATTCTAAGGAGGTTGCTGCTGCTACTACTGCGAGGATCGATGAGTTGGAGAAGAGTGAGGTTGATGAGTGTCGTTCTTGGTCCACTCGCTCGGATTGCTCTGTTTCTGATCGTGGAGTTGCTGATTCTACTAATTTGGATCGGTTCTTGGAGTACACTACTCCCGTTACGAGCCTAAAGGGATGGAGAAATCGTGAAGTCTCAGAGGCACCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATACGGAGCGGGTATCCCTCTATTGTTAAATGGTAGCGACTCTGTAGTACAGTACTACGTTCCATATCTGTCGGGCATACAACTCTATGTAGATCCATCAAAATCCTCTGCCTTAAGAAGGCGTGGTGCTGATAGTGATGCTGAGTCCTCAAAGGAAACAACTAGTGATGGAAGCAGTAATTGTGGGATGGAAAAAAAAACTAGTACGGCTCTTCAGGATGAGTGGATACAGGACTCCAGTGTTACTGGGTCACAGAGAGCGCTTCAAATGAATGTACCCTCTGCCGAGTCGTCAAGTGATGAAAGTGACTCTTGCTACCGTCAAGGTCAGCTTGTGTTCGAATACATGGAGCTTGATCCACCATTTTGTCGTGAACCACTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTGAAAACATATAGAAGCTGTGATTTATCTCCTTCCAGTTGGATATCTGTGGCATGGTATCCAATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTTGATGCTTGCTTTTTGACCTTCCATTCTCTGTCAACAGCATTTCAAGGCATCGGCACCGACGGGTTGCAATTCCATTGGCCAAGAGTTCGAGAGGTGCACACTGCAAATTTGCCTCTCAAACTACAGTTGCCAACATTTGGACTTGCTTCCTACAAGTTCAAAATTTCGTTTTGGAATTCAACTGGTGTGGAGGAATGTCCAAAGGCTAACACATTGTGGCAAGATGCCGACAACTGGCTCAGGTCATTAAACGTGAACCATCCCGATTACAGATTTTTTGCATCTCATACTTCGTCCGGGAGATGATAAGGAAAAGAATGTTATACATGATGCTCAAATGTGGGATTACCAAAGAAACTCGCTTCTTCGAAGTGTCGTGAAAATTTATATGGGATCTATGGCATTTTCTGAATTTCTATATGTTGACCATATAAAGGGTTTTGTACAGGGAGAATGATGTTGATAGTAACATTACGGTTGAAATGAGGATTGGCTTTAGGATATAGTTATTATTCTGGGAGCGTCAATCCTCTCCCCCTATGGTGGTACTAGTCTCTATTCTTCCTTCATTGTGTTGGGATATGAGTGATGATGGGGATTTGATGATGGTTGTGGGCAACAACAGAGAAACTGCTGTCAATAACAACACCCCGCCCTGCCCTGCCCCGCCCCGCCCCGTCCCTTTTTTGCTCTTTTTACTTCAAATTGTGGGTTAGGTCTGTAACTTTCACTGCCCCCACGATATTGGTGAAAAGTATATCTAGTATGTTAACTGCAATGTCGCTGTGTATTAACAGAGTCTTTATCAGTTGATGTGATAAAGTGTAATTGGTTCTCCGTTTTAAGAGTATGTTGAATACATTTTCTACGTTTTCATTATTATGTAAACAGATGGAAGCCCACCACTTATGGATGGTGTCATGATTGTTGTGTTTGAAACTCTTATATATAACAACGATCCATCATGTTTGAGATGGATATTTCAAGGTTTGTAGGGTGGTGTCTATACCTGAAAAAGTTTGAAGTGGTGTGTAGTAGCTGAATTTGGTTTTTACCATG

Coding sequence (CDS)

ATGTCAGTCTCCGGTGGTGTTTCGATTGCCCGAATCCGTGGGGAGAATCGGTTCTATCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAGCAGCAGCAACAGCAACAGCAGCAGCAGCAGCAGCAGAAGCAAACTGCCTTGGATTCTAAGGAGGTTGCTGCTGCTACTACTGCGAGGATCGATGAGTTGGAGAAGAGTGAGGTTGATGAGTGTCGTTCTTGGTCCACTCGCTCGGATTGCTCTGTTTCTGATCGTGGAGTTGCTGATTCTACTAATTTGGATCGGTTCTTGGAGTACACTACTCCCGTTACGAGCCTAAAGGGATGGAGAAATCGTGAAGTCTCAGAGGCACCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATACGGAGCGGGTATCCCTCTATTGTTAAATGGTAGCGACTCTGTAGTACAGTACTACGTTCCATATCTGTCGGGCATACAACTCTATGTAGATCCATCAAAATCCTCTGCCTTAAGAAGGCGTGGTGCTGATAGTGATGCTGAGTCCTCAAAGGAAACAACTAGTGATGGAAGCAGTAATTGTGGGATGGAAAAAAAAACTAGTACGGCTCTTCAGGATGAGTGGATACAGGACTCCAGTGTTACTGGGTCACAGAGAGCGCTTCAAATGAATGTACCCTCTGCCGAGTCGTCAAGTGATGAAAGTGACTCTTGCTACCGTCAAGGTCAGCTTGTGTTCGAATACATGGAGCTTGATCCACCATTTTGTCGTGAACCACTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTGAAAACATATAGAAGCTGTGATTTATCTCCTTCCAGTTGGATATCTGTGGCATGGTATCCAATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTTGATGCTTGCTTTTTGACCTTCCATTCTCTGTCAACAGCATTTCAAGGCATCGGCACCGACGGGTTGCAATTCCATTGGCCAAGAGTTCGAGAGGTGCACACTGCAAATTTGCCTCTCAAACTACAGTTGCCAACATTTGGACTTGCTTCCTACAAGTTCAAAATTTCGTTTTGGAATTCAACTGGTGTGGAGGAATGTCCAAAGGCTAACACATTGTGGCAAGATGCCGACAACTGGCTCAGGTCATTAAACGTGAACCATCCCGATTACAGATTTTTTGCATCTCATACTTCGTCCGGGAGATGA

Protein sequence

MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTTPVTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQDSSVTGSQRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHTSSGR
BLAST of Cp4.1LG08g05140 vs. TrEMBL
Match: A0A0A0L5V4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G165600 PE=4 SV=1)

HSP 1 Score: 710.3 bits (1832), Expect = 1.4e-201
Identity = 364/424 (85.85%), Postives = 384/424 (90.57%), Query Frame = 1

Query: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAAT 60
           MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ       KQ+ALDSK+V AA 
Sbjct: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQP------KQSALDSKDVVAAA 60

Query: 61  TARIDELEK-SEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTTPV--------TSLK 120
           T+ ID+LEK SE DECRSWSTRSDCSVSDRG+ADSTNLDRFLE+TTP+        TSL+
Sbjct: 61  TSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAHCIPKTSLR 120

Query: 121 GWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180
           GWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP
Sbjct: 121 GWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180

Query: 181 SKSSAL-RRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQDSSVTGSQRALQMN 240
           SKSSAL RRRGADSDAESSKET+SDGSSN G EKKT TALQ+EWIQD +V GSQRALQMN
Sbjct: 181 SKSSALSRRRGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMN 240

Query: 241 VPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSP 300
           VPS+ESSSDESDSCYR GQLVFEY+E DPPFCREPLTDKIT+LASRF ELKTYRSCDLSP
Sbjct: 241 VPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSCDLSP 300

Query: 301 SSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANL 360
           SSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHWPRVREV+TA+ 
Sbjct: 301 SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADC 360

Query: 361 PLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHT 415
           PLKLQLP FGLASYKFKI FWNSTG EEC KA++LWQDAD+WLR LNVNHPDYRFFASH 
Sbjct: 361 PLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHN 418

BLAST of Cp4.1LG08g05140 vs. TrEMBL
Match: F6H6U4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0077g01510 PE=4 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 1.7e-130
Identity = 261/434 (60.14%), Postives = 309/434 (71.20%), Query Frame = 1

Query: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-----QQQKQTALDSKE 60
           MS  GGVSI+R R  +RFY PPA+RR  QQQQQQQQQQQQQQQ     QQQ+Q  +  ++
Sbjct: 1   MSGQGGVSISRSRNGHRFYDPPAVRRNQQQQQQQQQQQQQQQQHQHQQQQQQQQMVHRQQ 60

Query: 61  VAAATT--ARIDELEKSEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTTPV------ 120
           V       A  +   ++E+D+C S      CSVS +G+ DSTNLDRFLEYTTPV      
Sbjct: 61  VQKPLKHKAAAEPESRTELDDCSS-----TCSVSHQGLGDSTNLDRFLEYTTPVVPAQHF 120

Query: 121 --TSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGI 180
             TS++GWRN + +E  PYF+LGDLWESFKEWSAYGAG+PLLLNGS+SVVQYYVPYLSGI
Sbjct: 121 PKTSMRGWRNHQ-TEFHPYFILGDLWESFKEWSAYGAGVPLLLNGSESVVQYYVPYLSGI 180

Query: 181 QLYVDPSKSSA-LRRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQ----DSSV 240
           QLY+DPSK S  LRR G +SDAESS+ET+SDGS++ G+E+  ++  Q  W Q    D + 
Sbjct: 181 QLYIDPSKPSPRLRRPGEESDAESSRETSSDGSADYGVERGANSVAQGPWSQQKLADVNT 240

Query: 241 TGSQRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPEL 300
               R    N P   S S E +     G L+FEY+E D P+ REPL DKI++LAS+FPEL
Sbjct: 241 QNMNRLSLRNKPFMGSPSGEGEVSNPPGLLLFEYLEQDSPYGREPLADKISVLASKFPEL 300

Query: 301 KTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWP 360
           +TYRSCDL PSSW+SVAWYPIYRIP GPTLQ+LDACFLTFHSLST FQ   TD L F   
Sbjct: 301 RTYRSCDLLPSSWVSVAWYPIYRIPMGPTLQNLDACFLTFHSLSTPFQSANTDWLDFRGS 360

Query: 361 RVREVHTANLPLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNH 415
            V+EVH A +P KL L   GLA YKFK+S WN  GV ECPKAN+L + ADNWLRSL VNH
Sbjct: 361 SVQEVHGAGMPFKLSLSILGLAFYKFKVSVWNHNGVNECPKANSLLRAADNWLRSLQVNH 420

BLAST of Cp4.1LG08g05140 vs. TrEMBL
Match: V4S739_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10005076mg PE=4 SV=1)

HSP 1 Score: 449.5 bits (1155), Expect = 4.4e-123
Identity = 247/425 (58.12%), Postives = 301/425 (70.82%), Query Frame = 1

Query: 1   MSVSGGVSIARI-RGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAA 60
           MS S  V I R  RGENRFY+PP +R+       QQ  ++QQ+++++KQ   + +E    
Sbjct: 1   MSGSRTVPIGRGGRGENRFYNPPHVRK-------QQLLRRQQEEEEEKQRVQEERERLRL 60

Query: 61  TTARIDELEKSEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTTPV--------TSLK 120
                 E   +E DEC S S             +STNLDRFLEYTTPV        TS++
Sbjct: 61  EKLSCSEKRTTESDECESISN---------SFGESTNLDRFLEYTTPVVPSQYPPKTSVR 120

Query: 121 GWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180
            WR++E +E  PYFVLGDLWESFKEWSAYGAG+PLLL+G++SV QYYVPYLSGIQLY+DP
Sbjct: 121 RWRSQE-TEFQPYFVLGDLWESFKEWSAYGAGVPLLLDGNESVKQYYVPYLSGIQLYIDP 180

Query: 181 SK-SSALRRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQ----DSSVTGSQRA 240
           S+ SS LRR G +SDAESS+ET+SDGSS+ G E++ +  +   W Q    D+++      
Sbjct: 181 SRPSSRLRRPGEESDAESSRETSSDGSSDTGAERRINAFVHGTWSQQNIADANIQNFSGL 240

Query: 241 LQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC 300
              N P   SSS+ES+ C   GQL+FEY+E DPPF REPL DKI ILASRFPEL+TY+SC
Sbjct: 241 SLRNKPFVGSSSEESEICNPPGQLIFEYLEHDPPFSREPLADKIGILASRFPELRTYKSC 300

Query: 301 DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVH 360
           DL PSSWISVAWYPIYRIPTGPTLQ+LDACFLTFHSLST FQ   ++GL F    VREVH
Sbjct: 301 DLLPSSWISVAWYPIYRIPTGPTLQNLDACFLTFHSLSTPFQSTSSEGLHFPNLTVREVH 360

Query: 361 TANLPLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFF 412
           +A++  KL LPTFGLASYKFK+SFWN  GV EC KAN+L + ADNWL+ L V+HPDY+FF
Sbjct: 361 SADMLYKLPLPTFGLASYKFKVSFWNCNGVYECQKANSLLRAADNWLKLLQVSHPDYKFF 408

BLAST of Cp4.1LG08g05140 vs. TrEMBL
Match: B9RZK3_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0999300 PE=4 SV=1)

HSP 1 Score: 448.0 bits (1151), Expect = 1.3e-122
Identity = 256/429 (59.67%), Postives = 306/429 (71.33%), Query Frame = 1

Query: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAAT 60
           MS SGGVSIAR RGENRFY  P MR      Q+QQQQQQQQQQ+ Q+Q  L SK      
Sbjct: 1   MSGSGGVSIARTRGENRFYVSPGMR------QKQQQQQQQQQQKPQQQRPLISK------ 60

Query: 61  TARIDELEKSEVDECRSWSTR-SDCSVSDR-GV-ADSTNLDRFLEYTTPV--------TS 120
           +  ++  ++ E D+C S S+  S+CSVS R G+  +STNLDRFLEYTTPV        TS
Sbjct: 61  SCMVEVEKRGESDQCGSSSSSVSNCSVSGRVGIEGNSTNLDRFLEYTTPVVPAQYLPKTS 120

Query: 121 LKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYV 180
           ++GWRN E +E  PYFVLGDLWESFKEWSAYGAG+PLLLNGS++V+QYYVPYLSGIQLY+
Sbjct: 121 VRGWRNHE-TEHQPYFVLGDLWESFKEWSAYGAGVPLLLNGSETVMQYYVPYLSGIQLYI 180

Query: 181 DPSKSSA-LRRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEW---IQDSSVTGSQR 240
           DP++ S  LRR G +SDAESS+++++DGSS+ G E+  +     +    I D+++    R
Sbjct: 181 DPARPSPRLRRPGEESDAESSRDSSTDGSSDYGAERGVNGVWGPQTQNNITDANIQSLNR 240

Query: 241 ALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRS 300
               N  S  SSSDE +     G+LVFEYME   PF REPL DKI+ L   FP LKT+RS
Sbjct: 241 LSLRNKHSRGSSSDEYEISNPPGRLVFEYMEHASPFTREPLADKISALTCNFPGLKTFRS 300

Query: 301 CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREV 360
           CDL PSSWISVAWYPIYRIP GPTLQ+LDACFLTFHSLST FQ + TD L F+    REV
Sbjct: 301 CDLLPSSWISVAWYPIYRIPMGPTLQNLDACFLTFHSLSTPFQSLNTDWLHFNGSSGREV 360

Query: 361 HTANLPLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRF 415
             A++P+KL L TFGLASYKFK+SFWN  G  EC K N+L + ADNWLR L V HPDY F
Sbjct: 361 SCADMPVKLPLATFGLASYKFKVSFWNPNGAYECQKVNSLLRAADNWLRLLQVYHPDYMF 416

BLAST of Cp4.1LG08g05140 vs. TrEMBL
Match: A0A0B0NEZ4_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_15127 PE=4 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 6.3e-122
Identity = 255/436 (58.49%), Postives = 305/436 (69.95%), Query Frame = 1

Query: 1   MSVSGGVSIARIRG-ENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAA 60
           MS SGGVSIAR RG E+RFY+PP MR+   QQQ QQQQQQQQQQQQ   T +  +E    
Sbjct: 1   MSGSGGVSIARSRGGESRFYNPPPMRK---QQQLQQQQQQQQQQQQMTTTTMQRREQRPP 60

Query: 61  TTARIDELEKSEVDECR------SWSTRSDCSVSDRGVADSTNLDRFLEYTTPV------ 120
             ++    ++++ ++C       S S+ S  + S     +STNLDRFLE+TTPV      
Sbjct: 61  LISKGSTEKRTDHEDCATLLPSSSSSSSSANNSSKTNYDNSTNLDRFLEFTTPVVSAQYL 120

Query: 121 --TSLKGWRNREVS-EAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG 180
             TS++G R RE   E PPYF L DLWESFKEWSAYGAG+PLLLNGSDSV+QYYVPYLSG
Sbjct: 121 PKTSIRGCRGRERGLECPPYFALKDLWESFKEWSAYGAGVPLLLNGSDSVMQYYVPYLSG 180

Query: 181 IQLYVDPSKSSALRRR-GADSDAESSKETTSDGS-SNCGMEKKTSTALQDEW----IQDS 240
           IQLYVD S+ S  +R  G +SD ESS+ET+SDGS S+ G+ ++ +  +   W    I D+
Sbjct: 181 IQLYVDQSRPSPRQRMPGEESDTESSRETSSDGSDSDYGVARRANNIVPGSWNQLDIADA 240

Query: 241 SVTGSQRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFP 300
           ++         N P   SSSDESD+C   GQL+FEY+E D PF REPL DKI++LAS+FP
Sbjct: 241 NIQRLNTLSLRNRPFGGSSSDESDTCNPLGQLIFEYLEHDQPFSREPLADKISVLASQFP 300

Query: 301 ELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFH 360
            L+TYRSCDLSPSSWISVAWYPIYRIP GPTLQ+LDACFLT+HSLST     GTDGL F 
Sbjct: 301 ALRTYRSCDLSPSSWISVAWYPIYRIPMGPTLQNLDACFLTYHSLSTPLPCNGTDGLPFR 360

Query: 361 WPRVREVHTANLPLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNV 415
              VRE H A++ LKL LPTFGLA YKFK+S WN  GV E  KAN+L Q ADNWLR L V
Sbjct: 361 GFNVREFHDADMSLKLPLPTFGLAFYKFKVSVWNPDGVNESQKANSLLQAADNWLRLLQV 420

BLAST of Cp4.1LG08g05140 vs. TAIR10
Match: AT4G16100.1 (AT4G16100.1 Protein of unknown function (DUF789))

HSP 1 Score: 337.8 bits (865), Expect = 9.4e-93
Identity = 207/420 (49.29%), Postives = 275/420 (65.48%), Query Frame = 1

Query: 11  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAATTARIDELEKS 70
           RIRGENRFY+PP MR+  Q++++++ + ++ +++++K   +  +++      +++E E  
Sbjct: 6   RIRGENRFYNPPPMRKLQQEREKKRLEAEEIEKEKKKAKEILDRKI------KVEEKEIK 65

Query: 71  EVDECRSWSTRSDCSVSDRGVADST-------NLDRFLEYTTPV--------TSLKGWRN 130
           + +EC +    SDCSV  R  + +T       NL RFL+ TTP+        TS KGWR 
Sbjct: 66  QPEECST----SDCSVPSRVSSTTTTTGTTSSNLGRFLDCTTPIVSTQHLPLTSSKGWRT 125

Query: 131 REVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSS 190
           RE  E  PYF+L DLW+SF+EWSAYG G+PLLLNG DSVVQYYVPYLSGIQLY DPS++ 
Sbjct: 126 RE-PEYRPYFLLNDLWDSFEEWSAYGVGVPLLLNGIDSVVQYYVPYLSGIQLYEDPSRAC 185

Query: 191 ALRRR-GADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQDSSVTGSQRALQMNVPSA 250
             RRR G +SD +S ++ +SDGS++C   ++ S  L              RA     P  
Sbjct: 186 TTRRRVGEESDGDSPRDMSSDGSNDC---RELSQNLY-------------RASLEEKPCI 245

Query: 251 ESSSDESD-SCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSW 310
            SSSDES+ S    G+LVFEY+E   PF REPLTDKI+ L+S+FP L+TYRSCDLSPSSW
Sbjct: 246 GSSSDESEASSNSPGELVFEYLEGAMPFGREPLTDKISNLSSQFPALRTYRSCDLSPSSW 305

Query: 311 ISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLK 370
           +SVAWYPIYRIP G +LQ+LDACFLTFHSLST  +G   +  Q      + V +A LP  
Sbjct: 306 VSVAWYPIYRIPLGQSLQNLDACFLTFHSLSTPCRGTSNEEGQ---SSSKSVASAKLP-- 365

Query: 371 LQLPTFGLASYKFKISFWN-STGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHTSS 413
             LPTFGLASYKFK+S W+  + V+E  +  TL + A+ WLR L V  PD+R F SH+ S
Sbjct: 366 --LPTFGLASYKFKLSEWSPESDVDENQRVGTLLRTAEEWLRRLKVILPDFRHFISHSGS 391

BLAST of Cp4.1LG08g05140 vs. TAIR10
Match: AT5G49220.1 (AT5G49220.1 Protein of unknown function (DUF789))

HSP 1 Score: 325.9 bits (834), Expect = 3.7e-89
Identity = 216/443 (48.76%), Postives = 266/443 (60.05%), Query Frame = 1

Query: 1   MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAA 60
           MS SGGVSIAR  IRGENRFY+PP MRR  Q+ Q QQQ +++Q++  + +  +D +   A
Sbjct: 1   MSGSGGVSIARTAIRGENRFYNPPPMRRMQQEAQLQQQIREKQRRDDEDEVLMDKERRKA 60

Query: 61  ATTARIDELEKSEVDECRSWSTRSDCSV----------SDRGVADSTNLDRFLEYTTPVT 120
           AT A     +   V E +S    S   V          S R ++D +NLDRFLE+TTPV 
Sbjct: 61  ATVAPRTTRKGLGVSESKSRVVVSGSEVCAGSSDSSSGSGRVLSDGSNLDRFLEHTTPVV 120

Query: 121 SLK------GW--RNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLN-----GSDSVV 180
             +       W  + RE S+   YFVL DLWESF EWSAYGAG+PL ++     G+DS V
Sbjct: 121 PARLFPMRSRWELKTRE-SDCHTYFVLEDLWESFAEWSAYGAGVPLEMHPLEMHGNDSTV 180

Query: 181 QYYVPYLSGIQLYVDPSKSSALRRRGADSDAESSKETTSDGSSNCGMEKKTSTA--LQDE 240
           QYYVPYLSGIQLYVDP K    + R    D E S    S+GSSN        +   L   
Sbjct: 181 QYYVPYLSGIQLYVDPLK----KPRNPVGDNEGS----SEGSSNSRTLPVDLSVGELNRI 240

Query: 241 WIQDSSVTGSQRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITIL 300
            ++D S+TGS             SS E++    QG+L+FEY+E +PPF REPL +KI+ L
Sbjct: 241 SLKDQSITGSL------------SSGEAEISNPQGRLLFEYLEYEPPFGREPLANKISDL 300

Query: 301 ASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTA--FQGIG 360
           ASR PEL TYRSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFHSLSTA     +G
Sbjct: 301 ASRVPELMTYRSCDLLPSSWVSVSWYPIYRIPVGPTLQNLDACFLTFHSLSTAPPQSAMG 360

Query: 361 TDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADN 415
               Q                KL LPTFGLASYK K+S WN   ++E  K  +L Q AD 
Sbjct: 361 CSDSQ-------------PSTKLPLPTFGLASYKLKVSVWNQNRIQESQKMTSLLQAADK 409

BLAST of Cp4.1LG08g05140 vs. TAIR10
Match: AT1G15030.1 (AT1G15030.1 Protein of unknown function (DUF789))

HSP 1 Score: 281.2 bits (718), Expect = 1.0e-75
Identity = 167/361 (46.26%), Postives = 219/361 (60.66%), Query Frame = 1

Query: 66  ELEKSEVDE---CRSWSTRSDCSVSDR-----GVADSTNLDRFLEYTTPV--------TS 125
           +L+++++D    CRS  T+   + S         A S+N++RFL+  TP         T 
Sbjct: 10  QLQRAQIDVSYGCRSSHTKDRENGSALLKHHVSEASSSNVERFLDSVTPSVPAHYLSKTI 69

Query: 126 LKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY 185
           ++     +V    PYF+LGD+WESF EWSAYG G+PL LN + D V QYYVP LSGIQ+Y
Sbjct: 70  VRERGGSDVESQVPYFLLGDVWESFAEWSAYGIGVPLTLNNNKDRVFQYYVPSLSGIQVY 129

Query: 186 VDP---SKSSALRRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQDSSVTGSQR 245
            D    + S   RR+G +S+++  ++++S+GSS+   E +       E I       S R
Sbjct: 130 ADVDALTSSLQARRQGEESESDF-RDSSSEGSSS---ESERGLCYSKEQISARMDKLSLR 189

Query: 246 ALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRS 305
                    +SSSD+ +    QG+L+FEY+E D P+ REP  DK++ LASRFPELKT RS
Sbjct: 190 KEHQE----DSSSDDGEPLSSQGRLIFEYLERDLPYVREPFADKMSDLASRFPELKTLRS 249

Query: 306 CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREV 365
           CDL PSSW SVAWYPIY+IPTGPTL+ LDACFLT+HSL T FQG G      H  + RE 
Sbjct: 250 CDLLPSSWFSVAWYPIYKIPTGPTLKDLDACFLTYHSLHTPFQGPGVTTGSMHVVQPRES 309

Query: 366 HTANLPLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRF 407
                  K++LP FGLASYK + S W S G      AN+L+Q ADNWLR   VNHPD+ F
Sbjct: 310 VE-----KMELPVFGLASYKLRGSVWTSFGGSGHQLANSLFQAADNWLRLRQVNHPDFIF 357

BLAST of Cp4.1LG08g05140 vs. TAIR10
Match: AT2G01260.1 (AT2G01260.1 Protein of unknown function (DUF789))

HSP 1 Score: 271.9 bits (694), Expect = 6.3e-73
Identity = 162/363 (44.63%), Postives = 215/363 (59.23%), Query Frame = 1

Query: 63  RIDELEKSEVDECRSWSTRSDCSVSDRGVAD--STNLDRFLEYTTPV--------TSLKG 122
           RID+L +++ D     S+           +D  S+NLDRFLE  TP         T L+ 
Sbjct: 30  RIDQLRRAQSDVSNVPSSAPSPHKQQLEPSDLSSSNLDRFLESVTPSVPAQFLSKTLLRE 89

Query: 123 WR-NREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVD 182
            R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y  
Sbjct: 90  RRADDDYNKLVPYFVLGDIWDSFAEWSAYGTGVPLVLNNNKDRVIQYYVPSLSAIQIYAH 149

Query: 183 PSK--SSALRRRGADSDAESSKETTSDGSSNCGMEKKTST----ALQDEWIQDSSVTGSQ 242
                SS   RR  DS     ++++SD SS+   E+ ++     +L+D+  +DSS     
Sbjct: 150 SHALDSSLKSRRPGDSSDSDFRDSSSDVSSDSDSERVSARVDCISLRDQHQEDSS----- 209

Query: 243 RALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYR 302
                        SD+ +    QG+L+FEY+E D P+ REP  DK+  LA++FPEL T R
Sbjct: 210 -------------SDDGEPLGSQGRLMFEYLERDLPYIREPFADKVLDLAAQFPELMTLR 269

Query: 303 SCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTD-GLQFHWPRVR 362
           SCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+F G G++  +    PR  
Sbjct: 270 SCDLLRSSWFSVAWYPIYRIPTGPTLKDLDACFLTYHSLHTSFGGEGSEQSMSLTQPRES 329

Query: 363 EVHTANLPLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDY 407
           E        K+ LP FGLASYKF+ S W   G  E    N+L+Q AD WL S +V+HPD+
Sbjct: 330 E--------KMSLPVFGLASYKFRGSLWTPIGGSEHQLVNSLFQAADKWLHSCHVSHPDF 366

BLAST of Cp4.1LG08g05140 vs. TAIR10
Match: AT4G03420.1 (AT4G03420.1 Protein of unknown function (DUF789))

HSP 1 Score: 204.5 bits (519), Expect = 1.2e-52
Identity = 123/332 (37.05%), Postives = 177/332 (53.31%), Query Frame = 1

Query: 95  TNLDRFLEYTTPVTSLKGWRNREVS-----------EAPPYFVLGDLWESFKEWSAYGAG 154
           +NLDRFL  TTPV   +     E+            +   +F L DLW+ + EWSAYGAG
Sbjct: 7   SNLDRFLHCTTPVVPPQSLSKAEIRSLNRIWHPWERQKVEFFRLSDLWDCYDEWSAYGAG 66

Query: 155 IPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRRRGADSDAESSKETTSDGSSNCGME 214
           +P+ L+  +S+VQYYVPYLS IQ++   S+SS +R R    D E S+++ SD  S+    
Sbjct: 67  VPIRLSNGESLVQYYVPYLSAIQIFT--SRSSLIRLRDDSEDGE-SRDSFSDSYSDESES 126

Query: 215 KKTSTALQDEWIQDSSVTGSQRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCR 274
            K S    DE ++  ++      L  N               R G L  +Y E   P+ R
Sbjct: 127 DKLSRCASDEGLEHDAL------LHPN--------------DRLGYLYLQYFERSAPYAR 186

Query: 275 EPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSL 334
            PL DKI  LA R+P L + RS DLSP+SW++VAWYPIY IP G T++ L  CFLT+H+L
Sbjct: 187 VPLMDKINELAQRYPGLMSLRSVDLSPASWMAVAWYPIYHIPMGRTIKDLSTCFLTYHTL 246

Query: 335 STAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKISFWNSTG--VEECPK 394
           S++FQ +  +       R+R+         + L  FGLA+YK + + W S     ++  +
Sbjct: 247 SSSFQDMEPEENGGEKERIRKEGEG-----VTLLPFGLATYKMQGNVWLSEDDQGQDQER 306

Query: 395 ANTLWQDADNWLRSLNVNHPDYRFFASHTSSG 414
             +L   AD+WL+ L V H D+ +F+     G
Sbjct: 307 VLSLLSVADSWLKQLRVQHHDFNYFSRMAHRG 310

BLAST of Cp4.1LG08g05140 vs. NCBI nr
Match: gi|659076890|ref|XP_008438917.1| (PREDICTED: uncharacterized protein LOC103483873 isoform X2 [Cucumis melo])

HSP 1 Score: 715.3 bits (1845), Expect = 6.1e-203
Identity = 365/423 (86.29%), Postives = 385/423 (91.02%), Query Frame = 1

Query: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAAT 60
           MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ  KQ+ALDSK+V AA 
Sbjct: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQP-KQSALDSKDVVAAA 60

Query: 61  TARIDELEK-SEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTTPV--------TSLK 120
           T+ ID+LEK SE DECRSWSTRSDCSVSDRG+ DSTNLDRFLE+TTP+        TSL+
Sbjct: 61  TSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDRFLEHTTPLVPAHCIPKTSLR 120

Query: 121 GWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180
           GWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP
Sbjct: 121 GWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180

Query: 181 SKSSALRRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQDSSVTGSQRALQMNV 240
           SKS ALRRRGADSDAESSKET+SDGSSN G EKKT TALQ+EWIQD +  GSQRALQMNV
Sbjct: 181 SKSPALRRRGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNV 240

Query: 241 PSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPS 300
           PS+ESSSDESDSCYR GQLVFEY+E DPPFCREPLTDKIT+LASRFPELKTYRSCDLSPS
Sbjct: 241 PSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSCDLSPS 300

Query: 301 SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLP 360
           SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHWPRVREV+TA+ P
Sbjct: 301 SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTALQGTSTDGLQFHWPRVREVYTADCP 360

Query: 361 LKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHTS 415
           LKLQLP FGLASYKFKI FWNSTG EEC KA++LWQDAD+WLR LNVNHPDYRFFASH S
Sbjct: 361 LKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHNS 420

BLAST of Cp4.1LG08g05140 vs. NCBI nr
Match: gi|778678993|ref|XP_011651067.1| (PREDICTED: uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus])

HSP 1 Score: 714.9 bits (1844), Expect = 8.0e-203
Identity = 364/423 (86.05%), Postives = 384/423 (90.78%), Query Frame = 1

Query: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAAT 60
           MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ       KQ+ALDSK+V AA 
Sbjct: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQP------KQSALDSKDVVAAA 60

Query: 61  TARIDELEK-SEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTTPV--------TSLK 120
           T+ ID+LEK SE DECRSWSTRSDCSVSDRG+ADSTNLDRFLE+TTP+        TSL+
Sbjct: 61  TSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAHCIPKTSLR 120

Query: 121 GWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180
           GWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP
Sbjct: 121 GWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180

Query: 181 SKSSALRRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQDSSVTGSQRALQMNV 240
           SKSSALRRRGADSDAESSKET+SDGSSN G EKKT TALQ+EWIQD +V GSQRALQMNV
Sbjct: 181 SKSSALRRRGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMNV 240

Query: 241 PSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPS 300
           PS+ESSSDESDSCYR GQLVFEY+E DPPFCREPLTDKIT+LASRF ELKTYRSCDLSPS
Sbjct: 241 PSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSCDLSPS 300

Query: 301 SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLP 360
           SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHWPRVREV+TA+ P
Sbjct: 301 SWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADCP 360

Query: 361 LKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHTS 415
           LKLQLP FGLASYKFKI FWNSTG EEC KA++LWQDAD+WLR LNVNHPDYRFFASH S
Sbjct: 361 LKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHNS 417

BLAST of Cp4.1LG08g05140 vs. NCBI nr
Match: gi|659076888|ref|XP_008438916.1| (PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo])

HSP 1 Score: 710.7 bits (1833), Expect = 1.5e-201
Identity = 365/424 (86.08%), Postives = 385/424 (90.80%), Query Frame = 1

Query: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAAT 60
           MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ  KQ+ALDSK+V AA 
Sbjct: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQP-KQSALDSKDVVAAA 60

Query: 61  TARIDELEK-SEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTTPV--------TSLK 120
           T+ ID+LEK SE DECRSWSTRSDCSVSDRG+ DSTNLDRFLE+TTP+        TSL+
Sbjct: 61  TSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDRFLEHTTPLVPAHCIPKTSLR 120

Query: 121 GWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180
           GWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP
Sbjct: 121 GWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180

Query: 181 SKSSAL-RRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQDSSVTGSQRALQMN 240
           SKS AL RRRGADSDAESSKET+SDGSSN G EKKT TALQ+EWIQD +  GSQRALQMN
Sbjct: 181 SKSPALSRRRGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMN 240

Query: 241 VPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSP 300
           VPS+ESSSDESDSCYR GQLVFEY+E DPPFCREPLTDKIT+LASRFPELKTYRSCDLSP
Sbjct: 241 VPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSCDLSP 300

Query: 301 SSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANL 360
           SSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHWPRVREV+TA+ 
Sbjct: 301 SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTALQGTSTDGLQFHWPRVREVYTADC 360

Query: 361 PLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHT 415
           PLKLQLP FGLASYKFKI FWNSTG EEC KA++LWQDAD+WLR LNVNHPDYRFFASH 
Sbjct: 361 PLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHN 420

BLAST of Cp4.1LG08g05140 vs. NCBI nr
Match: gi|778678990|ref|XP_004134231.2| (PREDICTED: uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus])

HSP 1 Score: 710.3 bits (1832), Expect = 2.0e-201
Identity = 364/424 (85.85%), Postives = 384/424 (90.57%), Query Frame = 1

Query: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQKQTALDSKEVAAAT 60
           MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ       KQ+ALDSK+V AA 
Sbjct: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQP------KQSALDSKDVVAAA 60

Query: 61  TARIDELEK-SEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTTPV--------TSLK 120
           T+ ID+LEK SE DECRSWSTRSDCSVSDRG+ADSTNLDRFLE+TTP+        TSL+
Sbjct: 61  TSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAHCIPKTSLR 120

Query: 121 GWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180
           GWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP
Sbjct: 121 GWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDP 180

Query: 181 SKSSAL-RRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQDSSVTGSQRALQMN 240
           SKSSAL RRRGADSDAESSKET+SDGSSN G EKKT TALQ+EWIQD +V GSQRALQMN
Sbjct: 181 SKSSALSRRRGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNVPGSQRALQMN 240

Query: 241 VPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSP 300
           VPS+ESSSDESDSCYR GQLVFEY+E DPPFCREPLTDKIT+LASRF ELKTYRSCDLSP
Sbjct: 241 VPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFSELKTYRSCDLSP 300

Query: 301 SSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANL 360
           SSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHWPRVREV+TA+ 
Sbjct: 301 SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGISTDGLQFHWPRVREVYTADC 360

Query: 361 PLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHT 415
           PLKLQLP FGLASYKFKI FWNSTG EEC KA++LWQDAD+WLR LNVNHPDYRFFASH 
Sbjct: 361 PLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHN 418

BLAST of Cp4.1LG08g05140 vs. NCBI nr
Match: gi|731388295|ref|XP_010649549.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 15 isoform X2 [Vitis vinifera])

HSP 1 Score: 474.2 bits (1219), Expect = 2.4e-130
Identity = 261/434 (60.14%), Postives = 309/434 (71.20%), Query Frame = 1

Query: 1   MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQ-----QQQKQTALDSKE 60
           MS  GGVSI+R R  +RFY PPA+RR  QQQQQQQQQQQQQQQ     QQQ+Q  +  ++
Sbjct: 1   MSGQGGVSISRSRNGHRFYDPPAVRRNQQQQQQQQQQQQQQQQHQHQQQQQQQQMVHRQQ 60

Query: 61  VAAATT--ARIDELEKSEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTTPV------ 120
           V       A  +   ++E+D+C S      CSVS +G+ DSTNLDRFLEYTTPV      
Sbjct: 61  VQKPLKHKAAAEPESRTELDDCSS-----TCSVSHQGLGDSTNLDRFLEYTTPVVPAQHF 120

Query: 121 --TSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGI 180
             TS++GWRN + +E  PYF+LGDLWESFKEWSAYGAG+PLLLNGS+SVVQYYVPYLSGI
Sbjct: 121 PKTSMRGWRNHQ-TEFHPYFILGDLWESFKEWSAYGAGVPLLLNGSESVVQYYVPYLSGI 180

Query: 181 QLYVDPSKSSA-LRRRGADSDAESSKETTSDGSSNCGMEKKTSTALQDEWIQ----DSSV 240
           QLY+DPSK S  LRR G +SDAESS+ET+SDGS++ G+E+  ++  Q  W Q    D + 
Sbjct: 181 QLYIDPSKPSPRLRRPGEESDAESSRETSSDGSADYGVERGANSVAQGPWSQQKLADVNT 240

Query: 241 TGSQRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPEL 300
               R    N P   S S E +     G L+FEY+E D P+ REPL DKI++LAS+FPEL
Sbjct: 241 QNMNRLSLRNKPFMGSPSGEGEVSNPPGLLLFEYLEQDSPYGREPLADKISVLASKFPEL 300

Query: 301 KTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWP 360
           +TYRSCDL PSSW+SVAWYPIYRIP GPTLQ+LDACFLTFHSLST FQ   TD L F   
Sbjct: 301 RTYRSCDLLPSSWVSVAWYPIYRIPMGPTLQNLDACFLTFHSLSTPFQSANTDWLDFRGS 360

Query: 361 RVREVHTANLPLKLQLPTFGLASYKFKISFWNSTGVEECPKANTLWQDADNWLRSLNVNH 415
            V+EVH A +P KL L   GLA YKFK+S WN  GV ECPKAN+L + ADNWLRSL VNH
Sbjct: 361 SVQEVHGAGMPFKLSLSILGLAFYKFKVSVWNHNGVNECPKANSLLRAADNWLRSLQVNH 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L5V4_CUCSA1.4e-20185.85Uncharacterized protein OS=Cucumis sativus GN=Csa_3G165600 PE=4 SV=1[more]
F6H6U4_VITVI1.7e-13060.14Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0077g01510 PE=4 SV=... [more]
V4S739_9ROSI4.4e-12358.12Uncharacterized protein OS=Citrus clementina GN=CICLE_v10005076mg PE=4 SV=1[more]
B9RZK3_RICCO1.3e-12259.67Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0999300 PE=4 SV=1[more]
A0A0B0NEZ4_GOSAR6.3e-12258.49Uncharacterized protein OS=Gossypium arboreum GN=F383_15127 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G16100.19.4e-9349.29 Protein of unknown function (DUF789)[more]
AT5G49220.13.7e-8948.76 Protein of unknown function (DUF789)[more]
AT1G15030.11.0e-7546.26 Protein of unknown function (DUF789)[more]
AT2G01260.16.3e-7344.63 Protein of unknown function (DUF789)[more]
AT4G03420.11.2e-5237.05 Protein of unknown function (DUF789)[more]
Match NameE-valueIdentityDescription
gi|659076890|ref|XP_008438917.1|6.1e-20386.29PREDICTED: uncharacterized protein LOC103483873 isoform X2 [Cucumis melo][more]
gi|778678993|ref|XP_011651067.1|8.0e-20386.05PREDICTED: uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus][more]
gi|659076888|ref|XP_008438916.1|1.5e-20186.08PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo][more]
gi|778678990|ref|XP_004134231.2|2.0e-20185.85PREDICTED: uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus][more]
gi|731388295|ref|XP_010649549.1|2.4e-13060.14PREDICTED: mediator of RNA polymerase II transcription subunit 15 isoform X2 [Vi... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008507DUF789
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g05140.1Cp4.1LG08g05140.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008507Protein of unknown function DUF789PFAMPF05623DUF789coord: 95..406
score: 2.8E
NoneNo IPR availableunknownCoilCoilcoord: 25..48
scor
NoneNo IPR availablePANTHERPTHR31343FAMILY NOT NAMEDcoord: 4..414
score: 5.2E
NoneNo IPR availablePANTHERPTHR31343:SF2SUBFAMILY NOT NAMEDcoord: 4..414
score: 5.2E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG08g05140Cp4.1LG03g11610Cucurbita pepo (Zucchini)cpecpeB482