Cp4.1LG20g01550 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g01550
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein sel-1
LocationCp4.1LG20 : 881822 .. 886886 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTCGTATCTCTCAGAGCAGGATCTTTCTCGCAAATTCTACAGAGGCTTCTTCTTCTTCTTCCTTCTTCCTTCTTCCTCCTTCTTCGTGTTCTTGAAAAATTCAATCCCAATGCAGATTAGGATGCAACTGCAAGCTCGTAGGCTTCAGTTAATTTTTTTAATTATATGTCTATGTTCTTTTTTCATCAACGCTCGTCCGTTTCTCATTGTCATCTCTCAAGATGACCTCAAGGATGCTGCGTCGCCCGACGACTCCTCCGATTCAGCGAACTACGACTCGGCCGAGTGGGACGAGTTCGGGGAGCCTGAATCTCAAAACGCCGCCCAGGAGCTTGACCCCGGTACTTGGCGCCCGATTTTTGAACCGGATTCTCCCGCCTCTGACCCCGACGCGCCGGAGGATCTGTACTACACTGCGTTAGGGAAGATGATGACAGCGGTGAGCTCTGGTGACCTGAGGTTGATGGAGGATGCGGTAGGGGATATTGATCAAGTTGCGGTGGAGAGTGGGGATCCGCACGCGCAATCGGTGCTAGGGTTATTGTATGGAACGGGGATAATGAAGGAGACGAATAAGGCTAAGGCGTTTATGTATCATTATTTTGCCGCGGAAGGGAATAAGCAATCTAAGATGGCTCTTGCATATAGTTACTTCAGGCAAGAAGTAAGTGCGACCCTGCGAGCTCTTTAGTTATTGAATTGTGTTTAATTAATGAAGTAATTGTTGTTTTATCGGTAGGCATATATGGATTTGTTATTCTTTTACTTGTCGTGTTTCCAATGCGTAATTCTTTCTTGGTTGCAGATGTATGAAAAAGCAGTTAAGCTTTATGCTGAGTTAGCAGAGGTAGCCATTAATAGTTTATTAGTTTCCAAGGATTCTCCAGTAATTGAACCTGTGAGGATCCATAATGGAGCAGAGGAGAACAAGCAGGCCTTGAGAAAGTCTCGAGGAGAGGAGGACGAGGACTTCCAAATTTTGGAATATCAGGCGCAGAAGGGTAATGCTGGAGCCATGTATAGAATTGGGCTATTTTACTACTTTGGACTTAGAGGGTTGAGGCGTGATCATGCAAAAGCATTGATCTGGTTTTCCAAGGCCGTGGATAAAGGTGAACCAAAGTCTATGGAACTACTTGGAGAGATATATGCAAGGGGCGCCGGAGTCGAAAGGGATTATACTAAGGCACTTCAATGGCTAACTCGTGCATCCAAGCATCCATCATTTTCTGCTTATAATGGCATAGGATATTTATACGTTAAGGGTTATGGAGTGGAGAAAAACTTTACCAAGGTAAGGTTCCCATTATTTCTGTTGAAATTGCTATTTTGACTTCGAATTTCTGTCTCCATATGTTGCTGTAATTCTTGCCTCGGAGCTGTCTTAATTAGATACCATTAACTTAACCCACTATACCTCAGGCGTTACATTTTCCTGAATCATAAGTGAACTAGAGACTATTTGATTTAATGGTGTATTCAATTCGTATGGCAAGCTGAACGGTTGTTAGTCTCTTGTGCCCGTAGAAATCTGGTGGCTTATTTTCGTTTTTATGTTCTATTAGTTGAATTGCGCTTACCTATCAATTGTTGTCACCCTACCTTAAATGTGTGTATTTTAGTGGTTGTCTCTTGTTCTATTGACTGTTCATAGTAGTATACTATATTCTCAGAAGTTCTGTGCTTATCTTAGATTGTTTTAGCTTTATTATTTTGTCAGCCCCTAACTTCTGTTACCTTAAAAGATAGTTTTTTTTTTTTTCCTTTCTTTTTTGCAATGTCTGCCCTTTTGAATCTCTGATGGGCATTGAATTTTTCTTGGAGGATGTAACATGATCTTGCTCCCATTTTACCACCACCACATGCCCTAGTCGTTGCAAGATTTGGATCTTAATGCAACTTTTGAATGCATATCCTTTTCTTTGTATTTTCATGTTCTAAATGTTGTTTTCTTCAGTATATGTTGATATGGCACCTGAATTACTGCTTGTTTCATTTTTTTTCCCACTTCAATTCTCTGCCTAAATCGTGTATAACATACGTCATCATTGCATGTTCCCCACATGCTACGTCCTCTAATACAGGCCAAGGAGTACTTTGAAAAGGCTGCCAATAACGACGAGTCTGGTGGTCATTATAATCTAGGAGTTATGTATCTTAAAGGAATTGGGGTAAAGAGAGATGTGAAGAAGGCATGTACTCATTTTATAGTGGCTGCAAATGCTGGACAACCAAAGGCATTCTACCAGCTGGCGAAGATGTTTCATACTGGTGTTGGGCTCAAGAGGAACATTCCTATGGTAAATTCACTCAACTACATTTTCAAACCCCCCCCCCCCCCCCCCCCCCCNGAAAGGAGGGAAAGAAAACTTTTTTTGGGTTCTAATAATTGAAAAGTCTTGGTTTTCTGGAGCAAACGTTTTATTAAAAATAAAATCTGAACTTATTATTGGGATCCTAATAGACCACAGATTCTGTAAATAATTGAAGTTTCAACCGTTGCAGGCTAGTGCTTTATACAAATTAGTAGCTGAACGAGGGCCTTGGAGTTCATTGTCTAGATGGGCATTGGAATCATATCTGAAAAGTGATATTGGCAAGGCATTCTTCTTGTACGCAAGAATGGCTGAGCTAGGATATGAGGTGGCACAAAGCAATGCAGCATGGATACTTGACAAATATGGAGAACAAAGCATGTGTCTGGGAGAATCTGGCTTTTGCACAGATGCAGAACTACATCAGAGAGCTCATTCTCTTTGGTGGCAAGCATCCGAGCAGGGCAACGAACACGCTGCTTTACTGATTGGGGATGCATATTACTACGGTCGGGTATGTTCTGAGAGTATTTCTATTGAATCACAAATGTTTCAAAATCATGTGTTTGAGATTTTTTCATTTTTCCACGTTTCCATTTCCACCCAGGGAACCGACGTAGATTATGATCGTGCTGCTGAAGCATACATGCATGCTAAATCCCAACTAAATGCACAAGCCATGTTCAACCTTGGTTACATGCATGAACATGGTCTAGGCCTTCCGTTTGACCTTCACCTAGCCAAGCGTTACTATGACCAAGCTCTGGAACTTGATCCAGCTGCCAGGTTGCCAGTCAAGCTGGCCCTAGCAAGCCTGTGGTTAAGGAAGAATCATGCCGACAGTTTTCTGGTGGGTTCAATGAACTTTTCTTAATTCTTTTCTTGCAATAGTCCATTTTCCTGCCTTATTATAGACGACCTGGATATCAACTGAGGGTGTTTCTGATGTGTGAATTAGGTCCATGTGATCGATTCGTTGCCAGAAGTGTATCCAAAGATTGATATATGGGTGGAGGATGTACTGTTGGAGGAAGGAAATGCAACAATTCTAACTCTATTTGCCTGCCTCCTCACTGTCCTATATCTTAGAGAGCGGCAACGGAGGCATGCAGTTGTACGGGCTGCTGCTGCTGCTGCTGCTGCTGAGGCTGTGCCACTCCACCCCAATGATCACGTGGCTGCACAAAATTAAATCACTGCCTGTGTTGTCTTTTTCTTTTCAATGAGGTATGAGAATCTTTTCTTTAACGTAATGTACAGGAAATTAGAATTCAACTGCTTAGAGAGAGCTTTTGCGTTCTTGGGACTTTCACAACACAGAGTAGAGGAAGAACTTCAAGATGTTTATCACCGGGATCATAGAAAGCAAAGATTTTTATATATTTTTTTTCTGTGATTTATTTCGTGGCAATTGCATTGTAGATTTTTATGAGACTGCACTGTCTTTTGCTGATCCTACTAATTGAACATTTTTACCTGGGAATTGCATGAGTCTAAATTAGTTGAGCTATGATCCTACTAATTGAACATGTCTACATGGGAATTGCACGAGTCGTTACTTTCATGTATTCGTACGTTACTTTCATGTATTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTGCTCGTACGTTACTTTCATGTACTCGTACGTTACTTTCATGTACTCGTACATTAACATGGTTCCTCGTTGATGGTGAGGAATCTCAATATAACAAGGTTAACTTAGCTAACTGTGTTACAAAATAACAGGCACCAGAGTAGCGACAACTAATATGTATACATTGTCTTAACAGAAAACAGAGTATTTACAACTAATTATAAAAAATATCCTAAAAACAAGTTTATTTTCACACAGTTTGATCACTTGGCATTCTTCAACAAAATAGGAGTGAGCGCTGTCTTCCACTTGCTTGCTACCAAAAAATATCTGAAAGCTGCTTTTCGAATCAGCTTAACGAGAAAGCATAGTTGAACAAAATGGGAGGCGCGCGTTCCATCTATTAATCTGATGCTACTAGCCGAGTGCCAAATATTCTGTACTGTCTACTTCCCCACGCATCTTGTTTATCAAGGTAATAAGATTTTAGGCAGCACTGCACCCATGGAGGATTTCTTGTACTCGCTTCATTTTTCTCACTTGCTGAAACCCCAAGTAACTTGTTTCGCTTCCTTGTTAAAACATCAATTGATGGATCTTCACCAAAGAATTTGTCCTTGAGTGGATTATGATAAATGTGAAAAACAATCAGAAGAGAGCCAATAAGTACAACTACAGCCAATATTATGAAATCATGAACACAATGTTCTAGGGAATTATTATCTTACACGAACTAAGGGGAAAAAATGCACAACTACGCTACTGCAAGTAATGAGTGTATTGCTATGTCAAAACATATTTTAAAT

mRNA sequence

GGTCGTATCTCTCAGAGCAGGATCTTTCTCGCAAATTCTACAGAGGCTTCTTCTTCTTCTTCCTTCTTCCTTCTTCCTCCTTCTTCGTGTTCTTGAAAAATTCAATCCCAATGCAGATTAGGATGCAACTGCAAGCTCGTAGGCTTCAGTTAATTTTTTTAATTATATGTCTATGTTCTTTTTTCATCAACGCTCGTCCGTTTCTCATTGTCATCTCTCAAGATGACCTCAAGGATGCTGCGTCGCCCGACGACTCCTCCGATTCAGCGAACTACGACTCGGCCGAGTGGGACGAGTTCGGGGAGCCTGAATCTCAAAACGCCGCCCAGGAGCTTGACCCCGGTACTTGGCGCCCGATTTTTGAACCGGATTCTCCCGCCTCTGACCCCGACGCGCCGGAGGATCTGTACTACACTGCGTTAGGGAAGATGATGACAGCGGTGAGCTCTGGTGACCTGAGGTTGATGGAGGATGCGGTAGGGGATATTGATCAAGTTGCGGTGGAGAGTGGGGATCCGCACGCGCAATCGGTGCTAGGGTTATTGTATGGAACGGGGATAATGAAGGAGACGAATAAGGCTAAGGCGTTTATGTATCATTATTTTGCCGCGGAAGGGAATAAGCAATCTAAGATGGCTCTTGCATATAGTTACTTCAGGCAAGAAATGTATGAAAAAGCAGTTAAGCTTTATGCTGAGTTAGCAGAGGTAGCCATTAATAGTTTATTAGTTTCCAAGGATTCTCCAGTAATTGAACCTGTGAGGATCCATAATGGAGCAGAGGAGAACAAGCAGGCCTTGAGAAAGTCTCGAGGAGAGGAGGACGAGGACTTCCAAATTTTGGAATATCAGGCGCAGAAGGGTAATGCTGGAGCCATGTATAGAATTGGGCTATTTTACTACTTTGGACTTAGAGGGTTGAGGCGTGATCATGCAAAAGCATTGATCTGGTTTTCCAAGGCCGTGGATAAAGGTGAACCAAAGTCTATGGAACTACTTGGAGAGATATATGCAAGGGGCGCCGGAGTCGAAAGGGATTATACTAAGGCACTTCAATGGCTAACTCGTGCATCCAAGCATCCATCATTTTCTGCTTATAATGGCATAGGATATTTATACGTTAAGGGTTATGGAGTGGAGAAAAACTTTACCAAGGCCAAGGAGTACTTTGAAAAGGCTGCCAATAACGACGAGTCTGGTGGTCATTATAATCTAGGAGTTATGTATCTTAAAGGAATTGGGGTAAAGAGAGATGTGAAGAAGGCATGTACTCATTTTATAGTGGCTGCAAATGCTGGACAACCAAAGGCATTCTACCAGCTGGCGAAGATGTTTCATACTGGTGTTGGGCTCAAGAGGAACATTCCTATGGCTAGTGCTTTATACAAATTAGTAGCTGAACGAGGGCCTTGGAGTTCATTGTCTAGATGGGCATTGGAATCATATCTGAAAAGTGATATTGGCAAGGCATTCTTCTTGTACGCAAGAATGGCTGAGCTAGGATATGAGGTGGCACAAAGCAATGCAGCATGGATACTTGACAAATATGGAGAACAAAGCATGTGTCTGGGAGAATCTGGCTTTTGCACAGATGCAGAACTACATCAGAGAGCTCATTCTCTTTGGTGGCAAGCATCCGAGCAGGGCAACGAACACGCTGCTTTACTGATTGGGGATGCATATTACTACGGTCGGGGAACCGACGTAGATTATGATCGTGCTGCTGAAGCATACATGCATGCTAAATCCCAACTAAATGCACAAGCCATGTTCAACCTTGGTTACATGCATGAACATGGTCTAGGCCTTCCGTTTGACCTTCACCTAGCCAAGCGTTACTATGACCAAGCTCTGGAACTTGATCCAGCTGCCAGGTTGCCAGTCAAGCTGGCCCTAGCAAGCCTGTGGTTAAGGAAGAATCATGCCGACAGTTTTCTGGTCCATGTGATCGATTCGTTGCCAGAAGTGTATCCAAAGATTGATATATGGGTGGAGGATGTACTGTTGGAGGAAGGAAATGCAACAATTCTAACTCTATTTGCCTGCCTCCTCACTGTCCTATATCTTAGAGAGCGGCAACGGAGGCATGCAGTTGTACGGGCTGCTGCTGCTGCTGCTGCTGCTGAGGCTGTGCCACTCCACCCCAATGATCACGTGGCTGCACAAAATTAAATCACTGCCTGTGTTGTCTTTTTCTTTTCAATGAGTTTGATCACTTGGCATTCTTCAACAAAATAGGAGTGAGCGCTGTCTTCCACTTGCTTGCTACCAAAAAATATCTGAAAGCTGCTTTTCGAATCAGCTTAACGAGAAAGCATAGTTGAACAAAATGGGAGGCGCGCGTTCCATCTATTAATCTGATGCTACTAGCCGAGTGCCAAATATTCTGTACTGTCTACTTCCCCACGCATCTTGTTTATCAAGGTAATAAGATTTTAGGCAGCACTGCACCCATGGAGGATTTCTTGTACTCGCTTCATTTTTCTCACTTGCTGAAACCCCAAGTAACTTGTTTCGCTTCCTTGTTAAAACATCAATTGATGGATCTTCACCAAAGAATTTGTCCTTGAGTGGATTATGATAAATGTGAAAAACAATCAGAAGAGAGCCAATAAGTACAACTACAGCCAATATTATGAAATCATGAACACAATGTTCTAGGGAATTATTATCTTACACGAACTAAGGGGAAAAAATGCACAACTACGCTACTGCAAGTAATGAGTGTATTGCTATGTCAAAACATATTTTAAAT

Coding sequence (CDS)

ATGCAGATTAGGATGCAACTGCAAGCTCGTAGGCTTCAGTTAATTTTTTTAATTATATGTCTATGTTCTTTTTTCATCAACGCTCGTCCGTTTCTCATTGTCATCTCTCAAGATGACCTCAAGGATGCTGCGTCGCCCGACGACTCCTCCGATTCAGCGAACTACGACTCGGCCGAGTGGGACGAGTTCGGGGAGCCTGAATCTCAAAACGCCGCCCAGGAGCTTGACCCCGGTACTTGGCGCCCGATTTTTGAACCGGATTCTCCCGCCTCTGACCCCGACGCGCCGGAGGATCTGTACTACACTGCGTTAGGGAAGATGATGACAGCGGTGAGCTCTGGTGACCTGAGGTTGATGGAGGATGCGGTAGGGGATATTGATCAAGTTGCGGTGGAGAGTGGGGATCCGCACGCGCAATCGGTGCTAGGGTTATTGTATGGAACGGGGATAATGAAGGAGACGAATAAGGCTAAGGCGTTTATGTATCATTATTTTGCCGCGGAAGGGAATAAGCAATCTAAGATGGCTCTTGCATATAGTTACTTCAGGCAAGAAATGTATGAAAAAGCAGTTAAGCTTTATGCTGAGTTAGCAGAGGTAGCCATTAATAGTTTATTAGTTTCCAAGGATTCTCCAGTAATTGAACCTGTGAGGATCCATAATGGAGCAGAGGAGAACAAGCAGGCCTTGAGAAAGTCTCGAGGAGAGGAGGACGAGGACTTCCAAATTTTGGAATATCAGGCGCAGAAGGGTAATGCTGGAGCCATGTATAGAATTGGGCTATTTTACTACTTTGGACTTAGAGGGTTGAGGCGTGATCATGCAAAAGCATTGATCTGGTTTTCCAAGGCCGTGGATAAAGGTGAACCAAAGTCTATGGAACTACTTGGAGAGATATATGCAAGGGGCGCCGGAGTCGAAAGGGATTATACTAAGGCACTTCAATGGCTAACTCGTGCATCCAAGCATCCATCATTTTCTGCTTATAATGGCATAGGATATTTATACGTTAAGGGTTATGGAGTGGAGAAAAACTTTACCAAGGCCAAGGAGTACTTTGAAAAGGCTGCCAATAACGACGAGTCTGGTGGTCATTATAATCTAGGAGTTATGTATCTTAAAGGAATTGGGGTAAAGAGAGATGTGAAGAAGGCATGTACTCATTTTATAGTGGCTGCAAATGCTGGACAACCAAAGGCATTCTACCAGCTGGCGAAGATGTTTCATACTGGTGTTGGGCTCAAGAGGAACATTCCTATGGCTAGTGCTTTATACAAATTAGTAGCTGAACGAGGGCCTTGGAGTTCATTGTCTAGATGGGCATTGGAATCATATCTGAAAAGTGATATTGGCAAGGCATTCTTCTTGTACGCAAGAATGGCTGAGCTAGGATATGAGGTGGCACAAAGCAATGCAGCATGGATACTTGACAAATATGGAGAACAAAGCATGTGTCTGGGAGAATCTGGCTTTTGCACAGATGCAGAACTACATCAGAGAGCTCATTCTCTTTGGTGGCAAGCATCCGAGCAGGGCAACGAACACGCTGCTTTACTGATTGGGGATGCATATTACTACGGTCGGGGAACCGACGTAGATTATGATCGTGCTGCTGAAGCATACATGCATGCTAAATCCCAACTAAATGCACAAGCCATGTTCAACCTTGGTTACATGCATGAACATGGTCTAGGCCTTCCGTTTGACCTTCACCTAGCCAAGCGTTACTATGACCAAGCTCTGGAACTTGATCCAGCTGCCAGGTTGCCAGTCAAGCTGGCCCTAGCAAGCCTGTGGTTAAGGAAGAATCATGCCGACAGTTTTCTGGTCCATGTGATCGATTCGTTGCCAGAAGTGTATCCAAAGATTGATATATGGGTGGAGGATGTACTGTTGGAGGAAGGAAATGCAACAATTCTAACTCTATTTGCCTGCCTCCTCACTGTCCTATATCTTAGAGAGCGGCAACGGAGGCATGCAGTTGTACGGGCTGCTGCTGCTGCTGCTGCTGCTGAGGCTGTGCCACTCCACCCCAATGATCACGTGGCTGCACAAAATTAA

Protein sequence

MQIRMQLQARRLQLIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFGEPESQNAAQELDPGTWRPIFEPDSPASDPDAPEDLYYTALGKMMTAVSSGDLRLMEDAVGDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAEGNKQSKMALAYSYFRQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYARGAGVERDYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEKNFTKAKEYFEKAANNDESGGHYNLGVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMASALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQSMCLGESGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYMHAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRKNHADSFLVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVVRAAAAAAAAEAVPLHPNDHVAAQN
BLAST of Cp4.1LG20g01550 vs. Swiss-Prot
Match: HRD3A_ARATH (ERAD-associated E3 ubiquitin-protein ligase component HRD3A OS=Arabidopsis thaliana GN=HRD3A PE=1 SV=1)

HSP 1 Score: 934.5 bits (2414), Expect = 6.7e-271
Identity = 469/674 (69.58%), Postives = 563/674 (83.53%), Query Frame = 1

Query: 14  LIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFGEPESQNAAQ 73
           L  L+     F ++ARP ++V+S DDL          D+   +S+++DEFGE E ++  +
Sbjct: 11  LSLLVFSFIEFGVHARPVVLVLSNDDLNSGGD-----DNGVGESSDFDEFGESEPKSE-E 70

Query: 74  ELDPGTWRPIFEPDSPASDPDAPEDLYYTALGKMMTAVSSGDLRLMEDAVGDIDQVAVES 133
           ELDPG+WR IFEPD       +P+  YY+ L K+++A S G+ RLME+AV +I+  A  +
Sbjct: 71  ELDPGSWRSIFEPDDSTVQAASPQ--YYSGLKKILSAASEGNFRLMEEAVDEIE-AASSA 130

Query: 134 GDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAEG-NKQSKMALAYSYFRQEMYEKAVK 193
           GDPHAQS++G +YG G+M+E +K+K+F++H FAA G N QSKMALA++Y RQ+M++KAV+
Sbjct: 131 GDPHAQSIMGFVYGIGMMREKSKSKSFLHHNFAAAGGNMQSKMALAFTYLRQDMHDKAVQ 190

Query: 194 LYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQILEYQAQKGN 253
           LYAELAE A+NS L+SKDSPV+EP RIH+G EENK ALRKSRGEEDEDFQILEYQAQKGN
Sbjct: 191 LYAELAETAVNSFLISKDSPVVEPTRIHSGTEENKGALRKSRGEEDEDFQILEYQAQKGN 250

Query: 254 AGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYARGAGVERDYTK 313
           A AMY+IGLFYYFGLRGLRRDH KAL WF KAVDKGEP+SMELLGEIYARGAGVER+YTK
Sbjct: 251 ANAMYKIGLFYYFGLRGLRRDHTKALHWFLKAVDKGEPRSMELLGEIYARGAGVERNYTK 310

Query: 314 ALQWLTRASKHPSFSAYNGIGYLYVKGYGVEK-NFTKAKEYFEKAANNDESGGHYNLGVM 373
           AL+WLT A+K   +SA+NGIGYLYVKGYGV+K N+TKA+EYFEKA +N++  GHYNLGV+
Sbjct: 311 ALEWLTLAAKEGLYSAFNGIGYLYVKGYGVDKKNYTKAREYFEKAVDNEDPSGHYNLGVL 370

Query: 374 YLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMASALYKLVAER 433
           YLKGIGV RDV++A  +F VAANAGQPKAFYQLAKMFHTGVGLK+N+ MA++ YKLVAER
Sbjct: 371 YLKGIGVNRDVRQATKYFFVAANAGQPKAFYQLAKMFHTGVGLKKNLEMATSFYKLVAER 430

Query: 434 GPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQSMCLGESGF 493
           GPWSSLSRWALE+YLK D+GKA  LY+RMAE+GYEVAQSNAAWILDKYGE+SMC+G SGF
Sbjct: 431 GPWSSLSRWALEAYLKGDVGKALILYSRMAEMGYEVAQSNAAWILDKYGERSMCMGVSGF 490

Query: 494 CTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYMHAKSQLNAQ 553
           CTD E H+RAHSLWW+ASEQGNEHAALLIGDAYYYGRGT+ D+ RAAEAYMHAKSQ NAQ
Sbjct: 491 CTDKERHERAHSLWWRASEQGNEHAALLIGDAYYYGRGTERDFVRAAEAYMHAKSQSNAQ 550

Query: 554 AMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRKNHADSFLVH 613
           AMFNLGYMHEHG GLPFDLHLAKRYYD++L+ D AARLPV LALASLWLR+N+AD+ LV 
Sbjct: 551 AMFNLGYMHEHGQGLPFDLHLAKRYYDESLQSDAAARLPVTLALASLWLRRNYADTVLVR 610

Query: 614 VIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVVRA-AAAAAA 673
           V+DSLPEVYPK++ W+E+V+ EEGNATILTLF CL+T+LYLRERQRR  VV A   AA  
Sbjct: 611 VVDSLPEVYPKVETWIENVVFEEGNATILTLFVCLITILYLRERQRRQVVVVADPVAADV 670

Query: 674 AEAVPLHPNDHVAA 685
           A+ +      H+AA
Sbjct: 671 AQPLDADVAQHLAA 675

BLAST of Cp4.1LG20g01550 vs. Swiss-Prot
Match: HRD3_ORYSJ (ERAD-associated E3 ubiquitin-protein ligase component HRD3 OS=Oryza sativa subsp. japonica GN=HRD3 PE=2 SV=1)

HSP 1 Score: 854.7 bits (2207), Expect = 6.7e-247
Identity = 439/655 (67.02%), Postives = 524/655 (80.00%), Query Frame = 1

Query: 29  RPFLIVISQDD-LKDAASPDDSSDSANYDSAEWDEFGEPESQNAAQELDPGTWRPIFEPD 88
           RPF++V+S+DD LKD A    S  SA+ DS EWD+F + ES      L P +W P+ +P 
Sbjct: 45  RPFVLVLSRDDFLKDTAGAHPSLPSADADSDEWDDFDD-ESPATDPLLSPSSWVPLLDPA 104

Query: 89  S--PASD-PDAPED-LYYTALGKMMTAVSSGDLRLMEDAVGDIDQVAVESGDPHAQSVLG 148
           S  P+ D PD+P D L+   +  M++A S+GD      A   I+  A   G P AQS L 
Sbjct: 105 SASPSGDEPDSPSDALFVAGVRAMLSAASAGDDAAFATAAAQIEAAAT-GGHPGAQSALA 164

Query: 149 LLYGTGIMKETNKAKAFMYHYFAAE-GNKQSKMALAYSYFRQEMYEKAVKLYAELAEVAI 208
            L G G+ +  ++++AF+ H FAA+ G+ QSKMALAYS+FRQEMYE+AV LYAELAE A+
Sbjct: 165 FLSGAGMTRPASRSRAFLLHKFAADAGDLQSKMALAYSFFRQEMYEEAVTLYAELAEAAL 224

Query: 209 NSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQILEYQAQKGNAGAMYRIGLF 268
            S L+SK+ PVIEPVR+H+G EENK+ALRKSRGE+DEDFQI EYQAQ+GN  AM+++GL 
Sbjct: 225 TSSLISKEPPVIEPVRLHSGTEENKEALRKSRGEDDEDFQITEYQAQRGNTVAMHKLGLL 284

Query: 269 YYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYARGAGVERDYTKALQWLTRASK 328
           YY+GLRG+RRD+ KA  WFSKAV+KG+ ++MELLGEIYARGAGVER+YT+A +WLT A+K
Sbjct: 285 YYYGLRGVRRDYGKAYHWFSKAVEKGDTRAMELLGEIYARGAGVERNYTEAYKWLTLAAK 344

Query: 329 HPSFSAYNGIGYLYVKGYGVEK-NFTKAKEYFEKAANNDESGGHYNLGVMYLKGIGVKRD 388
              +SAYNG+GYLYVKGYGVEK N TKAKE+FE AA + E GG+YNLGV+YLKGIGVKRD
Sbjct: 345 QQQYSAYNGLGYLYVKGYGVEKKNLTKAKEFFEIAAEHKEHGGYYNLGVLYLKGIGVKRD 404

Query: 389 VKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMASALYKLVAERGPWSSLSRWA 448
           V  AC  F+ A NAGQPKA YQ+AK+F  GVGLKRN+ MA+ +YK VAERGPWSSLSRWA
Sbjct: 405 VMTACNFFLRAVNAGQPKAIYQVAKLFQKGVGLKRNLQMAAVMYKSVAERGPWSSLSRWA 464

Query: 449 LESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQSMCLGESGFCTDAELHQRA 508
           LESYLK DIGKA  LY+RMA+LGYEVAQSNAAWILD+YG++S+C+GESGFCTD E H RA
Sbjct: 465 LESYLKGDIGKALLLYSRMADLGYEVAQSNAAWILDRYGDESICMGESGFCTDMERHLRA 524

Query: 509 HSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYMHAKSQLNAQAMFNLGYMHE 568
           H+LWWQASEQGNEHAALLIGDAYYYGRG   DY+RAAEAYMHA+SQ NAQAMFNLGYMHE
Sbjct: 525 HALWWQASEQGNEHAALLIGDAYYYGRGVGRDYERAAEAYMHAQSQSNAQAMFNLGYMHE 584

Query: 569 HGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRKNHADSFLVHVIDSLPEVYP 628
           HG GLP DLHLAKRYYDQA+E+DPAA+LPV LAL SLW+RKN+  SFLVH IDSLPEVYP
Sbjct: 585 HGHGLPLDLHLAKRYYDQAVEVDPAAKLPVMLALTSLWIRKNYDGSFLVHFIDSLPEVYP 644

Query: 629 KIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVVRAAAAAAAAEAVPL 677
            ++ WVEDVL++EGNATI TLFACL+TVLYLRERQRR A   AAA     +  P+
Sbjct: 645 VVEEWVEDVLMDEGNATIFTLFACLVTVLYLRERQRRQA---AAANPQQPDGAPI 694

BLAST of Cp4.1LG20g01550 vs. Swiss-Prot
Match: HRD3B_ARATH (ERAD-associated E3 ubiquitin-protein ligase component HRD3B OS=Arabidopsis thaliana GN=HRD3B PE=3 SV=1)

HSP 1 Score: 625.2 bits (1611), Expect = 8.7e-178
Identity = 338/611 (55.32%), Postives = 433/611 (70.87%), Query Frame = 1

Query: 26  INARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFGEPESQNAAQELDPGTWRPIFE 85
           + ARPF++V+S +DL    +     D+  Y+S+++DEFGE E ++  +ELDPG+WR IFE
Sbjct: 23  VQARPFVLVLSNEDLNGGFN-----DNGAYESSDFDEFGESEPKSE-EELDPGSWRRIFE 82

Query: 86  PDSPASDPDAPEDLYYTALGKMMTAVSSGDLRLMEDAVGDIDQVAVESGDPHAQSVLGLL 145
            +       A    YY+ L K+++A S G+  LME+AV +ID  A  SGDPHAQSV+G +
Sbjct: 83  TNESTVHASASPQ-YYSGLHKILSAASEGNTTLMEEAVSEIDSSA-SSGDPHAQSVMGFV 142

Query: 146 YGTGIMKETNKAKAFMYHYFAAEG-NKQSKMALAYSYFRQEMYEKAVKLYAELAEVAINS 205
           YG G+M+ET+++K+ ++H+FAA G N QSKMALA+ Y RQ MY+KAV+LYAELAE A+NS
Sbjct: 143 YGIGMMRETSRSKSILHHHFAAAGGNMQSKMALAFRYLRQNMYDKAVELYAELAETAVNS 202

Query: 206 LLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQILEYQAQKGNAGAMYRIGLFYY 265
            L+SKDSP+ EPVRIH G EENK ALRKSRGEEDEDFQILEYQA+KGN+ AM++IGLFYY
Sbjct: 203 FLISKDSPMAEPVRIHIGTEENKDALRKSRGEEDEDFQILEYQAEKGNSVAMHKIGLFYY 262

Query: 266 FGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYARGAGVE-RDYTKALQWLTRASKH 325
           FGLRGLRRDHAKAL WFSKA   G       LG +Y +G GV+ R+YTKA ++   A+ +
Sbjct: 263 FGLRGLRRDHAKALYWFSKAEFNG-------LGYLYVKGYGVDKRNYTKAREYFEMAANN 322

Query: 326 PSFSAYNGIGYLYVKGYGVEKNFTKAKEYFEKAANNDESGGHYNLGVMYLKGIGVKRDVK 385
              S +  +G LY+KG GV+K+   A +YF  AAN  +    Y L  M+  G+G+ ++++
Sbjct: 323 EDPSGHYNLGVLYLKGTGVKKDVRHATKYFFVAANAGQPKAFYQLAKMFHTGVGLTKNLE 382

Query: 386 KACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMASALYKLVAERGPWSSLSRWALE 445
            A T + + A  G   +  + A   +    LK ++  A                      
Sbjct: 383 MATTFYKLVAERGPWSSLSRWALEAY----LKGDVGKAF--------------------- 442

Query: 446 SYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQSMCLGESGFCTDAELHQRAHS 505
                       LY+RM+ELGYEVAQSNAAWI+DKYGE+SMC+G  GFCTD E H RAHS
Sbjct: 443 -----------ILYSRMSELGYEVAQSNAAWIVDKYGERSMCMGVYGFCTDKERHDRAHS 502

Query: 506 LWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYMHAKSQLNAQAMFNLGYMHEHG 565
           LWW+ASEQGNEHAALLIGDAYYYGRGT+ D+ RAAEAYM+AKSQ NAQAMFNLGYMHEHG
Sbjct: 503 LWWRASEQGNEHAALLIGDAYYYGRGTERDFVRAAEAYMYAKSQSNAQAMFNLGYMHEHG 562

Query: 566 LGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRKNHADSFLVHVIDSLPEVYPKI 625
            GLPFDLHLAKRYYDQAL+ D AA+LPV LALAS+W+R+N+AD+ LV V++SLPEV+ K+
Sbjct: 563 EGLPFDLHLAKRYYDQALQSDTAAKLPVTLALASVWVRRNYADTALVQVLNSLPEVHQKV 582

Query: 626 DIWVEDVLLEE 635
             WVE+ +LEE
Sbjct: 623 VEWVENGMLEE 582

BLAST of Cp4.1LG20g01550 vs. Swiss-Prot
Match: SE1L1_HUMAN (Protein sel-1 homolog 1 OS=Homo sapiens GN=SEL1L PE=1 SV=3)

HSP 1 Score: 288.5 bits (737), Expect = 1.9e-76
Identity = 201/566 (35.51%), Postives = 314/566 (55.48%), Query Frame = 1

Query: 103 ALGKMMTAVSSGD-LRLMEDAVGDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFM 162
           AL ++  A+  GD L     A  ++ +   E G P  Q+ LG LY +G+   +++AKA +
Sbjct: 221 ALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVNSSQAKALV 280

Query: 163 YHYFAA-EGNKQSKMALAYSYFRQ----EMYEKAVKLYAELAEVAINSLLVSKDSPVIEP 222
           Y+ F A  GN  + M L Y Y+      +  E A+  Y  +A    + + ++  S V++ 
Sbjct: 281 YYTFGALGGNLIAHMVLGYRYWAGIGVLQSCESALTHYRLVANHVASDISLTGGS-VVQR 340

Query: 223 VRIHNGAEENKQALRKSRGEEDEDFQILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAK 282
           +R+ +  E     +     EED   Q  ++ A+KG+  A   +G  +  G RG+ ++H +
Sbjct: 341 IRLPDEVEN--PGMNSGMLEEDL-IQYYQFLAEKGDVQAQVGLGQLHLHGGRGVEQNHQR 400

Query: 283 ALIWFSKAVDKGEPKSMELLGEIYARGAG-VERDYTKALQWLTRASKHPSFSAYNGIGYL 342
           A  +F+ A + G   +M  LG++Y+ G+  V +    AL +  +A+   +    +G+G  
Sbjct: 401 AFDYFNLAANAGNSHAMAFLGKMYSEGSDIVPQSNETALHYFKKAADMGNPVGQSGLGMA 460

Query: 343 YVKGYGVEKNFTKAKEYFEKAANNDESGGHYNLGVMYLKGIGVKRDVKKACTHFIVAANA 402
           Y+ G GV+ N+  A +YF+KAA      G   LG MY  GIGVKRD K+A  +F +A+  
Sbjct: 461 YLYGRGVQVNYDLALKYFQKAAEQGWVDGQLQLGSMYYNGIGVKRDYKQALKYFNLASQG 520

Query: 403 GQPKAFYQLAKMFHTGVGLKRNIPMASALYKLVAERGPWSSLSRWALESYLKSDIGKAFF 462
           G   AFY LA+M  +G G+ R+   A  L+K V ERG WS     A  SY   D   A  
Sbjct: 521 GHILAFYNLAQMHASGTGVMRSCHTAVELFKNVCERGRWSERLMTAYNSYKDGDYNAAVI 580

Query: 463 LYARMAELGYEVAQSNAAWILDKYGEQSMCLGESGFCTDAELHQRAHSLWWQASEQGNEH 522
            Y  +AE GYEVAQSNAA+ILD+   ++  +GE+      E + RA   W +A+ QG   
Sbjct: 581 QYLLLAEQGYEVAQSNAAFILDQ--REASIVGEN------ETYPRALLHWNRAASQGYTV 640

Query: 523 AALLIGDAYYYGRGTDVDYDRAAEAY-MHAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAK 582
           A + +GD ++YG GTDVDY+ A   Y + ++ Q +AQAMFNLGYMHE GLG+  D+HLAK
Sbjct: 641 ARIKLGDYHFYGFGTDVDYETAFIHYRLASEQQHSAQAMFNLGYMHEKGLGIKQDIHLAK 700

Query: 583 RYYDQALELDPAARLPVKLALASLWLRKNHADSFLVHVID-SLPEVYPKIDIWVEDVLLE 642
           R+YD A E  P A++PV LAL  L +       FL ++ + ++ +++ ++D  ++ +L  
Sbjct: 701 RFYDMAAEASPDAQVPVFLALCKLGV-----VYFLQYIRETNIRDMFTQLD--MDQLLGP 760

Query: 643 EGNATILTLFACLL-TVLYLRERQRR 659
           E +  ++T+ A LL TV+  R+RQ +
Sbjct: 761 EWDLYLMTIIALLLGTVIAYRQRQHQ 767

BLAST of Cp4.1LG20g01550 vs. Swiss-Prot
Match: SE1L1_RAT (Protein sel-1 homolog 1 OS=Rattus norvegicus GN=Sel1l PE=2 SV=2)

HSP 1 Score: 287.3 bits (734), Expect = 4.3e-76
Identity = 200/566 (35.34%), Postives = 313/566 (55.30%), Query Frame = 1

Query: 103 ALGKMMTAVSSGDLRLME-DAVGDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFM 162
           AL ++  A+  GD       A  ++ +   E G P  Q+ LG LY +G+   +++AKA +
Sbjct: 221 ALERVSYALLFGDYLTQNIQAAKEMFEKLTEEGSPKGQTGLGFLYASGLGVNSSQAKALV 280

Query: 163 YHYFAA-EGNKQSKMALAYSYFRQ----EMYEKAVKLYAELAEVAINSLLVSKDSPVIEP 222
           Y+ F A  GN  + M L Y Y+      +  E A+  Y  +A    + + ++  S V++ 
Sbjct: 281 YYTFGALGGNLIAHMVLGYRYWAGIGVLQSCESALTHYRLVANHVASDISLTGGS-VVQR 340

Query: 223 VRIHNGAEENKQALRKSRGEEDEDFQILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAK 282
           +R+ +  E     +     EED   Q  ++ A+KG+  A   +G  +  G RG+ ++H +
Sbjct: 341 IRLPDEVEN--PGMNSGMLEEDL-IQYYQFLAEKGDVQAQVGLGQLHLHGGRGVEQNHQR 400

Query: 283 ALIWFSKAVDKGEPKSMELLGEIYARGAG-VERDYTKALQWLTRASKHPSFSAYNGIGYL 342
           A  +F+ A + G   +M  LG++Y+ G+  V +    AL +  +A+   +    +G+G  
Sbjct: 401 AFDYFNLAANAGNSHAMAFLGKMYSEGSDIVPQSNETALHYFKKAADMGNPVGQSGLGMA 460

Query: 343 YVKGYGVEKNFTKAKEYFEKAANNDESGGHYNLGVMYLKGIGVKRDVKKACTHFIVAANA 402
           Y+ G GV+ N+  A +YF+KAA      G   LG MY  GIGVKRD K+A  +F +A+  
Sbjct: 461 YLYGRGVQVNYDLALKYFQKAAEQGWVDGQLQLGSMYYNGIGVKRDYKQALKYFNLASQG 520

Query: 403 GQPKAFYQLAKMFHTGVGLKRNIPMASALYKLVAERGPWSSLSRWALESYLKSDIGKAFF 462
           G   AFY LA+M  +G G+ R+   A  L+K V ERG WS     A  SY   D   A  
Sbjct: 521 GHILAFYNLAQMHASGTGVMRSCHTAVELFKNVCERGRWSERLMTAYNSYKDDDYNAAVV 580

Query: 463 LYARMAELGYEVAQSNAAWILDKYGEQSMCLGESGFCTDAELHQRAHSLWWQASEQGNEH 522
            Y  +AE GYEVAQSNAA+ILD+   ++  +GE+      E + RA   W +A+ QG   
Sbjct: 581 QYLLLAEQGYEVAQSNAAFILDQ--REATIVGEN------ETYPRALLHWNRAASQGYTV 640

Query: 523 AALLIGDAYYYGRGTDVDYDRAAEAY-MHAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAK 582
           A + +GD ++YG GTDVDY+ A   Y + ++ Q +AQAMFNLGYMHE GLG+  D+HLAK
Sbjct: 641 ARIKLGDYHFYGFGTDVDYETAFIHYRLASEQQHSAQAMFNLGYMHEKGLGIKQDIHLAK 700

Query: 583 RYYDQALELDPAARLPVKLALASLWLRKNHADSFLVHVID-SLPEVYPKIDIWVEDVLLE 642
           R+YD A E  P A++PV LAL  L +       FL ++ + ++ +++ ++D  ++ +L  
Sbjct: 701 RFYDMAAEASPDAQVPVFLALCKLGV-----VYFLQYIREANIRDLFTQLD--MDQLLGP 760

Query: 643 EGNATILTLFACLL-TVLYLRERQRR 659
           E +  ++T+ A LL TV+  R+RQ +
Sbjct: 761 EWDLYLMTIIALLLGTVIAYRQRQHQ 767

BLAST of Cp4.1LG20g01550 vs. TrEMBL
Match: A0A0A0LQX2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042850 PE=4 SV=1)

HSP 1 Score: 1261.9 bits (3264), Expect = 0.0e+00
Identity = 639/684 (93.42%), Postives = 651/684 (95.18%), Query Frame = 1

Query: 5   MQLQARRLQLIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFG 64
           MQLQ RRLQLIFLI+CL S FINARPFLIVISQDDLKD A PDDSSDSAN DSA+WDEFG
Sbjct: 1   MQLQTRRLQLIFLILCLSSLFINARPFLIVISQDDLKDGAPPDDSSDSANSDSADWDEFG 60

Query: 65  EPESQNAAQELDPGTWRPIFEPDSPAS--DPDAPEDLYYTALGKMMTAVSSGDLRLMEDA 124
           EPESQN+A ELDPG+WRPIFEPDS AS  D DAP+DLYYTALGKMM+AVSSGDLRLMEDA
Sbjct: 61  EPESQNSALELDPGSWRPIFEPDSTASASDSDAPQDLYYTALGKMMSAVSSGDLRLMEDA 120

Query: 125 VGDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAEGNKQSKMALAYSYF 184
           V DIDQ   ESGDPHAQSVLGLLYG GIMKETNKAKAFMYH+FAAEGNKQSKMALAY YF
Sbjct: 121 VADIDQAVAESGDPHAQSVLGLLYGMGIMKETNKAKAFMYHHFAAEGNKQSKMALAYIYF 180

Query: 185 RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ 244
           RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ
Sbjct: 181 RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ 240

Query: 245 ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYAR 304
           ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKAL WFSKAV+KGEPKSMELLGEIYAR
Sbjct: 241 ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALSWFSKAVEKGEPKSMELLGEIYAR 300

Query: 305 GAGVERDYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEKNFTKAKEYFEKAANNDES 364
           GAGVERDYTKALQWLTRASK PSF+AYNG+GYLYVKGYGVEKN+TKAKEYFEKAA NDES
Sbjct: 301 GAGVERDYTKALQWLTRASKQPSFTAYNGMGYLYVKGYGVEKNYTKAKEYFEKAAENDES 360

Query: 365 GGHYNLGVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMAS 424
           GGHYNLGVMYLKGIGVKRDVKKACTHFI+AANAGQPKAFYQLAKMFHTGVGLKRNIPMAS
Sbjct: 361 GGHYNLGVMYLKGIGVKRDVKKACTHFIMAANAGQPKAFYQLAKMFHTGVGLKRNIPMAS 420

Query: 425 ALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQ 484
           ALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQ
Sbjct: 421 ALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQ 480

Query: 485 SMCLGESGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYM 544
           SMCLGESGFCTDAE HQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYM
Sbjct: 481 SMCLGESGFCTDAERHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYM 540

Query: 545 HAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRK 604
           HAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLAL SLWLR 
Sbjct: 541 HAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALVSLWLRM 600

Query: 605 NHADSFLVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVV 664
           NHADSFLVHVIDSLPEVYPKID WVEDVLLEEGNATILTLFACLLTVLYLRERQRRHA V
Sbjct: 601 NHADSFLVHVIDSLPEVYPKIDAWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAAV 660

Query: 665 RAAAAAAAAEAVPLHPNDHVAAQN 687
           R      AAEAVPLHPNDHV  QN
Sbjct: 661 R------AAEAVPLHPNDHVPPQN 678

BLAST of Cp4.1LG20g01550 vs. TrEMBL
Match: B9SWV9_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0566120 PE=4 SV=1)

HSP 1 Score: 1025.4 bits (2650), Expect = 3.2e-296
Identity = 518/680 (76.18%), Postives = 588/680 (86.47%), Query Frame = 1

Query: 3   IRMQLQARRLQLIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDE 62
           +R +L   R     +I+ L    + ARPF++++SQDDLKDA +  D S SA     EWDE
Sbjct: 1   MRRRLGTYRFTFSLIIVSLLPLSLTARPFVLLLSQDDLKDAPATVDDSSSATDSPPEWDE 60

Query: 63  FGEPESQNAAQELDPGTWRPIFEPDSPASDP---DAPEDLYYTALGKMMTAVSSGDLRLM 122
           FG+ +S+    ELDPG+WRPIFEPDS +S     D+    YY+ + KM+ +VS G +RLM
Sbjct: 61  FGDSDSK-PEHELDPGSWRPIFEPDSSSSSSSVEDSEMAEYYSGVEKMLASVSDGKVRLM 120

Query: 123 EDAVGDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAE-GNKQSKMALA 182
           E+A  +I+  AV SG+PHAQSVLG LYG G MKE +KAKAF+YH+FAAE GN QSKMALA
Sbjct: 121 EEAAAEIESAAV-SGNPHAQSVLGFLYGLGQMKERDKAKAFLYHHFAAESGNMQSKMALA 180

Query: 183 YSYFRQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEED 242
           ++Y RQ+M++KAVKLYAELAEVA+NS L+SKDSPVIEPVRIHNGAEENK+ALRKSRGEED
Sbjct: 181 FTYSRQDMHDKAVKLYAELAEVAVNSFLISKDSPVIEPVRIHNGAEENKEALRKSRGEED 240

Query: 243 EDFQILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGE 302
           EDFQILEYQAQKGNAGAMY+IGLFYYFGLRGLRRDHAKAL WFSKAV KGEP+SMELLGE
Sbjct: 241 EDFQILEYQAQKGNAGAMYKIGLFYYFGLRGLRRDHAKALSWFSKAVKKGEPRSMELLGE 300

Query: 303 IYARGAGVERDYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEKNFTKAKEYFEKAAN 362
           IYARGAGVER+YTKAL+WLT ASK   +SAYNG+GYLYVKGYGVEKN+TKAKEYFEKAA+
Sbjct: 301 IYARGAGVERNYTKALEWLTLASKQQLYSAYNGMGYLYVKGYGVEKNYTKAKEYFEKAAH 360

Query: 363 NDESGGHYNLGVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNI 422
           N+E+GGHYNLGVMYLKGIGVKRDVK AC +FIVAANAGQPKAFYQLAKMFHTGVGLK+++
Sbjct: 361 NEEAGGHYNLGVMYLKGIGVKRDVKLACKYFIVAANAGQPKAFYQLAKMFHTGVGLKKDL 420

Query: 423 PMASALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDK 482
            MA+ALYKLVAERGPWS+LSRWALESYLK D+GKAF LYARMAE+GYE+AQSNAAWILDK
Sbjct: 421 VMATALYKLVAERGPWSTLSRWALESYLKGDVGKAFLLYARMAEMGYEIAQSNAAWILDK 480

Query: 483 YGEQSMCLGESGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAA 542
           YGE+SMC+GESGFCTDAE HQRAHSLWWQASEQGNEHAALLIGDAYYYGRGT+ DY+RAA
Sbjct: 481 YGERSMCMGESGFCTDAERHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTERDYERAA 540

Query: 543 EAYMHAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASL 602
           EAYMHAKSQ NAQAMFNLGYMHEHG GLPFDLHLAKRYYDQALE+DPAA+LPV LAL SL
Sbjct: 541 EAYMHAKSQSNAQAMFNLGYMHEHGQGLPFDLHLAKRYYDQALEIDPAAKLPVTLALTSL 600

Query: 603 WLRKNHADSFLVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRR 662
           W+R+N+ADSFLV++IDSLP VYPK++ WVE+V+LEEGNATILTLF CLLTVLYLRERQRR
Sbjct: 601 WVRRNYADSFLVNLIDSLPGVYPKVEAWVENVILEEGNATILTLFVCLLTVLYLRERQRR 660

Query: 663 HAVVRAAAAAAAAEAVPLHP 679
           H      A      AVP  P
Sbjct: 661 H------AGGVGEAAVPQQP 672

BLAST of Cp4.1LG20g01550 vs. TrEMBL
Match: A0A067JT43_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22718 PE=4 SV=1)

HSP 1 Score: 1016.5 bits (2627), Expect = 1.5e-293
Identity = 519/680 (76.32%), Postives = 588/680 (86.47%), Query Frame = 1

Query: 11  RLQLIFLIICLCSFFINARPFLIVISQDDLKDA-ASPDDSSDSANYDSAEWDEFGEPESQ 70
           RL L FLI+ L   ++NARPF++V+SQDDLKDA  S D   DS      EWDEFG+ +S+
Sbjct: 8   RLTLSFLILSLFPIYLNARPFVLVLSQDDLKDAPTSADSDGDSTAESPPEWDEFGDSDSK 67

Query: 71  NAAQELDPGTWRPIFEPDSPASDPDAPEDL----YYTALGKMMTAVSSGDLRLMEDAVGD 130
               ELDPG+WRPIFEPDS  S PD  ED     YY+ + KM++AVS G++RLME+A  +
Sbjct: 68  -PEHELDPGSWRPIFEPDS--SSPDITEDPEMAEYYSGVQKMLSAVSGGEVRLMEEATAE 127

Query: 131 IDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAE-GNKQSKMALAYSYFRQ 190
           I+  AV +G+PHAQSVLG LYG G M+E +KAKAF+YH+FAAE G+ QSKMALAY+Y RQ
Sbjct: 128 IEAAAV-AGNPHAQSVLGFLYGLGQMREWSKAKAFLYHHFAAEEGSMQSKMALAYTYTRQ 187

Query: 191 EMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQIL 250
           +MY+KAVKLYAELAEVA+NS L+SKDSPVIEPVRIHNGAEENK+ALRKSRGEEDEDFQIL
Sbjct: 188 DMYDKAVKLYAELAEVAVNSFLISKDSPVIEPVRIHNGAEENKEALRKSRGEEDEDFQIL 247

Query: 251 EYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYARGA 310
           EYQAQKGNAGAMY+IGLFYYFGLRGLRRDHAKAL+WFSKAV KGEP+SMELLGEIYARGA
Sbjct: 248 EYQAQKGNAGAMYKIGLFYYFGLRGLRRDHAKALLWFSKAVKKGEPRSMELLGEIYARGA 307

Query: 311 GVERDYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEK-NFTKAKEYFEKAANNDESG 370
           GVER+YTKAL+WLT ASK   +SAYNG+GYLYVKGYGVEK N+TKAKEYFEKAA+NDE+G
Sbjct: 308 GVERNYTKALEWLTLASKQQLYSAYNGMGYLYVKGYGVEKKNYTKAKEYFEKAADNDEAG 367

Query: 371 GHYNLGVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMASA 430
           GHYNLGVMYLKGIGVKRDV+ A  +F+VAANAGQPKAFYQLAKMFHTGVG K+++ MA+A
Sbjct: 368 GHYNLGVMYLKGIGVKRDVRLARKYFVVAANAGQPKAFYQLAKMFHTGVGFKKDLTMATA 427

Query: 431 LYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQS 490
           LYKLVAERGPWS+LSRWALESYLK D+GKA  LY+RMAELGYE+AQSNAAWILDKYGE+S
Sbjct: 428 LYKLVAERGPWSTLSRWALESYLKGDVGKASVLYSRMAELGYEIAQSNAAWILDKYGERS 487

Query: 491 MCLGESGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYMH 550
           MC+GESGFCTDAE HQRAHSLWWQASEQGNEHAALLIGDAYYYGRGT+ DY+RAAEAYMH
Sbjct: 488 MCMGESGFCTDAERHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTERDYERAAEAYMH 547

Query: 551 AKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRKN 610
           AKSQ NAQAMFNLGYMHEHG GLP+DLHLAKRYYDQALE+DPAA+LPV LAL SLW+R+N
Sbjct: 548 AKSQSNAQAMFNLGYMHEHGQGLPYDLHLAKRYYDQALEIDPAAKLPVTLALTSLWVRRN 607

Query: 611 HADSFLVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVVR 670
           +ADSFLV +IDSLP  YPK++ WVE+V++EEGNATILTL  CLLTVLYLRERQRRH V  
Sbjct: 608 YADSFLVDLIDSLPGFYPKVEAWVENVIMEEGNATILTLVVCLLTVLYLRERQRRHVV-- 667

Query: 671 AAAAAAAAEAVPLHPNDHVA 684
                     VP  P DH A
Sbjct: 668 -DVGVGDDGDVPPQPIDHAA 680

BLAST of Cp4.1LG20g01550 vs. TrEMBL
Match: M5WDY4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002386mg PE=4 SV=1)

HSP 1 Score: 1012.7 bits (2617), Expect = 2.1e-292
Identity = 511/682 (74.93%), Postives = 585/682 (85.78%), Query Frame = 1

Query: 5   MQLQARRLQLIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFG 64
           M    R+L    LI+ L    + ARP+ +V+S+DDL +  +P+   DS   DS EWDEFG
Sbjct: 1   MNFATRKLIFALLIVFLYPLSLFARPYFLVLSKDDLLN--TPNSPDDSTQNDSPEWDEFG 60

Query: 65  EPESQNAAQELDPGTWRPIFEPDSPASD-PDAPEDLYYTALGKMMTAVSSGDLRLMEDAV 124
           +  S  + +ELDPG+WRPIFEPD    D  + P++ YY+ + K++ +VSSGD  LM+DAV
Sbjct: 61  DSGSPQSEEELDPGSWRPIFEPDPFRPDLANNPDERYYSTVTKLIKSVSSGDTTLMDDAV 120

Query: 125 GDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAEG-NKQSKMALAYSYF 184
            +I++ A   G PHA+SVLG LY TG M++ NKAKAF YHYFA+EG N QSKMALAY+Y 
Sbjct: 121 SEIEESA-SRGLPHARSVLGFLYATGQMRKQNKAKAFTYHYFASEGGNMQSKMALAYTYS 180

Query: 185 RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ 244
           RQ+M++KAVKLY+ELAE A+NS L+SKDSPVIEPVRIHNGAEENK+ALRKSRGEEDEDFQ
Sbjct: 181 RQDMFDKAVKLYSELAEAAVNSFLISKDSPVIEPVRIHNGAEENKEALRKSRGEEDEDFQ 240

Query: 245 ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYAR 304
           ILEYQAQKGN+ AMY+IGLFYYFGLRGLRRDHAKAL WF KA++KGEP++MELLGEIYAR
Sbjct: 241 ILEYQAQKGNSAAMYKIGLFYYFGLRGLRRDHAKALSWFLKALEKGEPRAMELLGEIYAR 300

Query: 305 GAGVERDYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEK-NFTKAKEYFEKAANNDE 364
           GAGVER+YTKAL+WLT A+K   +SAYNG+GYLYVKGYGVEK N TKAKEYFEKAA+N++
Sbjct: 301 GAGVERNYTKALEWLTLAAKQELYSAYNGMGYLYVKGYGVEKKNLTKAKEYFEKAADNED 360

Query: 365 SGGHYNLGVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMA 424
           +GGHYNLGVMYLKGIGV RDVK AC +FIVAANAGQPKAFYQL KMFHTGVGLK+N+P A
Sbjct: 361 AGGHYNLGVMYLKGIGVTRDVKLACQYFIVAANAGQPKAFYQLGKMFHTGVGLKKNLPRA 420

Query: 425 SALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGE 484
           + LYKLVAERGPW+SLSRWALESYLK D+GKAFFLY+RMAELGYEVAQSNAAWILDKYGE
Sbjct: 421 TVLYKLVAERGPWNSLSRWALESYLKGDMGKAFFLYSRMAELGYEVAQSNAAWILDKYGE 480

Query: 485 QSMCLGESGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAY 544
           +SMC+GESGFCTDAE HQRAHSLWWQASEQGNEHAALLIGDAYYYG+GT+ DYDRAAEAY
Sbjct: 481 RSMCIGESGFCTDAERHQRAHSLWWQASEQGNEHAALLIGDAYYYGQGTERDYDRAAEAY 540

Query: 545 MHAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLR 604
            HA+SQ NAQAMFNLGYMHEHG GLP DLHLAKRYYDQALE+D AA+LPV LAL SLW+R
Sbjct: 541 KHARSQSNAQAMFNLGYMHEHGQGLPLDLHLAKRYYDQALEIDQAAKLPVTLALTSLWIR 600

Query: 605 KNHADSFLVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAV 664
           KN+AD FLVHVIDSLPEVYPK++ WV++VLLEEGNATILTLF CLLTVLYLRERQRRHAV
Sbjct: 601 KNYADGFLVHVIDSLPEVYPKVEEWVDNVLLEEGNATILTLFVCLLTVLYLRERQRRHAV 660

Query: 665 VRAAAAAAAAEAVPLHPNDHVA 684
                AA    AVP HPN+HVA
Sbjct: 661 -----AAPGGMAVPHHPNEHVA 674

BLAST of Cp4.1LG20g01550 vs. TrEMBL
Match: A0A0B0MWE3_GOSAR (Protein sel-1 OS=Gossypium arboreum GN=F383_26804 PE=4 SV=1)

HSP 1 Score: 1003.0 bits (2592), Expect = 1.7e-289
Identity = 508/674 (75.37%), Postives = 585/674 (86.80%), Query Frame = 1

Query: 12  LQLIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFGEPESQNA 71
           L L  L+  L  F + ARPF++V+SQDDLKD  + DD+S   +  S + D+FG    +  
Sbjct: 10  LFLFLLLFSLFPFPLLARPFVLVLSQDDLKDVQN-DDASPLDSDSSWDDDDFGGTHVK-P 69

Query: 72  AQELDPGTWRPIFEPDSPASDP--DAPEDLYYTALGKMMTAVSSGDLRLMEDAVGDIDQV 131
             ELDPG+WR +FEP +    P  D   D YY A+ K+++A ++GD RLME+A  +I+  
Sbjct: 70  DDELDPGSWRRLFEPPTTLLSPSHDTSLDSYYAAVQKIISASNNGDARLMEEAAAEIETA 129

Query: 132 AVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAEG-NKQSKMALAYSYFRQEMYE 191
           A   GDPHA+SVLG LYG G+M+E NKAKAF+ HYFAAEG N QSKMALAY+Y RQ+M+E
Sbjct: 130 ANADGDPHARSVLGFLYGMGMMRERNKAKAFLNHYFAAEGGNAQSKMALAYTYSRQDMHE 189

Query: 192 KAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQILEYQA 251
           KAVKLYAELAE A+NS L+SKDSPVIEP+RIHNGAEENK+AL+KSRGE+DEDFQILEYQA
Sbjct: 190 KAVKLYAELAETAVNSFLISKDSPVIEPIRIHNGAEENKEALKKSRGEDDEDFQILEYQA 249

Query: 252 QKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYARGAGVER 311
           QKGNAGAMY++GLFYYFGLRGLRRDH+KAL+WF KAVDKGEP+S+ELLGEIYARGAG+ER
Sbjct: 250 QKGNAGAMYKMGLFYYFGLRGLRRDHSKALMWFLKAVDKGEPRSLELLGEIYARGAGIER 309

Query: 312 DYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEKNFTKAKEYFEKAANNDESGGHYNL 371
           +YTKAL+WL+ AS+H  +SAYNG+GYL+VKGYGVEKN+TKAKEYF+KAA+N+++GGHYNL
Sbjct: 310 NYTKALEWLSLASEHGLYSAYNGMGYLHVKGYGVEKNYTKAKEYFDKAADNEDAGGHYNL 369

Query: 372 GVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMASALYKLV 431
           GVMYLKGIGVKRDVK AC  FIVAANAGQPKAFYQLAKMFHTGVGLK+N+PMA+ALYKLV
Sbjct: 370 GVMYLKGIGVKRDVKIACKCFIVAANAGQPKAFYQLAKMFHTGVGLKKNLPMATALYKLV 429

Query: 432 AERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQSMCLGE 491
           AERGPWSSLSRWALESYLK D+GKAF LY+RMAELGYE+AQSNAAWILDKYGE+SMC+GE
Sbjct: 430 AERGPWSSLSRWALESYLKGDMGKAFLLYSRMAELGYEIAQSNAAWILDKYGERSMCMGE 489

Query: 492 SGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYMHAKSQL 551
           SG CTDAE HQRAHSLWWQASEQGNEHAALLIGDAYYYGRGT  DY+RAAEAY HAKSQ 
Sbjct: 490 SGACTDAERHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTVRDYERAAEAYKHAKSQS 549

Query: 552 NAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRKNHADSF 611
           NAQAMFNLGYMHEHG GLPFDLHLAKRYYDQALELDPAA+LPV LALASLW+RKN+ADSF
Sbjct: 550 NAQAMFNLGYMHEHGQGLPFDLHLAKRYYDQALELDPAAKLPVTLALASLWVRKNYADSF 609

Query: 612 LVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVVRAAAAA 671
           LVH+IDSLPEVYP+++ WVE+V++EEGNATILTLF CLLTVLYLRERQRRHA+    AAA
Sbjct: 610 LVHIIDSLPEVYPRVEEWVENVIMEEGNATILTLFVCLLTVLYLRERQRRHAI----AAA 669

Query: 672 AAAEAVPLHPNDHV 683
            A E     PN+HV
Sbjct: 670 GAHE-----PNEHV 672

BLAST of Cp4.1LG20g01550 vs. TAIR10
Match: AT1G18260.1 (AT1G18260.1 HCP-like superfamily protein)

HSP 1 Score: 934.5 bits (2414), Expect = 3.8e-272
Identity = 469/674 (69.58%), Postives = 563/674 (83.53%), Query Frame = 1

Query: 14  LIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFGEPESQNAAQ 73
           L  L+     F ++ARP ++V+S DDL          D+   +S+++DEFGE E ++  +
Sbjct: 11  LSLLVFSFIEFGVHARPVVLVLSNDDLNSGGD-----DNGVGESSDFDEFGESEPKSE-E 70

Query: 74  ELDPGTWRPIFEPDSPASDPDAPEDLYYTALGKMMTAVSSGDLRLMEDAVGDIDQVAVES 133
           ELDPG+WR IFEPD       +P+  YY+ L K+++A S G+ RLME+AV +I+  A  +
Sbjct: 71  ELDPGSWRSIFEPDDSTVQAASPQ--YYSGLKKILSAASEGNFRLMEEAVDEIE-AASSA 130

Query: 134 GDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAEG-NKQSKMALAYSYFRQEMYEKAVK 193
           GDPHAQS++G +YG G+M+E +K+K+F++H FAA G N QSKMALA++Y RQ+M++KAV+
Sbjct: 131 GDPHAQSIMGFVYGIGMMREKSKSKSFLHHNFAAAGGNMQSKMALAFTYLRQDMHDKAVQ 190

Query: 194 LYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQILEYQAQKGN 253
           LYAELAE A+NS L+SKDSPV+EP RIH+G EENK ALRKSRGEEDEDFQILEYQAQKGN
Sbjct: 191 LYAELAETAVNSFLISKDSPVVEPTRIHSGTEENKGALRKSRGEEDEDFQILEYQAQKGN 250

Query: 254 AGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYARGAGVERDYTK 313
           A AMY+IGLFYYFGLRGLRRDH KAL WF KAVDKGEP+SMELLGEIYARGAGVER+YTK
Sbjct: 251 ANAMYKIGLFYYFGLRGLRRDHTKALHWFLKAVDKGEPRSMELLGEIYARGAGVERNYTK 310

Query: 314 ALQWLTRASKHPSFSAYNGIGYLYVKGYGVEK-NFTKAKEYFEKAANNDESGGHYNLGVM 373
           AL+WLT A+K   +SA+NGIGYLYVKGYGV+K N+TKA+EYFEKA +N++  GHYNLGV+
Sbjct: 311 ALEWLTLAAKEGLYSAFNGIGYLYVKGYGVDKKNYTKAREYFEKAVDNEDPSGHYNLGVL 370

Query: 374 YLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMASALYKLVAER 433
           YLKGIGV RDV++A  +F VAANAGQPKAFYQLAKMFHTGVGLK+N+ MA++ YKLVAER
Sbjct: 371 YLKGIGVNRDVRQATKYFFVAANAGQPKAFYQLAKMFHTGVGLKKNLEMATSFYKLVAER 430

Query: 434 GPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQSMCLGESGF 493
           GPWSSLSRWALE+YLK D+GKA  LY+RMAE+GYEVAQSNAAWILDKYGE+SMC+G SGF
Sbjct: 431 GPWSSLSRWALEAYLKGDVGKALILYSRMAEMGYEVAQSNAAWILDKYGERSMCMGVSGF 490

Query: 494 CTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYMHAKSQLNAQ 553
           CTD E H+RAHSLWW+ASEQGNEHAALLIGDAYYYGRGT+ D+ RAAEAYMHAKSQ NAQ
Sbjct: 491 CTDKERHERAHSLWWRASEQGNEHAALLIGDAYYYGRGTERDFVRAAEAYMHAKSQSNAQ 550

Query: 554 AMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRKNHADSFLVH 613
           AMFNLGYMHEHG GLPFDLHLAKRYYD++L+ D AARLPV LALASLWLR+N+AD+ LV 
Sbjct: 551 AMFNLGYMHEHGQGLPFDLHLAKRYYDESLQSDAAARLPVTLALASLWLRRNYADTVLVR 610

Query: 614 VIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVVRA-AAAAAA 673
           V+DSLPEVYPK++ W+E+V+ EEGNATILTLF CL+T+LYLRERQRR  VV A   AA  
Sbjct: 611 VVDSLPEVYPKVETWIENVVFEEGNATILTLFVCLITILYLRERQRRQVVVVADPVAADV 670

Query: 674 AEAVPLHPNDHVAA 685
           A+ +      H+AA
Sbjct: 671 AQPLDADVAQHLAA 675

BLAST of Cp4.1LG20g01550 vs. TAIR10
Match: AT1G73570.1 (AT1G73570.1 HCP-like superfamily protein)

HSP 1 Score: 625.2 bits (1611), Expect = 4.9e-179
Identity = 338/611 (55.32%), Postives = 433/611 (70.87%), Query Frame = 1

Query: 26  INARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFGEPESQNAAQELDPGTWRPIFE 85
           + ARPF++V+S +DL    +     D+  Y+S+++DEFGE E ++  +ELDPG+WR IFE
Sbjct: 23  VQARPFVLVLSNEDLNGGFN-----DNGAYESSDFDEFGESEPKSE-EELDPGSWRRIFE 82

Query: 86  PDSPASDPDAPEDLYYTALGKMMTAVSSGDLRLMEDAVGDIDQVAVESGDPHAQSVLGLL 145
            +       A    YY+ L K+++A S G+  LME+AV +ID  A  SGDPHAQSV+G +
Sbjct: 83  TNESTVHASASPQ-YYSGLHKILSAASEGNTTLMEEAVSEIDSSA-SSGDPHAQSVMGFV 142

Query: 146 YGTGIMKETNKAKAFMYHYFAAEG-NKQSKMALAYSYFRQEMYEKAVKLYAELAEVAINS 205
           YG G+M+ET+++K+ ++H+FAA G N QSKMALA+ Y RQ MY+KAV+LYAELAE A+NS
Sbjct: 143 YGIGMMRETSRSKSILHHHFAAAGGNMQSKMALAFRYLRQNMYDKAVELYAELAETAVNS 202

Query: 206 LLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQILEYQAQKGNAGAMYRIGLFYY 265
            L+SKDSP+ EPVRIH G EENK ALRKSRGEEDEDFQILEYQA+KGN+ AM++IGLFYY
Sbjct: 203 FLISKDSPMAEPVRIHIGTEENKDALRKSRGEEDEDFQILEYQAEKGNSVAMHKIGLFYY 262

Query: 266 FGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYARGAGVE-RDYTKALQWLTRASKH 325
           FGLRGLRRDHAKAL WFSKA   G       LG +Y +G GV+ R+YTKA ++   A+ +
Sbjct: 263 FGLRGLRRDHAKALYWFSKAEFNG-------LGYLYVKGYGVDKRNYTKAREYFEMAANN 322

Query: 326 PSFSAYNGIGYLYVKGYGVEKNFTKAKEYFEKAANNDESGGHYNLGVMYLKGIGVKRDVK 385
              S +  +G LY+KG GV+K+   A +YF  AAN  +    Y L  M+  G+G+ ++++
Sbjct: 323 EDPSGHYNLGVLYLKGTGVKKDVRHATKYFFVAANAGQPKAFYQLAKMFHTGVGLTKNLE 382

Query: 386 KACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMASALYKLVAERGPWSSLSRWALE 445
            A T + + A  G   +  + A   +    LK ++  A                      
Sbjct: 383 MATTFYKLVAERGPWSSLSRWALEAY----LKGDVGKAF--------------------- 442

Query: 446 SYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQSMCLGESGFCTDAELHQRAHS 505
                       LY+RM+ELGYEVAQSNAAWI+DKYGE+SMC+G  GFCTD E H RAHS
Sbjct: 443 -----------ILYSRMSELGYEVAQSNAAWIVDKYGERSMCMGVYGFCTDKERHDRAHS 502

Query: 506 LWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYMHAKSQLNAQAMFNLGYMHEHG 565
           LWW+ASEQGNEHAALLIGDAYYYGRGT+ D+ RAAEAYM+AKSQ NAQAMFNLGYMHEHG
Sbjct: 503 LWWRASEQGNEHAALLIGDAYYYGRGTERDFVRAAEAYMYAKSQSNAQAMFNLGYMHEHG 562

Query: 566 LGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRKNHADSFLVHVIDSLPEVYPKI 625
            GLPFDLHLAKRYYDQAL+ D AA+LPV LALAS+W+R+N+AD+ LV V++SLPEV+ K+
Sbjct: 563 EGLPFDLHLAKRYYDQALQSDTAAKLPVTLALASVWVRRNYADTALVQVLNSLPEVHQKV 582

Query: 626 DIWVEDVLLEE 635
             WVE+ +LEE
Sbjct: 623 VEWVENGMLEE 582

BLAST of Cp4.1LG20g01550 vs. NCBI nr
Match: gi|449439463|ref|XP_004137505.1| (PREDICTED: ERAD-associated E3 ubiquitin-protein ligase component HRD3A [Cucumis sativus])

HSP 1 Score: 1261.9 bits (3264), Expect = 0.0e+00
Identity = 639/684 (93.42%), Postives = 651/684 (95.18%), Query Frame = 1

Query: 5   MQLQARRLQLIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFG 64
           MQLQ RRLQLIFLI+CL S FINARPFLIVISQDDLKD A PDDSSDSAN DSA+WDEFG
Sbjct: 1   MQLQTRRLQLIFLILCLSSLFINARPFLIVISQDDLKDGAPPDDSSDSANSDSADWDEFG 60

Query: 65  EPESQNAAQELDPGTWRPIFEPDSPAS--DPDAPEDLYYTALGKMMTAVSSGDLRLMEDA 124
           EPESQN+A ELDPG+WRPIFEPDS AS  D DAP+DLYYTALGKMM+AVSSGDLRLMEDA
Sbjct: 61  EPESQNSALELDPGSWRPIFEPDSTASASDSDAPQDLYYTALGKMMSAVSSGDLRLMEDA 120

Query: 125 VGDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAEGNKQSKMALAYSYF 184
           V DIDQ   ESGDPHAQSVLGLLYG GIMKETNKAKAFMYH+FAAEGNKQSKMALAY YF
Sbjct: 121 VADIDQAVAESGDPHAQSVLGLLYGMGIMKETNKAKAFMYHHFAAEGNKQSKMALAYIYF 180

Query: 185 RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ 244
           RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ
Sbjct: 181 RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ 240

Query: 245 ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYAR 304
           ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKAL WFSKAV+KGEPKSMELLGEIYAR
Sbjct: 241 ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALSWFSKAVEKGEPKSMELLGEIYAR 300

Query: 305 GAGVERDYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEKNFTKAKEYFEKAANNDES 364
           GAGVERDYTKALQWLTRASK PSF+AYNG+GYLYVKGYGVEKN+TKAKEYFEKAA NDES
Sbjct: 301 GAGVERDYTKALQWLTRASKQPSFTAYNGMGYLYVKGYGVEKNYTKAKEYFEKAAENDES 360

Query: 365 GGHYNLGVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMAS 424
           GGHYNLGVMYLKGIGVKRDVKKACTHFI+AANAGQPKAFYQLAKMFHTGVGLKRNIPMAS
Sbjct: 361 GGHYNLGVMYLKGIGVKRDVKKACTHFIMAANAGQPKAFYQLAKMFHTGVGLKRNIPMAS 420

Query: 425 ALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQ 484
           ALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQ
Sbjct: 421 ALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQ 480

Query: 485 SMCLGESGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYM 544
           SMCLGESGFCTDAE HQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYM
Sbjct: 481 SMCLGESGFCTDAERHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYM 540

Query: 545 HAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRK 604
           HAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLAL SLWLR 
Sbjct: 541 HAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALVSLWLRM 600

Query: 605 NHADSFLVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVV 664
           NHADSFLVHVIDSLPEVYPKID WVEDVLLEEGNATILTLFACLLTVLYLRERQRRHA V
Sbjct: 601 NHADSFLVHVIDSLPEVYPKIDAWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAAV 660

Query: 665 RAAAAAAAAEAVPLHPNDHVAAQN 687
           R      AAEAVPLHPNDHV  QN
Sbjct: 661 R------AAEAVPLHPNDHVPPQN 678

BLAST of Cp4.1LG20g01550 vs. NCBI nr
Match: gi|659066862|ref|XP_008464831.1| (PREDICTED: protein sel-1 homolog 1 [Cucumis melo])

HSP 1 Score: 1256.9 bits (3251), Expect = 0.0e+00
Identity = 636/684 (92.98%), Postives = 653/684 (95.47%), Query Frame = 1

Query: 5   MQLQARRLQLIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEFG 64
           MQL+ RR+QLIFLI+CL S FINARPFLIVISQDDLKD A PDDSSDSAN DSA+WDEFG
Sbjct: 1   MQLETRRVQLIFLILCLSSLFINARPFLIVISQDDLKDGAPPDDSSDSANSDSADWDEFG 60

Query: 65  EPESQNAAQELDPGTWRPIFEPDS--PASDPDAPEDLYYTALGKMMTAVSSGDLRLMEDA 124
           EPESQN+A ELDPG+WRPIFEPDS   ASD DAP+DLYYTALGKMM+AVSSGDLRLMEDA
Sbjct: 61  EPESQNSALELDPGSWRPIFEPDSLASASDSDAPQDLYYTALGKMMSAVSSGDLRLMEDA 120

Query: 125 VGDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAEGNKQSKMALAYSYF 184
           V DIDQ   ++GDPHAQSVLGLLYG GIMKETNKAKAFMYH+FAAEGNKQSKMALAY YF
Sbjct: 121 VADIDQAVADNGDPHAQSVLGLLYGMGIMKETNKAKAFMYHHFAAEGNKQSKMALAYIYF 180

Query: 185 RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ 244
           RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ
Sbjct: 181 RQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQ 240

Query: 245 ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYAR 304
           ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKAL WFSKAV+KGEPKSMELLGEIYAR
Sbjct: 241 ILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALSWFSKAVEKGEPKSMELLGEIYAR 300

Query: 305 GAGVERDYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEKNFTKAKEYFEKAANNDES 364
           GAGVERDYTKALQWLTRASK PSF+AYNG+GYLYVKGYGVEKN+TKAKEYFEKAA NDES
Sbjct: 301 GAGVERDYTKALQWLTRASKQPSFTAYNGMGYLYVKGYGVEKNYTKAKEYFEKAAENDES 360

Query: 365 GGHYNLGVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMAS 424
           GGHYNLGVMYLKGIGVKRDVKKACTHFI+AANAGQPKAFYQLAKMFHTGVGLKRNIPMAS
Sbjct: 361 GGHYNLGVMYLKGIGVKRDVKKACTHFIMAANAGQPKAFYQLAKMFHTGVGLKRNIPMAS 420

Query: 425 ALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQ 484
           ALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQ
Sbjct: 421 ALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQ 480

Query: 485 SMCLGESGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYM 544
           SMCLGESGFCTDAE HQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYM
Sbjct: 481 SMCLGESGFCTDAERHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYM 540

Query: 545 HAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRK 604
           HAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLAL SLWLRK
Sbjct: 541 HAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALVSLWLRK 600

Query: 605 NHADSFLVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVV 664
           NHADSFLVHVIDSLPEVYPKID WVEDVLLEEGNATILTLFACLLTVLYLRERQRR A V
Sbjct: 601 NHADSFLVHVIDSLPEVYPKIDAWVEDVLLEEGNATILTLFACLLTVLYLRERQRRQAAV 660

Query: 665 RAAAAAAAAEAVPLHPNDHVAAQN 687
           RAAAAAA A   PLHPNDH+  QN
Sbjct: 661 RAAAAAAEA-VPPLHPNDHLPPQN 683

BLAST of Cp4.1LG20g01550 vs. NCBI nr
Match: gi|1009131209|ref|XP_015882716.1| (PREDICTED: ERAD-associated E3 ubiquitin-protein ligase component HRD3A-like [Ziziphus jujuba])

HSP 1 Score: 1029.6 bits (2661), Expect = 2.4e-297
Identity = 523/674 (77.60%), Postives = 591/674 (87.69%), Query Frame = 1

Query: 17  LIICLCSFFINARPFLIVISQDDLKDAA-SPDDSS--DSANYDSAEWDEFGEPESQNAAQ 76
           LI  L +  + ARPF++ +SQ+DLKD A SPDDSS  D +++DSAEWDEFG+ ++  + +
Sbjct: 14  LIFSLFTVSLYARPFVLFLSQEDLKDTANSPDDSSSSDPSHHDSAEWDEFGDSDAHKSDE 73

Query: 77  ELDPGTWRPIFEPDSPA-SDPDAPEDL-YYTALGKMMTAVSSGDLRLMEDAVGDIDQVAV 136
           ELDPG+WRPIFEPDS + +DP +  D  YY+ + KM+ AVSSG++ LM++A  +I+  AV
Sbjct: 74  ELDPGSWRPIFEPDSSSGADPGSVADAQYYSGVSKMVRAVSSGEVNLMDEAAKEIE-AAV 133

Query: 137 ESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAEG-NKQSKMALAYSYFRQEMYEKA 196
             G PHA+SVLG L G G M+E NKAKAFMYHYFAA+G N QSKMALAY+YF+QEM+EKA
Sbjct: 134 AVGHPHARSVLGFLNGAGQMRERNKAKAFMYHYFAADGGNMQSKMALAYTYFKQEMFEKA 193

Query: 197 VKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDFQILEYQAQK 256
           VKLY+ELAEVA+NS L+SKDSPVIEPVRIHNGAEENK+ALRKSRGEEDEDFQILEYQAQK
Sbjct: 194 VKLYSELAEVAVNSFLISKDSPVIEPVRIHNGAEENKEALRKSRGEEDEDFQILEYQAQK 253

Query: 257 GNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYARGAGVERDY 316
           GNAGAMY+IGLFYYFGLRGLRRDH KAL WF KAV+KGEP+SMELLGEIYARGAGVER+Y
Sbjct: 254 GNAGAMYKIGLFYYFGLRGLRRDHGKALSWFLKAVEKGEPRSMELLGEIYARGAGVERNY 313

Query: 317 TKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEK-NFTKAKEYFEKAANNDESGGHYNLG 376
           TKAL+WLT ASK   +SAYNG+GYLYVKGYGVEK N+TKAKEYFEKAA N+E+GGHYNLG
Sbjct: 314 TKALEWLTLASKQHLYSAYNGMGYLYVKGYGVEKKNYTKAKEYFEKAAENEEAGGHYNLG 373

Query: 377 VMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPMASALYKLVA 436
           VMY KGIGVKRDVK AC  FI+AANAGQPKAFYQLAKMFHTGVGLK+N+  A+ALYKLVA
Sbjct: 374 VMYFKGIGVKRDVKLACKCFIIAANAGQPKAFYQLAKMFHTGVGLKKNLATATALYKLVA 433

Query: 437 ERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYGEQSMCLGES 496
           ERGPWSSLSRWALESYLK DIGKAF LY+RMAELGYEVAQSNAAWILDKYGE+SMC+GES
Sbjct: 434 ERGPWSSLSRWALESYLKGDIGKAFLLYSRMAELGYEVAQSNAAWILDKYGERSMCMGES 493

Query: 497 GFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEAYMHAKSQLN 556
           GFCTDAE HQRAHSLWWQASEQGNEHAALLIGDAYYYGRGT+ DYDRAA+AYMHA+ Q N
Sbjct: 494 GFCTDAERHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTERDYDRAADAYMHARYQSN 553

Query: 557 AQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWLRKNHADSFL 616
           AQAMFNLGYM+EHGLGLP DLHLAKRYYDQALE DPAA+LPV LALASLW+RKN+   FL
Sbjct: 554 AQAMFNLGYMYEHGLGLPLDLHLAKRYYDQALENDPAAKLPVTLALASLWIRKNYEGGFL 613

Query: 617 VHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHAVVRAAAAAA 676
           VHVIDSLPEVYPK++ WVE+VLLEEGNATILTLF CLLTVLYLRERQRR+    AAA   
Sbjct: 614 VHVIDSLPEVYPKVEAWVENVLLEEGNATILTLFVCLLTVLYLRERQRRNV---AAAGGH 673

Query: 677 AAEAVPLHPNDHVA 684
                P+HPN+H+A
Sbjct: 674 REMPAPIHPNEHIA 683

BLAST of Cp4.1LG20g01550 vs. NCBI nr
Match: gi|255579265|ref|XP_002530478.1| (PREDICTED: ERAD-associated E3 ubiquitin-protein ligase component HRD3A [Ricinus communis])

HSP 1 Score: 1025.4 bits (2650), Expect = 4.6e-296
Identity = 518/680 (76.18%), Postives = 588/680 (86.47%), Query Frame = 1

Query: 3   IRMQLQARRLQLIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDE 62
           +R +L   R     +I+ L    + ARPF++++SQDDLKDA +  D S SA     EWDE
Sbjct: 1   MRRRLGTYRFTFSLIIVSLLPLSLTARPFVLLLSQDDLKDAPATVDDSSSATDSPPEWDE 60

Query: 63  FGEPESQNAAQELDPGTWRPIFEPDSPASDP---DAPEDLYYTALGKMMTAVSSGDLRLM 122
           FG+ +S+    ELDPG+WRPIFEPDS +S     D+    YY+ + KM+ +VS G +RLM
Sbjct: 61  FGDSDSK-PEHELDPGSWRPIFEPDSSSSSSSVEDSEMAEYYSGVEKMLASVSDGKVRLM 120

Query: 123 EDAVGDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAE-GNKQSKMALA 182
           E+A  +I+  AV SG+PHAQSVLG LYG G MKE +KAKAF+YH+FAAE GN QSKMALA
Sbjct: 121 EEAAAEIESAAV-SGNPHAQSVLGFLYGLGQMKERDKAKAFLYHHFAAESGNMQSKMALA 180

Query: 183 YSYFRQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEED 242
           ++Y RQ+M++KAVKLYAELAEVA+NS L+SKDSPVIEPVRIHNGAEENK+ALRKSRGEED
Sbjct: 181 FTYSRQDMHDKAVKLYAELAEVAVNSFLISKDSPVIEPVRIHNGAEENKEALRKSRGEED 240

Query: 243 EDFQILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGE 302
           EDFQILEYQAQKGNAGAMY+IGLFYYFGLRGLRRDHAKAL WFSKAV KGEP+SMELLGE
Sbjct: 241 EDFQILEYQAQKGNAGAMYKIGLFYYFGLRGLRRDHAKALSWFSKAVKKGEPRSMELLGE 300

Query: 303 IYARGAGVERDYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVEKNFTKAKEYFEKAAN 362
           IYARGAGVER+YTKAL+WLT ASK   +SAYNG+GYLYVKGYGVEKN+TKAKEYFEKAA+
Sbjct: 301 IYARGAGVERNYTKALEWLTLASKQQLYSAYNGMGYLYVKGYGVEKNYTKAKEYFEKAAH 360

Query: 363 NDESGGHYNLGVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNI 422
           N+E+GGHYNLGVMYLKGIGVKRDVK AC +FIVAANAGQPKAFYQLAKMFHTGVGLK+++
Sbjct: 361 NEEAGGHYNLGVMYLKGIGVKRDVKLACKYFIVAANAGQPKAFYQLAKMFHTGVGLKKDL 420

Query: 423 PMASALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDK 482
            MA+ALYKLVAERGPWS+LSRWALESYLK D+GKAF LYARMAE+GYE+AQSNAAWILDK
Sbjct: 421 VMATALYKLVAERGPWSTLSRWALESYLKGDVGKAFLLYARMAEMGYEIAQSNAAWILDK 480

Query: 483 YGEQSMCLGESGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAA 542
           YGE+SMC+GESGFCTDAE HQRAHSLWWQASEQGNEHAALLIGDAYYYGRGT+ DY+RAA
Sbjct: 481 YGERSMCMGESGFCTDAERHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTERDYERAA 540

Query: 543 EAYMHAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASL 602
           EAYMHAKSQ NAQAMFNLGYMHEHG GLPFDLHLAKRYYDQALE+DPAA+LPV LAL SL
Sbjct: 541 EAYMHAKSQSNAQAMFNLGYMHEHGQGLPFDLHLAKRYYDQALEIDPAAKLPVTLALTSL 600

Query: 603 WLRKNHADSFLVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRR 662
           W+R+N+ADSFLV++IDSLP VYPK++ WVE+V+LEEGNATILTLF CLLTVLYLRERQRR
Sbjct: 601 WVRRNYADSFLVNLIDSLPGVYPKVEAWVENVILEEGNATILTLFVCLLTVLYLRERQRR 660

Query: 663 HAVVRAAAAAAAAEAVPLHP 679
           H      A      AVP  P
Sbjct: 661 H------AGGVGEAAVPQQP 672

BLAST of Cp4.1LG20g01550 vs. NCBI nr
Match: gi|1021024021|gb|KZM81809.1| (hypothetical protein DCAR_029422 [Daucus carota subsp. sativus])

HSP 1 Score: 1018.1 bits (2631), Expect = 7.3e-294
Identity = 516/684 (75.44%), Postives = 593/684 (86.70%), Query Frame = 1

Query: 4   RMQLQARRLQLIFLIICLCSFFINARPFLIVISQDDLKDAASPDDSSDSANYDSAEWDEF 63
           R++   + L ++ L++C+    I +RPF++V+SQDDLKD +  D  S      ++EWDEF
Sbjct: 4   RVKTHRKILTILTLVLCIYPVSIYSRPFVLVLSQDDLKDPSPSDPLSPEPADSNSEWDEF 63

Query: 64  GEPESQNAAQELDPGTWRPIFEPDS-PASDPDAPEDLYYTALGKMMTAVSSGDLRLMEDA 123
           G+ +S++   ELDPGTWRPIFEP+S P  DP+  ED YY+ + +MM AVS GD+R+ME+ 
Sbjct: 64  GDSDSKSD-DELDPGTWRPIFEPESDPTRDPNW-EDGYYSGVRRMMGAVSRGDVRMMEEG 123

Query: 124 VGDIDQVAVESGDPHAQSVLGLLYGTGIMKETNKAKAFMYHYFAAE-GNKQSKMALAYSY 183
           VG+I++ A   G  H QS++G LY  G+++E +KAK FMYHYFAAE GN QSKMALAY+Y
Sbjct: 124 VGEIEEAA-RGGHAHGQSMMGYLYNMGVLRERSKAKGFMYHYFAAEAGNMQSKMALAYTY 183

Query: 184 FRQEMYEKAVKLYAELAEVAINSLLVSKDSPVIEPVRIHNGAEENKQALRKSRGEEDEDF 243
            RQ+M++KAVKLYAELAEVA+NS L+SKDSPVIEPVRIHNGAEENK+ALRKSRGEEDEDF
Sbjct: 184 TRQDMHDKAVKLYAELAEVAVNSFLISKDSPVIEPVRIHNGAEENKEALRKSRGEEDEDF 243

Query: 244 QILEYQAQKGNAGAMYRIGLFYYFGLRGLRRDHAKALIWFSKAVDKGEPKSMELLGEIYA 303
           QILEYQAQKGNAGAMY+IG+FYYFGLRGLRRDH+KAL WF KAV+K EP+SMELLGEIYA
Sbjct: 244 QILEYQAQKGNAGAMYKIGIFYYFGLRGLRRDHSKALYWFLKAVEKEEPRSMELLGEIYA 303

Query: 304 RGAGVERDYTKALQWLTRASKHPSFSAYNGIGYLYVKGYGVE-KNFTKAKEYFEKAANND 363
           RGAGVER+YTKAL+WLT ASK   +SAYNG+GYLYVKGYGVE KNFTKAKEYFEKAA+ND
Sbjct: 304 RGAGVERNYTKALEWLTLASKQQLYSAYNGMGYLYVKGYGVEKKNFTKAKEYFEKAADND 363

Query: 364 ESGGHYNLGVMYLKGIGVKRDVKKACTHFIVAANAGQPKAFYQLAKMFHTGVGLKRNIPM 423
           E+GGHYNLGV+YLKGIGVKRDVK AC +FI+AANAGQPKAFYQLAKMFHTGVGLK+N+PM
Sbjct: 364 EAGGHYNLGVLYLKGIGVKRDVKVACKYFIIAANAGQPKAFYQLAKMFHTGVGLKKNLPM 423

Query: 424 ASALYKLVAERGPWSSLSRWALESYLKSDIGKAFFLYARMAELGYEVAQSNAAWILDKYG 483
           A+ALYKLVAERGPWSSLSRW+LE+YLK D+GKAF LY+RMAELGYEVAQSNAAWILDKYG
Sbjct: 424 ATALYKLVAERGPWSSLSRWSLEAYLKGDVGKAFLLYSRMAELGYEVAQSNAAWILDKYG 483

Query: 484 EQSMCLGESGFCTDAELHQRAHSLWWQASEQGNEHAALLIGDAYYYGRGTDVDYDRAAEA 543
           E+SMC+GESGFCTDAE HQ AHSLWWQASEQGNEHAALLIGDAYYYGRGT+ DYDRAAEA
Sbjct: 484 ERSMCMGESGFCTDAERHQIAHSLWWQASEQGNEHAALLIGDAYYYGRGTERDYDRAAEA 543

Query: 544 YMHAKSQLNAQAMFNLGYMHEHGLGLPFDLHLAKRYYDQALELDPAARLPVKLALASLWL 603
           YMHAKSQ NAQAMFNLGYMHEHG GLPFDLHLAKRYYDQA+E+DP+A+LPV LAL SLW+
Sbjct: 544 YMHAKSQSNAQAMFNLGYMHEHGQGLPFDLHLAKRYYDQAVEIDPSAKLPVTLALGSLWV 603

Query: 604 RKNHADSFLVHVIDSLPEVYPKIDIWVEDVLLEEGNATILTLFACLLTVLYLRERQRRHA 663
           RKN ADSFLV  IDSLP V+PK+ +WVE+V+LEEGNATILTLF CLLTVLYLRERQRR A
Sbjct: 604 RKNCADSFLVDFIDSLPYVFPKVQLWVENVVLEEGNATILTLFVCLLTVLYLRERQRRQA 663

Query: 664 VVRAAAAAAAAEAVPLH-PNDHVA 684
           VV       A EA  LH PND VA
Sbjct: 664 VV------VAGEAANLHQPNDPVA 678

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HRD3A_ARATH6.7e-27169.58ERAD-associated E3 ubiquitin-protein ligase component HRD3A OS=Arabidopsis thali... [more]
HRD3_ORYSJ6.7e-24767.02ERAD-associated E3 ubiquitin-protein ligase component HRD3 OS=Oryza sativa subsp... [more]
HRD3B_ARATH8.7e-17855.32ERAD-associated E3 ubiquitin-protein ligase component HRD3B OS=Arabidopsis thali... [more]
SE1L1_HUMAN1.9e-7635.51Protein sel-1 homolog 1 OS=Homo sapiens GN=SEL1L PE=1 SV=3[more]
SE1L1_RAT4.3e-7635.34Protein sel-1 homolog 1 OS=Rattus norvegicus GN=Sel1l PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LQX2_CUCSA0.0e+0093.42Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042850 PE=4 SV=1[more]
B9SWV9_RICCO3.2e-29676.18Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0566120 PE=4 SV=1[more]
A0A067JT43_JATCU1.5e-29376.32Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22718 PE=4 SV=1[more]
M5WDY4_PRUPE2.1e-29274.93Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002386mg PE=4 SV=1[more]
A0A0B0MWE3_GOSAR1.7e-28975.37Protein sel-1 OS=Gossypium arboreum GN=F383_26804 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G18260.13.8e-27269.58 HCP-like superfamily protein[more]
AT1G73570.14.9e-17955.32 HCP-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449439463|ref|XP_004137505.1|0.0e+0093.42PREDICTED: ERAD-associated E3 ubiquitin-protein ligase component HRD3A [Cucumis ... [more]
gi|659066862|ref|XP_008464831.1|0.0e+0092.98PREDICTED: protein sel-1 homolog 1 [Cucumis melo][more]
gi|1009131209|ref|XP_015882716.1|2.4e-29777.60PREDICTED: ERAD-associated E3 ubiquitin-protein ligase component HRD3A-like [Ziz... [more]
gi|255579265|ref|XP_002530478.1|4.6e-29676.18PREDICTED: ERAD-associated E3 ubiquitin-protein ligase component HRD3A [Ricinus ... [more]
gi|1021024021|gb|KZM81809.1|7.3e-29475.44hypothetical protein DCAR_029422 [Daucus carota subsp. sativus][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR006597Sel1-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030433 ER-associated ubiquitin-dependent protein catabolic process
biological_process GO:0006888 ER to Golgi vesicle-mediated transport
biological_process GO:0042538 hyperosmotic salinity response
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0008150 biological_process
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0016020 membrane
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0016491 oxidoreductase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g01550.1Cp4.1LG20g01550.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006597Sel1-like repeatPFAMPF08238Sel1coord: 291..324
score: 2.2E-6coord: 136..169
score: 0.25coord: 447..465
score: 30.0coord: 327..360
score: 4.0E-6coord: 550..584
score: 1.0E-5coord: 402..432
score: 31.0coord: 365..396
score: 2.9E-4coord: 254..288
score: 7.8E-8coord: 515..538
score:
IPR006597Sel1-like repeatSMARTSM00671sel1coord: 253..289
score: 5.2E-8coord: 136..171
score: 1.1coord: 550..585
score: 8.9E-8coord: 326..361
score: 2.0E-5coord: 290..325
score: 5.5E-6coord: 514..549
score: 9.8coord: 362..397
score: 6.5E-7coord: 398..433
score: 0.
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 129..199
score: 3.6E-9coord: 445..584
score: 5.6E-24coord: 236..323
score: 4.1E-22coord: 328..436
score: 1.2
NoneNo IPR availablePANTHERPTHR11102SEL-1-LIKE PROTEINcoord: 107..606
score: 5.2E
NoneNo IPR availablePANTHERPTHR11102:SF84HCP-like superfamily protein-relatedcoord: 107..606
score: 5.2E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 339..473
score: 2.22E-27coord: 234..369
score: 1.12E-30coord: 129..199
score: 9.55E-10coord: 445..586
score: 1.7