Cp4.1LG08g03800 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g03800
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionARM repeat superfamily protein
LocationCp4.1LG08 : 1911267 .. 1916205 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTATGATTTGGGCCTATGTAGCTGTCGTGATCCATATATTTTCAGGCCCATTAAAATTCATGTCGCTTTCCTTGGTGGTTGAATAATTGAATTTATTCTGCATCGTTGTGCCCTAATTTAGACTTCAAGCTTTCTGATTGTTACACGATTGGGTCGCCATTGCTAAATCACTTGAAGTTCAGGGAGAAGAAGCCAGGCCATGCAGAAGAGAGAGCATAATAAGTTGGGTGGTAATGTTGGCGGCGTCTCGTCGGCGCCTCCGGCTAAGCGAGGTCGTCCATTCGGCAGCGTAAACAGCAACGCCGCCGCTGCAGTCGAGATCTTCGCTCCATCGGCACTGCTTGGCCCTTCTTTCCATGTTCATACTTCCTTCGCGGGTCTGTTAATCGAAACTCTTTTCACTTCGTCTCTTAATTTGCGGGACTCTTTACTGGAAAATCAGCTGAAAGTTCAGTTTTCGAGTTCTTATTTTATGTTGAGACGTTTATTTGATTGAGTTCCTTTTTTTTATTTATCTCTAAAATTCGTCACAAGAGCAGATCAAAACAATAAAAAGATAGTGTTGGCGCTACAGAGTGGCTTGAAGAGTGAATTGACATGGGCACTGAATACTCTAACTCTTCTCTCCTTCAAAGAGAAGGACGATATGCGCAGAGATTCTACTCCTCTAGCGAAAATTCCCGGCTTGCTTGATGCTCTTCTTCAAGTTGTATGTATTACATTGCCTCTCAGTCATTTTCTCGCGTTTTCTTTTGCTATGAATAAATATGATATTTTGGTTCTCTCGGGAAGATTTTATTGTCCTACCGTGTTCCATTTTGCTCATGAAATTGAATTTTGAAATTGGCAAGACTGATTTAAATCGTTCAAGTAAATCCCGGGATTAATGATATCATGATGTTACATCAAACAGATAGATGATTGGCGTGATATAGCACTTCCGAAGGATCTTGTAAAGAAGGCAAGGATCAGAACGTTAGGTGTAAATTCTTCTGTAACGGGATTTGGGAATGAATATGAGGCATTGGGCTCAAATGGGTATGCATCAAATGTCATTTTGTTATGTTTGCTCAGTTTACTGAATATTAGTTCTAAGTATGGTGTAAATTATGCCTGACAGCCTGAGACCTGGTTCTTCAGCTTCAGAGGTAACGGTTCACACTTCCAAATCATCTCCTCGACATTGGTGGCTTGATGAAGATGGTCTATTTAGTCTGGATGACGAAGGCCGAGCAGAAAGACAGCAATGTGCTGTTTCTGCTTCAAATATTATCCGAAACTTTTCTTTCATGCCAGAGAATGAATCTATTATGGCTCAACATCGACATACTCTTGAAACAGTGTTTCAGTGTATAGAAGATCATGTTACAGGTCAGAAGTTCGTGACCCTAATGAGCATTGCTTTTAGAGTTTCCTGTAGATATCTTATGTCATTAGTATTGAATGTTGCGTTTGAAATTTGAAGTGTCAAATTTGAGTATGTTTGTCTGAATTATGTCTGAAAGCTTTTATTGTTGACAGAGGATGATGAACTTGTTACAAATACACTAGAGACAATTGTGAATTTATCCCCACTCCTTGATCTTCGTATCTTTAGTTCATCAAAGCCGTCCTACATCAAAATAACGTGAGAATACTATTGATATTTTAGGCCGACTCGGGTTGATGATTAGTTCTAATAAGCTTGCTTTATACTTTTGTACCAGAGAAAAACGAGCAGTGGAAGGCATCATGGGAATGCTTGGATCTTCTGTCAAAGTCTGGCACTGTTCTGCTGCAGAATTACTCGGACGATTGATAATAAATCCCGACAACGAGCCTTTCCTTCTTCCCTTTATCCCCCAGGTTTGTTTCTTTAAAAACCCATGAACCTTCCGGTCTTCTGTCGTATTATGCCTGCTTTATCGTAACAACTTTTTGTTGAATCTTATTAGATACACAAGCGTTTAGTCGACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGTGCACTGTATAACCTTGTTGAAGTTAATATGGACTGCAGATTAAAGCTGGCAAGCGAAAGATGGTAATTCTTTTCTTTTTCCTTTACCGCTCTTGCGAAAACAACCTCGAGTTCTTTCCTAAGCCACTTGACACGCAGTAGATACTGAAGTATATTGCCATCTTTTTCCTGTACTTCGAATTAGGGCGATCGATCGACTTCTTAAAGTAATCAAGATGCCTCATCCAGTTCCGGAAATATGCAGGAAAGCAGCAATGATATTGGAGAGTCTCGTATCTGAGCCACAGAACAAGGGTGTGCTCCTAGCATTTGAAAATGCATTTGCAGAAATACTCTTCTTAGATGGCAGATATTCAGATACATTCGCTAGGATATTGTATGAGCTAACTTCCAGACCAAACAATAAAGTTGCTGCTGCTCAAGGAGTATGGGGCATGTGATCAGAAGTATCCTACACCCAATTGTCATCTTCGGCTCGAGATCACTGGACTAGTAGCTGCTAGATTCGAGCCTAGGACGAGATCGCTGGACCAGTAGTTGCTACCTATTCAAGCAAAGGACTCGACCCATGTCGTTGCTGTAAAACGTAAATATCAGCAGTCTACTAACTTACATATATATCTAATTCAAGCTATTGTATCTTCCAACTCTGTATTTCTACTGCTCACTTATAATCTCTTGTAAGTTACTTCTATTGGATATTGTATCTTCCAACTCTGTATTTCTAGCAGGTTTTATTGGATATTGTTAAAATATTTAAATTACAATTTTAGTCCCAGGTCCATGAACGAGTCTATCCCTTTTAACTTTAAGAGATAATATTTTCATTGTTACTTGATATGATTAAATTGTTAAAATATTTAAAGTGCAGCTTCATTGTCGGGCAGAATCAACAAGAAGGCAAATATCAGGTAGATTATTCGATTATATTTCCTCGGATTGTCTTGTTGTGATAGGGATATGATTGGAATTCGTCTTTTCTTTGTTTGTATCACCGAGACACCCGAAGGGCTGGCCATATGTACCTCGCCTCGGGAGACCCAGCCCATTCAATTCAAAATCGGATGAGCACCTACAACCAAGGGGAATGGAATGTCGACCACAGCGTGAGATGATCTTCTAATACGAGGACGAGTTATTGGTCAAGTCATGAAACTTTTGCATTTTGATCAACATGGCTGTACTTAAACTGACCAAAGTAGTGACTACTTCAACCGAAAAAGGGCGACACTAAACTTTTGCATTTTGATCAACATGGCTGTACTTAAACTGACCAAAGTAGTGTTTTGCATTTTGATCAACATGGCTATACTTAAACTGACCAAAGTAGTGACTACTTCAACCGAAAAAAGGGCGACACTAAACTTTTGCTGGCTGTACTTAAACTGACCAAAGTAGTGACTACTTCAACCGAAAAAGGGCAACACTAAACTTTTGCATTTTGATCAACATGGCTGTACTTAAACTGACCAAAGTAGTGTTTTGCATTTTGATCAACATGGCTATACTTAAACTGACCAAAGTAGTGACTACTTCAACCGAAAAAAGGGCGACACTAAACTTTTGCTGGCTGTACTTAAACTGACCAAAGTAGTGACTACTTCAACCGAAAAAAGGGCGACCCACCAAAGTATACGTTAATTACATCTAAATCTATAAAGATTAAATTCTATTTTGTTTTAAAGGGATTAAATTTAAAGTTAAATCCTTAAATATATTATGAAAGTTAGTTGAAAATTGTCAAATGCACGTGAAAAGTGCCCAACCTAAATCATATTAAATTTTCGTATCATACTGAACAACGTACATACTTTTTATCTACATCTTTCCATTTTGGAACGAATACAGACTTTTTATCACTAGATTTAGATATTAGACATCAACTGCATAATTCAATTATTTATCATATAATTATTAAAATATAACAGACATAAATATTAAAACAACTATGGTTAAAAGAAATTAAAAGAAGTAAAATAATGGGTCCATTAATTGGGTAGTATAAATGAATATTACACCTTAAAAAAACAATAATTAATTAGCATCCAATATGAACAATATGCTGTGTGACACACAAAATAAACAAACAACTTAAAAGAAAACATTATTTAAAAAAAAATAAAAAAAGGACCCAAACAAATTTGGAGTTAAAACCCTAATGAATCTCAAAAGAATTCTTCTTCTTCTTCAAGCTCTATTAATGGCGGCTACTCTGGGAACCTCAACAGGGAACCCATCAATCTCATCCATCCATGGATTCTTCTCCTTCGCCGGATCGACGGTCCTCAACGCCCGCCATACTGCACTGTTACACTTGAAACCCGACCCGAACGCAATCTGCCATGTCCGATCCCCTTTCCGGATCCGCCCTTTCGCCTCACTGTAAGCCAATTCGTACCACAAAGAACTACTCGACGTGTTCCCGAATCTATTTAGAGTCATTCTCGACGGCTCCATGTGCCACTCGCTCAGATCCAGATTCTTCTCCAGTTCATCCAGCACCGCGCGGCCGCCGGAATGGATGCAGAAATGTTCGAACGCGAGCTTAAAATCAGGTATGTACGGCTTAATTCGTTTCATTTTGAGAATCTTTTTGGCGACGAGAGTCGCAAAGAAGAGAAGCTGCTCCGACATTGGGAGGACGAGTGGACCAAGAGTAGTGATGTTGGTCTTCAGGGCTTCGCCGGCAACCGCCATTAGGTCTTTGGAGAGGCGGACGCCGACACGGCCAGTGTCGTCTTCTTGTTGGAAGACGCAGTTATAGCATTTGTCATCGGAGCCTTTGTGGGTTCGGACGGTGTGGATGAGTTGGTACTTGGAGCGACGGCGGTCTGAAGGGCGGTTTGAGAGGAGGATGGCGGCGCCGCCCATACGGAAGAGGCAATTCGAAACGAGCATTGATCGGTCGTTTCCGAAGTACCAATTAAGCGTGATGTTCT

mRNA sequence

CTTATGATTTGGGCCTATGTAGCTGTCGTGATCCATATATTTTCAGGCCCATTAAAATTCATGTCGCTTTCCTTGGTGGTTGAATAATTGAATTTATTCTGCATCGTTGTGCCCTAATTTAGACTTCAAGCTTTCTGATTGTTACACGATTGGGTCGCCATTGCTAAATCACTTGAAGTTCAGGGAGAAGAAGCCAGGCCATGCAGAAGAGAGAGCATAATAAGTTGGGTGGTAATGTTGGCGGCGTCTCGTCGGCGCCTCCGGCTAAGCGAGGTCGTCCATTCGGCAGCGTAAACAGCAACGCCGCCGCTGCAGTCGAGATCTTCGCTCCATCGGCACTGCTTGGCCCTTCTTTCCATGTTCATACTTCCTTCGCGGATCAAAACAATAAAAAGATAGTGTTGGCGCTACAGAGTGGCTTGAAGAGTGAATTGACATGGGCACTGAATACTCTAACTCTTCTCTCCTTCAAAGAGAAGGACGATATGCGCAGAGATTCTACTCCTCTAGCGAAAATTCCCGGCTTGCTTGATGCTCTTCTTCAAGTTATAGATGATTGGCGTGATATAGCACTTCCGAAGGATCTTGTAAAGAAGGCAAGGATCAGAACGTTAGGTGTAAATTCTTCTGTAACGGGATTTGGGAATGAATATGAGGCATTGGGCTCAAATGGCCTGAGACCTGGTTCTTCAGCTTCAGAGGTAACGGTTCACACTTCCAAATCATCTCCTCGACATTGGTGGCTTGATGAAGATGGTCTATTTAGTCTGGATGACGAAGGCCGAGCAGAAAGACAGCAATGTGCTGTTTCTGCTTCAAATATTATCCGAAACTTTTCTTTCATGCCAGAGAATGAATCTATTATGGCTCAACATCGACATACTCTTGAAACAGTGTTTCAGTGTATAGAAGATCATGTTACAGAGGATGATGAACTTGTTACAAATACACTAGAGACAATTGTGAATTTATCCCCACTCCTTGATCTTCGTATCTTTAGTTCATCAAAGCCGTCCTACATCAAAATAACAGAAAAACGAGCAGTGGAAGGCATCATGGGAATGCTTGGATCTTCTGTCAAAGTCTGGCACTGTTCTGCTGCAGAATTACTCGGACGATTGATAATAAATCCCGACAACGAGCCTTTCCTTCTTCCCTTTATCCCCCAGATACACAAGCGTTTAGTCGACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGTGCACTGTATAACCTTGTTGAAGTTAATATGGACTGCAGATTAAAGCTGGCAAGCGAAAGATGGGCGATCGATCGACTTCTTAAAGTAATCAAGATGCCTCATCCAGTTCCGGAAATATGCAGGAAAGCAGCAATGATATTGGAGAGTCTCGTATCTGAGCCACAGAACAAGGGTGTGCTCCTAGCATTTGAAAATGCATTTGCAGAAATACTCTTCTTAGATGGCAGATATTCAGATACATTCGCTAGGATATTGTATGAGCTAACTTCCAGACCAAACAATAAAGTTGCTGCTGCTCAAGGAGTATGGGGCATGTGATCAGAAGTATCCTACACCCAATTGTCATCTTCGGCTCGAGATCACTGGACTAGTAGCTGCTAGATTCGAGCCTAGGACGAGATCGCTGGACCAGTAGTTGCTACACATAAATATTAAAACAACTATGGTTAAAAGAAATTAAAAGAAGTAAAATAATGGGTCCATTAATTGGGTAGTATAAATGAATATTACACCTTAAAAAAACAATAATTAATTAGCATCCAATATGAACAATATGCTGTGTGACACACAAAATAAACAAACAACTTAAAAGAAAACATTATTTAAAAAAAAATAAAAAAAAAGAATTCTTCTTCTTCTTCAAGCTCTATTAATGGCGGCTACTCTGGGAACCTCAACAGGGAACCCATCAATCTCATCCATCCATGGATTCTTCTCCTTCGCCGGATCGACGGTCCTCAACGCCCGCCATACTGCACTGTTACACTTGAAACCCGACCCGAACGCAATCTGCCATGTCCGATCCCCTTTCCGGATCCGCCCTTTCGCCTCACTGTAAGCCAATTCGTACCACAAAGAACTACTCGACGTGTTCCCGAATCTATTTAGAGTCATTCTCGACGGCTCCATGTGCCACTCGCTCAGATCCAGATTCTTCTCCAGTTCATCCAGCACCGCGCGGCCGCCGGAATGGATGCAGAAATGTTCGAACGCGAGCTTAAAATCAGGTATGTACGGCTTAATTCGTTTCATTTTGAGAATCTTTTTGGCGACGAGAGTCGCAAAGAAGAGAAGCTGCTCCGACATTGGGAGGACGAGTGGACCAAGAGTAGTGATGTTGGTCTTCAGGGCTTCGCCGGCAACCGCCATTAGGTCTTTGGAGAGGCGGACGCCGACACGGCCAGTGTCGTCTTCTTGTTGGAAGACGCAGTTATAGCATTTGTCATCGGAGCCTTTGTGGGTTCGGACGGTGTGGATGAGTTGGTACTTGGAGCGACGGCGGTCTGAAGGGCGGTTTGAGAGGAGGATGGCGGCGCCGCCCATACGGAAGAGGCAATTCGAAACGAGCATTGATCGGTCGTTTCCGAAGTACCAATTAAGCGTGATGTTCT

Coding sequence (CDS)

ATGCAGAAGAGAGAGCATAATAAGTTGGGTGGTAATGTTGGCGGCGTCTCGTCGGCGCCTCCGGCTAAGCGAGGTCGTCCATTCGGCAGCGTAAACAGCAACGCCGCCGCTGCAGTCGAGATCTTCGCTCCATCGGCACTGCTTGGCCCTTCTTTCCATGTTCATACTTCCTTCGCGGATCAAAACAATAAAAAGATAGTGTTGGCGCTACAGAGTGGCTTGAAGAGTGAATTGACATGGGCACTGAATACTCTAACTCTTCTCTCCTTCAAAGAGAAGGACGATATGCGCAGAGATTCTACTCCTCTAGCGAAAATTCCCGGCTTGCTTGATGCTCTTCTTCAAGTTATAGATGATTGGCGTGATATAGCACTTCCGAAGGATCTTGTAAAGAAGGCAAGGATCAGAACGTTAGGTGTAAATTCTTCTGTAACGGGATTTGGGAATGAATATGAGGCATTGGGCTCAAATGGCCTGAGACCTGGTTCTTCAGCTTCAGAGGTAACGGTTCACACTTCCAAATCATCTCCTCGACATTGGTGGCTTGATGAAGATGGTCTATTTAGTCTGGATGACGAAGGCCGAGCAGAAAGACAGCAATGTGCTGTTTCTGCTTCAAATATTATCCGAAACTTTTCTTTCATGCCAGAGAATGAATCTATTATGGCTCAACATCGACATACTCTTGAAACAGTGTTTCAGTGTATAGAAGATCATGTTACAGAGGATGATGAACTTGTTACAAATACACTAGAGACAATTGTGAATTTATCCCCACTCCTTGATCTTCGTATCTTTAGTTCATCAAAGCCGTCCTACATCAAAATAACAGAAAAACGAGCAGTGGAAGGCATCATGGGAATGCTTGGATCTTCTGTCAAAGTCTGGCACTGTTCTGCTGCAGAATTACTCGGACGATTGATAATAAATCCCGACAACGAGCCTTTCCTTCTTCCCTTTATCCCCCAGATACACAAGCGTTTAGTCGACCTTATGAGCATCCCAGCATTAGATGCACAAGCAGCAGCTGTTGGTGCACTGTATAACCTTGTTGAAGTTAATATGGACTGCAGATTAAAGCTGGCAAGCGAAAGATGGGCGATCGATCGACTTCTTAAAGTAATCAAGATGCCTCATCCAGTTCCGGAAATATGCAGGAAAGCAGCAATGATATTGGAGAGTCTCGTATCTGAGCCACAGAACAAGGGTGTGCTCCTAGCATTTGAAAATGCATTTGCAGAAATACTCTTCTTAGATGGCAGATATTCAGATACATTCGCTAGGATATTGTATGAGCTAACTTCCAGACCAAACAATAAAGTTGCTGCTGCTCAAGGAGTATGGGGCATGTGA

Protein sequence

MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSNAAAAVEIFAPSALLGPSFHVHTSFADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVIDDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSNGLRPGSSASEVTVHTSKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVFQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGMLGSSVKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYNLVEVNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFENAFAEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM
BLAST of Cp4.1LG08g03800 vs. Swiss-Prot
Match: LFR_ARATH (Armadillo repeat-containing protein LFR OS=Arabidopsis thaliana GN=LFR PE=2 SV=1)

HSP 1 Score: 672.2 bits (1733), Expect = 4.1e-192
Identity = 337/461 (73.10%), Postives = 395/461 (85.68%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSN------AAAAVEIFAPSALLGPSFHV 60
           MQKRE  K GGN GG SS PPAKRGRPFGS ++N      AAAA +  +PSALLGPS  V
Sbjct: 1   MQKRELGKSGGNSGG-SSGPPAKRGRPFGSTSANSAAAAAAAAAADAMSPSALLGPSLLV 60

Query: 61  HTSFADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALL 120
           H SF +QNN++IVLALQSGLKSE+TWALNTLTLLSFKEK+D+RRD  PLAKI GLLDALL
Sbjct: 61  HNSFVEQNNRRIVLALQSGLKSEVTWALNTLTLLSFKEKEDIRRDVMPLAKIAGLLDALL 120

Query: 121 QVIDDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGS---NGLRPGSSASEVT-- 180
            +IDDWRDIALPKDL +  R+RTLG N+SVTGFGNEY+AL S    G   GSSA+E    
Sbjct: 121 LIIDDWRDIALPKDLTRGTRVRTLGTNASVTGFGNEYDALASIQPPGSGIGSSAAEALGK 180

Query: 181 VHTSKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTL 240
             T K     WW++EDGLF+LDDEGR+E+Q CA++ASN+IRNFSFMP+NE +MAQHRH L
Sbjct: 181 KSTGKHQSSQWWMEEDGLFNLDDEGRSEKQMCAIAASNVIRNFSFMPDNEVVMAQHRHCL 240

Query: 241 ETVFQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGML 300
           ETVFQCI DH+TED+ELVTN+LETIVNL+ L+DLRIFSS K SYI I EK+AV+ ++G+L
Sbjct: 241 ETVFQCIHDHMTEDEELVTNSLETIVNLAHLMDLRIFSSLKQSYININEKKAVQAVVGIL 300

Query: 301 GSSVKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYN 360
            SSVK W+C+AAELLGRLIINPDNEPF+ P IPQIHKRL+DL+SI A+DAQAAAVGALYN
Sbjct: 301 NSSVKAWNCAAAELLGRLIINPDNEPFISPLIPQIHKRLIDLLSIQAVDAQAAAVGALYN 360

Query: 361 LVEVNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFE 420
           LVEVNMDCRLKLASERWA+DRLLKVIK PHPVPE+CRKAAMILE+LVSEPQN+G+LLA+E
Sbjct: 361 LVEVNMDCRLKLASERWAVDRLLKVIKTPHPVPEVCRKAAMILENLVSEPQNRGLLLAYE 420

Query: 421 NAFAEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           NAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Sbjct: 421 NAFAELLFQEGKYSDSFARILYELTARSNSRVASARGIWGM 460

BLAST of Cp4.1LG08g03800 vs. TrEMBL
Match: A0A0A0L4L4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G144150 PE=4 SV=1)

HSP 1 Score: 805.4 bits (2079), Expect = 3.4e-230
Identity = 406/457 (88.84%), Postives = 426/457 (93.22%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSNAAAAV-------EIFAPSALLGPSFH 60
           MQKR+ NKLGGNV G +SAPPAKRGRPFGSVNSNAAA         E  APS LLGPS H
Sbjct: 1   MQKRDQNKLGGNVSGGASAPPAKRGRPFGSVNSNAAAVAAAVAAGTETLAPSTLLGPSLH 60

Query: 61  VHTSFADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL 120
           +HTSFADQNNK+IVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL
Sbjct: 61  IHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL 120

Query: 121 LQVIDDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSNGLRPGSSASEVTVHTS 180
           LQVIDDWRDIALP+DLVKK R+RTLG NSSVTGFGNE+EALGS+GLRP SS SE T H S
Sbjct: 121 LQVIDDWRDIALPRDLVKKQRVRTLGANSSVTGFGNEFEALGSDGLRPSSSVSESTGHAS 180

Query: 181 KSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVF 240
           K S R WWL+EDGLF+LDDEGRAERQQCAVSASNI+RNFSFMPENESIMA HRHTLETVF
Sbjct: 181 KPSSRPWWLEEDGLFNLDDEGRAERQQCAVSASNILRNFSFMPENESIMALHRHTLETVF 240

Query: 241 QCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGMLGSSV 300
           QCIEDHVTED+ELVTN LETIVNL+PLLDLRIFSS KPSYIKITEKRAVE IMGMLGS+V
Sbjct: 241 QCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSLKPSYIKITEKRAVEAIMGMLGSAV 300

Query: 301 KVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYNLVEV 360
           KVWHC+AAELLGRLIINPDNEPFLLPF+PQIHKRLVDLMSIPALDAQAAAVGALYNLVEV
Sbjct: 301 KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEV 360

Query: 361 NMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFENAFA 420
           NMDCR+KLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQN+G+LLA+ENAFA
Sbjct: 361 NMDCRIKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNRGLLLAYENAFA 420

Query: 421 EILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           EILF DGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Sbjct: 421 EILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 457

BLAST of Cp4.1LG08g03800 vs. TrEMBL
Match: A5AL12_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g04520 PE=4 SV=1)

HSP 1 Score: 727.2 bits (1876), Expect = 1.2e-206
Identity = 369/458 (80.57%), Postives = 407/458 (88.86%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSN--AAAAVEIFAPSALLGPSFHVHTSF 60
           MQKR+ +KLGG  GG ++ P AKRGRPFGS  SN  AAAA +  APS LLGPS HVH+SF
Sbjct: 1   MQKRDQSKLGGTAGGATT-PAAKRGRPFGSGGSNSAAAAAADAAAPSTLLGPSLHVHSSF 60

Query: 61  ADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVID 120
           ADQNNK+IVLALQSGLKSEL WA+N LTLLSFKEKDD+R+D+TPLAKIPGLLDALLQVID
Sbjct: 61  ADQNNKRIVLALQSGLKSELGWAINALTLLSFKEKDDVRKDATPLAKIPGLLDALLQVID 120

Query: 121 DWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSNGLRP----GSSASEVTV--HT 180
           DWRDIALPK+L K  R R LG NS VTGFGNEYEALGSN +      GSS SE +V  +T
Sbjct: 121 DWRDIALPKELAKAPRARLLGANSFVTGFGNEYEALGSNDVLSHPGSGSSISEASVQKNT 180

Query: 181 SKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETV 240
           +K  P  WWLDEDGLF+LD+EGRAE+QQCAV+ASNIIRNFSFMP+NE IMAQHRH LETV
Sbjct: 181 TKLRPSEWWLDEDGLFNLDEEGRAEKQQCAVAASNIIRNFSFMPDNEVIMAQHRHCLETV 240

Query: 241 FQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGMLGSS 300
           FQCIEDH+TED+ELVTN LETIVNL+PLLDLRIFSSSKPSYIKITEKRAV+ IMGMLGS+
Sbjct: 241 FQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVQAIMGMLGSA 300

Query: 301 VKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYNLVE 360
           VK WHC+AAELLGRLIINPDNEPFLLPF  QIHKRLVDL+S+PA+DAQAAAVGALYNL E
Sbjct: 301 VKAWHCAAAELLGRLIINPDNEPFLLPFASQIHKRLVDLLSLPAVDAQAAAVGALYNLAE 360

Query: 361 VNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFENAF 420
           VNMDCRLKLASERWAIDRLLKVIK PHPVPE+CRKAAMI+ESLVSEPQN+  LLA+ENAF
Sbjct: 361 VNMDCRLKLASERWAIDRLLKVIKTPHPVPEVCRKAAMIIESLVSEPQNRAQLLAYENAF 420

Query: 421 AEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           AEILF DGR+SDTFARILYELTSRPNNK+AAA+G+WGM
Sbjct: 421 AEILFSDGRHSDTFARILYELTSRPNNKMAAARGIWGM 457

BLAST of Cp4.1LG08g03800 vs. TrEMBL
Match: M5XRJ6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005335mg PE=4 SV=1)

HSP 1 Score: 723.4 bits (1866), Expect = 1.7e-205
Identity = 376/467 (80.51%), Postives = 407/467 (87.15%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSNAAAAV----EIFAPSALLGPSFHVHT 60
           MQKRE +KLGG VGG +SAPPAKRGRPFGS  ++AAAA     E  APS LLGPS HVH+
Sbjct: 1   MQKREQSKLGG-VGGGASAPPAKRGRPFGSGGNSAAAAAAAAAETAAPSTLLGPSLHVHS 60

Query: 61  SFADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQV 120
           SFADQNNK+IVLALQSGLKSELTWALNTLTLLSFKEKDDMR+D+TPLAKIPGLLDALL V
Sbjct: 61  SFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDTTPLAKIPGLLDALLSV 120

Query: 121 IDDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSNGLRP----GSSASEVTVH- 180
           IDDWRDIALPK+ VK  R+R LG N  VTGFGNEYEALGSNG  P    GSS+ E +V  
Sbjct: 121 IDDWRDIALPKEHVKAPRVRNLGANLLVTGFGNEYEALGSNGTLPLPGLGSSSQEASVLS 180

Query: 181 --TSKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTL 240
             T   S   WWLDEDGLF+LD+EGRAERQQCAV+ASNIIRNFSFMP+NE IMAQHRH L
Sbjct: 181 NVTKLRSSSEWWLDEDGLFNLDEEGRAERQQCAVAASNIIRNFSFMPDNEVIMAQHRHCL 240

Query: 241 ETVFQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGML 300
           ETVFQCIED++TED+ELVTN LETIVNL+PLLDL IFSSSKPSYIKIT KRAV+ IMGML
Sbjct: 241 ETVFQCIEDYLTEDEELVTNALETIVNLAPLLDLGIFSSSKPSYIKITGKRAVQAIMGML 300

Query: 301 GSSVKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALD------AQAAA 360
           GS VK WHC+AAELLGRLIINPDNE FLLPF+PQIHKRLVDLMS+P++D      AQAAA
Sbjct: 301 GSVVKTWHCAAAELLGRLIINPDNESFLLPFVPQIHKRLVDLMSLPSVDAQTAHGAQAAA 360

Query: 361 VGALYNLVEVNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKG 420
           VGALYNL EVNMDCRLKLASERWAIDRLLKVIK PHPVPE+CRKAAMILESLVSEPQN+ 
Sbjct: 361 VGALYNLAEVNMDCRLKLASERWAIDRLLKVIKAPHPVPEVCRKAAMILESLVSEPQNRA 420

Query: 421 VLLAFENAFAEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           +LLA+ENAFAEILF D RYSDTFARILYELTSRPNNKVAAA+GVWGM
Sbjct: 421 LLLAYENAFAEILFSDARYSDTFARILYELTSRPNNKVAAARGVWGM 466

BLAST of Cp4.1LG08g03800 vs. TrEMBL
Match: A0A0L9TTL1_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan02g003000 PE=4 SV=1)

HSP 1 Score: 722.6 bits (1864), Expect = 2.9e-205
Identity = 366/461 (79.39%), Postives = 406/461 (88.07%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSNAAA---AVEIFAPSALLGPSFHVHTS 60
           MQKRE  K GG+ GG  +APPAKRGRPFGS +S A+A   A +  APS LLGPS HVH S
Sbjct: 1   MQKREQGKSGGSAGG-GAAPPAKRGRPFGSGSSGASASASAADSAAPSTLLGPSLHVHNS 60

Query: 61  FADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVI 120
           FADQNNK+IVLALQSGLKSELTWALNTLTLLSFKEKDDMR+D+TPLAKIPGLLDALLQVI
Sbjct: 61  FADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVI 120

Query: 121 DDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSN------GLRPGSSASEVTVH 180
           DDWRDIALPK+L K  R+RTLG +S VTGFGNEY+ALGS       G+  GS+ +E T H
Sbjct: 121 DDWRDIALPKELAKSTRVRTLGASSVVTGFGNEYQALGSTSALHRPGVGSGSAGTESTQH 180

Query: 181 T--SKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTL 240
           +  +KS     WLDEDGLF+LDDEGR+E+QQCAV+ASNIIRNFSFMP+NE IMAQHRH L
Sbjct: 181 SGVTKSRFTELWLDEDGLFNLDDEGRSEKQQCAVAASNIIRNFSFMPDNEVIMAQHRHCL 240

Query: 241 ETVFQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGML 300
           ET FQCIEDH+ EDDELVTN LETIVNL+PLLDLRIFSSSKPS+IKITEKRAV+ IMGML
Sbjct: 241 ETAFQCIEDHLVEDDELVTNALETIVNLAPLLDLRIFSSSKPSFIKITEKRAVQAIMGML 300

Query: 301 GSSVKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYN 360
            S+VK WHC+AAELLGRLIINPDNEPFLLPF PQIHKRL+DL+S+PALDAQAAA+GALYN
Sbjct: 301 ESAVKAWHCAAAELLGRLIINPDNEPFLLPFFPQIHKRLIDLISMPALDAQAAAIGALYN 360

Query: 361 LVEVNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFE 420
           L EVNMDCRLK+A+ERWAIDRLLKVIK PHPVPE+CRKAAMILESLVSEPQN+ +LLA+E
Sbjct: 361 LAEVNMDCRLKIANERWAIDRLLKVIKTPHPVPEVCRKAAMILESLVSEPQNRSLLLAYE 420

Query: 421 NAFAEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           NAFAEILF DGRYSDTFARILYELTSRPNNKVA A+G+WGM
Sbjct: 421 NAFAEILFTDGRYSDTFARILYELTSRPNNKVATARGIWGM 460

BLAST of Cp4.1LG08g03800 vs. TrEMBL
Match: V7BLJ4_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_007G279400g PE=4 SV=1)

HSP 1 Score: 718.0 bits (1852), Expect = 7.2e-204
Identity = 365/461 (79.18%), Postives = 404/461 (87.64%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSNAAAAV---EIFAPSALLGPSFHVHTS 60
           MQKRE  K GG+ GG  +APPAKRGRPFGS +S+A+AA    +  APS LLGPS HVH S
Sbjct: 32  MQKREQGKSGGSGGG-GAAPPAKRGRPFGSGSSSASAAASAADSAAPSTLLGPSLHVHNS 91

Query: 61  FADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVI 120
           FADQNNK+IVLALQSGLKSELTWALNTLTLLSFKEKDDMR+D+TPLAKIPGLLDALLQ I
Sbjct: 92  FADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQAI 151

Query: 121 DDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSN------GLRPGSSASEVTVH 180
           DDWRDIALPK+L K  R+RTLG NS VTGFGNEY+ALGS       G+  GS+  E T H
Sbjct: 152 DDWRDIALPKELAKSTRVRTLGANSVVTGFGNEYQALGSTSALHRPGVGSGSAGIESTQH 211

Query: 181 T--SKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTL 240
           +  +KS     WLDEDGLF+LDDEGRAE+QQCAV+ASNIIRNFSFMP+NE IMAQHRH L
Sbjct: 212 SGVTKSRFTELWLDEDGLFNLDDEGRAEKQQCAVTASNIIRNFSFMPDNEVIMAQHRHCL 271

Query: 241 ETVFQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGML 300
           ET FQCIEDH+ ED+ELVTN LETIVNL+PLLDLRIFSSSKPS+IKITEKRAV+ IMGML
Sbjct: 272 ETAFQCIEDHLVEDEELVTNALETIVNLAPLLDLRIFSSSKPSFIKITEKRAVQAIMGML 331

Query: 301 GSSVKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYN 360
            S+VK WHC+AAELLGRLIINPDNEPFLLPF P IHKRL+DL+S+PALDAQAAA+GALYN
Sbjct: 332 ESAVKAWHCAAAELLGRLIINPDNEPFLLPFFPLIHKRLIDLISMPALDAQAAAIGALYN 391

Query: 361 LVEVNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFE 420
           L EVNMDCRLK+A+ERWAIDRLLKVIK PHPVPE+CRKAAMILESLVSEPQN+ +LLA+E
Sbjct: 392 LAEVNMDCRLKIANERWAIDRLLKVIKTPHPVPEVCRKAAMILESLVSEPQNRSLLLAYE 451

Query: 421 NAFAEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           NAFAEILF DGRYSDTFARILYELTSRPNNKVA A+G+WGM
Sbjct: 452 NAFAEILFTDGRYSDTFARILYELTSRPNNKVATARGIWGM 491

BLAST of Cp4.1LG08g03800 vs. TAIR10
Match: AT3G22990.1 (AT3G22990.1 ARM repeat superfamily protein)

HSP 1 Score: 672.2 bits (1733), Expect = 2.3e-193
Identity = 337/461 (73.10%), Postives = 395/461 (85.68%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSN------AAAAVEIFAPSALLGPSFHV 60
           MQKRE  K GGN GG SS PPAKRGRPFGS ++N      AAAA +  +PSALLGPS  V
Sbjct: 1   MQKRELGKSGGNSGG-SSGPPAKRGRPFGSTSANSAAAAAAAAAADAMSPSALLGPSLLV 60

Query: 61  HTSFADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALL 120
           H SF +QNN++IVLALQSGLKSE+TWALNTLTLLSFKEK+D+RRD  PLAKI GLLDALL
Sbjct: 61  HNSFVEQNNRRIVLALQSGLKSEVTWALNTLTLLSFKEKEDIRRDVMPLAKIAGLLDALL 120

Query: 121 QVIDDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGS---NGLRPGSSASEVT-- 180
            +IDDWRDIALPKDL +  R+RTLG N+SVTGFGNEY+AL S    G   GSSA+E    
Sbjct: 121 LIIDDWRDIALPKDLTRGTRVRTLGTNASVTGFGNEYDALASIQPPGSGIGSSAAEALGK 180

Query: 181 VHTSKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTL 240
             T K     WW++EDGLF+LDDEGR+E+Q CA++ASN+IRNFSFMP+NE +MAQHRH L
Sbjct: 181 KSTGKHQSSQWWMEEDGLFNLDDEGRSEKQMCAIAASNVIRNFSFMPDNEVVMAQHRHCL 240

Query: 241 ETVFQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGML 300
           ETVFQCI DH+TED+ELVTN+LETIVNL+ L+DLRIFSS K SYI I EK+AV+ ++G+L
Sbjct: 241 ETVFQCIHDHMTEDEELVTNSLETIVNLAHLMDLRIFSSLKQSYININEKKAVQAVVGIL 300

Query: 301 GSSVKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYN 360
            SSVK W+C+AAELLGRLIINPDNEPF+ P IPQIHKRL+DL+SI A+DAQAAAVGALYN
Sbjct: 301 NSSVKAWNCAAAELLGRLIINPDNEPFISPLIPQIHKRLIDLLSIQAVDAQAAAVGALYN 360

Query: 361 LVEVNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFE 420
           LVEVNMDCRLKLASERWA+DRLLKVIK PHPVPE+CRKAAMILE+LVSEPQN+G+LLA+E
Sbjct: 361 LVEVNMDCRLKLASERWAVDRLLKVIKTPHPVPEVCRKAAMILENLVSEPQNRGLLLAYE 420

Query: 421 NAFAEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           NAFAE+LF +G+YSD+FARILYELT+R N++VA+A+G+WGM
Sbjct: 421 NAFAELLFQEGKYSDSFARILYELTARSNSRVASARGIWGM 460

BLAST of Cp4.1LG08g03800 vs. NCBI nr
Match: gi|659076287|ref|XP_008438598.1| (PREDICTED: uncharacterized protein LOC103483658 [Cucumis melo])

HSP 1 Score: 815.5 bits (2105), Expect = 4.7e-233
Identity = 410/457 (89.72%), Postives = 430/457 (94.09%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSNAAAAV-------EIFAPSALLGPSFH 60
           MQKR+ NKLGGNV G +SAPPAKRGRPFGSVNSNAAA         E  APS LLGPS H
Sbjct: 1   MQKRDQNKLGGNVSGGASAPPAKRGRPFGSVNSNAAAVAAAVAAGTETLAPSTLLGPSLH 60

Query: 61  VHTSFADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL 120
           +HTSFADQNNK+IVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL
Sbjct: 61  IHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL 120

Query: 121 LQVIDDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSNGLRPGSSASEVTVHTS 180
           LQVIDDWRDIALP+DLVKK R+RTLG NSSVTGFGNE+EALGS+GLRPGSSASE T H S
Sbjct: 121 LQVIDDWRDIALPRDLVKKQRVRTLGANSSVTGFGNEFEALGSDGLRPGSSASESTGHAS 180

Query: 181 KSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVF 240
           K S RHWWL+EDGLF+LDDEGRAERQQCAVSASNI+RNFSFMPENESIMA HRHTLETVF
Sbjct: 181 KPSSRHWWLEEDGLFNLDDEGRAERQQCAVSASNILRNFSFMPENESIMALHRHTLETVF 240

Query: 241 QCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGMLGSSV 300
           QCIEDHVTED+ELVTN LETIVNL+PLLDLRIFSSSKPSYIKITEKRAVE IMGMLGS+V
Sbjct: 241 QCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVEAIMGMLGSAV 300

Query: 301 KVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYNLVEV 360
           KVWHC+AAELLGRLIINPDNEPFLLPF+PQIHKRLVDLMSIPALDAQAAAVGALYNLVEV
Sbjct: 301 KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEV 360

Query: 361 NMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFENAFA 420
           NMDCR+KLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQN+G+LLA+ENAFA
Sbjct: 361 NMDCRIKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNRGLLLAYENAFA 420

Query: 421 EILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           EILF DGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Sbjct: 421 EILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 457

BLAST of Cp4.1LG08g03800 vs. NCBI nr
Match: gi|778678350|ref|XP_011650953.1| (PREDICTED: armadillo repeat-containing protein LFR [Cucumis sativus])

HSP 1 Score: 805.4 bits (2079), Expect = 4.9e-230
Identity = 406/457 (88.84%), Postives = 426/457 (93.22%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSNAAAAV-------EIFAPSALLGPSFH 60
           MQKR+ NKLGGNV G +SAPPAKRGRPFGSVNSNAAA         E  APS LLGPS H
Sbjct: 1   MQKRDQNKLGGNVSGGASAPPAKRGRPFGSVNSNAAAVAAAVAAGTETLAPSTLLGPSLH 60

Query: 61  VHTSFADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL 120
           +HTSFADQNNK+IVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL
Sbjct: 61  IHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL 120

Query: 121 LQVIDDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSNGLRPGSSASEVTVHTS 180
           LQVIDDWRDIALP+DLVKK R+RTLG NSSVTGFGNE+EALGS+GLRP SS SE T H S
Sbjct: 121 LQVIDDWRDIALPRDLVKKQRVRTLGANSSVTGFGNEFEALGSDGLRPSSSVSESTGHAS 180

Query: 181 KSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETVF 240
           K S R WWL+EDGLF+LDDEGRAERQQCAVSASNI+RNFSFMPENESIMA HRHTLETVF
Sbjct: 181 KPSSRPWWLEEDGLFNLDDEGRAERQQCAVSASNILRNFSFMPENESIMALHRHTLETVF 240

Query: 241 QCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGMLGSSV 300
           QCIEDHVTED+ELVTN LETIVNL+PLLDLRIFSS KPSYIKITEKRAVE IMGMLGS+V
Sbjct: 241 QCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSLKPSYIKITEKRAVEAIMGMLGSAV 300

Query: 301 KVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYNLVEV 360
           KVWHC+AAELLGRLIINPDNEPFLLPF+PQIHKRLVDLMSIPALDAQAAAVGALYNLVEV
Sbjct: 301 KVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQAAAVGALYNLVEV 360

Query: 361 NMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFENAFA 420
           NMDCR+KLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQN+G+LLA+ENAFA
Sbjct: 361 NMDCRIKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNRGLLLAYENAFA 420

Query: 421 EILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           EILF DGRYSDTFARILYELTSRPNNKVAAAQGVWGM
Sbjct: 421 EILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 457

BLAST of Cp4.1LG08g03800 vs. NCBI nr
Match: gi|1009119735|ref|XP_015876541.1| (PREDICTED: armadillo repeat-containing protein LFR [Ziziphus jujuba])

HSP 1 Score: 731.9 bits (1888), Expect = 6.9e-208
Identity = 374/463 (80.78%), Postives = 411/463 (88.77%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSNAAAAV-----EIFAPSALLGPSFHVH 60
           MQKRE +KLGG  GG ++APPAKRGRPFGS +SNAAAA      +  APS LLGPS HVH
Sbjct: 1   MQKREQSKLGGGAGG-AAAPPAKRGRPFGSTSSNAAAAAAAAAADTAAPSTLLGPSLHVH 60

Query: 61  TSFADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQ 120
           +SFADQNNK+IVLALQSGLKSELTWALNTLTLLSFKEKDD R+DST LAKIPGLLDALLQ
Sbjct: 61  SSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDFRKDSTALAKIPGLLDALLQ 120

Query: 121 VIDDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSNG------LRPGSSASEVT 180
           VIDDWRDIAL K+L+K+ R+RTLG NS VTGFG+EYEALGSNG      L  GSS +E +
Sbjct: 121 VIDDWRDIALSKELIKEPRVRTLGANSMVTGFGHEYEALGSNGGLLHSGLGSGSSVTEAS 180

Query: 181 V--HTSKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRH 240
              + +KS P  WWLDEDGLF+LD+EGRA++QQCAV+ASNIIRNFSFMPENE IMAQHRH
Sbjct: 181 TPNNVTKSRPSEWWLDEDGLFNLDEEGRAKKQQCAVAASNIIRNFSFMPENEMIMAQHRH 240

Query: 241 TLETVFQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMG 300
            LETVFQCIED+VTED+ELVTN LETIVNL+P LDLRIFSSSKPSYIKITE RAV+ IMG
Sbjct: 241 CLETVFQCIEDYVTEDEELVTNALETIVNLAPYLDLRIFSSSKPSYIKITEIRAVQAIMG 300

Query: 301 MLGSSVKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGAL 360
           +LGS+VK WHC+AAELLGRLIINPDN PFLL F+PQIHKRLVDLMS+PALDAQAAAVGAL
Sbjct: 301 VLGSTVKAWHCAAAELLGRLIINPDNGPFLLNFVPQIHKRLVDLMSLPALDAQAAAVGAL 360

Query: 361 YNLVEVNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLA 420
           YNL EVNMDCRLKLASERWAIDRLLKVIK PHPVPE+CRKAAMILESLVSEPQ++ +LLA
Sbjct: 361 YNLAEVNMDCRLKLASERWAIDRLLKVIKAPHPVPEVCRKAAMILESLVSEPQSRALLLA 420

Query: 421 FENAFAEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           +ENAFAEILF D RYSDTFARILYELTSRPNNKVAAA+GVWGM
Sbjct: 421 YENAFAEILFSDARYSDTFARILYELTSRPNNKVAAARGVWGM 462

BLAST of Cp4.1LG08g03800 vs. NCBI nr
Match: gi|225432860|ref|XP_002283908.1| (PREDICTED: armadillo repeat-containing protein LFR [Vitis vinifera])

HSP 1 Score: 727.2 bits (1876), Expect = 1.7e-206
Identity = 369/458 (80.57%), Postives = 407/458 (88.86%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSN--AAAAVEIFAPSALLGPSFHVHTSF 60
           MQKR+ +KLGG  GG ++ P AKRGRPFGS  SN  AAAA +  APS LLGPS HVH+SF
Sbjct: 1   MQKRDQSKLGGTAGGATT-PAAKRGRPFGSGGSNSAAAAAADAAAPSTLLGPSLHVHSSF 60

Query: 61  ADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVID 120
           ADQNNK+IVLALQSGLKSEL WA+N LTLLSFKEKDD+R+D+TPLAKIPGLLDALLQVID
Sbjct: 61  ADQNNKRIVLALQSGLKSELGWAINALTLLSFKEKDDVRKDATPLAKIPGLLDALLQVID 120

Query: 121 DWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSNGLRP----GSSASEVTV--HT 180
           DWRDIALPK+L K  R R LG NS VTGFGNEYEALGSN +      GSS SE +V  +T
Sbjct: 121 DWRDIALPKELAKAPRARLLGANSFVTGFGNEYEALGSNDVLSHPGSGSSISEASVQKNT 180

Query: 181 SKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTLETV 240
           +K  P  WWLDEDGLF+LD+EGRAE+QQCAV+ASNIIRNFSFMP+NE IMAQHRH LETV
Sbjct: 181 TKLRPSEWWLDEDGLFNLDEEGRAEKQQCAVAASNIIRNFSFMPDNEVIMAQHRHCLETV 240

Query: 241 FQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGMLGSS 300
           FQCIEDH+TED+ELVTN LETIVNL+PLLDLRIFSSSKPSYIKITEKRAV+ IMGMLGS+
Sbjct: 241 FQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKITEKRAVQAIMGMLGSA 300

Query: 301 VKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYNLVE 360
           VK WHC+AAELLGRLIINPDNEPFLLPF  QIHKRLVDL+S+PA+DAQAAAVGALYNL E
Sbjct: 301 VKAWHCAAAELLGRLIINPDNEPFLLPFASQIHKRLVDLLSLPAVDAQAAAVGALYNLAE 360

Query: 361 VNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFENAF 420
           VNMDCRLKLASERWAIDRLLKVIK PHPVPE+CRKAAMI+ESLVSEPQN+  LLA+ENAF
Sbjct: 361 VNMDCRLKLASERWAIDRLLKVIKTPHPVPEVCRKAAMIIESLVSEPQNRAQLLAYENAF 420

Query: 421 AEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           AEILF DGR+SDTFARILYELTSRPNNK+AAA+G+WGM
Sbjct: 421 AEILFSDGRHSDTFARILYELTSRPNNKMAAARGIWGM 457

BLAST of Cp4.1LG08g03800 vs. NCBI nr
Match: gi|951015969|ref|XP_014510992.1| (PREDICTED: armadillo repeat-containing protein LFR [Vigna radiata var. radiata])

HSP 1 Score: 724.2 bits (1868), Expect = 1.4e-205
Identity = 367/461 (79.61%), Postives = 407/461 (88.29%), Query Frame = 1

Query: 1   MQKREHNKLGGNVGGVSSAPPAKRGRPFGSVNSNAAA---AVEIFAPSALLGPSFHVHTS 60
           MQKRE  K GG+ GG  +APPAKRGRPFGS +S+A+A   A +  APS LLGPS HVH S
Sbjct: 1   MQKREQGKSGGSAGG-GAAPPAKRGRPFGSGSSSASASASAADSAAPSTLLGPSLHVHNS 60

Query: 61  FADQNNKKIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDALLQVI 120
           FADQNNK+IVLALQSGLKSELTWALNTLTLLSFKEKDDMR+D+TPLAKIPGLLDALLQVI
Sbjct: 61  FADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVI 120

Query: 121 DDWRDIALPKDLVKKARIRTLGVNSSVTGFGNEYEALGSN------GLRPGSSASEVTVH 180
           DDWRDIALPK+L K  R+RTLG +S VTGFGNEY+ALGS       G+  GS+ +E T H
Sbjct: 121 DDWRDIALPKELAKSTRVRTLGASSVVTGFGNEYQALGSTSALHRPGVGSGSAGTESTQH 180

Query: 181 T--SKSSPRHWWLDEDGLFSLDDEGRAERQQCAVSASNIIRNFSFMPENESIMAQHRHTL 240
           +  +KS     WLDEDGLF+LDDEGRAE+QQCAV+ASNIIRNFSFMP+NE IMAQHRH L
Sbjct: 181 SGVTKSRFTELWLDEDGLFNLDDEGRAEKQQCAVAASNIIRNFSFMPDNEVIMAQHRHCL 240

Query: 241 ETVFQCIEDHVTEDDELVTNTLETIVNLSPLLDLRIFSSSKPSYIKITEKRAVEGIMGML 300
           ET FQCIEDH+ EDDELVTN LETIVNL+PLLDLRIFSSSKPS+IKITEKRAV+ IMGML
Sbjct: 241 ETAFQCIEDHLVEDDELVTNALETIVNLAPLLDLRIFSSSKPSFIKITEKRAVQAIMGML 300

Query: 301 GSSVKVWHCSAAELLGRLIINPDNEPFLLPFIPQIHKRLVDLMSIPALDAQAAAVGALYN 360
            S+VK WHC+AAELLGRLIINPDNEPFLLPF PQIHKRL+DL+S+PALDAQAAA+GALYN
Sbjct: 301 ESAVKAWHCAAAELLGRLIINPDNEPFLLPFFPQIHKRLIDLISMPALDAQAAAIGALYN 360

Query: 361 LVEVNMDCRLKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQNKGVLLAFE 420
           L EVNMDCRLK+A+ERWAIDRLLKVIK PHPVPE+CRKAAMILESLVSEPQN+ +LLA+E
Sbjct: 361 LAEVNMDCRLKIANERWAIDRLLKVIKTPHPVPEVCRKAAMILESLVSEPQNRSLLLAYE 420

Query: 421 NAFAEILFLDGRYSDTFARILYELTSRPNNKVAAAQGVWGM 451
           NAFAEILF DGRYSDTFARILYELTSRPNNKVA A+G+WGM
Sbjct: 421 NAFAEILFTDGRYSDTFARILYELTSRPNNKVATARGIWGM 460

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
LFR_ARATH4.1e-19273.10Armadillo repeat-containing protein LFR OS=Arabidopsis thaliana GN=LFR PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L4L4_CUCSA3.4e-23088.84Uncharacterized protein OS=Cucumis sativus GN=Csa_3G144150 PE=4 SV=1[more]
A5AL12_VITVI1.2e-20680.57Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g04520 PE=4 SV=... [more]
M5XRJ6_PRUPE1.7e-20580.51Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005335mg PE=4 SV=1[more]
A0A0L9TTL1_PHAAN2.9e-20579.39Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan02g003000 PE=4 SV=1[more]
V7BLJ4_PHAVU7.2e-20479.18Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_007G279400g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G22990.12.3e-19373.10 ARM repeat superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659076287|ref|XP_008438598.1|4.7e-23389.72PREDICTED: uncharacterized protein LOC103483658 [Cucumis melo][more]
gi|778678350|ref|XP_011650953.1|4.9e-23088.84PREDICTED: armadillo repeat-containing protein LFR [Cucumis sativus][more]
gi|1009119735|ref|XP_015876541.1|6.9e-20880.78PREDICTED: armadillo repeat-containing protein LFR [Ziziphus jujuba][more]
gi|225432860|ref|XP_002283908.1|1.7e-20680.57PREDICTED: armadillo repeat-containing protein LFR [Vitis vinifera][more]
gi|951015969|ref|XP_014510992.1|1.4e-20579.61PREDICTED: armadillo repeat-containing protein LFR [Vigna radiata var. radiata][more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0090544BAF-type complex
Vocabulary: Biological Process
TermDefinition
GO:0006338chromatin remodeling
Vocabulary: Molecular Function
TermDefinition
GO:0005488binding
Vocabulary: INTERPRO
TermDefinition
IPR021906BAF250/Osa
IPR016024ARM-type_fold
IPR011989ARM-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048653 anther development
biological_process GO:0006338 chromatin remodeling
biological_process GO:0009560 embryo sac egg cell differentiation
biological_process GO:0048366 leaf development
biological_process GO:0006312 mitotic recombination
cellular_component GO:0090544 BAF-type complex
cellular_component GO:0070603 SWI/SNF superfamily-type complex
molecular_function GO:0005488 binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g03800.1Cp4.1LG08g03800.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011989Armadillo-like helicalGENE3DG3DSA:1.25.10.10coord: 110..124
score: 2.0E-14coord: 201..409
score: 2.0
IPR016024Armadillo-type foldunknownSSF48371ARM repeatcoord: 179..424
score: 2.73E-21coord: 101..117
score: 2.73
IPR021906SWI/SNF-like complex subunit BAF250/OsaPANTHERPTHR12656BRG-1 ASSOCIATED FACTOR 250 BAF250coord: 1..450
score: 6.1E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG08g03800Cp4.1LG03g10310Cucurbita pepo (Zucchini)cpecpeB482