Cp4.1LG09g01090 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g01090
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function (DUF2930)
LocationCp4.1LG09 : 618001 .. 624867 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGGATTGAGCAGGGAAGTGTCCGGCCCAATGATCCTCGGTGGAATTTAAGAGAAGTGTTTCTAATGGAATTTGTTTTCTCGATTGCCAAACGGGGAAAGATAATGAGCTGCACAATTCTCTCTCCCGACCCGTTAATTCGGTTGTCATCCACTCACCGATTTCGCGCTAAAAATGGGTCGAAGAGTCCAGTGATTACTGCTCGTCTTGACGATTCTAAGAACTCGCCCAATCGCCAACTCAATCTCTCTGTTCTTCGCTTCACACTTGGTATCTCTTTCTCTTCCTCTATTGTAAATGGGTAATTTCTGAAGGAATTTGATAACATTTCTTTGGAATGGGTTTAGGGATTCCTGGATTGGATGAGTCTTACTTACCCAGATGGATTGGTTATGGATTTGGTTCGCTTCTGCTTTTGAATCATTTTGTTGGTTCGAATTCAGCTGCTCCCATCACCTCAGCACAGTTAGTAAGCAATCACAGTTTTCTGGGAACGCCTTTGCTTTTAGGTTCTTGTTGGTGTTTTCGTTATTTCTGCTATTTGATGTGTTTTTTTCTTTTTTTTTTTTTGGTAGAGAACTGAGGCTTTAGGCATTTCCTTGGCTGCATTTTCTATTGCACTCCCCTACTTGGGAAAGTTTCTTGAGGTAACATACGAGATATTACATTATTGTCCTGATTATACAGTTATTTTTTAATGTTCTTCTTCTACTTGCTGGGAAACTTCCGTTAATCTCTGTAAGAGTGTGGAAAAAGGAGGATTCGAGGGACTTGAAACTGCTGGCTTGTGTAGTACTTAATGATCAAGTAGGGATTGTGATGGAAAACAAGAGACTGGGACGGGGGGAATATATATTTGATGGATCTTGTTAAAAGAGAATGACCTTAACGAGAGATGAACTTTTATGTGACCGGATTTGTAAAATAAAATTGTATCCCTTAAGAACAGGACGGGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCACTCTTGCAATTGGGTCGTTGTGATTACGGGTAGGATGTCTAATTGTCCAGGCGATCGATTTCGGTTGGGTGGATGCCTAATGACAATCCTATCTAATAAGTTCAAAGATGCTTCACCTGTTCAATCCAACTCTTCCCAAAGTTGATCCAGCCTGCCTTGAATTTGAATAAGATTGTCATTCTAACCCTGATGGTTTTCTAGACATTCTACGTGTAGTGTTCATCTCGGAATCTTCTTGAAGATTGAAGCCCTCTTATTCGTCCTCTATCTTGCGTAAATTTCTTTTCTAACGTCCTGTTATTGTTAATTCAATTTGGCTCAAATGTCACCTCAGATCTCTTTATCTAGCCTTCCGTCTCCGATAATGACCTATTTGGTAACTGCCCAAGCCCACAGCTAGCAGATATTGTCCTTTTTAGGCTTTTCCTTTCGGGCTTCCTCTTGAGGTTTTTAAAACGGTGTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGCTTCGTTCTATTCTCCAACAATTCTCCTCTCCAACCAACGTGGGATCTCCCATATTTGCTAGGACCATACAAAAAAAGAAGTCTCACCACCTCCTTATATAAGATGTTCTCACATTGTTGACGAAGCCATTATCACTTTTTTGTTATACATTCTAGCATCATACCATTCCATGAGCTGCTGGCTTGCTTATTAGACTAACTTCATCAGGCACCTTGTTGATGTCTAGTTGAAGCATTGGGACTTTTGCTTGCAGGGTGCAGTTCCATCTGGCGAAGCTACCCTCCCTGAAGGTGCCGAGCAAATATTTGTCTTGTCACAAAATGTATCGGATAATGTGAAGGATGACTTGGCTTGGGCAACATACATCTTGCTGCGCAATACAAACAGTATATCAGTGGTATGGTATCTCACCAATTTTTGTAGTTAGAGAACACGCTGTGTTCAAAAGATTAAATTATTTAGCTTCCCTCGACACTGTCTAAATTTTAATTTTTGCATCAATTATTATCTTTGGAACCAGTTAATACAGATTCAAGGGGAGTTATGCGTTCGAGGATACTGGAATAGTCCAAATGATATATCAGGAACAGATTTACTTGCTTGGTTTGAGGAGCAGCTTCAGAGCATTGGCCTATCTGCATTAAATGATGCCCTCTACTTTCCTCAGATTTCAGGTATATGATTGCATTTTTTATGGTATTTTTTGCTCTCTTGCTGCACATGGAATAATGTCTTAGAAACGACCATAATTAGTGGAGAAAACGTTGTTATCTCCTTAAAAATCAACCGCTATCAATATTTGAAGACATTTGCAATGTCATATTGGTTTCTTTTCCTTATGGTGATGAGGTGTTTCTCCTGGTTGTTTTTCTGATGAAACATGCATTGATAGTAATAGTTAAACAAATGCTATGATGTTCTATGAAACCTTTCAAAATCATGGTTTTCATAACTTTTTCAGAATCTGGACTTTGGCAAATGCTGCCTAAGGGCACTCGCTCAGTTTTGGTACAGCCAGTGGCTCAAAATCTAAACCAAAGTGGCAATGAGATGGAAAAGATTGGAGGGTTCATATTGGTGGCTTCAAGTTTAAGTTATGCATTTAGTGATAAAGATAGAGCCTGGATAAGAGCTCTTGCTACCAAGTTTGATAATGGGGACATATCGGGGGGAAGTAAGTAATAAATCTCCAATATATTTTTTGGTTATTGACGTGCCTTTCTCAGAAGTCATGAGCATGGTACTGATACTCATTTTTGCCAATAAATTAGAGAATGTTACTATTTTTTTAGTTCAACAATCTTATTGACGCCCATTTCTTAGAGCTCACTAACCTTTGCATTGATTGACGTACAAAAATTACTTTTTGATGCAACTGTTTCGAGCAAGACATGGAAAGAACAGAAACGTGACGTTATGGATTTACTTAGAAACTTAGGAAAGGGATATAATCTGTATTTCAATGCTTTTTCTTATTTTTCACTGTCTTTGGGCCAAAATCTATGAGAGGTGCTTGTAATTTTCCCCTTAATATATTTTCTATCAACCTGCCTGTAACAGTCCAAGCTCACCGCTAGCAAATACGGTCCTCTTTGGGCTTTCCCTTTCGAGCTTCCTCTCAAGGCTTTGTAAGACGCATCTACGAGGGAGAGGTGTCCACACCCTTATAAACGGTGTTCCCTTCTCCTCCCCAACCGATGTGGGATCTCACAATCCACCCCCTTCGGGGACCCAGCGTCCTTGCTGGCACTCGTTCCCTTTTCCAATTGATGTGGGACCCCCCAATCCACCCCATTCGAGGCCCAGTGTCCTTGTTGGCACACCGTCTCATGTCCACCCCCTTCGAGGCTCAGCCTCCTTGTTGACACATTGCCCAGTGTCTGACTTTGATACCATTTACAACGGCCTAAGCCCGCCGCTAACAGATATTGTACTCTTTAGGTTTTCCCTTTCAGACTTCCTCTCATGGTTTTTTAGAGCGCGTCTACTAAGGAGAGGTTTCCACATCCTTAAAAATGGTGTTTCATTCTCCTCCCCAATCGATGTGGGATCTCACCCTGCCTTTTTGCGAAAATGGTTTTGTTCTAAATTCTCTTTTATTTATACCTAATGAAGTAAATTCTTGGAATAAAATTTGGAGTATCCTGCACCTAGTTGGTACATGTTGCCATTCGCGCTTGGGCAACCAAATGAATGCTTGTTACGAATTTGTATAGAACAGAAGCTTAGCAGATCTTGGCATGATAATTTGGACATGATGCAGCCTCAACATCTATTGAGAGCCTTTGATTTCTTGGGCTGATCTGCTGAACTCAACTCTTTTTCCTCAAATCTAATTTGTTGGGGTGAAAATGTAAGAATCTATTAGAATCTGTATGTTTGCGCGAAGAAAAATGTACATGAAAGAATATATGAAGAACAAATGAGGTCATACAAGTATCACAGTGACCCTTTTTAGAGGTACCCTTCTTTAAGAACTATAATTTAATTTCTCCATGTACACAAGGAAAATGAAAAACAAAAGTTATCTTTCAATTCTCTCTAACTTGAGCTCCTGACTCTTCTTTTTGTTATCAGGTCGAAAGTTCACTCTGTCCAATGAACAACGGAGATGAAGTTTCAGATTGAGCTACAAGGAGAGAGCCTTCGAAGCCTTATAAAAAATTCTCCCAGCCGTATACCAGATTCTGCCAAAGTAGCACAGAAAGGCGATAGTTTGACACAATACAGGATAAAGAAGATCATTCAAGAAAATTCGAGGTGCATATCCTGACGCGTGTTTAGGTACCTTAAGACGCACCACCTTTCTAATTGCCAATAACAAGCCTGGTATTAGACATTGCACGTTTGTAACATCATGTTTGAGAGTGTAAACCTGTGGTGCAAGCCAAAACCTACAACTCTGGTATGATATTGTCCACTTTGAGCACAAGCTTTCATGGCTTTGCTTACCAATAGAGATGTATTCCTTAGTTATAAACTCATGATCATCCTCTTAATTAGTCAATGTGAGACTCCCTCCCAACCACCACTCTGGTTGTAGAATGAGTAGAAAAGATTAAATTTACGAGTCTCTCGCAACTTTCCTAATGTTAGTTAGATAAGAACAGAAACACGATCTATAACTTATCCCTCCAGAAGTAGACATTGGTGCTGGAAGGATGCTCTGGAAGAACTAGGCTGTGCACTCGAACTCGATCTTCTCCAACATCTTGTCCCTTTGCCTGGTAGGAAAAAAGGAGAAATAAGCAAGCGAGTCTTTTTTCTAAATGGGCGTCTGAGTTTGATGTCTTTACGAGGTGAAAGTAAAGAGAATAAAAGCAGGTACATTATGATATATATTTTATTTCACATCATGTCTCAATTCTTAATGACTGATACATCAAATTAAATGAGGCAAGAGACTGCTCATGAATTCAGCTTACCAGTCTCCGGTTGAAATATCTTCTCTGTTGCAGTTCTGGCCCAAGTTTAAGAGGTTGTTAGCTATTTGAGATGCATCTGAAGATGGCACATCCTGTCACCAAAAAAGTCAGTATTTAATGCCCATCTGCTACATTTACTACTCTCTCTTAATGTCCTCTGGTATTTTTCATATTTTTTTAGTACAGCTTCATATGGTATTTTATAATACACTAATGGAGATTTGAAAGACTTGAATCCATGTACTTGATTCAGATGTTAAACTAAGCTAATGTAGGTGAGAGAGGAGTGCATGTTGATTCATCTATAGTTTAATGACAAAGGTCGCAGCGTTGTACGAATGCAATGAAAATGGAAAAGTATTGCCTATTTTTCTTCTTCTTTATGATTTTCTTTTCCTTTGAAACTGAAATAGCAGATTAGTTGATGATATGATTTGATTAACCTGTGCATTCTATCTGGATTCCACAGTCTCAACATTGCAGTAATGGGAGGAAGCTGAAATTGCAACTTGGTGGAGGAGGATAGAACCAATGGACTTAGTTGGGGTGACAAGCCAACCCTGTGACCAAATCCATTAAAGCAGAGAGAGAAAGAGAGGAGAGGGTTATCTTAGGAACCACACTTTCAAAATGGTATGATATTATACACTTTGAGCTCAAATGCTCATGGCTTTGCCTTTCGAGGCCTCGTACCAATGAAAATGTATTCATCATCCCTTTAATTAGCCAATGTGGCTCCCTCTCAACAATTCTCCCTTTGAATAAAGTATACCATAGAGCCTTTCCCGAGACCTATGGAGTCCTCGAACAGCCTCCCATTAATCGAGGATAAACTCTTTCTTTGGAGCCCTCGAACAAAGTACACTATTTGTTCGACACTTGAGAGTCATATTCGACTTTACTTTCGAGACTCACAACTTCTTTATCTGACACTTAAGGATTCTATTGACATGACTTTGATACCAGATAAGGAACCACAACTCTCCACAATGGTATGATATTGTCCACTTTGAGCATAAGCTCTTTTCTTAAAATGCGTCGTACCAACGGAGATGTATTTCTTACTTATAAACCCATAATCATCTCCTTAATTAGCCAACGTGGGCTCCCTCCCAACAACCATCCTCAGCAGGTTGAATTAAAATATCTCGCCCTGCACTCACCATGCTTGCTTTCTCACAGAGATAATGCAGTCACTGTGTCAACTTTTATTGCAGGAACATAAACCTCACTTCTCATACCAAATGAAGTTGCCTAAACTGAAAAAAGAAACAAGGCAAAGATGAAGATTTGGGTCTCGAATATTTGCAGAGGAGATAGAAGGGCCCATCAGAGAAAAGAAAAAAAAAAGTTTGATGTTTGCCAAATGGGGAAGAAAGATGAAGATGCTACCTGCTTCACGTTCTCATATACTTTAGAAGGTTGGGTAAAATCAACAACTCCTGTTTCCTTTGACTGCAAATTAGCAATAAAATAGTAAAGTTATAAGACTTAAAGCCAGACAAGGCAATAGATAAAGTTCCAGAGGTTCCTAACCTAATTTCATTTGTACAGATCAGCAAAAACACAGTGAGGAGAGAAATGAAACAGTGGGGTTTCATTAAAAAAGAAAAGAGATCCAGGGCCGCCATTGTTGTTGTTGAACAAAAGAATGAGAGTTATTGCAGGTGAGTGTGGGGTTACCTTGATATTACTCTGAAGAGAAGCGAATTCAAGATGTGATAGTGGGTTGCTTTTTCTCTGAGTTTTTAGACACTGTAAAGGGAATGAGCAAGCCAGGGCCGCCATTGTTGTTGGTTTCAAAGCTCTGTTTCTTCTCTAATTTCTCAAAGAAGATAAGCCCATTTTTAATTGCAAATTTGAGGTCATGCGAAAGCGACAGGATCACAATTTTCAAAAACATTAATCTTTCATTTTTAAATATC

mRNA sequence

TGGGATTGAGCAGGGAAGTGTCCGGCCCAATGATCCTCGGTGGAATTTAAGAGAAGTGTTTCTAATGGAATTTGTTTTCTCGATTGCCAAACGGGGAAAGATAATGAGCTGCACAATTCTCTCTCCCGACCCGTTAATTCGGTTGTCATCCACTCACCGATTTCGCGCTAAAAATGGGTCGAAGAGTCCAGTGATTACTGCTCGTCTTGACGATTCTAAGAACTCGCCCAATCGCCAACTCAATCTCTCTGTTCTTCGCTTCACACTTGGGATTCCTGGATTGGATGAGTCTTACTTACCCAGATGGATTGGTTATGGATTTGGTTCGCTTCTGCTTTTGAATCATTTTGTTGGTTCGAATTCAGCTGCTCCCATCACCTCAGCACAGTTAAGAACTGAGGCTTTAGGCATTTCCTTGGCTGCATTTTCTATTGCACTCCCCTACTTGGGAAAGTTTCTTGAGGGTGCAGTTCCATCTGGCGAAGCTACCCTCCCTGAAGGTGCCGAGCAAATATTTGTCTTGTCACAAAATGTATCGGATAATGTGAAGGATGACTTGGCTTGGGCAACATACATCTTGCTGCGCAATACAAACAGTATATCAGTGTTAATACAGATTCAAGGGGAGTTATGCGTTCGAGGATACTGGAATAGTCCAAATGATATATCAGGAACAGATTTACTTGCTTGGTTTGAGGAGCAGCTTCAGAGCATTGGCCTATCTGCATTAAATGATGCCCTCTACTTTCCTCAGATTTCAGAATCTGGACTTTGGCAAATGCTGCCTAAGGGCACTCGCTCAGTTTTGGTACAGCCAGTGGCTCAAAATCTAAACCAAAGTGGCAATGAGATGGAAAAGATTGGAGGGTTCATATTGGTGGCTTCAAGTTTAAGTTATGCATTTAGTGATAAAGATAGAGCCTGGATAAGAGCTCTTGCTACCAAGTTTGATAATGGGGACATATCGGGGGGAAGTAAGTAATAAATCTCCAATATATTTTTTGGTTATTGACGTGCCTTTCTCAGAAGTCATGAGCATGGTACTGATACTCATTTTTGCCAATAAATTAGAGAATGTTACTATTTTTTTAGTTCAACAATCTTATTGACGCCCATTTCTTAGAGCTCACTAACCTTTGCATTGATTGACGTACAAAAATTACTTTTTGATGCAACTGTTTCGAGCAAGACATGGAAAGAACAGAAACGTGACGTTATGGATTTACTTAGAAACTTAGGAAAGGGATATAATCTGTATTTCAATGCTTTTTCTTATTTTTCACTGTCTTTGGGCCAAAATCTATGAGAGGTGCTTGTAATTTTCCCCTTAATATATTTTCTATCAACCTGCCTGTAACAGTCCAAGCTCACCGCTAGCAAATACGGTCCTCTTTGGGCTTTCCCTTTCGAGCTTCCTCTCAAGGCTTTGTAAGACGCATCTACGAGGGAGAGGTGTCCACACCCTTATAAACGGTGTTCCCTTCTCCTCCCCAACCGATGTGGGATCTCACAATCCACCCCCTTCGGGGACCCAGCGTCCTTGCTGGCACTCGTTCCCTTTTCCAATTGATGTGGGACCCCCCAATCCACCCCATTCGAGGCCCAGTGTCCTTGTTGGCACACCGTCTCATGTCCACCCCCTTCGAGGCTCAGCCTCCTTGTTGACACATTGCCCAGTGTCTGACTTTGATACCATTTACAACGGCCTAAGCCCGCCGCTAACAGATATTGTACTCTTTAGGTTTTCCCTTTCAGACTTCCTCTCATGGTTTTTTAGAGCGCGTCTACTAAGGAGAGGTTTCCACATCCTTAAAAATGGTGTTTCATTCTCCTCCCCAATCGATGTGGGATCTCACCCTGCCTTTTTGCGAAAATGGTTTTGTTCTAAATTCTCTTTTATTTATACCTAATGAAGTAAATTCTTGGAATAAAATTTGGAGTATCCTGCACCTAGTTGGTACATGTTGCCATTCGCGCTTGGGCAACCAAATGAATGCTTGTTACGAATTTGTATAGAACAGAAGCTTAGCAGATCTTGGCATGATAATTTGGACATGATGCAGCCTCAACATCTATTGAGAGCCTTTGATTTCTTGGGCTGATCTGCTGAACTCAACTCTTTTTCCTCAAATCTAATTTGTTGGGGTGAAAATGTAAGAATCTATTAGAATCTGTATGTTTGCGCGAAGAAAAATGTACATGAAAGAATATATGAAGAACAAATGAGGTCATACAAGTATCACAGTGACCCTTTTTAGAGGTCGAAAGTTCACTCTGTCCAATGAACAACGGAGATGAAGTTTCAGATTGAGCTACAAGGAGAGAGCCTTCGAAGCCTTATAAAAAATTCTCCCAGCCGTATACCAGATTCTGCCAAAGTAGCACAGAAAGGCGATAGTTTGACACAATACAGGATAAAGAAGATCATTCAAGAAAATTCGAGGTGCATATCCTGACGCGTGTTTAGTTCTGGCCCAAGTTTAAGAGGTTGTTAGCTATTTGAGATGCATCTGAAGATGGCACATCCTGTCACCAAAAAAGTCAGTATTTAATGCCCATCTGCTACATTTACTACTCTCTCTTAATGTCCTCTGGTATTTTTCATATTTTTTTAGTACAGCTTCATATGGTATTTTATAATACACTAATGGAGATTTGAAAGACTTGAATCCATGTACTTGATTCAGATGTTAAACTAAGCTAATGTAGGTGAGAGAGGAGTGCATGTTGATTCATCTATAGTTTAATGACAAAGGTCGCAGCGTTGTACGAATGCAATGAAAATGGAAAAGTATTGCCTATTTTTCTTCTTCTTTATGATTTTCTTTTCCTTTGAAACTGAAATAGCAGATTAGTTGATGATATGATTTGATTAACCTGTGCATTCTATCTGGATTCCACAGTCTCAACATTGCAGTAATGGGAGGAAGCTGAAATTGCAACTTGGTGGAGGAGGATAGAACCAATGGACTTAGTTGGGGTGACAAGCCAACCCTGTGACCAAATCCATTAAAGCAGAGAGAGAAAGAGAGGAGAGGGTTATCTTAGGAACCACACTTTCAAAATGGTATGATATTATACACTTTGAGCTCAAATGCTCATGGCTTTGCCTTTCGAGGCCTCGTACCAATGAAAATGTATTCATCATCCCTTTAATTAGCCAATGTGGCTCCCTCTCAACAATTCTCCCTTTGAATAAAGTATACCATAGAGCCTTTCCCGAGACCTATGGAGTCCTCGAACAGCCTCCCATTAATCGAGGATAAACTCTTTCTTTGGAGCCCTCGAACAAAGTACACTATTTGTTCGACACTTGAGAGTCATATTCGACTTTACTTTCGAGACTCACAACTTCTTTATCTGACACTTAAGGATTCTATTGACATGACTTTGATACCAGATAAGGAACCACAACTCTCCACAATGGTATGATATTGTCCACTTTGAGCATAAGCTCTTTTCTTAAAATGCGTCGTACCAACGGAGATGTATTTCTTACTTATAAACCCATAATCATCTCCTTAATTAGCCAACGTGGGCTCCCTCCCAACAACCATCCTCAGCAGGTTGAATTAAAATATCTCGCCCTGCACTCACCATGCTTGCTTTCTCACAGAGATAATGCAGTCACTGTGTCAACTTTTATTGCAGGAACATAAACCTCACTTCTCATACCAAATGAAGTTGCCTAAACTGAAAAAAGAAACAAGGCAAAGATGAAGATTTGGGTCTCGAATATTTGCAGAGGAGATAGAAGGGCCCATCAGAGAAAAGAAAAAAAAAAGTTTGATGTTTGCCAAATGGGGAAGAAAGATGAAGATGCTACCTGCTTCACGTTCTCATATACTTTAGAAGATCAGCAAAAACACAGTGAGGAGAGAAATGAAACAGTGGGGTTTCATTAAAAAAGAAAAGAGATCCAGGGCCGCCATTGTTGTTGTTGAACAAAAGAATGAGAGTTATTGCAGGTGAGTGTGGGGTTACCTTGATATTACTCTGAAGAGAAGCGAATTCAAGATGTGATAGTGGGTTGCTTTTTCTCTGAGTTTTTAGACACTGTAAAGGGAATGAGCAAGCCAGGGCCGCCATTGTTGTTGGTTTCAAAGCTCTGTTTCTTCTCTAATTTCTCAAAGAAGATAAGCCCATTTTTAATTGCAAATTTGAGGTCATGCGAAAGCGACAGGATCACAATTTTCAAAAACATTAATCTTTCATTTTTAAATATC

Coding sequence (CDS)

ATGGAATTTGTTTTCTCGATTGCCAAACGGGGAAAGATAATGAGCTGCACAATTCTCTCTCCCGACCCGTTAATTCGGTTGTCATCCACTCACCGATTTCGCGCTAAAAATGGGTCGAAGAGTCCAGTGATTACTGCTCGTCTTGACGATTCTAAGAACTCGCCCAATCGCCAACTCAATCTCTCTGTTCTTCGCTTCACACTTGGGATTCCTGGATTGGATGAGTCTTACTTACCCAGATGGATTGGTTATGGATTTGGTTCGCTTCTGCTTTTGAATCATTTTGTTGGTTCGAATTCAGCTGCTCCCATCACCTCAGCACAGTTAAGAACTGAGGCTTTAGGCATTTCCTTGGCTGCATTTTCTATTGCACTCCCCTACTTGGGAAAGTTTCTTGAGGGTGCAGTTCCATCTGGCGAAGCTACCCTCCCTGAAGGTGCCGAGCAAATATTTGTCTTGTCACAAAATGTATCGGATAATGTGAAGGATGACTTGGCTTGGGCAACATACATCTTGCTGCGCAATACAAACAGTATATCAGTGTTAATACAGATTCAAGGGGAGTTATGCGTTCGAGGATACTGGAATAGTCCAAATGATATATCAGGAACAGATTTACTTGCTTGGTTTGAGGAGCAGCTTCAGAGCATTGGCCTATCTGCATTAAATGATGCCCTCTACTTTCCTCAGATTTCAGAATCTGGACTTTGGCAAATGCTGCCTAAGGGCACTCGCTCAGTTTTGGTACAGCCAGTGGCTCAAAATCTAAACCAAAGTGGCAATGAGATGGAAAAGATTGGAGGGTTCATATTGGTGGCTTCAAGTTTAAGTTATGCATTTAGTGATAAAGATAGAGCCTGGATAAGAGCTCTTGCTACCAAGTTTGATAATGGGGACATATCGGGGGGAAGTAAGTAA

Protein sequence

MEFVFSIAKRGKIMSCTILSPDPLIRLSSTHRFRAKNGSKSPVITARLDDSKNSPNRQLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLEGAVPSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRGYWNSPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLPKGTRSVLVQPVAQNLNQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKFDNGDISGGSK
BLAST of Cp4.1LG09g01090 vs. Swiss-Prot
Match: CCB2_ARATH (Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB2, chloroplastic OS=Arabidopsis thaliana GN=CCB2 PE=1 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 5.2e-82
Identity = 152/263 (57.79%), Postives = 201/263 (76.43%), Query Frame = 1

Query: 34  RAKNGSKSPVITARLDDSKNSPNRQLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLLLN 93
           RA+  ++    T        + ++QLNLSVLRFT GIPG DESYLPRWIGYGFGSLLLLN
Sbjct: 19  RAQRSTRIFARTENDSPQSKTSDQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLLLLN 78

Query: 94  HFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLEGAVPSGEATLPEGAEQIFVL 153
           HF   +++API+ +Q+R+EALG+SLAAFSIALPY+GKFL+G+V   + +LPE  EQ+FV+
Sbjct: 79  HF---SASAPISESQMRSEALGLSLAAFSIALPYIGKFLKGSVVE-QRSLPEEGEQVFVI 138

Query: 154 SQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRGYWNSPNDISGTDLLAWFEEQ 213
           S N+ D++K+DLAWATY+LLRNT++I+VLI +QGELCVRGYWN P+ +S   L  WF+++
Sbjct: 139 SSNIGDSLKEDLAWATYVLLRNTSTIAVLISVQGELCVRGYWNCPDQMSKAQLHDWFKKK 198

Query: 214 LQSIGLSALNDALYFPQISESGL-WQMLPKGTRSVLVQPVAQNLNQSGNEMEKIGGFILV 273
           +  IGL+ + + LYFPQ + S L   +LP GTRS+ VQP+ QN     NE +K+ GF+LV
Sbjct: 199 VDEIGLADVKETLYFPQYAGSALSLDILPDGTRSLFVQPLVQNT----NEPQKVNGFLLV 258

Query: 274 ASSLSYAFSDKDRAWIRALATKF 296
           AS+  YA+SDKDRAWI A+A KF
Sbjct: 259 ASTAGYAYSDKDRAWIGAMAEKF 273

BLAST of Cp4.1LG09g01090 vs. TrEMBL
Match: A0A0A0LBA9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G180340 PE=4 SV=1)

HSP 1 Score: 471.1 bits (1211), Expect = 1.0e-129
Identity = 236/284 (83.10%), Postives = 257/284 (90.49%), Query Frame = 1

Query: 14  MSCTILSPDPLIRLSSTHRFRAKNGSKSPVITARLDDSKNSPNRQLNLSVLRFTLGIPGL 73
           M  +I SP PL +L+S   FRAK+ +K P I+ARLDDSKNS N+QLNLSVLRFTLGIPGL
Sbjct: 1   MISSIPSPSPLNQLTSALSFRAKSKTKGPAISARLDDSKNSANQQLNLSVLRFTLGIPGL 60

Query: 74  DESYLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLE 133
           DESYLPRWIGYGFGSLLLLNHFVGSNSAA  T AQLRTEALGISLAAFSIALPYLGKFL+
Sbjct: 61  DESYLPRWIGYGFGSLLLLNHFVGSNSAALTTPAQLRTEALGISLAAFSIALPYLGKFLK 120

Query: 134 GAVPSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRG 193
           GA+PSGEA LPEG EQIF+LSQ +SDN+K+D+AWATYILLRNTNSISVLIQ QG LCVRG
Sbjct: 121 GALPSGEAILPEGTEQIFLLSQILSDNLKEDIAWATYILLRNTNSISVLIQTQGALCVRG 180

Query: 194 YWNSPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLPKGTRSVLVQPVA 253
           YWNSPNDIS  DLLAWFEEQLQSIGLSAL DA+YFPQISESGLWQMLPKGTRSVLVQPV 
Sbjct: 181 YWNSPNDISSADLLAWFEEQLQSIGLSALKDAVYFPQISESGLWQMLPKGTRSVLVQPVV 240

Query: 254 QNLNQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKFDN 298
           QNL QSGNE++ +GGFIL+ASSLSYAFSDKDRAWIRA+A KFD+
Sbjct: 241 QNLKQSGNEVQNMGGFILLASSLSYAFSDKDRAWIRAVANKFDD 284

BLAST of Cp4.1LG09g01090 vs. TrEMBL
Match: W9RWP2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_015739 PE=4 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 5.9e-101
Identity = 194/294 (65.99%), Postives = 230/294 (78.23%), Query Frame = 1

Query: 14  MSCTILSPDPLIRLSSTHRFRAKNGSKS-PVITARLDDSKNS--PNRQLNLSVLRFTLGI 73
           MS +I+S  PL +L     F  ++ ++   VI++RLDD+  S  PN QLNLSVLRFTLGI
Sbjct: 1   MSNSIVSLSPLSQLKIPTGFGTRSSTRRFSVISSRLDDNSRSGQPNPQLNLSVLRFTLGI 60

Query: 74  PGLDESYLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGK 133
           PGLDESYLPRWIGYGFGSLL+LNHFVGSNS   ITSAQLRTEALG+SLAAFSI LPYLGK
Sbjct: 61  PGLDESYLPRWIGYGFGSLLVLNHFVGSNSVTDITSAQLRTEALGLSLAAFSIVLPYLGK 120

Query: 134 FL---------EGAVPSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISV 193
           FL         +GA P  + T+PEG+EQIF+LS+NVS+  K+DLAWATYILLRNTN+++V
Sbjct: 121 FLKLYEDEKYLQGATPMDQTTIPEGSEQIFMLSENVSNTEKEDLAWATYILLRNTNTMAV 180

Query: 194 LIQIQGELCVRGYWNSPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLP 253
           LI IQGELCVRGYWN+P D+S TDLL WF  Q++  G+S + D LYFPQIS+SGLW +LP
Sbjct: 181 LISIQGELCVRGYWNTPTDVSKTDLLDWFGRQIEQFGISDVKDTLYFPQISDSGLWDILP 240

Query: 254 KGTRSVLVQPVAQNLNQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKF 296
           KGTRSVLVQPV Q  + S   ME   GFILVAS++SYA++ KDRAWI ALA KF
Sbjct: 241 KGTRSVLVQPVPQVPDSSDKTMETNQGFILVASTISYAYNVKDRAWIGALAKKF 294

BLAST of Cp4.1LG09g01090 vs. TrEMBL
Match: M5X1L8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009484mg PE=4 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 1.0e-100
Identity = 184/279 (65.95%), Postives = 230/279 (82.44%), Query Frame = 1

Query: 19  LSPDPLIRLSSTHRFRAKN-GSKSPVITARLDDSKNSPNR-QLNLSVLRFTLGIPGLDES 78
           LSP+ LI+L    +FRA+N  +    ++ARLD+SK+S    QLNLSVLRFTLGIPGLDES
Sbjct: 8   LSPNSLIQLKIPPKFRARNCRTNFSAVSARLDNSKSSSAEPQLNLSVLRFTLGIPGLDES 67

Query: 79  YLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLEGAV 138
           YLPRWIGYGFGSLL+LNHF GS S A  T AQLRTEALG+SLAAFSIALPYLG+FL+GA 
Sbjct: 68  YLPRWIGYGFGSLLILNHFAGSISPASTTPAQLRTEALGLSLAAFSIALPYLGRFLKGAT 127

Query: 139 PSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRGYWN 198
           P  + ++P G EQIFV+SQNVS+  K+DLAWATYILLRNTN+I+V+I I+ ELCVRGYWN
Sbjct: 128 PMDQTSIPRGCEQIFVISQNVSNTQKEDLAWATYILLRNTNTIAVIISIRNELCVRGYWN 187

Query: 199 SPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLPKGTRSVLVQPVAQNL 258
            P+D+S T++LAWFE+Q++SIGLS + + LY  QI +SGLW+MLP+GTRS+LVQP+ Q L
Sbjct: 188 IPDDVSKTNVLAWFEKQIESIGLSDVKETLYLSQIEDSGLWEMLPQGTRSLLVQPIVQVL 247

Query: 259 NQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKF 296
             S NE++K  GF+++ASS+ YA+SDKD+AWI A+A KF
Sbjct: 248 PSSDNEIQKSEGFVMLASSMRYAYSDKDKAWIGAIANKF 286

BLAST of Cp4.1LG09g01090 vs. TrEMBL
Match: A0A067JER9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26153 PE=4 SV=1)

HSP 1 Score: 370.5 bits (950), Expect = 1.9e-99
Identity = 186/279 (66.67%), Postives = 225/279 (80.65%), Query Frame = 1

Query: 19  LSPDPLIRLSSTHRFRAKNGSKSPVITARLDDSKNSPNRQ--LNLSVLRFTLGIPGLDES 78
           LS  PLI+L+   +FRAK   KS VI +R+D+S+   N+Q  LNLS+LRFT GIPGLDES
Sbjct: 4   LSIHPLIQLNIRPKFRAKVTRKSLVIASRIDNSQTRENQQQELNLSILRFTFGIPGLDES 63

Query: 79  YLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLEGAV 138
           YLPRWIGYGFGSLLLLNHF+GSNSAA  +  QLRTEALGISLAAFSIALP+ G+FL+G  
Sbjct: 64  YLPRWIGYGFGSLLLLNHFLGSNSAA--SPPQLRTEALGISLAAFSIALPFFGRFLKGVR 123

Query: 139 PSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRGYWN 198
           P  +A LP GAEQIF++S+N+ D  K+DLAWATY+LLRNTN+I+VLI IQG LCVRGYW 
Sbjct: 124 PMDQAALPGGAEQIFLMSENIFDTQKEDLAWATYVLLRNTNTIAVLISIQGGLCVRGYWK 183

Query: 199 SPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLPKGTRSVLVQPVAQNL 258
           +P+++S   LL WF +Q+ SIGL  L D LYFPQ +ESGLW+MLPKGTRS+LV+PV Q  
Sbjct: 184 TPDNLSKAQLLDWFLKQIDSIGLFDLRDTLYFPQTAESGLWEMLPKGTRSLLVEPVHQAR 243

Query: 259 NQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKF 296
            +S NEMEKI GF+L+ASS+ YA+ DKDRAWIRA+  KF
Sbjct: 244 AKSANEMEKIEGFVLLASSMEYAYGDKDRAWIRAVTNKF 280

BLAST of Cp4.1LG09g01090 vs. TrEMBL
Match: B9ID30_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s13640g PE=4 SV=2)

HSP 1 Score: 359.8 bits (922), Expect = 3.4e-96
Identity = 180/278 (64.75%), Postives = 219/278 (78.78%), Query Frame = 1

Query: 19  LSPDPLIRLSSTHRFRAKNGSKSPVITARLDDSKNS-PNRQLNLSVLRFTLGIPGLDESY 78
           LS  PLI+L + H+FRAK   KS  I A  D+ ++    +QLNLSVLRFT GIPGLDESY
Sbjct: 4   LSIHPLIQLKTHHQFRAKKTRKSIAIHASSDNPQSQRQQQQLNLSVLRFTFGIPGLDESY 63

Query: 79  LPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLEGAVP 138
           LPRWIGYGFGSLL+LNHF+GSN     T AQLRTE LG+SLAAFS ALPY G+FL+GA P
Sbjct: 64  LPRWIGYGFGSLLILNHFLGSNPDT--TQAQLRTEVLGLSLAAFSAALPYFGRFLKGATP 123

Query: 139 SGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRGYWNS 198
             + TLP+ AEQIF +SQN+SD  K+DLAWATYILLRNTN+I+VLI IQGELCVRGYW +
Sbjct: 124 VDQGTLPQDAEQIFAMSQNISDAQKEDLAWATYILLRNTNTIAVLISIQGELCVRGYWKT 183

Query: 199 PNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLPKGTRSVLVQPVAQNLN 258
            + +S  ++L WF+EQ+++IGLS + D LYFPQ +ES +W+MLP+GTRS+LV+PV Q   
Sbjct: 184 SDKMSKDEVLDWFKEQIENIGLSDVKDTLYFPQTTESEIWEMLPEGTRSLLVEPVLQATV 243

Query: 259 QSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKF 296
           QSGN+ E   GFIL+ASS+ YA+SDKDRAWIRA   KF
Sbjct: 244 QSGNKTENNEGFILLASSIGYAYSDKDRAWIRATGNKF 279

BLAST of Cp4.1LG09g01090 vs. TAIR10
Match: AT5G52110.1 (AT5G52110.1 Protein of unknown function (DUF2930))

HSP 1 Score: 305.8 bits (782), Expect = 2.9e-83
Identity = 152/263 (57.79%), Postives = 201/263 (76.43%), Query Frame = 1

Query: 34  RAKNGSKSPVITARLDDSKNSPNRQLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLLLN 93
           RA+  ++    T        + ++QLNLSVLRFT GIPG DESYLPRWIGYGFGSLLLLN
Sbjct: 19  RAQRSTRIFARTENDSPQSKTSDQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLLLLN 78

Query: 94  HFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLEGAVPSGEATLPEGAEQIFVL 153
           HF   +++API+ +Q+R+EALG+SLAAFSIALPY+GKFL+G+V   + +LPE  EQ+FV+
Sbjct: 79  HF---SASAPISESQMRSEALGLSLAAFSIALPYIGKFLKGSVVE-QRSLPEEGEQVFVI 138

Query: 154 SQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRGYWNSPNDISGTDLLAWFEEQ 213
           S N+ D++K+DLAWATY+LLRNT++I+VLI +QGELCVRGYWN P+ +S   L  WF+++
Sbjct: 139 SSNIGDSLKEDLAWATYVLLRNTSTIAVLISVQGELCVRGYWNCPDQMSKAQLHDWFKKK 198

Query: 214 LQSIGLSALNDALYFPQISESGL-WQMLPKGTRSVLVQPVAQNLNQSGNEMEKIGGFILV 273
           +  IGL+ + + LYFPQ + S L   +LP GTRS+ VQP+ QN     NE +K+ GF+LV
Sbjct: 199 VDEIGLADVKETLYFPQYAGSALSLDILPDGTRSLFVQPLVQNT----NEPQKVNGFLLV 258

Query: 274 ASSLSYAFSDKDRAWIRALATKF 296
           AS+  YA+SDKDRAWI A+A KF
Sbjct: 259 ASTAGYAYSDKDRAWIGAMAEKF 273

BLAST of Cp4.1LG09g01090 vs. NCBI nr
Match: gi|659077375|ref|XP_008439172.1| (PREDICTED: uncharacterized protein LOC103484048 isoform X1 [Cucumis melo])

HSP 1 Score: 480.7 bits (1236), Expect = 1.9e-132
Identity = 243/284 (85.56%), Postives = 259/284 (91.20%), Query Frame = 1

Query: 14  MSCTILSPDPLIRLSSTHRFRAKNGSKSPVITARLDDSKNSPNRQLNLSVLRFTLGIPGL 73
           M  +I SP PL +L+S   FRAK+  K+P I+ARLDDSKNS N+QLNLSVLRFTLGIPGL
Sbjct: 1   MISSIPSPSPLNQLTSALPFRAKSKMKAPAISARLDDSKNSANQQLNLSVLRFTLGIPGL 60

Query: 74  DESYLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLE 133
           DESYLPRWIGYGFGSLLLLNHFVGSNSAA  T AQLRTEALGISLAAFSIALPYLGKFL+
Sbjct: 61  DESYLPRWIGYGFGSLLLLNHFVGSNSAALTTPAQLRTEALGISLAAFSIALPYLGKFLK 120

Query: 134 GAVPSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRG 193
           GAVPSGEATLPEG EQIF+LSQ VSDN+K+D+AWATYILLRNTNSISVLIQ QG LCVRG
Sbjct: 121 GAVPSGEATLPEGTEQIFLLSQIVSDNLKEDIAWATYILLRNTNSISVLIQTQGALCVRG 180

Query: 194 YWNSPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLPKGTRSVLVQPVA 253
           YWNSPNDIS  DLLAWFEEQLQSIGLSAL DA+YFPQISESGLWQMLPKGTRSVLVQPV 
Sbjct: 181 YWNSPNDISSADLLAWFEEQLQSIGLSALKDAVYFPQISESGLWQMLPKGTRSVLVQPVV 240

Query: 254 QNLNQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKFDN 298
           QNL QSGNEMEK+GGFIL+ASSLSYAFSDKDRAWIRALA KFD+
Sbjct: 241 QNLKQSGNEMEKMGGFILLASSLSYAFSDKDRAWIRALANKFDD 284

BLAST of Cp4.1LG09g01090 vs. NCBI nr
Match: gi|449446061|ref|XP_004140790.1| (PREDICTED: uncharacterized protein LOC101219803 isoform X1 [Cucumis sativus])

HSP 1 Score: 471.1 bits (1211), Expect = 1.5e-129
Identity = 236/284 (83.10%), Postives = 257/284 (90.49%), Query Frame = 1

Query: 14  MSCTILSPDPLIRLSSTHRFRAKNGSKSPVITARLDDSKNSPNRQLNLSVLRFTLGIPGL 73
           M  +I SP PL +L+S   FRAK+ +K P I+ARLDDSKNS N+QLNLSVLRFTLGIPGL
Sbjct: 1   MISSIPSPSPLNQLTSALSFRAKSKTKGPAISARLDDSKNSANQQLNLSVLRFTLGIPGL 60

Query: 74  DESYLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLE 133
           DESYLPRWIGYGFGSLLLLNHFVGSNSAA  T AQLRTEALGISLAAFSIALPYLGKFL+
Sbjct: 61  DESYLPRWIGYGFGSLLLLNHFVGSNSAALTTPAQLRTEALGISLAAFSIALPYLGKFLK 120

Query: 134 GAVPSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRG 193
           GA+PSGEA LPEG EQIF+LSQ +SDN+K+D+AWATYILLRNTNSISVLIQ QG LCVRG
Sbjct: 121 GALPSGEAILPEGTEQIFLLSQILSDNLKEDIAWATYILLRNTNSISVLIQTQGALCVRG 180

Query: 194 YWNSPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLPKGTRSVLVQPVA 253
           YWNSPNDIS  DLLAWFEEQLQSIGLSAL DA+YFPQISESGLWQMLPKGTRSVLVQPV 
Sbjct: 181 YWNSPNDISSADLLAWFEEQLQSIGLSALKDAVYFPQISESGLWQMLPKGTRSVLVQPVV 240

Query: 254 QNLNQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKFDN 298
           QNL QSGNE++ +GGFIL+ASSLSYAFSDKDRAWIRA+A KFD+
Sbjct: 241 QNLKQSGNEVQNMGGFILLASSLSYAFSDKDRAWIRAVANKFDD 284

BLAST of Cp4.1LG09g01090 vs. NCBI nr
Match: gi|703141003|ref|XP_010107400.1| (hypothetical protein L484_015739 [Morus notabilis])

HSP 1 Score: 375.6 bits (963), Expect = 8.5e-101
Identity = 194/294 (65.99%), Postives = 230/294 (78.23%), Query Frame = 1

Query: 14  MSCTILSPDPLIRLSSTHRFRAKNGSKS-PVITARLDDSKNS--PNRQLNLSVLRFTLGI 73
           MS +I+S  PL +L     F  ++ ++   VI++RLDD+  S  PN QLNLSVLRFTLGI
Sbjct: 1   MSNSIVSLSPLSQLKIPTGFGTRSSTRRFSVISSRLDDNSRSGQPNPQLNLSVLRFTLGI 60

Query: 74  PGLDESYLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGK 133
           PGLDESYLPRWIGYGFGSLL+LNHFVGSNS   ITSAQLRTEALG+SLAAFSI LPYLGK
Sbjct: 61  PGLDESYLPRWIGYGFGSLLVLNHFVGSNSVTDITSAQLRTEALGLSLAAFSIVLPYLGK 120

Query: 134 FL---------EGAVPSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISV 193
           FL         +GA P  + T+PEG+EQIF+LS+NVS+  K+DLAWATYILLRNTN+++V
Sbjct: 121 FLKLYEDEKYLQGATPMDQTTIPEGSEQIFMLSENVSNTEKEDLAWATYILLRNTNTMAV 180

Query: 194 LIQIQGELCVRGYWNSPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLP 253
           LI IQGELCVRGYWN+P D+S TDLL WF  Q++  G+S + D LYFPQIS+SGLW +LP
Sbjct: 181 LISIQGELCVRGYWNTPTDVSKTDLLDWFGRQIEQFGISDVKDTLYFPQISDSGLWDILP 240

Query: 254 KGTRSVLVQPVAQNLNQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKF 296
           KGTRSVLVQPV Q  + S   ME   GFILVAS++SYA++ KDRAWI ALA KF
Sbjct: 241 KGTRSVLVQPVPQVPDSSDKTMETNQGFILVASTISYAYNVKDRAWIGALAKKF 294

BLAST of Cp4.1LG09g01090 vs. NCBI nr
Match: gi|595864362|ref|XP_007211756.1| (hypothetical protein PRUPE_ppa009484mg [Prunus persica])

HSP 1 Score: 374.8 bits (961), Expect = 1.4e-100
Identity = 184/279 (65.95%), Postives = 230/279 (82.44%), Query Frame = 1

Query: 19  LSPDPLIRLSSTHRFRAKN-GSKSPVITARLDDSKNSPNR-QLNLSVLRFTLGIPGLDES 78
           LSP+ LI+L    +FRA+N  +    ++ARLD+SK+S    QLNLSVLRFTLGIPGLDES
Sbjct: 8   LSPNSLIQLKIPPKFRARNCRTNFSAVSARLDNSKSSSAEPQLNLSVLRFTLGIPGLDES 67

Query: 79  YLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLEGAV 138
           YLPRWIGYGFGSLL+LNHF GS S A  T AQLRTEALG+SLAAFSIALPYLG+FL+GA 
Sbjct: 68  YLPRWIGYGFGSLLILNHFAGSISPASTTPAQLRTEALGLSLAAFSIALPYLGRFLKGAT 127

Query: 139 PSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRGYWN 198
           P  + ++P G EQIFV+SQNVS+  K+DLAWATYILLRNTN+I+V+I I+ ELCVRGYWN
Sbjct: 128 PMDQTSIPRGCEQIFVISQNVSNTQKEDLAWATYILLRNTNTIAVIISIRNELCVRGYWN 187

Query: 199 SPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLPKGTRSVLVQPVAQNL 258
            P+D+S T++LAWFE+Q++SIGLS + + LY  QI +SGLW+MLP+GTRS+LVQP+ Q L
Sbjct: 188 IPDDVSKTNVLAWFEKQIESIGLSDVKETLYLSQIEDSGLWEMLPQGTRSLLVQPIVQVL 247

Query: 259 NQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKF 296
             S NE++K  GF+++ASS+ YA+SDKD+AWI A+A KF
Sbjct: 248 PSSDNEIQKSEGFVMLASSMRYAYSDKDKAWIGAIANKF 286

BLAST of Cp4.1LG09g01090 vs. NCBI nr
Match: gi|645234628|ref|XP_008223895.1| (PREDICTED: uncharacterized protein LOC103323666 [Prunus mume])

HSP 1 Score: 373.6 bits (958), Expect = 3.2e-100
Identity = 184/279 (65.95%), Postives = 228/279 (81.72%), Query Frame = 1

Query: 19  LSPDPLIRLSSTHRFRAKN-GSKSPVITARLDDSK-NSPNRQLNLSVLRFTLGIPGLDES 78
           LSP+ LI+L    +FRA+N  +    ++ARLD+SK NS   QLNLSVLRFTLGIPGLDES
Sbjct: 8   LSPNSLIQLKIPPKFRARNCRTNFSAVSARLDNSKSNSAEPQLNLSVLRFTLGIPGLDES 67

Query: 79  YLPRWIGYGFGSLLLLNHFVGSNSAAPITSAQLRTEALGISLAAFSIALPYLGKFLEGAV 138
           YLPRWIGYGFGSLL+LNHF GS S A  T AQLRTEALG+SLAAFSIALPYLG+FL+GA 
Sbjct: 68  YLPRWIGYGFGSLLILNHFAGSISPASTTPAQLRTEALGLSLAAFSIALPYLGRFLKGAT 127

Query: 139 PSGEATLPEGAEQIFVLSQNVSDNVKDDLAWATYILLRNTNSISVLIQIQGELCVRGYWN 198
           P  + ++P G EQ+FV+SQNVS+  K+DLAWATYILLRNTN+I+V+I I+ ELCVRGYWN
Sbjct: 128 PMDQTSIPRGCEQMFVISQNVSNTQKEDLAWATYILLRNTNTIAVIISIRNELCVRGYWN 187

Query: 199 SPNDISGTDLLAWFEEQLQSIGLSALNDALYFPQISESGLWQMLPKGTRSVLVQPVAQNL 258
            P+D+S T++L WFE+Q++SIGLS + + LY  QI +SGLW+MLP+GTRS+LVQP+ Q L
Sbjct: 188 IPDDVSKTNVLGWFEKQIKSIGLSDVKETLYLSQIEDSGLWEMLPQGTRSLLVQPIVQVL 247

Query: 259 NQSGNEMEKIGGFILVASSLSYAFSDKDRAWIRALATKF 296
             S NE++K  GF+L+ASS+ YA+ DKD+AWI ALA KF
Sbjct: 248 PSSDNEIQKSEGFVLLASSMRYAYGDKDKAWIGALANKF 286

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CCB2_ARATH5.2e-8257.79Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB2, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LBA9_CUCSA1.0e-12983.10Uncharacterized protein OS=Cucumis sativus GN=Csa_3G180340 PE=4 SV=1[more]
W9RWP2_9ROSA5.9e-10165.99Uncharacterized protein OS=Morus notabilis GN=L484_015739 PE=4 SV=1[more]
M5X1L8_PRUPE1.0e-10065.95Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009484mg PE=4 SV=1[more]
A0A067JER9_JATCU1.9e-9966.67Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26153 PE=4 SV=1[more]
B9ID30_POPTR3.4e-9664.75Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s13640g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT5G52110.12.9e-8357.79 Protein of unknown function (DUF2930)[more]
Match NameE-valueIdentityDescription
gi|659077375|ref|XP_008439172.1|1.9e-13285.56PREDICTED: uncharacterized protein LOC103484048 isoform X1 [Cucumis melo][more]
gi|449446061|ref|XP_004140790.1|1.5e-12983.10PREDICTED: uncharacterized protein LOC101219803 isoform X1 [Cucumis sativus][more]
gi|703141003|ref|XP_010107400.1|8.5e-10165.99hypothetical protein L484_015739 [Morus notabilis][more]
gi|595864362|ref|XP_007211756.1|1.4e-10065.95hypothetical protein PRUPE_ppa009484mg [Prunus persica][more]
gi|645234628|ref|XP_008223895.1|3.2e-10065.95PREDICTED: uncharacterized protein LOC103323666 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR021325CCB2/CCB4
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010190 cytochrome b6f complex assembly
biological_process GO:0010207 photosystem II assembly
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g01090.1Cp4.1LG09g01090.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021325Cofactor assembly of complex C subunit B, CCB2/CCB4PFAMPF11152DUF2930coord: 78..295
score: 3.0
NoneNo IPR availablePANTHERPTHR36403FAMILY NOT NAMEDcoord: 44..299
score: 1.1E
NoneNo IPR availablePANTHERPTHR36403:SF1SUBFAMILY NOT NAMEDcoord: 44..299
score: 1.1E

The following gene(s) are paralogous to this gene:

None