Moc04g03770 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc04g03770
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCellulose synthase
Locationchr4: 2364912 .. 2369989 (-)
RNA-Seq ExpressionMoc04g03770
SyntenyMoc04g03770
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATCTTTAACCAATCAACCATCAAAGAAGGCCATTCGCAGCCCCGGCGGGTCTAATAATTCTACGAGCAATCGGGGTTCGGTTGGACAAACGGTTAAGTTTGCGCGAAGAACCTCGAGTGGCCGTTATGTTAGTTTGTCGAGAGAAGATCTCGACATGTCAGGAGAGGTTTCAGGAGATTATATTAACTACACTGTGCATATTCCTCCGACACCAGATAATCAACCGATGGACTCGTCAATAGCCACCAAAGCGGAGGAGCAATATGTCTCAAACTCGCTGTTTACTGGCGGATTCAATAGCGTGACGCGAGCGCATCTCATGGATAAGGTGATAGATTCAGAAGTGAGCCATCCCCAGATGGCTGGAGCTAAGGGCTCTTCGTGTGCCATGCCGGCTTGTGATGGCAAGGTCATGAAGGATGAGCGTGGCATCGATATCACCCCTTGTGAATGCAGGTTCAAATTGTCATTTTCAAATTAACTTACATTTTTAAAAGTTACTTAATTCTGAAAAAATTAGGTTAACGCTGATTTTCGTTCCAGTTAAAATGGAAATTTACTTTAGAAATAAAAACTTTAACCAGTTGAAGAAATGGTGATTTGAAATTGAGAAATGATTTTAAATTCTTTATCTTGTTTGGAGCGAGAATGATTTGAAACGAATTAAATTAAATTTCAAATTTGTTTGGATGTTCGCTGGAATTGAAAATAAAATAAAAATGTTTCTGGACTTTTAGTTTAAATTTATTAATTATAATTTAAAATATGACGAGGTTAAATAGGTAATTTATTTCGAAATTCTATGAAATCTTGCAAAATCCGATCATTTTGATCCGGAAATTTTCTTAAGCAAAAATAGTAAGATTTCGAATCATTCAAAATCATGTGGTACCAAACAGTGGAATTCATTTGAAATCCATCACGATTTTGAATTCCTTCAAAATCCAAGGTACTAAACTGAGCATAAAACAAGATCTAGACATAGAACCTCTCGTCCATTTATAAAGGCTGTAATAGACATATATAATAGTTAGTCTAATCTCTGAACATCTAGAAATAAGTTACAATAGGAGTTAAATCTAGTTGCTTGGTACGCAATTGAAGCACCCGGTGCACCTCAAAATTGTGTTTCATTTCATTTTAGGCTGGGAAGAAGTAGAATAGCTGGATCTTTTTATCTTCAAGTTTAATTATATTTTTAAACTAATCACTATTATTTTTATCACCATAAAACCCAATTGCTTCATCCAAACAAGTTATGATTTATTTTAAAAGAAACTAGCTTCGACGAACTTGGAATGTCTTTTACGTACTACTTATGATAGTGTATGCTTGTACTGAGGTGCAACATTATGTGTATCCAGGTTTCGAATATGTCGGGATTGTTATCTTGACGCCCAAAAGGACACAGGGCTATGTCCAGGATGTAAAGAGTTATACAAATCAGCCGATTATGATGATGATCCTAATGAATATTCAGGAGGAGCATTACAATTACAAGGGCCAGACGGGTCTAAAGGCGGGCAAAATATGTCAATGATGAAACTAAACCAAAGCGGAGAATTTGATCACAATAAGTGGTTGTTTGAGACAAAAGGAACTTATGGGGTTGGTAATGCATATTGGACACCAGAAGATGGCTATGCAGATGGTGGAGATGACAAGTTTCGTGATGGGGTGATGGAGTCAATGGATGCTTCTTGGAAACCGTTGAGTCGAACGTTTCCAATCCCAGCAAGCATCATCAGTCCTTATAGGTACTTTAGAACATGAAATGACAATATAGGAGTTCGAACATAGAGCCTCCTGATTTGATACAATCGTAGACAGGATTAAATCAACAATTATGTTATTGTTTTGCAGGTTGTTGATTTTGATTAGACTTGTGGTGTTGGGTTTCTTTTTGCATTGGAGAGTGAAACATCCAAATGAAGATGCAATATGGTTGTGGTTGATGTCAATTGTTTGTGAAATATGGTTCGCTTTTTCTTGGATTCTTGATCAGGTTCCGAAGTTGTGTCCCGTCAATCGTGCAACTGATCTTCAAGTGCTTCACGATAAGTTCGATGCACCGTGCCCATCGAATCCAACAGGTCGTTCCGACCTGCCTGGAGTTGACTTGTTTGTGTCGACTGCTGATCCTGAAAAGGAACCTGTTCTTGTCACTGCCAATACGATCTTATCTATTCTTGCCGTTGATTATCCGGTGGAAAAACTTGCTTGCTACATCTCTGATGATGGAGGTGCCCTGTTGACATTCGAGGCCATGGCAGAGGCAGCGAGCTTTGCAGATTTGTGGGTACCGTTTTGTCGTAAGCACGATATTGAGCCGAGAAATCCAGAGAGTTATTTTAGTTTAAAAGTGGATCCAACTAAGAACAAGAGTAGGTCAGATTTTGTGAAGGATAGAAGGAAGATCAAGCGAGAGTACGATGAGTTTAAGGTACGAACGAATGGGCTTCCTGACTCGATTAGGAGGCGATCAGATGCTTTCAATGCGAGAGAGGAGATGAAGATGTGGAAGCACATGAGAGAAACAGGGGCCGATGCCATGGAGCCGATTAAGGTTCAAAAAGCGACATGGATGGCTGATGGCACCCATTGGCCTGGAACTTGGGGTGTCTCTGCTAGTGACCATTCTAAGGGTGATCATGCTGGAATTCTTCAGGTATAAACAAATCTCTATTTGATTTAAGTATGACCAAAATTACTTTTGTCGTTATCAATATCTTTTTCAATAGGTGATGTTGAAGCCTCCGAGCCCCGATCCATTAATAGGGAGCACGGACGAGAAGATCATAGACTTCAGTGATGTAGACACTCGTCTTCCGATGTTCGTGTATGTGTCTCGAGAGAAACGACCCGGATATGATCATAACAAGAAGGCAGGTGCCATGAATGCACTTGTACGAGCTTCAGCCGTTTTGTCAAATGGTCCTTTCATTCTCAACCTTGATTGTGACCATTACATCTACAATTGCAAAGCCATTCGCGAAGGAATGTGCTTCATGATGGACCGTGGGGGAGAAGATATATGTTACATTCAATTCCCTCAAAGATTTGAAGGTATTGATCCTTCTGATCGATATGCCAATCATAATACTGTCTTCTTTGATGGCAACATGCGAGCACTTGATGGCATACAAGGTCCGGTTTACGTTGGGACTGGTTGTATGTTTAGGCGGTTCGCGCTCTATGGTTTCGACCCGCCGCAACCCGATAAGATTAAGCATAAAAATAATGATCAAGCAGAAACACAACCTTTGCAAGCCACTGATTTTGATCCTGATCTTGATGTGAATCTACTTCCCAAACGTTTTGGCAATTCTAACATGTTGGCTGAGTCAATACTGGTTGCAGAATTCCAAGGTCGCCCTCTTGCTGATCATTCTGCAGTCAAGTATGGACGACCGCCTGGCGCCCTTCGAGTTCCACGTGAACCGCTCGATGCTGCAACGGTTGCTGAAGCCGTCTCGGTCATTTCCTGCTGGTATCATCAACTCGACTCAACTAAAAAATATAACTTAATCATACATTTTAGTATATTATGTTTAAAAATTTTCTATTTAGTCCCTTATTTTAACCTACTTCAATTATACCCTTTCTATTATGTTTCAATTATCATATTTGAGTTTGATATAAATTTAGAGAAACAAATATGCCACCTAGGTGATAAAGTAATTTTCTAATTCAGGCTTAAAATTTCTAAAAAGTTAGTAGACATTTGGATTTTTTTCTTTCTTAGGTGGCTTAATTGTTCTTTCAAATTTCTAAAACTTGCCTAGTATCATGTCACATAAGTTTAACATAGAGTCACATTTGACATCTACGAATGAAAATAGTACAATTTAAAAATAGGTTAAGATAAAATTATGTAAACTTTTTAAACACTATGGACTAAATTTAAAAAGAAAAAAAAAAATACACATGCATGCATACACAAATTAACACGCTGCTTATTGTCGTCTTTGACTTTATTTAATAGGTATGAGGATAAAACCGAATGGGGGGAACGAGTGGGATGGATTTATGGCTCGGTGACAGAAGATGTCGTTACCGGATACCGTATGCACAACCGTGGATGGCACTCAGTGTACTGCATTACCAAACGTGATGCTTTCCGTGGATCAGCTCCAATCAATCTCACTGATCGACTCCATCAAGTGCTCCGATGGGCGACAGGTTCGGTCGAGATTTTCTTCTCAAGAAACAATGCATTGCTCGCTTCTCGACGCCTTAAGCTTCTACAACGTCTCGCATATCTCAACGTTGGCATTTATCCCTTCACTTCGATCTTCCTCATCGTATATTGCTTCCTCCCCGCACTCTCCCTCTTCTCTGGCAACTTCATCGTTCAAACACTCAACGTCACATTCTTGGTCTACTTGTTGATCATAACAATCTGTCTAATCTCCCTAGCCATCCTGGAGGTGAAATGGTCAGGCATTGGGTTGGAAGAGTGGTGGAGAAACGAACAATTCTGGCTCATCTCAGGTAAGAAATAAAACATCCATCAACACATCCTAATATTCATAACATATTTTAAAGTCTTTTCATAAATTTGAGAACTTAACATTTCTATTGACATGAACCTTCTCATCTTTAATTCCAGGCACGTCTGCTCACTTAGCAGCTGTGGTACAGGGCCTGTTGAAGGTCATAGCCGGGATTGAAATCTCATTCACGCTTACATCAAAGTCTGCCGGGGACGAGAACGAGGACATATACGCGGATTTGTACTTGGTGAAATGGACTTCCTTGATGGTTCCTCCCATAGTGATTGCCATGATGAACATAATAGCCATGGGAGTGGCATTCTCAAGGACAATTTACAGCACAGTGCCCCAATGGAGCAAGTTCATAGGAGGGGCCTTCTTTAGCTTTTGGGTTCTGGCCCATTTGTACCCTTTTGCAAAAGGGTTGATGGGGAGGAGAGGGAAGACACCAACAATTGTCATAGTTTGGTCAGGTTTGATAGCCATTACACTTTCTTTGCTTTGGATAGCCATTAGCCCACCAAAATCAACTACTCCTGAAGCAGCTGTGGGAGGAGGAGGGTTTGAATTCCCATGA

mRNA sequence

ATGGCATCTTTAACCAATCAACCATCAAAGAAGGCCATTCGCAGCCCCGGCGGGTCTAATAATTCTACGAGCAATCGGGGTTCGGTTGGACAAACGGTTAAGTTTGCGCGAAGAACCTCGAGTGGCCGTTATGTTAGTTTGTCGAGAGAAGATCTCGACATGTCAGGAGAGGTTTCAGGAGATTATATTAACTACACTGTGCATATTCCTCCGACACCAGATAATCAACCGATGGACTCGTCAATAGCCACCAAAGCGGAGGAGCAATATGTCTCAAACTCGCTGTTTACTGGCGGATTCAATAGCGTGACGCGAGCGCATCTCATGGATAAGGTGATAGATTCAGAAGTGAGCCATCCCCAGATGGCTGGAGCTAAGGGCTCTTCGTGTGCCATGCCGGCTTGTGATGGCAAGGTCATGAAGGATGAGCGTGGCATCGATATCACCCCTTGTGAATGCAGGTTTCGAATATGTCGGGATTGTTATCTTGACGCCCAAAAGGACACAGGGCTATGTCCAGGATGTAAAGAGTTATACAAATCAGCCGATTATGATGATGATCCTAATGAATATTCAGGAGGAGCATTACAATTACAAGGGCCAGACGGGTCTAAAGGCGGGCAAAATATGTCAATGATGAAACTAAACCAAAGCGGAGAATTTGATCACAATAAGTGGTTGTTTGAGACAAAAGGAACTTATGGGGTTGGTAATGCATATTGGACACCAGAAGATGGCTATGCAGATGGTGGAGATGACAAGTTTCGTGATGGGGTGATGGAGTCAATGGATGCTTCTTGGAAACCGTTGAGTCGAACGTTTCCAATCCCAGCAAGCATCATCAGTCCTTATAGGTTGTTGATTTTGATTAGACTTGTGGTGTTGGGTTTCTTTTTGCATTGGAGAGTGAAACATCCAAATGAAGATGCAATATGGTTGTGGTTGATGTCAATTGTTTGTGAAATATGGTTCGCTTTTTCTTGGATTCTTGATCAGGTTCCGAAGTTGTGTCCCGTCAATCGTGCAACTGATCTTCAAGTGCTTCACGATAAGTTCGATGCACCGTGCCCATCGAATCCAACAGGTCGTTCCGACCTGCCTGGAGTTGACTTGTTTGTGTCGACTGCTGATCCTGAAAAGGAACCTGTTCTTGTCACTGCCAATACGATCTTATCTATTCTTGCCGTTGATTATCCGGTGGAAAAACTTGCTTGCTACATCTCTGATGATGGAGGTGCCCTGTTGACATTCGAGGCCATGGCAGAGGCAGCGAGCTTTGCAGATTTGTGGGTACCGTTTTGTCGTAAGCACGATATTGAGCCGAGAAATCCAGAGAGTTATTTTAGTTTAAAAGTGGATCCAACTAAGAACAAGAGTAGGTCAGATTTTGTGAAGGATAGAAGGAAGATCAAGCGAGAGTACGATGAGTTTAAGGTACGAACGAATGGGCTTCCTGACTCGATTAGGAGGCGATCAGATGCTTTCAATGCGAGAGAGGAGATGAAGATGTGGAAGCACATGAGAGAAACAGGGGCCGATGCCATGGAGCCGATTAAGGTTCAAAAAGCGACATGGATGGCTGATGGCACCCATTGGCCTGGAACTTGGGGTGTCTCTGCTAGTGACCATTCTAAGGGTGATCATGCTGGAATTCTTCAGGTGATGTTGAAGCCTCCGAGCCCCGATCCATTAATAGGGAGCACGGACGAGAAGATCATAGACTTCAGTGATGTAGACACTCGTCTTCCGATGTTCGTGTATGTGTCTCGAGAGAAACGACCCGGATATGATCATAACAAGAAGGCAGGTGCCATGAATGCACTTGTACGAGCTTCAGCCGTTTTGTCAAATGGTCCTTTCATTCTCAACCTTGATTGTGACCATTACATCTACAATTGCAAAGCCATTCGCGAAGGAATGTGCTTCATGATGGACCGTGGGGGAGAAGATATATGTTACATTCAATTCCCTCAAAGATTTGAAGGTATTGATCCTTCTGATCGATATGCCAATCATAATACTGTCTTCTTTGATGGCAACATGCGAGCACTTGATGGCATACAAGGTCCGGTTTACGTTGGGACTGGTTGTATGTTTAGGCGGTTCGCGCTCTATGGTTTCGACCCGCCGCAACCCGATAAGATTAAGCATAAAAATAATGATCAAGCAGAAACACAACCTTTGCAAGCCACTGATTTTGATCCTGATCTTGATGTGAATCTACTTCCCAAACGTTTTGGCAATTCTAACATGTTGGCTGAGTCAATACTGGTTGCAGAATTCCAAGGTCGCCCTCTTGCTGATCATTCTGCAGTCAAGTATGGACGACCGCCTGGCGCCCTTCGAGTTCCACGTGAACCGCTCGATGCTGCAACGGTTGCTGAAGCCGTCTCGGTCATTTCCTGCTGGTATGAGGATAAAACCGAATGGGGGGAACGAGTGGGATGGATTTATGGCTCGGTGACAGAAGATGTCGTTACCGGATACCGTATGCACAACCGTGGATGGCACTCAGTGTACTGCATTACCAAACGTGATGCTTTCCGTGGATCAGCTCCAATCAATCTCACTGATCGACTCCATCAAGTGCTCCGATGGGCGACAGGTTCGGTCGAGATTTTCTTCTCAAGAAACAATGCATTGCTCGCTTCTCGACGCCTTAAGCTTCTACAACGTCTCGCATATCTCAACGTTGGCATTTATCCCTTCACTTCGATCTTCCTCATCGTATATTGCTTCCTCCCCGCACTCTCCCTCTTCTCTGGCAACTTCATCGTTCAAACACTCAACGTCACATTCTTGGTCTACTTGTTGATCATAACAATCTGTCTAATCTCCCTAGCCATCCTGGAGGTGAAATGGTCAGGCATTGGGTTGGAAGAGTGGTGGAGAAACGAACAATTCTGGCTCATCTCAGGCACGTCTGCTCACTTAGCAGCTGTGGTACAGGGCCTGTTGAAGGTCATAGCCGGGATTGAAATCTCATTCACGCTTACATCAAAGTCTGCCGGGGACGAGAACGAGGACATATACGCGGATTTGTACTTGGTGAAATGGACTTCCTTGATGGTTCCTCCCATAGTGATTGCCATGATGAACATAATAGCCATGGGAGTGGCATTCTCAAGGACAATTTACAGCACAGTGCCCCAATGGAGCAAGTTCATAGGAGGGGCCTTCTTTAGCTTTTGGGTTCTGGCCCATTTGTACCCTTTTGCAAAAGGGTTGATGGGGAGGAGAGGGAAGACACCAACAATTGTCATAGTTTGGTCAGGTTTGATAGCCATTACACTTTCTTTGCTTTGGATAGCCATTAGCCCACCAAAATCAACTACTCCTGAAGCAGCTGTGGGAGGAGGAGGGTTTGAATTCCCATGA

Coding sequence (CDS)

ATGGCATCTTTAACCAATCAACCATCAAAGAAGGCCATTCGCAGCCCCGGCGGGTCTAATAATTCTACGAGCAATCGGGGTTCGGTTGGACAAACGGTTAAGTTTGCGCGAAGAACCTCGAGTGGCCGTTATGTTAGTTTGTCGAGAGAAGATCTCGACATGTCAGGAGAGGTTTCAGGAGATTATATTAACTACACTGTGCATATTCCTCCGACACCAGATAATCAACCGATGGACTCGTCAATAGCCACCAAAGCGGAGGAGCAATATGTCTCAAACTCGCTGTTTACTGGCGGATTCAATAGCGTGACGCGAGCGCATCTCATGGATAAGGTGATAGATTCAGAAGTGAGCCATCCCCAGATGGCTGGAGCTAAGGGCTCTTCGTGTGCCATGCCGGCTTGTGATGGCAAGGTCATGAAGGATGAGCGTGGCATCGATATCACCCCTTGTGAATGCAGGTTTCGAATATGTCGGGATTGTTATCTTGACGCCCAAAAGGACACAGGGCTATGTCCAGGATGTAAAGAGTTATACAAATCAGCCGATTATGATGATGATCCTAATGAATATTCAGGAGGAGCATTACAATTACAAGGGCCAGACGGGTCTAAAGGCGGGCAAAATATGTCAATGATGAAACTAAACCAAAGCGGAGAATTTGATCACAATAAGTGGTTGTTTGAGACAAAAGGAACTTATGGGGTTGGTAATGCATATTGGACACCAGAAGATGGCTATGCAGATGGTGGAGATGACAAGTTTCGTGATGGGGTGATGGAGTCAATGGATGCTTCTTGGAAACCGTTGAGTCGAACGTTTCCAATCCCAGCAAGCATCATCAGTCCTTATAGGTTGTTGATTTTGATTAGACTTGTGGTGTTGGGTTTCTTTTTGCATTGGAGAGTGAAACATCCAAATGAAGATGCAATATGGTTGTGGTTGATGTCAATTGTTTGTGAAATATGGTTCGCTTTTTCTTGGATTCTTGATCAGGTTCCGAAGTTGTGTCCCGTCAATCGTGCAACTGATCTTCAAGTGCTTCACGATAAGTTCGATGCACCGTGCCCATCGAATCCAACAGGTCGTTCCGACCTGCCTGGAGTTGACTTGTTTGTGTCGACTGCTGATCCTGAAAAGGAACCTGTTCTTGTCACTGCCAATACGATCTTATCTATTCTTGCCGTTGATTATCCGGTGGAAAAACTTGCTTGCTACATCTCTGATGATGGAGGTGCCCTGTTGACATTCGAGGCCATGGCAGAGGCAGCGAGCTTTGCAGATTTGTGGGTACCGTTTTGTCGTAAGCACGATATTGAGCCGAGAAATCCAGAGAGTTATTTTAGTTTAAAAGTGGATCCAACTAAGAACAAGAGTAGGTCAGATTTTGTGAAGGATAGAAGGAAGATCAAGCGAGAGTACGATGAGTTTAAGGTACGAACGAATGGGCTTCCTGACTCGATTAGGAGGCGATCAGATGCTTTCAATGCGAGAGAGGAGATGAAGATGTGGAAGCACATGAGAGAAACAGGGGCCGATGCCATGGAGCCGATTAAGGTTCAAAAAGCGACATGGATGGCTGATGGCACCCATTGGCCTGGAACTTGGGGTGTCTCTGCTAGTGACCATTCTAAGGGTGATCATGCTGGAATTCTTCAGGTGATGTTGAAGCCTCCGAGCCCCGATCCATTAATAGGGAGCACGGACGAGAAGATCATAGACTTCAGTGATGTAGACACTCGTCTTCCGATGTTCGTGTATGTGTCTCGAGAGAAACGACCCGGATATGATCATAACAAGAAGGCAGGTGCCATGAATGCACTTGTACGAGCTTCAGCCGTTTTGTCAAATGGTCCTTTCATTCTCAACCTTGATTGTGACCATTACATCTACAATTGCAAAGCCATTCGCGAAGGAATGTGCTTCATGATGGACCGTGGGGGAGAAGATATATGTTACATTCAATTCCCTCAAAGATTTGAAGGTATTGATCCTTCTGATCGATATGCCAATCATAATACTGTCTTCTTTGATGGCAACATGCGAGCACTTGATGGCATACAAGGTCCGGTTTACGTTGGGACTGGTTGTATGTTTAGGCGGTTCGCGCTCTATGGTTTCGACCCGCCGCAACCCGATAAGATTAAGCATAAAAATAATGATCAAGCAGAAACACAACCTTTGCAAGCCACTGATTTTGATCCTGATCTTGATGTGAATCTACTTCCCAAACGTTTTGGCAATTCTAACATGTTGGCTGAGTCAATACTGGTTGCAGAATTCCAAGGTCGCCCTCTTGCTGATCATTCTGCAGTCAAGTATGGACGACCGCCTGGCGCCCTTCGAGTTCCACGTGAACCGCTCGATGCTGCAACGGTTGCTGAAGCCGTCTCGGTCATTTCCTGCTGGTATGAGGATAAAACCGAATGGGGGGAACGAGTGGGATGGATTTATGGCTCGGTGACAGAAGATGTCGTTACCGGATACCGTATGCACAACCGTGGATGGCACTCAGTGTACTGCATTACCAAACGTGATGCTTTCCGTGGATCAGCTCCAATCAATCTCACTGATCGACTCCATCAAGTGCTCCGATGGGCGACAGGTTCGGTCGAGATTTTCTTCTCAAGAAACAATGCATTGCTCGCTTCTCGACGCCTTAAGCTTCTACAACGTCTCGCATATCTCAACGTTGGCATTTATCCCTTCACTTCGATCTTCCTCATCGTATATTGCTTCCTCCCCGCACTCTCCCTCTTCTCTGGCAACTTCATCGTTCAAACACTCAACGTCACATTCTTGGTCTACTTGTTGATCATAACAATCTGTCTAATCTCCCTAGCCATCCTGGAGGTGAAATGGTCAGGCATTGGGTTGGAAGAGTGGTGGAGAAACGAACAATTCTGGCTCATCTCAGGCACGTCTGCTCACTTAGCAGCTGTGGTACAGGGCCTGTTGAAGGTCATAGCCGGGATTGAAATCTCATTCACGCTTACATCAAAGTCTGCCGGGGACGAGAACGAGGACATATACGCGGATTTGTACTTGGTGAAATGGACTTCCTTGATGGTTCCTCCCATAGTGATTGCCATGATGAACATAATAGCCATGGGAGTGGCATTCTCAAGGACAATTTACAGCACAGTGCCCCAATGGAGCAAGTTCATAGGAGGGGCCTTCTTTAGCTTTTGGGTTCTGGCCCATTTGTACCCTTTTGCAAAAGGGTTGATGGGGAGGAGAGGGAAGACACCAACAATTGTCATAGTTTGGTCAGGTTTGATAGCCATTACACTTTCTTTGCTTTGGATAGCCATTAGCCCACCAAAATCAACTACTCCTGAAGCAGCTGTGGGAGGAGGAGGGTTTGAATTCCCATGA

Protein sequence

MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSGDYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHPQMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYKSADYDDDPNEYSGGALQLQGPDGSKGGQNMSMMKLNQSGEFDHNKWLFETKGTYGVGNAYWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLHWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNPTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAMAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVRTNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVSASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHNKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQRFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKNNDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPGALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKWTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP
Homology
BLAST of Moc04g03770 vs. NCBI nr
Match: XP_022135687.1 (cellulose synthase-like protein D4 [Momordica charantia])

HSP 1 Score: 2302.7 bits (5966), Expect = 0.0e+00
Identity = 1125/1125 (100.00%), Postives = 1125/1125 (100.00%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG
Sbjct: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP
Sbjct: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK
Sbjct: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGGQNMSMMKLNQSGEFDHNKWLFETKGTYGVGNAY 240
            SADYDDDPNEYSGGALQLQGPDGSKGGQNMSMMKLNQSGEFDHNKWLFETKGTYGVGNAY
Sbjct: 181  SADYDDDPNEYSGGALQLQGPDGSKGGQNMSMMKLNQSGEFDHNKWLFETKGTYGVGNAY 240

Query: 241  WTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLH 300
            WTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLH
Sbjct: 241  WTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLH 300

Query: 301  WRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNP 360
            WRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNP
Sbjct: 301  WRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNP 360

Query: 361  TGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAM 420
            TGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAM
Sbjct: 361  TGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAM 420

Query: 421  AEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVR 480
            AEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVR
Sbjct: 421  AEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVR 480

Query: 481  TNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVSA 540
            TNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVSA
Sbjct: 481  TNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVSA 540

Query: 541  SDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHNK 600
            SDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHNK
Sbjct: 541  SDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHNK 600

Query: 601  KAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQRF 660
            KAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQRF
Sbjct: 601  KAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQRF 660

Query: 661  EGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKNN 720
            EGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKNN
Sbjct: 661  EGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKNN 720

Query: 721  DQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPGA 780
            DQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPGA
Sbjct: 721  DQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPGA 780

Query: 781  LRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVY 840
            LRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVY
Sbjct: 781  LRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVY 840

Query: 841  CITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGI 900
            CITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGI
Sbjct: 841  CITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGI 900

Query: 901  YPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLE 960
            YPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLE
Sbjct: 901  YPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLE 960

Query: 961  EWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKWT 1020
            EWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKWT
Sbjct: 961  EWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKWT 1020

Query: 1021 SLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRR 1080
            SLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRR
Sbjct: 1021 SLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRR 1080

Query: 1081 GKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            GKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP
Sbjct: 1081 GKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1125

BLAST of Moc04g03770 vs. NCBI nr
Match: XP_038888576.1 (cellulose synthase-like protein D4 [Benincasa hispida])

HSP 1 Score: 2141.3 bits (5547), Expect = 0.0e+00
Identity = 1042/1126 (92.54%), Postives = 1084/1126 (96.27%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MASLTNQPSKKAIRSPGGS NSTSNRGS GQTVKFARRTSSGRYVSLSREDLDMSGE+SG
Sbjct: 1    MASLTNQPSKKAIRSPGGSANSTSNRGSSGQTVKFARRTSSGRYVSLSREDLDMSGEISG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPMDSS+ATKAEEQYVSNSLFTGGFNSVTRAHLMD+VIDSEV+HP
Sbjct: 61   DYINYTVHIPPTPDNQPMDSSVATKAEEQYVSNSLFTGGFNSVTRAHLMDRVIDSEVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDGKVMKDERG D TPCECRFRICR+C+ DA K+TGLCPGCKE YK
Sbjct: 121  QMAGAKGSSCAMPACDGKVMKDERGKDFTPCECRFRICRECHFDAIKETGLCPGCKEPYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG-QNMSMMKLNQSGEFDHNKWLFETKGTYGVGNA 240
              DY+DD N+YS G LQLQGPDGSKGG QNMSMMKLNQ GEFDHNKWLFE+KGTYGVGNA
Sbjct: 181  MGDYEDDYNDYSNGTLQLQGPDGSKGGSQNMSMMKLNQGGEFDHNKWLFESKGTYGVGNA 240

Query: 241  YWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFL 300
            Y+   D Y +G DDKFR+G+MESMD  WKPLSRTFPIPASIISPYRLLIL+RLVVLGFFL
Sbjct: 241  YY---DDYDNGDDDKFREGMMESMDKPWKPLSRTFPIPASIISPYRLLILVRLVVLGFFL 300

Query: 301  HWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSN 360
            HWRV+HPNEDA+WLWLMSI+CEIWFAFSWILDQ+PKLCPVNR TDLQVL+DKFDAP PSN
Sbjct: 301  HWRVQHPNEDAVWLWLMSIICEIWFAFSWILDQIPKLCPVNRGTDLQVLYDKFDAPSPSN 360

Query: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEA 420
            PTGRSDLPGVD+FVSTADPEKEPVLVTANTILSILA DYPVEKLACYISDDGGALLTFEA
Sbjct: 361  PTGRSDLPGVDMFVSTADPEKEPVLVTANTILSILAADYPVEKLACYISDDGGALLTFEA 420

Query: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480
            MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV
Sbjct: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480

Query: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVS 540
            RTNGLPDSIRRRSDAFNAREEMKMWKHM+ETGADAMEPIKVQKATWMADGTHWPGTW V 
Sbjct: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMKETGADAMEPIKVQKATWMADGTHWPGTWVVP 540

Query: 541  ASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHN 600
            +SDHSKGDHAGILQVMLKPPS DPL+GS DEKIIDF+DVD RLPMFVYVSREKRPGYDHN
Sbjct: 541  SSDHSKGDHAGILQVMLKPPSHDPLMGSADEKIIDFTDVDIRLPMFVYVSREKRPGYDHN 600

Query: 601  KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQR 660
            KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAI+EGMCFMMDRGGEDICYIQFPQR
Sbjct: 601  KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIKEGMCFMMDRGGEDICYIQFPQR 660

Query: 661  FEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKN 720
            FEGIDPSDRYANHNTVFFDGNMRALDG+QGPVYVGTGCMFRRFALYGFDPPQPDKIKHK 
Sbjct: 661  FEGIDPSDRYANHNTVFFDGNMRALDGVQGPVYVGTGCMFRRFALYGFDPPQPDKIKHK- 720

Query: 721  NDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPG 780
            +D +ETQPLQ+TDFDPDLDVNLLPKRFGNS MLA+SI +AEFQGRPLADHSAVKYGRPPG
Sbjct: 721  SDSSETQPLQSTDFDPDLDVNLLPKRFGNSTMLADSIPIAEFQGRPLADHSAVKYGRPPG 780

Query: 781  ALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSV 840
            ALR+PR PLD+ATVAE+VSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSV
Sbjct: 781  ALRLPRPPLDSATVAESVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSV 840

Query: 841  YCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVG 900
            YCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLK LQRLAYLNVG
Sbjct: 841  YCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKFLQRLAYLNVG 900

Query: 901  IYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGL 960
            IYPFTSIFLIVYCFLPALSLFSG FIVQTLNVTFL+YLLIIT+CLISLAILEVKWSGIGL
Sbjct: 901  IYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLIYLLIITVCLISLAILEVKWSGIGL 960

Query: 961  EEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKW 1020
            EEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGD+ EDIYADLYLVKW
Sbjct: 961  EEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDDIEDIYADLYLVKW 1020

Query: 1021 TSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGR 1080
            TSLMVPPIVIAMMNIIA+ VAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGR
Sbjct: 1021 TSLMVPPIVIAMMNIIAIVVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGR 1080

Query: 1081 RGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            RGKTPTIVIVWSGLIAITLSLLWIAI+PPK +  +AAVGGGGFEFP
Sbjct: 1081 RGKTPTIVIVWSGLIAITLSLLWIAINPPKPSAADAAVGGGGFEFP 1122

BLAST of Moc04g03770 vs. NCBI nr
Match: XP_022989294.1 (cellulose synthase-like protein D4 [Cucurbita maxima])

HSP 1 Score: 2136.3 bits (5534), Expect = 0.0e+00
Identity = 1042/1127 (92.46%), Postives = 1081/1127 (95.92%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MA+LTNQPSKKAIRSPG S NSTS R + GQTVKFARRTSSGRYVSLSREDLDMSGEVSG
Sbjct: 1    MATLTNQPSKKAIRSPGASANSTSIRAASGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPMDSSIA+KAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEV+HP
Sbjct: 61   DYINYTVHIPPTPDNQPMDSSIASKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDGKVMKDERG DITPCECRFRICRDCYLDA K+TGLCPGCKE YK
Sbjct: 121  QMAGAKGSSCAMPACDGKVMKDERGKDITPCECRFRICRDCYLDALKETGLCPGCKEPYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG-QNMSMMKLNQSGEFDHNKWLFETKGTYGVGNA 240
              DY++D NEYS  ALQL GPDGSKGG QNMSMMKLNQSGEFDHNKWLFE+KGTYGVG+A
Sbjct: 181  VGDYEEDSNEYS--ALQLHGPDGSKGGSQNMSMMKLNQSGEFDHNKWLFESKGTYGVGSA 240

Query: 241  YWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFL 300
            YWTP+DGY +GG+D F DG+ME+MD  WKPLSRTFPIPASIISPYRLLIL+RLVVL FFL
Sbjct: 241  YWTPDDGYGNGGNDNFGDGMMEAMDKPWKPLSRTFPIPASIISPYRLLILVRLVVLAFFL 300

Query: 301  HWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSN 360
            HWRV+HPNEDAIWLWLMSIVCEIWFAFSWILDQ+PKLCPVNRATDLQVL+DKFDAP P N
Sbjct: 301  HWRVQHPNEDAIWLWLMSIVCEIWFAFSWILDQIPKLCPVNRATDLQVLYDKFDAPSPLN 360

Query: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEA 420
            PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACY+SDDGGALLTFEA
Sbjct: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYVSDDGGALLTFEA 420

Query: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480
            MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV
Sbjct: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480

Query: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMRETGA-DAMEPIKVQKATWMADGTHWPGTWGV 540
            RTNGLPDSIRRRS+AFNAREEMKMWK M+E G  DAMEPIKVQKATWMADG+HWPGTW V
Sbjct: 481  RTNGLPDSIRRRSEAFNAREEMKMWKLMKEKGGPDAMEPIKVQKATWMADGSHWPGTWVV 540

Query: 541  SASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDH 600
               DHSKGDH+GILQVMLKPPS DPL+GSTDEKIIDF+DVD RLPMFVYVSREKRPGYDH
Sbjct: 541  PTGDHSKGDHSGILQVMLKPPSHDPLLGSTDEKIIDFTDVDIRLPMFVYVSREKRPGYDH 600

Query: 601  NKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQ 660
            NKKAGAMNALVR+SAVLSNGPFILNLDCDHYIYNCKAI+EGMCFMMDRGGEDICYIQFPQ
Sbjct: 601  NKKAGAMNALVRSSAVLSNGPFILNLDCDHYIYNCKAIKEGMCFMMDRGGEDICYIQFPQ 660

Query: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHK 720
            RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDK+K  
Sbjct: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKMKQA 720

Query: 721  NNDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPP 780
             NDQ ETQPLQ TDFDPDLDVNLLPKRFGNS MLAESILVAEFQGRP+ADH AVKYGRPP
Sbjct: 721  KNDQPETQPLQPTDFDPDLDVNLLPKRFGNSTMLAESILVAEFQGRPIADHPAVKYGRPP 780

Query: 781  GALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS 840
            GALRVPR+PLDAATVAEAVSVISCWYEDKTEWG+RVGWIYGSVTEDVVTGYRMHNRGWHS
Sbjct: 781  GALRVPRQPLDAATVAEAVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWHS 840

Query: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNV 900
            VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNA LA+RRLKLLQRLAYLNV
Sbjct: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNAFLATRRLKLLQRLAYLNV 900

Query: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIG 960
            GIYPFTSIFLIVYCFLPALSLFSGNFIVQ+LN TFL+YLLIITICLISLA+LEVKWSGIG
Sbjct: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQSLNATFLIYLLIITICLISLAVLEVKWSGIG 960

Query: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVK 1020
            LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKS+GDENEDIYADLYLVK
Sbjct: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSSGDENEDIYADLYLVK 1020

Query: 1021 WTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080
            WTSLMVPPIVIAMMNIIAM VAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG
Sbjct: 1021 WTSLMVPPIVIAMMNIIAMVVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080

Query: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            RRGKTPTIVIVWSGLIAITLSLLWIAISPPK+   +AA+GGGGFEFP
Sbjct: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKAEDADAAIGGGGFEFP 1125

BLAST of Moc04g03770 vs. NCBI nr
Match: XP_022928356.1 (cellulose synthase-like protein D4 [Cucurbita moschata] >KAG6589078.1 Cellulose synthase-like protein D4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 2135.5 bits (5532), Expect = 0.0e+00
Identity = 1042/1127 (92.46%), Postives = 1080/1127 (95.83%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MA+LTNQPSKKAIRSPG S NSTS R + GQTVKFARRTSSGRYVSLSREDLDMSGEVSG
Sbjct: 1    MATLTNQPSKKAIRSPGASANSTSIRAASGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPMDSSIA+KAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEV+HP
Sbjct: 61   DYINYTVHIPPTPDNQPMDSSIASKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDGKVMKDERG DITPCECRFRICRDCYLDA K+TGLCPGCKE YK
Sbjct: 121  QMAGAKGSSCAMPACDGKVMKDERGKDITPCECRFRICRDCYLDALKETGLCPGCKEPYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG-QNMSMMKLNQSGEFDHNKWLFETKGTYGVGNA 240
              DY++D NEYS  ALQL GPDGSKGG QNMSMMKLNQ+GEFDHNKWLFE+KGTYGVG+A
Sbjct: 181  VGDYEEDSNEYS--ALQLHGPDGSKGGSQNMSMMKLNQTGEFDHNKWLFESKGTYGVGSA 240

Query: 241  YWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFL 300
            YWTPEDGY +GG+D F DG+ME+MD  WKPLSRTFPIPASIISPYRLLIL+R VVLGFFL
Sbjct: 241  YWTPEDGYGNGGNDNFGDGMMEAMDKPWKPLSRTFPIPASIISPYRLLILVRFVVLGFFL 300

Query: 301  HWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSN 360
            HWRV+HPNEDAIWLWLMSIVCEIWFAFSWILDQ+PKLCPVNRATDLQVL+DKFDAP P N
Sbjct: 301  HWRVRHPNEDAIWLWLMSIVCEIWFAFSWILDQIPKLCPVNRATDLQVLYDKFDAPSPLN 360

Query: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEA 420
            PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACY+SDDGGALLTFEA
Sbjct: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYVSDDGGALLTFEA 420

Query: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480
            MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV
Sbjct: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480

Query: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMRETGA-DAMEPIKVQKATWMADGTHWPGTWGV 540
            RTNGLPDSIRRRS+AFNAREEMKMWK M+E G  DAMEPIKVQKATWMADG+HWPGTW V
Sbjct: 481  RTNGLPDSIRRRSEAFNAREEMKMWKLMKEKGGPDAMEPIKVQKATWMADGSHWPGTWVV 540

Query: 541  SASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDH 600
               DHSKGDH+GILQVMLKPPS DPL+GSTDEKIIDF+DVD RLPMFVYVSREKRPGYDH
Sbjct: 541  PTGDHSKGDHSGILQVMLKPPSHDPLLGSTDEKIIDFTDVDIRLPMFVYVSREKRPGYDH 600

Query: 601  NKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQ 660
            NKKAGAMNALVR+SAVLSNGPFILNLDCDHYIYNCKAI+EGMCFMMDRGGEDICYIQFPQ
Sbjct: 601  NKKAGAMNALVRSSAVLSNGPFILNLDCDHYIYNCKAIKEGMCFMMDRGGEDICYIQFPQ 660

Query: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHK 720
            RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDK+K  
Sbjct: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKMKQA 720

Query: 721  NNDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPP 780
             +DQ ETQPLQ TDFDPDLDVNLLPKRFGNS MLAESILVAEFQGRP+ADH AVKYGRPP
Sbjct: 721  KSDQPETQPLQPTDFDPDLDVNLLPKRFGNSTMLAESILVAEFQGRPIADHPAVKYGRPP 780

Query: 781  GALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS 840
            GALRVPR+PLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS
Sbjct: 781  GALRVPRQPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS 840

Query: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNV 900
            VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNA LA+RRLK LQRLAYLNV
Sbjct: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNAFLATRRLKFLQRLAYLNV 900

Query: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIG 960
            GIYPFTSIFLIVYCFLPALSLFSGNFIVQ+LN TFL+YLLIITICLISLAILEVKWSGIG
Sbjct: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQSLNATFLIYLLIITICLISLAILEVKWSGIG 960

Query: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVK 1020
            LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKS+GDENEDIYADLYLVK
Sbjct: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSSGDENEDIYADLYLVK 1020

Query: 1021 WTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080
            WTSLMVPPIVIAMMNIIAM VAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG
Sbjct: 1021 WTSLMVPPIVIAMMNIIAMVVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080

Query: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            RRGKTPTIVIVWSGLIAITLSLLWIAISPPK+   +AA+GGGGFEFP
Sbjct: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKAEDADAAIGGGGFEFP 1125

BLAST of Moc04g03770 vs. NCBI nr
Match: XP_023529589.1 (cellulose synthase-like protein D4 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2134.0 bits (5528), Expect = 0.0e+00
Identity = 1040/1127 (92.28%), Postives = 1081/1127 (95.92%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MA+LTNQPSKKAIRSPG S NSTS R + GQTVKFARRTSSGRYVSLSREDLDMSGEVSG
Sbjct: 1    MATLTNQPSKKAIRSPGASANSTSIRAASGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPMDSSIA+KAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEV+HP
Sbjct: 61   DYINYTVHIPPTPDNQPMDSSIASKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDGKVMKDERG DITPCECRFRICRDCYLDA K+TGLCPGCKE YK
Sbjct: 121  QMAGAKGSSCAMPACDGKVMKDERGKDITPCECRFRICRDCYLDALKETGLCPGCKEPYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG-QNMSMMKLNQSGEFDHNKWLFETKGTYGVGNA 240
              DY++D NEYS  ALQL GPDGSKGG QNMSMMKLNQ+GEFDHNKWLFE+KGTYGVG+A
Sbjct: 181  VGDYEEDSNEYS--ALQLHGPDGSKGGSQNMSMMKLNQTGEFDHNKWLFESKGTYGVGSA 240

Query: 241  YWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFL 300
            YWTPEDGY +GG+D F DG+ME+MD  WKPLSRTFPIPASIISPYRLLIL+R VVLGFFL
Sbjct: 241  YWTPEDGYGNGGNDNFGDGMMEAMDKPWKPLSRTFPIPASIISPYRLLILVRFVVLGFFL 300

Query: 301  HWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSN 360
            HWRV+HPNEDAIWLWLMSIVCEIWFAFSWILDQ+PKLCPVNRATDLQVL+DKFDAP P N
Sbjct: 301  HWRVRHPNEDAIWLWLMSIVCEIWFAFSWILDQIPKLCPVNRATDLQVLYDKFDAPSPLN 360

Query: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEA 420
            PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACY+SDDGGALLTFEA
Sbjct: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYVSDDGGALLTFEA 420

Query: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480
            MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV
Sbjct: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480

Query: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMRETGA-DAMEPIKVQKATWMADGTHWPGTWGV 540
            RTNGLPDSIRRRS+AFNAREEMKMWK M+E G  DAMEPIKVQKATWMADG+HWPGTW V
Sbjct: 481  RTNGLPDSIRRRSEAFNAREEMKMWKLMKEKGGPDAMEPIKVQKATWMADGSHWPGTWVV 540

Query: 541  SASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDH 600
               DHSKGDH+GILQVMLKPPS DPL+GSTDEKIIDF+DVD RLPMFVY+SREKRPGYDH
Sbjct: 541  PTGDHSKGDHSGILQVMLKPPSHDPLLGSTDEKIIDFTDVDIRLPMFVYMSREKRPGYDH 600

Query: 601  NKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQ 660
            NKKAGAMNALVR+SAVLSNGPFILNLDCDHYIYNCKAI+EGMCFMMDRGGEDICYIQFPQ
Sbjct: 601  NKKAGAMNALVRSSAVLSNGPFILNLDCDHYIYNCKAIKEGMCFMMDRGGEDICYIQFPQ 660

Query: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHK 720
            RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDK+K  
Sbjct: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKMKQA 720

Query: 721  NNDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPP 780
             +DQ ETQPLQ TDFDPDLDVNLLPKRFGNS MLAESILVAEFQGRP+ADH AVKYGRPP
Sbjct: 721  KSDQPETQPLQPTDFDPDLDVNLLPKRFGNSTMLAESILVAEFQGRPIADHPAVKYGRPP 780

Query: 781  GALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS 840
            GALRVPR+PLDAATVAEAVSVISCWYEDKTEWG+RVGWIYGSVTEDVVTGYRMHNRGWHS
Sbjct: 781  GALRVPRQPLDAATVAEAVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWHS 840

Query: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNV 900
            VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNA+LA+RRLK LQRLAYLNV
Sbjct: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNAILATRRLKFLQRLAYLNV 900

Query: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIG 960
            GIYPFTSIFLIVYCFLPALSLFSGNFIVQ+LN TFL+YLLIITICLISLAILEVKWSGIG
Sbjct: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQSLNATFLIYLLIITICLISLAILEVKWSGIG 960

Query: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVK 1020
            LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKS+GDENEDIYADLYLVK
Sbjct: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSSGDENEDIYADLYLVK 1020

Query: 1021 WTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080
            WTSLMVPPIVIAMMNIIAM VAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG
Sbjct: 1021 WTSLMVPPIVIAMMNIIAMVVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080

Query: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            RRGKTPTIVIVWSGLIAITLSLLWIAISPPK+   +AA+GGGGFEFP
Sbjct: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKAEDADAAIGGGGFEFP 1125

BLAST of Moc04g03770 vs. ExPASy Swiss-Prot
Match: Q9SZL9 (Cellulose synthase-like protein D4 OS=Arabidopsis thaliana OX=3702 GN=CSLD4 PE=2 SV=1)

HSP 1 Score: 1898.2 bits (4916), Expect = 0.0e+00
Identity = 920/1130 (81.42%), Postives = 1011/1130 (89.47%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MAS   Q SKK        NNS S     GQTVKFARRTSSGRYVSLSR+++++SGE+SG
Sbjct: 1    MASTPPQTSKKV------RNNSGS-----GQTVKFARRTSSGRYVSLSRDNIELSGELSG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DY NYTVHIPPTPDNQPM    ATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDS+V+HP
Sbjct: 61   DYSNYTVHIPPTPDNQPM----ATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSDVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDG VMKDERG D+ PCECRF+ICRDC++DAQK+TGLCPGCKE YK
Sbjct: 121  QMAGAKGSSCAMPACDGNVMKDERGKDVMPCECRFKICRDCFMDAQKETGLCPGCKEQYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG--QNMSMMKLNQSGEFDHNKWLFETKGTYGVGN 240
              D DDD  +YS GAL L  P   + G   NMSMMK NQ+GEFDHN+WLFET+GTYG GN
Sbjct: 181  IGDLDDDTPDYSSGALPLPAPGKDQRGNNNNMSMMKRNQNGEFDHNRWLFETQGTYGYGN 240

Query: 241  AYWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFF 300
            AYW  ++ Y D  D+  R G++E+ D  W+PLSR  PIPA+IISPYRLLI+IR VVL FF
Sbjct: 241  AYWPQDEMYGDDMDEGMRGGMVETADKPWRPLSRRIPIPAAIISPYRLLIVIRFVVLCFF 300

Query: 301  LHWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPS 360
            L WR+++PNEDAIWLWLMSI+CE+WF FSWILDQ+PKLCP+NR+TDL+VL DKFD P PS
Sbjct: 301  LTWRIRNPNEDAIWLWLMSIICELWFGFSWILDQIPKLCPINRSTDLEVLRDKFDMPSPS 360

Query: 361  NPTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFE 420
            NPTGRSDLPG+DLFVSTADPEKEP LVTANTILSILAVDYPVEK++CY+SDDGGALL+FE
Sbjct: 361  NPTGRSDLPGIDLFVSTADPEKEPPLVTANTILSILAVDYPVEKVSCYLSDDGGALLSFE 420

Query: 421  AMAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFK 480
            AMAEAASFADLWVPFCRKH+IEPRNP+SYFSLK+DPTKNKSR DFVKDRRKIKREYDEFK
Sbjct: 421  AMAEAASFADLWVPFCRKHNIEPRNPDSYFSLKIDPTKNKSRIDFVKDRRKIKREYDEFK 480

Query: 481  VRTNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGV 540
            VR NGLPDSIRRRSDAFNAREEMK  K MRE+G D  EP+KV KATWMADGTHWPGTW  
Sbjct: 481  VRINGLPDSIRRRSDAFNAREEMKALKQMRESGGDPTEPVKVPKATWMADGTHWPGTWAA 540

Query: 541  SASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDH 600
            S  +HSKGDHAGILQVMLKPPS DPLIG++D+K+IDFSD DTRLPMFVYVSREKRPGYDH
Sbjct: 541  STREHSKGDHAGILQVMLKPPSSDPLIGNSDDKVIDFSDTDTRLPMFVYVSREKRPGYDH 600

Query: 601  NKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQ 660
            NKKAGAMNALVRASA+LSNGPFILNLDCDHYIYNCKA+REGMCFMMDRGGEDICYIQFPQ
Sbjct: 601  NKKAGAMNALVRASAILSNGPFILNLDCDHYIYNCKAVREGMCFMMDRGGEDICYIQFPQ 660

Query: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHK 720
            RFEGIDPSDRYAN+NTVFFDGNMRALDG+QGPVYVGTG MFRRFALYGFDPP PDK+  K
Sbjct: 661  RFEGIDPSDRYANNNTVFFDGNMRALDGVQGPVYVGTGTMFRRFALYGFDPPNPDKLLEK 720

Query: 721  NNDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPP 780
               ++ET+ L  +DFDPDLDV  LPKRFGNS +LAESI +AEFQGRPLADH AVKYGRPP
Sbjct: 721  K--ESETEALTTSDFDPDLDVTQLPKRFGNSTLLAESIPIAEFQGRPLADHPAVKYGRPP 780

Query: 781  GALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS 840
            GALRVPR+PLDA TVAE+VSVISCWYEDKTEWG+RVGWIYGSVTEDVVTGYRMHNRGW S
Sbjct: 781  GALRVPRDPLDATTVAESVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWRS 840

Query: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNV 900
            VYCITKRD+FRGSAPINLTDRLHQVLRWATGSVEIFFSRNNA+LAS+RLK LQRLAYLNV
Sbjct: 841  VYCITKRDSFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNAILASKRLKFLQRLAYLNV 900

Query: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIG 960
            GIYPFTS+FLI+YCFLPA SLFSG FIV+TL+++FLVYLL+ITICLI LA+LEVKWSGIG
Sbjct: 901  GIYPFTSLFLILYCFLPAFSLFSGQFIVRTLSISFLVYLLMITICLIGLAVLEVKWSGIG 960

Query: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVK 1020
            LEEWWRNEQ+WLISGTS+HL AVVQG+LKVIAGIEISFTLT+KS GD+NEDIYADLY+VK
Sbjct: 961  LEEWWRNEQWWLISGTSSHLYAVVQGVLKVIAGIEISFTLTTKSGGDDNEDIYADLYIVK 1020

Query: 1021 WTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080
            W+SLM+PPIVIAM+NIIA+ VAF RTIY  VPQWSK IGGAFFSFWVLAHLYPFAKGLMG
Sbjct: 1021 WSSLMIPPIVIAMVNIIAIVVAFIRTIYQAVPQWSKLIGGAFFSFWVLAHLYPFAKGLMG 1080

Query: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAA---VGGGGFEFP 1126
            RRGKTPTIV VW+GLIAIT+SLLW AI+P  +T P AA   VGGGGF+FP
Sbjct: 1081 RRGKTPTIVFVWAGLIAITISLLWTAINP--NTGPAAAAEGVGGGGFQFP 1111

BLAST of Moc04g03770 vs. ExPASy Swiss-Prot
Match: Q7EZW6 (Cellulose synthase-like protein D3 OS=Oryza sativa subsp. japonica OX=39947 GN=CSLD3 PE=2 SV=2)

HSP 1 Score: 1700.6 bits (4403), Expect = 0.0e+00
Identity = 835/1154 (72.36%), Postives = 956/1154 (82.84%), Query Frame = 0

Query: 4    LTNQPSKKAIRSPGG-----SNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEV 63
            ++  P KKAIR+ GG       ++   RG  GQ VKFARRTSSGRYVSLSRED+DM GE+
Sbjct: 1    MSTGPGKKAIRNAGGVGGGAGPSAGGPRGPAGQAVKFARRTSSGRYVSLSREDIDMEGEL 60

Query: 64   SGDYINYTVHIPPTPDNQPM-----DSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVI 123
            + DY NYTV IPPTPDNQPM      +S+A KAEEQYVSNSLFTGGFNS TRAHLMDKVI
Sbjct: 61   AADYTNYTVQIPPTPDNQPMLNGAEPASVAMKAEEQYVSNSLFTGGFNSATRAHLMDKVI 120

Query: 124  DSEVSHPQMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCP 183
            +S VSHPQMAGAKGS CAMPACDG  M++ERG D+ PCEC F+ICRDCYLDAQKD  +CP
Sbjct: 121  ESSVSHPQMAGAKGSRCAMPACDGSAMRNERGEDVDPCECHFKICRDCYLDAQKDGCICP 180

Query: 184  GCKELYKSADY-DDDPNEYSGGALQLQGPDGSKGGQNMSMMKLNQSGEFDHNKWLFETKG 243
            GCKE YK  +Y DDDP++   G L L GP    GG N S++  NQ+GEFDHN+WLFE+ G
Sbjct: 181  GCKEHYKIGEYADDDPHD---GKLHLPGPG---GGGNKSLLARNQNGEFDHNRWLFESSG 240

Query: 244  TYGVGNAYWTPEDGYADGGDDKF----------RDGVMESMDASWKPLSRTFPIPASIIS 303
            TYG GNA+W     Y D  DD              G        +KPL+R  P+P S+IS
Sbjct: 241  TYGYGNAFWPKGGMYDDDLDDDVDKLGGDGGGGGGGGPLPEQKPFKPLTRKIPMPTSVIS 300

Query: 304  PYRLLILIRLVVLGFFLHWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRA 363
            PYR+ I+IR+ VL F+L WR+++PN +A+WLW MSIVCE+WFAFSW+LD +PK+ PVNR+
Sbjct: 301  PYRIFIVIRMFVLLFYLTWRIRNPNMEALWLWGMSIVCELWFAFSWLLDMLPKVNPVNRS 360

Query: 364  TDLQVLHDKFDAPCPSNPTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEK 423
            TDL VL +KF+ P PSNP GRSDLPG+D+FVSTADPEKEPVL TA TILSILAVDYPVEK
Sbjct: 361  TDLAVLKEKFETPSPSNPHGRSDLPGLDVFVSTADPEKEPVLTTATTILSILAVDYPVEK 420

Query: 424  LACYISDDGGALLTFEAMAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSD 483
            LACY+SDDGGALLTFEAMAEAASFA++WVPFC+KHDIEPRNP+SYFS+K DPTK K R+D
Sbjct: 421  LACYVSDDGGALLTFEAMAEAASFANVWVPFCKKHDIEPRNPDSYFSVKGDPTKGKRRND 480

Query: 484  FVKDRRKIKREYDEFKVRTNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQK 543
            FVKDRR++KRE+DEFKVR NGLPDSIRRRSDAFNARE+MKM KH+RETGAD  E  KV+K
Sbjct: 481  FVKDRRRVKREFDEFKVRINGLPDSIRRRSDAFNAREDMKMLKHLRETGADPSEQPKVKK 540

Query: 544  ATWMADGTHWPGTWGVSASDHSKGDHAGILQVMLKPPSPDPLIG-STDEKIIDFSDVDTR 603
            ATWMADG+HWPGTW  SA DH+KG+HAGILQVMLKPPSPDPL G   D+++IDFSDVD R
Sbjct: 541  ATWMADGSHWPGTWAASAPDHAKGNHAGILQVMLKPPSPDPLYGMHDDDQMIDFSDVDIR 600

Query: 604  LPMFVYVSREKRPGYDHNKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMC 663
            LPM VY+SREKRPGYDHNKKAGAMNALVR SAV+SNGPF+LN DCDHYI N +A+RE MC
Sbjct: 601  LPMLVYMSREKRPGYDHNKKAGAMNALVRCSAVMSNGPFMLNFDCDHYINNAQAVREAMC 660

Query: 664  FMMDRGGEDICYIQFPQRFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRR 723
            F MDRGGE I YIQFPQRFEGIDPSDRYAN+NTVFFDGNMRALDG+QGP+YVGTGCMFRR
Sbjct: 661  FFMDRGGERIAYIQFPQRFEGIDPSDRYANNNTVFFDGNMRALDGLQGPMYVGTGCMFRR 720

Query: 724  FALYGFDPPQ----------PDKIKHKNNDQAETQPLQATDFDPDLDVNLLPKRFGNSNM 783
            FA+YGFDPP+            K+    + +++TQ L+A DFD +L  +L+P+RFGNS+ 
Sbjct: 721  FAVYGFDPPRTAEYTGWLFTKKKVTTFKDPESDTQTLKAEDFDAELTSHLVPRRFGNSSP 780

Query: 784  LAESILVAEFQGRPLADHSAVKYGRPPGALRVPREPLDAATVAEAVSVISCWYEDKTEWG 843
               SI VAEFQ RPLADH AV +GRP GAL VPR PLD  TVAEAVSVISCWYEDKTEWG
Sbjct: 781  FMASIPVAEFQARPLADHPAVLHGRPSGALTVPRPPLDPPTVAEAVSVISCWYEDKTEWG 840

Query: 844  ERVGWIYGSVTEDVVTGYRMHNRGWHSVYCITKRDAFRGSAPINLTDRLHQVLRWATGSV 903
            +RVGWIYGSVTEDVVTGYRMHNRGW SVYCITKRDAF G+APINLTDRLHQVLRWATGSV
Sbjct: 841  DRVGWIYGSVTEDVVTGYRMHNRGWRSVYCITKRDAFLGTAPINLTDRLHQVLRWATGSV 900

Query: 904  EIFFSRNNALLASRRLKLLQRLAYLNVGIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNV 963
            EIFFSRNNA LASR+L LLQR++YLNVGIYPFTSIFL+VYCF+PALSLFSG FIVQ L++
Sbjct: 901  EIFFSRNNAFLASRKLMLLQRISYLNVGIYPFTSIFLLVYCFIPALSLFSGFFIVQKLDI 960

Query: 964  TFLVYLLIITICLISLAILEVKWSGIGLEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAG 1023
             FL YLL +TI L++L ILEVKWSGI LE+WWRNEQFWLISGTSAHL AVVQGLLKV+AG
Sbjct: 961  AFLCYLLTMTITLVALGILEVKWSGIELEDWWRNEQFWLISGTSAHLYAVVQGLLKVMAG 1020

Query: 1024 IEISFTLTSKSAGDENEDIYADLYLVKWTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQ 1083
            IEISFTLT+K+A D+NEDIYADLY+VKW+SL++PPI I M+NIIA+  AF+RTIYS  P+
Sbjct: 1021 IEISFTLTAKAAADDNEDIYADLYIVKWSSLLIPPITIGMVNIIAIAFAFARTIYSDNPR 1080

Query: 1084 WSKFIGGAFFSFWVLAHLYPFAKGLMGRRGKTPTIVIVWSGLIAITLSLLWIAISPPKST 1126
            W KFIGG FFSFWVLAHL PFAKGLMGRRGKTPTIV VWSGL++IT+SLLW+AISPP++ 
Sbjct: 1081 WGKFIGGGFFSFWVLAHLNPFAKGLMGRRGKTPTIVFVWSGLLSITVSLLWVAISPPEAN 1140

BLAST of Moc04g03770 vs. ExPASy Swiss-Prot
Match: Q9LFL0 (Cellulose synthase-like protein D2 OS=Arabidopsis thaliana OX=3702 GN=CSLD2 PE=3 SV=1)

HSP 1 Score: 1659.4 bits (4296), Expect = 0.0e+00
Identity = 815/1138 (71.62%), Postives = 940/1138 (82.60%), Query Frame = 0

Query: 19   SNNS---TSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG-DYINYTVHIPPTPD 78
            SNNS      R   G +VKFA+RTSSGRY++ SR+DLD   E+ G D+++YTVHIPPTPD
Sbjct: 15   SNNSDIQEPGRPPAGHSVKFAQRTSSGRYINYSRDDLD--SELGGQDFMSYTVHIPPTPD 74

Query: 79   NQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHPQMAGAKGSSCAMPA 138
            NQPMD SI+ K EEQYV+NS+FTGGF S TRAHLM KVI++E +HPQMAG+KGSSCA+P 
Sbjct: 75   NQPMDPSISQKVEEQYVANSMFTGGFKSNTRAHLMHKVIETEPNHPQMAGSKGSSCAIPG 134

Query: 139  CDGKVMKDERGIDITPCECRFRICRDCYLDAQK-DTGLCPGCKELYKSADYDDDPNEYSG 198
            CD KVM DERG D+ PCEC F+ICRDC++DA K   G+CPGCKE YK+    D  +E   
Sbjct: 135  CDAKVMSDERGQDLLPCECDFKICRDCFIDAVKTGGGICPGCKEPYKNTHLTDQVDENGQ 194

Query: 199  GALQLQGPDGSKGGQNMSMMK--------LNQSGEFDHNKWLFETKGTYGVGNAYWTPED 258
                L G  GSK  + +SM+K         +Q+G+FDHN+WLFET GTYG GNA+WT + 
Sbjct: 195  QRPMLPGGGGSKMERRLSMVKSTNKSALMRSQTGDFDHNRWLFETTGTYGYGNAFWTKDG 254

Query: 259  GYADGGD-DKFRDGV-MESMD---ASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLH 318
             +  G D D   DG+ ME+ D     W+PL+R   IPA +ISPYRLLI IR+VVL  FL 
Sbjct: 255  DFGSGKDGDGDGDGMGMEAQDLMSRPWRPLTRKLKIPAGVISPYRLLIFIRIVVLALFLT 314

Query: 319  WRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNP 378
            WRVKH N DA+WLW MS+VCE+WFA SW+LDQ+PKLCP+NRATDLQVL +KF+ P  SNP
Sbjct: 315  WRVKHQNPDAVWLWGMSVVCELWFALSWLLDQLPKLCPINRATDLQVLKEKFETPTASNP 374

Query: 379  TGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAM 438
            TG+SDLPG D+FVSTADPEKEP LVTANTILSILA +YPVEKL+CY+SDDGGALLTFEAM
Sbjct: 375  TGKSDLPGFDVFVSTADPEKEPPLVTANTILSILAAEYPVEKLSCYVSDDGGALLTFEAM 434

Query: 439  AEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVR 498
            AEAASFA++WVPFCRKH IEPRNP+SYFSLK DP KNK +SDFVKDRR++KRE+DEFKVR
Sbjct: 435  AEAASFANIWVPFCRKHAIEPRNPDSYFSLKRDPYKNKVKSDFVKDRRRVKREFDEFKVR 494

Query: 499  TNGLPDSIRRRSDAFNAREEMKMWKHMRETGAD-AMEPIKVQKATWMADGTHWPGTWGVS 558
             N LPDSIRRRSDA++AREE+K  K  R+   D  MEP+K+ KATWMADGTHWPGTW  S
Sbjct: 495  VNSLPDSIRRRSDAYHAREEIKAMKMQRQNRDDEPMEPVKIPKATWMADGTHWPGTWLTS 554

Query: 559  ASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHN 618
            ASDH+KGDHAGI+QVMLKPPS +PL G   E  +D +DVD RLP+ VYVSREKRPGYDHN
Sbjct: 555  ASDHAKGDHAGIIQVMLKPPSDEPLHG-VSEGFLDLTDVDIRLPLLVYVSREKRPGYDHN 614

Query: 619  KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQR 678
            KKAGAMNALVRASA++SNGPFILNLDCDHYIYN +A+REGMCFMMDRGG+ +CY+QFPQR
Sbjct: 615  KKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSEALREGMCFMMDRGGDRLCYVQFPQR 674

Query: 679  FEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKI---- 738
            FEGIDPSDRYANHNTVFFD NMRALDG+ GPVYVGTGC+FRR ALYGF+PP+        
Sbjct: 675  FEGIDPSDRYANHNTVFFDVNMRALDGLMGPVYVGTGCLFRRIALYGFNPPRSKDFSPSC 734

Query: 739  -------KHKNNDQAETQPLQATDF-DPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLA 798
                     K N   E + L+ +D+ D +++++L+PK+FGNS  L +SI VAEFQGRPLA
Sbjct: 735  WSCCFPRSKKKNIPEENRALRMSDYDDEEMNLSLVPKKFGNSTFLIDSIPVAEFQGRPLA 794

Query: 799  DHSAVKYGRPPGALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVT 858
            DH AVK GRPPGAL +PRE LDA+TVAEA++VISCWYEDKTEWG R+GWIYGSVTEDVVT
Sbjct: 795  DHPAVKNGRPPGALTIPRELLDASTVAEAIAVISCWYEDKTEWGSRIGWIYGSVTEDVVT 854

Query: 859  GYRMHNRGWHSVYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRL 918
            GYRMHNRGW SVYC+TKRDAFRG+APINLTDRLHQVLRWATGSVEIFFSRNNALLAS ++
Sbjct: 855  GYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNALLASSKM 914

Query: 919  KLLQRLAYLNVGIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISL 978
            K+LQR+AYLNVGIYPFTSIFLIVYCFLPALSLFSG FIVQTLNVTFLVYLLII+I L  L
Sbjct: 915  KILQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLVYLLIISITLCLL 974

Query: 979  AILEVKWSGIGLEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDEN 1038
            A+LE+KWSGI LEEWWRNEQFWLI GTSAHLAAV+QGLLKV+AG+EISFTLTSKS GD+ 
Sbjct: 975  ALLEIKWSGISLEEWWRNEQFWLIGGTSAHLAAVLQGLLKVVAGVEISFTLTSKSGGDDI 1034

Query: 1039 EDIYADLYLVKWTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLA 1098
            +D +ADLY+VKWTSLM+PPI I M+N+IA+ V FSRTIYS VPQWSK IGG FFSFWVLA
Sbjct: 1035 DDEFADLYMVKWTSLMIPPITIIMVNLIAIAVGFSRTIYSVVPQWSKLIGGVFFSFWVLA 1094

Query: 1099 HLYPFAKGLMGRRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            HLYPFAKGLMGRRG+TPTIV VWSGL+AIT+SLLW+AI+PP   T      GG F FP
Sbjct: 1095 HLYPFAKGLMGRRGRTPTIVYVWSGLVAITISLLWVAINPPAGNTEI----GGNFSFP 1145

BLAST of Moc04g03770 vs. ExPASy Swiss-Prot
Match: Q9M9M4 (Cellulose synthase-like protein D3 OS=Arabidopsis thaliana OX=3702 GN=CSLD3 PE=1 SV=1)

HSP 1 Score: 1651.7 bits (4276), Expect = 0.0e+00
Identity = 804/1122 (71.66%), Postives = 926/1122 (82.53%), Query Frame = 0

Query: 29   VGQTVKFARRTSSGRYVSLSREDLDMSGEVSGDYINYTVHIPPTPDNQPMDSSIATKAEE 88
            V  +V FARRT SGRYV+ SR+DLD S   S D   Y+VHIPPTPDNQPMD SI+ K EE
Sbjct: 30   VSNSVTFARRTPSGRYVNYSRDDLD-SELGSVDLTGYSVHIPPTPDNQPMDPSISQKVEE 89

Query: 89   QYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHPQMAGAKGSSCAMPACDGKVMKDERGIDI 148
            QYVSNSLFTGGFNSVTRAHLM+KVID+E SHPQMAGAKGSSCA+P CD KVM DERG D+
Sbjct: 90   QYVSNSLFTGGFNSVTRAHLMEKVIDTETSHPQMAGAKGSSCAVPGCDVKVMSDERGQDL 149

Query: 149  TPCECRFRICRDCYLDAQKDTGLCPGCKELYKSADYDDDPNEYSGGALQLQGP-DGSKGG 208
             PCEC F+ICRDC++DA K  G+CPGCKE Y++ D  D  +        L  P  GSK  
Sbjct: 150  LPCECDFKICRDCFMDAVKTGGMCPGCKEPYRNTDLADFADNNKQQRPMLPPPAGGSKMD 209

Query: 209  QNMSMMK-------LNQSGEFDHNKWLFETKGTYGVGNAYWTPEDGYADGGDDKFRD-GV 268
            + +S+MK        +Q+G+FDHN+WLFET GTYG GNA+WT +  +    D      G 
Sbjct: 210  RRLSLMKSTKSGLMRSQTGDFDHNRWLFETSGTYGFGNAFWTKDGNFGSDKDGNGHGMGP 269

Query: 269  MESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLHWRVKHPNEDAIWLWLMSIV 328
             + M   W+PL+R   IPA++ISPYRLLILIR+VVL  FL WR+KH N DAIWLW MS+V
Sbjct: 270  QDLMSRPWRPLTRKLQIPAAVISPYRLLILIRIVVLALFLMWRIKHKNPDAIWLWGMSVV 329

Query: 329  CEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNPTGRSDLPGVDLFVSTADPE 388
            CE+WFA SW+LDQ+PKLCP+NRATDL VL +KF+ P PSNPTG+SDLPG+D+FVSTADPE
Sbjct: 330  CELWFALSWLLDQLPKLCPINRATDLNVLKEKFETPTPSNPTGKSDLPGLDMFVSTADPE 389

Query: 389  KEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAMAEAASFADLWVPFCRKHDI 448
            KEP LVT+NTILSILA DYPVEKLACY+SDDGGALLTFEAMAEAASFA++WVPFCRKH+I
Sbjct: 390  KEPPLVTSNTILSILAADYPVEKLACYVSDDGGALLTFEAMAEAASFANMWVPFCRKHNI 449

Query: 449  EPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVRTNGLPDSIRRRSDAFNARE 508
            EPRNP+SYFSLK DP KNK ++DFVKDRR++KREYDEFKVR N LPDSIRRRSDA++ARE
Sbjct: 450  EPRNPDSYFSLKRDPYKNKVKADFVKDRRRVKREYDEFKVRINSLPDSIRRRSDAYHARE 509

Query: 509  EMKMWKHMRET-GADAMEPIKVQKATWMADGTHWPGTWGVSASDHSKGDHAGILQVMLKP 568
            E+K  K  R+    + +EP+K+ KATWMADGTHWPGTW  S  DHS+ DHAGI+QVMLKP
Sbjct: 510  EIKAMKLQRQNRDEEIVEPVKIPKATWMADGTHWPGTWINSGPDHSRSDHAGIIQVMLKP 569

Query: 569  PSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHNKKAGAMNALVRASAVLSNG 628
            PS +PL G   E  +D +DVD RLP+ VYVSREKRPGYDHNKKAGAMNALVRASA++SNG
Sbjct: 570  PSDEPLHG-VSEGFLDLTDVDIRLPLLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNG 629

Query: 629  PFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQRFEGIDPSDRYANHNTVFFD 688
            PFILNLDCDHYIYN +A+REGMCFMMDRGG+ +CY+QFPQRFEGIDPSDRYANHNTVFFD
Sbjct: 630  PFILNLDCDHYIYNSQALREGMCFMMDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFD 689

Query: 689  GNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQP------------DKIKHKNNDQAETQ 748
             NMRALDG+ GPVYVGTGC+FRR ALYGFDPP+              + K K+    E +
Sbjct: 690  VNMRALDGLMGPVYVGTGCLFRRIALYGFDPPRAKEHHPGFCSCCFSRKKKKSRVPEENR 749

Query: 749  PLQA---TDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPGALRV 808
             L+    +D D +++++L+PK+FGNS  L +SI VAEFQGRPLADH AV+ GRPPGAL +
Sbjct: 750  SLRMGGDSDDDEEMNLSLVPKKFGNSTFLIDSIPVAEFQGRPLADHPAVQNGRPPGALTI 809

Query: 809  PREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVYCIT 868
            PRE LDA+TVAEA++VISCWYEDKTEWG R+GWIYGSVTEDVVTGYRMHNRGW SVYC+T
Sbjct: 810  PRELLDASTVAEAIAVISCWYEDKTEWGSRIGWIYGSVTEDVVTGYRMHNRGWKSVYCVT 869

Query: 869  KRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGIYPF 928
            KRDAFRG+APINLTDRLHQVLRWATGSVEIFFSRNNA  AS R+K+LQR+AYLNVGIYPF
Sbjct: 870  KRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNAFFASPRMKILQRIAYLNVGIYPF 929

Query: 929  TSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLEEWW 988
            TS FLIVYCFLPALSLFSG FIVQTLNVTFLVYLLII+I L  LA+LE+KWSGI LEEWW
Sbjct: 930  TSFFLIVYCFLPALSLFSGQFIVQTLNVTFLVYLLIISITLCLLALLEIKWSGISLEEWW 989

Query: 989  RNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKWTSLM 1048
            RNEQFWLI GTSAHLAAV+QGLLKV+AGIEISFTLTSKS G++ +D +ADLY+VKWTSLM
Sbjct: 990  RNEQFWLIGGTSAHLAAVIQGLLKVVAGIEISFTLTSKSGGEDVDDEFADLYIVKWTSLM 1049

Query: 1049 VPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRRGKT 1108
            +PPI I M+N+IA+ V FSRTIYS +PQWSK IGG FFSFWVLAHLYPFAKGLMGRRG+T
Sbjct: 1050 IPPITIMMVNLIAIAVGFSRTIYSVIPQWSKLIGGVFFSFWVLAHLYPFAKGLMGRRGRT 1109

Query: 1109 PTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            PTIV VWSGL+AIT+SLLW+AI+PP  +T      GG F FP
Sbjct: 1110 PTIVYVWSGLVAITISLLWVAINPPAGSTQI----GGSFTFP 1145

BLAST of Moc04g03770 vs. ExPASy Swiss-Prot
Match: A2YU42 (Cellulose synthase-like protein D2 OS=Oryza sativa subsp. indica OX=39946 GN=CSLD2 PE=3 SV=1)

HSP 1 Score: 1645.2 bits (4259), Expect = 0.0e+00
Identity = 814/1155 (70.48%), Postives = 931/1155 (80.61%), Query Frame = 0

Query: 14   RSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDL--------DMSGEVSGDYINY 73
            ++PGG        G     V FARRT SGRYVS SR+DL        DMS E   +++NY
Sbjct: 30   QAPGG--------GGDRPMVTFARRTHSGRYVSYSRDDLDSELGNSGDMSPESGQEFLNY 89

Query: 74   TVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHPQMAGA 133
             V IP TPDNQPMD +I+ + EEQYVSNSLFTGGFNSVTRAHLMDKVI+SE SHPQMAGA
Sbjct: 90   HVTIPATPDNQPMDPAISARVEEQYVSNSLFTGGFNSVTRAHLMDKVIESEASHPQMAGA 149

Query: 134  KGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYKSADYD 193
            KGSSCA+  CD KVM DERG DI PCEC F+IC DC+ DA K+ G CPGCK+ YK+ + D
Sbjct: 150  KGSSCAINGCDAKVMSDERGDDILPCECDFKICADCFADAVKNGGACPGCKDPYKATELD 209

Query: 194  DDPNEYSGGALQLQGPDG----SKGGQNMSMMK------LNQSGEFDHNKWLFETKGTYG 253
            D     +   L L  P G    S+  + +S+M+       +Q+G++DHN+WLFETKGTYG
Sbjct: 210  DVVG--ARPTLSLPPPPGGLPASRMERRLSIMRSQKAMTRSQTGDWDHNRWLFETKGTYG 269

Query: 254  VGNAYWTPEDGYADGGDDKFRDGV-------MESMDASWKPLSRTFPIPASIISPYRLLI 313
             GNA W  E+   +GG      G+        E     W+PL+R   IPA ++SPYRLLI
Sbjct: 270  YGNAIWPKENEVDNGGGGGGGGGLGGGDGQPAEFTSKPWRPLTRKLKIPAGVLSPYRLLI 329

Query: 314  LIRLVVLGFFLHWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVL 373
            LIR+ VLG FL WR+KH NEDA+WLW MS+VCE+WF  SW+LDQ+PKLCPVNRATDL VL
Sbjct: 330  LIRMAVLGLFLAWRIKHKNEDAMWLWGMSVVCELWFGLSWLLDQLPKLCPVNRATDLAVL 389

Query: 374  HDKFDAPCPSNPTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYIS 433
             DKF+ P PSNP GRSDLPG+D+FVSTADPEKEP LVTANTILSILA DYPVEKL+CY+S
Sbjct: 390  KDKFETPTPSNPNGRSDLPGLDIFVSTADPEKEPPLVTANTILSILAADYPVEKLSCYVS 449

Query: 434  DDGGALLTFEAMAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRR 493
            DDGGALLTFEAMAEAASFA++WVPFCRKHDIEPRNPESYF+LK DP KNK RSDFVKDRR
Sbjct: 450  DDGGALLTFEAMAEAASFANMWVPFCRKHDIEPRNPESYFNLKRDPYKNKVRSDFVKDRR 509

Query: 494  KIKREYDEFKVRTNGLPDSIRRRSDAFNAREEMKMWKHMRETGA-DAMEPIKVQKATWMA 553
            ++KREYDEFKVR N LPDSIRRRSDA++AREE+K  K  RE    D +E +K+ KATWMA
Sbjct: 510  RVKREYDEFKVRINSLPDSIRRRSDAYHAREEIKAMKRQREAALDDVVEAVKIPKATWMA 569

Query: 554  DGTHWPGTWGVSASDHSKGDHAGILQVMLKPPSPDPLIGSTDE--KIIDFSDVDTRLPMF 613
            DGTHWPGTW   +++H++GDHAGI+QVMLKPPS DPL G++ E  + +DF++VD RLPM 
Sbjct: 570  DGTHWPGTWIQPSAEHARGDHAGIIQVMLKPPSDDPLYGTSSEEGRPLDFTEVDIRLPML 629

Query: 614  VYVSREKRPGYDHNKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMD 673
            VYVSREKRPGYDHNKKAGAMNALVR+SAV+SNGPFILNLDCDHY+YN +A REGMCFMMD
Sbjct: 630  VYVSREKRPGYDHNKKAGAMNALVRSSAVMSNGPFILNLDCDHYVYNSQAFREGMCFMMD 689

Query: 674  RGGEDICYIQFPQRFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALY 733
            RGG+ I Y+QFPQRFEGIDPSDRYANHNTVFFD NMRALDGI GPVYVGTGC+FRR ALY
Sbjct: 690  RGGDRIGYVQFPQRFEGIDPSDRYANHNTVFFDVNMRALDGIMGPVYVGTGCLFRRIALY 749

Query: 734  GFDP--------------PQPDKIKHKNNDQAETQPLQATDF-DPDLDVNLLPKRFGNSN 793
            GFDP              PQ  K+K       E Q L+  DF D +++++  PK+FGNSN
Sbjct: 750  GFDPPRSKEHSGCCSCCFPQRRKVKTSTVASEERQALRMADFDDEEMNMSQFPKKFGNSN 809

Query: 794  MLAESILVAEFQGRPLADHSAVKYGRPPGALRVPREPLDAATVAEAVSVISCWYEDKTEW 853
             L  SI +AEFQGRPLADH  VK GRPPGAL VPR+ LDA+TVAEA+SVISCWYEDKTEW
Sbjct: 810  FLINSIPIAEFQGRPLADHPGVKNGRPPGALTVPRDLLDASTVAEAISVISCWYEDKTEW 869

Query: 854  GERVGWIYGSVTEDVVTGYRMHNRGWHSVYCITKRDAFRGSAPINLTDRLHQVLRWATGS 913
            G+RVGWIYGSVTEDVVTGYRMHNRGW SVYC+TKRDAFRG+APINLTDRLHQVLRWATGS
Sbjct: 870  GQRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATGS 929

Query: 914  VEIFFSRNNALLASRRLKLLQRLAYLNVGIYPFTSIFLIVYCFLPALSLFSGNFIVQTLN 973
            VEIFFSRNNALLASR++K LQR+AYLNVGIYPFTSIFLIVYCFLPALSLFSG FIV+TLN
Sbjct: 930  VEIFFSRNNALLASRKMKFLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVRTLN 989

Query: 974  VTFLVYLLIITICLISLAILEVKWSGIGLEEWWRNEQFWLISGTSAHLAAVVQGLLKVIA 1033
            VTFL YLL+IT+ +  LA+LE+KWSGI LEEWWRNEQFWLI GTSAHLAAV+QGLLKVIA
Sbjct: 990  VTFLTYLLVITLTMCMLAVLEIKWSGISLEEWWRNEQFWLIGGTSAHLAAVLQGLLKVIA 1049

Query: 1034 GIEISFTLTSKSAGDENEDIYADLYLVKWTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVP 1093
            GIEISFTLTSKS GDE +D +ADLY+VKWTSLM+PPIVI M+N+IA+ V FSRTIYS +P
Sbjct: 1050 GIEISFTLTSKSGGDEADDEFADLYIVKWTSLMIPPIVIMMVNLIAIAVGFSRTIYSEIP 1109

Query: 1094 QWSKFIGGAFFSFWVLAHLYPFAKGLMGRRGKTPTIVIVWSGLIAITLSLLWIAISPPKS 1126
            QWSK +GG FFSFWVLAHLYPFAKGLMGRRG+TPTIV VWSGL+AIT+SLLW+AI+PP  
Sbjct: 1110 QWSKLLGGVFFSFWVLAHLYPFAKGLMGRRGRTPTIVFVWSGLLAITISLLWVAINPPSQ 1169

BLAST of Moc04g03770 vs. ExPASy TrEMBL
Match: A0A6J1C3F4 (cellulose synthase-like protein D4 OS=Momordica charantia OX=3673 GN=LOC111007586 PE=4 SV=1)

HSP 1 Score: 2302.7 bits (5966), Expect = 0.0e+00
Identity = 1125/1125 (100.00%), Postives = 1125/1125 (100.00%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG
Sbjct: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP
Sbjct: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK
Sbjct: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGGQNMSMMKLNQSGEFDHNKWLFETKGTYGVGNAY 240
            SADYDDDPNEYSGGALQLQGPDGSKGGQNMSMMKLNQSGEFDHNKWLFETKGTYGVGNAY
Sbjct: 181  SADYDDDPNEYSGGALQLQGPDGSKGGQNMSMMKLNQSGEFDHNKWLFETKGTYGVGNAY 240

Query: 241  WTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLH 300
            WTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLH
Sbjct: 241  WTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLH 300

Query: 301  WRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNP 360
            WRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNP
Sbjct: 301  WRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNP 360

Query: 361  TGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAM 420
            TGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAM
Sbjct: 361  TGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAM 420

Query: 421  AEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVR 480
            AEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVR
Sbjct: 421  AEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVR 480

Query: 481  TNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVSA 540
            TNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVSA
Sbjct: 481  TNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVSA 540

Query: 541  SDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHNK 600
            SDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHNK
Sbjct: 541  SDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHNK 600

Query: 601  KAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQRF 660
            KAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQRF
Sbjct: 601  KAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQRF 660

Query: 661  EGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKNN 720
            EGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKNN
Sbjct: 661  EGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKNN 720

Query: 721  DQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPGA 780
            DQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPGA
Sbjct: 721  DQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPGA 780

Query: 781  LRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVY 840
            LRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVY
Sbjct: 781  LRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVY 840

Query: 841  CITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGI 900
            CITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGI
Sbjct: 841  CITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGI 900

Query: 901  YPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLE 960
            YPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLE
Sbjct: 901  YPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLE 960

Query: 961  EWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKWT 1020
            EWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKWT
Sbjct: 961  EWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKWT 1020

Query: 1021 SLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRR 1080
            SLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRR
Sbjct: 1021 SLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRR 1080

Query: 1081 GKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            GKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP
Sbjct: 1081 GKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1125

BLAST of Moc04g03770 vs. ExPASy TrEMBL
Match: A0A6J1JNY4 (cellulose synthase-like protein D4 OS=Cucurbita maxima OX=3661 GN=LOC111486408 PE=4 SV=1)

HSP 1 Score: 2136.3 bits (5534), Expect = 0.0e+00
Identity = 1042/1127 (92.46%), Postives = 1081/1127 (95.92%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MA+LTNQPSKKAIRSPG S NSTS R + GQTVKFARRTSSGRYVSLSREDLDMSGEVSG
Sbjct: 1    MATLTNQPSKKAIRSPGASANSTSIRAASGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPMDSSIA+KAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEV+HP
Sbjct: 61   DYINYTVHIPPTPDNQPMDSSIASKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDGKVMKDERG DITPCECRFRICRDCYLDA K+TGLCPGCKE YK
Sbjct: 121  QMAGAKGSSCAMPACDGKVMKDERGKDITPCECRFRICRDCYLDALKETGLCPGCKEPYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG-QNMSMMKLNQSGEFDHNKWLFETKGTYGVGNA 240
              DY++D NEYS  ALQL GPDGSKGG QNMSMMKLNQSGEFDHNKWLFE+KGTYGVG+A
Sbjct: 181  VGDYEEDSNEYS--ALQLHGPDGSKGGSQNMSMMKLNQSGEFDHNKWLFESKGTYGVGSA 240

Query: 241  YWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFL 300
            YWTP+DGY +GG+D F DG+ME+MD  WKPLSRTFPIPASIISPYRLLIL+RLVVL FFL
Sbjct: 241  YWTPDDGYGNGGNDNFGDGMMEAMDKPWKPLSRTFPIPASIISPYRLLILVRLVVLAFFL 300

Query: 301  HWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSN 360
            HWRV+HPNEDAIWLWLMSIVCEIWFAFSWILDQ+PKLCPVNRATDLQVL+DKFDAP P N
Sbjct: 301  HWRVQHPNEDAIWLWLMSIVCEIWFAFSWILDQIPKLCPVNRATDLQVLYDKFDAPSPLN 360

Query: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEA 420
            PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACY+SDDGGALLTFEA
Sbjct: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYVSDDGGALLTFEA 420

Query: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480
            MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV
Sbjct: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480

Query: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMRETGA-DAMEPIKVQKATWMADGTHWPGTWGV 540
            RTNGLPDSIRRRS+AFNAREEMKMWK M+E G  DAMEPIKVQKATWMADG+HWPGTW V
Sbjct: 481  RTNGLPDSIRRRSEAFNAREEMKMWKLMKEKGGPDAMEPIKVQKATWMADGSHWPGTWVV 540

Query: 541  SASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDH 600
               DHSKGDH+GILQVMLKPPS DPL+GSTDEKIIDF+DVD RLPMFVYVSREKRPGYDH
Sbjct: 541  PTGDHSKGDHSGILQVMLKPPSHDPLLGSTDEKIIDFTDVDIRLPMFVYVSREKRPGYDH 600

Query: 601  NKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQ 660
            NKKAGAMNALVR+SAVLSNGPFILNLDCDHYIYNCKAI+EGMCFMMDRGGEDICYIQFPQ
Sbjct: 601  NKKAGAMNALVRSSAVLSNGPFILNLDCDHYIYNCKAIKEGMCFMMDRGGEDICYIQFPQ 660

Query: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHK 720
            RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDK+K  
Sbjct: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKMKQA 720

Query: 721  NNDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPP 780
             NDQ ETQPLQ TDFDPDLDVNLLPKRFGNS MLAESILVAEFQGRP+ADH AVKYGRPP
Sbjct: 721  KNDQPETQPLQPTDFDPDLDVNLLPKRFGNSTMLAESILVAEFQGRPIADHPAVKYGRPP 780

Query: 781  GALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS 840
            GALRVPR+PLDAATVAEAVSVISCWYEDKTEWG+RVGWIYGSVTEDVVTGYRMHNRGWHS
Sbjct: 781  GALRVPRQPLDAATVAEAVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWHS 840

Query: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNV 900
            VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNA LA+RRLKLLQRLAYLNV
Sbjct: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNAFLATRRLKLLQRLAYLNV 900

Query: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIG 960
            GIYPFTSIFLIVYCFLPALSLFSGNFIVQ+LN TFL+YLLIITICLISLA+LEVKWSGIG
Sbjct: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQSLNATFLIYLLIITICLISLAVLEVKWSGIG 960

Query: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVK 1020
            LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKS+GDENEDIYADLYLVK
Sbjct: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSSGDENEDIYADLYLVK 1020

Query: 1021 WTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080
            WTSLMVPPIVIAMMNIIAM VAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG
Sbjct: 1021 WTSLMVPPIVIAMMNIIAMVVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080

Query: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            RRGKTPTIVIVWSGLIAITLSLLWIAISPPK+   +AA+GGGGFEFP
Sbjct: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKAEDADAAIGGGGFEFP 1125

BLAST of Moc04g03770 vs. ExPASy TrEMBL
Match: A0A6J1EK32 (cellulose synthase-like protein D4 OS=Cucurbita moschata OX=3662 GN=LOC111435210 PE=4 SV=1)

HSP 1 Score: 2135.5 bits (5532), Expect = 0.0e+00
Identity = 1042/1127 (92.46%), Postives = 1080/1127 (95.83%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MA+LTNQPSKKAIRSPG S NSTS R + GQTVKFARRTSSGRYVSLSREDLDMSGEVSG
Sbjct: 1    MATLTNQPSKKAIRSPGASANSTSIRAASGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPMDSSIA+KAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEV+HP
Sbjct: 61   DYINYTVHIPPTPDNQPMDSSIASKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDGKVMKDERG DITPCECRFRICRDCYLDA K+TGLCPGCKE YK
Sbjct: 121  QMAGAKGSSCAMPACDGKVMKDERGKDITPCECRFRICRDCYLDALKETGLCPGCKEPYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG-QNMSMMKLNQSGEFDHNKWLFETKGTYGVGNA 240
              DY++D NEYS  ALQL GPDGSKGG QNMSMMKLNQ+GEFDHNKWLFE+KGTYGVG+A
Sbjct: 181  VGDYEEDSNEYS--ALQLHGPDGSKGGSQNMSMMKLNQTGEFDHNKWLFESKGTYGVGSA 240

Query: 241  YWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFL 300
            YWTPEDGY +GG+D F DG+ME+MD  WKPLSRTFPIPASIISPYRLLIL+R VVLGFFL
Sbjct: 241  YWTPEDGYGNGGNDNFGDGMMEAMDKPWKPLSRTFPIPASIISPYRLLILVRFVVLGFFL 300

Query: 301  HWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSN 360
            HWRV+HPNEDAIWLWLMSIVCEIWFAFSWILDQ+PKLCPVNRATDLQVL+DKFDAP P N
Sbjct: 301  HWRVRHPNEDAIWLWLMSIVCEIWFAFSWILDQIPKLCPVNRATDLQVLYDKFDAPSPLN 360

Query: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEA 420
            PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACY+SDDGGALLTFEA
Sbjct: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYVSDDGGALLTFEA 420

Query: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480
            MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV
Sbjct: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480

Query: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMRETGA-DAMEPIKVQKATWMADGTHWPGTWGV 540
            RTNGLPDSIRRRS+AFNAREEMKMWK M+E G  DAMEPIKVQKATWMADG+HWPGTW V
Sbjct: 481  RTNGLPDSIRRRSEAFNAREEMKMWKLMKEKGGPDAMEPIKVQKATWMADGSHWPGTWVV 540

Query: 541  SASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDH 600
               DHSKGDH+GILQVMLKPPS DPL+GSTDEKIIDF+DVD RLPMFVYVSREKRPGYDH
Sbjct: 541  PTGDHSKGDHSGILQVMLKPPSHDPLLGSTDEKIIDFTDVDIRLPMFVYVSREKRPGYDH 600

Query: 601  NKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQ 660
            NKKAGAMNALVR+SAVLSNGPFILNLDCDHYIYNCKAI+EGMCFMMDRGGEDICYIQFPQ
Sbjct: 601  NKKAGAMNALVRSSAVLSNGPFILNLDCDHYIYNCKAIKEGMCFMMDRGGEDICYIQFPQ 660

Query: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHK 720
            RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDK+K  
Sbjct: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKMKQA 720

Query: 721  NNDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPP 780
             +DQ ETQPLQ TDFDPDLDVNLLPKRFGNS MLAESILVAEFQGRP+ADH AVKYGRPP
Sbjct: 721  KSDQPETQPLQPTDFDPDLDVNLLPKRFGNSTMLAESILVAEFQGRPIADHPAVKYGRPP 780

Query: 781  GALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS 840
            GALRVPR+PLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS
Sbjct: 781  GALRVPRQPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS 840

Query: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNV 900
            VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNA LA+RRLK LQRLAYLNV
Sbjct: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNAFLATRRLKFLQRLAYLNV 900

Query: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIG 960
            GIYPFTSIFLIVYCFLPALSLFSGNFIVQ+LN TFL+YLLIITICLISLAILEVKWSGIG
Sbjct: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQSLNATFLIYLLIITICLISLAILEVKWSGIG 960

Query: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVK 1020
            LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKS+GDENEDIYADLYLVK
Sbjct: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSSGDENEDIYADLYLVK 1020

Query: 1021 WTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080
            WTSLMVPPIVIAMMNIIAM VAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG
Sbjct: 1021 WTSLMVPPIVIAMMNIIAMVVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080

Query: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            RRGKTPTIVIVWSGLIAITLSLLWIAISPPK+   +AA+GGGGFEFP
Sbjct: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKAEDADAAIGGGGFEFP 1125

BLAST of Moc04g03770 vs. ExPASy TrEMBL
Match: A0A0A0K6B5 (Cellulose synthase OS=Cucumis sativus OX=3659 GN=Csa_7G029410 PE=4 SV=1)

HSP 1 Score: 2125.9 bits (5507), Expect = 0.0e+00
Identity = 1035/1127 (91.84%), Postives = 1084/1127 (96.18%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MASLTNQPSKKAIRSPGGS N+TSNRGS GQTVKFARRTSSGRYVSLSREDLDMSGE+SG
Sbjct: 1    MASLTNQPSKKAIRSPGGSTNATSNRGSSGQTVKFARRTSSGRYVSLSREDLDMSGEISG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPM+SS+ +KAEEQYVSNSLFTGGFNSVTRAHLMDKVIDS+V+HP
Sbjct: 61   DYINYTVHIPPTPDNQPMESSVISKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSQVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSC MPACDGKVMKD+RG D+TPCECRFRICR+C++DA K+TGLCPGCKE Y+
Sbjct: 121  QMAGAKGSSCGMPACDGKVMKDDRGQDMTPCECRFRICRECHIDAAKETGLCPGCKEPYR 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG-QNMSMMKLNQSGEFDHNKWLFETKGTYGVGNA 240
            + D DDDPN+YS G LQL+GPDGSKGG QNMSMMKLNQ G+FDHNKWLFE+KGTYGVGNA
Sbjct: 181  TGDIDDDPNDYSNGTLQLKGPDGSKGGSQNMSMMKLNQGGDFDHNKWLFESKGTYGVGNA 240

Query: 241  YWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFL 300
            Y+   D Y DG DDKFR+G++ESMD  WKPLSRTFPIPASIISPYRLLIL+RLVVLGFFL
Sbjct: 241  YF---DDY-DGEDDKFREGMLESMDKPWKPLSRTFPIPASIISPYRLLILVRLVVLGFFL 300

Query: 301  HWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSN 360
            HWRV+HPNEDAIWLWLMSI+CEIWFAFSWILDQ+PKLCPVNRATDLQVLHDKFDAP PSN
Sbjct: 301  HWRVQHPNEDAIWLWLMSIICEIWFAFSWILDQIPKLCPVNRATDLQVLHDKFDAPSPSN 360

Query: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEA 420
            PTGRSDLPGVD+FVSTADPEKEPVLVTANTILSILA DYPVEKLACYISDDGGALLTFEA
Sbjct: 361  PTGRSDLPGVDMFVSTADPEKEPVLVTANTILSILAADYPVEKLACYISDDGGALLTFEA 420

Query: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480
            MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV
Sbjct: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480

Query: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVS 540
            RTNGLPDSIRRRSDAFNAREEMKMWKHM+ETGADAMEPIKVQKATWMADG+HWPGTW V 
Sbjct: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMKETGADAMEPIKVQKATWMADGSHWPGTWVVP 540

Query: 541  ASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHN 600
            + DHSKGDHAGILQVMLKPPS DPL+GS DEKI+DF+DVD RLPMFVYVSREKRPGYDHN
Sbjct: 541  SGDHSKGDHAGILQVMLKPPSHDPLMGSADEKIVDFTDVDIRLPMFVYVSREKRPGYDHN 600

Query: 601  KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQR 660
            KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAI+EGMCFMMDRGGEDICYIQFPQR
Sbjct: 601  KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIKEGMCFMMDRGGEDICYIQFPQR 660

Query: 661  FEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKN 720
            FEGIDPSDRYANHNTVFFDGNMRALDG+QGPVYVGTGCMFRRFALYGFDPPQPDK K K 
Sbjct: 661  FEGIDPSDRYANHNTVFFDGNMRALDGVQGPVYVGTGCMFRRFALYGFDPPQPDKTKPK- 720

Query: 721  NDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPG 780
            ND AETQPL++TDFDPDLDVNLLPKRFGNSNMLA+SI VAEFQGRPLADHSAVKYGRPPG
Sbjct: 721  NDSAETQPLRSTDFDPDLDVNLLPKRFGNSNMLADSIPVAEFQGRPLADHSAVKYGRPPG 780

Query: 781  ALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSV 840
            ALR+PR PLDA TVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSV
Sbjct: 781  ALRLPRPPLDAPTVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSV 840

Query: 841  YCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVG 900
            YCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVG
Sbjct: 841  YCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVG 900

Query: 901  IYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGL 960
            IYPFTSIFLIVYCFLPALSLFSG FIVQTLNVTFL+YLLIIT+CLISLAILEVKWSGIGL
Sbjct: 901  IYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLIYLLIITVCLISLAILEVKWSGIGL 960

Query: 961  EEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKW 1020
            EEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKS+GD+ EDIYADLYLVKW
Sbjct: 961  EEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSSGDDVEDIYADLYLVKW 1020

Query: 1021 TSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGR 1080
            TSLMVPPIVIAMMNIIAM VAFSRTIYS+VPQWSKFIGGAFFSFWVLAHLYPFAKGLMGR
Sbjct: 1021 TSLMVPPIVIAMMNIIAMAVAFSRTIYSSVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGR 1080

Query: 1081 RGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVG-GGGFEFP 1126
            RGKTPTIVIVWSGLIAITLSLLWIAI+PPK +  +AAVG GGGF+FP
Sbjct: 1081 RGKTPTIVIVWSGLIAITLSLLWIAINPPKPSAEDAAVGAGGGFQFP 1122

BLAST of Moc04g03770 vs. ExPASy TrEMBL
Match: A0A5A7V1C4 (Cellulose synthase-like protein D4 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold54G00100 PE=4 SV=1)

HSP 1 Score: 2123.6 bits (5501), Expect = 0.0e+00
Identity = 1037/1127 (92.01%), Postives = 1081/1127 (95.92%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MASLTNQPSKKAIRSPGGS N+TSNRGS GQTVKFARRTSSGRYVSLSREDLDMSGE+SG
Sbjct: 1    MASLTNQPSKKAIRSPGGSTNATSNRGSSGQTVKFARRTSSGRYVSLSREDLDMSGEISG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DYINYTVHIPPTPDNQPM+SS+ +KAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEV+HP
Sbjct: 61   DYINYTVHIPPTPDNQPMESSVISKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSC MPACDGKVMKD+RG DITPCECRF+ICRDC+LDA K+TGLCPGCKE YK
Sbjct: 121  QMAGAKGSSCGMPACDGKVMKDDRGQDITPCECRFKICRDCHLDAVKETGLCPGCKEPYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG-QNMSMMKLNQSGEFDHNKWLFETKGTYGVGNA 240
              D+DDD N+YS G LQLQGPDGSKGG QNMSMMKLNQ GEFDHNKWLFE+KGTYGVGNA
Sbjct: 181  IGDFDDDSNDYSNGTLQLQGPDGSKGGSQNMSMMKLNQGGEFDHNKWLFESKGTYGVGNA 240

Query: 241  YWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFL 300
            Y+   D Y DG DDKFR+G+MESMD  WKPLSRTFPIPASIISPYRLLIL+RLVVLGFFL
Sbjct: 241  YY---DDY-DGEDDKFREGMMESMDKPWKPLSRTFPIPASIISPYRLLILVRLVVLGFFL 300

Query: 301  HWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSN 360
            HWRV+HPNEDAIWLWLMSIVCEIWFAFSWILDQ+PKLCPVNRATDLQVLHDKFDAP PSN
Sbjct: 301  HWRVQHPNEDAIWLWLMSIVCEIWFAFSWILDQIPKLCPVNRATDLQVLHDKFDAPSPSN 360

Query: 361  PTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEA 420
            PTGRSDLPGVD+FVSTADPEKEPVLVTANTILSILA DYPVEKLACYISDDGGALLTFEA
Sbjct: 361  PTGRSDLPGVDMFVSTADPEKEPVLVTANTILSILAADYPVEKLACYISDDGGALLTFEA 420

Query: 421  MAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480
            MAEAASFADLWVPFCRKH+IEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV
Sbjct: 421  MAEAASFADLWVPFCRKHNIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKV 480

Query: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGVS 540
            RTNGLPDSIRRRSDAFNAREEMKMWKHM+ETGADAMEPIKVQKATWMADG+HWPGTW V 
Sbjct: 481  RTNGLPDSIRRRSDAFNAREEMKMWKHMKETGADAMEPIKVQKATWMADGSHWPGTWVVP 540

Query: 541  ASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHN 600
            + DHSKGDHAGILQVMLKPPS DPL+GS DEKIIDF+DVD RLPMFVYVSREKRPGYDHN
Sbjct: 541  SGDHSKGDHAGILQVMLKPPSHDPLMGSVDEKIIDFTDVDIRLPMFVYVSREKRPGYDHN 600

Query: 601  KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQR 660
            KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAI+EGMCFMMDRGGEDICYIQFPQR
Sbjct: 601  KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIKEGMCFMMDRGGEDICYIQFPQR 660

Query: 661  FEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHKN 720
            FEGIDPSDRYANHNTVFFDGNMRALDG+QGPVYVGTGCMFRRFALYGFDPPQPDKI HK 
Sbjct: 661  FEGIDPSDRYANHNTVFFDGNMRALDGVQGPVYVGTGCMFRRFALYGFDPPQPDKITHK- 720

Query: 721  NDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPG 780
            ND AETQPL++++ DPDLDVNLLPKRFGNS MLA+SI VAEFQGRPLADHSAVKYGRPPG
Sbjct: 721  NDSAETQPLRSSELDPDLDVNLLPKRFGNSTMLADSIPVAEFQGRPLADHSAVKYGRPPG 780

Query: 781  ALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSV 840
            ALR+PR PLDA TVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSV
Sbjct: 781  ALRLPRPPLDAVTVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSV 840

Query: 841  YCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVG 900
            YCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLK LQRLAYLNVG
Sbjct: 841  YCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKFLQRLAYLNVG 900

Query: 901  IYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGL 960
            IYPFTSIFLIVYCFLPALSLFSG FIVQTLNVTFL+YLLIIT+CLISLAILEVKWSGIGL
Sbjct: 901  IYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLIYLLIITVCLISLAILEVKWSGIGL 960

Query: 961  EEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKW 1020
            EEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGD+ +DIYADLYLVKW
Sbjct: 961  EEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDDVDDIYADLYLVKW 1020

Query: 1021 TSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGR 1080
            TSLMVPPIVIAMMNIIA+ VAFSRTIYS+VPQWSKFIGGAFFSFWVLAHLYPFAKGLMGR
Sbjct: 1021 TSLMVPPIVIAMMNIIAIAVAFSRTIYSSVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGR 1080

Query: 1081 RGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVG-GGGFEFP 1126
            RGKTPTIVIVWSGLIAITLSLLWIAI+PPK +  +AAVG GGGF+FP
Sbjct: 1081 RGKTPTIVIVWSGLIAITLSLLWIAINPPKPSAEDAAVGAGGGFQFP 1122

BLAST of Moc04g03770 vs. TAIR 10
Match: AT4G38190.1 (cellulose synthase like D4 )

HSP 1 Score: 1898.2 bits (4916), Expect = 0.0e+00
Identity = 920/1130 (81.42%), Postives = 1011/1130 (89.47%), Query Frame = 0

Query: 1    MASLTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG 60
            MAS   Q SKK        NNS S     GQTVKFARRTSSGRYVSLSR+++++SGE+SG
Sbjct: 1    MASTPPQTSKKV------RNNSGS-----GQTVKFARRTSSGRYVSLSRDNIELSGELSG 60

Query: 61   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 120
            DY NYTVHIPPTPDNQPM    ATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDS+V+HP
Sbjct: 61   DYSNYTVHIPPTPDNQPM----ATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSDVTHP 120

Query: 121  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 180
            QMAGAKGSSCAMPACDG VMKDERG D+ PCECRF+ICRDC++DAQK+TGLCPGCKE YK
Sbjct: 121  QMAGAKGSSCAMPACDGNVMKDERGKDVMPCECRFKICRDCFMDAQKETGLCPGCKEQYK 180

Query: 181  SADYDDDPNEYSGGALQLQGPDGSKGG--QNMSMMKLNQSGEFDHNKWLFETKGTYGVGN 240
              D DDD  +YS GAL L  P   + G   NMSMMK NQ+GEFDHN+WLFET+GTYG GN
Sbjct: 181  IGDLDDDTPDYSSGALPLPAPGKDQRGNNNNMSMMKRNQNGEFDHNRWLFETQGTYGYGN 240

Query: 241  AYWTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFF 300
            AYW  ++ Y D  D+  R G++E+ D  W+PLSR  PIPA+IISPYRLLI+IR VVL FF
Sbjct: 241  AYWPQDEMYGDDMDEGMRGGMVETADKPWRPLSRRIPIPAAIISPYRLLIVIRFVVLCFF 300

Query: 301  LHWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPS 360
            L WR+++PNEDAIWLWLMSI+CE+WF FSWILDQ+PKLCP+NR+TDL+VL DKFD P PS
Sbjct: 301  LTWRIRNPNEDAIWLWLMSIICELWFGFSWILDQIPKLCPINRSTDLEVLRDKFDMPSPS 360

Query: 361  NPTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFE 420
            NPTGRSDLPG+DLFVSTADPEKEP LVTANTILSILAVDYPVEK++CY+SDDGGALL+FE
Sbjct: 361  NPTGRSDLPGIDLFVSTADPEKEPPLVTANTILSILAVDYPVEKVSCYLSDDGGALLSFE 420

Query: 421  AMAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFK 480
            AMAEAASFADLWVPFCRKH+IEPRNP+SYFSLK+DPTKNKSR DFVKDRRKIKREYDEFK
Sbjct: 421  AMAEAASFADLWVPFCRKHNIEPRNPDSYFSLKIDPTKNKSRIDFVKDRRKIKREYDEFK 480

Query: 481  VRTNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEPIKVQKATWMADGTHWPGTWGV 540
            VR NGLPDSIRRRSDAFNAREEMK  K MRE+G D  EP+KV KATWMADGTHWPGTW  
Sbjct: 481  VRINGLPDSIRRRSDAFNAREEMKALKQMRESGGDPTEPVKVPKATWMADGTHWPGTWAA 540

Query: 541  SASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDH 600
            S  +HSKGDHAGILQVMLKPPS DPLIG++D+K+IDFSD DTRLPMFVYVSREKRPGYDH
Sbjct: 541  STREHSKGDHAGILQVMLKPPSSDPLIGNSDDKVIDFSDTDTRLPMFVYVSREKRPGYDH 600

Query: 601  NKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQ 660
            NKKAGAMNALVRASA+LSNGPFILNLDCDHYIYNCKA+REGMCFMMDRGGEDICYIQFPQ
Sbjct: 601  NKKAGAMNALVRASAILSNGPFILNLDCDHYIYNCKAVREGMCFMMDRGGEDICYIQFPQ 660

Query: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIKHK 720
            RFEGIDPSDRYAN+NTVFFDGNMRALDG+QGPVYVGTG MFRRFALYGFDPP PDK+  K
Sbjct: 661  RFEGIDPSDRYANNNTVFFDGNMRALDGVQGPVYVGTGTMFRRFALYGFDPPNPDKLLEK 720

Query: 721  NNDQAETQPLQATDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPP 780
               ++ET+ L  +DFDPDLDV  LPKRFGNS +LAESI +AEFQGRPLADH AVKYGRPP
Sbjct: 721  K--ESETEALTTSDFDPDLDVTQLPKRFGNSTLLAESIPIAEFQGRPLADHPAVKYGRPP 780

Query: 781  GALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHS 840
            GALRVPR+PLDA TVAE+VSVISCWYEDKTEWG+RVGWIYGSVTEDVVTGYRMHNRGW S
Sbjct: 781  GALRVPRDPLDATTVAESVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWRS 840

Query: 841  VYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNV 900
            VYCITKRD+FRGSAPINLTDRLHQVLRWATGSVEIFFSRNNA+LAS+RLK LQRLAYLNV
Sbjct: 841  VYCITKRDSFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNAILASKRLKFLQRLAYLNV 900

Query: 901  GIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIG 960
            GIYPFTS+FLI+YCFLPA SLFSG FIV+TL+++FLVYLL+ITICLI LA+LEVKWSGIG
Sbjct: 901  GIYPFTSLFLILYCFLPAFSLFSGQFIVRTLSISFLVYLLMITICLIGLAVLEVKWSGIG 960

Query: 961  LEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVK 1020
            LEEWWRNEQ+WLISGTS+HL AVVQG+LKVIAGIEISFTLT+KS GD+NEDIYADLY+VK
Sbjct: 961  LEEWWRNEQWWLISGTSSHLYAVVQGVLKVIAGIEISFTLTTKSGGDDNEDIYADLYIVK 1020

Query: 1021 WTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMG 1080
            W+SLM+PPIVIAM+NIIA+ VAF RTIY  VPQWSK IGGAFFSFWVLAHLYPFAKGLMG
Sbjct: 1021 WSSLMIPPIVIAMVNIIAIVVAFIRTIYQAVPQWSKLIGGAFFSFWVLAHLYPFAKGLMG 1080

Query: 1081 RRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAA---VGGGGFEFP 1126
            RRGKTPTIV VW+GLIAIT+SLLW AI+P  +T P AA   VGGGGF+FP
Sbjct: 1081 RRGKTPTIVFVWAGLIAITISLLWTAINP--NTGPAAAAEGVGGGGFQFP 1111

BLAST of Moc04g03770 vs. TAIR 10
Match: AT5G16910.1 (cellulose-synthase like D2 )

HSP 1 Score: 1659.4 bits (4296), Expect = 0.0e+00
Identity = 815/1138 (71.62%), Postives = 940/1138 (82.60%), Query Frame = 0

Query: 19   SNNS---TSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSG-DYINYTVHIPPTPD 78
            SNNS      R   G +VKFA+RTSSGRY++ SR+DLD   E+ G D+++YTVHIPPTPD
Sbjct: 15   SNNSDIQEPGRPPAGHSVKFAQRTSSGRYINYSRDDLD--SELGGQDFMSYTVHIPPTPD 74

Query: 79   NQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHPQMAGAKGSSCAMPA 138
            NQPMD SI+ K EEQYV+NS+FTGGF S TRAHLM KVI++E +HPQMAG+KGSSCA+P 
Sbjct: 75   NQPMDPSISQKVEEQYVANSMFTGGFKSNTRAHLMHKVIETEPNHPQMAGSKGSSCAIPG 134

Query: 139  CDGKVMKDERGIDITPCECRFRICRDCYLDAQK-DTGLCPGCKELYKSADYDDDPNEYSG 198
            CD KVM DERG D+ PCEC F+ICRDC++DA K   G+CPGCKE YK+    D  +E   
Sbjct: 135  CDAKVMSDERGQDLLPCECDFKICRDCFIDAVKTGGGICPGCKEPYKNTHLTDQVDENGQ 194

Query: 199  GALQLQGPDGSKGGQNMSMMK--------LNQSGEFDHNKWLFETKGTYGVGNAYWTPED 258
                L G  GSK  + +SM+K         +Q+G+FDHN+WLFET GTYG GNA+WT + 
Sbjct: 195  QRPMLPGGGGSKMERRLSMVKSTNKSALMRSQTGDFDHNRWLFETTGTYGYGNAFWTKDG 254

Query: 259  GYADGGD-DKFRDGV-MESMD---ASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLH 318
             +  G D D   DG+ ME+ D     W+PL+R   IPA +ISPYRLLI IR+VVL  FL 
Sbjct: 255  DFGSGKDGDGDGDGMGMEAQDLMSRPWRPLTRKLKIPAGVISPYRLLIFIRIVVLALFLT 314

Query: 319  WRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNP 378
            WRVKH N DA+WLW MS+VCE+WFA SW+LDQ+PKLCP+NRATDLQVL +KF+ P  SNP
Sbjct: 315  WRVKHQNPDAVWLWGMSVVCELWFALSWLLDQLPKLCPINRATDLQVLKEKFETPTASNP 374

Query: 379  TGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAM 438
            TG+SDLPG D+FVSTADPEKEP LVTANTILSILA +YPVEKL+CY+SDDGGALLTFEAM
Sbjct: 375  TGKSDLPGFDVFVSTADPEKEPPLVTANTILSILAAEYPVEKLSCYVSDDGGALLTFEAM 434

Query: 439  AEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVR 498
            AEAASFA++WVPFCRKH IEPRNP+SYFSLK DP KNK +SDFVKDRR++KRE+DEFKVR
Sbjct: 435  AEAASFANIWVPFCRKHAIEPRNPDSYFSLKRDPYKNKVKSDFVKDRRRVKREFDEFKVR 494

Query: 499  TNGLPDSIRRRSDAFNAREEMKMWKHMRETGAD-AMEPIKVQKATWMADGTHWPGTWGVS 558
             N LPDSIRRRSDA++AREE+K  K  R+   D  MEP+K+ KATWMADGTHWPGTW  S
Sbjct: 495  VNSLPDSIRRRSDAYHAREEIKAMKMQRQNRDDEPMEPVKIPKATWMADGTHWPGTWLTS 554

Query: 559  ASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHN 618
            ASDH+KGDHAGI+QVMLKPPS +PL G   E  +D +DVD RLP+ VYVSREKRPGYDHN
Sbjct: 555  ASDHAKGDHAGIIQVMLKPPSDEPLHG-VSEGFLDLTDVDIRLPLLVYVSREKRPGYDHN 614

Query: 619  KKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQR 678
            KKAGAMNALVRASA++SNGPFILNLDCDHYIYN +A+REGMCFMMDRGG+ +CY+QFPQR
Sbjct: 615  KKAGAMNALVRASAIMSNGPFILNLDCDHYIYNSEALREGMCFMMDRGGDRLCYVQFPQR 674

Query: 679  FEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKI---- 738
            FEGIDPSDRYANHNTVFFD NMRALDG+ GPVYVGTGC+FRR ALYGF+PP+        
Sbjct: 675  FEGIDPSDRYANHNTVFFDVNMRALDGLMGPVYVGTGCLFRRIALYGFNPPRSKDFSPSC 734

Query: 739  -------KHKNNDQAETQPLQATDF-DPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLA 798
                     K N   E + L+ +D+ D +++++L+PK+FGNS  L +SI VAEFQGRPLA
Sbjct: 735  WSCCFPRSKKKNIPEENRALRMSDYDDEEMNLSLVPKKFGNSTFLIDSIPVAEFQGRPLA 794

Query: 799  DHSAVKYGRPPGALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVT 858
            DH AVK GRPPGAL +PRE LDA+TVAEA++VISCWYEDKTEWG R+GWIYGSVTEDVVT
Sbjct: 795  DHPAVKNGRPPGALTIPRELLDASTVAEAIAVISCWYEDKTEWGSRIGWIYGSVTEDVVT 854

Query: 859  GYRMHNRGWHSVYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRL 918
            GYRMHNRGW SVYC+TKRDAFRG+APINLTDRLHQVLRWATGSVEIFFSRNNALLAS ++
Sbjct: 855  GYRMHNRGWKSVYCVTKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNALLASSKM 914

Query: 919  KLLQRLAYLNVGIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISL 978
            K+LQR+AYLNVGIYPFTSIFLIVYCFLPALSLFSG FIVQTLNVTFLVYLLII+I L  L
Sbjct: 915  KILQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLVYLLIISITLCLL 974

Query: 979  AILEVKWSGIGLEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDEN 1038
            A+LE+KWSGI LEEWWRNEQFWLI GTSAHLAAV+QGLLKV+AG+EISFTLTSKS GD+ 
Sbjct: 975  ALLEIKWSGISLEEWWRNEQFWLIGGTSAHLAAVLQGLLKVVAGVEISFTLTSKSGGDDI 1034

Query: 1039 EDIYADLYLVKWTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLA 1098
            +D +ADLY+VKWTSLM+PPI I M+N+IA+ V FSRTIYS VPQWSK IGG FFSFWVLA
Sbjct: 1035 DDEFADLYMVKWTSLMIPPITIIMVNLIAIAVGFSRTIYSVVPQWSKLIGGVFFSFWVLA 1094

Query: 1099 HLYPFAKGLMGRRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            HLYPFAKGLMGRRG+TPTIV VWSGL+AIT+SLLW+AI+PP   T      GG F FP
Sbjct: 1095 HLYPFAKGLMGRRGRTPTIVYVWSGLVAITISLLWVAINPPAGNTEI----GGNFSFP 1145

BLAST of Moc04g03770 vs. TAIR 10
Match: AT3G03050.1 (cellulose synthase-like D3 )

HSP 1 Score: 1651.7 bits (4276), Expect = 0.0e+00
Identity = 804/1122 (71.66%), Postives = 926/1122 (82.53%), Query Frame = 0

Query: 29   VGQTVKFARRTSSGRYVSLSREDLDMSGEVSGDYINYTVHIPPTPDNQPMDSSIATKAEE 88
            V  +V FARRT SGRYV+ SR+DLD S   S D   Y+VHIPPTPDNQPMD SI+ K EE
Sbjct: 30   VSNSVTFARRTPSGRYVNYSRDDLD-SELGSVDLTGYSVHIPPTPDNQPMDPSISQKVEE 89

Query: 89   QYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHPQMAGAKGSSCAMPACDGKVMKDERGIDI 148
            QYVSNSLFTGGFNSVTRAHLM+KVID+E SHPQMAGAKGSSCA+P CD KVM DERG D+
Sbjct: 90   QYVSNSLFTGGFNSVTRAHLMEKVIDTETSHPQMAGAKGSSCAVPGCDVKVMSDERGQDL 149

Query: 149  TPCECRFRICRDCYLDAQKDTGLCPGCKELYKSADYDDDPNEYSGGALQLQGP-DGSKGG 208
             PCEC F+ICRDC++DA K  G+CPGCKE Y++ D  D  +        L  P  GSK  
Sbjct: 150  LPCECDFKICRDCFMDAVKTGGMCPGCKEPYRNTDLADFADNNKQQRPMLPPPAGGSKMD 209

Query: 209  QNMSMMK-------LNQSGEFDHNKWLFETKGTYGVGNAYWTPEDGYADGGDDKFRD-GV 268
            + +S+MK        +Q+G+FDHN+WLFET GTYG GNA+WT +  +    D      G 
Sbjct: 210  RRLSLMKSTKSGLMRSQTGDFDHNRWLFETSGTYGFGNAFWTKDGNFGSDKDGNGHGMGP 269

Query: 269  MESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLHWRVKHPNEDAIWLWLMSIV 328
             + M   W+PL+R   IPA++ISPYRLLILIR+VVL  FL WR+KH N DAIWLW MS+V
Sbjct: 270  QDLMSRPWRPLTRKLQIPAAVISPYRLLILIRIVVLALFLMWRIKHKNPDAIWLWGMSVV 329

Query: 329  CEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNPTGRSDLPGVDLFVSTADPE 388
            CE+WFA SW+LDQ+PKLCP+NRATDL VL +KF+ P PSNPTG+SDLPG+D+FVSTADPE
Sbjct: 330  CELWFALSWLLDQLPKLCPINRATDLNVLKEKFETPTPSNPTGKSDLPGLDMFVSTADPE 389

Query: 389  KEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAMAEAASFADLWVPFCRKHDI 448
            KEP LVT+NTILSILA DYPVEKLACY+SDDGGALLTFEAMAEAASFA++WVPFCRKH+I
Sbjct: 390  KEPPLVTSNTILSILAADYPVEKLACYVSDDGGALLTFEAMAEAASFANMWVPFCRKHNI 449

Query: 449  EPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVRTNGLPDSIRRRSDAFNARE 508
            EPRNP+SYFSLK DP KNK ++DFVKDRR++KREYDEFKVR N LPDSIRRRSDA++ARE
Sbjct: 450  EPRNPDSYFSLKRDPYKNKVKADFVKDRRRVKREYDEFKVRINSLPDSIRRRSDAYHARE 509

Query: 509  EMKMWKHMRET-GADAMEPIKVQKATWMADGTHWPGTWGVSASDHSKGDHAGILQVMLKP 568
            E+K  K  R+    + +EP+K+ KATWMADGTHWPGTW  S  DHS+ DHAGI+QVMLKP
Sbjct: 510  EIKAMKLQRQNRDEEIVEPVKIPKATWMADGTHWPGTWINSGPDHSRSDHAGIIQVMLKP 569

Query: 569  PSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDHNKKAGAMNALVRASAVLSNG 628
            PS +PL G   E  +D +DVD RLP+ VYVSREKRPGYDHNKKAGAMNALVRASA++SNG
Sbjct: 570  PSDEPLHG-VSEGFLDLTDVDIRLPLLVYVSREKRPGYDHNKKAGAMNALVRASAIMSNG 629

Query: 629  PFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQRFEGIDPSDRYANHNTVFFD 688
            PFILNLDCDHYIYN +A+REGMCFMMDRGG+ +CY+QFPQRFEGIDPSDRYANHNTVFFD
Sbjct: 630  PFILNLDCDHYIYNSQALREGMCFMMDRGGDRLCYVQFPQRFEGIDPSDRYANHNTVFFD 689

Query: 689  GNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQP------------DKIKHKNNDQAETQ 748
             NMRALDG+ GPVYVGTGC+FRR ALYGFDPP+              + K K+    E +
Sbjct: 690  VNMRALDGLMGPVYVGTGCLFRRIALYGFDPPRAKEHHPGFCSCCFSRKKKKSRVPEENR 749

Query: 749  PLQA---TDFDPDLDVNLLPKRFGNSNMLAESILVAEFQGRPLADHSAVKYGRPPGALRV 808
             L+    +D D +++++L+PK+FGNS  L +SI VAEFQGRPLADH AV+ GRPPGAL +
Sbjct: 750  SLRMGGDSDDDEEMNLSLVPKKFGNSTFLIDSIPVAEFQGRPLADHPAVQNGRPPGALTI 809

Query: 809  PREPLDAATVAEAVSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVYCIT 868
            PRE LDA+TVAEA++VISCWYEDKTEWG R+GWIYGSVTEDVVTGYRMHNRGW SVYC+T
Sbjct: 810  PRELLDASTVAEAIAVISCWYEDKTEWGSRIGWIYGSVTEDVVTGYRMHNRGWKSVYCVT 869

Query: 869  KRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGIYPF 928
            KRDAFRG+APINLTDRLHQVLRWATGSVEIFFSRNNA  AS R+K+LQR+AYLNVGIYPF
Sbjct: 870  KRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSRNNAFFASPRMKILQRIAYLNVGIYPF 929

Query: 929  TSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLEEWW 988
            TS FLIVYCFLPALSLFSG FIVQTLNVTFLVYLLII+I L  LA+LE+KWSGI LEEWW
Sbjct: 930  TSFFLIVYCFLPALSLFSGQFIVQTLNVTFLVYLLIISITLCLLALLEIKWSGISLEEWW 989

Query: 989  RNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTLTSKSAGDENEDIYADLYLVKWTSLM 1048
            RNEQFWLI GTSAHLAAV+QGLLKV+AGIEISFTLTSKS G++ +D +ADLY+VKWTSLM
Sbjct: 990  RNEQFWLIGGTSAHLAAVIQGLLKVVAGIEISFTLTSKSGGEDVDDEFADLYIVKWTSLM 1049

Query: 1049 VPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRRGKT 1108
            +PPI I M+N+IA+ V FSRTIYS +PQWSK IGG FFSFWVLAHLYPFAKGLMGRRG+T
Sbjct: 1050 IPPITIMMVNLIAIAVGFSRTIYSVIPQWSKLIGGVFFSFWVLAHLYPFAKGLMGRRGRT 1109

Query: 1109 PTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVGGGGFEFP 1126
            PTIV VWSGL+AIT+SLLW+AI+PP  +T      GG F FP
Sbjct: 1110 PTIVYVWSGLVAITISLLWVAINPPAGSTQI----GGSFTFP 1145

BLAST of Moc04g03770 vs. TAIR 10
Match: AT2G33100.1 (cellulose synthase-like D1 )

HSP 1 Score: 1411.4 bits (3652), Expect = 0.0e+00
Identity = 716/1143 (62.64%), Postives = 840/1143 (73.49%), Query Frame = 0

Query: 4    LTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSR-EDLDMSGEVSG-- 63
            + + P KK +      N+ +S+     Q VKF RRTSSGR VSLSR +D+D+SG+ SG  
Sbjct: 1    MASSPPKKTL------NSQSSSLSRPPQAVKFGRRTSSGRIVSLSRDDDMDVSGDYSGQN 60

Query: 64   DYINYTVHIPPTPDNQPMDSSIATKAEEQYVSNSLFTGGFNSVTRAHLMDKVIDSEVSHP 123
            DYINYTV +PPTPDNQP  SS +T                              SE    
Sbjct: 61   DYINYTVLMPPTPDNQPAGSSGST------------------------------SESKGD 120

Query: 124  QMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDAQKDTGLCPGCKELYK 183
               G  G        DG  M ++        E R  + +                     
Sbjct: 121  ANRGGGGG-------DGPKMGNK-------LERRLSVMK--------------------- 180

Query: 184  SADYDDDPNEYSGGALQLQGPDGSKGGQNMSMMKLNQSGEFDHNKWLFETKGTYGVGNAY 243
                                        N SM+  +Q+G+FDHN+WLFE+KG YG+GNA+
Sbjct: 181  --------------------------SNNKSMLLRSQTGDFDHNRWLFESKGKYGIGNAF 240

Query: 244  WTPEDGYADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISPYRLLILIRLVVLGFFLH 303
            W+ ED   DGG  K      + +D  WKPL+R   IPA I+SPYRLLI+IRLV++ FFL 
Sbjct: 241  WSEEDDTYDGGVSK-----SDFLDKPWKPLTRKVQIPAKILSPYRLLIVIRLVIVFFFLW 300

Query: 304  WRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRATDLQVLHDKFDAPCPSNP 363
            WR+ +PNEDA+WLW +SIVCEIWFAFSWILD +PKL P+NRATDL  LHDKF+ P PSNP
Sbjct: 301  WRITNPNEDAMWLWGLSIVCEIWFAFSWILDILPKLNPINRATDLAALHDKFEQPSPSNP 360

Query: 364  TGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKLACYISDDGGALLTFEAM 423
            TGRSDLPGVD+FVSTADPEKEP LVTANT+LSILAVDYP+EKL+ YISDDGGA+LTFEAM
Sbjct: 361  TGRSDLPGVDVFVSTADPEKEPPLVTANTLLSILAVDYPIEKLSAYISDDGGAILTFEAM 420

Query: 424  AEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDFVKDRRKIKREYDEFKVR 483
            AEA  FA+ WVPFCRKHDIEPRNP+SYFS+K DPTKNK R DFVKDRR IKREYDEFKVR
Sbjct: 421  AEAVRFAEYWVPFCRKHDIEPRNPDSYFSIKKDPTKNKKRQDFVKDRRWIKREYDEFKVR 480

Query: 484  TNGLPDSIRRRSDAFNAREEMKMWKHMRETGADAMEP--IKVQKATWMADGTHWPGTWGV 543
             NGLP+ I++R++ FN REE+K  +  RE     + P  ++V KATWMADGTHWPGTW  
Sbjct: 481  INGLPEQIKKRAEQFNMREELKEKRIAREKNGGVLPPDGVEVVKATWMADGTHWPGTWFE 540

Query: 544  SASDHSKGDHAGILQVMLKPPSPDPLIGSTDEKIIDFSDVDTRLPMFVYVSREKRPGYDH 603
               DHSKGDHAGILQ+M K P  +P++G  +E  +DF+ +D R+PMF YVSREKRPG+DH
Sbjct: 541  PKPDHSKGDHAGILQIMSKVPDLEPVMGGPNEGALDFTGIDIRVPMFAYVSREKRPGFDH 600

Query: 604  NKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREGMCFMMDRGGEDICYIQFPQ 663
            NKKAGAMN +VRASA+LSNG FILNLDCDHYIYN KAI+EGMCFMMDRGG+ ICYIQFPQ
Sbjct: 601  NKKAGAMNGMVRASAILSNGAFILNLDCDHYIYNSKAIKEGMCFMMDRGGDRICYIQFPQ 660

Query: 664  RFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMFRRFALYGFDPPQPDKIK-- 723
            RFEGIDPSDRYANHNTVFFDGNMRALDG+QGPVYVGTGCMFRR+ALYGF+PP+ ++    
Sbjct: 661  RFEGIDPSDRYANHNTVFFDGNMRALDGLQGPVYVGTGCMFRRYALYGFNPPRANEYSGV 720

Query: 724  ---------HKNNDQAETQPLQATDF---------DPDLDVNLLPKRFGNSNMLAESILV 783
                     H       +Q  QA+D          DPDL    LPK+FGNS M  ++I V
Sbjct: 721  FGQEKAPAMHVRTQSQASQTSQASDLESDTQPLNDDPDLG---LPKKFGNSTMFTDTIPV 780

Query: 784  AEFQGRPLADHSAVKYGRPPGALRVPREPLDAATVAEAVSVISCWYEDKTEWGERVGWIY 843
            AE+QGRPLADH +VK GRPPGAL +PR PLDA TVAEA++VISCWYED TEWG+R+GWIY
Sbjct: 781  AEYQGRPLADHMSVKNGRPPGALLLPRPPLDAPTVAEAIAVISCWYEDNTEWGDRIGWIY 840

Query: 844  GSVTEDVVTGYRMHNRGWHSVYCITKRDAFRGSAPINLTDRLHQVLRWATGSVEIFFSRN 903
            GSVTEDVVTGYRMHNRGW SVYCITKRDAFRG+APINLTDRLHQVLRWATGSVEIFFS+N
Sbjct: 841  GSVTEDVVTGYRMHNRGWRSVYCITKRDAFRGTAPINLTDRLHQVLRWATGSVEIFFSKN 900

Query: 904  NALLASRRLKLLQRLAYLNVGIYPFTSIFLIVYCFLPALSLFSGNFIVQTLNVTFLVYLL 963
            NA+ A+RRLK LQR+AYLNVGIYPFTSIFL+VYCFLPAL LFSG FIVQ+L++ FL YLL
Sbjct: 901  NAMFATRRLKFLQRVAYLNVGIYPFTSIFLVVYCFLPALCLFSGKFIVQSLDIHFLSYLL 960

Query: 964  IITICLISLAILEVKWSGIGLEEWWRNEQFWLISGTSAHLAAVVQGLLKVIAGIEISFTL 1023
             IT+ L  +++LEVKWSGIGLEEWWRNEQFWLI GTSAHLAAVVQGLLKVIAGIEISFTL
Sbjct: 961  CITVTLTLISLLEVKWSGIGLEEWWRNEQFWLIGGTSAHLAAVVQGLLKVIAGIEISFTL 1020

Query: 1024 TSKSAGDENEDIYADLYLVKWTSLMVPPIVIAMMNIIAMGVAFSRTIYSTVPQWSKFIGG 1083
            TSK++G++ +DI+ADLY+VKWT L + P+ I ++N++A+ +  SRTIYS +PQW K +GG
Sbjct: 1021 TSKASGEDEDDIFADLYIVKWTGLFIMPLTIIIVNLVAIVIGASRTIYSVIPQWGKLMGG 1033

Query: 1084 AFFSFWVLAHLYPFAKGLMGRRGKTPTIVIVWSGLIAITLSLLWIAISPPKSTTPEAAVG 1122
             FFS WVL H+YPFAKGLMGRRGK PTIV VWSGL++IT+SLLWI ISPP   +     G
Sbjct: 1081 IFFSLWVLTHMYPFAKGLMGRRGKVPTIVYVWSGLVSITVSLLWITISPPDDVS-----G 1033

BLAST of Moc04g03770 vs. TAIR 10
Match: AT1G02730.1 (cellulose synthase-like D5 )

HSP 1 Score: 1384.4 bits (3582), Expect = 0.0e+00
Identity = 707/1153 (61.32%), Postives = 854/1153 (74.07%), Query Frame = 0

Query: 4    LTNQPSKKAIRSPGGSNNSTSNRGSVGQTVKFARRTSSGRYVSLSREDLDMSGEVSGDYI 63
            +TNQ S  + R+   ++ S+ NR S G           GRY S+S EDL      S   +
Sbjct: 40   ITNQNSPLSSRATRRTSISSGNRRSNG---------DEGRYCSMSVEDLTAETTNSECVL 99

Query: 64   NYTVHIPPTPDNQPMDSSIATKAEE---------QYVSNSLFTGGFNSVTRAHLMDKVID 123
            +YTVHIPPTPD+Q + +S  ++ +E          ++S ++FTGGF SVTR H    VID
Sbjct: 100  SYTVHIPPTPDHQTVFASQESEEDEMLKGNSNQKSFLSGTIFTGGFKSVTRGH----VID 159

Query: 124  SEVSHPQMAGAKGSSCAMPACDGKVMKDERGIDITPCECRFRICRDCYLDA-QKDTGLCP 183
              +         G  C +  CD KV+          CEC FRICRDCY D      G CP
Sbjct: 160  CSMDRADPEKKSGQICWLKGCDEKVVHGR-------CECGFRICRDCYFDCITSGGGNCP 219

Query: 184  GCKELYKSADYDDDP----NEYSGGALQLQGPDGSKGGQNMSMMK----LNQSGEFDHNK 243
            GCKE Y+  D +DDP     +    A  L     SK  + +S++K     NQ+G+FDH +
Sbjct: 220  GCKEPYR--DINDDPETEEEDEEDEAKPLPQMGESKLDKRLSVVKSFKAQNQAGDFDHTR 279

Query: 244  WLFETKGTYGVGNAYWTPEDGY--ADGGDDKFRDGVMESMDASWKPLSRTFPIPASIISP 303
            WLFETKGTYG GNA W P+DGY    GG     +   E  + S +PL+R   + A+IISP
Sbjct: 280  WLFETKGTYGYGNAVW-PKDGYGIGSGGGGNGYETPPEFGERSKRPLTRKVSVSAAIISP 339

Query: 304  YRLLILIRLVVLGFFLHWRVKHPNEDAIWLWLMSIVCEIWFAFSWILDQVPKLCPVNRAT 363
            YRLLI +RLV LG FL WRV+HPN +A+WLW MS  CE+WFA SW+LDQ+PKLCPVNR T
Sbjct: 340  YRLLIALRLVALGLFLTWRVRHPNREAMWLWGMSTTCELWFALSWLLDQLPKLCPVNRLT 399

Query: 364  DLQVLHDKFDAPCPSNPTGRSDLPGVDLFVSTADPEKEPVLVTANTILSILAVDYPVEKL 423
            DL VL ++F++P   NP GRSDLPG+D+FVSTADPEKEP LVTANTILSILAVDYPVEKL
Sbjct: 400  DLGVLKERFESPNLRNPKGRSDLPGIDVFVSTADPEKEPPLVTANTILSILAVDYPVEKL 459

Query: 424  ACYISDDGGALLTFEAMAEAASFADLWVPFCRKHDIEPRNPESYFSLKVDPTKNKSRSDF 483
            ACY+SDDGGALLTFEA+A+ ASFA  WVPFCRKH+IEPRNPE+YF  K +  KNK R DF
Sbjct: 460  ACYLSDDGGALLTFEALAQTASFASTWVPFCRKHNIEPRNPEAYFGQKRNFLKNKVRLDF 519

Query: 484  VKDRRKIKREYDEFKVRTNGLPDSIRRRSDAFNAREEMKMWKHMRE--TGADAMEPIKVQ 543
            V++RR++KREYDEFKVR N LP++IRRRSDA+N  EE++  K   E   G +  E + V 
Sbjct: 520  VRERRRVKREYDEFKVRINSLPEAIRRRSDAYNVHEELRAKKKQMEMMMGNNPQETVIVP 579

Query: 544  KATWMADGTHWPGTWGVSASDHSKGDHAGILQVMLKPPSPDPLIG--STDEKIIDFSDVD 603
            KATWM+DG+HWPGTW    +D+S+GDHAGI+Q ML PP+ +P+ G  +  E +ID +DVD
Sbjct: 580  KATWMSDGSHWPGTWSSGETDNSRGDHAGIIQAMLAPPNAEPVYGAEADAENLIDTTDVD 639

Query: 604  TRLPMFVYVSREKRPGYDHNKKAGAMNALVRASAVLSNGPFILNLDCDHYIYNCKAIREG 663
             RLPM VYVSREKRPGYDHNKKAGAMNALVR SA++SNGPFILNLDCDHYIYN  A+REG
Sbjct: 640  IRLPMLVYVSREKRPGYDHNKKAGAMNALVRTSAIMSNGPFILNLDCDHYIYNSMALREG 699

Query: 664  MCFMMDRGGEDICYIQFPQRFEGIDPSDRYANHNTVFFDGNMRALDGIQGPVYVGTGCMF 723
            MCFM+DRGG+ ICY+QFPQRFEGIDP+DRYANHNTVFFD +MRALDG+QGP+YVGTGC+F
Sbjct: 700  MCFMLDRGGDRICYVQFPQRFEGIDPNDRYANHNTVFFDVSMRALDGLQGPMYVGTGCIF 759

Query: 724  RRFALYGFDPPQP------------------DKIKHKNNDQAET----QPLQATDFDPDL 783
            RR ALYGF PP+                    K   K +D+       +  +  + D D+
Sbjct: 760  RRTALYGFSPPRATEHHGWLGRRKVKISLRRPKAMMKKDDEVSLPINGEYNEEENDDGDI 819

Query: 784  DVNLLPKRFGNSNMLAESILVAEFQGRPLAD-HSAVKYGRPPGALRVPREPLDAATVAEA 843
            +  LLPKRFGNSN    SI VAE+QGR + D     K  RP G+L VPREPLDAATVAEA
Sbjct: 820  ESLLLPKRFGNSNSFVASIPVAEYQGRLIQDLQGKGKNSRPAGSLAVPREPLDAATVAEA 879

Query: 844  VSVISCWYEDKTEWGERVGWIYGSVTEDVVTGYRMHNRGWHSVYCITKRDAFRGSAPINL 903
            +SVISC+YEDKTEWG+RVGWIYGSVTEDVVTGYRMHNRGW S+YC+TKRDAFRG+APINL
Sbjct: 880  ISVISCFYEDKTEWGKRVGWIYGSVTEDVVTGYRMHNRGWRSIYCVTKRDAFRGTAPINL 939

Query: 904  TDRLHQVLRWATGSVEIFFSRNNALLASRRLKLLQRLAYLNVGIYPFTSIFLIVYCFLPA 963
            TDRLHQVLRWATGSVEIFFSRNNA+ A+RR+K LQR+AY NVG+YPFTS+FLIVYC LPA
Sbjct: 940  TDRLHQVLRWATGSVEIFFSRNNAIFATRRMKFLQRVAYFNVGMYPFTSLFLIVYCILPA 999

Query: 964  LSLFSGNFIVQTLNVTFLVYLLIITICLISLAILEVKWSGIGLEEWWRNEQFWLISGTSA 1023
            +SLFSG FIVQ+L++TFL+YLL IT+ L  L++LE+KWSGI L EWWRNEQFW+I GTSA
Sbjct: 1000 ISLFSGQFIVQSLDITFLIYLLSITLTLCMLSLLEIKWSGITLHEWWRNEQFWVIGGTSA 1059

Query: 1024 HLAAVVQGLLKVIAGIEISFTLTSK-SAGDENEDIYADLYLVKWTSLMVPPIVIAMMNII 1083
            H AAV+QGLLKVIAG++ISFTLTSK SA ++ +D +ADLY+VKW+ LMVPP+ I M+N+I
Sbjct: 1060 HPAAVLQGLLKVIAGVDISFTLTSKSSAPEDGDDEFADLYVVKWSFLMVPPLTIMMVNMI 1119

Query: 1084 AMGVAFSRTIYSTVPQWSKFIGGAFFSFWVLAHLYPFAKGLMGRRGKTPTIVIVWSGLIA 1109
            A+ V  +RT+YS  PQWSK +GG FFSFWVL HLYPFAKGLMGRRG+ PTIV VWSGL++
Sbjct: 1120 AIAVGLARTLYSPFPQWSKLVGGVFFSFWVLCHLYPFAKGLMGRRGRVPTIVFVWSGLLS 1169

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022135687.10.0e+00100.00cellulose synthase-like protein D4 [Momordica charantia][more]
XP_038888576.10.0e+0092.54cellulose synthase-like protein D4 [Benincasa hispida][more]
XP_022989294.10.0e+0092.46cellulose synthase-like protein D4 [Cucurbita maxima][more]
XP_022928356.10.0e+0092.46cellulose synthase-like protein D4 [Cucurbita moschata] >KAG6589078.1 Cellulose ... [more]
XP_023529589.10.0e+0092.28cellulose synthase-like protein D4 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9SZL90.0e+0081.42Cellulose synthase-like protein D4 OS=Arabidopsis thaliana OX=3702 GN=CSLD4 PE=2... [more]
Q7EZW60.0e+0072.36Cellulose synthase-like protein D3 OS=Oryza sativa subsp. japonica OX=39947 GN=C... [more]
Q9LFL00.0e+0071.62Cellulose synthase-like protein D2 OS=Arabidopsis thaliana OX=3702 GN=CSLD2 PE=3... [more]
Q9M9M40.0e+0071.66Cellulose synthase-like protein D3 OS=Arabidopsis thaliana OX=3702 GN=CSLD3 PE=1... [more]
A2YU420.0e+0070.48Cellulose synthase-like protein D2 OS=Oryza sativa subsp. indica OX=39946 GN=CSL... [more]
Match NameE-valueIdentityDescription
A0A6J1C3F40.0e+00100.00cellulose synthase-like protein D4 OS=Momordica charantia OX=3673 GN=LOC11100758... [more]
A0A6J1JNY40.0e+0092.46cellulose synthase-like protein D4 OS=Cucurbita maxima OX=3661 GN=LOC111486408 P... [more]
A0A6J1EK320.0e+0092.46cellulose synthase-like protein D4 OS=Cucurbita moschata OX=3662 GN=LOC111435210... [more]
A0A0A0K6B50.0e+0091.84Cellulose synthase OS=Cucumis sativus OX=3659 GN=Csa_7G029410 PE=4 SV=1[more]
A0A5A7V1C40.0e+0092.01Cellulose synthase-like protein D4 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C... [more]
Match NameE-valueIdentityDescription
AT4G38190.10.0e+0081.42cellulose synthase like D4 [more]
AT5G16910.10.0e+0071.62cellulose-synthase like D2 [more]
AT3G03050.10.0e+0071.66cellulose synthase-like D3 [more]
AT2G33100.10.0e+0062.64cellulose synthase-like D1 [more]
AT1G02730.10.0e+0061.32cellulose synthase-like D5 [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005150Cellulose synthasePFAMPF03552Cellulose_syntcoord: 719..1113
e-value: 1.4E-192
score: 642.1
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 113..199
e-value: 1.2E-10
score: 43.2
NoneNo IPR availablePFAMPF14570zf-RING_4coord: 132..180
e-value: 7.0E-14
score: 51.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..35
NoneNo IPR availablePANTHERPTHR13301X-BOX TRANSCRIPTION FACTOR-RELATEDcoord: 75..1122
NoneNo IPR availablePANTHERPTHR13301:SF40CELLULOSE SYNTHASE-LIKE PROTEIN D4coord: 75..1122
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 120..183
IPR029044Nucleotide-diphospho-sugar transferasesGENE3D3.90.550.10Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain Acoord: 566..710
e-value: 7.2E-13
score: 50.1
IPR029044Nucleotide-diphospho-sugar transferasesSUPERFAMILY53448Nucleotide-diphospho-sugar transferasescoord: 386..916

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc04g03770.1Moc04g03770.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030244 cellulose biosynthetic process
biological_process GO:0097502 mannosylation
biological_process GO:0009833 plant-type primary cell wall biogenesis
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005886 plasma membrane
cellular_component GO:0016020 membrane
molecular_function GO:0016760 cellulose synthase (UDP-forming) activity
molecular_function GO:0051753 mannan synthase activity