Cp4.1LG19g01780 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG19g01780
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGRAS family transcription factor
LocationCp4.1LG19 : 1489557 .. 1493789 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGGGTCGATTGTGTTCCTCCATTTCCGCCATGGATGCGACAGTGAAGAAGCTCTGCTCAAACTGTAATGTGAAGAAGAAGAAAGAAGAAGGAATCTATCTCTCTCCTCTTCCATTTTAATACTGTTTTCTTTTCAGTTCCTGATCTTCTCGATTTCGCACCACGATTTTCTTTTCAAAAAGCCACCTCTCAGATGCATCCAAAATCCATGGCTTCCGCGTACTTCGCATTATTAAAAAGCATGAAGGCATAGGGGCTCGTGTTCTTGCTCGAGAGAGAGAAGAGGAATCCTCTAGTTGCTGTCCAAATCTTCGTGCCGTTTTCTGCTTGGATTTCGCTTCTGTTTCCAAATCTTACAAATCTGCGTTAATGGTAGCCATTACTCCTGCTACCTCCTTCTCCTTTATGAAGAACACCGATCGTCGCCATTGCCATATCTTCTCTACCTCCAATGGCGACTTACGCTTTGCTTGGCGACTCCACTGTCCGCGTTAATGGCGGTTTCGATGACGGTTCTCTGACCAGCAACTCGACGAACAGCAACGGCAGCGAAGAACTTAATCAACAGACTGTTCAAGTTCCGGTTCAGGTTTCTCAACCGCCGCCGCGATTACCGCCTGGAAAAATGGTGCGGAAGAGAATCGCGTCGGAGATAGAGATCGAAGAACTCGACGGCGGAGGCGGCGGTACCGCCGCCGTTCATCCACGGTTTTGCCGGCGGAGTTTGGCTTCCGATCGTCCTTTTGCAGGTGGAGAAAATAAGACGAATGAGAATGTGGATAACTATTGTTCTTCTTCAAACCCTAGCCATGGCGCTAACCACTCCACTGTGCATAATTTAACCGCTCTGACGTCAGTTGTGGTCGCAGGGTCAAATTTATCAAATCCTCCGTCTGGTTCTGATGCTACGGCTTCTTCCACTACCTCCAACGTCTCGTCTCTTATTGACAGTACGCTTCCCGTTCTTCGTCCTCAGCCCCACCATCGCCATTTGCAGAATCCTGCAGTCTGTGGTTTTTCTGGTTTACCTTTGTTCCCACCCGAATCGAATCACCACCACCACCACAACAAGTTAAATTCTCGCAATAATCCTTTCCCCATTCCTAATCCATGTCAGGTTGTTCTTCATAATCCTCCAACTTCTACAACGACCTCCATTATCGCCGCCGCTTCTACTCCGATGGATGATTCCTCCGCCACTGCTTGGATCGACGGTATTATCAAGGATTTAATCCACAGCTCCACTGCCGTATCCATTCCTCAGCTTATTCAGAACGTTCGTGATATTATCTACCCTTGTAACCCCAATCTTGCGAACCTTCTTGAGTTTCGTCTTCGTACTTTGACAGAACCTAACGTTCCTAACTTCGCCGCTGAGGATCAGCGTGTTAGGAAATCGCCCTTGCCGTCACCAGCGCCGGTGGGCGGCTCGGGGCTGCAGCAGAGGCAGTTCAATCAAGAACATGAGCAAGAACAGGATTGTTCTGGGTTGAAGCTTAATCTCGATTCTTCTCTGCATAATCTTCCTAATTTCTCATCTCAGCCTCCGTTTCATGACCCATACCTCCACTGGGGAGCCACTCCTACCCCAGCCCCCACTCCCTCCGTCGCCACAACCGGCGGCGAGGTTCCCGGTCATCAACTTAATCTCTCTTCTGTTCCACATTCATCACTTATTCCTCTAAACCATATCCCTTCTAAGCCACAGCCAGAACAGCCAAACTCCTGTCCGGTCAATGTGAAGGCAGCGGCAGCGGCGGCCGCACAGCCATCGCCCTCGCCGCCGACGAGTAATGATCCTTCAACGACTGCGTTACTGATTAGAGAGATAAAAGAGGAGATGAGGCAACAGAAGAGAGACGAAGAAGGGTTACACCTCTTGACTTTGCTTCTTCAATGTGCAGAAGCCGTTTCTGCCGATAATTTAGAAGAAGCCAATAAGATGCTACTGGAAATCTCGGAGTTGTCGACGCCCTTCGGTACATCGGCGCAAAGAGTGGCGGCGTATTTCTCAGAAGCAATGTCGGCGAGGCTTGTGAGCTCATGTTTAGGCATATACGCCGCTTTGCCGCCGACGTTATTGCCCCATACACACAGCCAGAAGATAGCCTCAGCCTTTCAAGTCTTCAATGGCATAAGCCCATTTGTCAAATTCTCACACTTCACAGCCAATCAAGCCATTCAAGAAGCCTTCGAAAGGGAAGAGAGAGTTCACATCATAGATCTAGACATCATGCAAGGCCTTCAATGGCCAGGCCTCTTCCACATCTTGGCGTCTCGGCCGGGTGGGCCGCCCTATGTCCGCCTCACTGGGCTGGGGACCTCTCAGGAAGTTCTTGAGGCCACTGGCAAACGCCTCACTGAATTCGCCGAGAAGCTTGGCCTTCCGTTTGATTTCTTTCCTGTTGCAGATAAAATTGGCAATCTGGACTTGGAGAGGCTCAACGTGAGCCAAAGAGAAGCCGTTGCCGTCCATTGGATGCAGCATTCTCTTTACGAAGTCACTGGTTCTGATTCCAATACGCTATGGCTTCTGCAGAGGTATTACTAGCTTACACAAACCTTCAAGCTTTTCGTTTGATCTTCTTCTTTGATGGGTTTCCCTGTTAATTACTTTATCATAAGCATGTGTTAGGAATAAACATGGATTAAGAATCACCGCTCTCCACATGATATTGTGGAAATATTGTCAACTTTTGAGCATAAGCTTTCATGATTTTGCTTTTAATTTTCTTTACTTGTAAACCTATAATCATTCCCTTAATTAGCCGATGTGGGACGCCCCACCAATAATCCTCAATAATTCTTCTCTCGAACAAAGTATACTGTAGAGCCTCCTTTGAGACCTATGGAGCCCTCAAATAGCCTCCCCTTACTTTGAGCCTCGACTCCTTTCTCTGTAGCCCTCGAACAAAGTACACCATTTATTCGCCATTTGAGTCACTTTTTGACCACATTATCGAGGCTCACAATTCGTTGTTTGACATTTGAGAATTCTAGTTAAAGTCACGGAATCACGACTCTCATATGATATTGCCCACTTTGAATATAAGCTTTCATAGTTTTGTTTTTGGTTTTTTCCAAATGCCTCGTACGAATAGAGATAGTATTCCTTTCTTATGAACCCATGATGTGTAAGACTCCATGAAGAGTGATTCTTAGTCTTACACTTTACCTCCATTAACGCCTTGCTTTGTTTCAAAATGATGGATAGTCTTCAACAGTTTAAATCAAGCAAAAAGTTGACTGACTAATTCAATGTTATCTCTGTGCAATGCACCGGAAAAAGCTGCACTCTCTCTCATCTCATATTTGATATTTTCACTGAATAGACCATTCATTTGTTTACGTGTTGCTGATGAGTACATATTAAAAAACATTGCCATCTCTGTCTTAAACTGATACCAGATTGGCTCCAAAAGTGGTGACGGTGGTCGAACAAGATCTGAGCCACACAGGCTCCTTCTTGGGAAGATTCGTAGAGGCCATTCATTACTATTCAGCACTGTTTGACTCTTTGGGTGTCAGCTATGGCGAAGAGAGTGAAGAGAGGCACTTAGTGGAGCAGCTACTGTTATCAAGGGAGATCAGAAACGTGCTGGCCGTCGGAGGACCGTCGAGGAGCGGCGAAGTGAAGTTCCAAAACTGGAGAGAAAAGCTGCAACAATCTGGGTTTAAGGGCATTTCTCTCGCCGGTAATGCTGCAACTCAGGCCACTCTTCTCCTCGGAATGTTCCCTTCCGATGGGTATACGCTTGTGGAAGACAATGGGACTTTGAAACTTGGGTGGAAGGATCTATGCTTGCTGACAGCGTCGGCTTGGAAGCCGCCGTTTCATCACCATGCTGCCGGCAACCACATTCCCAGGTACTGACATTTTTCCATTTTTGATTCTATATAGTGGTGTTGATTTTGTTTGATAATATCGTCATAATCATTATCATCATTATTCCCTTATTTTTATGGCTTTTTTTTAACATTCTTCTTCCTCTTCTTCTTGTTTCAACCCTTTTTCCCATCTCCAATGTATACCAAATTATCTGTGCCTTTTGAATAATGAACTTTCTGTTTCTAACTCATTATAATGTCGACTGAAAAATTAATATTTTGTTTTCATTACTTTTAAATTCAATAAATGTAAAGTACAGACATTAGGAGTACCGAAGAAGTTGATTATTGTTGAGCCATGTTTGTTATTAAAGTCAAACTTTCAACTTTTTTGAATTAGTGGACG

mRNA sequence

CGGGTCGATTGTGTTCCTCCATTTCCGCCATGGATGCGACAGTGAAGAAGCTCTGCTCAAACTGTAATGTGAAGAAGAAGAAAGAAGAAGGAATCTATCTCTCTCCTCTTCCATTTTAATACTGTTTTCTTTTCAGTTCCTGATCTTCTCGATTTCGCACCACGATTTTCTTTTCAAAAAGCCACCTCTCAGATGCATCCAAAATCCATGGCTTCCGCGTACTTCGCATTATTAAAAAGCATGAAGGCATAGGGGCTCGTGTTCTTGCTCGAGAGAGAGAAGAGGAATCCTCTAGTTGCTGTCCAAATCTTCGTGCCGTTTTCTGCTTGGATTTCGCTTCTGTTTCCAAATCTTACAAATCTGCGTTAATGGTAGCCATTACTCCTGCTACCTCCTTCTCCTTTATGAAGAACACCGATCGTCGCCATTGCCATATCTTCTCTACCTCCAATGGCGACTTACGCTTTGCTTGGCGACTCCACTGTCCGCGTTAATGGCGGTTTCGATGACGGTTCTCTGACCAGCAACTCGACGAACAGCAACGGCAGCGAAGAACTTAATCAACAGACTGTTCAAGTTCCGGTTCAGGTTTCTCAACCGCCGCCGCGATTACCGCCTGGAAAAATGGTGCGGAAGAGAATCGCGTCGGAGATAGAGATCGAAGAACTCGACGGCGGAGGCGGCGGTACCGCCGCCGTTCATCCACGGTTTTGCCGGCGGAGTTTGGCTTCCGATCGTCCTTTTGCAGGTGGAGAAAATAAGACGAATGAGAATGTGGATAACTATTGTTCTTCTTCAAACCCTAGCCATGGCGCTAACCACTCCACTGTGCATAATTTAACCGCTCTGACGTCAGTTGTGGTCGCAGGGTCAAATTTATCAAATCCTCCGTCTGGTTCTGATGCTACGGCTTCTTCCACTACCTCCAACGTCTCGTCTCTTATTGACAGTACGCTTCCCGTTCTTCGTCCTCAGCCCCACCATCGCCATTTGCAGAATCCTGCAGTCTGTGGTTTTTCTGGTTTACCTTTGTTCCCACCCGAATCGAATCACCACCACCACCACAACAAGTTAAATTCTCGCAATAATCCTTTCCCCATTCCTAATCCATGTCAGGTTGTTCTTCATAATCCTCCAACTTCTACAACGACCTCCATTATCGCCGCCGCTTCTACTCCGATGGATGATTCCTCCGCCACTGCTTGGATCGACGGTATTATCAAGGATTTAATCCACAGCTCCACTGCCGTATCCATTCCTCAGCTTATTCAGAACGTTCGTGATATTATCTACCCTTGTAACCCCAATCTTGCGAACCTTCTTGAGTTTCGTCTTCGTACTTTGACAGAACCTAACGTTCCTAACTTCGCCGCTGAGGATCAGCGTGTTAGGAAATCGCCCTTGCCGTCACCAGCGCCGGTGGGCGGCTCGGGGCTGCAGCAGAGGCAGTTCAATCAAGAACATGAGCAAGAACAGGATTGTTCTGGGTTGAAGCTTAATCTCGATTCTTCTCTGCATAATCTTCCTAATTTCTCATCTCAGCCTCCGTTTCATGACCCATACCTCCACTGGGGAGCCACTCCTACCCCAGCCCCCACTCCCTCCGTCGCCACAACCGGCGGCGAGGTTCCCGGTCATCAACTTAATCTCTCTTCTGTTCCACATTCATCACTTATTCCTCTAAACCATATCCCTTCTAAGCCACAGCCAGAACAGCCAAACTCCTGTCCGGTCAATGTGAAGGCAGCGGCAGCGGCGGCCGCACAGCCATCGCCCTCGCCGCCGACGAGTAATGATCCTTCAACGACTGCGTTACTGATTAGAGAGATAAAAGAGGAGATGAGGCAACAGAAGAGAGACGAAGAAGGGTTACACCTCTTGACTTTGCTTCTTCAATGTGCAGAAGCCGTTTCTGCCGATAATTTAGAAGAAGCCAATAAGATGCTACTGGAAATCTCGGAGTTGTCGACGCCCTTCGGTACATCGGCGCAAAGAGTGGCGGCGTATTTCTCAGAAGCAATGTCGGCGAGGCTTGTGAGCTCATGTTTAGGCATATACGCCGCTTTGCCGCCGACGTTATTGCCCCATACACACAGCCAGAAGATAGCCTCAGCCTTTCAAGTCTTCAATGGCATAAGCCCATTTGTCAAATTCTCACACTTCACAGCCAATCAAGCCATTCAAGAAGCCTTCGAAAGGGAAGAGAGAGTTCACATCATAGATCTAGACATCATGCAAGGCCTTCAATGGCCAGGCCTCTTCCACATCTTGGCGTCTCGGCCGGGTGGGCCGCCCTATGTCCGCCTCACTGGGCTGGGGACCTCTCAGGAAGTTCTTGAGGCCACTGGCAAACGCCTCACTGAATTCGCCGAGAAGCTTGGCCTTCCGTTTGATTTCTTTCCTGTTGCAGATAAAATTGGCAATCTGGACTTGGAGAGGCTCAACGTGAGCCAAAGAGAAGCCGTTGCCGTCCATTGGATGCAGCATTCTCTTTACGAAGTCACTGGTTCTGATTCCAATACGCTATGGCTTCTGCAGAGATTGGCTCCAAAAGTGGTGACGGTGGTCGAACAAGATCTGAGCCACACAGGCTCCTTCTTGGGAAGATTCGTAGAGGCCATTCATTACTATTCAGCACTGTTTGACTCTTTGGGTGTCAGCTATGGCGAAGAGAGTGAAGAGAGGCACTTAGTGGAGCAGCTACTGTTATCAAGGGAGATCAGAAACGTGCTGGCCGTCGGAGGACCGTCGAGGAGCGGCGAAGTGAAGTTCCAAAACTGGAGAGAAAAGCTGCAACAATCTGGGTTTAAGGGCATTTCTCTCGCCGGTAATGCTGCAACTCAGGCCACTCTTCTCCTCGGAATGTTCCCTTCCGATGGGTATACGCTTGTGGAAGACAATGGGACTTTGAAACTTGGGTGGAAGGATCTATGCTTGCTGACAGCGTCGGCTTGGAAGCCGCCGTTTCATCACCATGCTGCCGGCAACCACATTCCCAGGTACTGACATTTTTCCATTTTTGATTCTATATAGTGGTGTTGATTTTGTTTGATAATATCGTCATAATCATTATCATCATTATTCCCTTATTTTTATGGCTTTTTTTTAACATTCTTCTTCCTCTTCTTCTTGTTTCAACCCTTTTTCCCATCTCCAATGTATACCAAATTATCTGTGCCTTTTGAATAATGAACTTTCTGTTTCTAACTCATTATAATGTCGACTGAAAAATTAATATTTTGTTTTCATTACTTTTAAATTCAATAAATGTAAAGTACAGACATTAGGAGTACCGAAGAAGTTGATTATTGTTGAGCCATGTTTGTTATTAAAGTCAAACTTTCAACTTTTTTGAATTAGTGGACG

Coding sequence (CDS)

ATGGCGACTTACGCTTTGCTTGGCGACTCCACTGTCCGCGTTAATGGCGGTTTCGATGACGGTTCTCTGACCAGCAACTCGACGAACAGCAACGGCAGCGAAGAACTTAATCAACAGACTGTTCAAGTTCCGGTTCAGGTTTCTCAACCGCCGCCGCGATTACCGCCTGGAAAAATGGTGCGGAAGAGAATCGCGTCGGAGATAGAGATCGAAGAACTCGACGGCGGAGGCGGCGGTACCGCCGCCGTTCATCCACGGTTTTGCCGGCGGAGTTTGGCTTCCGATCGTCCTTTTGCAGGTGGAGAAAATAAGACGAATGAGAATGTGGATAACTATTGTTCTTCTTCAAACCCTAGCCATGGCGCTAACCACTCCACTGTGCATAATTTAACCGCTCTGACGTCAGTTGTGGTCGCAGGGTCAAATTTATCAAATCCTCCGTCTGGTTCTGATGCTACGGCTTCTTCCACTACCTCCAACGTCTCGTCTCTTATTGACAGTACGCTTCCCGTTCTTCGTCCTCAGCCCCACCATCGCCATTTGCAGAATCCTGCAGTCTGTGGTTTTTCTGGTTTACCTTTGTTCCCACCCGAATCGAATCACCACCACCACCACAACAAGTTAAATTCTCGCAATAATCCTTTCCCCATTCCTAATCCATGTCAGGTTGTTCTTCATAATCCTCCAACTTCTACAACGACCTCCATTATCGCCGCCGCTTCTACTCCGATGGATGATTCCTCCGCCACTGCTTGGATCGACGGTATTATCAAGGATTTAATCCACAGCTCCACTGCCGTATCCATTCCTCAGCTTATTCAGAACGTTCGTGATATTATCTACCCTTGTAACCCCAATCTTGCGAACCTTCTTGAGTTTCGTCTTCGTACTTTGACAGAACCTAACGTTCCTAACTTCGCCGCTGAGGATCAGCGTGTTAGGAAATCGCCCTTGCCGTCACCAGCGCCGGTGGGCGGCTCGGGGCTGCAGCAGAGGCAGTTCAATCAAGAACATGAGCAAGAACAGGATTGTTCTGGGTTGAAGCTTAATCTCGATTCTTCTCTGCATAATCTTCCTAATTTCTCATCTCAGCCTCCGTTTCATGACCCATACCTCCACTGGGGAGCCACTCCTACCCCAGCCCCCACTCCCTCCGTCGCCACAACCGGCGGCGAGGTTCCCGGTCATCAACTTAATCTCTCTTCTGTTCCACATTCATCACTTATTCCTCTAAACCATATCCCTTCTAAGCCACAGCCAGAACAGCCAAACTCCTGTCCGGTCAATGTGAAGGCAGCGGCAGCGGCGGCCGCACAGCCATCGCCCTCGCCGCCGACGAGTAATGATCCTTCAACGACTGCGTTACTGATTAGAGAGATAAAAGAGGAGATGAGGCAACAGAAGAGAGACGAAGAAGGGTTACACCTCTTGACTTTGCTTCTTCAATGTGCAGAAGCCGTTTCTGCCGATAATTTAGAAGAAGCCAATAAGATGCTACTGGAAATCTCGGAGTTGTCGACGCCCTTCGGTACATCGGCGCAAAGAGTGGCGGCGTATTTCTCAGAAGCAATGTCGGCGAGGCTTGTGAGCTCATGTTTAGGCATATACGCCGCTTTGCCGCCGACGTTATTGCCCCATACACACAGCCAGAAGATAGCCTCAGCCTTTCAAGTCTTCAATGGCATAAGCCCATTTGTCAAATTCTCACACTTCACAGCCAATCAAGCCATTCAAGAAGCCTTCGAAAGGGAAGAGAGAGTTCACATCATAGATCTAGACATCATGCAAGGCCTTCAATGGCCAGGCCTCTTCCACATCTTGGCGTCTCGGCCGGGTGGGCCGCCCTATGTCCGCCTCACTGGGCTGGGGACCTCTCAGGAAGTTCTTGAGGCCACTGGCAAACGCCTCACTGAATTCGCCGAGAAGCTTGGCCTTCCGTTTGATTTCTTTCCTGTTGCAGATAAAATTGGCAATCTGGACTTGGAGAGGCTCAACGTGAGCCAAAGAGAAGCCGTTGCCGTCCATTGGATGCAGCATTCTCTTTACGAAGTCACTGGTTCTGATTCCAATACGCTATGGCTTCTGCAGAGATTGGCTCCAAAAGTGGTGACGGTGGTCGAACAAGATCTGAGCCACACAGGCTCCTTCTTGGGAAGATTCGTAGAGGCCATTCATTACTATTCAGCACTGTTTGACTCTTTGGGTGTCAGCTATGGCGAAGAGAGTGAAGAGAGGCACTTAGTGGAGCAGCTACTGTTATCAAGGGAGATCAGAAACGTGCTGGCCGTCGGAGGACCGTCGAGGAGCGGCGAAGTGAAGTTCCAAAACTGGAGAGAAAAGCTGCAACAATCTGGGTTTAAGGGCATTTCTCTCGCCGGTAATGCTGCAACTCAGGCCACTCTTCTCCTCGGAATGTTCCCTTCCGATGGGTATACGCTTGTGGAAGACAATGGGACTTTGAAACTTGGGTGGAAGGATCTATGCTTGCTGACAGCGTCGGCTTGGAAGCCGCCGTTTCATCACCATGCTGCCGGCAACCACATTCCCAGGTACTGA

Protein sequence

MATYALLGDSTVRVNGGFDDGSLTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPGKMVRKRIASEIEIEELDGGGGGTAAVHPRFCRRSLASDRPFAGGENKTNENVDNYCSSSNPSHGANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPVLRPQPHHRHLQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTSTTTSIIAAASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLLEFRLRTLTEPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQEHEQEQDCSGLKLNLDSSLHNLPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGEVPGHQLNLSSVPHSSLIPLNHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPPTSNDPSTTALLIREIKEEMRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAMSARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLGLPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVTVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVGGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGWKDLCLLTASAWKPPFHHHAAGNHIPRY
BLAST of Cp4.1LG19g01780 vs. Swiss-Prot
Match: SCR_PEA (Protein SCARECROW OS=Pisum sativum GN=SCR PE=2 SV=1)

HSP 1 Score: 794.3 bits (2050), Expect = 1.3e-228
Identity = 471/863 (54.58%), Postives = 580/863 (67.21%), Query Frame = 1

Query: 17  GFDDGSLTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPGKMVRKRIASEIEIEELDGG 76
           G   G+ T + TN+N +   +  + +    + Q  P     K++RKR+ASE+E++  +  
Sbjct: 9   GVGGGNTTPDETNNNSTSNSSNISTEDFHNMPQQQPHHSERKLLRKRMASEMELQLHNNN 68

Query: 77  GGGTAAVHPRFCRR-----SLASDRPF----------------AGGENKTNENVDNYCSS 136
                  + RF RR     SL    P                 +G  N  N N +NY   
Sbjct: 69  NNND---YHRFSRRTNNTSSLNCSLPATTQKGVTTTTTTTLASSGNNNNNNNNNNNYHYH 128

Query: 137 SNPSHGANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPVLRPQ 196
           +N ++   ++  +N+ AL+   VA  N       ++ +     S+ SS ++++       
Sbjct: 129 NNNNNSIINNNNNNV-ALSRDNVAIQNFPTVTVTTNYSTMLLPSSCSSNLNNSSTSAANY 188

Query: 197 PHHRHL----QN--PAVCGFSGLPLFPPESNHHH--HHNKLNSRNNPFPIPNPCQVVLHN 256
            H++      QN  P +CGFSGLPLFP ++N  +  ++N  N+RNN              
Sbjct: 189 THYQQPLVEEQNTLPEICGFSGLPLFPSQNNQTNRTNNNSSNNRNN-------------- 248

Query: 257 PPTSTTTSIIAAASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNL 316
             T+T   +++++ +  + S+ T WIDGI+KDLIH+S +VSIPQLI NVR+IIYPCNPNL
Sbjct: 249 --TNTVVDVVSSSPSMEETSATTNWIDGILKDLIHTSNSVSIPQLINNVREIIYPCNPNL 308

Query: 317 ANLLEFRLRTLTEPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQEHEQEQDCSGL 376
           A +LE RLR LTEPN        +R R S   S   V G+ L     N         S +
Sbjct: 309 ALVLEHRLRLLTEPN----TCVPERKRNSTEQSGVNVNGNVLAASNVNN--------SSV 368

Query: 377 KLN------LDSSLH--NLPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGEVPGHQLN 436
           KL       + +SLH  +     +Q    + + +WGAT                   Q+N
Sbjct: 369 KLMNRVDDVVPTSLHFSDSSTLLNQNQNQNMFPNWGAT-------------------QIN 428

Query: 437 LSSVPHSSLIPLNHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPPTSNDPSTTALLIRE 496
            ++ P  SL+ L   P   Q +Q +    + +  A A         T+   S    L R+
Sbjct: 429 NNNNPSVSLVTLPSQPLSTQQDQQHQLQQHPEDLAPAT--------TTTTTSAELALARK 488

Query: 497 IKEEMRQQ-KRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAA 556
            KEE+++Q K+DEEGLHLLTLLLQCAEAVSA+NLE+ANKMLLEIS+LSTPFGTSAQRVAA
Sbjct: 489 KKEEIKEQKKKDEEGLHLLTLLLQCAEAVSAENLEQANKMLLEISQLSTPFGTSAQRVAA 548

Query: 557 YFSEAMSARLVSSCLGIYAALPPTLLPHT-HSQKIASAFQVFNGISPFVKFSHFTANQAI 616
           YFSEA+SARLVSSCLGIYA LP  +  HT H+QK+ASAFQVFNGISPFVKFSHFTANQAI
Sbjct: 549 YFSEAISARLVSSCLGIYATLP--VSSHTPHNQKVASAFQVFNGISPFVKFSHFTANQAI 608

Query: 617 QEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLT 676
           QEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTS E LEATGKRL+
Sbjct: 609 QEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSMETLEATGKRLS 668

Query: 677 EFAEKLGLPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQR 736
           +FA KLGLPF+FFPVA+K+GN+D+E+LNVS+ EAVAVHW+QHSLY+VTGSD+NTLWLLQR
Sbjct: 669 DFANKLGLPFEFFPVAEKVGNIDVEKLNVSKSEAVAVHWLQHSLYDVTGSDTNTLWLLQR 728

Query: 737 LAPKVVTVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREI 796
           LAPKVVTVVEQDLS+ GSFLGRFVEAIHYYSALFDSLG SYGEESEERH+VEQ LLSREI
Sbjct: 729 LAPKVVTVVEQDLSNAGSFLGRFVEAIHYYSALFDSLGSSYGEESEERHVVEQQLLSREI 788

Query: 797 RNVLAVGGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDN 841
           RNVLAVGGPSRSGE+KF NWREKLQQ GF+G+SLAGNAATQA+LLLGMFPS+GYTLVEDN
Sbjct: 789 RNVLAVGGPSRSGEIKFHNWREKLQQCGFRGVSLAGNAATQASLLLGMFPSEGYTLVEDN 810

BLAST of Cp4.1LG19g01780 vs. Swiss-Prot
Match: SCR_IPONI (Protein SCARECROW OS=Ipomoea nil GN=SCR PE=1 SV=1)

HSP 1 Score: 774.2 bits (1998), Expect = 1.4e-222
Identity = 461/792 (58.21%), Postives = 541/792 (68.31%), Query Frame = 1

Query: 58  KMVRKRIASEIEIEELDGGGGGTAAVHPRFCRRS--LASDRPFAGGENKTNENVDNYCSS 117
           KMVRKR ASE+E++      GG  + H RF RR+  L  D    G         DN   +
Sbjct: 58  KMVRKRAASEMELQI-----GGGISEHGRFLRRNAPLLGDLRVCGTNFGGGAGGDNGGGN 117

Query: 118 S---NPSHGANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPVL 177
           S   + SH  NH  V+N + +         ++ PP+ ++ + +ST+          LP  
Sbjct: 118 SLGVSVSH-PNHVVVNNYSTM--------QIAPPPTSTNLSVTSTSDATHLAYMEQLPPN 177

Query: 178 RPQPHHRHLQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTST 237
            PQ          +C FSGLPLFP  S      N   +   P P+P           + +
Sbjct: 178 EPQAPL------PLCVFSGLPLFPAPSRAR---NAAGAALQPAPLPVTA--------SGS 237

Query: 238 TTSIIAAASTPM-DDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLL 297
              + +++   M D+ +A AWIDGIIKDLIH ST VSIPQLIQNVR+II+PCNPNLA LL
Sbjct: 238 AIGVNSSSGGGMGDNGTAMAWIDGIIKDLIHISTHVSIPQLIQNVREIIHPCNPNLAALL 297

Query: 298 EFRLRTLTEPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQEHEQEQDCSGLKLNL 357
           E+RLR+LT       AA D        P  A V     ++        Q QD       +
Sbjct: 298 EYRLRSLTTAA----AAAD--------PLAANVYDDWRRKETLQP---QSQDA------I 357

Query: 358 DSSLHNLPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGEVPGHQLNLSSVPHSSLIPL 417
              LH LP+  + PP       W  T  PA   + ATT      HQL  ++      +P+
Sbjct: 358 THPLH-LPDSMTPPP-------WEITLPPAA--AAATTR-----HQLRDNNPSSLPFVPV 417

Query: 418 NHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPPTSNDPSTTALL-----IREIKEEMRQ 477
                  Q +QP       +  + + +Q   SPP S + +  AL+     +R  KEE+ Q
Sbjct: 418 PSSDRLDQQQQPGRMDNEKQPESQSQSQ---SPPASENTAAAALIRTESIMRREKEELEQ 477

Query: 478 QKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAMSA 537
           QK+DEEGLHLLTLLLQCAEAV+ADNL+EAN+MLL++SELSTP+GTSAQRVAAYFSEAMSA
Sbjct: 478 QKKDEEGLHLLTLLLQCAEAVAADNLDEANRMLLQVSELSTPYGTSAQRVAAYFSEAMSA 537

Query: 538 RLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFEREER 597
           RLV+SCLGIYA+ P   LP + +QK+ASAFQVFNGISPFVKFSHFTANQAIQEAFERE+R
Sbjct: 538 RLVNSCLGIYASAPLNALPLSLNQKMASAFQVFNGISPFVKFSHFTANQAIQEAFEREDR 597

Query: 598 VHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLGLP 657
           VHIIDLDIMQGLQWPGLFHILASRPGGPP VRLTGLGTS E LEATGKRL++FA+KLGLP
Sbjct: 598 VHIIDLDIMQGLQWPGLFHILASRPGGPPLVRLTGLGTSMEALEATGKRLSDFAQKLGLP 657

Query: 658 FDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVTVV 717
           F+FFPVADK+GNLD +RLNV++REAVAVHW+QHSLY+VTGSD+NTLWLLQRLAPKVVTVV
Sbjct: 658 FEFFPVADKVGNLDPQRLNVNKREAVAVHWLQHSLYDVTGSDTNTLWLLQRLAPKVVTVV 717

Query: 718 EQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVGGP 777
           EQDLSH GSFLGRFVEAIHYYSALFDSLG  YGEESEERH VEQ LLSREIRNVLAVGGP
Sbjct: 718 EQDLSHAGSFLGRFVEAIHYYSALFDSLGACYGEESEERHAVEQQLLSREIRNVLAVGGP 777

Query: 778 SRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGWKD 837
           SRSGEVKF NWREK QQSGF+G+SLAGNAA QATLLLGMF SDGYTL EDNG LKLGWKD
Sbjct: 778 SRSGEVKFNNWREKFQQSGFRGVSLAGNAAAQATLLLGMFHSDGYTLAEDNGALKLGWKD 779

Query: 838 LCLLTASAWKPP 839
           LCLLTASAW+PP
Sbjct: 838 LCLLTASAWRPP 779

BLAST of Cp4.1LG19g01780 vs. Swiss-Prot
Match: SCR_ARATH (Protein SCARECROW OS=Arabidopsis thaliana GN=SCR PE=1 SV=1)

HSP 1 Score: 672.5 bits (1734), Expect = 5.9e-192
Identity = 342/445 (76.85%), Postives = 386/445 (86.74%), Query Frame = 1

Query: 408 LIPLNHIPSKPQPEQ----------PNSCPVNVKAA--AAAAAQPSPSPPTSNDPST--- 467
           L  +++ PS PQ +Q          P   P+  +    ++  A P P   T+  P+    
Sbjct: 207 LYQISNNPSPPQQQQQHQQQQQQHKPPPPPIQQQERENSSTDAPPQPETVTATVPAVQTN 266

Query: 468 TALLIREIKEEMRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTS 527
           TA  +RE KEE+++QK+DEEGLHLLTLLLQCAEAVSADNLEEANK+LLEIS+LSTP+GTS
Sbjct: 267 TAEALRERKEEIKRQKQDEEGLHLLTLLLQCAEAVSADNLEEANKLLLEISQLSTPYGTS 326

Query: 528 AQRVAAYFSEAMSARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFT 587
           AQRVAAYFSEAMSARL++SCLGIYAALP   +P THS K+ SAFQVFNGISP VKFSHFT
Sbjct: 327 AQRVAAYFSEAMSARLLNSCLGIYAALPSRWMPQTHSLKMVSAFQVFNGISPLVKFSHFT 386

Query: 588 ANQAIQEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEAT 647
           ANQAIQEAFE+E+ VHIIDLDIMQGLQWPGLFHILASRPGGPP+VRLTGLGTS E L+AT
Sbjct: 387 ANQAIQEAFEKEDSVHIIDLDIMQGLQWPGLFHILASRPGGPPHVRLTGLGTSMEALQAT 446

Query: 648 GKRLTEFAEKLGLPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTL 707
           GKRL++FA+KLGLPF+F P+A+K+GNLD ERLNV +REAVAVHW+QHSLY+VTGSD++TL
Sbjct: 447 GKRLSDFADKLGLPFEFCPLAEKVGNLDTERLNVRKREAVAVHWLQHSLYDVTGSDAHTL 506

Query: 708 WLLQRLAPKVVTVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLL 767
           WLLQRLAPKVVTVVEQDLSH GSFLGRFVEAIHYYSALFDSLG SYGEESEERH+VEQ L
Sbjct: 507 WLLQRLAPKVVTVVEQDLSHAGSFLGRFVEAIHYYSALFDSLGASYGEESEERHVVEQQL 566

Query: 768 LSREIRNVLAVGGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYT 827
           LS+EIRNVLAVGGPSRSGEVKF++WREK+QQ GFKGISLAGNAATQATLLLGMFPSDGYT
Sbjct: 567 LSKEIRNVLAVGGPSRSGEVKFESWREKMQQCGFKGISLAGNAATQATLLLGMFPSDGYT 626

Query: 828 LVEDNGTLKLGWKDLCLLTASAWKP 838
           LV+DNGTLKLGWKDL LLTASAW P
Sbjct: 627 LVDDNGTLKLGWKDLSLLTASAWTP 651

BLAST of Cp4.1LG19g01780 vs. Swiss-Prot
Match: SCR_MAIZE (Protein SCARECROW OS=Zea mays GN=SCR PE=2 SV=1)

HSP 1 Score: 659.4 bits (1700), Expect = 5.1e-188
Identity = 370/596 (62.08%), Postives = 427/596 (71.64%), Query Frame = 1

Query: 247 SSATAWIDGIIKDLIHSS--TAVSIPQLIQNVRDIIYPCNPNLANLLEFRLRTLTEPNVP 306
           +S TAW+DGII+D+I SS   AVSI QLI NVR+II+PCNP LA+LLE RLR+L      
Sbjct: 138 ASTTAWVDGIIRDIIGSSGGAAVSITQLIHNVREIIHPCNPGLASLLELRLRSL------ 197

Query: 307 NFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQEHEQEQDC-SGLKLNLDSSLHNLPNFSS 366
             AA+      +PLP P        Q +Q    H       +GL L     L +      
Sbjct: 198 -LAAD-----PAPLPPPP-------QPQQHALLHGAPAAAPAGLTLPPPPPLPDKRRHEH 257

Query: 367 QPPFHDPYLHWGATPTPAPTPSVATTGGEVPGHQLNLSSVPHSSLIPLNHIPSKPQPEQP 426
            PP           P PAP                                P  P  E+ 
Sbjct: 258 PPPCQQQQQE---EPHPAPQS------------------------------PKAPTAEET 317

Query: 427 NSCPVNVKAAAAAAAQPSPSPPTSNDPSTTALLIREIKEEMRQQKRDEEGLHLLTLLLQC 486
            +     +AAAAAAA+                   E KEE R+++RDEEGLHLLTLLLQC
Sbjct: 318 AAAAAAAQAAAAAAAK-------------------ERKEEQRRKQRDEEGLHLLTLLLQC 377

Query: 487 AEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAMSARLVSSCLGIYAALPP-- 546
           AEAV+ADNL++A++ LLEI+EL+TPFGTS QRVAAYF+EAMSARLVSSCLG+YA LPP  
Sbjct: 378 AEAVNADNLDDAHQTLLEIAELATPFGTSTQRVAAYFAEAMSARLVSSCLGLYAPLPPGS 437

Query: 547 TLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLDIMQGLQWP 606
                 H  ++A+AFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLDIMQGLQWP
Sbjct: 438 PAAARLHG-RVAAAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLDIMQGLQWP 497

Query: 607 GLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLGLPFDFFPVADKIGNLDL 666
           GLFHILASRPGGPP VRLTGLG S E LEATGKRL++FA+ LGLPF+F  VA+K GN+D 
Sbjct: 498 GLFHILASRPGGPPRVRLTGLGASMEALEATGKRLSDFADTLGLPFEFCAVAEKAGNVDP 557

Query: 667 ERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVTVVEQDLSHTGSFLGRFV 726
           E+L V++REAVAVHW+ HSLY+VTGSDSNTLWL+QRLAPKVVT+VEQDLSH+GSFL RFV
Sbjct: 558 EKLGVTRREAVAVHWLHHSLYDVTGSDSNTLWLIQRLAPKVVTMVEQDLSHSGSFLARFV 617

Query: 727 EAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVGGPSRSGEVKFQNWREKL 786
           EAIHYYSALFDSL  SYGE+S ERH+VEQ LLSREIRNVLAVGGP+R+G+VKF +WREKL
Sbjct: 618 EAIHYYSALFDSLDASYGEDSPERHVVEQQLLSREIRNVLAVGGPARTGDVKFGSWREKL 661

Query: 787 QQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGWKDLCLLTASAWKP 838
            QSGF+  SLAG+AA QA+LLLGMFPSDGYTLVE+NG LKLGWKDLCLLTASAW+P
Sbjct: 678 AQSGFRAASLAGSAAAQASLLLGMFPSDGYTLVEENGALKLGWKDLCLLTASAWRP 661

BLAST of Cp4.1LG19g01780 vs. Swiss-Prot
Match: SCR2_ORYSI (Protein SCARECROW 2 OS=Oryza sativa subsp. indica GN=SCR2 PE=3 SV=2)

HSP 1 Score: 620.9 bits (1600), Expect = 2.0e-176
Identity = 384/709 (54.16%), Postives = 452/709 (63.75%), Query Frame = 1

Query: 139 AGSNLSNPPSGSDATASS-TTSNVSSLIDSTLPVLRPQPHHRHLQNPAVCGFSGLPLFPP 198
           + S L  P S S AT SS + S+ S  I S LP L P  HH             L L+  
Sbjct: 3   SSSLLLFPSSSSSATHSSYSPSSSSHAITSLLPPL-PSDHH-------------LLLYLD 62

Query: 199 ESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTSTTT---SIIAAASTPMDDSSATAWID 258
               HH    +  +     +  P       PP    T   S + AA+      SA+A + 
Sbjct: 63  HQEQHHLAAAMVRKRPASDMDLP-------PPRRHVTGDLSDVTAAAAGAPTLSASAQLP 122

Query: 259 GIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLLEFRLRTLTEPNVPNFAAEDQRVR 318
            +            +P       D+  P  P    +         E   P+ A  D  +R
Sbjct: 123 AL---------PTQLPAFHHTDMDLAAPAPPAPQQV------AAGEGGPPSTAWVDGIIR 182

Query: 319 KSPLPSPAPVGGSGLQQRQFNQEHEQEQDC-----SGLKLNLDSSLHNLPNFSSQPPFHD 378
                S A V  + L     +   E  + C     S L+L L S L++ P     PP H 
Sbjct: 183 DIIASSGAAVSVAQL----IHNVREIIRPCNPDLASILELRLRSLLNSDPAPPPPPPSHP 242

Query: 379 PYLHWGATPTPAPTPSVATTGGEVPGHQLNLSSVPHSSLIPLNHIPSKPQPEQPNSCPVN 438
             L   AT  P P  SVA      P         P          P++PQ  +P +    
Sbjct: 243 ALLPPDATAPPPPPTSVAALPPPPPAQPDKRRREPQCQ----EQEPNQPQSPKPPTAEET 302

Query: 439 VKAAAAAAAQPSPSPPTSNDPSTTALLIREIKEEMRQQKRDEEGLHLLTLLLQCAEAVSA 498
             AAAAAAA               A   +E KEE R+++RDEEGLHLLTLLLQCAE+V+A
Sbjct: 303 AAAAAAAAA-------------AAAAAAKERKEEQRRKQRDEEGLHLLTLLLQCAESVNA 362

Query: 499 DNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAMSARLVSSCLGIYAALP-PTLLPHTH 558
           DNL+EA++ LLEI+EL+TPFGTS QRVAAYF+EAMSARLVSSCLG+YA LP P+      
Sbjct: 363 DNLDEAHRALLEIAELATPFGTSTQRVAAYFAEAMSARLVSSCLGLYAPLPSPSPAGARV 422

Query: 559 SQKIASAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLDIMQGLQWPGLFHILA 618
             ++A+AFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLDIMQGLQWPGLFHILA
Sbjct: 423 HGRVAAAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLDIMQGLQWPGLFHILA 482

Query: 619 SRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLGLPFDFFPVADKIGNLDLERLNVSQ 678
           SRPGGPP VRLTGLG S E LEATGKRL++FA+ LGLPF+F PVADK GNLD E+L V++
Sbjct: 483 SRPGGPPRVRLTGLGASMEALEATGKRLSDFADTLGLPFEFCPVADKAGNLDPEKLGVTR 542

Query: 679 REAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVTVVEQDLSHTGSFLGRFVEAIHYYS 738
           REAVAVHW++HSLY+VTGSDSNTLWL+QRLAPKVVT+VEQDLSH+GSFL RFVEAIHYYS
Sbjct: 543 REAVAVHWLRHSLYDVTGSDSNTLWLIQRLAPKVVTMVEQDLSHSGSFLARFVEAIHYYS 602

Query: 739 ALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVGGPSRSGEVKFQNWREKLQQSGFKG 798
           ALFDSL  SY E+S ERH+VEQ LLSREIRNVLAVGGP+R+G+VKF +WREKL QSGF+ 
Sbjct: 603 ALFDSLDASYSEDSPERHVVEQQLLSREIRNVLAVGGPARTGDVKFGSWREKLAQSGFRV 654

Query: 799 ISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGWKDLCLLTASAWKP 838
            SLAG+AA QA LLLGMFPSDGYTL+E+NG LKLGWKDLCLLTASAW+P
Sbjct: 663 SSLAGSAAAQAALLLGMFPSDGYTLIEENGALKLGWKDLCLLTASAWRP 654

BLAST of Cp4.1LG19g01780 vs. TrEMBL
Match: A0A0A0KWH9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G196810 PE=3 SV=1)

HSP 1 Score: 1436.8 bits (3718), Expect = 0.0e+00
Identity = 759/875 (86.74%), Postives = 788/875 (90.06%), Query Frame = 1

Query: 1   MATYALLGDSTVR-VNGGFDDGSLTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPGKM 60
           MA YALL DST R VNGGFDD  LTS STNSNGS+ELN Q +   VQV  P PRLP GKM
Sbjct: 1   MAAYALLNDSTPRGVNGGFDDSPLTSASTNSNGSDELNHQQI---VQV--PQPRLPVGKM 60

Query: 61  VRKRIASEIEIEELDGGGGG-------TAAVHPRFCRRSLASDRPFAGGENKTNENVDNY 120
           VRKRIASE+EIE LD GGGG       T AVHPRFCRR+LASDRPF  GENKTN N   Y
Sbjct: 61  VRKRIASEMEIEGLDSGGGGGGGGSGGTTAVHPRFCRRTLASDRPF--GENKTNVN---Y 120

Query: 121 CSSSNPSHGANHSTV-HNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPV 180
           CSSSNPSHG NHSTV HNLTALTSVV+ GSNLSNPPSGSDAT SSTTSN ++L+DSTLPV
Sbjct: 121 CSSSNPSHGGNHSTVVHNLTALTSVVIEGSNLSNPPSGSDATVSSTTSN-NNLLDSTLPV 180

Query: 181 LRPQPHHRHLQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTS 240
           LRPQPHH HLQNPAVCGFSGLPLFPPESNHHH  NKLN+RNNPFP+PNP QV+LHNPPT+
Sbjct: 181 LRPQPHHHHLQNPAVCGFSGLPLFPPESNHHH--NKLNTRNNPFPLPNPSQVLLHNPPTT 240

Query: 241 TTTSIIAAASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLL 300
            TTSIIAAAS+PMDDSSATAWIDGIIKDLIHSSTA+SIPQLIQNVR+IIYPCNPNLANLL
Sbjct: 241 ATTSIIAAASSPMDDSSATAWIDGIIKDLIHSSTAISIPQLIQNVREIIYPCNPNLANLL 300

Query: 301 EFRLRTLTEPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQE-HEQEQDCSGLKLN 360
           EFRLRTLT+P+VPNFA ED RVRKSPLP PAPV G GLQQRQFNQE HEQE DCSGLKLN
Sbjct: 301 EFRLRTLTDPSVPNFATEDHRVRKSPLPLPAPVAGLGLQQRQFNQEQHEQEHDCSGLKLN 360

Query: 361 LDS-SLHNLPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGE----VPGH-QLNLSSVP 420
           LDS SLHNL NF SQPPFH+PYL WGATP P PTPS A  G +    +PGH QLNLSSV 
Sbjct: 361 LDSTSLHNLSNFPSQPPFHEPYLQWGATPPPVPTPSAAAAGEDALQRLPGHHQLNLSSVT 420

Query: 421 HSSLIPLNHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPP-TSNDPSTTALLIREIKEE 480
            SSL+ LNH+PSKPQ EQ NSC       AAAAAQP+P+PP TSN+PS TALLIREIKEE
Sbjct: 421 PSSLVSLNHVPSKPQSEQQNSC-----TKAAAAAQPAPAPPSTSNNPSATALLIREIKEE 480

Query: 481 MRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEA 540
           MRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEA
Sbjct: 481 MRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEA 540

Query: 541 MSARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFER 600
           MSARLVSSCLGIYAALPP+L+PHTHSQKIASAFQ+FNGISPFVKFSHFTANQAIQEAFER
Sbjct: 541 MSARLVSSCLGIYAALPPSLVPHTHSQKIASAFQIFNGISPFVKFSHFTANQAIQEAFER 600

Query: 601 EERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKL 660
           EERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKL
Sbjct: 601 EERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKL 660

Query: 661 GLPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVV 720
           GLPFDFFPVADKIGNLDLERLNVS+REAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVV
Sbjct: 661 GLPFDFFPVADKIGNLDLERLNVSKREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVV 720

Query: 721 TVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAV 780
           TVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQ LLSREIRNVLAV
Sbjct: 721 TVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQQLLSREIRNVLAV 780

Query: 781 GGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLG 840
           GGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLG
Sbjct: 781 GGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLG 840

Query: 841 WKDLCLLTASAWKPPFHHHAA-------GNHIPRY 852
           WKDLCLLTASAWKPPFHHHAA        NHIPRY
Sbjct: 841 WKDLCLLTASAWKPPFHHHAAAAAAAVTNNHIPRY 857

BLAST of Cp4.1LG19g01780 vs. TrEMBL
Match: Q5NDC9_CUCSA (SCARECROW OS=Cucumis sativus GN=scr PE=2 SV=1)

HSP 1 Score: 1361.3 bits (3522), Expect = 0.0e+00
Identity = 729/875 (83.31%), Postives = 760/875 (86.86%), Query Frame = 1

Query: 1   MATYALLGDSTVR-VNGGFDDGSLTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPGKM 60
           MA YALL DST R VNGGFDD  LTS STNSNGS+ELN Q +   VQV  P PRLP GKM
Sbjct: 1   MAAYALLNDSTPRGVNGGFDDSPLTSASTNSNGSDELNHQQI---VQV--PQPRLPVGKM 60

Query: 61  VRKRIASEIEIEELDGGGGGTAAVHPRF--CR-----RSLASDRPFAGGENKTNENVDNY 120
           VRKRIASE+EIE LD GGGG      R+  C      RSLASDRP      K        
Sbjct: 61  VRKRIASEMEIEGLDSGGGGGGGGSRRYYCCSSTVLPRSLASDRPL----EKIRRIGIIV 120

Query: 121 CSSSNPSHGANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPVL 180
              +            NLTALTSVV+ GSNLSNPPSGSDAT SSTTSN ++L+DSTLPVL
Sbjct: 121 LLQTLAMAATTPLLCINLTALTSVVIEGSNLSNPPSGSDATVSSTTSN-NNLLDSTLPVL 180

Query: 181 RPQPHHRHLQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTST 240
           RPQPHH HLQNPAVCGFSGLPLFPPESNHHH  NKLN+RNNPFP+PNP QV+LHNPPT+ 
Sbjct: 181 RPQPHHHHLQNPAVCGFSGLPLFPPESNHHH--NKLNTRNNPFPLPNPSQVLLHNPPTTA 240

Query: 241 TTSIIAAASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLLE 300
           TTSIIAAAS+PMDDSSATAWIDGIIKDLIHSSTA+SIPQLIQNVR+IIYPCNPNLANLLE
Sbjct: 241 TTSIIAAASSPMDDSSATAWIDGIIKDLIHSSTAISIPQLIQNVREIIYPCNPNLANLLE 300

Query: 301 FRLRTLTEPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQE-HEQEQDCSGLKLNL 360
           FRLRTLT+P+VPNFA ED RVRKSPLP PAPV G GLQQRQFNQE HEQE DCSGLKLNL
Sbjct: 301 FRLRTLTDPSVPNFATEDHRVRKSPLPLPAPVAGLGLQQRQFNQEQHEQEHDCSGLKLNL 360

Query: 361 DS-SLHNLPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGE----VPGH-QLNLSSVPH 420
           DS SLHNL NF SQPPFH+PYL WGATP P PTPS A  G +    +PGH QLN+SSV  
Sbjct: 361 DSTSLHNLSNFPSQPPFHEPYLQWGATPPPVPTPSAAAAGEDALQRLPGHHQLNISSVTP 420

Query: 421 SSLIPLNHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPP-TSNDPSTTALLIREIKEEM 480
           SSL+ LNH+PSKPQ EQ NSC       AAAAAQP+P+PP TSN+PS TALLIREIKEEM
Sbjct: 421 SSLVSLNHVPSKPQSEQQNSC-----TKAAAAAQPAPAPPSTSNNPSATALLIREIKEEM 480

Query: 481 RQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAM 540
           RQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAM
Sbjct: 481 RQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAM 540

Query: 541 SARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFERE 600
           SARLVSSCLGIYAALPP+L+PHTHSQKIASAFQ+FNGISPFVKFSHFTANQAIQEAFERE
Sbjct: 541 SARLVSSCLGIYAALPPSLVPHTHSQKIASAFQIFNGISPFVKFSHFTANQAIQEAFERE 600

Query: 601 ERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLG 660
           ERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLG
Sbjct: 601 ERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLG 660

Query: 661 LPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVT 720
           LPFDFFPVADKIGNLDLERLNVS+REAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVT
Sbjct: 661 LPFDFFPVADKIGNLDLERLNVSKREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVT 720

Query: 721 VVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVG 780
           VVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQ LLSREIRNVLAVG
Sbjct: 721 VVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQQLLSREIRNVLAVG 780

Query: 781 GPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGW 840
           GPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGW
Sbjct: 781 GPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGW 840

Query: 841 KDLCLLTASAWKPPFHHHAA--------GNHIPRY 852
           KDLCLLTASAWKPPFHHHAA         NHIPRY
Sbjct: 841 KDLCLLTASAWKPPFHHHAAAAAAAAVTNNHIPRY 858

BLAST of Cp4.1LG19g01780 vs. TrEMBL
Match: F6HMQ2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0056g00050 PE=3 SV=1)

HSP 1 Score: 862.4 bits (2227), Expect = 4.5e-247
Identity = 517/874 (59.15%), Postives = 597/874 (68.31%), Query Frame = 1

Query: 2   ATYALLGDSTVRVNGGFDDGS----LTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPG 61
           A  ALLGD+   ++     G+    LTS S +S G ++LN    +               
Sbjct: 3   AACALLGDNGREMDANGSAGASLTPLTSTSISS-GCDQLNHHFQRA-------------- 62

Query: 62  KMVRKRIASEIEIEELDGGGGGTAAVHPRFCRRSLASDRP-----FAGG-------ENKT 121
           KMVRKR ASE+E++        T + H RF RR + +  P       GG        N  
Sbjct: 63  KMVRKRTASEVELQ--------TGSYH-RFSRRPITAMNPNPLHDMGGGGSSLSFPSNNI 122

Query: 122 NENVDNYCSSS---NPSHGANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVS 181
           +   DN  S+S   N +H  NHST+                   P  +++T +S+T+   
Sbjct: 123 SSRDDNSNSNSATPNSTHVPNHSTIS------------------PCSTNSTVTSSTN--L 182

Query: 182 SLIDSTLPVLRPQPHHRHLQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQ 241
           + ID+  P+  PQP       PAVCGFSGLPLFPPE N +      ++   P P   P  
Sbjct: 183 AYIDTLAPL--PQP-------PAVCGFSGLPLFPPERNRNTSGTLASAAFLPAPAVPPL- 242

Query: 242 VVLHNPPTSTTTSIIAAASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYP 301
                PP+             M+D++ATAWIDGI+KDLIHSST V IPQLIQNVR+II+P
Sbjct: 243 ----TPPS-------------MEDTTATAWIDGILKDLIHSSTNVPIPQLIQNVREIIHP 302

Query: 302 CNPNLANLLEFRLRTLTEPN-VPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQEHEQE 361
           CNPNLA++LE+RLR+LT+PN +PN+    +R RK   P        GL +    Q   Q 
Sbjct: 303 CNPNLASILEYRLRSLTDPNPIPNY---PERRRKDGPP-------VGLPRAYQQQGQVQV 362

Query: 362 QDCSGLKLNLDSSLHN----LPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGEVPGHQ 421
              SGLKL LDS L N    LP+ S+     + YL+WG   T  PT +       +  HQ
Sbjct: 363 SSSSGLKLYLDSGLDNLHYSLPD-SAASHVMNHYLNWGL--TQPPTTTADGQAQHLSDHQ 422

Query: 422 LNLSSVPHSSLIPLNHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPPTSNDPSTTALLI 481
            + SSV     +   H P   QP+QP + P + + A AAA         +  P++ A++ 
Sbjct: 423 ASPSSVAPVLSLNQVHPPQPAQPQQPQNSPQSAEPAGAAAT-------ITTAPTSAAIVT 482

Query: 482 REIKEEMRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVA 541
           +E KEE RQQKRDEEGLHLLTLLLQCAEAVSADN EEANKMLLEISELSTPFGTSAQRVA
Sbjct: 483 KEKKEETRQQKRDEEGLHLLTLLLQCAEAVSADNFEEANKMLLEISELSTPFGTSAQRVA 542

Query: 542 AYFSEAMSARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAI 601
           AYFSEAMSARLVSSCLGIYA LP       HSQK+ SAFQVFNGISPFVKFSHFTANQAI
Sbjct: 543 AYFSEAMSARLVSSCLGIYATLPTV----PHSQKLVSAFQVFNGISPFVKFSHFTANQAI 602

Query: 602 QEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLT 661
           QEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPP+VRLTGLGTS E LEATGKRLT
Sbjct: 603 QEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPFVRLTGLGTSMEALEATGKRLT 662

Query: 662 EFAEKLGLPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQR 721
           +FAEKLGLPF+FFPVA+K+GNLD ERLNVS+REAVAVHW+QHSLY+VTGSD+NTLWLLQR
Sbjct: 663 DFAEKLGLPFEFFPVAEKVGNLDPERLNVSKREAVAVHWLQHSLYDVTGSDTNTLWLLQR 722

Query: 722 LAPKVVTVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREI 781
           LAPKVVTVVEQDLSH GSFLGRFVEAIHYYSALFDSLG SYGEESE+RH VEQ LLSREI
Sbjct: 723 LAPKVVTVVEQDLSHAGSFLGRFVEAIHYYSALFDSLGASYGEESEQRHAVEQQLLSREI 781

Query: 782 RNVLAVGGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDN 841
           RNVLAVGGPSRSG+VKF NWREKLQQSGF+ +SLAGNAATQATLLLGMFPSDGYTLVEDN
Sbjct: 783 RNVLAVGGPSRSGDVKFNNWREKLQQSGFRVVSLAGNAATQATLLLGMFPSDGYTLVEDN 781

Query: 842 GTLKLGWKDLCLLTASAWKP--------PFHHHA 844
           GTLKLGWKDLCLLTASAW+P        P HH+A
Sbjct: 843 GTLKLGWKDLCLLTASAWRPFHAAATTTPTHHYA 781

BLAST of Cp4.1LG19g01780 vs. TrEMBL
Match: A0A061ELM0_THECC (GRAS family transcription factor isoform 2 OS=Theobroma cacao GN=TCM_017746 PE=3 SV=1)

HSP 1 Score: 855.5 bits (2209), Expect = 5.4e-245
Identity = 501/857 (58.46%), Postives = 595/857 (69.43%), Query Frame = 1

Query: 1   MATYALLGDSTVRVNGGFDDGSLTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPGKMV 60
           MA   L+G++   +NG        SNS  S  +   N  T +              GKM+
Sbjct: 1   MAACDLVGENGSEING-------CSNSRESPVTSASNSSTSE--------------GKMM 60

Query: 61  RKRIASEIEIEELDGGGGGTAAVHPRFCRRSLASDRPFAGGENKTNENVDNYCSSSNPSH 120
           RKR+ASEI             A + RF RRSL S  P    EN     +    +++NP+ 
Sbjct: 61  RKRMASEI-------------ADYHRFPRRSLPSHPP---SENMGCSFLAAATTANNPNP 120

Query: 121 GANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPVLRPQPHHRH 180
             N+ST++    + + ++  +NL+   SG  A   +TTSN++ +    L    P P    
Sbjct: 121 LLNYSTMN----MNTTIIPSANLTAVTSGGPAFLCTTTSNITCI--DNLSTTNPPP---- 180

Query: 181 LQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTSTTTSI-IAA 240
              PAVCGFSGLPLFPP   + +                    V  +  T+TT  + +  
Sbjct: 181 ---PAVCGFSGLPLFPPTDRNRN-------------------TVAASTTTATTAPVALTP 240

Query: 241 ASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLLEFRLRTLT 300
            S  MDD+SATAWIDGII+DLIH+S+ VSIPQLIQNVR+IIYPCNPNLA LLE+RLR+L 
Sbjct: 241 ISNSMDDTSATAWIDGIIRDLIHTSSNVSIPQLIQNVREIIYPCNPNLAALLEYRLRSLM 300

Query: 301 EPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQEHEQEQDCSGLKLNLDSSLHNLP 360
           +P       E +R    P+  PA     GL  R  +Q  +Q+   SGL LNLDS+L ++P
Sbjct: 301 DP------LERRRKETPPVHLPA-----GLIPRHHSQHQQQQHGSSGLTLNLDSALDSVP 360

Query: 361 NFS-SQPPFHDPYLHWGATPTPAPTPSVATTGGEVPGHQLNLS-SVPHSSLIPLNHIPSK 420
           N+S ++      YL+WG TP P  + S AT   +   +Q++ S S P   ++ LN    +
Sbjct: 361 NYSFTESCAMSQYLNWGITPLPI-SNSAATGSNQHHHNQISSSPSAPTPPVLSLNQTQHQ 420

Query: 421 PQ-PEQPNSCPV---NVKAAAAAAAQPSPSPPTSNDPSTTALLIREIKEEMRQQKRDEEG 480
           PQ P Q    P+   N           + + PTS   +  A  +R+ KEE+RQQKRDEEG
Sbjct: 421 PQVPHQAQEQPLPEENSSPVEKTTTSTTTTTPTSTVQAVQACSVRDRKEELRQQKRDEEG 480

Query: 481 LHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAMSARLVSSCL 540
           LHLLTLLLQCAEAVSA+N EEAN+MLLE+S+LSTPFGTSAQRVAAYFSEAMSARLVSSCL
Sbjct: 481 LHLLTLLLQCAEAVSANNFEEANRMLLELSQLSTPFGTSAQRVAAYFSEAMSARLVSSCL 540

Query: 541 GIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLD 600
           GI A LP   +P +H+QK+ SAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLD
Sbjct: 541 GISAELPS--IPQSHTQKMVSAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLD 600

Query: 601 IMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLGLPFDFFPVA 660
           IMQGLQWPGLFHILASRPGGPP+VRLTGLGTS E LEATGKRL++FA+KLGLPF+F PVA
Sbjct: 601 IMQGLQWPGLFHILASRPGGPPHVRLTGLGTSLEALEATGKRLSDFADKLGLPFEFCPVA 660

Query: 661 DKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVTVVEQDLSHT 720
           +K+GNL+ ERLNVS+REAVAVHW+QHSLY+VTGSD+NTLWLLQRLAPKVVTVVEQDLSH 
Sbjct: 661 EKVGNLEPERLNVSKREAVAVHWLQHSLYDVTGSDTNTLWLLQRLAPKVVTVVEQDLSHA 720

Query: 721 GSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVGGPSRSGEVK 780
           GSFLG FVEAIHYYSALFDSLG SYGEESEERH+VEQ LLS+EIRNVLA+GGPSRS EVK
Sbjct: 721 GSFLGTFVEAIHYYSALFDSLGASYGEESEERHVVEQQLLSKEIRNVLALGGPSRSEEVK 774

Query: 781 FQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGWKDLCLLTAS 840
           F NWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNG LKLGWKDLCLLTAS
Sbjct: 781 FHNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGALKLGWKDLCLLTAS 774

Query: 841 AWKPPFHHHAAGNHIPR 851
           AW+P +   A+   I R
Sbjct: 841 AWRPFYASAASATTIHR 774

BLAST of Cp4.1LG19g01780 vs. TrEMBL
Match: A0A061EF07_THECC (GRAS family transcription factor isoform 1 OS=Theobroma cacao GN=TCM_017746 PE=3 SV=1)

HSP 1 Score: 855.5 bits (2209), Expect = 5.4e-245
Identity = 501/857 (58.46%), Postives = 595/857 (69.43%), Query Frame = 1

Query: 1   MATYALLGDSTVRVNGGFDDGSLTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPGKMV 60
           MA   L+G++   +NG        SNS  S  +   N  T +              GKM+
Sbjct: 19  MAACDLVGENGSEING-------CSNSRESPVTSASNSSTSE--------------GKMM 78

Query: 61  RKRIASEIEIEELDGGGGGTAAVHPRFCRRSLASDRPFAGGENKTNENVDNYCSSSNPSH 120
           RKR+ASEI             A + RF RRSL S  P    EN     +    +++NP+ 
Sbjct: 79  RKRMASEI-------------ADYHRFPRRSLPSHPP---SENMGCSFLAAATTANNPNP 138

Query: 121 GANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPVLRPQPHHRH 180
             N+ST++    + + ++  +NL+   SG  A   +TTSN++ +    L    P P    
Sbjct: 139 LLNYSTMN----MNTTIIPSANLTAVTSGGPAFLCTTTSNITCI--DNLSTTNPPP---- 198

Query: 181 LQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTSTTTSI-IAA 240
              PAVCGFSGLPLFPP   + +                    V  +  T+TT  + +  
Sbjct: 199 ---PAVCGFSGLPLFPPTDRNRN-------------------TVAASTTTATTAPVALTP 258

Query: 241 ASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLLEFRLRTLT 300
            S  MDD+SATAWIDGII+DLIH+S+ VSIPQLIQNVR+IIYPCNPNLA LLE+RLR+L 
Sbjct: 259 ISNSMDDTSATAWIDGIIRDLIHTSSNVSIPQLIQNVREIIYPCNPNLAALLEYRLRSLM 318

Query: 301 EPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQEHEQEQDCSGLKLNLDSSLHNLP 360
           +P       E +R    P+  PA     GL  R  +Q  +Q+   SGL LNLDS+L ++P
Sbjct: 319 DP------LERRRKETPPVHLPA-----GLIPRHHSQHQQQQHGSSGLTLNLDSALDSVP 378

Query: 361 NFS-SQPPFHDPYLHWGATPTPAPTPSVATTGGEVPGHQLNLS-SVPHSSLIPLNHIPSK 420
           N+S ++      YL+WG TP P  + S AT   +   +Q++ S S P   ++ LN    +
Sbjct: 379 NYSFTESCAMSQYLNWGITPLPI-SNSAATGSNQHHHNQISSSPSAPTPPVLSLNQTQHQ 438

Query: 421 PQ-PEQPNSCPV---NVKAAAAAAAQPSPSPPTSNDPSTTALLIREIKEEMRQQKRDEEG 480
           PQ P Q    P+   N           + + PTS   +  A  +R+ KEE+RQQKRDEEG
Sbjct: 439 PQVPHQAQEQPLPEENSSPVEKTTTSTTTTTPTSTVQAVQACSVRDRKEELRQQKRDEEG 498

Query: 481 LHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAMSARLVSSCL 540
           LHLLTLLLQCAEAVSA+N EEAN+MLLE+S+LSTPFGTSAQRVAAYFSEAMSARLVSSCL
Sbjct: 499 LHLLTLLLQCAEAVSANNFEEANRMLLELSQLSTPFGTSAQRVAAYFSEAMSARLVSSCL 558

Query: 541 GIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLD 600
           GI A LP   +P +H+QK+ SAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLD
Sbjct: 559 GISAELPS--IPQSHTQKMVSAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLD 618

Query: 601 IMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLGLPFDFFPVA 660
           IMQGLQWPGLFHILASRPGGPP+VRLTGLGTS E LEATGKRL++FA+KLGLPF+F PVA
Sbjct: 619 IMQGLQWPGLFHILASRPGGPPHVRLTGLGTSLEALEATGKRLSDFADKLGLPFEFCPVA 678

Query: 661 DKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVTVVEQDLSHT 720
           +K+GNL+ ERLNVS+REAVAVHW+QHSLY+VTGSD+NTLWLLQRLAPKVVTVVEQDLSH 
Sbjct: 679 EKVGNLEPERLNVSKREAVAVHWLQHSLYDVTGSDTNTLWLLQRLAPKVVTVVEQDLSHA 738

Query: 721 GSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVGGPSRSGEVK 780
           GSFLG FVEAIHYYSALFDSLG SYGEESEERH+VEQ LLS+EIRNVLA+GGPSRS EVK
Sbjct: 739 GSFLGTFVEAIHYYSALFDSLGASYGEESEERHVVEQQLLSKEIRNVLALGGPSRSEEVK 792

Query: 781 FQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGWKDLCLLTAS 840
           F NWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNG LKLGWKDLCLLTAS
Sbjct: 799 FHNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGALKLGWKDLCLLTAS 792

Query: 841 AWKPPFHHHAAGNHIPR 851
           AW+P +   A+   I R
Sbjct: 859 AWRPFYASAASATTIHR 792

BLAST of Cp4.1LG19g01780 vs. TAIR10
Match: AT3G54220.1 (AT3G54220.1 GRAS family transcription factor)

HSP 1 Score: 672.5 bits (1734), Expect = 3.3e-193
Identity = 342/445 (76.85%), Postives = 386/445 (86.74%), Query Frame = 1

Query: 408 LIPLNHIPSKPQPEQ----------PNSCPVNVKAA--AAAAAQPSPSPPTSNDPST--- 467
           L  +++ PS PQ +Q          P   P+  +    ++  A P P   T+  P+    
Sbjct: 207 LYQISNNPSPPQQQQQHQQQQQQHKPPPPPIQQQERENSSTDAPPQPETVTATVPAVQTN 266

Query: 468 TALLIREIKEEMRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTS 527
           TA  +RE KEE+++QK+DEEGLHLLTLLLQCAEAVSADNLEEANK+LLEIS+LSTP+GTS
Sbjct: 267 TAEALRERKEEIKRQKQDEEGLHLLTLLLQCAEAVSADNLEEANKLLLEISQLSTPYGTS 326

Query: 528 AQRVAAYFSEAMSARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFT 587
           AQRVAAYFSEAMSARL++SCLGIYAALP   +P THS K+ SAFQVFNGISP VKFSHFT
Sbjct: 327 AQRVAAYFSEAMSARLLNSCLGIYAALPSRWMPQTHSLKMVSAFQVFNGISPLVKFSHFT 386

Query: 588 ANQAIQEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEAT 647
           ANQAIQEAFE+E+ VHIIDLDIMQGLQWPGLFHILASRPGGPP+VRLTGLGTS E L+AT
Sbjct: 387 ANQAIQEAFEKEDSVHIIDLDIMQGLQWPGLFHILASRPGGPPHVRLTGLGTSMEALQAT 446

Query: 648 GKRLTEFAEKLGLPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTL 707
           GKRL++FA+KLGLPF+F P+A+K+GNLD ERLNV +REAVAVHW+QHSLY+VTGSD++TL
Sbjct: 447 GKRLSDFADKLGLPFEFCPLAEKVGNLDTERLNVRKREAVAVHWLQHSLYDVTGSDAHTL 506

Query: 708 WLLQRLAPKVVTVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLL 767
           WLLQRLAPKVVTVVEQDLSH GSFLGRFVEAIHYYSALFDSLG SYGEESEERH+VEQ L
Sbjct: 507 WLLQRLAPKVVTVVEQDLSHAGSFLGRFVEAIHYYSALFDSLGASYGEESEERHVVEQQL 566

Query: 768 LSREIRNVLAVGGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYT 827
           LS+EIRNVLAVGGPSRSGEVKF++WREK+QQ GFKGISLAGNAATQATLLLGMFPSDGYT
Sbjct: 567 LSKEIRNVLAVGGPSRSGEVKFESWREKMQQCGFKGISLAGNAATQATLLLGMFPSDGYT 626

Query: 828 LVEDNGTLKLGWKDLCLLTASAWKP 838
           LV+DNGTLKLGWKDL LLTASAW P
Sbjct: 627 LVDDNGTLKLGWKDLSLLTASAWTP 651

BLAST of Cp4.1LG19g01780 vs. TAIR10
Match: AT5G41920.1 (AT5G41920.1 GRAS family transcription factor)

HSP 1 Score: 423.7 bits (1088), Expect = 2.7e-118
Identity = 223/396 (56.31%), Postives = 285/396 (71.97%), Query Frame = 1

Query: 445 PTSNDPSTTALLIREIKEEMRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISE 504
           P+S+DPS+    I   +E +         + LL+LLLQCAE V+ D+L EA+ +L EISE
Sbjct: 11  PSSDDPSSAKRRIEFPEETLEND--GAAAIKLLSLLLQCAEYVATDHLREASTLLSEISE 70

Query: 505 LSTPFGTSAQRVAAYFSEAMSARLVSSCL-GIYAALPPTLLPHTHSQKIASAFQVFNGIS 564
           + +PFG+S +RV AYF++A+  R++SS L G  + L    L    SQKI SA Q +N +S
Sbjct: 71  ICSPFGSSPERVVAYFAQALQTRVISSYLSGACSPLSEKPLTVVQSQKIFSALQTYNSVS 130

Query: 565 PFVKFSHFTANQAIQEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLG 624
           P +KFSHFTANQAI +A + E+ VHIIDLD+MQGLQWP LFHILASRP     +R+TG G
Sbjct: 131 PLIKFSHFTANQAIFQALDGEDSVHIIDLDVMQGLQWPALFHILASRPRKLRSIRITGFG 190

Query: 625 TSQEVLEATGKRLTEFAEKLGLPFDFFPVADKIGNL-DLERLNVSQREAVAVHWMQHSLY 684
           +S ++L +TG+RL +FA  L LPF+F P+   IGNL D  +L   Q EAV VHWMQH LY
Sbjct: 191 SSSDLLASTGRRLADFASSLNLPFEFHPIEGIIGNLIDPSQLATRQGEAVVVHWMQHRLY 250

Query: 685 EVTGSDSNTLWLLQRLAPKVVTVVEQDLSHT--GSFLGRFVEAIHYYSALFDSLGVSYGE 744
           +VTG++  TL +L+RL P ++TVVEQ+LS+   GSFLGRFVEA+HYYSALFD+LG   GE
Sbjct: 251 DVTGNNLETLEILRRLKPNLITVVEQELSYDDGGSFLGRFVEALHYYSALFDALGDGLGE 310

Query: 745 ESEERHLVEQLLLSREIRNVLAVGGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQAT 804
           ES ER  VEQ++L  EIRN++A GG    G  K   W+E+L + GF+ +SL GN ATQA 
Sbjct: 311 ESGERFTVEQIVLGTEIRNIVAHGG----GRRKRMKWKEELSRVGFRPVSLRGNPATQAG 370

Query: 805 LLLGMFPSDGYTLVEDNGTLKLGWKDLCLLTASAWK 837
           LLLGM P +GYTLVE+NGTL+LGWKDL LLTASAWK
Sbjct: 371 LLLGMLPWNGYTLVEENGTLRLGWKDLSLLTASAWK 400

BLAST of Cp4.1LG19g01780 vs. TAIR10
Match: AT1G14920.1 (AT1G14920.1 GRAS family transcription factor family protein)

HSP 1 Score: 233.8 bits (595), Expect = 3.9e-61
Identity = 144/380 (37.89%), Postives = 216/380 (56.84%), Query Frame = 1

Query: 471 EEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAMSARLVS 530
           E G+ L+  LL CAEAV  +NL  A  ++ +I  L+     + ++VA YF+EA++ R+  
Sbjct: 164 ENGVRLVHALLACAEAVQKENLTVAEALVKQIGFLAVSQIGAMRKVATYFAEALARRIYR 223

Query: 531 SCLGIYAALPPTLLPHTHSQKIASAFQV-FNGISPFVKFSHFTANQAIQEAFEREERVHI 590
                   L P+  P  HS  ++   Q+ F    P++KF+HFTANQAI EAF+ ++RVH+
Sbjct: 224 --------LSPSQSPIDHS--LSDTLQMHFYETCPYLKFAHFTANQAILEAFQGKKRVHV 283

Query: 591 IDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLG----TSQEVLEATGKRLTEFAEKLGL 650
           ID  + QGLQWP L   LA RPGGPP  RLTG+G     + + L   G +L   AE + +
Sbjct: 284 IDFSMSQGLQWPALMQALALRPGGPPVFRLTGIGPPAPDNFDYLHEVGCKLAHLAEAIHV 343

Query: 651 PFDFFP-VADKIGNLDLERLNV--SQREAVAVH--WMQHSLYEVTGSDSNTLWLLQRLAP 710
            F++   VA+ + +LD   L +  S+ E+VAV+  +  H L    G+    L ++ ++ P
Sbjct: 344 EFEYRGFVANTLADLDASMLELRPSEIESVAVNSVFELHKLLGRPGAIDKVLGVVNQIKP 403

Query: 711 KVVTVVEQDLSHTGS-FLGRFVEAIHYYSALFDSL-GVSYGEESEERHLVEQLLLSREIR 770
           ++ TVVEQ+ +H    FL RF E++HYYS LFDSL GV  G++     ++ ++ L ++I 
Sbjct: 404 EIFTVVEQESNHNSPIFLDRFTESLHYYSTLFDSLEGVPSGQDK----VMSEVYLGKQIC 463

Query: 771 NVLAVGGPSR-SGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMF-PSDGYTLVED 830
           NV+A  GP R         WR +   +GF    +  NA  QA++LL +F   +GY + E 
Sbjct: 464 NVVACDGPDRVERHETLSQWRNRFGSAGFAAAHIGSNAFKQASMLLALFNGGEGYRVEES 523

Query: 831 NGTLKLGWKDLCLLTASAWK 837
           +G L LGW    L+  SAWK
Sbjct: 524 DGCLMLGWHTRPLIATSAWK 529

BLAST of Cp4.1LG19g01780 vs. TAIR10
Match: AT1G63100.1 (AT1G63100.1 GRAS family transcription factor)

HSP 1 Score: 232.6 bits (592), Expect = 8.7e-61
Identity = 141/385 (36.62%), Postives = 212/385 (55.06%), Query Frame = 1

Query: 471 EEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFG-TSAQRVAAYFSEAMSARLV 530
           +    L+ LL  C +A+ + N+   N  +    +L++P G T   R+ AY+ EA++ R+ 
Sbjct: 269 QRDFELVNLLTGCLDAIRSRNIAAINHFIARTGDLASPRGRTPMTRLIAYYIEALALRVA 328

Query: 531 SSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHI 590
                I+   PP     T   +  +A +  N ++P  KF HFTAN+ +  AFE +ERVHI
Sbjct: 329 RMWPHIFHIAPPREFDRTVEDESGNALRFLNQVTPIPKFIHFTANEMLLRAFEGKERVHI 388

Query: 591 IDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLGLPFDF 650
           ID DI QGLQWP  F  LASR   P +VR+TG+G S+  L  TG RL  FAE + L F+F
Sbjct: 389 IDFDIKQGLQWPSFFQSLASRINPPHHVRITGIGESKLELNETGDRLHGFAEAMNLQFEF 448

Query: 651 FPVADKIGNLDLERLNVSQREAVAVH---WMQHSLYEVTGSD-SNTLWLLQRLAPKVVTV 710
            PV D++ ++ L  L+V + E+VAV+    M  +LY+ TG+   + L L++   P  + +
Sbjct: 449 HPVVDRLEDVRLWMLHVKEGESVAVNCVMQMHKTLYDGTGAAIRDFLGLIRSTNPIALVL 508

Query: 711 VEQDLSHTGSFL-GRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVG 770
            EQ+  H    L  R   ++ YYSA+FD++  +   +S  R  VE++L  REIRN++A  
Sbjct: 509 AEQEAEHNSEQLETRVCNSLKYYSAMFDAIHTNLATDSLMRVKVEEMLFGREIRNIVACE 568

Query: 771 GPSR-SGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSD--GYTLV----EDN 830
           G  R    V F++WR  L+Q GF+ + ++     Q+ +LL M+ SD  G+  V    EDN
Sbjct: 569 GSHRQERHVGFRHWRRMLEQLGFRSLGVSEREVLQSKMLLRMYGSDNEGFFNVERSDEDN 628

Query: 831 -------GTLKLGWKDLCLLTASAW 836
                  G + L W +  L T SAW
Sbjct: 629 GGEGGRGGGVTLRWSEQPLYTISAW 653

BLAST of Cp4.1LG19g01780 vs. TAIR10
Match: AT1G66350.1 (AT1G66350.1 RGA-like 1)

HSP 1 Score: 231.1 bits (588), Expect = 2.5e-60
Identity = 141/373 (37.80%), Postives = 217/373 (58.18%), Query Frame = 1

Query: 471 EEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAMSARLVS 530
           E G+ L+  LL CAEAV  +NL+ A+ ++  +  L++    + ++VA YF+E ++ R+  
Sbjct: 147 ETGVRLVHALLACAEAVQQNNLKLADALVKHVGLLASSQAGAMRKVATYFAEGLARRIYR 206

Query: 531 SCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHII 590
               IY      L   + + +I      F    P++KF+HFTANQAI E F   E+VH+I
Sbjct: 207 ----IYPRDDVALSSFSDTLQIH-----FYESCPYLKFAHFTANQAILEVFATAEKVHVI 266

Query: 591 DLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLGLPFDFF 650
           DL +  GLQWP L   LA RP GPP  RLTG+G S   ++  G +L + A  +G+ F+F 
Sbjct: 267 DLGLNHGLQWPALIQALALRPNGPPDFRLTGIGYSLTDIQEVGWKLGQLASTIGVNFEFK 326

Query: 651 PVA-DKIGNLDLERLNVSQ-REAVAVH--WMQHSLYEVTGSDSNTLWLLQRLAPKVVTVV 710
            +A + + +L  E L++    E+VAV+  +  H L    GS    L  ++ + P ++TVV
Sbjct: 327 SIALNNLSDLKPEMLDIRPGLESVAVNSVFELHRLLAHPGSIDKFLSTIKSIRPDIMTVV 386

Query: 711 EQDLSHTGS-FLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVGG 770
           EQ+ +H G+ FL RF E++HYYS+LFDSL    G  S++R ++ +L L R+I N++A  G
Sbjct: 387 EQEANHNGTVFLDRFTESLHYYSSLFDSL---EGPPSQDR-VMSELFLGRQILNLVACEG 446

Query: 771 PSRSGEVKFQN-WREKLQQSGFKGISLAGNAATQATLLLGMFP-SDGYTLVEDNGTLKLG 830
             R    +  N WR +    GFK +S+  NA  QA++LL ++  +DGY + E+ G L LG
Sbjct: 447 EDRVERHETLNQWRNRFGLGGFKPVSIGSNAYKQASMLLALYAGADGYNVEENEGCLLLG 506

Query: 831 WKDLCLLTASAWK 837
           W+   L+  SAW+
Sbjct: 507 WQTRPLIATSAWR 506

BLAST of Cp4.1LG19g01780 vs. NCBI nr
Match: gi|700198807|gb|KGN53965.1| (hypothetical protein Csa_4G196810 [Cucumis sativus])

HSP 1 Score: 1436.8 bits (3718), Expect = 0.0e+00
Identity = 759/875 (86.74%), Postives = 788/875 (90.06%), Query Frame = 1

Query: 1   MATYALLGDSTVR-VNGGFDDGSLTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPGKM 60
           MA YALL DST R VNGGFDD  LTS STNSNGS+ELN Q +   VQV  P PRLP GKM
Sbjct: 1   MAAYALLNDSTPRGVNGGFDDSPLTSASTNSNGSDELNHQQI---VQV--PQPRLPVGKM 60

Query: 61  VRKRIASEIEIEELDGGGGG-------TAAVHPRFCRRSLASDRPFAGGENKTNENVDNY 120
           VRKRIASE+EIE LD GGGG       T AVHPRFCRR+LASDRPF  GENKTN N   Y
Sbjct: 61  VRKRIASEMEIEGLDSGGGGGGGGSGGTTAVHPRFCRRTLASDRPF--GENKTNVN---Y 120

Query: 121 CSSSNPSHGANHSTV-HNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPV 180
           CSSSNPSHG NHSTV HNLTALTSVV+ GSNLSNPPSGSDAT SSTTSN ++L+DSTLPV
Sbjct: 121 CSSSNPSHGGNHSTVVHNLTALTSVVIEGSNLSNPPSGSDATVSSTTSN-NNLLDSTLPV 180

Query: 181 LRPQPHHRHLQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTS 240
           LRPQPHH HLQNPAVCGFSGLPLFPPESNHHH  NKLN+RNNPFP+PNP QV+LHNPPT+
Sbjct: 181 LRPQPHHHHLQNPAVCGFSGLPLFPPESNHHH--NKLNTRNNPFPLPNPSQVLLHNPPTT 240

Query: 241 TTTSIIAAASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLL 300
            TTSIIAAAS+PMDDSSATAWIDGIIKDLIHSSTA+SIPQLIQNVR+IIYPCNPNLANLL
Sbjct: 241 ATTSIIAAASSPMDDSSATAWIDGIIKDLIHSSTAISIPQLIQNVREIIYPCNPNLANLL 300

Query: 301 EFRLRTLTEPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQE-HEQEQDCSGLKLN 360
           EFRLRTLT+P+VPNFA ED RVRKSPLP PAPV G GLQQRQFNQE HEQE DCSGLKLN
Sbjct: 301 EFRLRTLTDPSVPNFATEDHRVRKSPLPLPAPVAGLGLQQRQFNQEQHEQEHDCSGLKLN 360

Query: 361 LDS-SLHNLPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGE----VPGH-QLNLSSVP 420
           LDS SLHNL NF SQPPFH+PYL WGATP P PTPS A  G +    +PGH QLNLSSV 
Sbjct: 361 LDSTSLHNLSNFPSQPPFHEPYLQWGATPPPVPTPSAAAAGEDALQRLPGHHQLNLSSVT 420

Query: 421 HSSLIPLNHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPP-TSNDPSTTALLIREIKEE 480
            SSL+ LNH+PSKPQ EQ NSC       AAAAAQP+P+PP TSN+PS TALLIREIKEE
Sbjct: 421 PSSLVSLNHVPSKPQSEQQNSC-----TKAAAAAQPAPAPPSTSNNPSATALLIREIKEE 480

Query: 481 MRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEA 540
           MRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEA
Sbjct: 481 MRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEA 540

Query: 541 MSARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFER 600
           MSARLVSSCLGIYAALPP+L+PHTHSQKIASAFQ+FNGISPFVKFSHFTANQAIQEAFER
Sbjct: 541 MSARLVSSCLGIYAALPPSLVPHTHSQKIASAFQIFNGISPFVKFSHFTANQAIQEAFER 600

Query: 601 EERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKL 660
           EERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKL
Sbjct: 601 EERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKL 660

Query: 661 GLPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVV 720
           GLPFDFFPVADKIGNLDLERLNVS+REAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVV
Sbjct: 661 GLPFDFFPVADKIGNLDLERLNVSKREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVV 720

Query: 721 TVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAV 780
           TVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQ LLSREIRNVLAV
Sbjct: 721 TVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQQLLSREIRNVLAV 780

Query: 781 GGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLG 840
           GGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLG
Sbjct: 781 GGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLG 840

Query: 841 WKDLCLLTASAWKPPFHHHAA-------GNHIPRY 852
           WKDLCLLTASAWKPPFHHHAA        NHIPRY
Sbjct: 841 WKDLCLLTASAWKPPFHHHAAAAAAAVTNNHIPRY 857

BLAST of Cp4.1LG19g01780 vs. NCBI nr
Match: gi|659126706|ref|XP_008463324.1| (PREDICTED: LOW QUALITY PROTEIN: protein SCARECROW-like [Cucumis melo])

HSP 1 Score: 1365.9 bits (3534), Expect = 0.0e+00
Identity = 730/873 (83.62%), Postives = 760/873 (87.06%), Query Frame = 1

Query: 1   MATYALLGDSTVR-VNGGFDDGSLTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPGKM 60
           MA YALL DST R VNGGFDD  LTS STNSNGS+ELN Q +   VQV  P PRLP GKM
Sbjct: 1   MAAYALLNDSTPRGVNGGFDDSPLTSASTNSNGSDELNHQQI---VQV--PQPRLPVGKM 60

Query: 61  VRKRIASEIEIEELDGGGGGTAAVHPRFC-------RRSLASDRPFAGGENKTNENVDNY 120
           VRKRIASE+EIE LD GGGG      R C        RSLASDRP    E      +   
Sbjct: 61  VRKRIASEMEIEGLDSGGGGGGGGGGRCCCCSSTVLPRSLASDRPL---EKIRRIXIIVL 120

Query: 121 CSSSNPSHGANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPVL 180
              +            NLTALTSVV+ GSNLSNPPSGSDAT SSTTSN ++L+DSTLPVL
Sbjct: 121 LLQTLAMAATTPLLCXNLTALTSVVIEGSNLSNPPSGSDATVSSTTSN-NNLLDSTLPVL 180

Query: 181 RPQPHHRHLQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTST 240
           RPQPHH HLQNPAVCGFSGLPLFPPESNHHH  NKLN+RNNPFP+PNP QV+LHNPPT+ 
Sbjct: 181 RPQPHHHHLQNPAVCGFSGLPLFPPESNHHH--NKLNTRNNPFPLPNPSQVLLHNPPTTA 240

Query: 241 TTSIIAAASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLLE 300
           TTSIIAAAS+PMDDSSATAWIDGIIKDLIHSSTA+SIPQLIQNVR+IIYPCNPNLANLLE
Sbjct: 241 TTSIIAAASSPMDDSSATAWIDGIIKDLIHSSTAISIPQLIQNVREIIYPCNPNLANLLE 300

Query: 301 FRLRTLTEPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQE-HEQEQDCSGLKLNL 360
           FRLRTLT+P+VPNFA ED RVRKSPLP PAPV G GLQQRQFNQE HEQE DCSGLKLNL
Sbjct: 301 FRLRTLTDPSVPNFATEDHRVRKSPLPLPAPVAGLGLQQRQFNQEQHEQEHDCSGLKLNL 360

Query: 361 DS-SLHNLPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGE----VPGH-QLNLSSVPH 420
           DS SLHNL NF SQPPFH+PYL WGATP P PTPS A  G +    +PGH QLNLSSV  
Sbjct: 361 DSTSLHNLSNFPSQPPFHEPYLQWGATPPPVPTPSAAAAGEDALQRLPGHHQLNLSSVTP 420

Query: 421 SSLIPLNHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPP-TSNDPSTTALLIREIKEEM 480
           SSL+PLNH+PSKPQ EQ NS        AAAAAQP+P+PP TSN+PS TALLIREIKEEM
Sbjct: 421 SSLVPLNHVPSKPQSEQQNS-----STKAAAAAQPAPAPPSTSNNPSATALLIREIKEEM 480

Query: 481 RQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAM 540
           RQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAM
Sbjct: 481 RQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAM 540

Query: 541 SARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFERE 600
           SARLVSSCLGIYAALPP+L+PHTHSQKIASAFQ+FNGISPFVKFSHFTANQAIQEAFERE
Sbjct: 541 SARLVSSCLGIYAALPPSLVPHTHSQKIASAFQIFNGISPFVKFSHFTANQAIQEAFERE 600

Query: 601 ERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLG 660
           ERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLG
Sbjct: 601 ERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLG 660

Query: 661 LPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVT 720
           LPFDFFPVADKIGNLDLERLNVS+REAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVT
Sbjct: 661 LPFDFFPVADKIGNLDLERLNVSKREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVT 720

Query: 721 VVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVG 780
           VVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQ LLSREIRNVLAVG
Sbjct: 721 VVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQQLLSREIRNVLAVG 780

Query: 781 GPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGW 840
           GPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGW
Sbjct: 781 GPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGW 840

Query: 841 KDLCLLTASAWKPPFHHHAA------GNHIPRY 852
           KDLCLLTASAWKPPFHHHAA       NHIPRY
Sbjct: 841 KDLCLLTASAWKPPFHHHAAAAVAVTNNHIPRY 857

BLAST of Cp4.1LG19g01780 vs. NCBI nr
Match: gi|821595353|ref|NP_001295787.1| (protein SCARECROW 1 [Cucumis sativus])

HSP 1 Score: 1361.3 bits (3522), Expect = 0.0e+00
Identity = 729/875 (83.31%), Postives = 760/875 (86.86%), Query Frame = 1

Query: 1   MATYALLGDSTVR-VNGGFDDGSLTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPGKM 60
           MA YALL DST R VNGGFDD  LTS STNSNGS+ELN Q +   VQV  P PRLP GKM
Sbjct: 1   MAAYALLNDSTPRGVNGGFDDSPLTSASTNSNGSDELNHQQI---VQV--PQPRLPVGKM 60

Query: 61  VRKRIASEIEIEELDGGGGGTAAVHPRF--CR-----RSLASDRPFAGGENKTNENVDNY 120
           VRKRIASE+EIE LD GGGG      R+  C      RSLASDRP      K        
Sbjct: 61  VRKRIASEMEIEGLDSGGGGGGGGSRRYYCCSSTVLPRSLASDRPL----EKIRRIGIIV 120

Query: 121 CSSSNPSHGANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLIDSTLPVL 180
              +            NLTALTSVV+ GSNLSNPPSGSDAT SSTTSN ++L+DSTLPVL
Sbjct: 121 LLQTLAMAATTPLLCINLTALTSVVIEGSNLSNPPSGSDATVSSTTSN-NNLLDSTLPVL 180

Query: 181 RPQPHHRHLQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQVVLHNPPTST 240
           RPQPHH HLQNPAVCGFSGLPLFPPESNHHH  NKLN+RNNPFP+PNP QV+LHNPPT+ 
Sbjct: 181 RPQPHHHHLQNPAVCGFSGLPLFPPESNHHH--NKLNTRNNPFPLPNPSQVLLHNPPTTA 240

Query: 241 TTSIIAAASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYPCNPNLANLLE 300
           TTSIIAAAS+PMDDSSATAWIDGIIKDLIHSSTA+SIPQLIQNVR+IIYPCNPNLANLLE
Sbjct: 241 TTSIIAAASSPMDDSSATAWIDGIIKDLIHSSTAISIPQLIQNVREIIYPCNPNLANLLE 300

Query: 301 FRLRTLTEPNVPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQE-HEQEQDCSGLKLNL 360
           FRLRTLT+P+VPNFA ED RVRKSPLP PAPV G GLQQRQFNQE HEQE DCSGLKLNL
Sbjct: 301 FRLRTLTDPSVPNFATEDHRVRKSPLPLPAPVAGLGLQQRQFNQEQHEQEHDCSGLKLNL 360

Query: 361 DS-SLHNLPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGE----VPGH-QLNLSSVPH 420
           DS SLHNL NF SQPPFH+PYL WGATP P PTPS A  G +    +PGH QLN+SSV  
Sbjct: 361 DSTSLHNLSNFPSQPPFHEPYLQWGATPPPVPTPSAAAAGEDALQRLPGHHQLNISSVTP 420

Query: 421 SSLIPLNHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPP-TSNDPSTTALLIREIKEEM 480
           SSL+ LNH+PSKPQ EQ NSC       AAAAAQP+P+PP TSN+PS TALLIREIKEEM
Sbjct: 421 SSLVSLNHVPSKPQSEQQNSC-----TKAAAAAQPAPAPPSTSNNPSATALLIREIKEEM 480

Query: 481 RQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAM 540
           RQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAM
Sbjct: 481 RQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAM 540

Query: 541 SARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAIQEAFERE 600
           SARLVSSCLGIYAALPP+L+PHTHSQKIASAFQ+FNGISPFVKFSHFTANQAIQEAFERE
Sbjct: 541 SARLVSSCLGIYAALPPSLVPHTHSQKIASAFQIFNGISPFVKFSHFTANQAIQEAFERE 600

Query: 601 ERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLG 660
           ERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLG
Sbjct: 601 ERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLG 660

Query: 661 LPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVT 720
           LPFDFFPVADKIGNLDLERLNVS+REAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVT
Sbjct: 661 LPFDFFPVADKIGNLDLERLNVSKREAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVT 720

Query: 721 VVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVG 780
           VVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQ LLSREIRNVLAVG
Sbjct: 721 VVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQQLLSREIRNVLAVG 780

Query: 781 GPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGW 840
           GPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGW
Sbjct: 781 GPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDNGTLKLGW 840

Query: 841 KDLCLLTASAWKPPFHHHAA--------GNHIPRY 852
           KDLCLLTASAWKPPFHHHAA         NHIPRY
Sbjct: 841 KDLCLLTASAWKPPFHHHAAAAAAAAVTNNHIPRY 858

BLAST of Cp4.1LG19g01780 vs. NCBI nr
Match: gi|645221238|ref|XP_008244193.1| (PREDICTED: protein SCARECROW-like [Prunus mume])

HSP 1 Score: 864.4 bits (2232), Expect = 1.7e-247
Identity = 518/891 (58.14%), Postives = 603/891 (67.68%), Query Frame = 1

Query: 1   MATYALLGD------------STVRVNGGFDDGSLTSNS-TNSNGSEELNQQTVQVPVQV 60
           MA  ALLGD            + +   GG     +TS + +NS GS    Q   Q   Q 
Sbjct: 1   MAACALLGDHNGEHISGNGSSNNISHGGGSPSCPMTSTTNSNSQGSSVEQQPPRQHQNQQ 60

Query: 61  SQPPPRLPPGKMVRKRIASEIEIEELDGGGGGTAAVHPRFCRRS--LASDRPFAGGENKT 120
            Q        KMVRKR+A EIE++        +A+ + R  RRS  + ++ P      K 
Sbjct: 61  QQQRQSTEGSKMVRKRMACEIEVQNYPTSRNTSASDYMRLSRRSSSIINNNPNPNA-TKV 120

Query: 121 NENVDNYCSSSNPSHGANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVSSLI 180
           N N   Y     P++      V + T LT++  AG  LS P S S ++A+++ +N   + 
Sbjct: 121 NNNSMVY-----PNYSTMLLPVPSSTNLTTLTSAGGALS-PASASASSAAASAANWGPID 180

Query: 181 DSTLPVLRPQ----PHHRHLQ----NPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPI 240
             +L          PH   LQ     PAVCGFSGLPLFPPE                   
Sbjct: 181 PLSLHHHHQSGALPPHQLQLQPKTLTPAVCGFSGLPLFPPEKT----------------- 240

Query: 241 PNPCQVVLHNPPTSTTTSIIAAASTPMDDSS-ATAWIDGIIKDLIHSSTAVSIPQLIQNV 300
                      P++ +T+  ++ S  M+DSS ATAWIDGIIKDLIHSST VSIPQLI NV
Sbjct: 241 ----------TPSNQSTATPSSISISMEDSSSATAWIDGIIKDLIHSSTNVSIPQLIHNV 300

Query: 301 RDIIYPCNPNLANLLEFRLRTLTEPN-----VPNF---AAEDQRVRKSPLPSPAPVGGSG 360
           R+II+PCNPNLA+LLE+RLR+++EP      +PNF      + R R+  L          
Sbjct: 301 REIIFPCNPNLASLLEYRLRSISEPPPPPPPIPNFNPTTVPELRRRRETLQ--------- 360

Query: 361 LQQRQFNQEHEQEQDCSGLKLNLDSS-LHNLPNFSSQPPF------------HDPYLH-W 420
           LQQ+Q    H   Q    LKLNLDS+ LH++  F++                +D YLH W
Sbjct: 361 LQQQQNQHHHHHHQGPGALKLNLDSAALHDVAIFTNPTTVETASVATHVMNSNDLYLHSW 420

Query: 421 -----GATPTPAPTPSVATTGGEVPGHQLNLSSVPHSSLIPLNHIPSKPQPEQPNSCPVN 480
                GA PTP    + + T    P    N  ++ H+    L +  S   P   ++ P  
Sbjct: 421 TGGGGGAGPTPI---TCSQTNPHHPNSPFN-QAIHHTQDKQLEN-SSSSSPAAESTTPTA 480

Query: 481 VKAAAAAAAQPSPSPPTSNDPSTTALLIREIKEEMRQQKRDEEGLHLLTLLLQCAEAVSA 540
             A   A   P+P P T   PS    LIRE KEEMRQQKRDEEGLHLLTLLLQCAEAVSA
Sbjct: 481 APATTTATTTPTPPPTT---PSAAVSLIRERKEEMRQQKRDEEGLHLLTLLLQCAEAVSA 540

Query: 541 DNLEEANKMLLEISELSTPFGTSAQRVAAYFSEAMSARLVSSCLGIYAALPPTLLPHTHS 600
           DN +EA K+LLEISELSTPFGTSAQRVAAYFSEAMSARLVSSCLGIYA+LPP+ +P +H+
Sbjct: 541 DNFDEATKILLEISELSTPFGTSAQRVAAYFSEAMSARLVSSCLGIYASLPPSYVPISHT 600

Query: 601 QKIASAFQVFNGISPFVKFSHFTANQAIQEAFEREERVHIIDLDIMQGLQWPGLFHILAS 660
           QK+ SAFQVFNGISPFVKFSHFTANQAIQEAFERE+RVHI+DLDIMQGLQWPGLFHILAS
Sbjct: 601 QKMVSAFQVFNGISPFVKFSHFTANQAIQEAFEREDRVHIVDLDIMQGLQWPGLFHILAS 660

Query: 661 RPGGPPYVRLTGLGTSQEVLEATGKRLTEFAEKLGLPFDFFPVADKIGNLDLERLNVSQR 720
           RPGGPPYVRLTGLGTS E LEATGKRL++FA+KLGLPF+FFPVA+K+G+LD ERLN+S+R
Sbjct: 661 RPGGPPYVRLTGLGTSMEALEATGKRLSDFADKLGLPFEFFPVAEKVGSLDPERLNISKR 720

Query: 721 EAVAVHWMQHSLYEVTGSDSNTLWLLQRLAPKVVTVVEQDLSHTGSFLGRFVEAIHYYSA 780
           EAVAVHW+QHSLY+VTGSDSNTLWLLQRLAPKVVTVVEQDLSH GSFLGRFVEAIHYYSA
Sbjct: 721 EAVAVHWLQHSLYDVTGSDSNTLWLLQRLAPKVVTVVEQDLSHAGSFLGRFVEAIHYYSA 780

Query: 781 LFDSLGVSYGEESEERHLVEQLLLSREIRNVLAVGGPSRSGEVKFQNWREKLQQSGFKGI 840
           LFDSLG SYGEESEERH+VEQ LLSREIRNVLAVGGPSRSGEVKF NWREK QQSGF+GI
Sbjct: 781 LFDSLGASYGEESEERHVVEQQLLSREIRNVLAVGGPSRSGEVKFHNWREKFQQSGFRGI 840

BLAST of Cp4.1LG19g01780 vs. NCBI nr
Match: gi|225439035|ref|XP_002264349.1| (PREDICTED: protein SCARECROW [Vitis vinifera])

HSP 1 Score: 862.4 bits (2227), Expect = 6.4e-247
Identity = 517/874 (59.15%), Postives = 597/874 (68.31%), Query Frame = 1

Query: 2   ATYALLGDSTVRVNGGFDDGS----LTSNSTNSNGSEELNQQTVQVPVQVSQPPPRLPPG 61
           A  ALLGD+   ++     G+    LTS S +S G ++LN    +               
Sbjct: 3   AACALLGDNGREMDANGSAGASLTPLTSTSISS-GCDQLNHHFQRA-------------- 62

Query: 62  KMVRKRIASEIEIEELDGGGGGTAAVHPRFCRRSLASDRP-----FAGG-------ENKT 121
           KMVRKR ASE+E++        T + H RF RR + +  P       GG        N  
Sbjct: 63  KMVRKRTASEVELQ--------TGSYH-RFSRRPITAMNPNPLHDMGGGGSSLSFPSNNI 122

Query: 122 NENVDNYCSSS---NPSHGANHSTVHNLTALTSVVVAGSNLSNPPSGSDATASSTTSNVS 181
           +   DN  S+S   N +H  NHST+                   P  +++T +S+T+   
Sbjct: 123 SSRDDNSNSNSATPNSTHVPNHSTIS------------------PCSTNSTVTSSTN--L 182

Query: 182 SLIDSTLPVLRPQPHHRHLQNPAVCGFSGLPLFPPESNHHHHHNKLNSRNNPFPIPNPCQ 241
           + ID+  P+  PQP       PAVCGFSGLPLFPPE N +      ++   P P   P  
Sbjct: 183 AYIDTLAPL--PQP-------PAVCGFSGLPLFPPERNRNTSGTLASAAFLPAPAVPPL- 242

Query: 242 VVLHNPPTSTTTSIIAAASTPMDDSSATAWIDGIIKDLIHSSTAVSIPQLIQNVRDIIYP 301
                PP+             M+D++ATAWIDGI+KDLIHSST V IPQLIQNVR+II+P
Sbjct: 243 ----TPPS-------------MEDTTATAWIDGILKDLIHSSTNVPIPQLIQNVREIIHP 302

Query: 302 CNPNLANLLEFRLRTLTEPN-VPNFAAEDQRVRKSPLPSPAPVGGSGLQQRQFNQEHEQE 361
           CNPNLA++LE+RLR+LT+PN +PN+    +R RK   P        GL +    Q   Q 
Sbjct: 303 CNPNLASILEYRLRSLTDPNPIPNY---PERRRKDGPP-------VGLPRAYQQQGQVQV 362

Query: 362 QDCSGLKLNLDSSLHN----LPNFSSQPPFHDPYLHWGATPTPAPTPSVATTGGEVPGHQ 421
              SGLKL LDS L N    LP+ S+     + YL+WG   T  PT +       +  HQ
Sbjct: 363 SSSSGLKLYLDSGLDNLHYSLPD-SAASHVMNHYLNWGL--TQPPTTTADGQAQHLSDHQ 422

Query: 422 LNLSSVPHSSLIPLNHIPSKPQPEQPNSCPVNVKAAAAAAAQPSPSPPTSNDPSTTALLI 481
            + SSV     +   H P   QP+QP + P + + A AAA         +  P++ A++ 
Sbjct: 423 ASPSSVAPVLSLNQVHPPQPAQPQQPQNSPQSAEPAGAAAT-------ITTAPTSAAIVT 482

Query: 482 REIKEEMRQQKRDEEGLHLLTLLLQCAEAVSADNLEEANKMLLEISELSTPFGTSAQRVA 541
           +E KEE RQQKRDEEGLHLLTLLLQCAEAVSADN EEANKMLLEISELSTPFGTSAQRVA
Sbjct: 483 KEKKEETRQQKRDEEGLHLLTLLLQCAEAVSADNFEEANKMLLEISELSTPFGTSAQRVA 542

Query: 542 AYFSEAMSARLVSSCLGIYAALPPTLLPHTHSQKIASAFQVFNGISPFVKFSHFTANQAI 601
           AYFSEAMSARLVSSCLGIYA LP       HSQK+ SAFQVFNGISPFVKFSHFTANQAI
Sbjct: 543 AYFSEAMSARLVSSCLGIYATLPTV----PHSQKLVSAFQVFNGISPFVKFSHFTANQAI 602

Query: 602 QEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPYVRLTGLGTSQEVLEATGKRLT 661
           QEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPP+VRLTGLGTS E LEATGKRLT
Sbjct: 603 QEAFEREERVHIIDLDIMQGLQWPGLFHILASRPGGPPFVRLTGLGTSMEALEATGKRLT 662

Query: 662 EFAEKLGLPFDFFPVADKIGNLDLERLNVSQREAVAVHWMQHSLYEVTGSDSNTLWLLQR 721
           +FAEKLGLPF+FFPVA+K+GNLD ERLNVS+REAVAVHW+QHSLY+VTGSD+NTLWLLQR
Sbjct: 663 DFAEKLGLPFEFFPVAEKVGNLDPERLNVSKREAVAVHWLQHSLYDVTGSDTNTLWLLQR 722

Query: 722 LAPKVVTVVEQDLSHTGSFLGRFVEAIHYYSALFDSLGVSYGEESEERHLVEQLLLSREI 781
           LAPKVVTVVEQDLSH GSFLGRFVEAIHYYSALFDSLG SYGEESE+RH VEQ LLSREI
Sbjct: 723 LAPKVVTVVEQDLSHAGSFLGRFVEAIHYYSALFDSLGASYGEESEQRHAVEQQLLSREI 781

Query: 782 RNVLAVGGPSRSGEVKFQNWREKLQQSGFKGISLAGNAATQATLLLGMFPSDGYTLVEDN 841
           RNVLAVGGPSRSG+VKF NWREKLQQSGF+ +SLAGNAATQATLLLGMFPSDGYTLVEDN
Sbjct: 783 RNVLAVGGPSRSGDVKFNNWREKLQQSGFRVVSLAGNAATQATLLLGMFPSDGYTLVEDN 781

Query: 842 GTLKLGWKDLCLLTASAWKP--------PFHHHA 844
           GTLKLGWKDLCLLTASAW+P        P HH+A
Sbjct: 843 GTLKLGWKDLCLLTASAWRPFHAAATTTPTHHYA 781

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SCR_PEA1.3e-22854.58Protein SCARECROW OS=Pisum sativum GN=SCR PE=2 SV=1[more]
SCR_IPONI1.4e-22258.21Protein SCARECROW OS=Ipomoea nil GN=SCR PE=1 SV=1[more]
SCR_ARATH5.9e-19276.85Protein SCARECROW OS=Arabidopsis thaliana GN=SCR PE=1 SV=1[more]
SCR_MAIZE5.1e-18862.08Protein SCARECROW OS=Zea mays GN=SCR PE=2 SV=1[more]
SCR2_ORYSI2.0e-17654.16Protein SCARECROW 2 OS=Oryza sativa subsp. indica GN=SCR2 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KWH9_CUCSA0.0e+0086.74Uncharacterized protein OS=Cucumis sativus GN=Csa_4G196810 PE=3 SV=1[more]
Q5NDC9_CUCSA0.0e+0083.31SCARECROW OS=Cucumis sativus GN=scr PE=2 SV=1[more]
F6HMQ2_VITVI4.5e-24759.15Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0056g00050 PE=3 SV=... [more]
A0A061ELM0_THECC5.4e-24558.46GRAS family transcription factor isoform 2 OS=Theobroma cacao GN=TCM_017746 PE=3... [more]
A0A061EF07_THECC5.4e-24558.46GRAS family transcription factor isoform 1 OS=Theobroma cacao GN=TCM_017746 PE=3... [more]
Match NameE-valueIdentityDescription
AT3G54220.13.3e-19376.85 GRAS family transcription factor[more]
AT5G41920.12.7e-11856.31 GRAS family transcription factor[more]
AT1G14920.13.9e-6137.89 GRAS family transcription factor family protein[more]
AT1G63100.18.7e-6136.62 GRAS family transcription factor[more]
AT1G66350.12.5e-6037.80 RGA-like 1[more]
Match NameE-valueIdentityDescription
gi|700198807|gb|KGN53965.1|0.0e+0086.74hypothetical protein Csa_4G196810 [Cucumis sativus][more]
gi|659126706|ref|XP_008463324.1|0.0e+0083.62PREDICTED: LOW QUALITY PROTEIN: protein SCARECROW-like [Cucumis melo][more]
gi|821595353|ref|NP_001295787.1|0.0e+0083.31protein SCARECROW 1 [Cucumis sativus][more]
gi|645221238|ref|XP_008244193.1|1.7e-24758.14PREDICTED: protein SCARECROW-like [Prunus mume][more]
gi|225439035|ref|XP_002264349.1|6.4e-24759.15PREDICTED: protein SCARECROW [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005202TF_GRAS
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0044763 single-organism cellular process
biological_process GO:0044767 single-organism developmental process
biological_process GO:0008356 asymmetric cell division
biological_process GO:0090610 bundle sheath cell fate specification
biological_process GO:0009630 gravitropism
biological_process GO:0048366 leaf development
biological_process GO:0051457 maintenance of protein location in nucleus
biological_process GO:0009956 radial pattern formation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG19g01780.1Cp4.1LG19g01780.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005202Transcription factor GRASPFAMPF03514GRAScoord: 477..836
score: 3.2E
IPR005202Transcription factor GRASPROFILEPS50985GRAScoord: 450..816
score: 61
NoneNo IPR availablePANTHERPTHR31636FAMILY NOT NAMEDcoord: 35..130
score: 0.0coord: 266..837
score:
NoneNo IPR availablePANTHERPTHR31636:SF12PROTEIN SCARECROWcoord: 35..130
score: 0.0coord: 266..837
score: