Cp4.1LG15g01400 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g01400
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionNHL repeat-containing protein 2
LocationCp4.1LG15 : 1115804 .. 1124540 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAGTTAGCGAATAGCAAACAACCGCTCAAACACTGACGCCGTCGCAAACAACAGCTAAAACCCTGCGGGCCGCAACCTCTGAAACCCTGCCGCCGTGTAAGCCGCCGCTAGTAGCTTACCGTCGACCTGATGTCATTTCGTTCTTCTGCAACGAATATGGCTTTCAGGTTCCGGCGACTCAGAGAAATCTCAAAGTCGTTGCCTCAATTTTACTCAGGTACTCCTCCGCCGATCCGTGTTCCATCTTAATTTGAATTTGTTTCGAATAGCTTCGCTTGCTTGGTTGCATCTAGTCATCTTGAATTTTTTCTTTATCTGATTGCAATTTTTCTTTGTTGGAGTCAGGATGATGGTTTCTTTATCTGATTGCAATTTTTCTTTGTTGGAGTCAGGATATTACCATCAGCATCATCATAGGCATGCTGTTAGCTCATTGCCATTTTCCGTTGCTCCATCTTACGTTTCTGAAGGAGTTGAGAGAAGGATTTTAGAGAGTGGTCGCCACCTTCTGCGGTATATATGATTCATATGTTTACAGCTTTGCTAGATTTAAATTTTGTATCTCTGCTTGTACTCCATGAATTATTACAGCTTTACGAAGGAGAACCAGTTGATTAAAATGTTTTACGAAGGAGAACCAGTTGCTTAAAATGTTTCAAGTGGTTATGAGTGTTGTTAGACTATAACTATTTGGGAGAATTTACCCCGAGAGAGAGCCATTAAGATATGATGGCCTTTAACAATGAAAGAAACTTTCAAACTTGATGGCCTTGTCGTCGAACATATGTCACTTCTCCCCACCCAAATCAACCAAAAAAAAACCTATTTAGATATGAGAGCCTTAATCTTCAACTTTAAGTCTGTTATGCCTAAGCGGCCTTCAGACTTTGGAAGCTTAACCTTTTGGCAACGAATGAGATGAGGTGTTTCTTTATCCCGACCACCATTCTACATAAAGTTGAGGTAGTATTTTTTGGCTTTGTTGGGAGATAAAATAAAAGCAGTAATAGTAAGATGGGGTAATGGTGAGAGAAGATGCGAGAGAGCCTCTTAAAATAAGCCCAAGTACCATTGCGCCAGGGGCACTTTTTAGCTTCATGTTTTTAATAAAGGGCCAACCACTCCATTTTTCAAAACACCATCTCATTATGCTAAAATCACCTCCTAAAATCCATTGCTTCCTGTAAAATACATTCAACTATCTCAGCTTTAATGAAAAATTCCCGAGGGTTCTTTGGAATTTTCTTTGATCTTTGTTTTGTTTAGTACAAAAATGTGGTATATGTTGAATGACTTTTACTTGGGCCATGAAGATACCATATAGTAATAGTAGAAGCAGTATACTAAAGTTAGTTGTGTTCTTCAATAATGTTTTCACTGTTGATCTTGAGTTTGTTAGGTGGTACTGTTACTTGGTTCGTAGTTTCAATCTTGTAAATTTTGTTTGTCTATCTGACTTATATTCCACAGTCATTTTCAGGTTTTCCACAACAACGGAGCTGCAGTGCGAGTCTTCTCCCGCAAATGATGTTTTATCCTTCATTAAGTCAACCCTTGATGAATCTGAAGGTTGTACATTTTCCTTTTTCTTCGGACCTCCTTTTCTTTTATATCAATACCCCTACCTGCCATCATAGTAAAGGTTTAAAGTTATGTTTAGACAAGCCATGATATTTTGAAATATAATCATTTTTTTCTTGAAAGAAAATCATTTAATATATTTTTATATGGAGATTTTACATGTTGTTTTGTTACTTTTTGTTTTCTGAAATAGTAAGCTGCTGAACCTCAGAAGCACAATTTACACAATTATAAGATGATTTTCCAGAGTTTGTACTTTACGATTTGAAAAGTTCCTTTTAAAATTGCAATAGAATGCATTTTCTCGTTTATTTATTTTTGTGCAAATGTGCAGGTCCTAACCACTATTGGTTGAATATATGTGATGGAAATAAAGGAATATCTGAGAAGGATGGAATCTACTTAATTCTTGCTGATCAATTTCTAGAAATGACGAGTTCGGATTCTGTTGTTTTGGTCGAAAATGTAAAATTCCTTCAGCATAGGTAATGGAAGTTCATTTTAGGATCTCACTACTGCTGTAGTGCAGTAGACTGATCTGGAATTTTCTTTCTTTATAGGTTTCCTCAGCTTCATGTGATTGGGCTTCAGTGTTCCAATACTCTATCTGTCGCTGAAAAAAGTGGGATGATCCAATTTATAATGAGGGAATATGTTTCCTTTCCCATTTTGTTATCCAATAAGATTTTTGAGGTGAGTTTAAAGTGTTGCTTTCTCTTCCTTATGTATGTATTTAGATTGAGAGATAGAATAGGGAATCGATGATTTTTCTTGCCTCCTGGGTTATTGTTGGGACTTGTATAACATTATGTCAAACCTCCAATTACAAAAGTATATTCTAGGAAGGGTNAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTTTAGGGTATAGTGTTTGGCAGTTATTGTATTTAGGACTGTGTTTGACTGATATTGTGATTAAAAGTTTGATCTTTTCTTTAAATACGTTTGGTTTGATCTTCTGTAGAATGGGATATGGGATATGTGCAGTTTTCTTCTCAATCGAAGTAATGTAGGGAGAGAAGATTGGCTCTCTTGAACTGTTTAATTAGCGAAAATTGGCCCTCTTGAACTGATTAATTTGTGATAGGTTGTAATCTCTTTCTTTTTCGTCAATAATATTTAAAAGAGATGTTTATCACATTTCATTTGAGATGGGTATCAACTATAATGTGATAAAATGTGATAAACGTTCTAAAAGATGGTAAAACGGCAGAAGGGTGAAATCAGTTCATGTCATACTTGTTGTATCGGGGCCCTAGTCTTTTTATTATATGGTTTGCTCATCACCCATCATTCTAATCTCTGAACTGGCTTTGCTGGCTTTGCCCACTATGAGGACAGAATTGGATGCTTAAACTCAAAACTTAAGAGAAAATCTCCCTTTTCATCATGGCCACCCCTTGGAGGTTCTAGGCATTGATATGTTAACGTCTCATTTTAATCAACTGGAATGATTTTCATGAGTTTTTCATCATCAAAGAAAGTCAATGGCACTTAGATGAGTATATTTTTTTGTCTTTTCTCTTGAAATATTTATCTTTGAAATTCTATTATTTCAAATCTTTAGAAGACTATTTTCCCCCTGTTTTGAACATTTACTGCAGGAGTGTCCAGTTCTAATACGGTTGACTGATTCAGTTTCTGAATCTTTGTTGTGCAAATAGATGGGGAGGAGGCTCTGTTATATTATCTCCAAGGACTTCAGTAATCCTTTGCTCATCAGTGAGAGGGACACAGACCTTAATGTTCTTCGGAAAGGTATGCATCTTTTTTCTTGCTCTTGGCGCTTTTACTGCAACATTATTTTTAATTACTATTGAGTCTGGATTCTGGTCATTTTGATGCATTTTATCCTCGTAGCTTGCTGTTGTTATTTATTTATTTATTTATTTTTGGATAAAAGACAATTCAATGCTGAATATGTCTCTTTGTTTTCCATAACATCTTTGTTTGTGTTAGTAGTCATTTAATTGATGTCTACGTTATGCATGCATATATATTTCATAGTGTTCATTATTCTCACACCGTATGCACATTCAGCTATTGAGGAGTTGCTTGAACCAGAAAATGAGAAATCTGGTCTGCCGAATATTGGGAGAACTACTTATCTGAAACATGCGGAGATCATTAAAGAACCATATTCATGTTCGTTCATGCAGAATTTTATTCTCCACTTTCCAGGTACAGCTTTCAAAGAGATGTTTTCCTCATGTCCTATATTCCCTAGAGTAAACATCCCTTGAAGCAAAGCTTTTCTCTCTACAAAGGAGTTACTACCACTTGGAAGACTCTTTTTCATGGCAATCAACATTTTATAATGTCTTCGATCAATTTCCACTGTCACAGGCTGTATATCTGCTGATGAAAAGGGTGGCCGACTCTTCCTTTCTGATAGCAATCATAACCGGATTGTTATATTTAATGGCAACGGGAAGATCCTGGACATGGTATGTTTAATATTTAAATATTGATCCAGTCGGTGATGATTAGGTACCAGATATCCCTGTTTTTCTAGGTATATTAATAACTCGGTGAGTAAAAAAAAGGATAAGGTATTCTATTCATTATTTATAATTGCAGTTACTTTTCTTCCTCAGTTCTCTCTATTCCTTGGATGCCACAAAAATACCGTCATAAATAGTGATAATGATGGTTGTGTCATTAGAACTTGAAACATTATGATTATAATTTTAGGACCTTAATTGAACGGGACCTGTTCTATAGGTTGCACTACTGCATCCTTGAGTTCTATTTAGTTCTGCTTCTAACATAAAGGTGTAGTTTCAAGACTGTTGATTCTTTCAATCATTCTAGACTTCTACACTATTTATTTACCATACCTTATATCTTAAAACTGATTACCGTATTATTGCATTACATGGTTGACATGCTTACATTTTTATATGTCAGATTGGTTCTTATCCAGGTTTTGAGGACGGAGAATTTGAATTGGTCAAATTAGCTCGTCCAGCAGCTTCCTTTTATCATGCTACTCAGGATTGCCTGTATTTTGTGGACTCTGAGGTACGCACTAGTTCATCCCGAAAATTATTTGAACTTTCAAGTCCATGTGAGACCATTATCAGAATTAGCTTTAGTCCATTGTGTAAGCCGCAGATTCTATATTTCCGTCTGCAAGTAAAAGTTCGAAATGATGAGAACAAGATATCCGTGTGAAATAAGGTGACAGGCTGGAGACCTTTTTTCTTTTGTCAAAAAACATCGCCATAAAATTAAAATCATTGTTTTTTTTTAATGTTTCCTTTGAACAAGTGACCCCAAATTATGTTTCTGCTCATTTGTAGACATTTTAGTTGTTTGTTCTTGAAAATTCCACTACCTATCTGTCTGTGTTAACTTCAATGATATTAGTCAATTTATCTCTCAGCGGCTCTAATAGTCTCTTAAATTTTTGCTGTTATTTAATTTTGACTATCCAGAACCATGCCATTAGGAAAGCTGATTTGGGTAAGCGCGTTGTTGAAACTCTTTATCCAGCAAACTACTCAAGTACGAAGAGTACTAAGTTATGGAGCTGGATCAAGGACCGACTTGGTTTGGGAAGCATTCCCGACAGAGAAGTGGAAGATTTCAATCCGCAGTCTCTGATGTTTCCTTGGCACATGATTAGATATATGGATGATAGATTATTAATTTTAAATCGCAGGTAGAATGGAAGTGTGCACAATACATTATTTTATTTACTCAACAAGTTGATCATTTGTTATATGTTTTTACAGCTTTTCCCCATGCTGTACATGATACCATATATTTCTTATCTGAAGTAGAGTGTTTACAGCACNTCTATTTAGTTCTGCTTCTAACATAAAGGTGTAGTTTCAAGACCGTTGGTTCGTTCAATCATTTTAGACTAATACACTATTTATTTACCATCCTTATATCTCACAACTGATTAGATTACCGCATTATTGCATTACATTGTTGACATGCTTCCATTTTTATATGTCAGATTGGTTCTTATCCAGGTTTTGAGGACGGAGAATTTGAATTGGTCAAATTAGCTCGTCCAGCAGATTCCTTTTATCATGCTACTCAGGATTGCCTGTATTTTGTGGACTCTGAGGTACGCACTAGTTCATCCCGACAATTATTTGAACTTTCAAGTCCATGTGAGACTATTGATTACACAATGGATGACGTGACAATCAATATGAGTGCTGGTTCTTATAATCTACGTTGGTTTTGCTTCATTTGTTTGCAGTCTTGGGACACTATGGACCATGGATTTGGCTTCAGGAAAAATTATTGAAGTTGTTAGAGGTACAGAATTTATAGATTCCTTTCTTAACCCTGATGTCGATGGGAAATTTTAATTAGAAGATTGGTGTTATGAATAGGGCATTCAAAGATTATGGAGAACTATGGGCAGTTGTTCATGGACAGAGTGTCTGTTCTGAAACAGATACCTGATGGTATGTTGCAGCTGCAAAGAGATGCAAAGACTGTCACAGGGGAGCTACCGTCCTTGGATCTTTTATCTTCTCTAACACCCTTTCAAAATTGCCTAATCATTTGCGACTCAGGTACCAATATTTCAATCGATGGATTGTTTTTGTGCTTAACATATCATCTTTTACTAGTTTTTTTAGCCCGTACTTCTCTCTTGAAGTTGGACAGGTAATTATGAAATATAATAGAGAATGACCACTCTCGAGTTCGTAGGTTAAAAGTTTAAAACCCCAGCTTGGATTACCGTATTGGTTTGCTCCACCTCCAGAGAAGGTTATTTCCACGTAAGTGTATTTACTTGATTTGGGTGTCTGCAGCTATGTTTATGTTTCTGACATTGATAGAATTTAGTGACTATGGGCCTAGTTACTGGATGGGGTTAATTTATATTCTACAAACTCATTTACCATCATCATCATCATCTTGGTGTTGTAGGTCTCCTTTTTTGCGTTTGTTCTGTTTTTAACCTCATTAACTTTTTGTAGATCCATTTTTGTTGATACTGCAGTGCCGATAGTTTCCAAGGAGCAGGGATTGATCATATTCACTTTTTCAGACTGCTGCCTGGTACTTTCTTATTGGAGATTGACGTGTTAGATTTACGATTTCCTCGATAGCAGTTTAGTTCAGTCGTAGCTCTAGTATGGTTTTCCACTTGTACCTGGTTTAGGAGATTTTATGTGTTACAATATCTGTCTTCATTGTAGACCTAGTCTTGTTAATTATACGATTTTTCACTGTGCATACAGGTAAGGTTGGTATACATATCAATGTTGATCTTCCTACAGATATTGAACTAGTGGAATCAATACAAGAAGACAGCATATGGCGTCAGACAAGAGGAACTGCAACTGAAATTTCAATCGTCGAGAAAGTTTCTGGGCTCTCAGAAAAGGTAACCTTGCTACGCATTTTTTGTTCAAGTGACATTTTCTTGTTTCTAGAATATCTGTTAGCGTTCGTGTGGTGTAGGTCGGTTCGGCTCAACAGTGGTATGATGAATTGGATAGTCTAGCCTTTTCACCACAAGAATCAGAAGTGGTGGAAGAAGATAATATAAGAGCTGTTAACCATATTGGAGACGATAGGTTTCAAATTGAGTGTGCTGTCAATACAAGTCCTGGAACTAGCGAGGTATGAATTCTATCTTTTCTTTATTTTAGGAAACAGAGTTACATCATTAATTAAGCAAACTGGCACTGGCATCTCTTCTCATGTATAGTCATCAGTTTTGTCCCTCTCTAATTAATTAGTTCACATTTGCATAGTTATTTTTGTTGAGGATTGTTGGGAGGGAGTCCACACATTGGCTAATTAAGAGGCTCCATTGGTATGAGGCTTTTTGGGGAAACCAAAAGCAATGCTACGAGAGCTTATGCTCGAAGTGGACAATATCACATCATTGTGGAGATTCGTGGTTCTTAACATCGTATCAAAGCCATGTCTAACAAAGGGTGTACTTTGTTCGAGGGTTCTAGAGAAGGAGTCGAGCCTTGATTAAGGGGGGTTGTTCGAGAGCCCCATAGGTCTCAGGGGAGGCACTATGGTATACTTTGTTTGAGGTGAGGATTGTTGGGAGAGGAGTCTCATATCGTCTAATTAAAGGGTCAATCATGAGTCTATAAGTAAGGAATACATCTCCGGGGGGTTGTTCGAGAGCCCCATAGGTCTCAGGGGAGGCACTATGGTATACTTTGTTTGAGGTGAGGATTGTTGGGAGAGGAGTCTCATATCGTCTAATTAAAGGGTCAATCATGAGTCTATAAGTAAGGAATACATCTCCATGGTATGAGGCCTTTTGGAGAAACCCAAAGCAACACCACGAGTGCTTATACTCAAAGTGGACAATATCATATTATCGTGGAGAATTGTGGTTCCTAACAATTTCAACCTAAAGAGGGTCCATCTTTTCACATGCACGCTTGCACACACCTCCACTTGAGTTGTGGTTTATTTTCAGTTGATAATTCCTCGCACCTCGAAACTGTTTTAGGAAAAATAAGACGGAAGCTGAAACGTAGCCACATAATTTTGAAACAGGTTATAGTATATGCAGCCGTATATTTAAGGCTTAGAAGAGACCAAGATTGGGAAGGCAATGGTGACAAACAAGCAGCAGCAGCAGCAGCCAGGATAGCAGATTTATTGTACCCAGGAAGTAGAGGGAAGAAGATAAAAGAGAGCTGCATTCAGTTCCTCCTTGTAAACTGTAAACGAGACCTGAGAGAGGTGATTTTTGTGAAACCTTTGCATGTGAGGATAAAGCTGGATACTATGGCGCACCCTAAAGCTGATAATTCCAAAGGTATTATCCTTACAGACTCCTCTGTAGAACTCAATTTATCTCTTGCCTCCTAAGAATAGATCATTATTTTGGTACCTTTTGTGGCTTATTGTTTTCTTATTCTATGCTCATTCTATTGTTCTAATGGTGGAACAATATTGTTTATTATTATTAATTTCTTAAATTATTGGATATTAGGTCCTCCACATTATTCTCATGATCGATTCCTCCATCCCACCGTTCGTTTTTAATTGTTGGAACAATATTGTTAATTTTCGATCGTT

mRNA sequence

CAAGTTAGCGAATAGCAAACAACCGCTCAAACACTGACGCCGTCGCAAACAACAGCTAAAACCCTGCGGGCCGCAACCTCTGAAACCCTGCCGCCGTGTAAGCCGCCGCTAGTAGCTTACCGTCGACCTGATGTCATTTCGTTCTTCTGCAACGAATATGGCTTTCAGGTTCCGGCGACTCAGAGAAATCTCAAAGTCGTTGCCTCAATTTTACTCAGGATATTACCATCAGCATCATCATAGGCATGCTGTTAGCTCATTGCCATTTTCCGTTGCTCCATCTTACGTTTCTGAAGGAGTTGAGAGAAGGATTTTAGAGAGTGGTCGCCACCTTCTGCGGTTTTCCACAACAACGGAGCTGCAGTGCGAGTCTTCTCCCGCAAATGATGTTTTATCCTTCATTAAGTCAACCCTTGATGAATCTGAAGGTCCTAACCACTATTGGTTGAATATATGTGATGGAAATAAAGGAATATCTGAGAAGGATGGAATCTACTTAATTCTTGCTGATCAATTTCTAGAAATGACGAGTTCGGATTCTGTTGTTTTGGTCGAAAATGTAAAATTCCTTCAGCATAGGTTTCCTCAGCTTCATGTGATTGGGCTTCAGTGTTCCAATACTCTATCTGTCGCTGAAAAAAGTGGGATGATCCAATTTATAATGAGGGAATATGTTTCCTTTCCCATTTTGTTATCCAATAAGATTTTTGAGATGGGGAGGAGGCTCTGTTATATTATCTCCAAGGACTTCAGTAATCCTTTGCTCATCAGTGAGAGGGACACAGACCTTAATGTTCTTCGGAAAGCTATTGAGGAGTTGCTTGAACCAGAAAATGAGAAATCTGGTCTGCCGAATATTGGGAGAACTACTTATCTGAAACATGCGGAGATCATTAAAGAACCATATTCATGTTCGTTCATGCAGAATTTTATTCTCCACTTTCCAGGCTGTATATCTGCTGATGAAAAGGGTGGCCGACTCTTCCTTTCTGATAGCAATCATAACCGGATTGTTATATTTAATGGCAACGGGAAGATCCTGGACATGATTGGTTCTTATCCAGGTTTTGAGGACGGAGAATTTGAATTGGTCAAATTAGCTCGTCCAGCAGCTTCCTTTTATCATGCTACTCAGGATTGCCTGTATTTTGTGGACTCTGAGAACCATGCCATTAGGAAAGCTGATTTGGGTAAGCGCGTTGTTGAAACTCTTTATCCAGCAAACTACTCAAGTACGAAGAGTACTAAGTTATGGAGCTGGATCAAGGACCGACTTGGTTTGGGAAGCATTCCCGACAGAGAAGTGGAAGATTTCAATCCGCAGTCTCTGATGTTTCCTTGGCACATGATTAGATATATGGATGATAGATTATTAATTTTAAATCGCAGTCTTGGGACACTATGGACCATGGATTTGGCTTCAGGAAAAATTATTGAAGTTGTTAGAGGGCATTCAAAGATTATGGAGAACTATGGGCAGTTGTTCATGGACAGAGTGTCTGTTCTGAAACAGATACCTGATGGTATGTTGCAGCTGCAAAGAGATGCAAAGACTGTCACAGGGGAGCTACCGTCCTTGGATCTTTTATCTTCTCTAACACCCTTTCAAAATTGCCTAATCATTTGCGACTCAGGTAAGGTTGGTATACATATCAATGTTGATCTTCCTACAGATATTGAACTAGTGGAATCAATACAAGAAGACAGCATATGGCGTCAGACAAGAGGAACTGCAACTGAAATTTCAATCGTCGAGAAAGTTTCTGGGCTCTCAGAAAAGGTCGGTTCGGCTCAACAGTGGTATGATGAATTGGATAGTCTAGCCTTTTCACCACAAGAATCAGAAGTGGTGGAAGAAGATAATATAAGAGCTGTTAACCATATTGGAGACGATAGGTTTCAAATTGAGTGTGCTGTCAATACAAGTCCTGGAACTAGCGAGGTTATAGTATATGCAGCCGTATATTTAAGGCTTAGAAGAGACCAAGATTGGGAAGGCAATGGTGACAAACAAGCAGCAGCAGCAGCAGCCAGGATAGCAGATTTATTGTACCCAGGAAGTAGAGGGAAGAAGATAAAAGAGAGCTGCATTCAGTTCCTCCTTGTAAACTGTAAACGAGACCTGAGAGAGGTGATTTTTGTGAAACCTTTGCATGTGAGGATAAAGCTGGATACTATGGCGCACCCTAAAGCTGATAATTCCAAAGGTATTATCCTTACAGACTCCTCTGTAGAACTCAATTTATCTCTTGCCTCCTAAGAATAGATCATTATTTTGGTACCTTTTGTGGCTTATTGTTTTCTTATTCTATGCTCATTCTATTGTTCTAATGGTGGAACAATATTGTTTATTATTATTAATTTCTTAAATTATTGGATATTAGGTCCTCCACATTATTCTCATGATCGATTCCTCCATCCCACCGTTCGTTTTTAATTGTTGGAACAATATTGTTAATTTTCGATCGTT

Coding sequence (CDS)

ATGTCATTTCGTTCTTCTGCAACGAATATGGCTTTCAGGTTCCGGCGACTCAGAGAAATCTCAAAGTCGTTGCCTCAATTTTACTCAGGATATTACCATCAGCATCATCATAGGCATGCTGTTAGCTCATTGCCATTTTCCGTTGCTCCATCTTACGTTTCTGAAGGAGTTGAGAGAAGGATTTTAGAGAGTGGTCGCCACCTTCTGCGGTTTTCCACAACAACGGAGCTGCAGTGCGAGTCTTCTCCCGCAAATGATGTTTTATCCTTCATTAAGTCAACCCTTGATGAATCTGAAGGTCCTAACCACTATTGGTTGAATATATGTGATGGAAATAAAGGAATATCTGAGAAGGATGGAATCTACTTAATTCTTGCTGATCAATTTCTAGAAATGACGAGTTCGGATTCTGTTGTTTTGGTCGAAAATGTAAAATTCCTTCAGCATAGGTTTCCTCAGCTTCATGTGATTGGGCTTCAGTGTTCCAATACTCTATCTGTCGCTGAAAAAAGTGGGATGATCCAATTTATAATGAGGGAATATGTTTCCTTTCCCATTTTGTTATCCAATAAGATTTTTGAGATGGGGAGGAGGCTCTGTTATATTATCTCCAAGGACTTCAGTAATCCTTTGCTCATCAGTGAGAGGGACACAGACCTTAATGTTCTTCGGAAAGCTATTGAGGAGTTGCTTGAACCAGAAAATGAGAAATCTGGTCTGCCGAATATTGGGAGAACTACTTATCTGAAACATGCGGAGATCATTAAAGAACCATATTCATGTTCGTTCATGCAGAATTTTATTCTCCACTTTCCAGGCTGTATATCTGCTGATGAAAAGGGTGGCCGACTCTTCCTTTCTGATAGCAATCATAACCGGATTGTTATATTTAATGGCAACGGGAAGATCCTGGACATGATTGGTTCTTATCCAGGTTTTGAGGACGGAGAATTTGAATTGGTCAAATTAGCTCGTCCAGCAGCTTCCTTTTATCATGCTACTCAGGATTGCCTGTATTTTGTGGACTCTGAGAACCATGCCATTAGGAAAGCTGATTTGGGTAAGCGCGTTGTTGAAACTCTTTATCCAGCAAACTACTCAAGTACGAAGAGTACTAAGTTATGGAGCTGGATCAAGGACCGACTTGGTTTGGGAAGCATTCCCGACAGAGAAGTGGAAGATTTCAATCCGCAGTCTCTGATGTTTCCTTGGCACATGATTAGATATATGGATGATAGATTATTAATTTTAAATCGCAGTCTTGGGACACTATGGACCATGGATTTGGCTTCAGGAAAAATTATTGAAGTTGTTAGAGGGCATTCAAAGATTATGGAGAACTATGGGCAGTTGTTCATGGACAGAGTGTCTGTTCTGAAACAGATACCTGATGGTATGTTGCAGCTGCAAAGAGATGCAAAGACTGTCACAGGGGAGCTACCGTCCTTGGATCTTTTATCTTCTCTAACACCCTTTCAAAATTGCCTAATCATTTGCGACTCAGGTAAGGTTGGTATACATATCAATGTTGATCTTCCTACAGATATTGAACTAGTGGAATCAATACAAGAAGACAGCATATGGCGTCAGACAAGAGGAACTGCAACTGAAATTTCAATCGTCGAGAAAGTTTCTGGGCTCTCAGAAAAGGTCGGTTCGGCTCAACAGTGGTATGATGAATTGGATAGTCTAGCCTTTTCACCACAAGAATCAGAAGTGGTGGAAGAAGATAATATAAGAGCTGTTAACCATATTGGAGACGATAGGTTTCAAATTGAGTGTGCTGTCAATACAAGTCCTGGAACTAGCGAGGTTATAGTATATGCAGCCGTATATTTAAGGCTTAGAAGAGACCAAGATTGGGAAGGCAATGGTGACAAACAAGCAGCAGCAGCAGCAGCCAGGATAGCAGATTTATTGTACCCAGGAAGTAGAGGGAAGAAGATAAAAGAGAGCTGCATTCAGTTCCTCCTTGTAAACTGTAAACGAGACCTGAGAGAGGTGATTTTTGTGAAACCTTTGCATGTGAGGATAAAGCTGGATACTATGGCGCACCCTAAAGCTGATAATTCCAAAGGTATTATCCTTACAGACTCCTCTGTAGAACTCAATTTATCTCTTGCCTCCTAA

Protein sequence

MSFRSSATNMAFRFRRLREISKSLPQFYSGYYHQHHHRHAVSSLPFSVAPSYVSEGVERRILESGRHLLRFSTTTELQCESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDGIYLILADQFLEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAEKSGMIQFIMREYVSFPILLSNKIFEMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKSGLPNIGRTTYLKHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQDCLYFVDSENHAIRKADLGKRVVETLYPANYSSTKSTKLWSWIKDRLGLGSIPDREVEDFNPQSLMFPWHMIRYMDDRLLILNRSLGTLWTMDLASGKIIEVVRGHSKIMENYGQLFMDRVSVLKQIPDGMLQLQRDAKTVTGELPSLDLLSSLTPFQNCLIICDSGKVGIHINVDLPTDIELVESIQEDSIWRQTRGTATEISIVEKVSGLSEKVGSAQQWYDELDSLAFSPQESEVVEEDNIRAVNHIGDDRFQIECAVNTSPGTSEVIVYAAVYLRLRRDQDWEGNGDKQAAAAAARIADLLYPGSRGKKIKESCIQFLLVNCKRDLREVIFVKPLHVRIKLDTMAHPKADNSKGIILTDSSVELNLSLAS
BLAST of Cp4.1LG15g01400 vs. Swiss-Prot
Match: NHLC2_CHICK (NHL repeat-containing protein 2 OS=Gallus gallus GN=NHLRC2 PE=2 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 5.5e-10
Identity = 47/174 (27.01%), Postives = 73/174 (41.95%), Query Frame = 1

Query: 269 LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPA 328
           L FPG ++ D+ G RL ++D+ H+RI++   NG+IL  IG    G +DG F       P 
Sbjct: 218 LLFPGKVTVDKSGERLVIADTGHHRILVTLKNGQILHTIGGPNSGRKDGRFSEAAFNSPQ 277

Query: 329 ASFYHATQDCLYFVDSENHAIRKADLGKRVVETLYPANYSSTKSTKLWSWIKDRLGLGSI 388
                   + +Y  D+ENH IRK DL   +V T+                    +G+  +
Sbjct: 278 G--VAIKNNVIYVADTENHLIRKIDLELEIVTTV------------------AGIGIQGV 337

Query: 389 PDREVEDFNPQSLMFPWHMI-------RYMDDRLLILNRSLGTLWTMDLASGKI 435
                     Q +  PW ++          DD L I    +  +W + L  GK+
Sbjct: 338 DKEGGAKGEEQPISSPWDVVFGNSVSGTQEDDVLWIAMAGIHQVWALMLEGGKL 371

BLAST of Cp4.1LG15g01400 vs. Swiss-Prot
Match: NHLC2_MOUSE (NHL repeat-containing protein 2 OS=Mus musculus GN=Nhlrc2 PE=1 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 8.0e-09
Identity = 35/94 (37.23%), Postives = 49/94 (52.13%), Query Frame = 1

Query: 269 LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPA 328
           L FPG ++ D   GRL ++D+ H+RI++   NG+I   IG   PG +DG F       P 
Sbjct: 223 LLFPGKVAVDHATGRLVVADTGHHRILVIQKNGRIQSSIGGPNPGRKDGMFSESSFNSPQ 282

Query: 329 ASFYHATQDCLYFVDSENHAIRKADLGKRVVETL 362
                   + +Y  D+ENH IRK DL    V T+
Sbjct: 283 G--VAIADNVIYVADTENHLIRKIDLEAEKVTTV 314

BLAST of Cp4.1LG15g01400 vs. Swiss-Prot
Match: NHLC2_BOVIN (NHL repeat-containing protein 2 OS=Bos taurus GN=NHLRC2 PE=2 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 1.1e-07
Identity = 35/94 (37.23%), Postives = 49/94 (52.13%), Query Frame = 1

Query: 269 LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPA 328
           L FPG I+ D    RL ++D+ H+RI++   NG+I   IG   PG +DG F       P 
Sbjct: 223 LLFPGKITVDHVSNRLVIADTGHHRILVVWKNGQIQYSIGGPNPGRKDGIFSESSFNSPQ 282

Query: 329 ASFYHATQDCLYFVDSENHAIRKADLGKRVVETL 362
                   + +Y  D+ENH IRK DL   +V T+
Sbjct: 283 G--VAIMNNIIYVADTENHLIRKIDLEAEMVSTV 314

BLAST of Cp4.1LG15g01400 vs. TrEMBL
Match: A0A0A0LBM4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G627650 PE=4 SV=1)

HSP 1 Score: 538.9 bits (1387), Expect = 9.4e-150
Identity = 267/335 (79.70%), Postives = 295/335 (88.06%), Query Frame = 1

Query: 10  MAFRFRRLREISKSLPQFYSGYYHQHHHRHAVSSLPFSVAPSYVSEGVERRILESGRHLL 69
           MAFRFRRL+EIS+S+PQ YS +YHQHH R+ VSSL  SVAP  VSE + RR+  +GR+  
Sbjct: 1   MAFRFRRLKEISRSIPQIYSEFYHQHHRRYGVSSLALSVAPFRVSERIGRRLFYNGRYFT 60

Query: 70  RFSTTTELQCESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDGIYLILADQF 129
           RFSTTTELQCESSP +D+ SFIKSTLDESEGPNHYWLN  + NK I E+DG YLILA+QF
Sbjct: 61  RFSTTTELQCESSPTSDIFSFIKSTLDESEGPNHYWLNTSNENKVIFEEDGKYLILANQF 120

Query: 130 LEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAEKSGMIQFIMREYVSFPILLS 189
           LEMTSSDSVVLVENVKFLQ RFP LHVIG QCS+TLSVAEKS MIQFIMREY+SFPILLS
Sbjct: 121 LEMTSSDSVVLVENVKFLQQRFPHLHVIGFQCSSTLSVAEKSDMIQFIMREYISFPILLS 180

Query: 190 NKIFEMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKSGLPNIGRTTYL 249
           NKIFE+    CYIISKD SNPLL+SER  DL++LRKAIEEL EPENEKSGL N+G+TTYL
Sbjct: 181 NKIFEVAG--CYIISKDLSNPLLVSERGMDLSILRKAIEELHEPENEKSGLSNMGKTTYL 240

Query: 250 KHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS 309
           K AE+IKEP SCSFM NF+LH+PGCISADE+GGRLFLSDSNHNRIVIFN  GKILDMIGS
Sbjct: 241 KQAEMIKEPNSCSFMHNFLLHYPGCISADEEGGRLFLSDSNHNRIVIFNSYGKILDMIGS 300

Query: 310 YPGFEDGEFELVKLARPAASFYHATQDCLYFVDSE 345
           YPGFEDGEFELVKLARPAASFYH+TQ+CLYFVDSE
Sbjct: 301 YPGFEDGEFELVKLARPAASFYHSTQNCLYFVDSE 333

BLAST of Cp4.1LG15g01400 vs. TrEMBL
Match: Q9M8Z4_ARATH (F17A9.23 protein OS=Arabidopsis thaliana GN=F17A9.23 PE=4 SV=1)

HSP 1 Score: 526.2 bits (1354), Expect = 6.3e-146
Identity = 289/657 (43.99%), Postives = 416/657 (63.32%), Query Frame = 1

Query: 64  SGRHLLRFSTTTELQCESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDGIYL 123
           S R + R S+++     SSP  D+LSFIK++LD+ EGP+H+WLN   GNK + +  G Y+
Sbjct: 19  SVRRVSRASSSSS-SSPSSPHVDLLSFIKASLDKLEGPSHHWLNRDFGNKQLFKDKGTYV 78

Query: 124 ILADQFLEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAE-KSGMIQFIMREYV 183
           +LA   L+ TS D     E +K LQ R P +  +G+  S+   +A+ ++ + + I++EY+
Sbjct: 79  VLAGHLLDGTS-DLSGFFEKLKLLQQRSPGVCFMGIHFSDQARIADDRTALAELILKEYL 138

Query: 184 SFPILLSNKIF-EMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKSGLP 243
           +FP+LLS K F +    + YI+ KDF NPL+  E+D D+  + KA++ LL  + EKS   
Sbjct: 139 TFPVLLSEKEFPKTSGEVRYIVFKDFKNPLIYEEKDLDIASVVKALDSLLTQDTEKSKSV 198

Query: 244 NIGRTTYLKHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFNGNG 303
            +   T+ K AE IKE +  SF Q+ +L+FPGCISADE G RLFLSD+NH+RI+IF  +G
Sbjct: 199 RLFTNTWSKQAEAIKESHFPSFFQDLLLYFPGCISADEVGDRLFLSDTNHHRIIIFENSG 258

Query: 304 KILDMIGSYPGFEDGEFELVKLARPAASFYHATQDCLYFVDSENHAIRKADLGKRVVETL 363
           KI+D IG +PGFEDG+FE  K+ RP  + Y   +DCLY VDSE            V++  
Sbjct: 259 KIVDSIGCFPGFEDGDFESAKMLRPTGTLYDEAEDCLYIVDSE------------VIK-- 318

Query: 364 YPANYSSTKSTKLWSWIKDRLGLGSIPDREV------EDFNPQSLMFPWHMIRYMDDRLL 423
                   K+  LWSWI +++GLG   D  V      E+F+ +SL+FPWH+++  D+ LL
Sbjct: 319 --------KTGGLWSWIMEKMGLGKDDDTTVDADTKSEEFDARSLLFPWHILKRDDESLL 378

Query: 424 ILNRSLGTLWTMDLASGKIIEVVRGHSKIMENYGQLFMDRVSVLKQIPDGMLQLQRDAKT 483
           ++N+S   LW ++ ASG+I EVV G SKI+E  GQ   +++SVL+ +P   LQ Q  A  
Sbjct: 379 VINKSFSKLWIINFASGEIEEVVEGFSKIIEICGQSITEKLSVLEHMPSNWLQQQTAAIA 438

Query: 484 VTGELPSLDLLSSLTPFQNCLIICDSGKVGIHINVDLPTDIELVESIQEDSIWRQTRGTA 543
              E PS  LLSS T   + +++ D GK+ I +N+++P   ELVE IQE  IWRQTRG  
Sbjct: 439 SFKEQPSASLLSSFTKLGDDIVMTDIGKISIRLNIEIPPCTELVEPIQESCIWRQTRGAI 498

Query: 544 TEISIVEKVSGLSEKVGSAQQWYDELDSLA---FSPQESEVVEEDNIR--AVNHIGDDRF 603
           +E S        SEK+G +QQWYDELDSLA    +P+ +E  EE+++    V+   D R 
Sbjct: 499 SEFSSAGSAVEPSEKIGVSQQWYDELDSLAKEIANPEAAEEEEEEDVNPSEVDREEDGRI 558

Query: 604 QIECAVNTSPGTSEVIVYAAVYLRLRRDQDWEGNGDKQAAAAAARIADLLYPGSRGKKIK 663
            I+C V TSPG+SE+IVYAA+YLRL R+++ E    ++    A +IA +L P      +K
Sbjct: 559 HIDCPVKTSPGSSELIVYAALYLRLARNEETESATQEE---LARKIAKILKPVRNITTMK 618

Query: 664 ESCIQFLLVNCKRDLREVIFVKPLHVRIKLDTMAHPKADNSKGIILTDSSVELNLSL 708
           E     LL   KR+LR+++F+KP+HVRI+LD+  HPKADNS+ +ILTDSSVE+++SL
Sbjct: 619 EDLFVNLLSKSKRELRDIVFIKPMHVRIRLDSKDHPKADNSRDVILTDSSVEVDVSL 648

BLAST of Cp4.1LG15g01400 vs. TrEMBL
Match: B9GNJ6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s24280g PE=4 SV=1)

HSP 1 Score: 526.2 bits (1354), Expect = 6.3e-146
Identity = 271/490 (55.31%), Postives = 354/490 (72.24%), Query Frame = 1

Query: 224 RKAIEELLEPENEK--SGL--PNIGRTTYLKHAEIIKEPYSCSFMQNFILHFPGCISADE 283
           R  IEEL   EN    +G+  P + +TT+ K AE+IKEPY CS +QN +L+FPGC+SADE
Sbjct: 43  RNVIEELNVQENMNFDNGISRPKL-KTTWAKQAEVIKEPYMCSPLQNLLLYFPGCVSADE 102

Query: 284 KGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQDCLY 343
            G RLFLSDSNH+RI++ +GNGKILD IGS PGFEDGEFE  KLARPAASFY   +DCLY
Sbjct: 103 SGNRLFLSDSNHHRIIVSDGNGKILDSIGSGPGFEDGEFESAKLARPAASFYDDEEDCLY 162

Query: 344 FVDSENHAIRKADLGKRVVETLYPANYSSTKSTKLWSWIKDRLGLGSIPDREVEDFNPQS 403
            VDSENHAIR+ADL  RV+ET+YP ++S  K+  +W+WI D+LG     D + E+F+ Q 
Sbjct: 163 IVDSENHAIRRADLESRVLETVYPKSFSK-KNNSIWTWIMDKLGSRINVDAKSEEFDSQP 222

Query: 404 LMFPWHMIRYMDDRLLILNRSLGTLWTMDLASGKIIEVVRGHSKIMENYGQLFMDRVSVL 463
           L+FPWH+++ +D+  LI++RS  TLW +DL SG++ E ++G   I+E  GQL   +VS+L
Sbjct: 223 LVFPWHLLKSVDNTFLIISRSFETLWVIDLVSGEMKECIKGFPNILETCGQLITGKVSLL 282

Query: 464 KQIPDGMLQLQRDAKTVTGELPSLDLLSSLTPFQNCLIICDSGKVGIHINVDLPTDIELV 523
           KQ+P   L+ Q D      E P   L+S+LT F+N +++CD+G+V I +N+D+P D ELV
Sbjct: 283 KQLPIDYLKQQTDVNCSLKEFPYATLVSNLTTFENDIVLCDTGRVDIRLNIDIPMDTELV 342

Query: 524 ESIQEDSIWRQTRGTATEISIVEKVSGLSEKVGSAQQWYDELDSLAFS-PQESEVVEEDN 583
           E +QE  IWRQ RG+AT I   E V G SEK G +QQWYDELD+LAFS P      EED+
Sbjct: 343 EPLQEGCIWRQARGSATVILGAEDVVGSSEKAGVSQQWYDELDNLAFSTPGLEMATEEDS 402

Query: 584 IRAVNHIGDDRFQIECAVNTSPGTSEVIVYAAVYLRLRRDQDWEGNGDKQAAAAAARIAD 643
             +  +  D+R  I+CAVNTSPGTSE+I++AA+YL+LRR  D E  G ++    AARIAD
Sbjct: 403 ATSDVNYQDERLHIDCAVNTSPGTSELIIHAALYLKLRRHLDLEEGGQQK---HAARIAD 462

Query: 644 LLYPGSRGKKIKESCIQFLL-VNCKRDLREVIFVKPLHVRIKLDTMAHPKADNSKGIILT 703
           +L PG  G   K+SCIQ LL  NC  +LR++IFVKPLH+RI LDT+ HPKADNSK IILT
Sbjct: 463 ILNPGRGGGLEKDSCIQLLLKSNC--NLRDLIFVKPLHLRINLDTLDHPKADNSKDIILT 522

Query: 704 DSSVELNLSL 708
           DS++E+N+SL
Sbjct: 523 DSAIEVNVSL 525

BLAST of Cp4.1LG15g01400 vs. TrEMBL
Match: B9S538_RICCO (Catalytic, putative OS=Ricinus communis GN=RCOM_1720980 PE=4 SV=1)

HSP 1 Score: 471.9 bits (1213), Expect = 1.4e-129
Identity = 247/506 (48.81%), Postives = 351/506 (69.37%), Query Frame = 1

Query: 10  MAFRFRRLREISKSLPQFYSGYYHQHHHRHAVSSL--PFSV--APSYVSEGVERR-ILES 69
           M+ R++R R+ISK LP+   G  +Q H     +S+   FS    P+   +G+    IL  
Sbjct: 1   MSRRYQRFRQISKLLPRILPG--NQLHKCRKATSIHSAFSTISVPNSDYDGMRNTLILHR 60

Query: 70  GRHLLRFSTTTELQCESS--PANDVLSFIKSTLDESEGPNHYWLNICDG-NKGISEKDGI 129
           G +  R+ST +E+  ES+  P +DVLSFIKST +E +GPNH WLN  D  +K    KDGI
Sbjct: 61  GFYSQRYSTISEVSHESNSPPVDDVLSFIKSTFNELQGPNHCWLNKVDHRDKDCLNKDGI 120

Query: 130 YLILADQFLEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAE-KSGMIQFIMRE 189
           +L++A Q +   +S  V ++E +K +Q RFPQL ++G  C +++  A+ ++ +++ IM+E
Sbjct: 121 FLLIAGQLIN--NSQIVFMIEKIKSIQQRFPQLCIVGFHCGSSIGSADDRTRLVELIMKE 180

Query: 190 YVSFPILLSNKIF-EMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPEN--EK 249
           +++FP+LLS+K F +M    CYI+ KDF N ++  +RD D+ +L KAIEEL   +N    
Sbjct: 181 FLTFPVLLSSKNFLQMENGACYILFKDFKNSVIYHDRDLDIEILNKAIEELHMQQNGYTN 240

Query: 250 SGLPNIG--RTTYLKHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIV 309
           SG+ N+   +++++K AE+ KEP S SF+QN +L+FPGC+SADE G RLFLSDSNH+RI+
Sbjct: 241 SGISNLRDLKSSWVKQAEVTKEPCSSSFLQNLVLYFPGCVSADESGDRLFLSDSNHHRII 300

Query: 310 IFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQDCLYFVDSENHAIRKADLGK 369
           IF+GNGKI+D IGS PGFEDGEFE  KL RPAASFYH ++DCLY VD+EN AIR+AD+ +
Sbjct: 301 IFDGNGKIMDSIGSCPGFEDGEFESAKLLRPAASFYHNSEDCLYIVDAENQAIRRADMER 360

Query: 370 RVVETLYPANYSSTKSTKLWSWIKDRLGLGSIPDREVEDFNPQSLMFPWHMIRYMDDRLL 429
           RV+ETLYP    S  ++ +W+WI +++G G   D + ++F+ Q LMFPWH+ + +DD LL
Sbjct: 361 RVLETLYPTCSISKNNSSVWTWIVNKMGFGRNSDMKSKEFDSQLLMFPWHLFKSVDDSLL 420

Query: 430 ILNRSLGTLWTMDLASGKIIEVVRGHSKIMENYGQLFMDRVSVLKQIPDGMLQLQRDAKT 489
           I+NRS  +LW MDLASGKI E++RG  KI+E  GQL  ++VS+LKQ+P+  LQ Q DA  
Sbjct: 421 IINRSFESLWIMDLASGKIKEIIRGFPKILETCGQLITEKVSLLKQMPNDWLQQQIDASC 480

Query: 490 VTGELPSLDLLSSLTPFQNCLIICDS 502
               LP   LLSS+T FQN LI+CD+
Sbjct: 481 SPEGLPFASLLSSVTTFQNHLIMCDT 502

BLAST of Cp4.1LG15g01400 vs. TrEMBL
Match: A0A067FEX4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g004302mg PE=4 SV=1)

HSP 1 Score: 465.7 bits (1197), Expect = 1.0e-127
Identity = 242/501 (48.30%), Postives = 331/501 (66.07%), Query Frame = 1

Query: 10  MAFRFRRLREISKSLPQFYSGYYHQHHHRHAVSSLPFSVAPSYVSEGVERRILESGRHLL 69
           MA R   LR IS+ LP  YSG + Q   R  ++SL  S+          R   +  R+ +
Sbjct: 1   MALRNGCLRRISRFLPHIYSGSHFQQSRRAIINSLASSLIT------FPRECEQISRNGV 60

Query: 70  RFSTTTELQC---ESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDGIYLILA 129
            FS +T  Q    ES   +D LSFI+ST +E +GP+H W NI + N    ++ G +L+LA
Sbjct: 61  NFSFSTIAQASPAESLSQSDTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLA 120

Query: 130 DQFLEMTSS-----DSVVLVENVKFLQHRFPQLHVIG-LQCSNTLSVAEKSGMIQFIMRE 189
            +F++   S      +VV  E VK +Q  FPQL VIG L   +T+S  +++ +++ +M+E
Sbjct: 121 GRFVDNCDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKE 180

Query: 190 YVSFPILLSNKIF-EMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKSG 249
           Y++FPILLSNK F +M    CY++SKDF N  +  E   D+ +L KA+EEL+  + E S 
Sbjct: 181 YITFPILLSNKNFPQMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSS 240

Query: 250 LPNIGRTTYLKHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFNG 309
            P+  + T+ K AE++KEP++CS ++N +LHFPGCISADE G RLFLSDSNH+RI++F+G
Sbjct: 241 SPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDG 300

Query: 310 NGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQDCLYFVDSENHAIRKADLGKRVVE 369
           NGKILD IGS PGFEDGEFE  KL RPAASFYH   DCLY VDSENHAIR+AD+G+RV+E
Sbjct: 301 NGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLE 360

Query: 370 TLYPANYSSTKSTKLWSWIKDRLGLGSIPDREVEDFNPQSLMFPWHMIRYMDDRLLILNR 429
           T+YP +  S K+  LW+WI ++LG     D + E  +PQSL+FPWH+++  DD LLI+NR
Sbjct: 361 TVYPTSGISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINR 420

Query: 430 SLGTLWTMDLASGKIIEVVRGHSKIMENYGQLFMDRVSVLKQIPDGMLQLQRDAKTVTGE 489
           S  TLW MDLASG+I E V+G SK++E  G L M++V +LKQ+P   L  Q D+     E
Sbjct: 421 SFETLWIMDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKE 480

Query: 490 LPSLDLLSSLTPFQNCLIICD 501
           LP   L+SS   FQN +++CD
Sbjct: 481 LPYAGLISSSIAFQNHILLCD 495

BLAST of Cp4.1LG15g01400 vs. TAIR10
Match: AT3G07060.1 (AT3G07060.1 NHL domain-containing protein)

HSP 1 Score: 374.4 bits (960), Expect = 1.6e-103
Identity = 210/508 (41.34%), Postives = 310/508 (61.02%), Query Frame = 1

Query: 10  MAFRFRRLREISKSLPQFYSGYYHQHHHRHAVSSLPFSVAPSYVSEG-------VERRIL 69
           M+ R   L++IS    +  S   H    R ++++    +APS   +        VE+R  
Sbjct: 1   MSNRSLHLKKISWLSSRILSDNVHGRFRR-SITTPATCLAPSLDGDMNIGSKTLVEQRFS 60

Query: 70  ESGRHLLRFS--TTTELQCESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDG 129
                + R S  +++     SSP  D+LSFIK++LD+ EGP+H+WLN   GNK + +  G
Sbjct: 61  RGFASVRRVSRASSSSSSSPSSPHVDLLSFIKASLDKLEGPSHHWLNRDFGNKQLFKDKG 120

Query: 130 IYLILADQFLEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAE-KSGMIQFIMR 189
            Y++LA   L+ TS D     E +K LQ R P +  +G+  S+   +A+ ++ + + I++
Sbjct: 121 TYVVLAGHLLDGTS-DLSGFFEKLKLLQQRSPGVCFMGIHFSDQARIADDRTALAELILK 180

Query: 190 EYVSFPILLSNKIF-EMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKS 249
           EY++FP+LLS K F +    + YI+ KDF NPL+  E+D D+  + KA++ LL  + EKS
Sbjct: 181 EYLTFPVLLSEKEFPKTSGEVRYIVFKDFKNPLIYEEKDLDIASVVKALDSLLTQDTEKS 240

Query: 250 GLPNIGRTTYLKHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFN 309
               +   T+ K AE IKE +  SF Q+ +L+FPGCISADE G RLFLSD+NH+RI+IF 
Sbjct: 241 KSVRLFTNTWSKQAEAIKESHFPSFFQDLLLYFPGCISADEVGDRLFLSDTNHHRIIIFE 300

Query: 310 GNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQDCLYFVDSENHAIRKADLGKRVV 369
            +GKI+D IG +PGFEDG+FE  K+ RP  + Y   +DCLY VDSENHAIR+A++  RV+
Sbjct: 301 NSGKIVDSIGCFPGFEDGDFESAKMLRPTGTLYDEAEDCLYIVDSENHAIRRANINSRVL 360

Query: 370 ETLYPANYSSTKSTKLWSWIKDRLGLGSIPDREV------EDFNPQSLMFPWHMIRYMDD 429
           ET+YP     T    LWSWI +++GLG   D  V      E+F+ +SL+FPWH+++  D+
Sbjct: 361 ETVYPKVIKKTGG--LWSWIMEKMGLGKDDDTTVDADTKSEEFDARSLLFPWHILKRDDE 420

Query: 430 RLLILNRSLGTLWTMDLASGKIIEVVRGHSKIMENYGQLFMDRVSVLKQIPDGMLQLQRD 489
            LL++N+S   LW ++ ASG+I EVV G SKI+E  GQ   +++SVL+ +P   LQ Q  
Sbjct: 421 SLLVINKSFSKLWIINFASGEIEEVVEGFSKIIEICGQSITEKLSVLEHMPSNWLQQQTA 480

Query: 490 AKTVTGELPSLDLLSSLTPFQNCLIICD 501
           A     E PS  LLSS T   + +++ D
Sbjct: 481 AIASFKEQPSASLLSSFTKLGDDIVMTD 504

BLAST of Cp4.1LG15g01400 vs. TAIR10
Match: AT1G56500.1 (AT1G56500.1 haloacid dehalogenase-like hydrolase family protein)

HSP 1 Score: 84.3 bits (207), Expect = 3.2e-16
Identity = 42/95 (44.21%), Postives = 57/95 (60.00%), Query Frame = 1

Query: 269 LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSY--PGFEDGEFELVKLARP 328
           L FPG ++ D    RLF+SDSNHNRI++ +  G  +  IGS    GF+DG FE     RP
Sbjct: 566 LKFPGKLAIDTLNNRLFISDSNHNRIIVTDLEGNFIVQIGSSGEEGFQDGSFEDAAFNRP 625

Query: 329 AASFYHATQDCLYFVDSENHAIRKADLGKRVVETL 362
               Y+A ++ LY  D+ENHA+R+ D     V+TL
Sbjct: 626 QGLAYNAKKNLLYVADTENHALREIDFVNERVQTL 660

BLAST of Cp4.1LG15g01400 vs. NCBI nr
Match: gi|659130990|ref|XP_008465459.1| (PREDICTED: uncharacterized protein LOC103503064 isoform X1 [Cucumis melo])

HSP 1 Score: 819.7 bits (2116), Expect = 4.0e-234
Identity = 402/501 (80.24%), Postives = 441/501 (88.02%), Query Frame = 1

Query: 10  MAFRFRRLREISKSLPQFYSGYYHQHHHRHAVSSLPFSVAPSYVSEGVERRILESGRHLL 69
           MAFRFRRL+EIS+SLPQ YSGYYHQHHHR+ VSSL  SVAP +VSEG++RR+ ++GRH  
Sbjct: 1   MAFRFRRLKEISRSLPQIYSGYYHQHHHRYGVSSLVLSVAPFHVSEGIDRRLFDNGRHFT 60

Query: 70  RFSTTTELQCESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDGIYLILADQF 129
           RFSTTTELQCESSP ND+ SFI STLDESEGPNHYWLN  +GNKGI E+DG+YLILA+QF
Sbjct: 61  RFSTTTELQCESSPINDIFSFINSTLDESEGPNHYWLNTSNGNKGIFEEDGMYLILANQF 120

Query: 130 LEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAEKSGMIQFIMREYVSFPILLS 189
           LEMTSSDS+ LVENVKFLQ RFP LHVIG QC +TLSVAEKS MIQFIMREY+SFPILLS
Sbjct: 121 LEMTSSDSIDLVENVKFLQQRFPHLHVIGFQCYSTLSVAEKSAMIQFIMREYISFPILLS 180

Query: 190 NKIFEMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKSGLPNIGRTTYL 249
           NKIFE+    C IISKD SNPLL+ ERD DL++L KAIEEL EPENEKSGL N G+TTYL
Sbjct: 181 NKIFEVAG--CCIISKDLSNPLLVCERDMDLSILCKAIEELHEPENEKSGLSNKGKTTYL 240

Query: 250 KHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS 309
           K AE+IKEP SCSFM NF+LH+PGCISADE+GGRLFLSDSNHNRIVI N  GKILDMIGS
Sbjct: 241 KQAEMIKEPNSCSFMHNFLLHYPGCISADEEGGRLFLSDSNHNRIVILNSYGKILDMIGS 300

Query: 310 YPGFEDGEFELVKLARPAASFYHATQDCLYFVDSENHAIRKADLGKRVVETLYPANYSST 369
           YPGFEDGEFELVKLARPAASFYH+TQ+CLYFVDSENHAIRKADLGKRVVETLYP NYS+ 
Sbjct: 301 YPGFEDGEFELVKLARPAASFYHSTQNCLYFVDSENHAIRKADLGKRVVETLYPENYSNK 360

Query: 370 KSTKLWSWIKDRLGLGSIPDREVEDFNPQSLMFPWHMIRYMDDRLLILNRSLGTLWTMDL 429
           KST+LWSWI D+ GLGSIPDREVEDFNPQSLMFPWH+IRYMDDRLLILNRSLGTLWTMDL
Sbjct: 361 KSTQLWSWIMDKFGLGSIPDREVEDFNPQSLMFPWHLIRYMDDRLLILNRSLGTLWTMDL 420

Query: 430 ASGKIIEVVRGHSKIMENYGQLFMDRVSVLKQIPDGMLQLQRDAKTVTGELPSLDLLSSL 489
            SGKIIEVVRG S+IMENYG L MDR+SVLKQIPDG LQ   DA   TG  P +DLLSSL
Sbjct: 421 VSGKIIEVVRGLSRIMENYGHLIMDRLSVLKQIPDGTLQQPSDANIATGGSPYMDLLSSL 480

Query: 490 TPFQNCLIICDS-GKVGIHIN 510
           TPF+NC+IICDS G+V +  N
Sbjct: 481 TPFKNCIIICDSVGQVVLKYN 499

BLAST of Cp4.1LG15g01400 vs. NCBI nr
Match: gi|778682118|ref|XP_011651649.1| (PREDICTED: uncharacterized protein LOC101209700 isoform X1 [Cucumis sativus])

HSP 1 Score: 803.1 bits (2073), Expect = 3.8e-229
Identity = 397/501 (79.24%), Postives = 438/501 (87.43%), Query Frame = 1

Query: 10  MAFRFRRLREISKSLPQFYSGYYHQHHHRHAVSSLPFSVAPSYVSEGVERRILESGRHLL 69
           MAFRFRRL+EIS+S+PQ YS +YHQHH R+ VSSL  SVAP  VSE + RR+  +GR+  
Sbjct: 1   MAFRFRRLKEISRSIPQIYSEFYHQHHRRYGVSSLALSVAPFRVSERIGRRLFYNGRYFT 60

Query: 70  RFSTTTELQCESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDGIYLILADQF 129
           RFSTTTELQCESSP +D+ SFIKSTLDESEGPNHYWLN  + NK I E+DG YLILA+QF
Sbjct: 61  RFSTTTELQCESSPTSDIFSFIKSTLDESEGPNHYWLNTSNENKVIFEEDGKYLILANQF 120

Query: 130 LEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAEKSGMIQFIMREYVSFPILLS 189
           LEMTSSDSVVLVENVKFLQ RFP LHVIG QCS+TLSVAEKS MIQFIMREY+SFPILLS
Sbjct: 121 LEMTSSDSVVLVENVKFLQQRFPHLHVIGFQCSSTLSVAEKSDMIQFIMREYISFPILLS 180

Query: 190 NKIFEMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKSGLPNIGRTTYL 249
           NKIFE+    CYIISKD SNPLL+SER  DL++LRKAIEEL EPENEKSGL N+G+TTYL
Sbjct: 181 NKIFEVAG--CYIISKDLSNPLLVSERGMDLSILRKAIEELHEPENEKSGLSNMGKTTYL 240

Query: 250 KHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS 309
           K AE+IKEP SCSFM NF+LH+PGCISADE+GGRLFLSDSNHNRIVIFN  GKILDMIGS
Sbjct: 241 KQAEMIKEPNSCSFMHNFLLHYPGCISADEEGGRLFLSDSNHNRIVIFNSYGKILDMIGS 300

Query: 310 YPGFEDGEFELVKLARPAASFYHATQDCLYFVDSENHAIRKADLGKRVVETLYPANYSST 369
           YPGFEDGEFELVKLARPAASFYH+TQ+CLYFVDSENHAIRKADLGKRVVETLYP NYS+ 
Sbjct: 301 YPGFEDGEFELVKLARPAASFYHSTQNCLYFVDSENHAIRKADLGKRVVETLYPENYSNK 360

Query: 370 KSTKLWSWIKDRLGLGSIPDREVEDFNPQSLMFPWHMIRYMDDRLLILNRSLGTLWTMDL 429
           KST+ WSWI D+ GLGSIPDREV+DFNPQS+MFPWHMIRYMDDRLLILNRSLGTLWTMDL
Sbjct: 361 KSTQFWSWIMDKFGLGSIPDREVKDFNPQSIMFPWHMIRYMDDRLLILNRSLGTLWTMDL 420

Query: 430 ASGKIIEVVRGHSKIMENYGQLFMDRVSVLKQIPDGMLQLQRDAKTVTGELPSLDLLSSL 489
            SGKIIEVVRG S+IME+YGQL MDR+SV+KQIPDGMLQ   DA    G  P LDLLSSL
Sbjct: 421 VSGKIIEVVRGLSRIMESYGQLIMDRLSVIKQIPDGMLQRPSDANIAIGGSPYLDLLSSL 480

Query: 490 TPFQNCLIICDS-GKVGIHIN 510
           T F+NC+IICDS G+V +  N
Sbjct: 481 TSFENCIIICDSVGQVVLKCN 499

BLAST of Cp4.1LG15g01400 vs. NCBI nr
Match: gi|778682121|ref|XP_011651650.1| (PREDICTED: uncharacterized protein LOC101209700 isoform X2 [Cucumis sativus])

HSP 1 Score: 731.9 bits (1888), Expect = 1.1e-207
Identity = 361/445 (81.12%), Postives = 393/445 (88.31%), Query Frame = 1

Query: 66  RHLLRFSTTTELQCESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDGIYLIL 125
           R   RFSTTTELQCESSP +D+ SFIKSTLDESEGPNHYWLN  + NK I E+DG YLIL
Sbjct: 6   RVFFRFSTTTELQCESSPTSDIFSFIKSTLDESEGPNHYWLNTSNENKVIFEEDGKYLIL 65

Query: 126 ADQFLEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAEKSGMIQFIMREYVSFP 185
           A+QFLEMTSSDSVVLVENVKFLQ RFP LHVIG QCS+TLSVAEKS MIQFIMREY+SFP
Sbjct: 66  ANQFLEMTSSDSVVLVENVKFLQQRFPHLHVIGFQCSSTLSVAEKSDMIQFIMREYISFP 125

Query: 186 ILLSNKIFEMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKSGLPNIGR 245
           ILLSNKIFE+    CYIISKD SNPLL+SER  DL++LRKAIEEL EPENEKSGL N+G+
Sbjct: 126 ILLSNKIFEVAG--CYIISKDLSNPLLVSERGMDLSILRKAIEELHEPENEKSGLSNMGK 185

Query: 246 TTYLKHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILD 305
           TTYLK AE+IKEP SCSFM NF+LH+PGCISADE+GGRLFLSDSNHNRIVIFN  GKILD
Sbjct: 186 TTYLKQAEMIKEPNSCSFMHNFLLHYPGCISADEEGGRLFLSDSNHNRIVIFNSYGKILD 245

Query: 306 MIGSYPGFEDGEFELVKLARPAASFYHATQDCLYFVDSENHAIRKADLGKRVVETLYPAN 365
           MIGSYPGFEDGEFELVKLARPAASFYH+TQ+CLYFVDSENHAIRKADLGKRVVETLYP N
Sbjct: 246 MIGSYPGFEDGEFELVKLARPAASFYHSTQNCLYFVDSENHAIRKADLGKRVVETLYPEN 305

Query: 366 YSSTKSTKLWSWIKDRLGLGSIPDREVEDFNPQSLMFPWHMIRYMDDRLLILNRSLGTLW 425
           YS+ KST+ WSWI D+ GLGSIPDREV+DFNPQS+MFPWHMIRYMDDRLLILNRSLGTLW
Sbjct: 306 YSNKKSTQFWSWIMDKFGLGSIPDREVKDFNPQSIMFPWHMIRYMDDRLLILNRSLGTLW 365

Query: 426 TMDLASGKIIEVVRGHSKIMENYGQLFMDRVSVLKQIPDGMLQLQRDAKTVTGELPSLDL 485
           TMDL SGKIIEVVRG S+IME+YGQL MDR+SV+KQIPDGMLQ   DA    G  P LDL
Sbjct: 366 TMDLVSGKIIEVVRGLSRIMESYGQLIMDRLSVIKQIPDGMLQRPSDANIAIGGSPYLDL 425

Query: 486 LSSLTPFQNCLIICDS-GKVGIHIN 510
           LSSLT F+NC+IICDS G+V +  N
Sbjct: 426 LSSLTSFENCIIICDSVGQVVLKCN 448

BLAST of Cp4.1LG15g01400 vs. NCBI nr
Match: gi|659130992|ref|XP_008465461.1| (PREDICTED: uncharacterized protein LOC103503064 isoform X2 [Cucumis melo])

HSP 1 Score: 730.3 bits (1884), Expect = 3.2e-207
Identity = 360/445 (80.90%), Postives = 390/445 (87.64%), Query Frame = 1

Query: 66  RHLLRFSTTTELQCESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDGIYLIL 125
           R   RFSTTTELQCESSP ND+ SFI STLDESEGPNHYWLN  +GNKGI E+DG+YLIL
Sbjct: 18  RVFFRFSTTTELQCESSPINDIFSFINSTLDESEGPNHYWLNTSNGNKGIFEEDGMYLIL 77

Query: 126 ADQFLEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAEKSGMIQFIMREYVSFP 185
           A+QFLEMTSSDS+ LVENVKFLQ RFP LHVIG QC +TLSVAEKS MIQFIMREY+SFP
Sbjct: 78  ANQFLEMTSSDSIDLVENVKFLQQRFPHLHVIGFQCYSTLSVAEKSAMIQFIMREYISFP 137

Query: 186 ILLSNKIFEMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKSGLPNIGR 245
           ILLSNKIFE+    C IISKD SNPLL+ ERD DL++L KAIEEL EPENEKSGL N G+
Sbjct: 138 ILLSNKIFEVAG--CCIISKDLSNPLLVCERDMDLSILCKAIEELHEPENEKSGLSNKGK 197

Query: 246 TTYLKHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILD 305
           TTYLK AE+IKEP SCSFM NF+LH+PGCISADE+GGRLFLSDSNHNRIVI N  GKILD
Sbjct: 198 TTYLKQAEMIKEPNSCSFMHNFLLHYPGCISADEEGGRLFLSDSNHNRIVILNSYGKILD 257

Query: 306 MIGSYPGFEDGEFELVKLARPAASFYHATQDCLYFVDSENHAIRKADLGKRVVETLYPAN 365
           MIGSYPGFEDGEFELVKLARPAASFYH+TQ+CLYFVDSENHAIRKADLGKRVVETLYP N
Sbjct: 258 MIGSYPGFEDGEFELVKLARPAASFYHSTQNCLYFVDSENHAIRKADLGKRVVETLYPEN 317

Query: 366 YSSTKSTKLWSWIKDRLGLGSIPDREVEDFNPQSLMFPWHMIRYMDDRLLILNRSLGTLW 425
           YS+ KST+LWSWI D+ GLGSIPDREVEDFNPQSLMFPWH+IRYMDDRLLILNRSLGTLW
Sbjct: 318 YSNKKSTQLWSWIMDKFGLGSIPDREVEDFNPQSLMFPWHLIRYMDDRLLILNRSLGTLW 377

Query: 426 TMDLASGKIIEVVRGHSKIMENYGQLFMDRVSVLKQIPDGMLQLQRDAKTVTGELPSLDL 485
           TMDL SGKIIEVVRG S+IMENYG L MDR+SVLKQIPDG LQ   DA   TG  P +DL
Sbjct: 378 TMDLVSGKIIEVVRGLSRIMENYGHLIMDRLSVLKQIPDGTLQQPSDANIATGGSPYMDL 437

Query: 486 LSSLTPFQNCLIICDS-GKVGIHIN 510
           LSSLTPF+NC+IICDS G+V +  N
Sbjct: 438 LSSLTPFKNCIIICDSVGQVVLKYN 460

BLAST of Cp4.1LG15g01400 vs. NCBI nr
Match: gi|700203221|gb|KGN58354.1| (hypothetical protein Csa_3G627650 [Cucumis sativus])

HSP 1 Score: 538.9 bits (1387), Expect = 1.3e-149
Identity = 267/335 (79.70%), Postives = 295/335 (88.06%), Query Frame = 1

Query: 10  MAFRFRRLREISKSLPQFYSGYYHQHHHRHAVSSLPFSVAPSYVSEGVERRILESGRHLL 69
           MAFRFRRL+EIS+S+PQ YS +YHQHH R+ VSSL  SVAP  VSE + RR+  +GR+  
Sbjct: 1   MAFRFRRLKEISRSIPQIYSEFYHQHHRRYGVSSLALSVAPFRVSERIGRRLFYNGRYFT 60

Query: 70  RFSTTTELQCESSPANDVLSFIKSTLDESEGPNHYWLNICDGNKGISEKDGIYLILADQF 129
           RFSTTTELQCESSP +D+ SFIKSTLDESEGPNHYWLN  + NK I E+DG YLILA+QF
Sbjct: 61  RFSTTTELQCESSPTSDIFSFIKSTLDESEGPNHYWLNTSNENKVIFEEDGKYLILANQF 120

Query: 130 LEMTSSDSVVLVENVKFLQHRFPQLHVIGLQCSNTLSVAEKSGMIQFIMREYVSFPILLS 189
           LEMTSSDSVVLVENVKFLQ RFP LHVIG QCS+TLSVAEKS MIQFIMREY+SFPILLS
Sbjct: 121 LEMTSSDSVVLVENVKFLQQRFPHLHVIGFQCSSTLSVAEKSDMIQFIMREYISFPILLS 180

Query: 190 NKIFEMGRRLCYIISKDFSNPLLISERDTDLNVLRKAIEELLEPENEKSGLPNIGRTTYL 249
           NKIFE+    CYIISKD SNPLL+SER  DL++LRKAIEEL EPENEKSGL N+G+TTYL
Sbjct: 181 NKIFEVAG--CYIISKDLSNPLLVSERGMDLSILRKAIEELHEPENEKSGLSNMGKTTYL 240

Query: 250 KHAEIIKEPYSCSFMQNFILHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS 309
           K AE+IKEP SCSFM NF+LH+PGCISADE+GGRLFLSDSNHNRIVIFN  GKILDMIGS
Sbjct: 241 KQAEMIKEPNSCSFMHNFLLHYPGCISADEEGGRLFLSDSNHNRIVIFNSYGKILDMIGS 300

Query: 310 YPGFEDGEFELVKLARPAASFYHATQDCLYFVDSE 345
           YPGFEDGEFELVKLARPAASFYH+TQ+CLYFVDSE
Sbjct: 301 YPGFEDGEFELVKLARPAASFYHSTQNCLYFVDSE 333

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NHLC2_CHICK5.5e-1027.01NHL repeat-containing protein 2 OS=Gallus gallus GN=NHLRC2 PE=2 SV=1[more]
NHLC2_MOUSE8.0e-0937.23NHL repeat-containing protein 2 OS=Mus musculus GN=Nhlrc2 PE=1 SV=1[more]
NHLC2_BOVIN1.1e-0737.23NHL repeat-containing protein 2 OS=Bos taurus GN=NHLRC2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LBM4_CUCSA9.4e-15079.70Uncharacterized protein OS=Cucumis sativus GN=Csa_3G627650 PE=4 SV=1[more]
Q9M8Z4_ARATH6.3e-14643.99F17A9.23 protein OS=Arabidopsis thaliana GN=F17A9.23 PE=4 SV=1[more]
B9GNJ6_POPTR6.3e-14655.31Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s24280g PE=4 SV=1[more]
B9S538_RICCO1.4e-12948.81Catalytic, putative OS=Ricinus communis GN=RCOM_1720980 PE=4 SV=1[more]
A0A067FEX4_CITSI1.0e-12748.30Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g004302mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G07060.11.6e-10341.34 NHL domain-containing protein[more]
AT1G56500.13.2e-1644.21 haloacid dehalogenase-like hydrolase family protein[more]
Match NameE-valueIdentityDescription
gi|659130990|ref|XP_008465459.1|4.0e-23480.24PREDICTED: uncharacterized protein LOC103503064 isoform X1 [Cucumis melo][more]
gi|778682118|ref|XP_011651649.1|3.8e-22979.24PREDICTED: uncharacterized protein LOC101209700 isoform X1 [Cucumis sativus][more]
gi|778682121|ref|XP_011651650.1|1.1e-20781.12PREDICTED: uncharacterized protein LOC101209700 isoform X2 [Cucumis sativus][more]
gi|659130992|ref|XP_008465461.1|3.2e-20780.90PREDICTED: uncharacterized protein LOC103503064 isoform X2 [Cucumis melo][more]
gi|700203221|gb|KGN58354.1|1.3e-14979.70hypothetical protein Csa_3G627650 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR0110426-blade_b-propeller_TolB-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g01400.1Cp4.1LG15g01400.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011042Six-bladed beta-propeller, TolB-likeGENE3DG3DSA:2.120.10.30coord: 269..365
score: 3.2E-14coord: 400..442
score: 3.2
NoneNo IPR availablePANTHERPTHR24104FAMILY NOT NAMEDcoord: 70..709
score: 4.0E
NoneNo IPR availablePANTHERPTHR24104:SF10NHL DOMAIN-CONTAINING PROTEINcoord: 70..709
score: 4.0E
NoneNo IPR availableunknownSSF63825YWTD domaincoord: 265..367
score: 8.11E-14coord: 400..441
score: 8.11

The following gene(s) are paralogous to this gene:

None