Cp4.1LG01g08830 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g08830
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDNA polymerase III subunit gamma/tau
LocationCp4.1LG01 : 4711575 .. 4716742 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGAAGTTCGAGTTTCTGATCCTAGTAAGCTTCATTTGAAAAAGGAACTTACTCAAATCCGCAAGGCTGCTCGTGTTCTCCGGGATCCTGGTACTACCTCGTCTTGGAAGTCTCCGCTTAACTCTTCTAGATCTGTATTGGCTGCGGTGCCGGGTGGAGCGTCTTCTTCTTTGAACAAGAACTTGGAAAGTGAGACCAGGAGGCATAGTGGCCAATCCCAACTGGACGCCATTGTTCCTCCTCGAAATGAAAATCGGAATCCCAAGGACAAGAAGATATACCTCTACAACTGGAAGAGCCATAAATCATCAAGCGAAAAGAGTGTTATCCATCAGAAGGAAGACCGTGATGGCAACAACGGTACTAATGATGGGTCTTATTCAGTTCCGGGGCTCAGTCTTGATGATAGCTTGAGTGATGCTCGAAATGGAGGCGACTCAAAGAGCGACACCTACTTGGGAGATCTCTGTTCTTCAATGGTCTTCAGGTGCGGTGATGCAAATCTAGTGTCATATGGCGGACCATTGGCCAAACGGGCTTCTGCGGTCAAGAAAAAGAGTAAGAAGCATTGTTCCCATTTGGATGTTTTGTCGAGACATCGACAAAAGGGTCCTGTTCTTGGTAGGAAATTGTTGGAGGGCCATCCTTCGTTGTCTATTAGTTTCAGCCAGGATGATTCGATCGAGCAGTCTGATGACACCGAAGATTACTCTAACTCAGAGGATTTCAGACGATATTCTGCGGCTTCCCCTTTACTATTGAAGCTCCACCCATCTGCTAAGTTATTGAGAAATCATCGAAAAGAGGACTCTTCCTATTCTTATAGCACCCCAGCTTTATCTACTAGTTCTTATAATAGGTATGTTAATAACAACCCAAGTACTGTTGGGTCTTGGGAAGGCACCACAACTTCGATTAATGATGCAGATGATGAAGTGGATGATCAATTAGATTTTCCTGGTCGTCAGGGATGTGGTATTCCTTGCTATTGGTCAAAGCGGACGCCAAAGCATAGAGGAGTTTGTGGAGGTTGTTGCTCTCCTTCACTTTCTGATACCTGGAGAAGGAAGGGAAGTAGCATTTTGTTTGGTAGTCAATCTATTTATTCTAGGCGCAAATCATTAAATTCCAGTAACCGAAGACTTACTTCAGGAAGTGCTCGAGGGGTCCTCCCATTGCTTACTAACAGTGCAGATGGCAGAGTTGGTTCATCGATTGGAACCGGGAGAAGTGATGATGAACTGTCTACTAACTTTGGGGAGCTTGATTTGGAGGCTCTGAGTAGGTTAGATGGACGAAGATGGTCAAGTTGCAGGAGTCATGAAGGGCTAGAGATTGTTGCTTTAAATGGGGAGGTAGAGGAGGGAAGTACACCAGAAAGTACAACAAGTTTCAGCCAGAAGTATAGACCGATGTTTTTTAATGAACTGATAGGTCAGAATATAGTGGTACAATCACTTATAAATGCTATTTCAAGGGGACGGATTGCTCCTGTTTATCTTTTCCAAGGCCCACGGGGTACTGGAAAAACAACAGCAGCAAGGATTTTTGCTGCTGCGTTGAATTGTTTGGCCCCGGAGGAAAATAAGCCATGTGGATACTGCAGAGAATGCACTGATTTCATGTCTGGCAAACAAAAGGATCTCTTGGAAATTGATGGAACAAATAGGAAGGGAATAGATAGAATTAGATACCAATTAAAAAAGTTATCATCCGGGTCATCTTCAGCCTTCTTGAGATATAAAGTTTTTCTCATTGATGAGTGTCATTTGTTGCCCTCTAAGGCGTGGCTCACATTTCTCAAATTCTTTGAAGAACCTCCTCAACGTGTTGTCTTCATATTCATAACTACTGATCTTGACAGTATACCCCGTACCATTCAGTCAAGGTGTCAGAAGTACATATTTAACAAAATAAAAGATTGTGACATGGTAGAAAGACTTAAAAGAATTTCTGCAGAGGAGAACTTGGATGCTGATTTGGATGCATTGGATTTGATAGCTATGAATGCTGATGGTTCACTTAGAGATGCTGAAACAATGTTGGAACAATTGAGTTTGTTAGGGAAAAGGATAACAATATCTCTGGTTAATGAACTTGTGAGCACAGTCCCTATTCTTTCGCTTTCATGTTACCTGGTTTGATTTGATTTTATTCTTTCCACTTTTTGAAAATAGTTCTTCATATTTATCACTTGTTTTCCTCTTTCATAGGTTGGCATTGTTTCTGATGAAAAGTTGCTTGAGCTTTTAGCGCTAGCAATGTCTTCAAACACCGCAGAAACAGTTAAAAGAGCAAGAGAGTTGATGGATTCTGGGGTTGATCCGCTGGTTTTGATGTCTCAGCTCGCCAGCTTGATTATGGACATTATTGCTGGAACCTACAACATTATTGATCCTAAAGACAGTGCTTCAATATTTTGTGGACGCAGTTGTGAGTATTTTGCACCGTCTTCTGATGTCATTAAATCAAAATCATGCTGGTGTTATGTTTAAGATATGATATACTTATGAGATTATAAGAACACAATTTTTTATTATTTCGAACTGTTAATGTCTGCTTGAATAATTGGGTTTGATGTGATTAGAGGCCCCTAATACGGTAAACCTATGTTATTGAACTCTTAATGTACCTGCATCAACTCTTCTTTTCATGGTCAGTTTTAAACATGAATTTCCATTTCTTCCTTGTAATGAAGATTACTTGTTCATCCATAAGTTGTACTGGTGGGGAAATTCTTCCGTTATAAATATGATTTTTTTGTTTTTCTGTATTTTATTTGATTATTAACTGTACAATGTTCGTTTTGATTTGATTATCCTTATAATGAAAGATGATGATGTTGTCCTCTGAAGTGAAAAGAATATCTTTCTAGCTAATATTGATCTACGAAATTAGTATAACAAATGTTCTGGAGATTGCTACGATTATCCTAAATTGACGTCTTTTGTGGCTTTCCTTCAGTAAGCGAAACCGAAGTGGAAAGATTGAAGCATGCTCTTAAGTTTCTTTCGGAGGCCGAGAAGCAGTTGAGAGTTTCCAGTGAGCGTTCAACTTGGTTCACAGCAACTCTTTTGCAACTTGGTTCCATATCTTCTCTAGATTTCACTCCGACAGGCAGCAATAGGAGACAGAGCTGCAAGACAACTGACGATGATCCATCAACTACTTCAAATGGGACAATTGGCTACAAACAAAAGTCATTTTCTCATCTTATACCAAAGTTAGGTTCCCCTGCATCTTTGTGCAACCTGAAAAATGGCAATTATAATAATCAAGGGGATTTGTCGCCTATGGTTGATAGTTTGAGTAACAACCCCAAGCCCACGCATAAACAGTTCATGGAGGGTAAAAACTCTTTTTCGCGTGATGATGCTACTCTTAGAAATATGGTTTTCAGATGCAAAAACTCAGAAAAGTTGGATAACATCTGGGTGCATTGTATTGAAAGATGCCACTCAAAGACGTTGCGGCAGCTATTGTATGCTTATGGGAAGCTTTTGTCCCTTTCGGAATCTGAAGGTAAGGACCAGAACCATTGATTGTCTTCCGAAATTGTACCCTTTCTCGTTTGATAGGTGCTTAATTTCTTCTGCTTGCAGATACCCTTATTGCATACGTTGCCTTTGAGGACGCAGATATCAAATCCAGAGCTGAAAGGTTTCTGAGCAGTATCACAAATTCTATGGAGATGGTTCTTAGATGCAATGTAGAGGTTAGAATCATTTTGTTACCAGATGGTGAGACTTCTATTAATGGTATGACTGCAGCCAAGTCGTCCGAAGGTGTAGAACACGAACTTGTTGATAAAGAAAGGAAAATTGCCAATCTTAATGCAATGGAGGGCTATTCTAGCCGCTCTTTGATTCTGGATGGAACATATCAAGCAACCTCTGATTCATCGCAGCTACCATCCGAAAGTAACAATCAAATAGACGGTTCGAGGGACAGGAGACAGGAAATCCCGATGCAGAGAATAGAATCAATTATTCGTGAACAAAGGTTGGAAACTGCCTGGTTACAGGCCATGGAAAAAGGCACACCTGGATCTTTGAGTCGTTTGAAACCCGAGAAGAATCAAGTCCTGCCTCAAGATGGTTCATACTATAAAGATCAAACGGAAGAAATGAATTCAACAGGGGACTCCTCTCGGAAATGGGATGATGAATTGAACCGTGAGCTTAAAGTTCTGAAGGCTAATGAAGAGCTAATTGCCCAGAAGGAGCAGGTTGGCAGACGGGTGGACCGCTATGCTATCTCCCCAAGTATACTGCACGATGGCGGCATGGTGGGAAATGCAAACAAGGATAACCTGTGAGTTTCTCTAGCTTTCGATTGAATCACTGAGTTTTCTCGACTTGAATTTCATTATGTTTTTGCTGAGTAGAGGATGAACAACGTAAAATATGGTTTTCTTATGCATATTTCCATGGGAGGATCATCAAAGTTTATCCTAAATGATCTCAACTCTTAATTTTAACAAGATCCATGCCTCCATTTGTTTTAAATACAGGGGATATGAATCAAGCTCAGCGGTTGGTGGTTGTAGTGGATTGTTCTGCTGGAACAACAGCAAATCCCATAAAAGGGGAAAGGTAATTTCCATAGTTCACTTTTCCTTTTTATGAACATTTAACTTGACGTTTAGTTGGGAGTCATTGATGTCTGTTTTTTCTGTTGGTGATGAGCAGGTAAGAACCAACCATGGTCGGTCACGCAGTGGAAGATTTTCACTGTTTGGGGAGTGTGGGAAGTCGAGGAATTTCGGGAGTCGATCTAGACGATAAACATGACTCTGGTAATCGTTTTGTGTCTGTAATCTTGTGTTTTTTGGTACAATTCTTTATGGGCATAGGGATGGAGGGGGAAGTAGTTGATGTAAAGTTCTCATTAATCGAGCCAATAAATGGTCAGTAATGAACATTGGAGTGTGCCTAATTGGAGGGGGAAGCTTTGGTTCCAAATAGAATTCTTTTTGACTTATTTGTAAAGAATCTCAGCCTATCACTATCTTAAAAAGATGTATACTCTGTCGGTTTAGTTAGATAAGGAAATCTTAGAACTCAACATCTCCATATCTTCACAGAATGGGCCTTTCCTCTTTTGGGGGAAAATGTTAGTTTTGACCTTTTGAACTAAACAAATAAAATGATTTTTTGGGAAT

mRNA sequence

ATGGCCGAAGTTCGAGTTTCTGATCCTAGTAAGCTTCATTTGAAAAAGGAACTTACTCAAATCCGCAAGGCTGCTCGTGTTCTCCGGGATCCTGGTACTACCTCGTCTTGGAAGTCTCCGCTTAACTCTTCTAGATCTGTATTGGCTGCGGTGCCGGGTGGAGCGTCTTCTTCTTTGAACAAGAACTTGGAAAGTGAGACCAGGAGGCATAGTGGCCAATCCCAACTGGACGCCATTGTTCCTCCTCGAAATGAAAATCGGAATCCCAAGGACAAGAAGATATACCTCTACAACTGGAAGAGCCATAAATCATCAAGCGAAAAGAGTGTTATCCATCAGAAGGAAGACCGTGATGGCAACAACGGTACTAATGATGGGTCTTATTCAGTTCCGGGGCTCAGTCTTGATGATAGCTTGAGTGATGCTCGAAATGGAGGCGACTCAAAGAGCGACACCTACTTGGGAGATCTCTGTTCTTCAATGGTCTTCAGGTGCGGTGATGCAAATCTAGTGTCATATGGCGGACCATTGGCCAAACGGGCTTCTGCGGTCAAGAAAAAGAGTAAGAAGCATTGTTCCCATTTGGATGTTTTGTCGAGACATCGACAAAAGGGTCCTGTTCTTGGTAGGAAATTGTTGGAGGGCCATCCTTCGTTGTCTATTAGTTTCAGCCAGGATGATTCGATCGAGCAGTCTGATGACACCGAAGATTACTCTAACTCAGAGGATTTCAGACGATATTCTGCGGCTTCCCCTTTACTATTGAAGCTCCACCCATCTGCTAAGTTATTGAGAAATCATCGAAAAGAGGACTCTTCCTATTCTTATAGCACCCCAGCTTTATCTACTAGTTCTTATAATAGGTATGTTAATAACAACCCAAGTACTGTTGGGTCTTGGGAAGGCACCACAACTTCGATTAATGATGCAGATGATGAAGTGGATGATCAATTAGATTTTCCTGGTCGTCAGGGATGTGGTATTCCTTGCTATTGGTCAAAGCGGACGCCAAAGCATAGAGGAGTTTGTGGAGGTTGTTGCTCTCCTTCACTTTCTGATACCTGGAGAAGGAAGGGAAGTAGCATTTTGTTTGGTAGTCAATCTATTTATTCTAGGCGCAAATCATTAAATTCCAGTAACCGAAGACTTACTTCAGGAAGTGCTCGAGGGGTCCTCCCATTGCTTACTAACAGTGCAGATGGCAGAGTTGGTTCATCGATTGGAACCGGGAGAAGTGATGATGAACTGTCTACTAACTTTGGGGAGCTTGATTTGGAGGCTCTGAGTAGGTTAGATGGACGAAGATGGTCAAGTTGCAGGAGTCATGAAGGGCTAGAGATTGTTGCTTTAAATGGGGAGGTAGAGGAGGGAAGTACACCAGAAAGTACAACAAGTTTCAGCCAGAAGTATAGACCGATGTTTTTTAATGAACTGATAGGTCAGAATATAGTGGTACAATCACTTATAAATGCTATTTCAAGGGGACGGATTGCTCCTGTTTATCTTTTCCAAGGCCCACGGGGTACTGGAAAAACAACAGCAGCAAGGATTTTTGCTGCTGCGTTGAATTGTTTGGCCCCGGAGGAAAATAAGCCATGTGGATACTGCAGAGAATGCACTGATTTCATGTCTGGCAAACAAAAGGATCTCTTGGAAATTGATGGAACAAATAGGAAGGGAATAGATAGAATTAGATACCAATTAAAAAAGTTATCATCCGGGTCATCTTCAGCCTTCTTGAGATATAAAGTTTTTCTCATTGATGAGTGTCATTTGTTGCCCTCTAAGGCGTGGCTCACATTTCTCAAATTCTTTGAAGAACCTCCTCAACGTGTTGTCTTCATATTCATAACTACTGATCTTGACAGTATACCCCGTACCATTCAGTCAAGGTGTCAGAAGTACATATTTAACAAAATAAAAGATTGTGACATGGTAGAAAGACTTAAAAGAATTTCTGCAGAGGAGAACTTGGATGCTGATTTGGATGCATTGGATTTGATAGCTATGAATGCTGATGGTTCACTTAGAGATGCTGAAACAATGTTGGAACAATTGAGTTTGTTAGGGAAAAGGATAACAATATCTCTGGTTAATGAACTTGTGAGCACAGTCCCTATTCTTTCGCTTTCATGTTACCTGGTTGGCATTGTTTCTGATGAAAAGTTGCTTGAGCTTTTAGCGCTAGCAATGTCTTCAAACACCGCAGAAACAGTTAAAAGAGCAAGAGAGTTGATGGATTCTGGGGTTGATCCGCTGGTTTTGATGTCTCAGCTCGCCAGCTTGATTATGGACATTATTGCTGGAACCTACAACATTATTGATCCTAAAGACAGTGCTTCAATATTTTGTGGACGCAGTTTAAGCGAAACCGAAGTGGAAAGATTGAAGCATGCTCTTAAGTTTCTTTCGGAGGCCGAGAAGCAGTTGAGAGTTTCCAGTGAGCGTTCAACTTGGTTCACAGCAACTCTTTTGCAACTTGGTTCCATATCTTCTCTAGATTTCACTCCGACAGGCAGCAATAGGAGACAGAGCTGCAAGACAACTGACGATGATCCATCAACTACTTCAAATGGGACAATTGGCTACAAACAAAAGTCATTTTCTCATCTTATACCAAAGTTAGGTTCCCCTGCATCTTTGTGCAACCTGAAAAATGGCAATTATAATAATCAAGGGGATTTGTCGCCTATGGTTGATAGTTTGAGTAACAACCCCAAGCCCACGCATAAACAGTTCATGGAGGGTAAAAACTCTTTTTCGCGTGATGATGCTACTCTTAGAAATATGGTTTTCAGATGCAAAAACTCAGAAAAGTTGGATAACATCTGGGTGCATTGTATTGAAAGATGCCACTCAAAGACGTTGCGGCAGCTATTGTATGCTTATGGGAAGCTTTTGTCCCTTTCGGAATCTGAAGATACCCTTATTGCATACGTTGCCTTTGAGGACGCAGATATCAAATCCAGAGCTGAAAGGTTTCTGAGCAGTATCACAAATTCTATGGAGATGGTTCTTAGATGCAATGTAGAGGTTAGAATCATTTTGTTACCAGATGGTGAGACTTCTATTAATGGTATGACTGCAGCCAAGTCGTCCGAAGGTGTAGAACACGAACTTGTTGATAAAGAAAGGAAAATTGCCAATCTTAATGCAATGGAGGGCTATTCTAGCCGCTCTTTGATTCTGGATGGAACATATCAAGCAACCTCTGATTCATCGCAGCTACCATCCGAAAGTAACAATCAAATAGACGGTTCGAGGGACAGGAGACAGGAAATCCCGATGCAGAGAATAGAATCAATTATTCGTGAACAAAGGTTGGAAACTGCCTGGTTACAGGCCATGGAAAAAGGCACACCTGGATCTTTGAGTCGTTTGAAACCCGAGAAGAATCAAGTCCTGCCTCAAGATGGTTCATACTATAAAGATCAAACGGAAGAAATGAATTCAACAGGGGACTCCTCTCGGAAATGGGATGATGAATTGAACCGTGAGCTTAAAGTTCTGAAGGCTAATGAAGAGCTAATTGCCCAGAAGGAGCAGGTTGGCAGACGGGTGGACCGCTATGCTATCTCCCCAAGTATACTGCACGATGGCGGCATGGTGGGAAATGCAAACAAGGATAACCTGGGATATGAATCAAGCTCAGCGGTTGGTGGTTGTAGTGGATTGTTCTGCTGGAACAACAGCAAATCCCATAAAAGGGGAAAGGTAAGAACCAACCATGGTCGGTCACGCAGTGGAAGATTTTCACTGTTTGGGGAGTGTGGGAAGTCGAGGAATTTCGGGAGTCGATCTAGACGATAAACATGACTCTGGTAATCGTTTTGTGTCTGTAATCTTGTGTTTTTTGGTACAATTCTTTATGGGCATAGGGATGGAGGGGGAAGTAGTTGATGTAAAGTTCTCATTAATCGAGCCAATAAATGGTCAGTAATGAACATTGGAGTGTGCCTAATTGGAGGGGGAAGCTTTGGTTCCAAATAGAATTCTTTTTGACTTATTTGTAAAGAATCTCAGCCTATCACTATCTTAAAAAGATGTATACTCTGTCGGTTTAGTTAGATAAGGAAATCTTAGAACTCAACATCTCCATATCTTCACAGAATGGGCCTTTCCTCTTTTGGGGGAAAATGTTAGTTTTGACCTTTTGAACTAAACAAATAAAATGATTTTTTGGGAAT

Coding sequence (CDS)

ATGGCCGAAGTTCGAGTTTCTGATCCTAGTAAGCTTCATTTGAAAAAGGAACTTACTCAAATCCGCAAGGCTGCTCGTGTTCTCCGGGATCCTGGTACTACCTCGTCTTGGAAGTCTCCGCTTAACTCTTCTAGATCTGTATTGGCTGCGGTGCCGGGTGGAGCGTCTTCTTCTTTGAACAAGAACTTGGAAAGTGAGACCAGGAGGCATAGTGGCCAATCCCAACTGGACGCCATTGTTCCTCCTCGAAATGAAAATCGGAATCCCAAGGACAAGAAGATATACCTCTACAACTGGAAGAGCCATAAATCATCAAGCGAAAAGAGTGTTATCCATCAGAAGGAAGACCGTGATGGCAACAACGGTACTAATGATGGGTCTTATTCAGTTCCGGGGCTCAGTCTTGATGATAGCTTGAGTGATGCTCGAAATGGAGGCGACTCAAAGAGCGACACCTACTTGGGAGATCTCTGTTCTTCAATGGTCTTCAGGTGCGGTGATGCAAATCTAGTGTCATATGGCGGACCATTGGCCAAACGGGCTTCTGCGGTCAAGAAAAAGAGTAAGAAGCATTGTTCCCATTTGGATGTTTTGTCGAGACATCGACAAAAGGGTCCTGTTCTTGGTAGGAAATTGTTGGAGGGCCATCCTTCGTTGTCTATTAGTTTCAGCCAGGATGATTCGATCGAGCAGTCTGATGACACCGAAGATTACTCTAACTCAGAGGATTTCAGACGATATTCTGCGGCTTCCCCTTTACTATTGAAGCTCCACCCATCTGCTAAGTTATTGAGAAATCATCGAAAAGAGGACTCTTCCTATTCTTATAGCACCCCAGCTTTATCTACTAGTTCTTATAATAGGTATGTTAATAACAACCCAAGTACTGTTGGGTCTTGGGAAGGCACCACAACTTCGATTAATGATGCAGATGATGAAGTGGATGATCAATTAGATTTTCCTGGTCGTCAGGGATGTGGTATTCCTTGCTATTGGTCAAAGCGGACGCCAAAGCATAGAGGAGTTTGTGGAGGTTGTTGCTCTCCTTCACTTTCTGATACCTGGAGAAGGAAGGGAAGTAGCATTTTGTTTGGTAGTCAATCTATTTATTCTAGGCGCAAATCATTAAATTCCAGTAACCGAAGACTTACTTCAGGAAGTGCTCGAGGGGTCCTCCCATTGCTTACTAACAGTGCAGATGGCAGAGTTGGTTCATCGATTGGAACCGGGAGAAGTGATGATGAACTGTCTACTAACTTTGGGGAGCTTGATTTGGAGGCTCTGAGTAGGTTAGATGGACGAAGATGGTCAAGTTGCAGGAGTCATGAAGGGCTAGAGATTGTTGCTTTAAATGGGGAGGTAGAGGAGGGAAGTACACCAGAAAGTACAACAAGTTTCAGCCAGAAGTATAGACCGATGTTTTTTAATGAACTGATAGGTCAGAATATAGTGGTACAATCACTTATAAATGCTATTTCAAGGGGACGGATTGCTCCTGTTTATCTTTTCCAAGGCCCACGGGGTACTGGAAAAACAACAGCAGCAAGGATTTTTGCTGCTGCGTTGAATTGTTTGGCCCCGGAGGAAAATAAGCCATGTGGATACTGCAGAGAATGCACTGATTTCATGTCTGGCAAACAAAAGGATCTCTTGGAAATTGATGGAACAAATAGGAAGGGAATAGATAGAATTAGATACCAATTAAAAAAGTTATCATCCGGGTCATCTTCAGCCTTCTTGAGATATAAAGTTTTTCTCATTGATGAGTGTCATTTGTTGCCCTCTAAGGCGTGGCTCACATTTCTCAAATTCTTTGAAGAACCTCCTCAACGTGTTGTCTTCATATTCATAACTACTGATCTTGACAGTATACCCCGTACCATTCAGTCAAGGTGTCAGAAGTACATATTTAACAAAATAAAAGATTGTGACATGGTAGAAAGACTTAAAAGAATTTCTGCAGAGGAGAACTTGGATGCTGATTTGGATGCATTGGATTTGATAGCTATGAATGCTGATGGTTCACTTAGAGATGCTGAAACAATGTTGGAACAATTGAGTTTGTTAGGGAAAAGGATAACAATATCTCTGGTTAATGAACTTGTGAGCACAGTCCCTATTCTTTCGCTTTCATGTTACCTGGTTGGCATTGTTTCTGATGAAAAGTTGCTTGAGCTTTTAGCGCTAGCAATGTCTTCAAACACCGCAGAAACAGTTAAAAGAGCAAGAGAGTTGATGGATTCTGGGGTTGATCCGCTGGTTTTGATGTCTCAGCTCGCCAGCTTGATTATGGACATTATTGCTGGAACCTACAACATTATTGATCCTAAAGACAGTGCTTCAATATTTTGTGGACGCAGTTTAAGCGAAACCGAAGTGGAAAGATTGAAGCATGCTCTTAAGTTTCTTTCGGAGGCCGAGAAGCAGTTGAGAGTTTCCAGTGAGCGTTCAACTTGGTTCACAGCAACTCTTTTGCAACTTGGTTCCATATCTTCTCTAGATTTCACTCCGACAGGCAGCAATAGGAGACAGAGCTGCAAGACAACTGACGATGATCCATCAACTACTTCAAATGGGACAATTGGCTACAAACAAAAGTCATTTTCTCATCTTATACCAAAGTTAGGTTCCCCTGCATCTTTGTGCAACCTGAAAAATGGCAATTATAATAATCAAGGGGATTTGTCGCCTATGGTTGATAGTTTGAGTAACAACCCCAAGCCCACGCATAAACAGTTCATGGAGGGTAAAAACTCTTTTTCGCGTGATGATGCTACTCTTAGAAATATGGTTTTCAGATGCAAAAACTCAGAAAAGTTGGATAACATCTGGGTGCATTGTATTGAAAGATGCCACTCAAAGACGTTGCGGCAGCTATTGTATGCTTATGGGAAGCTTTTGTCCCTTTCGGAATCTGAAGATACCCTTATTGCATACGTTGCCTTTGAGGACGCAGATATCAAATCCAGAGCTGAAAGGTTTCTGAGCAGTATCACAAATTCTATGGAGATGGTTCTTAGATGCAATGTAGAGGTTAGAATCATTTTGTTACCAGATGGTGAGACTTCTATTAATGGTATGACTGCAGCCAAGTCGTCCGAAGGTGTAGAACACGAACTTGTTGATAAAGAAAGGAAAATTGCCAATCTTAATGCAATGGAGGGCTATTCTAGCCGCTCTTTGATTCTGGATGGAACATATCAAGCAACCTCTGATTCATCGCAGCTACCATCCGAAAGTAACAATCAAATAGACGGTTCGAGGGACAGGAGACAGGAAATCCCGATGCAGAGAATAGAATCAATTATTCGTGAACAAAGGTTGGAAACTGCCTGGTTACAGGCCATGGAAAAAGGCACACCTGGATCTTTGAGTCGTTTGAAACCCGAGAAGAATCAAGTCCTGCCTCAAGATGGTTCATACTATAAAGATCAAACGGAAGAAATGAATTCAACAGGGGACTCCTCTCGGAAATGGGATGATGAATTGAACCGTGAGCTTAAAGTTCTGAAGGCTAATGAAGAGCTAATTGCCCAGAAGGAGCAGGTTGGCAGACGGGTGGACCGCTATGCTATCTCCCCAAGTATACTGCACGATGGCGGCATGGTGGGAAATGCAAACAAGGATAACCTGGGATATGAATCAAGCTCAGCGGTTGGTGGTTGTAGTGGATTGTTCTGCTGGAACAACAGCAAATCCCATAAAAGGGGAAAGGTAAGAACCAACCATGGTCGGTCACGCAGTGGAAGATTTTCACTGTTTGGGGAGTGTGGGAAGTCGAGGAATTTCGGGAGTCGATCTAGACGATAA

Protein sequence

MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAAVPGGASSSLNKNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQKEDRDGNNGTNDGSYSVPGLSLDDSLSDARNGGDSKSDTYLGDLCSSMVFRCGDANLVSYGGPLAKRASAVKKKSKKHCSHLDVLSRHRQKGPVLGRKLLEGHPSLSISFSQDDSIEQSDDTEDYSNSEDFRRYSAASPLLLKLHPSAKLLRNHRKEDSSYSYSTPALSTSSYNRYVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGCCSPSLSDTWRRKGSSILFGSQSIYSRRKSLNSSNRRLTSGSARGVLPLLTNSADGRVGSSIGTGRSDDELSTNFGELDLEALSRLDGRRWSSCRSHEGLEIVALNGEVEEGSTPESTTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNCLAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYKVFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDCDMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVSTVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLIPKLGSPASLCNLKNGNYNNQGDLSPMVDSLSNNPKPTHKQFMEGKNSFSRDDATLRNMVFRCKNSEKLDNIWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSITNSMEMVLRCNVEVRIILLPDGETSINGMTAAKSSEGVEHELVDKERKIANLNAMEGYSSRSLILDGTYQATSDSSQLPSESNNQIDGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLSRLKPEKNQVLPQDGSYYKDQTEEMNSTGDSSRKWDDELNRELKVLKANEELIAQKEQVGRRVDRYAISPSILHDGGMVGNANKDNLGYESSSAVGGCSGLFCWNNSKSHKRGKVRTNHGRSRSGRFSLFGECGKSRNFGSRSRR
BLAST of Cp4.1LG01g08830 vs. Swiss-Prot
Match: STI_ARATH (Protein STICHEL OS=Arabidopsis thaliana GN=STI PE=1 SV=2)

HSP 1 Score: 1234.6 bits (3193), Expect = 0.0e+00
Identity = 730/1300 (56.15%), Postives = 899/1300 (69.15%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAAVPGGASSSLN 60
            M+  RVSD SKLHLKKELTQIRKA RVLRDPGTTSSWKSPL+SSRSV             
Sbjct: 1    MSGSRVSDLSKLHLKKELTQIRKAGRVLRDPGTTSSWKSPLDSSRSVAL----------- 60

Query: 61   KNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQKEDRDGN 120
              LE+   R+ G S    I    + NR  K+KK++LYNWK+ KSSSEKS + +    +  
Sbjct: 61   --LETPASRNGGSSSQFPIRGESSTNRRGKEKKVFLYNWKTQKSSSEKSGLAKNGKEEEE 120

Query: 121  NGTNDGSYSVPGLSLDDSLSDARNGGDSKSDTYLGDLCS-SMVFRCGDANLVSYGGPLAK 180
               +  S++   ++ DD +SDARNGGDS    Y  ++ S SM FRC D NL S G    +
Sbjct: 121  EEEDASSWTQASVNDDDDVSDARNGGDS----YRREIQSASMGFRCRDTNLASQGVSKMR 180

Query: 181  RAS--AVKKKSKKHCS--HLDVLSRHRQKGPVLGRKLLEGHPSLSISFSQDDSIEQSDDT 240
            +++  + KKKSKK  S   LD LS+++ +  ++ R    G                SDDT
Sbjct: 181  KSNVGSCKKKSKKKISSSRLDCLSKYQPRDDIVARNCNAG----------------SDDT 240

Query: 241  ED-YSNSEDFRRYSAASPLLLKL------HPSAKLLR-NHRKEDSSYSY-STPALSTSSY 300
            E+  SNSED R+ + ASPLLLKL        S++LLR N+RKEDSS +Y STPALSTSSY
Sbjct: 241  EEELSNSEDLRKVTGASPLLLKLKQKNWSRSSSRLLRANNRKEDSSCTYNSTPALSTSSY 300

Query: 301  NRYVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGC 360
            N Y   NPSTVGSW+GTTTS+ND DDE+DD LD PGRQGCGIPCYW+K+  KHRG C  C
Sbjct: 301  NMYAVRNPSTVGSWDGTTTSVNDGDDELDDNLDLPGRQGCGIPCYWTKKAMKHRGGCRSC 360

Query: 361  CSPSLSDTWRRKGSSILFGSQSIYSRRKSLNS---SNRRLTSGSARGVLPLLTNSADGRV 420
            CSPS SDT RR GSSIL GSQS+Y R    +S   S +++   SA+GVLPLL+   DGR 
Sbjct: 361  CSPSFSDTLRRTGSSILCGSQSVYRRHNRHSSGGYSKQKIACRSAQGVLPLLSYGGDGRG 420

Query: 421  GSSIGTGRSDDELSTNFGELDLEALSRLDGRRWS-SCRSHEGLEIVALNGEVEEGSTPES 480
            GSS+GTG SDDELSTN+GELDLEA SRLDGRRWS S RS +GLE VAL+GE EEGSTPE+
Sbjct: 421  GSSLGTGLSDDELSTNYGELDLEAQSRLDGRRWSTSYRSQDGLEAVALDGEEEEGSTPET 480

Query: 481  TTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAAL 540
              SFSQKYRPMFF ELIGQ+IVVQSL+NA+ R RIAPVYLFQGPRGTGKT+ ARIF+AAL
Sbjct: 481  IRSFSQKYRPMFFEELIGQSIVVQSLMNAVKRSRIAPVYLFQGPRGTGKTSTARIFSAAL 540

Query: 541  NCLAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRY 600
            NC+A EE KPCGYC+EC DFMSGK KD  E+DG N+KG D++RY LK L +        Y
Sbjct: 541  NCVATEEMKPCGYCKECNDFMSGKSKDFWELDGANKKGADKVRYLLKNLPTILPRNSSMY 600

Query: 601  KVFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKD 660
            KVF+IDECHLLPSK WL+FLKF E P Q+VVFIFITTDL+++PRTIQSRCQK++F+K+KD
Sbjct: 601  KVFVIDECHLLPSKTWLSFLKFLENPLQKVVFIFITTDLENVPRTIQSRCQKFLFDKLKD 660

Query: 661  CDMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELV 720
             D+V RLK+I+++ENLD DL ALDLIAMNADGSLRDAETMLEQLSLLGKRIT +LVNE  
Sbjct: 661  SDIVVRLKKIASDENLDVDLHALDLIAMNADGSLRDAETMLEQLSLLGKRITTALVNE-- 720

Query: 721  STVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASL 780
                       LVG+VSDEKLLELL LA+SS+TAETVKRAREL+D G DP+VLMSQLASL
Sbjct: 721  -----------LVGVVSDEKLLELLELALSSDTAETVKRARELLDLGADPIVLMSQLASL 780

Query: 781  IMDIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTA 840
            IMDIIAGTY ++D K S + F GR+L+E ++E LKHALK LSEAEKQLRVS++RSTWFTA
Sbjct: 781  IMDIIAGTYKVVDEKYSNAFFDGRNLTEADMEGLKHALKLLSEAEKQLRVSNDRSTWFTA 840

Query: 841  TLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLIPKLGSPASLC 900
            TLLQLGS+ S   T TGS+RRQS + TDDDP++ S   + YKQ+       K  SPAS+ 
Sbjct: 841  TLLQLGSMPSPGTTHTGSSRRQSSRATDDDPASVSREVMAYKQRIGGLHFSKSASPASVI 900

Query: 901  NLKNGNYNNQGDLSPMVDSLSNN--PKPTHKQFMEGKNSF-SRDDATLRNMVFRCKNSEK 960
              +NGN++++    P    + NN     +  Q +E + S  S +++    M+   ++SEK
Sbjct: 901  K-RNGNHSHEA--KPFSRVIDNNCYKSSSSSQMIESEGSIASHENSIASTMMLNQRSSEK 960

Query: 961  LDNIWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSIT 1020
            L++IW  CIERCHSKTLRQLLY +GKL+S+SE E  L+AY+AF + DIK RAERFLSSIT
Sbjct: 961  LNDIWRKCIERCHSKTLRQLLYTHGKLISISEVEGILVAYIAFGENDIKLRAERFLSSIT 1020

Query: 1021 NSMEMVLRCNVEVRIILLPDGETSINGMTAAKSSEGVEHELVDKE--RKIANLNAMEGYS 1080
            NS+EMVLR +VEVRIILLP+ E  +           V H+    E   K  +LN + G  
Sbjct: 1021 NSIEMVLRRSVEVRIILLPETELLV-----------VPHQTRKPEMTNKSGHLNNIAG-- 1080

Query: 1081 SRSLILDGTYQATSDSSQLPSESNNQIDGSRDRRQEIPMQRIESIIREQRLETAWLQAME 1140
                              L +E++ ++  S + R ++PMQRIESIIREQRLETAWLQ  +
Sbjct: 1081 ------------------LNAETDVEVGSSVESRSKLPMQRIESIIREQRLETAWLQTAD 1140

Query: 1141 KGTPGSLSRLKPEKNQVLPQDGSYYK-DQTEEMNSTGDSSRKWDDELNRELKVLKANEEL 1200
            K TPGS+ R+KPE+NQ+LPQ+ +Y + +    ++S+G ++ +W DELN E+K+LK  +  
Sbjct: 1141 KDTPGSIIRVKPERNQILPQEDTYRQTNVASAISSSGLTTHQWVDELNNEVKLLKIGDNG 1200

Query: 1201 IAQKEQVGRRVDRYAISPSILHDGGMVGNANKDNL-GYESSSAVGGCSGLFCWNNSKSHK 1260
              Q+   G R     +SPS+LHD    GN NKDNL GYES S   GC+ LFCWN  K+ +
Sbjct: 1201 ELQENLTGTRGQHCPLSPSLLHDTNF-GN-NKDNLGGYESGSGRVGCNILFCWNTKKTQR 1218

Query: 1261 RGKVRTNHG------RSRSGRFSLFGECGKSRNFGSRSRR 1270
            R K +   G      R+R  RFSLF  C K R      RR
Sbjct: 1261 RSKSKQVKGTPVRSRRNRKSRFSLFNGCAKPRKAEGNIRR 1218

BLAST of Cp4.1LG01g08830 vs. Swiss-Prot
Match: STIL1_ARATH (Protein STICHEL-like 1 OS=Arabidopsis thaliana GN=At1g14460 PE=1 SV=1)

HSP 1 Score: 934.9 bits (2415), Expect = 9.5e-271
Identity = 562/1043 (53.88%), Postives = 706/1043 (67.69%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRK-AARVLRDPGTTSSWKSPLNSSRSVLAAVPGGASSSL 60
            M+ +R+SDPSKLHLKKELT IRK A++ LRDPGTTSSWKSPL SSR V+          L
Sbjct: 1    MSGLRISDPSKLHLKKELTHIRKVASKGLRDPGTTSSWKSPLTSSRFVVEPPASNNVEIL 60

Query: 61   NKNLESETRRHSGQSQLDAIVPPR---NENRNPKDKKIYLYNWKSHKSSSEKSVIHQKED 120
            + N            QLD+  P       N   K+KK++LYNWK+ ++SSEK+       
Sbjct: 61   SNN------------QLDSQFPSSRVFGNNGKEKEKKVFLYNWKTQRTSSEKT------- 120

Query: 121  RDGNNGTNDGSYSVPGLS----LDDSLSDARNGGDSKSDTYLGDLCSSMVFRCGDANLVS 180
                 G ++ S+    L+     DD +SDARNGGDS  +                    +
Sbjct: 121  ----EGEDETSWIQASLNDDDDDDDDVSDARNGGDSCLEE-------------------T 180

Query: 181  YGGPLAKRASAVKKKSKKHCSHLDVLSRHRQKGPVLGRKLLEGHPSLSISFSQDDSIEQS 240
                + +++  +KKKSK+    LD+    +             H +  +S  +D    +S
Sbjct: 181  RSASMIRKSGFIKKKSKE----LDLSIGRKSTAKARNFPSHHLHVASGLSVVRD----ES 240

Query: 241  DDTEDYSNSEDFRRYSAASPLLLKL------HPSAKLLR-NHRKEDSSYS-YSTPALSTS 300
            D+TED+SNSE+F     +SPLLLKL        S+K LR   ++EDSS++  STPALSTS
Sbjct: 241  DETEDFSNSENFPT-KVSSPLLLKLKRKNWSRSSSKFLRGTSKREDSSHTCNSTPALSTS 300

Query: 301  SYNRYVNNNPSTVGSWEGTTTSINDADDEV-DDQLDFPGRQGCGIPCYWSKRTPKHRGVC 360
            SYN Y   NPSTVGSWE       D DDE+ DD LDF GRQGCGIP YW+KR  KHRG C
Sbjct: 301  SYNMYGIRNPSTVGSWE-------DGDDELDDDNLDFKGRQGCGIPFYWTKRNLKHRGGC 360

Query: 361  GGCCSPSLSDTWRRKGSSILFGSQSIYSRRK--SLNSSNRRLTSGSARGVLPLLTNSADG 420
              CCSPS SDT RRKGSSIL GSQS+Y R +  S   + ++L   SA+GVLPLL    D 
Sbjct: 361  RSCCSPSFSDTLRRKGSSILCGSQSVYRRHRHSSGRFNKQKLALRSAKGVLPLLKYGGDS 420

Query: 421  RVGSSIGTGRSDDELSTNFGELDLEALSRLDGRRWSS-CRSHEGLEIVALNGEVEEGSTP 480
            R GSSIG G SDD+LST+FGE+DLEA SRLDGRRWSS C+S +G        E E GSTP
Sbjct: 421  RGGSSIGIGYSDDDLSTDFGEIDLEAQSRLDGRRWSSCCKSQDG----EREEEEEGGSTP 480

Query: 481  ESTTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAA 540
            ES  S SQKY+PMFF+ELIGQ+IVVQSL+NA+ +GR+A VYLFQGPRGTGKT+ ARI +A
Sbjct: 481  ESIQSLSQKYKPMFFDELIGQSIVVQSLMNAVKKGRVAHVYLFQGPRGTGKTSTARILSA 540

Query: 541  ALNC-LAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAF 600
            ALNC +  EE KPCGYC+EC+D+M GK +DLLE+D   + G +++RY LKKL + +  + 
Sbjct: 541  ALNCDVVTEEMKPCGYCKECSDYMLGKSRDLLELDAGKKNGAEKVRYLLKKLLTLAPQSS 600

Query: 601  LRYKVFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNK 660
             RYKVF+IDECHLLPS+ WL+ LKF E P Q+ VF+ ITTDLD++PRTIQSRCQKYIFNK
Sbjct: 601  QRYKVFVIDECHLLPSRTWLSLLKFLENPLQKFVFVCITTDLDNVPRTIQSRCQKYIFNK 660

Query: 661  IKDCDMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVN 720
            ++D D+V RL++I+++ENLD +  ALDLIA+NADGSLRDAETMLEQLSL+GKRIT+ LVN
Sbjct: 661  VRDGDIVVRLRKIASDENLDVESQALDLIALNADGSLRDAETMLEQLSLMGKRITVDLVN 720

Query: 721  ELVSTVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQL 780
            E             LVG+VSD+KLLELL LA+SS+TAETVK+AREL+D G DP+++MSQL
Sbjct: 721  E-------------LVGVVSDDKLLELLELALSSDTAETVKKARELLDLGADPILMMSQL 780

Query: 781  ASLIMDIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTW 840
            ASLIMDIIAG Y  +D K S +    R+L+E ++ERLKHALK LSEAEKQLRVS++RSTW
Sbjct: 781  ASLIMDIIAGAYKALDEKYSEAFLDRRNLTEADLERLKHALKLLSEAEKQLRVSTDRSTW 840

Query: 841  FTATLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLIPKLGSPA 900
            F ATLLQLGS+ S   T TGS+RRQS + T++   + S   I YKQ+S         SP 
Sbjct: 841  FIATLLQLGSMPSPGTTHTGSSRRQSSRATEE---SISREVIAYKQRS-GLQCSNTASPT 900

Query: 901  SLCNLKNGNYNNQGDLSPMVDSLSNNPKPTHKQFMEGKNSF-SRDDATLRNMVFRCKNSE 960
            S+   K+GN   +  LS            +  + +E   S  S DD T   M   C+NSE
Sbjct: 901  SI--RKSGNLVREVKLS-----------SSSSEVLESDTSMASHDDTTASTMTLTCRNSE 951

Query: 961  KLDNIWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSI 1020
            KL++IW+ C++RCHSKTL+QLLYA+GKLLS+SE E  L+AY+AF + +IK+RAERF+SSI
Sbjct: 961  KLNDIWIKCVDRCHSKTLKQLLYAHGKLLSISEVEGILVAYIAFGEGEIKARAERFVSSI 951

Query: 1021 TNSMEMVLRCNVEVRIILLPDGE 1022
            TNS+EMVLR NVEVRIILL + E
Sbjct: 1021 TNSIEMVLRRNVEVRIILLSETE 951

BLAST of Cp4.1LG01g08830 vs. Swiss-Prot
Match: STIL2_ARATH (Protein STICHEL-like 2 OS=Arabidopsis thaliana GN=At4g24790 PE=2 SV=1)

HSP 1 Score: 328.2 bits (840), Expect = 4.0e-88
Identity = 194/408 (47.55%), Postives = 264/408 (64.71%), Query Frame = 1

Query: 465 SFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNC 524
           S SQK+RP  F+EL+GQ +VV+ L++ I RGRI  VYLF GPRGTGKT+ ++IFAAALNC
Sbjct: 240 SLSQKFRPKSFDELVGQEVVVKCLLSTILRGRITSVYLFHGPRGTGKTSTSKIFAAALNC 299

Query: 525 LAPE-ENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYK 584
           L+    ++PCG C EC  + SG+ +D++E D         +R  +K  S    S+  R+K
Sbjct: 300 LSQAAHSRPCGLCSECKSYFSGRGRDVMETDSGKLNRPSYLRSLIKSASLPPVSS--RFK 359

Query: 585 VFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDC 644
           VF+IDEC LL  + W T L   +   Q  VFI +T++L+ +PR + SR QKY F+K+ D 
Sbjct: 360 VFIIDECQLLCQETWGTLLNSLDNFSQHSVFILVTSELEKLPRNVLSRSQKYHFSKVCDA 419

Query: 645 DMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVS 704
           D+  +L +I  EE +D D  A+D IA  +DGSLRDAE ML+QLSLLGKRIT SL  +L+ 
Sbjct: 420 DISTKLAKICIEEGIDFDQGAVDFIASKSDGSLRDAEIMLDQLSLLGKRITTSLAYKLI- 479

Query: 705 TVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLI 764
                       G+VSD++LL+LL LAMSS+T+ TV RARELM S +DP+ L+SQLA++I
Sbjct: 480 ------------GVVSDDELLDLLDLAMSSDTSNTVIRARELMRSKIDPMQLISQLANVI 539

Query: 765 MDIIAGTYNIIDPKDSASI----FCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTW 824
           MDIIAG     + ++S+S     F  R  SE E+++L++ALK LS+AEK LR S  ++TW
Sbjct: 540 MDIIAG-----NSQESSSATRLRFLTRHTSEEEMQKLRNALKILSDAEKHLRASKNQTTW 599

Query: 825 FTATLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKS 868
            T  LLQL +  S  F    + R Q  K  D + S+TS+G  G   KS
Sbjct: 600 LTVALLQLSNTDSSSFATDENGRNQINK--DVELSSTSSGCPGDVIKS 625

BLAST of Cp4.1LG01g08830 vs. Swiss-Prot
Match: STIL4_ARATH (Protein STICHEL-like 4 OS=Arabidopsis thaliana GN=At5g45720 PE=2 SV=1)

HSP 1 Score: 318.5 bits (815), Expect = 3.2e-85
Identity = 208/542 (38.38%), Postives = 300/542 (55.35%), Query Frame = 1

Query: 326 CGIPCYWSKRTPKHRG-----VCGGCCSPSLSDTWRRKGSSILFGSQSIYSRRKSLNSSN 385
           CGIP  WS+    HRG     + G   S  +SD+  RKG +       ++S         
Sbjct: 240 CGIPFNWSRI--HHRGKTFLDIAGRSLSCGISDSKGRKGEA----GTPMFSD-------- 299

Query: 386 RRLTSGSARGVLPLLTNSADGRVGSSIGTGRSDDELSTNFGELDLEALSRLDGRRWSSCR 445
              +S S R  LPLL +SAD           +++ +    GEL + A + L   + S   
Sbjct: 300 ---SSSSDREALPLLVDSAD-----------NEEWVHDYSGELGIFADNLLKNGKDS--- 359

Query: 446 SHEGLEIVALNGEVEEGSTPESTTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPV 505
                    + G+           SF+QKY P  F +L+GQN+VVQ+L NAI++ R+  +
Sbjct: 360 ---------VIGKKSSRKNTRWHQSFTQKYAPRTFRDLLGQNLVVQALSNAIAKRRVGLL 419

Query: 506 YLFQGPRGTGKTTAARIFAAALNCLAPEENKPCGYCRECTDFMSGKQKDLLEI------D 565
           Y+F GP GTGKT+ AR+FA ALNC + E++KPCG C  C  +  GK + + E+      D
Sbjct: 420 YVFHGPNGTGKTSCARVFARALNCHSTEQSKPCGVCSSCVSYDDGKNRYIREMGPVKSFD 479

Query: 566 GTNRKGIDRIRYQLKKLSSGSSSAFLRYKVFLIDECHLLPSKAWLTFLKFFEEPPQRVVF 625
             N      IR Q K+             V + D+C  + +  W T  K  +  P+RVVF
Sbjct: 480 FENLLDKTNIRQQQKQ-----------QLVLIFDDCDTMSTDCWNTLSKIVDRAPRRVVF 539

Query: 626 IFITTDLDSIPRTIQSRCQKYIFNKIKDCDMVERLKRISAEENLDADLDALDLIAMNADG 685
           + + + LD +P  I SRCQK+ F K+KD D+++ L+ I+++E +D D DAL L+A  +DG
Sbjct: 540 VLVCSSLDVLPHIIVSRCQKFFFPKLKDVDIIDSLQLIASKEEIDIDKDALKLVASRSDG 599

Query: 686 SLRDAETMLEQLSLLGKRITISLVNELVSTVPILSLSCYLVGIVSDEKLLELLALAMSSN 745
           SLRDAE  LEQLSLLG RI++ LV E+             VG++SDEKL++LL LA+S++
Sbjct: 600 SLRDAEMTLEQLSLLGTRISVPLVQEM-------------VGLISDEKLVDLLDLALSAD 659

Query: 746 TAETVKRARELMDSGVDPLVLMSQLASLIMDIIAGTYNIIDPKDSASIFCGRSLSETEVE 805
           T  TVK  R +M++G++PL LMSQLA++I DI+AG+Y+    +     F  + LS+ ++E
Sbjct: 660 TVNTVKNLRIIMETGLEPLALMSQLATVITDILAGSYDFTKDQCKRKFFRRQPLSKEDME 717

Query: 806 RLKHALKFLSEAEKQLRVSSERSTWFTATLLQLGSISS--LDFTPTGSNRRQSCKTTDDD 855
           +LK ALK LSE+EKQLRVS+++ TW TA LLQL       L  + +          TD D
Sbjct: 720 KLKQALKTLSESEKQLRVSNDKLTWLTAALLQLAPDKQYLLPHSSSADASFNHTPLTDSD 717

BLAST of Cp4.1LG01g08830 vs. Swiss-Prot
Match: STIL3_ARATH (Protein STICHEL-like 3 OS=Arabidopsis thaliana GN=At4g18820 PE=3 SV=1)

HSP 1 Score: 306.2 bits (783), Expect = 1.6e-81
Identity = 211/515 (40.97%), Postives = 293/515 (56.89%), Query Frame = 1

Query: 324 QGCGIPCYWSKRTPKHRGV-----CGGCCSPSLSDT-WRRKG-SSILFGSQSIYSRRKSL 383
           + CGIP  WS+    HRG       G   S  +SD+   RKG ++   GS  +  +    
Sbjct: 298 KACGIPFNWSRI--HHRGKTFLDKAGRSLSCGMSDSKGGRKGETNERNGSDKMMIQSDDD 357

Query: 384 NSSNRRLTSGSARGVLPLLTNSA--DGRVGSSIGT-GRSDDELSTNFGELDLEALSRLDG 443
           +SS      GS    LPLL +S   DG V    G  G   D L  N  + DL +  R  G
Sbjct: 358 SSS----FIGSDGEALPLLVDSGENDGWVHDYSGELGIFADSLLKNDEDSDLASEGR-SG 417

Query: 444 RRWSSCRSHEGLEIVALNGEVEEGSTPESTTSFSQKYRPMFFNELIGQNIVVQSLINAIS 503
            +    +SH       +N         +S T   +KY P  F +L+GQN+VVQ+L NA++
Sbjct: 418 EKKHKKKSH-------VNARHRHRQQHQSLT---EKYTPKTFRDLLGQNLVVQALSNAVA 477

Query: 504 RGRIAPVYLFQGPRGTGKTTAARIFAAALNCLAPEENKPCGYCRECTDFMSGKQKDLLEI 563
           R ++  +Y+F GP GTGKT+ ARIFA ALNC + E+ KPCG C  C     GK  ++ E+
Sbjct: 478 RRKLGLLYVFHGPNGTGKTSCARIFARALNCHSMEQPKPCGTCSSCVSHDMGKSWNIREV 537

Query: 564 DGTNRKGIDRIRYQLKKLSSGSSSAFLRYKVFLIDECHLLPSKAWLTFLKFFEEP-PQRV 623
                   ++I   L      SS +    +VF+ D+C  L S  W    K  +   P+ V
Sbjct: 538 GPVGNYDFEKIMDLLDGNVMVSSQS---PRVFIFDDCDTLSSDCWNALSKVVDRAAPRHV 597

Query: 624 VFIFITTDLDSIPRTIQSRCQKYIFNKIKDCDMVERLKRISAEENLDADLDALDLIAMNA 683
           VFI + + LD +P  I SRCQK+ F K+KD D+V  L+ I+++E ++ D DAL LIA  +
Sbjct: 598 VFILVCSSLDVLPHVIISRCQKFFFPKLKDADIVYSLQWIASKEEIEIDKDALKLIASRS 657

Query: 684 DGSLRDAETMLEQLSLLGKRITISLVNELVSTVPILSLSCYLVGIVSDEKLLELLALAMS 743
           DGSLRDAE  LEQLSLLG+RI++ LV EL             VG+VSDEKL++LL LA+S
Sbjct: 658 DGSLRDAEMTLEQLSLLGQRISVPLVQEL-------------VGLVSDEKLVDLLDLALS 717

Query: 744 SNTAETVKRARELMDSGVDPLVLMSQLASLIMDIIAGTYNIIDPKDSASIFCGRSLSETE 803
           ++T  TVK  R +M++ V+PL LMSQLA++I DI+AG+Y+    +     F  + L + +
Sbjct: 718 ADTVNTVKNLRTIMETSVEPLALMSQLATVITDILAGSYDFTKDQHKRKFFRRQPLPKED 777

Query: 804 VERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL 828
           +E+L+ ALK LSEAEKQLRVS+++ TW TA LLQL
Sbjct: 778 MEKLRQALKTLSEAEKQLRVSNDKLTWLTAALLQL 779

BLAST of Cp4.1LG01g08830 vs. TrEMBL
Match: A0A0A0L847_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G113330 PE=4 SV=1)

HSP 1 Score: 2131.3 bits (5521), Expect = 0.0e+00
Identity = 1122/1284 (87.38%), Postives = 1175/1284 (91.51%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAA-----VPGGA 60
            MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPL+SSRSV+AA     V GGA
Sbjct: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLSSSRSVMAATATAVVAGGA 60

Query: 61   SSSLNKNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQKE 120
            SSSLNKNLE ETRR+SGQSQLDAIVP RNENRNPKDKKIYLYNWKSHKSSSEKS   Q E
Sbjct: 61   SSSLNKNLECETRRYSGQSQLDAIVPLRNENRNPKDKKIYLYNWKSHKSSSEKSATLQNE 120

Query: 121  DRDGNNGTNDGSYSVPGLSLDDSLSDARNGGDSKSDTYLGDLCSSMVFRCGDANLVSYGG 180
            D DGN+  NDGSYSVPG+SLD SLSDARNGGDSKSDTYLGDL SSMVFRCGDANLVSY G
Sbjct: 121  DHDGNDDNNDGSYSVPGVSLDGSLSDARNGGDSKSDTYLGDLYSSMVFRCGDANLVSYSG 180

Query: 181  PLAKRASAVKKKSKKHCSHLDVLSRHRQKGP--VLGRKLLEGHPSLSISFSQDDSIEQSD 240
            P AKR SA KKKSKKHCSHLDVLSRH+QKGP  ++GRKLLEGHPSLSI+FSQDDSIEQSD
Sbjct: 181  PSAKRTSAFKKKSKKHCSHLDVLSRHQQKGPGPLMGRKLLEGHPSLSINFSQDDSIEQSD 240

Query: 241  DTEDYSNSEDFRRYSAASPLLLKL-----HPSAKLLRNHRKEDSSYSYSTPALSTSSYNR 300
            DTEDYSNSEDFRRYSAASPLLLKL     HPS+K LRN RKEDSSYSYSTPALSTSSYNR
Sbjct: 241  DTEDYSNSEDFRRYSAASPLLLKLKHKSFHPSSKFLRNSRKEDSSYSYSTPALSTSSYNR 300

Query: 301  YVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGCCS 360
            YVN NPSTVGSW+GTTTSINDADDEVDD+LDFPGRQGCGIPCYWSKRTPKHRG+CG CCS
Sbjct: 301  YVNRNPSTVGSWDGTTTSINDADDEVDDRLDFPGRQGCGIPCYWSKRTPKHRGICGSCCS 360

Query: 361  PSLSDTWRRKGSSILFGSQSIYSRRKSLNSSNRRLTSGSARGVLPLLTNSADGRVGSSIG 420
            PSLSDT RRKGSSILFGSQSIYSRRKS+NSS RR  SGSARGVLPLLTNSADG VGSSIG
Sbjct: 361  PSLSDTLRRKGSSILFGSQSIYSRRKSINSSKRRFASGSARGVLPLLTNSADGGVGSSIG 420

Query: 421  TGRSDDELSTNFGELDLEALSRLDGRRW-SSCRSHEGLEIVALNGEVEEGSTPESTTSFS 480
            TGRSDDELSTNFGELDLEALSRLDGRRW SSCRSHEGLEIVALNGEVE G TPEST SFS
Sbjct: 421  TGRSDDELSTNFGELDLEALSRLDGRRWSSSCRSHEGLEIVALNGEVEGGGTPESTRSFS 480

Query: 481  QKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNCLAP 540
            QKY+PMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKT AARIFAAALNCLAP
Sbjct: 481  QKYKPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTAAARIFAAALNCLAP 540

Query: 541  EENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYKVFLI 600
            EENKPCGYCRECTDFM+GKQKDLLE+DGTN+KGID+IRYQLK LSSG SSAF RYK+FL+
Sbjct: 541  EENKPCGYCRECTDFMAGKQKDLLEVDGTNKKGIDKIRYQLKLLSSGQSSAFFRYKIFLV 600

Query: 601  DECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDCDMVE 660
            DECHLLPSKAWL FLK FEEPPQRVVFIFITTDLDS+PRTIQSRCQKY+FNKIKDCDMVE
Sbjct: 601  DECHLLPSKAWLAFLKLFEEPPQRVVFIFITTDLDSVPRTIQSRCQKYLFNKIKDCDMVE 660

Query: 661  RLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVSTVPI 720
            RLKRISA+ENLD DLDALDLIAMNADGSLRDAETMLEQLSLLGKRIT SLVNE       
Sbjct: 661  RLKRISADENLDVDLDALDLIAMNADGSLRDAETMLEQLSLLGKRITTSLVNE------- 720

Query: 721  LSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDII 780
                  LVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDII
Sbjct: 721  ------LVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDII 780

Query: 781  AGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL 840
            AGTYNIID KD ASIF GRSLSE EVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL
Sbjct: 781  AGTYNIIDTKDGASIFGGRSLSEAEVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL 840

Query: 841  GSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLI-PKLGSPASLCNLKN 900
            GSISS DFT TGS+RRQSCKTTDDDPS+TSNGTI YKQKSF+ L+ P LGSP SLCNLKN
Sbjct: 841  GSISSPDFTQTGSSRRQSCKTTDDDPSSTSNGTIAYKQKSFAQLMPPNLGSPTSLCNLKN 900

Query: 901  GNYNNQGDLSPMVDSLSNNPKPTHKQFMEGK-NSFSRDDATLRNMVFRCKNSEKLDNIWV 960
            GNYNNQ D+ PMVD+L  N KPTHKQF+EGK +SFSR+D TLRNMVFR KNSEKL++IWV
Sbjct: 901  GNYNNQADMVPMVDNLIYNSKPTHKQFIEGKDSSFSREDVTLRNMVFRSKNSEKLNSIWV 960

Query: 961  HCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSITNSMEMV 1020
            HCIERCHSKTLRQLLYA+GKLLS+SESE TLIAYVAFED DIKSRAERFLSSITNSMEMV
Sbjct: 961  HCIERCHSKTLRQLLYAHGKLLSISESEGTLIAYVAFEDVDIKSRAERFLSSITNSMEMV 1020

Query: 1021 LRCNVEVRIILLPDGETSINGMTAAKSSEGVEHELVDKERKIANLNAMEGYSSRSLILDG 1080
            LRCNVEVRIILLPDGE S    TAAK SEGVE    DKER+ +NLNAMEGYS+RSL+LD 
Sbjct: 1021 LRCNVEVRIILLPDGEAS----TAAKLSEGVE---PDKERRTSNLNAMEGYSNRSLMLDA 1080

Query: 1081 TYQATSDSSQLPSESNNQIDGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLS 1140
            TYQ+TSDSSQLP+ESN+Q DGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLS
Sbjct: 1081 TYQSTSDSSQLPTESNHQNDGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLS 1140

Query: 1141 RLKPEKNQVLPQDGSYYKDQTEEMNSTGDSSRKWDDELNRELKVLKANEELIAQKEQVGR 1200
            RLKPEKNQVLPQDGSYYKDQ +EMNST DSSRKW+DELNRELKVLK  ++++AQKEQVGR
Sbjct: 1141 RLKPEKNQVLPQDGSYYKDQMDEMNSTEDSSRKWEDELNRELKVLKVGDDILAQKEQVGR 1200

Query: 1201 RVDRYAISPSILHDGGMVGNANKDNLGYESSSAVGGCSGLFCWNNSKSHKRGKVRTNHGR 1260
            R DRYAISPSILHDG MVGN+NKDNLGYESSSA GGCSGLFCWN+SK HKR KVR NH R
Sbjct: 1201 RADRYAISPSILHDGSMVGNSNKDNLGYESSSAAGGCSGLFCWNSSKPHKRAKVRANHVR 1260

Query: 1261 SRSGRFSLFGECGKSRNFGSRSRR 1270
            SR+GRFSLFGECGKSRN GSR RR
Sbjct: 1261 SRNGRFSLFGECGKSRNSGSRFRR 1264

BLAST of Cp4.1LG01g08830 vs. TrEMBL
Match: A0A067K8J9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13362 PE=4 SV=1)

HSP 1 Score: 1464.1 bits (3789), Expect = 0.0e+00
Identity = 825/1312 (62.88%), Postives = 989/1312 (75.38%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSV----LAAVPGGAS 60
            M+E+RVSDPS+LHLKKELTQIRKAAR+LRDPGTTSSWKSPL+SSRS     LAA    ++
Sbjct: 1    MSEMRVSDPSRLHLKKELTQIRKAARLLRDPGTTSSWKSPLSSSRSAVAATLAATASTSA 60

Query: 61   SSLNKNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQKED 120
            S   + LE+E    +  S LD+    RN N N K+K+++LYNWK+ KSSSEKS + + E 
Sbjct: 61   SVWKQQLENENVIPNN-SHLDSHF--RN-NGNGKEKRVFLYNWKNQKSSSEKSAMAKNEA 120

Query: 121  RDGNNGTNDGSYSVPGL--SLDDSLSDARN-GGDSKSDTYLGDL-CSSMVFRCGDANLVS 180
                    D  Y    +  SLDDSLSDARN G DSKSDTY+G+   SSM+FRC DA+LVS
Sbjct: 121  --------DEDYESRSIQESLDDSLSDARNVGADSKSDTYVGESRSSSMIFRCRDASLVS 180

Query: 181  YGGPLAKRASAVKKKSKKHCSHLDVLSRHRQKGPVLGRKLLEGHPSLSISFSQDDSIEQS 240
               P  +RA  +KKKSKK  +HLD+LSR++QK   L R+LL+ HPS+++   +DD +EQS
Sbjct: 181  ---PSMRRAMGIKKKSKKTNTHLDILSRYQQKEMNL-RRLLKSHPSMALGLGRDDYVEQS 240

Query: 241  DDTEDYSNSEDFRRYSAASPLLLKL------HPSAKLLRNHRKEDSSYSYSTPALSTSSY 300
            DDTE+YSNSED R+ S ASPLL+KL      H  +KLLRN RKEDSS +YSTPALSTSSY
Sbjct: 241  DDTEEYSNSEDLRKISGASPLLIKLKHKNWSHSPSKLLRNSRKEDSSCTYSTPALSTSSY 300

Query: 301  NRYVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGC 360
            NRY   NPSTVGSW+  TTS+ND DDE DD LD PGRQGCGIPCYWSKRTP+HRG CG C
Sbjct: 301  NRYCIRNPSTVGSWDAATTSLNDGDDEEDDHLDLPGRQGCGIPCYWSKRTPRHRGPCGSC 360

Query: 361  CSPSLSDTWRRKGSSILFGSQSIYSRRK--SLNSSNRRLTSGSARGVLPLLTNSADGRVG 420
            CSPSLSDT RRKG+SIL GSQS+Y RR+  S  S+ RR+TS S +G+LPLL NS D R G
Sbjct: 361  CSPSLSDTIRRKGTSILCGSQSMYHRRRRSSSISNKRRITSRSGQGLLPLLANSED-RGG 420

Query: 421  SSIGTGRSDDELSTNFGELDLEALSRLDGRRWSSCRSHEGLEIVALNGEVEEGSTPESTT 480
            SSI TG SDDELSTNFGELDLEALSRLDGRRWSSCRS +GLEIVALNG+ EE  TPE+  
Sbjct: 421  SSIETGNSDDELSTNFGELDLEALSRLDGRRWSSCRSQDGLEIVALNGDGEEEDTPENIR 480

Query: 481  SFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNC 540
            S SQKY+P+FF+E+IGQNIVVQSLINA+SRGRIAPVYLFQGPRGTGKT+ ARIFA+ALNC
Sbjct: 481  SLSQKYKPLFFSEVIGQNIVVQSLINAVSRGRIAPVYLFQGPRGTGKTSTARIFASALNC 540

Query: 541  LAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYKV 600
            ++ EE KPCGYCREC+DF+SGK +DL E+DGTN+KGID++ + LKK+S    +   RYK+
Sbjct: 541  MSTEETKPCGYCRECSDFISGKTRDLWEVDGTNKKGIDKVSHLLKKVSQWPPTGSSRYKI 600

Query: 601  FLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDCD 660
            FLIDECHLLPSK WL FLKF EEPPQRVVFIFITTD D++PRT+QSRCQKY+F+KIKD D
Sbjct: 601  FLIDECHLLPSKMWLAFLKFLEEPPQRVVFIFITTDPDNVPRTVQSRCQKYLFSKIKDGD 660

Query: 661  MVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVST 720
            +V RL++ISAEENLD +LDALDLIAMNADGSLRD+ETML+QLSLLGKRIT SLVNE    
Sbjct: 661  IVARLRKISAEENLDVELDALDLIAMNADGSLRDSETMLDQLSLLGKRITTSLVNE---- 720

Query: 721  VPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIM 780
                     LVG+V DEKLLELL L+MSS+TAETVKRAR+LMDSGVDP+VLMSQLASLIM
Sbjct: 721  ---------LVGVVPDEKLLELLELSMSSDTAETVKRARDLMDSGVDPMVLMSQLASLIM 780

Query: 781  DIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTATL 840
            DIIAGTYN++D K S S F GRSL+E E+ERLKHALK LSEAEKQLRVSS+RSTWFTATL
Sbjct: 781  DIIAGTYNVVDAKHSNSFFGGRSLTEAELERLKHALKLLSEAEKQLRVSSDRSTWFTATL 840

Query: 841  LQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKS-FSHLIPKLGSPASLCN 900
            LQLGS+ S D T + S+RRQS +TT++DPS+TS     YKQKS   +L  +  SPASL  
Sbjct: 841  LQLGSVPSPDLTQSSSSRRQSSRTTEEDPSSTSREVTIYKQKSDAQYLSRRSSSPASLYK 900

Query: 901  LKNGNYNNQGDLSPMVDSLSNNPKPTHKQFMEGKNS-FSRDDATLRNMVFRCKNSEKLDN 960
              N N                + KP   + M  + S  S DD  +  M+FR +N++KLD+
Sbjct: 901  AINEN-----------SEFGFSSKPLPSRTMHSRTSTASWDDELVETMLFRYRNADKLDH 960

Query: 961  IWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSITNSM 1020
            IW  CI +CHS TLRQLL+A+GKL S+SE E  L+ YVAF D DIK+RAERF+SSITNS+
Sbjct: 961  IWEKCIAKCHSNTLRQLLHAHGKLFSISELEGILVVYVAFGDEDIKARAERFMSSITNSI 1020

Query: 1021 EMVLRCNVEVRIILLPDGETSIN--GMTAAKSSEGVEHELV-DKERKIANLNAMEGYS-- 1080
            EMVLRCNVEVRIIL+PDG  S+N    +  +  +  E  L  ++ERK  + N + GYS  
Sbjct: 1021 EMVLRCNVEVRIILVPDGVDSMNCVNQSELQGQKRAEATLANEQERKENSSNLLNGYSDS 1080

Query: 1081 -SRSLIL--------------------DGTYQATSDSSQLPSESNNQIDGSRDRRQEIPM 1140
               SL L                    +  +Q+T+ S++LP + + +  G R+R+QE+PM
Sbjct: 1081 QQESLKLSRGSFNDLESKLKGGSSNLRESPFQSTALSTELPPDPDAENGGVRERKQELPM 1140

Query: 1141 QRIESIIREQRLETAWLQAMEKGTPGSLSRLKPEKNQVLPQDGSYYKDQTEEMNSTGDSS 1200
            QRIESIIREQRLETAWLQA EKGTPGSLSRLKPEKNQVLPQ+ +Y ++Q E  +S G SS
Sbjct: 1141 QRIESIIREQRLETAWLQAAEKGTPGSLSRLKPEKNQVLPQEDNYRQNQMESASSMGLSS 1200

Query: 1201 RKWDDELNRELKVLKANEELIAQKEQVGRRVDRYAISPSILHDGGMVGNANKDNLGYESS 1260
            + W+DELN ELKVLK  + ++  K+Q+G+R DRY ISPS+LHD  +VG  N +NLGYESS
Sbjct: 1201 QHWEDELNHELKVLKMEDRMVVYKDQIGKRADRYPISPSLLHDNNLVGYPNNENLGYESS 1260

Query: 1261 SAVGGCSGLFCWNNSKSHKRGKVRTNHGRSR--SGRFSLFGECGKSRNFGSR 1267
            SA GGCSGL CWN ++S K GK +    RSR  SGRF+LFGECGK +   +R
Sbjct: 1261 SASGGCSGLLCWNANRSLK-GKAKGTSVRSRHKSGRFTLFGECGKHKKAENR 1270

BLAST of Cp4.1LG01g08830 vs. TrEMBL
Match: A0A0D2VK62_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G090600 PE=4 SV=1)

HSP 1 Score: 1461.0 bits (3781), Expect = 0.0e+00
Identity = 827/1335 (61.95%), Postives = 990/1335 (74.16%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAAVPGGASSSLN 60
            M+++R+ DPS+LHLKKELTQIRKAARVLRDPGTTSSWKSP+NSSRSV AAV  G  S+  
Sbjct: 1    MSDLRMPDPSRLHLKKELTQIRKAARVLRDPGTTSSWKSPINSSRSVAAAVAAGTGSTST 60

Query: 61   -----KNLESET-RRHSGQSQLDAIVPP-----------RNENRNPKDKKIYLYNWKSHK 120
                  +L SE+  R +G ++LD  + P            N N N KDK+++LYNW+S K
Sbjct: 61   CTASRNHLGSESLSRSNGNARLDLSLLPFRVESNGHGRITNSNGNEKDKRVFLYNWRSQK 120

Query: 121  SSSEKSVIHQKEDRDGNNGTNDGSYS---VPGLSLDDSLSDARNGGDSKSDTYLGDLCS- 180
            SSS        +D D ++G +DG  S   + G   ++SLSDAR  GDSKSDT LG+  S 
Sbjct: 121  SSSVNVDDDGDDDDDFDDG-DDGDQSSSWIQGSVDENSLSDARKCGDSKSDTCLGESRSA 180

Query: 181  SMVFRCGDANLVSYGGPLAKRASAVKKKSKKHCSHLDVLSRHRQK------GPVLGRKLL 240
            SM+FRC DANLVS   P AKR     K SKK+ S+ DV SR+ QK        V  RKLL
Sbjct: 181  SMLFRCRDANLVSLVTPSAKRMLGANKNSKKNGSNFDVFSRYEQKKNGVNRNSVYSRKLL 240

Query: 241  EGHPSLSISFSQDDSIEQSDDTEDYSNSEDFRRYSAASPLLLKL------HPSAKLLRNH 300
            + HP+L++S  +DDS++QSDDTEDYSNSEDFR+ S ASPLLLKL      H S++LL+  
Sbjct: 241  KAHPALALSLGRDDSVDQSDDTEDYSNSEDFRKISGASPLLLKLKPKNWPHSSSRLLKAD 300

Query: 301  RKEDSSYSYSTPALSTSSYNRYVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCG 360
            RKEDSSYSYSTPALSTSSYN+Y N+NPS VGSW+ TTTS+ND DD+VDD LD PGRQGCG
Sbjct: 301  RKEDSSYSYSTPALSTSSYNKYFNHNPSVVGSWDATTTSLNDGDDDVDDPLDLPGRQGCG 360

Query: 361  IPCYWSKRTPKHRGVCGGCCSPSLSDTWRRKGSSILFGSQSIYSR-RKSLNSSNRRLTS- 420
            IPCYW+KRTPKHR VCG C SPSLSDT RRKGSSIL GSQS+Y R R+SL+ SN+R  + 
Sbjct: 361  IPCYWTKRTPKHRVVCGSCYSPSLSDTLRRKGSSILCGSQSMYHRHRRSLSLSNKRKNAL 420

Query: 421  GSARGVLPLLTNSADGRVGSSIGTGRSDDELSTNFGELDLEALSRLDGRRWSS-CRSHEG 480
             SA+GVLPLL+NSADGR GSSIGT  SDDELSTNFGELDLEALSRLDGRRWSS CRS +G
Sbjct: 421  RSAQGVLPLLSNSADGRGGSSIGTRCSDDELSTNFGELDLEALSRLDGRRWSSSCRSQDG 480

Query: 481  LEIVALNGEVEEGSTPESTTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQ 540
            LEIVAL GE EE  TPE+  S SQKY+PMFF+ELIGQNIVVQSL+NA+S+GRIAP YLFQ
Sbjct: 481  LEIVALTGEAEEEGTPENIKSLSQKYKPMFFDELIGQNIVVQSLMNAVSKGRIAPFYLFQ 540

Query: 541  GPRGTGKTTAARIFAAALNCLAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRI 600
            GPRGTGKT+ ARIF+AALNC   +++KPCG C ECT+F+SGK+++  E D TNR+GIDR+
Sbjct: 541  GPRGTGKTSTARIFSAALNCQTTDDDKPCGCCTECTEFISGKRREFWEFDSTNRRGIDRV 600

Query: 601  RYQLKKLSSGSSSAFLRYKVFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSI 660
            RY LK LS+G +S+  RYKVF+IDECHLLPSK WL  LKF E+PP R+VFIFITTDLD++
Sbjct: 601  RYLLKSLSTGLASSSSRYKVFVIDECHLLPSKIWLALLKFLEDPPPRLVFIFITTDLDNV 660

Query: 661  PRTIQSRCQKYIFNKIKDCDMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLE 720
            PRT+QSRCQKY+FNKIKD D++ RL+++SA+ENL+ + DALDLIA+NADGSLRDAETML+
Sbjct: 661  PRTVQSRCQKYLFNKIKDGDIMARLRKMSADENLEVESDALDLIALNADGSLRDAETMLD 720

Query: 721  QLSLLGKRITISLVNELVSTVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARE 780
            QLSLLGKRIT SLVNE             LVG+VSDEKLLELL LAMSS+TAETVKRARE
Sbjct: 721  QLSLLGKRITASLVNE-------------LVGVVSDEKLLELLELAMSSDTAETVKRARE 780

Query: 781  LMDSGVDPLVLMSQLASLIMDIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLS 840
            LMDSGVDP+VLMSQLASLIMDIIAGTYNI+D K S S F GR+L+E EVERLK ALK LS
Sbjct: 781  LMDSGVDPMVLMSQLASLIMDIIAGTYNIVDSKYSHSFFGGRALTEAEVERLKDALKLLS 840

Query: 841  EAEKQLRVSSERSTWFTATLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYK 900
            EAEKQLRVSSERSTWFTATLLQLGS+ S D + +GS+RRQS KT +DD  +TS   I YK
Sbjct: 841  EAEKQLRVSSERSTWFTATLLQLGSLPSPDLSQSGSSRRQSSKTIEDDLQSTSREAIAYK 900

Query: 901  QKSFSHLIPKLGSPASLCNLKNGNYNNQGDLSPMVDSLSNNPKPTHKQFMEGK-NSFSRD 960
             KS +  +P   + ASL    NGN   QG+L   +D   +N K +H ++++G     + D
Sbjct: 901  PKSGTQCMPWKSTSASLQKSVNGNSTRQGELVSRIDGYGSNSKTSHGRYLDGSATPAACD 960

Query: 961  DATLRNMVFRCKNSEKLDNIWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFE 1020
            ++   NM+  C+NSEKLD+IW  CI +CHSKTLRQLL A+GKLLSL+E E  LIAY+AF 
Sbjct: 961  NSQNGNMILACRNSEKLDDIWAKCINKCHSKTLRQLLLAHGKLLSLAEDEGVLIAYLAFA 1020

Query: 1021 DADIKSRAERFLSSITNSMEMVLRCNVEVRIILLPDGETSINGMTAAKSSEGVEH----E 1080
            D DIKSRAERFLSSITNS+E+V+R NVEVRIILL D   S+N    A+  E ++      
Sbjct: 1021 DGDIKSRAERFLSSITNSIEIVMRRNVEVRIILLADVGISLNLANPAEMLESLQQVEAVA 1080

Query: 1081 LVDKERKIANLNAMEGYSSRSL-------------ILDGTYQATSDSS-----------Q 1140
             +  ERK    N ++G SS  L              L+G  +   D S           +
Sbjct: 1081 GIGSERKAIPKNVLDGISSLDLHQESRKVSKGSFSDLEGKLRGVQDYSNYSSQSIVRTPE 1140

Query: 1141 LPSESNNQIDGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLSRLKPEKNQVL 1200
            L +E  + ID S++ RQEIPMQRIESIIREQRLETAWLQA EKGTPGSLSRLKPEKNQVL
Sbjct: 1141 LLAEGKDDIDSSKESRQEIPMQRIESIIREQRLETAWLQAAEKGTPGSLSRLKPEKNQVL 1200

Query: 1201 PQDGSYYKDQTEEMNSTGDSSRKWDDELNRELKVLKANEELIAQKEQVGRRVDRYAISPS 1260
            PQ+  Y +     M+S+  SS++WDDELNRELK+LK N+    QK+Q+GRR D Y +SPS
Sbjct: 1201 PQE-VYRQSNLGSMDSSAFSSQQWDDELNRELKILKTNDGQEIQKDQLGRRADHYPMSPS 1260

Query: 1261 ILHDGGMVGNANKDNLGYESSSAVGGCSGLFCWNNSKSHKRGKVRTNHGRS-RSGRFSLF 1270
            +LH+     N +K+NLGYES S  GGCSGLFCWNNSK  +R K +    RS R+ RFSLF
Sbjct: 1261 LLHN----SNLSKENLGYESGSGTGGCSGLFCWNNSKPRRRAKAKGTPVRSCRTRRFSLF 1316

BLAST of Cp4.1LG01g08830 vs. TrEMBL
Match: A0A0D2UR74_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G090600 PE=4 SV=1)

HSP 1 Score: 1459.1 bits (3776), Expect = 0.0e+00
Identity = 826/1334 (61.92%), Postives = 989/1334 (74.14%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAAVPGGASSSLN 60
            M+++R+ DPS+LHLKKELTQIRKAARVLRDPGTTSSWKSP+NSSRSV AAV  G  S+  
Sbjct: 1    MSDLRMPDPSRLHLKKELTQIRKAARVLRDPGTTSSWKSPINSSRSVAAAVAAGTGSTST 60

Query: 61   -----KNLESET-RRHSGQSQLDAIVPP-----------RNENRNPKDKKIYLYNWKSHK 120
                  +L SE+  R +G ++LD  + P            N N N KDK+++LYNW+S K
Sbjct: 61   CTASRNHLGSESLSRSNGNARLDLSLLPFRVESNGHGRITNSNGNEKDKRVFLYNWRSQK 120

Query: 121  SSSEKSVIHQKEDRDGNNGTNDGSYS---VPGLSLDDSLSDARNGGDSKSDTYLGDLCS- 180
            SSS        +D D ++G +DG  S   + G   ++SLSDAR  GDSKSDT LG+  S 
Sbjct: 121  SSSVNVDDDGDDDDDFDDG-DDGDQSSSWIQGSVDENSLSDARKCGDSKSDTCLGESRSA 180

Query: 181  SMVFRCGDANLVSYGGPLAKRASAVKKKSKKHCSHLDVLSRHRQK------GPVLGRKLL 240
            SM+FRC DANLVS   P AKR     K SKK+ S+ DV SR+ QK        V  RKLL
Sbjct: 181  SMLFRCRDANLVSLVTPSAKRMLGANKNSKKNGSNFDVFSRYEQKKNGVNRNSVYSRKLL 240

Query: 241  EGHPSLSISFSQDDSIEQSDDTEDYSNSEDFRRYSAASPLLLKL------HPSAKLLRNH 300
            + HP+L++S  +DDS++QSDDTEDYSNSEDFR+ S ASPLLLKL      H S++LL+  
Sbjct: 241  KAHPALALSLGRDDSVDQSDDTEDYSNSEDFRKISGASPLLLKLKPKNWPHSSSRLLKAD 300

Query: 301  RKEDSSYSYSTPALSTSSYNRYVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCG 360
            RKEDSSYSYSTPALSTSSYN+Y N+NPS VGSW+ TTTS+ND DD+VDD LD PGRQGCG
Sbjct: 301  RKEDSSYSYSTPALSTSSYNKYFNHNPSVVGSWDATTTSLNDGDDDVDDPLDLPGRQGCG 360

Query: 361  IPCYWSKRTPKHRGVCGGCCSPSLSDTWRRKGSSILFGSQSIYSR-RKSLNSSNRRLTS- 420
            IPCYW+KRTPKHR VCG C SPSLSDT RRKGSSIL GSQS+Y R R+SL+ SN+R  + 
Sbjct: 361  IPCYWTKRTPKHRVVCGSCYSPSLSDTLRRKGSSILCGSQSMYHRHRRSLSLSNKRKNAL 420

Query: 421  GSARGVLPLLTNSADGRVGSSIGTGRSDDELSTNFGELDLEALSRLDGRRWSS-CRSHEG 480
             SA+GVLPLL+NSADGR GSSIGT  SDDELSTNFGELDLEALSRLDGRRWSS CRS +G
Sbjct: 421  RSAQGVLPLLSNSADGRGGSSIGTRCSDDELSTNFGELDLEALSRLDGRRWSSSCRSQDG 480

Query: 481  LEIVALNGEVEEGSTPESTTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQ 540
            LEIVAL GE EE  TPE+  S SQKY+PMFF+ELIGQNIVVQSL+NA+S+GRIAP YLFQ
Sbjct: 481  LEIVALTGEAEEEGTPENIKSLSQKYKPMFFDELIGQNIVVQSLMNAVSKGRIAPFYLFQ 540

Query: 541  GPRGTGKTTAARIFAAALNCLAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRI 600
            GPRGTGKT+ ARIF+AALNC   +++KPCG C ECT+F+SGK+++  E D TNR+GIDR+
Sbjct: 541  GPRGTGKTSTARIFSAALNCQTTDDDKPCGCCTECTEFISGKRREFWEFDSTNRRGIDRV 600

Query: 601  RYQLKKLSSGSSSAFLRYKVFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSI 660
            RY LK LS+G +S+  RYKVF+IDECHLLPSK WL  LKF E+PP R+VFIFITTDLD++
Sbjct: 601  RYLLKSLSTGLASSSSRYKVFVIDECHLLPSKIWLALLKFLEDPPPRLVFIFITTDLDNV 660

Query: 661  PRTIQSRCQKYIFNKIKDCDMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLE 720
            PRT+QSRCQKY+FNKIKD D++ RL+++SA+ENL+ + DALDLIA+NADGSLRDAETML+
Sbjct: 661  PRTVQSRCQKYLFNKIKDGDIMARLRKMSADENLEVESDALDLIALNADGSLRDAETMLD 720

Query: 721  QLSLLGKRITISLVNELVSTVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARE 780
            QLSLLGKRIT SLVNE             LVG+VSDEKLLELL LAMSS+TAETVKRARE
Sbjct: 721  QLSLLGKRITASLVNE-------------LVGVVSDEKLLELLELAMSSDTAETVKRARE 780

Query: 781  LMDSGVDPLVLMSQLASLIMDIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLS 840
            LMDSGVDP+VLMSQLASLIMDIIAGTYNI+D K S S F GR+L+E EVERLK ALK LS
Sbjct: 781  LMDSGVDPMVLMSQLASLIMDIIAGTYNIVDSKYSHSFFGGRALTEAEVERLKDALKLLS 840

Query: 841  EAEKQLRVSSERSTWFTATLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYK 900
            EAEKQLRVSSERSTWFTATLLQLGS+ S D + +GS+RRQS KT +DD  +TS   I YK
Sbjct: 841  EAEKQLRVSSERSTWFTATLLQLGSLPSPDLSQSGSSRRQSSKTIEDDLQSTSREAIAYK 900

Query: 901  QKSFSHLIPKLGSPASLCNLKNGNYNNQGDLSPMVDSLSNNPKPTHKQFMEGK-NSFSRD 960
             KS +  +P   + ASL    NGN   QG+L   +D   +N K +H ++++G     + D
Sbjct: 901  PKSGTQCMPWKSTSASLQKSVNGNSTRQGELVSRIDGYGSNSKTSHGRYLDGSATPAACD 960

Query: 961  DATLRNMVFRCKNSEKLDNIWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFE 1020
            ++   NM+  C+NSEKLD+IW  CI +CHSKTLRQLL A+GKLLSL+E E  LIAY+AF 
Sbjct: 961  NSQNGNMILACRNSEKLDDIWAKCINKCHSKTLRQLLLAHGKLLSLAEDEGVLIAYLAFA 1020

Query: 1021 DADIKSRAERFLSSITNSMEMVLRCNVEVRIILLPDGETSINGMTAAKSSEGVEH----E 1080
            D DIKSRAERFLSSITNS+E+V+R NVEVRIILL D   S+N    A+  E ++      
Sbjct: 1021 DGDIKSRAERFLSSITNSIEIVMRRNVEVRIILLADVGISLNLANPAEMLESLQQVEAVA 1080

Query: 1081 LVDKERKIANLNAMEGYSSRSL-------------ILDGTYQATSDSS-----------Q 1140
             +  ERK    N ++G SS  L              L+G  +   D S           +
Sbjct: 1081 GIGSERKAIPKNVLDGISSLDLHQESRKVSKGSFSDLEGKLRGVQDYSNYSSQSIVRTPE 1140

Query: 1141 LPSESNNQIDGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLSRLKPEKNQVL 1200
            L +E  + ID S++ RQEIPMQRIESIIREQRLETAWLQA EKGTPGSLSRLKPEKNQVL
Sbjct: 1141 LLAEGKDDIDSSKESRQEIPMQRIESIIREQRLETAWLQAAEKGTPGSLSRLKPEKNQVL 1200

Query: 1201 PQDGSYYKDQTEEMNSTGDSSRKWDDELNRELKVLKANEELIAQKEQVGRRVDRYAISPS 1260
            PQ+  Y +     M+S+  SS++WDDELNRELK+LK N+    QK+Q+GRR D Y +SPS
Sbjct: 1201 PQE-VYRQSNLGSMDSSAFSSQQWDDELNRELKILKTNDGQEIQKDQLGRRADHYPMSPS 1260

Query: 1261 ILHDGGMVGNANKDNLGYESSSAVGGCSGLFCWNNSKSHKRGKVRTNHGRS-RSGRFSLF 1269
            +LH+     N +K+NLGYES S  GGCSGLFCWNNSK  +R K +    RS R+ RFSLF
Sbjct: 1261 LLHN----SNLSKENLGYESGSGTGGCSGLFCWNNSKPRRRAKAKGTPVRSCRTRRFSLF 1315

BLAST of Cp4.1LG01g08830 vs. TrEMBL
Match: A0A067DQP8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g000818mg PE=4 SV=1)

HSP 1 Score: 1442.9 bits (3734), Expect = 0.0e+00
Identity = 810/1307 (61.97%), Postives = 982/1307 (75.13%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAAVPGGASSSLN 60
            MAE+R     +L LKKELTQIRKAAR LRDPGTTSSWKSPL+SSRS+ AAV   ++S   
Sbjct: 1    MAEMR----GRLQLKKELTQIRKAAR-LRDPGTTSSWKSPLSSSRSLAAAVAAASASGSA 60

Query: 61   KNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQKEDRD-- 120
              + +  ++   +   D  V   N N N K+K+++L NWK+ KSSSE S + + +D D  
Sbjct: 61   WKINNNNKQLVDE---DNNVSINNGNVNGKEKRVFLCNWKNQKSSSETSAVARNDDDDID 120

Query: 121  GNNGTNDGSYSVPGLSLDDSLSDARNGGDSKSDTYLGDLCSSMVFRCGDANLVSYGGPLA 180
             ++  ++GS SV   S+DDSLSDARNGGDSKSDTYLG+  +S +FRC DANLVS   P  
Sbjct: 121  VDDDEDEGSSSVIE-SVDDSLSDARNGGDSKSDTYLGENRASSIFRCRDANLVSVATPAM 180

Query: 181  KRASAVKKKSKKHCSHLDVLSRHRQKGPVLGRKLLEGHPSLSISFSQDDSIEQSDDTEDY 240
            KRA A K+KSK+H +  D L+R++QK  +L R       S ++   +D+S+EQSDDTEDY
Sbjct: 181  KRAMAAKRKSKRHKTLSDSLTRYQQKQIILARN------SAALGLGRDESVEQSDDTEDY 240

Query: 241  SNSEDFRRYSAASPLLLKL------HPSAKLLRNHRKEDSSYSYSTPALSTSSYNRYVNN 300
             NSEDFR+YS ASPLLLKL      H S+KLL+  RKEDSSYSYSTPALSTSSYNRYVN 
Sbjct: 241  CNSEDFRKYSGASPLLLKLKHKNWSHSSSKLLKGGRKEDSSYSYSTPALSTSSYNRYVNR 300

Query: 301  NPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGCCSPSLS 360
            NPST+GSW+ TT S+ND DD +DD LD PGRQGCGIPCYWSKRTPKHRGVCG CCSPSLS
Sbjct: 301  NPSTIGSWDATTASLNDNDDAMDDHLDLPGRQGCGIPCYWSKRTPKHRGVCGSCCSPSLS 360

Query: 361  DTWRRKGSSILFGSQSIYS--RRKSLNSSNRRLTSGSARGVLPLLTNSADGRVGSSIGTG 420
            DT RRKGSSIL GSQ++Y   RR S  S+ RR+ S SA+GVLPLL N+ DGR GSSIGTG
Sbjct: 361  DTLRRKGSSILCGSQTMYHGRRRSSSVSNKRRMASRSAQGVLPLLANNGDGRAGSSIGTG 420

Query: 421  RSDDELSTNFGELDLEALSRLDGRRWSS-CRSHEGLEIVALNGEVEEGSTPESTTSFSQK 480
            RSDDELSTNFGELDLEALSRLDGRRWSS CRS +GLEIVALNGE EEG   E+  S SQK
Sbjct: 421  RSDDELSTNFGELDLEALSRLDGRRWSSSCRSQDGLEIVALNGEEEEGVL-ENIRSLSQK 480

Query: 481  YRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNCLAPEE 540
            Y+P+FF+ELIGQNIVVQSL+N ISRGRIAPVYLFQGPRGTGKT+ A+IF+AALNC+A ++
Sbjct: 481  YKPIFFDELIGQNIVVQSLVNTISRGRIAPVYLFQGPRGTGKTSTAKIFSAALNCVATDQ 540

Query: 541  NKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYKVFLIDE 600
             KPCGYCREC DF+SGK ++ +E+DGTN+KG+DR+RY LK LS+G  SA  R+KVF+IDE
Sbjct: 541  TKPCGYCRECNDFISGKSRNFMEVDGTNKKGLDRVRYILKHLSAGLPSASPRFKVFVIDE 600

Query: 601  CHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDCDMVERL 660
            CHLLPSK WL FLKF EEPPQRVVFIFITTD+D++PR+IQSRCQKY+FNKIKD D+V RL
Sbjct: 601  CHLLPSKTWLAFLKFLEEPPQRVVFIFITTDIDNVPRSIQSRCQKYLFNKIKDGDIVARL 660

Query: 661  KRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVSTVPILS 720
            ++ISAEENL+ + DALDLIA+NADGSLRDAETML+QLSLLGKRIT SLVNE         
Sbjct: 661  RKISAEENLNVEPDALDLIALNADGSLRDAETMLDQLSLLGKRITSSLVNE--------- 720

Query: 721  LSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDIIAG 780
                LVG+VS+EKLLELL LAMSS+TAETVKRARELMDSGVDP+VLMSQLASLIMDIIAG
Sbjct: 721  ----LVGVVSEEKLLELLELAMSSDTAETVKRARELMDSGVDPMVLMSQLASLIMDIIAG 780

Query: 781  TYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQLGS 840
            TY I           GRSL+E E+ERLKHALK LSEAEKQLR+SSER TWFTATLLQLGS
Sbjct: 781  TYTI----------GGRSLTEAELERLKHALKLLSEAEKQLRLSSERCTWFTATLLQLGS 840

Query: 841  ISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLIPKLG-SPASLCNLKNGN 900
            + S D T +GS+RRQS +TT++DPS+TS   + YK+ S    +P+   SPASL    NGN
Sbjct: 841  MHSPDLTQSGSSRRQSSRTTEEDPSSTSREAVVYKRMSGPQYMPQNAVSPASLREPVNGN 900

Query: 901  YNNQGDLSPMVDSLSNNPKPTHKQFME-GKNSFSRDDATLRNMVFRCKNSEKLDNIWVHC 960
              + G++   +D  ++  KP+H +  + G  + S++   + N +  C+NSEKL  IW  C
Sbjct: 901  SRHLGEVLSRIDGHNSYSKPSHSRLKDAGALAVSQNGNIVGNTIITCRNSEKLGEIWAQC 960

Query: 961  IERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSITNSMEMVLR 1020
            IERCHSKTL+QLL  +GKLLS+SE E  LIAYVAF D DIKSRAERFLSSITNS+E VLR
Sbjct: 961  IERCHSKTLKQLLQVHGKLLSISEVERVLIAYVAFGDGDIKSRAERFLSSITNSIETVLR 1020

Query: 1021 CNVEVRIILLPDGETSINGMTAAKSSEGVEH----ELVDKERKIANLNAMEGYS------ 1080
             NVEVRIILLPDGE SI+   + +  +G++       +++E K    NA + YS      
Sbjct: 1021 RNVEVRIILLPDGEASIHHGISNELPKGLKKTETTAAIEREGKALCSNANDNYSDSDSQQ 1080

Query: 1081 ---------SRSLI--LDGTYQATSDSSQ---LPSESNNQIDGSRDRRQEIPMQRIESII 1140
                     SR     L+G ++   D S    L ++ N++I  ++ RRQEIPMQRIESII
Sbjct: 1081 IPVNVARKVSRGSFNELEGKFKGEDDHSNCSPLFADGNSEISSTKGRRQEIPMQRIESII 1140

Query: 1141 REQRLETAWLQAMEKGTPGSLSRLKPEKNQVLPQDGSYYKDQTEEMNSTGDSSRKWDDEL 1200
            REQRLETAWLQA EKG PGSL  L+PEKNQVLPQ+  Y ++  E + S+G SS++W+DEL
Sbjct: 1141 REQRLETAWLQATEKGAPGSLGHLRPEKNQVLPQEDIYRQNHMESLLSSGLSSQQWEDEL 1200

Query: 1201 NRELKVLKANEELIAQKEQVGRRVDRYAISPSILHDGGMVGNANKDNLGYESSSAVGGCS 1260
            N+ELK+LK NE+ + +K++ G++ + Y I PS+LHD   +GN +K+N GYES S  GGCS
Sbjct: 1201 NQELKILKLNEDRVLKKDENGKKGENYPILPSLLHDSSFMGNFSKENQGYESGSQAGGCS 1260

Query: 1261 GLFCWNNSKSHKRGKVRTNHGRSR-SGRFSLFGECGKSRNFGSRSRR 1270
            GLFCWNN+K HK+GKV+    RSR  G FSLF +C K++   SR RR
Sbjct: 1261 GLFCWNNTKPHKKGKVKGTPVRSRKGGHFSLFVDCTKAKKSESRLRR 1268

BLAST of Cp4.1LG01g08830 vs. TAIR10
Match: AT2G02480.1 (AT2G02480.1 AAA-type ATPase family protein)

HSP 1 Score: 1234.6 bits (3193), Expect = 0.0e+00
Identity = 730/1300 (56.15%), Postives = 899/1300 (69.15%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAAVPGGASSSLN 60
            M+  RVSD SKLHLKKELTQIRKA RVLRDPGTTSSWKSPL+SSRSV             
Sbjct: 1    MSGSRVSDLSKLHLKKELTQIRKAGRVLRDPGTTSSWKSPLDSSRSVAL----------- 60

Query: 61   KNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQKEDRDGN 120
              LE+   R+ G S    I    + NR  K+KK++LYNWK+ KSSSEKS + +    +  
Sbjct: 61   --LETPASRNGGSSSQFPIRGESSTNRRGKEKKVFLYNWKTQKSSSEKSGLAKNGKEEEE 120

Query: 121  NGTNDGSYSVPGLSLDDSLSDARNGGDSKSDTYLGDLCS-SMVFRCGDANLVSYGGPLAK 180
               +  S++   ++ DD +SDARNGGDS    Y  ++ S SM FRC D NL S G    +
Sbjct: 121  EEEDASSWTQASVNDDDDVSDARNGGDS----YRREIQSASMGFRCRDTNLASQGVSKMR 180

Query: 181  RAS--AVKKKSKKHCS--HLDVLSRHRQKGPVLGRKLLEGHPSLSISFSQDDSIEQSDDT 240
            +++  + KKKSKK  S   LD LS+++ +  ++ R    G                SDDT
Sbjct: 181  KSNVGSCKKKSKKKISSSRLDCLSKYQPRDDIVARNCNAG----------------SDDT 240

Query: 241  ED-YSNSEDFRRYSAASPLLLKL------HPSAKLLR-NHRKEDSSYSY-STPALSTSSY 300
            E+  SNSED R+ + ASPLLLKL        S++LLR N+RKEDSS +Y STPALSTSSY
Sbjct: 241  EEELSNSEDLRKVTGASPLLLKLKQKNWSRSSSRLLRANNRKEDSSCTYNSTPALSTSSY 300

Query: 301  NRYVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGC 360
            N Y   NPSTVGSW+GTTTS+ND DDE+DD LD PGRQGCGIPCYW+K+  KHRG C  C
Sbjct: 301  NMYAVRNPSTVGSWDGTTTSVNDGDDELDDNLDLPGRQGCGIPCYWTKKAMKHRGGCRSC 360

Query: 361  CSPSLSDTWRRKGSSILFGSQSIYSRRKSLNS---SNRRLTSGSARGVLPLLTNSADGRV 420
            CSPS SDT RR GSSIL GSQS+Y R    +S   S +++   SA+GVLPLL+   DGR 
Sbjct: 361  CSPSFSDTLRRTGSSILCGSQSVYRRHNRHSSGGYSKQKIACRSAQGVLPLLSYGGDGRG 420

Query: 421  GSSIGTGRSDDELSTNFGELDLEALSRLDGRRWS-SCRSHEGLEIVALNGEVEEGSTPES 480
            GSS+GTG SDDELSTN+GELDLEA SRLDGRRWS S RS +GLE VAL+GE EEGSTPE+
Sbjct: 421  GSSLGTGLSDDELSTNYGELDLEAQSRLDGRRWSTSYRSQDGLEAVALDGEEEEGSTPET 480

Query: 481  TTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAAL 540
              SFSQKYRPMFF ELIGQ+IVVQSL+NA+ R RIAPVYLFQGPRGTGKT+ ARIF+AAL
Sbjct: 481  IRSFSQKYRPMFFEELIGQSIVVQSLMNAVKRSRIAPVYLFQGPRGTGKTSTARIFSAAL 540

Query: 541  NCLAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRY 600
            NC+A EE KPCGYC+EC DFMSGK KD  E+DG N+KG D++RY LK L +        Y
Sbjct: 541  NCVATEEMKPCGYCKECNDFMSGKSKDFWELDGANKKGADKVRYLLKNLPTILPRNSSMY 600

Query: 601  KVFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKD 660
            KVF+IDECHLLPSK WL+FLKF E P Q+VVFIFITTDL+++PRTIQSRCQK++F+K+KD
Sbjct: 601  KVFVIDECHLLPSKTWLSFLKFLENPLQKVVFIFITTDLENVPRTIQSRCQKFLFDKLKD 660

Query: 661  CDMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELV 720
             D+V RLK+I+++ENLD DL ALDLIAMNADGSLRDAETMLEQLSLLGKRIT +LVNE  
Sbjct: 661  SDIVVRLKKIASDENLDVDLHALDLIAMNADGSLRDAETMLEQLSLLGKRITTALVNE-- 720

Query: 721  STVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASL 780
                       LVG+VSDEKLLELL LA+SS+TAETVKRAREL+D G DP+VLMSQLASL
Sbjct: 721  -----------LVGVVSDEKLLELLELALSSDTAETVKRARELLDLGADPIVLMSQLASL 780

Query: 781  IMDIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTA 840
            IMDIIAGTY ++D K S + F GR+L+E ++E LKHALK LSEAEKQLRVS++RSTWFTA
Sbjct: 781  IMDIIAGTYKVVDEKYSNAFFDGRNLTEADMEGLKHALKLLSEAEKQLRVSNDRSTWFTA 840

Query: 841  TLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLIPKLGSPASLC 900
            TLLQLGS+ S   T TGS+RRQS + TDDDP++ S   + YKQ+       K  SPAS+ 
Sbjct: 841  TLLQLGSMPSPGTTHTGSSRRQSSRATDDDPASVSREVMAYKQRIGGLHFSKSASPASVI 900

Query: 901  NLKNGNYNNQGDLSPMVDSLSNN--PKPTHKQFMEGKNSF-SRDDATLRNMVFRCKNSEK 960
              +NGN++++    P    + NN     +  Q +E + S  S +++    M+   ++SEK
Sbjct: 901  K-RNGNHSHEA--KPFSRVIDNNCYKSSSSSQMIESEGSIASHENSIASTMMLNQRSSEK 960

Query: 961  LDNIWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSIT 1020
            L++IW  CIERCHSKTLRQLLY +GKL+S+SE E  L+AY+AF + DIK RAERFLSSIT
Sbjct: 961  LNDIWRKCIERCHSKTLRQLLYTHGKLISISEVEGILVAYIAFGENDIKLRAERFLSSIT 1020

Query: 1021 NSMEMVLRCNVEVRIILLPDGETSINGMTAAKSSEGVEHELVDKE--RKIANLNAMEGYS 1080
            NS+EMVLR +VEVRIILLP+ E  +           V H+    E   K  +LN + G  
Sbjct: 1021 NSIEMVLRRSVEVRIILLPETELLV-----------VPHQTRKPEMTNKSGHLNNIAG-- 1080

Query: 1081 SRSLILDGTYQATSDSSQLPSESNNQIDGSRDRRQEIPMQRIESIIREQRLETAWLQAME 1140
                              L +E++ ++  S + R ++PMQRIESIIREQRLETAWLQ  +
Sbjct: 1081 ------------------LNAETDVEVGSSVESRSKLPMQRIESIIREQRLETAWLQTAD 1140

Query: 1141 KGTPGSLSRLKPEKNQVLPQDGSYYK-DQTEEMNSTGDSSRKWDDELNRELKVLKANEEL 1200
            K TPGS+ R+KPE+NQ+LPQ+ +Y + +    ++S+G ++ +W DELN E+K+LK  +  
Sbjct: 1141 KDTPGSIIRVKPERNQILPQEDTYRQTNVASAISSSGLTTHQWVDELNNEVKLLKIGDNG 1200

Query: 1201 IAQKEQVGRRVDRYAISPSILHDGGMVGNANKDNL-GYESSSAVGGCSGLFCWNNSKSHK 1260
              Q+   G R     +SPS+LHD    GN NKDNL GYES S   GC+ LFCWN  K+ +
Sbjct: 1201 ELQENLTGTRGQHCPLSPSLLHDTNF-GN-NKDNLGGYESGSGRVGCNILFCWNTKKTQR 1218

Query: 1261 RGKVRTNHG------RSRSGRFSLFGECGKSRNFGSRSRR 1270
            R K +   G      R+R  RFSLF  C K R      RR
Sbjct: 1261 RSKSKQVKGTPVRSRRNRKSRFSLFNGCAKPRKAEGNIRR 1218

BLAST of Cp4.1LG01g08830 vs. TAIR10
Match: AT1G14460.1 (AT1G14460.1 AAA-type ATPase family protein)

HSP 1 Score: 934.9 bits (2415), Expect = 5.3e-272
Identity = 562/1043 (53.88%), Postives = 706/1043 (67.69%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRK-AARVLRDPGTTSSWKSPLNSSRSVLAAVPGGASSSL 60
            M+ +R+SDPSKLHLKKELT IRK A++ LRDPGTTSSWKSPL SSR V+          L
Sbjct: 1    MSGLRISDPSKLHLKKELTHIRKVASKGLRDPGTTSSWKSPLTSSRFVVEPPASNNVEIL 60

Query: 61   NKNLESETRRHSGQSQLDAIVPPR---NENRNPKDKKIYLYNWKSHKSSSEKSVIHQKED 120
            + N            QLD+  P       N   K+KK++LYNWK+ ++SSEK+       
Sbjct: 61   SNN------------QLDSQFPSSRVFGNNGKEKEKKVFLYNWKTQRTSSEKT------- 120

Query: 121  RDGNNGTNDGSYSVPGLS----LDDSLSDARNGGDSKSDTYLGDLCSSMVFRCGDANLVS 180
                 G ++ S+    L+     DD +SDARNGGDS  +                    +
Sbjct: 121  ----EGEDETSWIQASLNDDDDDDDDVSDARNGGDSCLEE-------------------T 180

Query: 181  YGGPLAKRASAVKKKSKKHCSHLDVLSRHRQKGPVLGRKLLEGHPSLSISFSQDDSIEQS 240
                + +++  +KKKSK+    LD+    +             H +  +S  +D    +S
Sbjct: 181  RSASMIRKSGFIKKKSKE----LDLSIGRKSTAKARNFPSHHLHVASGLSVVRD----ES 240

Query: 241  DDTEDYSNSEDFRRYSAASPLLLKL------HPSAKLLR-NHRKEDSSYS-YSTPALSTS 300
            D+TED+SNSE+F     +SPLLLKL        S+K LR   ++EDSS++  STPALSTS
Sbjct: 241  DETEDFSNSENFPT-KVSSPLLLKLKRKNWSRSSSKFLRGTSKREDSSHTCNSTPALSTS 300

Query: 301  SYNRYVNNNPSTVGSWEGTTTSINDADDEV-DDQLDFPGRQGCGIPCYWSKRTPKHRGVC 360
            SYN Y   NPSTVGSWE       D DDE+ DD LDF GRQGCGIP YW+KR  KHRG C
Sbjct: 301  SYNMYGIRNPSTVGSWE-------DGDDELDDDNLDFKGRQGCGIPFYWTKRNLKHRGGC 360

Query: 361  GGCCSPSLSDTWRRKGSSILFGSQSIYSRRK--SLNSSNRRLTSGSARGVLPLLTNSADG 420
              CCSPS SDT RRKGSSIL GSQS+Y R +  S   + ++L   SA+GVLPLL    D 
Sbjct: 361  RSCCSPSFSDTLRRKGSSILCGSQSVYRRHRHSSGRFNKQKLALRSAKGVLPLLKYGGDS 420

Query: 421  RVGSSIGTGRSDDELSTNFGELDLEALSRLDGRRWSS-CRSHEGLEIVALNGEVEEGSTP 480
            R GSSIG G SDD+LST+FGE+DLEA SRLDGRRWSS C+S +G        E E GSTP
Sbjct: 421  RGGSSIGIGYSDDDLSTDFGEIDLEAQSRLDGRRWSSCCKSQDG----EREEEEEGGSTP 480

Query: 481  ESTTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAA 540
            ES  S SQKY+PMFF+ELIGQ+IVVQSL+NA+ +GR+A VYLFQGPRGTGKT+ ARI +A
Sbjct: 481  ESIQSLSQKYKPMFFDELIGQSIVVQSLMNAVKKGRVAHVYLFQGPRGTGKTSTARILSA 540

Query: 541  ALNC-LAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAF 600
            ALNC +  EE KPCGYC+EC+D+M GK +DLLE+D   + G +++RY LKKL + +  + 
Sbjct: 541  ALNCDVVTEEMKPCGYCKECSDYMLGKSRDLLELDAGKKNGAEKVRYLLKKLLTLAPQSS 600

Query: 601  LRYKVFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNK 660
             RYKVF+IDECHLLPS+ WL+ LKF E P Q+ VF+ ITTDLD++PRTIQSRCQKYIFNK
Sbjct: 601  QRYKVFVIDECHLLPSRTWLSLLKFLENPLQKFVFVCITTDLDNVPRTIQSRCQKYIFNK 660

Query: 661  IKDCDMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVN 720
            ++D D+V RL++I+++ENLD +  ALDLIA+NADGSLRDAETMLEQLSL+GKRIT+ LVN
Sbjct: 661  VRDGDIVVRLRKIASDENLDVESQALDLIALNADGSLRDAETMLEQLSLMGKRITVDLVN 720

Query: 721  ELVSTVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQL 780
            E             LVG+VSD+KLLELL LA+SS+TAETVK+AREL+D G DP+++MSQL
Sbjct: 721  E-------------LVGVVSDDKLLELLELALSSDTAETVKKARELLDLGADPILMMSQL 780

Query: 781  ASLIMDIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTW 840
            ASLIMDIIAG Y  +D K S +    R+L+E ++ERLKHALK LSEAEKQLRVS++RSTW
Sbjct: 781  ASLIMDIIAGAYKALDEKYSEAFLDRRNLTEADLERLKHALKLLSEAEKQLRVSTDRSTW 840

Query: 841  FTATLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLIPKLGSPA 900
            F ATLLQLGS+ S   T TGS+RRQS + T++   + S   I YKQ+S         SP 
Sbjct: 841  FIATLLQLGSMPSPGTTHTGSSRRQSSRATEE---SISREVIAYKQRS-GLQCSNTASPT 900

Query: 901  SLCNLKNGNYNNQGDLSPMVDSLSNNPKPTHKQFMEGKNSF-SRDDATLRNMVFRCKNSE 960
            S+   K+GN   +  LS            +  + +E   S  S DD T   M   C+NSE
Sbjct: 901  SI--RKSGNLVREVKLS-----------SSSSEVLESDTSMASHDDTTASTMTLTCRNSE 951

Query: 961  KLDNIWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSI 1020
            KL++IW+ C++RCHSKTL+QLLYA+GKLLS+SE E  L+AY+AF + +IK+RAERF+SSI
Sbjct: 961  KLNDIWIKCVDRCHSKTLKQLLYAHGKLLSISEVEGILVAYIAFGEGEIKARAERFVSSI 951

Query: 1021 TNSMEMVLRCNVEVRIILLPDGE 1022
            TNS+EMVLR NVEVRIILL + E
Sbjct: 1021 TNSIEMVLRRNVEVRIILLSETE 951

BLAST of Cp4.1LG01g08830 vs. TAIR10
Match: AT4G24790.1 (AT4G24790.1 AAA-type ATPase family protein)

HSP 1 Score: 328.2 bits (840), Expect = 2.3e-89
Identity = 194/408 (47.55%), Postives = 264/408 (64.71%), Query Frame = 1

Query: 465 SFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNC 524
           S SQK+RP  F+EL+GQ +VV+ L++ I RGRI  VYLF GPRGTGKT+ ++IFAAALNC
Sbjct: 240 SLSQKFRPKSFDELVGQEVVVKCLLSTILRGRITSVYLFHGPRGTGKTSTSKIFAAALNC 299

Query: 525 LAPE-ENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYK 584
           L+    ++PCG C EC  + SG+ +D++E D         +R  +K  S    S+  R+K
Sbjct: 300 LSQAAHSRPCGLCSECKSYFSGRGRDVMETDSGKLNRPSYLRSLIKSASLPPVSS--RFK 359

Query: 585 VFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDC 644
           VF+IDEC LL  + W T L   +   Q  VFI +T++L+ +PR + SR QKY F+K+ D 
Sbjct: 360 VFIIDECQLLCQETWGTLLNSLDNFSQHSVFILVTSELEKLPRNVLSRSQKYHFSKVCDA 419

Query: 645 DMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVS 704
           D+  +L +I  EE +D D  A+D IA  +DGSLRDAE ML+QLSLLGKRIT SL  +L+ 
Sbjct: 420 DISTKLAKICIEEGIDFDQGAVDFIASKSDGSLRDAEIMLDQLSLLGKRITTSLAYKLI- 479

Query: 705 TVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLI 764
                       G+VSD++LL+LL LAMSS+T+ TV RARELM S +DP+ L+SQLA++I
Sbjct: 480 ------------GVVSDDELLDLLDLAMSSDTSNTVIRARELMRSKIDPMQLISQLANVI 539

Query: 765 MDIIAGTYNIIDPKDSASI----FCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTW 824
           MDIIAG     + ++S+S     F  R  SE E+++L++ALK LS+AEK LR S  ++TW
Sbjct: 540 MDIIAG-----NSQESSSATRLRFLTRHTSEEEMQKLRNALKILSDAEKHLRASKNQTTW 599

Query: 825 FTATLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKS 868
            T  LLQL +  S  F    + R Q  K  D + S+TS+G  G   KS
Sbjct: 600 LTVALLQLSNTDSSSFATDENGRNQINK--DVELSSTSSGCPGDVIKS 625

BLAST of Cp4.1LG01g08830 vs. TAIR10
Match: AT5G45720.1 (AT5G45720.1 AAA-type ATPase family protein)

HSP 1 Score: 318.5 bits (815), Expect = 1.8e-86
Identity = 208/542 (38.38%), Postives = 300/542 (55.35%), Query Frame = 1

Query: 326 CGIPCYWSKRTPKHRG-----VCGGCCSPSLSDTWRRKGSSILFGSQSIYSRRKSLNSSN 385
           CGIP  WS+    HRG     + G   S  +SD+  RKG +       ++S         
Sbjct: 240 CGIPFNWSRI--HHRGKTFLDIAGRSLSCGISDSKGRKGEA----GTPMFSD-------- 299

Query: 386 RRLTSGSARGVLPLLTNSADGRVGSSIGTGRSDDELSTNFGELDLEALSRLDGRRWSSCR 445
              +S S R  LPLL +SAD           +++ +    GEL + A + L   + S   
Sbjct: 300 ---SSSSDREALPLLVDSAD-----------NEEWVHDYSGELGIFADNLLKNGKDS--- 359

Query: 446 SHEGLEIVALNGEVEEGSTPESTTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPV 505
                    + G+           SF+QKY P  F +L+GQN+VVQ+L NAI++ R+  +
Sbjct: 360 ---------VIGKKSSRKNTRWHQSFTQKYAPRTFRDLLGQNLVVQALSNAIAKRRVGLL 419

Query: 506 YLFQGPRGTGKTTAARIFAAALNCLAPEENKPCGYCRECTDFMSGKQKDLLEI------D 565
           Y+F GP GTGKT+ AR+FA ALNC + E++KPCG C  C  +  GK + + E+      D
Sbjct: 420 YVFHGPNGTGKTSCARVFARALNCHSTEQSKPCGVCSSCVSYDDGKNRYIREMGPVKSFD 479

Query: 566 GTNRKGIDRIRYQLKKLSSGSSSAFLRYKVFLIDECHLLPSKAWLTFLKFFEEPPQRVVF 625
             N      IR Q K+             V + D+C  + +  W T  K  +  P+RVVF
Sbjct: 480 FENLLDKTNIRQQQKQ-----------QLVLIFDDCDTMSTDCWNTLSKIVDRAPRRVVF 539

Query: 626 IFITTDLDSIPRTIQSRCQKYIFNKIKDCDMVERLKRISAEENLDADLDALDLIAMNADG 685
           + + + LD +P  I SRCQK+ F K+KD D+++ L+ I+++E +D D DAL L+A  +DG
Sbjct: 540 VLVCSSLDVLPHIIVSRCQKFFFPKLKDVDIIDSLQLIASKEEIDIDKDALKLVASRSDG 599

Query: 686 SLRDAETMLEQLSLLGKRITISLVNELVSTVPILSLSCYLVGIVSDEKLLELLALAMSSN 745
           SLRDAE  LEQLSLLG RI++ LV E+             VG++SDEKL++LL LA+S++
Sbjct: 600 SLRDAEMTLEQLSLLGTRISVPLVQEM-------------VGLISDEKLVDLLDLALSAD 659

Query: 746 TAETVKRARELMDSGVDPLVLMSQLASLIMDIIAGTYNIIDPKDSASIFCGRSLSETEVE 805
           T  TVK  R +M++G++PL LMSQLA++I DI+AG+Y+    +     F  + LS+ ++E
Sbjct: 660 TVNTVKNLRIIMETGLEPLALMSQLATVITDILAGSYDFTKDQCKRKFFRRQPLSKEDME 717

Query: 806 RLKHALKFLSEAEKQLRVSSERSTWFTATLLQLGSISS--LDFTPTGSNRRQSCKTTDDD 855
           +LK ALK LSE+EKQLRVS+++ TW TA LLQL       L  + +          TD D
Sbjct: 720 KLKQALKTLSESEKQLRVSNDKLTWLTAALLQLAPDKQYLLPHSSSADASFNHTPLTDSD 717

BLAST of Cp4.1LG01g08830 vs. TAIR10
Match: AT4G18820.1 (AT4G18820.1 AAA-type ATPase family protein)

HSP 1 Score: 306.2 bits (783), Expect = 9.3e-83
Identity = 211/515 (40.97%), Postives = 293/515 (56.89%), Query Frame = 1

Query: 324 QGCGIPCYWSKRTPKHRGV-----CGGCCSPSLSDT-WRRKG-SSILFGSQSIYSRRKSL 383
           + CGIP  WS+    HRG       G   S  +SD+   RKG ++   GS  +  +    
Sbjct: 298 KACGIPFNWSRI--HHRGKTFLDKAGRSLSCGMSDSKGGRKGETNERNGSDKMMIQSDDD 357

Query: 384 NSSNRRLTSGSARGVLPLLTNSA--DGRVGSSIGT-GRSDDELSTNFGELDLEALSRLDG 443
           +SS      GS    LPLL +S   DG V    G  G   D L  N  + DL +  R  G
Sbjct: 358 SSS----FIGSDGEALPLLVDSGENDGWVHDYSGELGIFADSLLKNDEDSDLASEGR-SG 417

Query: 444 RRWSSCRSHEGLEIVALNGEVEEGSTPESTTSFSQKYRPMFFNELIGQNIVVQSLINAIS 503
            +    +SH       +N         +S T   +KY P  F +L+GQN+VVQ+L NA++
Sbjct: 418 EKKHKKKSH-------VNARHRHRQQHQSLT---EKYTPKTFRDLLGQNLVVQALSNAVA 477

Query: 504 RGRIAPVYLFQGPRGTGKTTAARIFAAALNCLAPEENKPCGYCRECTDFMSGKQKDLLEI 563
           R ++  +Y+F GP GTGKT+ ARIFA ALNC + E+ KPCG C  C     GK  ++ E+
Sbjct: 478 RRKLGLLYVFHGPNGTGKTSCARIFARALNCHSMEQPKPCGTCSSCVSHDMGKSWNIREV 537

Query: 564 DGTNRKGIDRIRYQLKKLSSGSSSAFLRYKVFLIDECHLLPSKAWLTFLKFFEEP-PQRV 623
                   ++I   L      SS +    +VF+ D+C  L S  W    K  +   P+ V
Sbjct: 538 GPVGNYDFEKIMDLLDGNVMVSSQS---PRVFIFDDCDTLSSDCWNALSKVVDRAAPRHV 597

Query: 624 VFIFITTDLDSIPRTIQSRCQKYIFNKIKDCDMVERLKRISAEENLDADLDALDLIAMNA 683
           VFI + + LD +P  I SRCQK+ F K+KD D+V  L+ I+++E ++ D DAL LIA  +
Sbjct: 598 VFILVCSSLDVLPHVIISRCQKFFFPKLKDADIVYSLQWIASKEEIEIDKDALKLIASRS 657

Query: 684 DGSLRDAETMLEQLSLLGKRITISLVNELVSTVPILSLSCYLVGIVSDEKLLELLALAMS 743
           DGSLRDAE  LEQLSLLG+RI++ LV EL             VG+VSDEKL++LL LA+S
Sbjct: 658 DGSLRDAEMTLEQLSLLGQRISVPLVQEL-------------VGLVSDEKLVDLLDLALS 717

Query: 744 SNTAETVKRARELMDSGVDPLVLMSQLASLIMDIIAGTYNIIDPKDSASIFCGRSLSETE 803
           ++T  TVK  R +M++ V+PL LMSQLA++I DI+AG+Y+    +     F  + L + +
Sbjct: 718 ADTVNTVKNLRTIMETSVEPLALMSQLATVITDILAGSYDFTKDQHKRKFFRRQPLPKED 777

Query: 804 VERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL 828
           +E+L+ ALK LSEAEKQLRVS+++ TW TA LLQL
Sbjct: 778 MEKLRQALKTLSEAEKQLRVSNDKLTWLTAALLQL 779

BLAST of Cp4.1LG01g08830 vs. NCBI nr
Match: gi|659102544|ref|XP_008452189.1| (PREDICTED: protein STICHEL [Cucumis melo])

HSP 1 Score: 2136.3 bits (5534), Expect = 0.0e+00
Identity = 1128/1284 (87.85%), Postives = 1177/1284 (91.67%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAA-----VPGGA 60
            MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPL+SSRSV+AA     V GGA
Sbjct: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLSSSRSVMAATATAVVAGGA 60

Query: 61   SSSLNKNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQKE 120
            SSSLNKNLE +TRR+SGQSQL+AIVP RNENRNPKDKKIYLYNWKSHKSSSEKS   Q E
Sbjct: 61   SSSLNKNLECDTRRYSGQSQLEAIVPLRNENRNPKDKKIYLYNWKSHKSSSEKSATLQNE 120

Query: 121  DRDGNNGTNDGSYSVPGLSLDDSLSDARNGGDSKSDTYLGDLCSSMVFRCGDANLVSYGG 180
            DRDGN+  NDGSYSVPG+SLD SLSDARNGGDSKSDTYLGDL SSMVFRCGDANLVSY G
Sbjct: 121  DRDGNDDNNDGSYSVPGVSLDGSLSDARNGGDSKSDTYLGDLYSSMVFRCGDANLVSYSG 180

Query: 181  PLAKRASAVKKKSKKHCSHLDVLSRHRQKGP--VLGRKLLEGHPSLSISFSQDDSIEQSD 240
            P AKR SA KKKSKKHCSHLDVLSRH+QKGP  +LGRKLLEGHPSLSI+FSQDDSIEQSD
Sbjct: 181  PSAKRTSAFKKKSKKHCSHLDVLSRHQQKGPGPLLGRKLLEGHPSLSINFSQDDSIEQSD 240

Query: 241  DTEDYSNSEDFRRYSAASPLLLKL-----HPSAKLLRNHRKEDSSYSYSTPALSTSSYNR 300
            DTEDYSNSEDFRRYSAASPLLLKL     HPS+K LRN RKEDSSYSYSTPALSTSSYNR
Sbjct: 241  DTEDYSNSEDFRRYSAASPLLLKLKHKSFHPSSKFLRNSRKEDSSYSYSTPALSTSSYNR 300

Query: 301  YVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGCCS 360
            YVN NPSTVGSW+GTTTSINDADDEVDD+LDFPGRQGCGIPCYWSKRTPKHRG+CG CCS
Sbjct: 301  YVNRNPSTVGSWDGTTTSINDADDEVDDRLDFPGRQGCGIPCYWSKRTPKHRGICGSCCS 360

Query: 361  PSLSDTWRRKGSSILFGSQSIYSRRKSLNSSNRRLTSGSARGVLPLLTNSADGRVGSSIG 420
            PSLSDT RRKGSSILFGSQSIYSRRKS+NSS RR  SGSARGVLPLLTNSADG VGSSIG
Sbjct: 361  PSLSDTLRRKGSSILFGSQSIYSRRKSINSSKRRFASGSARGVLPLLTNSADGGVGSSIG 420

Query: 421  TGRSDDELSTNFGELDLEALSRLDGRRW-SSCRSHEGLEIVALNGEVEEGSTPESTTSFS 480
            TGRSDDELSTNFGELDLEALSRLDGRRW SSCRSHEGLEIVALNGEVE G TPEST SFS
Sbjct: 421  TGRSDDELSTNFGELDLEALSRLDGRRWSSSCRSHEGLEIVALNGEVEGGGTPESTRSFS 480

Query: 481  QKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNCLAP 540
            QKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKT AARIFAAALNCLAP
Sbjct: 481  QKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTAAARIFAAALNCLAP 540

Query: 541  EENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYKVFLI 600
            EENKPCGYCRECTDFM+GKQKDLLE+DGTN+KGIDRIRYQLK LSSG SSAFLRYKVFLI
Sbjct: 541  EENKPCGYCRECTDFMAGKQKDLLEVDGTNKKGIDRIRYQLKMLSSGQSSAFLRYKVFLI 600

Query: 601  DECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDCDMVE 660
            DECHLLPSKAWL FLKFFEEPPQRVVFIFITTDLDS+PRTIQSRCQKY+FNKIKDCDMVE
Sbjct: 601  DECHLLPSKAWLAFLKFFEEPPQRVVFIFITTDLDSVPRTIQSRCQKYLFNKIKDCDMVE 660

Query: 661  RLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVSTVPI 720
            RLKRISA+ENLD DLDALDLIAMNADGSLRDAETMLEQLSLLGKRIT SLVNE       
Sbjct: 661  RLKRISADENLDVDLDALDLIAMNADGSLRDAETMLEQLSLLGKRITTSLVNE------- 720

Query: 721  LSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDII 780
                  LVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDII
Sbjct: 721  ------LVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDII 780

Query: 781  AGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL 840
            AGTYNIID KDSASIF GRSLSE EVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL
Sbjct: 781  AGTYNIIDTKDSASIFGGRSLSEAEVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL 840

Query: 841  GSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLI-PKLGSPASLCNLKN 900
            GSISS DFT TGS+RRQSCKTTDDDPS+TSNGTI YKQKSF+ L+ P LGSPASLCNLKN
Sbjct: 841  GSISSPDFTQTGSSRRQSCKTTDDDPSSTSNGTIAYKQKSFAQLMPPNLGSPASLCNLKN 900

Query: 901  GNYNNQGDLSPMVDSLSNNPKPTHKQFMEGKN-SFSRDDATLRNMVFRCKNSEKLDNIWV 960
            GNYNNQ D+  MVD+L  N KPTHKQF+EGK+ SFSR+D TLRNMV R KNSEKL++IWV
Sbjct: 901  GNYNNQADMVSMVDNLIYNSKPTHKQFIEGKDLSFSREDVTLRNMVVRSKNSEKLNSIWV 960

Query: 961  HCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSITNSMEMV 1020
            HCIERCHSKTLRQLLYA+GKLLS+SESE TLIAY+AFED DIKSRAERFLSSITN MEMV
Sbjct: 961  HCIERCHSKTLRQLLYAHGKLLSISESEGTLIAYIAFEDVDIKSRAERFLSSITNFMEMV 1020

Query: 1021 LRCNVEVRIILLPDGETSINGMTAAKSSEGVEHELVDKERKIANLNAMEGYSSRSLILDG 1080
            LRCNVEVRIILLPDGE S    TAAK SEGVE    DKERK +N NAMEGYS+RSL+LD 
Sbjct: 1021 LRCNVEVRIILLPDGEAS----TAAKLSEGVE---PDKERKTSNPNAMEGYSNRSLMLDA 1080

Query: 1081 TYQATSDSSQLPSESNNQIDGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLS 1140
            TYQ+TSDSSQLP+ESN+Q DGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLS
Sbjct: 1081 TYQSTSDSSQLPAESNHQNDGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLS 1140

Query: 1141 RLKPEKNQVLPQDGSYYKDQTEEMNSTGDSSRKWDDELNRELKVLKANEELIAQKEQVGR 1200
            RLKPEKNQVLPQDGSYYKDQ +EMNSTG SSRKW+DELNRELKVLK  ++++AQKEQVGR
Sbjct: 1141 RLKPEKNQVLPQDGSYYKDQMDEMNSTGGSSRKWEDELNRELKVLKVGDDILAQKEQVGR 1200

Query: 1201 RVDRYAISPSILHDGGMVGNANKDNLGYESSSAVGGCSGLFCWNNSKSHKRGKVRTNHGR 1260
            R DRYAISPSILHDG MVGN+NKDNLGYESSSA GGCSGLFCWNNSK HKRGKVR NH R
Sbjct: 1201 RADRYAISPSILHDGSMVGNSNKDNLGYESSSAAGGCSGLFCWNNSKPHKRGKVRANHVR 1260

Query: 1261 SRSGRFSLFGECGKSRNFGSRSRR 1270
            SR+GRFSLFGECGKSRN GSR RR
Sbjct: 1261 SRNGRFSLFGECGKSRNSGSRFRR 1264

BLAST of Cp4.1LG01g08830 vs. NCBI nr
Match: gi|449431904|ref|XP_004133740.1| (PREDICTED: protein STICHEL [Cucumis sativus])

HSP 1 Score: 2131.3 bits (5521), Expect = 0.0e+00
Identity = 1122/1284 (87.38%), Postives = 1175/1284 (91.51%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAA-----VPGGA 60
            MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPL+SSRSV+AA     V GGA
Sbjct: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLSSSRSVMAATATAVVAGGA 60

Query: 61   SSSLNKNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQKE 120
            SSSLNKNLE ETRR+SGQSQLDAIVP RNENRNPKDKKIYLYNWKSHKSSSEKS   Q E
Sbjct: 61   SSSLNKNLECETRRYSGQSQLDAIVPLRNENRNPKDKKIYLYNWKSHKSSSEKSATLQNE 120

Query: 121  DRDGNNGTNDGSYSVPGLSLDDSLSDARNGGDSKSDTYLGDLCSSMVFRCGDANLVSYGG 180
            D DGN+  NDGSYSVPG+SLD SLSDARNGGDSKSDTYLGDL SSMVFRCGDANLVSY G
Sbjct: 121  DHDGNDDNNDGSYSVPGVSLDGSLSDARNGGDSKSDTYLGDLYSSMVFRCGDANLVSYSG 180

Query: 181  PLAKRASAVKKKSKKHCSHLDVLSRHRQKGP--VLGRKLLEGHPSLSISFSQDDSIEQSD 240
            P AKR SA KKKSKKHCSHLDVLSRH+QKGP  ++GRKLLEGHPSLSI+FSQDDSIEQSD
Sbjct: 181  PSAKRTSAFKKKSKKHCSHLDVLSRHQQKGPGPLMGRKLLEGHPSLSINFSQDDSIEQSD 240

Query: 241  DTEDYSNSEDFRRYSAASPLLLKL-----HPSAKLLRNHRKEDSSYSYSTPALSTSSYNR 300
            DTEDYSNSEDFRRYSAASPLLLKL     HPS+K LRN RKEDSSYSYSTPALSTSSYNR
Sbjct: 241  DTEDYSNSEDFRRYSAASPLLLKLKHKSFHPSSKFLRNSRKEDSSYSYSTPALSTSSYNR 300

Query: 301  YVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGCCS 360
            YVN NPSTVGSW+GTTTSINDADDEVDD+LDFPGRQGCGIPCYWSKRTPKHRG+CG CCS
Sbjct: 301  YVNRNPSTVGSWDGTTTSINDADDEVDDRLDFPGRQGCGIPCYWSKRTPKHRGICGSCCS 360

Query: 361  PSLSDTWRRKGSSILFGSQSIYSRRKSLNSSNRRLTSGSARGVLPLLTNSADGRVGSSIG 420
            PSLSDT RRKGSSILFGSQSIYSRRKS+NSS RR  SGSARGVLPLLTNSADG VGSSIG
Sbjct: 361  PSLSDTLRRKGSSILFGSQSIYSRRKSINSSKRRFASGSARGVLPLLTNSADGGVGSSIG 420

Query: 421  TGRSDDELSTNFGELDLEALSRLDGRRW-SSCRSHEGLEIVALNGEVEEGSTPESTTSFS 480
            TGRSDDELSTNFGELDLEALSRLDGRRW SSCRSHEGLEIVALNGEVE G TPEST SFS
Sbjct: 421  TGRSDDELSTNFGELDLEALSRLDGRRWSSSCRSHEGLEIVALNGEVEGGGTPESTRSFS 480

Query: 481  QKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNCLAP 540
            QKY+PMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKT AARIFAAALNCLAP
Sbjct: 481  QKYKPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTAAARIFAAALNCLAP 540

Query: 541  EENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYKVFLI 600
            EENKPCGYCRECTDFM+GKQKDLLE+DGTN+KGID+IRYQLK LSSG SSAF RYK+FL+
Sbjct: 541  EENKPCGYCRECTDFMAGKQKDLLEVDGTNKKGIDKIRYQLKLLSSGQSSAFFRYKIFLV 600

Query: 601  DECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDCDMVE 660
            DECHLLPSKAWL FLK FEEPPQRVVFIFITTDLDS+PRTIQSRCQKY+FNKIKDCDMVE
Sbjct: 601  DECHLLPSKAWLAFLKLFEEPPQRVVFIFITTDLDSVPRTIQSRCQKYLFNKIKDCDMVE 660

Query: 661  RLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVSTVPI 720
            RLKRISA+ENLD DLDALDLIAMNADGSLRDAETMLEQLSLLGKRIT SLVNE       
Sbjct: 661  RLKRISADENLDVDLDALDLIAMNADGSLRDAETMLEQLSLLGKRITTSLVNE------- 720

Query: 721  LSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDII 780
                  LVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDII
Sbjct: 721  ------LVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIMDII 780

Query: 781  AGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL 840
            AGTYNIID KD ASIF GRSLSE EVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL
Sbjct: 781  AGTYNIIDTKDGASIFGGRSLSEAEVERLKHALKFLSEAEKQLRVSSERSTWFTATLLQL 840

Query: 841  GSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLI-PKLGSPASLCNLKN 900
            GSISS DFT TGS+RRQSCKTTDDDPS+TSNGTI YKQKSF+ L+ P LGSP SLCNLKN
Sbjct: 841  GSISSPDFTQTGSSRRQSCKTTDDDPSSTSNGTIAYKQKSFAQLMPPNLGSPTSLCNLKN 900

Query: 901  GNYNNQGDLSPMVDSLSNNPKPTHKQFMEGK-NSFSRDDATLRNMVFRCKNSEKLDNIWV 960
            GNYNNQ D+ PMVD+L  N KPTHKQF+EGK +SFSR+D TLRNMVFR KNSEKL++IWV
Sbjct: 901  GNYNNQADMVPMVDNLIYNSKPTHKQFIEGKDSSFSREDVTLRNMVFRSKNSEKLNSIWV 960

Query: 961  HCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSITNSMEMV 1020
            HCIERCHSKTLRQLLYA+GKLLS+SESE TLIAYVAFED DIKSRAERFLSSITNSMEMV
Sbjct: 961  HCIERCHSKTLRQLLYAHGKLLSISESEGTLIAYVAFEDVDIKSRAERFLSSITNSMEMV 1020

Query: 1021 LRCNVEVRIILLPDGETSINGMTAAKSSEGVEHELVDKERKIANLNAMEGYSSRSLILDG 1080
            LRCNVEVRIILLPDGE S    TAAK SEGVE    DKER+ +NLNAMEGYS+RSL+LD 
Sbjct: 1021 LRCNVEVRIILLPDGEAS----TAAKLSEGVE---PDKERRTSNLNAMEGYSNRSLMLDA 1080

Query: 1081 TYQATSDSSQLPSESNNQIDGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLS 1140
            TYQ+TSDSSQLP+ESN+Q DGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLS
Sbjct: 1081 TYQSTSDSSQLPTESNHQNDGSRDRRQEIPMQRIESIIREQRLETAWLQAMEKGTPGSLS 1140

Query: 1141 RLKPEKNQVLPQDGSYYKDQTEEMNSTGDSSRKWDDELNRELKVLKANEELIAQKEQVGR 1200
            RLKPEKNQVLPQDGSYYKDQ +EMNST DSSRKW+DELNRELKVLK  ++++AQKEQVGR
Sbjct: 1141 RLKPEKNQVLPQDGSYYKDQMDEMNSTEDSSRKWEDELNRELKVLKVGDDILAQKEQVGR 1200

Query: 1201 RVDRYAISPSILHDGGMVGNANKDNLGYESSSAVGGCSGLFCWNNSKSHKRGKVRTNHGR 1260
            R DRYAISPSILHDG MVGN+NKDNLGYESSSA GGCSGLFCWN+SK HKR KVR NH R
Sbjct: 1201 RADRYAISPSILHDGSMVGNSNKDNLGYESSSAAGGCSGLFCWNSSKPHKRAKVRANHVR 1260

Query: 1261 SRSGRFSLFGECGKSRNFGSRSRR 1270
            SR+GRFSLFGECGKSRN GSR RR
Sbjct: 1261 SRNGRFSLFGECGKSRNSGSRFRR 1264

BLAST of Cp4.1LG01g08830 vs. NCBI nr
Match: gi|1000966104|ref|XP_015574794.1| (PREDICTED: protein STICHEL [Ricinus communis])

HSP 1 Score: 1469.1 bits (3802), Expect = 0.0e+00
Identity = 827/1318 (62.75%), Postives = 991/1318 (75.19%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSVLAAV---PGGASS 60
            M+E+RVSDPS+LHLKKELTQIRKAARVLRDPGTTSSWKSP++SSRS  AA       AS+
Sbjct: 1    MSEMRVSDPSRLHLKKELTQIRKAARVLRDPGTTSSWKSPISSSRSAAAATLAAAAAAST 60

Query: 61   SLNKNLESET---RRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQK 120
            S  K  ++E      H+  S +D+    RN   N K+K+++LYNWK+ KSSSEKS I + 
Sbjct: 61   SAWKQFDNENVIPNGHNSNSHMDSYF--RN---NGKEKRVFLYNWKTQKSSSEKSAIARN 120

Query: 121  E-DRDGNNGTNDGSYSVPGLSLDDSLSDARNGGDSKSDTYLGDL-CSSMVFRCGDANLVS 180
            + D D        S SV   S+DDSLSDARN  DSKSDTYLGD   SSM+FRC DANLVS
Sbjct: 121  DLDEDYE------SRSVQD-SVDDSLSDARNAADSKSDTYLGDSRSSSMIFRCRDANLVS 180

Query: 181  YGGPLAKRASAVKKKSKKHCSHLDVLSRHRQKGPVLGRKLLEGHPSLSISFSQDDSIEQS 240
               P  +RA  +KKKSKK  +HLD+LSR++QK   L R+LL+ HPS+++   ++DS+EQS
Sbjct: 181  ---PSMRRAMGIKKKSKKTDTHLDILSRYQQKEINL-RRLLKSHPSIALGLGREDSVEQS 240

Query: 241  DDTEDYSNSEDFRRYSAASPLLLKL------HPSAKLLRNHRKEDSSYSYSTPALSTSSY 300
            DDTEDYSNSED R+ S ASPLL+KL      H  +KLLR  RKEDSSY+YSTPALSTSSY
Sbjct: 241  DDTEDYSNSEDLRKISGASPLLIKLKHKRWSHSPSKLLRISRKEDSSYTYSTPALSTSSY 300

Query: 301  NRYVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGC 360
            NRY N+NPSTVGSW+GTT S+ND DDEVDD LD PGRQGCGIPCYWSKRTP+HRGVCG C
Sbjct: 301  NRYCNHNPSTVGSWDGTTASVNDGDDEVDDHLDLPGRQGCGIPCYWSKRTPRHRGVCGSC 360

Query: 361  CSPSLSDTWRRKGSSILFGSQSIYSRRKSLNS--SNRRLTSGSARGVLPLLTNSADGRVG 420
            CSPSLSDT +RKG+S+L G QS+Y RR   +S  + RR++S SA+G+LPLL NS DGR G
Sbjct: 361  CSPSLSDTIQRKGTSMLCGRQSMYHRRWHSSSVYNKRRISSRSAQGLLPLLANS-DGRGG 420

Query: 421  SSIGTGRSDDELSTNFGELDLEALSRLDGRRWSSCRSHEGLEIVALNGEVEEGSTPESTT 480
            SSIGTG SDDELSTNFGELDLEALSRLDGRRWSSCRS +GLEIVALNG+ EE  TPE+  
Sbjct: 421  SSIGTGNSDDELSTNFGELDLEALSRLDGRRWSSCRSQDGLEIVALNGDGEEEGTPENIR 480

Query: 481  SFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNC 540
            S SQKY+P+FF E+IGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKT+ ARIFA+ALNC
Sbjct: 481  SLSQKYKPLFFGEVIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTSTARIFASALNC 540

Query: 541  LAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYKV 600
            ++ EE KPCGYCR+C+DF+SGK +DL E+DGTN+KGID++R+ LKK+S    +   RYKV
Sbjct: 541  ISTEETKPCGYCRDCSDFISGKARDLWEVDGTNKKGIDKVRHLLKKVSQWPPTGSSRYKV 600

Query: 601  FLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDCD 660
            FLIDECHLLPSK WL FLKF EEPPQRVVFIFITTD D++PRT+QSRCQKY+FNKIKD D
Sbjct: 601  FLIDECHLLPSKMWLAFLKFLEEPPQRVVFIFITTDPDNVPRTVQSRCQKYLFNKIKDGD 660

Query: 661  MVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVST 720
            +V RL+++S+EENLD +LDALDLIA+NADGSLRDAETML+QLSLLGKRIT SLVNELV  
Sbjct: 661  IVARLRKVSSEENLDVELDALDLIALNADGSLRDAETMLDQLSLLGKRITTSLVNELV-- 720

Query: 721  VPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIM 780
                       G+V DEKLLELL L+MSS+TAETVKRAR+L+ SGVDPLVLMSQLASLIM
Sbjct: 721  -----------GVVPDEKLLELLELSMSSDTAETVKRARDLLHSGVDPLVLMSQLASLIM 780

Query: 781  DIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTATL 840
            DIIAGT+N+ D K S S+F GRSL+E E+ERLKHALK LSEAEKQLRVSS+RSTWFTATL
Sbjct: 781  DIIAGTHNVADAKYSISLFGGRSLTEAELERLKHALKLLSEAEKQLRVSSDRSTWFTATL 840

Query: 841  LQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFS-HLIPKLGSPASLCN 900
            LQLGS+ S D T + S+RRQS +TT++DPS+ S     YKQKS + +L  +  SPASL  
Sbjct: 841  LQLGSVPSPDLTQSSSSRRQSSRTTEEDPSSASREVTVYKQKSDAQYLSRRSSSPASLYK 900

Query: 901  LKNGNYNNQGDLSPMVDSLSNNPKPTHKQFMEGKNSFSRDDATLRNMVFRCKNSEKLDNI 960
              NG  +++G+        ++  +P+H       +S SRDD  + +M  R +N+EKLD I
Sbjct: 901  AINGKSSHRGEFG-----FNSKLRPSHS-IDSCMSSASRDDELVESMPLRYRNAEKLDRI 960

Query: 961  WVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSITNSME 1020
            W  CI  CHS TLRQLL+ +GKL SLSE E  L+ YVAF D DIK+RAERF+SSITNS+E
Sbjct: 961  WEKCIANCHSNTLRQLLHTHGKLFSLSEVEGALVVYVAFGDEDIKARAERFMSSITNSIE 1020

Query: 1021 MVLRCNVEVRIILLPDGETSINGM-------------TAAKSSEGVEH---------ELV 1080
            MVLRCNVEVRII +PDGE S+N +             T A   E   +         +  
Sbjct: 1021 MVLRCNVEVRIIFVPDGEDSMNCVNQSELQIQKQVEATMAIEQEKKANCVNPVNGYSDAQ 1080

Query: 1081 DKERKIAN---------LNAMEGYSSRSL-ILDGTYQATSDSSQLPSESNNQIDGSRDRR 1140
             + RK++          L    G   +SL +LD ++Q+TS S++L  E+N + DG ++  
Sbjct: 1081 QESRKLSRGSFNDLDSKLKGGSGDYLKSLTLLDSSFQSTSLSAELLPEANTESDGVKETG 1140

Query: 1141 QEIPMQRIESIIREQRLETAWLQAMEKGTPGSLSRLKPEKNQVLPQDGSYYKDQTEEMNS 1200
            QE+PMQRIESIIREQRLETAWLQA EKGTPGSLSRLKPEKNQVLPQ+    ++Q E  +S
Sbjct: 1141 QELPMQRIESIIREQRLETAWLQAAEKGTPGSLSRLKPEKNQVLPQEDCQ-QNQMESASS 1200

Query: 1201 TGDSSRKWDDELNRELKVLKANEELIAQKEQVGRRVDRYAISPSILHDGGMVGNANKDNL 1260
               SS+ W+ ELN ELKVLK  E  +  K+Q+G+R D Y ISPS+LH    VGN NK++L
Sbjct: 1201 MALSSQHWEHELNDELKVLKMEERRVLHKDQIGKRADHYPISPSLLHGSNFVGNLNKESL 1260

Query: 1261 GYESSSAVGGCSGLFCWNNSKSHKRGKVRTNHGRSRSGRFSLFGECGKSRNFGSRSRR 1270
            GYESSSA GGCSGLFCWN +KSHK       + R + GRFSLFGECGK +   +R +R
Sbjct: 1261 GYESSSAGGGCSGLFCWNANKSHKVNGTPVRY-RGKGGRFSLFGECGKHKKTENRIKR 1280

BLAST of Cp4.1LG01g08830 vs. NCBI nr
Match: gi|802640476|ref|XP_012078831.1| (PREDICTED: protein STICHEL-like [Jatropha curcas])

HSP 1 Score: 1464.1 bits (3789), Expect = 0.0e+00
Identity = 825/1312 (62.88%), Postives = 989/1312 (75.38%), Query Frame = 1

Query: 1    MAEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPLNSSRSV----LAAVPGGAS 60
            M+E+RVSDPS+LHLKKELTQIRKAAR+LRDPGTTSSWKSPL+SSRS     LAA    ++
Sbjct: 1    MSEMRVSDPSRLHLKKELTQIRKAARLLRDPGTTSSWKSPLSSSRSAVAATLAATASTSA 60

Query: 61   SSLNKNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSSSEKSVIHQKED 120
            S   + LE+E    +  S LD+    RN N N K+K+++LYNWK+ KSSSEKS + + E 
Sbjct: 61   SVWKQQLENENVIPNN-SHLDSHF--RN-NGNGKEKRVFLYNWKNQKSSSEKSAMAKNEA 120

Query: 121  RDGNNGTNDGSYSVPGL--SLDDSLSDARN-GGDSKSDTYLGDL-CSSMVFRCGDANLVS 180
                    D  Y    +  SLDDSLSDARN G DSKSDTY+G+   SSM+FRC DA+LVS
Sbjct: 121  --------DEDYESRSIQESLDDSLSDARNVGADSKSDTYVGESRSSSMIFRCRDASLVS 180

Query: 181  YGGPLAKRASAVKKKSKKHCSHLDVLSRHRQKGPVLGRKLLEGHPSLSISFSQDDSIEQS 240
               P  +RA  +KKKSKK  +HLD+LSR++QK   L R+LL+ HPS+++   +DD +EQS
Sbjct: 181  ---PSMRRAMGIKKKSKKTNTHLDILSRYQQKEMNL-RRLLKSHPSMALGLGRDDYVEQS 240

Query: 241  DDTEDYSNSEDFRRYSAASPLLLKL------HPSAKLLRNHRKEDSSYSYSTPALSTSSY 300
            DDTE+YSNSED R+ S ASPLL+KL      H  +KLLRN RKEDSS +YSTPALSTSSY
Sbjct: 241  DDTEEYSNSEDLRKISGASPLLIKLKHKNWSHSPSKLLRNSRKEDSSCTYSTPALSTSSY 300

Query: 301  NRYVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRTPKHRGVCGGC 360
            NRY   NPSTVGSW+  TTS+ND DDE DD LD PGRQGCGIPCYWSKRTP+HRG CG C
Sbjct: 301  NRYCIRNPSTVGSWDAATTSLNDGDDEEDDHLDLPGRQGCGIPCYWSKRTPRHRGPCGSC 360

Query: 361  CSPSLSDTWRRKGSSILFGSQSIYSRRK--SLNSSNRRLTSGSARGVLPLLTNSADGRVG 420
            CSPSLSDT RRKG+SIL GSQS+Y RR+  S  S+ RR+TS S +G+LPLL NS D R G
Sbjct: 361  CSPSLSDTIRRKGTSILCGSQSMYHRRRRSSSISNKRRITSRSGQGLLPLLANSED-RGG 420

Query: 421  SSIGTGRSDDELSTNFGELDLEALSRLDGRRWSSCRSHEGLEIVALNGEVEEGSTPESTT 480
            SSI TG SDDELSTNFGELDLEALSRLDGRRWSSCRS +GLEIVALNG+ EE  TPE+  
Sbjct: 421  SSIETGNSDDELSTNFGELDLEALSRLDGRRWSSCRSQDGLEIVALNGDGEEEDTPENIR 480

Query: 481  SFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTTAARIFAAALNC 540
            S SQKY+P+FF+E+IGQNIVVQSLINA+SRGRIAPVYLFQGPRGTGKT+ ARIFA+ALNC
Sbjct: 481  SLSQKYKPLFFSEVIGQNIVVQSLINAVSRGRIAPVYLFQGPRGTGKTSTARIFASALNC 540

Query: 541  LAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSSGSSSAFLRYKV 600
            ++ EE KPCGYCREC+DF+SGK +DL E+DGTN+KGID++ + LKK+S    +   RYK+
Sbjct: 541  MSTEETKPCGYCRECSDFISGKTRDLWEVDGTNKKGIDKVSHLLKKVSQWPPTGSSRYKI 600

Query: 601  FLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQKYIFNKIKDCD 660
            FLIDECHLLPSK WL FLKF EEPPQRVVFIFITTD D++PRT+QSRCQKY+F+KIKD D
Sbjct: 601  FLIDECHLLPSKMWLAFLKFLEEPPQRVVFIFITTDPDNVPRTVQSRCQKYLFSKIKDGD 660

Query: 661  MVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRITISLVNELVST 720
            +V RL++ISAEENLD +LDALDLIAMNADGSLRD+ETML+QLSLLGKRIT SLVNE    
Sbjct: 661  IVARLRKISAEENLDVELDALDLIAMNADGSLRDSETMLDQLSLLGKRITTSLVNE---- 720

Query: 721  VPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPLVLMSQLASLIM 780
                     LVG+V DEKLLELL L+MSS+TAETVKRAR+LMDSGVDP+VLMSQLASLIM
Sbjct: 721  ---------LVGVVPDEKLLELLELSMSSDTAETVKRARDLMDSGVDPMVLMSQLASLIM 780

Query: 781  DIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVSSERSTWFTATL 840
            DIIAGTYN++D K S S F GRSL+E E+ERLKHALK LSEAEKQLRVSS+RSTWFTATL
Sbjct: 781  DIIAGTYNVVDAKHSNSFFGGRSLTEAELERLKHALKLLSEAEKQLRVSSDRSTWFTATL 840

Query: 841  LQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKS-FSHLIPKLGSPASLCN 900
            LQLGS+ S D T + S+RRQS +TT++DPS+TS     YKQKS   +L  +  SPASL  
Sbjct: 841  LQLGSVPSPDLTQSSSSRRQSSRTTEEDPSSTSREVTIYKQKSDAQYLSRRSSSPASLYK 900

Query: 901  LKNGNYNNQGDLSPMVDSLSNNPKPTHKQFMEGKNS-FSRDDATLRNMVFRCKNSEKLDN 960
              N N                + KP   + M  + S  S DD  +  M+FR +N++KLD+
Sbjct: 901  AINEN-----------SEFGFSSKPLPSRTMHSRTSTASWDDELVETMLFRYRNADKLDH 960

Query: 961  IWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAERFLSSITNSM 1020
            IW  CI +CHS TLRQLL+A+GKL S+SE E  L+ YVAF D DIK+RAERF+SSITNS+
Sbjct: 961  IWEKCIAKCHSNTLRQLLHAHGKLFSISELEGILVVYVAFGDEDIKARAERFMSSITNSI 1020

Query: 1021 EMVLRCNVEVRIILLPDGETSIN--GMTAAKSSEGVEHELV-DKERKIANLNAMEGYS-- 1080
            EMVLRCNVEVRIIL+PDG  S+N    +  +  +  E  L  ++ERK  + N + GYS  
Sbjct: 1021 EMVLRCNVEVRIILVPDGVDSMNCVNQSELQGQKRAEATLANEQERKENSSNLLNGYSDS 1080

Query: 1081 -SRSLIL--------------------DGTYQATSDSSQLPSESNNQIDGSRDRRQEIPM 1140
               SL L                    +  +Q+T+ S++LP + + +  G R+R+QE+PM
Sbjct: 1081 QQESLKLSRGSFNDLESKLKGGSSNLRESPFQSTALSTELPPDPDAENGGVRERKQELPM 1140

Query: 1141 QRIESIIREQRLETAWLQAMEKGTPGSLSRLKPEKNQVLPQDGSYYKDQTEEMNSTGDSS 1200
            QRIESIIREQRLETAWLQA EKGTPGSLSRLKPEKNQVLPQ+ +Y ++Q E  +S G SS
Sbjct: 1141 QRIESIIREQRLETAWLQAAEKGTPGSLSRLKPEKNQVLPQEDNYRQNQMESASSMGLSS 1200

Query: 1201 RKWDDELNRELKVLKANEELIAQKEQVGRRVDRYAISPSILHDGGMVGNANKDNLGYESS 1260
            + W+DELN ELKVLK  + ++  K+Q+G+R DRY ISPS+LHD  +VG  N +NLGYESS
Sbjct: 1201 QHWEDELNHELKVLKMEDRMVVYKDQIGKRADRYPISPSLLHDNNLVGYPNNENLGYESS 1260

Query: 1261 SAVGGCSGLFCWNNSKSHKRGKVRTNHGRSR--SGRFSLFGECGKSRNFGSR 1267
            SA GGCSGL CWN ++S K GK +    RSR  SGRF+LFGECGK +   +R
Sbjct: 1261 SASGGCSGLLCWNANRSLK-GKAKGTSVRSRHKSGRFTLFGECGKHKKAENR 1270

BLAST of Cp4.1LG01g08830 vs. NCBI nr
Match: gi|645223428|ref|XP_008218626.1| (PREDICTED: protein STICHEL-like [Prunus mume])

HSP 1 Score: 1463.4 bits (3787), Expect = 0.0e+00
Identity = 835/1301 (64.18%), Postives = 975/1301 (74.94%), Query Frame = 1

Query: 2    AEVRVSDPSKLHLKKELTQIRKAARVLRDPGTTSSWKSPL-NSSRSVLAAVPGGASSSLN 61
            A + VSD S+LHLKKELTQIRKAARVLRDPGTTSSW+SPL +SSRSV AA    A+++  
Sbjct: 11   AAMGVSDRSQLHLKKELTQIRKAARVLRDPGTTSSWRSPLASSSRSVGAAAAAAAAAAAA 70

Query: 62   KNLESETRRHSGQSQLDAIVPPRNENRNPKDKKIYLYNWKSHKSS----------SEKSV 121
                S T  ++G S       P     N  DK+++L+NWKS KSS           +   
Sbjct: 71   SATTSSTWNNNGNS-----TTPSGNRNNGSDKRVFLHNWKSSKSSRNNDNDDDDYGDGDY 130

Query: 122  IHQKEDRDGNNGTNDGSYSVPGLSLDDSLSDARN--GGDSKSDTYLGDLCSSMVFRCGDA 181
                +D D ++   D S SV  LS+DDSLSDAR    GDS+SDT      SSM+ R   A
Sbjct: 131  DDDDDDDDDDDDGIDASSSVAALSVDDSLSDARTVADGDSRSDTQTYSRSSSMMLRRRYA 190

Query: 182  NLVSYGGPLAKRASAVKKKSKKHCSHLDVLSRHRQKGPVLGRKLL------EGHPSLSIS 241
            +L+    P  K     KK SKK  +H D+LS+++QK  +LGR L+      EGHPS+++ 
Sbjct: 191  HLL----PPVKNT---KKTSKKTDTHSDLLSKYQQKELILGRNLVSSRKSVEGHPSMAVR 250

Query: 242  F--SQDDSIEQSDDTEDYSNSEDFRRYSAASPLLLKLHPS--AKLLRNH--RKEDSSYSY 301
               ++DD ++QSDDTEDY NSED RR S ASPLL KL     +K  R++  RKEDSSYSY
Sbjct: 251  SGRTRDDLVDQSDDTEDYCNSEDLRRISGASPLLSKLKKKNWSKFRRDNSIRKEDSSYSY 310

Query: 302  STPALSTSSYNRYVNNNPSTVGSWEGTTTSINDADDEVDDQLDFPGRQGCGIPCYWSKRT 361
            STPALSTSSYNRY   NPSTVGSW+GTTTS+ND DDEVDD L+FPGRQGCGIPCYWSKRT
Sbjct: 311  STPALSTSSYNRYHVRNPSTVGSWDGTTTSMNDGDDEVDDHLEFPGRQGCGIPCYWSKRT 370

Query: 362  PKHRGVCGGCCSPSLSDTWRRKGSSILFGSQSIYSRRK--SLNSSNRRLTSGSARGVLPL 421
            PKH+ + G CCSPSLSDT RRKGS I  GSQ+IY RR+  S  S  +R+ S SA+GVLPL
Sbjct: 371  PKHKSMYGSCCSPSLSDTLRRKGSIIFCGSQNIYPRRRQSSSGSHKQRIASRSAQGVLPL 430

Query: 422  LTNSADGRVGSSIGTGRSDDELSTNFGELDLEALSRLDGRRWS-SCRSHEGLEIVALNGE 481
            LTNS +GR GSS+GTGRSDDELSTNFGELDLEALSRLDGRRWS SCRS EGLEIV LNG 
Sbjct: 431  LTNSGEGRGGSSLGTGRSDDELSTNFGELDLEALSRLDGRRWSSSCRSQEGLEIVTLNGG 490

Query: 482  VEEGSTPESTTSFSQKYRPMFFNELIGQNIVVQSLINAISRGRIAPVYLFQGPRGTGKTT 541
             EE  +PE+  SFSQKY+PMFF EL+GQNIVVQSLINAI RGRIAPVYLFQGPRGTGKT+
Sbjct: 491  GEEEGSPENIRSFSQKYKPMFFGELVGQNIVVQSLINAIERGRIAPVYLFQGPRGTGKTS 550

Query: 542  AARIFAAALNCLAPEENKPCGYCRECTDFMSGKQKDLLEIDGTNRKGIDRIRYQLKKLSS 601
            AARIF A+LNCLAP+E KPCGYCREC+DF+SGK KDLLE+DGTN+KGID++RY LK LS 
Sbjct: 551  AARIFTASLNCLAPDETKPCGYCRECSDFVSGKNKDLLEVDGTNKKGIDKVRYLLKTLSM 610

Query: 602  GSSSAFLRYKVFLIDECHLLPSKAWLTFLKFFEEPPQRVVFIFITTDLDSIPRTIQSRCQ 661
               SA  RYKVF+IDECHLLPSK WL FLK+ EEPPQRVVFIFITTDLD++PRTIQSRCQ
Sbjct: 611  APPSASSRYKVFVIDECHLLPSKTWLAFLKYLEEPPQRVVFIFITTDLDNVPRTIQSRCQ 670

Query: 662  KYIFNKIKDCDMVERLKRISAEENLDADLDALDLIAMNADGSLRDAETMLEQLSLLGKRI 721
            KY+FNKIKD D+V RL++ISAEENLD + DAL+LIA+NADGSLRDAETML+QLSLLGKRI
Sbjct: 671  KYLFNKIKDSDIVARLRKISAEENLDVETDALELIALNADGSLRDAETMLDQLSLLGKRI 730

Query: 722  TISLVNELVSTVPILSLSCYLVGIVSDEKLLELLALAMSSNTAETVKRARELMDSGVDPL 781
            + SLVNE             LVG+VSDEKLLELL LAMSS+TAETVKRARELMDSGVDP+
Sbjct: 731  STSLVNE-------------LVGVVSDEKLLELLELAMSSDTAETVKRARELMDSGVDPM 790

Query: 782  VLMSQLASLIMDIIAGTYNIIDPKDSASIFCGRSLSETEVERLKHALKFLSEAEKQLRVS 841
            VLMSQLASLIMDIIAGTYNI D K   S F  R+L+E E+ERLKHALK LSEAEKQLRVS
Sbjct: 791  VLMSQLASLIMDIIAGTYNINDVKHD-SFFGDRNLTEAELERLKHALKILSEAEKQLRVS 850

Query: 842  SERSTWFTATLLQLGSISSLDFTPTGSNRRQSCKTTDDDPSTTSNGTIGYKQKSFSHLIP 901
            SERSTWFTATLLQLGS+ S D T + S RR SCKTT+DD S+ S     YKQ    +++ 
Sbjct: 851  SERSTWFTATLLQLGSMPSPDLTHSCS-RRHSCKTTEDDSSSASREAATYKQLEGQYMLH 910

Query: 902  KLGSPASLCNLKNGNYNNQGDLSPMVDSLSNNPKPTHKQFME-GKNSFSRDDATLRNMVF 961
            K  S ASL    NGN N+Q D     +    N K +H Q ME G ++ S D+    N++ 
Sbjct: 911  KSTSHASLQKTLNGNSNHQRDSLSRKNGFGFNTKLSHGQIMESGASTPSHDEDMAGNVIL 970

Query: 962  RCKNSEKLDNIWVHCIERCHSKTLRQLLYAYGKLLSLSESEDTLIAYVAFEDADIKSRAE 1021
            RC NSEKL+++W  CIERCHSKTLRQLL+++GKL+S+SE+E  L+AYVAFED  IKSRAE
Sbjct: 971  RCVNSEKLEDVWAQCIERCHSKTLRQLLHSHGKLVSISEAEGVLVAYVAFEDGSIKSRAE 1030

Query: 1022 RFLSSITNSMEMVLRCNVEVRIILLPDGETSINGMTAAKSSEGVEHELVDKERKIANLNA 1081
            RF+SSITNSME+VLR NVEVRI+ LP GE S+NG + A  S  V    +D+ERK    NA
Sbjct: 1031 RFVSSITNSMEVVLRRNVEVRIVHLPGGEASLNGPSPAHLSGTV--AAIDRERKRVGSNA 1090

Query: 1082 MEGYSSRSLILDGTYQATSDSSQLPSESNNQIDGSRDRRQEIPMQRIESIIREQRLETAW 1141
             +GYS+ SL LDGT+++TSDSS + +E N Q   +R+RRQEIPMQRIESIIR+QRLETAW
Sbjct: 1091 TDGYSNCSLFLDGTHKSTSDSSDVIAEGNAQTSATRERRQEIPMQRIESIIRDQRLETAW 1150

Query: 1142 LQAMEKGTPGSLSRLKPEKNQVLPQDGSYYKDQTEEMNSTGDSSRKWDDELNRELKVLKA 1201
            LQ  EKGTPGSLSRLKPEKNQVLPQDG YY+DQ E +NS   SS++W+D  N E+K+LK 
Sbjct: 1151 LQVAEKGTPGSLSRLKPEKNQVLPQDGIYYEDQMESLNSMRLSSQQWEDGSNHEVKILKV 1210

Query: 1202 NEELIAQKEQVGRRVDRYAISPSILHDGGMVGNANKDNLGYESSSAVGGCSGLFCWNNSK 1261
            N    AQK+Q GR+VDRY +SPS+LHD   VGN+NKDNLG ES S  GGCSG F   N+K
Sbjct: 1211 NSGRDAQKDQTGRKVDRYPMSPSLLHDSNFVGNSNKDNLGDESGSGKGGCSGFFRCYNTK 1270

Query: 1262 SHKRGKVRTN--HGRSRSG-RFSLFGECG-KSRNFGSRSRR 1270
              KRGKV+      + R G RFSLFGECG KSR   SR  R
Sbjct: 1271 PRKRGKVKGTAVAVQPRKGRRFSLFGECGKKSRKTESRHTR 1282

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
STI_ARATH0.0e+0056.15Protein STICHEL OS=Arabidopsis thaliana GN=STI PE=1 SV=2[more]
STIL1_ARATH9.5e-27153.88Protein STICHEL-like 1 OS=Arabidopsis thaliana GN=At1g14460 PE=1 SV=1[more]
STIL2_ARATH4.0e-8847.55Protein STICHEL-like 2 OS=Arabidopsis thaliana GN=At4g24790 PE=2 SV=1[more]
STIL4_ARATH3.2e-8538.38Protein STICHEL-like 4 OS=Arabidopsis thaliana GN=At5g45720 PE=2 SV=1[more]
STIL3_ARATH1.6e-8140.97Protein STICHEL-like 3 OS=Arabidopsis thaliana GN=At4g18820 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L847_CUCSA0.0e+0087.38Uncharacterized protein OS=Cucumis sativus GN=Csa_3G113330 PE=4 SV=1[more]
A0A067K8J9_JATCU0.0e+0062.88Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13362 PE=4 SV=1[more]
A0A0D2VK62_GOSRA0.0e+0061.95Uncharacterized protein OS=Gossypium raimondii GN=B456_011G090600 PE=4 SV=1[more]
A0A0D2UR74_GOSRA0.0e+0061.92Uncharacterized protein OS=Gossypium raimondii GN=B456_011G090600 PE=4 SV=1[more]
A0A067DQP8_CITSI0.0e+0061.97Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g000818mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G02480.10.0e+0056.15 AAA-type ATPase family protein[more]
AT1G14460.15.3e-27253.88 AAA-type ATPase family protein[more]
AT4G24790.12.3e-8947.55 AAA-type ATPase family protein[more]
AT5G45720.11.8e-8638.38 AAA-type ATPase family protein[more]
AT4G18820.19.3e-8340.97 AAA-type ATPase family protein[more]
Match NameE-valueIdentityDescription
gi|659102544|ref|XP_008452189.1|0.0e+0087.85PREDICTED: protein STICHEL [Cucumis melo][more]
gi|449431904|ref|XP_004133740.1|0.0e+0087.38PREDICTED: protein STICHEL [Cucumis sativus][more]
gi|1000966104|ref|XP_015574794.1|0.0e+0062.75PREDICTED: protein STICHEL [Ricinus communis][more]
gi|802640476|ref|XP_012078831.1|0.0e+0062.88PREDICTED: protein STICHEL-like [Jatropha curcas][more]
gi|645223428|ref|XP_008218626.1|0.0e+0064.18PREDICTED: protein STICHEL-like [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0009360DNA polymerase III complex
Vocabulary: Molecular Function
TermDefinition
GO:0005524ATP binding
GO:0003887DNA-directed DNA polymerase activity
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006260DNA replication
Vocabulary: INTERPRO
TermDefinition
IPR027417P-loop_NTPase
IPR012763DNA_pol_III_sug/sutau
IPR008921DNA_pol3_clamp-load_cplx_C
IPR003593AAA+_ATPase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0006260 DNA replication
cellular_component GO:0009360 DNA polymerase III complex
molecular_function GO:0005524 ATP binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0003887 DNA-directed DNA polymerase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g08830.1Cp4.1LG01g08830.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003593AAA+ ATPase domainSMARTSM00382AAA_5coord: 497..641
score: 6.
IPR008921DNA polymerase III, clamp loader complex, gamma/delta/delta subunit, C-terminalunknownSSF48019post-AAA+ oligomerization domain-likecoord: 724..825
score: 4.5
IPR012763DNA polymerase III, subunit gamma/ tauTIGRFAMsTIGR02397TIGR02397coord: 465..827
score: 1.4E
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3DG3DSA:3.40.50.300coord: 462..628
score: 2.8
IPR027417P-loop containing nucleoside triphosphate hydrolaseunknownSSF52540P-loop containing nucleoside triphosphate hydrolasescoord: 489..688
score: 5.01
NoneNo IPR availableunknownCoilCoilcoord: 1033..1053
scor
NoneNo IPR availableGENE3DG3DSA:1.10.8.60coord: 629..702
score: 5.0
NoneNo IPR availableGENE3DG3DSA:1.20.272.10coord: 716..827
score: 7.
NoneNo IPR availablePANTHERPTHR11669REPLICATION FACTOR C / DNA POLYMERASE III GAMMA-TAU SUBUNITcoord: 870..1017
score: 1.1E-254coord: 450..842
score: 1.1E
NoneNo IPR availablePANTHERPTHR11669:SF0PROTEIN STICHEL-RELATEDcoord: 870..1017
score: 1.1E-254coord: 450..842
score: 1.1E
NoneNo IPR availablePFAMPF13177DNA_pol3_delta2coord: 480..640
score: 8.4