Cp4.1LG01g20370 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g20370
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDIS3-like exonuclease 2
LocationCp4.1LG01 : 17318154 .. 17324404 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAGAAAGAAAAGGAGATGAAGAGAAACGGAAAGAAGTGGAAAAAGGGCGCGAAGTGGGCGCCATTGTCTTCGCCGACCGTTCATCGCGTCCCTTTCTTCTTCTTCTTCCACAGTCTTGCCAACACTAGCGATTTCTTTCTCAGCTCTCTTTCATGGCTGCTGTTTGATTCATTTATCATTTTCTTCTGTTGTGTTCTTGATATCCTTTTCCGAAGTCGGAAGATTGTCTTGTTATATCTTTCAAATTCGTTTGGATTTGAGGTTTTTTTTGTGGATTGTTGCCATTTCTTGTTCCGTTTGATTCTTCTTGCGTTTTCATAATTATGAGGGGATCTGTTGAGCAATCCACTTCCGAGAGGAACGAGGATGGCGGAAAGGAGTGGAAGAAGAAGGATCGTTCTAATTGGCGGTCTGAGCAGAAGGCCTCTATTTGGAGGGCAGGTGTTCTTCTCTTTTTAACTTGTTTTGCTTGATTTTAGTCTTAATTCGTGGGATTTTGCCGAATATGGATGTACTTTGATATATGGAAATTGTGATGGAAATTGTGAGATCTCAAGTCGATTGGAGAGGAGAGCGAAACATTCTTTATAAGTGTGTGGAAACTTCTCCCTTGATGCATTTTAAAAACATTGAGTGGAAGCATGAAAGGGAAAGATTCTCCAAAGAAGATAATATTTACTAGCGGTGGACTTGAACCATTACACATGGTATCAGAGCCAGACATTGGGTGATGTGTAAGCGAGGAGGCTGAACCCCGAAAAGCTGGACACAAGGCAGTGTGTCAACAAGAATGCTAGGCCCTGAAGGGGGTGGATTGTGAGATTCCACATTGATTGGGGAGGAGAACGAAACATTCATTATAAGGGCGTGAAAACTTCTCCCTAACAAGATATGTTTTAAAAACCTTGAAGGAAAGCTCTCTCCCTAGCATACTTATTTTAAAAATCTTGAGGGACAGTTCGAAAAGAAAAGTCTAAAAGAAAAGTCTAAAAGGACAATATTCGCAAGCAATGGGCTTGAACTTGCTATTTGTTGTTAAGTATTGCATTGAGACAAATTTCTCAGGGTGCGTTGATATGTGCTTATTCTAAATCAATAGTTTCAGATGTGTGTTTTGTTGCTCGGGAAGGTGCTTTTATGGTTTACTTTTGTTTCCCCTAAACCATCTTTCCACTCTAAACATGCCTTAACTTACAAGATATGTTGCCTCTTTTGGCAGTGTCTTGCAGTTCAGTCAATGAAATACCGAGGGAAGCATCAGAGTGCATGGAAAATTGTAGAATTGATGCTAACTTAACAGAACCCTCATTTTATTCTTCTTTGACTCAAGATGAATGTCAATCAAATCAGCCAACCGAGCCTGGTTTCACCGGAAGAAATAAGCTTTCCTTTAGTTCTTTGTCCCCTCTGCATATTGGCCAACAAGCAGAATGGTCACAAAATTTGAGAAACCAGCATCATTCCATGGATACTGGTCGATGGGCTATAACATCATGTCCGGAACAGATTGCCAGTGGAAGTATGCCTTGGATATCTATGAACCAGCATTCGCCTCCTGCTGATGTAAACAGTCAAAGGAAATATTTTACGTCTCACTGGCCCCTGGATGATGTTAATAGAGGCTTACAGGTCTGTTTGACATGCTGAATTTTCTTGATGCTAGTTGGTAGGGTGAGTTCTTTGTTCAAATGATATCAAATTTTGTGATAGGATGCTGCAATTTAGCTGTTGTTTCCACTTTTGCAGAAAGGCGACATATTTAAAGCTTTGTTTCGTATGAATGCTCACTATAGAGTTGAGGTTTGTGTCAAATCTCTCTGAATATTAGCTACTGAGTATGTTTGTTCGCATATTTCTTTCGAGTTTTGTTTGAACACGATGTATCTATTTTGTAGGCCTACTGCAAAATTGATGGAATACCAGTTGATGTCTTAATCTATGGAAGTGCGTCTCAGAACAGAGCTGTAAATACTCTCTCTTTCATGCACACATACAAACACAAAATGGATGAGTTATCCAAGAAACTCGAAATTGGTTGTAGGGTTTCTAGAAAGCTTGAAACATTTTGGTTCATTTTTAGTAAAATGAGTTGTGTGCTAGCAAGGATGCCAAGCCTCCAAGGAGGGTGGATTGTGAGATCTCACATCAGTTGGAGAGGGGAACGAAACATTCCTTATAAAGGTGTAGAAACCTCTCCCTGGTAGATGCATTTTAAAACCTTGAGGGAAACCTTTGAAGGGAAAGCCCAAAGGGGACAATATCGGCTAGTGGTGGGCTTGAGCCGTTACAAATGGTATCAGTGCCAGACACCGGTGTGCCAATGAGGACGCTAGGCCTCCAAGGAGGGTGGATTGTGAGATCCCTCATCGGTTGGAGAAAGGAACGAAGCATTTGGAAATCTCTCCCTAGTAGACGTGTTTTAAAACCTTAAGGGAAAGCCCAAAGAGACAATATTTGCTAGCGGTGGTGGGCTTGAGCTGTTATAAATGGTATCAAAGCTAGACATCGAGCCGTGTACCAGCGAAGATGCTGGGCCTCCAAGAAGGGTGGATTGTGAGATCCCACATCAGTTGGAGAAGGGAACGAAGTATTTCTTATAAGGGTGTGGAAACCTCTTTCTAATGGATGCGTTTTAAAACCTTGAAGAAAAGCCCCAAAAAGGAGCTAGAAATGGACTTGGGATGTACACAGTGAACTAAAAAATGCTTGCATCTTGTGTTTGACTGCTGTAGAATTACGATGAGAGCAAATCTAAATGTACATATGGGGAATTTTTTTATTGATTTGGAACCATTGTCCTGATCTAGGTGGAAGGGGACATAGTTGCAATGAAGATGAATCCTTTTTCGTTATGGAGTAGGATGAAAGGCACTAGTGAGGCCCATGACAATATGCATTCACTGGAAGATGCCAATGTTATGTCCGACAGTTTTTGGAGTTCCCCTTCAGTTGATCCCATTGGCAGGATCTGTGCAGTGATTGATTTATTTCCTACAAAAAGACCAACTGGTAAGGTAGTAGCCATCCTAAAGAAGTCTCGGCAGCGAGGAACTATTGTTGGCCTTCTTAATGTCAAGAAATTCCTCTCCTTTCATGATTGTGGATATGTCCAATTGATGCCTAACGATGCGAGATTCCCGACAATGATGGTTTTTGCAGTCGATTTACCCGACTGCATCAAGAAGAGATTGGACAATGGTGATGCCACAGTTGAAAGTGAGCTGGTGGCTGCACGGATTGATGAATGGCTTGAAGAGAGTTCAGCTCCAAAAGCACTTGTCATGCATGTTCTAGGACGGGGAAGTAGAATAGAGTCTCATATTGATGCTATTTTATTTCAAAATGCAATTCTTACATGTGAATTCTCTCGTGATTCATTGTCTTGTCTCCCTCATACCCCTTGGAAGATCCCACAAGAGGAACTTCAATGCAGAAGAGATCTAAGAAACTTATGCATATTTACTATTGATCCTCCATCTGCCTTGGATCTTGATGATGCTTTCTCGGTTCAAAAGTTAGCCAATGGTATCTTTAGAGTAGGCATTCATGTTGCTGATGTATCACATTTTGTATTGCCAGACACTGCCTTAGATAAAGAGGCTCGAATCCGATCGACAAGTGTTCATCTTCTACGACGCAAGATACCAATGTTGCCACCATTACTCTCTGAGAATATCGGTTCACTTAACCCCGGAGTGGATCGACTTGCGTTTTCATTGTTTTTGGACATCAACCATTGTGGAGATGTCGAAGATTGTTGGATTGGCCGTACTGTGATATGCTCTTGCTGCAAACTCTCATATGGACATGCTCAGGACATTATTGACAGTTCAAAGGTTTTAGGACATTGTGTTCCCCAGTTGCATGGCCAGTCTACATGGCTTGATATCATTTCATCTGTTAGAACTCTAAATGAAATTTCTAAAACTCTAAAGGAGAAGAGATTTAGAGATGGGGCTTTGAGGATTGAGAATCCCAAAATAGTGTTTTTATATGATGAATTTGGAATTCCATATGATAGTACGTTTCATGAGCGAAAGGATTCGAATTTTCTTGTCGAGGAGTTCGTGCTTTTGGCAAACAGAACCGTGGCCGAAGTGATATCCAGAACTTTTCCTGATAGAGCATTATTGAGAAGGCATCCTAAACCTATATTCAAGAAACTTACAGAATTTGAATCATTTTGTTCTAAGCAGGGCTTTGAACTGGACACATCCTCTTCATTTCTGTTCCAACAGTCATTAGAGCAGATACGGATGAAATTTCATGATGATCCTTTGCTGTTTGATGCTGTGATATCCTATGCTACAAGGTCTACGCAGTTAGCGAGTTATTTCTGCAATGGAGAGCTAAAAGATGGTGAAAATGGGAGCTATTATTCACTGGCTGTCCCTTGGTACACACATTTCACGTCACCGTTGCGACGGTATGCTGATATCGTCGTCCACCGCACACTTGCAGCAGCTGTTGAGGCTGAGGAGTTGTATTTGAAGCACCAAAGTGATGAACGGATGAGATGTTTTACTGGCATGTATTTTGACAAAGATGCTGCTGACTCCTTAGAAGGTAGAGAAGCGTTATCATCTGCAGCATTGAGGCATGGAGTTCCATGCTCTAAATTACTTTCAGATGTTGCTGTGCAATGCAATAACAGAAAATTGGCTAGTAAGCATGCTGCGGATGCTTGTGATAAGCTCTACATGTGGGCTCTTTTGAAAGAAAAACAGGTACTCTTTTGTGATCTTTTCTGATATAATTTGAACACTATCTTGTTTTTGTTTGTTTTACAAGTATTAAGGTTCAATGGATGGTAGATTCATCTGTGGCTTCTTTGTATTTTCAGATTTTGTTCTCAGATGCAAAGGTATTGGGCCTTGGTTCAAAATTTATGACTCTGTATATACAGAAGCTGGCGGTGAGTGTTCTTTGAACCTGTAGTATTTAATTTATTTCATTCTGAGGTGTGGCTGCGTATTCAATGGTTATACATAGTATCAGTAAACGACTATTTAAAGTAAAAAGCTGATATATAGAGATAGTTATATATAAAGTTATATATAATAAACTAGGCTATATATATATCAGTGGTTTGAAAATGTACTCACGTACCAAGCTGGTAGAGTTATATATGATAGACAATTACATAGTAGATGGTTATATCGTAAAGTTATATACAATAAACTGAACTCCATATATATATATCACTGTCGTGCTTTAAAAATGTACTTTCTATCTGTATTCTTTCTTACTCTGCTTACCAAGCGGATAGATAGTTATATATAATAGATGGTTATATCTAGTAAAATTATATATAATAAACTGAACTATATATATATATATATACCTATGATGCTTTGAAAATGTACTTTCTATCTGTGTTCTCTTTTTACTCTACATACTAAGCAGAGCTATATATGGTTGACAGACAGTTATATATAATAGATGGTTATACCTAGTAAAGTCAGAGCTATATATGGTTGACAGACAGTTATATATAATAGATGGTTATACCTAGTAAAGTCAGAGCTATATATGGTTGACAGACAGTTATATATAATAGATGGTTATACCTAGTAAAGTCAGAGCTATATATGGTTGACAGACAGTTATATATAATAGATAGTTATATCTAGTAAAGTTATATATAGTAAACTGAACCATATATATGCCTCTGAAGTGCACTGAAAATGTACTTTCGGTCTCATTCTCTCTTATTTTTGTACATACATGAAGTCATCAATATAAACCAATATTCCTCCCGGCATTTTCTCTCTCTTGTTTTTTGTATTCATCTAGATGAGTGTTCGCATTATTGTGCGATCCTAGCCGTCTGAATTGGTATTCATTATTTTTCTCATGCCATACATTATCAGATTGAGCGGAGAATATACTACGAGGAAGTCAAAGATTTGGCAAATGAATGGCTCGATGCTACATCTACGTTGGTGCTTAGTTTTCCTGACACTAGGCGCTCTCATGGGAGTAGAGATTCAATTAAGTGGAAGGCATTGGAAGATGTTGCATTGGTTATTTCCCCTTGCGACCTGACTGTTCAACAGAGTACGCTTGAAGGAGAAGCAAGCACAGAGGGTGCTGCTGCTTCAGATAGTGGAATCATCGAGCCTGCAGTTTTCCCCCTCACAGTGCAGCTCCTTTCAACGCTACCAGTAGCACTTCATGCCGTTGGTGGGGACGATGGAGCCATTGAGATAGGCGTTAGGCTGTACATGAGCTCATATTTAAGGTAA

mRNA sequence

AGAAAGAAAGAAAAGGAGATGAAGAGAAACGGAAAGAAGTGGAAAAAGGGCGCGAAGTGGGCGCCATTGTCTTCGCCGACCGTTCATCGCGTCCCTTTCTTCTTCTTCTTCCACAGTCTTGCCAACACTAGCGATTTCTTTCTCAGCTCTCTTTCATGGCTGCTGTTTGATTCATTTATCATTTTCTTCTGTTGTGTTCTTGATATCCTTTTCCGAAGTCGGAAGATTGTCTTGTTATATCTTTCAAATTCGTTTGGATTTGAGGTTTTTTTTGTGGATTGTTGCCATTTCTTGTTCCGTTTGATTCTTCTTGCGTTTTCATAATTATGAGGGGATCTGTTGAGCAATCCACTTCCGAGAGGAACGAGGATGGCGGAAAGGAGTGGAAGAAGAAGGATCGTTCTAATTGGCGGTCTGAGCAGAAGGCCTCTATTTGGAGGGCAGTGTCTTGCAGTTCAGTCAATGAAATACCGAGGGAAGCATCAGAGTGCATGGAAAATTGTAGAATTGATGCTAACTTAACAGAACCCTCATTTTATTCTTCTTTGACTCAAGATGAATGTCAATCAAATCAGCCAACCGAGCCTGGTTTCACCGGAAGAAATAAGCTTTCCTTTAGTTCTTTGTCCCCTCTGCATATTGGCCAACAAGCAGAATGGTCACAAAATTTGAGAAACCAGCATCATTCCATGGATACTGGTCGATGGGCTATAACATCATGTCCGGAACAGATTGCCAGTGGAAGTATGCCTTGGATATCTATGAACCAGCATTCGCCTCCTGCTGATGTAAACAGTCAAAGGAAATATTTTACGTCTCACTGGCCCCTGGATGATGTTAATAGAGGCTTACAGAAAGGCGACATATTTAAAGCTTTGTTTCGTATGAATGCTCACTATAGAGTTGAGGCCTACTGCAAAATTGATGGAATACCAGTTGATGTCTTAATCTATGGAAGTGCGTCTCAGAACAGAGCTGTGGAAGGGGACATAGTTGCAATGAAGATGAATCCTTTTTCGTTATGGAGTAGGATGAAAGGCACTAGTGAGGCCCATGACAATATGCATTCACTGGAAGATGCCAATGTTATGTCCGACAGTTTTTGGAGTTCCCCTTCAGTTGATCCCATTGGCAGGATCTGTGCAGTGATTGATTTATTTCCTACAAAAAGACCAACTGGTAAGGTAGTAGCCATCCTAAAGAAGTCTCGGCAGCGAGGAACTATTGTTGGCCTTCTTAATGTCAAGAAATTCCTCTCCTTTCATGATTGTGGATATGTCCAATTGATGCCTAACGATGCGAGATTCCCGACAATGATGGTTTTTGCAGTCGATTTACCCGACTGCATCAAGAAGAGATTGGACAATGGTGATGCCACAGTTGAAAGTGAGCTGGTGGCTGCACGGATTGATGAATGGCTTGAAGAGAGTTCAGCTCCAAAAGCACTTGTCATGCATGTTCTAGGACGGGGAAGTAGAATAGAGTCTCATATTGATGCTATTTTATTTCAAAATGCAATTCTTACATGTGAATTCTCTCGTGATTCATTGTCTTGTCTCCCTCATACCCCTTGGAAGATCCCACAAGAGGAACTTCAATGCAGAAGAGATCTAAGAAACTTATGCATATTTACTATTGATCCTCCATCTGCCTTGGATCTTGATGATGCTTTCTCGGTTCAAAAGTTAGCCAATGGTATCTTTAGAGTAGGCATTCATGTTGCTGATGTATCACATTTTGTATTGCCAGACACTGCCTTAGATAAAGAGGCTCGAATCCGATCGACAAGTGTTCATCTTCTACGACGCAAGATACCAATGTTGCCACCATTACTCTCTGAGAATATCGGTTCACTTAACCCCGGAGTGGATCGACTTGCGTTTTCATTGTTTTTGGACATCAACCATTGTGGAGATGTCGAAGATTGTTGGATTGGCCGTACTGTGATATGCTCTTGCTGCAAACTCTCATATGGACATGCTCAGGACATTATTGACAGTTCAAAGGTTTTAGGACATTGTGTTCCCCAGTTGCATGGCCAGTCTACATGGCTTGATATCATTTCATCTGTTAGAACTCTAAATGAAATTTCTAAAACTCTAAAGGAGAAGAGATTTAGAGATGGGGCTTTGAGGATTGAGAATCCCAAAATAGTGTTTTTATATGATGAATTTGGAATTCCATATGATAGTACGTTTCATGAGCGAAAGGATTCGAATTTTCTTGTCGAGGAGTTCGTGCTTTTGGCAAACAGAACCGTGGCCGAAGTGATATCCAGAACTTTTCCTGATAGAGCATTATTGAGAAGGCATCCTAAACCTATATTCAAGAAACTTACAGAATTTGAATCATTTTGTTCTAAGCAGGGCTTTGAACTGGACACATCCTCTTCATTTCTGTTCCAACAGTCATTAGAGCAGATACGGATGAAATTTCATGATGATCCTTTGCTGTTTGATGCTGTGATATCCTATGCTACAAGGTCTACGCAGTTAGCGAGTTATTTCTGCAATGGAGAGCTAAAAGATGGTGAAAATGGGAGCTATTATTCACTGGCTGTCCCTTGGTACACACATTTCACGTCACCGTTGCGACGGTATGCTGATATCGTCGTCCACCGCACACTTGCAGCAGCTGTTGAGGCTGAGGAGTTGTATTTGAAGCACCAAAGTGATGAACGGATGAGATGTTTTACTGGCATGTATTTTGACAAAGATGCTGCTGACTCCTTAGAAGGTAGAGAAGCGTTATCATCTGCAGCATTGAGGCATGGAGTTCCATGCTCTAAATTACTTTCAGATGTTGCTGTGCAATGCAATAACAGAAAATTGGCTAGTAAGCATGCTGCGGATGCTTGTGATAAGCTCTACATGTGGGCTCTTTTGAAAGAAAAACAGATTTTGTTCTCAGATGCAAAGGTATTGGGCCTTGGTTCAAAATTTATGACTCTGTATATACAGAAGCTGGCGGAAGTCAAAGATTTGGCAAATGAATGGCTCGATGCTACATCTACGTTGGTGCTTAGTTTTCCTGACACTAGGCGCTCTCATGGGAGTAGAGATTCAATTAAGTGGAAGGCATTGGAAGATGTTGCATTGGTTATTTCCCCTTGCGACCTGACTGTTCAACAGAGTACGCTTGAAGGAGAAGCAAGCACAGAGGGTGCTGCTGCTTCAGATAGTGGAATCATCGAGCCTGCAGTTTTCCCCCTCACAGTGCAGCTCCTTTCAACGCTACCAGTAGCACTTCATGCCGTTGGTGGGGACGATGGAGCCATTGAGATAGGCGTTAGGCTGTACATGAGCTCATATTTAAGGTAA

Coding sequence (CDS)

ATGAGGGGATCTGTTGAGCAATCCACTTCCGAGAGGAACGAGGATGGCGGAAAGGAGTGGAAGAAGAAGGATCGTTCTAATTGGCGGTCTGAGCAGAAGGCCTCTATTTGGAGGGCAGTGTCTTGCAGTTCAGTCAATGAAATACCGAGGGAAGCATCAGAGTGCATGGAAAATTGTAGAATTGATGCTAACTTAACAGAACCCTCATTTTATTCTTCTTTGACTCAAGATGAATGTCAATCAAATCAGCCAACCGAGCCTGGTTTCACCGGAAGAAATAAGCTTTCCTTTAGTTCTTTGTCCCCTCTGCATATTGGCCAACAAGCAGAATGGTCACAAAATTTGAGAAACCAGCATCATTCCATGGATACTGGTCGATGGGCTATAACATCATGTCCGGAACAGATTGCCAGTGGAAGTATGCCTTGGATATCTATGAACCAGCATTCGCCTCCTGCTGATGTAAACAGTCAAAGGAAATATTTTACGTCTCACTGGCCCCTGGATGATGTTAATAGAGGCTTACAGAAAGGCGACATATTTAAAGCTTTGTTTCGTATGAATGCTCACTATAGAGTTGAGGCCTACTGCAAAATTGATGGAATACCAGTTGATGTCTTAATCTATGGAAGTGCGTCTCAGAACAGAGCTGTGGAAGGGGACATAGTTGCAATGAAGATGAATCCTTTTTCGTTATGGAGTAGGATGAAAGGCACTAGTGAGGCCCATGACAATATGCATTCACTGGAAGATGCCAATGTTATGTCCGACAGTTTTTGGAGTTCCCCTTCAGTTGATCCCATTGGCAGGATCTGTGCAGTGATTGATTTATTTCCTACAAAAAGACCAACTGGTAAGGTAGTAGCCATCCTAAAGAAGTCTCGGCAGCGAGGAACTATTGTTGGCCTTCTTAATGTCAAGAAATTCCTCTCCTTTCATGATTGTGGATATGTCCAATTGATGCCTAACGATGCGAGATTCCCGACAATGATGGTTTTTGCAGTCGATTTACCCGACTGCATCAAGAAGAGATTGGACAATGGTGATGCCACAGTTGAAAGTGAGCTGGTGGCTGCACGGATTGATGAATGGCTTGAAGAGAGTTCAGCTCCAAAAGCACTTGTCATGCATGTTCTAGGACGGGGAAGTAGAATAGAGTCTCATATTGATGCTATTTTATTTCAAAATGCAATTCTTACATGTGAATTCTCTCGTGATTCATTGTCTTGTCTCCCTCATACCCCTTGGAAGATCCCACAAGAGGAACTTCAATGCAGAAGAGATCTAAGAAACTTATGCATATTTACTATTGATCCTCCATCTGCCTTGGATCTTGATGATGCTTTCTCGGTTCAAAAGTTAGCCAATGGTATCTTTAGAGTAGGCATTCATGTTGCTGATGTATCACATTTTGTATTGCCAGACACTGCCTTAGATAAAGAGGCTCGAATCCGATCGACAAGTGTTCATCTTCTACGACGCAAGATACCAATGTTGCCACCATTACTCTCTGAGAATATCGGTTCACTTAACCCCGGAGTGGATCGACTTGCGTTTTCATTGTTTTTGGACATCAACCATTGTGGAGATGTCGAAGATTGTTGGATTGGCCGTACTGTGATATGCTCTTGCTGCAAACTCTCATATGGACATGCTCAGGACATTATTGACAGTTCAAAGGTTTTAGGACATTGTGTTCCCCAGTTGCATGGCCAGTCTACATGGCTTGATATCATTTCATCTGTTAGAACTCTAAATGAAATTTCTAAAACTCTAAAGGAGAAGAGATTTAGAGATGGGGCTTTGAGGATTGAGAATCCCAAAATAGTGTTTTTATATGATGAATTTGGAATTCCATATGATAGTACGTTTCATGAGCGAAAGGATTCGAATTTTCTTGTCGAGGAGTTCGTGCTTTTGGCAAACAGAACCGTGGCCGAAGTGATATCCAGAACTTTTCCTGATAGAGCATTATTGAGAAGGCATCCTAAACCTATATTCAAGAAACTTACAGAATTTGAATCATTTTGTTCTAAGCAGGGCTTTGAACTGGACACATCCTCTTCATTTCTGTTCCAACAGTCATTAGAGCAGATACGGATGAAATTTCATGATGATCCTTTGCTGTTTGATGCTGTGATATCCTATGCTACAAGGTCTACGCAGTTAGCGAGTTATTTCTGCAATGGAGAGCTAAAAGATGGTGAAAATGGGAGCTATTATTCACTGGCTGTCCCTTGGTACACACATTTCACGTCACCGTTGCGACGGTATGCTGATATCGTCGTCCACCGCACACTTGCAGCAGCTGTTGAGGCTGAGGAGTTGTATTTGAAGCACCAAAGTGATGAACGGATGAGATGTTTTACTGGCATGTATTTTGACAAAGATGCTGCTGACTCCTTAGAAGGTAGAGAAGCGTTATCATCTGCAGCATTGAGGCATGGAGTTCCATGCTCTAAATTACTTTCAGATGTTGCTGTGCAATGCAATAACAGAAAATTGGCTAGTAAGCATGCTGCGGATGCTTGTGATAAGCTCTACATGTGGGCTCTTTTGAAAGAAAAACAGATTTTGTTCTCAGATGCAAAGGTATTGGGCCTTGGTTCAAAATTTATGACTCTGTATATACAGAAGCTGGCGGAAGTCAAAGATTTGGCAAATGAATGGCTCGATGCTACATCTACGTTGGTGCTTAGTTTTCCTGACACTAGGCGCTCTCATGGGAGTAGAGATTCAATTAAGTGGAAGGCATTGGAAGATGTTGCATTGGTTATTTCCCCTTGCGACCTGACTGTTCAACAGAGTACGCTTGAAGGAGAAGCAAGCACAGAGGGTGCTGCTGCTTCAGATAGTGGAATCATCGAGCCTGCAGTTTTCCCCCTCACAGTGCAGCTCCTTTCAACGCTACCAGTAGCACTTCATGCCGTTGGTGGGGACGATGGAGCCATTGAGATAGGCGTTAGGCTGTACATGAGCTCATATTTAAGGTAA

Protein sequence

MRGSVEQSTSERNEDGGKEWKKKDRSNWRSEQKASIWRAVSCSSVNEIPREASECMENCRIDANLTEPSFYSSLTQDECQSNQPTEPGFTGRNKLSFSSLSPLHIGQQAEWSQNLRNQHHSMDTGRWAITSCPEQIASGSMPWISMNQHSPPADVNSQRKYFTSHWPLDDVNRGLQKGDIFKALFRMNAHYRVEAYCKIDGIPVDVLIYGSASQNRAVEGDIVAMKMNPFSLWSRMKGTSEAHDNMHSLEDANVMSDSFWSSPSVDPIGRICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKFLSFHDCGYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAPKALVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRNLCIFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHLLRRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGHAQDIIDSSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIENPKIVFLYDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTEFESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLASYFCNGELKDGENGSYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKHQSDERMRCFTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASKHAADACDKLYMWALLKEKQILFSDAKVLGLGSKFMTLYIQKLAEVKDLANEWLDATSTLVLSFPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTLEGEASTEGAAASDSGIIEPAVFPLTVQLLSTLPVALHAVGGDDGAIEIGVRLYMSSYLR
BLAST of Cp4.1LG01g20370 vs. Swiss-Prot
Match: DI3L2_ARATH (DIS3-like exonuclease 2 OS=Arabidopsis thaliana GN=SOV PE=1 SV=1)

HSP 1 Score: 805.8 bits (2080), Expect = 5.2e-232
Identity = 416/755 (55.10%), Postives = 536/755 (70.99%), Query Frame = 1

Query: 268  IGRICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKF--------------LSFH 327
            + ++C ++  FP KRPTG+VVA+++KS  R +IVGLL+VK +              LS  
Sbjct: 304  VDKLCGILSSFPHKRPTGQVVAVVEKSLVRDSIVGLLDVKGWIHYKESDPKRCKSPLSLS 363

Query: 328  DCGYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAPKA 387
            D  YVQLMP D RFP ++V    LP  I+ RL+N D  +E+ELVAA+I +W E S  P A
Sbjct: 364  DDEYVQLMPADPRFPKLIVPFHVLPGSIRARLENLDPNLEAELVAAQIVDWGEGSPFPVA 423

Query: 388  LVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRNLC 447
             + H+ GRGS +E  I+AIL+QN++   +FS  SL+ LP  PW++P+EE+Q R+DLR+LC
Sbjct: 424  QITHLFGRGSELEPQINAILYQNSVCDSDFSPGSLTSLPRVPWEVPEEEVQRRKDLRDLC 483

Query: 448  IFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHLLR 507
            + TIDP +A DLDDA SVQ L  G FRVG+H+ADVS+FVLP+TALD EAR RSTSV+L++
Sbjct: 484  VLTIDPSTATDLDDALSVQSLPGGFFRVGVHIADVSYFVLPETALDTEARFRSTSVYLMQ 543

Query: 508  RKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGHAQ 567
            RKI MLPPLLSEN+GSL+PG DRLAFS+  D+N  GDV D WIGRT+I SCCKLSY HAQ
Sbjct: 544  RKISMLPPLLSENVGSLSPGADRLAFSILWDLNREGDVIDRWIGRTIIRSCCKLSYDHAQ 603

Query: 568  DIID-SSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIENPKIVFL 627
            DIID  S V  +  P LHG   W D+  SV+ L+EIS TL++KRFR+GAL++EN K VFL
Sbjct: 604  DIIDGKSDVAENGWPALHGSFKWCDVTRSVKQLSEISTTLRQKRFRNGALQLENSKPVFL 663

Query: 628  YDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTE 687
            +DE G+PYD     RK SNFLVEEF+LLAN T AEVIS+ +P  +LLRRHP+P  +KL E
Sbjct: 664  FDEHGVPYDFVTCSRKGSNFLVEEFMLLANMTAAEVISQAYPASSLLRRHPEPNTRKLKE 723

Query: 688  FESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLASYFCNGEL 747
            FE FCSK G +LD SSS   Q SLE+I     DD +  D + +YA +  QLASYFC G L
Sbjct: 724  FEGFCSKHGMDLDISSSGQLQDSLEKITGNLKDDSVFVDILNNYAIKPMQLASYFCTGNL 783

Query: 748  KDG-ENGSYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKHQS---DERMRC 807
            KD      +Y+LAVP YTHFTSPLRRY DIVVHR LAAA+EAEELY K +    DE   C
Sbjct: 784  KDSVAEWGHYALAVPLYTHFTSPLRRYPDIVVHRALAAALEAEELYSKQKQTAIDEGRSC 843

Query: 808  FTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASKHAADACDKLY 867
            FTG++F+KDAA+S+EG+EALS AAL+HGVP +++LSDVA  CN RKLA++   DACDKLY
Sbjct: 844  FTGIHFNKDAAESIEGKEALSVAALKHGVPSTEILSDVAAYCNERKLAARKVRDACDKLY 903

Query: 868  MWALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKDLANEWLDATSTLVLS 927
             W +LK+K+I   +A+V+ LGS+FMT+YI KL         +++ L  +WL+ATSTL++ 
Sbjct: 904  TWFVLKQKEIFPCEARVMNLGSRFMTVYISKLGIERRIYYDQIEGLCADWLEATSTLIVD 963

Query: 928  FPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTLEGEASTEGAAASDSGIIEPAVF 987
               ++R  G R    +K +++   ++SPC++ V + +      TE   A     + PAVF
Sbjct: 964  KLYSKR--GGRGF--FKPMKEAVYLVSPCEVCVAKCSALSVHDTESPEAVSIDEVAPAVF 1023

Query: 988  PLTVQLLSTLPVALHAVGGDDGAIEIGVRLYMSSY 996
            PLT+QL ST+PV LHAVGGDDG ++IG RLYMSSY
Sbjct: 1024 PLTIQLFSTIPVVLHAVGGDDGPLDIGARLYMSSY 1054

BLAST of Cp4.1LG01g20370 vs. Swiss-Prot
Match: DI32L_ARATH (Inactive exonuclease DIS3L2 OS=Arabidopsis thaliana GN=SOV PE=2 SV=1)

HSP 1 Score: 802.4 bits (2071), Expect = 5.8e-231
Identity = 415/755 (54.97%), Postives = 535/755 (70.86%), Query Frame = 1

Query: 268  IGRICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKF--------------LSFH 327
            + ++C ++  FP KRPTG+VVA+++KS  R +IVGLL+VK +              LS  
Sbjct: 304  VDKLCGILSSFPHKRPTGQVVAVVEKSLVRDSIVGLLDVKGWIHYKESDPKRCKSPLSLS 363

Query: 328  DCGYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAPKA 387
            D  YVQLMP D RFP ++V    LP  I+ RL+N D  +E+ELVAA+I +W E S  P A
Sbjct: 364  DDEYVQLMPADPRFPKLIVPFHVLPGSIRARLENLDPNLEAELVAAQIVDWGEGSPFPVA 423

Query: 388  LVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRNLC 447
             + H+ GRGS +E  I+AIL+QN++   +FS  SL+ LP  PW++P+EE+Q R+DLR+LC
Sbjct: 424  QITHLFGRGSELEPQINAILYQNSVCDSDFSPGSLTSLPRVPWEVPEEEVQRRKDLRDLC 483

Query: 448  IFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHLLR 507
            + TIDP +A DLDDA SVQ L  G FRVG+H+ADVS+FVLP+TALD EAR RSTSV+L++
Sbjct: 484  VLTIDPSTATDLDDALSVQSLPGGFFRVGVHIADVSYFVLPETALDTEARFRSTSVYLMQ 543

Query: 508  RKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGHAQ 567
            RKI MLPPLLSEN+GSL+PG DRLAFS+  D+N  GDV D WIGRT+I SCCKLSY HAQ
Sbjct: 544  RKISMLPPLLSENVGSLSPGADRLAFSILWDLNREGDVIDRWIGRTIIRSCCKLSYDHAQ 603

Query: 568  DIID-SSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIENPKIVFL 627
            DIID  S V  +  P LHG   W D+  SV+ L+EIS TL++KRFR+GAL++EN K VFL
Sbjct: 604  DIIDGKSDVAENGWPALHGSFKWCDVTRSVKQLSEISTTLRQKRFRNGALQLENSKPVFL 663

Query: 628  YDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTE 687
            +DE G+PYD     RK SNFLVEEF+LLAN T AEVIS+ +   +LLRRHP+P  +KL E
Sbjct: 664  FDEHGVPYDFVTCSRKGSNFLVEEFMLLANMTAAEVISQAYRASSLLRRHPEPNTRKLKE 723

Query: 688  FESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLASYFCNGEL 747
            FE FCSK G +LD SSS   Q SLE+I     DD +  D + +YA +  QLASYFC G L
Sbjct: 724  FEGFCSKHGMDLDISSSGQLQDSLEKITGNLKDDSVFVDILNNYAIKPMQLASYFCTGNL 783

Query: 748  KDG-ENGSYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKHQS---DERMRC 807
            KD      +Y+LAVP YTHFTSPLRRY DIVVHR LAAA+EAEELY K +    DE   C
Sbjct: 784  KDSVAEWGHYALAVPLYTHFTSPLRRYPDIVVHRALAAALEAEELYSKQKQTAIDEGRSC 843

Query: 808  FTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASKHAADACDKLY 867
            FTG++F+KDAA+S+EG+EALS AAL+HGVP +++LSDVA  CN RKLA++   DACDKLY
Sbjct: 844  FTGIHFNKDAAESIEGKEALSVAALKHGVPSTEILSDVAAYCNERKLAARKVRDACDKLY 903

Query: 868  MWALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKDLANEWLDATSTLVLS 927
             W +LK+K+I   +A+V+ LGS+FMT+YI KL         +++ L  +WL+ATSTL++ 
Sbjct: 904  TWFVLKQKEIFPCEARVMNLGSRFMTVYISKLGIERRIYYDQIEGLCADWLEATSTLIVD 963

Query: 928  FPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTLEGEASTEGAAASDSGIIEPAVF 987
               ++R  G R    +K +++   ++SPC++ V + +      TE   A     + PAVF
Sbjct: 964  KLYSKR--GGRGF--FKPMKEAVYLVSPCEVCVAKCSALSVHDTESPEAVSIDEVAPAVF 1023

Query: 988  PLTVQLLSTLPVALHAVGGDDGAIEIGVRLYMSSY 996
            PLT+QL ST+PV LHAVGGDDG ++IG RLYMSSY
Sbjct: 1024 PLTIQLFSTIPVVLHAVGGDDGPLDIGARLYMSSY 1054

BLAST of Cp4.1LG01g20370 vs. Swiss-Prot
Match: DI3L2_SCHPO (DIS3-like exonuclease 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=dis32 PE=1 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 2.0e-106
Identity = 236/686 (34.40%), Postives = 375/686 (54.66%), Query Frame = 1

Query: 112 SQNLRNQHHSMDTGRWAITSCPEQIAS------GSMPWISMNQHSPPADVNSQRKYFTSH 171
           S++ +   H  D  + +++   + + S       S      N H   AD N+  +  +S+
Sbjct: 111 SKSSKQDEHKTDVHKESVSKLSKNLESRNNRDENSAKREKNNSHQVEADTNNATEMVSSN 170

Query: 172 -----WPL----DDVNRGLQKGDIFKALFRMNAHYRVEAYCKIDGIPVDVLIYGSASQNR 231
                +PL      V +GL+ G +FK   R+  ++R  A+  ++ IP D  + G  ++NR
Sbjct: 171 AKKSVYPLYYDSATVKKGLKSGTLFKGTLRILENHR-SAFACMEDIP-DFYVDGPIARNR 230

Query: 232 AVEGDIVAMKMNPFSLWSRMKGTSEAHDNMHSLEDANVMSDSFWSSPSVDPIGR--ICAV 291
           A   D+V ++     + +    T +++   + +E   +         +++ + R  I +V
Sbjct: 231 AFHNDVVIVE----PVMNNDSPTEKSNFLQNGVEKVKIKDHDDELGGAMEHLERLEIKSV 290

Query: 292 IDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKFLSFHDCGYVQ-------LMPNDARF 351
                  R   +VVAI K++ +   IVG+L    + S  +  YV         +P D R 
Sbjct: 291 ASFKGDSRTRARVVAIEKRA-EISKIVGILRAPGW-SLKNVEYVSKKSSYAIFIPKDKRL 350

Query: 352 PTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAPKALVMHVLGRGSRIES 411
           P + +   DL D   +           +L +  I  W   S  P  ++   LG  + +E+
Sbjct: 351 PFITIHKNDLSDLSGENWIENILKHHDQLFSVEITRWSIYSRYPMGVLGEKLGNITDVEA 410

Query: 412 HIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRNLCIFTIDPPSALDLDD 471
           + +A+L +N I +  FS + L+CLP   W I  EE++ RRDLRN  I TIDP +A DLDD
Sbjct: 411 YTNALLLENGISSSPFSDEVLNCLPPDDWIISHEEIKKRRDLRNELIITIDPETARDLDD 470

Query: 472 AFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHLLRRKIPMLPPLLSENI 531
           A S + L NG + VG+H+ADV+HFV PD+ALDKEA  R+T+V+L+++ IPMLPPLL E +
Sbjct: 471 AVSCRALDNGTYEVGVHIADVTHFVKPDSALDKEAASRATTVYLVQKAIPMLPPLLCERL 530

Query: 532 GSLNPGVDRLAFSLFLDINHCG-DVEDCWIGRTVICSCCKLSYGHAQDIIDSSKVLGHCV 591
            SLNP V+RLAFS+F  ++  G ++   W G+TVI +C +L+Y  AQ +I+         
Sbjct: 531 CSLNPNVERLAFSVFWKLDSNGKEIGKRWFGKTVIKTCARLAYSEAQGVIEGKSWDDAVG 590

Query: 592 PQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIENPKIVFLYDEFGIPYDSTFHE 651
             + G  T  D+ +S+ TL EIS+ L++ RF  GA+ I + ++ F  DE+G+P     +E
Sbjct: 591 KPIGGTHTPKDVETSILTLCEISRKLRKDRFAKGAVEINSTELKFQLDEYGMPNKCEVYE 650

Query: 652 RKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTEFESFCSKQGFELDT 711
           + D+N L+EEF+LLANR+VAE IS+ F + +LLRRH  P  K++ EF  F     F+ D 
Sbjct: 651 QTDANHLIEEFMLLANRSVAEHISKNFSNNSLLRRHASPKEKQINEFCHFLKSMNFDFDA 710

Query: 712 SSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLASYFCNGELKDGENGSYYSLAVP 771
           SSS  F  S+ ++R  F+++  L +   + A RS   A YFC G+  +  +  +Y+L+  
Sbjct: 711 SSSAAFNASMVRLRSTFNEE--LVELFENMAVRSLNRAEYFCTGDFGEKTDWHHYALSFN 770

Query: 772 WYTHFTSPLRRYADIVVHRTLAAAVE 773
            YTHFTSP+RRY DI+VHR L  +++
Sbjct: 771 HYTHFTSPIRRYPDIIVHRLLERSLK 786

BLAST of Cp4.1LG01g20370 vs. Swiss-Prot
Match: DI3L2_XENTR (DIS3-like exonuclease 2 OS=Xenopus tropicalis GN=dis3l2 PE=2 SV=2)

HSP 1 Score: 360.9 bits (925), Expect = 4.4e-98
Identity = 209/492 (42.48%), Postives = 283/492 (57.52%), Query Frame = 1

Query: 284 TGKVVAILKKSRQRGTIVGLLNVKKFLSFHDCGYVQLMPNDARFPTMMVFAVDLPDCIKK 343
           T KVV IL+K   R     +  +    S          P D R P + V   D P     
Sbjct: 203 TAKVVYILEKKHSRAATGFIKPLSDKSSDLARKRALFSPVDHRLPRIYVPLGDCPHDFAI 262

Query: 344 RLDNGDATVESELVAARIDEWLEESSAPKALVMHVLGRGSRIESHIDAILFQNAILTCEF 403
             +    T  + L    I  W ++S+  +  +M  LG+   IE   + IL +  +   +F
Sbjct: 263 HPE----TYANTLFICSITAWRDDSNFAEGKLMKSLGQAGEIEPETEGILVEYGVDFSDF 322

Query: 404 SRDSLSCLPHT-PWKIPQEELQCRRDLRNLCIFTIDPPSALDLDDAFSVQKLANGIFRVG 463
               L CLP   PW IPQEE Q R+DLRN CIFTIDP +A DLDDA S + L +G F VG
Sbjct: 323 PDKVLQCLPQDLPWTIPQEEFQKRKDLRNECIFTIDPATARDLDDALSCKPLPDGNFEVG 382

Query: 464 IHVADVSHFVLPDTALDKEARIRSTSVHLLRRKIPMLPPLLSENIGSLNPGVDRLAFSLF 523
           +H+ADVS+FV   +ALD  A  R+TSV+L+++ IPMLP LL E + SLNP  DRL FS+ 
Sbjct: 383 VHIADVSYFVAEGSALDIMASERATSVYLVQKVIPMLPRLLCEELCSLNPMTDRLTFSVI 442

Query: 524 LDINHCGDVEDCWIGRTVICSCCKLSYGHAQDIID--SSKVLGHCVPQLHGQSTWLDIIS 583
             I   G++ D W GR+VICSC KLSY HAQ++I+    K+  H +P +  Q T  +I  
Sbjct: 443 WKITPQGEILDEWFGRSVICSCVKLSYDHAQNMINHPDKKIEQHELPPVSPQHTINEIHQ 502

Query: 584 SVRTLNEISKTLKEKRFRDGALRIENPKIVFLYD-EFGIPYDSTFHERKDSNFLVEEFVL 643
           +V  L+ I++ L+++RF DGALR++  K+ F  D E G+P     ++ +DSN LVEEF+L
Sbjct: 503 AVLNLHLIAQNLRKQRFDDGALRLDQLKLTFTLDKESGLPQGCYIYQYRDSNKLVEEFML 562

Query: 644 LANRTVAEVISRTFPDRALLRRHPKPIFKKLTEFESFCSKQGFELDTSSSFLFQQSLEQI 703
           LAN  VA  I R FP+ ALLRRHP P  K L +   FC + G +LD SSS    +SL   
Sbjct: 563 LANMAVAHHIYRRFPEEALLRRHPPPQTKMLNDLIEFCDQMGLQLDFSSSGTLHKSLNDQ 622

Query: 704 RMKFHDDPLLFDAVISYATRSTQLASYFCNGELKDGENGSYYSLAVPWYTHFTSPLRRYA 763
                      + + +  +R  Q+A YFC G LKD     +Y+L VP YTHFTSP+RR+A
Sbjct: 623 FETDEYSAARKEVLTNMCSRPMQMAVYFCTGALKDETLFHHYALNVPLYTHFTSPIRRFA 682

Query: 764 DIVVHRTLAAAV 772
           D++VHR LAA++
Sbjct: 683 DVIVHRLLAASL 690

BLAST of Cp4.1LG01g20370 vs. Swiss-Prot
Match: DI3L2_MOUSE (DIS3-like exonuclease 2 OS=Mus musculus GN=Dis3l2 PE=1 SV=1)

HSP 1 Score: 358.2 bits (918), Expect = 2.9e-97
Identity = 208/501 (41.52%), Postives = 295/501 (58.88%), Query Frame = 1

Query: 284 TGKVVAILKKSRQRGT--IVGLLNVKKFLSFHDCGYVQLMPNDARFPTMMVFAVDLPDCI 343
           + KVV IL+K   R    I+ LL  K    F    Y    P+D R P + V   D P   
Sbjct: 230 SAKVVYILEKKHSRAATGILKLLADKNSDLFKK--YALFSPSDHRVPRIYVPLKDCPQDF 289

Query: 344 KKRLDNGDATVESELVAARIDEWLEESSAPKALVMHVLGRGSRIESHIDAILFQNAILTC 403
             R  +   T    L   RI +W E+ +     +   LG+   IE   + IL +  +   
Sbjct: 290 MTRPKDFANT----LFICRIIDWKEDCNFALGQLAKSLGQAGEIEPETEGILTEYGVDFS 349

Query: 404 EFSRDSLSCLPHT-PWKIPQEELQCRRDLRNLCIFTIDPPSALDLDDAFSVQKLANGIFR 463
           +FS + L CLP + PW IP +E+  RRDLR  CIFTIDP +A DLDDA + ++L +G F 
Sbjct: 350 DFSSEVLECLPQSLPWTIPPDEVGKRRDLRKDCIFTIDPSTARDLDDALACRRLTDGTFE 409

Query: 464 VGIHVADVSHFVLPDTALDKEARIRSTSVHLLRRKIPMLPPLLSENIGSLNPGVDRLAFS 523
           VG+H+ADVS+FV   ++LDK A  R+TSV+L+++ +PMLP LL E + SLNP  D+L FS
Sbjct: 410 VGVHIADVSYFVPEGSSLDKVAAERATSVYLVQKVVPMLPRLLCEELCSLNPMTDKLTFS 469

Query: 524 LFLDINHCGDVEDCWIGRTVICSCCKLSYGHAQDIID--SSKVLGHCVPQLHGQSTWLDI 583
           +   +   G + + W GRT+I SC KLSY HAQ +I+  + K+    +P +  + +  ++
Sbjct: 470 VIWKLTPEGKILEEWFGRTIIRSCTKLSYDHAQSMIENPTEKIPEEELPPISPEHSVEEV 529

Query: 584 ISSVRTLNEISKTLKEKRFRDGALRIENPKIVFLYD-EFGIPYDSTFHERKDSNFLVEEF 643
             +V  L+ I+K L+ +RF DGALR++  K+ F  D E G+P     +E +DSN LVEEF
Sbjct: 530 HQAVLNLHSIAKQLRRQRFVDGALRLDQLKLAFTLDHETGLPQGCHIYEYRDSNKLVEEF 589

Query: 644 VLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTEFESFCSKQGFELDTSSSFLFQQSLE 703
           +LLAN  VA  I RTFP++ALLRRHP P  K L++   FC + G  +D SS+    +SL 
Sbjct: 590 MLLANMAVAHKIFRTFPEQALLRRHPPPQTKMLSDLVEFCDQMGLPMDVSSAGALNKSLT 649

Query: 704 QIRMKFHDDPLLF---DAVISYATRSTQLASYFCNGELKDGENGSYYSLAVPWYTHFTSP 763
           +    F DD       + + +  +R  Q+A YFC+G L+D E   +Y+L VP YTHFTSP
Sbjct: 650 K---TFGDDKYSLARKEVLTNMYSRPMQMALYFCSGMLQDQEQFRHYALNVPLYTHFTSP 709

Query: 764 LRRYADIVVHRTLAAAVEAEE 776
           +RR+AD++VHR LAAA+   E
Sbjct: 710 IRRFADVIVHRLLAAALGYSE 721

BLAST of Cp4.1LG01g20370 vs. TrEMBL
Match: A0A0A0KC80_CUCSA (DIS3-like exonuclease 2 OS=Cucumis sativus GN=Csa_6G040560 PE=3 SV=1)

HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 571/782 (73.02%), Postives = 656/782 (83.89%), Query Frame = 1

Query: 268  IGRICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKFLSFHD------------- 327
            IGRICA+I+L+P KRPTG+VV IL+KSR R  +VG LNVKKFLSF +             
Sbjct: 345  IGRICALINLYPAKRPTGRVVTILEKSRLRENVVGHLNVKKFLSFQEFYVKESTKSCLSP 404

Query: 328  ---CGYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAP 387
               CGYVQLMPNDARFP MMV A DLP+CIKKRLDNGD TVE+ELVAARI EW++ESS+P
Sbjct: 405  SQNCGYVQLMPNDARFPIMMVLAGDLPNCIKKRLDNGDVTVENELVAARIYEWVKESSSP 464

Query: 388  KALVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRN 447
            +A V+HVLGRG+ +ESHIDAILF+NAI TCEFS+DSLSC+P TPWKIP EELQCRRD+RN
Sbjct: 465  RAHVLHVLGRGNEVESHIDAILFENAIRTCEFSQDSLSCVPQTPWKIPPEELQCRRDIRN 524

Query: 448  LCIFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHL 507
            LCIFTIDP SA DLDDA SVQ+LANGIFRVGIH+ADVS+FVLPDTALDKEA+IRSTSV+L
Sbjct: 525  LCIFTIDPSSASDLDDALSVQRLANGIFRVGIHIADVSYFVLPDTALDKEAQIRSTSVYL 584

Query: 508  LRRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGH 567
            L+RKIPMLPPLLSE+IGSLNPGVDRLAFSLFLDIN CGDV+D WI RTVIC CCKLSY H
Sbjct: 585  LQRKIPMLPPLLSESIGSLNPGVDRLAFSLFLDINSCGDVKDFWIERTVICCCCKLSYEH 644

Query: 568  AQDII------DSSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIE 627
            AQDII      DSS++ G+  PQLHGQ TW D+ISSV+ L+EISKT+KEKRFR+GALR+E
Sbjct: 645  AQDIIDGLIDSDSSELFGNNCPQLHGQFTWHDVISSVKLLHEISKTVKEKRFRNGALRLE 704

Query: 628  NPKIVFLYDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKP 687
            N K+++LYDE+GIPYDS F+E+KDSNFLVEEF+LLANRTVAEVISRTFPD ALLRRHP+P
Sbjct: 705  NSKLIYLYDEYGIPYDSMFYEQKDSNFLVEEFMLLANRTVAEVISRTFPDSALLRRHPEP 764

Query: 688  IFKKLTEFESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLAS 747
            + +KL EFE+FCSK GFELDTSSS  FQQSLEQIR++  DDPLLFD +ISYATR  QLA+
Sbjct: 765  MLRKLREFETFCSKHGFELDTSSSVHFQQSLEQIRIELQDDPLLFDILISYATRPMQLAT 824

Query: 748  YFCNGELKDGENGSYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKHQ---- 807
            YFC+GELKDGE  S+Y+LAVP YTHFTSPLRRY DIVVHRTLAAA+EAE++YLKH+    
Sbjct: 825  YFCSGELKDGETRSHYALAVPLYTHFTSPLRRYPDIVVHRTLAAAIEAEKMYLKHKGVIQ 884

Query: 808  ---SDERMRCFTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASK 867
               S+E  RCFTG+YFDKDAADSLEGREALSSAAL+HGVPCSKLL DVA+ CN+RKLASK
Sbjct: 885  KVNSNEETRCFTGIYFDKDAADSLEGREALSSAALKHGVPCSKLLLDVALHCNDRKLASK 944

Query: 868  HAADACDKLYMWALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKDLANEW 927
            H AD  +KLYMWALLK+K+ILFSDA+VLGLG +FM++YIQKLA        EV+ LA EW
Sbjct: 945  HVADGIEKLYMWALLKKKKILFSDARVLGLGPRFMSVYIQKLAIERRIYYDEVEGLAVEW 1004

Query: 928  LDATSTLVLSFPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTL----EGEASTEG 987
            L+ TSTLVL F  +RRSH SR S+KWKALEDVALVISPCD  V++ TL     G AS  G
Sbjct: 1005 LETTSTLVLRFFCSRRSHRSRGSVKWKALEDVALVISPCDQNVKERTLGVSSNGGASKGG 1064

Query: 988  AA-----------ASDSGIIEPAVFPLTVQLLSTLPVALHAVGGDDGAIEIGVRLYMSSY 998
            +A            SD+G ++PA+FPLTV+LLST+PVALHAVGGDDG I+IGVRLYMSSY
Sbjct: 1065 SAVVEQDSNLKSHVSDTG-VDPAIFPLTVRLLSTIPVALHAVGGDDGPIDIGVRLYMSSY 1124

BLAST of Cp4.1LG01g20370 vs. TrEMBL
Match: M5XPD1_PRUPE (DIS3-like exonuclease 2 OS=Prunus persica GN=PRUPE_ppa015523mg PE=3 SV=1)

HSP 1 Score: 949.9 bits (2454), Expect = 2.5e-273
Identity = 501/886 (56.55%), Postives = 636/886 (71.78%), Query Frame = 1

Query: 163  TSHWPLDDVNRGLQKGDIFKALFRMNAHYRVEAYCKIDGIPVDVLIYGSASQNRAVEGDI 222
            TS  PLDD N  L+   +        A Y  +   K+D    +V +YG+   +   E   
Sbjct: 206  TSSAPLDDFNLQLENNVV--------AGYNCKGKAKVD----EVYLYGNDRSSLLPERGS 265

Query: 223  VAMKMNPFSLWSRMKGTSEAHDNM---HSLEDANVMSDSFWSSPSVDPIGRICAVIDLFP 282
               +    S  S   G S ++D++   + L   ++ + S   +     + R+CA+I+ FP
Sbjct: 266  RPEESVGESFHSGPIGQS-SYDHVAGRYPLPSDSIQAGSPEQNEVRLSVERLCAMINSFP 325

Query: 283  TKRPTGKVVAILKKSRQRGTIVGLLNVKKFLSFHDC----------------GYVQLMPN 342
            +KRPTG+VVAI+++S +R  IVG LNVK+++S+ +                  Y+Q+ P 
Sbjct: 326  SKRPTGRVVAIVERSPRRDAIVGFLNVKQWISYREFCRKDMRKNKNSSFSNHEYIQMTPI 385

Query: 343  DARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAPKALVMHVLGRGS 402
            D RFP M+V   +LPD IKKRL++GD T+E EL AARIDEW EESSAP+A++++  GRG 
Sbjct: 386  DPRFPKMVVLVRNLPDSIKKRLEDGDETIEMELFAARIDEWDEESSAPQAVILNAFGRGC 445

Query: 403  RIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRNLCIFTIDPPSAL 462
             ++  I+AILFQNAI + EFS +SLSCLPH PW++PQEE Q RRDLRNLCIFTIDP +A 
Sbjct: 446  ELQPQIEAILFQNAINSSEFSPESLSCLPHLPWEVPQEEFQTRRDLRNLCIFTIDPSTAT 505

Query: 463  DLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHLLRRKIPMLPPLL 522
            DLDDA SV KL+NGI+RVGIH+ADVSHFVLP T LD+EA+ RSTSV++ RRK+PMLPPLL
Sbjct: 506  DLDDALSVDKLSNGIYRVGIHIADVSHFVLPGTPLDEEAQSRSTSVYMSRRKLPMLPPLL 565

Query: 523  SENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGHAQDIID------ 582
            SEN+GSLNPGV+RLAFS+FLD+NH GDV D WIGRTVI SCCKLSY H QDIID      
Sbjct: 566  SENVGSLNPGVERLAFSIFLDMNHAGDVVDRWIGRTVIRSCCKLSYEHTQDIIDGKFNLE 625

Query: 583  SSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIENPKIVFLYDEFG 642
            S  +LG+  PQLHG   W D++ SV+ L+EIS+ LKE+RF DGAL++E+ K+V L+DE+G
Sbjct: 626  SVDILGNGRPQLHGHFEWFDVLRSVKDLHEISRILKERRFSDGALQLESSKVVILFDEYG 685

Query: 643  IPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTEFESFC 702
            +PYDS   E K+SNFLVEEF+LLANRT AEVISR FPD ALLRRHP+P  +KL EFE+FC
Sbjct: 686  VPYDSIHSELKESNFLVEEFMLLANRTAAEVISRAFPDSALLRRHPEPNLRKLREFEAFC 745

Query: 703  SKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLASYFCNGELKDGEN 762
            SK G ELDTSSS  FQ SLE+IR +  DD +LF+ +++YAT+  QLA+YFC+GELKD EN
Sbjct: 746  SKHGLELDTSSSGQFQLSLEKIREELKDDCVLFNILMNYATKPMQLAAYFCSGELKDREN 805

Query: 763  G-SYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKH--------QSDE-RMR 822
               +Y LAVP YTHFTSPLRRY DI+VHR L+AA+EAEEL LKH        + DE RM+
Sbjct: 806  DWGHYGLAVPLYTHFTSPLRRYPDILVHRMLSAAIEAEELLLKHRRMLNNFNRGDECRMK 865

Query: 823  CFTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASKHAADACDKL 882
            CFTG+YFDKDAA+S E REALS+A+++HG+PCS+LL+DVA  CN RKLAS+H  DACDKL
Sbjct: 866  CFTGIYFDKDAAESYESREALSAASMKHGIPCSELLTDVAAYCNERKLASRHVKDACDKL 925

Query: 883  YMWALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKDLANEWLDATSTLVL 942
            YMWALLK+K+IL S+A+V+GLG +FM++YI KLA        EV+ +  EWLDATSTLVL
Sbjct: 926  YMWALLKKKEILLSEARVMGLGPRFMSIYIYKLAVERRIYYDEVEGMMGEWLDATSTLVL 985

Query: 943  SFPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTLEGEASTEGAAASDSGI----- 997
            +    RRS       K +ALEDVALV  P DL  +   + G ++ EGAAA D G+     
Sbjct: 986  TLCSNRRSLRRGSPGKCRALEDVALVARPYDLKAELGAV-GNSTNEGAAAQDVGVATHSS 1045

BLAST of Cp4.1LG01g20370 vs. TrEMBL
Match: A0A067K983_JATCU (DIS3-like exonuclease 2 OS=Jatropha curcas GN=JCGZ_12063 PE=3 SV=1)

HSP 1 Score: 904.0 bits (2335), Expect = 1.6e-259
Identity = 468/816 (57.35%), Postives = 594/816 (72.79%), Query Frame = 1

Query: 233  WSRMKGTSEAHDNMHSLEDANVMSDSFWSSPSVDPIGRICAVIDLFPTKRPTGKVVAILK 292
            W+   G +  + +  S  D++  S S   +  ++ +GR+CA+I+ +P+KRPTG+VVAI++
Sbjct: 302  WNGPVGYNLVNGHQPSASDSSHFS-STGQNDVMNGVGRLCAMINSYPSKRPTGRVVAIVE 361

Query: 293  KSRQRGTIVGLLNVKKFLSFHDC-----------------GYVQLMPNDARFPTMMVFAV 352
            +S +R  IVG L VK++L + +                   Y+QL P D +FP MMV   
Sbjct: 362  RSPRRDAIVGFLYVKQWLFYREACKKDGKKNKNSLSISAHEYIQLTPTDPKFPKMMVLMK 421

Query: 353  DLPDCIKKRLDNGDATVESELVAARIDEWLEESSAPKALVMHVLGRGSRIESHIDAILFQ 412
             LPD IKKRL+ GD TVE ELVAA+ID W E+S  P+A V H+ G GS +E  I+AIL++
Sbjct: 422  SLPDPIKKRLEEGDPTVEMELVAAQIDNWDEDSPFPQAHVSHIFGWGSEMEPQINAILYE 481

Query: 413  NAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRNLCIFTIDPPSALDLDDAFSVQKLA 472
            NAI   +FS +SLSCLP   WK+P EE++ RRDLRNLCIFTIDP +A DLDDA SV++L 
Sbjct: 482  NAIRCSDFSPESLSCLPCDAWKVPAEEIKIRRDLRNLCIFTIDPSTATDLDDALSVERLQ 541

Query: 473  NGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHLLRRKIPMLPPLLSENIGSLNPGVD 532
            NG  RVG+H+ADVS+FVLPDTALD EA+ RSTSV++LRRK+PMLPPLLSEN+GSL PGVD
Sbjct: 542  NGTLRVGVHIADVSYFVLPDTALDIEAQSRSTSVYMLRRKLPMLPPLLSENLGSLKPGVD 601

Query: 533  RLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGHAQDII------DSSKVLGHCVPQL 592
            RLAFS+F D+N  GDV D WIGRTVI SCCKLSY HAQ ++      D+S   G+ +PQL
Sbjct: 602  RLAFSIFWDLNGAGDVIDRWIGRTVIRSCCKLSYEHAQYMVDGMIDEDASSTYGNGLPQL 661

Query: 593  HGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIENPKIVFLYDEFGIPYDSTFHERKD 652
            HG   W D+I SV++L+EISKTL+EKRF DGAL++E+ KIVFL+DE+GIPYDS F ERKD
Sbjct: 662  HGPFEWADVIRSVKSLHEISKTLREKRFDDGALQLESSKIVFLFDEYGIPYDSMFSERKD 721

Query: 653  SNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTEFESFCSKQGFELDTSSS 712
            SNFLVEEF+LLANRT AEVI R FPD ALLRRHP+P  +KL EFE+FC K G ELDT+SS
Sbjct: 722  SNFLVEEFMLLANRTAAEVICRAFPDSALLRRHPEPNMRKLREFEAFCCKHGLELDTTSS 781

Query: 713  FLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLASYFCNGELKDGENG-SYYSLAVPWY 772
              F +SLE+IR K  DD +L+  ++SYA+R  QLA+YFC+G +KD  N   +Y LAVP Y
Sbjct: 782  GHFHRSLERIREKLKDDSMLYGILMSYASRPMQLATYFCSGVMKDNTNDWGHYGLAVPLY 841

Query: 773  THFTSPLRRYADIVVHRTLAAAVEAEELYLKHQ--------SDERMRCFTGMYFDKDAAD 832
            THFTSPLRRY DIVVHRTLAAAVEAEELY++ +         +E  RCFTG+YFDK+AA+
Sbjct: 842  THFTSPLRRYPDIVVHRTLAAAVEAEELYMRSRRILHNVSMKEEVTRCFTGIYFDKNAAE 901

Query: 833  SLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASKHAADACDKLYMWALLKEKQILF 892
            SLEGREA S+AAL+H VPC++LLSDVA  CN RKLAS+H  DAC+KLYMW LLK+K++LF
Sbjct: 902  SLEGREAFSAAALKHKVPCTELLSDVAAYCNERKLASRHVKDACEKLYMWVLLKKKEVLF 961

Query: 893  SDAKVLGLGSKFMTLYIQKLA--------EVKDLANEWLDATSTLVLSFPDTRRSHGSRD 952
            S+A+VLGLG +FM++Y+QKLA        EV+ L  EW +ATSTLVLS    +R+     
Sbjct: 962  SEARVLGLGPRFMSIYVQKLAIERRIYYDEVEGLTVEWFEATSTLVLSLCAFKRTVRKAG 1021

Query: 953  SIKWKALEDVALVISPCDLTVQQSTLEGEA-STEGAAASDSGI------------IEPAV 996
               +KAL +VA +++PC+L V  + +EG A         D+ +            I+P V
Sbjct: 1022 PGYYKALNEVAWLVNPCNLKVDPAAVEGSAIQCSNTQLDDNDMPSQHVDPISESEIDPLV 1081

BLAST of Cp4.1LG01g20370 vs. TrEMBL
Match: W9QVR2_9ROSA (DIS3-like exonuclease 2 OS=Morus notabilis GN=L484_020163 PE=3 SV=1)

HSP 1 Score: 897.9 bits (2319), Expect = 1.1e-257
Identity = 460/778 (59.13%), Postives = 572/778 (73.52%), Query Frame = 1

Query: 266  DPIGRICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKF---------------- 325
            D IGR+CA+I  FP+KRPTG+V+A+++KS +R  +VG LNVK++                
Sbjct: 336  DAIGRMCAMISSFPSKRPTGRVLAVIEKSPRRKAVVGFLNVKQWILYQEVCRKDAKKNKS 395

Query: 326  -LSFHDCGYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEES 385
             L+F D  Y+QL P D R P MMV    LPDCIKKRL+NGD T+E ELVAA+ID W EES
Sbjct: 396  TLAFTDYEYIQLTPIDPRLPKMMVLVQGLPDCIKKRLENGDVTLEIELVAAKIDNWGEES 455

Query: 386  SAPKALVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRD 445
              P+A V H  G+G  + S + AILF+NAI + +FS  S SCLP+ PW++P EELQ RRD
Sbjct: 456  PFPQACVSHTFGQGGELNSQLGAILFENAICSADFSPKSFSCLPNVPWEVPLEELQSRRD 515

Query: 446  LRNLCIFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTS 505
            LR LCIFTIDP +A +LDDA S+++L+N  FRVGIH+ADVS+FVLPDT LDKEA++RSTS
Sbjct: 516  LRKLCIFTIDPSTATELDDALSIERLSNRDFRVGIHIADVSYFVLPDTELDKEAQMRSTS 575

Query: 506  VHLLRRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLS 565
            V++ R+K+ MLPPLLSENIGSLN GVDRLAFS+FLDIN  GDVED WIGRTVI SCCKLS
Sbjct: 576  VYMSRKKLSMLPPLLSENIGSLNAGVDRLAFSMFLDINLAGDVEDRWIGRTVIKSCCKLS 635

Query: 566  YGHAQDIID-----SSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALR 625
            Y HAQ+IID      S   G+  PQLHG   W+D+++SV+ L+E+SK L+ KRF +GAL 
Sbjct: 636  YEHAQEIIDGPMDTGSLFSGNNCPQLHGHFEWVDVVNSVKDLHELSKILRGKRFSNGALA 695

Query: 626  IENPKIVFLYDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHP 685
            +E+ K+VF YDE G PYDS   ERK SNFLVEEF+LLANRT AEVISR FPD ALLRRHP
Sbjct: 696  LESLKVVFRYDECGNPYDSMLSERKASNFLVEEFMLLANRTAAEVISRAFPDCALLRRHP 755

Query: 686  KPIFKKLTEFESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQL 745
            +P  +KL EFE+FC K G ELDTSSS  F  SL++I  K  DD  LFD +++YA R  QL
Sbjct: 756  EPNMRKLREFEAFCHKHGLELDTSSSRQFHLSLQRIGEKLKDDSTLFDIIMNYAARPMQL 815

Query: 746  ASYFCNGELKDGENG-SYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKHQ- 805
            A+YFC G+LKD EN   +Y+LAVP YTHFTSPLRRY DIVVHRTLAA +EAEELYLKH+ 
Sbjct: 816  ATYFCTGDLKDDENDWGHYALAVPLYTHFTSPLRRYPDIVVHRTLAAIIEAEELYLKHEK 875

Query: 806  --------SDERMRCFTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNR 865
                     +   +CFTG+ F+KDAA+S EGREALS+AA  H +P ++LL+ VA  CN+R
Sbjct: 876  TFNKFHRGQEATRKCFTGINFEKDAAESREGREALSAAARNHRIPGTELLAKVAAYCNDR 935

Query: 866  KLASKHAADACDKLYMWALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKD 925
            KLAS+H  DACDKL+MWALLK+KQ+L S+A+VLGLG +FM++YIQKLA        EV+ 
Sbjct: 936  KLASRHVKDACDKLHMWALLKKKQVLLSEARVLGLGPRFMSIYIQKLAIERRIYYDEVEG 995

Query: 926  LANEWLDATSTLVLSFPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTLEGEASTE 985
            L  EWL+ATSTLVL+    R         KW+ +EDVAL++SPCDL  +   + G +S+E
Sbjct: 996  LMPEWLEATSTLVLNLYPNRLCTRRGSPGKWRPIEDVALIVSPCDLQAEPGVV-GSSSSE 1055

Query: 986  ----GAAASDSGI----IEPAVFPLTVQLLSTLPVALHAVGGDDGAIEIGVRLYMSSY 996
                    S SG     ++P+VFP+TV+LLST+PVA+HA+GGDDG ++IG RLYMSSY
Sbjct: 1056 PVGSSVVTSQSGSSETELDPSVFPITVRLLSTIPVAVHAIGGDDGPVDIGARLYMSSY 1112

BLAST of Cp4.1LG01g20370 vs. TrEMBL
Match: B9GUD6_POPTR (DIS3-like exonuclease 2 OS=Populus trichocarpa GN=POPTR_0002s08690g PE=3 SV=2)

HSP 1 Score: 892.9 bits (2306), Expect = 3.6e-256
Identity = 465/800 (58.13%), Postives = 586/800 (73.25%), Query Frame = 1

Query: 243  HDNM-----HSLEDANVMSDSFWSSPSVDPIGRICAVIDLFPTKRPTGKVVAILKKSRQR 302
            HDN       S  ++++   S       + +GRICA++ L+P+KRPTG+VVAI++KS +R
Sbjct: 283  HDNYVNGYHQSASESSLAVPSTGQDEVSNSVGRICAMLSLYPSKRPTGRVVAIVEKSPRR 342

Query: 303  GTIVGLLNVKKF-----------------LSFHDCGYVQLMPNDARFPTMMVFAVDLPDC 362
              IVG LNVK++                 LS  +  Y+++MP D RFP +MV    LPDC
Sbjct: 343  DVIVGFLNVKQWFYYREGCRQNAKKNKSSLSISNREYIEMMPTDPRFPKLMVLVSVLPDC 402

Query: 363  IKKRLDNGDATVESELVAARIDEWLEESSAPKALVMHVLGRGSRIESHIDAILFQNAILT 422
            IKKRL+N DATVE ELVAA+ID W ++S  P+A V  + GRGS +ES I+AIL +NAI  
Sbjct: 403  IKKRLENEDATVEMELVAAQIDNWSDKSPFPEAHVSCIFGRGSEMESQINAILHENAICC 462

Query: 423  CEFSRDSLSCLPHTPWKIPQEELQCRRDLRNLCIFTIDPPSALDLDDAFSVQKLANGIFR 482
             +FS +SLSCLP   W++P++E++ R+D+RNLCIFTIDP SA DLDDA SVQKL NG+ R
Sbjct: 463  SKFSPESLSCLPSNTWEVPKDEIENRKDIRNLCIFTIDPSSATDLDDALSVQKLPNGLVR 522

Query: 483  VGIHVADVSHFVLPDTALDKEARIRSTSVHLLRRKIPMLPPLLSENIGSLNPGVDRLAFS 542
            VG+H+ADVS+FVLPDTALD EA+ RSTSV++LRRKIPMLPPLLSEN+GSLNPGVDRLAFS
Sbjct: 523  VGVHIADVSYFVLPDTALDMEAQFRSTSVYMLRRKIPMLPPLLSENLGSLNPGVDRLAFS 582

Query: 543  LFLDINHCGDVEDCWIGRTVICSCCKLSYGHAQDIID------SSKVLGHCVPQLHGQST 602
            +F D N  G+V D WI RTVI SCCKLSY HAQ I+D      +    G  +PQLHG   
Sbjct: 583  IFWDFNSSGNVVDRWIDRTVIQSCCKLSYEHAQGIVDGMIDTETCNTFGDSLPQLHGHFE 642

Query: 603  WLDIISSVRTLNEISKTLKEKRFRDGALRIENPKIVFLYDEFGIPYDSTFHERKDSNFLV 662
            W D+I SV  L+EISKTL+EKRF +GALR+E+ KIVFL+DE+GIPYDS+  ERKDSNF+V
Sbjct: 643  WADVIGSVVCLHEISKTLREKRFDNGALRLESSKIVFLFDEYGIPYDSSLCERKDSNFIV 702

Query: 663  EEFVLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTEFESFCSKQGFELDTSSSFLFQQ 722
            EEF+LLAN T AE+ISR FPD ALLRRHP+P  +KL EFE+FC K G ELDTSS   FQQ
Sbjct: 703  EEFMLLANFTAAEIISRAFPDSALLRRHPEPNMRKLREFEAFCCKHGLELDTSSG-NFQQ 762

Query: 723  SLEQIRMKFHDDPLLFDAVISYATRSTQLASYFCNGELKDGENG-SYYSLAVPWYTHFTS 782
            SLE+I+ K  DDP LF+ +I+YA+R  QLA+YFC+G+LKD  N   +Y+LAVP YTHFTS
Sbjct: 763  SLERIKEKLKDDPELFNILINYASRPMQLATYFCSGDLKDNMNDWGHYALAVPLYTHFTS 822

Query: 783  PLRRYADIVVHRTLAAAVEAEELYLKHQ--------SDERMRCFTGMYFDKDAADSLEGR 842
            PLRRY DIVVHRTLAAA+EAE+LY+  +         +E  RCFTG+ F KD A+S EG+
Sbjct: 823  PLRRYPDIVVHRTLAAAIEAEQLYMMDRRMSLKARPGEEGTRCFTGICFCKDVAESAEGK 882

Query: 843  EALSSAALRHGVPCSKLLSDVAVQCNNRKLASKHAADACDKLYMWALLKEKQILFSDAKV 902
            EALS+AAL+H +PC +LLS VA  CN RKLAS+H  DACDKLYMW  +K K++L SDA+V
Sbjct: 883  EALSAAALKHRIPCPELLSHVAAYCNERKLASRHVKDACDKLYMWVSVKRKEVLLSDARV 942

Query: 903  LGLGSKFMTLYIQKLA--------EVKDLANEWLDATSTLVLSFPDTRRSHGSRDSIKWK 962
            LGLG +FM++YI KLA        EV+ L  EWL+ATSTLVL+   ++RS     S  +K
Sbjct: 943  LGLGPRFMSIYINKLAIERRIYYDEVEGLTVEWLEATSTLVLNICASKRSVRRAGSGYYK 1002

Query: 963  ALEDVALVISPCD--LTVQQSTLEGEASTEGAAASDSGIIEPAVFPLTVQLLSTLPVALH 996
            AL +VA VI+P D  L     + +G ++++ + A     I+P+VFPLTV+LLST+PVALH
Sbjct: 1003 ALGEVAWVINPYDHNLEPDMESTKGCSASQHSDAILKSEIDPSVFPLTVRLLSTIPVALH 1062

BLAST of Cp4.1LG01g20370 vs. TAIR10
Match: AT1G77680.1 (AT1G77680.1 Ribonuclease II/R family protein)

HSP 1 Score: 802.4 bits (2071), Expect = 3.2e-232
Identity = 415/755 (54.97%), Postives = 535/755 (70.86%), Query Frame = 1

Query: 268  IGRICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKF--------------LSFH 327
            + ++C ++  FP KRPTG+VVA+++KS  R +IVGLL+VK +              LS  
Sbjct: 304  VDKLCGILSSFPHKRPTGQVVAVVEKSLVRDSIVGLLDVKGWIHYKESDPKRCKSPLSLS 363

Query: 328  DCGYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAPKA 387
            D  YVQLMP D RFP ++V    LP  I+ RL+N D  +E+ELVAA+I +W E S  P A
Sbjct: 364  DDEYVQLMPADPRFPKLIVPFHVLPGSIRARLENLDPNLEAELVAAQIVDWGEGSPFPVA 423

Query: 388  LVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRNLC 447
             + H+ GRGS +E  I+AIL+QN++   +FS  SL+ LP  PW++P+EE+Q R+DLR+LC
Sbjct: 424  QITHLFGRGSELEPQINAILYQNSVCDSDFSPGSLTSLPRVPWEVPEEEVQRRKDLRDLC 483

Query: 448  IFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHLLR 507
            + TIDP +A DLDDA SVQ L  G FRVG+H+ADVS+FVLP+TALD EAR RSTSV+L++
Sbjct: 484  VLTIDPSTATDLDDALSVQSLPGGFFRVGVHIADVSYFVLPETALDTEARFRSTSVYLMQ 543

Query: 508  RKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGHAQ 567
            RKI MLPPLLSEN+GSL+PG DRLAFS+  D+N  GDV D WIGRT+I SCCKLSY HAQ
Sbjct: 544  RKISMLPPLLSENVGSLSPGADRLAFSILWDLNREGDVIDRWIGRTIIRSCCKLSYDHAQ 603

Query: 568  DIID-SSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIENPKIVFL 627
            DIID  S V  +  P LHG   W D+  SV+ L+EIS TL++KRFR+GAL++EN K VFL
Sbjct: 604  DIIDGKSDVAENGWPALHGSFKWCDVTRSVKQLSEISTTLRQKRFRNGALQLENSKPVFL 663

Query: 628  YDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTE 687
            +DE G+PYD     RK SNFLVEEF+LLAN T AEVIS+ +   +LLRRHP+P  +KL E
Sbjct: 664  FDEHGVPYDFVTCSRKGSNFLVEEFMLLANMTAAEVISQAYRASSLLRRHPEPNTRKLKE 723

Query: 688  FESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLASYFCNGEL 747
            FE FCSK G +LD SSS   Q SLE+I     DD +  D + +YA +  QLASYFC G L
Sbjct: 724  FEGFCSKHGMDLDISSSGQLQDSLEKITGNLKDDSVFVDILNNYAIKPMQLASYFCTGNL 783

Query: 748  KDG-ENGSYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKHQS---DERMRC 807
            KD      +Y+LAVP YTHFTSPLRRY DIVVHR LAAA+EAEELY K +    DE   C
Sbjct: 784  KDSVAEWGHYALAVPLYTHFTSPLRRYPDIVVHRALAAALEAEELYSKQKQTAIDEGRSC 843

Query: 808  FTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASKHAADACDKLY 867
            FTG++F+KDAA+S+EG+EALS AAL+HGVP +++LSDVA  CN RKLA++   DACDKLY
Sbjct: 844  FTGIHFNKDAAESIEGKEALSVAALKHGVPSTEILSDVAAYCNERKLAARKVRDACDKLY 903

Query: 868  MWALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKDLANEWLDATSTLVLS 927
             W +LK+K+I   +A+V+ LGS+FMT+YI KL         +++ L  +WL+ATSTL++ 
Sbjct: 904  TWFVLKQKEIFPCEARVMNLGSRFMTVYISKLGIERRIYYDQIEGLCADWLEATSTLIVD 963

Query: 928  FPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTLEGEASTEGAAASDSGIIEPAVF 987
               ++R  G R    +K +++   ++SPC++ V + +      TE   A     + PAVF
Sbjct: 964  KLYSKR--GGRGF--FKPMKEAVYLVSPCEVCVAKCSALSVHDTESPEAVSIDEVAPAVF 1023

Query: 988  PLTVQLLSTLPVALHAVGGDDGAIEIGVRLYMSSY 996
            PLT+QL ST+PV LHAVGGDDG ++IG RLYMSSY
Sbjct: 1024 PLTIQLFSTIPVVLHAVGGDDGPLDIGARLYMSSY 1054

BLAST of Cp4.1LG01g20370 vs. TAIR10
Match: AT2G17510.2 (AT2G17510.2 ribonuclease II family protein)

HSP 1 Score: 296.2 bits (757), Expect = 7.5e-80
Identity = 196/654 (29.97%), Postives = 318/654 (48.62%), Query Frame = 1

Query: 157 SQRKY-FTSHWPLDDVNRGLQKGDIFKALFRMNAHYRVEAYCKIDGIPVDVLIYGSASQN 216
           S+RK  +  H P+ ++  GL +G   +   R+N     EAY   + I  +++IYG ++ N
Sbjct: 255 SKRKLIYQEHKPMSEITAGLHRGIYHQGKLRVNRFNPYEAYVGSESIGEEIIIYGRSNMN 314

Query: 217 RAVEGDIVAMKMNPFSLWSRMKGTSEAHDNMHSLEDANVMSDSFWSSPSVDPIGRICAVI 276
           RA +GDIVA+++ P   W   K  S A ++    +  ++  D+   +P    +    +  
Sbjct: 315 RAFDGDIVAVELLPRDQWQDEKALSIAEEDDEEDDTVHLAPDNVDDAPRTSNLSHETSGD 374

Query: 277 DLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKF-LSFHDCGYVQLMPNDARFPTMMVFA 336
                 RP+G+VV +++  R   +  G L             +   +  D R P + +  
Sbjct: 375 KNAAPVRPSGRVVGVIR--RNWHSYCGSLEPMSLPAGSGGTAHALFVSKDRRIPKIRINT 434

Query: 337 VDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAPKALVMHVLGR------GSRIESH 396
             L + +  R            +   +D W  +S  P    +  +G+       + +  H
Sbjct: 435 RQLQNLLDMR------------IVVAVDSWDRQSRYPSGHYVRPIGKIGDKETETEVRDH 494

Query: 397 ID----------------AILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQ--CRRDLR 456
           I+                 +L +N +    FS   L+CLP  PW +  E++    R+DLR
Sbjct: 495 INLFDSILVGVRWARVGKVVLIENDVDYSPFSSQVLACLPPLPWSVSSEDVSNPVRQDLR 554

Query: 457 NLCIFTIDPPSALDLDDAFSVQKLANGIFRVGI------------HVADVSHFVLPDTAL 516
           +L +F++DPP   D+DDA     L NG F +G+            ++ADV++FV P T L
Sbjct: 555 HLLVFSVDPPGCKDIDDALHCTSLPNGNFELGVRILESSDSHKYDYIADVTNFVHPGTPL 614

Query: 517 DKEARIRSTSVHLLRRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGR 576
           D EA  R TSV+L+ R+I MLP  L+E+I SL   V+RLAFS+  +++   ++      +
Sbjct: 615 DDEASKRGTSVYLVERRIDMLPKPLTEDICSLRADVERLAFSVIWEMSPDAEIISTRFTK 674

Query: 577 TVICSCCKLSYGHAQDIIDSSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFR 636
           ++I S   LSY  AQ  +D S++                + + +R +N ++K ++++R  
Sbjct: 675 SIIKSSAALSYIEAQARMDDSRLTD-------------SLTTDLRNMNTLAKIMRQRRID 734

Query: 637 DGALRIENPKIVFLYD-EFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRA 696
            GAL + + ++ F  D E   P +   ++  ++N +VEEF+L AN +VA  I + FP  +
Sbjct: 735 RGALTLASAEVKFDIDPENHDPLNIGMYQILEANQMVEEFMLAANVSVAGQILKLFPSCS 794

Query: 697 LLRRHPKPIFKKLTEFESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYA 756
           LLRRHP P  + L       +  G  LD SSS     SL++      +DP     +   A
Sbjct: 795 LLRRHPTPTREMLEPLLRTAAAIGLTLDVSSSKALADSLDR---AVGEDPYFNKLIRILA 854

Query: 757 TRSTQLASYFCNGELKDGENGSYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAV 772
           TR    A YFC+G+L   E   +Y LA P YTHFTSP+RRYAD+ VHR LAA++
Sbjct: 855 TRCMTQAVYFCSGDLSPPEY-HHYGLAAPLYTHFTSPIRRYADVFVHRLLAASL 877

BLAST of Cp4.1LG01g20370 vs. TAIR10
Match: AT5G02250.1 (AT5G02250.1 Ribonuclease II/R family protein)

HSP 1 Score: 77.4 bits (189), Expect = 5.5e-14
Identity = 44/131 (33.59%), Postives = 71/131 (54.20%), Query Frame = 1

Query: 426 RRDLRNLCIFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIR 485
           R DL +L ++ ID   A +LDDA S  +L +G  ++ IHVAD + +V P + +D+EAR R
Sbjct: 399 RIDLTHLKVYAIDVDEADELDDALSATRLQDGRIKIWIHVADPARYVTPGSKVDREARRR 458

Query: 486 STSVHLLRRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCC 545
            TSV L     PM P  L+    SL  G +  A S+ + +   G + +  +  ++I    
Sbjct: 459 GTSVFLPTATYPMFPEKLAMEGMSLRQGENCNAVSVSVVLRSDGCITEYSVDNSIIRPTY 518

Query: 546 KLSYGHAQDII 557
            L+Y  A +++
Sbjct: 519 MLTYESASELL 529

BLAST of Cp4.1LG01g20370 vs. NCBI nr
Match: gi|659089736|ref|XP_008445669.1| (PREDICTED: LOW QUALITY PROTEIN: DIS3-like exonuclease 2 [Cucumis melo])

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 580/781 (74.26%), Postives = 655/781 (83.87%), Query Frame = 1

Query: 268  IGRICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKFLSFHDC------------ 327
            IGRICA+I+L+P KRPTG+VV IL+KSR R  +VG LNVKKFLSF +             
Sbjct: 345  IGRICALINLYPAKRPTGRVVTILEKSRLRDNVVGHLNVKKFLSFQEFYVKENTKSCLSP 404

Query: 328  ----GYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAP 387
                GYVQLMPNDARFP MMV A DLPDCIKKRLDNGD TVE+ELVAARI +W++ESS+P
Sbjct: 405  SQNGGYVQLMPNDARFPIMMVLAGDLPDCIKKRLDNGDVTVENELVAARIYDWVKESSSP 464

Query: 388  KALVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRN 447
            +A V+HVLGRGS +ESHIDAILF+NAI TCEFS DSLSC+PHTPWKIP EEL+CRRD+RN
Sbjct: 465  RAHVLHVLGRGSEVESHIDAILFENAIRTCEFSHDSLSCIPHTPWKIPHEELRCRRDIRN 524

Query: 448  LCIFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHL 507
            LCIFTIDP SA DLDDA SVQKLAN IFRVGIH+ADVS+FVLPDTALDKEA+IRSTSV+L
Sbjct: 525  LCIFTIDPSSASDLDDALSVQKLANDIFRVGIHIADVSYFVLPDTALDKEAQIRSTSVYL 584

Query: 508  LRRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGH 567
            L+RKIPMLPPLLSENIGSLNPGVDRLAFSLFLDIN CGDV+D WI RTVIC CCKLSY +
Sbjct: 585  LQRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINGCGDVKDYWIERTVICCCCKLSYEY 644

Query: 568  AQDII------DSSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIE 627
            AQDII      DS ++ G+  PQLHGQ TW D+ISSV+ L+EISKTLKEKRFRDGALR+E
Sbjct: 645  AQDIIDGLIDSDSPEIFGNNCPQLHGQFTWHDVISSVKLLHEISKTLKEKRFRDGALRLE 704

Query: 628  NPKIVFLYDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKP 687
            N K+++LYDE+GIPYDS F+E+KDSNFLVEEF+LLANRTVAEVISRTFPD ALLRRHP+P
Sbjct: 705  NSKLIYLYDEYGIPYDSMFYEQKDSNFLVEEFMLLANRTVAEVISRTFPDSALLRRHPEP 764

Query: 688  IFKKLTEFESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLAS 747
            + +KL EFESFCSK GFELDTSSS  FQQSLEQIR K HDDPLLFD +ISYATR  QLA+
Sbjct: 765  MLRKLREFESFCSKHGFELDTSSSVHFQQSLEQIRTKLHDDPLLFDILISYATRPMQLAT 824

Query: 748  YFCNGELKDGENGSYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKHQ---- 807
            YFC+GELKDGE  ++Y+LAVP YTHFTSPLRRY DIVVHRTLAAA+EAE++YLKHQ    
Sbjct: 825  YFCSGELKDGEKRNHYALAVPLYTHFTSPLRRYPDIVVHRTLAAAIEAEKVYLKHQGIIQ 884

Query: 808  ---SDERMRCFTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASK 867
               SD+ MRCFTG+YFDKDAADSLEGREALS AAL+HGVPCSKLLSDVA+ CN+RKLASK
Sbjct: 885  KVNSDKEMRCFTGIYFDKDAADSLEGREALSFAALKHGVPCSKLLSDVALHCNDRKLASK 944

Query: 868  HAADACDKLYMWALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKDLANEW 927
            H AD C+KLYMWALLK+K+ILFSDA+VLGLG +FM++YIQKLA        EV+ LA EW
Sbjct: 945  HIADGCEKLYMWALLKKKRILFSDARVLGLGPRFMSVYIQKLAIERRIYYDEVEGLAVEW 1004

Query: 928  LDATSTLVLSFPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTL----EGEASTEG 987
            LD TSTLVLSF  +RRSH SR S+KWKALEDVALVISPCD  V + TL     G AS  G
Sbjct: 1005 LDTTSTLVLSF-CSRRSHRSRGSVKWKALEDVALVISPCDQNVNKRTLGVCPNGGASKGG 1064

Query: 988  AAA--SDSGI--------IEPAVFPLTVQLLSTLPVALHAVGGDDGAIEIGVRLYMSSYL 998
            +AA   DS +        ++PAVFPLTV+LLST+PVALHAVGGDDG I+IGVRLYMSSYL
Sbjct: 1065 SAAVEQDSNLKSHVSDIGVDPAVFPLTVRLLSTIPVALHAVGGDDGPIDIGVRLYMSSYL 1124

BLAST of Cp4.1LG01g20370 vs. NCBI nr
Match: gi|449446430|ref|XP_004140974.1| (PREDICTED: DIS3-like exonuclease 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 571/782 (73.02%), Postives = 656/782 (83.89%), Query Frame = 1

Query: 268  IGRICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKFLSFHD------------- 327
            IGRICA+I+L+P KRPTG+VV IL+KSR R  +VG LNVKKFLSF +             
Sbjct: 345  IGRICALINLYPAKRPTGRVVTILEKSRLRENVVGHLNVKKFLSFQEFYVKESTKSCLSP 404

Query: 328  ---CGYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAP 387
               CGYVQLMPNDARFP MMV A DLP+CIKKRLDNGD TVE+ELVAARI EW++ESS+P
Sbjct: 405  SQNCGYVQLMPNDARFPIMMVLAGDLPNCIKKRLDNGDVTVENELVAARIYEWVKESSSP 464

Query: 388  KALVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRN 447
            +A V+HVLGRG+ +ESHIDAILF+NAI TCEFS+DSLSC+P TPWKIP EELQCRRD+RN
Sbjct: 465  RAHVLHVLGRGNEVESHIDAILFENAIRTCEFSQDSLSCVPQTPWKIPPEELQCRRDIRN 524

Query: 448  LCIFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHL 507
            LCIFTIDP SA DLDDA SVQ+LANGIFRVGIH+ADVS+FVLPDTALDKEA+IRSTSV+L
Sbjct: 525  LCIFTIDPSSASDLDDALSVQRLANGIFRVGIHIADVSYFVLPDTALDKEAQIRSTSVYL 584

Query: 508  LRRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGH 567
            L+RKIPMLPPLLSE+IGSLNPGVDRLAFSLFLDIN CGDV+D WI RTVIC CCKLSY H
Sbjct: 585  LQRKIPMLPPLLSESIGSLNPGVDRLAFSLFLDINSCGDVKDFWIERTVICCCCKLSYEH 644

Query: 568  AQDII------DSSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIE 627
            AQDII      DSS++ G+  PQLHGQ TW D+ISSV+ L+EISKT+KEKRFR+GALR+E
Sbjct: 645  AQDIIDGLIDSDSSELFGNNCPQLHGQFTWHDVISSVKLLHEISKTVKEKRFRNGALRLE 704

Query: 628  NPKIVFLYDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKP 687
            N K+++LYDE+GIPYDS F+E+KDSNFLVEEF+LLANRTVAEVISRTFPD ALLRRHP+P
Sbjct: 705  NSKLIYLYDEYGIPYDSMFYEQKDSNFLVEEFMLLANRTVAEVISRTFPDSALLRRHPEP 764

Query: 688  IFKKLTEFESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLAS 747
            + +KL EFE+FCSK GFELDTSSS  FQQSLEQIR++  DDPLLFD +ISYATR  QLA+
Sbjct: 765  MLRKLREFETFCSKHGFELDTSSSVHFQQSLEQIRIELQDDPLLFDILISYATRPMQLAT 824

Query: 748  YFCNGELKDGENGSYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKHQ---- 807
            YFC+GELKDGE  S+Y+LAVP YTHFTSPLRRY DIVVHRTLAAA+EAE++YLKH+    
Sbjct: 825  YFCSGELKDGETRSHYALAVPLYTHFTSPLRRYPDIVVHRTLAAAIEAEKMYLKHKGVIQ 884

Query: 808  ---SDERMRCFTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASK 867
               S+E  RCFTG+YFDKDAADSLEGREALSSAAL+HGVPCSKLL DVA+ CN+RKLASK
Sbjct: 885  KVNSNEETRCFTGIYFDKDAADSLEGREALSSAALKHGVPCSKLLLDVALHCNDRKLASK 944

Query: 868  HAADACDKLYMWALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKDLANEW 927
            H AD  +KLYMWALLK+K+ILFSDA+VLGLG +FM++YIQKLA        EV+ LA EW
Sbjct: 945  HVADGIEKLYMWALLKKKKILFSDARVLGLGPRFMSVYIQKLAIERRIYYDEVEGLAVEW 1004

Query: 928  LDATSTLVLSFPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTL----EGEASTEG 987
            L+ TSTLVL F  +RRSH SR S+KWKALEDVALVISPCD  V++ TL     G AS  G
Sbjct: 1005 LETTSTLVLRFFCSRRSHRSRGSVKWKALEDVALVISPCDQNVKERTLGVSSNGGASKGG 1064

Query: 988  AA-----------ASDSGIIEPAVFPLTVQLLSTLPVALHAVGGDDGAIEIGVRLYMSSY 998
            +A            SD+G ++PA+FPLTV+LLST+PVALHAVGGDDG I+IGVRLYMSSY
Sbjct: 1065 SAVVEQDSNLKSHVSDTG-VDPAIFPLTVRLLSTIPVALHAVGGDDGPIDIGVRLYMSSY 1124

BLAST of Cp4.1LG01g20370 vs. NCBI nr
Match: gi|778710143|ref|XP_011656525.1| (PREDICTED: DIS3-like exonuclease 2 isoform X2 [Cucumis sativus])

HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 571/782 (73.02%), Postives = 656/782 (83.89%), Query Frame = 1

Query: 268  IGRICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKFLSFHD------------- 327
            IGRICA+I+L+P KRPTG+VV IL+KSR R  +VG LNVKKFLSF +             
Sbjct: 293  IGRICALINLYPAKRPTGRVVTILEKSRLRENVVGHLNVKKFLSFQEFYVKESTKSCLSP 352

Query: 328  ---CGYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAP 387
               CGYVQLMPNDARFP MMV A DLP+CIKKRLDNGD TVE+ELVAARI EW++ESS+P
Sbjct: 353  SQNCGYVQLMPNDARFPIMMVLAGDLPNCIKKRLDNGDVTVENELVAARIYEWVKESSSP 412

Query: 388  KALVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRN 447
            +A V+HVLGRG+ +ESHIDAILF+NAI TCEFS+DSLSC+P TPWKIP EELQCRRD+RN
Sbjct: 413  RAHVLHVLGRGNEVESHIDAILFENAIRTCEFSQDSLSCVPQTPWKIPPEELQCRRDIRN 472

Query: 448  LCIFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHL 507
            LCIFTIDP SA DLDDA SVQ+LANGIFRVGIH+ADVS+FVLPDTALDKEA+IRSTSV+L
Sbjct: 473  LCIFTIDPSSASDLDDALSVQRLANGIFRVGIHIADVSYFVLPDTALDKEAQIRSTSVYL 532

Query: 508  LRRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGH 567
            L+RKIPMLPPLLSE+IGSLNPGVDRLAFSLFLDIN CGDV+D WI RTVIC CCKLSY H
Sbjct: 533  LQRKIPMLPPLLSESIGSLNPGVDRLAFSLFLDINSCGDVKDFWIERTVICCCCKLSYEH 592

Query: 568  AQDII------DSSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIE 627
            AQDII      DSS++ G+  PQLHGQ TW D+ISSV+ L+EISKT+KEKRFR+GALR+E
Sbjct: 593  AQDIIDGLIDSDSSELFGNNCPQLHGQFTWHDVISSVKLLHEISKTVKEKRFRNGALRLE 652

Query: 628  NPKIVFLYDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKP 687
            N K+++LYDE+GIPYDS F+E+KDSNFLVEEF+LLANRTVAEVISRTFPD ALLRRHP+P
Sbjct: 653  NSKLIYLYDEYGIPYDSMFYEQKDSNFLVEEFMLLANRTVAEVISRTFPDSALLRRHPEP 712

Query: 688  IFKKLTEFESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLAS 747
            + +KL EFE+FCSK GFELDTSSS  FQQSLEQIR++  DDPLLFD +ISYATR  QLA+
Sbjct: 713  MLRKLREFETFCSKHGFELDTSSSVHFQQSLEQIRIELQDDPLLFDILISYATRPMQLAT 772

Query: 748  YFCNGELKDGENGSYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKHQ---- 807
            YFC+GELKDGE  S+Y+LAVP YTHFTSPLRRY DIVVHRTLAAA+EAE++YLKH+    
Sbjct: 773  YFCSGELKDGETRSHYALAVPLYTHFTSPLRRYPDIVVHRTLAAAIEAEKMYLKHKGVIQ 832

Query: 808  ---SDERMRCFTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASK 867
               S+E  RCFTG+YFDKDAADSLEGREALSSAAL+HGVPCSKLL DVA+ CN+RKLASK
Sbjct: 833  KVNSNEETRCFTGIYFDKDAADSLEGREALSSAALKHGVPCSKLLLDVALHCNDRKLASK 892

Query: 868  HAADACDKLYMWALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKDLANEW 927
            H AD  +KLYMWALLK+K+ILFSDA+VLGLG +FM++YIQKLA        EV+ LA EW
Sbjct: 893  HVADGIEKLYMWALLKKKKILFSDARVLGLGPRFMSVYIQKLAIERRIYYDEVEGLAVEW 952

Query: 928  LDATSTLVLSFPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTL----EGEASTEG 987
            L+ TSTLVL F  +RRSH SR S+KWKALEDVALVISPCD  V++ TL     G AS  G
Sbjct: 953  LETTSTLVLRFFCSRRSHRSRGSVKWKALEDVALVISPCDQNVKERTLGVSSNGGASKGG 1012

Query: 988  AA-----------ASDSGIIEPAVFPLTVQLLSTLPVALHAVGGDDGAIEIGVRLYMSSY 998
            +A            SD+G ++PA+FPLTV+LLST+PVALHAVGGDDG I+IGVRLYMSSY
Sbjct: 1013 SAVVEQDSNLKSHVSDTG-VDPAIFPLTVRLLSTIPVALHAVGGDDGPIDIGVRLYMSSY 1072

BLAST of Cp4.1LG01g20370 vs. NCBI nr
Match: gi|694434791|ref|XP_009344602.1| (PREDICTED: DIS3-like exonuclease 2 isoform X4 [Pyrus x bretschneideri])

HSP 1 Score: 1091.3 bits (2821), Expect = 0.0e+00
Identity = 591/1086 (54.42%), Postives = 751/1086 (69.15%), Query Frame = 1

Query: 1    MRGSVEQSTSERNEDGGKEWKKKDRSNWRSEQKASIWRAVSCSSVNEIPR-EASECMENC 60
            M+ +V Q+  ER EDG KE KK  R + RS QK S     S +S    PR E SEC+ N 
Sbjct: 1    MKDAVVQAVVERVEDGDKEKKKNRRPSRRSRQKNS----TSAAS----PRTEVSECLANG 60

Query: 61   RIDANLTEPSFYSSLTQDECQSNQPTEPGFTGRNKLSFSSLSPLHIGQQAEWSQ-NLRNQ 120
            RI  ++T     +SL Q +   + P E G    + L+F+SL  +HI +Q       +   
Sbjct: 61   RISNHVT-----TSLKQPQLDMHPPDEQGTIKASNLAFNSLPTVHINEQVNHEDVQISVN 120

Query: 121  HHSMD---TGRWAITSCPEQIASGSMPWISMNQHSPP--ADVNSQRKYFTSHWPLDDVNR 180
             HS+     GR+   SCPE +A G  P   M +  PP  A+  ++RK FTSHWP++ VN 
Sbjct: 121  QHSLPCDPAGRFISNSCPEPVACGGSP--GMFKDFPPHHAESYARRKCFTSHWPMEAVND 180

Query: 181  GLQKGDIFKALFRMNAHYRVEAYCKIDGIPVDVLIYGSASQNRAVEGDIVAMKMNPFSLW 240
             L+KGD FKALFR+NAH R EAYCKIDG+P D+LI G A QNRA+EGDIVA+K++P +LW
Sbjct: 181  ALEKGDAFKALFRVNAHNRFEAYCKIDGVPTDILIDGLAEQNRAMEGDIVAIKVDPLALW 240

Query: 241  SRMKGTSEAHDNMHSLEDAN--VMSDSFWSSPSVDPIG---------------------- 300
            +RMKG++    +   +ED N    ++        +PIG                      
Sbjct: 241  TRMKGSAGTCTSSALVEDFNLPQEANEISGLNCKEPIGQTSYDHVAGRYPTASDFLQEGS 300

Query: 301  ------------RICAVIDLFPTKRPTGKVVAILKKSRQRGTIVGLLNVKKFLSFHDC-- 360
                        RICA+I  FP+KRPTG+VVAI+++S +R T+VG L+VKK++++ +   
Sbjct: 301  HGEQNEVAVAVERICAMISSFPSKRPTGRVVAIIERSPRRETVVGFLHVKKWIAYREVCR 360

Query: 361  --------------GYVQLMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARI 420
                            +Q+ P D RFP M+V    LPD IKKRL+NGD ++E EL AARI
Sbjct: 361  NNMRKNKNALFSSDECIQMTPTDPRFPKMVVLVRTLPDSIKKRLENGDESIEMELFAARI 420

Query: 421  DEWLEESSAPKALVMHVLGRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQE 480
            DEW EESSAP+A++++  GR   ++  ++AILFQN+I + +FS +SLSCLPH  W++PQE
Sbjct: 421  DEWDEESSAPQAVILNAFGRAGEVQPQLEAILFQNSINSSDFSHESLSCLPHLTWEVPQE 480

Query: 481  ELQCRRDLRNLCIFTIDPPSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKE 540
            E++ R+DLRNLCI TIDP +A DLDDA SV+KL+NGIFRVGIH+ADVS+FVLPDT LD+E
Sbjct: 481  EIKSRKDLRNLCILTIDPSTATDLDDALSVEKLSNGIFRVGIHIADVSYFVLPDTQLDEE 540

Query: 541  ARIRSTSVHLLRRKIPMLPPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVI 600
            A+ RS SV++ +RK+PMLPP+LSENIGSLNPGV+RLAFS+FLDINH GDV D WIGRTVI
Sbjct: 541  AQSRSASVYMSQRKLPMLPPVLSENIGSLNPGVERLAFSIFLDINHVGDVVDRWIGRTVI 600

Query: 601  CSCCKLSYGHAQDII------DSSKVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEK 660
             SCCKLSY HAQ+II      +SS +LG+  PQLHG   W D+I SV+ L EIS+ LKE+
Sbjct: 601  RSCCKLSYEHAQEIINGKLNLESSNILGNGCPQLHGHFEWSDVIRSVKDLLEISRVLKER 660

Query: 661  RFRDGALRIENPKIVFLYDEFGIPYDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPD 720
            RF DGAL++E+ K+V L+DE G+PYDS + ERKDSNFLVEEF+LLAN T AEVISR FP+
Sbjct: 661  RFSDGALQLESSKVVILFDECGVPYDSMYSERKDSNFLVEEFMLLANTTAAEVISRAFPE 720

Query: 721  RALLRRHPKPIFKKLTEFESFCSKQGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVIS 780
             ALLRRHP+P  +KL EFE+FCSK G ELDTSSS  FQQSL +IR +  DD +LF+ ++S
Sbjct: 721  SALLRRHPEPNMRKLREFEAFCSKHGLELDTSSSGQFQQSLLRIREELKDDAVLFNILMS 780

Query: 781  YATRSTQLASYFCNGELKDGENG-SYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAE 840
            YA +  QLASYFC+GELKD EN   +Y LAVP YTHFTSPLRRY DIVVHRTLAAAVEAE
Sbjct: 781  YAAKPMQLASYFCSGELKDRENDWGHYGLAVPLYTHFTSPLRRYPDIVVHRTLAAAVEAE 840

Query: 841  ELYLKHQ---------SDERMRCFTGMYFDKDAADSLEGREALSSAALRHGVPCSKLLSD 900
            EL+LKH+          + +MRCFTG+YFDK+ A+S E +EALS+AA++H VPCS++L+D
Sbjct: 841  ELFLKHRRLLNNFSRGDEVKMRCFTGIYFDKNVAESCEIKEALSAAAIKHRVPCSEMLTD 900

Query: 901  VAVQCNNRKLASKHAADACDKLYMWALLKEK--QILFSDAKVLGLGSKFMTLYIQKLA-- 960
            VA  CN RKLAS+H  DACDKLYMWALLK+K  QI+FS+A+VLG+G +FM++YI KLA  
Sbjct: 901  VAAYCNERKLASRHVRDACDKLYMWALLKKKELQIVFSEARVLGVGPRFMSIYIHKLAVE 960

Query: 961  ------EVKDLANEWLDATSTLVLSFPDTRRSHGSRDSIKWKALEDVALVISPCDLTVQQ 997
                  EV+ L  EWLDATSTLVL+    RRSH      KW+ALEDVALV+ P DL  + 
Sbjct: 961  RRIYYDEVEGLMVEWLDATSTLVLNLCSNRRSHRRGSPGKWRALEDVALVVRPYDLKAEL 1020

BLAST of Cp4.1LG01g20370 vs. NCBI nr
Match: gi|302141847|emb|CBI19050.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 978.4 bits (2528), Expect = 9.4e-282
Identity = 541/1053 (51.38%), Postives = 693/1053 (65.81%), Query Frame = 1

Query: 1    MRGSVEQSTSERNEDGGKEWKKKDRSNWRSEQKASIWRAVSCSSVNEIPREASECMENCR 60
            MRG VEQS  ER EDG KE KKK R   RS+Q AS   + +CSS NE+  E SEC+ N  
Sbjct: 1    MRGVVEQSVVERCEDGDKE-KKKRRRPRRSKQNASA--SATCSSANEMRGEVSECLANGS 60

Query: 61   IDANLTEPSFYSSLTQDECQSNQPTEPGFTGRNKLSFSSLSPLHIGQQAEWSQ--NLRNQ 120
            I    T    YSS  Q   +++     G    + ++F+SL  +H+ +QA  ++  ++ NQ
Sbjct: 61   ISNYDTTSMSYSSSKQGGLETDPLDNHGLHKASDVAFTSLPTMHLNEQALHAEVGSMNNQ 120

Query: 121  H--HSMDTGRWAITSCPEQI--ASGSMPWISMNQHSPPADVN-SQRKYFTSHWPLDDVNR 180
            H   S  +G     SCP  I        + + N  SP  D   +QRKYFT HW  + VN 
Sbjct: 121  HIFPSDPSGGMCSKSCPVPIDCEQSIQSFTNKNVLSPYQDEGCAQRKYFTPHWSTEVVNE 180

Query: 181  GLQKGDIFKALFRMNAHYRVEAYCKIDGIPVDVLIYGSASQNRAVEGDIVAMKMNPFSLW 240
             L+KG++F+A FR+NA+ R+EAYC I+G+  DVLI G ASQNRAVEGDIVA+K++PFSLW
Sbjct: 181  ALEKGNVFRASFRVNAYNRLEAYCTIEGVKTDVLISGLASQNRAVEGDIVAVKVDPFSLW 240

Query: 241  SRMKGTSEAHDNMHSLEDANVMSDS-----------------FWSSPSVDPIGRICAVID 300
            SRMKG++   +N         M  +                 F    ++D + +ICA I+
Sbjct: 241  SRMKGSTVFPNNAAENISQEPMGHNHVNGHHPPVFGPSHVSCFGERSNMDSLEKICAAIN 300

Query: 301  LFPTKRPTGKVVAILKKSRQRGTIVGLLNVKK-----------------FLSFHDCGYVQ 360
             FP+KRPTG VVAI+++S +R  +VG L+VK+                 +LS  D  Y+Q
Sbjct: 301  SFPSKRPTGSVVAIIERSPRRVAVVGFLSVKQWLSSRVLHRKGTKMNKTYLSLSDSEYIQ 360

Query: 361  LMPNDARFPTMMVFAVDLPDCIKKRLDNGDATVESELVAARIDEWLEESSAPKALVMHVL 420
            L P D +FP M+V    L DCIKKRL++GDA++E ELVAA+I +W EESS P A VMH+ 
Sbjct: 361  LTPTDPKFPKMVVPVKGLSDCIKKRLEDGDASMEMELVAAQISDWGEESSLPLAHVMHIF 420

Query: 421  GRGSRIESHIDAILFQNAILTCEFSRDSLSCLPHTPWKIPQEELQCRRDLRNLCIFTIDP 480
            GRG  IE  I AILF+NAI   EFS +SLSCLPH PWK+PQEE++ RRDLRNLCIFTIDP
Sbjct: 421  GRGGEIEPRIAAILFENAIRPSEFSPESLSCLPHIPWKVPQEEIERRRDLRNLCIFTIDP 480

Query: 481  PSALDLDDAFSVQKLANGIFRVGIHVADVSHFVLPDTALDKEARIRSTSVHLLRRKIPML 540
             +A DLDDA SV+KL+ G FRVG+H+AD S+FVLPD  LD+EA+ RSTSV+LL+ K+PML
Sbjct: 481  STATDLDDALSVEKLSGGNFRVGVHIADASYFVLPDGVLDREAQSRSTSVYLLQHKLPML 540

Query: 541  PPLLSENIGSLNPGVDRLAFSLFLDINHCGDVEDCWIGRTVICSCCKLSYGHAQDIIDSS 600
            PPLLSEN+GSL PGVDRLAFS+F DIN  GDV D WIGRTVI SCCKLSY HAQ IID  
Sbjct: 541  PPLLSENLGSLIPGVDRLAFSIFWDINLAGDVVDRWIGRTVIQSCCKLSYEHAQGIIDGM 600

Query: 601  KVLGHCVPQLHGQSTWLDIISSVRTLNEISKTLKEKRFRDGALRIENPKIVFLYDEFGIP 660
              +              ++I S++ L  ISKTL+  RF DGAL ++  K++ L+DE G  
Sbjct: 601  FDV--------------EVIRSIKYLYAISKTLRANRFNDGALLLDGAKVILLFDEHG-- 660

Query: 661  YDSTFHERKDSNFLVEEFVLLANRTVAEVISRTFPDRALLRRHPKPIFKKLTEFESFCSK 720
                                    T AE+ISR FPD ALLRRHP+P  +KL EFE+FCSK
Sbjct: 661  ------------------------TAAEIISRAFPDNALLRRHPEPNLRKLREFEAFCSK 720

Query: 721  QGFELDTSSSFLFQQSLEQIRMKFHDDPLLFDAVISYATRSTQLASYFCNGELKDGEN-G 780
             G ELDTSSS  F  SLEQIR K  +D +LFD ++SYA+R  QLA+YFC+G+LKD +N  
Sbjct: 721  HGLELDTSSSGQFNHSLEQIREKLKNDSVLFDILLSYASRPMQLATYFCSGDLKDNKNEW 780

Query: 781  SYYSLAVPWYTHFTSPLRRYADIVVHRTLAAAVEAEELYLKH--------QSDERMRCFT 840
            S+Y+LAVP YTHFTSPLRRY DI+VHRTLAAA+EAEELYLKH          +E  RCFT
Sbjct: 781  SHYALAVPLYTHFTSPLRRYPDIIVHRTLAAAIEAEELYLKHGAKIQKVKNGEEMRRCFT 840

Query: 841  GMYFDKDAADSLEGREALSSAALRHGVPCSKLLSDVAVQCNNRKLASKHAADACDKLYMW 900
            G++FDK+AA+S+EG++ALS AA +H +PC+++L+DV   CN RKLAS+HA D C++LYMW
Sbjct: 841  GIHFDKNAAESVEGQKALSVAASKHRLPCTEILADVVAYCNERKLASRHAKDGCERLYMW 900

Query: 901  ALLKEKQILFSDAKVLGLGSKFMTLYIQKLA--------EVKDLANEWLDATSTLVLSFP 960
             LLK+K++L S+A+VLGLG +FM++YI KL         EV+ L  EWLDATSTLV++  
Sbjct: 901  VLLKKKEVLLSEARVLGLGPRFMSIYIHKLGIERRIYYDEVEGLTVEWLDATSTLVVNLS 960

Query: 961  DTRRSHGSRDSIKWKALEDVALVISPCDLTVQQSTLEGEASTEGAAASDSGIIEPAVFPL 996
              + S    +  K++ LEDVA VI PC+L       E +A    +   D+  I+P  FPL
Sbjct: 961  TNKCSRWRGNQGKYRQLEDVAWVIRPCNL-----KQEVDACMSESGVPDANEIDPLFFPL 1005

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DI3L2_ARATH5.2e-23255.10DIS3-like exonuclease 2 OS=Arabidopsis thaliana GN=SOV PE=1 SV=1[more]
DI32L_ARATH5.8e-23154.97Inactive exonuclease DIS3L2 OS=Arabidopsis thaliana GN=SOV PE=2 SV=1[more]
DI3L2_SCHPO2.0e-10634.40DIS3-like exonuclease 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) G... [more]
DI3L2_XENTR4.4e-9842.48DIS3-like exonuclease 2 OS=Xenopus tropicalis GN=dis3l2 PE=2 SV=2[more]
DI3L2_MOUSE2.9e-9741.52DIS3-like exonuclease 2 OS=Mus musculus GN=Dis3l2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KC80_CUCSA0.0e+0073.02DIS3-like exonuclease 2 OS=Cucumis sativus GN=Csa_6G040560 PE=3 SV=1[more]
M5XPD1_PRUPE2.5e-27356.55DIS3-like exonuclease 2 OS=Prunus persica GN=PRUPE_ppa015523mg PE=3 SV=1[more]
A0A067K983_JATCU1.6e-25957.35DIS3-like exonuclease 2 OS=Jatropha curcas GN=JCGZ_12063 PE=3 SV=1[more]
W9QVR2_9ROSA1.1e-25759.13DIS3-like exonuclease 2 OS=Morus notabilis GN=L484_020163 PE=3 SV=1[more]
B9GUD6_POPTR3.6e-25658.13DIS3-like exonuclease 2 OS=Populus trichocarpa GN=POPTR_0002s08690g PE=3 SV=2[more]
Match NameE-valueIdentityDescription
AT1G77680.13.2e-23254.97 Ribonuclease II/R family protein[more]
AT2G17510.27.5e-8029.97 ribonuclease II family protein[more]
AT5G02250.15.5e-1433.59 Ribonuclease II/R family protein[more]
Match NameE-valueIdentityDescription
gi|659089736|ref|XP_008445669.1|0.0e+0074.26PREDICTED: LOW QUALITY PROTEIN: DIS3-like exonuclease 2 [Cucumis melo][more]
gi|449446430|ref|XP_004140974.1|0.0e+0073.02PREDICTED: DIS3-like exonuclease 2 isoform X1 [Cucumis sativus][more]
gi|778710143|ref|XP_011656525.1|0.0e+0073.02PREDICTED: DIS3-like exonuclease 2 isoform X2 [Cucumis sativus][more]
gi|694434791|ref|XP_009344602.1|0.0e+0054.42PREDICTED: DIS3-like exonuclease 2 isoform X4 [Pyrus x bretschneideri][more]
gi|302141847|emb|CBI19050.3|9.4e-28251.38unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0034427nuclear-transcribed mRNA catabolic process, exonucleolytic, 3'-5'
Vocabulary: Molecular Function
TermDefinition
GO:00001753'-5'-exoribonuclease activity
Vocabulary: INTERPRO
TermDefinition
IPR012340NA-bd_OB-fold
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0034427 nuclear-transcribed mRNA catabolic process, exonucleolytic, 3'-5'
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0051252 regulation of RNA metabolic process
biological_process GO:0090503 RNA phosphodiester bond hydrolysis, exonucleolytic
cellular_component GO:0005575 cellular_component
molecular_function GO:0000175 3'-5'-exoribonuclease activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g20370.1Cp4.1LG01g20370.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012340Nucleic acid-binding, OB-foldunknownSSF50249Nucleic acid-binding proteinscoord: 159..250
score: 2.14E-26coord: 317..381
score: 2.14E-26coord: 386..784
score: 3.51E-114coord: 822..855
score: 3.51E
NoneNo IPR availablePANTHERPTHR23355RIBONUCLEASEcoord: 803..997
score: 0.0coord: 98..771
score:
NoneNo IPR availablePANTHERPTHR23355:SF9DIS3-LIKE EXONUCLEASE 2coord: 98..771
score: 0.0coord: 803..997
score:
NoneNo IPR availablePFAMPF00773RNBcoord: 426..771
score: 9.6
NoneNo IPR availableSMARTSM00955RNB_2coord: 426..773
score: 8.2E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g20370Cp4.1LG13g04830Cucurbita pepo (Zucchini)cpecpeB199