Cla021389 (gene) Watermelon (97103) v1

NameCla021389
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionOxoprolinase family protein (AHRD V1 ***- C1FF97_MICSR); contains Interpro domain(s) IPR003692 Hydantoinase B/oxoprolinase
LocationChr5 : 2758457 .. 2762411 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAGTAATAATGAAGACAAACTCCGATTTTGCATCGATAGAGGAGGCACATTCACTGATGTATACGCTGAAATTCCTGGCCGTCAAGATGGTAAAGTTATAAAGCTTCTATCTGTTGATCCATCGAATTATGAGGATGCTCCAGTTGAAGGAATTCGGAGGATTCTCGAAGAATATAGCGGCAAAAAAATCCCAAGGACGTCAAAAATACCCACTCAAAACATAGAGTGGATACGGATGGGAACAACTGTGGCAACTAATGCACTTCTGGAGAGAAAAGGAGAAAGGATTGCTCTTTGTGTCACTAAAGGTTTTAGAGATTTGCTTCAGATTGGCAACCAGGCCCGCCCGGATATATTTGATCTAACTGTCTCAAAACCATCAAATCTTTATGAGGATGTCATAGAAGTAGATGAGCGAGTCGAACTTATTCGAGGCAAGGGTGATGGTAATCAAGATTTTTCTACTTCATATGTAAAAGGAGTTTCTGGTGAGCTTATTCGCATTGTGAGGACTCTCAATGAAGAAGCTTTAAAGCCGTTACTGAAGGATCTCCTGCAAAGAGGCATTAGCTGTTTGGCAGTTGTCTTAATGCATTCATACACTTACCCACAACATGAATTGGCTCTGGAGAAATTAGCTCTGAGTATGGGCTTCAAACATGTTTCTTTGTCCTCAGCTTTGACCCCTATGGTTCGAGCTGTTCCACGTGGTCTTACAGCCAGTGTAGATGCATACTTGACTCCAGTCATCAAAGAGTACTTATCTGGATTCATGTCCAAATTTGATGAGAGCAGTGGGAAGGTGAATGTGCTATTTATGCAATCAGATGGAGGACTTGCACCAGAAAATAGATTTTCAGGCCACAAGGCAGTGTTATCTGGTCCAGCTGGTGGAGTTGTCGGTTACTCGCAAACACTTTTCGAGCTTGAGACCAAGAAGCCTCTCATTGGATTCGACATGGGTGGCACATCAACTGATGCTAGCCGCTATGCTGGAAGTTATGAACAGGTCCTGGAAACCCAGATTGCCGGTGCAATAATTCAAGCTCCTCAGCTTGATATCAACACTGTGGCAGCTGGTGGTGGCTCAAAATTGAAATTTCAATTTGGAGCTTTCCGCGTGGGACCCGAATCAGTGGGTGCACACCCTGGTCCAGTTTGTTACAGAAAAGGAGGTGAGCTAGCTGTTACTGATGCAAATCTGGTTTTAGGGTTTGTTATCCCTGATTTCTTTCCATCCATCTTTGGTCCCAACGAGGATCAGCCTCTAGATATTGAAGCCACTAGAGGAGAGTTTGAGAAGCTTGCAACAGAAATTAATTCTTACAGAAAAATCCAGGATCCATCCTCAAAGCCTATGACAATCGAGGAGATTGCTTTGGGCTTTGTAAATGTTGCAAATGAAACTATGTGCCGCCCAATCCGGCAACTGACTGAGATGAAGGGCCATGAGACAAAAAACCATGCTCTTGCTTGTTTTGGAGGTGCTGGACCTCAACATGCATGTGCTATTGCCAGGTTATTAGGTATGAAAGAGATATTTATTCATAGATTTTGTGGGATTTTAAGTGCTTATGGTATGGGACTGGCGGATGTTGTTGAAGAAGAACAGGAGCCGTACTCGGCTGTGTATTGTTCTGAGTCTGTCCAGGAGGTCTCTCGAAGAGAGGCAAGTCTACTAAAGCAAGTAAATTATAAGCTGCAGGGCCAAGGATTTAGAGAGGGAAGTATTAAAACTGAAACGTATTTGAATTTGCGGTATGAAGGTACGGACACTGCCATCATGGTAAAGAGCCAACAAGTGGATAATGGAGTAGAATTTGACTTTGCAGCTGAGTTTGAGAAGCTTTTCCAACAGGAGTATGGATTTAAATTACAGAATAGGAATGTTCTTATATGTGACATAAGAGTTCGTGGTATAGGAGTAACAAATGTATTGAAGCCACGAGCTTTCGAAGGACTTGCAGGTGACCCTAAAATTGAAGGCCACTACAGGGTCTACTTTGGAAATGGATGGCAGGATACGCCTTTATTCAAGCTTGACAATCTAGGATTTGGTCATATCATCCCAGGGCCTGCTATCATTATGAATGGGAATAGTACTGTGATTGTCGAACCCAGCTGCAAAGCTACTATAACTAAATATGGAAACATCAAAATTGAAATAGATTCCACCTTTTCCACAGAAAAAGTATCAGAAAAGGTAGCTGACGTCGTCCAACTATCGATCTTCAATCACCGGTTTATGGGTATAGCTGAACAGATGGGAAGGACACTACAGAGAACTTCTATATCAACAAACATCAAGGAACGCCTAGATTTCTCTTGTGCCCTCTTTGGTCCTGATGGAGGATTAGTTGCCAATGCTCCCCATGTACCTGTCCACTTAGGAGCAATGTCCAGTACGGTTCGTTGGCAAATCGAGTACTGGGGTGACAATTTGAATGAGGGAGATGTATTGGTCACCAATCACCCATGTGCTGGAGGTAGCCATCTTCCTGATATAACAGTCATCACACCAGTATTTGATAATGGAAAATTGATATTTTTTGTCGCAAGTAGAGGGCACCACGCAGAGATCGGGGGCATTACTCCTGGAAGCATGCCACCATTTTCAAAATCCATCTGGGAAGAAGGAGCTGCAATAAAAGCATTCAAGCTTGTTGAAAAGGGAATTTTTCAAGAAGAAGGAATTATCAAGCTCCTGCAGTTCCCTAGTTCTGATGAAGGTGTCATCCCGGGAACTCGAAGACTCCAAGATAATTTATCTGATCTTCATGCACAAGTTGCGGCAAATCACAGAGGAATTTCACTAATCAAAGAGCTTATTGCCCAATATGGTTTAAATATTGTTCAGGCTTATATGACATATGTACAGCTTAACGCAGAAGAAGCAGTAAGGGAAATGCTGAAATCTGTTGCTTCTAGAGTTTCATCTAATTCAGCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGCATCTAATTCAGCAGGAACTGGGAAGGGAAGCTCCATAACAATCGAAGAAGAGGATTACATGGATGATGGTTCTGTAATTCATTTGAAACTCACAATTGATCCTCATAAAGGCGAAGCCAATTTTGATTTTAGTGGAACGAGTCCAGAAGTATATGGCAATTGGAACGCTCCAGAAGCAGTAACAGTGGCAGCAGTTATATACTGCCTTCGCTGCATGGTCGATGTCGATATTCCTCTCAACCAAGGCTGCTTGGCTCCTGTCAAGATATATATTCCACCAGGCTCATTCCTTTCCCCGAGCGAGAAGGCTGCGATTGTAGGGGGAAACGTTCTCACTTCTCAGCGAATAACCGATGTCATACTCACAGCATTTCAGGCATGTGCCTGTTCCCAAGGTTGTATGAACAACCTCACTTTTGGTGACGATACGTTTGGATACTACGAAACAATTGGTGGCGGAAGTGGAGCTGGTCCTTCTTGGCATGGAACAAGTGGAGTTCAATGCCATATGACCAATACTCGAATGACTGATCCCGAGATATTCGAGCAGCGATACCCTGTTCTTCTACATACATTTGCACTCAGAGAGAACAGTGGAGGAAGTGGAATCTACAAAGGAGGTGATGGGCTCGTTAGGGAGATCGAGTTCAAGCAGCCAGTGGTGGTGAGTATTCTTTCGGAGAGACGCGTCCATGCGCCGAGAGGTTTGAAAGGAGGAAAAAACGGTGCTCGTGGGGCTAATTTTCTGGTGAGAAAGGATGGAAGAAGAGTTTATCTTGGAGGCAAGAATACTGTAACAGTGAAAGCTGGGGAGATTCTTCAAATTTTAACTCCTGGTGGTGGTGGATGGGGTTGTCCTTAA

mRNA sequence

ATGGGAAGTAATAATGAAGACAAACTCCGATTTTGCATCGATAGAGGAGGCACATTCACTGATGTATACGCTGAAATTCCTGGCCGTCAAGATGGTAAAGTTATAAAGCTTCTATCTGTTGATCCATCGAATTATGAGGATGCTCCAGTTGAAGGAATTCGGAGGATTCTCGAAGAATATAGCGGCAAAAAAATCCCAAGGACGTCAAAAATACCCACTCAAAACATAGAGTGGATACGGATGGGAACAACTGTGGCAACTAATGCACTTCTGGAGAGAAAAGGAGAAAGGATTGCTCTTTGTGTCACTAAAGGTTTTAGAGATTTGCTTCAGATTGGCAACCAGGCCCGCCCGGATATATTTGATCTAACTGTCTCAAAACCATCAAATCTTTATGAGGATGTCATAGAAGTAGATGAGCGAGTCGAACTTATTCGAGGCAAGGGTGATGGTAATCAAGATTTTTCTACTTCATATGTAAAAGGAGTTTCTGGTGAGCTTATTCGCATTGTGAGGACTCTCAATGAAGAAGCTTTAAAGCCGTTACTGAAGGATCTCCTGCAAAGAGGCATTAGCTGTTTGGCAGTTGTCTTAATGCATTCATACACTTACCCACAACATGAATTGGCTCTGGAGAAATTAGCTCTGAGTATGGGCTTCAAACATGTTTCTTTGTCCTCAGCTTTGACCCCTATGGTTCGAGCTGTTCCACGTGGTCTTACAGCCAGTGTAGATGCATACTTGACTCCAGTCATCAAAGAGTACTTATCTGGATTCATGTCCAAATTTGATGAGAGCAGTGGGAAGGTGAATGTGCTATTTATGCAATCAGATGGAGGACTTGCACCAGAAAATAGATTTTCAGGCCACAAGGCAGTGTTATCTGGTCCAGCTGGTGGAGTTGTCGGTTACTCGCAAACACTTTTCGAGCTTGAGACCAAGAAGCCTCTCATTGGATTCGACATGGGTGGCACATCAACTGATGCTAGCCGCTATGCTGGAAGTTATGAACAGGTCCTGGAAACCCAGATTGCCGGTGCAATAATTCAAGCTCCTCAGCTTGATATCAACACTGTGGCAGCTGGTGGTGGCTCAAAATTGAAATTTCAATTTGGAGCTTTCCGCGTGGGACCCGAATCAGTGGGTGCACACCCTGGTCCAGTTTGTTACAGAAAAGGAGGTGAGCTAGCTGTTACTGATGCAAATCTGGTTTTAGGGTTTGTTATCCCTGATTTCTTTCCATCCATCTTTGGTCCCAACGAGGATCAGCCTCTAGATATTGAAGCCACTAGAGGAGAGTTTGAGAAGCTTGCAACAGAAATTAATTCTTACAGAAAAATCCAGGATCCATCCTCAAAGCCTATGACAATCGAGGAGATTGCTTTGGGCTTTGTAAATGTTGCAAATGAAACTATGTGCCGCCCAATCCGGCAACTGACTGAGATGAAGGGCCATGAGACAAAAAACCATGCTCTTGCTTGTTTTGGAGGTGCTGGACCTCAACATGCATGTGCTATTGCCAGGTTATTAGGTATGAAAGAGATATTTATTCATAGATTTTGTGGGATTTTAAGTGCTTATGGTATGGGACTGGCGGATGTTGTTGAAGAAGAACAGGAGCCGTACTCGGCTGTGTATTGTTCTGAGTCTGTCCAGGAGGTCTCTCGAAGAGAGGCAAGTCTACTAAAGCAAGTAAATTATAAGCTGCAGGGCCAAGGATTTAGAGAGGGAAGTATTAAAACTGAAACGTATTTGAATTTGCGGTATGAAGGTACGGACACTGCCATCATGGTAAAGAGCCAACAAGTGGATAATGGAGTAGAATTTGACTTTGCAGCTGAGTTTGAGAAGCTTTTCCAACAGGAGTATGGATTTAAATTACAGAATAGGAATGTTCTTATATGTGACATAAGAGTTCGTGGTATAGGAGTAACAAATGTATTGAAGCCACGAGCTTTCGAAGGACTTGCAGGTGACCCTAAAATTGAAGGCCACTACAGGGTCTACTTTGGAAATGGATGGCAGGATACGCCTTTATTCAAGCTTGACAATCTAGGATTTGGTCATATCATCCCAGGGCCTGCTATCATTATGAATGGGAATAGTACTGTGATTGTCGAACCCAGCTGCAAAGCTACTATAACTAAATATGGAAACATCAAAATTGAAATAGATTCCACCTTTTCCACAGAAAAAGTATCAGAAAAGGTAGCTGACGTCGTCCAACTATCGATCTTCAATCACCGGTTTATGGGTATAGCTGAACAGATGGGAAGGACACTACAGAGAACTTCTATATCAACAAACATCAAGGAACGCCTAGATTTCTCTTGTGCCCTCTTTGGTCCTGATGGAGGATTAGTTGCCAATGCTCCCCATGTACCTGTCCACTTAGGAGCAATGTCCAGTACGGTTCGTTGGCAAATCGAGTACTGGGGTGACAATTTGAATGAGGGAGATGTATTGGTCACCAATCACCCATGTGCTGGAGGTAGCCATCTTCCTGATATAACAGTCATCACACCAGTATTTGATAATGGAAAATTGATATTTTTTGTCGCAAGTAGAGGGCACCACGCAGAGATCGGGGGCATTACTCCTGGAAGCATGCCACCATTTTCAAAATCCATCTGGGAAGAAGGAGCTGCAATAAAAGCATTCAAGCTTGTTGAAAAGGGAATTTTTCAAGAAGAAGGAATTATCAAGCTCCTGCAGTTCCCTAGTTCTGATGAAGGTGTCATCCCGGGAACTCGAAGACTCCAAGATAATTTATCTGATCTTCATGCACAAGTTGCGGCAAATCACAGAGGAATTTCACTAATCAAAGAGCTTATTGCCCAATATGCAGGAACTGGGAAGGGAAGCTCCATAACAATCGAAGAAGAGGATTACATGGATGATGGTTCTGTAATTCATTTGAAACTCACAATTGATCCTCATAAAGGCGAAGCCAATTTTGATTTTAGTGGAACGAGTCCAGAAGTATATGGCAATTGGAACGCTCCAGAAGCAGTAACAGTGGCAGCAGTTATATACTGCCTTCGCTGCATGGTCGATGTCGATATTCCTCTCAACCAAGGCTGCTTGGCTCCTGTCAAGATATATATTCCACCAGGCTCATTCCTTTCCCCGAGCGAGAAGGCTGCGATTGTAGGGGGAAACGTTCTCACTTCTCAGCGAATAACCGATGTCATACTCACAGCATTTCAGGCATGTGCCTGTTCCCAAGGTTGTATGAACAACCTCACTTTTGGTGACGATACGTTTGGATACTACGAAACAATTGGTGGCGGAAGTGGAGCTGGTCCTTCTTGGCATGGAACAAGTGGAGTTCAATGCCATATGACCAATACTCGAATGACTGATCCCGAGATATTCGAGCAGCGATACCCTGTTCTTCTACATACATTTGCACTCAGAGAGAACAGTGGAGGAAGTGGAATCTACAAAGGAGGTGATGGGCTCGTTAGGGAGATCGAGTTCAAGCAGCCAGTGGTGGTGAGTATTCTTTCGGAGAGACGCGTCCATGCGCCGAGAGGTTTGAAAGGAGGAAAAAACGGTGCTCGTGGGGCTAATTTTCTGGTGAGAAAGGATGGAAGAAGAGTTTATCTTGGAGGCAAGAATACTGTAACAGTGAAAGCTGGGGAGATTCTTCAAATTTTAACTCCTGGTGGTGGTGGATGGGGTTGTCCTTAA

Coding sequence (CDS)

ATGGGAAGTAATAATGAAGACAAACTCCGATTTTGCATCGATAGAGGAGGCACATTCACTGATGTATACGCTGAAATTCCTGGCCGTCAAGATGGTAAAGTTATAAAGCTTCTATCTGTTGATCCATCGAATTATGAGGATGCTCCAGTTGAAGGAATTCGGAGGATTCTCGAAGAATATAGCGGCAAAAAAATCCCAAGGACGTCAAAAATACCCACTCAAAACATAGAGTGGATACGGATGGGAACAACTGTGGCAACTAATGCACTTCTGGAGAGAAAAGGAGAAAGGATTGCTCTTTGTGTCACTAAAGGTTTTAGAGATTTGCTTCAGATTGGCAACCAGGCCCGCCCGGATATATTTGATCTAACTGTCTCAAAACCATCAAATCTTTATGAGGATGTCATAGAAGTAGATGAGCGAGTCGAACTTATTCGAGGCAAGGGTGATGGTAATCAAGATTTTTCTACTTCATATGTAAAAGGAGTTTCTGGTGAGCTTATTCGCATTGTGAGGACTCTCAATGAAGAAGCTTTAAAGCCGTTACTGAAGGATCTCCTGCAAAGAGGCATTAGCTGTTTGGCAGTTGTCTTAATGCATTCATACACTTACCCACAACATGAATTGGCTCTGGAGAAATTAGCTCTGAGTATGGGCTTCAAACATGTTTCTTTGTCCTCAGCTTTGACCCCTATGGTTCGAGCTGTTCCACGTGGTCTTACAGCCAGTGTAGATGCATACTTGACTCCAGTCATCAAAGAGTACTTATCTGGATTCATGTCCAAATTTGATGAGAGCAGTGGGAAGGTGAATGTGCTATTTATGCAATCAGATGGAGGACTTGCACCAGAAAATAGATTTTCAGGCCACAAGGCAGTGTTATCTGGTCCAGCTGGTGGAGTTGTCGGTTACTCGCAAACACTTTTCGAGCTTGAGACCAAGAAGCCTCTCATTGGATTCGACATGGGTGGCACATCAACTGATGCTAGCCGCTATGCTGGAAGTTATGAACAGGTCCTGGAAACCCAGATTGCCGGTGCAATAATTCAAGCTCCTCAGCTTGATATCAACACTGTGGCAGCTGGTGGTGGCTCAAAATTGAAATTTCAATTTGGAGCTTTCCGCGTGGGACCCGAATCAGTGGGTGCACACCCTGGTCCAGTTTGTTACAGAAAAGGAGGTGAGCTAGCTGTTACTGATGCAAATCTGGTTTTAGGGTTTGTTATCCCTGATTTCTTTCCATCCATCTTTGGTCCCAACGAGGATCAGCCTCTAGATATTGAAGCCACTAGAGGAGAGTTTGAGAAGCTTGCAACAGAAATTAATTCTTACAGAAAAATCCAGGATCCATCCTCAAAGCCTATGACAATCGAGGAGATTGCTTTGGGCTTTGTAAATGTTGCAAATGAAACTATGTGCCGCCCAATCCGGCAACTGACTGAGATGAAGGGCCATGAGACAAAAAACCATGCTCTTGCTTGTTTTGGAGGTGCTGGACCTCAACATGCATGTGCTATTGCCAGGTTATTAGGTATGAAAGAGATATTTATTCATAGATTTTGTGGGATTTTAAGTGCTTATGGTATGGGACTGGCGGATGTTGTTGAAGAAGAACAGGAGCCGTACTCGGCTGTGTATTGTTCTGAGTCTGTCCAGGAGGTCTCTCGAAGAGAGGCAAGTCTACTAAAGCAAGTAAATTATAAGCTGCAGGGCCAAGGATTTAGAGAGGGAAGTATTAAAACTGAAACGTATTTGAATTTGCGGTATGAAGGTACGGACACTGCCATCATGGTAAAGAGCCAACAAGTGGATAATGGAGTAGAATTTGACTTTGCAGCTGAGTTTGAGAAGCTTTTCCAACAGGAGTATGGATTTAAATTACAGAATAGGAATGTTCTTATATGTGACATAAGAGTTCGTGGTATAGGAGTAACAAATGTATTGAAGCCACGAGCTTTCGAAGGACTTGCAGGTGACCCTAAAATTGAAGGCCACTACAGGGTCTACTTTGGAAATGGATGGCAGGATACGCCTTTATTCAAGCTTGACAATCTAGGATTTGGTCATATCATCCCAGGGCCTGCTATCATTATGAATGGGAATAGTACTGTGATTGTCGAACCCAGCTGCAAAGCTACTATAACTAAATATGGAAACATCAAAATTGAAATAGATTCCACCTTTTCCACAGAAAAAGTATCAGAAAAGGTAGCTGACGTCGTCCAACTATCGATCTTCAATCACCGGTTTATGGGTATAGCTGAACAGATGGGAAGGACACTACAGAGAACTTCTATATCAACAAACATCAAGGAACGCCTAGATTTCTCTTGTGCCCTCTTTGGTCCTGATGGAGGATTAGTTGCCAATGCTCCCCATGTACCTGTCCACTTAGGAGCAATGTCCAGTACGGTTCGTTGGCAAATCGAGTACTGGGGTGACAATTTGAATGAGGGAGATGTATTGGTCACCAATCACCCATGTGCTGGAGGTAGCCATCTTCCTGATATAACAGTCATCACACCAGTATTTGATAATGGAAAATTGATATTTTTTGTCGCAAGTAGAGGGCACCACGCAGAGATCGGGGGCATTACTCCTGGAAGCATGCCACCATTTTCAAAATCCATCTGGGAAGAAGGAGCTGCAATAAAAGCATTCAAGCTTGTTGAAAAGGGAATTTTTCAAGAAGAAGGAATTATCAAGCTCCTGCAGTTCCCTAGTTCTGATGAAGGTGTCATCCCGGGAACTCGAAGACTCCAAGATAATTTATCTGATCTTCATGCACAAGTTGCGGCAAATCACAGAGGAATTTCACTAATCAAAGAGCTTATTGCCCAATATGCAGGAACTGGGAAGGGAAGCTCCATAACAATCGAAGAAGAGGATTACATGGATGATGGTTCTGTAATTCATTTGAAACTCACAATTGATCCTCATAAAGGCGAAGCCAATTTTGATTTTAGTGGAACGAGTCCAGAAGTATATGGCAATTGGAACGCTCCAGAAGCAGTAACAGTGGCAGCAGTTATATACTGCCTTCGCTGCATGGTCGATGTCGATATTCCTCTCAACCAAGGCTGCTTGGCTCCTGTCAAGATATATATTCCACCAGGCTCATTCCTTTCCCCGAGCGAGAAGGCTGCGATTGTAGGGGGAAACGTTCTCACTTCTCAGCGAATAACCGATGTCATACTCACAGCATTTCAGGCATGTGCCTGTTCCCAAGGTTGTATGAACAACCTCACTTTTGGTGACGATACGTTTGGATACTACGAAACAATTGGTGGCGGAAGTGGAGCTGGTCCTTCTTGGCATGGAACAAGTGGAGTTCAATGCCATATGACCAATACTCGAATGACTGATCCCGAGATATTCGAGCAGCGATACCCTGTTCTTCTACATACATTTGCACTCAGAGAGAACAGTGGAGGAAGTGGAATCTACAAAGGAGGTGATGGGCTCGTTAGGGAGATCGAGTTCAAGCAGCCAGTGGTGGTGAGTATTCTTTCGGAGAGACGCGTCCATGCGCCGAGAGGTTTGAAAGGAGGAAAAAACGGTGCTCGTGGGGCTAATTTTCTGGTGAGAAAGGATGGAAGAAGAGTTTATCTTGGAGGCAAGAATACTGTAACAGTGAAAGCTGGGGAGATTCTTCAAATTTTAACTCCTGGTGGTGGTGGATGGGGTTGTCCTTAA

Protein sequence

MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEYSGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDIFDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALKPLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGLTASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGGVVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVAAGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPNEDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLTEMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQEPYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKSQQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDPKIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNIKIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCALFGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVITPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGIIKLLQFPSSDEGVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQYAGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKGEANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLSPSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGPSWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQPVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTPGGGGWGCP
BLAST of Cla021389 vs. Swiss-Prot
Match: OPLA_ARATH (5-oxoprolinase OS=Arabidopsis thaliana GN=OXP1 PE=2 SV=1)

HSP 1 Score: 1905.2 bits (4934), Expect = 0.0e+00
Identity = 960/1274 (75.35%), Postives = 1070/1274 (83.99%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MG+  E KLRFCIDRGGTFTDVYAEIPG  DG V+KLLSVDPSNY+DAPVEGIRRILEEY
Sbjct: 1    MGTVIEGKLRFCIDRGGTFTDVYAEIPGHSDGHVLKLLSVDPSNYDDAPVEGIRRILEEY 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +GKKIPRTSKIPT  I+WIRMGTTVATNALLERKGERIALCVTKGF+DLLQIGNQARPDI
Sbjct: 61   TGKKIPRTSKIPTDKIQWIRMGTTVATNALLERKGERIALCVTKGFKDLLQIGNQARPDI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDLTV+KPSNLYE+VIEVDERV L     D ++    S +KGVSGE +R+V+  + E LK
Sbjct: 121  FDLTVAKPSNLYEEVIEVDERVVLALEDDDDDEG---SLIKGVSGEFLRVVKPFDGEGLK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLLK LL +GISCLAVVLMHSYTYP+HE+ +EKLAL MGF+HVSLSSALTPMVRAVPRGL
Sbjct: 181  PLLKGLLDKGISCLAVVLMHSYTYPKHEMDVEKLALEMGFRHVSLSSALTPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TA+VDAYLTPVIKEYLSGF+SKFD+  GKVNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TATVDAYLTPVIKEYLSGFISKFDDDLGKVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLF LET+KPLIGFDMGGTSTD SRY GSYEQV+ETQIAG IIQAPQLDINTVA
Sbjct: 301  VVGYSQTLFGLETEKPLIGFDMGGTSTDVSRYDGSYEQVIETQIAGTIIQAPQLDINTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGP+SVGAHPGPVCYRKGGELAVTDANLVLGFVIPD+FPSIFGPN
Sbjct: 361  AGGGSKLKFQFGAFRVGPDSVGAHPGPVCYRKGGELAVTDANLVLGFVIPDYFPSIFGPN 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPLD+ ATR  FEKLA +IN YRK QDPS+K M++EEIA+GFV+VANETMCRPIRQLT
Sbjct: 421  EDQPLDVAATREAFEKLAGQINIYRKSQDPSAKDMSVEEIAMGFVSVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHETKNHALACFGGAGPQHACAIAR LGMKE+ +HR+CGILSAYGMGLADV+E+ QE
Sbjct: 481  EMKGHETKNHALACFGGAGPQHACAIARSLGMKEVLVHRYCGILSAYGMGLADVIEDAQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PYSAVY  ES+ EV RRE  LL++V  KLQ QGF +G+I TETYLNLRY+GTDTAIMVK 
Sbjct: 541  PYSAVYGPESLSEVFRRETVLLREVREKLQEQGFGDGNISTETYLNLRYDGTDTAIMVKG 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
            ++  +G  FD+AAEF KLF+QEYGFKLQNRN+LICD+RVRGIGVT++LKPRA E     P
Sbjct: 601  KKTGDGSAFDYAAEFLKLFEQEYGFKLQNRNLLICDVRVRGIGVTSILKPRAVEAAPVTP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            K+E HY+VYF  GW DTPLFKL+NLGFGH I GPAIIMNGNSTVIVEP CKA ITKYGNI
Sbjct: 661  KVERHYKVYFEGGWHDTPLFKLENLGFGHEILGPAIIMNGNSTVIVEPQCKAIITKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIE++   S+ K++E VADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 721  KIEVEPATSSVKLAENVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            F PDGGLVANAPHVPVHLGAMSSTVRWQ+++WG+NLNEGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FSPDGGLVANAPHVPVHLGAMSSTVRWQLKHWGENLNEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASR------------------------------------GHHAEIGG 900
            TPVFD GKL+FFVASR                                    G   E G 
Sbjct: 841  TPVFDKGKLVFFVASRGHHAEVGGITPGSMPPFSKAIWEEGAAIKAFKVVEKGVFQEEGI 900

Query: 901  ITPGSMP---------PFSKSIWEEGAAIKAFKLVEKGIFQEEGIIKLLQFPSSDEGVIP 960
            +     P         P ++ I +  + ++A       I   +  I L++      G+  
Sbjct: 901  VKLLQFPSSDETTTKIPGTRRIQDNLSDLQA------QIAANQRGISLIKELIEQYGL-- 960

Query: 961  GTRRLQDNLSDLHAQVAANHRGISLIKELIAQYAGTGKGSSITIEEEDYMDDGSVIHLKL 1020
            GT +       L+A+ A      S+   + ++   +  G+S+TIEEEDYMDDGS+IHLKL
Sbjct: 961  GTVQAYMKYVQLNAEEAVREMLKSVANRVSSETPNSRVGNSVTIEEEDYMDDGSIIHLKL 1020

Query: 1021 TIDPHKGEANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIY 1080
            TID  KGEA+FDF+GTSPEVYGNWNAPEAVT AAVIYCLRC+V+VDIPLNQGCLAPV+I 
Sbjct: 1021 TIDADKGEASFDFTGTSPEVYGNWNAPEAVTSAAVIYCLRCLVNVDIPLNQGCLAPVEIR 1080

Query: 1081 IPPGSFLSPSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETI 1140
            IP GSFLSPSEKAA+VGGNVLTSQR+TDV+LTAFQACACSQGCMNNLTFGDDTFGYYETI
Sbjct: 1081 IPAGSFLSPSEKAAVVGGNVLTSQRVTDVVLTAFQACACSQGCMNNLTFGDDTFGYYETI 1140

Query: 1141 GGGSGAGPSWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGL 1200
            GGG GAGP+W+GTSGVQCHMTNTRMTDPEIFEQRYPVLLH F LRENSGG+G++KGGDGL
Sbjct: 1141 GGGCGAGPTWNGTSGVQCHMTNTRMTDPEIFEQRYPVLLHRFGLRENSGGNGLHKGGDGL 1200

Query: 1201 VREIEFKQPVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAG 1230
            VREIEF++PVVVSILSERRVH+PRGL GG+NG RGAN+L+ KD RR+YLGGKNTV V+AG
Sbjct: 1201 VREIEFRKPVVVSILSERRVHSPRGLNGGQNGLRGANYLITKDKRRIYLGGKNTVHVEAG 1260

BLAST of Cla021389 vs. Swiss-Prot
Match: OPLA_MOUSE (5-oxoprolinase OS=Mus musculus GN=Oplah PE=1 SV=1)

HSP 1 Score: 1382.9 bits (3578), Expect = 0.0e+00
Identity = 716/1266 (56.56%), Postives = 895/1266 (70.70%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS  E++  F IDRGGTFTDV+A+ PG    +V+KLLS DP+NY DAP EGIRRILE+ 
Sbjct: 1    MGSP-EERFHFAIDRGGTFTDVFAQCPGGHV-RVLKLLSEDPANYADAPTEGIRRILEQE 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
             G  +PR   + T +I  IRMGTTVATNALLER+GER+AL VT+GFRDLL IG QARPD+
Sbjct: 61   RGVLLPRGRPLDTSHIASIRMGTTVATNALLERQGERVALLVTRGFRDLLHIGTQARPDL 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDL V  P  LYE+V+EVDERV L RG+         S VKG +G+L+ I + ++  AL+
Sbjct: 121  FDLAVPMPEVLYEEVVEVDERVLLYRGEPGAG-----SPVKGCTGDLLEIQQPVDLAALR 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
              L+ LL RGI  LAVVLMHSYT+ QHE  +  LA  +GF HVSLSS + PMVR VPRG 
Sbjct: 181  GKLEGLLTRGIHSLAVVLMHSYTWAQHEQQVGTLARELGFTHVSLSSEVMPMVRIVPRGH 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TA  DAYLTP I+ Y+ GF   F      V VLFM+SDGGLAP + FSG +AVLSGPAGG
Sbjct: 241  TACADAYLTPTIQRYVQGFRRGFQGQLKNVQVLFMRSDGGLAPMDAFSGSRAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYS T ++LE  +P+IGFDMGGTSTD SRYAG +E V E   AG  +QAPQLDINTVA
Sbjct: 301  VVGYSTTTYQLEGGQPVIGFDMGGTSTDVSRYAGEFEHVFEASTAGVTLQAPQLDINTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGS+L F+ G F VGPES GAHPGP CYRKGG + VTDANLVLG ++P  FP IFGP 
Sbjct: 361  AGGGSRLFFRSGLFVVGPESAGAHPGPACYRKGGPVTVTDANLVLGRLLPASFPCIFGPG 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPL  EA+R   E +A E+NS+       +  +++EE+A+GFV VANE MCRPIR LT
Sbjct: 421  EDQPLSPEASRKALEAVAMEVNSFLASGPCPASQLSLEEVAMGFVRVANEAMCRPIRALT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            + +GH+   H LACFGGAG QHACAIAR LGM  + IHR  G+LSA G+ LADVV E QE
Sbjct: 481  QARGHDPSAHVLACFGGAGGQHACAIARALGMDTVHIHRHSGLLSALGLALADVVHEAQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            P S  Y  E+  ++ +R + L +Q    LQ QGF    I TE++L+LRY+GTD A+MV +
Sbjct: 541  PCSLSYTPETFAQLDQRLSRLEEQCVDALQAQGFSRSQISTESFLHLRYQGTDCALMVSA 600

Query: 601  QQ----VDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGL 660
             Q      +    DF A F + + +E+GF +  R+V++ D+RVRG G + +      +  
Sbjct: 601  NQHPATTCSPRAGDFGAAFVERYMREFGFIIPERSVVVDDVRVRGTGRSGLQLEETSKIQ 660

Query: 661  AGDPKIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITK 720
            +G P +E   + YF  G+Q+TP++ L  LG+GH + GP +I++ NST++VEP C+A + +
Sbjct: 661  SGPPHVEKVTQCYFEGGYQETPVYLLGELGYGHQLQGPCLIIDNNSTILVEPGCQAEVIE 720

Query: 721  YGNIKIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDF 780
             G+I+I + +      + +   D +QLSIF+HRFM IAEQMGR LQRT+ISTNIKERLDF
Sbjct: 721  TGDIRISVGA--EAPSMIDTKLDPIQLSIFSHRFMSIAEQMGRILQRTAISTNIKERLDF 780

Query: 781  SCALFGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPD 840
            SCALFGPDGGLV+NAPH+PVHLGAM  TV++QI++ G +L+ GDVL++NHP AGGSHLPD
Sbjct: 781  SCALFGPDGGLVSNAPHIPVHLGAMQETVQFQIQHLGADLHPGDVLLSNHPSAGGSHLPD 840

Query: 841  ITVITPVFDNGKL--IFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGI 900
            +TVITPVF  G+   +F+VASRGHHA+IGGITPGSMPP S ++ +EGA   +FKLV+ G+
Sbjct: 841  LTVITPVFWPGQSRPVFYVASRGHHADIGGITPGSMPPHSTTLQQEGAVFLSFKLVQGGV 900

Query: 901  FQEEGIIKLLQFPSSDEGVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQYA----- 960
            FQEE + + LQ P    G   GTR L DNLSDL AQVAAN +GI L+ ELI QY      
Sbjct: 901  FQEEAVTEALQAPGKISGC-SGTRNLHDNLSDLRAQVAANQKGIQLVGELIGQYGLDVVQ 960

Query: 961  ---------------------GTGK---GSSITIEEEDYMDDGSVIHLKLTIDPHKGEAN 1020
                                 GT +   G  + +  +D+MDDGS I L + I+ ++G A 
Sbjct: 961  AYMGHIQANAELAVRDMLRAFGTSRQARGLPLEVSAKDHMDDGSPICLHVQINLNQGSAV 1020

Query: 1021 FDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLSPS 1080
            FDFSG+  EV+GN NAP A+T++A+IYCLRC+V  DIPLNQGCLAPV++ IP GS L PS
Sbjct: 1021 FDFSGSGSEVFGNLNAPRAITLSALIYCLRCLVGRDIPLNQGCLAPVQVIIPKGSILDPS 1080

Query: 1081 EKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGPSW 1140
             +AA+VGGNVLTSQR+ DVIL AF ACA SQGCMNN+T G+   GYYET+ GG+GAGP W
Sbjct: 1081 PEAAVVGGNVLTSQRVVDVILGAFGACAASQGCMNNVTLGNARMGYYETVAGGAGAGPGW 1140

Query: 1141 HGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQPV 1200
            HG SGV  HMTNTR+TDPEI E RYPV+L  F LR  SGG G ++GGDG+VRE+ F++  
Sbjct: 1141 HGRSGVHSHMTNTRITDPEILESRYPVILRRFELRPGSGGRGRFRGGDGVVRELVFREEA 1200

Query: 1201 VVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTPGG 1232
            ++S+L+ERR   P GL GG+ G RG N L+RKDGR V LGGK +VTV  G+   + TPGG
Sbjct: 1201 LLSVLTERRAFQPYGLHGGEPGTRGLNLLIRKDGRTVNLGGKTSVTVYPGDAFCLHTPGG 1256

BLAST of Cla021389 vs. Swiss-Prot
Match: OPLA_HUMAN (5-oxoprolinase OS=Homo sapiens GN=OPLAH PE=1 SV=3)

HSP 1 Score: 1382.1 bits (3576), Expect = 0.0e+00
Identity = 721/1270 (56.77%), Postives = 896/1270 (70.55%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS  E +  F IDRGGTFTDV+A+ PG    +V+KLLS DP+NY DAP EGIRRILE+ 
Sbjct: 1    MGSP-EGRFHFAIDRGGTFTDVFAQCPGGHV-RVLKLLSEDPANYADAPTEGIRRILEQE 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +G  +PR   + + +I  IRMGTTVATNALLERKGER+AL VT+GFRDLL IG QAR D+
Sbjct: 61   AGMLLPRDQPLDSSHIASIRMGTTVATNALLERKGERVALLVTRGFRDLLHIGTQARGDL 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGK-GDGNQDFSTSYVKGVSGELIRIVRTLNEEAL 180
            FDL V  P  LYE+V+EVDERV L RG+ G G        VKG +G+L+ + + ++  AL
Sbjct: 121  FDLAVPMPEVLYEEVLEVDERVVLHRGEAGTGTP------VKGRTGDLLEVQQPVDLGAL 180

Query: 181  KPLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRG 240
            +  L+ LL RGI  LAVVLMHSYT+ QHE  +  LA  +GF HVSLSS   PMVR VPRG
Sbjct: 181  RGKLEGLLSRGIRSLAVVLMHSYTWAQHEQQVGVLARELGFTHVSLSSEAMPMVRIVPRG 240

Query: 241  LTASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAG 300
             TA  DAYLTP I+ Y+ GF   F      V VLFM+SDGGLAP + FSG  AVLSGPAG
Sbjct: 241  HTACADAYLTPAIQRYVQGFCRGFQGQLKDVQVLFMRSDGGLAPMDTFSGSSAVLSGPAG 300

Query: 301  GVVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTV 360
            GVVGYS T ++ E  +P+IGFDMGGTSTD SRYAG +E V E   AG  +QAPQLDINTV
Sbjct: 301  GVVGYSATTYQQEGGQPVIGFDMGGTSTDVSRYAGEFEHVFEASTAGVTLQAPQLDINTV 360

Query: 361  AAGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGP 420
            AAGGGS+L F+ G F VGPES GAHPGP CYRKGG + VTDANLVLG ++P  FP IFGP
Sbjct: 361  AAGGGSRLFFRSGLFVVGPESAGAHPGPACYRKGGPVTVTDANLVLGRLLPASFPCIFGP 420

Query: 421  NEDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQL 480
             E+QPL  EA+R   E +ATE+NS+       + P+++EE+A+GFV VANE MCRPIR L
Sbjct: 421  GENQPLSPEASRKALEAVATEVNSFLTNGPCPASPLSLEEVAMGFVRVANEAMCRPIRAL 480

Query: 481  TEMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQ 540
            T+ +GH+   H LACFGGAG QHACAIAR LGM  + IHR  G+LSA G+ LADVV E Q
Sbjct: 481  TQARGHDPSAHVLACFGGAGGQHACAIARALGMDTVHIHRHSGLLSALGLALADVVHEAQ 540

Query: 541  EPYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVK 600
            EP S +Y  E+  ++ +R + L +Q    LQ QGF    I TE++L+LRY+GTD A+MV 
Sbjct: 541  EPCSLLYAPETFVQLDQRLSRLEEQCVDALQAQGFPRSQISTESFLHLRYQGTDCALMVS 600

Query: 601  SQQVDNGVEF----DFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEG 660
            + Q           DF A F + + +E+GF +  R V++ D+RVRG G + +    A + 
Sbjct: 601  AHQHPATARSPRAGDFGAAFVERYMREFGFVIPERPVVVDDVRVRGTGRSGLRLEDAPKA 660

Query: 661  LAGDPKIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATIT 720
              G P+++   + YF  G+Q+TP++ L  LG+GH + GP +I++ NST++VEP C+A +T
Sbjct: 661  QTGPPRVDKMTQCYFEGGYQETPVYLLAELGYGHKLHGPCLIIDSNSTILVEPGCQAEVT 720

Query: 721  KYGNIKIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLD 780
            K G+I I + +        +   D +QLSIF+HRFM IAEQMGR LQRT+ISTNIKERLD
Sbjct: 721  KTGDICISVGAEVPGTVGPQ--LDPIQLSIFSHRFMSIAEQMGRILQRTAISTNIKERLD 780

Query: 781  FSCALFGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLP 840
            FSCALFGPDGGLV+NAPH+PVHLGAM  TV++QI++ G +L+ GDVL++NHP AGGSHLP
Sbjct: 781  FSCALFGPDGGLVSNAPHIPVHLGAMQETVQFQIQHLGADLHPGDVLLSNHPSAGGSHLP 840

Query: 841  DITVITPVFDNGKL--IFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKG 900
            D+TVITPVF  G+   +F+VASRGHHA+IGGITPGSMPP S  + +EGA   +FKLV+ G
Sbjct: 841  DLTVITPVFWPGQTRPVFYVASRGHHADIGGITPGSMPPHSTMLQQEGAVFLSFKLVQGG 900

Query: 901  IFQEEGIIKLLQFPSSDEGVIP---GTRRLQDNLSDLHAQVAANHRGISLIKELIAQYA- 960
            +FQEE + + L+ P    G +P   GTR L DNLSDL AQVAAN +GI L+ ELI QY  
Sbjct: 901  VFQEEAVTEALRAP----GKVPNCSGTRNLHDNLSDLRAQVAANQKGIQLVGELIGQYGL 960

Query: 961  -------------------------GTGK---GSSITIEEEDYMDDGSVIHLKLTIDPHK 1020
                                     GT +   G  + +  ED+MDDGS I L++ I   +
Sbjct: 961  DVVQAYMGHIQANAELAVRDMLRAFGTSRQARGLPLEVSSEDHMDDGSPIRLRVQISLSQ 1020

Query: 1021 GEANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSF 1080
            G A FDFSGT PEV+GN NAP AVT++A+IYCLRC+V  DIPLNQGCLAPV++ IP GS 
Sbjct: 1021 GSAVFDFSGTGPEVFGNLNAPRAVTLSALIYCLRCLVGRDIPLNQGCLAPVRVVIPRGSI 1080

Query: 1081 LSPSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGA 1140
            L PS +AA+VGGNVLTSQR+ DVIL AF ACA SQGCMNN+T G+   GYYET+ GG+GA
Sbjct: 1081 LDPSPEAAVVGGNVLTSQRVVDVILGAFGACAASQGCMNNVTLGNAHMGYYETVAGGAGA 1140

Query: 1141 GPSWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEF 1200
            GPSWHG SGV  HMTNTR+TDPEI E RYPV+L  F LR  SGG G ++GGDG+ RE+ F
Sbjct: 1141 GPSWHGRSGVHSHMTNTRITDPEILESRYPVILRRFELRRGSGGRGRFRGGDGVTRELLF 1200

Query: 1201 KQPVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQIL 1232
            ++  ++S+L+ERR   P GL GG+ GARG N L+RK+GR V LGGK +VTV  G++  + 
Sbjct: 1201 REEALLSVLTERRAFRPYGLHGGEPGARGLNLLIRKNGRTVNLGGKTSVTVYPGDVFCLH 1256

BLAST of Cla021389 vs. Swiss-Prot
Match: OPLA_RAT (5-oxoprolinase OS=Rattus norvegicus GN=Oplah PE=1 SV=2)

HSP 1 Score: 1381.3 bits (3574), Expect = 0.0e+00
Identity = 717/1266 (56.64%), Postives = 894/1266 (70.62%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS  E +  F IDRGGTFTDV+A+ PG    +V+KLLS DP+NY DAP EGIRRILE+ 
Sbjct: 1    MGSP-EGRFHFAIDRGGTFTDVFAQCPGGHV-RVLKLLSEDPANYADAPTEGIRRILEQE 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
             G  +PR   + T  I  IRMGTTVATNALLER+GER+AL VT+GFRDLL IG QARPD+
Sbjct: 61   EGVLLPRGRPLDTSRIASIRMGTTVATNALLERQGERVALLVTRGFRDLLHIGTQARPDL 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDL V  P  LYE+V+EVDERV L RG+         S VKG +G+L+ I + ++ EAL+
Sbjct: 121  FDLAVPMPEVLYEEVLEVDERVVLYRGEPGAG-----SPVKGRTGDLLEIQQPVDLEALR 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
              L+ LL RGI  LAVVLMHSYT+ QHE  +  LA  +GF HVSLSS + PMVR VPRG 
Sbjct: 181  GKLEGLLSRGIHSLAVVLMHSYTWAQHEQQVGTLARELGFTHVSLSSEVMPMVRIVPRGH 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TA  DAYLTP I+ Y+ GF   F      V VLFM+SDGGLAP + FSG +AVLSGPAGG
Sbjct: 241  TACADAYLTPTIQRYVQGFRRGFQGQLKNVQVLFMRSDGGLAPMDAFSGSRAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYS T + LE  +P+IGFDMGGTSTD SRYAG +E V E   AG  +QAPQLDINTVA
Sbjct: 301  VVGYSATTYHLEGGQPVIGFDMGGTSTDVSRYAGEFEHVFEASTAGVTLQAPQLDINTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGS+L F+ G F VGPES GAHPGP CYRKGG + VTDANLVLG ++P  FP IFGP 
Sbjct: 361  AGGGSRLFFRSGLFVVGPESAGAHPGPACYRKGGPVTVTDANLVLGRLLPASFPCIFGPG 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPL  EA+R   E +A E+NS+       +  +++EE+A+GFV VANE MCRPIR LT
Sbjct: 421  EDQPLSPEASRKALEAVAMEVNSFLTNGPCPASQLSLEEVAMGFVRVANEAMCRPIRALT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            + +GH+   H LACFGGAG QHACAIAR LGM  + IHR  G+LSA G+ LADVV E QE
Sbjct: 481  QARGHDPSAHVLACFGGAGGQHACAIARALGMDTVHIHRHSGLLSALGLALADVVHEAQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            P S  Y  E+  ++ +R + L +Q    LQ QGF    I TE++L+LRY+GTD A+MV +
Sbjct: 541  PCSLSYTPETFAQLDQRLSRLEEQCVDALQVQGFPRSQISTESFLHLRYQGTDCALMVSA 600

Query: 601  QQ----VDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGL 660
             Q      +    DF A F + + +E+GF +  R V++ D+RVRG G + +      +  
Sbjct: 601  HQHPATACSPRAGDFGAAFVERYMREFGFIIPERPVVVDDVRVRGTGRSGLQLEDTPKIQ 660

Query: 661  AGDPKIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITK 720
             G P +E   + YF  G+Q+TP++ L  LG+GH + GP +I++ NST++VEP C+A +T 
Sbjct: 661  TGPPHVEKVTQCYFEGGYQETPVYLLGELGYGHQLQGPCLIIDNNSTILVEPGCQAEVTD 720

Query: 721  YGNIKIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDF 780
             G+I+I + +      +++   D +QLSIF+HRFM IAEQMGR LQRT+ISTNIKERLDF
Sbjct: 721  TGDIRISVGA--EGPSMADTRLDPIQLSIFSHRFMSIAEQMGRILQRTAISTNIKERLDF 780

Query: 781  SCALFGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPD 840
            SCALFGPDGGLV+NAPH+PVHLGAM  TV++QI++ G +L+ GDVL++NHP AGGSHLPD
Sbjct: 781  SCALFGPDGGLVSNAPHIPVHLGAMQETVQFQIQHLGADLHPGDVLLSNHPSAGGSHLPD 840

Query: 841  ITVITPVFDNGKL--IFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGI 900
            +TVITPVF  G+   +F+VASRGHHA+IGGITPGSMPP S ++ +EGA   +FKLV+ G+
Sbjct: 841  LTVITPVFWPGQTRPVFYVASRGHHADIGGITPGSMPPHSTTLQQEGAVFLSFKLVQGGV 900

Query: 901  FQEEGIIKLLQFPSSDEGVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQYA----- 960
            FQEE + + L+ P    G   GTR L DNLSDL AQVAAN +GI L+ ELI QY      
Sbjct: 901  FQEEAVTEALRAPGKISGC-SGTRNLHDNLSDLRAQVAANQKGIQLVGELIGQYGLDVVQ 960

Query: 961  ---------------------GTGK---GSSITIEEEDYMDDGSVIHLKLTIDPHKGEAN 1020
                                 GT +   G  + +  ED+MDDGS I L++ I+  +G A 
Sbjct: 961  AYMGHIQANAELAVRDMLRAFGTSRQARGLPLEVSAEDHMDDGSPICLRVQINLSQGSAV 1020

Query: 1021 FDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLSPS 1080
            FDF+G+  EV+GN NAP A+T++A+IYCLRC+V  DIPLNQGCLAPV++ IP GS L PS
Sbjct: 1021 FDFTGSGSEVFGNLNAPRAITLSALIYCLRCLVGRDIPLNQGCLAPVRVIIPKGSILDPS 1080

Query: 1081 EKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGPSW 1140
             +AA+VGGNVLTSQR+ DVIL AF AC+ SQGCMNN+T G+   GYYET+ GG+GAGP W
Sbjct: 1081 PEAAVVGGNVLTSQRVVDVILGAFGACSASQGCMNNVTLGNARMGYYETVAGGAGAGPGW 1140

Query: 1141 HGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQPV 1200
            HG SGV  HMTNTR+TDPEI E RYPV+L  F LR  SGG G ++GGDG+VRE+ F++  
Sbjct: 1141 HGRSGVHSHMTNTRITDPEILESRYPVILRRFELRPGSGGRGRFRGGDGVVRELVFREEA 1200

Query: 1201 VVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTPGG 1232
            ++S+L+ERR   P GL GG+ GARG N L+RKDGR V LGGK +VTV  G++  + TPGG
Sbjct: 1201 LLSVLTERRAFQPYGLHGGEPGARGLNLLIRKDGRTVNLGGKTSVTVYPGDVFCLHTPGG 1256

BLAST of Cla021389 vs. Swiss-Prot
Match: OPLA_BOVIN (5-oxoprolinase OS=Bos taurus GN=OPLAH PE=1 SV=1)

HSP 1 Score: 1369.0 bits (3542), Expect = 0.0e+00
Identity = 706/1262 (55.94%), Postives = 881/1262 (69.81%), Query Frame = 1

Query: 6    EDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEYSGKKI 65
            E +  F IDRGGTFTDV+A+ PG    +V+KLLS DP+NY DAP EGIRRILE+  G  +
Sbjct: 5    EGRFHFAIDRGGTFTDVFAQCPGGHV-RVLKLLSEDPANYVDAPTEGIRRILEQEGGVLL 64

Query: 66   PRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDIFDLTV 125
            PR   + T  I  IRMGTTVATNALLE++GER+AL VT+GFRDLL +  QAR  +FDL V
Sbjct: 65   PRDRPLDTSRIASIRMGTTVATNALLEQQGERVALLVTRGFRDLLHVCTQARAXLFDLAV 124

Query: 126  SKPSNLYEDVIEVDERVELIRGK-GDGNQDFSTSYVKGVSGELIRIVRTLNEEALKPLLK 185
              P  LYE+V+EVDERV L RG  G G        VKG +G+L+ + + ++   L+  L+
Sbjct: 125  PMPETLYEEVLEVDERVVLYRGXPGAGTP------VKGCTGDLLEVQQPVDLGGLRWKLE 184

Query: 186  DLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGLTASV 245
             LL RGI  LAVVLMHSYT+ QHE  +  LA  +GF HVSLSS   PMVR VPRG TA  
Sbjct: 185  GLLSRGIRSLAVVLMHSYTWAQHEQQVGALARELGFTHVSLSSEAMPMVRIVPRGHTACA 244

Query: 246  DAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGGVVGY 305
            DAYLTP I+ Y+ GF   F      V VLFM+SDGGLAP + FSG +AVLSGPAGGVVGY
Sbjct: 245  DAYLTPTIQRYVQGFRRGFQGQLKDVQVLFMRSDGGLAPMDSFSGSRAVLSGPAGGVVGY 304

Query: 306  SQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVAAGGG 365
            S T + +E  +P+IGFDMGGTSTD SRYAG +E V E   AG  +QAPQLDINTVAAGGG
Sbjct: 305  SATTYRVEGGQPVIGFDMGGTSTDVSRYAGEFEHVFEASTAGVTLQAPQLDINTVAAGGG 364

Query: 366  SKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPNEDQP 425
            S+L F+ G F VGPES GAHPGP CYRKGG + VTDANLVLG ++P  FP IFGP EDQP
Sbjct: 365  SRLFFRSGLFVVGPESAGAHPGPACYRKGGPVTVTDANLVLGRLLPASFPCIFGPGEDQP 424

Query: 426  LDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLTEMKG 485
            L  EA+R   E +ATE+NS+       + P+++EE+A+GFV VANE MCRPIR LT+ +G
Sbjct: 425  LSPEASRKALEAVATEVNSFLTNGPCPASPLSLEEVAMGFVRVANEAMCRPIRALTQARG 484

Query: 486  HETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQEPYSA 545
            H+   H LACFGGAG QHACAIAR LGM  + IHR  G+LSA G+ LADVV E QEP S 
Sbjct: 485  HDPSAHVLACFGGAGGQHACAIARALGMDTVHIHRHSGLLSALGLALADVVHEAQEPCSL 544

Query: 546  VYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKSQQVD 605
             Y  E+  ++ +R   L +Q    L+ QGF    I TE++L+LRY+GTD A+MV + Q  
Sbjct: 545  PYAPETFAQLDQRLGRLEEQCVEALRAQGFPRSQISTESFLHLRYQGTDCALMVSAHQHP 604

Query: 606  NGVEF----DFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 665
                     DF A F + + +E+GF +  R V++ D+RVRG G +++      +  +G P
Sbjct: 605  ASARSPRAGDFGAAFVERYMREFGFIIPERPVVVDDVRVRGTGSSSLRLEDVPKAHSGPP 664

Query: 666  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 725
            +++   + YF  G+Q+TP++ L  LG GH + GP +I++ NST++VEP C+A +T+ G+I
Sbjct: 665  RVDKMTQCYFEGGYQETPVYLLGELGCGHKLQGPCLIIDSNSTILVEPGCQAEVTETGDI 724

Query: 726  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 785
            +I + +   T  V     D + L+IF+HRFM IAEQMGR LQRT+ISTNIKERLDFSCAL
Sbjct: 725  RISVGA--ETASVVGTQLDPIHLTIFSHRFMSIAEQMGRILQRTAISTNIKERLDFSCAL 784

Query: 786  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 845
            FGPDGGLV+N PH+PVHLGAM  TV++QI+  G +L+ GDVL++NHP AGGSHLPD+TVI
Sbjct: 785  FGPDGGLVSNVPHIPVHLGAMQETVQFQIQQLGADLHPGDVLLSNHPSAGGSHLPDLTVI 844

Query: 846  TPVFDNGKL--IFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEE 905
            TPVF  G+   +F+VASRGHHA+IGGITPGSMPP S S+ +EGA   +FKLV  G+FQEE
Sbjct: 845  TPVFWPGQTRPVFYVASRGHHADIGGITPGSMPPHSTSLQQEGAVFLSFKLVHGGVFQEE 904

Query: 906  GIIKLLQFPSSDEGVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQY---------- 965
             + + L+ P    G   GTR L DNLSDL AQVAAN +GI L+ ELI QY          
Sbjct: 905  AVTEALRAPGKIPGC-SGTRNLHDNLSDLRAQVAANQKGIQLVGELIGQYGLDVVQAYMG 964

Query: 966  -------------------AGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKGEANFDFS 1025
                               A   +G  + +  ED+MDDGS I L++ I+  +G A FDFS
Sbjct: 965  HIQANAELAVRDMLRAFGTARQARGLPLEVSAEDHMDDGSPIRLRVQINMSQGSAVFDFS 1024

Query: 1026 GTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLSPSEKAA 1085
            G+ PEV+GN NAP A+T++A+IYCLRC+V  DIPLNQGCLAPV++ IP GS L PS  AA
Sbjct: 1025 GSGPEVFGNLNAPRAITLSALIYCLRCLVGRDIPLNQGCLAPVRVVIPKGSILDPSPDAA 1084

Query: 1086 IVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGPSWHGTS 1145
            +VGGNVLTSQR+ DVIL AF ACA SQGCMNN+T G+   GYYET+ GG+GAGP WHG S
Sbjct: 1085 VVGGNVLTSQRVVDVILGAFGACAASQGCMNNVTLGNAHMGYYETVAGGAGAGPGWHGRS 1144

Query: 1146 GVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQPVVVSI 1205
            GV  HMTNTR+TDPEI E RYPV+L  F LR  SGG G ++GGDG++RE+ F++  ++S+
Sbjct: 1145 GVHSHMTNTRITDPEILESRYPVILRRFELRLGSGGRGRFRGGDGIIRELLFREEALLSV 1204

Query: 1206 LSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTPGGGGWG 1232
            L+ERR   P GL GG+ GARG N L+RKDGR V LGGK +V V  G++  + TPGGGG+G
Sbjct: 1205 LTERRAFQPYGLMGGEPGARGLNLLIRKDGRTVNLGGKTSVPVYPGDVFCLHTPGGGGYG 1256

BLAST of Cla021389 vs. TrEMBL
Match: M5X2E6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000342mg PE=4 SV=1)

HSP 1 Score: 2111.6 bits (5470), Expect = 0.0e+00
Identity = 1049/1268 (82.73%), Postives = 1137/1268 (89.67%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS N++KLRFCIDRGGTFTDVYAEIPG+ DG+V+KLLSVDPSNY+DAPVEGIRRILEE+
Sbjct: 1    MGSANDNKLRFCIDRGGTFTDVYAEIPGQPDGQVLKLLSVDPSNYDDAPVEGIRRILEEF 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +GKKI R SKIPT  IEWIRMGTTVATNALLERKGERIALCVT+GFRDLLQIGNQARP I
Sbjct: 61   TGKKISRASKIPTDKIEWIRMGTTVATNALLERKGERIALCVTRGFRDLLQIGNQARPKI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDLTVSKPSNLYE+VIEVDERVEL     D +   S S VKGVSGE++++V+ ++ E LK
Sbjct: 121  FDLTVSKPSNLYEEVIEVDERVELANDNQDSS---SASLVKGVSGEMVKVVKPIDVETLK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLL+ LL++GISCLAVVLMHSYTYPQHE+A+E+LA S+GF+HVSLSSALTPMVRAVPRGL
Sbjct: 181  PLLQGLLEKGISCLAVVLMHSYTYPQHEVAVERLAESLGFRHVSLSSALTPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPVIKEYLSGFMSKFDE   KVNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TASVDAYLTPVIKEYLSGFMSKFDEGVEKVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLF LET+KPLIGFDMGGTSTD SRYAG+YEQVLETQIAGAIIQAPQLDI+TVA
Sbjct: 301  VVGYSQTLFGLETEKPLIGFDMGGTSTDVSRYAGTYEQVLETQIAGAIIQAPQLDISTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLG+VIPD+FPSIFGPN
Sbjct: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGYVIPDYFPSIFGPN 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            ED+PLDI ATR EF+KLA++INSYRK QDPS+K MT+EEIALGFVNVANETMCRPIRQLT
Sbjct: 421  EDEPLDIRATRDEFDKLASQINSYRKSQDPSAKDMTVEEIALGFVNVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHET+NHALACFGGAGPQHACAIAR LGMKE+ IHRFCGILSAYGMGLADVVEE QE
Sbjct: 481  EMKGHETRNHALACFGGAGPQHACAIARSLGMKEVLIHRFCGILSAYGMGLADVVEEAQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PYSAVY  ESVQE S REA LL QV  KLQ QGFR+ ++ TETYLNLRYEGTDT+IMVK 
Sbjct: 541  PYSAVYSLESVQEASHREAILLSQVRQKLQEQGFRDENMTTETYLNLRYEGTDTSIMVKK 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
            +  ++G   ++  +F +LFQQEYGFKL NRN+LICD+RVRG+GVTN+LKP A E  +  P
Sbjct: 601  RITEDGRGCNYNLDFVELFQQEYGFKLLNRNILICDVRVRGVGVTNILKPLALERTSCSP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            K+EG+Y+VYFGNGWQ+TPL+KL+ LG+GHI+ GPAIIMNGNSTVIVEP+CKA ITKYGNI
Sbjct: 661  KVEGNYKVYFGNGWQETPLYKLEKLGYGHIMAGPAIIMNGNSTVIVEPNCKAIITKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEIDST ST KV EKVA+VVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 721  KIEIDSTSSTMKVVEKVANVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQI YWGDNL+EGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQINYWGDNLSEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900
            TPVFDNGKL+FFVASRGHHAEIGGITPGSMPPFSKSIWEEGAA+KAFKLVEKGIFQEEGI
Sbjct: 841  TPVFDNGKLVFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAALKAFKLVEKGIFQEEGI 900

Query: 901  IKLLQFPSSDE--GVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQY---------- 960
             KLL+FP SDE    IPGTRRLQDNLSDL AQVAAN RGI+LIKELI QY          
Sbjct: 901  TKLLRFPCSDELAQKIPGTRRLQDNLSDLRAQVAANKRGITLIKELIEQYGLDTVQAYMT 960

Query: 961  -------------------------AGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKGE 1020
                                     + +G  SS+TIEEEDYMDDGS+IHLKLTID   GE
Sbjct: 961  YVQLNAEEAVREMLKSVAARVLSQPSSSGDRSSVTIEEEDYMDDGSIIHLKLTIDSDNGE 1020

Query: 1021 ANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLS 1080
            ANFDFSGTSPEVYGNWNAPEAVT AAVIYCLRC+VDVDIPLNQGCLAPVKIYIPPGSFLS
Sbjct: 1021 ANFDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCLVDVDIPLNQGCLAPVKIYIPPGSFLS 1080

Query: 1081 PSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGP 1140
            PS+KAA+VGGNVLTSQRITDV+LTAFQACACSQGCMNNLTFGD+TFGYYETIGGGSGAGP
Sbjct: 1081 PSDKAAVVGGNVLTSQRITDVVLTAFQACACSQGCMNNLTFGDETFGYYETIGGGSGAGP 1140

Query: 1141 SWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQ 1200
            +W GTSGVQCHMTNTRMTDPEIFEQRYPVLLH F LRENSGG G +KGGDGLVREIEFK+
Sbjct: 1141 TWDGTSGVQCHMTNTRMTDPEIFEQRYPVLLHKFGLRENSGGVGYHKGGDGLVREIEFKR 1200

Query: 1201 PVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTP 1232
            P+VVSILSERRVH PRGLKGGK+GARGANFL+ +D RRVYLGGKNTV V+ GEILQILTP
Sbjct: 1201 PIVVSILSERRVHTPRGLKGGKDGARGANFLITQDKRRVYLGGKNTVEVQPGEILQILTP 1260

BLAST of Cla021389 vs. TrEMBL
Match: B9SP24_RICCO (5-oxoprolinase, putative OS=Ricinus communis GN=RCOM_1248770 PE=4 SV=1)

HSP 1 Score: 2109.3 bits (5464), Expect = 0.0e+00
Identity = 1044/1266 (82.46%), Postives = 1141/1266 (90.13%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS  E+KLRFCIDRGGTFTDVYAE+PG  DG+V+KLLSVDPSNY+DAPVEGIRRILEEY
Sbjct: 1    MGSIKEEKLRFCIDRGGTFTDVYAEVPGNPDGRVLKLLSVDPSNYDDAPVEGIRRILEEY 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +G+KIPR+SKIPT  IEWIRMGTTVATNALLERKGERIA+CVT+GF+DLLQIGNQARP+I
Sbjct: 61   TGEKIPRSSKIPTDKIEWIRMGTTVATNALLERKGERIAVCVTQGFKDLLQIGNQARPNI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDLTVSKPSNLYE+VIEVDERV+L+  K + +Q+ S S VKGVSGEL+RIV+ L+EEALK
Sbjct: 121  FDLTVSKPSNLYEEVIEVDERVQLVLDKEEVDQNSSASVVKGVSGELVRIVKPLDEEALK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLLK LL++GISCLAVVL+HSYT+PQHELA+E++A S+GF+HVSLSS L+PMVRAVPRGL
Sbjct: 181  PLLKGLLEKGISCLAVVLLHSYTFPQHELAVERVAASLGFRHVSLSSGLSPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPVIKEYLSGF+SKFDE  GKVNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TASVDAYLTPVIKEYLSGFISKFDEGLGKVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLF LET+KPLIGFDMGGTSTD SRYAGSYEQVLETQIAGAIIQAPQLDINTVA
Sbjct: 301  VVGYSQTLFGLETQKPLIGFDMGGTSTDVSRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANL+LGFVIPD+FPSIFGPN
Sbjct: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLILGFVIPDYFPSIFGPN 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPLDIEATR EF+KLA +INSYRK QDP +K MTIE+IALGFVNVANETMCRPIRQLT
Sbjct: 421  EDQPLDIEATREEFKKLAMQINSYRKSQDPLAKDMTIEDIALGFVNVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            E+KGHET+NHALACFGGAGPQHACAIAR LGMKE+ IH+FCGILSAYGMGLADVVEE QE
Sbjct: 481  ELKGHETRNHALACFGGAGPQHACAIARSLGMKEVLIHKFCGILSAYGMGLADVVEEAQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PYSAVY  ESV E S RE  LLKQV  KLQGQGFRE +I TETYLNLRYEGTDT+IMV+ 
Sbjct: 541  PYSAVYGHESVLEASSREDVLLKQVKQKLQGQGFREENITTETYLNLRYEGTDTSIMVRR 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
               ++G  +D+A EF KLFQ+EYGFKLQNRN+LICD+RVRGIGVTN+LKP+  +  +G P
Sbjct: 601  HVNEDGSRYDYAVEFVKLFQKEYGFKLQNRNILICDVRVRGIGVTNILKPQVLQPTSGSP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            K+EG Y+VYFGNGW +TPLFKL+NLG G I+PGPAIIMNGNSTVIVEP+CKA +TKYGNI
Sbjct: 661  KVEGDYKVYFGNGWLNTPLFKLENLGPGDIMPGPAIIMNGNSTVIVEPNCKAFVTKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEI+S  +T +++EKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 721  KIEIESNVNTVQIAEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQ+ YWGDNLNEGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQLNYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900
            TPVFD GKL+ FVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVE+G+FQEEGI
Sbjct: 841  TPVFDKGKLVVFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVERGVFQEEGI 900

Query: 901  IKLLQFPSSDEGV--IPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQYA--------- 960
            IKLL+FPSS+E    IPGTRRLQDNLSDLHAQVAAN RGISLIKELI QY          
Sbjct: 901  IKLLKFPSSNESAYKIPGTRRLQDNLSDLHAQVAANQRGISLIKELIEQYGLDTVQAYMT 960

Query: 961  --------------------GTGKGSSI------TIEEEDYMDDGSVIHLKLTIDPHKGE 1020
                                 + + S        TIEEEDYMDDGSVIHLKLTID  +GE
Sbjct: 961  YVQLNAEEAVREMLKSVAVRVSSESSRFAHNHSITIEEEDYMDDGSVIHLKLTIDSDRGE 1020

Query: 1021 ANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLS 1080
            A FDFSGTSPEVYGNWNAPEAVT AAVIYCLRC+VDVDIPLNQGCLAPV I+IPP SFLS
Sbjct: 1021 AFFDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCLVDVDIPLNQGCLAPVTIHIPPCSFLS 1080

Query: 1081 PSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGP 1140
            PS+KAA+VGGNVLTSQRITDV+LTAFQACACSQGCMNNLTFGD TFGYYETIGGGSGAGP
Sbjct: 1081 PSDKAAVVGGNVLTSQRITDVVLTAFQACACSQGCMNNLTFGDHTFGYYETIGGGSGAGP 1140

Query: 1141 SWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQ 1200
            +W+GTSGVQCHMTNTRMTDPEIFEQRYPVLLH F LRENSGG G++KGGDGLVREIEF++
Sbjct: 1141 TWNGTSGVQCHMTNTRMTDPEIFEQRYPVLLHKFGLRENSGGDGLHKGGDGLVREIEFRR 1200

Query: 1201 PVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTP 1230
            PVVVSILSERRVHAPRG++GGK+GARGAN L+ KD R++YLGGKNTV V+AGEILQILTP
Sbjct: 1201 PVVVSILSERRVHAPRGIRGGKDGARGANHLITKDKRKIYLGGKNTVEVQAGEILQILTP 1260

BLAST of Cla021389 vs. TrEMBL
Match: A0A0D2RJ73_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G213100 PE=4 SV=1)

HSP 1 Score: 2105.9 bits (5455), Expect = 0.0e+00
Identity = 1040/1266 (82.15%), Postives = 1136/1266 (89.73%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS + +KLRFCIDRGGTFTDVYAEIPG  DG+V+KLLSVDPSNY+DAP+EGIRRILEEY
Sbjct: 1    MGSVSGEKLRFCIDRGGTFTDVYAEIPGHSDGRVLKLLSVDPSNYDDAPIEGIRRILEEY 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +G+KIPRT KIPT  IEWIRMGTTVATNALLERKGERIALCVT+GF+DLLQIG+Q+RP I
Sbjct: 61   TGQKIPRTVKIPTDKIEWIRMGTTVATNALLERKGERIALCVTRGFKDLLQIGDQSRPHI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDL+ +KPSNLYE VIEVDERVEL+  +  GN + S S+VKGVSGEL+R+V+ L+EE+LK
Sbjct: 121  FDLSAAKPSNLYEQVIEVDERVELVLDEEKGNGEKSGSFVKGVSGELVRVVKCLDEESLK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLLK LL++GISCLAVVLMHSYTYP HE+A+EKLA+S+GF+HVS SSALTPMVRAVPRGL
Sbjct: 181  PLLKGLLEKGISCLAVVLMHSYTYPYHEMAVEKLAMSLGFRHVSSSSALTPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPV+KEYLSGF+S+FDE    VNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TASVDAYLTPVVKEYLSGFISRFDEGLAMVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLF LET+KPLIGFDMGGTSTD SRYAGSYEQVLET+IAGAIIQAPQLDINTVA
Sbjct: 301  VVGYSQTLFRLETEKPLIGFDMGGTSTDVSRYAGSYEQVLETKIAGAIIQAPQLDINTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANL+LG+V+PD+FP+IFGP 
Sbjct: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLILGYVVPDYFPAIFGPK 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPLD+EATR E++KLA +INSYRK QD S+K MT+EEIALGFVNVANETMCRPIRQLT
Sbjct: 421  EDQPLDVEATREEYKKLAEQINSYRKSQDSSAKDMTVEEIALGFVNVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHET+NHALACFGGAGPQHACAIAR LGM E+ IHRFCGILSAYGMGLADVVEE Q 
Sbjct: 481  EMKGHETRNHALACFGGAGPQHACAIARSLGMTEVLIHRFCGILSAYGMGLADVVEEAQL 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PY+AVY SESV E SRREA LL QV  KLQ QGFRE +IK ETYLNLRYEGTDTAIMVK 
Sbjct: 541  PYAAVYGSESVVEASRREAILLNQVKQKLQEQGFREENIKAETYLNLRYEGTDTAIMVKR 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
               ++G   D+A EFEKLFQQEYGFKLQNRN+L+CD+RVRGIGV N+LKP+  E  +G P
Sbjct: 601  CIAEDGSGSDYAEEFEKLFQQEYGFKLQNRNILVCDVRVRGIGVANILKPQTLEPASGSP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            KIEGHY+V+FGNGW DTPLFKL+NLG+GH+IPGPAIIMNG+STVIVEP CKA ITKYGNI
Sbjct: 661  KIEGHYKVFFGNGWHDTPLFKLENLGYGHVIPGPAIIMNGSSTVIVEPKCKAIITKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEI+S+ +T KV+EKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 721  KIEIESSVNTVKVAEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQ+EYWGD LNEGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQLEYWGDKLNEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900
            TPVFDNGKL+FFVASRGHHAEIGG+TPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI
Sbjct: 841  TPVFDNGKLVFFVASRGHHAEIGGVTPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900

Query: 901  IKLLQFPSSDEGV--IPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQY---------- 960
            IKLL+FP + E    IPGTRRLQDNLSDL AQVAAN RGI+LIKELI QY          
Sbjct: 901  IKLLKFPDAVEHSQNIPGTRRLQDNLSDLRAQVAANQRGITLIKELIEQYGLETVQAYMT 960

Query: 961  -------------------------AGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKGE 1020
                                        G+ +SITIEEED MDDGSVIHLKLTID +KGE
Sbjct: 961  YVQLNAEEAVREMLKAVAARISSESTRLGERNSITIEEEDCMDDGSVIHLKLTIDSNKGE 1020

Query: 1021 ANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLS 1080
            A+FDFSGTSPEVYGNWNAPEAVT AAVIYCLRC+VDVDIPLNQGCLAPVKI++P GSFLS
Sbjct: 1021 ASFDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCLVDVDIPLNQGCLAPVKIHVPAGSFLS 1080

Query: 1081 PSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGP 1140
            PS+KAA+VGGNVLTSQRITDV+LTAFQACACSQGCMNNLTFGD+TFGYYETIGGGSGAGP
Sbjct: 1081 PSDKAAVVGGNVLTSQRITDVVLTAFQACACSQGCMNNLTFGDNTFGYYETIGGGSGAGP 1140

Query: 1141 SWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQ 1200
            +W GTSGVQCHMTNTRMTDPEIFEQRYPV LH F LRENSGG+G  KGG+GLVREIEF++
Sbjct: 1141 TWDGTSGVQCHMTNTRMTDPEIFEQRYPVFLHKFGLRENSGGAGHRKGGNGLVREIEFRR 1200

Query: 1201 PVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTP 1230
            PVVVSILSERRVHAPRGLKGG NGARGAN+L+ KD RR+YLGGKNTV V+AGEILQILTP
Sbjct: 1201 PVVVSILSERRVHAPRGLKGGANGARGANYLITKDKRRIYLGGKNTVEVQAGEILQILTP 1260

BLAST of Cla021389 vs. TrEMBL
Match: A0A0B0P1C4_GOSAR (5-oxoprolinase-like protein OS=Gossypium arboreum GN=F383_10193 PE=4 SV=1)

HSP 1 Score: 2103.6 bits (5449), Expect = 0.0e+00
Identity = 1036/1266 (81.83%), Postives = 1138/1266 (89.89%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS + +KLRFCIDRGGTFTDVYAEIPG  DG+V+KLLSVDPSNY+DAP+EGIRRILEEY
Sbjct: 1    MGSVSGEKLRFCIDRGGTFTDVYAEIPGHSDGRVLKLLSVDPSNYDDAPIEGIRRILEEY 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +G+KIPRT KIPT  IEWIRMGTTVATNALLERKGERIALCVT+GF+DLLQIG+Q+RP I
Sbjct: 61   TGQKIPRTVKIPTDKIEWIRMGTTVATNALLERKGERIALCVTRGFKDLLQIGDQSRPHI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDL+ +KPSNLYE VIEVDERVEL+  +  GN + S S+VKGVSGEL+R+V+ L+EE+LK
Sbjct: 121  FDLSAAKPSNLYEQVIEVDERVELVLDEEKGNGEKSGSFVKGVSGELVRVVKCLDEESLK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLLK LL++GISCLAVVLMHSYTYP HE+A+EKLA+S+GF+HVSLSSALTPMVRAVPRGL
Sbjct: 181  PLLKGLLEKGISCLAVVLMHSYTYPYHEMAVEKLAMSLGFRHVSLSSALTPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPV+KEYLSGF+S+FDE   +VNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TASVDAYLTPVVKEYLSGFISRFDEGLARVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLF LET+KPLIGFDMGGTSTD SRYAGSYEQVLET+IAGAIIQAPQLDINTVA
Sbjct: 301  VVGYSQTLFRLETEKPLIGFDMGGTSTDVSRYAGSYEQVLETKIAGAIIQAPQLDINTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANL+LG+V+PD+FP+IFGP 
Sbjct: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLILGYVVPDYFPAIFGPK 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPLD+EATR E++KLA +INSYRK QD S++ MT+EEIALGFVNVANETMCRPIRQLT
Sbjct: 421  EDQPLDVEATREEYKKLAEQINSYRKSQDSSARDMTVEEIALGFVNVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHET+NHALACFGGAGPQHACAIAR LGM E+ IHRFCGILSAYGMGLADV+EE Q 
Sbjct: 481  EMKGHETRNHALACFGGAGPQHACAIARSLGMTEVLIHRFCGILSAYGMGLADVIEEAQV 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PY+AVY SESV E S REA LL QV  KLQ QGFRE +IK ETYLNLRYEGTDTAIMVK 
Sbjct: 541  PYAAVYGSESVVEASCREAILLNQVKQKLQEQGFREENIKAETYLNLRYEGTDTAIMVKR 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
            +  ++G   D+A EFEKLFQQEYGFKLQNRN+L+CD+RVRGIGV N+LKP+  E  +G P
Sbjct: 601  RIAEDGSGSDYAEEFEKLFQQEYGFKLQNRNILVCDVRVRGIGVANILKPQTLEPASGSP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            KIEGHY+V+FGNGW DTPLFKL+NLG+GH+IPGPAIIMNG+STVIVEP CKA ITKYGNI
Sbjct: 661  KIEGHYKVFFGNGWHDTPLFKLENLGYGHVIPGPAIIMNGSSTVIVEPKCKAIITKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEI+S+ +T KV+EKVADVVQLSIFNHRFMGIAEQMGRTLQR SISTNIKERLDFSCAL
Sbjct: 721  KIEIESSVNTVKVAEKVADVVQLSIFNHRFMGIAEQMGRTLQRISISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQ+EYWGDNLNEGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQLEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900
            TPVF+NGKL+FFVASRGHHAEIGG+TPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI
Sbjct: 841  TPVFNNGKLVFFVASRGHHAEIGGVTPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900

Query: 901  IKLLQFPSSDEGV--IPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQY---------- 960
            IKLL+FP +DE    IPGTRRLQDNLSDL AQVAAN RGI+LIKELI QY          
Sbjct: 901  IKLLKFPGADEHSQNIPGTRRLQDNLSDLRAQVAANQRGITLIKELIEQYGLETVQAYMT 960

Query: 961  -------------------------AGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKGE 1020
                                        G+ +SITIEEED MDDGSVIHLKL+ID +KGE
Sbjct: 961  YVQLNAEEAVREMLKAVAARISSESTRLGERNSITIEEEDCMDDGSVIHLKLSIDSNKGE 1020

Query: 1021 ANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLS 1080
            A+FDFSGTSPEVYGNWNAPEAVT AAVIYCLRC+VDVDIPLNQGCLAPV I++P GSFLS
Sbjct: 1021 ASFDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCLVDVDIPLNQGCLAPVIIHVPAGSFLS 1080

Query: 1081 PSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGP 1140
            PS+KAA+VGGNVLTSQRITDV+LTAFQACACSQGCMNNLTFGD+TFGYYETIGGGSGAGP
Sbjct: 1081 PSDKAAVVGGNVLTSQRITDVVLTAFQACACSQGCMNNLTFGDNTFGYYETIGGGSGAGP 1140

Query: 1141 SWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQ 1200
            +W GTSGVQCHMTNTRMTDPEIFEQRYPV LH F LRENSGG+G  KGG+GLVREIEF++
Sbjct: 1141 TWDGTSGVQCHMTNTRMTDPEIFEQRYPVFLHKFGLRENSGGAGHRKGGNGLVREIEFRR 1200

Query: 1201 PVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTP 1230
            PVVVSILSERRVHAPRGLKGG NGARGAN+L+ KD RR+YLGGKNTV V+AGEILQILTP
Sbjct: 1201 PVVVSILSERRVHAPRGLKGGANGARGANYLITKDKRRIYLGGKNTVEVQAGEILQILTP 1260

BLAST of Cla021389 vs. TrEMBL
Match: A0A061E1X5_THECC (Oxoprolinase 1 OS=Theobroma cacao GN=TCM_007670 PE=4 SV=1)

HSP 1 Score: 2100.9 bits (5442), Expect = 0.0e+00
Identity = 1033/1266 (81.60%), Postives = 1135/1266 (89.65%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS +E+KLRFCIDRGGTFTDVYAEIP   DG+V+KLLSVDPSNY+DAP+EGIRRILEEY
Sbjct: 1    MGSVSEEKLRFCIDRGGTFTDVYAEIPDHPDGRVLKLLSVDPSNYDDAPIEGIRRILEEY 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +G+KIPRT+KIPT  IEWIRMGTTVATNALLERKGERIALCVT+GF+DLLQIG+Q+RP+I
Sbjct: 61   TGEKIPRTAKIPTDKIEWIRMGTTVATNALLERKGERIALCVTRGFKDLLQIGDQSRPNI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDLT +K SNLYE+V+EVDER+EL+  +  GN+D S S++KGVSGEL+R+V+ L+EEALK
Sbjct: 121  FDLTATKSSNLYEEVVEVDERIELVLEQDKGNKDNSKSFLKGVSGELVRVVKCLDEEALK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLLK LL+ GISCLAVVLMHSYTYP HE+A+EKLA+++GF+HVSLSSALTPMVRAVPRGL
Sbjct: 181  PLLKGLLENGISCLAVVLMHSYTYPYHEMAVEKLAMNLGFRHVSLSSALTPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPV+KEYL+GF+S+FDE  GKVNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TASVDAYLTPVVKEYLAGFISRFDEGLGKVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLF LET+KPLIGFDMGGTSTD SRYAGSYEQVLET+IAGAIIQAPQLDINTVA
Sbjct: 301  VVGYSQTLFGLETEKPLIGFDMGGTSTDVSRYAGSYEQVLETKIAGAIIQAPQLDINTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLG+VIPD+FP+IFGPN
Sbjct: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGYVIPDYFPAIFGPN 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPLD++AT+ EF+KLA +INSYRK QD S+K MT+EEIALGFVNVANETMCRPIRQLT
Sbjct: 421  EDQPLDVQATKEEFKKLAEKINSYRKSQDSSAKDMTVEEIALGFVNVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHET+NHALACFGGAGPQHACAI+R LGM  + IHRFCGILSAYGMGLADVVEE QE
Sbjct: 481  EMKGHETRNHALACFGGAGPQHACAISRSLGMTAVLIHRFCGILSAYGMGLADVVEEAQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PY+AVY  ESV E SRREA LLKQV  KL  QGFR  +IKTETY+NLRYEGTDTAIMVK 
Sbjct: 541  PYAAVYGPESVLEASRREAILLKQVKQKLLEQGFRGENIKTETYINLRYEGTDTAIMVKG 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
               ++G   D+A EF KLFQQEYGFKL NRN+L+CD+RVRGIGV N+LKPRA E  +G P
Sbjct: 601  HIAEDGSGCDYADEFVKLFQQEYGFKLHNRNILVCDVRVRGIGVANILKPRALERASGSP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            KIE  Y+V+FGNGW DTPLFKLDNLG+GH+IPGPAIIMNG+STVIVEP C A ITKYGNI
Sbjct: 661  KIESRYKVFFGNGWHDTPLFKLDNLGYGHVIPGPAIIMNGSSTVIVEPKCNAIITKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEI+S  +T KV+EKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 721  KIEIESILNTVKVAEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQ+EYWG NLNEGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQLEYWGGNLNEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900
            TPVFDNGKL+FFVASRGHHAEIGG+TPGSMPPFSK IWEEGAAIKAFKLVEKGIFQEEGI
Sbjct: 841  TPVFDNGKLVFFVASRGHHAEIGGVTPGSMPPFSKCIWEEGAAIKAFKLVEKGIFQEEGI 900

Query: 901  IKLLQFPSSDEGV--IPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQY---------- 960
            +KLL+FP +DE    IPGTR+LQDNLSDL AQVAAN RGI+LIKELI QY          
Sbjct: 901  VKLLEFPGADESTQKIPGTRQLQDNLSDLRAQVAANQRGITLIKELIEQYGLETVQAYMT 960

Query: 961  -------------------------AGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKGE 1020
                                        G+ + + IEEED MDDGSVIHLKLTID +KGE
Sbjct: 961  YVQLNAEEAVREMLKSVAARISSESTTLGERNFLMIEEEDCMDDGSVIHLKLTIDSNKGE 1020

Query: 1021 ANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLS 1080
            A FDFSGTSPEVYGNWNAPEAVT AAVIYCLRC+VDVDIPLNQGCLAPVKI++P GSFLS
Sbjct: 1021 ARFDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCLVDVDIPLNQGCLAPVKIHVPEGSFLS 1080

Query: 1081 PSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGP 1140
            PS++AA+VGGNVLTSQRITDV+LTAFQACACSQGCMNNLTFGD+TFGYYETIGGGSGAGP
Sbjct: 1081 PSDEAAVVGGNVLTSQRITDVVLTAFQACACSQGCMNNLTFGDNTFGYYETIGGGSGAGP 1140

Query: 1141 SWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQ 1200
            SW GTSGVQCHMTNTRMTDPEIFEQRYPVLLH F LRENSGG+GI+KGGDGLVREIEF++
Sbjct: 1141 SWDGTSGVQCHMTNTRMTDPEIFEQRYPVLLHRFGLRENSGGAGIHKGGDGLVREIEFRR 1200

Query: 1201 PVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTP 1230
             VVVSILSERRVHAPRGLKGG NGARGAN+L+ KD RR+YLGGKNTV V+AGEIL+ILTP
Sbjct: 1201 AVVVSILSERRVHAPRGLKGGANGARGANYLITKDERRIYLGGKNTVEVQAGEILEILTP 1260

BLAST of Cla021389 vs. NCBI nr
Match: gi|659116894|ref|XP_008458316.1| (PREDICTED: 5-oxoprolinase [Cucumis melo])

HSP 1 Score: 2313.1 bits (5993), Expect = 0.0e+00
Identity = 1159/1265 (91.62%), Postives = 1188/1265 (93.91%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGSNNE+KLRFCIDRGGTFTDVYAEIPGR DGKV+KLLSVDPSNY+DAPVEGIRRILEEY
Sbjct: 1    MGSNNEEKLRFCIDRGGTFTDVYAEIPGRPDGKVLKLLSVDPSNYDDAPVEGIRRILEEY 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +GKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI
Sbjct: 61   TGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDLTVSKPSNLYEDV+EVDERVELI GKGDGNQD ST YVKGVSGELIRIV+TLNEEALK
Sbjct: 121  FDLTVSKPSNLYEDVVEVDERVELIHGKGDGNQDSST-YVKGVSGELIRIVKTLNEEALK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLLKDLLQRGI CLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL
Sbjct: 181  PLLKDLLQRGIGCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPVIKEYLSGFMSKFD+SSGKVNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TASVDAYLTPVIKEYLSGFMSKFDKSSGKVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLFELET KPLIGFDMGGTSTD SRYAGSYEQVLETQIAGAIIQAPQLDINTVA
Sbjct: 301  VVGYSQTLFELETTKPLIGFDMGGTSTDVSRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN
Sbjct: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPLDIEATRGEFEKLATEINSYRK QDPSSKPMTIE+IALGFVNVANETMCRPIRQLT
Sbjct: 421  EDQPLDIEATRGEFEKLATEINSYRKNQDPSSKPMTIEQIALGFVNVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE
Sbjct: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PYSAVYCSES  EVSRREASLLKQV +KLQ QGFREGSIKTETYLNLRY+GTDTAIMVK 
Sbjct: 541  PYSAVYCSESFLEVSRREASLLKQVKHKLQSQGFREGSIKTETYLNLRYDGTDTAIMVKG 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
            Q+ DNGVEFDFAAEFEKLFQQEYGFKLQNRN+LICDIRVRG+GVTNVLKPRAFEGL+GDP
Sbjct: 601  QRADNGVEFDFAAEFEKLFQQEYGFKLQNRNILICDIRVRGVGVTNVLKPRAFEGLSGDP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            KIEGHYRVYFGNGWQDTPL KLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKAT+TKYGNI
Sbjct: 661  KIEGHYRVYFGNGWQDTPLLKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATVTKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEIDSTF TE VSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 721  KIEIDSTFCTENVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQI+YWGDNLNEGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIDYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900
            TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI
Sbjct: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900

Query: 901  IKLLQFPSSDEGVIPGT--------------------------RRLQDNLS--------- 960
            IKLLQFPSSDEGVIPGT                            +Q  L+         
Sbjct: 901  IKLLQFPSSDEGVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIVQYGLNIVQAYMTYV 960

Query: 961  DLHAQVAANHRGISLIKELIAQYAGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKGEAN 1020
             L+A+ A      S+   + +  A + +G SI IEEEDYMDDGS IHLKLTIDP+KGEAN
Sbjct: 961  QLNAEEAVREMLKSVASRVSSNSARSVEGGSIVIEEEDYMDDGSAIHLKLTIDPNKGEAN 1020

Query: 1021 FDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLSPS 1080
            FDFSGTSPEVYGNWNAPEAVT AAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLSPS
Sbjct: 1021 FDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLSPS 1080

Query: 1081 EKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGPSW 1140
            EKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGPSW
Sbjct: 1081 EKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGPSW 1140

Query: 1141 HGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQPV 1200
            HGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSG+YKGGDGLVREIEFKQPV
Sbjct: 1141 HGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGVYKGGDGLVREIEFKQPV 1200

Query: 1201 VVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTPGG 1231
            VVSILSERRVHAPRGLKGGK+GA GANFLVRKDGRRVYLGGKNTVTVKAGEILQILTPGG
Sbjct: 1201 VVSILSERRVHAPRGLKGGKDGAHGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTPGG 1260

BLAST of Cla021389 vs. NCBI nr
Match: gi|449441520|ref|XP_004138530.1| (PREDICTED: 5-oxoprolinase [Cucumis sativus])

HSP 1 Score: 2193.7 bits (5683), Expect = 0.0e+00
Identity = 1112/1268 (87.70%), Postives = 1158/1268 (91.32%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGSNNE+KLRFCIDRGGTFTDVYAEIPGR DGKV KLLSVDPSNY+DAPVEGIRRILEEY
Sbjct: 1    MGSNNEEKLRFCIDRGGTFTDVYAEIPGRPDGKVFKLLSVDPSNYDDAPVEGIRRILEEY 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +GKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI
Sbjct: 61   TGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDLTVSKPSNLYEDV+EVDERVELI GKGDGNQD ST YV+GVSGELIRIV+TLNEEALK
Sbjct: 121  FDLTVSKPSNLYEDVVEVDERVELIHGKGDGNQDSST-YVEGVSGELIRIVKTLNEEALK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLL DLLQRGI CLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL
Sbjct: 181  PLLNDLLQRGIGCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLFELET KPLIGFDMGGTSTD SRYAGSYEQVLETQIAGAIIQAPQLDINTVA
Sbjct: 301  VVGYSQTLFELETTKPLIGFDMGGTSTDVSRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN
Sbjct: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPLDIEATRGEFEKLATEINSYRK QDPSSKPMTIEEIALGFVNVANETMCRPIRQLT
Sbjct: 421  EDQPLDIEATRGEFEKLATEINSYRKNQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE
Sbjct: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PYSAVYCS+SVQEVSRREASLLKQV +KL+ QGFREGSI TETYLNLRY+GTDTAIMVKS
Sbjct: 541  PYSAVYCSKSVQEVSRREASLLKQVKHKLRSQGFREGSINTETYLNLRYDGTDTAIMVKS 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
            Q+VDNGVEFDFAAEFEKLFQQEYGFKLQNRN+LICDIRVRG+GVTNVLKPRAFEGL+GDP
Sbjct: 601  QRVDNGVEFDFAAEFEKLFQQEYGFKLQNRNILICDIRVRGVGVTNVLKPRAFEGLSGDP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            KIEGHYRVYFGNGWQDTPLFKLDNLGFG+IIPGPAIIMNGNSTVIVEPSCKAT+TKYGNI
Sbjct: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGYIIPGPAIIMNGNSTVIVEPSCKATVTKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEIDSTF T+KVSEKVADVVQLSIFNH+FMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 721  KIEIDSTFCTKKVSEKVADVVQLSIFNHQFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQI++WGDNLNEGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIDFWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASRGHHAE-----IGGITPGS-----------------MPPFSKSIW 900
            TPVFDNGKLIFFVASRGHHAE      G + P S                    F +   
Sbjct: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900

Query: 901  EEGAAIKAFKLVEKGIFQ-----EEGIIKL-LQFPSSDEGVIP--------GTRRLQDNL 960
             +   +  F   ++G+       ++ +  L  Q  ++  G+          G   +Q  +
Sbjct: 901  NK---LLQFPSSDEGVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIVQYGLNIVQAYM 960

Query: 961  S--DLHAQVAANHRGISLIKELIAQYAGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKG 1020
            +   L+A+ A      S+   + +  A   +G SI IEEEDYMDDGS IHLKLTIDPHKG
Sbjct: 961  TYVQLNAEEAVREMLKSVASRVSSNSAKYVEGGSIAIEEEDYMDDGSAIHLKLTIDPHKG 1020

Query: 1021 EANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFL 1080
            EANFDFSGTSPEVYGNWNAPEAVT AAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFL
Sbjct: 1021 EANFDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFL 1080

Query: 1081 SPSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAG 1140
            SPSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAG
Sbjct: 1081 SPSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAG 1140

Query: 1141 PSWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFK 1200
            PSWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSG+YKGGDGLVREIEFK
Sbjct: 1141 PSWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGVYKGGDGLVREIEFK 1200

Query: 1201 QPVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILT 1231
            QPVVVSILSERRVHAPRGLKGGK+GARGANFLVRKDGRRVYLGGKNT+TVKAGEILQILT
Sbjct: 1201 QPVVVSILSERRVHAPRGLKGGKDGARGANFLVRKDGRRVYLGGKNTITVKAGEILQILT 1260

BLAST of Cla021389 vs. NCBI nr
Match: gi|700190402|gb|KGN45606.1| (hypothetical protein Csa_6G000060 [Cucumis sativus])

HSP 1 Score: 2193.7 bits (5683), Expect = 0.0e+00
Identity = 1112/1268 (87.70%), Postives = 1158/1268 (91.32%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGSNNE+KLRFCIDRGGTFTDVYAEIPGR DGKV KLLSVDPSNY+DAPVEGIRRILEEY
Sbjct: 53   MGSNNEEKLRFCIDRGGTFTDVYAEIPGRPDGKVFKLLSVDPSNYDDAPVEGIRRILEEY 112

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +GKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI
Sbjct: 113  TGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 172

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDLTVSKPSNLYEDV+EVDERVELI GKGDGNQD ST YV+GVSGELIRIV+TLNEEALK
Sbjct: 173  FDLTVSKPSNLYEDVVEVDERVELIHGKGDGNQDSST-YVEGVSGELIRIVKTLNEEALK 232

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLL DLLQRGI CLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL
Sbjct: 233  PLLNDLLQRGIGCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 292

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 293  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 352

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLFELET KPLIGFDMGGTSTD SRYAGSYEQVLETQIAGAIIQAPQLDINTVA
Sbjct: 353  VVGYSQTLFELETTKPLIGFDMGGTSTDVSRYAGSYEQVLETQIAGAIIQAPQLDINTVA 412

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN
Sbjct: 413  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 472

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            EDQPLDIEATRGEFEKLATEINSYRK QDPSSKPMTIEEIALGFVNVANETMCRPIRQLT
Sbjct: 473  EDQPLDIEATRGEFEKLATEINSYRKNQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 532

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE
Sbjct: 533  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 592

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PYSAVYCS+SVQEVSRREASLLKQV +KL+ QGFREGSI TETYLNLRY+GTDTAIMVKS
Sbjct: 593  PYSAVYCSKSVQEVSRREASLLKQVKHKLRSQGFREGSINTETYLNLRYDGTDTAIMVKS 652

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
            Q+VDNGVEFDFAAEFEKLFQQEYGFKLQNRN+LICDIRVRG+GVTNVLKPRAFEGL+GDP
Sbjct: 653  QRVDNGVEFDFAAEFEKLFQQEYGFKLQNRNILICDIRVRGVGVTNVLKPRAFEGLSGDP 712

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            KIEGHYRVYFGNGWQDTPLFKLDNLGFG+IIPGPAIIMNGNSTVIVEPSCKAT+TKYGNI
Sbjct: 713  KIEGHYRVYFGNGWQDTPLFKLDNLGFGYIIPGPAIIMNGNSTVIVEPSCKATVTKYGNI 772

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEIDSTF T+KVSEKVADVVQLSIFNH+FMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 773  KIEIDSTFCTKKVSEKVADVVQLSIFNHQFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 832

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQI++WGDNLNEGDVLVTNHPCAGGSHLPDITVI
Sbjct: 833  FGPDGGLVANAPHVPVHLGAMSSTVRWQIDFWGDNLNEGDVLVTNHPCAGGSHLPDITVI 892

Query: 841  TPVFDNGKLIFFVASRGHHAE-----IGGITPGS-----------------MPPFSKSIW 900
            TPVFDNGKLIFFVASRGHHAE      G + P S                    F +   
Sbjct: 893  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 952

Query: 901  EEGAAIKAFKLVEKGIFQ-----EEGIIKL-LQFPSSDEGVIP--------GTRRLQDNL 960
             +   +  F   ++G+       ++ +  L  Q  ++  G+          G   +Q  +
Sbjct: 953  NK---LLQFPSSDEGVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIVQYGLNIVQAYM 1012

Query: 961  S--DLHAQVAANHRGISLIKELIAQYAGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKG 1020
            +   L+A+ A      S+   + +  A   +G SI IEEEDYMDDGS IHLKLTIDPHKG
Sbjct: 1013 TYVQLNAEEAVREMLKSVASRVSSNSAKYVEGGSIAIEEEDYMDDGSAIHLKLTIDPHKG 1072

Query: 1021 EANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFL 1080
            EANFDFSGTSPEVYGNWNAPEAVT AAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFL
Sbjct: 1073 EANFDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFL 1132

Query: 1081 SPSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAG 1140
            SPSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAG
Sbjct: 1133 SPSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAG 1192

Query: 1141 PSWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFK 1200
            PSWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSG+YKGGDGLVREIEFK
Sbjct: 1193 PSWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGVYKGGDGLVREIEFK 1252

Query: 1201 QPVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILT 1231
            QPVVVSILSERRVHAPRGLKGGK+GARGANFLVRKDGRRVYLGGKNT+TVKAGEILQILT
Sbjct: 1253 QPVVVSILSERRVHAPRGLKGGKDGARGANFLVRKDGRRVYLGGKNTITVKAGEILQILT 1312

BLAST of Cla021389 vs. NCBI nr
Match: gi|596017794|ref|XP_007218890.1| (hypothetical protein PRUPE_ppa000342mg [Prunus persica])

HSP 1 Score: 2111.6 bits (5470), Expect = 0.0e+00
Identity = 1049/1268 (82.73%), Postives = 1137/1268 (89.67%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS N++KLRFCIDRGGTFTDVYAEIPG+ DG+V+KLLSVDPSNY+DAPVEGIRRILEE+
Sbjct: 1    MGSANDNKLRFCIDRGGTFTDVYAEIPGQPDGQVLKLLSVDPSNYDDAPVEGIRRILEEF 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +GKKI R SKIPT  IEWIRMGTTVATNALLERKGERIALCVT+GFRDLLQIGNQARP I
Sbjct: 61   TGKKISRASKIPTDKIEWIRMGTTVATNALLERKGERIALCVTRGFRDLLQIGNQARPKI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDLTVSKPSNLYE+VIEVDERVEL     D +   S S VKGVSGE++++V+ ++ E LK
Sbjct: 121  FDLTVSKPSNLYEEVIEVDERVELANDNQDSS---SASLVKGVSGEMVKVVKPIDVETLK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLL+ LL++GISCLAVVLMHSYTYPQHE+A+E+LA S+GF+HVSLSSALTPMVRAVPRGL
Sbjct: 181  PLLQGLLEKGISCLAVVLMHSYTYPQHEVAVERLAESLGFRHVSLSSALTPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPVIKEYLSGFMSKFDE   KVNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TASVDAYLTPVIKEYLSGFMSKFDEGVEKVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLF LET+KPLIGFDMGGTSTD SRYAG+YEQVLETQIAGAIIQAPQLDI+TVA
Sbjct: 301  VVGYSQTLFGLETEKPLIGFDMGGTSTDVSRYAGTYEQVLETQIAGAIIQAPQLDISTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLG+VIPD+FPSIFGPN
Sbjct: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGYVIPDYFPSIFGPN 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            ED+PLDI ATR EF+KLA++INSYRK QDPS+K MT+EEIALGFVNVANETMCRPIRQLT
Sbjct: 421  EDEPLDIRATRDEFDKLASQINSYRKSQDPSAKDMTVEEIALGFVNVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHET+NHALACFGGAGPQHACAIAR LGMKE+ IHRFCGILSAYGMGLADVVEE QE
Sbjct: 481  EMKGHETRNHALACFGGAGPQHACAIARSLGMKEVLIHRFCGILSAYGMGLADVVEEAQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PYSAVY  ESVQE S REA LL QV  KLQ QGFR+ ++ TETYLNLRYEGTDT+IMVK 
Sbjct: 541  PYSAVYSLESVQEASHREAILLSQVRQKLQEQGFRDENMTTETYLNLRYEGTDTSIMVKK 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
            +  ++G   ++  +F +LFQQEYGFKL NRN+LICD+RVRG+GVTN+LKP A E  +  P
Sbjct: 601  RITEDGRGCNYNLDFVELFQQEYGFKLLNRNILICDVRVRGVGVTNILKPLALERTSCSP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            K+EG+Y+VYFGNGWQ+TPL+KL+ LG+GHI+ GPAIIMNGNSTVIVEP+CKA ITKYGNI
Sbjct: 661  KVEGNYKVYFGNGWQETPLYKLEKLGYGHIMAGPAIIMNGNSTVIVEPNCKAIITKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEIDST ST KV EKVA+VVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 721  KIEIDSTSSTMKVVEKVANVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQI YWGDNL+EGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQINYWGDNLSEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900
            TPVFDNGKL+FFVASRGHHAEIGGITPGSMPPFSKSIWEEGAA+KAFKLVEKGIFQEEGI
Sbjct: 841  TPVFDNGKLVFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAALKAFKLVEKGIFQEEGI 900

Query: 901  IKLLQFPSSDE--GVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQY---------- 960
             KLL+FP SDE    IPGTRRLQDNLSDL AQVAAN RGI+LIKELI QY          
Sbjct: 901  TKLLRFPCSDELAQKIPGTRRLQDNLSDLRAQVAANKRGITLIKELIEQYGLDTVQAYMT 960

Query: 961  -------------------------AGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKGE 1020
                                     + +G  SS+TIEEEDYMDDGS+IHLKLTID   GE
Sbjct: 961  YVQLNAEEAVREMLKSVAARVLSQPSSSGDRSSVTIEEEDYMDDGSIIHLKLTIDSDNGE 1020

Query: 1021 ANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLS 1080
            ANFDFSGTSPEVYGNWNAPEAVT AAVIYCLRC+VDVDIPLNQGCLAPVKIYIPPGSFLS
Sbjct: 1021 ANFDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCLVDVDIPLNQGCLAPVKIYIPPGSFLS 1080

Query: 1081 PSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGP 1140
            PS+KAA+VGGNVLTSQRITDV+LTAFQACACSQGCMNNLTFGD+TFGYYETIGGGSGAGP
Sbjct: 1081 PSDKAAVVGGNVLTSQRITDVVLTAFQACACSQGCMNNLTFGDETFGYYETIGGGSGAGP 1140

Query: 1141 SWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQ 1200
            +W GTSGVQCHMTNTRMTDPEIFEQRYPVLLH F LRENSGG G +KGGDGLVREIEFK+
Sbjct: 1141 TWDGTSGVQCHMTNTRMTDPEIFEQRYPVLLHKFGLRENSGGVGYHKGGDGLVREIEFKR 1200

Query: 1201 PVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTP 1232
            P+VVSILSERRVH PRGLKGGK+GARGANFL+ +D RRVYLGGKNTV V+ GEILQILTP
Sbjct: 1201 PIVVSILSERRVHTPRGLKGGKDGARGANFLITQDKRRVYLGGKNTVEVQPGEILQILTP 1260

BLAST of Cla021389 vs. NCBI nr
Match: gi|645252441|ref|XP_008232124.1| (PREDICTED: 5-oxoprolinase [Prunus mume])

HSP 1 Score: 2109.3 bits (5464), Expect = 0.0e+00
Identity = 1047/1268 (82.57%), Postives = 1137/1268 (89.67%), Query Frame = 1

Query: 1    MGSNNEDKLRFCIDRGGTFTDVYAEIPGRQDGKVIKLLSVDPSNYEDAPVEGIRRILEEY 60
            MGS N++KLRFCIDRGGTFTDVYAEIPG+ DG+V+KLLSVDPSNY+DAPVEGIRRILEE+
Sbjct: 1    MGSANDNKLRFCIDRGGTFTDVYAEIPGQPDGQVLKLLSVDPSNYDDAPVEGIRRILEEF 60

Query: 61   SGKKIPRTSKIPTQNIEWIRMGTTVATNALLERKGERIALCVTKGFRDLLQIGNQARPDI 120
            +G+KI R SKIPT  IEWIRMGTTVATNALLERKGERIALCVT+GFRDLLQIGNQARP I
Sbjct: 61   TGEKISRASKIPTDKIEWIRMGTTVATNALLERKGERIALCVTRGFRDLLQIGNQARPKI 120

Query: 121  FDLTVSKPSNLYEDVIEVDERVELIRGKGDGNQDFSTSYVKGVSGELIRIVRTLNEEALK 180
            FDLTVSKPSNLYE+V+EVDERVEL     D +   S S VKGVSGE++++V+ ++ E LK
Sbjct: 121  FDLTVSKPSNLYEEVVEVDERVELANDNQDSS---SASLVKGVSGEMVKVVKPIDVETLK 180

Query: 181  PLLKDLLQRGISCLAVVLMHSYTYPQHELALEKLALSMGFKHVSLSSALTPMVRAVPRGL 240
            PLL+ LL++GISCLAVVLMHSYTYPQHE+A+E+LA S+GF+HVSLSSALTPMVRAVPRGL
Sbjct: 181  PLLQGLLEKGISCLAVVLMHSYTYPQHEVAVERLAESLGFRHVSLSSALTPMVRAVPRGL 240

Query: 241  TASVDAYLTPVIKEYLSGFMSKFDESSGKVNVLFMQSDGGLAPENRFSGHKAVLSGPAGG 300
            TASVDAYLTPVIKEYLSGFMSKFDE   KVNVLFMQSDGGLAPE+RFSGHKAVLSGPAGG
Sbjct: 241  TASVDAYLTPVIKEYLSGFMSKFDEGVEKVNVLFMQSDGGLAPESRFSGHKAVLSGPAGG 300

Query: 301  VVGYSQTLFELETKKPLIGFDMGGTSTDASRYAGSYEQVLETQIAGAIIQAPQLDINTVA 360
            VVGYSQTLF LET+KPLIGFDMGGTSTD SRYAG+YEQVLETQIAGAIIQAPQLDI+TVA
Sbjct: 301  VVGYSQTLFGLETEKPLIGFDMGGTSTDVSRYAGTYEQVLETQIAGAIIQAPQLDISTVA 360

Query: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGFVIPDFFPSIFGPN 420
            AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLG+VIPD+FPSIFGPN
Sbjct: 361  AGGGSKLKFQFGAFRVGPESVGAHPGPVCYRKGGELAVTDANLVLGYVIPDYFPSIFGPN 420

Query: 421  EDQPLDIEATRGEFEKLATEINSYRKIQDPSSKPMTIEEIALGFVNVANETMCRPIRQLT 480
            ED+PLDI ATR EF+KLA +INSYRK QDPS+K MT+EEIALGFVNVANETMCRPIRQLT
Sbjct: 421  EDKPLDIRATRDEFDKLARQINSYRKSQDPSAKDMTVEEIALGFVNVANETMCRPIRQLT 480

Query: 481  EMKGHETKNHALACFGGAGPQHACAIARLLGMKEIFIHRFCGILSAYGMGLADVVEEEQE 540
            EMKGHET+NHALACFGGAGPQHACAIAR LGMKE+ IHRFCGILSAYGMGLADVVEE QE
Sbjct: 481  EMKGHETRNHALACFGGAGPQHACAIARSLGMKEVLIHRFCGILSAYGMGLADVVEEAQE 540

Query: 541  PYSAVYCSESVQEVSRREASLLKQVNYKLQGQGFREGSIKTETYLNLRYEGTDTAIMVKS 600
            PYSAVY  ESVQE S REA LL QV  KLQ QGFR+ ++ TETYLNLRYEGTDT+IMVK 
Sbjct: 541  PYSAVYSLESVQEASHREAILLSQVRQKLQEQGFRDENMTTETYLNLRYEGTDTSIMVKK 600

Query: 601  QQVDNGVEFDFAAEFEKLFQQEYGFKLQNRNVLICDIRVRGIGVTNVLKPRAFEGLAGDP 660
            +  ++G   ++  +F +LFQQEYGFKL NRN+LICD+RVRG+GVTN+LKP A E  +  P
Sbjct: 601  RITEDGRGCNYDLDFVELFQQEYGFKLLNRNILICDVRVRGVGVTNILKPLALERTSCSP 660

Query: 661  KIEGHYRVYFGNGWQDTPLFKLDNLGFGHIIPGPAIIMNGNSTVIVEPSCKATITKYGNI 720
            K+EG+Y+VYFGNGWQ+TPL+KL+ LG+GHI+ GPAIIMNGNSTVIVEP+CKA ITKYGNI
Sbjct: 661  KVEGNYKVYFGNGWQETPLYKLEKLGYGHIMAGPAIIMNGNSTVIVEPNCKAIITKYGNI 720

Query: 721  KIEIDSTFSTEKVSEKVADVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780
            KIEIDST ST KV EKVA+VVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL
Sbjct: 721  KIEIDSTSSTMKVVEKVANVVQLSIFNHRFMGIAEQMGRTLQRTSISTNIKERLDFSCAL 780

Query: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQIEYWGDNLNEGDVLVTNHPCAGGSHLPDITVI 840
            FGPDGGLVANAPHVPVHLGAMSSTVRWQI YWGDNL+EGDVLVTNHPCAGGSHLPDITVI
Sbjct: 781  FGPDGGLVANAPHVPVHLGAMSSTVRWQINYWGDNLSEGDVLVTNHPCAGGSHLPDITVI 840

Query: 841  TPVFDNGKLIFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAAIKAFKLVEKGIFQEEGI 900
            TPVFDNGKL+FFVASRGHHAEIGGITPGSMPPFSKSIWEEGAA+KAFKLVEK IFQEEGI
Sbjct: 841  TPVFDNGKLVFFVASRGHHAEIGGITPGSMPPFSKSIWEEGAALKAFKLVEKEIFQEEGI 900

Query: 901  IKLLQFPSSDE--GVIPGTRRLQDNLSDLHAQVAANHRGISLIKELIAQY---------- 960
             KLL+FP SDE    IPGTRRLQDNLSDL AQVAAN RGI+LIKELI QY          
Sbjct: 901  TKLLRFPCSDELAQKIPGTRRLQDNLSDLQAQVAANKRGITLIKELIEQYGLDTVQAYMT 960

Query: 961  -------------------------AGTGKGSSITIEEEDYMDDGSVIHLKLTIDPHKGE 1020
                                     + +G GSS+TIEEEDYMDDGS+IHLKLTID  KGE
Sbjct: 961  YVQLNAEEAVREMLKSVAARVLSQPSSSGDGSSVTIEEEDYMDDGSIIHLKLTIDSDKGE 1020

Query: 1021 ANFDFSGTSPEVYGNWNAPEAVTVAAVIYCLRCMVDVDIPLNQGCLAPVKIYIPPGSFLS 1080
            ANFDFSGTSPEVYGNWNAPEAVT AAVIYCLRC+VDVDIPLNQGCLAPVKIYIPPGSFLS
Sbjct: 1021 ANFDFSGTSPEVYGNWNAPEAVTAAAVIYCLRCLVDVDIPLNQGCLAPVKIYIPPGSFLS 1080

Query: 1081 PSEKAAIVGGNVLTSQRITDVILTAFQACACSQGCMNNLTFGDDTFGYYETIGGGSGAGP 1140
            PS+KAA+VGGNVLTSQRITDV+LTAFQACACSQGCMNNLTFGD+TFGYYETIGGGSGAGP
Sbjct: 1081 PSDKAAVVGGNVLTSQRITDVVLTAFQACACSQGCMNNLTFGDETFGYYETIGGGSGAGP 1140

Query: 1141 SWHGTSGVQCHMTNTRMTDPEIFEQRYPVLLHTFALRENSGGSGIYKGGDGLVREIEFKQ 1200
            +W GTSGVQCHMTNTRMTDPEIFEQRYPVLLH F LRENSGG G ++GGDGLVREIEFK+
Sbjct: 1141 TWDGTSGVQCHMTNTRMTDPEIFEQRYPVLLHKFGLRENSGGVGYHRGGDGLVREIEFKR 1200

Query: 1201 PVVVSILSERRVHAPRGLKGGKNGARGANFLVRKDGRRVYLGGKNTVTVKAGEILQILTP 1232
            P+VVSILSERRVH PRGLKGGK+GARGANFL+ +D RRVYLGGKNTV V+ GEILQILTP
Sbjct: 1201 PIVVSILSERRVHTPRGLKGGKDGARGANFLITQDKRRVYLGGKNTVEVQPGEILQILTP 1260

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
OPLA_ARATH0.0e+0075.355-oxoprolinase OS=Arabidopsis thaliana GN=OXP1 PE=2 SV=1[more]
OPLA_MOUSE0.0e+0056.565-oxoprolinase OS=Mus musculus GN=Oplah PE=1 SV=1[more]
OPLA_HUMAN0.0e+0056.775-oxoprolinase OS=Homo sapiens GN=OPLAH PE=1 SV=3[more]
OPLA_RAT0.0e+0056.645-oxoprolinase OS=Rattus norvegicus GN=Oplah PE=1 SV=2[more]
OPLA_BOVIN0.0e+0055.945-oxoprolinase OS=Bos taurus GN=OPLAH PE=1 SV=1[more]
Match NameE-valueIdentityDescription
M5X2E6_PRUPE0.0e+0082.73Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000342mg PE=4 SV=1[more]
B9SP24_RICCO0.0e+0082.465-oxoprolinase, putative OS=Ricinus communis GN=RCOM_1248770 PE=4 SV=1[more]
A0A0D2RJ73_GOSRA0.0e+0082.15Uncharacterized protein OS=Gossypium raimondii GN=B456_005G213100 PE=4 SV=1[more]
A0A0B0P1C4_GOSAR0.0e+0081.835-oxoprolinase-like protein OS=Gossypium arboreum GN=F383_10193 PE=4 SV=1[more]
A0A061E1X5_THECC0.0e+0081.60Oxoprolinase 1 OS=Theobroma cacao GN=TCM_007670 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659116894|ref|XP_008458316.1|0.0e+0091.62PREDICTED: 5-oxoprolinase [Cucumis melo][more]
gi|449441520|ref|XP_004138530.1|0.0e+0087.70PREDICTED: 5-oxoprolinase [Cucumis sativus][more]
gi|700190402|gb|KGN45606.1|0.0e+0087.70hypothetical protein Csa_6G000060 [Cucumis sativus][more]
gi|596017794|ref|XP_007218890.1|0.0e+0082.73hypothetical protein PRUPE_ppa000342mg [Prunus persica][more]
gi|645252441|ref|XP_008232124.1|0.0e+0082.57PREDICTED: 5-oxoprolinase [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002821Hydantoinase_A
IPR003692Hydantoinase_B
IPR008040Hydant_A_N
Vocabulary: Molecular Function
TermDefinition
GO:0016787hydrolase activity
GO:0003824catalytic activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006751 glutathione catabolic process
cellular_component GO:0005829 cytosol
cellular_component GO:0009506 plasmodesma
molecular_function GO:0017168 5-oxoprolinase (ATP-hydrolyzing) activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0016787 hydrolase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU37447watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU67981watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla021389Cla021389.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU67981WMU67981transcribed_cluster
WMU37447WMU37447transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002821Hydantoinase/oxoprolinasePFAMPF01968Hydantoinase_Acoord: 238..537
score: 1.3E
IPR003692Hydantoinase B/oxoprolinasePFAMPF02538Hydantoinase_Bcoord: 739..948
score: 5.0E-93coord: 958..1230
score: 9.2E
IPR008040Hydantoinaseoxoprolinase, N-terminalPFAMPF05378Hydant_A_Ncoord: 10..218
score: 3.3
NoneNo IPR availablePANTHERPTHR113655-OXOPROLINASE RELATEDcoord: 3..1229
score:
NoneNo IPR availablePANTHERPTHR11365:SF25-OXOPROLINASEcoord: 3..1229
score:

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla021389Cla97C05G083630Watermelon (97103) v2wmwmbB206
Cla021389ClCG05G003000Watermelon (Charleston Gray)wcgwmB286
The following gene(s) are paralogous to this gene:

None