Cla022020 (gene) Watermelon (97103) v1

NameCla022020
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7KQP9_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr8 : 19614073 .. 19615605 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGCCATTACAAATCGACGTATTATCATCAATAACTTCTCATTTTCCTTCATCTTTCATCAACGTTTTACTCCCTTTACATCCGATTCCACGGCGGCCGCTCCTCCCCACAACAACATTCCTTCGGCAATCAACCCGACTCACCTACGCCGTGTCTGTACCGTTCTATATCAGCAACAGAACTCCCCCGATATCAAGCTTCACTCCAAACTTCTCGCTTGTAATTTCAATCTCTCACACGAATTCTTCCTCCAGGTATGCAACACTTTCCCTCTCTCTTGGCGTCCCGTTTATCGCTTCTTCCAATTCACTGAAACCGACCCTAATTTCACTCACACGGCGGTTTCTTTCAATAAGTTGATTGATGTTGTTGGGAAATCACGAAATATCGATCTCTTATGGGGTTTGGTTCAGGAAATGGGGCGGCGGCGGTTGGTTACTGATAAGACCTTTGTAGTTGCTCTTAGAACTCTCGCGGCGGCCAGAGAGTTGAAGAAGTGTGTAGAGTTTTTCCATTTGATGGATGGATATGGATTTGGTTATAGTTTAATGACTTTGAATAAGGTAGTTGAGAAATTGTGTGGCTGTAAATTAGTGGACGAGGCTAAGTTTTTGGTTATGAAATTGAAGGAATGGATCAAAGCTGATGAGGTTACTTATAAATGGTTGATTAAGGGGTTTTGTAATGTGGGGGATTTGGTTGAAGCTTCAAAGATTTGGAACTTAATGGTGGATGAAGGGTTTGAGCCTGAAATGGAAGCTGTGGAGGAGATGATGAATGTTCTTTTCAAGACCAATAAATTTGATGAAGCTTTGAAACTTTTCCAGGCAGTAAGATCAAACAGGATGAATGATTTGATTCCTTCAACTTACAGTCTTGTTATAAGATGGTTGTGTAACAAAGCTAAGGTGCAGCAAGCGTATGTCGTGTTCGACGAAATGCATAAGAGAGGACTTGAAGCTGATAATTCAGCACATTCTTCACTTATTTATGGGCTTTTAGCAAGAGGGAGGAGGGGAGAAGCTTATAATATAATGAGAAGAATTGAGAATCCTGATTTGGGTGTGTATCATGCATTGATTAAGGGACTTTTGAGGTTGAAAAGGGCAAATGAAGCAACCCAAGTTTTCAGGGAAATGGTTGAAAGAGGGTGTGAGCCTATAATGCATACATATATAATGTTGTTGCAAGGACATTTAGGGAAAAGGGGGAGGAAGGGATTGGATCCACTTGTGAATTTTGATACTATTTTTGTTGGAGGTTTGGTGAAGAATGGGAAGTCATTGGAGGCCACAAAGTATGTGGAAAGACTGATGAAAAGAGGGCTTGAAGTGCCAAGGTTTGATTACAACAAGTTTTTGCATTACTATTCAAATGAGGAGGGAGTGGTAATGTTTAGAGAGGTGGGGAATAGGTTGAGAGAAGTTGGATTGGTTGATTTGGCTGATATATTTCAGAGATATGGGGAGAAAATGACCACTAGAGATAGAAGGAGAGATCGAGCAACAACGGTGTCGTCTGGGATTTGA

mRNA sequence

ATGGCAGCCATTACAAATCGACGTATTATCATCAATAACTTCTCATTTTCCTTCATCTTTCATCAACGTTTTACTCCCTTTACATCCGATTCCACGGCGGCCGCTCCTCCCCACAACAACATTCCTTCGGCAATCAACCCGACTCACCTACGCCGTGTCTGTACCGTTCTATATCAGCAACAGAACTCCCCCGATATCAAGCTTCACTCCAAACTTCTCGCTTGTAATTTCAATCTCTCACACGAATTCTTCCTCCAGGTATGCAACACTTTCCCTCTCTCTTGGCGTCCCGTTTATCGCTTCTTCCAATTCACTGAAACCGACCCTAATTTCACTCACACGGCGGTTTCTTTCAATAAGTTGATTGATGTTGTTGGGAAATCACGAAATATCGATCTCTTATGGGGTTTGGTTCAGGAAATGGGGCGGCGGCGGTTGGTTACTGATAAGACCTTTGTAGTTGCTCTTAGAACTCTCGCGGCGGCCAGAGAGTTGAAGAAGTGTGTAGAGTTTTTCCATTTGATGGATGGATATGGATTTGGTTATAGTTTAATGACTTTGAATAAGGTAGTTGAGAAATTGTGTGGCTGTAAATTAGTGGACGAGGCTAAGTTTTTGGTTATGAAATTGAAGGAATGGATCAAAGCTGATGAGGTTACTTATAAATGGTTGATTAAGGGGTTTTGTAATGTGGGGGATTTGGTTGAAGCTTCAAAGATTTGGAACTTAATGGTGGATGAAGGGTTTGAGCCTGAAATGGAAGCTGTGGAGGAGATGATGAATGTTCTTTTCAAGACCAATAAATTTGATGAAGCTTTGAAACTTTTCCAGGCAGTAAGATCAAACAGGATGAATGATTTGATTCCTTCAACTTACAGTCTTGTTATAAGATGGTTGTGTAACAAAGCTAAGGTGCAGCAAGCGTATGTCGTGTTCGACGAAATGCATAAGAGAGGACTTGAAGCTGATAATTCAGCACATTCTTCACTTATTTATGGGCTTTTAGCAAGAGGGAGGAGGGGAGAAGCTTATAATATAATGAGAAGAATTGAGAATCCTGATTTGGGTGTGTATCATGCATTGATTAAGGGACTTTTGAGGTTGAAAAGGGCAAATGAAGCAACCCAAGTTTTCAGGGAAATGGTTGAAAGAGGGTGTGAGCCTATAATGCATACATATATAATGTTGTTGCAAGGACATTTAGGGAAAAGGGGGAGGAAGGGATTGGATCCACTTGTGAATTTTGATACTATTTTTGTTGGAGGTTTGGTGAAGAATGGGAAGTCATTGGAGGCCACAAAGTATGTGGAAAGACTGATGAAAAGAGGGCTTGAAGTGCCAAGGTTTGATTACAACAAGTTTTTGCATTACTATTCAAATGAGGAGGGAGTGGTAATGTTTAGAGAGGTGGGGAATAGGTTGAGAGAAGTTGGATTGGTTGATTTGGCTGATATATTTCAGAGATATGGGGAGAAAATGACCACTAGAGATAGAAGGAGAGATCGAGCAACAACGGTGTCGTCTGGGATTTGA

Coding sequence (CDS)

ATGGCAGCCATTACAAATCGACGTATTATCATCAATAACTTCTCATTTTCCTTCATCTTTCATCAACGTTTTACTCCCTTTACATCCGATTCCACGGCGGCCGCTCCTCCCCACAACAACATTCCTTCGGCAATCAACCCGACTCACCTACGCCGTGTCTGTACCGTTCTATATCAGCAACAGAACTCCCCCGATATCAAGCTTCACTCCAAACTTCTCGCTTGTAATTTCAATCTCTCACACGAATTCTTCCTCCAGGTATGCAACACTTTCCCTCTCTCTTGGCGTCCCGTTTATCGCTTCTTCCAATTCACTGAAACCGACCCTAATTTCACTCACACGGCGGTTTCTTTCAATAAGTTGATTGATGTTGTTGGGAAATCACGAAATATCGATCTCTTATGGGGTTTGGTTCAGGAAATGGGGCGGCGGCGGTTGGTTACTGATAAGACCTTTGTAGTTGCTCTTAGAACTCTCGCGGCGGCCAGAGAGTTGAAGAAGTGTGTAGAGTTTTTCCATTTGATGGATGGATATGGATTTGGTTATAGTTTAATGACTTTGAATAAGGTAGTTGAGAAATTGTGTGGCTGTAAATTAGTGGACGAGGCTAAGTTTTTGGTTATGAAATTGAAGGAATGGATCAAAGCTGATGAGGTTACTTATAAATGGTTGATTAAGGGGTTTTGTAATGTGGGGGATTTGGTTGAAGCTTCAAAGATTTGGAACTTAATGGTGGATGAAGGGTTTGAGCCTGAAATGGAAGCTGTGGAGGAGATGATGAATGTTCTTTTCAAGACCAATAAATTTGATGAAGCTTTGAAACTTTTCCAGGCAGTAAGATCAAACAGGATGAATGATTTGATTCCTTCAACTTACAGTCTTGTTATAAGATGGTTGTGTAACAAAGCTAAGGTGCAGCAAGCGTATGTCGTGTTCGACGAAATGCATAAGAGAGGACTTGAAGCTGATAATTCAGCACATTCTTCACTTATTTATGGGCTTTTAGCAAGAGGGAGGAGGGGAGAAGCTTATAATATAATGAGAAGAATTGAGAATCCTGATTTGGGTGTGTATCATGCATTGATTAAGGGACTTTTGAGGTTGAAAAGGGCAAATGAAGCAACCCAAGTTTTCAGGGAAATGGTTGAAAGAGGGTGTGAGCCTATAATGCATACATATATAATGTTGTTGCAAGGACATTTAGGGAAAAGGGGGAGGAAGGGATTGGATCCACTTGTGAATTTTGATACTATTTTTGTTGGAGGTTTGGTGAAGAATGGGAAGTCATTGGAGGCCACAAAGTATGTGGAAAGACTGATGAAAAGAGGGCTTGAAGTGCCAAGGTTTGATTACAACAAGTTTTTGCATTACTATTCAAATGAGGAGGGAGTGGTAATGTTTAGAGAGGTGGGGAATAGGTTGAGAGAAGTTGGATTGGTTGATTTGGCTGATATATTTCAGAGATATGGGGAGAAAATGACCACTAGAGATAGAAGGAGAGATCGAGCAACAACGGTGTCGTCTGGGATTTGA

Protein sequence

MAAITNRRIIINNFSFSFIFHQRFTPFTSDSTAAAPPHNNIPSAINPTHLRRVCTVLYQQQNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTYKWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLVIRWLCNKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRGEAYNIMRRIENPDLGVYHALIKGLLRLKRANEATQVFREMVERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFVGGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRRDRATTVSSGI
BLAST of Cla022020 vs. Swiss-Prot
Match: PPR59_ARATH (Putative pentatricopeptide repeat-containing protein At1g26500 OS=Arabidopsis thaliana GN=At1g26500 PE=3 SV=1)

HSP 1 Score: 629.0 bits (1621), Expect = 4.5e-179
Identity = 304/505 (60.20%), Postives = 390/505 (77.23%), Query Frame = 1

Query: 1   MAAITNRRIII--NNFSFSFIFHQRFTPFTSDSTAAAPPHNNIPSAINPTHLRRVCTVLY 60
           +A +T+RR+I   N+    FI + RF       T   P        IN  HL RVCT+LY
Sbjct: 3   VAVVTSRRMINIGNSIRRCFILNHRFFSTELTPTTITP--------INQDHLLRVCTILY 62

Query: 61  QQQNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETD-PNFTHTAVS 120
           QQQNSPD +L SKL +  F L+HEFFLQVCN FPLSWRPV+RFF +++T  P+FTHT+ +
Sbjct: 63  QQQNSPDSRLVSKLSSTKFQLTHEFFLQVCNNFPLSWRPVHRFFLYSQTHHPDFTHTSTT 122

Query: 121 FNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDG 180
            NK++ ++G SRN+DL W L QE+G+R LV DKTF + L+TLA+ARELKKCV +FHLM+G
Sbjct: 123 SNKMLAIIGNSRNMDLFWELAQEIGKRGLVNDKTFRIVLKTLASARELKKCVNYFHLMNG 182

Query: 181 YGFGYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTYKWLIKGFCNVGDLVEA 240
           +G+ Y++ T+N+ VE LC  KLV+EAKF+ +KLKE+IK DE+TY+ +I+GFC+VGDL+EA
Sbjct: 183 FGYLYNVETMNRGVETLCKEKLVEEAKFVFIKLKEFIKPDEITYRTMIQGFCDVGDLIEA 242

Query: 241 SKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLVIR 300
           +K+WNLM+DEGF+ ++EA +++M  L K N+FDEA K+F  + S R  DL    Y ++I 
Sbjct: 243 AKLWNLMMDEGFDVDIEAGKKIMETLLKKNQFDEASKVFYVMVSKRGGDLDGGFYRVMID 302

Query: 301 WLCNKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRGEAYNIMRRIENPDLGV 360
           WLC   ++  A  VFDEM +RG+  DN   +SLIYGLL + R  EAY ++  +ENPD+ +
Sbjct: 303 WLCKNGRIDMARKVFDEMRERGVYVDNLTWASLIYGLLVKRRVVEAYGLVEGVENPDISI 362

Query: 361 YHALIKGLLRLKRANEATQVFREMVERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDT 420
           YH LIKGL+++KRA+EAT+VFR+M++RGCEPIMHTY+MLLQGHLG+RGRKG DPLVNFDT
Sbjct: 363 YHGLIKGLVKIKRASEATEVFRKMIQRGCEPIMHTYLMLLQGHLGRRGRKGPDPLVNFDT 422

Query: 421 IFVGGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREV 480
           IFVGG++K GK LE TKY+ER +KRGLEVPRFDY+KFLHYYSNEEGVVMF E+  +LREV
Sbjct: 423 IFVGGMIKAGKRLETTKYIERTLKRGLEVPRFDYSKFLHYYSNEEGVVMFEEMAKKLREV 482

Query: 481 GLVDLADIFQRYGEKMTTRDRRRDR 503
            L DLADIFQRYGEKMTTR+RRRDR
Sbjct: 483 SLFDLADIFQRYGEKMTTRERRRDR 499

BLAST of Cla022020 vs. Swiss-Prot
Match: PP293_ARATH (Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidopsis thaliana GN=At3g62470 PE=2 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 3.5e-67
Identity = 144/476 (30.25%), Postives = 259/476 (54.41%), Query Frame = 1

Query: 41  IPSAINPTHLRRVCTVLYQQQNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYR 100
           + S+ NP  + RVC V+  +  + D  + + L     +LSH+  ++V   F  + +P +R
Sbjct: 122 VESSTNPEEVERVCKVI-DELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFR 181

Query: 101 FFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLA 160
           FF +      F H + ++N ++ ++ K+R  + +  +++EMG + L+T +TF +A++  A
Sbjct: 182 FFCWAAERQGFAHDSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFA 241

Query: 161 AARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVT 220
           AA+E KK V  F LM  Y F   + T+N +++ L   KL  EA+ L  KLKE    + +T
Sbjct: 242 AAKERKKAVGIFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMT 301

Query: 221 YKWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVR 280
           Y  L+ G+C V +L+EA++IWN M+D+G +P++ A   M+  L ++ K  +A+KLF  ++
Sbjct: 302 YTVLLNGWCRVRNLIEAARIWNDMIDQGLKPDIVAHNVMLEGLLRSRKKSDAIKLFHVMK 361

Query: 281 SNRMNDLIPSTYSLVIRWLCNKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRR 340
           S      + S Y+++IR  C ++ ++ A   FD+M   GL+ D + ++ LI G   + + 
Sbjct: 362 SKGPCPNVRS-YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKL 421

Query: 341 GEAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMVERGCEPIMHTYIML 400
              Y +++ ++     PD   Y+ALIK +   K    AT+++ +M++   EP +HT+ M+
Sbjct: 422 DTVYELLKEMQEKGHPPDGKTYNALIKLMANQKMPEHATRIYNKMIQNEIEPSIHTFNMI 481

Query: 401 LQGHLGKRG------------RKGLDPLVNFDTIFVGGLVKNGKSLEATKYVERLMKRGL 460
           ++ +   R             +KG+ P  N  T+ + GL+  GKS EA +Y+E ++ +G+
Sbjct: 482 MKSYFMARNYEMGRAVWEEMIKKGICPDDNSYTVLIRGLIGEGKSREACRYLEEMLDKGM 541

Query: 461 EVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRR 501
           + P  DYNKF   +       +F E+  R +  G    A+IF R+ +    R ++R
Sbjct: 542 KTPLIDYNKFAADFHRGGQPEIFEELAQRAKFSGKFAAAEIFARWAQMTRRRFKQR 595

BLAST of Cla022020 vs. Swiss-Prot
Match: PP294_ARATH (Pentatricopeptide repeat-containing protein At3g62540, mitochondrial OS=Arabidopsis thaliana GN=At3g62540 PE=2 SV=1)

HSP 1 Score: 255.4 bits (651), Expect = 1.3e-66
Identity = 143/476 (30.04%), Postives = 257/476 (53.99%), Query Frame = 1

Query: 41  IPSAINPTHLRRVCTVLYQQQNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYR 100
           + S+ NP  + RVC V+  +  + D  + + L     +LSH+  ++V   F  + +P +R
Sbjct: 122 VESSTNPEEVERVCKVI-DELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFR 181

Query: 101 FFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLA 160
           FF +      F H + ++N ++ ++ K+R  + +  +++EMG + L+T +TF +A++  A
Sbjct: 182 FFCWAAERQGFAHASRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFA 241

Query: 161 AARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVT 220
           AA+E KK V  F LM  Y F   + T+N +++ L   KL  EA+ L  KLKE    + +T
Sbjct: 242 AAKERKKAVGIFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMT 301

Query: 221 YKWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVR 280
           Y  L+ G+C V +L+EA++IWN M+D G +P++ A   M+  L ++ K  +A+KLF  ++
Sbjct: 302 YTVLLNGWCRVRNLIEAARIWNDMIDHGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMK 361

Query: 281 SNRMNDLIPSTYSLVIRWLCNKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRR 340
           S      + S Y+++IR  C ++ ++ A   FD+M   GL+ D + ++ LI G   + + 
Sbjct: 362 SKGPCPNVRS-YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKL 421

Query: 341 GEAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMVERGCEPIMHTYIML 400
              Y +++ ++     PD   Y+ALIK +   K     T+++ +M++   EP +HT+ M+
Sbjct: 422 DTVYELLKEMQEKGHPPDGKTYNALIKLMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMI 481

Query: 401 LQGHLGKRG------------RKGLDPLVNFDTIFVGGLVKNGKSLEATKYVERLMKRGL 460
           ++ +   R             +KG+ P  N  T+ + GL+  GKS EA +Y+E ++ +G+
Sbjct: 482 MKSYFVARNYEMGRAVWDEMIKKGICPDDNSYTVLIRGLISEGKSREACRYLEEMLDKGM 541

Query: 461 EVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRR 501
           + P  DYNKF   +       +F E+  R +  G    A+IF R+ +    R ++R
Sbjct: 542 KTPLIDYNKFAADFHRGGQPEIFEELAQRAKFSGKFAAAEIFARWAQMTRRRCKQR 595

BLAST of Cla022020 vs. Swiss-Prot
Match: PP382_ARATH (Pentatricopeptide repeat-containing protein At5g14820, mitochondrial OS=Arabidopsis thaliana GN=At5g14820 PE=2 SV=1)

HSP 1 Score: 255.0 bits (650), Expect = 1.8e-66
Identity = 143/476 (30.04%), Postives = 257/476 (53.99%), Query Frame = 1

Query: 41  IPSAINPTHLRRVCTVLYQQQNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYR 100
           + S+ NP  + RVC V+  +  + D  + + L     +LSH+  ++V   F  + +P +R
Sbjct: 121 VESSTNPEEVERVCKVI-DELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFR 180

Query: 101 FFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLA 160
           FF +      F H + ++N ++ ++ K+R  + +  +++EMG + L+T +TF +A++  A
Sbjct: 181 FFCWAAERQGFAHDSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFA 240

Query: 161 AARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVT 220
           AA+E KK V  F LM  Y F   + T+N +++ L   KL  EA+ L  KLKE    + +T
Sbjct: 241 AAKERKKAVGIFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMT 300

Query: 221 YKWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVR 280
           Y  L+ G+C V +L+EA++IWN M+D G +P++ A   M+  L ++ K  +A+KLF  ++
Sbjct: 301 YTVLLNGWCRVRNLIEAARIWNDMIDHGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMK 360

Query: 281 SNRMNDLIPSTYSLVIRWLCNKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRR 340
           S      + S Y+++IR  C ++ ++ A   FD+M   GL+ D + ++ LI G   + + 
Sbjct: 361 SKGPCPNVRS-YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKL 420

Query: 341 GEAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMVERGCEPIMHTYIML 400
              Y +++ ++     PD   Y+ALIK +   K     T+++ +M++   EP +HT+ M+
Sbjct: 421 DTVYELLKEMQEKGHPPDGKTYNALIKLMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMI 480

Query: 401 LQGHLGKRG------------RKGLDPLVNFDTIFVGGLVKNGKSLEATKYVERLMKRGL 460
           ++ +   R             +KG+ P  N  T+ + GL+  GKS EA +Y+E ++ +G+
Sbjct: 481 MKSYFVARNYEMGRAVWDEMIKKGICPDDNSYTVLIRGLISEGKSREACRYLEEMLDKGM 540

Query: 461 EVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRR 501
           + P  DYNKF   +       +F E+  R +  G    A+IF R+ +    R ++R
Sbjct: 541 KTPLIDYNKFAADFHRGGQPEIFEELAQRAKFSGKFAAAEIFARWAQMTRRRCKQR 594

BLAST of Cla022020 vs. Swiss-Prot
Match: PP275_ARATH (Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN=At3g49730 PE=2 SV=1)

HSP 1 Score: 123.2 bits (308), Expect = 8.0e-27
Identity = 89/365 (24.38%), Postives = 164/365 (44.93%), Query Frame = 1

Query: 99  YRFFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRR--RLVTDKTFVVAL 158
           YRFF +    P + H+      ++ ++ K R    +WGL++EM +    L+  + FVV +
Sbjct: 115 YRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLM 174

Query: 159 RTLAAARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKA 218
           R  A+A  +KK VE    M  YG          +++ LC    V EA  +   ++E    
Sbjct: 175 RRFASANMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKEASKVFEDMREKFPP 234

Query: 219 DEVTYKWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLF 278
           +   +  L+ G+C  G L+EA ++   M + G EP++     +++      K  +A  L 
Sbjct: 235 NLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLM 294

Query: 279 QAVRSNRMNDLIPSTYSLVIRWLCNKAK-VQQAYVVFDEMHKRGLEADNSAHSSLIYGLL 338
             +R       + + Y+++I+ LC   K + +A  VF EM + G EAD   +++LI G  
Sbjct: 295 NDMRKRGFEPNV-NCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFC 354

Query: 339 ARGRRGEAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMVERGCEP--I 398
             G   + Y+++  +      P    Y  ++    + ++  E  ++  +M  RGC P  +
Sbjct: 355 KWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLL 414

Query: 399 MHTYIMLLQGHLG----------KRGRKGLDPLVNFDTIFVGGLVKNGKSLEATKYVERL 445
           ++  ++ L   LG          +    GL P V+   I + G    G  +EA  + + +
Sbjct: 415 IYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGFLIEACNHFKEM 474


HSP 2 Score: 67.0 bits (162), Expect = 6.8e-10
Identity = 59/236 (25.00%), Postives = 108/236 (45.76%), Query Frame = 1

Query: 152 FVVALRTLAAARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGC-KLVDEAKFLVMKL 211
           F   L   A A ++    +  + M   GF  ++     +++ LC   K +DEA  + +++
Sbjct: 274 FTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEM 333

Query: 212 KEW-IKADEVTYKWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKF 271
           + +  +AD VTY  LI GFC  G + +   + + M  +G  P      ++M    K  +F
Sbjct: 334 ERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQF 393

Query: 272 DEALKLFQAVRSNRMN-DLIPSTYSLVIRWLCNKAKVQQAYVVFDEMHKRGLEADNSAHS 331
           +E L+L + ++    + DL+   Y++VIR  C   +V++A  +++EM   GL        
Sbjct: 394 EECLELIEKMKRRGCHPDLL--IYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFV 453

Query: 332 SLIYGLLARGRRGEAYNIMRRIEN------PDLGVYHALIKGLLRLKRANEATQVF 379
            +I G  ++G   EA N  + + +      P  G   +L+  L+R  +   A  V+
Sbjct: 454 IMINGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKLEMAKDVW 507


HSP 3 Score: 44.7 bits (104), Expect = 3.6e-03
Identity = 47/195 (24.10%), Postives = 82/195 (42.05%), Query Frame = 1

Query: 201 DEAKFLVMKLKE-WIKADEVTYKWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEM 260
           +E   L+ K+K      D + Y  +I+  C +G++ EA ++WN M   G  P ++    M
Sbjct: 394 EECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIM 453

Query: 261 MNVLFKTNKFDEALKLFQAVRSNRMNDLIP-STYSLVIRWLCNKAKVQQAYVVFDEMHKR 320
           +N         EA   F+ + S  +       T   ++  L    K++ A  V+  +  +
Sbjct: 454 INGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKLEMAKDVWSCISNK 513

Query: 321 --GLEADNSAHSSLIYGLLARGRRGEA----YNIMRRIENPDLGVYHALIKGLLRLKRAN 380
               E + SA +  I+ L A+G   EA     ++M     P    Y  L+KGL +L    
Sbjct: 514 TSSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPNTYAKLMKGLNKLYNRT 573

Query: 381 EATQVFREMVERGCE 388
            A ++  ++V+   E
Sbjct: 574 IAAEITEKVVKMASE 588

BLAST of Cla022020 vs. TrEMBL
Match: A0A0A0KD74_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G366430 PE=4 SV=1)

HSP 1 Score: 931.4 bits (2406), Expect = 4.7e-268
Identity = 459/505 (90.89%), Postives = 480/505 (95.05%), Query Frame = 1

Query: 1   MAAITNRRIIINNFSFSFIFHQRFTPFTSDSTAAAPPHNNIPSAINPTHLRRVCTVLYQQ 60
           MAAIT RRIIINNFSFSFIFHQRF+PFTSDS+ AA   +NIP  I+ THLRRVCTVLYQQ
Sbjct: 1   MAAITYRRIIINNFSFSFIFHQRFSPFTSDSSTAA---DNIPQPIDSTHLRRVCTVLYQQ 60

Query: 61  QNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSPD+KLH+KLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET PNFTHTAVSFNK
Sbjct: 61  QNSPDLKLHTKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETHPNFTHTAVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGL+QEMGRRRLV DKTFVVALRTLA ARELKKCVEFFHLM+GYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLIQEMGRRRLVNDKTFVVALRTLATARELKKCVEFFHLMNGYGF 180

Query: 181 GYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTYKWLIKGFCNVGDLVEASKI 240
            YSL+TLN+VVEKLCGCKLVDEAKFLVMKL EWIKAD VTYK LIKGFCNVGDL+EASK+
Sbjct: 181 CYSLVTLNRVVEKLCGCKLVDEAKFLVMKLNEWIKADGVTYKLLIKGFCNVGDLIEASKM 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLVIRWLC 300
           WNLMVDEGFEPEMEAVEEMMNVLFKTNK DEALKLFQA+RS+RMNDLIPSTYSLVIRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKLDEALKLFQALRSDRMNDLIPSTYSLVIRWLC 300

Query: 301 NKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRGEAYNIMRRIENPDLGVYHA 360
           NK KV QA++VFDEMHKRGLE DNS HSSLIYGLLARGRR EAYNIMRRIENPDL VYHA
Sbjct: 301 NKGKVGQAFIVFDEMHKRGLEVDNSVHSSLIYGLLARGRRREAYNIMRRIENPDLDVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMVERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
            IKGLL+LKRANEATQVFREM+ERGCEPIMHTYIMLLQGHLGKRGRKG DPLVNFD+IFV
Sbjct: 361 FIKGLLKLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGSDPLVNFDSIFV 420

Query: 421 GGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGN+LREVGLV
Sbjct: 421 GGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNKLREVGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRDRATT 506
           DLADIFQRYGEKMTTRDRRR+RA T
Sbjct: 481 DLADIFQRYGEKMTTRDRRRNRAVT 502

BLAST of Cla022020 vs. TrEMBL
Match: M5Y2L9_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa015711mg PE=4 SV=1)

HSP 1 Score: 708.0 bits (1826), Expect = 8.4e-201
Identity = 338/461 (73.32%), Postives = 397/461 (86.12%), Query Frame = 1

Query: 42  PSAINPTHLRRVCTVLYQQQNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRF 101
           PS +NP HL RVCT+LYQQQNSP+ +LHS L + NF L+HEFFLQVCN+FPLSWRPVY F
Sbjct: 6   PSPVNPAHLLRVCTILYQQQNSPESRLHSNLNSSNFQLTHEFFLQVCNSFPLSWRPVYLF 65

Query: 102 FQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAA 161
           F +T+T PNFTHT VSFNK++DV+GK+RNI LLW ++ EMGRRRLV DKTF++AL+TLA 
Sbjct: 66  FLYTQTHPNFTHTTVSFNKMVDVIGKARNIQLLWDMLHEMGRRRLVNDKTFLIALKTLAK 125

Query: 162 ARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTY 221
           AREL KCVEFFH+M+GYGF YSL TLNKVVE LCG KLV EAKF+V KLKE I  + VTY
Sbjct: 126 ARELNKCVEFFHVMNGYGFDYSLETLNKVVESLCGSKLVVEAKFIVFKLKESIGPNGVTY 185

Query: 222 KWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRS 281
           + LI+GFC+VGDL+EASKIWNLMVDEGF+P++ A+E+MM  LFKTN++ EALK+FQ +R 
Sbjct: 186 RCLIEGFCDVGDLIEASKIWNLMVDEGFDPDIGAIEKMMETLFKTNRYGEALKVFQMMRV 245

Query: 282 NRMNDLIPSTYSLVIRWLCNKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRG 341
           NRM+DL  STY LVI W+C   K+++A+VVF+EM KR +EADNS  +SL+YGLLARGR  
Sbjct: 246 NRMDDLGLSTYRLVIEWMCKSGKIEEAHVVFEEMQKRRIEADNSTLASLVYGLLARGRVR 305

Query: 342 EAYNIMRRIENPDLGVYHALIKGLLRLKRANEATQVFREMVERGCEPIMHTYIMLLQGHL 401
            AY I+  IE PD+ V+H +IKGLLRL++  EAT+VFREM+++GCEP MHTYIMLLQGHL
Sbjct: 306 VAYKIVEGIEKPDINVFHGMIKGLLRLRKLREATEVFREMIKKGCEPNMHTYIMLLQGHL 365

Query: 402 GKRGRKGLDPLVNFDTIFVGGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNE 461
           GKRGRKG DPLVNFDTIFVGGLVK GKSLEATKYVER++KRGLEVPRFDYNKFLHYYSNE
Sbjct: 366 GKRGRKGSDPLVNFDTIFVGGLVKAGKSLEATKYVERVIKRGLEVPRFDYNKFLHYYSNE 425

Query: 462 EGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRRDR 503
           EGV MF EVG +LREVGLVDLADIFQRYGEKM TRDRRR+R
Sbjct: 426 EGVGMFEEVGKKLREVGLVDLADIFQRYGEKMATRDRRRNR 466

BLAST of Cla022020 vs. TrEMBL
Match: A0A067KYM3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15698 PE=4 SV=1)

HSP 1 Score: 687.2 bits (1772), Expect = 1.5e-194
Identity = 329/496 (66.33%), Postives = 402/496 (81.05%), Query Frame = 1

Query: 10  IINNFSFSFIFHQRFTPFTSDSTAAAPPHNNIPSAINPTHLRRVCTVLYQQQNSPDIKLH 69
           I+N+F     +HQ    F   +TA+  P  + P  + P +L RVCT+LYQQQNSPD KL+
Sbjct: 12  ILNSF-----YHQPRQLFRLLTTASIQPQPSPPCPVKPDYLLRVCTILYQQQNSPDSKLY 71

Query: 70  SKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPN--FTHTAVSFNKLIDVVGK 129
           SKL +CNF+L+HEFFLQVCN FP SWRPVYRFFQ+T   PN  F HT++SFNK++DV+GK
Sbjct: 72  SKLSSCNFHLTHEFFLQVCNKFPYSWRPVYRFFQYTRQTPNALFAHTSISFNKMLDVIGK 131

Query: 130 SRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGFGYSLMTL 189
           SRNI+L W  +QEM +  LV DKTF++AL+TL  ARELKKCVEFFHLM+ YG+ Y +  L
Sbjct: 132 SRNINLFWDTIQEMAKIGLVNDKTFIIALKTLGLARELKKCVEFFHLMNSYGYEYRVERL 191

Query: 190 NKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTYKWLIKGFCNVGDLVEASKIWNLMVDE 249
           NKVVE LC  KLV+EAKF+V+KLKEWIKA+E+TY WL+ GFC++GD++EASKIWNLMVDE
Sbjct: 192 NKVVESLCKDKLVEEAKFVVLKLKEWIKANEITYGWLVIGFCDMGDMIEASKIWNLMVDE 251

Query: 250 GFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLVIRWLCNKAKVQQ 309
           GFEP +   E+M+   FK N+++EA+KLFQ +R  +M+DL  STY LVI W+C + K+ Q
Sbjct: 252 GFEPGIHVYEKMIETFFKRNEYNEAVKLFQTMRVKKMDDLGLSTYRLVIDWMCKRGKISQ 311

Query: 310 AYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRGEAYNIMRRIENPDLGVYHALIKGLLR 369
           A ++FDEM KRG+EADN    SLIYGLLARGR  EAY ++  IE PD+ VYH +IKGLLR
Sbjct: 312 AKMMFDEMSKRGIEADNLTLGSLIYGLLARGRVNEAYKVVETIEKPDISVYHGMIKGLLR 371

Query: 370 LKRANEATQVFREMVERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFVGGLVKNG 429
           L++A+EATQVFREM++RGCEP MHTY+MLLQGHLGKRGRKG DPLVNFDTIFVGGLVK G
Sbjct: 372 LRKASEATQVFREMIKRGCEPTMHTYVMLLQGHLGKRGRKGKDPLVNFDTIFVGGLVKAG 431

Query: 430 KSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQ 489
           KSLEAT+YVER +  GLEVPRFDYNKFLHYYS+EEG V+F E+G +LRE GLVDLADI +
Sbjct: 432 KSLEATQYVERTINGGLEVPRFDYNKFLHYYSSEEGGVIFEEMGKKLREAGLVDLADILE 491

Query: 490 RYGEKMTTRDRRRDRA 504
           RYGEKMTTR+RRR+RA
Sbjct: 492 RYGEKMTTRERRRNRA 502

BLAST of Cla022020 vs. TrEMBL
Match: F6HFU4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00520 PE=4 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 1.0e-190
Identity = 321/460 (69.78%), Postives = 383/460 (83.26%), Query Frame = 1

Query: 43  SAINPTHLRRVCTVLYQQQNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFF 102
           S +NP HL RVCTVLYQQQNSP+++L + L AC F+L+HEFFLQVCN FPLSWRPVY+FF
Sbjct: 34  SIVNPNHLLRVCTVLYQQQNSPEVRLQTHLRACEFHLTHEFFLQVCNKFPLSWRPVYKFF 93

Query: 103 QFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAA 162
           +FTET P F H +V+FNK++DV+G+SRNI L W ++QEMGRRRL  DKTFV+AL+TLA+ 
Sbjct: 94  EFTETQPCFHHNSVTFNKMVDVIGRSRNIKLFWEVLQEMGRRRLANDKTFVIALKTLASI 153

Query: 163 RELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTYK 222
           RE+KKCVEFFHLM+ + +GYSL TLNKVVE LC  KL  EAK +V+KLK WI    VTY 
Sbjct: 154 REMKKCVEFFHLMNAHEYGYSLETLNKVVEVLCRSKLAVEAKEIVLKLKTWIPPSGVTYG 213

Query: 223 WLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSN 282
           +LIKGFC VGDL+EA+ +W+LMVDEGF P ++AVE+MM  LF  N+FDEA+KLFQAVR+ 
Sbjct: 214 YLIKGFCEVGDLIEAANVWDLMVDEGFRPGIDAVEKMMETLFNINRFDEAMKLFQAVRTT 273

Query: 283 RMNDLIPSTYSLVIRWLCNKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRGE 342
           R ++L+ STYSLVI W+C + KV QAY+VF+EM KRG++ DN   SSLIYGLLA+GR  E
Sbjct: 274 RFDELVLSTYSLVIDWMCKRGKVSQAYMVFEEMLKRGIQPDNKTMSSLIYGLLAKGRVRE 333

Query: 343 AYNIMRRIENPDLGVYHALIKGLLRLKRANEATQVFREMVERGCEPIMHTYIMLLQGHLG 402
           A  I   IE PD+ VYH +IKGLLRL++A EATQVFREM+ RGCEP MHTYIMLLQGHLG
Sbjct: 334 ANKITEGIERPDIAVYHGVIKGLLRLRKAGEATQVFREMIRRGCEPTMHTYIMLLQGHLG 393

Query: 403 KRGRKGLDPLVNFDTIFVGGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEE 462
           K+GRKG  PLVNFD+IFVGGL+K GKSL+A+KYVER+M  G+EVPRFDYN+FLH YSNEE
Sbjct: 394 KKGRKGPHPLVNFDSIFVGGLIKVGKSLDASKYVERMMDGGMEVPRFDYNRFLHCYSNEE 453

Query: 463 GVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRRDR 503
           GV MF EVG +LREVGLVDLADIF RYGEKM TRDRRR+R
Sbjct: 454 GVFMFEEVGKKLREVGLVDLADIFLRYGEKMATRDRRRNR 493

BLAST of Cla022020 vs. TrEMBL
Match: B9IJ07_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0016s08740g PE=4 SV=1)

HSP 1 Score: 674.1 bits (1738), Expect = 1.3e-190
Identity = 328/500 (65.60%), Postives = 399/500 (79.80%), Query Frame = 1

Query: 10  IINNFSFSFIFHQRFTPFTSDSTAAAPPHNNIPSAINPTHLRRVCTVLYQQQNSPDIKLH 69
           ++N+FS     H+     T++S+   P   + PS +N  HL RVCTVL+QQQ+S D KL 
Sbjct: 4   LLNSFSHRQ-HHRCLCLLTTESSQTYP---SSPSPVNEDHLLRVCTVLFQQQDSSDFKLR 63

Query: 70  SKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDP--NFTHTAVSFNKLIDVVGK 129
           +KL + +FNL+HEFFLQVCN FP SWRPV+RFFQ+T+  P   FTHT+VS NK++D+ G+
Sbjct: 64  NKLSSIDFNLTHEFFLQVCNKFPASWRPVHRFFQYTQQMPCSRFTHTSVSLNKMLDIFGR 123

Query: 130 SRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGFGYSLMTL 189
           SRN+DLLWG VQEM +R LV DKTF++ L+ LA+ARELKKC EFFH M+ +G  Y +  L
Sbjct: 124 SRNLDLLWGAVQEMAKRGLVNDKTFIIVLKALASARELKKCAEFFHFMNEHGCEYRVERL 183

Query: 190 NKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTYKWLIKGFCNVGDLVEASKIWNLMVDE 249
           NKVVE LC  KLV+EAKF+V+KLK+WI+ D VTY WL+KGFC+VG+L+EASKIWNLMVDE
Sbjct: 184 NKVVENLCKGKLVEEAKFVVLKLKDWIRPDGVTYGWLVKGFCDVGELIEASKIWNLMVDE 243

Query: 250 GFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLVIRWLCNKAKVQQ 309
             EPE+E  E+MM  LFK N++DEALK+FQ +R NRM+DL  STY LVI W+C K KV Q
Sbjct: 244 SIEPEIEVFEKMMETLFKRNEYDEALKVFQTMRVNRMDDLALSTYRLVIDWMCRKGKVVQ 303

Query: 310 AYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRGEAYNIMRRIENPDLGVYHALIKGLLR 369
           A +VFDEM +RG++ADNS   SL+YGLL RGR  EA+ ++ RIE  D+ VYH LIKGL+R
Sbjct: 304 AQMVFDEMRQRGIQADNSTLGSLVYGLLTRGRHAEAHKVVERIEKTDISVYHGLIKGLVR 363

Query: 370 LKRANEATQVFREMVERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFVGGLVKNG 429
            +RA+EATQVFREM+ RGCEP MHTYIMLLQGHLGKRGRKG DPLVNF++IFVGGL+K G
Sbjct: 364 SRRASEATQVFREMINRGCEPTMHTYIMLLQGHLGKRGRKGPDPLVNFESIFVGGLIKAG 423

Query: 430 KSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQ 489
           KSLEATKYVER MK  LEVPRFDYNKFLHYYS+EEGVVMF+EVG +LRE G VDLADI Q
Sbjct: 424 KSLEATKYVERTMKGSLEVPRFDYNKFLHYYSSEEGVVMFKEVGKKLREAGFVDLADILQ 483

Query: 490 RYGEKMTTRDRRRDRATTVS 508
           RYGEKM TR+RRR R+  V+
Sbjct: 484 RYGEKMATRERRRRRSELVN 499

BLAST of Cla022020 vs. NCBI nr
Match: gi|659089350|ref|XP_008445460.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis melo])

HSP 1 Score: 936.8 bits (2420), Expect = 1.6e-269
Identity = 462/505 (91.49%), Postives = 481/505 (95.25%), Query Frame = 1

Query: 1   MAAITNRRIIINNFSFSFIFHQRFTPFTSDSTAAAPPHNNIPSAINPTHLRRVCTVLYQQ 60
           MAAIT RRIIINNFSFSFIFHQRF+PFTSDSTAA    +NIPSAI+ THLRRVCTVLYQQ
Sbjct: 1   MAAITYRRIIINNFSFSFIFHQRFSPFTSDSTAA----DNIPSAIDSTHLRRVCTVLYQQ 60

Query: 61  QNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSPDIKLH+KLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET PNFTHT VSFNK
Sbjct: 61  QNSPDIKLHTKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETHPNFTHTVVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGL+QEMGRRRLV DKTFVVALRTLA ARELKKCVEFFHLM+GYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLIQEMGRRRLVNDKTFVVALRTLATARELKKCVEFFHLMNGYGF 180

Query: 181 GYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTYKWLIKGFCNVGDLVEASKI 240
            YSL+TLNKVVE LCGCKLVDEAKFLVMKL EWIKAD VTYK LIKGFCNVGDL+EASK+
Sbjct: 181 CYSLVTLNKVVENLCGCKLVDEAKFLVMKLNEWIKADGVTYKLLIKGFCNVGDLIEASKM 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLVIRWLC 300
           WNLMVDEGFEPEMEAVEEM+NVLFKTNK DEALKLFQAVRSNRMNDLIPSTY LVIRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMVNVLFKTNKLDEALKLFQAVRSNRMNDLIPSTYRLVIRWLC 300

Query: 301 NKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRGEAYNIMRRIENPDLGVYHA 360
           NK KV+QA++VFDEMHKRGLE DNS HSSLIYGLLARGRR EAYNIMRRIENPDL VYHA
Sbjct: 301 NKGKVRQAFIVFDEMHKRGLEVDNSVHSSLIYGLLARGRRREAYNIMRRIENPDLDVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMVERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
           LIKGLL+LKRANEATQVFREM+ERGCEPIMHTYIMLLQGHLGKRGRKGLDP VNFD+IFV
Sbjct: 361 LIKGLLKLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGLDPFVNFDSIFV 420

Query: 421 GGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKNGKSLEATKYVER+MKRGLEVPRFDYNKFLHYYSN+EGVVMFREVGNRLREVGLV
Sbjct: 421 GGLVKNGKSLEATKYVERIMKRGLEVPRFDYNKFLHYYSNDEGVVMFREVGNRLREVGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRDRATT 506
           DLADIFQRYGEKMTTRDRRR+RA T
Sbjct: 481 DLADIFQRYGEKMTTRDRRRNRAAT 501

BLAST of Cla022020 vs. NCBI nr
Match: gi|449453031|ref|XP_004144262.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis sativus])

HSP 1 Score: 931.4 bits (2406), Expect = 6.7e-268
Identity = 459/505 (90.89%), Postives = 480/505 (95.05%), Query Frame = 1

Query: 1   MAAITNRRIIINNFSFSFIFHQRFTPFTSDSTAAAPPHNNIPSAINPTHLRRVCTVLYQQ 60
           MAAIT RRIIINNFSFSFIFHQRF+PFTSDS+ AA   +NIP  I+ THLRRVCTVLYQQ
Sbjct: 1   MAAITYRRIIINNFSFSFIFHQRFSPFTSDSSTAA---DNIPQPIDSTHLRRVCTVLYQQ 60

Query: 61  QNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSPD+KLH+KLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET PNFTHTAVSFNK
Sbjct: 61  QNSPDLKLHTKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETHPNFTHTAVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGL+QEMGRRRLV DKTFVVALRTLA ARELKKCVEFFHLM+GYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLIQEMGRRRLVNDKTFVVALRTLATARELKKCVEFFHLMNGYGF 180

Query: 181 GYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTYKWLIKGFCNVGDLVEASKI 240
            YSL+TLN+VVEKLCGCKLVDEAKFLVMKL EWIKAD VTYK LIKGFCNVGDL+EASK+
Sbjct: 181 CYSLVTLNRVVEKLCGCKLVDEAKFLVMKLNEWIKADGVTYKLLIKGFCNVGDLIEASKM 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLVIRWLC 300
           WNLMVDEGFEPEMEAVEEMMNVLFKTNK DEALKLFQA+RS+RMNDLIPSTYSLVIRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKLDEALKLFQALRSDRMNDLIPSTYSLVIRWLC 300

Query: 301 NKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRGEAYNIMRRIENPDLGVYHA 360
           NK KV QA++VFDEMHKRGLE DNS HSSLIYGLLARGRR EAYNIMRRIENPDL VYHA
Sbjct: 301 NKGKVGQAFIVFDEMHKRGLEVDNSVHSSLIYGLLARGRRREAYNIMRRIENPDLDVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMVERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
            IKGLL+LKRANEATQVFREM+ERGCEPIMHTYIMLLQGHLGKRGRKG DPLVNFD+IFV
Sbjct: 361 FIKGLLKLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGSDPLVNFDSIFV 420

Query: 421 GGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGN+LREVGLV
Sbjct: 421 GGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNKLREVGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRDRATT 506
           DLADIFQRYGEKMTTRDRRR+RA T
Sbjct: 481 DLADIFQRYGEKMTTRDRRRNRAVT 502

BLAST of Cla022020 vs. NCBI nr
Match: gi|596297449|ref|XP_007227334.1| (hypothetical protein PRUPE_ppa015711mg, partial [Prunus persica])

HSP 1 Score: 708.0 bits (1826), Expect = 1.2e-200
Identity = 338/461 (73.32%), Postives = 397/461 (86.12%), Query Frame = 1

Query: 42  PSAINPTHLRRVCTVLYQQQNSPDIKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRF 101
           PS +NP HL RVCT+LYQQQNSP+ +LHS L + NF L+HEFFLQVCN+FPLSWRPVY F
Sbjct: 6   PSPVNPAHLLRVCTILYQQQNSPESRLHSNLNSSNFQLTHEFFLQVCNSFPLSWRPVYLF 65

Query: 102 FQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAA 161
           F +T+T PNFTHT VSFNK++DV+GK+RNI LLW ++ EMGRRRLV DKTF++AL+TLA 
Sbjct: 66  FLYTQTHPNFTHTTVSFNKMVDVIGKARNIQLLWDMLHEMGRRRLVNDKTFLIALKTLAK 125

Query: 162 ARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTY 221
           AREL KCVEFFH+M+GYGF YSL TLNKVVE LCG KLV EAKF+V KLKE I  + VTY
Sbjct: 126 ARELNKCVEFFHVMNGYGFDYSLETLNKVVESLCGSKLVVEAKFIVFKLKESIGPNGVTY 185

Query: 222 KWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRS 281
           + LI+GFC+VGDL+EASKIWNLMVDEGF+P++ A+E+MM  LFKTN++ EALK+FQ +R 
Sbjct: 186 RCLIEGFCDVGDLIEASKIWNLMVDEGFDPDIGAIEKMMETLFKTNRYGEALKVFQMMRV 245

Query: 282 NRMNDLIPSTYSLVIRWLCNKAKVQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRG 341
           NRM+DL  STY LVI W+C   K+++A+VVF+EM KR +EADNS  +SL+YGLLARGR  
Sbjct: 246 NRMDDLGLSTYRLVIEWMCKSGKIEEAHVVFEEMQKRRIEADNSTLASLVYGLLARGRVR 305

Query: 342 EAYNIMRRIENPDLGVYHALIKGLLRLKRANEATQVFREMVERGCEPIMHTYIMLLQGHL 401
            AY I+  IE PD+ V+H +IKGLLRL++  EAT+VFREM+++GCEP MHTYIMLLQGHL
Sbjct: 306 VAYKIVEGIEKPDINVFHGMIKGLLRLRKLREATEVFREMIKKGCEPNMHTYIMLLQGHL 365

Query: 402 GKRGRKGLDPLVNFDTIFVGGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNE 461
           GKRGRKG DPLVNFDTIFVGGLVK GKSLEATKYVER++KRGLEVPRFDYNKFLHYYSNE
Sbjct: 366 GKRGRKGSDPLVNFDTIFVGGLVKAGKSLEATKYVERVIKRGLEVPRFDYNKFLHYYSNE 425

Query: 462 EGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRRDR 503
           EGV MF EVG +LREVGLVDLADIFQRYGEKM TRDRRR+R
Sbjct: 426 EGVGMFEEVGKKLREVGLVDLADIFQRYGEKMATRDRRRNR 466

BLAST of Cla022020 vs. NCBI nr
Match: gi|645229514|ref|XP_008221500.1| (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g26500 [Prunus mume])

HSP 1 Score: 693.7 bits (1789), Expect = 2.4e-196
Identity = 340/481 (70.69%), Postives = 400/481 (83.16%), Query Frame = 1

Query: 23  RFTPFTSDSTAAAPPHNNIPSAINPTHLRRVCTVLYQQQNSPDIKLHSKLLACNFNLSHE 82
           RF    S  T  A P  +  S +NP HL RVCT+LYQQQNSP  +    L + NF L+HE
Sbjct: 18  RFITTESTQTQLATPSPS-SSPVNPAHLLRVCTILYQQQNSPASRX-PHLNSSNFQLTHE 77

Query: 83  FFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMG 142
           FFLQVCN+FPLSWRPVY FF +T+T PNFTHT VSFNK++DV+GK+RNI LLW ++ EMG
Sbjct: 78  FFLQVCNSFPLSWRPVYLFFLYTQTHPNFTHTTVSFNKMVDVIGKARNIQLLWDMLHEMG 137

Query: 143 RRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCGCKLVDE 202
           RRRLV DKTF++AL+TLA AREL KCVEFFH+M+GYGF YSL TLNKVVE LCG KLV E
Sbjct: 138 RRRLVNDKTFLIALKTLAKARELNKCVEFFHVMNGYGFDYSLETLNKVVESLCGSKLVAE 197

Query: 203 AKFLVMKLKEWIKADEVTYKWLIKGFCNVGDLVEASKIWNLMVDEGFEPEMEAVEEMMNV 262
           AKF+V KLKE I  + VTY+ LI+GFC+VGDL+EASKIWNLMVD+GF+P++ A+E+MM  
Sbjct: 198 AKFIVFKLKESIGPNGVTYRCLIEGFCDVGDLIEASKIWNLMVDQGFDPDIGAIEKMMEP 257

Query: 263 LFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLVIRWLCNKAKVQQAYVVFDEMHKRGLEA 322
           LFKTN++ EALK+FQ +R NRM+DL  STY LVI W+C + K+++A+VVF+EM KR +EA
Sbjct: 258 LFKTNRYGEALKVFQMMRVNRMDDLGLSTYRLVIEWMCKRGKIEEAHVVFEEMQKRRIEA 317

Query: 323 DNSAHSSLIYGLLARGRRGEAYNIMRRIENPDLGVYHALIKGLLRLKRANEATQVFREMV 382
           DNS  +SL YGLLARGR   AY I+  IE PD+ V+H +IKGLLRL++  EAT+VFREM+
Sbjct: 318 DNSRLASLAYGLLARGRVRVAYKIVEGIEKPDINVFHGMIKGLLRLRKLREATEVFREMI 377

Query: 383 ERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFVGGLVKNGKSLEATKYVERLMKR 442
           ++GCEP MHTYIMLLQGHLGKRGRKG DPLVNFDTIFVGGLVK GKSLEATKYVER++KR
Sbjct: 378 KKGCEPNMHTYIMLLQGHLGKRGRKGSDPLVNFDTIFVGGLVKAGKSLEATKYVERVIKR 437

Query: 443 GLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRRDR 502
           GLEVPRFDYNKFLHYYSNEEGV MF EVG +LREVGLVDLADIFQRYGEKM TRDRRR+R
Sbjct: 438 GLEVPRFDYNKFLHYYSNEEGVEMFEEVGKKLREVGLVDLADIFQRYGEKMATRDRRRNR 496

Query: 503 A 504
           A
Sbjct: 498 A 496

BLAST of Cla022020 vs. NCBI nr
Match: gi|694388801|ref|XP_009370072.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 isoform X1 [Pyrus x bretschneideri])

HSP 1 Score: 690.6 bits (1781), Expect = 2.0e-195
Identity = 336/499 (67.33%), Postives = 405/499 (81.16%), Query Frame = 1

Query: 6   NRRIIINNFSFSFIFHQRFTPFTSDSTAAAPPHNNIPSAINPTHLRRVCTVLYQQQNSPD 65
           N R++I   + S +    F  FT+  +         PS ++P  L RVCT+LYQQQNSP+
Sbjct: 12  NSRMLIRQGTRSLLIPNFFRRFTTTESTQPQLAAAAPSPVDPARLLRVCTILYQQQNSPE 71

Query: 66  IKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETD-PNFTHTAVSFNKLIDV 125
            +LHS L +  F L+HEFFLQVCN+FPLSWRPVY FF++T++  P+FTHTAVSFNK++DV
Sbjct: 72  SRLHSNLNSSAFQLTHEFFLQVCNSFPLSWRPVYLFFRYTQSHHPDFTHTAVSFNKMLDV 131

Query: 126 VGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGFGYSL 185
           +GKSRNI L W  + +MGRRRLV  KTF++ALRTLA AREL KCV+FFH M+G+GFGY L
Sbjct: 132 IGKSRNIQLFWDTLHKMGRRRLVNRKTFLIALRTLAKARELNKCVDFFHAMNGFGFGYDL 191

Query: 186 MTLNKVVEKLCGCKLVDEAKFLVMKLKEWIKADEVTYKWLIKGFCNVGDLVEASKIWNLM 245
             LN VVE LCG KLV EA+FLV+KLKE I+ D  TY+ LI+GFC+VGDL+EASKIWNLM
Sbjct: 192 GNLNMVVEDLCGSKLVVEARFLVLKLKESIRPDGFTYRCLIQGFCDVGDLIEASKIWNLM 251

Query: 246 VDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLVIRWLCNKAK 305
           VDEGF+P+++AVE+MM  LFKTN++ EALK+FQ +R NRM DL  STY LVI W+C + K
Sbjct: 252 VDEGFDPDIDAVEKMMETLFKTNRYGEALKVFQMMRVNRMEDLGLSTYRLVIEWMCKRGK 311

Query: 306 VQQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRGEAYNIMRRIENPDLGVYHALIKG 365
           +++A+ VF+EMHKR +EADNS  +SL+YGLLARGR   A+ ++  IE PD+ +YH LIKG
Sbjct: 312 IEEAHEVFEEMHKRRIEADNSTLASLVYGLLARGRVTMAFKVVEGIEKPDISLYHGLIKG 371

Query: 366 LLRLKRANEATQVFREMVERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFVGGLV 425
           LLRL++A +AT+VFREM+ R CEP MHTYIMLLQGHLGKRGRKG DPLVNFDTIFVGGLV
Sbjct: 372 LLRLRKARDATEVFREMIRRRCEPNMHTYIMLLQGHLGKRGRKGSDPLVNFDTIFVGGLV 431

Query: 426 KNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLAD 485
           K GKSLEATKYVER++KRGLEVPRFDYNKFLHYYSNEEGV MF EVG + REVGLVDLAD
Sbjct: 432 KAGKSLEATKYVERVIKRGLEVPRFDYNKFLHYYSNEEGVEMFEEVGKKFREVGLVDLAD 491

Query: 486 IFQRYGEKMTTRDRRRDRA 504
           IFQRYGEKM TRDRRR+RA
Sbjct: 492 IFQRYGEKMATRDRRRNRA 510

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR59_ARATH4.5e-17960.20Putative pentatricopeptide repeat-containing protein At1g26500 OS=Arabidopsis th... [more]
PP293_ARATH3.5e-6730.25Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidop... [more]
PP294_ARATH1.3e-6630.04Pentatricopeptide repeat-containing protein At3g62540, mitochondrial OS=Arabidop... [more]
PP382_ARATH1.8e-6630.04Pentatricopeptide repeat-containing protein At5g14820, mitochondrial OS=Arabidop... [more]
PP275_ARATH8.0e-2724.38Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KD74_CUCSA4.7e-26890.89Uncharacterized protein OS=Cucumis sativus GN=Csa_6G366430 PE=4 SV=1[more]
M5Y2L9_PRUPE8.4e-20173.32Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa015711mg PE=4 S... [more]
A0A067KYM3_JATCU1.5e-19466.33Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15698 PE=4 SV=1[more]
F6HFU4_VITVI1.0e-19069.78Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00520 PE=4 SV=... [more]
B9IJ07_POPTR1.3e-19065.60Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
Match NameE-valueIdentityDescription
gi|659089350|ref|XP_008445460.1|1.6e-26991.49PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucum... [more]
gi|449453031|ref|XP_004144262.1|6.7e-26890.89PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucum... [more]
gi|596297449|ref|XP_007227334.1|1.2e-20073.32hypothetical protein PRUPE_ppa015711mg, partial [Prunus persica][more]
gi|645229514|ref|XP_008221500.1|2.4e-19670.69PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
gi|694388801|ref|XP_009370072.1|2.0e-19567.33PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 isofor... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla022020Cla022020.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 418..444
score: 1.0coord: 259..281
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 214..244
score: 6.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 291..334
score: 4.5E-10coord: 353..399
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 291..323
score: 5.7E-6coord: 357..388
score: 1.1E-7coord: 220..251
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 354..388
score: 12.386coord: 412..446
score: 5.788coord: 217..251
score: 12.09coord: 288..322
score: 9.624coord: 183..213
score: 5.875coord: 114..148
score: 7.476coord: 323..353
score: 6.007coord: 252..286
score:
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 253..383
score: 5.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 60..488
score: 8.1E-235coord: 8..42
score: 8.1E
NoneNo IPR availablePANTHERPTHR24015:SF584SUBFAMILY NOT NAMEDcoord: 8..42
score: 8.1E-235coord: 60..488
score: 8.1E

The following gene(s) are paralogous to this gene:

None