CmaCh20G006320 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G006320
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionAPO RNA-binding protein
LocationCma_Chr20 : 2943975 .. 2946458 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCATTTTGATCCATTGCCCTCGTCTCTGTGGCGCCCTCGTTTTCCCTCTGGACATCTGGAACTGCAAATCTTCATTTGGTCTCACAATTCTGCAAATTAAGTCTATGGAAATGCCTGTTCTTCCTCCTGTCCGCGAATCTGGCTTCAATTCTCCGTCGGCGTCGCTGTTAGTTGTTTCATCTTGTCTCTGAAAGCTCTCTGTCTCTCTGCGGTCTTCTCTCGAAATCTCATCCTTTGTTCTTTAAATCATCTATCTTACTCGCAACCCGAGCACTTTCGCCGTTGAATTCGAAGGCTCATGTGATTGGATTAAGATCTTAATTTATGCCTTCAAGCTTGTTGCTTGTCGTTTTCCTAAGATTTGATAAATGTTAGCGCTCGCATCATGAAAACTATGCCTTTTATGGCTCTTAGAAGGAAGTTCTGCGAGAATATTGTGCAGGAATTCATGCTACATCGATGCTACAGTTCTAAAGTCGATTTGAAGAAGCTTCGTCCAATGATTTTGAAGAGAATTCAAAATCGGGCTAATAACTGTCCTATAAAGGGTCTAATTCCAGTGGCACAGCAAGTCTTTGAAGCTAGGGCAATACTTATTCATGGAGTTTCCACTCTTCTCAAAGAATTTCCGGTTGTCTCTTGCAAGTCAGTGTCCCGGGAACAAACTATCTTTAATTTCTCATTTTTGTGTTATGATTTTACTCTGTTTCGTTTTCTTTTACTTAGTTTTTATCTGAAGTTGCATGTCCCTTGAGCTAGTATGTGTAATAGGAACGTACCATTTTGAAATGCTGATTGGCCTTGCTTATCATCTAGCTATTACCTGACCATTCCTGCGCAGGTAGTGAGGTCAAAAAATTCATCAATGCATATTATCTAGTTCTTTATGCACACTTGGGCCGTCTGTGAAGTCTGTTCTTTCTGAAATTTAGATTGAACAATCCTGAGGTATTAATATGAATGCATGTTTGAGAATTCTGCTTCTAAAATTCGTAGATTACTGGAGGTGCCCGGAATTCATGAATTCTCTGGTAAAGGGGAGTCGAGGTTGAATTCTTTGGGCCCTTTGTATACACTGCCCGTCGGTCATACCGATTGAATGGTCCGGTGAAGTGTTCGGATCGCGATGACGTGGGCGGTTTGCTGCCTGCGACATCATGAGAAGTTCTCATTAAACCTTATCATTTAGAGGAAGAAAAAGTTGTAACAAGGTTTCCGTAGGTGAACCTGCAGAAGGATCATTGTCGATGCCTAAACATCAAACGACCCATGAACACGTTTAAGTCTGTTCTTTTTGAAGATTATATTAAAAACTCTCCGCTGTTAACGTGAATACATGTTTGTGAAGGCTGTTTCTGATACTTGTAGGATCATTTTGTTGTTTTTATAGATTTTGTCCAGAAGTATATGTTGGTGAGAAAGGTCATCTGATACGAAGCTGTGGCGGTTACAAACGCGGGCCTAAGAATCAGGTTCATGAGTGGATCAGAGGTGGTTTGAATGATGTCCTTGTCCCAGTAGAAGCATTTCACCGCCATCACATTTTCGAAAAGGTTAATGAGCATGATGAGAGGTTCAAGTTTGAGCGTGTTCCAGCAGTTGTGGAACTATGCTGGCAAGCTGGAGCCAATACAAATGATGAAAATCTGTCTTCAAGCACTTGGAATTCAGTTGGTGGTGGTGGTGGTGGTGGCTCAGGCAGGGATGAACCTTTATCAGGTAATGAGATGAGGCTTTTGGCAACCGAAACTCTCAGGGCGTGGGAAACAGTTCGAACGGGCGTGCAGAAACTGTTAATGGTATATCCTGCTAAGATCTGTAATTACTGCTCAGACGTTCATGTTGGGCCATCAGTACACAAAGCTAGACCATGTGGAGTTTTTAAATGTGAGAGTTGGCGAGGATCCCACTTCTGGGAGAAGGCGGATGTAGATGATTTAGTTCCTCCGAAGATCGTATGGCATCGGCGACCTCAAGATCCTCCCGTGCTCGTCGACGAAGGGAGGGATTATTACGGGCACTCGCCAGCAGTCCTGGCTCTTTGCACACATGCTGGTGCCATTGCACCACCTAAGTATCATTGTATGATGAAAGTTCAAGGCTTACCACCTCAGAGTCACCTCAAGCTATGAAATCTTCTTTTGTTCGACTTCACATGGAGCTTGACAAGAACACTCGACTCGATTACAGTTCATGACCTTCAGGTGACCGTGCCTATTTATTTTCTTTCTTCCTGCTTTATCTGTTTGGTTGATCTAGTAAACATATAGAAAAAGTAACATGATGAACATCAAGCATAGCAACTTGGGAAAAGTTATCAAAAAGTTGAAGAATTTTGAACTTGAGAATGTCTTGTAGATTCGTCTTTCATTCTTTTCAGTAGAATATTCATGTCACGGGAGGCACTTTATAGAAAAGTTGGATGTAAAATCTAGATTTATATCCCTGCTTAGAAATTGTTGTTTTTTTTTACCCAACTTTAT

mRNA sequence

TTCATTTTGATCCATTGCCCTCGTCTCTGTGGCGCCCTCGTTTTCCCTCTGGACATCTGGAACTGCAAATCTTCATTTGGTCTCACAATTCTGCAAATTAAGTCTATGGAAATGCCTGTTCTTCCTCCTGTCCGCGAATCTGGCTTCAATTCTCCGTCGGCGTCGCTGTTAGTTGTTTCATCTTGTCTCTGAAAGCTCTCTGTCTCTCTGCGGTCTTCTCTCGAAATCTCATCCTTTGTTCTTTAAATCATCTATCTTACTCGCAACCCGAGCACTTTCGCCGTTGAATTCGAAGGCTCATGTGATTGGATTAAGATCTTAATTTATGCCTTCAAGCTTGTTGCTTGTCGTTTTCCTAAGATTTGATAAATGTTAGCGCTCGCATCATGAAAACTATGCCTTTTATGGCTCTTAGAAGGAAGTTCTGCGAGAATATTGTGCAGGAATTCATGCTACATCGATGCTACAGTTCTAAAGTCGATTTGAAGAAGCTTCGTCCAATGATTTTGAAGAGAATTCAAAATCGGGCTAATAACTGTCCTATAAAGGGTCTAATTCCAGTGGCACAGCAAGTCTTTGAAGCTAGGGCAATACTTATTCATGGAGTTTCCACTCTTCTCAAAGAATTTCCGGTTGTCTCTTGCAAATTTTGTCCAGAAGTATATGTTGGTGAGAAAGGTCATCTGATACGAAGCTGTGGCGGTTACAAACGCGGGCCTAAGAATCAGGTTCATGAGTGGATCAGAGGTGGTTTGAATGATGTCCTTGTCCCAGTAGAAGCATTTCACCGCCATCACATTTTCGAAAAGGTTAATGAGCATGATGAGAGGTTCAAGTTTGAGCGTGTTCCAGCAGTTGTGGAACTATGCTGGCAAGCTGGAGCCAATACAAATGATGAAAATCTGTCTTCAAGCACTTGGAATTCAGTTGGTGGTGGTGGTGGTGGTGGCTCAGGCAGGGATGAACCTTTATCAGGTAATGAGATGAGGCTTTTGGCAACCGAAACTCTCAGGGCGTGGGAAACAGTTCGAACGGGCGTGCAGAAACTGTTAATGGTATATCCTGCTAAGATCTGTAATTACTGCTCAGACGTTCATGTTGGGCCATCAGTACACAAAGCTAGACCATGTGGAGTTTTTAAATGTGAGAGTTGGCGAGGATCCCACTTCTGGGAGAAGGCGGATGTAGATGATTTAGTTCCTCCGAAGATCGTATGGCATCGGCGACCTCAAGATCCTCCCGTGCTCGTCGACGAAGGGAGGGATTATTACGGGCACTCGCCAGCAGTCCTGGCTCTTTGCACACATGCTGGTGCCATTGCACCACCTAAGTATCATTGTATGATGAAAGTTCAAGGCTTACCACCTCAGAGTCACCTCAAGCTATGAAATCTTCTTTTGTTCGACTTCACATGGAGCTTGACAAGAACACTCGACTCGATTACAGTTCATGACCTTCAGGTGACCGTGCCTATTTATTTTCTTTCTTCCTGCTTTATCTGTTTGGTTGATCTAGTAAACATATAGAAAAAGTAACATGATGAACATCAAGCATAGCAACTTGGGAAAAGTTATCAAAAAGTTGAAGAATTTTGAACTTGAGAATGTCTTGTAGATTCGTCTTTCATTCTTTTCAGTAGAATATTCATGTCACGGGAGGCACTTTATAGAAAAGTTGGATGTAAAATCTAGATTTATATCCCTGCTTAGAAATTGTTGTTTTTTTTTACCCAACTTTAT

Coding sequence (CDS)

ATGAAAACTATGCCTTTTATGGCTCTTAGAAGGAAGTTCTGCGAGAATATTGTGCAGGAATTCATGCTACATCGATGCTACAGTTCTAAAGTCGATTTGAAGAAGCTTCGTCCAATGATTTTGAAGAGAATTCAAAATCGGGCTAATAACTGTCCTATAAAGGGTCTAATTCCAGTGGCACAGCAAGTCTTTGAAGCTAGGGCAATACTTATTCATGGAGTTTCCACTCTTCTCAAAGAATTTCCGGTTGTCTCTTGCAAATTTTGTCCAGAAGTATATGTTGGTGAGAAAGGTCATCTGATACGAAGCTGTGGCGGTTACAAACGCGGGCCTAAGAATCAGGTTCATGAGTGGATCAGAGGTGGTTTGAATGATGTCCTTGTCCCAGTAGAAGCATTTCACCGCCATCACATTTTCGAAAAGGTTAATGAGCATGATGAGAGGTTCAAGTTTGAGCGTGTTCCAGCAGTTGTGGAACTATGCTGGCAAGCTGGAGCCAATACAAATGATGAAAATCTGTCTTCAAGCACTTGGAATTCAGTTGGTGGTGGTGGTGGTGGTGGCTCAGGCAGGGATGAACCTTTATCAGGTAATGAGATGAGGCTTTTGGCAACCGAAACTCTCAGGGCGTGGGAAACAGTTCGAACGGGCGTGCAGAAACTGTTAATGGTATATCCTGCTAAGATCTGTAATTACTGCTCAGACGTTCATGTTGGGCCATCAGTACACAAAGCTAGACCATGTGGAGTTTTTAAATGTGAGAGTTGGCGAGGATCCCACTTCTGGGAGAAGGCGGATGTAGATGATTTAGTTCCTCCGAAGATCGTATGGCATCGGCGACCTCAAGATCCTCCCGTGCTCGTCGACGAAGGGAGGGATTATTACGGGCACTCGCCAGCAGTCCTGGCTCTTTGCACACATGCTGGTGCCATTGCACCACCTAAGTATCATTGTATGATGAAAGTTCAAGGCTTACCACCTCAGAGTCACCTCAAGCTATGA

Protein sequence

MKTMPFMALRRKFCENIVQEFMLHRCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANTNDENLSSSTWNSVGGGGGGGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHCMMKVQGLPPQSHLKL
BLAST of CmaCh20G006320 vs. Swiss-Prot
Match: APO4_ARATH (APO protein 4, mitochondrial OS=Arabidopsis thaliana GN=APO4 PE=2 SV=2)

HSP 1 Score: 356.7 bits (914), Expect = 2.8e-97
Identity = 164/294 (55.78%), Postives = 216/294 (73.47%), Query Frame = 1

Query: 32  DLKKLRPMILKRIQNRANNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVVSCKFCPE 91
           DL+KLRPMILKRI+NRA + P+K ++PVA+++  AR  LI  ++ LLK FPV++CKFC E
Sbjct: 34  DLRKLRPMILKRIENRAKDYPVKEIVPVAEEILIARKNLISNIAALLKVFPVLTCKFCSE 93

Query: 92  VYVGEKGHLIRSCGGYKRGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNEHDERFKF 151
           V+VG++GHLI +C  Y R   N++HEW+ G +ND+LVPVE++H H+I + V  H ERF +
Sbjct: 94  VFVGKEGHLIETCRSYIRRGNNRLHEWVPGSINDILVPVESYHLHNISQGVIRHQERFDY 153

Query: 152 ERVPAVVELCWQAGANTNDENLSSSTWNSVGGGGGGGSGRDEPLSGNEMRLLATETLRAW 211
           +RVPA++ELC QAGA   +E L    ++ +             L   +++ +    L AW
Sbjct: 154 DRVPAILELCCQAGAIHPEEILQ---YSEIHDNPQISEEDIRSLPAGDLKYVGANALMAW 213

Query: 212 ETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGSHFWEKADVDDLV 271
           E VR GV+KLL+VYP+K+C  C +VHVGPS HKAR CGVFK ESWRG+H+WEKA V+DLV
Sbjct: 214 EKVRAGVKKLLLVYPSKVCKRCKEVHVGPSGHKARLCGVFKYESWRGTHYWEKAGVNDLV 273

Query: 272 PPKIVWHRRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHCMMKVQGL 326
           P K+VWHRRPQDP VLVDEGR YYGH+PA+++LC+H GAI P KY C MK QGL
Sbjct: 274 PEKMVWHRRPQDPVVLVDEGRSYYGHAPAIVSLCSHTGAIVPVKYACKMKPQGL 324

BLAST of CmaCh20G006320 vs. Swiss-Prot
Match: APO2_ARATH (APO protein 2, chloroplastic OS=Arabidopsis thaliana GN=APO2 PE=2 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 1.3e-54
Identity = 113/305 (37.05%), Postives = 160/305 (52.46%), Query Frame = 1

Query: 49  NNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYK 108
           N   +K L+P+A +V+ AR  LI+ +  L+K   V +C +C E++VG  GH  +SC G  
Sbjct: 126 NGMVVKSLVPLAYKVYNARIRLINNLHRLMKVVRVNACGWCNEIHVGPYGHPFKSCKGPN 185

Query: 109 RGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANT 168
              +  +HEW    + DV+VP+EA+H      K   HDERF   RVPAVVELC Q G   
Sbjct: 186 TSQRKGLHEWTNSVIEDVIVPLEAYHLFDRLGKRIRHDERFSIPRVPAVVELCIQGGVEI 245

Query: 169 NDENLSSSTWNSVGGGGGGGSGRDE--------------------------PLSGNEMRL 228
            +          +  G       DE                          P S  E   
Sbjct: 246 PEFPAKRRRKPIIRIGKSEFVDADETELPDPEPQPPPVPLLTELPVSEITPPSSEEETVS 305

Query: 229 LATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGSHFW 288
           LA ETL+AWE +R G +KL+ +Y  ++C YC +VHVGP+ HKA+ CG FK +   G H W
Sbjct: 306 LAEETLQAWEEMRAGAKKLMRMYRVRVCGYCPEVHVGPTGHKAQNCGAFKHQQRNGQHGW 365

Query: 289 EKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHCMMKV 327
           + A +DDL+PP+ VWH    + P +  E R +YG +PAV+ +C  AGA+ P  Y   M++
Sbjct: 366 QSAVLDDLIPPRYVWHVPDVNGPPMQRELRSFYGQAPAVVEICAQAGAVVPEHYRATMRL 425

BLAST of CmaCh20G006320 vs. Swiss-Prot
Match: APO1_ARATH (APO protein 1, chloroplastic OS=Arabidopsis thaliana GN=APO1 PE=2 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 1.4e-51
Identity = 119/328 (36.28%), Postives = 168/328 (51.22%), Query Frame = 1

Query: 34  KKLRPM-ILKRIQNRANNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVVSCKFCPEV 93
           KKL  M I K++    N   +  L+PVA QV +   +LI G++ LL   PV +C  C  V
Sbjct: 103 KKLAQMGIEKQLDPPKNGLLVPNLVPVADQVIDNWKLLIKGLAQLLHVVPVFACSECGAV 162

Query: 94  YVGEKGHLIRSCGGYKRGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNEHDERFKFE 153
           +V   GH IR C G     +   H W++G +NDVL+PVE++H +  F +  +H+ RF++E
Sbjct: 163 HVANVGHNIRDCNGPTNSQRRGSHSWVKGTINDVLIPVESYHMYDPFGRRIKHETRFEYE 222

Query: 154 RVPAVVELCWQAGANTNDENLSSSTW------NSVGGGGG-------------------- 213
           R+PA+VELC QAG    +      T         V   GG                    
Sbjct: 223 RIPALVELCIQAGVEIPEYPCRRRTQPIRMMGKRVIDRGGYHKEPEKPQTSSSLSSPLAE 282

Query: 214 ----GGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSV 273
               G   R  P +  ++  +A ET+ A+E VR GV KL+  +  K C YCS+VHVGP  
Sbjct: 283 LDTLGVFERYPPPTPEDIPKIAQETMDAYEKVRLGVTKLMRKFTVKACGYCSEVHVGPWG 342

Query: 274 HKARPCGVFKCESWR-GSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAV 330
           H  + CG FK   WR G H W+ A VD++ PP  VWH R      L    R +YG +PA+
Sbjct: 343 HSVKLCGEFK-HQWRDGKHGWQDALVDEVFPPNYVWHVRDLKGNPLTGNLRRFYGKAPAL 402

BLAST of CmaCh20G006320 vs. Swiss-Prot
Match: APO3_ARATH (APO protein 3, mitochondrial OS=Arabidopsis thaliana GN=APO3 PE=2 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 1.3e-22
Identity = 50/130 (38.46%), Postives = 76/130 (58.46%), Query Frame = 1

Query: 200 MRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGS 259
           ++ L+ ET+ +W  +  GV+KL+  Y    C YC ++ VGP  HK R C   K +   G 
Sbjct: 265 LKELSFETMESWFEMVLGVRKLMERYRVWTCGYCPEIQVGPKGHKVRMCKATKHQMRDGM 324

Query: 260 HFWEKADVDDLVPPKIVWH-RRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHC 319
           H W++A +DD+V P  VWH R P D  VL +  + +YG +PAV+ +C   GA  P +Y+ 
Sbjct: 325 HAWQEATIDDVVGPTYVWHVRDPTDGSVLDNSLKRFYGKAPAVIEMCVQGGAPVPDQYNS 384

Query: 320 MMKVQGLPPQ 329
           MM++  + PQ
Sbjct: 385 MMRLDVVYPQ 394

BLAST of CmaCh20G006320 vs. TrEMBL
Match: A0A0A0LQG5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G421010 PE=4 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 3.5e-155
Identity = 265/333 (79.58%), Postives = 294/333 (88.29%), Query Frame = 1

Query: 1   MKTMPFMALRRKFCENIVQEFMLHRCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVA 60
           MKTM FMA+RRKF +N+VQEFML RCYSSKV+LKKLRPMILKRIQ+RA N PIKG+ PVA
Sbjct: 1   MKTMAFMAIRRKFRDNVVQEFMLQRCYSSKVNLKKLRPMILKRIQDRAKNYPIKGMTPVA 60

Query: 61  QQVFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIR 120
           QQV EARA+LIHGVSTLLK FPV+SCKFCPEVYVGE+GHLIRSCGGYKRG KNQVH+WIR
Sbjct: 61  QQVLEARAMLIHGVSTLLKSFPVLSCKFCPEVYVGEEGHLIRSCGGYKRGAKNQVHQWIR 120

Query: 121 GGLNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANTNDENLSSSTWNS 180
           G L D++VPVEAFH HH+F+ V +HDERF FERVPAVVELC QAGAN +D+NL+SST NS
Sbjct: 121 GDLKDIIVPVEAFHLHHMFQDVIKHDERFNFERVPAVVELCSQAGANPDDKNLASSTQNS 180

Query: 181 VGGGGGGGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGP 240
                GGGSG DEPLS +EM LLATET+RAWET+RTGVQKLLMVYP K+C YCS+VHVGP
Sbjct: 181 ---AEGGGSGMDEPLSDHEMMLLATETIRAWETLRTGVQKLLMVYPTKVCKYCSEVHVGP 240

Query: 241 SVHKARPCGVFKCESWRGSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPA 300
           S HKAR CGVF  ESWRGSHFWEKADVDDLVPPKIVWHRR QDPPVLVD+G+DYYGH+PA
Sbjct: 241 SGHKARLCGVFTYESWRGSHFWEKADVDDLVPPKIVWHRRQQDPPVLVDKGKDYYGHAPA 300

Query: 301 VLALCTHAGAIAPPKYHCMMKVQGLPPQSHLKL 334
           V+ALCT AG IAP KYHCMMKVQGL P+  L+L
Sbjct: 301 VVALCTQAGVIAPFKYHCMMKVQGLSPRVKLEL 330

BLAST of CmaCh20G006320 vs. TrEMBL
Match: A0A061EHU0_THECC (APO protein 4 isoform 1 OS=Theobroma cacao GN=TCM_019219 PE=4 SV=1)

HSP 1 Score: 444.5 bits (1142), Expect = 1.1e-121
Identity = 210/302 (69.54%), Postives = 250/302 (82.78%), Query Frame = 1

Query: 25  RCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVV 84
           R YSSKVDLKKLRPMILKRI+NRA + P+ G+IPVAQ+V  ARA+L  GVS LLK FPV+
Sbjct: 10  RSYSSKVDLKKLRPMILKRIENRAKDYPVPGMIPVAQEVLMARALLFQGVSILLKLFPVL 69

Query: 85  SCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNE 144
           +CKFCPEVY+GEKGHLI++C GYKR  KN+VHEW+ GGLND+LVPVEAFH H++F+ V +
Sbjct: 70  ACKFCPEVYIGEKGHLIKTCCGYKRIGKNRVHEWVNGGLNDILVPVEAFHLHNMFQGVIK 129

Query: 145 HDERFKFERVPAVVELCWQAGANTNDENLSSSTWNSVGGGGGGGSGRDEPLSGNEMRLLA 204
           H +RF FERVPAVVELCWQAGA+ NDENL+S   + V     GG    E LS +++ ++A
Sbjct: 130 HQQRFDFERVPAVVELCWQAGADLNDENLNSG--SLVADEFYGGVRGIESLSHDDLTVIA 189

Query: 205 TETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGSHFWEK 264
             TLRAWET+R+GV KLL+VYPAK+C YCS+VHVGPS H+AR CGVF+ ESWRG+HFW+K
Sbjct: 190 NGTLRAWETLRSGVMKLLLVYPAKVCKYCSEVHVGPSGHRARLCGVFRYESWRGAHFWKK 249

Query: 265 ADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHCMMKVQG 324
           A VDDLVPPKIVW RRPQDP VL+DEGRDYYGH+PAV+ LC+ AGAI P KY CMMKV G
Sbjct: 250 AGVDDLVPPKIVWRRRPQDPLVLLDEGRDYYGHAPAVVDLCSGAGAIVPTKYSCMMKVSG 309

Query: 325 LP 327
           LP
Sbjct: 310 LP 309

BLAST of CmaCh20G006320 vs. TrEMBL
Match: F6HIK2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0059g02380 PE=4 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 1.6e-120
Identity = 213/330 (64.55%), Postives = 259/330 (78.48%), Query Frame = 1

Query: 7   MALRRKFCENIVQEF----MLHRCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVAQQ 66
           MALR K     + E     M  R YS KVDLKKLRPMILKRI+NRA   PI  +IPVAQ 
Sbjct: 1   MALRGKLWHCFLDEASGFAMYARFYSVKVDLKKLRPMILKRIENRAKEYPISSMIPVAQD 60

Query: 67  VFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIRGG 126
           V +AR++LI GVSTL+  FPV++CKFCPEVY+GE+GHLI++C GYKR  KNQVHEWI G 
Sbjct: 61  VLKARSLLIQGVSTLMNVFPVMACKFCPEVYIGEQGHLIQTCYGYKRRSKNQVHEWISGS 120

Query: 127 LNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANTNDENLSSSTWNSVG 186
           LND+LVPVE FH   +F+ V +H +RF F+RVPAV ELC QAGA+ ++ENLSSS+W S  
Sbjct: 121 LNDILVPVETFHLQKMFQDVIKHHQRFDFDRVPAVFELCLQAGADLDEENLSSSSWKSES 180

Query: 187 GGGGGGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSV 246
              G    +   LS +E++ +AT TLRAWE +R+G+++LL+VYPAK+C YCS+VHVGPS 
Sbjct: 181 TFSGVHGTKS--LSPDELKFVATGTLRAWEVLRSGIRRLLLVYPAKVCKYCSEVHVGPSG 240

Query: 247 HKARPCGVFKCESWRGSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAVL 306
           HKAR CGVFK ESWRG+HFW+KADVDDLVPPKIVW +RPQDPPVLV+EGRD+YGH+PAV+
Sbjct: 241 HKARLCGVFKYESWRGAHFWKKADVDDLVPPKIVWRQRPQDPPVLVNEGRDFYGHAPAVV 300

Query: 307 ALCTHAGAIAPPKYHCMMKVQGLPPQSHLK 333
            LCT AGAIAP +YH MMKVQGLP  + +K
Sbjct: 301 DLCTKAGAIAPARYHSMMKVQGLPGPTGVK 328

BLAST of CmaCh20G006320 vs. TrEMBL
Match: U5FMZ8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s06490g PE=4 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 2.0e-118
Identity = 208/324 (64.20%), Postives = 254/324 (78.40%), Query Frame = 1

Query: 7   MALRRKFCENIVQEF-----MLHRCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVAQ 66
           MALR+K  ENIV+EF     M  R YSS+VD KKLRPMILKRIQNRA + P+KG++PVA+
Sbjct: 1   MALRKKLWENIVEEFSKTYFMHSRFYSSRVDFKKLRPMILKRIQNRAKDYPVKGMVPVAR 60

Query: 67  QVFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIRG 126
           +V E R +LI GVSTL++ FPV++CKFCPEVY+GEKGHLI++C GYKR  + +VHEWI G
Sbjct: 61  EVLEKRKLLIQGVSTLMEVFPVLACKFCPEVYIGEKGHLIQTCYGYKRCGRKRVHEWIPG 120

Query: 127 GLNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANTNDENLSSSTWNSV 186
           GLND+LVPVE F   ++F+ V EHD+RF F+RVPAVVELC QAGAN +DENL     +  
Sbjct: 121 GLNDILVPVETFRLDNMFQDVIEHDQRFDFDRVPAVVELCRQAGANIDDENLHPGMLDLD 180

Query: 187 GGGGGGGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPS 246
           GG G    G  EP S + +  +A E L AWE +R+GVQ+LL+VYP+K+C +CS+VH+GPS
Sbjct: 181 GGIGHIDGG--EPFSPSHLMYIAKEILDAWEKLRSGVQRLLLVYPSKVCKHCSEVHIGPS 240

Query: 247 VHKARPCGVFKCESWRGSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAV 306
            HKAR CGVFK ESW G HFW+KA+VDDLVPPKIVW RRPQDP VLV+EGRD+YGH+PAV
Sbjct: 241 GHKARLCGVFKFESWHGKHFWKKAEVDDLVPPKIVWRRRPQDPLVLVNEGRDFYGHAPAV 300

Query: 307 LALCTHAGAIAPPKYHCMMKVQGL 326
           + LCT  G I P KY CMMK+QGL
Sbjct: 301 VDLCTKTGIIVPTKYSCMMKIQGL 322

BLAST of CmaCh20G006320 vs. TrEMBL
Match: A0A151R7A7_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_040522 PE=4 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 3.8e-117
Identity = 192/301 (63.79%), Postives = 248/301 (82.39%), Query Frame = 1

Query: 25  RCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVV 84
           R Y++KVDL+KLRPMILKRI+ RA   P++ ++P+A++V +AR +LIHGVSTLLK  P++
Sbjct: 14  RLYATKVDLRKLRPMILKRIEKRAQAYPVRAMVPIAKEVLQARNVLIHGVSTLLKFLPLM 73

Query: 85  SCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNE 144
           +CKFCPE+Y+GE+GHLI++C GYK   KN+VHEW++GGLNDVLVPVE+FH +++++ V  
Sbjct: 74  ACKFCPEIYIGEQGHLIQTCWGYKHRAKNRVHEWVKGGLNDVLVPVESFHLNNMYQNVIR 133

Query: 145 HDERFKFERVPAVVELCWQAGANTNDENLSSSTWNSVGGGGGGGSGRDEPLSGNEMRLLA 204
           H+ERF F+R+PAVVELCWQAGA+ +DENL+S  WN     G G     E LS N++  +A
Sbjct: 134 HNERFDFDRIPAVVELCWQAGADAHDENLNSGNWNL--DTGNGSILETESLSPNDLVSIA 193

Query: 205 TETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGSHFWEK 264
            +TL AWET+R+GV+KLL+VYP K+C YCS+VHVGPS HKAR CGVFK ESW+G+HFW K
Sbjct: 194 NKTLTAWETLRSGVEKLLLVYPVKVCKYCSEVHVGPSGHKARLCGVFKYESWKGAHFWIK 253

Query: 265 ADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHCMMKVQG 324
           A+VD+LVPPK+VW RRP DPPVLV+EG+D+YG  PAVL LC+ AGAI P KY+CMMKVQG
Sbjct: 254 ANVDNLVPPKVVWRRRPHDPPVLVNEGKDFYGRVPAVLDLCSKAGAIVPAKYNCMMKVQG 312

Query: 325 L 326
           L
Sbjct: 314 L 312

BLAST of CmaCh20G006320 vs. TAIR10
Match: AT3G21740.1 (AT3G21740.1 Arabidopsis thaliana protein of unknown function (DUF794))

HSP 1 Score: 356.7 bits (914), Expect = 1.6e-98
Identity = 164/294 (55.78%), Postives = 216/294 (73.47%), Query Frame = 1

Query: 32  DLKKLRPMILKRIQNRANNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVVSCKFCPE 91
           DL+KLRPMILKRI+NRA + P+K ++PVA+++  AR  LI  ++ LLK FPV++CKFC E
Sbjct: 34  DLRKLRPMILKRIENRAKDYPVKEIVPVAEEILIARKNLISNIAALLKVFPVLTCKFCSE 93

Query: 92  VYVGEKGHLIRSCGGYKRGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNEHDERFKF 151
           V+VG++GHLI +C  Y R   N++HEW+ G +ND+LVPVE++H H+I + V  H ERF +
Sbjct: 94  VFVGKEGHLIETCRSYIRRGNNRLHEWVPGSINDILVPVESYHLHNISQGVIRHQERFDY 153

Query: 152 ERVPAVVELCWQAGANTNDENLSSSTWNSVGGGGGGGSGRDEPLSGNEMRLLATETLRAW 211
           +RVPA++ELC QAGA   +E L    ++ +             L   +++ +    L AW
Sbjct: 154 DRVPAILELCCQAGAIHPEEILQ---YSEIHDNPQISEEDIRSLPAGDLKYVGANALMAW 213

Query: 212 ETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGSHFWEKADVDDLV 271
           E VR GV+KLL+VYP+K+C  C +VHVGPS HKAR CGVFK ESWRG+H+WEKA V+DLV
Sbjct: 214 EKVRAGVKKLLLVYPSKVCKRCKEVHVGPSGHKARLCGVFKYESWRGTHYWEKAGVNDLV 273

Query: 272 PPKIVWHRRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHCMMKVQGL 326
           P K+VWHRRPQDP VLVDEGR YYGH+PA+++LC+H GAI P KY C MK QGL
Sbjct: 274 PEKMVWHRRPQDPVVLVDEGRSYYGHAPAIVSLCSHTGAIVPVKYACKMKPQGL 324

BLAST of CmaCh20G006320 vs. TAIR10
Match: AT5G57930.2 (AT5G57930.2 Arabidopsis thaliana protein of unknown function (DUF794))

HSP 1 Score: 214.9 bits (546), Expect = 7.4e-56
Identity = 113/305 (37.05%), Postives = 160/305 (52.46%), Query Frame = 1

Query: 49  NNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYK 108
           N   +K L+P+A +V+ AR  LI+ +  L+K   V +C +C E++VG  GH  +SC G  
Sbjct: 129 NGMVVKSLVPLAYKVYNARIRLINNLHRLMKVVRVNACGWCNEIHVGPYGHPFKSCKGPN 188

Query: 109 RGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANT 168
              +  +HEW    + DV+VP+EA+H      K   HDERF   RVPAVVELC Q G   
Sbjct: 189 TSQRKGLHEWTNSVIEDVIVPLEAYHLFDRLGKRIRHDERFSIPRVPAVVELCIQGGVEI 248

Query: 169 NDENLSSSTWNSVGGGGGGGSGRDE--------------------------PLSGNEMRL 228
            +          +  G       DE                          P S  E   
Sbjct: 249 PEFPAKRRRKPIIRIGKSEFVDADETELPDPEPQPPPVPLLTELPVSEITPPSSEEETVS 308

Query: 229 LATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGSHFW 288
           LA ETL+AWE +R G +KL+ +Y  ++C YC +VHVGP+ HKA+ CG FK +   G H W
Sbjct: 309 LAEETLQAWEEMRAGAKKLMRMYRVRVCGYCPEVHVGPTGHKAQNCGAFKHQQRNGQHGW 368

Query: 289 EKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHCMMKV 327
           + A +DDL+PP+ VWH    + P +  E R +YG +PAV+ +C  AGA+ P  Y   M++
Sbjct: 369 QSAVLDDLIPPRYVWHVPDVNGPPMQRELRSFYGQAPAVVEICAQAGAVVPEHYRATMRL 428

BLAST of CmaCh20G006320 vs. TAIR10
Match: AT1G64810.2 (AT1G64810.2 Arabidopsis thaliana protein of unknown function (DUF794))

HSP 1 Score: 204.9 bits (520), Expect = 7.6e-53
Identity = 119/328 (36.28%), Postives = 168/328 (51.22%), Query Frame = 1

Query: 34  KKLRPM-ILKRIQNRANNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVVSCKFCPEV 93
           KKL  M I K++    N   +  L+PVA QV +   +LI G++ LL   PV +C  C  V
Sbjct: 127 KKLAQMGIEKQLDPPKNGLLVPNLVPVADQVIDNWKLLIKGLAQLLHVVPVFACSECGAV 186

Query: 94  YVGEKGHLIRSCGGYKRGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNEHDERFKFE 153
           +V   GH IR C G     +   H W++G +NDVL+PVE++H +  F +  +H+ RF++E
Sbjct: 187 HVANVGHNIRDCNGPTNSQRRGSHSWVKGTINDVLIPVESYHMYDPFGRRIKHETRFEYE 246

Query: 154 RVPAVVELCWQAGANTNDENLSSSTW------NSVGGGGG-------------------- 213
           R+PA+VELC QAG    +      T         V   GG                    
Sbjct: 247 RIPALVELCIQAGVEIPEYPCRRRTQPIRMMGKRVIDRGGYHKEPEKPQTSSSLSSPLAE 306

Query: 214 ----GGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSV 273
               G   R  P +  ++  +A ET+ A+E VR GV KL+  +  K C YCS+VHVGP  
Sbjct: 307 LDTLGVFERYPPPTPEDIPKIAQETMDAYEKVRLGVTKLMRKFTVKACGYCSEVHVGPWG 366

Query: 274 HKARPCGVFKCESWR-GSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAV 330
           H  + CG FK   WR G H W+ A VD++ PP  VWH R      L    R +YG +PA+
Sbjct: 367 HSVKLCGEFK-HQWRDGKHGWQDALVDEVFPPNYVWHVRDLKGNPLTGNLRRFYGKAPAL 426

BLAST of CmaCh20G006320 vs. TAIR10
Match: AT5G61930.1 (AT5G61930.1 Arabidopsis thaliana protein of unknown function (DUF794))

HSP 1 Score: 108.6 bits (270), Expect = 7.5e-24
Identity = 50/130 (38.46%), Postives = 76/130 (58.46%), Query Frame = 1

Query: 200 MRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGS 259
           ++ L+ ET+ +W  +  GV+KL+  Y    C YC ++ VGP  HK R C   K +   G 
Sbjct: 265 LKELSFETMESWFEMVLGVRKLMERYRVWTCGYCPEIQVGPKGHKVRMCKATKHQMRDGM 324

Query: 260 HFWEKADVDDLVPPKIVWH-RRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHC 319
           H W++A +DD+V P  VWH R P D  VL +  + +YG +PAV+ +C   GA  P +Y+ 
Sbjct: 325 HAWQEATIDDVVGPTYVWHVRDPTDGSVLDNSLKRFYGKAPAVIEMCVQGGAPVPDQYNS 384

Query: 320 MMKVQGLPPQ 329
           MM++  + PQ
Sbjct: 385 MMRLDVVYPQ 394

BLAST of CmaCh20G006320 vs. NCBI nr
Match: gi|449468339|ref|XP_004151879.1| (PREDICTED: APO protein 4, mitochondrial [Cucumis sativus])

HSP 1 Score: 555.8 bits (1431), Expect = 5.0e-155
Identity = 265/333 (79.58%), Postives = 294/333 (88.29%), Query Frame = 1

Query: 1   MKTMPFMALRRKFCENIVQEFMLHRCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVA 60
           MKTM FMA+RRKF +N+VQEFML RCYSSKV+LKKLRPMILKRIQ+RA N PIKG+ PVA
Sbjct: 1   MKTMAFMAIRRKFRDNVVQEFMLQRCYSSKVNLKKLRPMILKRIQDRAKNYPIKGMTPVA 60

Query: 61  QQVFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIR 120
           QQV EARA+LIHGVSTLLK FPV+SCKFCPEVYVGE+GHLIRSCGGYKRG KNQVH+WIR
Sbjct: 61  QQVLEARAMLIHGVSTLLKSFPVLSCKFCPEVYVGEEGHLIRSCGGYKRGAKNQVHQWIR 120

Query: 121 GGLNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANTNDENLSSSTWNS 180
           G L D++VPVEAFH HH+F+ V +HDERF FERVPAVVELC QAGAN +D+NL+SST NS
Sbjct: 121 GDLKDIIVPVEAFHLHHMFQDVIKHDERFNFERVPAVVELCSQAGANPDDKNLASSTQNS 180

Query: 181 VGGGGGGGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGP 240
                GGGSG DEPLS +EM LLATET+RAWET+RTGVQKLLMVYP K+C YCS+VHVGP
Sbjct: 181 ---AEGGGSGMDEPLSDHEMMLLATETIRAWETLRTGVQKLLMVYPTKVCKYCSEVHVGP 240

Query: 241 SVHKARPCGVFKCESWRGSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPA 300
           S HKAR CGVF  ESWRGSHFWEKADVDDLVPPKIVWHRR QDPPVLVD+G+DYYGH+PA
Sbjct: 241 SGHKARLCGVFTYESWRGSHFWEKADVDDLVPPKIVWHRRQQDPPVLVDKGKDYYGHAPA 300

Query: 301 VLALCTHAGAIAPPKYHCMMKVQGLPPQSHLKL 334
           V+ALCT AG IAP KYHCMMKVQGL P+  L+L
Sbjct: 301 VVALCTQAGVIAPFKYHCMMKVQGLSPRVKLEL 330

BLAST of CmaCh20G006320 vs. NCBI nr
Match: gi|659111656|ref|XP_008455841.1| (PREDICTED: APO protein 4, mitochondrial [Cucumis melo])

HSP 1 Score: 549.7 bits (1415), Expect = 3.6e-153
Identity = 261/331 (78.85%), Postives = 292/331 (88.22%), Query Frame = 1

Query: 1   MKTMPFMALRRKFCENIVQEFMLHRCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVA 60
           MKTM FMA+RRKF +N+VQEFML RCYSSKV+LKKLRPMILKRIQ+RA + PIKG+ PVA
Sbjct: 1   MKTMAFMAIRRKFRDNVVQEFMLQRCYSSKVNLKKLRPMILKRIQDRAKSYPIKGMTPVA 60

Query: 61  QQVFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIR 120
           QQV EARA+LIHGVSTLLK FPV+SCK+CPEVYVG +GHLIRSCGGYKRG KNQVH+WIR
Sbjct: 61  QQVLEARAMLIHGVSTLLKSFPVLSCKYCPEVYVGAEGHLIRSCGGYKRGAKNQVHQWIR 120

Query: 121 GGLNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANTNDENLSSSTWNS 180
           G LND++VPVEAFH HH+F+ V +HDERF FERVPAVVELC QAG N +D++L+SST NS
Sbjct: 121 GDLNDIIVPVEAFHLHHMFQDVIKHDERFNFERVPAVVELCCQAGVNPDDKDLASSTQNS 180

Query: 181 VGGGGGGGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGP 240
                GGGSG DEP S +EM LLATET+RAWET+RTGVQKLLMVYP K+C YCS+VHVGP
Sbjct: 181 ---AEGGGSGMDEPFSDHEMMLLATETIRAWETLRTGVQKLLMVYPTKVCKYCSEVHVGP 240

Query: 241 SVHKARPCGVFKCESWRGSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPA 300
           S HKAR CGVFK ESWRGSHFWEKADVDDLVPPKIVWHRR QDPPVLVD+GRDYYGH+PA
Sbjct: 241 SGHKARLCGVFKYESWRGSHFWEKADVDDLVPPKIVWHRRQQDPPVLVDKGRDYYGHAPA 300

Query: 301 VLALCTHAGAIAPPKYHCMMKVQGLPPQSHL 332
           V+ALC  AGAIAP KYHCMMKVQGL P+ +L
Sbjct: 301 VVALCMQAGAIAPFKYHCMMKVQGLSPRVNL 328

BLAST of CmaCh20G006320 vs. NCBI nr
Match: gi|590652086|ref|XP_007033060.1| (APO protein 4 isoform 1 [Theobroma cacao])

HSP 1 Score: 444.5 bits (1142), Expect = 1.6e-121
Identity = 210/302 (69.54%), Postives = 250/302 (82.78%), Query Frame = 1

Query: 25  RCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVAQQVFEARAILIHGVSTLLKEFPVV 84
           R YSSKVDLKKLRPMILKRI+NRA + P+ G+IPVAQ+V  ARA+L  GVS LLK FPV+
Sbjct: 10  RSYSSKVDLKKLRPMILKRIENRAKDYPVPGMIPVAQEVLMARALLFQGVSILLKLFPVL 69

Query: 85  SCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIRGGLNDVLVPVEAFHRHHIFEKVNE 144
           +CKFCPEVY+GEKGHLI++C GYKR  KN+VHEW+ GGLND+LVPVEAFH H++F+ V +
Sbjct: 70  ACKFCPEVYIGEKGHLIKTCCGYKRIGKNRVHEWVNGGLNDILVPVEAFHLHNMFQGVIK 129

Query: 145 HDERFKFERVPAVVELCWQAGANTNDENLSSSTWNSVGGGGGGGSGRDEPLSGNEMRLLA 204
           H +RF FERVPAVVELCWQAGA+ NDENL+S   + V     GG    E LS +++ ++A
Sbjct: 130 HQQRFDFERVPAVVELCWQAGADLNDENLNSG--SLVADEFYGGVRGIESLSHDDLTVIA 189

Query: 205 TETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSVHKARPCGVFKCESWRGSHFWEK 264
             TLRAWET+R+GV KLL+VYPAK+C YCS+VHVGPS H+AR CGVF+ ESWRG+HFW+K
Sbjct: 190 NGTLRAWETLRSGVMKLLLVYPAKVCKYCSEVHVGPSGHRARLCGVFRYESWRGAHFWKK 249

Query: 265 ADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAVLALCTHAGAIAPPKYHCMMKVQG 324
           A VDDLVPPKIVW RRPQDP VL+DEGRDYYGH+PAV+ LC+ AGAI P KY CMMKV G
Sbjct: 250 AGVDDLVPPKIVWRRRPQDPLVLLDEGRDYYGHAPAVVDLCSGAGAIVPTKYSCMMKVSG 309

Query: 325 LP 327
           LP
Sbjct: 310 LP 309

BLAST of CmaCh20G006320 vs. NCBI nr
Match: gi|359485666|ref|XP_002273999.2| (PREDICTED: APO protein 4, mitochondrial [Vitis vinifera])

HSP 1 Score: 440.7 bits (1132), Expect = 2.3e-120
Identity = 213/330 (64.55%), Postives = 259/330 (78.48%), Query Frame = 1

Query: 7   MALRRKFCENIVQEF----MLHRCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVAQQ 66
           MALR K     + E     M  R YS KVDLKKLRPMILKRI+NRA   PI  +IPVAQ 
Sbjct: 1   MALRGKLWHCFLDEASGFAMYARFYSVKVDLKKLRPMILKRIENRAKEYPISSMIPVAQD 60

Query: 67  VFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIRGG 126
           V +AR++LI GVSTL+  FPV++CKFCPEVY+GE+GHLI++C GYKR  KNQVHEWI G 
Sbjct: 61  VLKARSLLIQGVSTLMNVFPVMACKFCPEVYIGEQGHLIQTCYGYKRRSKNQVHEWISGS 120

Query: 127 LNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANTNDENLSSSTWNSVG 186
           LND+LVPVE FH   +F+ V +H +RF F+RVPAV ELC QAGA+ ++ENLSSS+W S  
Sbjct: 121 LNDILVPVETFHLQKMFQDVIKHHQRFDFDRVPAVFELCLQAGADLDEENLSSSSWKSES 180

Query: 187 GGGGGGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPSV 246
              G    +   LS +E++ +AT TLRAWE +R+G+++LL+VYPAK+C YCS+VHVGPS 
Sbjct: 181 TFSGVHGTKS--LSPDELKFVATGTLRAWEVLRSGIRRLLLVYPAKVCKYCSEVHVGPSG 240

Query: 247 HKARPCGVFKCESWRGSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAVL 306
           HKAR CGVFK ESWRG+HFW+KADVDDLVPPKIVW +RPQDPPVLV+EGRD+YGH+PAV+
Sbjct: 241 HKARLCGVFKYESWRGAHFWKKADVDDLVPPKIVWRQRPQDPPVLVNEGRDFYGHAPAVV 300

Query: 307 ALCTHAGAIAPPKYHCMMKVQGLPPQSHLK 333
            LCT AGAIAP +YH MMKVQGLP  + +K
Sbjct: 301 DLCTKAGAIAPARYHSMMKVQGLPGPTGVK 328

BLAST of CmaCh20G006320 vs. NCBI nr
Match: gi|743883363|ref|XP_011037010.1| (PREDICTED: APO protein 4, mitochondrial [Populus euphratica])

HSP 1 Score: 434.9 bits (1117), Expect = 1.3e-118
Identity = 207/324 (63.89%), Postives = 254/324 (78.40%), Query Frame = 1

Query: 7   MALRRKFCENIVQEF-----MLHRCYSSKVDLKKLRPMILKRIQNRANNCPIKGLIPVAQ 66
           MA  +K  EN+V+EF     M  R YSS+VD KKLRPMILKRIQNRA + P+KG++PVA+
Sbjct: 1   MAFTKKLWENLVEEFSKTYFMHSRFYSSRVDFKKLRPMILKRIQNRAKDYPVKGMVPVAR 60

Query: 67  QVFEARAILIHGVSTLLKEFPVVSCKFCPEVYVGEKGHLIRSCGGYKRGPKNQVHEWIRG 126
           +V E R +LI GVSTL++ FPV++CKFCPEVY+GEKGHLI++C GYKR  + +VHEWI G
Sbjct: 61  EVLEKRKLLIQGVSTLMEVFPVLACKFCPEVYIGEKGHLIQTCYGYKRCGRKRVHEWIPG 120

Query: 127 GLNDVLVPVEAFHRHHIFEKVNEHDERFKFERVPAVVELCWQAGANTNDENLSSSTWNSV 186
           GLND+LVPVE F  H++F+ V EH++RF F+RVPAVVELC QAGAN +DENL     +  
Sbjct: 121 GLNDILVPVETFRLHNMFQDVIEHNQRFDFDRVPAVVELCRQAGANIDDENLHPGMLDLD 180

Query: 187 GGGGGGGSGRDEPLSGNEMRLLATETLRAWETVRTGVQKLLMVYPAKICNYCSDVHVGPS 246
           GG G    G  EP S + +   A E L AWE +R+GVQ+LL+VYP+K+C +CS+VH+GPS
Sbjct: 181 GGIGHIDGG--EPFSPSHLMHTAKEILDAWEKLRSGVQRLLLVYPSKVCKHCSEVHIGPS 240

Query: 247 VHKARPCGVFKCESWRGSHFWEKADVDDLVPPKIVWHRRPQDPPVLVDEGRDYYGHSPAV 306
            HKAR CGVFK ESW G HFW+KA+VDDLVPPKIVW RRPQDPPVLV+EGRD+YGH+PAV
Sbjct: 241 GHKARLCGVFKFESWHGKHFWKKAEVDDLVPPKIVWWRRPQDPPVLVNEGRDFYGHAPAV 300

Query: 307 LALCTHAGAIAPPKYHCMMKVQGL 326
           + LCT  G I PPKY CMMK+QGL
Sbjct: 301 VDLCTKTGIIVPPKYSCMMKIQGL 322

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
APO4_ARATH2.8e-9755.78APO protein 4, mitochondrial OS=Arabidopsis thaliana GN=APO4 PE=2 SV=2[more]
APO2_ARATH1.3e-5437.05APO protein 2, chloroplastic OS=Arabidopsis thaliana GN=APO2 PE=2 SV=1[more]
APO1_ARATH1.4e-5136.28APO protein 1, chloroplastic OS=Arabidopsis thaliana GN=APO1 PE=2 SV=1[more]
APO3_ARATH1.3e-2238.46APO protein 3, mitochondrial OS=Arabidopsis thaliana GN=APO3 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LQG5_CUCSA3.5e-15579.58Uncharacterized protein OS=Cucumis sativus GN=Csa_2G421010 PE=4 SV=1[more]
A0A061EHU0_THECC1.1e-12169.54APO protein 4 isoform 1 OS=Theobroma cacao GN=TCM_019219 PE=4 SV=1[more]
F6HIK2_VITVI1.6e-12064.55Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0059g02380 PE=4 SV=... [more]
U5FMZ8_POPTR2.0e-11864.20Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s06490g PE=4 SV=1[more]
A0A151R7A7_CAJCA3.8e-11763.79Uncharacterized protein OS=Cajanus cajan GN=KK1_040522 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21740.11.6e-9855.78 Arabidopsis thaliana protein of unknown function (DUF794)[more]
AT5G57930.27.4e-5637.05 Arabidopsis thaliana protein of unknown function (DUF794)[more]
AT1G64810.27.6e-5336.28 Arabidopsis thaliana protein of unknown function (DUF794)[more]
AT5G61930.17.5e-2438.46 Arabidopsis thaliana protein of unknown function (DUF794)[more]
Match NameE-valueIdentityDescription
gi|449468339|ref|XP_004151879.1|5.0e-15579.58PREDICTED: APO protein 4, mitochondrial [Cucumis sativus][more]
gi|659111656|ref|XP_008455841.1|3.6e-15378.85PREDICTED: APO protein 4, mitochondrial [Cucumis melo][more]
gi|590652086|ref|XP_007033060.1|1.6e-12169.54APO protein 4 isoform 1 [Theobroma cacao][more]
gi|359485666|ref|XP_002273999.2|2.3e-12064.55PREDICTED: APO protein 4, mitochondrial [Vitis vinifera][more]
gi|743883363|ref|XP_011037010.1|1.3e-11863.89PREDICTED: APO protein 4, mitochondrial [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR023342APO_dom
Vocabulary: Molecular Function
TermDefinition
GO:0003723RNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0006979 response to oxidative stress
cellular_component GO:0005575 cellular_component
molecular_function GO:0003723 RNA binding
molecular_function GO:0020037 heme binding
molecular_function GO:0004601 peroxidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G006320.1CmaCh20G006320.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR023342APO domainPFAMPF05634APO_RNA-bindcoord: 203..310
score: 6.1E-16coord: 29..168
score: 1.9
IPR023342APO domainPROFILEPS51499APOcoord: 85..170
score: 20.111coord: 229..314
score: 19
NoneNo IPR availablePANTHERPTHR10388EUKARYOTIC TRANSLATION INITIATION FACTOR SUI1coord: 1..305
score: 9.4E
NoneNo IPR availablePANTHERPTHR10388:SF20APO PROTEIN 4, MITOCHONDRIALcoord: 1..305
score: 9.4E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh20G006320CmaCh02G001560Cucurbita maxima (Rimu)cmacmaB477