Tan0017001 (gene) Snake gourd v1

Overview
NameTan0017001
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein CHLOROPLAST IMPORT APPARATUS 2
LocationLG08: 11608875 .. 11613056 (-)
RNA-Seq ExpressionTan0017001
SyntenyTan0017001
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGAAACTGAGCCTTGTCTCGTCTTGTGTTCTCTTCCTTTCTAGTAATCTCCATCGCCCAACTGCAAAAACACTCACACAGAAGAGAGAGAAAGAAACAAGAGAGAGAGAGAGAGAGAGAGAAATTCTTTGTTTCTCTCTGCAGAATTGCTATTATATCTACAGCACACCCAACGGATCTTTCCCTTTTATCTTCTCTTCTTCTTCTTCTTCTTCCTTCTCTCTCTGTTTTTTGAGGAAATTCAGACTGTGTGTATATATAGAAATCAGAGTGTTTGTTTTCTTAATTGGGTTAGTGAGTAGTTTTTGTTTGGTGGGAGATTTCCTATTTCCATGGAAGTTCCAAATTCCTCTGGTTTTCTTGGGGCAAGAGAGAGGGAAATTGGGAAATAACAAGAGAGACAGAGAGATAGAAGGAAAAGGAGAAGAGGGTTTTTGTTTTGGTTTAAGAACTTTGAAGTTTTGGGGTTTTGTTGGATTTTTCTTTTCTTTTTTTGGCGAGTTCTTCCGTACTGTTTCCCCTTTTTTGTGGGGATTTTTTCATGCCTAGGGTTTAGGGCTTTAGAAAGTCAACACTCATTTCCACTAATTTTCCAGGTTTTTGATGGAATTGAGCTGTTCTAGAGGGATTTCTTTGAATTCTTGAGGAAAAATTTTCTTGGTAACCAAACAGAATTTACAGTGTCTTTTTCTTTTTCTTGTTCTTGTTGTTCTGAATATATCCAATGTCTTCACCATGTATAAGTGGTGGTGGAAGAGCTTACAATTTCGACCTCGAAATTGTGAAATCTCCATCTTCTTCATGGACGAGAACTTCTCAAACTTCATCGCCTTCTTCAACTCTCTCCGAATCAAGCAATACACAGCTAGCAATTTCAACTCGAAAATCCCGAACTCCTCGAAAACGCCCAAATCAAACCTACAATGAGGCCACGGTTTTGCTATCTACGGCATATCCAAATGTTTTCTCTACCAAACACCTCACGAATCCGCGAAAATTCACCGAATCACACGACTCTCTGTTCTGTGAATCGGCCGAATTGCTCTTGCCTTTCCGAGTAATCGATAGCTCTGGATTTCTCCTCCACCAACCGCTTCTACAAGAAAAACCTAATTCCCAAATTCAATCGAAATTGGCGAATCTGTGGGAGAGCCGGCCATGTTCGAGCCCAGGGGAGATCGATTTCCAACCGAATTCGATGGAGATGGAAGATTTCGATGCCGAATCGATTCTCGACGAGGAAATCGAGGAAGGAATCGATAGTATCATGGGGAATCTGTGTGTGGATAACCTAGAAACTGCTACTTCAGCGCAAGATTATTCTTGTGCAAACCCTAAGAATTGGAATTGTTACTGGAATCCAATCGGTTTGGGATTCAACCAGAAATTCGAATTCGGATTCGGAATGCGGAAGGCAATGGAACGAGCAGCAATCCGACGAGTCGACGACGGAAACTGGTGGCGGTTTCCGACAGTCGACGTAGTCGAAATCTCTCCTAAACTAAATCCAAAGCCACCGGCTCCGGCACCGACACCGGCCGCCGTCGCAACAAAGAAGAAGAAGAAGAAAGTGGAGAAACTTACAGTAATCGAATCGAAGAAAGCAGCAACAACGCCACCGCAAAAGGAGAAATCAGAGAAGCCGACGATTCCGAAATCGAAACCTCCTGGATTGCTTCTGAAACTGAACTACGAAGCCGTCGCCGACGCTTGGTCCGACAGAGGATCCCCATTTTCCGACGAGGTTCCGGGATCCGATACGGCGGGAAGTGATGTAAATGTACGTCTCTCATCCCGCCCCCGAAAATCAAAACCCAAGAAACTCTGAAAAATAATAATAATAATAATAATAAATTTCCTCCCACCCCCGAAGTTTTAGGAATTCAAATCTTCTTAGTACTGTTTCGTTCTCCCATAAAATTAAAAATTGTAGTGTTAAATCTGTATTTTTAGTCCTTATGCTGTCGTCTCCTTTTCAATTAAGTTCCTATAGTTTTAGAACTTTCAATTTAAACTCTATCAAAAAAATTTTGCCAGTTATTAACAATTTTTTAACTTAGGTAGATTAAAAATTTGTAGGGATTAAATTAAAAATTTGAAACTATAGATTAAAATAAAAACGAAATTTATCTTCCTAAAAGAAAACCTATAATTATTTAATGTAATGTAGCATGGAGGTCTGGGAGACAAGGTGAAAGAATTAAAAATATTTTGACTTTTTCAAAAATAATGTAATTATTATTGTTTTTAATTATTTTATTTGGACCATTGATAGATTTCCTTGTCAGTGAATCACCTTTCTCATGAAAAAGATCTAAAGGGAAATTAAAAAGGGAAAAAGAAAAAAAAAAAGAAGTTAACTATGGTGTTTAATCATGAGAGCTTCATTCAAAAGGAGGGTAGTGTTGTAAATTAAAAACGAATTTTATCAAAAACCATAATGTGATAAAATAATTCAAAAACATAAAAACGAAAAATGGGTTGATTTCCAGTGAAATGACGAAACTGTCCTTGTTTATGGAGTAATATTAACTAAAGAGAGAGGGATATTTTTGGAAATTGGCAGGCTAGGCTGGCACAAATTGATTTATTTTCAGACGGAGGAGGATTATTGAGAGAAGCCAGTGTGTTACGATACAAGGAGAAAAGACGGACCCGCTTGTTCTCGAAGAAAATTAGATATCAAGTCAGGAAAGTCAATGCTGATGGACGGCCCAGAATGAAGGTATGTATGTGAATCTGGTACATCCACCCTTATGTGATTATAGCTTTTTCTTCTTTTTATTTTTCAAAGTGATTATAGTTTTAGTATTGTTTTTAAGGTTTACAAAATTATTATACTACTACTAGTACTAGATTCTATTCTTATTATTAAAATTTCAGTAATGAGTTCACCTAGAAAATGACATTATTTGGTTTATTTTTTCTTTGTTGAGTAGCATCTTCGTGAAATTGATTTTTTTAATAATAGAAATTTATTTAATTAATGACATGGAAATTATTTTATAAACTAAATATATGTTTTTTTTTATATATAAATTTAGTTTAGTAAAGTTGGAGGTGGGATAACCCTCCACCCCGCCATTGACCCACTTAAAAAATAAAACAATTTTGGCGGTATAAATTTGAATGTCTAACTTTTAAGACAAATGTATAATTTCTTAACCAGTTTATGTTAAGTTGTTATCTAAGTAATATATGTTTTAATTTTTATTATTTTGTTGTTTTCTCGATTTAATTATTATTATTTAAAAAAGTTTTAATTTTTTTTTTGTATATCCTCACCGTACTAAAATGTAGAGAATTGAATTAAAATTTTTAAAGCAATAAAAACAAAGTCGAAAAAACACAGTAACATTAGGGATAAAAACACATTTTTTAGCTTGTATTATATGTCACTTATTTAGAAATTGAAAAAAAAAAAACATAAAGTTTGTGTTTAGCAATTAGAGTAAAAAGTTAAAACCAAAAGATTAAAAGAATCTTCGTTAAAAAATCACAATGCTCGGATTCATAAATGACTAAAATTCAATCGACCAACCCAATAACTTTTAAAACCTTAGAAATCAAATTAGAAATTTCATATAAACTTTATAAACAACATATTATATTCTAGTTTAATCATAGTAAGTATCATCACATTATTTTCACAAATCATACACATATTTTTTTTAATTAAACTTTCAATTCGAGCATAATTCAAAGTAACCAAATTGTGTTATTAAATTAAATGTGGTAAGATTAAAAGTAATAGAAACTGAAAAGACATTTCCCTTTTGGCAGGGGCGATTTGTGAGGAGACCTAATTCAAGCGCCAGCTTACAGAGATAGACAATAGCCTTTAGGATTTAGAAACTCTCCTTTTGAGTTGTCAGCTTCACTTTTTGAGAGGACCTACTTTTTTTTTTTTTCCCTCAAATTTTTCTTGTATTAATTATTATATATCTCTCTTTTTCTTTTTTTTTTTCCTCTTCCTCTTCTGCATTTCCCCTATTTTGTTTATCTTTTATTTTCTTTTTTGAGATTATGAGAGAATGGCCACTAAGAGTAGATGGTCAGAGATTTAATAATTCAAAGGCTCTTTTTCAAATATATGTGTTTTTCTTTTGGGGGGCTCTGTTTTGTAGTGTTTGGTTTATAGTATCTAATGCTTTAAGAAGATGCTCACATGTTTGAATGGTC

mRNA sequence

CAGAAACTGAGCCTTGTCTCGTCTTGTGTTCTCTTCCTTTCTAGTAATCTCCATCGCCCAACTGCAAAAACACTCACACAGAAGAGAGAGAAAGAAACAAGAGAGAGAGAGAGAGAGAGAGAAATTCTTTGTTTCTCTCTGCAGAATTGCTATTATATCTACAGCACACCCAACGGATCTTTCCCTTTTATCTTCTCTTCTTCTTCTTCTTCTTCCTTCTCTCTCTGTTTTTTGAGGAAATTCAGACTGTGTGTATATATAGAAATCAGAGTGTTTGTTTTCTTAATTGGGTTAGTGAGTAGTTTTTGTTTGGTGGGAGATTTCCTATTTCCATGGAAGTTCCAAATTCCTCTGGTTTTCTTGGGGCAAGAGAGAGGGAAATTGGGAAATAACAAGAGAGACAGAGAGATAGAAGGAAAAGGAGAAGAGGGTTTTTGTTTTGGTTTAAGAACTTTGAAGTTTTGGGGTTTTGTTGGATTTTTCTTTTCTTTTTTTGGCGAGTTCTTCCGTACTGTTTCCCCTTTTTTGTGGGGATTTTTTCATGCCTAGGGTTTAGGGCTTTAGAAAGTCAACACTCATTTCCACTAATTTTCCAGGTTTTTGATGGAATTGAGCTGTTCTAGAGGGATTTCTTTGAATTCTTGAGGAAAAATTTTCTTGGTAACCAAACAGAATTTACAGTGTCTTTTTCTTTTTCTTGTTCTTGTTGTTCTGAATATATCCAATGTCTTCACCATGTATAAGTGGTGGTGGAAGAGCTTACAATTTCGACCTCGAAATTGTGAAATCTCCATCTTCTTCATGGACGAGAACTTCTCAAACTTCATCGCCTTCTTCAACTCTCTCCGAATCAAGCAATACACAGCTAGCAATTTCAACTCGAAAATCCCGAACTCCTCGAAAACGCCCAAATCAAACCTACAATGAGGCCACGGTTTTGCTATCTACGGCATATCCAAATGTTTTCTCTACCAAACACCTCACGAATCCGCGAAAATTCACCGAATCACACGACTCTCTGTTCTGTGAATCGGCCGAATTGCTCTTGCCTTTCCGAGTAATCGATAGCTCTGGATTTCTCCTCCACCAACCGCTTCTACAAGAAAAACCTAATTCCCAAATTCAATCGAAATTGGCGAATCTGTGGGAGAGCCGGCCATGTTCGAGCCCAGGGGAGATCGATTTCCAACCGAATTCGATGGAGATGGAAGATTTCGATGCCGAATCGATTCTCGACGAGGAAATCGAGGAAGGAATCGATAGTATCATGGGGAATCTGTGTGTGGATAACCTAGAAACTGCTACTTCAGCGCAAGATTATTCTTGTGCAAACCCTAAGAATTGGAATTGTTACTGGAATCCAATCGGTTTGGGATTCAACCAGAAATTCGAATTCGGATTCGGAATGCGGAAGGCAATGGAACGAGCAGCAATCCGACGAGTCGACGACGGAAACTGGTGGCGGTTTCCGACAGTCGACGTAGTCGAAATCTCTCCTAAACTAAATCCAAAGCCACCGGCTCCGGCACCGACACCGGCCGCCGTCGCAACAAAGAAGAAGAAGAAGAAAGTGGAGAAACTTACAGTAATCGAATCGAAGAAAGCAGCAACAACGCCACCGCAAAAGGAGAAATCAGAGAAGCCGACGATTCCGAAATCGAAACCTCCTGGATTGCTTCTGAAACTGAACTACGAAGCCGTCGCCGACGCTTGGTCCGACAGAGGATCCCCATTTTCCGACGAGGTTCCGGGATCCGATACGGCGGGAAGTGATGTAAATGCTAGGCTGGCACAAATTGATTTATTTTCAGACGGAGGAGGATTATTGAGAGAAGCCAGTGTGTTACGATACAAGGAGAAAAGACGGACCCGCTTGTTCTCGAAGAAAATTAGATATCAAGTCAGGAAAGTCAATGCTGATGGACGGCCCAGAATGAAGGGGCGATTTGTGAGGAGACCTAATTCAAGCGCCAGCTTACAGAGATAGACAATAGCCTTTAGGATTTAGAAACTCTCCTTTTGAGTTGTCAGCTTCACTTTTTGAGAGGACCTACTTTTTTTTTTTTTCCCTCAAATTTTTCTTGTATTAATTATTATATATCTCTCTTTTTCTTTTTTTTTTTCCTCTTCCTCTTCTGCATTTCCCCTATTTTGTTTATCTTTTATTTTCTTTTTTGAGATTATGAGAGAATGGCCACTAAGAGTAGATGGTCAGAGATTTAATAATTCAAAGGCTCTTTTTCAAATATATGTGTTTTTCTTTTGGGGGGCTCTGTTTTGTAGTGTTTGGTTTATAGTATCTAATGCTTTAAGAAGATGCTCACATGTTTGAATGGTC

Coding sequence (CDS)

ATGTCTTCACCATGTATAAGTGGTGGTGGAAGAGCTTACAATTTCGACCTCGAAATTGTGAAATCTCCATCTTCTTCATGGACGAGAACTTCTCAAACTTCATCGCCTTCTTCAACTCTCTCCGAATCAAGCAATACACAGCTAGCAATTTCAACTCGAAAATCCCGAACTCCTCGAAAACGCCCAAATCAAACCTACAATGAGGCCACGGTTTTGCTATCTACGGCATATCCAAATGTTTTCTCTACCAAACACCTCACGAATCCGCGAAAATTCACCGAATCACACGACTCTCTGTTCTGTGAATCGGCCGAATTGCTCTTGCCTTTCCGAGTAATCGATAGCTCTGGATTTCTCCTCCACCAACCGCTTCTACAAGAAAAACCTAATTCCCAAATTCAATCGAAATTGGCGAATCTGTGGGAGAGCCGGCCATGTTCGAGCCCAGGGGAGATCGATTTCCAACCGAATTCGATGGAGATGGAAGATTTCGATGCCGAATCGATTCTCGACGAGGAAATCGAGGAAGGAATCGATAGTATCATGGGGAATCTGTGTGTGGATAACCTAGAAACTGCTACTTCAGCGCAAGATTATTCTTGTGCAAACCCTAAGAATTGGAATTGTTACTGGAATCCAATCGGTTTGGGATTCAACCAGAAATTCGAATTCGGATTCGGAATGCGGAAGGCAATGGAACGAGCAGCAATCCGACGAGTCGACGACGGAAACTGGTGGCGGTTTCCGACAGTCGACGTAGTCGAAATCTCTCCTAAACTAAATCCAAAGCCACCGGCTCCGGCACCGACACCGGCCGCCGTCGCAACAAAGAAGAAGAAGAAGAAAGTGGAGAAACTTACAGTAATCGAATCGAAGAAAGCAGCAACAACGCCACCGCAAAAGGAGAAATCAGAGAAGCCGACGATTCCGAAATCGAAACCTCCTGGATTGCTTCTGAAACTGAACTACGAAGCCGTCGCCGACGCTTGGTCCGACAGAGGATCCCCATTTTCCGACGAGGTTCCGGGATCCGATACGGCGGGAAGTGATGTAAATGCTAGGCTGGCACAAATTGATTTATTTTCAGACGGAGGAGGATTATTGAGAGAAGCCAGTGTGTTACGATACAAGGAGAAAAGACGGACCCGCTTGTTCTCGAAGAAAATTAGATATCAAGTCAGGAAAGTCAATGCTGATGGACGGCCCAGAATGAAGGGGCGATTTGTGAGGAGACCTAATTCAAGCGCCAGCTTACAGAGATAG

Protein sequence

MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFCESAELLLPFRVIDSSGFLLHQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEIEEGIDSIMGNLCVDNLETATSAQDYSCANPKNWNCYWNPIGLGFNQKFEFGFGMRKAMERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVATKKKKKKVEKLTVIESKKAATTPPQKEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRRPNSSASLQR
Homology
BLAST of Tan0017001 vs. ExPASy Swiss-Prot
Match: Q9LU68 (Protein CHLOROPLAST IMPORT APPARATUS 2 OS=Arabidopsis thaliana OX=3702 GN=CIA2 PE=1 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 2.3e-75
Identity = 213/452 (47.12%), Postives = 266/452 (58.85%), Query Frame = 0

Query: 3   SPCIS---GGGRAYNFDLEIVKS-PSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTP 62
           S C+S   GG  AY+F+LE VKS P SS T T++ +SPSST+SESSN+ LAISTRK RT 
Sbjct: 2   SACLSSGGGGAAAYSFELEKVKSPPPSSSTTTTRATSPSSTISESSNSPLAISTRKPRTQ 61

Query: 63  RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFC--------ESAELLLPF 122
           RKRPNQTYNEA  LLSTAYPN+FS+ +L++ +K   S +S F         ++++LLLP+
Sbjct: 62  RKRPNQTYNEAATLLSTAYPNIFSS-NLSSKQKTHSSSNSHFYGPLLSDNDDASDLLLPY 121

Query: 123 RVIDSSGFLLHQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESIL 182
             I+   FL H P +Q K       K  N      C   GEI+    S   ++FDAESIL
Sbjct: 122 ESIEEPDFLFH-PTIQTKTEFFSDQKEVN--SGGDCYG-GEIEKFDFS---DEFDAESIL 181

Query: 183 DEEIEEGIDSIMGNLCVDNLETA---------------TSAQDYSCANPKNWNCYWNPIG 242
           DE+IEEGIDSIMG +   N  +                +S+              WN   
Sbjct: 182 DEDIEEGIDSIMGTVVESNSNSGIYESRVPGMINRGGRSSSNRIGKLEQMMMINSWNRSS 241

Query: 243 LGFNQKFEFGFGMRKAMERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVA 302
            GFN  F  G G+     R+A+R  DD   W+  TVD  +ISP++         T  A++
Sbjct: 242 NGFN--FPLGLGL-----RSALRENDDTKLWKIHTVDFEQISPRIQ-----TVKTETAIS 301

Query: 303 TKKKKKKVEKLTVIESKKA-----------ATTPPQKEKSEKPTIPKS-KPPGLLLKLNY 362
           T  ++K   K  VI  +K+            TT   + KS + T   S K  G LLKL+Y
Sbjct: 302 TVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTLITESKSLEDTEETSLKRTGPLLKLDY 361

Query: 363 EAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDLFSDGGGLLREASVLRYKEKRRTR 416
           + V +AWSD+ SPF DE+ GS+    DVNARLAQIDLF D G  +REASVLRYKEKRRTR
Sbjct: 362 DGVLEAWSDKTSPFPDEIQGSEAV--DVNARLAQIDLFGDSG--MREASVLRYKEKRRTR 421

BLAST of Tan0017001 vs. ExPASy Swiss-Prot
Match: Q8RWD0 (Zinc finger protein CONSTANS-LIKE 16 OS=Arabidopsis thaliana OX=3702 GN=COL16 PE=1 SV=2)

HSP 1 Score: 86.3 bits (212), Expect = 9.2e-16
Identity = 53/120 (44.17%), Postives = 67/120 (55.83%), Query Frame = 0

Query: 317 LLLKLNYEAVADAWSDRGSPFSDEVP-------------------GSDTAGSDVNARLAQ 376
           L+L+LNY++V   W  +G P+S   P                   G  T           
Sbjct: 293 LMLRLNYDSVISTWGGQGPPWSSGEPPERDMDISGWPAFSMVENGGESTHQKQYVGGCLP 352

Query: 377 IDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRRPNSSAS 418
              F DGG   REA V RY+EKRRTRLFSKKIRY+VRK+NA+ RPRMKGRFV+R + +A+
Sbjct: 353 SSGFGDGG---REARVSRYREKRRTRLFSKKIRYEVRKLNAEKRPRMKGRFVKRASLAAA 409

BLAST of Tan0017001 vs. ExPASy Swiss-Prot
Match: Q8LG76 (Zinc finger protein CONSTANS-LIKE 6 OS=Arabidopsis thaliana OX=3702 GN=COL6 PE=2 SV=2)

HSP 1 Score: 84.0 bits (206), Expect = 4.6e-15
Identity = 52/117 (44.44%), Postives = 68/117 (58.12%), Query Frame = 0

Query: 311 KSKPPGLLLKLNYEAVADAWSDRGSPFSDEVP---------------GSDTAGSDVNARL 370
           + K   L+L+L+YE+V   W  +G P++  VP               G   A +  +   
Sbjct: 283 EKKEKALMLRLDYESVISTWGGQGIPWTARVPSEIDLDMVCFPTHTMGESGAEAHHHNHF 342

Query: 371 AQIDL-FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRR 412
             + L   D G   REA V RY+EKRRTRLFSKKIRY+VRK+NA+ RPRMKGRFV+R
Sbjct: 343 RGLGLHLGDAGDGGREARVSRYREKRRTRLFSKKIRYEVRKLNAEKRPRMKGRFVKR 399

BLAST of Tan0017001 vs. ExPASy Swiss-Prot
Match: Q9C9A9 (Zinc finger protein CONSTANS-LIKE 7 OS=Arabidopsis thaliana OX=3702 GN=COL7 PE=2 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 8.6e-14
Identity = 60/150 (40.00%), Postives = 80/150 (53.33%), Query Frame = 0

Query: 278 KKKKKVEKLTVIESKKAATTPPQKEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPF 337
           K  K+V+     E +        K+   + +  K +   L L+L+Y AV  AW + GSP+
Sbjct: 238 KDLKRVKDEDEEEEEAKCENGGSKDSDREASNDKDRKTSLFLRLDYGAVISAWDNHGSPW 297

Query: 338 SDEVP-----GSDTAGSDVNARLAQI---------DLFSDGGGL--LREASVLRYKEKRR 397
              +      G +T    V     ++             DGGG    REA VLRYKEKRR
Sbjct: 298 KTGIKPECMLGGNTCLPHVVGGYEKLMSSDGSVTRQQGRDGGGSDGEREARVLRYKEKRR 357

Query: 398 TRLFSKKIRYQVRKVNADGRPRMKGRFVRR 412
           TRLFSKKIRY+VRK+NA+ RPR+KGRFV+R
Sbjct: 358 TRLFSKKIRYEVRKLNAEQRPRIKGRFVKR 387

BLAST of Tan0017001 vs. ExPASy Swiss-Prot
Match: Q9M9B3 (Zinc finger protein CONSTANS-LIKE 8 OS=Arabidopsis thaliana OX=3702 GN=COL8 PE=1 SV=2)

HSP 1 Score: 74.3 bits (181), Expect = 3.6e-12
Identity = 46/92 (50.00%), Postives = 60/92 (65.22%), Query Frame = 0

Query: 320 KLNYEAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDLFSDGGGLLREASVLRYKEK 379
           +LNYE V  AW  + SP   +V  + ++   V   + +  + S+     REA V RY++K
Sbjct: 229 RLNYENVIAAWDKQESP--RDVKNNTSSFQLVPPGIEEKRVRSE-----REARVWRYRDK 288

Query: 380 RRTRLFSKKIRYQVRKVNADGRPRMKGRFVRR 412
           R+ RLF KKIRY+VRKVNAD RPRMKGRFVRR
Sbjct: 289 RKNRLFEKKIRYEVRKVNADKRPRMKGRFVRR 313

BLAST of Tan0017001 vs. NCBI nr
Match: XP_038890141.1 (protein CHLOROPLAST IMPORT APPARATUS 2 isoform X2 [Benincasa hispida])

HSP 1 Score: 711.8 bits (1836), Expect = 3.4e-201
Identity = 385/435 (88.51%), Postives = 397/435 (91.26%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESS-NTQLAISTRKSRTPR 60
           MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESS NTQLAISTRKSRTPR
Sbjct: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNNTQLAISTRKSRTPR 60

Query: 61  KRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFCESAELLLPFRVIDSSGFL 120
           KRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFT+SHDSLFCESAELLLPFRVIDSSGFL
Sbjct: 61  KRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTKSHDSLFCESAELLLPFRVIDSSGFL 120

Query: 121 LHQ-PLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSM---EMEDFDAESILDEEIE 180
           LHQ PL+ E+PNSQI SKL NLWE+RPCSSPGEIDFQPNSM   E+EDFDAESILDEEIE
Sbjct: 121 LHQPPLVHERPNSQIHSKLTNLWENRPCSSPGEIDFQPNSMEIEEIEDFDAESILDEEIE 180

Query: 181 EGIDSIMGNLCVDNLETATSAQDYSCAN----PKNWNCYWNPIGLGFNQKFEFGFGMRKA 240
           EGIDSIMGNL VDNLET  S QD SC N    P+NWNCYW+PIGLGFNQKFE GFGMRK 
Sbjct: 181 EGIDSIMGNLSVDNLETGNSTQD-SCVNVNNHPRNWNCYWSPIGLGFNQKFESGFGMRKG 240

Query: 241 MERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVATKKKKKKVEKLTVIES 300
           +ER AIR VD+GNWWRFPTVDVVEISPKLNPKPP PAP P+AVATKKKKKKVEKLTVIES
Sbjct: 241 IERTAIRGVDNGNWWRFPTVDVVEISPKLNPKPPVPAPAPSAVATKKKKKKVEKLTVIES 300

Query: 301 KKAATTPPQK------EKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSD 360
           KKAA  P QK      EKSEKP IPK KP GLLLKLNYEAVADAWS RGSPFSDE+PGSD
Sbjct: 301 KKAA-MPLQKEKLEKSEKSEKP-IPKLKPTGLLLKLNYEAVADAWSSRGSPFSDEIPGSD 360

Query: 361 TAGSDVNARLAQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMK 420
           TAGSDVNARLAQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMK
Sbjct: 361 TAGSDVNARLAQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMK 420

BLAST of Tan0017001 vs. NCBI nr
Match: XP_022149812.1 (protein CHLOROPLAST IMPORT APPARATUS 2 [Momordica charantia])

HSP 1 Score: 709.9 bits (1831), Expect = 1.3e-200
Identity = 378/429 (88.11%), Postives = 394/429 (91.84%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK 60
           MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK
Sbjct: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK 60

Query: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFCESAELLLPFRVIDSSGFLL 120
           RPNQTYNEATVLLST YPNVFSTKHLTNPRKFT+ HDSLFCESAELLLPFRVIDSSGFLL
Sbjct: 61  RPNQTYNEATVLLSTVYPNVFSTKHLTNPRKFTKPHDSLFCESAELLLPFRVIDSSGFLL 120

Query: 121 HQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEM-------EDFDAESILDEE 180
           HQPLLQEKPNSQIQ+KLANLWE RPCSSPGEIDFQPNSME+       +DFDAESILDEE
Sbjct: 121 HQPLLQEKPNSQIQTKLANLWEGRPCSSPGEIDFQPNSMEIDGGCGQDDDFDAESILDEE 180

Query: 181 IEEGIDSIMGNLCVDNLETATSAQDYSCANPKNWNCYWN--PIGLGFNQKFEFGFGMRKA 240
           IEEGIDSIMGNL VDNLETA S QDYSC NP+NWNCYWN  PIGLG NQKFEFGFGM+KA
Sbjct: 181 IEEGIDSIMGNLSVDNLETANSPQDYSCTNPRNWNCYWNNPPIGLGLNQKFEFGFGMQKA 240

Query: 241 MERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVATKKKKKKVEKLTVIES 300
            ERAAIR+V+DGNWWRFPTVDVVEISPKLN KPPA    PAAV+TKKKKKK+EKLTVIE+
Sbjct: 241 TERAAIRQVEDGNWWRFPTVDVVEISPKLN-KPPA----PAAVSTKKKKKKMEKLTVIET 300

Query: 301 KKAATTPPQKEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDV 360
           KKAAT P   +K +KPT PKSK PGLLLKLNYEAVADAWSDRGSPFSDE+PGSDTAG+DV
Sbjct: 301 KKAATIP---QKDQKPT-PKSK-PGLLLKLNYEAVADAWSDRGSPFSDEIPGSDTAGNDV 360

Query: 361 NARLAQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRR 420
           N RLAQ+DLF D GGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNAD RPRMKGRFVRR
Sbjct: 361 NTRLAQVDLFLDSGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADERPRMKGRFVRR 419

BLAST of Tan0017001 vs. NCBI nr
Match: XP_023537773.1 (protein CHLOROPLAST IMPORT APPARATUS 2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 708.4 bits (1827), Expect = 3.8e-200
Identity = 373/417 (89.45%), Postives = 388/417 (93.05%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK 60
           MSSP ISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE      AISTRKSRTPRK
Sbjct: 1   MSSPFISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE------AISTRKSRTPRK 60

Query: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFCESAELLLPFRVIDSSGFLL 120
           RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFT+SHDSLFCESAELLLPFRVIDSSGFLL
Sbjct: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTKSHDSLFCESAELLLPFRVIDSSGFLL 120

Query: 121 HQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEIEEGIDS 180
           +QPLL EKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEE+EEGIDS
Sbjct: 121 NQPLLHEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEVEEGIDS 180

Query: 181 IMGNLCVDNLETATSAQDYSCANPKNWNCYWNPIGLGFNQKFEFGFGMRKAMERAAIRRV 240
           IMGNL VDNLETA SAQD S  NPKN NCYWNPIGLGFNQKFEFGFGMRKA+ERAAIRRV
Sbjct: 181 IMGNLSVDNLETANSAQDCSSDNPKNRNCYWNPIGLGFNQKFEFGFGMRKAIERAAIRRV 240

Query: 241 DDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVATKKKKKKVEKLTVIESKKAATTPPQ 300
           DDGNWWRFPTVDVV+ISPKLNPKPPAP PT A  +TKKKKKK+EKLTVIESKK  T PP+
Sbjct: 241 DDGNWWRFPTVDVVDISPKLNPKPPAPTPTVA--STKKKKKKMEKLTVIESKK--TAPPK 300

Query: 301 KEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDL 360
           ++ SE PTIPKSKPPGLLLKLNYEAVADAWS RGSPFSD  PGS +AG+DVNARLAQIDL
Sbjct: 301 EKSSENPTIPKSKPPGLLLKLNYEAVADAWSARGSPFSDNNPGSKSAGNDVNARLAQIDL 360

Query: 361 FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRRPNSSAS 418
           FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQV+KVN D RPR+KGRFVRRPNSSA+
Sbjct: 361 FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVKKVNDDLRPRLKGRFVRRPNSSAT 407

BLAST of Tan0017001 vs. NCBI nr
Match: XP_022937796.1 (protein CHLOROPLAST IMPORT APPARATUS 2-like [Cucurbita moschata])

HSP 1 Score: 707.6 bits (1825), Expect = 6.4e-200
Identity = 374/417 (89.69%), Postives = 387/417 (92.81%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK 60
           MSSP ISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE      AISTRKSRTPRK
Sbjct: 1   MSSPFISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE------AISTRKSRTPRK 60

Query: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFCESAELLLPFRVIDSSGFLL 120
           RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFT+SHDSLFCESAELLLPFRVIDSSGFLL
Sbjct: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTKSHDSLFCESAELLLPFRVIDSSGFLL 120

Query: 121 HQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEIEEGIDS 180
           +QPLL EKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEE+EEGIDS
Sbjct: 121 NQPLLHEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEVEEGIDS 180

Query: 181 IMGNLCVDNLETATSAQDYSCANPKNWNCYWNPIGLGFNQKFEFGFGMRKAMERAAIRRV 240
           IMGNL VDNLETA SAQD S  NPKN NCYWNPIGLGFNQKFEFGFGMRKA+ERAAIRRV
Sbjct: 181 IMGNLSVDNLETANSAQDCSTDNPKNRNCYWNPIGLGFNQKFEFGFGMRKAIERAAIRRV 240

Query: 241 DDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVATKKKKKKVEKLTVIESKKAATTPPQ 300
           DDGNWWRFPTVDVV ISPKLNPKPPAP PT A  +TKKKKKKVEKLTVIESKK  T PP+
Sbjct: 241 DDGNWWRFPTVDVVAISPKLNPKPPAPTPTVA--STKKKKKKVEKLTVIESKK--TAPPK 300

Query: 301 KEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDL 360
           ++ SEK TIPKSKPPGLLLKLNYEAVADAWS RGSPFSD  PGS +AG+DVNARLAQIDL
Sbjct: 301 EKSSEKQTIPKSKPPGLLLKLNYEAVADAWSARGSPFSDNNPGSKSAGNDVNARLAQIDL 360

Query: 361 FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRRPNSSAS 418
           FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQV+KVN D RPR+KGRFVRRPNSSA+
Sbjct: 361 FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVKKVNDDVRPRLKGRFVRRPNSSAT 407

BLAST of Tan0017001 vs. NCBI nr
Match: KAG6586108.1 (Protein CHLOROPLAST IMPORT APPARATUS 2, partial [Cucurbita argyrosperma subsp. sororia] >KAG7020932.1 Protein CHLOROPLAST IMPORT APPARATUS 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 707.2 bits (1824), Expect = 8.4e-200
Identity = 373/414 (90.10%), Postives = 385/414 (93.00%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK 60
           MSSP ISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE      AISTRKSRTPRK
Sbjct: 1   MSSPFISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE------AISTRKSRTPRK 60

Query: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFCESAELLLPFRVIDSSGFLL 120
           RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFT+SHDSLFCESAELLLPFRVIDSSGFLL
Sbjct: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTKSHDSLFCESAELLLPFRVIDSSGFLL 120

Query: 121 HQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEIEEGIDS 180
           +QPLL EKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEE+EEGIDS
Sbjct: 121 NQPLLHEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEVEEGIDS 180

Query: 181 IMGNLCVDNLETATSAQDYSCANPKNWNCYWNPIGLGFNQKFEFGFGMRKAMERAAIRRV 240
           IMGNL VDNLETA SAQD S  NPKN NCYWNPIGLGFNQKFEFGFGMRKA+ERAAIRRV
Sbjct: 181 IMGNLSVDNLETANSAQDCSTDNPKNRNCYWNPIGLGFNQKFEFGFGMRKAIERAAIRRV 240

Query: 241 DDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVATKKKKKKVEKLTVIESKKAATTPPQ 300
           DDGNWWRFPTVDVV ISPKLNPKPPAP PT A  +TKKKKKKVEKLTVIESKK  T PP+
Sbjct: 241 DDGNWWRFPTVDVVAISPKLNPKPPAPTPTVA--STKKKKKKVEKLTVIESKK--TAPPK 300

Query: 301 KEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDL 360
           ++ SEKPTIPKSKPPGLLLKLNYEAVADAWS RGSPFSD  PGS +AG+DVNARLAQIDL
Sbjct: 301 EKSSEKPTIPKSKPPGLLLKLNYEAVADAWSARGSPFSDNNPGSKSAGNDVNARLAQIDL 360

Query: 361 FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRRPNS 415
           FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQV+KVN D RPR+KGRFVRRPNS
Sbjct: 361 FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVKKVNDDVRPRLKGRFVRRPNS 404

BLAST of Tan0017001 vs. ExPASy TrEMBL
Match: A0A6J1D908 (protein CHLOROPLAST IMPORT APPARATUS 2 OS=Momordica charantia OX=3673 GN=LOC111018157 PE=4 SV=1)

HSP 1 Score: 709.9 bits (1831), Expect = 6.3e-201
Identity = 378/429 (88.11%), Postives = 394/429 (91.84%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK 60
           MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK
Sbjct: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK 60

Query: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFCESAELLLPFRVIDSSGFLL 120
           RPNQTYNEATVLLST YPNVFSTKHLTNPRKFT+ HDSLFCESAELLLPFRVIDSSGFLL
Sbjct: 61  RPNQTYNEATVLLSTVYPNVFSTKHLTNPRKFTKPHDSLFCESAELLLPFRVIDSSGFLL 120

Query: 121 HQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEM-------EDFDAESILDEE 180
           HQPLLQEKPNSQIQ+KLANLWE RPCSSPGEIDFQPNSME+       +DFDAESILDEE
Sbjct: 121 HQPLLQEKPNSQIQTKLANLWEGRPCSSPGEIDFQPNSMEIDGGCGQDDDFDAESILDEE 180

Query: 181 IEEGIDSIMGNLCVDNLETATSAQDYSCANPKNWNCYWN--PIGLGFNQKFEFGFGMRKA 240
           IEEGIDSIMGNL VDNLETA S QDYSC NP+NWNCYWN  PIGLG NQKFEFGFGM+KA
Sbjct: 181 IEEGIDSIMGNLSVDNLETANSPQDYSCTNPRNWNCYWNNPPIGLGLNQKFEFGFGMQKA 240

Query: 241 MERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVATKKKKKKVEKLTVIES 300
            ERAAIR+V+DGNWWRFPTVDVVEISPKLN KPPA    PAAV+TKKKKKK+EKLTVIE+
Sbjct: 241 TERAAIRQVEDGNWWRFPTVDVVEISPKLN-KPPA----PAAVSTKKKKKKMEKLTVIET 300

Query: 301 KKAATTPPQKEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDV 360
           KKAAT P   +K +KPT PKSK PGLLLKLNYEAVADAWSDRGSPFSDE+PGSDTAG+DV
Sbjct: 301 KKAATIP---QKDQKPT-PKSK-PGLLLKLNYEAVADAWSDRGSPFSDEIPGSDTAGNDV 360

Query: 361 NARLAQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRR 420
           N RLAQ+DLF D GGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNAD RPRMKGRFVRR
Sbjct: 361 NTRLAQVDLFLDSGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADERPRMKGRFVRR 419

BLAST of Tan0017001 vs. ExPASy TrEMBL
Match: A0A6J1FBD7 (protein CHLOROPLAST IMPORT APPARATUS 2-like OS=Cucurbita moschata OX=3662 GN=LOC111444083 PE=4 SV=1)

HSP 1 Score: 707.6 bits (1825), Expect = 3.1e-200
Identity = 374/417 (89.69%), Postives = 387/417 (92.81%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK 60
           MSSP ISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE      AISTRKSRTPRK
Sbjct: 1   MSSPFISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE------AISTRKSRTPRK 60

Query: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFCESAELLLPFRVIDSSGFLL 120
           RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFT+SHDSLFCESAELLLPFRVIDSSGFLL
Sbjct: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTKSHDSLFCESAELLLPFRVIDSSGFLL 120

Query: 121 HQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEIEEGIDS 180
           +QPLL EKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEE+EEGIDS
Sbjct: 121 NQPLLHEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEVEEGIDS 180

Query: 181 IMGNLCVDNLETATSAQDYSCANPKNWNCYWNPIGLGFNQKFEFGFGMRKAMERAAIRRV 240
           IMGNL VDNLETA SAQD S  NPKN NCYWNPIGLGFNQKFEFGFGMRKA+ERAAIRRV
Sbjct: 181 IMGNLSVDNLETANSAQDCSTDNPKNRNCYWNPIGLGFNQKFEFGFGMRKAIERAAIRRV 240

Query: 241 DDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVATKKKKKKVEKLTVIESKKAATTPPQ 300
           DDGNWWRFPTVDVV ISPKLNPKPPAP PT A  +TKKKKKKVEKLTVIESKK  T PP+
Sbjct: 241 DDGNWWRFPTVDVVAISPKLNPKPPAPTPTVA--STKKKKKKVEKLTVIESKK--TAPPK 300

Query: 301 KEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDL 360
           ++ SEK TIPKSKPPGLLLKLNYEAVADAWS RGSPFSD  PGS +AG+DVNARLAQIDL
Sbjct: 301 EKSSEKQTIPKSKPPGLLLKLNYEAVADAWSARGSPFSDNNPGSKSAGNDVNARLAQIDL 360

Query: 361 FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRRPNSSAS 418
           FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQV+KVN D RPR+KGRFVRRPNSSA+
Sbjct: 361 FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVKKVNDDVRPRLKGRFVRRPNSSAT 407

BLAST of Tan0017001 vs. ExPASy TrEMBL
Match: A0A6J1HLT7 (protein CHLOROPLAST IMPORT APPARATUS 2-like OS=Cucurbita maxima OX=3661 GN=LOC111465375 PE=4 SV=1)

HSP 1 Score: 700.7 bits (1807), Expect = 3.8e-198
Identity = 369/417 (88.49%), Postives = 385/417 (92.33%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRK 60
           MSSP ISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE      AISTRKSRTPRK
Sbjct: 1   MSSPFISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSE------AISTRKSRTPRK 60

Query: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFCESAELLLPFRVIDSSGFLL 120
           RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFT+S DSLFCESAELLLPFRVI+SSGFLL
Sbjct: 61  RPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTKSPDSLFCESAELLLPFRVIESSGFLL 120

Query: 121 HQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEIEEGIDS 180
           +QPLL EKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEE+EEGIDS
Sbjct: 121 NQPLLHEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESILDEEVEEGIDS 180

Query: 181 IMGNLCVDNLETATSAQDYSCANPKNWNCYWNPIGLGFNQKFEFGFGMRKAMERAAIRRV 240
           IMGNL VDNLET  SAQD S  NPKNWNCYWNPIGLGFNQKFEFGFGMRKA+ERAAIRRV
Sbjct: 181 IMGNLSVDNLETPNSAQDCSSDNPKNWNCYWNPIGLGFNQKFEFGFGMRKAIERAAIRRV 240

Query: 241 DDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVATKKKKKKVEKLTVIESKKAATTPPQ 300
           DDGNWWRFPTVDVV+ISPKLNPKPPAP PT A+   KKKKKKVEKLTVIESKK  T PP+
Sbjct: 241 DDGNWWRFPTVDVVDISPKLNPKPPAPTPTVAS-TKKKKKKKVEKLTVIESKK--TAPPK 300

Query: 301 KEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDL 360
           ++ SEKPTIPKSKPPGLLLKLNYEAVADAWS RGSPF+D   GS +AG+DVNARLAQIDL
Sbjct: 301 EKSSEKPTIPKSKPPGLLLKLNYEAVADAWSARGSPFTDNHMGSKSAGNDVNARLAQIDL 360

Query: 361 FSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRRPNSSAS 418
           FSDGGGLLREASVLRYKEKRRTRL SKKIRYQV+KVN D RPR+KGRFVRRPNSSA+
Sbjct: 361 FSDGGGLLREASVLRYKEKRRTRLLSKKIRYQVKKVNDDLRPRLKGRFVRRPNSSAT 408

BLAST of Tan0017001 vs. ExPASy TrEMBL
Match: A0A0A0LGV4 (CCT domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G061540 PE=4 SV=1)

HSP 1 Score: 688.0 bits (1774), Expect = 2.5e-194
Identity = 375/433 (86.61%), Postives = 389/433 (89.84%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSN--TQLAISTRKSRTP 60
           MSSPCISGGGRAYNFDLEI+KSPSSSWTRTSQTSSPSSTLSESSN  TQLAISTRK RTP
Sbjct: 1   MSSPCISGGGRAYNFDLEILKSPSSSWTRTSQTSSPSSTLSESSNNTTQLAISTRKLRTP 60

Query: 61  RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHD---SLFCESAELLLPFRVIDS 120
           RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFT+SHD   SLFCESAELLLPFRVIDS
Sbjct: 61  RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTKSHDDSSSLFCESAELLLPFRVIDS 120

Query: 121 SGFLLHQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSM---EMEDFDAESILDE 180
           SGFLLHQPLL+EKPNSQI SKL NLWE+RPCSSPGEIDFQPNSM   E+EDFDAESILDE
Sbjct: 121 SGFLLHQPLLEEKPNSQIHSKLTNLWENRPCSSPGEIDFQPNSMEIEEIEDFDAESILDE 180

Query: 181 EIEEGIDSIMGNLCVDNLETATSAQDYSCAN----PKNWNCYWNPIGLGFNQKFEFGFGM 240
           EIEEGIDSIMGNL VDNLE   S QD SC N    P+NWN  WNPIGLGFNQKFE GFG 
Sbjct: 181 EIEEGIDSIMGNLSVDNLEKGNSTQD-SCVNANNHPRNWN--WNPIGLGFNQKFESGFGF 240

Query: 241 RKAMERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPA------PTPAAVATKKKKKK 300
           RK +ER AIR VD+GNWWRFPTVDV+EISPKLNPKPPAPA      PTPAAV+TKKKKKK
Sbjct: 241 RKGIERTAIRGVDNGNWWRFPTVDVIEISPKLNPKPPAPAPTPTPTPTPAAVSTKKKKKK 300

Query: 301 VEKLTVIESKKAATTPPQKEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVP 360
           VEKLTVIESKKAA  P QKEKSEKP IPK KP GLLLKLNYEAVADAWS RGSPFSDE+P
Sbjct: 301 VEKLTVIESKKAA-IPLQKEKSEKP-IPKLKPTGLLLKLNYEAVADAWSSRGSPFSDEIP 360

Query: 361 GSDTAGSDVNARLAQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRP 416
            SDTAGSDVNAR+A IDLF++GGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRP
Sbjct: 361 SSDTAGSDVNARVANIDLFTEGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRP 420

BLAST of Tan0017001 vs. ExPASy TrEMBL
Match: A0A1S3BQX8 (protein CHLOROPLAST IMPORT APPARATUS 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492569 PE=4 SV=1)

HSP 1 Score: 683.7 bits (1763), Expect = 4.8e-193
Identity = 375/435 (86.21%), Postives = 387/435 (88.97%), Query Frame = 0

Query: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSN--TQLAISTRKSRTP 60
           MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSN  TQLAISTRK RTP
Sbjct: 1   MSSPCISGGGRAYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNNTTQLAISTRKLRTP 60

Query: 61  RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHD---SLFCESAELLLPFRVIDS 120
           RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFT+SHD   SLFCESAELLLPFRVIDS
Sbjct: 61  RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTKSHDDSSSLFCESAELLLPFRVIDS 120

Query: 121 SGFLLHQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSM---EMEDFDAESILDE 180
           SGFLLHQPLL+EKPNSQI SKL NLWE+RPCSSPGEIDFQPNSM   E+EDFDAESILDE
Sbjct: 121 SGFLLHQPLLEEKPNSQIHSKLTNLWENRPCSSPGEIDFQPNSMEIEEIEDFDAESILDE 180

Query: 181 EIEEGIDSIMGNLCVDNLETATSAQDYSCAN----PKNWNCYWNPIGLGFNQKFEFGFGM 240
           EIEEGIDSIMGNL VDNLE   S QD SC N     +NWN  WNPIGLGFNQKFE GFG 
Sbjct: 181 EIEEGIDSIMGNLSVDNLENGNSTQD-SCVNANNHQRNWN--WNPIGLGFNQKFESGFGF 240

Query: 241 RKAMERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPA------PTPAAVATKKKKKK 300
           RK +ER AIR VD+GNWWRFPTVDV+EISPKLNPKPPAPA      PTPAAV+TKKKKKK
Sbjct: 241 RKGIERTAIRGVDNGNWWRFPTVDVIEISPKLNPKPPAPAPAPTPTPTPAAVSTKKKKKK 300

Query: 301 VEKLTVIESKKAATTPPQKEKSEK--PTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDE 360
           VEKLTVIESKKAA  P QKEKSEK    IPK KP GLLLKLNYEAVADAWS RGSPFSDE
Sbjct: 301 VEKLTVIESKKAA-IPLQKEKSEKSEKPIPKLKPAGLLLKLNYEAVADAWSSRGSPFSDE 360

Query: 361 VPGSDTAGSDVNARLAQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADG 416
           +P SDTAGSDVNARLA IDLF++GGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADG
Sbjct: 361 IPSSDTAGSDVNARLANIDLFTEGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADG 420

BLAST of Tan0017001 vs. TAIR 10
Match: AT5G57180.2 (chloroplast import apparatus 2 )

HSP 1 Score: 284.3 bits (726), Expect = 1.6e-76
Identity = 213/452 (47.12%), Postives = 266/452 (58.85%), Query Frame = 0

Query: 3   SPCIS---GGGRAYNFDLEIVKS-PSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTP 62
           S C+S   GG  AY+F+LE VKS P SS T T++ +SPSST+SESSN+ LAISTRK RT 
Sbjct: 2   SACLSSGGGGAAAYSFELEKVKSPPPSSSTTTTRATSPSSTISESSNSPLAISTRKPRTQ 61

Query: 63  RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFC--------ESAELLLPF 122
           RKRPNQTYNEA  LLSTAYPN+FS+ +L++ +K   S +S F         ++++LLLP+
Sbjct: 62  RKRPNQTYNEAATLLSTAYPNIFSS-NLSSKQKTHSSSNSHFYGPLLSDNDDASDLLLPY 121

Query: 123 RVIDSSGFLLHQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESIL 182
             I+   FL H P +Q K       K  N      C   GEI+    S   ++FDAESIL
Sbjct: 122 ESIEEPDFLFH-PTIQTKTEFFSDQKEVN--SGGDCYG-GEIEKFDFS---DEFDAESIL 181

Query: 183 DEEIEEGIDSIMGNLCVDNLETA---------------TSAQDYSCANPKNWNCYWNPIG 242
           DE+IEEGIDSIMG +   N  +                +S+              WN   
Sbjct: 182 DEDIEEGIDSIMGTVVESNSNSGIYESRVPGMINRGGRSSSNRIGKLEQMMMINSWNRSS 241

Query: 243 LGFNQKFEFGFGMRKAMERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVA 302
            GFN  F  G G+     R+A+R  DD   W+  TVD  +ISP++         T  A++
Sbjct: 242 NGFN--FPLGLGL-----RSALRENDDTKLWKIHTVDFEQISPRIQ-----TVKTETAIS 301

Query: 303 TKKKKKKVEKLTVIESKKA-----------ATTPPQKEKSEKPTIPKS-KPPGLLLKLNY 362
           T  ++K   K  VI  +K+            TT   + KS + T   S K  G LLKL+Y
Sbjct: 302 TVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTLITESKSLEDTEETSLKRTGPLLKLDY 361

Query: 363 EAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDLFSDGGGLLREASVLRYKEKRRTR 416
           + V +AWSD+ SPF DE+ GS+    DVNARLAQIDLF D G  +REASVLRYKEKRRTR
Sbjct: 362 DGVLEAWSDKTSPFPDEIQGSEAV--DVNARLAQIDLFGDSG--MREASVLRYKEKRRTR 421

BLAST of Tan0017001 vs. TAIR 10
Match: AT5G57180.1 (chloroplast import apparatus 2 )

HSP 1 Score: 266.5 bits (680), Expect = 3.5e-71
Identity = 204/442 (46.15%), Postives = 257/442 (58.14%), Query Frame = 0

Query: 3   SPCIS---GGGRAYNFDLEIVKS-PSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTP 62
           S C+S   GG  AY+F+LE VKS P SS T T++ +SPSST+SESSN+ LAISTRK RT 
Sbjct: 2   SACLSSGGGGAAAYSFELEKVKSPPPSSSTTTTRATSPSSTISESSNSPLAISTRKPRTQ 61

Query: 63  RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFC--------ESAELLLPF 122
           RKRPNQTYNEA  LLSTAYPN+FS+ +L++ +K   S +S F         ++++LLLP+
Sbjct: 62  RKRPNQTYNEAATLLSTAYPNIFSS-NLSSKQKTHSSSNSHFYGPLLSDNDDASDLLLPY 121

Query: 123 RVIDSSGFLLHQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESIL 182
             I+   FL H P +Q K       K  N      C   GEI+    S   ++FDAESIL
Sbjct: 122 ESIEEPDFLFH-PTIQTKTEFFSDQKEVN--SGGDCYG-GEIEKFDFS---DEFDAESIL 181

Query: 183 DEEIEEGIDSIMGNLCVDNLETA---------------TSAQDYSCANPKNWNCYWNPIG 242
           DE+IEEGIDSIMG +   N  +                +S+              WN   
Sbjct: 182 DEDIEEGIDSIMGTVVESNSNSGIYESRVPGMINRGGRSSSNRIGKLEQMMMINSWNRSS 241

Query: 243 LGFNQKFEFGFGMRKAMERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVA 302
            GFN  F  G G+     R+A+R  DD   W+  TVD  +ISP++         T  A++
Sbjct: 242 NGFN--FPLGLGL-----RSALRENDDTKLWKIHTVDFEQISPRIQ-----TVKTETAIS 301

Query: 303 TKKKKKKVEKLTVIESKKA-----------ATTPPQKEKSEKPTIPKS-KPPGLLLKLNY 362
           T  ++K   K  VI  +K+            TT   + KS + T   S K  G LLKL+Y
Sbjct: 302 TVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTLITESKSLEDTEETSLKRTGPLLKLDY 361

Query: 363 EAVADAWSDRGSPFSDEVPGSDTAGSDVNARLAQIDLFSDGGGLLREASVLRYKEKRRTR 406
           + V +AWSD+ SPF DE+ GS+    DVNARLAQIDLF D G  +REASVLRYKEKRRTR
Sbjct: 362 DGVLEAWSDKTSPFPDEIQGSEAV--DVNARLAQIDLFGDSG--MREASVLRYKEKRRTR 419

BLAST of Tan0017001 vs. TAIR 10
Match: AT4G25990.1 (CCT motif family protein )

HSP 1 Score: 262.7 bits (670), Expect = 5.1e-70
Identity = 186/419 (44.39%), Postives = 247/419 (58.95%), Query Frame = 0

Query: 12  AYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRKRPNQTYNEATV 71
           AY+F+LE++KSP S     + T SPSST+SE+++   +ISTR+ RTPRKRPNQTY+EA  
Sbjct: 5   AYSFELEMMKSPPS-----NNTPSPSSTISETNSPPFSISTRRPRTPRKRPNQTYDEAAA 64

Query: 72  LLSTAYPNVFSTKHL-TNPRKFTESHDSLFCESAELLLPFRVIDSSGFLLHQPLLQEKPN 131
           LLSTAYP +FS+K   T      +S  S + E+++LLLP+  I+ + FL          N
Sbjct: 65  LLSTAYPKIFSSKKAKTQIFGTNKSPLSDYDEASQLLLPYVSIEENEFLF---------N 124

Query: 132 SQIQSKLANLWESRPCSSPGEIDFQPNSM-EMEDFDAESILDEEIEEGIDSIMGNLCVDN 191
             I +K  +  E +  S     D + N    ++DFDAESILDEEIEEGIDS MGN+    
Sbjct: 125 PTIPTKTEHFLEQKEVSFD---DLEVNGFGVLDDFDAESILDEEIEEGIDSFMGNI---- 184

Query: 192 LETATSAQDYSCANPKNWNCYWNPIGLGFNQKFEFGFGMRKAMERAAIRRVDDGNWWRFP 251
                 + D    N          +   +N +F  G G+     R+++R+ +D NWW+FP
Sbjct: 185 -----ESNDGDRENCYRVGRLEEIMKNAWNGRFRLGLGL-----RSSLRQNNDENWWKFP 244

Query: 252 TVDVVEISPKLNPKPPAPA--------------PTPAAVATKKKKKKVEKLTVIESKKAA 311
           TV+  +ISP++     A A                  A   KKKKKK +K  V  +  AA
Sbjct: 245 TVEFDQISPRIQTTAAAAADDGQSNVVDSSKIKTIVTAEGDKKKKKKKKKKKV--APAAA 304

Query: 312 TTPPQKEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDVNARL 371
            +   +     P + +   P  LLKL+Y+ V +AWS + SPFSDE+ GSD  G D + RL
Sbjct: 305 ESKSSEVTDSNPKLEQRVSP--LLKLDYDGVLEAWSGKESPFSDEILGSDADGVDFHVRL 364

Query: 372 AQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMKGRFVRRPNS 415
            +IDLF + G  +REASVLRYKEKRR RLFSKKIRYQVRK+NAD RPRMKGRFVRRPN+
Sbjct: 365 GEIDLFGESG--MREASVLRYKEKRRNRLFSKKIRYQVRKLNADQRPRMKGRFVRRPNA 386

BLAST of Tan0017001 vs. TAIR 10
Match: AT4G25990.2 (CCT motif family protein )

HSP 1 Score: 252.7 bits (644), Expect = 5.3e-67
Identity = 186/434 (42.86%), Postives = 247/434 (56.91%), Query Frame = 0

Query: 12  AYNFDLEIVKSPSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTPRKRPNQTYNEATV 71
           AY+F+LE++KSP S     + T SPSST+SE+++   +ISTR+ RTPRKRPNQTY+EA  
Sbjct: 5   AYSFELEMMKSPPS-----NNTPSPSSTISETNSPPFSISTRRPRTPRKRPNQTYDEAAA 64

Query: 72  LLSTAYPNVFSTKHL-TNPRKFTESHDSLFCESAELLLPFRVIDSSGFLLHQPLLQEKPN 131
           LLSTAYP +FS+K   T      +S  S + E+++LLLP+  I+ + FL          N
Sbjct: 65  LLSTAYPKIFSSKKAKTQIFGTNKSPLSDYDEASQLLLPYVSIEENEFLF---------N 124

Query: 132 SQIQSKLANLWESRPCSSPGEIDFQPNSM-EMEDFDAESILDEEIEEGIDSIMGNLCVDN 191
             I +K  +  E +  S     D + N    ++DFDAESILDEEIEEGIDS MGN+    
Sbjct: 125 PTIPTKTEHFLEQKEVSFD---DLEVNGFGVLDDFDAESILDEEIEEGIDSFMGNI---- 184

Query: 192 LETATSAQDYSCANPKNWNCYWNPIGLGFNQKFEFGFGMRKAMERAAIRRVDDGNWWRFP 251
                 + D    N          +   +N +F  G G+     R+++R+ +D NWW+FP
Sbjct: 185 -----ESNDGDRENCYRVGRLEEIMKNAWNGRFRLGLGL-----RSSLRQNNDENWWKFP 244

Query: 252 TVDVVEISPKLNPKPPAPA--------------PTPAAVATKKKKKKVEKLTVIESKKAA 311
           TV+  +ISP++     A A                  A   KKKKKK +K  V  +  AA
Sbjct: 245 TVEFDQISPRIQTTAAAAADDGQSNVVDSSKIKTIVTAEGDKKKKKKKKKKKV--APAAA 304

Query: 312 TTPPQKEKSEKPTIPKSKPPGLLLKLNYEAVADAWSDRGSPFSDEVPGSDTAGSDVNARL 371
            +   +     P + +   P  LLKL+Y+ V +AWS + SPFSDE+ GSD  G D + RL
Sbjct: 305 ESKSSEVTDSNPKLEQRVSP--LLKLDYDGVLEAWSGKESPFSDEILGSDADGVDFHVRL 364

Query: 372 AQIDLFSDGGGLLREASVLRYKEKRRTRLFSKKIRYQVRKVNADGRPRMK---------- 415
            +IDLF + G  +REASVLRYKEKRR RLFSKKIRYQVRK+NAD RPRMK          
Sbjct: 365 GEIDLFGESG--MREASVLRYKEKRRNRLFSKKIRYQVRKLNADQRPRMKVKDWHCNIVV 401

BLAST of Tan0017001 vs. TAIR 10
Match: AT5G57180.3 (chloroplast import apparatus 2 )

HSP 1 Score: 188.0 bits (476), Expect = 1.6e-47
Identity = 160/395 (40.51%), Postives = 211/395 (53.42%), Query Frame = 0

Query: 3   SPCIS---GGGRAYNFDLEIVKS-PSSSWTRTSQTSSPSSTLSESSNTQLAISTRKSRTP 62
           S C+S   GG  AY+F+LE VKS P SS T T++ +SPSST+SESSN+ LAISTRK RT 
Sbjct: 2   SACLSSGGGGAAAYSFELEKVKSPPPSSSTTTTRATSPSSTISESSNSPLAISTRKPRTQ 61

Query: 63  RKRPNQTYNEATVLLSTAYPNVFSTKHLTNPRKFTESHDSLFC--------ESAELLLPF 122
           RKRPNQTYNEA  LLSTAYPN+FS+ +L++ +K   S +S F         ++++LLLP+
Sbjct: 62  RKRPNQTYNEAATLLSTAYPNIFSS-NLSSKQKTHSSSNSHFYGPLLSDNDDASDLLLPY 121

Query: 123 RVIDSSGFLLHQPLLQEKPNSQIQSKLANLWESRPCSSPGEIDFQPNSMEMEDFDAESIL 182
             I+   FL H P +Q K       K  N      C   GEI+    S   ++FDAESIL
Sbjct: 122 ESIEEPDFLFH-PTIQTKTEFFSDQKEVN--SGGDCYG-GEIEKFDFS---DEFDAESIL 181

Query: 183 DEEIEEGIDSIMGNLCVDNLETA---------------TSAQDYSCANPKNWNCYWNPIG 242
           DE+IEEGIDSIMG +   N  +                +S+              WN   
Sbjct: 182 DEDIEEGIDSIMGTVVESNSNSGIYESRVPGMINRGGRSSSNRIGKLEQMMMINSWNRSS 241

Query: 243 LGFNQKFEFGFGMRKAMERAAIRRVDDGNWWRFPTVDVVEISPKLNPKPPAPAPTPAAVA 302
            GFN  F  G G+     R+A+R  DD   W+  TVD  +ISP++         T  A++
Sbjct: 242 NGFN--FPLGLGL-----RSALRENDDTKLWKIHTVDFEQISPRIQ-----TVKTETAIS 301

Query: 303 TKKKKKKVEKLTVIESKKA-----------ATTPPQKEKSEKPTIPKS-KPPGLLLKLNY 359
           T  ++K   K  VI  +K+            TT   + KS + T   S K  G LLKL+Y
Sbjct: 302 TVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTLITESKSLEDTEETSLKRTGPLLKLDY 361

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LU682.3e-7547.12Protein CHLOROPLAST IMPORT APPARATUS 2 OS=Arabidopsis thaliana OX=3702 GN=CIA2 P... [more]
Q8RWD09.2e-1644.17Zinc finger protein CONSTANS-LIKE 16 OS=Arabidopsis thaliana OX=3702 GN=COL16 PE... [more]
Q8LG764.6e-1544.44Zinc finger protein CONSTANS-LIKE 6 OS=Arabidopsis thaliana OX=3702 GN=COL6 PE=2... [more]
Q9C9A98.6e-1440.00Zinc finger protein CONSTANS-LIKE 7 OS=Arabidopsis thaliana OX=3702 GN=COL7 PE=2... [more]
Q9M9B33.6e-1250.00Zinc finger protein CONSTANS-LIKE 8 OS=Arabidopsis thaliana OX=3702 GN=COL8 PE=1... [more]
Match NameE-valueIdentityDescription
XP_038890141.13.4e-20188.51protein CHLOROPLAST IMPORT APPARATUS 2 isoform X2 [Benincasa hispida][more]
XP_022149812.11.3e-20088.11protein CHLOROPLAST IMPORT APPARATUS 2 [Momordica charantia][more]
XP_023537773.13.8e-20089.45protein CHLOROPLAST IMPORT APPARATUS 2-like [Cucurbita pepo subsp. pepo][more]
XP_022937796.16.4e-20089.69protein CHLOROPLAST IMPORT APPARATUS 2-like [Cucurbita moschata][more]
KAG6586108.18.4e-20090.10Protein CHLOROPLAST IMPORT APPARATUS 2, partial [Cucurbita argyrosperma subsp. s... [more]
Match NameE-valueIdentityDescription
A0A6J1D9086.3e-20188.11protein CHLOROPLAST IMPORT APPARATUS 2 OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A6J1FBD73.1e-20089.69protein CHLOROPLAST IMPORT APPARATUS 2-like OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1HLT73.8e-19888.49protein CHLOROPLAST IMPORT APPARATUS 2-like OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A0A0LGV42.5e-19486.61CCT domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G061540 PE=4 SV... [more]
A0A1S3BQX84.8e-19386.21protein CHLOROPLAST IMPORT APPARATUS 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC... [more]
Match NameE-valueIdentityDescription
AT5G57180.21.6e-7647.12chloroplast import apparatus 2 [more]
AT5G57180.13.5e-7146.15chloroplast import apparatus 2 [more]
AT4G25990.15.1e-7044.39CCT motif family protein [more]
AT4G25990.25.3e-6742.86CCT motif family protein [more]
AT5G57180.31.6e-4740.51chloroplast import apparatus 2 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010402CCT domainPFAMPF06203CCTcoord: 369..411
e-value: 9.6E-18
score: 64.0
IPR010402CCT domainPROSITEPS51017CCTcoord: 369..411
score: 15.784945
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 291..314
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 24..64
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 24..53
NoneNo IPR availablePANTHERPTHR31874CCT MOTIF FAMILY PROTEIN, EXPRESSEDcoord: 1..415
NoneNo IPR availablePANTHERPTHR31874:SF10OS02G0148000 PROTEINcoord: 1..415

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017001.1Tan0017001.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding