Cp4.1LG01g18810 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g18810
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProline--tRNA ligase
LocationCp4.1LG01 : 16155022 .. 16161293 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAGCTAATTTGAGAAATCAATCCATCCCCGTCCTGTTCTGTGCTGAACCCTCAAACTTCTCCTCGTAGTTCATCACTATCCGCGCCTCCTCGCATTTGCGCATCGCCGCCGCGAGGTCGTTCGGTGTTCGTCGAAATCGCAGAGCTCGATTCACTGTAAACTTCGTTGTACCTTTCTTGATACTTCAATTGCTTAGTCCTTCTTTTCTCTGTCTCTGTAGACTCCACGAATTCTGCCATGTTAATTAGCTACTCTTTGTTCTTTTATTCCTGGCCGAGGTGGGTTTGTATGACTTGATGATTTATGTTTCCTTGTGTATTCCGGGGCATATGTGGGGAAATTTGACAACCAGACTCAATCCAATTCAATTATCGTTGGGTTGACTTTAGATTCCTTTTTGTGTTCTTCTCGTTTTTACATGAAATATTGTTAACAATTCTCCCAATGCCTACTTGATTATTTTTTTACTGCATTTCTTTGTCCTCAATTGAATTTCTTACGATGATACTCCTGCAGAAGTTAAANATTGTTAACAATTCTCCCAATGCCTACTTGATTATTTTTTTATTGCATTTCTTTGTCCTCAATTGAATTTCTTACGATGATACTCCTGCAGAAGTTAAAAACTCTAAGCATGGCTGGTCCTAAGCCTGGTAGCTCTGCTAAAACTCCAAAAGCTGGTAAATATTGTTCCTTTAGTACGGTAGTTGTTTGTGCAATTGCTTCGTATTCCCCTCTTTGGCATGTGTTTAATAGTTCTTCCTTGCTAACTGGACAGGCGGTAAAAAGAAGGAGGTGAAGAAAGAGACTGGTTTAGGTCTCACTAATAAGAAGGATGATAACTTTGGAGAGTGGTATTCTGAGGTGCTGTTTCTAGCTTGATTTTTGTTTTCTTCATTACAAAGTGACTTTGTGTTGGGTAAAGTTATTTGTCTGAATTGAGTTTTTGTTTTCATCACACTTTAACTTGACAATACTTGAACATTTTTCCTTCAACTTAGGAGAGAAATGTTTTATTTTGGTTCATAAGGTCTTGTTTCTAGTTCTTCTATGCCACTTTTTTTTATTTTTTTTTCTGTCGTAAAATGTTGTCGATTAGTGGAATCATATATTATGTTAAATGAACAAAAAAGGAAATGAAGATTTCAAAATTTTTGTTTTGTAGCAAGCTCCCCAGTAATATTACGAACAGGCTGTTTTCAGTGGCTTGAATAATTCGTTTTCTAGTTTGTCTAGGTGGTTGTCAGTGGAGAAATGATTGAATATTATGATATCTCTGGCTGCTATATTCTGAGGCCGTGGGCTATGTCTATCTGGGAGACTATGCAAGTAAGGATGCTTTTCTTTCTATAGTCAATTTTACAATCTATTCTCGCTTTGAAGTTTGAGGATTTTATTTGTCACCGATGCAGGTATTTTTTGATGCAGAAATTAAGAAAATGAAAATCAAGAACTGCTATTTTCCACTTTTTGTGTCTCCCGGCGTCTTACAAAGAGAGAAAGATCATATAGAAGGTTTTGCTCCTGAGGTGACGTTTTAATATTAATATATGCAATAATAGTGAAACTTCTCCGTCTGAATTTGAAATGCTTATTACCTTTCCATTCCTCATGTTCTGTTTGTTCATTTTCAATTTTTAAGCTGATAAAGTGAACTTGGTTTAGTGCTGAAAAATTGGTATTTGGCATCCATTTGAAGGTTGCATGGGTGACAAAATCTGGGGAGTCTGACTTGGAAGTGCCTATTGCAATTCGACCTACAAGTGAGACTGTGATGTATCCATACTATTCTAAGTGGATCAGGGGTCACCGTGACTTGCCTTTGAAACTTAACCAGTGGTGCAATGTTGTGAGATGGGAGTTCAGCCATCCCACACCATTTATTAGGCAAGTACTCTTCCGTGTGTTTTTTTTTTCGTTTCTTTACCTTTCACTATGTTGATGGATTACTTTGGACCTGTTAACAGTTAGATGTTTTATCATTTTGTTTAATGATCTTGATTAAATAATTTTGTTCAATGTTCTTGATCAAATAATTTTGTTTAATGTTCTTGATTATATAATTTATTCAATGTTCTTGATTAAATAATTTTGTTCAATGTTCTTGATTAAATAGTTCTATTCAATGTTCTTAACTAAATGTGTCATATTTGCATAAGTCACTATAGTATTCTTGAACCTTTTAAGGATTTTTCAAGAACTTCATTGTTTTAAAATTATTTGTAACTATTTTGAGAAAATGACCAATTTTTATGAACTACTGACTGCAGGAGTCGTGAGTTCCTTTGGCAGGAAGGACATACTGCTTTTGCTACCAAGGATGAAGCAGACACAGAGGTATCAAACAGATACAGAATTTCAAATTACGAAATGGAAAATTGCTTTTGTCATGGTTGAATGTAGAGATTGAAACTAACTTGTCTTTTCTCAGTTTCTAGTGATGGAAATGAGTCAAATATTTTTGGGATAATGGTTTGAGTGGAAACTTAGGACCTTTAAATGGGAAGAGTAGAGATTTGATACATTGAATCTTTCTTTATTTGTCTACTATCGCCTTTGGGTTGACTGTTTGTGTGAAAATAAAGAAAGTAAATTTAGCATTCTTTTCATCTTCAGCTTCTACTTCAATACTAATGGATTTTTAAGAACATAATAAACTCGTTTGATTTGTTTGATGTCACAAGAGAGTGCAGTGGAGTCTTAAGATAAGGTAACTGTGAAGGTGTTGATCTTTTAGAATTTTTGAGATGTACAATTGTACTAATGGCATAATTATAAACTTTTCTTTCTTGATCAAACTTGGAGATTCAAGCATGCAAACTCCCTTTAATTTGAAGCTGAGTGGGGCCTAAATTGGCTGAGTTCTTCAAGTGATCATCCTCAAGTTCTTAGAACTCAAGGGATGAGTTTTCTTTCGTTGGAAAGCTTGATGGTTCAATTTAGATATCAATTGTGTCATAAATTTTATATTTTATATAATGCTTGGATAACTTTTTAATGGTCGAAACTGATTTTTTTTTTTTTTTAAAACAAATTCAAGTTTGGAAATTATTAGTGATTTATATTTGGAATTTTAAGTCATGTGCTCATATTTTTCCTTGGCATTACATTCTATCAATTGATTTGTTGACGATGAATCATGCCTATTGTTCATTGATAAACTATCCATAGCTGTTACGCTTCTGAATTGTCTTGTCTTTTCTTTGCGAAGAAAACAAAGGTGTATAAATAATTTTAAGAGTAGAGAGGGTAGAGGGAGTGTTAAATCTCCATCTTCATAATCAATCCAAGGGAGATCAAAAAAAAAAAAACCTCGATCTGTCCCTGAAAATATGCAACTCTTTTGTGTAACTTTGGTTATTCTTTAAAGGTTACTTTCATTTCAGGTTCTTGAGATATTGGAACTGTATAGGCGCATATATGAAGAATACCTTGCGATTCCTGTTATTAAGGGCAAAAAGAGTGAGATGGAGAAGTTCGCTGGTGGTCTTTACACTACAAGCGTTGAGGTATAAGTTGACTAAATTTTATGACCTATTTTCAAAATGGTGAATCAATATCCGTTGCTGGTTCAAGTTGTATAAAACCATTTCTTCATGTGTGGTATGTGTGTGCAGGCATTTATCCCAAACACTGGTCGTGGAATTCAGGGTGCAACTTCACATTGTTTGGGTCAAAATTTTGCGAAAATGTTTGAAATAAACTTTGAAAATGAAAAGGGGGAGAAAGCTATGGTCTGGCAAAATTCATGGGCCTACAGTACTAGAACGGTAATATTTGAGTACATTTGGCTTTGGTAACTTGTCAGAGTGAATTTCTGGGTTTGTGAAACAACAAACAGTGATGAAGGCATATATTACCTTTTATGAATGGTCATGGTCATTTCTTTTAGTCGCTTATCTAGTCAAATTGTTTACAGATTGGTGTGATGGTTATGGTTCACGGGGATGACAAGGGATTGGTGCTGCCTCCCAAAGTTGCATCAGTTCAAGTTGTTATCGTTCCTGTTCCTTACAAAGATGCAGATACTCAGGGGATTTTTGATGCTTGTTCTGTCACTTTGGATACGTTGTCTGAAGCAGGAATTCGTGCAGAGGTAGACGCTAGGGATAATTATTCTCCTGGATGGAAGTATTCTCACTGGGAAATGAAAGGTGTTCCACTCAGGATTGAAATAGGGCCTAAGGACTTGGCAAACAATCAGGTTACATATTCTGTCTATTAATATATTTTTCGGTCACTACTGAAATACTGTTTTCCCTTAGCTAATACTTTTTAATGCCTTGGACTATGTTGATACATTGAGCTAATATATAATTATGTNAAATACTGTTTTCCCTTAGCTAATACTTTTTAATGCCTTGGATTATGTTGATACATTGAGCTAATATATAATTATATACTTCTAGGTACGTGCTGTTCGCCGTGATAATTCAGCAAAGAAGGACATACCTAGGGCGTCCTTGGTTGAGCAAGTGAAAGAATTGCTAGAAAGTATTCAACAAAGCCTGTTTGACGCAGCAAAAGTAAAACGGGACACATGCATTCAGGTTATTAATACTTGGGAAGAGTTTACTGAAGCACTCAGTCAGAAGAAAATGATATTAGCTCCATGGTGCGATGAAGAGGTATAGTTTCTTGGTGTTCTTATTGCTATTTTGATTCTGATTGGGAAGGATGACTGCTTTTTGGTTCTGACGTTTCATGAGGCCACCATAATTTTATTTGATTAGTTAAAGGGATTCTTATGGTCAGTTTTTTTTCAATATATTACTCGATATGCGCTCTCATCTGCTGTTTTTTTTTTTTTTGTGTGTGTGTGTGTGTATGTCTGTGTGTGGAGCTCGTGTATGTATCTTTGAAGAGCTTCTTGCACGTTTTGGCTGCTGTAGCTTTAAAGTAGTGTCTCACTTGGATGATTTCTGTGAAATTTCGGGGGTAGTTATCAAATGATATTTTGAACTCCATTGACCACTGGCTTGAGTACATGCATTTTCAAGTGTTTGTTCTATGCCAACCTCTATCATCTACTGGTGCACTTTTCCTTAAGGGGTTTCATTGTTTTGTTCCTCCAGGAGGTTGAGAAAGATGTGAAAACAAGGACCAAGGGTGAGATGGGAGCAGCTAAAACTTTATGTTCTCCATTCGAGCAGCCCGAACTTCCAGAAGGTACTAAATTTCTGTCCAAATCTGGTTTCTTCTNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATTGATAATAGGACTATATATTATCTTTTGACATCTTGGGGGAATTTATGAGCTAAACTCTTTTTGGTTCCTTGAATTGATGTTTGTGCTCAGGTACTAAATGTTTTGCATCTGGGAAGCCTGCCAAAAAATGGAGCTATTGGGGCCGCAGCTACTAAACTACACAAGCTCATGTTTATCACCCACGCTATACTTGGATACTTGTCAAGTAATCAGCTTTAGAGTTCTTGAGTACGAGTGCTTTCAATTCAGTTAATTCTGAGTTCATTAGAATTTCCAATTGTTTAATCATTCACCAAGCAAGTTAACTATAGAATCTTGTTTTACACTTTCTTTTTTTGATAATTTATGGGTTTTGCCTTACAATCTTGTTTTCAGTTCTGTTAAATTATTAGTCCAACGCTTAGAAGCTTCTAAATTTCATATTTTCTTTCTTTCATATTCTCGGGAACTTGTCATGATTTAACTCAGTTATCATCACCCAATAGCGACTTTTAAATGATTCCTTTTTGTAGTCTCATTCAAAGTATCAGTATTGATAATGGCCAAGCTCGGAAGATTGGTACTGAATGAATATCAATGGTACCTCTACCTTCATTGGAGATGGCTAAGCCACTGGGATATTTTAGTGAAAAACTAAGCCTCAACAGCTATCGACTAAGGAATTAAGGTCTCTGATGGTTGATGCTAATGGGGATGTGTCGGAAGTAGTACCAAGCGAAGGCAGTGGTGGGATTAACTTCGACTGACACCATCGAAGTGAAAGGATTGATCGATCTGCTAAGAGATCGTCTTGATCCCATGTTTTCTCTAGTTTCTGTAAAGGGTTTAGGGTTGATGGAACAGACCTAATTGAAACTTTCTTGGTTGGAAATGTAATTGAAATTTGTAAGGCACTGAATGGAAAAACATTGCAGATATTAGGAACAAAATTAGATATCTGCCCTTGTGGGTAAAACTAAAAATAATCTCTAATGGATCTTGCTAGCATGATGGTGCCTATGT

mRNA sequence

AAAAAGCTAATTTGAGAAATCAATCCATCCCCGTCCTGTTCTGTGCTGAACCCTCAAACTTCTCCTCGTAGTTCATCACTATCCGCGCCTCCTCGCATTTGCGCATCGCCGCCGCGAGGTCGTTCGGTGTTCGTCGAAATCGCAGAGCTCGATTCACTAAGTTAAAAACTCTAAGCATGGCTGGTCCTAAGCCTGGTAGCTCTGCTAAAACTCCAAAAGCTGGCGGTAAAAAGAAGGAGGTGAAGAAAGAGACTGGTTTAGGTCTCACTAATAAGAAGGATGATAACTTTGGAGAGTGGTATTCTGAGGTGGTTGTCAGTGGAGAAATGATTGAATATTATGATATCTCTGGCTGCTATATTCTGAGGCCGTGGGCTATGTCTATCTGGGAGACTATGCAAGTATTTTTTGATGCAGAAATTAAGAAAATGAAAATCAAGAACTGCTATTTTCCACTTTTTGTGTCTCCCGGCGTCTTACAAAGAGAGAAAGATCATATAGAAGGTTTTGCTCCTGAGGTTGCATGGGTGACAAAATCTGGGGAGTCTGACTTGGAAGTGCCTATTGCAATTCGACCTACAAGTGAGACTGTGATGTATCCATACTATTCTAAGTGGATCAGGGGTCACCGTGACTTGCCTTTGAAACTTAACCAGTGGTGCAATGTTGTGAGATGGGAGAGTCGTGAGTTCCTTTGGCAGGAAGGACATACTGCTTTTGCTACCAAGGATGAAGCAGACACAGAGGTTCTTGAGATATTGGAACTGTATAGGCGCATATATGAAGAATACCTTGCGATTCCTGTTATTAAGGGCAAAAAGAGTGAGATGGAGAAGTTCGCTGGTGGTCTTTACACTACAAGCGTTGAGGCATTTATCCCAAACACTGGTCGTGGAATTCAGGGTGCAACTTCACATTGTTTGGGTCAAAATTTTGCGAAAATGTTTGAAATAAACTTTGAAAATGAAAAGGGGGAGAAAGCTATGGTCTGGCAAAATTCATGGGCCTACAGTACTAGAACGATTGGTGTGATGGTTATGGTTCACGGGGATGACAAGGGATTGGTGCTGCCTCCCAAAGTTGCATCAGTTCAAGTTGTTATCGTTCCTGTTCCTTACAAAGATGCAGATACTCAGGGGATTTTTGATGCTTGTTCTGTCACTTTGGATACGTTGTCTGAAGCAGGAATTCGTGCAGAGGTAGACGCTAGGGATAATTATTCTCCTGGATGGAAGTATTCTCACTGGGAAATGAAAGGTGTTCCACTCAGGATTGAAATAGGGCCTAAGGACTTGGCAAACAATCAGGTACGTGCTGTTCGCCGTGATAATTCAGCAAAGAAGGACATACCTAGGGCGTCCTTGGTTGAGCAAGTGAAAGAATTGCTAGAAAGTATTCAACAAAGCCTGTTTGACGCAGCAAAAGTAAAACGGGACACATGCATTCAGGTTATTAATACTTGGGAAGAGTTTACTGAAGCACTCAGTCAGAAGAAAATGATATTAGCTCCATGGTGCGATGAAGAGGTACTAAATGTTTTGCATCTGGGAAGCCTGCCAAAAAATGGAGCTATTGGGGCCGCAGCTACTAAACTACACAAGCTCATGTTTATCACCCACGCTATACTTGGATACTTGTCAAGTAATCAGCTTTAGAGTTCTTGAGTACGAGTGCTTTCAATTCAGTTAATTCTGAGTTCATTAGAATTTCCAATTGTTTAATCATTCACCAAGCAAGTTAACTATAGAATCTTGTTTTACACTTTCTTTTTTTGATAATTTATGGGTTTTGCCTTACAATCTTGTTTTCAGTTCTGTTAAATTATTAGTCCAACGCTTAGAAGCTTCTAAATTTCATATTTTCTTTCTTTCATATTCTCGGGAACTTGTCATGATTTAACTCAGTTATCATCACCCAATAGCGACTTTTAAATGATTCCTTTTTGTAGTCTCATTCAAAGTATCAGTATTGATAATGGCCAAGCTCGGAAGATTGGTACTGAATGAATATCAATGGTACCTCTACCTTCATTGGAGATGGCTAAGCCACTGGGATATTTTAGTGAAAAACTAAGCCTCAACAGCTATCGACTAAGGAATTAAGGTCTCTGATGGTTGATGCTAATGGGGATGTGTCGGAAGTAGTACCAAGCGAAGGCAGTGGTGGGATTAACTTCGACTGACACCATCGAAGTGAAAGGATTGATCGATCTGCTAAGAGATCGTCTTGATCCCATGTTTTCTCTAGTTTCTGTAAAGGGTTTAGGGTTGATGGAACAGACCTAATTGAAACTTTCTTGGTTGGAAATGTAATTGAAATTTGTAAGGCACTGAATGGAAAAACATTGCAGATATTAGGAACAAAATTAGATATCTGCCCTTGTGGGTAAAACTAAAAATAATCTCTAATGGATCTTGCTAGCATGATGGTGCCTATGT

Coding sequence (CDS)

ATGGCTGGTCCTAAGCCTGGTAGCTCTGCTAAAACTCCAAAAGCTGGCGGTAAAAAGAAGGAGGTGAAGAAAGAGACTGGTTTAGGTCTCACTAATAAGAAGGATGATAACTTTGGAGAGTGGTATTCTGAGGTGGTTGTCAGTGGAGAAATGATTGAATATTATGATATCTCTGGCTGCTATATTCTGAGGCCGTGGGCTATGTCTATCTGGGAGACTATGCAAGTATTTTTTGATGCAGAAATTAAGAAAATGAAAATCAAGAACTGCTATTTTCCACTTTTTGTGTCTCCCGGCGTCTTACAAAGAGAGAAAGATCATATAGAAGGTTTTGCTCCTGAGGTTGCATGGGTGACAAAATCTGGGGAGTCTGACTTGGAAGTGCCTATTGCAATTCGACCTACAAGTGAGACTGTGATGTATCCATACTATTCTAAGTGGATCAGGGGTCACCGTGACTTGCCTTTGAAACTTAACCAGTGGTGCAATGTTGTGAGATGGGAGAGTCGTGAGTTCCTTTGGCAGGAAGGACATACTGCTTTTGCTACCAAGGATGAAGCAGACACAGAGGTTCTTGAGATATTGGAACTGTATAGGCGCATATATGAAGAATACCTTGCGATTCCTGTTATTAAGGGCAAAAAGAGTGAGATGGAGAAGTTCGCTGGTGGTCTTTACACTACAAGCGTTGAGGCATTTATCCCAAACACTGGTCGTGGAATTCAGGGTGCAACTTCACATTGTTTGGGTCAAAATTTTGCGAAAATGTTTGAAATAAACTTTGAAAATGAAAAGGGGGAGAAAGCTATGGTCTGGCAAAATTCATGGGCCTACAGTACTAGAACGATTGGTGTGATGGTTATGGTTCACGGGGATGACAAGGGATTGGTGCTGCCTCCCAAAGTTGCATCAGTTCAAGTTGTTATCGTTCCTGTTCCTTACAAAGATGCAGATACTCAGGGGATTTTTGATGCTTGTTCTGTCACTTTGGATACGTTGTCTGAAGCAGGAATTCGTGCAGAGGTAGACGCTAGGGATAATTATTCTCCTGGATGGAAGTATTCTCACTGGGAAATGAAAGGTGTTCCACTCAGGATTGAAATAGGGCCTAAGGACTTGGCAAACAATCAGGTACGTGCTGTTCGCCGTGATAATTCAGCAAAGAAGGACATACCTAGGGCGTCCTTGGTTGAGCAAGTGAAAGAATTGCTAGAAAGTATTCAACAAAGCCTGTTTGACGCAGCAAAAGTAAAACGGGACACATGCATTCAGGTTATTAATACTTGGGAAGAGTTTACTGAAGCACTCAGTCAGAAGAAAATGATATTAGCTCCATGGTGCGATGAAGAGGTACTAAATGTTTTGCATCTGGGAAGCCTGCCAAAAAATGGAGCTATTGGGGCCGCAGCTACTAAACTACACAAGCTCATGTTTATCACCCACGCTATACTTGGATACTTGTCAAGTAATCAGCTTTAG

Protein sequence

MAGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCYILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWESREFLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEAFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHGDDKGLVLPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSLFDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEEVLNVLHLGSLPKNGAIGAAATKLHKLMFITHAILGYLSSNQL
BLAST of Cp4.1LG01g18810 vs. Swiss-Prot
Match: SYPC_ARATH (Proline--tRNA ligase, cytoplasmic OS=Arabidopsis thaliana GN=At3g62120 PE=2 SV=1)

HSP 1 Score: 771.2 bits (1990), Expect = 7.0e-222
Identity = 365/450 (81.11%), Postives = 402/450 (89.33%), Query Frame = 1

Query: 10  AKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCYILRPWAMS 69
           AK   +G KKK+VKKETGLGL+ KKD+NFGEWYSEV    +MIEYYDISGCYILRPW+M+
Sbjct: 30  AKASSSGQKKKDVKKETGLGLSVKKDENFGEWYSEVCKQ-DMIEYYDISGCYILRPWSMA 89

Query: 70  IWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSGESDLEVP 129
           IWE MQ+FFDAEIKKMK+KNCYFPLFVSPGVL++EKDHIEGFAPEVAWVTKSG+SDLEVP
Sbjct: 90  IWEIMQIFFDAEIKKMKVKNCYFPLFVSPGVLEKEKDHIEGFAPEVAWVTKSGKSDLEVP 149

Query: 130 IAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SREFLWQEGHTA 189
           IAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE         SREFLWQEGHTA
Sbjct: 150 IAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWEFSNPTPFIRSREFLWQEGHTA 209

Query: 190 FATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEAFIPNTGRG 249
           FATK EAD EVL+ILELYRRIYEEYLA+PV+KG KSE EKFAGGLYTTSVEAFIPNTGRG
Sbjct: 210 FATKAEADEEVLQILELYRRIYEEYLAVPVVKGMKSENEKFAGGLYTTSVEAFIPNTGRG 269

Query: 250 IQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHGDDKGLVLPP 309
           +QGATSHCLGQNFAKMFEINFENEK E  MVWQNSWAYSTRTIGVM+M HGDDKGLVLPP
Sbjct: 270 VQGATSHCLGQNFAKMFEINFENEKAETEMVWQNSWAYSTRTIGVMIMTHGDDKGLVLPP 329

Query: 310 KVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPGWKYSHWEMK 369
           KVASVQVV++PVPYKDA+TQGI+DAC+ T   L EAGIRAE D RDNYSPGWKYS WEMK
Sbjct: 330 KVASVQVVVIPVPYKDANTQGIYDACTATASALCEAGIRAEEDLRDNYSPGWKYSDWEMK 389

Query: 370 GVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSLFDAAKVKRD 429
           GVPLRIEIGP+DL N+QVR VRRDN  K+DIPR SLVE VKELLE IQQ++++ AK KR+
Sbjct: 390 GVPLRIEIGPRDLENDQVRTVRRDNGVKEDIPRGSLVEHVKELLEKIQQNMYEVAKQKRE 449

Query: 430 TCIQVINTWEEFTEALSQKKMILAPWCDEE 451
            C+Q + TW+EF +AL++KK+ILAPWCDEE
Sbjct: 450 ACVQEVKTWDEFIKALNEKKLILAPWCDEE 478

BLAST of Cp4.1LG01g18810 vs. Swiss-Prot
Match: SYEP_DROME (Bifunctional glutamate/proline--tRNA ligase OS=Drosophila melanogaster GN=Aats-glupro PE=1 SV=2)

HSP 1 Score: 531.9 bits (1369), Expect = 7.2e-150
Identity = 260/462 (56.28%), Postives = 332/462 (71.86%), Query Frame = 1

Query: 2    AGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCY 61
            A PKP    K   A      VKK+T LGL   K+DN  +WYS+V+  GEMIEYYD+SGCY
Sbjct: 1189 AQPKPAKPVKKEPAADASGAVKKQTRLGLEATKEDNLPDWYSQVITKGEMIEYYDVSGCY 1248

Query: 62   ILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKS 121
            ILR W+ +IW+ ++ +FDAEI +M +K CYFP+FVS  VL++EK HI  FAPEVAWVTKS
Sbjct: 1249 ILRQWSFAIWKAIKTWFDAEITRMGVKECYFPIFVSKAVLEKEKTHIADFAPEVAWVTKS 1308

Query: 122  GESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SREF 181
            G+SDL  PIA+RPTSETVMYP Y+KW++ +RDLP++LNQW NVVRWE         +REF
Sbjct: 1309 GDSDLAEPIAVRPTSETVMYPAYAKWVQSYRDLPIRLNQWNNVVRWEFKQPTPFLRTREF 1368

Query: 182  LWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEA 241
            LWQEGHTAFA K+EA  EVL+IL+LY  +Y   LAIPV+KG+K+E EKFAGG YTT+VEA
Sbjct: 1369 LWQEGHTAFADKEEAAKEVLDILDLYALVYTHLLAIPVVKGRKTEKEKFAGGDYTTTVEA 1428

Query: 242  FIPNTGRGIQGATSHCLGQNFAKMFEINFEN-EKGEKAMVWQNSWAYSTRTIGVMVMVHG 301
            FI  +GR IQGATSH LGQNF+KMFEI +E+ E  +K  V+QNSW  +TRTIGVM+MVH 
Sbjct: 1429 FISASGRAIQGATSHHLGQNFSKMFEIVYEDPETQQKKYVYQNSWGITTRTIGVMIMVHA 1488

Query: 302  DDKGLVLPPKVASVQVVIVP----VPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDN 361
            D++GLVLPP VA +Q ++VP    V  KD +   + DAC      L   G+R E D RDN
Sbjct: 1489 DNQGLVLPPHVACIQAIVVPCGITVNTKDDERAQLLDACKALEKRLVGGGVRCEGDYRDN 1548

Query: 362  YSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESI 421
            YSPGWK++HWE+KGVPLR+E+GPKDL   Q+ AVRRD   K  IP A + +++  LLE+I
Sbjct: 1549 YSPGWKFNHWELKGVPLRLEVGPKDLKAQQLVAVRRDTVEKITIPLADVEKKIPALLETI 1608

Query: 422  QQSLFDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDE 450
             +S+ + A+    +  + +  W +F   L QK ++LAP+C E
Sbjct: 1609 HESMLNKAQEDMTSHTKKVTNWTDFCGFLEQKNILLAPFCGE 1650

BLAST of Cp4.1LG01g18810 vs. Swiss-Prot
Match: SYEP_MOUSE (Bifunctional glutamate/proline--tRNA ligase OS=Mus musculus GN=Eprs PE=1 SV=4)

HSP 1 Score: 530.4 bits (1365), Expect = 2.1e-149
Identity = 262/460 (56.96%), Postives = 335/460 (72.83%), Query Frame = 1

Query: 5    KPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCYILR 64
            K   S  +    G+ +  KK+T LGL  KK++N  EWYS+V+   EMIEYYD+SGCYILR
Sbjct: 991  KSQGSGLSSGGAGEGQGPKKQTRLGLEAKKEENLAEWYSQVITKSEMIEYYDVSGCYILR 1050

Query: 65   PWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSGES 124
            PW+ SIWE+++ FFDAEIKK+ ++NCYFP+FVS   L++EK+HIE FAPEVAWVT+SG++
Sbjct: 1051 PWSYSIWESIKDFFDAEIKKLGVENCYFPIFVSQAALEKEKNHIEDFAPEVAWVTRSGKT 1110

Query: 125  DLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SREFLWQ 184
            +L  PIAIRPTSETVMYP Y+KW++ HRDLP++LNQWCNVVRWE         +REFLWQ
Sbjct: 1111 ELAEPIAIRPTSETVMYPAYAKWVQSHRDLPVRLNQWCNVVRWEFKHPQPFLRTREFLWQ 1170

Query: 185  EGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEAFIP 244
            EGH+AFAT +EA  EVL+ILELY R+YEE LAIPV++G+K+E EKFAGG YTT++EAFI 
Sbjct: 1171 EGHSAFATFEEAADEVLQILELYARVYEELLAIPVVRGRKTEKEKFAGGDYTTTIEAFIS 1230

Query: 245  NTGRGIQGATSHCLGQNFAKMFEINFENEK--GEKAMVWQNSWAYSTRTIGVMVMVHGDD 304
             +GR IQGATSH LGQNF+KM EI FE+ K  GEK   +Q SW  +TRTIGVMVMVHGD+
Sbjct: 1231 ASGRAIQGATSHHLGQNFSKMCEIVFEDPKTPGEKQFAYQCSWGLTTRTIGVMVMVHGDN 1290

Query: 305  KGLVLPPKVASVQVVIVPVPYKDA----DTQGIFDACSVTLDTLSEAGIRAEVDARDNYS 364
             GLVLPP+VASVQVV++P    +A    D + +   C+     L  A IR  VD RDNYS
Sbjct: 1291 MGLVLPPRVASVQVVVIPCGITNALSEEDREALMAKCNEYRRRLLGANIRVRVDLRDNYS 1350

Query: 365  PGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQ 424
            PGWK++HWE+KGVP+R+E+GP+D+ + Q  AVRRD   K  I       +++++LE IQ 
Sbjct: 1351 PGWKFNHWELKGVPVRLEVGPRDMKSCQFVAVRRDTGEKLTIAEKEAEAKLEKVLEDIQL 1410

Query: 425  SLFDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDE 450
            +LF  A     T + V NT E+F + L   K+   P+C E
Sbjct: 1411 NLFTRASEDLKTHMVVSNTLEDFQKVLDAGKVAQIPFCGE 1450

BLAST of Cp4.1LG01g18810 vs. Swiss-Prot
Match: SYEP_HUMAN (Bifunctional glutamate/proline--tRNA ligase OS=Homo sapiens GN=EPRS PE=1 SV=5)

HSP 1 Score: 527.3 bits (1357), Expect = 1.8e-148
Identity = 254/448 (56.70%), Postives = 326/448 (72.77%), Query Frame = 1

Query: 17   GKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCYILRPWAMSIWETMQV 76
            G+ +  KK+T LGL  KK++N  +WYS+V+   EMIEY+DISGCYILRPWA +IWE ++ 
Sbjct: 1003 GEGQGPKKQTRLGLEAKKEENLADWYSQVITKSEMIEYHDISGCYILRPWAYAIWEAIKD 1062

Query: 77   FFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSGESDLEVPIAIRPTS 136
            FFDAEIKK+ ++NCYFP+FVS   L++EK H+  FAPEVAWVT+SG+++L  PIAIRPTS
Sbjct: 1063 FFDAEIKKLGVENCYFPMFVSQSALEKEKTHVADFAPEVAWVTRSGKTELAEPIAIRPTS 1122

Query: 137  ETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SREFLWQEGHTAFATKDEA 196
            ETVMYP Y+KW++ HRDLP+KLNQWCNVVRWE         +REFLWQEGH+AFAT +EA
Sbjct: 1123 ETVMYPAYAKWVQSHRDLPIKLNQWCNVVRWEFKHPQPFLRTREFLWQEGHSAFATMEEA 1182

Query: 197  DTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEAFIPNTGRGIQGATSH 256
              EVL+IL+LY ++YEE LAIPV+KG+K+E EKFAGG YTT++EAFI  +GR IQG TSH
Sbjct: 1183 AEEVLQILDLYAQVYEELLAIPVVKGRKTEKEKFAGGDYTTTIEAFISASGRAIQGGTSH 1242

Query: 257  CLGQNFAKMFEINFENEK--GEKAMVWQNSWAYSTRTIGVMVMVHGDDKGLVLPPKVASV 316
             LGQNF+KMFEI FE+ K  GEK   +QNSW  +TRTIGVM MVHGD+ GLVLPP+VA V
Sbjct: 1243 HLGQNFSKMFEIVFEDPKIPGEKQFAYQNSWGLTTRTIGVMTMVHGDNMGLVLPPRVACV 1302

Query: 317  QVVIVPVPYKDA----DTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPGWKYSHWEMKG 376
            QVVI+P    +A    D + +   C+     L    IR   D RDNYSPGWK++HWE+KG
Sbjct: 1303 QVVIIPCGITNALSEEDKEALIAKCNDYRRRLLSVNIRVRADLRDNYSPGWKFNHWELKG 1362

Query: 377  VPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSLFDAAKVKRDT 436
            VP+R+E+GP+D+ + Q  AVRRD   K  +       +++ +LE IQ +LF  A     T
Sbjct: 1363 VPIRLEVGPRDMKSCQFVAVRRDTGEKLTVAENEAETKLQAILEDIQVTLFTRASEDLKT 1422

Query: 437  CIQVINTWEEFTEALSQKKMILAPWCDE 450
             + V NT E+F + L   K++  P+C E
Sbjct: 1423 HMVVANTMEDFQKILDSGKIVQIPFCGE 1450

BLAST of Cp4.1LG01g18810 vs. Swiss-Prot
Match: PRS1_SCHPO (Putative proline--tRNA ligase C19C7.06 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=prs1 PE=3 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 1.1e-147
Identity = 261/468 (55.77%), Postives = 326/468 (69.66%), Query Frame = 1

Query: 2   AGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCY 61
           A  KP +  K  +       ++    +G+T +KD +F  WY +V+   +MIEYYDISGCY
Sbjct: 179 APSKPAAQKKKAEPSKNDAAIENAALIGITVRKDADFPNWYQQVLTKSDMIEYYDISGCY 238

Query: 62  ILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKS 121
           IL+PW+ SIWE +Q +FD EIKK+ ++N YFPLFVS  VL++EKDH+EGFAPEVAWVT++
Sbjct: 239 ILKPWSYSIWEAIQGWFDKEIKKLGVRNGYFPLFVSSKVLEKEKDHVEGFAPEVAWVTRA 298

Query: 122 GESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SREF 181
           G S+L+ PIAIRPTSETVMYPYY+KWIR HRDLPLKLNQW +VVRWE         +REF
Sbjct: 299 GTSELDEPIAIRPTSETVMYPYYAKWIRSHRDLPLKLNQWNSVVRWEFKNPQPFLRTREF 358

Query: 182 LWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEA 241
           LWQEGHTA  T + A  EV +IL+LY RIY + LA+PVIKG KSE EKFAGG++TT+VE 
Sbjct: 359 LWQEGHTAHMTLEGATEEVHQILDLYARIYTDLLAVPVIKGVKSENEKFAGGMFTTTVEG 418

Query: 242 FIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGE--------KAMVWQNSWAYSTRTIG 301
           +IP TGRGIQGATSHCLGQNF+KMF I  E+   E        K  VWQNSW  STRTIG
Sbjct: 419 YIPTTGRGIQGATSHCLGQNFSKMFNIVVEDPNAEIGPTGERPKLFVWQNSWGLSTRTIG 478

Query: 302 VMVMVHGDDKGLVLPPKVASVQVVIVPV----PYKDADTQGIFDACSVTLDTLSEAGIRA 361
           V VMVHGDDKGL LPP +A VQ V+VP        D +   I   CS   D L+ A IR 
Sbjct: 479 VAVMVHGDDKGLKLPPAIALVQSVVVPCGITNKTTDQERNEIEGFCSKLADRLNAADIRT 538

Query: 362 EVDARDNYSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQV 421
           E D R  Y+PG+K+SHWEMKGVPLR+E GP D   NQV AVRRD   K  +P  +L + V
Sbjct: 539 EADLR-AYTPGYKFSHWEMKGVPLRLEYGPNDAKKNQVTAVRRDTFEKIPVPLNNLEKGV 598

Query: 422 KELLESIQQSLFDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCD 449
            +LL  IQ ++++ AK +RD  +  +  W +F  AL++K +++ PWC+
Sbjct: 599 SDLLAKIQTNMYETAKAERDAHVVKVKEWADFVPALNKKNIVMIPWCN 645

BLAST of Cp4.1LG01g18810 vs. TrEMBL
Match: A0A0A0KTZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G015750 PE=3 SV=1)

HSP 1 Score: 892.9 bits (2306), Expect = 1.8e-256
Identity = 437/459 (95.21%), Postives = 442/459 (96.30%), Query Frame = 1

Query: 1   MAGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60
           MAGPKPGSSA   KAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC
Sbjct: 1   MAGPKPGSSATNKKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60

Query: 61  YILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120
           YILRPWA+SIWETMQVFFDAEIK+MKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK
Sbjct: 61  YILRPWAISIWETMQVFFDAEIKQMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120

Query: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SRE 180
           SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE         SRE
Sbjct: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWEFSHPTPFIRSRE 180

Query: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240
           FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE
Sbjct: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240

Query: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300
           AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG
Sbjct: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300

Query: 301 DDKGLVLPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPG 360
           DDKGLVLPPKVASVQV+IVPVPYKDADTQGIFDACS TLDTL+ AGIRAEVD+RDNYSPG
Sbjct: 301 DDKGLVLPPKVASVQVIIVPVPYKDADTQGIFDACSATLDTLTAAGIRAEVDSRDNYSPG 360

Query: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSL 420
           WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNS KKDIPR SLVEQVKELLESIQQSL
Sbjct: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSGKKDIPRDSLVEQVKELLESIQQSL 420

Query: 421 FDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEE 451
           FDAAKVKRDTCIQVINTWEEFTEAL QKKMILAPWCDEE
Sbjct: 421 FDAAKVKRDTCIQVINTWEEFTEALGQKKMILAPWCDEE 459

BLAST of Cp4.1LG01g18810 vs. TrEMBL
Match: A0A059AB74_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_J00690 PE=3 SV=1)

HSP 1 Score: 827.4 bits (2136), Expect = 9.2e-237
Identity = 399/459 (86.93%), Postives = 423/459 (92.16%), Query Frame = 1

Query: 1   MAGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60
           MA  +P  S K     GKKKEVKKETGLGL+NKKD+NFGEWYSEVVVSGEMIEYYDISGC
Sbjct: 1   MASGEPKKSNKPNAGAGKKKEVKKETGLGLSNKKDENFGEWYSEVVVSGEMIEYYDISGC 60

Query: 61  YILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120
           YILRPWAM+IWE MQ FFDAEIKKMKIKNCYFPLFVSPGVLQREK+HIEGFAPEVAWVTK
Sbjct: 61  YILRPWAMAIWEIMQEFFDAEIKKMKIKNCYFPLFVSPGVLQREKEHIEGFAPEVAWVTK 120

Query: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SRE 180
           SG+SDLEVPIAIRPTSETVMYPY+SKWIRGHRDLPL+LNQWCNVVRWE         SRE
Sbjct: 121 SGQSDLEVPIAIRPTSETVMYPYFSKWIRGHRDLPLRLNQWCNVVRWEFSNPTPFIRSRE 180

Query: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240
           FLWQEGHTAFATK+EAD EVL+ILELYRRIYEEYLA+PVIKGKKSE+EKFAGG YTT+VE
Sbjct: 181 FLWQEGHTAFATKEEADAEVLDILELYRRIYEEYLAVPVIKGKKSELEKFAGGFYTTTVE 240

Query: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300
           AF+PNTGRGIQGATSHCLGQNFAKMFEI FENEK EKAMVWQNSWAY+TRTIGVMVMVHG
Sbjct: 241 AFVPNTGRGIQGATSHCLGQNFAKMFEIFFENEKREKAMVWQNSWAYTTRTIGVMVMVHG 300

Query: 301 DDKGLVLPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPG 360
           DDKGLVLPPKVASVQV+IVPVPYKDA+TQGIFDAC+ T++TL EAGIRAE D RDNYSPG
Sbjct: 301 DDKGLVLPPKVASVQVIIVPVPYKDANTQGIFDACTATVNTLCEAGIRAEADLRDNYSPG 360

Query: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSL 420
           WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRA LVEQVKELL +IQQSL
Sbjct: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRADLVEQVKELLANIQQSL 420

Query: 421 FDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEE 451
           FDAAK KRD CIQV+ TW+EF EAL QKKM+LAPWCDEE
Sbjct: 421 FDAAKQKRDACIQVVKTWDEFVEALGQKKMVLAPWCDEE 459

BLAST of Cp4.1LG01g18810 vs. TrEMBL
Match: W9T0E9_9ROSA (Putative proline--tRNA ligase OS=Morus notabilis GN=L484_022724 PE=3 SV=1)

HSP 1 Score: 825.5 bits (2131), Expect = 3.5e-236
Identity = 393/453 (86.75%), Postives = 423/453 (93.38%), Query Frame = 1

Query: 7   GSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCYILRPW 66
           G  AK P AGGKKKEVKKETGLGLTNKKD+NFGEWYSEVVV+GEMIEYYDISGCYILRPW
Sbjct: 3   GGEAKKPNAGGKKKEVKKETGLGLTNKKDENFGEWYSEVVVNGEMIEYYDISGCYILRPW 62

Query: 67  AMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSGESDL 126
            MSIWETMQ FFDAEIKKMK+KNCYFPLFVS  VL++EKDHIEGFAPEVAWVT+SG+S+L
Sbjct: 63  TMSIWETMQEFFDAEIKKMKVKNCYFPLFVSSTVLEKEKDHIEGFAPEVAWVTRSGKSEL 122

Query: 127 EVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SREFLWQEG 186
           EVP+AIRPTSETVMYPYYSKWIRGHRDLPL+LNQWCNVVRWE         SREFLWQEG
Sbjct: 123 EVPVAIRPTSETVMYPYYSKWIRGHRDLPLRLNQWCNVVRWEFSNPTPFIRSREFLWQEG 182

Query: 187 HTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEAFIPNT 246
           HTAFATKDEAD EVL+ILELYRRIYEE+LAIPVIKGKKSE+EKFAGGLYTTSVEA+IPNT
Sbjct: 183 HTAFATKDEADEEVLQILELYRRIYEEFLAIPVIKGKKSELEKFAGGLYTTSVEAYIPNT 242

Query: 247 GRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHGDDKGLV 306
           GRG+QGATSHCLGQNFAKMFEI+FENEKGEKAMVWQNSWAYSTRTIGVMVMVHGDDKGLV
Sbjct: 243 GRGVQGATSHCLGQNFAKMFEISFENEKGEKAMVWQNSWAYSTRTIGVMVMVHGDDKGLV 302

Query: 307 LPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPGWKYSHW 366
           LPPKVASVQV++VPVPYKDA+TQGIFDAC+ T++TLSEAGIRAE D RDNYSPGWKYSHW
Sbjct: 303 LPPKVASVQVIVVPVPYKDANTQGIFDACTETVNTLSEAGIRAEADFRDNYSPGWKYSHW 362

Query: 367 EMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSLFDAAKV 426
           EMKGVPLRIEIGPKDLANNQVRAVRRDNSAK DIPRASLVEQVKELL +IQQ+LFD AK 
Sbjct: 363 EMKGVPLRIEIGPKDLANNQVRAVRRDNSAKVDIPRASLVEQVKELLGNIQQNLFDVAKQ 422

Query: 427 KRDTCIQVINTWEEFTEALSQKKMILAPWCDEE 451
           KRD C++++ TWEEF  AL +KK+ILAPWCDEE
Sbjct: 423 KRDACVEIVKTWEEFIAALGKKKLILAPWCDEE 455

BLAST of Cp4.1LG01g18810 vs. TrEMBL
Match: A0A059A097_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_K00563 PE=3 SV=1)

HSP 1 Score: 824.3 bits (2128), Expect = 7.8e-236
Identity = 394/457 (86.21%), Postives = 423/457 (92.56%), Query Frame = 1

Query: 7   GSSAKTPKA----GGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCYI 66
           G  +K PKA    GGKKKEVKKETGLGL+NKKD+NFGEWYSEVVVSGEMIEYYDISGCYI
Sbjct: 3   GGESKKPKANAGAGGKKKEVKKETGLGLSNKKDENFGEWYSEVVVSGEMIEYYDISGCYI 62

Query: 67  LRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSG 126
           LRPWAM+IWE MQ FFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSG
Sbjct: 63  LRPWAMAIWEIMQEFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSG 122

Query: 127 ESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SREFL 186
           ESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPL+LNQWCNVVRWE         SREFL
Sbjct: 123 ESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLRLNQWCNVVRWEFSHPTPFIRSREFL 182

Query: 187 WQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEAF 246
           WQEGHTAFATK+EAD EVL+ILELYRRIYEEYLA+PVIKGKKSE+EKFAGG YTT+VEAF
Sbjct: 183 WQEGHTAFATKEEADAEVLDILELYRRIYEEYLAVPVIKGKKSELEKFAGGYYTTTVEAF 242

Query: 247 IPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHGDD 306
           IP+TGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAY+TRTIGVM+MVHGDD
Sbjct: 243 IPDTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYTTRTIGVMIMVHGDD 302

Query: 307 KGLVLPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPGWK 366
           KGLVLPPKV+SVQV++VPVPYKDADTQGIFDAC+ T +TL EAGIRAEVD RDNYSPGWK
Sbjct: 303 KGLVLPPKVSSVQVIVVPVPYKDADTQGIFDACTATANTLCEAGIRAEVDLRDNYSPGWK 362

Query: 367 YSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSLFD 426
           YSHWEMKGVPLRIEIGPKD A NQVRAVRRDNS K DIP+ASLVEQV+++L+ IQQSLFD
Sbjct: 363 YSHWEMKGVPLRIEIGPKDFAKNQVRAVRRDNSTKSDIPQASLVEQVEKILDDIQQSLFD 422

Query: 427 AAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEE 451
           AAK KR+ CIQ++ TW+EF +AL +KKMILAPWCDEE
Sbjct: 423 AAKQKREACIQIVKTWDEFIDALGKKKMILAPWCDEE 459

BLAST of Cp4.1LG01g18810 vs. TrEMBL
Match: A0A022R6H7_ERYGU (Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a004863mg PE=3 SV=1)

HSP 1 Score: 812.8 bits (2098), Expect = 2.3e-232
Identity = 384/453 (84.77%), Postives = 421/453 (92.94%), Query Frame = 1

Query: 7   GSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCYILRPW 66
           G  AK+  A GKKKEVKKETGLGL+ KKD+NFGEWYSEVVV+GEMIEYYDISGCYILRPW
Sbjct: 3   GKDAKSNAAKGKKKEVKKETGLGLSYKKDENFGEWYSEVVVNGEMIEYYDISGCYILRPW 62

Query: 67  AMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSGESDL 126
           AMSIWE MQ FFDAEIKKMKIKNCYFPLFVS GVLQ+EKDHIEGFAPEVAWVTKSGES+L
Sbjct: 63  AMSIWEIMQTFFDAEIKKMKIKNCYFPLFVSSGVLQKEKDHIEGFAPEVAWVTKSGESEL 122

Query: 127 EVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SREFLWQEG 186
           E+PIAIRPTSETVMYPY+SKWIRGHRDLPL+LNQWCNVVRWE         SREFLWQEG
Sbjct: 123 EMPIAIRPTSETVMYPYFSKWIRGHRDLPLRLNQWCNVVRWEFSNPTPFIRSREFLWQEG 182

Query: 187 HTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEAFIPNT 246
           HTAFATK+EADTEVL+ILELYRRIYEE+LA+PVIKGKKSE EKFAGGLYTT+VEAF+PNT
Sbjct: 183 HTAFATKEEADTEVLDILELYRRIYEEFLAVPVIKGKKSEHEKFAGGLYTTTVEAFVPNT 242

Query: 247 GRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHGDDKGLV 306
           GRG+QGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAY+TRTIGVM+MVHGDDKGLV
Sbjct: 243 GRGVQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYTTRTIGVMIMVHGDDKGLV 302

Query: 307 LPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPGWKYSHW 366
           LPPKVASVQV+++PVPYKDADT+GIFDAC+ T+ +L+E+GIRAE D RDNYSPGWKYSHW
Sbjct: 303 LPPKVASVQVIVIPVPYKDADTKGIFDACAATVKSLNESGIRAEADFRDNYSPGWKYSHW 362

Query: 367 EMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSLFDAAKV 426
           EMKGVPLRIEIGPKD ANNQVRAVRRDN+ K DIP A +VE+VK++L++IQQSLFDAAK 
Sbjct: 363 EMKGVPLRIEIGPKDYANNQVRAVRRDNATKHDIPMADVVERVKDMLDNIQQSLFDAAKE 422

Query: 427 KRDTCIQVINTWEEFTEALSQKKMILAPWCDEE 451
           KRD CI+V++TWEEF EAL QKKMILAPWCDEE
Sbjct: 423 KRDVCIEVVHTWEEFAEALGQKKMILAPWCDEE 455

BLAST of Cp4.1LG01g18810 vs. TAIR10
Match: AT3G62120.1 (AT3G62120.1 Class II aaRS and biotin synthetases superfamily protein)

HSP 1 Score: 771.2 bits (1990), Expect = 4.0e-223
Identity = 365/450 (81.11%), Postives = 402/450 (89.33%), Query Frame = 1

Query: 10  AKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCYILRPWAMS 69
           AK   +G KKK+VKKETGLGL+ KKD+NFGEWYSEV    +MIEYYDISGCYILRPW+M+
Sbjct: 30  AKASSSGQKKKDVKKETGLGLSVKKDENFGEWYSEVCKQ-DMIEYYDISGCYILRPWSMA 89

Query: 70  IWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSGESDLEVP 129
           IWE MQ+FFDAEIKKMK+KNCYFPLFVSPGVL++EKDHIEGFAPEVAWVTKSG+SDLEVP
Sbjct: 90  IWEIMQIFFDAEIKKMKVKNCYFPLFVSPGVLEKEKDHIEGFAPEVAWVTKSGKSDLEVP 149

Query: 130 IAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SREFLWQEGHTA 189
           IAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE         SREFLWQEGHTA
Sbjct: 150 IAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWEFSNPTPFIRSREFLWQEGHTA 209

Query: 190 FATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEAFIPNTGRG 249
           FATK EAD EVL+ILELYRRIYEEYLA+PV+KG KSE EKFAGGLYTTSVEAFIPNTGRG
Sbjct: 210 FATKAEADEEVLQILELYRRIYEEYLAVPVVKGMKSENEKFAGGLYTTSVEAFIPNTGRG 269

Query: 250 IQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHGDDKGLVLPP 309
           +QGATSHCLGQNFAKMFEINFENEK E  MVWQNSWAYSTRTIGVM+M HGDDKGLVLPP
Sbjct: 270 VQGATSHCLGQNFAKMFEINFENEKAETEMVWQNSWAYSTRTIGVMIMTHGDDKGLVLPP 329

Query: 310 KVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPGWKYSHWEMK 369
           KVASVQVV++PVPYKDA+TQGI+DAC+ T   L EAGIRAE D RDNYSPGWKYS WEMK
Sbjct: 330 KVASVQVVVIPVPYKDANTQGIYDACTATASALCEAGIRAEEDLRDNYSPGWKYSDWEMK 389

Query: 370 GVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSLFDAAKVKRD 429
           GVPLRIEIGP+DL N+QVR VRRDN  K+DIPR SLVE VKELLE IQQ++++ AK KR+
Sbjct: 390 GVPLRIEIGPRDLENDQVRTVRRDNGVKEDIPRGSLVEHVKELLEKIQQNMYEVAKQKRE 449

Query: 430 TCIQVINTWEEFTEALSQKKMILAPWCDEE 451
            C+Q + TW+EF +AL++KK+ILAPWCDEE
Sbjct: 450 ACVQEVKTWDEFIKALNEKKLILAPWCDEE 478

BLAST of Cp4.1LG01g18810 vs. TAIR10
Match: AT5G52520.1 (AT5G52520.1 Class II aaRS and biotin synthetases superfamily protein)

HSP 1 Score: 388.7 bits (997), Expect = 5.5e-108
Identity = 190/443 (42.89%), Postives = 275/443 (62.08%), Query Frame = 1

Query: 18  KKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGCYILRPWAMSIWETMQVF 77
           K  EV +         +  +F  WY +V+ S E+ +Y  + G  ++RP+  +IWE +Q +
Sbjct: 53  KSSEVDRLRSDRAVTPRSQDFNAWYLDVIASAELADYGPVRGTMVIRPYGYAIWEAIQDY 112

Query: 78  FDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTKSGESDLEVPIAIRPTSE 137
            + + K+    N YFP F+    +++E  H+EGF+PE+A VT  G  +LE  + +RPTSE
Sbjct: 113 LNVKFKETGHSNMYFPQFIPYSFIEKEASHVEGFSPELALVTVGGGKELEEKLVVRPTSE 172

Query: 138 TVMYPYYSKWIRGHRDLPLKLNQWCNVVRWESR--------EFLWQEGHTAFATKDEADT 197
           T++   +++WI  +RDLPL +NQW NV RWE R        EFLWQEGHTA AT +EA+ 
Sbjct: 173 TIVNHMFTQWIHSYRDLPLMINQWANVTRWEMRTKPFIRTLEFLWQEGHTAHATPEEAEK 232

Query: 198 EVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVEAFIPNTGRGIQGATSHCL 257
           E  +++E+Y R   E  AIPVI G+KS++E FAG   T ++EA + +  + +Q  TSH L
Sbjct: 233 EAKQMIEIYTRFAFEQTAIPVIPGRKSKLETFAGADITYTIEAMMGDR-KALQAGTSHNL 292

Query: 258 GQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHGDDKGLVLPPKVASVQVVI 317
           GQNF++ F   F +E GE+  VWQ SWA STR +G ++M HGDD GL+LPPK+A +QVVI
Sbjct: 293 GQNFSRAFGTQFADENGERQHVWQTSWAVSTRFVGGIIMTHGDDTGLMLPPKIAPIQVVI 352

Query: 318 VPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPGWKYSHWEMKGVPLRIEIG 377
           VP+  KD +  G+  A S   + L  AG+R ++D  D  +PGWK++ WEMKG+PLRIEIG
Sbjct: 353 VPIWKKDTEKTGVLSAASSVKEALQTAGVRVKLDDTDQRTPGWKFNFWEMKGIPLRIEIG 412

Query: 378 PKDLANNQVRAVRRDNSAKK------DIPRASLVEQVKELLESIQQSLFDAAKVKRDTCI 437
           P+D+++N V   RRD   K        +  ++LV  VKE L+ IQ SL + A   RD+ I
Sbjct: 413 PRDVSSNSVVVSRRDVPGKAGKVFGISMEPSTLVAYVKEKLDEIQTSLLEKALSFRDSNI 472

Query: 438 QVINTWEEFTEALSQKKMILAPW 447
             +N++ E  +A+S  K    PW
Sbjct: 473 VDVNSYAELKDAISSGKWARGPW 494

BLAST of Cp4.1LG01g18810 vs. TAIR10
Match: AT5G10880.1 (AT5G10880.1 tRNA synthetase-related / tRNA ligase-related)

HSP 1 Score: 213.4 bits (542), Expect = 3.2e-55
Identity = 105/163 (64.42%), Postives = 122/163 (74.85%), Query Frame = 1

Query: 288 MVHGDDKGLVLPPKVASVQVVIVPVPYKDA-DTQGIFDACSVTLDTLSEAGIRAEVDARD 347
           M HGDDKGLV PPKVA VQVV++ VP K A D Q + DAC     TL  AGIRAE D RD
Sbjct: 89  MTHGDDKGLVFPPKVAPVQVVVIHVPIKGAADYQELCDACEAVESTLLGAGIRAEADIRD 148

Query: 348 NYSPGWKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLES 407
           NYS GWKY+  E+ GVPLRIE GP+DLAN+QVR V RDN AK D+ R  L+EQVK+LLE 
Sbjct: 149 NYSCGWKYADQELTGVPLRIETGPRDLANDQVRIVTRDNGAKMDVKRGDLIEQVKDLLEK 208

Query: 408 IQQSLFDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDE 450
           IQ +L+D AK K + C Q + TW+EF EALSQKK+ILAPWCD+
Sbjct: 209 IQSNLYDVAKRKVEECTQKVETWDEFVEALSQKKLILAPWCDK 251

BLAST of Cp4.1LG01g18810 vs. NCBI nr
Match: gi|659108022|ref|XP_008453976.1| (PREDICTED: putative proline--tRNA ligase C19C7.06 [Cucumis melo])

HSP 1 Score: 896.0 bits (2314), Expect = 3.0e-257
Identity = 438/459 (95.42%), Postives = 445/459 (96.95%), Query Frame = 1

Query: 1   MAGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60
           MAGPKPGSSA   KAGGKKKEVKKETGLGLTNKKD+NFGEWYSEVVVSGEMIEYYDISGC
Sbjct: 1   MAGPKPGSSATNQKAGGKKKEVKKETGLGLTNKKDENFGEWYSEVVVSGEMIEYYDISGC 60

Query: 61  YILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120
           YILRPWA+SIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK
Sbjct: 61  YILRPWAISIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120

Query: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SRE 180
           SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE         SRE
Sbjct: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWEFSHPTPFIRSRE 180

Query: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240
           FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE
Sbjct: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240

Query: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300
           AFIPNTGRGIQGATSHCLGQNFAKMFEINFEN+KGEKAMVWQNSWAYSTRTIGVMVMVHG
Sbjct: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENDKGEKAMVWQNSWAYSTRTIGVMVMVHG 300

Query: 301 DDKGLVLPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPG 360
           DDKGLVLPPKVASVQV+IVPVPYKDADTQGIFDACS TLDTL++AGIRAEVD+RDNYSPG
Sbjct: 301 DDKGLVLPPKVASVQVIIVPVPYKDADTQGIFDACSATLDTLTDAGIRAEVDSRDNYSPG 360

Query: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSL 420
           WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPR SLVEQVKELLESIQQSL
Sbjct: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRDSLVEQVKELLESIQQSL 420

Query: 421 FDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEE 451
           FDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEE
Sbjct: 421 FDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEE 459

BLAST of Cp4.1LG01g18810 vs. NCBI nr
Match: gi|778690030|ref|XP_004152099.2| (PREDICTED: putative proline--tRNA ligase C19C7.06 isoform X1 [Cucumis sativus])

HSP 1 Score: 892.9 bits (2306), Expect = 2.6e-256
Identity = 437/459 (95.21%), Postives = 442/459 (96.30%), Query Frame = 1

Query: 1   MAGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60
           MAGPKPGSSA   KAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC
Sbjct: 1   MAGPKPGSSATNKKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60

Query: 61  YILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120
           YILRPWA+SIWETMQVFFDAEIK+MKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK
Sbjct: 61  YILRPWAISIWETMQVFFDAEIKQMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120

Query: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SRE 180
           SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE         SRE
Sbjct: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWEFSHPTPFIRSRE 180

Query: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240
           FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE
Sbjct: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240

Query: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300
           AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG
Sbjct: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300

Query: 301 DDKGLVLPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPG 360
           DDKGLVLPPKVASVQV+IVPVPYKDADTQGIFDACS TLDTL+ AGIRAEVD+RDNYSPG
Sbjct: 301 DDKGLVLPPKVASVQVIIVPVPYKDADTQGIFDACSATLDTLTAAGIRAEVDSRDNYSPG 360

Query: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSL 420
           WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNS KKDIPR SLVEQVKELLESIQQSL
Sbjct: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSGKKDIPRDSLVEQVKELLESIQQSL 420

Query: 421 FDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEE 451
           FDAAKVKRDTCIQVINTWEEFTEAL QKKMILAPWCDEE
Sbjct: 421 FDAAKVKRDTCIQVINTWEEFTEALGQKKMILAPWCDEE 459

BLAST of Cp4.1LG01g18810 vs. NCBI nr
Match: gi|659072589|ref|XP_008466319.1| (PREDICTED: putative proline--tRNA ligase C19C7.06 isoform X2 [Cucumis melo])

HSP 1 Score: 886.7 bits (2290), Expect = 1.8e-254
Identity = 438/482 (90.87%), Postives = 450/482 (93.36%), Query Frame = 1

Query: 1   MAGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60
           MAGPKPGSSA  PKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC
Sbjct: 1   MAGPKPGSSATNPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60

Query: 61  YILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120
           YILRPWAMS+WETMQ FFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK
Sbjct: 61  YILRPWAMSVWETMQEFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120

Query: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SRE 180
           SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE         SRE
Sbjct: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWEFSHPTPFIRSRE 180

Query: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240
           FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE
Sbjct: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240

Query: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300
           AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG
Sbjct: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300

Query: 301 DDKGLVLPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPG 360
           DDKGLV+PPKVASVQV+IVPVPYKDADT+GIFDACS T D LS+AGIRAEVD R+NYSPG
Sbjct: 301 DDKGLVMPPKVASVQVIIVPVPYKDADTRGIFDACSATSDALSKAGIRAEVDIRENYSPG 360

Query: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSL 420
           WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRA LVEQVKELLESIQQSL
Sbjct: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRALLVEQVKELLESIQQSL 420

Query: 421 FDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEEVLNVLHLGSLPKNGAIGAAAT 474
           FDAAK KRD CIQV+NTWEEFTEAL QKKMILAPWC+EE++ V   G      AI AAA 
Sbjct: 421 FDAAKEKRDACIQVVNTWEEFTEALGQKKMILAPWCNEELVFVGRKGDFV---AILAAAA 479

BLAST of Cp4.1LG01g18810 vs. NCBI nr
Match: gi|659072593|ref|XP_008466335.1| (PREDICTED: putative proline--tRNA ligase C19C7.06 isoform X3 [Cucumis melo])

HSP 1 Score: 885.9 bits (2288), Expect = 3.1e-254
Identity = 432/460 (93.91%), Postives = 440/460 (95.65%), Query Frame = 1

Query: 1   MAGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60
           MAGPKPGSSA  PKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC
Sbjct: 1   MAGPKPGSSATNPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60

Query: 61  YILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120
           YILRPWAMS+WETMQ FFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK
Sbjct: 61  YILRPWAMSVWETMQEFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120

Query: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SRE 180
           SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE         SRE
Sbjct: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWEFSHPTPFIRSRE 180

Query: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240
           FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE
Sbjct: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240

Query: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300
           AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG
Sbjct: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300

Query: 301 DDKGLVLPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPG 360
           DDKGLV+PPKVASVQV+IVPVPYKDADT+GIFDACS T D LS+AGIRAEVD R+NYSPG
Sbjct: 301 DDKGLVMPPKVASVQVIIVPVPYKDADTRGIFDACSATSDALSKAGIRAEVDIRENYSPG 360

Query: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSL 420
           WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRA LVEQVKELLESIQQSL
Sbjct: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRALLVEQVKELLESIQQSL 420

Query: 421 FDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEEV 452
           FDAAK KRD CIQV+NTWEEFTEAL QKKMILAPWC+EEV
Sbjct: 421 FDAAKEKRDACIQVVNTWEEFTEALGQKKMILAPWCNEEV 460

BLAST of Cp4.1LG01g18810 vs. NCBI nr
Match: gi|659072587|ref|XP_008466310.1| (PREDICTED: putative proline--tRNA ligase C19C7.06 isoform X1 [Cucumis melo])

HSP 1 Score: 884.4 bits (2284), Expect = 9.1e-254
Identity = 431/459 (93.90%), Postives = 439/459 (95.64%), Query Frame = 1

Query: 1   MAGPKPGSSAKTPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60
           MAGPKPGSSA  PKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC
Sbjct: 1   MAGPKPGSSATNPKAGGKKKEVKKETGLGLTNKKDDNFGEWYSEVVVSGEMIEYYDISGC 60

Query: 61  YILRPWAMSIWETMQVFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120
           YILRPWAMS+WETMQ FFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK
Sbjct: 61  YILRPWAMSVWETMQEFFDAEIKKMKIKNCYFPLFVSPGVLQREKDHIEGFAPEVAWVTK 120

Query: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE---------SRE 180
           SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWE         SRE
Sbjct: 121 SGESDLEVPIAIRPTSETVMYPYYSKWIRGHRDLPLKLNQWCNVVRWEFSHPTPFIRSRE 180

Query: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240
           FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE
Sbjct: 181 FLWQEGHTAFATKDEADTEVLEILELYRRIYEEYLAIPVIKGKKSEMEKFAGGLYTTSVE 240

Query: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300
           AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG
Sbjct: 241 AFIPNTGRGIQGATSHCLGQNFAKMFEINFENEKGEKAMVWQNSWAYSTRTIGVMVMVHG 300

Query: 301 DDKGLVLPPKVASVQVVIVPVPYKDADTQGIFDACSVTLDTLSEAGIRAEVDARDNYSPG 360
           DDKGLV+PPKVASVQV+IVPVPYKDADT+GIFDACS T D LS+AGIRAEVD R+NYSPG
Sbjct: 301 DDKGLVMPPKVASVQVIIVPVPYKDADTRGIFDACSATSDALSKAGIRAEVDIRENYSPG 360

Query: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRASLVEQVKELLESIQQSL 420
           WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRA LVEQVKELLESIQQSL
Sbjct: 361 WKYSHWEMKGVPLRIEIGPKDLANNQVRAVRRDNSAKKDIPRALLVEQVKELLESIQQSL 420

Query: 421 FDAAKVKRDTCIQVINTWEEFTEALSQKKMILAPWCDEE 451
           FDAAK KRD CIQV+NTWEEFTEAL QKKMILAPWC+EE
Sbjct: 421 FDAAKEKRDACIQVVNTWEEFTEALGQKKMILAPWCNEE 459

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SYPC_ARATH7.0e-22281.11Proline--tRNA ligase, cytoplasmic OS=Arabidopsis thaliana GN=At3g62120 PE=2 SV=1[more]
SYEP_DROME7.2e-15056.28Bifunctional glutamate/proline--tRNA ligase OS=Drosophila melanogaster GN=Aats-g... [more]
SYEP_MOUSE2.1e-14956.96Bifunctional glutamate/proline--tRNA ligase OS=Mus musculus GN=Eprs PE=1 SV=4[more]
SYEP_HUMAN1.8e-14856.70Bifunctional glutamate/proline--tRNA ligase OS=Homo sapiens GN=EPRS PE=1 SV=5[more]
PRS1_SCHPO1.1e-14755.77Putative proline--tRNA ligase C19C7.06 OS=Schizosaccharomyces pombe (strain 972 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KTZ1_CUCSA1.8e-25695.21Uncharacterized protein OS=Cucumis sativus GN=Csa_4G015750 PE=3 SV=1[more]
A0A059AB74_EUCGR9.2e-23786.93Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_J00690 PE=3 SV=1[more]
W9T0E9_9ROSA3.5e-23686.75Putative proline--tRNA ligase OS=Morus notabilis GN=L484_022724 PE=3 SV=1[more]
A0A059A097_EUCGR7.8e-23686.21Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_K00563 PE=3 SV=1[more]
A0A022R6H7_ERYGU2.3e-23284.77Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a004863mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G62120.14.0e-22381.11 Class II aaRS and biotin synthetases superfamily protein[more]
AT5G52520.15.5e-10842.89 Class II aaRS and biotin synthetases superfamily protein[more]
AT5G10880.13.2e-5564.42 tRNA synthetase-related / tRNA ligase-related[more]
Match NameE-valueIdentityDescription
gi|659108022|ref|XP_008453976.1|3.0e-25795.42PREDICTED: putative proline--tRNA ligase C19C7.06 [Cucumis melo][more]
gi|778690030|ref|XP_004152099.2|2.6e-25695.21PREDICTED: putative proline--tRNA ligase C19C7.06 isoform X1 [Cucumis sativus][more]
gi|659072589|ref|XP_008466319.1|1.8e-25490.87PREDICTED: putative proline--tRNA ligase C19C7.06 isoform X2 [Cucumis melo][more]
gi|659072593|ref|XP_008466335.1|3.1e-25493.91PREDICTED: putative proline--tRNA ligase C19C7.06 isoform X3 [Cucumis melo][more]
gi|659072587|ref|XP_008466310.1|9.1e-25493.90PREDICTED: putative proline--tRNA ligase C19C7.06 isoform X1 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006433prolyl-tRNA aminoacylation
GO:0006418tRNA aminoacylation for protein translation
Vocabulary: Cellular Component
TermDefinition
GO:0005737cytoplasm
Vocabulary: Molecular Function
TermDefinition
GO:0004827proline-tRNA ligase activity
GO:0005524ATP binding
GO:0004812aminoacyl-tRNA ligase activity
GO:0000166nucleotide binding
Vocabulary: INTERPRO
TermDefinition
IPR016061Pro-tRNA_ligase_II_C
IPR006195aa-tRNA-synth_II
IPR004499Pro-tRNA-ligase_IIa_arc-type
IPR004154Anticodon-bd
IPR002316Pro-tRNA-ligase_IIa
IPR002314aa-tRNA-synt_IIb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006525 arginine metabolic process
biological_process GO:0006560 proline metabolic process
biological_process GO:0006433 prolyl-tRNA aminoacylation
biological_process GO:0006418 tRNA aminoacylation for protein translation
cellular_component GO:0005737 cytoplasm
molecular_function GO:0005524 ATP binding
molecular_function GO:0004827 proline-tRNA ligase activity
molecular_function GO:0004812 aminoacyl-tRNA ligase activity
molecular_function GO:0000166 nucleotide binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g18810.1Cp4.1LG01g18810.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002314Aminoacyl-tRNA synthetase, class II (G/ P/ S/T)PFAMPF00587tRNA-synt_2bcoord: 130..287
score: 6.2
IPR002316Proline-tRNA ligase, class IIaPRINTSPR01046TRNASYNTHPROcoord: 89..107
score: 9.2E-5coord: 130..141
score: 9.2E-5coord: 160..168
score: 9.
IPR004154Anticodon-bindingGENE3DG3DSA:3.40.50.800coord: 300..420
score: 3.2
IPR004154Anticodon-bindingPFAMPF03129HGTP_anticodoncoord: 306..402
score: 9.9
IPR004154Anticodon-bindingunknownSSF52954Class II aaRS ABD-relatedcoord: 294..419
score: 3.93
IPR004499Proline-tRNA ligase, class IIa, archaeal-typeHAMAPMF_01571Pro_tRNA_synth_type3coord: 31..483
score: 35
IPR004499Proline-tRNA ligase, class IIa, archaeal-typeTIGRFAMsTIGR00408TIGR00408coord: 32..452
score: 4.6E
IPR006195Aminoacyl-tRNA synthetase, class IIPROFILEPS50862AA_TRNA_LIGASE_IIcoord: 70..299
score: 1
IPR016061Proline-tRNA ligase, class II, C-terminalGENE3DG3DSA:3.30.110.30coord: 424..450
score: 5.
NoneNo IPR availableGENE3DG3DSA:3.30.930.10coord: 28..299
score: 3.0E
NoneNo IPR availablePANTHERPTHR11451TRNA SYNTHETASE-RELATEDcoord: 31..450
score: 5.4E
NoneNo IPR availablePANTHERPTHR11451:SF21BIFUNCTIONAL GLUTAMATE/PROLINE--TRNA LIGASEcoord: 31..450
score: 5.4E
NoneNo IPR availableunknownSSF55681Class II aaRS and biotin synthetasescoord: 36..293
score: 7.77