HG10004577 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004577
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionla protein 1
LocationChr08: 18494254 .. 18499035 (-)
RNA-Seq ExpressionHG10004577
SyntenyHG10004577
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAATTCTTCTCTCGATCAAGAAATTGCCAAGAAAGTGCTTCGTCAGGTCAAATGCGTTTAAAATCTATCTTCATTTCATGTTCTAATCGATCATTTCTAATTGTTCATCGGTTAAATGTTTCTTCTGCAGGTCGAGTTCTATTTCAGTGACAGTAATCTTCCTAGAGATTGTTTTCTGAGGAAACATATTAGTGAAAGCCCTGATGGAAGTATCCTTCACCTTCTGATCATACGTGTTTTTAACGCTTTTCATCTTTAGTTCTGTTTAATTTGCCTTCAAGATTGATGGCCTCTTTAAGTATTACCTATTCCCATTTTGGGTTTTGTTTTGATTTAATATCTAAGTTGGATTAGTATTATTGTGTTTGGATGCTCATTTCGAAAGTCTTTCGTTGATCTTAACTGTGCAATTACAGTGGTCGACTTGTCTTTAATCTGTTCGTTTTCTCGGATGAGAGGCCATCTTGAATTGAAACACGAGATGAAATCAGATGAGATTTCAGAAGATACTATAAAGGCCGTCGCTGAAACTCTAAGAAGCTCTTCAACTGTTAAAGTTTCTGAAGATGGTTAGTTCTTTCGCATACAAGTGATGTAATTCGTTATTTATCTGAATGCTGAGTAATCGTGGAATTTCATGAAATGCAGAGTTTGGAGTTATGGAAAAACCAATTCATTTTCTACCCATTAATGTATCTTTATGACTATATCTATTGCATTACACAATTTCATGAAAGAAGAAGAAAACTACTTAGCAGGAGTGAAGGTTAAGGTTTCAATTTTGTTGAAAGGTTGGTCTAAATGTTGAAATCATCTGTCCCCAAAAAAATCATGAAAATAAAGTAAATTAATAAATAAAAATGTTAAATGTCATTAATTCTCATGTTAACTGCTTATATCGAAGATGCTGAATGATATAATCCTTCAATTTATCATAATGCTTATGAAACTATCAAAATACTTCATTGTTCAGATTTTTAGATTAGCTCTTGTTCTACAATTAACTTTTGTATCTTATTCCAATTTACGGATTGTTCTGGTCTATATTATTATTTTGAATTTGGGTTTGCACGGTCCAGGGAAGAAGGTTGGTAGAGCTACTGAACTCCCAAAGCCTGAGGAGTTAATAGAACAATTGGATGTTAGAACCGTAGCTGCATCACCATTTGAATATGATGTCAAGCTTGAAGATGTAGAAGCTTTCTTTGATCGAGTCACAAAGGTAGTTTTTTTTTCATTGATTTTAATTTCTATATTTTATTCCCGATGTCCTTGTTGGCATAATTTTACATGCGAATGTTATTGTTTCATTATTGGCTTCAGAATGCCCTTAGTCTTAGCTAAGATTACCATGAAGCAGCTCAAGCAGCAGATTCTATTCATGGGCATGAATGTGCAACCTAAAGACTCCCAAGTTTTTTTCCTCTACTTCTCAGTGGCTATAAAGCTTTTTTGAATAAATTTGTAGTAGTTTTCTCTTCTGAACAATTTCGTAAACAATTGTGGAAATTTGAACTTGTGAAATGCAATTCTTTGATTCCTTCCAAATTGTGGAAATTTTAAATTGTGAAATGCAGTACTTTGATTCTTCCCATTTTCATTGATCTTATGACTGTAAACTGATTCTTTAGTTATTTTATTTTGCATTACATATCAAGTACTCAAAATCCTGTTCTGTCAGTCTTAATAAGGTTATTATCAGTATTGTTGAGCTAAAACGAGTTAGCCCATTCATTTGAAATGGAAATGATTTGTTTGAAATGGAAATGAGTGCGATTCAAAAGGTTGCATTGCAAAAACATAGTAAATCACAATTTCCAGTTCAAGAAATTGAAAAAGAAGTTTAAAATAAAAAATACCATTGTCTGCCATTAGTTGCCGCCTCTGTGTTGGCGATGGCCACAACTGCTGGCAAAGAAACAAAAGAAAAAAGAGAAGAAAATAAAAGAACTATGGGGAAATGGAAAAGACAACCTGACAATTTGATAATTGTAATTTAAAATTAAAATTAATTAGGTGGCTCTTCTTTCGCGACTGTATATTTAAATTCTTTTATTTTTCTTTTTATTCTTCTTTTTCTTTTTCAAATGTTTATTATCACGAGTTATAACTAACCCTATGTATATTCCATTTCATGAGTTAACATCCTTGGTTATTATCACTACCCTTTGTCAAAGTATGTATTTTTTCCCCCATTTTATATTGTTAAGTTAAATATTAGAACTTTCCTACTGGTTTTCAGGTCAGTAGTGTGAGGCTTCCCCGTCATGTTGCAGATAAACGGGTATTCTGTGGAACTGCCTTGGTAGAATTTTCTACTGAGGAAGGTGCTGAAAAGGTTTTGAAGGAAAGCTTGGTTTATGCTGGTGCCAAGTTAGAATTCAAACGAAAGTATGTGTGTTTTGAAATACATCTACATTTTCTTCCAACTCTTTTGCTTCATGATGAACCTCCAGGTGTATCAGTCAAACTCAATTTAAATAAGAAGCAATCGTCAAAACTTTAAAACCATGCATTGCATTGTGAATTGTAAGAATTATAATTACAATAGCATGAATTGTTGCAATATTGAATTTAGATTAGTTTGAAGATTCTGAAGACCGTTTTAAATGCTGAAAACAATCATCAATAGGATTTCCATATTCTTTGACTTCTTCTTCTGACCTTTTTGAAAGGCTTATTGGAAAATGATTAACTGTCAGATTTCATACTATGTTTTGGATATATCCGTTAGTTTACTTCTGTATTATGTAAAATAGTGGGTTATTCAATATTTTATTCTTCCAGCCATCGTTAAATTGTGTCTAATCCTTAGGTGCAAGCCTTGAGGCTAAAAATTTGTGAATTGTGATGATATTTAAAAAAAAAAGTTGTTATTCTAATTGGTATCTATGTGAAGGCGTATTGATTACCCAAGCTTCTTCGACAACTTTAAGCCATTGATAATATATTCCCTTTGACATATCAGTCAACTGCGTTTGTAAAGATCATTTCCATTTCAGGAGGGATTTTGATGAGGAGAGAGCAAAAGAGACGGAGAAGTTTGAAAGTTCACGCTCAATCTCGGGTGCAAACCGCAACAATAATAATTCCCCAGAATCGAGGTATGTAGTACAAGCTTGTAGTAATATCTAGTATTCTTAGCCATACACTTCTCTTTCCACATCACATGATCTTACACATCATCATTGCAGCTACCCTAAAGGCTTAATTGTCGCCTTTACATTGAAAAGCATATCATCTGAAAGTTCATCTGCTGAGGAAAATGGATCTCATGGTGTAGCCGCTGATAAGACTGAATGCAAAACAGATGAAGGATTAGATTCCTCAAAGAACGACTCTGAGAAAACTGAGCAAATAGAGGTAACAAATATGAGTAAAGATGAAGAAATTAAGGAAAGTGCTGATGATAAGAATGGAGAGGCTGAACGGAAGGGTGATTCCGTCAATGAAGAAAGTCCTGAAGTGGAAAAAGAACAGTCTATGGATGATCCCATTGATGAACATGAAGAGGCTGAAGAAAAGCCCACAGTTGCTAAATCCAAGAACAATATGAATGTGGTTTCACGCGAGGATTTGAAGGCTGTTTTTCGGAAGTTTGGAAGTGTCAAGGTATCTCATAGCTTCTTCCTCATATGCTACGCTCAACAAAATATTTCAAGAAAATTTTTTGTTCTCTACTTTATCTAGTAGTTTATCTTCTCTTCTCTGTTTGTGATTGTCATTTTGTCATGGGAACCTAGTTTTTCTTGCATCATTATTGTCTTATTTCCTTCATTCTACCGTGGAATTTTACGCTAATGATATAGGTATGGAACTGTGAGTTTTGGGGGGTGGGTTCCATTGTGACATATAAAGCTTATTCATTCTGACACATCTGAGTGTTCTTTTCCCCTCTCTTCTTTGGTTCGAATTGTCAAGTTCATTGATTTTAAGATTGGAGACGAGTCGGGGTATATCAGGTTTGAGGAGCCTGAAGCTGCCCAGAAAGCTCGTGCAGCTGCAGTACTAGCTGACCAAGGGGGACTGGCTGTGAAGAATTTTATAGCCACTTTAGAACCAGTGTCAGGTATCATTGAAAATTTCACTGTTCTTTGGCTGCTGGGAATGGTGTCTTTCTAGCAATATCAATCGGCATAGTTTCTTTAAAAATTATTTAATATTGCACATGGTGATCTGCCAGTAACTTTTGCTTCTTTAAAACACAACTAAAATCATCAAATTCTGGTAAATGACTTTTCGTCACTTAATGTTCCTATGATCAATTCATTGTTCTTTCCCATATCCGATTGAGCTTTTCTAACTTGATACAATTGACTTACTTCATCTTTACTCGTGCGTTCTCCTTGAGTGGATCTTATCCTTTTAACAATTTTGGTTAAATTCATTTGATTTCAGGTGAGGCTGAGAAGGAGTATTGGAGTCTACTCCGCAGCAACCAAGAGAAGCATCATCGCGACTTTAAGGGAAACCGTGGAAGGTTTGCTTTCTAAATAACTAAGTATTGGAGTCTATTGACAACCATTTTGTTTTAGTTTTCTGCTGTTGCTCCATTTGATAACCATTAGGTTTTGAGTTTCGAAAATTAAACTTAAACACTACTTTCACCTTTTGATTTATTTGTTTTACTATAATATTTTCTATGAGTGCTTTTAAAATCTTTTCTGTTGCTTGCTACTTCTATAGGGGTGGTAAATTCAATCGAGGTGGAAAGCACGGTCGGTCGAGGGGGTATGACAACCATCGAGGTCGCCCGAACAAAGCTCAAAAGGTTTGA

mRNA sequence

ATGGAGAATTCTTCTCTCGATCAAGAAATTGCCAAGAAAGTGCTTCGTCAGGTCGAGTTCTATTTCAGTGACAGTAATCTTCCTAGAGATTGTTTTCTGAGGAAACATATTAGTGAAAGCCCTGATGGAATGGTCGACTTGTCTTTAATCTGTTCGTTTTCTCGGATGAGAGGCCATCTTGAATTGAAACACGAGATGAAATCAGATGAGATTTCAGAAGATACTATAAAGGCCGTCGCTGAAACTCTAAGAAGCTCTTCAACTGTTAAAGTTTCTGAAGATGGGAAGAAGGTTGGTAGAGCTACTGAACTCCCAAAGCCTGAGGAGTTAATAGAACAATTGGATGTTAGAACCGTAGCTGCATCACCATTTGAATATGATGTCAAGCTTGAAGATGTAGAAGCTTTCTTTGATCGAGTCACAAAGGTCAGTAGTGTGAGGCTTCCCCGTCATGTTGCAGATAAACGGGTATTCTGTGGAACTGCCTTGGTAGAATTTTCTACTGAGGAAGGTGCTGAAAAGGTTTTGAAGGAAAGCTTGGTTTATGCTGGTGCCAAGTTAGAATTCAAACGAAAGAGGGATTTTGATGAGGAGAGAGCAAAAGAGACGGAGAAGTTTGAAAGTTCACGCTCAATCTCGGGTGCAAACCGCAACAATAATAATTCCCCAGAATCGAGCTACCCTAAAGGCTTAATTGTCGCCTTTACATTGAAAAGCATATCATCTGAAAGTTCATCTGCTGAGGAAAATGGATCTCATGGTGTAGCCGCTGATAAGACTGAATGCAAAACAGATGAAGGATTAGATTCCTCAAAGAACGACTCTGAGAAAACTGAGCAAATAGAGGTAACAAATATGAGTAAAGATGAAGAAATTAAGGAAAGTGCTGATGATAAGAATGGAGAGGCTGAACGGAAGGGTGATTCCGTCAATGAAGAAAGTCCTGAAGTGGAAAAAGAACAGTCTATGGATGATCCCATTGATGAACATGAAGAGGCTGAAGAAAAGCCCACAGTTGCTAAATCCAAGAACAATATGAATGTGGTTTCACGCGAGGATTTGAAGGCTGTTTTTCGGAAGTTTGGAAGTGTCAAGTTCATTGATTTTAAGATTGGAGACGAGTCGGGGTATATCAGGTTTGAGGAGCCTGAAGCTGCCCAGAAAGCTCGTGCAGCTGCAGTACTAGCTGACCAAGGGGGACTGGCTGTGAAGAATTTTATAGCCACTTTAGAACCAGTGTCAGGTGAGGCTGAGAAGGAGTATTGGAGTCTACTCCGCAGCAACCAAGAGAAGCATCATCGCGACTTTAAGGGAAACCGTGGAAGGGGTGGTAAATTCAATCGAGGTGGAAAGCACGGTCGGTCGAGGGGGTATGACAACCATCGAGGTCGCCCGAACAAAGCTCAAAAGGTTTGA

Coding sequence (CDS)

ATGGAGAATTCTTCTCTCGATCAAGAAATTGCCAAGAAAGTGCTTCGTCAGGTCGAGTTCTATTTCAGTGACAGTAATCTTCCTAGAGATTGTTTTCTGAGGAAACATATTAGTGAAAGCCCTGATGGAATGGTCGACTTGTCTTTAATCTGTTCGTTTTCTCGGATGAGAGGCCATCTTGAATTGAAACACGAGATGAAATCAGATGAGATTTCAGAAGATACTATAAAGGCCGTCGCTGAAACTCTAAGAAGCTCTTCAACTGTTAAAGTTTCTGAAGATGGGAAGAAGGTTGGTAGAGCTACTGAACTCCCAAAGCCTGAGGAGTTAATAGAACAATTGGATGTTAGAACCGTAGCTGCATCACCATTTGAATATGATGTCAAGCTTGAAGATGTAGAAGCTTTCTTTGATCGAGTCACAAAGGTCAGTAGTGTGAGGCTTCCCCGTCATGTTGCAGATAAACGGGTATTCTGTGGAACTGCCTTGGTAGAATTTTCTACTGAGGAAGGTGCTGAAAAGGTTTTGAAGGAAAGCTTGGTTTATGCTGGTGCCAAGTTAGAATTCAAACGAAAGAGGGATTTTGATGAGGAGAGAGCAAAAGAGACGGAGAAGTTTGAAAGTTCACGCTCAATCTCGGGTGCAAACCGCAACAATAATAATTCCCCAGAATCGAGCTACCCTAAAGGCTTAATTGTCGCCTTTACATTGAAAAGCATATCATCTGAAAGTTCATCTGCTGAGGAAAATGGATCTCATGGTGTAGCCGCTGATAAGACTGAATGCAAAACAGATGAAGGATTAGATTCCTCAAAGAACGACTCTGAGAAAACTGAGCAAATAGAGGTAACAAATATGAGTAAAGATGAAGAAATTAAGGAAAGTGCTGATGATAAGAATGGAGAGGCTGAACGGAAGGGTGATTCCGTCAATGAAGAAAGTCCTGAAGTGGAAAAAGAACAGTCTATGGATGATCCCATTGATGAACATGAAGAGGCTGAAGAAAAGCCCACAGTTGCTAAATCCAAGAACAATATGAATGTGGTTTCACGCGAGGATTTGAAGGCTGTTTTTCGGAAGTTTGGAAGTGTCAAGTTCATTGATTTTAAGATTGGAGACGAGTCGGGGTATATCAGGTTTGAGGAGCCTGAAGCTGCCCAGAAAGCTCGTGCAGCTGCAGTACTAGCTGACCAAGGGGGACTGGCTGTGAAGAATTTTATAGCCACTTTAGAACCAGTGTCAGGTGAGGCTGAGAAGGAGTATTGGAGTCTACTCCGCAGCAACCAAGAGAAGCATCATCGCGACTTTAAGGGAAACCGTGGAAGGGGTGGTAAATTCAATCGAGGTGGAAAGCACGGTCGGTCGAGGGGGTATGACAACCATCGAGGTCGCCCGAACAAAGCTCAAAAGGTTTGA

Protein sequence

MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHLELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVAASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESLVYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSISSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADDKNGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFRKFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAEKEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGYDNHRGRPNKAQKV
Homology
BLAST of HG10004577 vs. NCBI nr
Match: XP_008445036.1 (PREDICTED: la protein 1 [Cucumis melo] >KAA0065026.1 la protein 1 [Cucumis melo var. makuwa])

HSP 1 Score: 746.9 bits (1927), Expect = 1.1e-211
Identity = 410/473 (86.68%), Postives = 437/473 (92.39%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           MEN+SLDQE AKKVLRQVEFYFSDSNLPRD FLRK ISESPDGMVDLSLIC+F+RM+GHL
Sbjct: 1   MENTSLDQEKAKKVLRQVEFYFSDSNLPRDGFLRKSISESPDGMVDLSLICTFTRMKGHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           ELK ++  ++  EDT+KAVAETLR+SS++KVSEDGKKVGRATELPKPEELIEQLD RTVA
Sbjct: 61  ELKQDVTPEDFPEDTLKAVAETLRTSSSLKVSEDGKKVGRATELPKPEELIEQLDDRTVA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPFEYD+KLEDVEAFFDRVTKV+SVRLPRHVADKRVFCGTAL+EFSTEE AEKVLKESL
Sbjct: 121 ASPFEYDIKLEDVEAFFDRVTKVNSVRLPRHVADKRVFCGTALIEFSTEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANR-NNNNSPESSYPKGLIVAFTLKS 240
           VYAGAKLE K KR+FDEERAKE EKFESSRS  GANR NNN+SPESSYPKGLIVAFTLKS
Sbjct: 181 VYAGAKLELKPKREFDEERAKEMEKFESSRSTLGANRSNNNSSPESSYPKGLIVAFTLKS 240

Query: 241 ISSESSSAEENGSHGVAADKTECKTDEGL-DSSKNDSEKTEQIEVTNMSKDEEIKESADD 300
            SS  +SAEEN SHGVAADKTECKTDEGL DSSKND EKTEQIE TN+SKDEEIK+SADD
Sbjct: 241 TSS-GNSAEENESHGVAADKTECKTDEGLEDSSKNDPEKTEQIEETNLSKDEEIKKSADD 300

Query: 301 KNGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVF 360
           KNGEAE K D+ NE S EVE EQ MD  +DE EEAEEKPT  KS+NNMNVVSREDLKAVF
Sbjct: 301 KNGEAEEKNDTGNERSLEVE-EQCMDGTVDEQEEAEEKPTALKSRNNMNVVSREDLKAVF 360

Query: 361 RKFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAE 420
           +KFGSVKFIDFKIGDESGYIRFEEPEAAQKARA+AVL +QGGLAVKNFIATLEPVSGEAE
Sbjct: 361 QKFGSVKFIDFKIGDESGYIRFEEPEAAQKARASAVLTEQGGLAVKNFIATLEPVSGEAE 420

Query: 421 KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGYDNHRGRPNKAQKV 472
           KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRG+DNHRGRPNKAQKV
Sbjct: 421 KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGHDNHRGRPNKAQKV 471

BLAST of HG10004577 vs. NCBI nr
Match: XP_004150280.1 (la protein 1 [Cucumis sativus] >KGN62834.1 hypothetical protein Csa_022558 [Cucumis sativus])

HSP 1 Score: 746.9 bits (1927), Expect = 1.1e-211
Identity = 410/472 (86.86%), Postives = 439/472 (93.01%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           M N+SLDQE+AKKVLRQVEFYFSDSNLPRD FLRK ISESPDG+VDLSLIC+FSRM+GHL
Sbjct: 1   MGNTSLDQEMAKKVLRQVEFYFSDSNLPRDTFLRKTISESPDGLVDLSLICTFSRMKGHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           ELK ++  +   EDT+KAVAETLR+SS++KVSEDGKKVGRATELPKPEELIEQLD RTVA
Sbjct: 61  ELKQDVTPENFPEDTMKAVAETLRTSSSLKVSEDGKKVGRATELPKPEELIEQLDDRTVA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPFEYD+KLEDVEAFF++VTKV+SVRLPRHVADKRVFCGTAL+EFSTEE AEKVLKESL
Sbjct: 121 ASPFEYDIKLEDVEAFFNQVTKVNSVRLPRHVADKRVFCGTALIEFSTEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANR-NNNNSPESSYPKGLIVAFTLKS 240
           VYAGAKLE K KR+FDEERAKE EKFESSRS SGANR NNN+SPE+SYPKGLIVAFTLKS
Sbjct: 181 VYAGAKLELKPKREFDEERAKEMEKFESSRSTSGANRSNNNSSPEASYPKGLIVAFTLKS 240

Query: 241 ISSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADDK 300
            SS  S+AE N SHGV ADKTECKTDEGLDSSKNDSEKT QIE TN+SKDEEIKESADDK
Sbjct: 241 TSS-GSTAEGNESHGV-ADKTECKTDEGLDSSKNDSEKTVQIEETNLSKDEEIKESADDK 300

Query: 301 NGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFR 360
           NGEA  K DS NE+S EVE EQSMDD +DEHEEAEEKPT  +S+NNMNVVSREDLKAVFR
Sbjct: 301 NGEAVEKNDSGNEKSLEVE-EQSMDDTVDEHEEAEEKPTAFQSRNNMNVVSREDLKAVFR 360

Query: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAEK 420
           KFGSVKFIDFKIGDESGYIRFEEPEAAQKARA+AVLA+QGGLAVKNFIATLEPVSGEAEK
Sbjct: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARASAVLAEQGGLAVKNFIATLEPVSGEAEK 420

Query: 421 EYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGYDNHRGRPNKAQKV 472
           EYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRG+DNHRGRPNKAQKV
Sbjct: 421 EYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGHDNHRGRPNKAQKV 469

BLAST of HG10004577 vs. NCBI nr
Match: XP_038886452.1 (la protein 1 [Benincasa hispida])

HSP 1 Score: 735.7 bits (1898), Expect = 2.5e-208
Identity = 412/496 (83.06%), Postives = 434/496 (87.50%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           ME  SLDQE+AKKV+RQVEFYFSDSNLPRDCFL K+I+ESPDGMV+LSLIC+FSRMR HL
Sbjct: 1   METPSLDQEMAKKVIRQVEFYFSDSNLPRDCFLMKNINESPDGMVNLSLICTFSRMRSHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           ELKHE + DEI ED IKAV ETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLD RTVA
Sbjct: 61  ELKHETRPDEIPEDIIKAVGETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDDRTVA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPF YDVKLEDV AFFDRV KV+SVRLPRHVADKRVFCGTALVEFSTEE AEKVLKESL
Sbjct: 121 ASPFPYDVKLEDVHAFFDRVAKVNSVRLPRHVADKRVFCGTALVEFSTEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSI 240
           VYAGAKLE K KRDFDEER KETEKFESSRS SGANR NN+SPESSYPKGLIVAFTLKS+
Sbjct: 181 VYAGAKLELKPKRDFDEERTKETEKFESSRSTSGANR-NNSSPESSYPKGLIVAFTLKSM 240

Query: 241 SSES-----------------SSAEENGSHGV--AADKTECKTDEGLDSSKNDSEKTEQI 300
           SS S                 SSAEENGSHGV  AAD TECKTDE  + S+NDSEK+++I
Sbjct: 241 SSVSSAEENGSHGVDAADNTVSSAEENGSHGVDAAADNTECKTDEANNDSENDSEKSDKI 300

Query: 301 EVT------NMSKDEEIKESADDKNGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEE 360
            +T      NMSKDEEI +SADDKNGEAE K DS NEE PEV  EQ  DDPIDEHEEAEE
Sbjct: 301 VLTIRVEEANMSKDEEIMKSADDKNGEAEEKNDSGNEERPEV--EQCKDDPIDEHEEAEE 360

Query: 361 KPTVAKSKNNMNVVSREDLKAVFRKFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVL 420
           KP  AKSKNNMNVVSREDLKAVF+KFGSVKFIDFKIGDESGYIRFEEPEAAQKARA A+L
Sbjct: 361 KPPAAKSKNNMNVVSREDLKAVFQKFGSVKFIDFKIGDESGYIRFEEPEAAQKARATALL 420

Query: 421 ADQGGLAVKNFIATLEPVSGEAEKEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRS 472
           ADQGGLAVKNFIATLEPVSGEAEKEYWSLLRSNQEKHHRDFKG+RGRGGKFNRGGKHGRS
Sbjct: 421 ADQGGLAVKNFIATLEPVSGEAEKEYWSLLRSNQEKHHRDFKGHRGRGGKFNRGGKHGRS 480

BLAST of HG10004577 vs. NCBI nr
Match: XP_022997306.1 (la protein 1 [Cucurbita maxima])

HSP 1 Score: 715.7 bits (1846), Expect = 2.6e-202
Identity = 394/473 (83.30%), Postives = 429/473 (90.70%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           MENSSLDQE AKKVLRQVEFYFSDSNLPRD FL+K IS  PDGMVDLSLIC+FSRM+GHL
Sbjct: 1   MENSSLDQETAKKVLRQVEFYFSDSNLPRDGFLKKTISGIPDGMVDLSLICTFSRMKGHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           +LK ++K +EI EDT+KAVAETLRSSST+KVSEDGKK+GR TELPKPEELIEQLD +T+A
Sbjct: 61  KLKQDVKPEEIPEDTLKAVAETLRSSSTIKVSEDGKKIGRTTELPKPEELIEQLDDKTIA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPFEYDVKLEDVEAFF+++ KV+SVRLPRHVADKRVFCGTALVEFS EE AEKVLKESL
Sbjct: 121 ASPFEYDVKLEDVEAFFNQIAKVNSVRLPRHVADKRVFCGTALVEFSNEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSI 240
           +YAGAKLE K KRDFD+ERAKETE+FESSR+ S ANR NNN PES+YPKGLIVAFTLKS+
Sbjct: 181 LYAGAKLELKPKRDFDKERAKETEEFESSRANSSANRKNNNPPESNYPKGLIVAFTLKSV 240

Query: 241 SSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQI-EVTNMSKDEEIKESADDK 300
           SS  SSAEENGSH VAADKTECKTDE LDSSKNDSEKT+QI E  NMSKDEEIK SADD 
Sbjct: 241 SS-GSSAEENGSHCVAADKTECKTDERLDSSKNDSEKTKQIWEEANMSKDEEIK-SADDN 300

Query: 301 NGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFR 360
           NGEA++K D  NEE PEVE E++ +D +DEHEE EEKPT AK KNNMNVVSREDLK +F+
Sbjct: 301 NGEADKKNDLGNEEKPEVE-ERTTNDTVDEHEEDEEKPTAAKFKNNMNVVSREDLKVIFQ 360

Query: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAEK 420
           KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLA+QGGLAVK+FIATLEPVSGEAEK
Sbjct: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLAEQGGLAVKSFIATLEPVSGEAEK 420

Query: 421 EYWSLLRSNQEKHHRDFKGNRGRGGKFNR-GGKHGRSRGYDNHRGRPNKAQKV 472
           EYW LLRSNQEKHHRDFKGNRGRGGKFNR GGKH RSRG+ N +GRPNKAQKV
Sbjct: 421 EYWRLLRSNQEKHHRDFKGNRGRGGKFNRGGGKHARSRGHGNDQGRPNKAQKV 470

BLAST of HG10004577 vs. NCBI nr
Match: XP_022962519.1 (la protein 1 [Cucurbita moschata])

HSP 1 Score: 707.2 bits (1824), Expect = 9.4e-200
Identity = 394/477 (82.60%), Postives = 426/477 (89.31%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           MENSSLDQE AKKVLRQVEFYFSDSNLPRD FL+K IS   DGMVDLSLIC+FSRM+GHL
Sbjct: 1   MENSSLDQETAKKVLRQVEFYFSDSNLPRDGFLKKTISGIADGMVDLSLICTFSRMKGHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           +LK ++K +EI EDT+KAVAETLRSSST+KVSEDGKK+GR TELPKPEELIEQLD +T+A
Sbjct: 61  KLKQDVKPEEIPEDTLKAVAETLRSSSTIKVSEDGKKIGRTTELPKPEELIEQLDDKTIA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPFEYDVKLEDVEAFF+++ KV+SVRLPRHVADKRVFCGTALVEFSTEE AEKVLKESL
Sbjct: 121 ASPFEYDVKLEDVEAFFNQIAKVNSVRLPRHVADKRVFCGTALVEFSTEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSI 240
           +YAGAKLE K KRDFD ERAKETE+FESSR+ S ANR NNN PES+YPKGLIVAFTLKS+
Sbjct: 181 LYAGAKLELKPKRDFDAERAKETEEFESSRANSSANRKNNNPPESNYPKGLIVAFTLKSV 240

Query: 241 SSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQI-EVTNMSKDEEIKESADDK 300
           SS  SSAEENGSHGVAADKTECK DE LDSSKNDSEKTE I E  NMSKDEEIK SADD 
Sbjct: 241 SS-GSSAEENGSHGVAADKTECKPDERLDSSKNDSEKTEIIGEEANMSKDEEIK-SADDN 300

Query: 301 NGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFR 360
           NGEA++K D  NEE PEVE E++ +D +DEHEE EEKPT AKSKNNMNVVSREDLK +F+
Sbjct: 301 NGEADKKNDLGNEERPEVE-ERTTNDTVDEHEEDEEKPTAAKSKNNMNVVSREDLKVIFQ 360

Query: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAEK 420
           KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLA+QGGL VK+FIATLEPVSGEAEK
Sbjct: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLAEQGGLVVKSFIATLEPVSGEAEK 420

Query: 421 EYWSLLRSNQEKHHRDFKGNRGRGGKFNR-GGKHGRSRGY----DNHRGRPNKAQKV 472
           EYW LLRSNQEKHHRDFK NRGRGGKFNR GGKH RSRG+     N RGRPNKAQKV
Sbjct: 421 EYWRLLRSNQEKHHRDFKSNRGRGGKFNRGGGKHARSRGHGHCNGNDRGRPNKAQKV 474

BLAST of HG10004577 vs. ExPASy Swiss-Prot
Match: Q93ZV7 (La protein 1 OS=Arabidopsis thaliana OX=3702 GN=LA1 PE=1 SV=1)

HSP 1 Score: 417.2 bits (1071), Expect = 2.5e-115
Identity = 261/473 (55.18%), Postives = 328/473 (69.34%), Query Frame = 0

Query: 6   LDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHLELKHE 65
           L +E AK VLRQVEFYFSDSNLP D FL+K ++ES DG+V L+LICSFS+MRG+L+L  +
Sbjct: 6   LTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYLKL-GD 65

Query: 66  MKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVAASPFE 125
            K D+I EDTIKAVA+TLR+SS +K+S+DGKKVGR+TEL K E+LIEQL+ RTVAASPF 
Sbjct: 66  SKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAASPFS 125

Query: 126 YDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESLVYAGA 185
           YDVK EDVE+FF +  KV+SVR+PRHVA+ R+F G ALVEF TEE A+ V+K++LV+AG 
Sbjct: 126 YDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLVFAGQ 185

Query: 186 KLEFKRKRDFDEERAKETEKFE-------SSRSISGANRNNNNSPESSYPKGLIVAFTLK 245
           +LE K K++FD ER K+  KF        S+   +G++  NN++ E  YPKGLI++FTLK
Sbjct: 186 ELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIISFTLK 245

Query: 246 SISSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADD 305
                  SAEE          TE K+ E               E T+ + +E   + AD 
Sbjct: 246 ------RSAEEG--------TTEQKSSE---------------EPTDKTMEESETKPADT 305

Query: 306 KNGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVF 365
            + + E  G+ V  E  E           DE +E EEK  +A  K+N +VV REDLKAVF
Sbjct: 306 PDADKENTGE-VQAEGAE-----------DEDDEKEEKGALATHKDNKDVVLREDLKAVF 365

Query: 366 RKFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAE 425
            KFG VKF+DFK+G E+GY+RF+EPEA+QKARAAAVLA++GGLAVKNFIA LEPV GEAE
Sbjct: 366 GKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVLANEGGLAVKNFIAVLEPVIGEAE 425

Query: 426 KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGYDNHRGRPNKAQKV 472
           KEYW+LLRS   K   D  G  GRGG+  RGG+ GR RG D+  GR NK+QKV
Sbjct: 426 KEYWTLLRS---KDRFDKGGRGGRGGR--RGGRFGRKRGSDSPGGRWNKSQKV 431

BLAST of HG10004577 vs. ExPASy Swiss-Prot
Match: Q0V7U7 (La protein 2 OS=Arabidopsis thaliana OX=3702 GN=LA2 PE=1 SV=1)

HSP 1 Score: 255.0 bits (650), Expect = 1.7e-66
Identity = 177/440 (40.23%), Postives = 251/440 (57.05%), Query Frame = 0

Query: 4   SSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHLELK 63
           SS ++E AKK+L QVEFYFSDSNLP D FL + +++S DG+V L L+CSFSRMR  L L 
Sbjct: 3   SSFNEETAKKLLTQVEFYFSDSNLPTDGFLNREVTKSKDGLVSLPLVCSFSRMRNLLGLG 62

Query: 64  HEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVAASP 123
           + +  ++I    ++ VA  LR+S  +KVS +G+++GR T+L KPEE++EQ+  RT+AASP
Sbjct: 63  N-INREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAASP 122

Query: 124 FEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESLVYA 183
           FEY +K+EDV +FF +  KV+SVRLP ++ADKR FCGTALVEFS+E+  + +L++SLVYA
Sbjct: 123 FEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVYA 182

Query: 184 GAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSISSE 243
           GA L    K DFD +R    ++   S S             + + +G IV F LK I+SE
Sbjct: 183 GADLVLIPKSDFDCQRENMIKQLGKSES------------HNEFRRGQIVKFALKWIASE 242

Query: 244 SSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADDKNGEA 303
                                           EK    E  +  K+ +IKE  D + G A
Sbjct: 243 --------------------------------EKVTNKEKPSALKN-KIKEKEDKETGIA 302

Query: 304 ERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFRKFGS 363
           +R+ ++ +     + K+ +                V    NN N VS E LK +F++FGS
Sbjct: 303 DREKENGDNSCASLCKDNT-------------DQLVVPPWNNSNSVSSEVLKDLFQRFGS 362

Query: 364 VKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVK-NFIATLEPVSGEAEKEYW 423
           V+ I++  G +SGY+ F + E A KARAA      GGL VK NF   LE ++GE E+E W
Sbjct: 363 VEHIEYSGGLDSGYVWFTDSETAMKARAAVEFV--GGLVVKNNFSVALEAINGEMERELW 381

Query: 424 SLLRSNQ----EKHHRDFKG 439
             L S +    ++ H+  KG
Sbjct: 423 KRLSSAELEGGKEGHKKEKG 381

BLAST of HG10004577 vs. ExPASy Swiss-Prot
Match: P33399 (La protein homolog OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=LHP1 PE=1 SV=2)

HSP 1 Score: 90.9 bits (224), Expect = 4.2e-17
Identity = 90/265 (33.96%), Postives = 126/265 (47.55%), Query Frame = 0

Query: 9   EIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHLELKHEMKS 68
           E+  + L+QVEFYFS+ N P D FLR   +E  DG V +S I +F+RM+ +         
Sbjct: 28  EVLDRCLKQVEFYFSEFNFPYDRFLRT-TAEKNDGWVPISTIATFNRMKKY--------- 87

Query: 69  DEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATEL---PKPEELIEQLDVRTVAASPFE 128
                  +  V E LRSS  ++VS DG+ V R   L         IEQ + RT+A   F 
Sbjct: 88  -----RPVDKVIEALRSSEILEVSADGENVKRRVPLDLTAARNARIEQ-NQRTLAVMNFP 147

Query: 129 Y-DVKL-------EDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLK 188
           + DV+        E++EAFF ++ +++ VRL R   +K+ F GT LVEF T    E  LK
Sbjct: 148 HEDVEASQIPELQENLEAFFKKLGEINQVRLRRDHRNKK-FNGTVLVEFKTIPECEAFLK 207

Query: 189 ---------ESLVYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYP 248
                    E L Y G KL    K+ FD +R     K  S RS S       N P+  +P
Sbjct: 208 SYSNDDESNEILSYEGKKLSVLTKKQFDLQREASKSKNFSGRSRSFNGHKKKNLPK--FP 267

Query: 249 KGLIVAFTLKSISSESSSAEENGSH 254
           K        +S    S+ A+++  H
Sbjct: 268 KNKKKNGKEESKEDSSAIADDDEEH 273

BLAST of HG10004577 vs. ExPASy Swiss-Prot
Match: Q7ZWE3 (La-related protein 7 OS=Danio rerio OX=7955 GN=larp7 PE=2 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 1.9e-09
Identity = 121/512 (23.63%), Postives = 215/512 (41.99%), Query Frame = 0

Query: 8   QEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHLELKHEMK 67
           +++   V +QVEF+F D NL +D F++  I +S DG +D++++ +F+RM+          
Sbjct: 40  KQLLSDVKKQVEFWFGDVNLHKDRFMKSIIEQSRDGYIDIAVLTTFNRMK---------- 99

Query: 68  SDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVAASPFEYD 127
              ++ D +K +A  L++S+ V+V+++G ++ R   L    E  + +D RTV        
Sbjct: 100 --NLTAD-VKLIARALKNSTIVEVNDEGTRIRRKEPL---GETPKDVDSRTVYVELLPKT 159

Query: 128 VKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVL-------KESL 187
           V    +E  F +   V  + +PR+    R   G A VEF T+E A+K +       +++ 
Sbjct: 160 VTHIWLERVFSKCGHVVYISIPRY-KSTRHSKGFAFVEFETQEQAQKAVEMLNNPPEDAP 219

Query: 188 VYAGAKLEFKRKR---------DFDEE-RAKETEKFESSRSISGANRNNNNSPESSYPKG 247
              G   +  RK+         D DE+ + K+TE   S+   +G+N  + +    S    
Sbjct: 220 RKPGIFPKTCRKKAVPFDAVTQDNDEDGKKKKTELKNSTSEETGSNNMDQDGMLESTVTS 279

Query: 248 LIVAFTLKSISSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDE 307
                TL S  S+ +  +   S    A   E + +      K + EK+E  ++++ +KDE
Sbjct: 280 EPNLATLTSTVSKKAKKKRLRSQSFEASSGEDQFEMSSKMRKVEEEKSELKDLSSENKDE 339

Query: 308 EIK--ESADDKNGEAERKGDSVNEESPEVEKE------QSMDDPIDEHEE---------- 367
           E+   +  DD   +A+RK     +E  +V +E       S  + +D  +E          
Sbjct: 340 ELNSLKKKDDSVLKAKRKRKKKLKERLKVGEEVIPLRVLSKKEWLDLKQEYLTLQKRCMA 399

Query: 368 --------AEEKPT----------------------------------VAKSKNNMNVVS 427
                     +KPT                                  + K   N  + S
Sbjct: 400 HLKQSVFQINQKPTNYHIVKLKEDDTNAFYKDTPKKELTSGPEFLSGVIVKISYNQPLPS 459

Query: 428 REDLKAVFRKFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATL 442
           +  +K +  +   V ++D   GD  G++RF+  E AQK   A           K +   L
Sbjct: 460 KRCIKDMLSELSPVAYVDLLDGDTEGHVRFKSSEDAQKVIKARFEFQ------KKYNWNL 519

BLAST of HG10004577 vs. ExPASy Swiss-Prot
Match: Q5XI01 (La-related protein 7 OS=Rattus norvegicus OX=10116 GN=Larp7 PE=1 SV=2)

HSP 1 Score: 59.3 bits (142), Expect = 1.3e-07
Identity = 78/305 (25.57%), Postives = 136/305 (44.59%), Query Frame = 0

Query: 8   QEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHLELKHEMK 67
           +++   + +QV+F+F D+NL +D FLR+ I +S DG VD+SL+ SF++M+          
Sbjct: 27  KQVLADIAKQVDFWFGDANLHKDRFLREQIEKSRDGYVDISLLVSFNKMK---------- 86

Query: 68  SDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRAT---ELPKPEELIEQLDVRTVAASPF 127
             +++ D  K +A  L+SSS V++  +G ++ R     E PK EE       RTV     
Sbjct: 87  --KLTTDG-KLIARALKSSSVVELDLEGTRIRRKKPLGERPKDEE------ERTVYVELL 146

Query: 128 EYDVKLEDVEAFFDRVTKVSSVRLPRH--VADKRVFCGTALVEFSTEEGAEKVL------ 187
             +V    +E  F +   V  + +P +    D +   G A VEF T+E A K +      
Sbjct: 147 PKNVTHSWIERVFGKCGNVVYISIPHYKSTGDPK---GFAFVEFETKEQAAKAIEFLNNP 206

Query: 188 -KESLVYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAF 247
            +E+    G   +  + +     R  E +K +  +      +      ES   K L+V  
Sbjct: 207 PEEAPRKPGIFPKTVKNKPIPSLRVAEEKKKKKKK------KGRIKKEESVQAKELVVDS 266

Query: 248 TLKSISSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKES 301
           +   + S+++      S G  A+  E          K D  +T  +  +   K E  + S
Sbjct: 267 SSSGV-SKATKRPRTASEGSEAETPEAPKQPAKKKKKRDKVETGGLPESKAGKRE--RSS 300

BLAST of HG10004577 vs. ExPASy TrEMBL
Match: A0A5A7VFC4 (La protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003520 PE=4 SV=1)

HSP 1 Score: 746.9 bits (1927), Expect = 5.2e-212
Identity = 410/473 (86.68%), Postives = 437/473 (92.39%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           MEN+SLDQE AKKVLRQVEFYFSDSNLPRD FLRK ISESPDGMVDLSLIC+F+RM+GHL
Sbjct: 1   MENTSLDQEKAKKVLRQVEFYFSDSNLPRDGFLRKSISESPDGMVDLSLICTFTRMKGHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           ELK ++  ++  EDT+KAVAETLR+SS++KVSEDGKKVGRATELPKPEELIEQLD RTVA
Sbjct: 61  ELKQDVTPEDFPEDTLKAVAETLRTSSSLKVSEDGKKVGRATELPKPEELIEQLDDRTVA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPFEYD+KLEDVEAFFDRVTKV+SVRLPRHVADKRVFCGTAL+EFSTEE AEKVLKESL
Sbjct: 121 ASPFEYDIKLEDVEAFFDRVTKVNSVRLPRHVADKRVFCGTALIEFSTEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANR-NNNNSPESSYPKGLIVAFTLKS 240
           VYAGAKLE K KR+FDEERAKE EKFESSRS  GANR NNN+SPESSYPKGLIVAFTLKS
Sbjct: 181 VYAGAKLELKPKREFDEERAKEMEKFESSRSTLGANRSNNNSSPESSYPKGLIVAFTLKS 240

Query: 241 ISSESSSAEENGSHGVAADKTECKTDEGL-DSSKNDSEKTEQIEVTNMSKDEEIKESADD 300
            SS  +SAEEN SHGVAADKTECKTDEGL DSSKND EKTEQIE TN+SKDEEIK+SADD
Sbjct: 241 TSS-GNSAEENESHGVAADKTECKTDEGLEDSSKNDPEKTEQIEETNLSKDEEIKKSADD 300

Query: 301 KNGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVF 360
           KNGEAE K D+ NE S EVE EQ MD  +DE EEAEEKPT  KS+NNMNVVSREDLKAVF
Sbjct: 301 KNGEAEEKNDTGNERSLEVE-EQCMDGTVDEQEEAEEKPTALKSRNNMNVVSREDLKAVF 360

Query: 361 RKFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAE 420
           +KFGSVKFIDFKIGDESGYIRFEEPEAAQKARA+AVL +QGGLAVKNFIATLEPVSGEAE
Sbjct: 361 QKFGSVKFIDFKIGDESGYIRFEEPEAAQKARASAVLTEQGGLAVKNFIATLEPVSGEAE 420

Query: 421 KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGYDNHRGRPNKAQKV 472
           KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRG+DNHRGRPNKAQKV
Sbjct: 421 KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGHDNHRGRPNKAQKV 471

BLAST of HG10004577 vs. ExPASy TrEMBL
Match: A0A0A0LP79 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G375250 PE=4 SV=1)

HSP 1 Score: 746.9 bits (1927), Expect = 5.2e-212
Identity = 410/472 (86.86%), Postives = 439/472 (93.01%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           M N+SLDQE+AKKVLRQVEFYFSDSNLPRD FLRK ISESPDG+VDLSLIC+FSRM+GHL
Sbjct: 1   MGNTSLDQEMAKKVLRQVEFYFSDSNLPRDTFLRKTISESPDGLVDLSLICTFSRMKGHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           ELK ++  +   EDT+KAVAETLR+SS++KVSEDGKKVGRATELPKPEELIEQLD RTVA
Sbjct: 61  ELKQDVTPENFPEDTMKAVAETLRTSSSLKVSEDGKKVGRATELPKPEELIEQLDDRTVA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPFEYD+KLEDVEAFF++VTKV+SVRLPRHVADKRVFCGTAL+EFSTEE AEKVLKESL
Sbjct: 121 ASPFEYDIKLEDVEAFFNQVTKVNSVRLPRHVADKRVFCGTALIEFSTEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANR-NNNNSPESSYPKGLIVAFTLKS 240
           VYAGAKLE K KR+FDEERAKE EKFESSRS SGANR NNN+SPE+SYPKGLIVAFTLKS
Sbjct: 181 VYAGAKLELKPKREFDEERAKEMEKFESSRSTSGANRSNNNSSPEASYPKGLIVAFTLKS 240

Query: 241 ISSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADDK 300
            SS  S+AE N SHGV ADKTECKTDEGLDSSKNDSEKT QIE TN+SKDEEIKESADDK
Sbjct: 241 TSS-GSTAEGNESHGV-ADKTECKTDEGLDSSKNDSEKTVQIEETNLSKDEEIKESADDK 300

Query: 301 NGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFR 360
           NGEA  K DS NE+S EVE EQSMDD +DEHEEAEEKPT  +S+NNMNVVSREDLKAVFR
Sbjct: 301 NGEAVEKNDSGNEKSLEVE-EQSMDDTVDEHEEAEEKPTAFQSRNNMNVVSREDLKAVFR 360

Query: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAEK 420
           KFGSVKFIDFKIGDESGYIRFEEPEAAQKARA+AVLA+QGGLAVKNFIATLEPVSGEAEK
Sbjct: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARASAVLAEQGGLAVKNFIATLEPVSGEAEK 420

Query: 421 EYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGYDNHRGRPNKAQKV 472
           EYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRG+DNHRGRPNKAQKV
Sbjct: 421 EYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGHDNHRGRPNKAQKV 469

BLAST of HG10004577 vs. ExPASy TrEMBL
Match: A0A1S3BCL6 (la protein 1 OS=Cucumis melo OX=3656 GN=LOC103488199 PE=4 SV=1)

HSP 1 Score: 746.9 bits (1927), Expect = 5.2e-212
Identity = 410/473 (86.68%), Postives = 437/473 (92.39%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           MEN+SLDQE AKKVLRQVEFYFSDSNLPRD FLRK ISESPDGMVDLSLIC+F+RM+GHL
Sbjct: 1   MENTSLDQEKAKKVLRQVEFYFSDSNLPRDGFLRKSISESPDGMVDLSLICTFTRMKGHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           ELK ++  ++  EDT+KAVAETLR+SS++KVSEDGKKVGRATELPKPEELIEQLD RTVA
Sbjct: 61  ELKQDVTPEDFPEDTLKAVAETLRTSSSLKVSEDGKKVGRATELPKPEELIEQLDDRTVA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPFEYD+KLEDVEAFFDRVTKV+SVRLPRHVADKRVFCGTAL+EFSTEE AEKVLKESL
Sbjct: 121 ASPFEYDIKLEDVEAFFDRVTKVNSVRLPRHVADKRVFCGTALIEFSTEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANR-NNNNSPESSYPKGLIVAFTLKS 240
           VYAGAKLE K KR+FDEERAKE EKFESSRS  GANR NNN+SPESSYPKGLIVAFTLKS
Sbjct: 181 VYAGAKLELKPKREFDEERAKEMEKFESSRSTLGANRSNNNSSPESSYPKGLIVAFTLKS 240

Query: 241 ISSESSSAEENGSHGVAADKTECKTDEGL-DSSKNDSEKTEQIEVTNMSKDEEIKESADD 300
            SS  +SAEEN SHGVAADKTECKTDEGL DSSKND EKTEQIE TN+SKDEEIK+SADD
Sbjct: 241 TSS-GNSAEENESHGVAADKTECKTDEGLEDSSKNDPEKTEQIEETNLSKDEEIKKSADD 300

Query: 301 KNGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVF 360
           KNGEAE K D+ NE S EVE EQ MD  +DE EEAEEKPT  KS+NNMNVVSREDLKAVF
Sbjct: 301 KNGEAEEKNDTGNERSLEVE-EQCMDGTVDEQEEAEEKPTALKSRNNMNVVSREDLKAVF 360

Query: 361 RKFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAE 420
           +KFGSVKFIDFKIGDESGYIRFEEPEAAQKARA+AVL +QGGLAVKNFIATLEPVSGEAE
Sbjct: 361 QKFGSVKFIDFKIGDESGYIRFEEPEAAQKARASAVLTEQGGLAVKNFIATLEPVSGEAE 420

Query: 421 KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGYDNHRGRPNKAQKV 472
           KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRG+DNHRGRPNKAQKV
Sbjct: 421 KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGHDNHRGRPNKAQKV 471

BLAST of HG10004577 vs. ExPASy TrEMBL
Match: A0A6J1K750 (la protein 1 OS=Cucurbita maxima OX=3661 GN=LOC111492255 PE=4 SV=1)

HSP 1 Score: 715.7 bits (1846), Expect = 1.3e-202
Identity = 394/473 (83.30%), Postives = 429/473 (90.70%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           MENSSLDQE AKKVLRQVEFYFSDSNLPRD FL+K IS  PDGMVDLSLIC+FSRM+GHL
Sbjct: 1   MENSSLDQETAKKVLRQVEFYFSDSNLPRDGFLKKTISGIPDGMVDLSLICTFSRMKGHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           +LK ++K +EI EDT+KAVAETLRSSST+KVSEDGKK+GR TELPKPEELIEQLD +T+A
Sbjct: 61  KLKQDVKPEEIPEDTLKAVAETLRSSSTIKVSEDGKKIGRTTELPKPEELIEQLDDKTIA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPFEYDVKLEDVEAFF+++ KV+SVRLPRHVADKRVFCGTALVEFS EE AEKVLKESL
Sbjct: 121 ASPFEYDVKLEDVEAFFNQIAKVNSVRLPRHVADKRVFCGTALVEFSNEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSI 240
           +YAGAKLE K KRDFD+ERAKETE+FESSR+ S ANR NNN PES+YPKGLIVAFTLKS+
Sbjct: 181 LYAGAKLELKPKRDFDKERAKETEEFESSRANSSANRKNNNPPESNYPKGLIVAFTLKSV 240

Query: 241 SSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQI-EVTNMSKDEEIKESADDK 300
           SS  SSAEENGSH VAADKTECKTDE LDSSKNDSEKT+QI E  NMSKDEEIK SADD 
Sbjct: 241 SS-GSSAEENGSHCVAADKTECKTDERLDSSKNDSEKTKQIWEEANMSKDEEIK-SADDN 300

Query: 301 NGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFR 360
           NGEA++K D  NEE PEVE E++ +D +DEHEE EEKPT AK KNNMNVVSREDLK +F+
Sbjct: 301 NGEADKKNDLGNEEKPEVE-ERTTNDTVDEHEEDEEKPTAAKFKNNMNVVSREDLKVIFQ 360

Query: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAEK 420
           KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLA+QGGLAVK+FIATLEPVSGEAEK
Sbjct: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLAEQGGLAVKSFIATLEPVSGEAEK 420

Query: 421 EYWSLLRSNQEKHHRDFKGNRGRGGKFNR-GGKHGRSRGYDNHRGRPNKAQKV 472
           EYW LLRSNQEKHHRDFKGNRGRGGKFNR GGKH RSRG+ N +GRPNKAQKV
Sbjct: 421 EYWRLLRSNQEKHHRDFKGNRGRGGKFNRGGGKHARSRGHGNDQGRPNKAQKV 470

BLAST of HG10004577 vs. ExPASy TrEMBL
Match: A0A6J1HHB2 (la protein 1 OS=Cucurbita moschata OX=3662 GN=LOC111462923 PE=4 SV=1)

HSP 1 Score: 707.2 bits (1824), Expect = 4.5e-200
Identity = 394/477 (82.60%), Postives = 426/477 (89.31%), Query Frame = 0

Query: 1   MENSSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHL 60
           MENSSLDQE AKKVLRQVEFYFSDSNLPRD FL+K IS   DGMVDLSLIC+FSRM+GHL
Sbjct: 1   MENSSLDQETAKKVLRQVEFYFSDSNLPRDGFLKKTISGIADGMVDLSLICTFSRMKGHL 60

Query: 61  ELKHEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVA 120
           +LK ++K +EI EDT+KAVAETLRSSST+KVSEDGKK+GR TELPKPEELIEQLD +T+A
Sbjct: 61  KLKQDVKPEEIPEDTLKAVAETLRSSSTIKVSEDGKKIGRTTELPKPEELIEQLDDKTIA 120

Query: 121 ASPFEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESL 180
           ASPFEYDVKLEDVEAFF+++ KV+SVRLPRHVADKRVFCGTALVEFSTEE AEKVLKESL
Sbjct: 121 ASPFEYDVKLEDVEAFFNQIAKVNSVRLPRHVADKRVFCGTALVEFSTEEDAEKVLKESL 180

Query: 181 VYAGAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSI 240
           +YAGAKLE K KRDFD ERAKETE+FESSR+ S ANR NNN PES+YPKGLIVAFTLKS+
Sbjct: 181 LYAGAKLELKPKRDFDAERAKETEEFESSRANSSANRKNNNPPESNYPKGLIVAFTLKSV 240

Query: 241 SSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQI-EVTNMSKDEEIKESADDK 300
           SS  SSAEENGSHGVAADKTECK DE LDSSKNDSEKTE I E  NMSKDEEIK SADD 
Sbjct: 241 SS-GSSAEENGSHGVAADKTECKPDERLDSSKNDSEKTEIIGEEANMSKDEEIK-SADDN 300

Query: 301 NGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFR 360
           NGEA++K D  NEE PEVE E++ +D +DEHEE EEKPT AKSKNNMNVVSREDLK +F+
Sbjct: 301 NGEADKKNDLGNEERPEVE-ERTTNDTVDEHEEDEEKPTAAKSKNNMNVVSREDLKVIFQ 360

Query: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAEK 420
           KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLA+QGGL VK+FIATLEPVSGEAEK
Sbjct: 361 KFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLAEQGGLVVKSFIATLEPVSGEAEK 420

Query: 421 EYWSLLRSNQEKHHRDFKGNRGRGGKFNR-GGKHGRSRGY----DNHRGRPNKAQKV 472
           EYW LLRSNQEKHHRDFK NRGRGGKFNR GGKH RSRG+     N RGRPNKAQKV
Sbjct: 421 EYWRLLRSNQEKHHRDFKSNRGRGGKFNRGGGKHARSRGHGHCNGNDRGRPNKAQKV 474

BLAST of HG10004577 vs. TAIR 10
Match: AT4G32720.1 (La protein 1 )

HSP 1 Score: 417.2 bits (1071), Expect = 1.8e-116
Identity = 261/473 (55.18%), Postives = 328/473 (69.34%), Query Frame = 0

Query: 6   LDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHLELKHE 65
           L +E AK VLRQVEFYFSDSNLP D FL+K ++ES DG+V L+LICSFS+MRG+L+L  +
Sbjct: 6   LTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYLKL-GD 65

Query: 66  MKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVAASPFE 125
            K D+I EDTIKAVA+TLR+SS +K+S+DGKKVGR+TEL K E+LIEQL+ RTVAASPF 
Sbjct: 66  SKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAASPFS 125

Query: 126 YDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESLVYAGA 185
           YDVK EDVE+FF +  KV+SVR+PRHVA+ R+F G ALVEF TEE A+ V+K++LV+AG 
Sbjct: 126 YDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLVFAGQ 185

Query: 186 KLEFKRKRDFDEERAKETEKFE-------SSRSISGANRNNNNSPESSYPKGLIVAFTLK 245
           +LE K K++FD ER K+  KF        S+   +G++  NN++ E  YPKGLI++FTLK
Sbjct: 186 ELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIISFTLK 245

Query: 246 SISSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADD 305
                  SAEE          TE K+ E               E T+ + +E   + AD 
Sbjct: 246 ------RSAEEG--------TTEQKSSE---------------EPTDKTMEESETKPADT 305

Query: 306 KNGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVF 365
            + + E  G+ V  E  E           DE +E EEK  +A  K+N +VV REDLKAVF
Sbjct: 306 PDADKENTGE-VQAEGAE-----------DEDDEKEEKGALATHKDNKDVVLREDLKAVF 365

Query: 366 RKFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAE 425
            KFG VKF+DFK+G E+GY+RF+EPEA+QKARAAAVLA++GGLAVKNFIA LEPV GEAE
Sbjct: 366 GKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVLANEGGLAVKNFIAVLEPVIGEAE 425

Query: 426 KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGRSRGYDNHRGRPNKAQKV 472
           KEYW+LLRS   K   D  G  GRGG+  RGG+ GR RG D+  GR NK+QKV
Sbjct: 426 KEYWTLLRS---KDRFDKGGRGGRGGR--RGGRFGRKRGSDSPGGRWNKSQKV 431

BLAST of HG10004577 vs. TAIR 10
Match: AT4G32720.2 (La protein 1 )

HSP 1 Score: 396.7 bits (1018), Expect = 2.5e-110
Identity = 244/456 (53.51%), Postives = 311/456 (68.20%), Query Frame = 0

Query: 6   LDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHLELKHE 65
           L +E AK VLRQVEFYFSDSNLP D FL+K ++ES DG+V L+LICSFS+MRG+L+L  +
Sbjct: 6   LTEETAKTVLRQVEFYFSDSNLPIDDFLKKTVTESEDGLVSLALICSFSKMRGYLKL-GD 65

Query: 66  MKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVAASPFE 125
            K D+I EDTIKAVA+TLR+SS +K+S+DGKKVGR+TEL K E+LIEQL+ RTVAASPF 
Sbjct: 66  SKGDDIPEDTIKAVADTLRTSSALKISDDGKKVGRSTELLKLEDLIEQLNARTVAASPFS 125

Query: 126 YDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESLVYAGA 185
           YDVK EDVE+FF +  KV+SVR+PRHVA+ R+F G ALVEF TEE A+ V+K++LV+AG 
Sbjct: 126 YDVKREDVESFFSQYGKVNSVRMPRHVAESRIFSGVALVEFPTEEDAQNVMKQNLVFAGQ 185

Query: 186 KLEFKRKRDFDEERAKETEKFE-------SSRSISGANRNNNNSPESSYPKGLIVAFTLK 245
           +LE K K++FD ER K+  KF        S+   +G++  NN++ E  YPKGLI++FTLK
Sbjct: 186 ELELKPKKEFDNEREKDEVKFANYQPQKGSANQKNGSDHKNNSAYEPDYPKGLIISFTLK 245

Query: 246 SISSESSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADD 305
                  SAEE          TE K+ E               E T+ + +E   + AD 
Sbjct: 246 ------RSAEEG--------TTEQKSSE---------------EPTDKTMEESETKPADT 305

Query: 306 KNGEAERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVF 365
            + + E  G+ V  E  E           DE +E EEK  +A  K+N +VV REDLKAVF
Sbjct: 306 PDADKENTGE-VQAEGAE-----------DEDDEKEEKGALATHKDNKDVVLREDLKAVF 365

Query: 366 RKFGSVKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVKNFIATLEPVSGEAE 425
            KFG VKF+DFK+G E+GY+RF+EPEA+QKARAAAVLA++GGLAVKNFIA LEPV GEAE
Sbjct: 366 GKFGDVKFVDFKMGSETGYLRFDEPEASQKARAAAVLANEGGLAVKNFIAVLEPVIGEAE 404

Query: 426 KEYWSLLRSNQEKHHRDFKGNRGRGGKFNRGGKHGR 455
           KEYW+LLRS                 +F++GG+ GR
Sbjct: 426 KEYWTLLRSKD---------------RFDKGGRGGR 404

BLAST of HG10004577 vs. TAIR 10
Match: AT1G79880.1 (RNA recognition motif (RRM)-containing protein )

HSP 1 Score: 255.0 bits (650), Expect = 1.2e-67
Identity = 177/440 (40.23%), Postives = 251/440 (57.05%), Query Frame = 0

Query: 4   SSLDQEIAKKVLRQVEFYFSDSNLPRDCFLRKHISESPDGMVDLSLICSFSRMRGHLELK 63
           SS ++E AKK+L QVEFYFSDSNLP D FL + +++S DG+V L L+CSFSRMR  L L 
Sbjct: 3   SSFNEETAKKLLTQVEFYFSDSNLPTDGFLNREVTKSKDGLVSLPLVCSFSRMRNLLGLG 62

Query: 64  HEMKSDEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVAASP 123
           + +  ++I    ++ VA  LR+S  +KVS +G+++GR T+L KPEE++EQ+  RT+AASP
Sbjct: 63  N-INREDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAASP 122

Query: 124 FEYDVKLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESLVYA 183
           FEY +K+EDV +FF +  KV+SVRLP ++ADKR FCGTALVEFS+E+  + +L++SLVYA
Sbjct: 123 FEYSIKMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVYA 182

Query: 184 GAKLEFKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSISSE 243
           GA L    K DFD +R    ++   S S             + + +G IV F LK I+SE
Sbjct: 183 GADLVLIPKSDFDCQRENMIKQLGKSES------------HNEFRRGQIVKFALKWIASE 242

Query: 244 SSSAEENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADDKNGEA 303
                                           EK    E  +  K+ +IKE  D + G A
Sbjct: 243 --------------------------------EKVTNKEKPSALKN-KIKEKEDKETGIA 302

Query: 304 ERKGDSVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFRKFGS 363
           +R+ ++ +     + K+ +                V    NN N VS E LK +F++FGS
Sbjct: 303 DREKENGDNSCASLCKDNT-------------DQLVVPPWNNSNSVSSEVLKDLFQRFGS 362

Query: 364 VKFIDFKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVK-NFIATLEPVSGEAEKEYW 423
           V+ I++  G +SGY+ F + E A KARAA      GGL VK NF   LE ++GE E+E W
Sbjct: 363 VEHIEYSGGLDSGYVWFTDSETAMKARAAVEFV--GGLVVKNNFSVALEAINGEMERELW 381

Query: 424 SLLRSNQ----EKHHRDFKG 439
             L S +    ++ H+  KG
Sbjct: 423 KRLSSAELEGGKEGHKKEKG 381

BLAST of HG10004577 vs. TAIR 10
Match: AT1G79880.3 (RNA recognition motif (RRM)-containing protein )

HSP 1 Score: 188.0 bits (476), Expect = 1.8e-47
Identity = 140/372 (37.63%), Postives = 201/372 (54.03%), Query Frame = 0

Query: 69  DEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVAASPFEYDV 128
           ++I    ++ VA  LR+S  +KVS +G+++GR T+L KPEE++EQ+  RT+AASPFEY +
Sbjct: 13  EDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAASPFEYSI 72

Query: 129 KLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESLVYAGAKLE 188
           K+EDV +FF +  KV+SVRLP ++ADKR FCGTALVEFS+E+  + +L++SLVYAGA L 
Sbjct: 73  KMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVYAGADLV 132

Query: 189 FKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSISSESSSAE 248
              K DFD +R    ++   S S             + + +G IV F LK I+SE     
Sbjct: 133 LIPKSDFDCQRENMIKQLGKSES------------HNEFRRGQIVKFALKWIASE----- 192

Query: 249 ENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADDKNGEAERKGD 308
                                      EK    E  +  K+ +IKE  D + G A+R+ +
Sbjct: 193 ---------------------------EKVTNKEKPSALKN-KIKEKEDKETGIADREKE 252

Query: 309 SVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFRKFGSVKFID 368
           + +     + K+ +                V    NN N VS E LK +F++FGSV+ I+
Sbjct: 253 NGDNSCASLCKDNT-------------DQLVVPPWNNSNSVSSEVLKDLFQRFGSVEHIE 312

Query: 369 FKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVK-NFIATLEPVSGEAEKEYWSLLRS 428
           +  G +SGY+ F + E A KARAA      GGL VK NF   LE ++GE E+E W  L S
Sbjct: 313 YSGGLDSGYVWFTDSETAMKARAAVEFV--GGLVVKNNFSVALEAINGEMERELWKRLSS 324

Query: 429 NQEKHHRDFKGN 440
            + +   D K N
Sbjct: 373 AELEGGYDIKPN 324

BLAST of HG10004577 vs. TAIR 10
Match: AT1G79880.2 (RNA recognition motif (RRM)-containing protein )

HSP 1 Score: 186.4 bits (472), Expect = 5.2e-47
Identity = 140/375 (37.33%), Postives = 203/375 (54.13%), Query Frame = 0

Query: 69  DEISEDTIKAVAETLRSSSTVKVSEDGKKVGRATELPKPEELIEQLDVRTVAASPFEYDV 128
           ++I    ++ VA  LR+S  +KVS +G+++GR T+L KPEE++EQ+  RT+AASPFEY +
Sbjct: 13  EDIPPRIVEEVANLLRTSDFLKVSNNGQRIGRGTKLSKPEEVLEQVHRRTLAASPFEYSI 72

Query: 129 KLEDVEAFFDRVTKVSSVRLPRHVADKRVFCGTALVEFSTEEGAEKVLKESLVYAGAKLE 188
           K+EDV +FF +  KV+SVRLP ++ADKR FCGTALVEFS+E+  + +L++SLVYAGA L 
Sbjct: 73  KMEDVSSFFSQYAKVNSVRLPPNIADKRRFCGTALVEFSSEQDTQDILRQSLVYAGADLV 132

Query: 189 FKRKRDFDEERAKETEKFESSRSISGANRNNNNSPESSYPKGLIVAFTLKSISSESSSAE 248
              K DFD +R    ++   S S             + + +G IV F LK I+SE     
Sbjct: 133 LIPKSDFDCQRENMIKQLGKSES------------HNEFRRGQIVKFALKWIASE----- 192

Query: 249 ENGSHGVAADKTECKTDEGLDSSKNDSEKTEQIEVTNMSKDEEIKESADDKNGEAERKGD 308
                                      EK    E  +  K+ +IKE  D + G A+R+ +
Sbjct: 193 ---------------------------EKVTNKEKPSALKN-KIKEKEDKETGIADREKE 252

Query: 309 SVNEESPEVEKEQSMDDPIDEHEEAEEKPTVAKSKNNMNVVSREDLKAVFRKFGSVKFID 368
           + +     + K+ +                V    NN N VS E LK +F++FGSV+ I+
Sbjct: 253 NGDNSCASLCKDNT-------------DQLVVPPWNNSNSVSSEVLKDLFQRFGSVEHIE 312

Query: 369 FKIGDESGYIRFEEPEAAQKARAAAVLADQGGLAVK-NFIATLEPVSGEAEKEYWSLLRS 428
           +  G +SGY+ F + E A KARAA      GGL VK NF   LE ++GE E+E W  L S
Sbjct: 313 YSGGLDSGYVWFTDSETAMKARAAVEFV--GGLVVKNNFSVALEAINGEMERELWKRLSS 327

Query: 429 NQ----EKHHRDFKG 439
            +    ++ H+  KG
Sbjct: 373 AELEGGKEGHKKEKG 327

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008445036.11.1e-21186.68PREDICTED: la protein 1 [Cucumis melo] >KAA0065026.1 la protein 1 [Cucumis melo ... [more]
XP_004150280.11.1e-21186.86la protein 1 [Cucumis sativus] >KGN62834.1 hypothetical protein Csa_022558 [Cucu... [more]
XP_038886452.12.5e-20883.06la protein 1 [Benincasa hispida][more]
XP_022997306.12.6e-20283.30la protein 1 [Cucurbita maxima][more]
XP_022962519.19.4e-20082.60la protein 1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q93ZV72.5e-11555.18La protein 1 OS=Arabidopsis thaliana OX=3702 GN=LA1 PE=1 SV=1[more]
Q0V7U71.7e-6640.23La protein 2 OS=Arabidopsis thaliana OX=3702 GN=LA2 PE=1 SV=1[more]
P333994.2e-1733.96La protein homolog OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=5... [more]
Q7ZWE31.9e-0923.63La-related protein 7 OS=Danio rerio OX=7955 GN=larp7 PE=2 SV=1[more]
Q5XI011.3e-0725.57La-related protein 7 OS=Rattus norvegicus OX=10116 GN=Larp7 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A5A7VFC45.2e-21286.68La protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003520 P... [more]
A0A0A0LP795.2e-21286.86Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G375250 PE=4 SV=1[more]
A0A1S3BCL65.2e-21286.68la protein 1 OS=Cucumis melo OX=3656 GN=LOC103488199 PE=4 SV=1[more]
A0A6J1K7501.3e-20283.30la protein 1 OS=Cucurbita maxima OX=3661 GN=LOC111492255 PE=4 SV=1[more]
A0A6J1HHB24.5e-20082.60la protein 1 OS=Cucurbita moschata OX=3662 GN=LOC111462923 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G32720.11.8e-11655.18La protein 1 [more]
AT4G32720.22.5e-11053.51La protein 1 [more]
AT1G79880.11.2e-6740.23RNA recognition motif (RRM)-containing protein [more]
AT1G79880.31.8e-4737.63RNA recognition motif (RRM)-containing protein [more]
AT1G79880.25.2e-4737.33RNA recognition motif (RRM)-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002344Lupus La proteinPRINTSPR00302LUPUSLAcoord: 158..176
score: 35.89
coord: 17..34
score: 67.68
coord: 43..58
score: 32.95
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 118..190
e-value: 0.029
score: 20.3
coord: 344..404
e-value: 1.0
score: 6.4
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 125..177
e-value: 7.4E-5
score: 22.5
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 117..202
score: 9.831471
IPR006630La-type HTH domainSMARTSM00715lacoord: 8..101
e-value: 2.1E-30
score: 117.0
IPR006630La-type HTH domainPFAMPF05383Lacoord: 14..84
e-value: 2.2E-22
score: 78.8
IPR006630La-type HTH domainPROSITEPS50961HTH_LAcoord: 4..108
score: 24.418575
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 324..440
e-value: 1.3E-24
score: 88.2
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 111..207
e-value: 3.5E-12
score: 48.2
IPR036388Winged helix-like DNA-binding domain superfamilyGENE3D1.10.10.10coord: 2..105
e-value: 1.5E-27
score: 97.4
IPR014886La protein, xRRM domainPFAMPF08777RRM_3coord: 349..438
e-value: 2.2E-21
score: 75.8
IPR014886La protein, xRRM domainPROSITEPS51939XRRMcoord: 333..454
score: 23.157887
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 452..471
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 428..442
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 209..227
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 254..339
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 428..471
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 198..227
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 244..346
NoneNo IPR availablePANTHERPTHR22792LUPUS LA PROTEIN-RELATEDcoord: 298..471
coord: 2..254
NoneNo IPR availablePANTHERPTHR22792:SF138BNAANNG34180D PROTEINcoord: 298..471
coord: 2..254
NoneNo IPR availableCDDcd08030LA_like_plantcoord: 11..100
e-value: 2.33962E-39
score: 135.209
IPR036390Winged helix DNA-binding domain superfamilySUPERFAMILY46785"Winged helix" DNA-binding domaincoord: 8..104
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 117..391

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004577.1HG10004577.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008033 tRNA processing
biological_process GO:0006396 RNA processing
cellular_component GO:0005634 nucleus
cellular_component GO:1990904 ribonucleoprotein complex
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0003729 mRNA binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003723 RNA binding