Lag0019244 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0019244
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRpr2/Rpp21 subunit-like protein
Locationchr5: 40172271 .. 40176088 (-)
RNA-Seq ExpressionLag0019244
SyntenyLag0019244
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGAAGAAGAAGGGAAATAAGAAGAAAGGAGCAACTAACCCCACATCCGGTCCTCAAGATTCGATCACCCTCAGGCAGGAAATTACTGGGAAAATCCAACCCAAAGTCTCTAGCAATGTCAAATCTTATTTGAACCACTTGGAAAACTTGGCCACTTGGGCCAGTGGGCAAGCCTCTATACCTTCATTGGCTGCTTTCTATGGGCAGCGCCACGCCGCTGTAGCGGAGTCTTCAGCGGTCCCTCCCAACCCTTCTCTAATTATCTGCCAGAGGTCCGTATGTCTTCCGCCCTTTACATTCGTCTGGTTCTTCTTGTCTAGTTTTGTTGTTTTTGTAGTGTTCATAGATGGGCAAGTTCGTGATGGTTGATTATAGTTGAGGATGGGAAGTTTGTGTTTAACTTCTCTGATTTGTTTGAAGTGACAGTAATCTGATGTAAACGGAATTTGAATGTAGAGAATGGGCGCTCTTTATTGGGCGAAATATTGTTTTGATTCAAACTATTGACGTTGGAGGGATAGAGTAATATGCTTGGAATTATGGCAGTATTGTGAATGGTTTTAGGGGTACTGTTGTACAATGTACAAGCTGTTTGTGGATGCTTTGATCCTTCTGGTTTGTTACCAGAATTGTGTGATTGAGATGTATATTAAAAAAGCAACCTTGGTAATATGCAGGAGATATCATCTAACCTAGCAACAATTTGACAGAAGGAATCATGTGAGGGTGTGAACAGAAGGATGAATGTACTGAGTTTCATTGTTTAACTATACATTTTCTCTACTTGAGACAGTATTAGATTTTAGATTTTCATGTTTAGTTCAGGAGTTTGTATTGTAGCAGGGTTGGGGTGTAAGCTTGTGTGTTGTACAGTACTTATCTTATTTGGGCATACCCTTGGGTGATAATCAGATAACCAAAGAGGCTAAAGTTTTGAAACCTGTGAAGGGCAATTTTACACAAACTTTATAAATGGAGGAAATTATCCAGGGGTGAAAGCTTGACTTTGTGTAGGCAGTCCTTGCAAAATTACCTACCCACCGTGCATCCTTGTGATACTGAGGAAACTGTAGCTTTAATAAAGAGGAAAATGAGGAACTTTCTTTGGAATGGTTCTTTAGTGGGAAAGCCTAGCCATCTAGTGAATTAAAAGGCAGTATCCTTGCCCTTATGAAATGGAAGGTTGGGAGTTGGGAACTTGAGCCGAGGGAATGAAGCCCTTATGGCAAAGTGGTTTTGGCATTGCCTGGGGAGCCTGGCTTGATCTGGCACAAAATAGTGGCATAGTGCATATTATCTTTGAGTAAAAGGAGAGGGATTCAAAAAAGGAAAATAGAGGTAGTGGTTGGTTACTAAGTTGAAGGGTATTAGATTTCAATCACTCTAAAGATTTCAATCACTCTATGGTCAATGTCGGAGATGGAAAGAACTCAAGGACACAGATGTACAGCCTATAGAACATATCTTGTTCAAGTAAGGTTGTATGTCAGAGATAGAAGAAATAATAAAATTTTGGAGGATTGTGGATGGATAGTCAAACCTTAAGCTTAACTATGAATTTTCCTAATTTAATGCAATGGGGGAATCCAAGTAAGGTTGTATGTCAGAGATAGAAGAAATAATAAATTTTTGGAGGATTGTGGATGGATAGTCAAACCTTAAGCTTAACTATGAATTTTCCTAATTTAATGCAATGGGGGAATCCAAGCTGGAGTATAATGACAAGGAGGAATCTTTCAGATGAAGAATTTAGTTCAATGCTATTATTTCGAGTTGCATTCATCATTATTCGGTATTCTCTTGTCTGTCTGAGTTTTCTTGCCACATTCACTGGAAAACAGACTAACTGGTGCAGGGGTGGATGAAAGAAGCTGGAATTTGTGTAGCTCTAGTGATCTCAATGAAATCTTTGGTTTTTTTGCCTGACTGATTTCAGGAGAGCTTTAGAAAGAGAGTTTTCAACTTTAGGGTGGAAAAAAACGTGCCTAAAAAGATCAAACGTTTGAGTGAATAGCCTGCCTTACTGGTCTTAACACAGTAGAGAGAGATGACATAAAGATGCCCTACGGTGGTCTTGGATCCTTCTGGGTGCTCATTTTCCTGGGAAGAATATGAGAGAACCTGCCACCCTTTCTTCCATTGCCCATTTGGCGAGAGATGTTGGTACCCATTATCATTACTATAAATTTTCCATCACTTTTGGGTTTTTTATTGATCTTTGAAGGCGTGCAGCATATCCCTTCTACCATGTGGTTACTCTTTCAATGGCAAAGCCAGGAAGCTATGGCTAATGCAATCAAACATGTTCTTTGGAGCCTGTGGGTTGGGGAGAAAAAATAGTGTTTAACTGCAAGGCTAAATACTAAATACTGGGTTGAAGCATATGAGTCTGTTCTTAAAGATTTCTTTAGATAATAGTAGGTTTAACAACCATATTCCCCTTGTAATTTTGGTTTATCAAATAAGAGGGTGAGTTCTCGAATTGAGTGGGGGTTTGGAATTACGACCTCTAAAGAGAGGTAATAGATATCTTAACCACCTTATATTCCCCTTTAGTTAATACTCATTGGGGTGGGTTTTTGTAACAACTTCAACTTAGAATATAGCTATGGGTTATGACTGAATTCTCTTCTTTGTACTATGAGTTCCATGAAACTCATAATCACAAAGAGACAGTGTTTGCATGGATCTCATAGGACAAACCTGAGCATTTATAATATAATATTAGCTAAAGATCAGTTGCTCTTTTTATGCTTATTGATAACATTCTTATTGTTAAAAAATTTTCAAATACTTGGAAGAAACCTACTGATGTTTTAATTACATACATGAACGGAGTGGGCACTTGATTCTAACCATATAGCTGTTCCATGTCAAACATTGATCAGGCCATCAGATTTTATGTTTATCTGCTTCGCTTTTTTTCATCCTTCTTGGATCAAAATGTGCTTTCTTGTCACATGTGATAATGAACATTACATTAGTTTAGTTTCTTGTTTTAAGGACTTTTATCATGTTCCATTTGCGTTTTGTCTTCGCATCTTGCTTGAATCTTCTTTGGCCATAAAACTAGCTAATCTTACTTGCTCCATCATATTGCTTTCAGGTGTGAAACAATTCTCCAACCTGGCTCTAACTGCTCTATACGAATAGAGAAGAATAATGCCAAGAGACGTCGGAGAAGCAACAAATCTAGTAATTTGACACAGAACAATGTGGCGTATTACTGCCACTACTGCTCTTTTAGGAACATAAAGAGAGGGACTCCCAAAGGCCATATGAAAGTGCTTTATGAAACAGGGTTCGAAAGCAAGATGGCAACTGAGATTCTTACCGTTGATGCTCCTACAATTCCTTCTACAACTGGAGACATTCTTACTATTGATACTCCTGCAACTCCGCCCACCCTGATCGCAACGACTCTGTCGAAAAGACAGAAGAGGAAAATGAGGAAATTAGCTATGAAACTAACTGGACCTGACACTAGCTGTGCTCCAACAGATGCGGAGGAGAAAACTGGGGATGCTCCTACCGTCGATGCTCCCGCAACTCCTCCCACTACGCTCGGGACGACTCTGTTGAATTTGAAGAAGAGAAAGAGGAAGAAACTGTCATCAAAGAATCAAACTGAGCCCGAAATTAGCTCTGCTCCAACAGCTGATGGGGATAAAACCGAAGGCACATCTAAAAGAAAGCGAAAGCGAAAGAAAAAGTCATGGACAAGTTTGAAGGAAATCGCTCAGATAAATGAACAGAGTGGTAAACAAAACGTGGCTGAATTGGCAATTCCATTCTCCTTACAAGGCACTTTCTGA

mRNA sequence

ATGGCGAAGAAGAAGGGAAATAAGAAGAAAGGAGCAACTAACCCCACATCCGGTCCTCAAGATTCGATCACCCTCAGGCAGGAAATTACTGGGAAAATCCAACCCAAAGTCTCTAGCAATGTCAAATCTTATTTGAACCACTTGGAAAACTTGGCCACTTGGGCCAGTGGGCAAGCCTCTATACCTTCATTGGCTGCTTTCTATGGGCAGCGCCACGCCGCTGTAGCGGAGTCTTCAGCGGTCCCTCCCAACCCTTCTCTAATTATCTGCCAGAGGTGTGAAACAATTCTCCAACCTGGCTCTAACTGCTCTATACGAATAGAGAAGAATAATGCCAAGAGACGTCGGAGAAGCAACAAATCTAGTAATTTGACACAGAACAATGTGGCGTATTACTGCCACTACTGCTCTTTTAGGAACATAAAGAGAGGGACTCCCAAAGGCCATATGAAAGTGCTTTATGAAACAGGGTTCGAAAGCAAGATGGCAACTGAGATTCTTACCGTTGATGCTCCTACAATTCCTTCTACAACTGGAGACATTCTTACTATTGATACTCCTGCAACTCCGCCCACCCTGATCGCAACGACTCTGTCGAAAAGACAGAAGAGGAAAATGAGGAAATTAGCTATGAAACTAACTGGACCTGACACTAGCTGTGCTCCAACAGATGCGGAGGAGAAAACTGGGGATGCTCCTACCGTCGATGCTCCCGCAACTCCTCCCACTACGCTCGGGACGACTCTGTTGAATTTGAAGAAGAGAAAGAGGAAGAAACTGTCATCAAAGAATCAAACTGAGCCCGAAATTAGCTCTGCTCCAACAGCTGATGGGGATAAAACCGAAGGCACATCTAAAAGAAAGCGAAAGCGAAAGAAAAAGTCATGGACAAGTTTGAAGGAAATCGCTCAGATAAATGAACAGAGTGGTAAACAAAACGTGGCTGAATTGGCAATTCCATTCTCCTTACAAGGCACTTTCTGA

Coding sequence (CDS)

ATGGCGAAGAAGAAGGGAAATAAGAAGAAAGGAGCAACTAACCCCACATCCGGTCCTCAAGATTCGATCACCCTCAGGCAGGAAATTACTGGGAAAATCCAACCCAAAGTCTCTAGCAATGTCAAATCTTATTTGAACCACTTGGAAAACTTGGCCACTTGGGCCAGTGGGCAAGCCTCTATACCTTCATTGGCTGCTTTCTATGGGCAGCGCCACGCCGCTGTAGCGGAGTCTTCAGCGGTCCCTCCCAACCCTTCTCTAATTATCTGCCAGAGGTGTGAAACAATTCTCCAACCTGGCTCTAACTGCTCTATACGAATAGAGAAGAATAATGCCAAGAGACGTCGGAGAAGCAACAAATCTAGTAATTTGACACAGAACAATGTGGCGTATTACTGCCACTACTGCTCTTTTAGGAACATAAAGAGAGGGACTCCCAAAGGCCATATGAAAGTGCTTTATGAAACAGGGTTCGAAAGCAAGATGGCAACTGAGATTCTTACCGTTGATGCTCCTACAATTCCTTCTACAACTGGAGACATTCTTACTATTGATACTCCTGCAACTCCGCCCACCCTGATCGCAACGACTCTGTCGAAAAGACAGAAGAGGAAAATGAGGAAATTAGCTATGAAACTAACTGGACCTGACACTAGCTGTGCTCCAACAGATGCGGAGGAGAAAACTGGGGATGCTCCTACCGTCGATGCTCCCGCAACTCCTCCCACTACGCTCGGGACGACTCTGTTGAATTTGAAGAAGAGAAAGAGGAAGAAACTGTCATCAAAGAATCAAACTGAGCCCGAAATTAGCTCTGCTCCAACAGCTGATGGGGATAAAACCGAAGGCACATCTAAAAGAAAGCGAAAGCGAAAGAAAAAGTCATGGACAAGTTTGAAGGAAATCGCTCAGATAAATGAACAGAGTGGTAAACAAAACGTGGCTGAATTGGCAATTCCATTCTCCTTACAAGGCACTTTCTGA

Protein sequence

MAKKKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQASIPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRSNKSSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKMATEILTVDAPTIPSTTGDILTIDTPATPPTLIATTLSKRQKRKMRKLAMKLTGPDTSCAPTDAEEKTGDAPTVDAPATPPTTLGTTLLNLKKRKRKKLSSKNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSWTSLKEIAQINEQSGKQNVAELAIPFSLQGTF
Homology
BLAST of Lag0019244 vs. NCBI nr
Match: XP_022136585.1 (uncharacterized protein LOC111008256 isoform X1 [Momordica charantia])

HSP 1 Score: 429.1 bits (1102), Expect = 3.4e-116
Identity = 257/391 (65.73%), Postives = 277/391 (70.84%), Query Frame = 0

Query: 1   MAKK-KGNKKKGATNPTSG-PQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQ 60
           MAKK +G KK GA+N T G PQDSITLRQE TGKIQPK  +NVK YL+HLENLATWASGQ
Sbjct: 1   MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ 60

Query: 61  ASIPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRS 120
           ASIPSLAAF+G+R AA A+SS V P+ SL +CQRCETILQPGSNCSIRIEKN AKRRR+ 
Sbjct: 61  ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH 120

Query: 121 NKSSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLY---------ETGFESK-------- 180
           NK SNLTQNN+ YYCHYCS RNIKRGTPKGHMKV Y             ESK        
Sbjct: 121 NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV 180

Query: 181 ------------MATEILTVDAPTIPS----------------TTG-------------- 240
                       M TEILT+DAP IPS                TTG              
Sbjct: 181 RRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPR 240

Query: 241 --DILTIDTPATPPTLIATTLSKRQKRKMRKLAMK-LTGPDTSCAPTDAEEKTGDAPTVD 300
             DIL I+ PATPPT+  TTLSK QKRK RKLA K  TGP+ SCAPTD+E+KTGD PTVD
Sbjct: 241 MEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVD 300

Query: 301 APATPPTTLGTTLLNLKKRKRKKLSSKNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSW 328
           APATPP  +G TLL  KKRKRKK SSKNQTEPE S APTA+GDKTEGTSKRKRKR  KSW
Sbjct: 301 APATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKR--KSW 360

BLAST of Lag0019244 vs. NCBI nr
Match: XP_022984531.1 (uncharacterized protein LOC111482797 [Cucurbita maxima])

HSP 1 Score: 414.1 bits (1063), Expect = 1.1e-111
Identity = 241/362 (66.57%), Postives = 267/362 (73.76%), Query Frame = 0

Query: 1   MAKKKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQAS 60
           MAK+KGN KKGA+NPTSGPQDSIT+RQEITGK +PKVS+NVK+YLNHLENLATWASG+AS
Sbjct: 1   MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60

Query: 61  IPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRSNK 120
           IPSLAAF+GQR A  AES AV P+ SL  CQRCETILQPGSNCSIRIEKNNAKRRRR  K
Sbjct: 61  IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKK 120

Query: 121 SSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKM------------------ 180
            SN  QNNVAYYCH+CS RNIKRGTPKGHMKVLY+  FE ++                  
Sbjct: 121 CSNSRQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGKECETSAVER 180

Query: 181 ATEILTVDAPTIPST-----TGD---------------ILTIDTPATPPTLIATTLSKRQ 240
            TEILT+DAP IP       TGD               IL I++PATP TL  TTL K Q
Sbjct: 181 PTEILTIDAPKIPDASAIPPTGDITALDNPAIQLQTKGILNINSPATPSTLSVTTLLKSQ 240

Query: 241 KRKMRKLAMKLTGPDTSCAPTDAEEKTGDAPTVDAPATPPTTLGTTLLNLKKRKRKKLSS 300
           KR+M  L+ K  G D     TD E+KTG  PTVDAPATP T+ G TLL+ KKRKR K SS
Sbjct: 241 KREMTTLSEKHIGHDIR---TDEEKKTGAVPTVDAPATPSTSTGVTLLDSKKRKRNKPSS 300

Query: 301 KNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSWTSLKEIAQINEQSGKQ-NVAELAIPF 324
           KNQTEP   SAPTADGD++EGTSKR RKR  KSWTSLKE+A+ NEQSGKQ N+AELAIPF
Sbjct: 301 KNQTEPRSCSAPTADGDRSEGTSKRNRKR--KSWTSLKEVARTNEQSGKQKNMAELAIPF 357

BLAST of Lag0019244 vs. NCBI nr
Match: XP_023553160.1 (uncharacterized protein LOC111810649 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 410.2 bits (1053), Expect = 1.6e-110
Identity = 238/363 (65.56%), Postives = 267/363 (73.55%), Query Frame = 0

Query: 1   MAKKKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQAS 60
           MAK+KGN KKGA+NPTSGPQDSIT+RQEITGK +PKVS+NVK+YLNHLENLATWASG+AS
Sbjct: 1   MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60

Query: 61  IPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRSNK 120
           IPSLAAF+GQR A  AES AV P+ SL  CQRCETILQPGSNCSIRIEKNN KRRRR  K
Sbjct: 61  IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNTKRRRRQKK 120

Query: 121 SSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKM------------------ 180
            SN TQNNVAYYCH+CS RNIKRGTPKGHMKVLY+  FE ++                  
Sbjct: 121 CSNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGQECETSAVEK 180

Query: 181 ATEILTVDAPTIPST------TGD---------------ILTIDTPATPPTLIATTLSKR 240
            TEILT+DAP IP        TGD               IL I++PATP TL  TTLSK 
Sbjct: 181 PTEILTIDAPKIPDASAIPPPTGDITALDNPAIQLRTKAILNINSPATPSTLSVTTLSKS 240

Query: 241 QKRKMRKLAMKLTGPDTSCAPTDAEEKTGDAPTVDAPATPPTTLGTTLLNLKKRKRKKLS 300
           QK++M  L+ K  G +     TD E+KTG  PTVD PATP T+ G TLL+ KKRKR K S
Sbjct: 241 QKQEMTTLSEKHIGHEIR---TDKEKKTGAVPTVDTPATPSTSTGVTLLDSKKRKRNKPS 300

Query: 301 SKNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSWTSLKEIAQINEQSGKQ-NVAELAIP 324
           SKNQT+P   SAPTADGD++EGTSKR RKR  KSWTSLKE+A+ NEQSGKQ N+AELAIP
Sbjct: 301 SKNQTDPGSCSAPTADGDRSEGTSKRNRKR--KSWTSLKEVARTNEQSGKQKNMAELAIP 358

BLAST of Lag0019244 vs. NCBI nr
Match: KAG6577189.1 (hypothetical protein SDJN03_24763, partial [Cucurbita argyrosperma subsp. sororia] >KAG7015188.1 hypothetical protein SDJN02_22821, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 404.4 bits (1038), Expect = 9.0e-109
Identity = 239/365 (65.48%), Postives = 266/365 (72.88%), Query Frame = 0

Query: 1   MAKKKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQAS 60
           MAK+KGN KKGA+NPTSGPQDSIT+RQEITGK +PKVS+NVK+YLNHLENLATWASG+AS
Sbjct: 1   MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60

Query: 61  IPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRSNK 120
           IPSLAAF+GQR A  AES AV P+ SL  CQRCETILQPGSNCSIRIEKNNAKRRRR  K
Sbjct: 61  IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKK 120

Query: 121 SSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKM------------------ 180
             N TQNNVAYYCH+CS RNIKRGTPKGHMKVLY+  FE ++                  
Sbjct: 121 CCNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGKECETSAVER 180

Query: 181 ATEILTVDAPTIPST------TGD----------------ILTIDTPATPPTLIATTLSK 240
            TEILT+DAP IP        TGD                IL I++PATP  +  TTLSK
Sbjct: 181 PTEILTIDAPKIPDASAIPPPTGDITALDNPAIQLPKTEGILNINSPATPSAVSITTLSK 240

Query: 241 RQKRKM-RKLAMKLTGPDTSCAPTDAEEKTGDAPTVDAPATPPTTLGTTLLNLKKRKRKK 300
            QK KM   L+ K  G +T    TD E+KTG  PTVD PATP T+ G TLL+ KKRKR K
Sbjct: 241 PQKWKMTTTLSEKHIGHETR---TDREKKTGAVPTVDTPATPSTSTGVTLLDSKKRKRNK 300

Query: 301 LSSKNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSWTSLKEIAQINEQSGKQ-NVAELA 324
            SSKNQTEP   SAPTADGD++EGTSKR RKR  KSWTSLKE+A+ NEQSGKQ N+AELA
Sbjct: 301 PSSKNQTEPGSCSAPTADGDRSEGTSKRNRKR--KSWTSLKEVARTNEQSGKQKNMAELA 360

BLAST of Lag0019244 vs. NCBI nr
Match: XP_022931505.1 (uncharacterized protein LOC111437660 [Cucurbita moschata])

HSP 1 Score: 400.6 bits (1028), Expect = 1.3e-107
Identity = 238/365 (65.21%), Postives = 265/365 (72.60%), Query Frame = 0

Query: 1   MAKKKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQAS 60
           MAK+KGN KKGA+NPTSGPQDSIT+RQEITGK +PKVS+NVK+YLNHLENLATWASG+AS
Sbjct: 1   MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60

Query: 61  IPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRSNK 120
           IPSLAAF+GQR A  AES AV P+ SL  CQRCETILQPGSNCSIRIEKNNAKRRRR  K
Sbjct: 61  IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKK 120

Query: 121 SSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKM------------------ 180
             N TQNNVAYYCH+CS RNIKRGTPKGHMKVLY+  FE ++                  
Sbjct: 121 CCNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPLDVKDGKECETSAVER 180

Query: 181 ATEILTVDAPTIPST------TGD----------------ILTIDTPATPPTLIATTLSK 240
            TEILT+DAP IP        TGD                IL I++PA P T+  TTLSK
Sbjct: 181 PTEILTIDAPKIPDASAIPPPTGDITALDNPAIQLPETEGILNINSPAAPSTVSITTLSK 240

Query: 241 RQKRKM-RKLAMKLTGPDTSCAPTDAEEKTGDAPTVDAPATPPTTLGTTLLNLKKRKRKK 300
            QK KM   L+ K  G +T    TD E+KTG  PTVD PATP T+ G TLL+ KKRKR K
Sbjct: 241 PQKWKMTTTLSEKHIGHETR---TDKEKKTGAVPTVDTPATPSTSTGVTLLDSKKRKRNK 300

Query: 301 LSSKNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSWTSLKEIAQINEQSGKQ-NVAELA 324
            SSKNQTE    SAPTADGD++EGTSKR RKR  KSWTSLKE+A+ NEQSGKQ N+AELA
Sbjct: 301 PSSKNQTELGSCSAPTADGDRSEGTSKRNRKR--KSWTSLKEVARTNEQSGKQKNMAELA 360

BLAST of Lag0019244 vs. ExPASy TrEMBL
Match: A0A6J1C4C1 (uncharacterized protein LOC111008256 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111008256 PE=4 SV=1)

HSP 1 Score: 429.1 bits (1102), Expect = 1.7e-116
Identity = 257/391 (65.73%), Postives = 277/391 (70.84%), Query Frame = 0

Query: 1   MAKK-KGNKKKGATNPTSG-PQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQ 60
           MAKK +G KK GA+N T G PQDSITLRQE TGKIQPK  +NVK YL+HLENLATWASGQ
Sbjct: 1   MAKKNRGKKKIGASNSTPGRPQDSITLRQEKTGKIQPKPYNNVKIYLSHLENLATWASGQ 60

Query: 61  ASIPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRS 120
           ASIPSLAAF+G+R AA A+SS V P+ SL +CQRCETILQPGSNCSIRIEKN AKRRR+ 
Sbjct: 61  ASIPSLAAFFGRRFAAAADSSGVVPDASLFLCQRCETILQPGSNCSIRIEKNKAKRRRKH 120

Query: 121 NKSSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLY---------ETGFESK-------- 180
           NK SNLTQNN+ YYCHYCS RNIKRGTPKGHMKV Y             ESK        
Sbjct: 121 NKCSNLTQNNLVYYCHYCSCRNIKRGTPKGHMKVRYAQKSKAVEESEPIESKSKVKVLNV 180

Query: 181 ------------MATEILTVDAPTIPS----------------TTG-------------- 240
                       M TEILT+DAP IPS                TTG              
Sbjct: 181 RRGKECEATAVQMTTEILTIDAPMIPSPTTREIGTIDAPIAPPTTGDTLVVGASAILPPR 240

Query: 241 --DILTIDTPATPPTLIATTLSKRQKRKMRKLAMK-LTGPDTSCAPTDAEEKTGDAPTVD 300
             DIL I+ PATPPT+  TTLSK QKRK RKLA K  TGP+ SCAPTD+E+KTGD PTVD
Sbjct: 241 MEDILIINAPATPPTVSGTTLSKSQKRKKRKLAAKNQTGPENSCAPTDSEKKTGDIPTVD 300

Query: 301 APATPPTTLGTTLLNLKKRKRKKLSSKNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSW 328
           APATPP  +G TLL  KKRKRKK SSKNQTEPE S APTA+GDKTEGTSKRKRKR  KSW
Sbjct: 301 APATPPAMIGMTLLESKKRKRKKPSSKNQTEPESSCAPTAEGDKTEGTSKRKRKR--KSW 360

BLAST of Lag0019244 vs. ExPASy TrEMBL
Match: A0A6J1J5I6 (uncharacterized protein LOC111482797 OS=Cucurbita maxima OX=3661 GN=LOC111482797 PE=4 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 5.5e-112
Identity = 241/362 (66.57%), Postives = 267/362 (73.76%), Query Frame = 0

Query: 1   MAKKKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQAS 60
           MAK+KGN KKGA+NPTSGPQDSIT+RQEITGK +PKVS+NVK+YLNHLENLATWASG+AS
Sbjct: 1   MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60

Query: 61  IPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRSNK 120
           IPSLAAF+GQR A  AES AV P+ SL  CQRCETILQPGSNCSIRIEKNNAKRRRR  K
Sbjct: 61  IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKK 120

Query: 121 SSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKM------------------ 180
            SN  QNNVAYYCH+CS RNIKRGTPKGHMKVLY+  FE ++                  
Sbjct: 121 CSNSRQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPVDVKDGKECETSAVER 180

Query: 181 ATEILTVDAPTIPST-----TGD---------------ILTIDTPATPPTLIATTLSKRQ 240
            TEILT+DAP IP       TGD               IL I++PATP TL  TTL K Q
Sbjct: 181 PTEILTIDAPKIPDASAIPPTGDITALDNPAIQLQTKGILNINSPATPSTLSVTTLLKSQ 240

Query: 241 KRKMRKLAMKLTGPDTSCAPTDAEEKTGDAPTVDAPATPPTTLGTTLLNLKKRKRKKLSS 300
           KR+M  L+ K  G D     TD E+KTG  PTVDAPATP T+ G TLL+ KKRKR K SS
Sbjct: 241 KREMTTLSEKHIGHDIR---TDEEKKTGAVPTVDAPATPSTSTGVTLLDSKKRKRNKPSS 300

Query: 301 KNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSWTSLKEIAQINEQSGKQ-NVAELAIPF 324
           KNQTEP   SAPTADGD++EGTSKR RKR  KSWTSLKE+A+ NEQSGKQ N+AELAIPF
Sbjct: 301 KNQTEPRSCSAPTADGDRSEGTSKRNRKR--KSWTSLKEVARTNEQSGKQKNMAELAIPF 357

BLAST of Lag0019244 vs. ExPASy TrEMBL
Match: A0A6J1EZL5 (uncharacterized protein LOC111437660 OS=Cucurbita moschata OX=3662 GN=LOC111437660 PE=4 SV=1)

HSP 1 Score: 400.6 bits (1028), Expect = 6.3e-108
Identity = 238/365 (65.21%), Postives = 265/365 (72.60%), Query Frame = 0

Query: 1   MAKKKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQAS 60
           MAK+KGN KKGA+NPTSGPQDSIT+RQEITGK +PKVS+NVK+YLNHLENLATWASG+AS
Sbjct: 1   MAKRKGNTKKGASNPTSGPQDSITIRQEITGKFKPKVSNNVKTYLNHLENLATWASGKAS 60

Query: 61  IPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRSNK 120
           IPSLAAF+GQR A  AES AV P+ SL  CQRCETILQPGSNCSIRIEKNNAKRRRR  K
Sbjct: 61  IPSLAAFFGQRLATAAESLAVAPDASLFTCQRCETILQPGSNCSIRIEKNNAKRRRRQKK 120

Query: 121 SSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKM------------------ 180
             N TQNNVAYYCH+CS RNIKRGTPKGHMKVLY+  FE ++                  
Sbjct: 121 CCNSTQNNVAYYCHHCSCRNIKRGTPKGHMKVLYDAAFERRVKPLDVKDGKECETSAVER 180

Query: 181 ATEILTVDAPTIPST------TGD----------------ILTIDTPATPPTLIATTLSK 240
            TEILT+DAP IP        TGD                IL I++PA P T+  TTLSK
Sbjct: 181 PTEILTIDAPKIPDASAIPPPTGDITALDNPAIQLPETEGILNINSPAAPSTVSITTLSK 240

Query: 241 RQKRKM-RKLAMKLTGPDTSCAPTDAEEKTGDAPTVDAPATPPTTLGTTLLNLKKRKRKK 300
            QK KM   L+ K  G +T    TD E+KTG  PTVD PATP T+ G TLL+ KKRKR K
Sbjct: 241 PQKWKMTTTLSEKHIGHETR---TDKEKKTGAVPTVDTPATPSTSTGVTLLDSKKRKRNK 300

Query: 301 LSSKNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSWTSLKEIAQINEQSGKQ-NVAELA 324
            SSKNQTE    SAPTADGD++EGTSKR RKR  KSWTSLKE+A+ NEQSGKQ N+AELA
Sbjct: 301 PSSKNQTELGSCSAPTADGDRSEGTSKRNRKR--KSWTSLKEVARTNEQSGKQKNMAELA 360

BLAST of Lag0019244 vs. ExPASy TrEMBL
Match: A0A5D3CYJ3 (Rpr2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004600 PE=4 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 5.9e-98
Identity = 218/349 (62.46%), Postives = 250/349 (71.63%), Query Frame = 0

Query: 1   MAKKKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQAS 60
           MA+KKGN K+G++NPTSGPQ+SITLRQE TGKI+PKVS+N K YLNHLENLATWASGQ S
Sbjct: 1   MARKKGNTKRGSSNPTSGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60

Query: 61  IPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRSNK 120
           +PSLAAF+GQR AA AES AV P+PSL +C RCET+LQPGSNC IRIEKNNAK+RRR  K
Sbjct: 61  LPSLAAFFGQRLAAAAESLAVAPDPSLFLCARCETVLQPGSNCYIRIEKNNAKKRRRHKK 120

Query: 121 SSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKMAT-----------EILTV 180
           +SN+TQN VAYYCHYCS RNIKRGTPKGHMKVLY T   SK+ +           +ILTV
Sbjct: 121 ASNVTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKILTV 180

Query: 181 DAPTIPS-TTGDILTIDTPATP------------PTLIATT--LSKRQKRKMRKLAMKLT 240
           DAPT P  TT D LTIDTPA P             T +  T  +S      +        
Sbjct: 181 DAPTTPPLTTVDCLTIDTPAIPSLSTTRDDVAVDTTAVPPTEDISVDDGPAISSPRTTPA 240

Query: 241 GPDTSCAPTDAEEKTGDAPTVDAPATPPTTLGTTLLNLKKRKRKKLSSKNQTEPEISSAP 300
            P TS   + +  +  D PT+DAPATP T    TLL+ K+RKRKK SSKN+TEPE  SAP
Sbjct: 241 IPSTSSVTSMSRSQVRDIPTLDAPATPLTLTAMTLLDSKRRKRKKPSSKNRTEPESCSAP 300

Query: 301 TADGDKTEGTSKRKRKRKKKSWTSLKEIAQINEQSGKQNVAELAIPFSL 324
           T+ G+K+E TSKRKR R  KSWTSLKEIAQ  E+ GKQNVA LAIPFSL
Sbjct: 301 TSHGEKSEDTSKRKRNR--KSWTSLKEIAQREEEKGKQNVAGLAIPFSL 347

BLAST of Lag0019244 vs. ExPASy TrEMBL
Match: A0A1S3BU13 (uncharacterized protein LOC103493157 OS=Cucumis melo OX=3656 GN=LOC103493157 PE=4 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 5.9e-98
Identity = 218/349 (62.46%), Postives = 250/349 (71.63%), Query Frame = 0

Query: 1   MAKKKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWASGQAS 60
           MA+KKGN K+G++NPTSGPQ+SITLRQE TGKI+PKVS+N K YLNHLENLATWASGQ S
Sbjct: 1   MARKKGNTKRGSSNPTSGPQNSITLRQEATGKIKPKVSNNAKVYLNHLENLATWASGQPS 60

Query: 61  IPSLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEKNNAKRRRRSNK 120
           +PSLAAF+GQR AA AES AV P+PSL +C RCET+LQPGSNC IRIEKNNAK+RRR  K
Sbjct: 61  LPSLAAFFGQRLAAAAESLAVAPDPSLFLCARCETVLQPGSNCYIRIEKNNAKKRRRHKK 120

Query: 121 SSNLTQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKMAT-----------EILTV 180
           +SN+TQN VAYYCHYCS RNIKRGTPKGHMKVLY T   SK+ +           +ILTV
Sbjct: 121 ASNVTQNVVAYYCHYCSCRNIKRGTPKGHMKVLYGTECVSKVKSVVVKDGKECENKILTV 180

Query: 181 DAPTIPS-TTGDILTIDTPATP------------PTLIATT--LSKRQKRKMRKLAMKLT 240
           DAPT P  TT D LTIDTPA P             T +  T  +S      +        
Sbjct: 181 DAPTTPPLTTVDCLTIDTPAIPSLSTTRDDVAVDTTAVPPTEDISVDDGPAISSPRTTPA 240

Query: 241 GPDTSCAPTDAEEKTGDAPTVDAPATPPTTLGTTLLNLKKRKRKKLSSKNQTEPEISSAP 300
            P TS   + +  +  D PT+DAPATP T    TLL+ K+RKRKK SSKN+TEPE  SAP
Sbjct: 241 IPSTSSVTSMSRSQVRDIPTLDAPATPLTLTAMTLLDSKRRKRKKPSSKNRTEPESCSAP 300

Query: 301 TADGDKTEGTSKRKRKRKKKSWTSLKEIAQINEQSGKQNVAELAIPFSL 324
           T+ G+K+E TSKRKR R  KSWTSLKEIAQ  E+ GKQNVA LAIPFSL
Sbjct: 301 TSHGEKSEDTSKRKRNR--KSWTSLKEIAQREEEKGKQNVAGLAIPFSL 347

BLAST of Lag0019244 vs. TAIR 10
Match: AT5G41270.1 (CONTAINS InterPro DOMAIN/s: RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 127.5 bits (319), Expect = 2.0e-29
Identity = 120/327 (36.70%), Postives = 157/327 (48.01%), Query Frame = 0

Query: 4   KKGNKKKGATNPTSGPQDSITLRQEITGKIQPKVSSNVKSYLNHLENLATWAS-GQASIP 63
           K+G KK   T    GP     LR E                  HL+NLA W+S G   IP
Sbjct: 3   KRGPKKLTNTQGGGGPNLKSVLRHE------------------HLKNLALWSSTGDTPIP 62

Query: 64  SLAAFYGQRHAAVAESSAVPPNPSLIICQRCETILQPGSNCSIRIEK---NNAKRRRRSN 123
           SLA+  G+R AA  ES+ +  +P L+ CQRCETIL+PG NC++RIEK   N  K+R R  
Sbjct: 63  SLASLLGRRLAADTESTGITTDPDLVSCQRCETILKPGFNCNVRIEKVSANVKKKRNRCK 122

Query: 124 KSSNL--TQNNVAYYCHYCSFRNIKRGTPKGHMKVLYETGFESKMATEILTVDAPTIPST 183
           KS+N+   QNNV Y+C++CS RN+KRGT KG MK LY                 P  P T
Sbjct: 123 KSNNICFPQNNVVYHCNFCSHRNLKRGTAKGQMKELY-----------------PFKPKT 182

Query: 184 TGDILTIDTPATPPTLIATTLSKRQKRKMRKLAMKLTGPDTSCAPTDAEEKTGDAPTVDA 243
                     + P      T+ +  +  M      L+ P+ S      E+  GD P    
Sbjct: 183 A-------RSSRPKIKKEMTMPQEIQSNM------LSSPERSVKDQVEEKSVGDTPK--- 242

Query: 244 PATPPTTLGTTLLNLKKRKR-KKLSSKNQTEPEISSAPTADGDKTEGTSKRKRKRKKKSW 303
                      +L L++ +R +K  SK  +EP+  S P    +KT G S  KRKR K  W
Sbjct: 243 ---------PMMLTLERDRRIRKPKSKKPSEPQ--SVP----EKTVGGS-NKRKR-KSPW 258

Query: 304 TSLKEIAQINEQSGKQNVAELAIPFSL 324
           TS+KEIA+ N+ S   N     IPF L
Sbjct: 303 TSMKEIAETNKSSKAGN---FKIPFLL 258

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022136585.13.4e-11665.73uncharacterized protein LOC111008256 isoform X1 [Momordica charantia][more]
XP_022984531.11.1e-11166.57uncharacterized protein LOC111482797 [Cucurbita maxima][more]
XP_023553160.11.6e-11065.56uncharacterized protein LOC111810649 [Cucurbita pepo subsp. pepo][more]
KAG6577189.19.0e-10965.48hypothetical protein SDJN03_24763, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022931505.11.3e-10765.21uncharacterized protein LOC111437660 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1C4C11.7e-11665.73uncharacterized protein LOC111008256 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1J5I65.5e-11266.57uncharacterized protein LOC111482797 OS=Cucurbita maxima OX=3661 GN=LOC111482797... [more]
A0A6J1EZL56.3e-10865.21uncharacterized protein LOC111437660 OS=Cucurbita moschata OX=3662 GN=LOC1114376... [more]
A0A5D3CYJ35.9e-9862.46Rpr2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S3BU135.9e-9862.46uncharacterized protein LOC103493157 OS=Cucumis melo OX=3656 GN=LOC103493157 PE=... [more]
Match NameE-valueIdentityDescription
AT5G41270.12.0e-2936.70CONTAINS InterPro DOMAIN/s: RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175); Ha... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007175Ribonuclease P subunit, Rpr2/Snm1/Rpp21PFAMPF04032Rpr2coord: 45..136
e-value: 1.2E-12
score: 47.9
NoneNo IPR availableGENE3D6.20.50.20coord: 88..143
e-value: 5.4E-6
score: 27.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 210..297
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..23
NoneNo IPR availablePANTHERPTHR36072OS01G0541600 PROTEINcoord: 1..160
coord: 195..323

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0019244.1Lag0019244.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006396 RNA processing