Cla97C01G000040 (gene) Watermelon (97103) v2

NameCla97C01G000040
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionNPL4-like protein 1
LocationCla97Chr01 : 25471 .. 33326 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGCTCAGAATTCGCAGTAGAGATGGCCTGGAGCGAGTCGCTGTAGACAACCCACACATCACAATCGCTCAACTCAAAGCCATAATTCAATCCCAGCTCAATATCCCAATCCACAACCAAACCCTCTCAACTAACCAAAATATTCTATTGGCGAAGACTCATGACGATCTTTCCAAATTCACTGACATGTCTAATCCTAATACCCATCTCTCGTCGCTTAATTTGTCTCATGGGTCTATTGTCTTTCTCGCCTACGAGGGCGAGCGCACTGTTGCTGGCCCTACTTTCCATCCCGCCGGATCTTTTGGCCGTAAAATGACTATGGATGATCTAATTGCTAAGCAGATGCGGATCACTCGTCAAGAAAACCCCCATTGTGAATTGGTTTCTTTCGACCGGGATTGCGCCAATGCTTTCCAGCATTATGTTAATGAAACGCTAGCCTTCGCTGTCAAACGTGGGGGAATGATGTACGGAACCGTGTCACCAGAAGGCAAGGTCGAGGTAGATTTCATATATGAGCCGCCACAGCAAGGGACTGAGGACAATTTACTGTTTTTCCGAGATCCCGATGAGGAAAGATTGGTAGAAGCAATTGCAGTTGGGTTGGGGATGAGGAAAGTTGGGTTTATATTCACGCAGACGATTAGTCAGGACAAAAAGGACTACACCTTATCCAACAGGGAAGTACTCCAGGCGGCTCAGTTTCACTCCGAGAGCGAGTTGAAGGAGTGGGTGACAGCAGTTGTGAAGTTGGAGGTGAACGAGGATGGGGGTGCTGATGTTCATTTTGAGGCTTTTCAAATGAGTGACATGTGCATTAGATTGTTCAAGGAAGGTTGGTTTGAGACGGATATTGGAGAGGATTTTGATCCCAAGCTCTCGAAGATGAGGAGGGACGTTGTTGTTGGTGTCAAAGACACTAGAGATGTTGACAATGACTTTTTCCTGGTCGTAGTTAAGATTTTCGACCATCAGGTTCGTGGACCCTTTAATTTTGCTAATCATATTACCTAATTCCTCTGCTGCGTCGAATTGTTTGCGTTTTCTCCTCAATCAATGACTCATAATTGCAATCATTGAACTTGTTTGCACAGAATTCTCGGATTTTTGTTAATTTTCCTTGTATTTTGCTTCCTTGACTGAAAGTGGATAGTCTATTAGCTTTGTAAATTGAATGTGTTAGCCCAGTATTGGCCGGAACGTAGGAGTTGAAGGTTTGGGTGTGGTGTATTTTCCATTTTCTTAGAGCCAGAATATTAGCGCTACTTGGTAATAGTTTTTGGAAATATCAATCTGGTATTTCGTTCATTAAATATTTGTTATTTATTCTACCATGACAGGTTTTAACAGATTACTAGTGTCAAGTAAGAAGACTGTGATGGCTGTATAAAATTTTACTTCTGTTCTCTTTCTCAGGCAGATGCTTACAAATAATTTATCCTTCTTGCAGAAAGTTTTCATGATACAATTCTTTGAAACAGATGAGTGCAAAATATCTTATTCTTTTTGTGAACCAAAGCCTTAGACTTCATTCTGTTTTTCTTTTTTTCCCACATCTACATTGGTCAAGAATGAGAATTACCTTTTTCAAATTCACTAATTGCTGCCTAGCTCTCCTTTTTCTTTTCCATTCTTGCTCACTCTTAAAATTGAATCCCTAAATTCCTTAATGATGACATTTACAATGGGAAAAAGCTTTCTTGATTTCTTCAAGTATAAAATGTTATCTCTTGTGATCTATTATGTTTTTCTGGCTTATAGTTTTTTTTTCCATTCAAGAATTAAGTGTACTTTTGTGCTTTATATTTCTTCACCTTTTTGTCTTCTTAGGAGCTGCTTGATTGGGCATGTTCAAAATTTTCTTATAGCATTCTTATCAGATCTAAGAAGCAATTACTTACTCTGAAAGGGTTGTATACAACATTTCAGGAAATTAGGTTTAAGAAAGTGGTCACCCTACAGTCACCTGCATTGTATTAAAGTCCCCCAAGTTTCTAGAATCTCAAATATTGTATGTTCCATGATAATCTTAATGTTTAGCTTAGGTTGAGGCCTTGGAAGTTTTTTTATGCTTAAAAAATGAAATGAGGACCAACGGTTTCCTTATTGATTTACTGATATAGTTTCTGGGGTTTACTAAGTAAATTTTGTTGCAATATTTAGTTCCGGGACAATGTTCTTATTGCTAATAAGACTGAGGTGTTGTTTACAAGTTATTATATCAGTCTTGTCACTCAAGGTCCCTTTTCCCTTTGTAGGAAGACAAATTTTGGAGTTTGGACCATGTTGTTTTTTTAATTATTAATGAGTATATCAAGGACTATGGGTTTCATAGATTGGAAGGAGTTAATTTTCTATTGATCTTGAAGAAGTCCATGTCCATGTTGATTGGAGCTTTTGTGATAAGGCTTATAAAAGAAGGGTTTTGGGTTCAAGTGGAGAATGTGGGTTTGTTGCTATCCTAAGTTGGTTAATCAATTTTGGTTAGGAGGAATTTAGTGGAAGAACTTTAACTTATCAGGAGAGTTTGGGGGGGGGGGGGGGGGGGGGGGGGGGTGGTTATCCCTATTCTTTTCCCAGTTGTTGTTGATGTCCAAAGCAAATTAGTGTCAACAAGAACAGATAAGGATCTTATTTCAGGGTTCTATGTGGGGACGTAATTAATTTTTCTATCTCATTTTTAGTTTGTTGATAATACAAACTATTTTTCTCGGTTAGGGAGGGCTCTCTTGTTAGTCTTAACATAGTTCTATCGGTTTGTGAAATCATTTTGGTCATAATTAGAGACAAGAGTTTTGTGATTAGTATTAGTCATGATTCCTTCTAAGCTGCCTGGATGAATGTCTAATTGGGGTTTGCATGAAAGTGGCTTCCTACCATTTATATACAGTGCTTCCCCTTGGCCATACTTCCAATCATTTGTTGTTTTGGAATCTTTAATCTCACAGGACAATCTGCCTGACTCTACAACATTTGGGTGTCTAAGGAAACTTGTAGGATATTAAATCCTAGGTAGGTGGCCACCATGGATTGGACCCATGACCTCTTAGCCTTTTATTGAGATTGTCTCCTTTTTTACCACTAAGCCAACCCATGGTGGTTTATTTCGGAATCTAATTGTGGAATATGTCGAAAGATATCTCTTTTATTTGAAAATTATGTTACTTTCTTAGGGTGATAGGACAATACTTTTATAGTCTGGAGATGTTTTGAGTGATTTTCTAAAAGTCACATCTTGTACTGAGGATGTCAGTATTTCTTGAGGTCCAAGTTCATATCAATGCAGAGGGAATTCCAATAGATATTCTACTTAGACGTCAATATTTCTTGAGGTCCAAGTTCATATCAACGCAGAGGGAATTCCAATAGATATTCTACTTAGACTCTTCCTCTCTTTTATAGCCAAAGCACATATAAATTGGCTTCATGGTTGTGGGTTTATGAAATCAGTGGGTCAAAATTGAAAGAATCTGAATGAAGCATTTATAAGGAAGTAGGTGAGAAGCCTATGATAAATCAATCTACTGATTTGAAGCTTGTTCTAAACCACGGAAAAATTGGAGTTTTAAGTCTCTATCATTTGGTTGTTCAATCAACAGATGATGCCATTATTACAAGACCTTAGGAGCATGAATTTTCTGAGGAGGATATTGACGTCAGCATTAGTAGTTTGGATTCTCCCAGTGGTTTTCTGGTTTTAGAATTGTTAAATAATGTAATGGGTAACCAATCATTTCATGATGGGCTCCTTAATCTCTTAAGGGAAGATAAAATGTCACATGATGGTGAAAATATAAAAACTTTTGCTGTCCCTTCAGTAAAGGGGAAATCTATGTTAGCCTCGTGGATGGGGGTCTTTCTAAATACATCAAATAGGCTGGTATATCTCTTCTCCCATTCATCCCATCTCAATATTGATAATTATAGAGGGCTTATTTCTCTGACTCTCAATAGGTCTTTTTTGTTATGCACTCATTATGAAAATCATTCTTGGTTTCCTATTAAGCCCAAAGACCATTTTGGTGGACTCAAGCTCCAAAAGGACTTGAAACTCTAAGAAGACAAAGTTTCATCTTTCTAGATTTCTTTTTCTTTTTTCCTTTTTCTTTTTCTTTTTCTTTTTTTTTCTTTTTTCCCCCTCCTTCTGGTATGTTGTTCTGTTTGGATAGTTTCAGCTTTCTGAGAGAGAGTGAATGGGTTAATCAGGATTTTGCTTAATGGTAATTTAAATTCAGCTGATATCAAAAAAGCCTCCACGTTCTATTCTGAAGCCATCAGTATGTGTTTTATGTTATGCAGACAGTGAATTCCAAAGTCTTTTGTTCTCCCACTGCCCATACGCTTTGGCTTCTTGGTATTCAATGTTTCACCTTTCTAATATGCATTGGATTTTTTCTCAAGATTTAAGAAGTAATTTATTCCAGTTTCTCATCGGTCCAGTTCTCCACTCTCAAGCTCATCTTTTATGGGTTAATGCCGTTAAATCCATCTGGTCAGAATTATGGCTCTAACAGAATCAAAGAACTTAAGTTTGGTCTGAATGTTTTGAGAGTTGAGTTCACAGATTCTCAATGGTGCTCACCCAGCTTATTTTATCATGGAATGCTTTTACTCATTCTTCCTTTTTTGCTTTGCCATACTTTTTATTTTCCCTCTTTGTCATGAGGAAGTGGGCCTCAACCCAATTATTTAAAATGAGGCCACACCTACGAAGCCCCCATGGTTTGACGTTCCTCTAGATCCATATGATTTCACGCCATTAGGGTTAGAGGTTATCACCAAACCAACTTGTTGTCTTTTCTCCAGCACACGGTGCCTTCCTCGAGGCTTCATCTCCATTTACACATGCTCATGTCGCTGGTAAGAGCCTCCATTAGCTTTTCTCTTCTTTTCTTTCGAACTTTATAGCTACTAAAGTGTTTTCCTCTTAATGACCGATGATCCCACTCTAAGAATCCAACCTTCGACATGCTAGATATCTTTTCAGCCCATTTACCCCAGATTACGAGTCATTCTCCCCTCTCTTGCAACTGAATCTTCTGTGCTGATATTTTTTGTTGGCCTTCCCATTTTTCTGTTTTTCTACTCGTATTGTGAACAAACACAACCTTGGGCTCTGTGTTCTTTCCATCTTGTTTCTTATATGGGAATTCTTCCTTTCTTGCAGTTGTTGCGCCATCCTGATCATCGACCTTCCTAGTATCTTCTTTCTTGCCATATGAGATACTTGATTCTGTTATGGAACTCTGCCTCTTTGTCTTTGCCAACCTAATCTCTTTTTTTATTGTGTAATCTCCACTTTTATTTCAACTCTATGCTGTTGTGTCTCACATAGATTGAAAATCGAGTTCATTGACTCCAATGGTTTCTTCAGTTGATCCATTTATTTTGATAAAATTTTCAATATCTCTTCATTTAAATTAACATCTAACTCCGTGACCAGAATTGAACTGCACTATACTAATTGATTGAACCAAAATATACTCAATTATAAAAGGTAACGAAACCATAAGATAACCAAGTTTCCAAGTAATTCAAGGTATTTGGACCCTCCTTTTAAGCACACTCACTCAAATCTTAGCCAACTTAATTAGACCATGCCTTCATCCTTTCTAACCCCCTCTATTTATAACAAACTCTCCTAACTAACTTTCTTATCTAATTACCAATGTACTCTTGTGGCCTCTCGTTCCCTTGGTGGGATTGGGATTGGGGGAAAAACTCTCTTTTTCTCAAATGTTAGTACTTCGAGAAAGAAAATAAAAAAAGCTCTTTGGAGAAGGGTGGTAAAGAGCACATATGGGGTGGATTCTTTAAAAGACCTGAAAAAACAAAAAACAAAAAACAAAATAGAGAGGAAGGTTATAACAAAAAACAAAATAGCACATATGTTTAGTCCGTGGAAGGTTATAACAAAAAAATACGATTTCTTTGGAAAAACGTTAGTTTTAAACTAGGTGATGACAATGACCTTTTTTGGGAAGATAGTTGGCATGTTAATCAGGCCCCAGAAGATGTTGTTACTTCCTTTTATAGCTTTTTATTTATTTATTTATTTTCAGAAACAATTTTATCGATGAATAAAATAATCCAAAAGGATATACAAAACTGTCCTATCAAAGGGAATACAAAAGTCTGTTCCAATTTGCGACAAGATAATTTAGGCTATGATCTTTAAGGGGATGTAGGTGTTTATACCAAAAAAGAGCATTAAAAATAGAAGAATCAAGGATGTCTATTTGGCCTTGGAAGAAAATAGAAGGCTTTTGGAAAAAGAGATTTGCTCTTAGATTTGGGAAGGTTTAGTACTTAAAAAGGTTAAATTCTTCCTTTCAACGGTTGCTCATGGTAGCATTAATACTTGTGATAAGGTGCAAAGAAGATATCCCCACCTTAACATTTTTCCTAATACGTGCACCATCTGCAGGAGTGAGGGATCACTTCCTCACTTTTTTCTTCACTTGCAGCTTCAGCAAAAAAGTGTGGAACTATTTTGGTAACTTATTCGGTCTGAAATAGTGTTACCCTCTGGTGTTAGATCAGGGGCTGGTGGAGTTTTAATGTTGTGGACGGCAAAATTATGTAATATGTATTAGTTCCTTTACTTATTGTGCCCTGTTTCACGGTTATACTCTGTTTTGGCAGTTTATTTTTCGTTCCAGTTTTTGTTCTTTTATTTAGGAGGCTCATTTGTTTTTAACGTGAGGCCCACCTCCTCCGTATAACTAGTAGCCTTTTTCTATGAAGGATCGAGTAAATTTATTTTCTTGGTTTACACCACAAAGGCTATTTTGTGGCTTATATGGGAGGAAAGAAACGTTAGAATTTTCTTCCATTATTTTTGTGGCTTTGTACGATTCACGGCTCCCAATTGAATTTTGATGTTTCTCGTTGGATGTGGGTGGTGAGTCCTTTTGTAATTATTCACTAGGTCACATTTTATTAGATTGGGGTCCCTTTCTCTAGTGTGCTCCCTTTTTGTGGGCTGTCTTTTTTCATTTTTTTAAATTCTTTTGGTTATTGTTTTTTAAAATTTTTTATTTTTATGGCCTTGTATTCTTTCCTTTTTTTCTCTATGAAAGTTGGTCTTCGAATAGAAAAACAAAAACGAAGAACACAGAGTTCTTTTGTATTAACTACACCTTGCTCATTAATTGTCTCTTTTTCAGAAGACTGATATACCCTTAATATCCCAATAGCAATCCTAATTCTATTCCAGTAGCACTCCTCGCATGTATCTAGTGTATTGGTTGGATGCTTCATTTTTTGTGTTGTATGTGCCAAATATCTGCAATGTGTTTGAGGGTTTAATGCAAGAGTTTGATGTGATTATGAAGTTGGATGTGAATCGGCCAACAATTTTAACATTTTCCAGTAGATGTGGAGAGTAATGTGTTTCCTAGTGAGAATTTTTTTTTGGGACTTGGTCAAAATTTAAATATATGGTTGTGCTTGATGAATAGTATGAATGGAGGGGGAAAAACTACACAAAACTGAAGTATTTTTCACTTTGCTAACTATATGAATACGGTAGAGATTTTAAATTTGTTTCTGGTGGCCGCTTAACTGATGTTAATGAGAACTTTGTAATGCAGGGCCCACTTTCAACAACATTTCCGATTGAAAACCGAAATGCTCCAGTGACCATGAAGGCATTGAAGAATCACCTGGACCGCTCAAAAGGGCTTCCGTTTGTGAAGCGAATTTCTGACTTTCATTTGCTGCTGTTACTAGCCAGGGTGTTGGATGTGAGCTCTGACGTTCCTGCACTGGCCGAGTGCGTTCAGACCCAGACAGCCATACCTGAAGGTTACAAAATATTAATCGAGTCTATGGCAAGTGCTGCTTGA

mRNA sequence

ATGATGCTCAGAATTCGCAGTAGAGATGGCCTGGAGCGAGTCGCTGTAGACAACCCACACATCACAATCGCTCAACTCAAAGCCATAATTCAATCCCAGCTCAATATCCCAATCCACAACCAAACCCTCTCAACTAACCAAAATATTCTATTGGCGAAGACTCATGACGATCTTTCCAAATTCACTGACATGTCTAATCCTAATACCCATCTCTCGTCGCTTAATTTGTCTCATGGGTCTATTGTCTTTCTCGCCTACGAGGGCGAGCGCACTGTTGCTGGCCCTACTTTCCATCCCGCCGGATCTTTTGGCCGTAAAATGACTATGGATGATCTAATTGCTAAGCAGATGCGGATCACTCGTCAAGAAAACCCCCATTGTGAATTGGTTTCTTTCGACCGGGATTGCGCCAATGCTTTCCAGCATTATGTTAATGAAACGCTAGCCTTCGCTGTCAAACGTGGGGGAATGATGTACGGAACCGTGTCACCAGAAGGCAAGGTCGAGGTAGATTTCATATATGAGCCGCCACAGCAAGGGACTGAGGACAATTTACTGTTTTTCCGAGATCCCGATGAGGAAAGATTGGTAGAAGCAATTGCAGTTGGGTTGGGGATGAGGAAAGTTGGGTTTATATTCACGCAGACGATTAGTCAGGACAAAAAGGACTACACCTTATCCAACAGGGAAGTACTCCAGGCGGCTCAGTTTCACTCCGAGAGCGAGTTGAAGGAGTGGGTGACAGCAGTTGTGAAGTTGGAGGTGAACGAGGATGGGGGTGCTGATGTTCATTTTGAGGCTTTTCAAATGAGTGACATGTGCATTAGATTGTTCAAGGAAGGTTGGTTTGAGACGGATATTGGAGAGGATTTTGATCCCAAGCTCTCGAAGATGAGGAGGGACGTTGTTGTTGGTGTCAAAGACACTAGAGATGTTGACAATGACTTTTTCCTGGTCGTAGTTAAGATTTTCGACCATCAGGGCCCACTTTCAACAACATTTCCGATTGAAAACCGAAATGCTCCAGTGACCATGAAGGCATTGAAGAATCACCTGGACCGCTCAAAAGGGCTTCCGTTTGTGAAGCGAATTTCTGACTTTCATTTGCTGCTGTTACTAGCCAGGGTGTTGGATGTGAGCTCTGACGTTCCTGCACTGGCCGAGTGCGTTCAGACCCAGACAGCCATACCTGAAGGTTACAAAATATTAATCGAGTCTATGGCAAGTGCTGCTTGA

Coding sequence (CDS)

ATGATGCTCAGAATTCGCAGTAGAGATGGCCTGGAGCGAGTCGCTGTAGACAACCCACACATCACAATCGCTCAACTCAAAGCCATAATTCAATCCCAGCTCAATATCCCAATCCACAACCAAACCCTCTCAACTAACCAAAATATTCTATTGGCGAAGACTCATGACGATCTTTCCAAATTCACTGACATGTCTAATCCTAATACCCATCTCTCGTCGCTTAATTTGTCTCATGGGTCTATTGTCTTTCTCGCCTACGAGGGCGAGCGCACTGTTGCTGGCCCTACTTTCCATCCCGCCGGATCTTTTGGCCGTAAAATGACTATGGATGATCTAATTGCTAAGCAGATGCGGATCACTCGTCAAGAAAACCCCCATTGTGAATTGGTTTCTTTCGACCGGGATTGCGCCAATGCTTTCCAGCATTATGTTAATGAAACGCTAGCCTTCGCTGTCAAACGTGGGGGAATGATGTACGGAACCGTGTCACCAGAAGGCAAGGTCGAGGTAGATTTCATATATGAGCCGCCACAGCAAGGGACTGAGGACAATTTACTGTTTTTCCGAGATCCCGATGAGGAAAGATTGGTAGAAGCAATTGCAGTTGGGTTGGGGATGAGGAAAGTTGGGTTTATATTCACGCAGACGATTAGTCAGGACAAAAAGGACTACACCTTATCCAACAGGGAAGTACTCCAGGCGGCTCAGTTTCACTCCGAGAGCGAGTTGAAGGAGTGGGTGACAGCAGTTGTGAAGTTGGAGGTGAACGAGGATGGGGGTGCTGATGTTCATTTTGAGGCTTTTCAAATGAGTGACATGTGCATTAGATTGTTCAAGGAAGGTTGGTTTGAGACGGATATTGGAGAGGATTTTGATCCCAAGCTCTCGAAGATGAGGAGGGACGTTGTTGTTGGTGTCAAAGACACTAGAGATGTTGACAATGACTTTTTCCTGGTCGTAGTTAAGATTTTCGACCATCAGGGCCCACTTTCAACAACATTTCCGATTGAAAACCGAAATGCTCCAGTGACCATGAAGGCATTGAAGAATCACCTGGACCGCTCAAAAGGGCTTCCGTTTGTGAAGCGAATTTCTGACTTTCATTTGCTGCTGTTACTAGCCAGGGTGTTGGATGTGAGCTCTGACGTTCCTGCACTGGCCGAGTGCGTTCAGACCCAGACAGCCATACCTGAAGGTTACAAAATATTAATCGAGTCTATGGCAAGTGCTGCTTGA

Protein sequence

MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSKFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRITRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQGTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRRDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPFVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA
BLAST of Cla97C01G000040 vs. NCBI nr
Match: XP_008437660.1 (PREDICTED: NPL4-like protein 1 [Cucumis melo])

HSP 1 Score: 798.9 bits (2062), Expect = 8.1e-228
Identity = 399/411 (97.08%), Postives = 405/411 (98.54%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERVAV+NPHITIAQLKAIIQSQL IPIHNQTLS NQNILLAKT DDLSK
Sbjct: 1   MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSANQNILLAKTQDDLSK 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           FTDMSNPNT+LSSLNLSHGSIVFLAYEGERTVAGPT HPAGSFGRKMTMDDLIAKQMRIT
Sbjct: 61  FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TEDNLLFFRD DEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKM++
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRN PVTMKALKNHLDRSKGLPF
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLL+RVLDVSSDVPALAECVQTQTA+PEGYKILIESMASAA
Sbjct: 361 VKRISDFHLLLLLSRVLDVSSDVPALAECVQTQTAVPEGYKILIESMASAA 411

BLAST of Cla97C01G000040 vs. NCBI nr
Match: XP_004145952.1 (PREDICTED: NPL4-like protein 1 isoform X1 [Cucumis sativus] >KGN49825.1 hypothetical protein Csa_5G139050 [Cucumis sativus])

HSP 1 Score: 794.7 bits (2051), Expect = 1.5e-226
Identity = 398/411 (96.84%), Postives = 403/411 (98.05%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERVAV+NPHITIAQLKAIIQSQL IPIHNQTLSTNQNILLAKT DDLSK
Sbjct: 1   MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSTNQNILLAKTQDDLSK 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           FTDMSNPNT+LSSLNLSHGSIVFLAYEGERTVAGPT HPAGSFGRKMTMDDLIAKQMRIT
Sbjct: 61  FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TEDNLLFFRD DEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKE WFETDIGEDFDPKLSKM++
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKECWFETDIGEDFDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTRDVDNDFFLVVVKI DHQGPLSTTFPIENRN PVTMKALKNHLDRSKGLPF
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKILDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLLARVLDVSSDVPALAECVQTQT +PEGYKILIESMASAA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTGVPEGYKILIESMASAA 411

BLAST of Cla97C01G000040 vs. NCBI nr
Match: XP_022155640.1 (NPL4-like protein 1 [Momordica charantia])

HSP 1 Score: 779.6 bits (2012), Expect = 5.1e-222
Identity = 387/411 (94.16%), Postives = 403/411 (98.05%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERVAV+NPHITIAQLKAIIQSQ+ IP+HNQTLST+QNILLAK+HDDLSK
Sbjct: 1   MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQMKIPVHNQTLSTHQNILLAKSHDDLSK 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           FTDMSNPN+ LSSLNLSHGSIVFLAYEGERTVAGP FHPAGSFGRKMTMDDLIA+QMRIT
Sbjct: 61  FTDMSNPNSLLSSLNLSHGSIVFLAYEGERTVAGPAFHPAGSFGRKMTMDDLIARQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQ+YVNETLAFA+KRGG MYGTVSPEGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQNYVNETLAFAIKRGGFMYGTVSPEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TEDNLLFFRD DEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAA+FHS+
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAEFHSD 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           S LKEWVTAVVKLEVNEDGGADVHFEAFQMSD CIRLFKEGWFETDIGEDFDPKLSKM++
Sbjct: 241 SGLKEWVTAVVKLEVNEDGGADVHFEAFQMSDTCIRLFKEGWFETDIGEDFDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLST+F IENRNAPVTMKALKNHL+RSKGLPF
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTSFAIENRNAPVTMKALKNHLERSKGLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTA+PEGYKILIESMASAA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAVPEGYKILIESMASAA 411

BLAST of Cla97C01G000040 vs. NCBI nr
Match: XP_022958337.1 (NPL4-like protein 1 [Cucurbita moschata] >XP_023534714.1 NPL4-like protein 1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 763.1 bits (1969), Expect = 5.0e-217
Identity = 381/411 (92.70%), Postives = 395/411 (96.11%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERVAV++PHITIAQLKAIIQSQL IPIHNQTLSTNQNILLA  H D SK
Sbjct: 1   MMLRIRSRDGLERVAVESPHITIAQLKAIIQSQLKIPIHNQTLSTNQNILLANAHGDRSK 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           F DMSNPNT LSSLNLSHGSIV+LAYEGERTVAGP+ HPAGSFGRKMTMDDLIA+QMRIT
Sbjct: 61  FMDMSNPNTLLSSLNLSHGSIVYLAYEGERTVAGPSIHPAGSFGRKMTMDDLIARQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGG MYGTVS EGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGFMYGTVSTEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TEDNLLFFRD DEERLVEAIAVGLGMRKVGFIFTQTIS+++KDYTLSNREVLQAA+FHSE
Sbjct: 181 TEDNLLFFRDQDEERLVEAIAVGLGMRKVGFIFTQTISRERKDYTLSNREVLQAAEFHSE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           S LKEWVTAVVKLEVNEDGGADVHFEAFQMSD+CI+LFKEGWFETDI EDFDPKLSKM++
Sbjct: 241 SGLKEWVTAVVKLEVNEDGGADVHFEAFQMSDVCIKLFKEGWFETDIQEDFDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRN PVTMKALKNHLDRSKGL F
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLTF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLLARVLDVSSDVPALAECVQTQT +PEGYKILIESMASAA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTPVPEGYKILIESMASAA 411

BLAST of Cla97C01G000040 vs. NCBI nr
Match: XP_022995358.1 (NPL4-like protein 1 [Cucurbita maxima])

HSP 1 Score: 759.2 bits (1959), Expect = 7.2e-216
Identity = 378/411 (91.97%), Postives = 394/411 (95.86%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERVAV++PHITIAQLKAIIQSQL IPIHNQTLSTNQNILLA  H D SK
Sbjct: 1   MMLRIRSRDGLERVAVESPHITIAQLKAIIQSQLKIPIHNQTLSTNQNILLANAHGDRSK 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           F DMSNPNT LSSLNLSHGSIV+LAYEGERTVAGP+ HPAGSFGRKMTMDDL+A+QMRIT
Sbjct: 61  FMDMSNPNTLLSSLNLSHGSIVYLAYEGERTVAGPSIHPAGSFGRKMTMDDLVARQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGG MYGTVS EGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGFMYGTVSTEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TEDNLLFFRD DEERLVEAIAVGLGMRKVGFIFTQTIS+++KDYTLSNREVLQAA+FHSE
Sbjct: 181 TEDNLLFFRDQDEERLVEAIAVGLGMRKVGFIFTQTISRERKDYTLSNREVLQAAEFHSE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           + LKEWVTAVVKLEVNEDGGADVHFEAFQMSD+CI+LFKEGWFETDI EDFDPKLSKM++
Sbjct: 241 TGLKEWVTAVVKLEVNEDGGADVHFEAFQMSDVCIKLFKEGWFETDIQEDFDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRN PVTMKALKNHLDR KGL F
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNVPVTMKALKNHLDRLKGLTF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLLARVLDVSSDVPALAECVQTQT +PEGYKILIESMASAA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTPVPEGYKILIESMASAA 411

BLAST of Cla97C01G000040 vs. TrEMBL
Match: tr|A0A1S3AU81|A0A1S3AU81_CUCME (NPL4-like protein 1 OS=Cucumis melo OX=3656 GN=LOC103483000 PE=4 SV=1)

HSP 1 Score: 798.9 bits (2062), Expect = 5.4e-228
Identity = 399/411 (97.08%), Postives = 405/411 (98.54%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERVAV+NPHITIAQLKAIIQSQL IPIHNQTLS NQNILLAKT DDLSK
Sbjct: 1   MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSANQNILLAKTQDDLSK 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           FTDMSNPNT+LSSLNLSHGSIVFLAYEGERTVAGPT HPAGSFGRKMTMDDLIAKQMRIT
Sbjct: 61  FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TEDNLLFFRD DEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKM++
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRN PVTMKALKNHLDRSKGLPF
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLL+RVLDVSSDVPALAECVQTQTA+PEGYKILIESMASAA
Sbjct: 361 VKRISDFHLLLLLSRVLDVSSDVPALAECVQTQTAVPEGYKILIESMASAA 411

BLAST of Cla97C01G000040 vs. TrEMBL
Match: tr|A0A0A0KJC3|A0A0A0KJC3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G139050 PE=4 SV=1)

HSP 1 Score: 794.7 bits (2051), Expect = 1.0e-226
Identity = 398/411 (96.84%), Postives = 403/411 (98.05%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERVAV+NPHITIAQLKAIIQSQL IPIHNQTLSTNQNILLAKT DDLSK
Sbjct: 1   MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSTNQNILLAKTQDDLSK 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           FTDMSNPNT+LSSLNLSHGSIVFLAYEGERTVAGPT HPAGSFGRKMTMDDLIAKQMRIT
Sbjct: 61  FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TEDNLLFFRD DEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKE WFETDIGEDFDPKLSKM++
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKECWFETDIGEDFDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTRDVDNDFFLVVVKI DHQGPLSTTFPIENRN PVTMKALKNHLDRSKGLPF
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKILDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLLARVLDVSSDVPALAECVQTQT +PEGYKILIESMASAA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTGVPEGYKILIESMASAA 411

BLAST of Cla97C01G000040 vs. TrEMBL
Match: tr|A0A2P5FFU9|A0A2P5FFU9_9ROSA (Nuclear pore localization protein Npl OS=Trema orientalis OX=63057 GN=TorRG33x02_076610 PE=4 SV=1)

HSP 1 Score: 734.6 bits (1895), Expect = 1.2e-208
Identity = 357/411 (86.86%), Postives = 389/411 (94.65%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLER+ +DNPHIT++QLKA+IQSQL IP HNQTLSTNQN+LLAKTHDDLS+
Sbjct: 1   MMLRIRSRDGLERITIDNPHITVSQLKALIQSQLQIPFHNQTLSTNQNLLLAKTHDDLSR 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           F DM++PN  LSSLNLSHGSIVFLAYEGERTVAGP FHPAGSFGRKMTMDDLIAKQMR++
Sbjct: 61  FVDMADPNAPLSSLNLSHGSIVFLAYEGERTVAGPVFHPAGSFGRKMTMDDLIAKQMRVS 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVN+ LAFAVKRGG MYGTVS EGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNDALAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TE+NL FFRDPDEE+LVEAIAVGLGMR+VGFIFTQTI+QDKKDYTLSNREVLQAA+FH+E
Sbjct: 181 TEENLTFFRDPDEEKLVEAIAVGLGMRRVGFIFTQTITQDKKDYTLSNREVLQAAEFHAE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           S LKEWVTAVVKLEVNEDG ADVHFEAFQMSDMC+RLFKEGWF T+IGE+ DPKLSKM++
Sbjct: 241 SGLKEWVTAVVKLEVNEDGAADVHFEAFQMSDMCVRLFKEGWFVTEIGEEDDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTR+VDNDFFLVVVKI DHQGPLS+TFPIENRN  VTM+ALK+HLDRSK LPF
Sbjct: 301 DVVVGVKDTREVDNDFFLVVVKIADHQGPLSSTFPIENRNTQVTMRALKSHLDRSKNLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLLAR LD+S+DVPALAECV TQ+ IPEGYK+LIES+ASAA
Sbjct: 361 VKRISDFHLLLLLARFLDLSADVPALAECVHTQSPIPEGYKLLIESLASAA 411

BLAST of Cla97C01G000040 vs. TrEMBL
Match: tr|A0A2P5C807|A0A2P5C807_PARAD (Nuclear pore localization protein Npl OS=Parasponia andersonii OX=3476 GN=PanWU01x14_176240 PE=4 SV=1)

HSP 1 Score: 728.0 bits (1878), Expect = 1.2e-206
Identity = 353/411 (85.89%), Postives = 387/411 (94.16%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLER+ +DNPHIT++QLK +IQSQL IP HNQTLSTNQN+LLAKTHDDLS+
Sbjct: 1   MMLRIRSRDGLERITIDNPHITVSQLKVLIQSQLQIPFHNQTLSTNQNLLLAKTHDDLSR 60

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRIT 120
           F DM++PNT LSSLNLSHGSIVFLAYEGERTVAGP FHPAGSFGRKMTMDDLIAKQMR++
Sbjct: 61  FVDMADPNTPLSSLNLSHGSIVFLAYEGERTVAGPVFHPAGSFGRKMTMDDLIAKQMRVS 120

Query: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
           RQENPHCELVSFDRDCANAFQHYVN+ LAFAVKRGG MYGTVS EGKVEVDFIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNDALAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQG 180

Query: 181 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
           TE+NL FFRDPDEE+ VEAIAVGL MR+VGFIFTQTI+QDKKDYTLSNREVLQAA+FH+E
Sbjct: 181 TEENLTFFRDPDEEKFVEAIAVGLEMRRVGFIFTQTITQDKKDYTLSNREVLQAAEFHAE 240

Query: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 300
           S LKEWVTAVVKLEVNEDG ADVHFEAFQMSD+C+RLFKEGWF T+IGE+ DPKLSKM++
Sbjct: 241 SGLKEWVTAVVKLEVNEDGAADVHFEAFQMSDICVRLFKEGWFVTEIGEEDDPKLSKMKK 300

Query: 301 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 360
           DVVVGVKDTR+VDNDFFLVVVKI DHQGPLS+TFPIENRN  VTM+ALK+HLDRSK LPF
Sbjct: 301 DVVVGVKDTREVDNDFFLVVVKIADHQGPLSSTFPIENRNIQVTMRALKSHLDRSKNLPF 360

Query: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLLLLAR LD+S+DVPALAECV TQ+ +PEGYK+LIES+ASAA
Sbjct: 361 VKRISDFHLLLLLARFLDLSADVPALAECVHTQSPVPEGYKLLIESLASAA 411

BLAST of Cla97C01G000040 vs. TrEMBL
Match: tr|M5VVX8|M5VVX8_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_8G251000 PE=4 SV=1)

HSP 1 Score: 719.2 bits (1855), Expect = 5.4e-204
Identity = 347/412 (84.22%), Postives = 390/412 (94.66%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNP-HITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLS 60
           MMLR+RSRDGLERV VDNP H T++QLKA+IQ+QL IP  NQT+STNQN+LLAKTHDD+S
Sbjct: 1   MMLRVRSRDGLERVTVDNPQHTTVSQLKALIQTQLRIPFQNQTISTNQNLLLAKTHDDIS 60

Query: 61  KFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGPTFHPAGSFGRKMTMDDLIAKQMRI 120
           +FTDM+NPNT LS+LNLSHGSIV+LAY+GERTVAGPTFHPAGSFGRKMTMDDLIAKQM++
Sbjct: 61  RFTDMANPNTPLSALNLSHGSIVYLAYDGERTVAGPTFHPAGSFGRKMTMDDLIAKQMKV 120

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQENPH ELVSFDRDCANAFQHYVN+TLAFAVKRGG MYGTVS EGKVEVDFIYEPPQQ
Sbjct: 121 TRQENPHSELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQ 180

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           GTE NL+FFRDPDEE+ VEAIA+GLGMR+VGFIFTQT+SQDKKDYTLSNREVLQA++FH+
Sbjct: 181 GTEANLVFFRDPDEEKSVEAIAMGLGMRRVGFIFTQTVSQDKKDYTLSNREVLQASEFHA 240

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ES LKEWVTA+VKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFET+I E  DPKLSKM+
Sbjct: 241 ESGLKEWVTAMVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETEIEEGHDPKLSKMK 300

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
           +DVVVGVKDTR+VDNDFFLVVVKIFDHQGPLS++FPIENRN PVT+KALKNHLDR+K LP
Sbjct: 301 KDVVVGVKDTREVDNDFFLVVVKIFDHQGPLSSSFPIENRNTPVTLKALKNHLDRAKSLP 360

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           FVKRISDFHL+LLLAR LDV++D+PALA CV T++ IPEGY++LIESMA+A+
Sbjct: 361 FVKRISDFHLMLLLARFLDVAADIPALAVCVHTESPIPEGYQLLIESMANAS 412

BLAST of Cla97C01G000040 vs. Swiss-Prot
Match: sp|Q9LYC2|NPL41_ARATH (NPL4-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At3g63000 PE=1 SV=1)

HSP 1 Score: 634.8 bits (1636), Expect = 6.7e-181
Identity = 302/411 (73.48%), Postives = 363/411 (88.32%), Query Frame = 0

Query: 2   MLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSKF 61
           MLR+RSRDGLERV+VD PHIT++QLK +IQ QL IPIHNQTLSTN+N+LLAK+  D   F
Sbjct: 3   MLRVRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKSPSDFLAF 62

Query: 62  TDMSNPNTHLSSLNLSHGSIVFLAYEGERTV-AGPTFHPAGSFGRKMTMDDLIAKQMRIT 121
           TDM++PN  +SSLNL+HGS+V+LAYEGERT+  GP   PAGSFGRKMT++DLIA+QMR+ 
Sbjct: 63  TDMADPNLRISSLNLAHGSMVYLAYEGERTIRGGPAVTPAGSFGRKMTVEDLIARQMRVG 122

Query: 122 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 181
           RQE  HC+ VSFDRDCANAFQH+VNE+LAFAVKRGG MYG VS +G+VEV+FIYEPPQQG
Sbjct: 123 RQEKAHCDSVSFDRDCANAFQHFVNESLAFAVKRGGFMYGNVSEDGQVEVNFIYEPPQQG 182

Query: 182 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 241
            EDNL+  RD +EE+ V+AIA+GLGMR+VGFIF QT++QDKK+YTLSN EVL AAQ H+E
Sbjct: 183 MEDNLILMRDSEEEKRVDAIALGLGMRRVGFIFNQTVTQDKKEYTLSNVEVLLAAQLHAE 242

Query: 242 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 301
           SELKEWVTAVVKLE+NEDGGADVHFE FQMSDMC+RLFKEGWFET+IG + DPKLSK+++
Sbjct: 243 SELKEWVTAVVKLEINEDGGADVHFEPFQMSDMCVRLFKEGWFETEIGPEDDPKLSKLKK 302

Query: 302 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 361
           +VVVGVKD ++VDNDFFLV+VKI DHQGPLS TFPIENRN   TM+ALK H++R++ LPF
Sbjct: 303 EVVVGVKDVKEVDNDFFLVLVKILDHQGPLSCTFPIENRNTQTTMRALKTHMERARSLPF 362

Query: 362 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLL +A+ LDVSSDVPALAECV+ Q+ +PEGY++LI+SMA+ +
Sbjct: 363 VKRISDFHLLLFVAQFLDVSSDVPALAECVRLQSHVPEGYELLIDSMANTS 413

BLAST of Cla97C01G000040 vs. Swiss-Prot
Match: sp|O82264|NPL42_ARATH (NPL4-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=At2g47970 PE=1 SV=1)

HSP 1 Score: 620.9 bits (1600), Expect = 9.9e-177
Identity = 302/410 (73.66%), Postives = 357/410 (87.07%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERV  +  HIT++QLK +I  QL IP+H QTLSTN+++LLAKT  DL  
Sbjct: 2   MMLRIRSRDGLERVTAEGAHITVSQLKTLIADQLQIPLHKQTLSTNRDLLLAKTPADLLA 61

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAG-PTFHPAGSFGRKMTMDDLIAKQMRI 120
           FTD+++PN  LSSLNL HGS+++LAY+GER++ G P   PAGSFGRKMT+DDLIA+QMR+
Sbjct: 62  FTDLTDPNLPLSSLNLGHGSMLYLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 121

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQE  HC+ VSFDRD ANAFQHYVNE+LAFAVKRGG MYGTV+ EG+VEVDFIYEPPQQ
Sbjct: 122 TRQETSHCDSVSFDRDAANAFQHYVNESLAFAVKRGGFMYGTVTEEGQVEVDFIYEPPQQ 181

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           GTE NL+  RD DEE+ V+AIA+GLGMR+VGFIF QT+ QDK +YTLSN EVLQAA+ H+
Sbjct: 182 GTEANLILMRDADEEKRVDAIAMGLGMRRVGFIFNQTVVQDKTEYTLSNAEVLQAAELHA 241

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKE WFET+I  D DPKLSKM+
Sbjct: 242 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPDDDPKLSKMK 301

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
           ++VVVGVKD ++VDNDFFLV+V+I DHQGPLS+TFPIENR++  TM+ALK HLDR+K LP
Sbjct: 302 KEVVVGVKDLKEVDNDFFLVLVRILDHQGPLSSTFPIENRSSRATMRALKTHLDRAKSLP 361

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMAS 410
            VK++SDFHLLL +A+ LDVSSDVPALAECV+ Q+ +PEGY +LIESMA+
Sbjct: 362 LVKKMSDFHLLLFVAQFLDVSSDVPALAECVRLQSPVPEGYALLIESMAN 411

BLAST of Cla97C01G000040 vs. Swiss-Prot
Match: sp|Q9AS33|NPL4_ORYSJ (NPL4-like protein OS=Oryza sativa subsp. japonica OX=39947 GN=Os01g0377700 PE=2 SV=1)

HSP 1 Score: 459.5 bits (1181), Expect = 3.8e-128
Identity = 239/419 (57.04%), Postives = 306/419 (73.03%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNP-HITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLS 60
           M+LRIRSRDG +R+ V +P   T+  L+ +I +++ +P+  Q LS +  +LL  +    +
Sbjct: 1   MILRIRSRDGTDRITVPDPAAATVGDLQRLIAARVTVPVPLQRLSLDPALLLPSS----A 60

Query: 61  KFTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAGP----TFHPAGSFGRKMTMDDLIAK 120
               +++P   LSSL LS+GS V+L+Y  +   + P        AGSFG+KMTMDDLIA+
Sbjct: 61  SAALLADPAAPLSSLRLSNGSFVYLSYPPDARSSQPPPPKALSAAGSFGKKMTMDDLIAR 120

Query: 121 QMRITRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGK-VEVDFIY 180
           Q+R+TRQE P C   SFDRD ANAFQ +V E+LAFA KR G +YG V  + K V VDFIY
Sbjct: 121 QIRVTRQEAPLCAAASFDRDSANAFQLHVAESLAFATKRAGFLYGRVDADTKEVFVDFIY 180

Query: 181 EPPQQGTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTI---SQDKKDYTLSNREV 240
           EPPQ GTED +   RD  EE  V+AIA GLGMR+VG +FTQ +   + D  +YT+SNREV
Sbjct: 181 EPPQVGTEDVVQLMRDAQEEARVDAIAHGLGMRRVGLVFTQAVGRKTSDTGEYTMSNREV 240

Query: 241 LQAAQFHSESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDF 300
           LQA +  +E  + EWVTA+VKLEV +DG  DVHFEAFQMS++C++LFK+G  ET+IG+  
Sbjct: 241 LQATELQAEGGIPEWVTAIVKLEVGDDGSGDVHFEAFQMSEICVKLFKDGVLETEIGDKD 300

Query: 301 DPKLSKMRRDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNH 360
           DP+LSKMR++VV G KDT +VDNDFFLV VKI DHQGPLST FPIENR  PV M ALK+H
Sbjct: 301 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSTGFPIENRGNPVAMSALKSH 360

Query: 361 LDRSKGLPFVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASA 411
           LDR+K LPFVKRISDFHLLLL+A  LD+ +DVPAL  CV+ Q+ +PEGY++LIES+A A
Sbjct: 361 LDRAKHLPFVKRISDFHLLLLVAAFLDIKADVPALTACVKNQSVVPEGYQLLIESLAGA 415

BLAST of Cla97C01G000040 vs. Swiss-Prot
Match: sp|Q54GD3|NPL4_DICDI (Nuclear protein localization protein 4 homolog OS=Dictyostelium discoideum OX=44689 GN=nploc4 PE=3 SV=1)

HSP 1 Score: 131.7 bits (330), Expect = 1.8e-29
Identity = 92/285 (32.28%), Postives = 153/285 (53.68%), Query Frame = 0

Query: 116 QMRITRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYE 175
           +MR+  Q+NPH      D   AN FQ Y+  +  +  +R G ++G    +G V VD IYE
Sbjct: 282 KMRLKSQDNPHAPGALVDFQSANIFQQYIANS-KYEQQRIGFLFGNFLSDGSVVVDSIYE 341

Query: 176 PPQQGTEDNL-LFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQA 235
           PPQ+  +        DP  ++ +E++A  LG+ +VG+IF    S   + YT+S+ E++QA
Sbjct: 342 PPQECKDKQTPTLLPDPLADK-IESMASMLGLTRVGWIF----SHPSRKYTMSSTEIIQA 401

Query: 236 AQFHSESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPK 295
           A + ++     +VT +  L VN DG +++  EAFQ+SD  ++L K G F   +    DP 
Sbjct: 402 ASYQNKYG-PSFVTLI--LSVNSDGQSNM--EAFQVSDQALKLEKTGEF---LPTQPDPT 461

Query: 296 LSKMRRDVVVGVKDTRDVDNDFFLVVV--KIFDHQGPLSTTFPIENRNAPVTMKALKNHL 355
             K++  V     +T + D  FF+V V  K  + +   + +FP+ENR    T+  L ++ 
Sbjct: 462 KCKLKSPVFEEGTETINADTHFFIVTVPLKAREDKSIFNISFPVENRIPVNTLSDLASYK 521

Query: 356 DRSKGLPFVKRISDFHLLLLLA--RVLDVSSDVPALAECVQTQTA 396
              K +  +K  SDFH L+ L   + LD  SD P + E ++++++
Sbjct: 522 LEHKDVSPLKFFSDFHFLIFLLENQFLDFQSDFPIICENIRSRSS 552

BLAST of Cla97C01G000040 vs. Swiss-Prot
Match: sp|P60670|NPL4_MOUSE (Nuclear protein localization protein 4 homolog OS=Mus musculus OX=10090 GN=Nploc4 PE=1 SV=3)

HSP 1 Score: 97.1 bits (240), Expect = 5.0e-19
Identity = 92/334 (27.54%), Postives = 151/334 (45.21%), Query Frame = 0

Query: 119 ITRQENPHCELVSFD-RDCANAFQHYVNETLAFAVKRGGMMYGTVS-----PEG-KVEVD 178
           + RQ+  H + + F+    A+ F  +  +T     +  G +YG  +     P G + EV 
Sbjct: 215 LNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAEVA 274

Query: 179 FIYEPPQQGTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQD----------- 238
            IYEPPQ GT+++L    DP  E +V+ IA  LG+RKVG+IFT  +S+D           
Sbjct: 275 AIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRN 334

Query: 239 KKDYTLSNREVLQAAQFHSESEL-----------KEWVTAVVKLEVNEDGGAD--VHFEA 298
           K  Y LS+ E + A  F ++               ++VTAV        GG D  VHFE 
Sbjct: 335 KDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVA------TGGPDNQVHFEG 394

Query: 299 FQMSDMCIRLFKE------------GWFETDIGEDFDP-----KLSKMRRDVVVGVKDTR 358
           +Q+S+ C+ L ++            G+ +    E + P      + K   ++    +  R
Sbjct: 395 YQVSNQCMALVRDECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEI---TQLAR 454

Query: 359 DVDNDFFLVVVKIFDHQGPLST------TFPIENRNA---PVTMKALKNHLDRSKGLPFV 394
            +  ++ ++ +     + P+ T       FPIENR+         +L  +L ++    F+
Sbjct: 455 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQDFHSLATYLSQNTSSVFL 514

BLAST of Cla97C01G000040 vs. TAIR10
Match: AT3G63000.1 (NPL4-like protein 1)

HSP 1 Score: 634.8 bits (1636), Expect = 3.7e-182
Identity = 302/411 (73.48%), Postives = 363/411 (88.32%), Query Frame = 0

Query: 2   MLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSKF 61
           MLR+RSRDGLERV+VD PHIT++QLK +IQ QL IPIHNQTLSTN+N+LLAK+  D   F
Sbjct: 3   MLRVRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKSPSDFLAF 62

Query: 62  TDMSNPNTHLSSLNLSHGSIVFLAYEGERTV-AGPTFHPAGSFGRKMTMDDLIAKQMRIT 121
           TDM++PN  +SSLNL+HGS+V+LAYEGERT+  GP   PAGSFGRKMT++DLIA+QMR+ 
Sbjct: 63  TDMADPNLRISSLNLAHGSMVYLAYEGERTIRGGPAVTPAGSFGRKMTVEDLIARQMRVG 122

Query: 122 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 181
           RQE  HC+ VSFDRDCANAFQH+VNE+LAFAVKRGG MYG VS +G+VEV+FIYEPPQQG
Sbjct: 123 RQEKAHCDSVSFDRDCANAFQHFVNESLAFAVKRGGFMYGNVSEDGQVEVNFIYEPPQQG 182

Query: 182 TEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 241
            EDNL+  RD +EE+ V+AIA+GLGMR+VGFIF QT++QDKK+YTLSN EVL AAQ H+E
Sbjct: 183 MEDNLILMRDSEEEKRVDAIALGLGMRRVGFIFNQTVTQDKKEYTLSNVEVLLAAQLHAE 242

Query: 242 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMRR 301
           SELKEWVTAVVKLE+NEDGGADVHFE FQMSDMC+RLFKEGWFET+IG + DPKLSK+++
Sbjct: 243 SELKEWVTAVVKLEINEDGGADVHFEPFQMSDMCVRLFKEGWFETEIGPEDDPKLSKLKK 302

Query: 302 DVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLPF 361
           +VVVGVKD ++VDNDFFLV+VKI DHQGPLS TFPIENRN   TM+ALK H++R++ LPF
Sbjct: 303 EVVVGVKDVKEVDNDFFLVLVKILDHQGPLSCTFPIENRNTQTTMRALKTHMERARSLPF 362

Query: 362 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMASAA 412
           VKRISDFHLLL +A+ LDVSSDVPALAECV+ Q+ +PEGY++LI+SMA+ +
Sbjct: 363 VKRISDFHLLLFVAQFLDVSSDVPALAECVRLQSHVPEGYELLIDSMANTS 413

BLAST of Cla97C01G000040 vs. TAIR10
Match: AT2G47970.1 (Nuclear pore localisation protein NPL4)

HSP 1 Score: 620.9 bits (1600), Expect = 5.5e-178
Identity = 302/410 (73.66%), Postives = 357/410 (87.07%), Query Frame = 0

Query: 1   MMLRIRSRDGLERVAVDNPHITIAQLKAIIQSQLNIPIHNQTLSTNQNILLAKTHDDLSK 60
           MMLRIRSRDGLERV  +  HIT++QLK +I  QL IP+H QTLSTN+++LLAKT  DL  
Sbjct: 2   MMLRIRSRDGLERVTAEGAHITVSQLKTLIADQLQIPLHKQTLSTNRDLLLAKTPADLLA 61

Query: 61  FTDMSNPNTHLSSLNLSHGSIVFLAYEGERTVAG-PTFHPAGSFGRKMTMDDLIAKQMRI 120
           FTD+++PN  LSSLNL HGS+++LAY+GER++ G P   PAGSFGRKMT+DDLIA+QMR+
Sbjct: 62  FTDLTDPNLPLSSLNLGHGSMLYLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 121

Query: 121 TRQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQ 180
           TRQE  HC+ VSFDRD ANAFQHYVNE+LAFAVKRGG MYGTV+ EG+VEVDFIYEPPQQ
Sbjct: 122 TRQETSHCDSVSFDRDAANAFQHYVNESLAFAVKRGGFMYGTVTEEGQVEVDFIYEPPQQ 181

Query: 181 GTEDNLLFFRDPDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHS 240
           GTE NL+  RD DEE+ V+AIA+GLGMR+VGFIF QT+ QDK +YTLSN EVLQAA+ H+
Sbjct: 182 GTEANLILMRDADEEKRVDAIAMGLGMRRVGFIFNQTVVQDKTEYTLSNAEVLQAAELHA 241

Query: 241 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETDIGEDFDPKLSKMR 300
           ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKE WFET+I  D DPKLSKM+
Sbjct: 242 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPDDDPKLSKMK 301

Query: 301 RDVVVGVKDTRDVDNDFFLVVVKIFDHQGPLSTTFPIENRNAPVTMKALKNHLDRSKGLP 360
           ++VVVGVKD ++VDNDFFLV+V+I DHQGPLS+TFPIENR++  TM+ALK HLDR+K LP
Sbjct: 302 KEVVVGVKDLKEVDNDFFLVLVRILDHQGPLSSTFPIENRSSRATMRALKTHLDRAKSLP 361

Query: 361 FVKRISDFHLLLLLARVLDVSSDVPALAECVQTQTAIPEGYKILIESMAS 410
            VK++SDFHLLL +A+ LDVSSDVPALAECV+ Q+ +PEGY +LIESMA+
Sbjct: 362 LVKKMSDFHLLLFVAQFLDVSSDVPALAECVRLQSPVPEGYALLIESMAN 411

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008437660.18.1e-22897.08PREDICTED: NPL4-like protein 1 [Cucumis melo][more]
XP_004145952.11.5e-22696.84PREDICTED: NPL4-like protein 1 isoform X1 [Cucumis sativus] >KGN49825.1 hypothet... [more]
XP_022155640.15.1e-22294.16NPL4-like protein 1 [Momordica charantia][more]
XP_022958337.15.0e-21792.70NPL4-like protein 1 [Cucurbita moschata] >XP_023534714.1 NPL4-like protein 1 [Cu... [more]
XP_022995358.17.2e-21691.97NPL4-like protein 1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3AU81|A0A1S3AU81_CUCME5.4e-22897.08NPL4-like protein 1 OS=Cucumis melo OX=3656 GN=LOC103483000 PE=4 SV=1[more]
tr|A0A0A0KJC3|A0A0A0KJC3_CUCSA1.0e-22696.84Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G139050 PE=4 SV=1[more]
tr|A0A2P5FFU9|A0A2P5FFU9_9ROSA1.2e-20886.86Nuclear pore localization protein Npl OS=Trema orientalis OX=63057 GN=TorRG33x02... [more]
tr|A0A2P5C807|A0A2P5C807_PARAD1.2e-20685.89Nuclear pore localization protein Npl OS=Parasponia andersonii OX=3476 GN=PanWU0... [more]
tr|M5VVX8|M5VVX8_PRUPE5.4e-20484.22Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_8G251000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q9LYC2|NPL41_ARATH6.7e-18173.48NPL4-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At3g63000 PE=1 SV=1[more]
sp|O82264|NPL42_ARATH9.9e-17773.66NPL4-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=At2g47970 PE=1 SV=1[more]
sp|Q9AS33|NPL4_ORYSJ3.8e-12857.04NPL4-like protein OS=Oryza sativa subsp. japonica OX=39947 GN=Os01g0377700 PE=2 ... [more]
sp|Q54GD3|NPL4_DICDI1.8e-2932.28Nuclear protein localization protein 4 homolog OS=Dictyostelium discoideum OX=44... [more]
sp|P60670|NPL4_MOUSE5.0e-1927.54Nuclear protein localization protein 4 homolog OS=Mus musculus OX=10090 GN=Nploc... [more]
Match NameE-valueIdentityDescription
AT3G63000.13.7e-18273.48NPL4-like protein 1[more]
AT2G47970.15.5e-17873.66Nuclear pore localisation protein NPL4[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR029071Ubiquitin-like_domsf
IPR037518MPN
IPR007717NPL4_C
IPR024682Npl4_Ub-like_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006511 ubiquitin-dependent protein catabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0043130 ubiquitin binding
molecular_function GO:0031625 ubiquitin protein ligase binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G000040.1Cla97C01G000040.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024682Nuclear pore localisation protein Npl4, ubiquitin-like domainPFAMPF11543UN_NPL4coord: 1..85
e-value: 1.6E-5
score: 25.2
IPR007717Nuclear pore localisation protein NPL4, C-terminalPFAMPF05021NPL4coord: 167..280
e-value: 3.0E-12
score: 46.4
IPR007717Nuclear pore localisation protein NPL4, C-terminalCDDcd08061MPN_NPL4coord: 118..368
e-value: 6.76959E-72
score: 228.397
NoneNo IPR availableGENE3DG3DSA:3.10.20.90coord: 1..99
e-value: 1.9E-36
score: 126.2
NoneNo IPR availablePANTHERPTHR12710NUCLEAR PROTEIN LOCALIZATION 4coord: 111..410
coord: 1..93
NoneNo IPR availableCDDcd01769UBLcoord: 4..86
e-value: 0.00685492
score: 33.3946
IPR037518MPN domainPROSITEPS50249MPNcoord: 129..270
score: 17.545
IPR029071Ubiquitin-like domain superfamilySUPERFAMILYSSF54236Ubiquitin-likecoord: 1..88

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C01G000040Silver-seed gourdcarwmbB0529
Cla97C01G000040Cucurbita maxima (Rimu)cmawmbB284
Cla97C01G000040Cucurbita maxima (Rimu)cmawmbB604
Cla97C01G000040Cucurbita moschata (Rifu)cmowmbB266
Cla97C01G000040Cucurbita moschata (Rifu)cmowmbB580