Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTTATACCTTTCGACCATCGTTGTCAGATTCGAAGCTAGAGGCGAGCAGACAAGAGATGAGGCTGAGGAAGACAAGATTGAAGATCAGAGAAGAAGACACAACGTTTCTTGTTATTGTCATGAGTAGAAACATAAATGGCAAGCTTAAGATCAGGGAATTTGAAGAAAGTGAATAGGAACAACGGCCTATTATATACAAGCTTGAAGAAAGTCATATCTTCAACCATTGAACATTCTGCGAATCATCTTCCAAAAATCTATATCCATTACAAAAATGATATGTCTCAATCATAAACATAAAATATTTAGAGTCTAACGGCTGACCCTTTCAAGCCTTTTTGCTTCAATCAATCATACGTGGAACAACTTTAGTTTCGGCTTAAAAATGTTTATATTTCATGTTTCTGGTTTCTATTGTTTTTTAAGATCGATCTACAAACCGGAATATGTTTGATAATCATTTATGTTTACTATTTCTTGTTTTTTTAGTTGAAATAAAATTTAGTAGATATATATACAATGAAAATGAAAAAAGAACCTAATTAATATGATTTAAAACAAACCCAATACAATTATAAAAAGTTTATAATATGTAATACAATTCAAGTCCCAAAGAAAAATTGAAAATAACAATAAAAAGATGCATGTATTATTCGTTGGCCAATAATAATATTTGAAGTATTAATTTTTTTCATCAAAATGTTGTAAAAAAATTAATTGAGACATACTATTAAATAAGAATTTTTACGGTTTTGTTGTTTCTCAAATTTTATTTTATTTTTGAAAAACGTTTTCCAAAAATTAGAAACAAAAATGAGATGGCAGAAGTTTAATTTGTTTCTTACAAAATTAGAAACAAAAAAAAAAACCTAGAAATGAGAATATTATCAAATGCACCTTCAAATCTCTAGACTTAGAGAGAGAAAAGCTTGACAAATTAAAATAATAATAAAATAATGAGCATGAAACTTGACTAATTTAACAATTCCTTTCACACTACAAGTTTTGTTTTTATATATATTTTTTGTTTTGTTTTGTTTTGTTTTTTTTCCTTTTTCTTTAACTTTCACACTAAAAGTTGAAAATATGTTGATATAATCTAAAATATGTGGAATATTCTCACATAGAACTATAATCTAAAATATATTGAATGATCTCACATAGAACTAAATCCTAGACCACCCACTCAAATGCTACTTATACATTGAATTCACCCGTCTATCAACAAAATTCAACCCACCCATAAAAAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAAAAAACCCACCCATAAAAAGTTTATTCTTTCATATGAGATGTCTAAAAGTCATGTTTTTGCGTCCAAATATATTATTAAAATTAAATAGTAATTTTTTTTTTTGAAAAGAAATTAAATAGTAATTTAAATGATATATTCATTTTATTTATAATCTATCCTATTATCTATTTATATCTATACTCTATTAAAAGATAATCATGATTTCTAAAAAAAACATTTTTAGACTAGAATACCCCTTTAAATTTTAGATAATTACATAATTAGTTTTGAAATTTCATAATCAAAATTCGTAATTTAATAAAATTCAAAAGAATATTTGATTTAAGTGTAAATTCCGTCCATAAATATTTCATTCTTCAACTGATTTTCACATCTACAACTATTTGTTTCCTCCCATTATTCGTTTTAGGTATCTCCATTAAGTTATTATTATAATAATAATGTTATACCCTTTAAATTTCAGATAATTACATAATTAGTTTTAAAGTTACATAATCCAAATTCGTAATTTAAAAAAAATTAAAATAATATGTGATTTTATCCTTTAGTTTTCAAAATCCATAATCAAAATTCGCAGTTTAATAAAATTCAAAATAATATTTGATTTAAGTCAAAATTTCGTCTATAAATATTTCATTTTCTCCAATAAATTTTCACATTTACAACTATTTGGTTTTGTCAAGTTTAATTTTTTTTTTTTTTCTTTCATAGTCCTGTCTATTGTACAAGTGGTCTCTCCTATGTGAGAGTGGGTTTTTAGGGTTTTACATAGGGGTTCACAGCGGCGGCGTGGTCTGCTTGACGGTGAGGGCTTCTTGCCGGCGAGGTCTCTTTGAGGTGTGGGTCTTCTGCAGCGGTTTTCTTCACGGGGTTGAGGTATTCTCCGACGGTTTCAGAGTCTCGAAGGGGTCGGCGGCTTTCGATCTCCTTCACGGATTTTGGGTTGTCGTCAGCGGCGGCGGAGTTTCATGGGTTCGACGGTTGATCTCCGGCGTTGGGCGGTGTCGTCTCCGGTTCCATTCGGGGTCGAGGTTGACTTCGGTTGGTAATTTTTTCTTCTTTCTTCCGGCTGTGAGGTTTGGGTGTTTCGACTGTTTTGGTCCTCGGTGGTTTTGGCGTTTGGTTTGATTTCGGATCTTCCCTTGTCGTGTTGGGGTTGGTTTGTAGGCTTTGTGTGCTTTGCGATTTTGGTCTTGGGGCTGGTCTCAGTCTGTAGCTGTCGGTTTCTTGCGTGTTTGTTCTGTTTTAGGTGGTCGCGGGTTTAGTTCTGTTTGTGTTTTATTTTGCTTGCAAGTTTCGGTTGTATTTTGGTGTTTGGTTGCTGGTTTGGGTCTCCACAGGTTGAGGTTGTCTAAGGATTGCGGGGTTTTGGTGTCTCTTGAAGCGATTTGAGAAGTGGTTTCCTTTGGTTGGTGTTGGTTTGGTGCAGCTCTGCGTTTCTGGTCCTAATTGGTTCCTGCGATTTGGGCTGGGTTGTGGTATTGTGTAGGCTGCGGTGGTGTGGGTTTGGGGTTGGTGGGTCGTGTAGTGGTTTGGGCGATTGCCGTTAGAGTTCCCTCGGTGCACGTATGGATGATCTTGTGGTTCAATGGGAAAATATGGGGTTATCAGAGGCAGAGTCGACGACGTTCTCTGTGCCAGCAGATATTCCTCTCTTGGATGAATCAACAGTTCAACTGTGTGTAGTTGGTAAGGTTCTTTCTTCCAAACCGGTGAATCCTGATGCCTTTCGTCGAGTAATGTTATCGGTTTGGAGTGTTCATCGTTCTACCCGGATTGAACCTTGGGGGGATAATGTCTTCGTAATTCGGTTTCATTCCGTCACTGAAAAGCGGAGAATTATGAGTTTGGGCCCTTGGACCTTCGATAAGTCTTTACTGGTTCTGGTGTCTCCTCGAGGGTCGGATGACCCATCTCTTCTGGACTTTTCTCGTTGTGAGTTTTGGGTTCATATCACGAAAGTTCCGTTGAACTATCATACAGCGGCTATGGCTCGTGCTCTTGGTAGTGTGGTGGGTCACGTGGTTGAGGTGCCAGGGGAAGGCCACAGTGACTGGCTTGGCACAGTAATGAGAGTTCGTATTGTTCTTAACATGGCTCAACCCCTTCGTCGTGTTGTCCGGCTTGTAAAAGGGAATGGGTCCGTTCTTTGGTGTCCGTTGCAGTATGAACGATTGCCGGATTTCTGTTTTAGGTGTGGGCGTATTGGCCATTCACATAGGGAGTGCTCGGAAGAGGGGGAAGGCGTGGGTGCTGACAATCAGTTTCTGTTTGGTGACTGGTTGCGGGCTGTTCCATTTCGGCATGGGGTTGCTAATGCGACAGAAGAGGGTGGTGGGCGTCCGGATATTCAGGGGGGCGGGGATCAGGTGTGGCTGGTAGAGGGAGAGGGCGGGGCCAGGTGGATCAGTCGGAATGGGTGGGGGTAGGAAGGTCTGAGGTGTCTATGCCAGCTGACCGGGTGGTAGATCTGGTTGATTCAGGAGTTGTTCTTGAGGGGACTACGGCTTCCCCGGTTCCTTCGGGTACTCCCCCGTCAGTCGATCCTGATATTGGGGTGGCTTCTGCGGACAAGGGTAAGGAGGTGGCCGATCCGGGTGTTGCTCCAGAAGCTAGCTCTAAGGTGGGTACGGTGCCTTTGGCTCCGTTAGTGGCAACACATACAGTCTCTTCTGGGGCAGGGTCGGTTTCTGCAGGTAAAGGCAAGGCTGTGGCTAATGAAAATTCTGAGATTACTATGACTGATGTGCATGATGGTCCGGTGAAGAAGAGTTGGAAGCGGCTGGCCAGAAGCTCTTTGAAGGACATTTCCAATGTCTTATCTTCCTCAGTTGTTAGTGGGCACAAGCGATCAGCCCAGGGGGACCCGCCTGATGAGGATGGGTTAGTCTCCAAGCGACTGAAGGAGGTGGAGTCTGGTGTGGATCTGTGTGTGACTGATGGAATGGTTGTGGCGGTGGCTGGGTCCCAACCCCGCCCGGGATTATGAGTTTGATGTTTTGGAATGTTCGGGGTTTAGGGTCTCCCCGGGCTTTCCAGCGCTTGGCCAAGGTGGTTCAAGAGAAAAGACCCCTGGTGCTCTTCCTGTCTGAAACAAAGCTGTCGTCAAACAGGATGGCATCAGCGAAGCGAGTTCTGGGTTTCGAGTACTGTTTTTGTGTTGATAGCAAAGGTAGGAGTGGTGGTTTGGCTCTGTTGTGGAGTTCGTCTGTCTCCTTCAGCCTCTTGTCATTTTCGAATAACCACATTGATGGGTGGATCTCGTGGGACGTTTATCATTGGCGACTCACGGGTTTCTATGGTTTCCCTGCCGCCGATAAGCGGGATCAAACGTGGTCCCTTCTCTCTAAGTTAAGGGGGGGTTCTGATACTCCTTGGCTTATAGGAGGGGACTTTAATGCCCTGTTGTACCAGCATGAGAAGGAAGGTGGCAGAGATAAACCCCTCTCAGAGCTAGCGGCCTTTCAGAATGTGATTGACTCATGTGGGCTTCTTGATTTGGGATTTGTGGGGAATAGGTTCACATGGTGCAACAGGCGGCCGGAAGGAACGATCTATGAGCGCTTGGATAGGTGTTTTAGCTCAGTTGCTTGGCATGATATCTACCCCAACTGTGTAGTTAACCATCTTGATTATCACCAGTCCGATCACCGACCGATTGAGCTGGTTCTCTCTCCGCAGCCTGGTTGTTGGAGACGCTCGAGCCAGCGAATCTTACGGTTTGATGAGACTTGGCTGAAGCAAGCAGAGCTGCAGCAGCTGGTCAGGGACTCATGGGGGTCGAGTGGGGAGGGTCCTGGTTTGTCAGCTCCCGAAAGGTTGGCTCAAGTTTCCAGAAGGTGCATGCGTTCGATGGCTGGTTGGGGTCGCTCAAAAATGGGGAACTTCCCTCAGAAGGTACAGCTGGCCATTGAGGGATTGAGAGGGGCTGGGTCCCGTGAGCCACTTTCCCAGGCAGAGGCCCAGTTGGAAGATGTGTTACAGGAGGAGGAACTTTACTGGAAGCAAAGATCCAGAGAGGTGTGGTTGAAGGAAGGGGATCAGAATACTCGGTGGTTTCATCGTCAAGCCTCGTATAGGCAAAGGCTCAATCGTATTGGGGGCCTCATGGACGATCAGGGGGAATGGCGCCAGGACAGAGCTATGGTTCTTCAGTTGGTGACTGATTATTTCCAGCAGCTTTTCTCGACATCAGAGCCGAGTGATCAGGATTTCGATGTATCTCTCCGGGACCTTCAGCGATCTGTGGATAGTGAAATGAATGTGGATCTGTTGAAACCTTTTACTGAGGAGGAGATTCTTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCGGGTCCAGATGGGTTATCTGATAGCGAGCATAGCTAG
mRNA sequence
ATGTTTTATACCTTTCGACCATCGTTGTCAGATTCGAAGCTAGAGGCGAGCAGACAAGAGATGAGGCTGAGGAAGACAAGATTGAAGATCAGAGAAGAAGACACAACGTTTCTTGTTATTGTCATGAGTAGAAACATAAATGGCAAGCTTAAGATCAGGGAATTTGAAGAAAGGGTTCACAGCGGCGGCGTGGTCTGCTTGACGGTGAGGGCTTCTTGCCGGCGAGGTCTCTTTGAGGTGTGGGTCTTCTGCAGCGGTTTTCTTCACGGGGTTGAGGTATTCTCCGACGGTTTCAGAGTCTCGAAGGGGTCGGCGGCTTTCGATCTCCTTCACGGATTTTGGGTTGTCGTCAGCGGCGGCGGAGTTTCATGGGTTCGACGGTTGATCTCCGGCGTTGGGCGGTGTCGTCTCCGGTTCCATTCGGGGTCGAGGTTGACTTCGGCTTTGTGTGCTTTGCGATTTTGGTCTTGGGGCTGGTCTCAGTCTGTAGCTGTCGGTTTCTTGCGTGTTTGTTCTGTTTTAGGTGGTCGCGGGTTTAGTTCTGTTTGTGTTTTATTTTGCTTGCAAGTTTCGGTTGTATTTTGGTGTTTGGTTGCTGGTTTGGGTCTCCACAGCTCTGCGTTTCTGGTCCTAATTGGTTCCTGCGATTTGGGCTGGGTTGTGAGTTCCCTCGGTGCACGTATGGATGATCTTGTGGTTCAATGGGAAAATATGGGGTTATCAGAGGCAGAGTCGACGACGTTCTCTGTGCCAGCAGATATTCCTCTCTTGGATGAATCAACAGTTCAACTGTGTGTAGTTGGTAAGGTTCTTTCTTCCAAACCGGTGAATCCTGATGCCTTTCGTCGAGTAATGTTATCGGTTTGGAGTGTTCATCGTTCTACCCGGATTGAACCTTGGGGGGATAATGTCTTCGTAATTCGGTTTCATTCCGTCACTGAAAAGCGGAGAATTATGAGTTTGGGCCCTTGGACCTTCGATAAGTCTTTACTGGTTCTGGTGTCTCCTCGAGGGTCGGATGACCCATCTCTTCTGGACTTTTCTCGTTGTGAGTTTTGGGTTCATATCACGAAAGTTCCGTTGAACTATCATACAGCGGCTATGGCTCGTGCTCTTGGTAGTGTGGTGGGTCACGTGGTTGAGGTGCCAGGGGAAGGCCACAGTGACTGGCTTGGCACAGTAATGAGAGTTCGTATTGTTCTTAACATGGCTCAACCCCTTCGTCGTGTTGTCCGGCTTGTAAAAGGGAATGGGTCCGTTCTTTGGTGTCCGTTGCAGTATGAACGATTGCCGGATTTCTGTTTTAGGTGTGGGCGTATTGGCCATTCACATAGGGAGTGCTCGGAAGAGGGGGAAGGCGTGGGTGCTGACAATCAGTTTCTGTTTGGTGACTGGTTGCGGGCTGTTCCATTTCGGCATGGGGTTGCTAATGCGACAGAAGAGGGTGGTGGGCGTCCGGATATTCAGGGGGGCGGGGATCAGGTGTCTGAGGTGTCTATGCCAGCTGACCGGGTGGTAGATCTGGTTGATTCAGGAGTTGTTCTTGAGGGGACTACGGCTTCCCCGGTTCCTTCGGGTACTCCCCCGTCAGTCGATCCTGATATTGGGGTGGCTTCTGCGGACAAGGGTAAGGAGGTGGCCGATCCGGGTGTTGCTCCAGAAGCTAGCTCTAAGGTGGGTACGGTGCCTTTGGCTCCGTTAGTGGCAACACATACAGTCTCTTCTGGGGCAGGGTCGGTTTCTGCAGGTAAAGGCAAGGCTGTGGCTAATGAAAATTCTGAGATTACTATGACTGATGTGCATGATGGTCCGGTGAAGAAGAGTTGGAAGCGGCTGGCCAGAAGCTCTTTGAAGGACATTTCCAATGTCTTATCTTCCTCAGTTGTTAGTGGGCACAAGCGATCAGCCCAGGGGGACCCGCCTGATGAGGATGGGTTAGTCTCCAAGCGACTGAAGGAGGTGGAGTCTGGGTCTCCCCGGGCTTTCCAGCGCTTGGCCAAGGTGGTTCAAGAGAAAAGACCCCTGGTGCTCTTCCTGTCTGAAACAAAGCTGTCGTCAAACAGGATGGCATCAGCGAAGCGAGTTCTGGGTTTCGAGTACTGTTTTTGTGTTGATAGCAAAGGTAGGAGTGGTGGTTTGGCTCTGTTGTGGAGTTCGTCTGTCTCCTTCAGCCTCTTGTCATTTTCGAATAACCACATTGATGGGTGGATCTCGTGGGACGTTTATCATTGGCGACTCACGGGTTTCTATGGTTTCCCTGCCGCCGATAAGCGGGATCAAACGTGGTCCCTTCTCTCTAAGTTAAGGGGGGGTTCTGATACTCCTTGGCTTATAGGAGGGGACTTTAATGCCCTGTTGTACCAGCATGAGAAGGAAGGTGGCAGAGATAAACCCCTCTCAGAGCTAGCGGCCTTTCAGAATGTGATTGACTCATGTGGGCTTCTTGATTTGGGATTTGTGGGGAATAGGTTCACATGGTGCAACAGGCGGCCGGAAGGAACGATCTATGAGCGCTTGGATAGGTGTTTTAGCTCAGTTGCTTGGCATGATATCTACCCCAACTGTGTAGTTAACCATCTTGATTATCACCAGTCCGATCACCGACCGATTGAGCTGGTTCTCTCTCCGCAGCCTGGTTGTTGGAGACGCTCGAGCCAGCGAATCTTACGGTTTGATGAGACTTGGCTGAAGCAAGCAGAGCTGCAGCAGCTGGTCAGGGACTCATGGGGGTCGAGTGGGGAGGGTCCTGGTTTGTCAGCTCCCGAAAGGTTGGCTCAAGTTTCCAGAAGGTGCATGCGTTCGATGGCTGGTTGGGGTCGCTCAAAAATGGGGAACTTCCCTCAGAAGGTACAGCTGGCCATTGAGGGATTGAGAGGGGCTGGGTCCCGTGAGCCACTTTCCCAGGCAGAGGCCCAGTTGGAAGATGTGTTACAGGAGGAGGAACTTTACTGGAAGCAAAGATCCAGAGAGGTGTGGTTGAAGGAAGGGGATCAGAATACTCGGTGGTTTCATCGTCAAGCCTCGTATAGGCAAAGGCTCAATCGTATTGGGGGCCTCATGGACGATCAGGGGGAATGGCGCCAGGACAGAGCTATGGTTCTTCAGTTGGTGACTGATTATTTCCAGCAGCTTTTCTCGACATCAGAGCCGAGTGATCAGGATTTCGATGTATCTCTCCGGGACCTTCAGCGATCTGTGGATAGTGAAATGAATGTGGATCTGTTGAAACCTTTTACTGAGGAGGAGATTCTTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCGGGTCCAGATGGGTTATCTGATAGCGAGCATAGCTAG
Coding sequence (CDS)
ATGTTTTATACCTTTCGACCATCGTTGTCAGATTCGAAGCTAGAGGCGAGCAGACAAGAGATGAGGCTGAGGAAGACAAGATTGAAGATCAGAGAAGAAGACACAACGTTTCTTGTTATTGTCATGAGTAGAAACATAAATGGCAAGCTTAAGATCAGGGAATTTGAAGAAAGGGTTCACAGCGGCGGCGTGGTCTGCTTGACGGTGAGGGCTTCTTGCCGGCGAGGTCTCTTTGAGGTGTGGGTCTTCTGCAGCGGTTTTCTTCACGGGGTTGAGGTATTCTCCGACGGTTTCAGAGTCTCGAAGGGGTCGGCGGCTTTCGATCTCCTTCACGGATTTTGGGTTGTCGTCAGCGGCGGCGGAGTTTCATGGGTTCGACGGTTGATCTCCGGCGTTGGGCGGTGTCGTCTCCGGTTCCATTCGGGGTCGAGGTTGACTTCGGCTTTGTGTGCTTTGCGATTTTGGTCTTGGGGCTGGTCTCAGTCTGTAGCTGTCGGTTTCTTGCGTGTTTGTTCTGTTTTAGGTGGTCGCGGGTTTAGTTCTGTTTGTGTTTTATTTTGCTTGCAAGTTTCGGTTGTATTTTGGTGTTTGGTTGCTGGTTTGGGTCTCCACAGCTCTGCGTTTCTGGTCCTAATTGGTTCCTGCGATTTGGGCTGGGTTGTGAGTTCCCTCGGTGCACGTATGGATGATCTTGTGGTTCAATGGGAAAATATGGGGTTATCAGAGGCAGAGTCGACGACGTTCTCTGTGCCAGCAGATATTCCTCTCTTGGATGAATCAACAGTTCAACTGTGTGTAGTTGGTAAGGTTCTTTCTTCCAAACCGGTGAATCCTGATGCCTTTCGTCGAGTAATGTTATCGGTTTGGAGTGTTCATCGTTCTACCCGGATTGAACCTTGGGGGGATAATGTCTTCGTAATTCGGTTTCATTCCGTCACTGAAAAGCGGAGAATTATGAGTTTGGGCCCTTGGACCTTCGATAAGTCTTTACTGGTTCTGGTGTCTCCTCGAGGGTCGGATGACCCATCTCTTCTGGACTTTTCTCGTTGTGAGTTTTGGGTTCATATCACGAAAGTTCCGTTGAACTATCATACAGCGGCTATGGCTCGTGCTCTTGGTAGTGTGGTGGGTCACGTGGTTGAGGTGCCAGGGGAAGGCCACAGTGACTGGCTTGGCACAGTAATGAGAGTTCGTATTGTTCTTAACATGGCTCAACCCCTTCGTCGTGTTGTCCGGCTTGTAAAAGGGAATGGGTCCGTTCTTTGGTGTCCGTTGCAGTATGAACGATTGCCGGATTTCTGTTTTAGGTGTGGGCGTATTGGCCATTCACATAGGGAGTGCTCGGAAGAGGGGGAAGGCGTGGGTGCTGACAATCAGTTTCTGTTTGGTGACTGGTTGCGGGCTGTTCCATTTCGGCATGGGGTTGCTAATGCGACAGAAGAGGGTGGTGGGCGTCCGGATATTCAGGGGGGCGGGGATCAGGTGTCTGAGGTGTCTATGCCAGCTGACCGGGTGGTAGATCTGGTTGATTCAGGAGTTGTTCTTGAGGGGACTACGGCTTCCCCGGTTCCTTCGGGTACTCCCCCGTCAGTCGATCCTGATATTGGGGTGGCTTCTGCGGACAAGGGTAAGGAGGTGGCCGATCCGGGTGTTGCTCCAGAAGCTAGCTCTAAGGTGGGTACGGTGCCTTTGGCTCCGTTAGTGGCAACACATACAGTCTCTTCTGGGGCAGGGTCGGTTTCTGCAGGTAAAGGCAAGGCTGTGGCTAATGAAAATTCTGAGATTACTATGACTGATGTGCATGATGGTCCGGTGAAGAAGAGTTGGAAGCGGCTGGCCAGAAGCTCTTTGAAGGACATTTCCAATGTCTTATCTTCCTCAGTTGTTAGTGGGCACAAGCGATCAGCCCAGGGGGACCCGCCTGATGAGGATGGGTTAGTCTCCAAGCGACTGAAGGAGGTGGAGTCTGGGTCTCCCCGGGCTTTCCAGCGCTTGGCCAAGGTGGTTCAAGAGAAAAGACCCCTGGTGCTCTTCCTGTCTGAAACAAAGCTGTCGTCAAACAGGATGGCATCAGCGAAGCGAGTTCTGGGTTTCGAGTACTGTTTTTGTGTTGATAGCAAAGGTAGGAGTGGTGGTTTGGCTCTGTTGTGGAGTTCGTCTGTCTCCTTCAGCCTCTTGTCATTTTCGAATAACCACATTGATGGGTGGATCTCGTGGGACGTTTATCATTGGCGACTCACGGGTTTCTATGGTTTCCCTGCCGCCGATAAGCGGGATCAAACGTGGTCCCTTCTCTCTAAGTTAAGGGGGGGTTCTGATACTCCTTGGCTTATAGGAGGGGACTTTAATGCCCTGTTGTACCAGCATGAGAAGGAAGGTGGCAGAGATAAACCCCTCTCAGAGCTAGCGGCCTTTCAGAATGTGATTGACTCATGTGGGCTTCTTGATTTGGGATTTGTGGGGAATAGGTTCACATGGTGCAACAGGCGGCCGGAAGGAACGATCTATGAGCGCTTGGATAGGTGTTTTAGCTCAGTTGCTTGGCATGATATCTACCCCAACTGTGTAGTTAACCATCTTGATTATCACCAGTCCGATCACCGACCGATTGAGCTGGTTCTCTCTCCGCAGCCTGGTTGTTGGAGACGCTCGAGCCAGCGAATCTTACGGTTTGATGAGACTTGGCTGAAGCAAGCAGAGCTGCAGCAGCTGGTCAGGGACTCATGGGGGTCGAGTGGGGAGGGTCCTGGTTTGTCAGCTCCCGAAAGGTTGGCTCAAGTTTCCAGAAGGTGCATGCGTTCGATGGCTGGTTGGGGTCGCTCAAAAATGGGGAACTTCCCTCAGAAGGTACAGCTGGCCATTGAGGGATTGAGAGGGGCTGGGTCCCGTGAGCCACTTTCCCAGGCAGAGGCCCAGTTGGAAGATGTGTTACAGGAGGAGGAACTTTACTGGAAGCAAAGATCCAGAGAGGTGTGGTTGAAGGAAGGGGATCAGAATACTCGGTGGTTTCATCGTCAAGCCTCGTATAGGCAAAGGCTCAATCGTATTGGGGGCCTCATGGACGATCAGGGGGAATGGCGCCAGGACAGAGCTATGGTTCTTCAGTTGGTGACTGATTATTTCCAGCAGCTTTTCTCGACATCAGAGCCGAGTGATCAGGATTTCGATGTATCTCTCCGGGACCTTCAGCGATCTGTGGATAGTGAAATGAATGTGGATCTGTTGAAACCTTTTACTGAGGAGGAGATTCTTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCGGGTCCAGATGGGTTATCTGATAGCGAGCATAGCTAG
Protein sequence
MFYTFRPSLSDSKLEASRQEMRLRKTRLKIREEDTTFLVIVMSRNINGKLKIREFEERVHSGGVVCLTVRASCRRGLFEVWVFCSGFLHGVEVFSDGFRVSKGSAAFDLLHGFWVVVSGGGVSWVRRLISGVGRCRLRFHSGSRLTSALCALRFWSWGWSQSVAVGFLRVCSVLGGRGFSSVCVLFCLQVSVVFWCLVAGLGLHSSAFLVLIGSCDLGWVVSSLGARMDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEEGEGVGADNQFLFGDWLRAVPFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASPVPSGTPPSVDPDIGVASADKGKEVADPGVAPEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLSDSEHS
Homology
BLAST of Lag0040831 vs. NCBI nr
Match:
XP_042990668.1 (uncharacterized protein LOC122317666 [Carya illinoinensis])
HSP 1 Score: 402.1 bits (1032), Expect = 1.5e-107
Identity = 283/927 (30.53%), Postives = 455/927 (49.08%), Query Frame = 0
Query: 229 DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSV 288
+D+ +W+++ L+E E + + L + +LC++ K+ + + N +AFR M +
Sbjct: 3 EDITARWKSLKLTEEEQQEIVLSEEAVLSSNAKGELCLLAKIFNDRTANREAFRTTMSKI 62
Query: 289 WSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFS 348
W+ + NV++I F V++K +++ PW+FD+ L+ + G S + F
Sbjct: 63 WNTEGWLTFKDLETNVYLIEFQLVSDKEKVLLGRPWSFDRHLVCMKEFEGLLSLSEVRFD 122
Query: 349 RCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLR 408
FW+ + +P + M +GSV+G V+EV G +R++ +N+ + L
Sbjct: 123 SEPFWIQVHNLPFAGMSKEMGILVGSVIGRVLEVETNTEGYGWGGYLRIKAEVNVTKALV 182
Query: 409 RVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEEGEGVGADNQFL--FGDWL 468
R R +K W +YERLP FCF+CGR H C E GADN +G WL
Sbjct: 183 R-GRFLKSGSKQSWLSFKYERLPMFCFKCGRFVHEQGSCQER----GADNHTFDQYGQWL 242
Query: 469 RAV----PFRH-----GVANATEEGGGRPDIQGGGDQVS-----EVSMPADRVVDLVDSG 528
RA F+H G+ G Q GD S S P +LV+S
Sbjct: 243 RATHLHSKFQHNRRYGGLKEKEPSGSTWSGSQEEGDDYSGFCGVSTSKPGSEPPNLVEST 302
Query: 529 VVLE---GTTASPVP---SGTPPSVDPDIGVASADKGKEV----ADPGVAPEASSKVGTV 588
LE G T +P GT + +P + DKG + P E S+ T
Sbjct: 303 EALEVNLGHTEGNLPKEDQGTSVTDNP-----AHDKGHKTFPTHERPASWQERLSQDPTT 362
Query: 589 PLAPLVATHTVSSGAGSV----SAGKGKAVANENSEITMTDV--------HDGPVK-KSW 648
++ H+++S V A ++ EN + T+ + PVK K+W
Sbjct: 363 LHPQVLFPHSLASSYDVVMLATEAPGPTSIRKENGGVPFTETCFENLENSSNSPVKGKTW 422
Query: 649 KRLAR---SSLKDISNVLSS-SVVSGHKRS-AQGDPPDEDGLVSKRLKEVESGSP-RAFQ 708
KR AR + L D++N++ S + KRS Q DG + K+ ++ +P + Q
Sbjct: 423 KRKARALPTPLSDVTNIIQQPSSYNNSKRSLRQRFSTRADG----KCKKQKTQAPFESVQ 482
Query: 709 RLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSF 768
L +V+ K+P ++FL+ETK ++ R+ K LG+E CF V+SKG+SG LALLW SV
Sbjct: 483 ELHLLVKAKQPHLVFLTETKCNNVRLDRIKLALGYENCFSVNSKGKSGELALLWKDSVKV 542
Query: 769 SLLSFSNNHIDGWIS--WDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGD 828
+++++ HI I+ D W+LTGFYG P + KR ++W LL L+ + PWL GD
Sbjct: 543 EVINYTTWHISALITSPIDNTQWQLTGFYGHPNSAKRPESWHLLRGLKPTGNLPWLCLGD 602
Query: 829 FNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIY--ERL 888
FN + +Q EK G ++P ++ F+N + C L DLGF G++FTW N R EG + ERL
Sbjct: 603 FNEITHQSEKVGAANRPYRQMVQFRNSLSFCKLYDLGFHGDKFTWSNNR-EGDQFTKERL 662
Query: 889 DRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQA 948
DR + W ++ N V+HLD QSDH+ + + + ++ R+ RF+ W K+
Sbjct: 663 DRACGNNFWIKLFANHTVSHLDCTQSDHKALLVQTADLNSMGKKG--RVFRFESAWTKEM 722
Query: 949 ELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRG 1008
E +++++ W S G S R Q +C + W R+K + + ++ E L+
Sbjct: 723 ECEEIIKKVWRLSS---GPSILHRTLQDLNQCQGKLKIWSRNKQRDQRKALKNKTELLKI 782
Query: 1009 AGSR------EPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRL 1068
R E + + + ++ E L +QR+++ WLK GD+NT++FH+ +S R+R
Sbjct: 783 LQERNQGELSEEIKKVNQSINCIMDAENLKRQQRAKQAWLKNGDRNTKFFHQCSSQRKRT 842
Query: 1069 NRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVD 1101
N I + G QD + Q + ++F LF++S PS D L LQ+ + +M
Sbjct: 843 NSILRIQTKSGLSTQDPQTICQTLLEFFTDLFTSSHPS--GIDNCLSPLQKIITDDMFTF 902
BLAST of Lag0040831 vs. NCBI nr
Match:
KAG2663507.1 (hypothetical protein I3760_16G033000 [Carya illinoinensis])
HSP 1 Score: 400.2 bits (1027), Expect = 5.8e-107
Identity = 295/922 (32.00%), Postives = 430/922 (46.64%), Query Frame = 0
Query: 228 MDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLS 287
MD L W N+ L+E E + ++ + + ++GK+ S + V +
Sbjct: 1 MDSLHSMWGNLHLNEEEEDAIVIDENVCSEVQRKGERSLIGKIWSDRQVGKNVVESTFAK 60
Query: 288 VWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDF 347
+W + +S + N F++ F + +K R+ S PW FD L V+ GS S + F
Sbjct: 61 IWRLSKSVVLREVAPNTFILIFATHADKDRVESGRPWFFDGHLFVINPFDGSIPVSEMKF 120
Query: 348 SRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPL 407
FWV +PL LGS +G V EV + G +RV+I+L++ +PL
Sbjct: 121 DHASFWVQFHNLPLLGMNKECGAKLGSTIGEVEEVEVDEDDVAWGRSLRVKIMLDLRKPL 180
Query: 408 RRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEEGEGVGADNQFLFGDWLR 467
R R + +G +W P++YE++P FCF CGRI H C V D FG WLR
Sbjct: 181 AR-GRTIVLHGVKVWSPVKYEKIPRFCFTCGRILHGSVGCQ-----VQKDVSIQFGSWLR 240
Query: 468 A---VPFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASP-- 527
A + R + G G AD + SG VL + S
Sbjct: 241 AESPLKKRWEIKKELSTDKGSSSEAGMEQNTVADGGAADHGGPMKCSGNVLSSSMLSDDG 300
Query: 528 -VPSGTPPSVDPDIGVASADKGKEVADPGVAPEASSKV--GTVPLAPLVATHTVSSGAGS 587
G +D A D KE V E + G + + A+ T AG
Sbjct: 301 INGDGLGVDIDSINDKAVTDCYKEQGLNQVVTETCDLLSDGNLGGKEIHASETSFGPAGG 360
Query: 588 VSAGKG----KAVANENSEI-----TMTDVHDGPV-------KKSWKRLARSSLKDISNV 647
S G +A NE + VH P+ + SWK+ AR S +
Sbjct: 361 PSGLGGPTDSQAGLNETGLVPEGPEVQPAVHQLPLTQQGRHTRSSWKKKARGSFLPAEPL 420
Query: 648 LSSSVVSGHKRSAQGDPPDEDGLVS--KRLKEVES---------GSPRAFQRLAKVVQEK 707
S+V KRS D+ + + K L S G+P Q L+ +V+ K
Sbjct: 421 EKSTV---RKRSGDFLSLDQSLIPASKKALPGTMSLISWNSRGLGNPVGVQALSDLVRSK 480
Query: 708 RPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNH 767
P +LFL ETKL++ M K LGF+ C V +G+SGG+AL W++ + +FS H
Sbjct: 481 APDILFLQETKLNARVMERMKYQLGFKNCLAVSCEGKSGGIALFWNNRFKVEIQTFSKFH 540
Query: 768 IDGWISWD---VYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQH 827
I ++ + V W LTGFYG KR ++W+LL L SD WLI GDFN +L
Sbjct: 541 IHAKVTEEEENVEPWWLTGFYGNSDVSKRHESWNLLRTLLVPSDKGWLILGDFNEILSNA 600
Query: 828 EKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEG-TIYERLDRCFSSVAW 887
EK GGRDK ++ AF++VID C L DLGF GN FTWCNRR I ERLDR S++ W
Sbjct: 601 EKSGGRDKSERQMKAFRDVIDECHLHDLGFNGNPFTWCNRRERAHCISERLDRFLSNLKW 660
Query: 888 HDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDS 947
H YP V H SDH PI L L+ G + +++ RF+ W+ + +Q+++D+
Sbjct: 661 HSFYPMASVIHGVIAYSDHVPIMLKLT--AGSVQGPRKKLFRFEAMWVDATDCKQVIQDA 720
Query: 948 WGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREPLSQ 1007
W LS R Q C + W ++ G+ + ++ A + L +PLS
Sbjct: 721 WRGVEGRKDLSIVMRKIQ---HCGEKLTVWNKTNFGHVQRNLKKAKDRLCLVHQADPLSN 780
Query: 1008 AEAQLEDV-------LQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMD 1067
+L++ L E+ WKQRS+ +WL EGD+N+R+FH +A+ R++ N I + D
Sbjct: 781 NRQKLQEARNEVQKWLTRNEIMWKQRSKALWLAEGDKNSRYFHHKATQRRKKNWIKEVKD 840
Query: 1068 DQGEWRQDRAMVLQLVTDYFQQLF-STSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTE 1103
G W Q+ +++ DYF LF + E F L+ L + +M L PF+E
Sbjct: 841 SNGVW-QNNERRDEIILDYFNSLFKAADEVGSMGF---LQGLAGRITPDMVEQLDSPFSE 900
BLAST of Lag0040831 vs. NCBI nr
Match:
XP_022841874.1 (uncharacterized protein LOC111365549 [Olea europaea var. sylvestris])
HSP 1 Score: 398.3 bits (1022), Expect = 2.2e-106
Identity = 272/896 (30.36%), Postives = 434/896 (48.44%), Query Frame = 0
Query: 253 DIPLLDESTVQL------CVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFV 312
D+ L++E V + C++ K+LSSK N +AF+ M +W +R+ I N+ +
Sbjct: 20 DVLLVEEDEVLIPHRSDKCLLFKLLSSKHFNKEAFKGTMRRLWHPNRTLSIHDLFPNLHI 79
Query: 313 IRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTA 372
F +K R+ GPW F K L++ G + + FS FWV I + +
Sbjct: 80 AEFDDNRDKDRVKCEGPWVFGKQLVLTKDVDGLEQIHQILFSEANFWVRIHDLHVMVRNW 139
Query: 373 AMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQ 432
M +G +G V+EV + G + VR+ L++++PL R ++ G+ W
Sbjct: 140 KMGDFIGRQLGKVIEVDIDKIEIARGEFLHVRVCLDISKPLLRGKKIYVGSYKPFWTRFS 199
Query: 433 YERLPDFCFRCGRIGHSHRECSEEGEGVGA--DNQFLFGDWLRAVPFRHGVANATEEGGG 492
YERLP+FC+ CG +GH H++ + A + + +G WLRA N+++ G
Sbjct: 200 YERLPNFCYLCGVLGHGHKDYILWKPAMEAYTTSDYSYGPWLRA-------GNSSDYSGP 259
Query: 493 RPDIQGGG---DQVSEVSMPAD----------RVVDLVDSGVVLEGTTASPVPSGTPPSV 552
D Q D + ++ P+ +V+L + +LE P G P +
Sbjct: 260 IVDRQINSRTPDSPNSLANPSSDPSSANITHVNLVNLTNKDQLLE-----PYGKGDPAML 319
Query: 553 DPDIGVASADKGKEVADPGVAPEASSKVGTVPLAP---------LVATHTVSSGAGSVSA 612
+ + ++ + ++ G++ AS + L P L A +T SG
Sbjct: 320 ---LNLTNSQELQQTNFEGIS-TASRTATNLELTPLQTDAIMDNLTAENTRHSGLADTQT 379
Query: 613 GKGKAV-----ANENSEITMTDVHDGPVKKSWKRL-ARSSLKDISNVLSSSVVSGHKRSA 672
+ + NS +T + WKRL ++ I +S+ +
Sbjct: 380 KGTHGLLQTDPVSSNSAEVLTGAKGANFTRRWKRLNTNHAIHSIVPPPQTSLKRSLDFPS 439
Query: 673 QGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVL 732
+ PP L+S + +PR L+ +++++ P VLFL ETKL++ M + L
Sbjct: 440 ENAPPIPLKLLSWNAQ--GHRNPRGIHALSNLIRKEDPDVLFLQETKLNAANMELCRIKL 499
Query: 733 GFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWDVYHWRLTGFYGFPAAD 792
F C V + GRSGG+ALLW S+V+ S+L +S HID I ++ W LTG YG P
Sbjct: 500 KFYGCLNVQAVGRSGGIALLWKSNVNLSILGYSTKHIDAKIESPLHCWFLTGIYGHPETT 559
Query: 793 KRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLD 852
KR +TW+LL +L+ S WL+ GDFN +L EK GGRD+P +L FQ++ C L D
Sbjct: 560 KRMETWNLLKRLKRNSSEAWLVFGDFNEILSNEEKWGGRDRPRQQLENFQHMFLECELRD 619
Query: 853 LGFVGNRFTWCN-RRPEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLS 912
LGF G +TW N R I+ERLDR ++ +W ++ +V H SDHRP+ + L
Sbjct: 620 LGFKGPYYTWYNGRHDTNQIFERLDRFIANDSWCRLFTQAMVTHGSTTYSDHRPLWIQL- 679
Query: 913 PQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRR---CM 972
Q + RF+ W+ + +V + W + + L + R C
Sbjct: 680 -QGASQTLKGPKPFRFESMWIGEKACSDIVLNQWSD-------TKSQNLESIMRNISCCS 739
Query: 973 RSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREP-------LSQAEAQLEDVLQEEELYWK 1032
+ W RSK G ++ A L ++P L + +++ + EEL WK
Sbjct: 740 SQLQLWNRSKFGRVQAELNKARTKLSQIQKKDPATINTEALRTSAEKVQTWMDREELMWK 799
Query: 1033 QRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLF 1092
Q+ R WL++GDQNTR+FH +AS R++ N I L+DDQG W QD +L+T+YF LF
Sbjct: 800 QQPRAAWLEQGDQNTRYFHAKASQRRKTNSITKLLDDQGCW-QDGGACNRLITNYFSDLF 859
Query: 1093 STSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGL 1102
S++ ++ L +L+ + +MN+DL++ FTE E+ +ALK+ HP KAPGPD +
Sbjct: 860 SST--GSRNHAAVLDNLECKLTYDMNIDLIQTFTEIEVSQALKEMHPTKAPGPDSM 885
BLAST of Lag0040831 vs. NCBI nr
Match:
KAG2711776.1 (hypothetical protein I3760_04G092800 [Carya illinoinensis])
HSP 1 Score: 396.4 bits (1017), Expect = 8.3e-106
Identity = 298/942 (31.63%), Postives = 462/942 (49.04%), Query Frame = 0
Query: 229 DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQ--LCVVGKVLSSKPVNPDAFRRVML 288
D L + ++ L+E E+ SV +I L + + C++ K+L K N +AF++ M
Sbjct: 3 DALDELYASLSLTEKENEAVSV--EISRLGDVLERGANCLIMKLLIKKHYNHEAFKQTMR 62
Query: 289 SVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLD 348
+W + + ++ F + +K ++M GPW+FDK L++L G ++
Sbjct: 63 KIWRPVKGVKFRDLNSEFTLVEFDDMRDKLKVMREGPWSFDKHLVLLKEFDGRLQIGKIE 122
Query: 349 FSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVP-GEGHSDWLGTVMRVRIVLNMAQ 408
FWV + +PL + R +G +G V+EV G +W G MRVR+++N+++
Sbjct: 123 LVHAPFWVRLHDIPLMARNEYIGRLVGGALGEVLEVDLDNGEMEW-GEYMRVRVLINISK 182
Query: 409 PLRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEEGEGVGADNQFL-FGD 468
PL R R++ G W YERLPD CF CG +GHS +EC E + D+Q L +G
Sbjct: 183 PLLRRKRMIVEEGVSCWVRFSYERLPDLCFNCGILGHSLKEC-EGFDANQVDSQRLPYGP 242
Query: 469 WLRAVPFRHGVANATEEGG----------------------GRPDIQGGGDQVSEVSMP- 528
WLRA G N+ GG + D G G V+ P
Sbjct: 243 WLRA-----GFLNSRRSGGRANSMATTATAAAATPDPVETVSQKDSNGKGSGAESVTEPV 302
Query: 529 ADRVVDLVDSGVVLEGTT-ASPVPSGTPPSVDPDIGV--ASADKG---KEVADP---GVA 588
++ DL GV + G +S VP+ TP +++ ++ + +S D G KEV++ V
Sbjct: 303 GEKNPDL--EGVDIPGNKGSSSVPTVTPSALNEEVAILESSMDVGGLNKEVSETVVLDVV 362
Query: 589 PEASSKVGTVPLAPLVATHTV------------------SSGAGSVSAGKGKAVANENSE 648
+ G + + L A+H SS SV G+ V +
Sbjct: 363 QRNHLEDGLMDVPVLEASHMFMGLDSGPSDNIVPKPVGGSSSRNSVQRSTGQRVGSRALR 422
Query: 649 ITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEV 708
T+ ++G + +A +S+ +L S+ H R +G +S E
Sbjct: 423 KIQTEAYEGRLVTG---VAETSVLQSEGLLGSN----HGRKRKG--------ISIVEAEA 482
Query: 709 ESGS--PRAFQRLAKVVQEKRPLVLFLSETKLSSNR---MASAKRVLGFEYCFCVDSKGR 768
+ G+ P R + +++E+R L++ + SN + S EY F + GR
Sbjct: 483 KKGNFLPLPLFRWSWLMRERRRLMI----SPAGSNEDFILESPWAWEPTEYSFF--TVGR 542
Query: 769 SGGLALLWSSSVSFSLLSFSNNHIDGWI-SWDVYHWRLTGFYGFPAADKRDQTWSLLSKL 828
SGGLAL W ++ ++S+S NHI I + D W LTG YG P + +R + W LL L
Sbjct: 543 SGGLALFWKDNIHLKIVSYSRNHIHAAIKNCDGVEWLLTGVYGHPESGQRSEFWRLLKFL 602
Query: 829 RGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCN 888
G + PWL+ GDFN +L EK GG + S++ F+ V+ C L DLG+ G FTW N
Sbjct: 603 GSGVNLPWLVFGDFNEILDHSEKLGGNIRSESQMKEFRAVLSDCHLRDLGYEGAPFTWSN 662
Query: 889 RR-PEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQR 948
RR EG + ERLDR ++ W +IY N V+H SDH P L L + +RR ++R
Sbjct: 663 RRGEEGLVKERLDRFLANSWWCEIYLNLRVSHGVAAYSDHIP--LWLDTEGALFRRRNKR 722
Query: 949 ILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMG--- 1008
+ RF+ W+ + E ++ +W +S + + ++S RC + W ++ G
Sbjct: 723 LFRFEAMWVGEKECSSIIERAW--CQRNGSISLDQIMGRIS-RCAIELGRWNKTSFGHVQ 782
Query: 1009 ----NFPQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNT 1068
N +K+Q G+ S E QA +++ L+ +EL WKQRSR WL+EGD N+
Sbjct: 783 KNLANAKRKLQCLEANDSGSLSLEEHKQACLEVQKWLERDELMWKQRSRVKWLREGDCNS 842
Query: 1069 RWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLR 1103
R+FH +AS R+R N I L D+ G W++ M L+T+YF LF+ ++ D D+ L
Sbjct: 843 RYFHSKASTRRRKNSIMQLQDESGCWQKGDQMD-ALITEYFHNLFTAADRVDMG-DI-LS 902
BLAST of Lag0040831 vs. NCBI nr
Match:
XP_030939698.1 (uncharacterized protein LOC115964550 [Quercus lobata])
HSP 1 Score: 391.7 bits (1005), Expect = 2.1e-104
Identity = 257/860 (29.88%), Postives = 416/860 (48.37%), Query Frame = 0
Query: 259 ESTVQLC---VVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEK 318
E T++ C ++G+ L+++P N A + ++ SVW + RI G+ +F RF ++
Sbjct: 30 EKTIEECSLTLLGRFLTNRPYNQRAAKSLLRSVWKLGNDLRIVDVGEGLFQFRFKLESQL 89
Query: 319 RRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSV 378
++ GPW+FD +LLVL + + F WV + +P + +G
Sbjct: 90 TWVLENGPWSFDNNLLVLRRWERGMTANSVTFPTLPIWVQVWGLPFDLINEEAGWEIGKG 149
Query: 379 VGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYERLPDFCF 438
+G V EV + +R+R+ + + +P+RR + G + +YERL C+
Sbjct: 150 LGQVYEVDNKTFLSDQALFIRIRVGIPLEKPIRRGGWVANPEGDQVQVGFKYERLVGLCY 209
Query: 439 RCGRIGHSHRECSEEGEGVGADNQFLFGDWLRAVPFRHGVANATEEGGGRPDIQGGGDQV 498
+CG++GH ++CS +G A+ +GDWL+A G R D
Sbjct: 210 QCGKLGHEMKDCSVQGSSQQAEKP--YGDWLKA-------------GFRRKD-------- 269
Query: 499 SEVSMPADRVVDLVDSGVVLEGTTASPVPSGTP-PSVDPDIGVASADK---GKEVADPGV 558
M ADR T +P P+ P PS + + S D+ + D
Sbjct: 270 ----MGADR------------AKTNAPPPAPAPEPSQSHTVAINSHDEVAGSMGINDNHE 329
Query: 559 APEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKR 618
+ S++ V+ S + G + +S +++ G
Sbjct: 330 RTDNGSEINCPKSHVTVSQKIQVSEEVNTEHLWGAKFSEPDSINGVSNTQMGIRGMEITG 389
Query: 619 LARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQE 678
L ++L +++ L + ++ KR D + ++ +E S L+ +V+
Sbjct: 390 LETAALHALNSTLINVPINYEKRQTHAD----QSICIQQPREKSSVLETHVDVLSHLVRV 449
Query: 679 KRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNN 738
K P +LFL ETK S M + L + F V S RSGGLALLW + + +F+ N
Sbjct: 450 KAPKILFLMETKRSLEEMRWIQNDLPYRCMFVVPSVRRSGGLALLWMEEIDLHIQTFTLN 509
Query: 739 HIDGWISWD-VYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHE 798
HID I D HWRLTGFYG+P ++ ++W LL L PWL GDFN +L E
Sbjct: 510 HIDALIMDDPANHWRLTGFYGWPEEQRKQESWQLLKHLHSRHSVPWLCFGDFNEILQSEE 569
Query: 799 KEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIYERLDRCFSSVAWHD 858
K+GG KPL+ + F+ + CGL+DLG+ GN FTW N R + + ERLDR +++ W D
Sbjct: 570 KQGGLPKPLAPMLNFREALLYCGLVDLGYQGNMFTWTNGR-DDLVQERLDRACATIEWRD 629
Query: 859 IYPNCVVNHLDYHQSDHRPIELV--LSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDS 918
+ V HL+ SDH PI + + P P ++ RF+E W + + +++ +
Sbjct: 630 KFAQVQVTHLEASYSDHNPILVTTHIRPHPTLKKKIPH---RFEERWATHPDCENIIQMA 689
Query: 919 WGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMG-NFP--QKVQLAIEGL---RGAGS 978
W S P S +L + +RC ++ W R G + P Q+ Q +E L A +
Sbjct: 690 WDSI--VPNGSPMAKLFEKIKRCRFALVDWSRITFGLSKPQLQEKQKILEELCIQNRAEN 749
Query: 979 REPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDD 1038
+ +A++ +++ ++EL+W+QRSR +WL GD+NT++FH +AS R+R N I G+ D
Sbjct: 750 VRTIKSLKAEITNIIHQDELFWRQRSRSIWLPAGDKNTKYFHNRASQRRRKNHISGVFDS 809
Query: 1039 QGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEE 1098
W + ++ YFQ+LFST+ P Q+ + L+ +QR V MN L +P+T +E
Sbjct: 810 DERWCTSDEQIAKVAEFYFQELFSTAHP--QNMESVLQSVQRKVTPHMNESLTRPYTADE 838
Query: 1099 ILRALKQSHPHKAPGPDGLS 1103
+ AL Q HP K+PGPDG+S
Sbjct: 870 VRLALFQMHPSKSPGPDGMS 838
BLAST of Lag0040831 vs. ExPASy TrEMBL
Match:
A0A2N9GF83 (CCHC-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS25970 PE=4 SV=1)
HSP 1 Score: 433.0 bits (1112), Expect = 3.9e-117
Identity = 301/949 (31.72%), Postives = 455/949 (47.95%), Query Frame = 0
Query: 228 MDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLS 287
+++LV W L+E E+ F++ + ++ C++GK+L SK N A + ML
Sbjct: 2 VEELVEDWRRFSLTEDEAPGFTIDEEAMGNAKAFGSHCLLGKLLISKSFNKAALKATMLR 61
Query: 288 VWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDF 347
+W V + G N+F+ +F + ++ +R+ PW FD +LLVL GS + + F
Sbjct: 62 IWGVACGIVAKDMGMNLFLFQFQNESDCKRVFKGSPWLFDNNLLVLNEFDGSCPANQISF 121
Query: 348 SRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPL 407
+ C FWV + VPL Y T +G +G V +V G +RVRI +++ +P+
Sbjct: 122 NSCCFWVQLHGVPLFYMTKQTGERVGGAIGIVEKVDVSEDGVGWGPFLRVRISVDITKPI 181
Query: 408 RRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEE--GEGVGADNQFLFGDW 467
+R + G+ +W +YERLP FCF CG++GH REC + G + + +G W
Sbjct: 182 QRGRLVTFGSTGQMWIAFKYERLPWFCFHCGKLGHGERECGVKLRGGNMNTSDFKQYGVW 241
Query: 468 LRAV--PFRHGVANATEEGGGRPD--IQGGGDQVSE-----VSMPAD------------- 527
LRA FRH + + G P + GG QV++ S PA
Sbjct: 242 LRAPEHSFRHRNMHGDDRRRGAPSSLVLPGGGQVADNAAFKFSTPASISGDSNRGYSLAR 301
Query: 528 --RVVDLV---------------DSGVVLEGTTASPVPS----------GTPPSVDPDIG 587
R VD V D L+ TA VP+ P S D G
Sbjct: 302 ERRGVDKVISSSGFSQPCGDQPSDIQASLKAPTALRVPNIGQQNDYIAVAYPGSCDLPGG 361
Query: 588 VASADKGKEVADPGVAPEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEIT 647
V D +A G V T + H +S S+ + T
Sbjct: 362 V--GDSVGSIAQSGACSTPQEFVTTTLHGEDIVDHVTASTCPSLPTTSKFCEGS-----T 421
Query: 648 MTDVHDGPV--KKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEV 707
+T H+G V K WKRLAR+ K ++S G KR PPD L+S + +
Sbjct: 422 LTAKHNGLVKGKSPWKRLARAKGK-----VTSVAGEGQKRGCYPAPPDTMSLLSWNCQGL 481
Query: 708 ESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLA 767
G+P + L +++EK P +LFLSET+L + + L F FCV G GGLA
Sbjct: 482 --GNPCTVRELLLLLKEKAPSILFLSETRLDCVGVEKLRVRLKFGNAFCVPRLGTGGGLA 541
Query: 768 LLWSSSVSFSLLSFSNNHIDGWISWDVYH--WRLTGFYGFPAADKRDQTWSLLSKLRGGS 827
LLW++ V + S+S NHID + + H +R+TGFYG KR ++W+LL L +
Sbjct: 542 LLWTAKVEIQIQSYSRNHIDAGVKDLLGHRRFRVTGFYGNLETSKRKESWALLKHLSQIA 601
Query: 828 DTP-WLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRP 887
+P WL GDFN +L E+ G +P ++ F+ I CGL D+GFVG+ FTW +R
Sbjct: 602 GSPLWLCIGDFNEVLDNSERVGRGSRPAWQIQDFRTSIVHCGLHDIGFVGHPFTWRKQRR 661
Query: 888 E---------GTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWR 947
G RLDR S +W + V+HL SDH P+ + L G
Sbjct: 662 SSGLLDGGIGGCAAARLDRALVSDSWILDFQGLSVSHLPVQNSDHCPLFVHL--PVGLQA 721
Query: 948 RSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRR---CMRSMAGWG 1007
+++ RF+ W K + ++ +W S + A ++ QV + C ++ W
Sbjct: 722 SRPKKVFRFEAMWAKDEQCYDVINQAWDSE-----VLAGSKMFQVMEKLKGCRGALIAWS 781
Query: 1008 RSKMGNFPQKVQLAIEGLRGAGSREPLS------QAEAQLEDVLQEEELYWKQRSREVWL 1067
+ + G+ ++ L+ + PL + + L +L++EE+YW+QRSR W+
Sbjct: 782 KLRFGSLAFSIKGKRMQLQSLLADHPLGDSPRILELQDDLNALLEKEEVYWQQRSRISWM 841
Query: 1068 KEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQ 1103
KEGD+NT++FH Q S R+ N++ GL D+ G W D+A V + DYF+ +FS+S P+ +
Sbjct: 842 KEGDKNTKFFHAQCSQRRESNKVKGLRDEVGVWHTDKAKVADMAVDYFKNIFSSSNPTGE 901
BLAST of Lag0040831 vs. ExPASy TrEMBL
Match:
A0A7N2R0C3 (Reverse transcriptase domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)
HSP 1 Score: 432.2 bits (1110), Expect = 6.6e-117
Identity = 284/943 (30.12%), Postives = 451/943 (47.83%), Query Frame = 0
Query: 229 DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSV 288
++L V W+ + ++E E + + + + CV KV+S K + +A R+ + +
Sbjct: 3 EELEVLWQRLKVTEEEEESILLGDECMRAAVERGKKCVFMKVMSRKGLMVEALRKNVRML 62
Query: 289 WSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFS 348
W ++S ++ G+ +F++ F +KRR+M + PW ++K L++ G + P +
Sbjct: 63 WKPNKSIQLSVIGEELFLVEFEDERDKRRVMDMRPWHYEKQLVLFKEFEGDESPKDILLK 122
Query: 349 RCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLR 408
FWV I +PL T +A+G +G +EV E GT +RVR+ +++ + L
Sbjct: 123 WSPFWVQIYNLPLKSRTKETGKAIGESIGKFIEVDVEETGVQWGTCLRVRVEIDVTRKLI 182
Query: 409 RVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEE--GEGVGADNQFLFGDWL 468
R ++ G W +YERLP+FC+RCG + H ++C EE + G ++ +G WL
Sbjct: 183 RGRKINMEKGETRWVHFKYERLPNFCYRCGLLDHDLKDCLEEPGKDKTGEESDLQYGAWL 242
Query: 469 RAVPFRHG------------------VANATEEGGGRPDIQGGG-------DQVSEVSMP 528
R P R G E GR ++Q G + +S
Sbjct: 243 RGEPIRKGGWDFGFAKKKVIGEMKNKENTKAAERKGRDEVQEGVARETQELEAISLGDSR 302
Query: 529 ADRVVDLVDSGVVLEGTTASPVPSGTPPSV------------DPDIGVASADKGKEVADP 588
+R DL+ G E TT +G S+ + + V + + GKE
Sbjct: 303 QERHGDLMGGG---EVTTVKKGENGLAESIGRSHSGELVEVGEENREVGNGNGGKEACQK 362
Query: 589 GVAPEASSKVGTVPLA----PLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPV 648
G A + + + + V G G G + E GP
Sbjct: 363 GRAENLNKPIPNFEFGLGNETVKTDNVVGLGLGPDKNKDGPMAMQYDPEEGWVANKLGPS 422
Query: 649 KKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGD----PPDEDGLVSKRLKEVES----- 708
WKR+ R+ + VS +R GD D++ SKR K S
Sbjct: 423 SGHWKRIIRAG----PDAEMKESVSPVQRKRDGDLTLREIDQNVKASKRRKTPPSQMTAL 482
Query: 709 -------GSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGR 768
GS A + L V+ P+++FL+ETK S R+ +R LG V S GR
Sbjct: 483 AWNCRGMGSAPAVRALTDEVKNGDPVLVFLAETKASQRRIKGLQRKLGLTQGIAVPSDGR 542
Query: 769 SGGLALLWSSSVSFSLLSFSNNHIDGWI--SWDVYHWRLTGFYGFPAADKRDQTWSLLSK 828
SGGLA+LW V SL S SN+HID + S WR TGFYG P A R +W LL
Sbjct: 543 SGGLAMLWREGVDVSLKSCSNSHIDVVVGGSNGAVPWRATGFYGHPDAGMRPISWKLLEV 602
Query: 829 LRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWC 888
L + PW++ GDFN +L EK G ++ ++ F+ + +CGLLDLGFVG RFTWC
Sbjct: 603 LSRQCNMPWVVFGDFNEILNSDEKLGWLERDARQMECFRECLSNCGLLDLGFVGQRFTWC 662
Query: 889 NRR-PEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQ 948
N R E RLDR ++ W +++P V H SDH + L + + R+ ++
Sbjct: 663 NGRIGEQRTLVRLDRMVANEEWMNLFPEAKVVHRSMAASDHCLLSLSIRRRE--TRKVAR 722
Query: 949 RILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGN- 1008
R F+E W ++ ++++ +W G P L+ RL + C + W R GN
Sbjct: 723 RRFMFEEMWTREEGCREVIERAWDPLGCNPELTIQNRL----KCCQCQLQNWNRRVFGNV 782
Query: 1009 --FPQKVQLAIEGLRGAG----SREPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQN 1068
++ Q ++ L S E + + + ++ +V+ EE+ W QRSR +W+K GD+N
Sbjct: 783 NKILKQKQCRLQQLEELNLLHESAEEVQKLKKEINEVMLREEIMWNQRSRALWIKYGDRN 842
Query: 1069 TRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSL 1103
TR+FH A+ R+R N+I G++D +G WR++ V +++ +YF++++S++ P+ +F L
Sbjct: 843 TRFFHATANNRRRKNKIEGILDSEGRWRENNEEVEEIILEYFKEIYSSNFPT--EFGACL 902
BLAST of Lag0040831 vs. ExPASy TrEMBL
Match:
A0A7N2LIH6 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=3 SV=1)
HSP 1 Score: 426.0 bits (1094), Expect = 4.8e-115
Identity = 298/991 (30.07%), Postives = 469/991 (47.33%), Query Frame = 0
Query: 204 HSSAFLVLIGS--CDLGWVVSSLGARM--------------DDLVVQWENMGLSEAESTT 263
H +FL ++ CDL +SLG + ++L W+ + ++EAE
Sbjct: 177 HVQSFLCILSQEFCDLPCGEASLGTLLSVGKRVGAVEILMAEELEELWKKLTVTEAEDED 236
Query: 264 FSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVI 323
+ ++ + + CVV K+L+ + V +A ++ M +W + +I G+++F++
Sbjct: 237 IKLGSNSTRAAKELGKNCVVMKILTQRTVILEALKKNMRMLWKPSKGMQISEIGEDLFLV 296
Query: 324 RFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAA 383
F +K+++M + PW+++K L+++ G P + FWV I +PL T
Sbjct: 297 EFGDGRDKKKVMEMCPWSYEKQLILMQEFEGELVPKEIKLKWTPFWVQIFNLPLKCMTRE 356
Query: 384 MARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQY 443
+G+ +G V+EV G +RVRI + L R ++ G W +Y
Sbjct: 357 SGYEIGAKIGKVLEVDVPEKGVQWGKFLRVRIRFDATTQLIRGKKVSIEGGEGRWVFFKY 416
Query: 444 ERLPDFCFRCGRIGHSHRECSE--EGEGVGADNQFLFGDWLRAVPFRH------------ 503
ERLP+FC++CGR+ H ++C E +GE G + + +G WLR P R
Sbjct: 417 ERLPNFCYQCGRLDHGEKDCPERKDGENHGDEERKQYGAWLRGEPGRSSGRDYGRMGEET 476
Query: 504 -----------------------------GVANATEEGGGRPD------IQGGGDQVSEV 563
G + +E+ G+ D ++ GG V
Sbjct: 477 MPERRDDHQETRTETQTRVKTRLKESAAVGRQHVSEQAIGQKDGTRKSLVEHGGVGQKGV 536
Query: 564 SMPADRVVDLVDSGVVLEGTTASPVPSGTPPSVDPDIGVASADKGKEVADPGVAPEASSK 623
S + V+LV + T D G+ +A K++ D E
Sbjct: 537 SY---QKVELVHENGKFDSPKEKFEDKITISGKDSLTGMVNA---KDMEDKMQWEEVLDD 596
Query: 624 VGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHD--------GPVKKSWK 683
V V G G + NE+S + MT + GP WK
Sbjct: 597 VANVKSEEGSKQKQGVKGTGCANKENQCWAENESSPLAMTYDQEKGWTSEILGPKSGHWK 656
Query: 684 RLARSSLKDISNVLSSSVVSGHKRSAQG---------DPPDEDGLVSKRLKEVESGSPRA 743
RLAR + KD S S VSG ++ G PP +++ + + G+ A
Sbjct: 657 RLARQA-KDSSPTAGS--VSGSQKRKDGWRCGGGCGAAPPSSMNILAWNCRGL--GTSPA 716
Query: 744 FQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSV 803
+ L V++K P+++FL ETK S +M + LGF V S GRSGGLALLW
Sbjct: 717 VRTLTDEVKKKNPVLVFLVETKASVEKMKGFQNKLGFTQGIIVPSDGRSGGLALLWKEGT 776
Query: 804 SFSLLSFSNNHIDGWI--SWDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIG 863
S S++HID + + WR TGFYG P KR +W LL L + PWL+
Sbjct: 777 DIRFKSCSHSHIDVVVHGAGSGGPWRATGFYGHPDTGKRYTSWKLLEILNTQCEMPWLVC 836
Query: 864 GDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRR-PEGTIYER 923
GDFN +++ EK G +D+ +++ AF+ V+ CGL+DLGFVG RFTWCN R + R
Sbjct: 837 GDFNEIVHPDEKMGWKDRDAAQMDAFREVLSKCGLIDLGFVGPRFTWCNGRFGDQRTLIR 896
Query: 924 LDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQ 983
LDR ++ AW ++P V+H+ SDH + L L+ + RR +R F+E W +
Sbjct: 897 LDRMVANEAWSLMFPEAKVHHVSMSASDHCLLALFLN-KVNNQRRGKKRFF-FEEMWTRV 956
Query: 984 AELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLR 1043
E +++V +W E + ERL RC + + W ++ GN + ++ L+
Sbjct: 957 EECKEIVELAWDPYREDSAMPVQERL----ERCQKMLQQWNQNSFGNVYKGIKQKKNRLQ 1016
Query: 1044 GAGSREPLSQAEAQLEDVLQE-------EELYWKQRSREVWLKEGDQNTRWFHRQASYRQ 1103
S L + +++ + +E EE+ WKQRSR WL+ GD+N+++FH AS R+
Sbjct: 1017 QLESLNLLHETAEEIQTLKKEINELHTREEVMWKQRSRVSWLQYGDKNSKFFHATASQRR 1076
BLAST of Lag0040831 vs. ExPASy TrEMBL
Match:
A0A2N9GJ35 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27322 PE=3 SV=1)
HSP 1 Score: 419.9 bits (1078), Expect = 3.4e-113
Identity = 278/886 (31.38%), Postives = 430/886 (48.53%), Query Frame = 0
Query: 238 MGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRI 297
M LSE E S+ D L + Q ++ K+L++KP + +AF+ + ++WS I
Sbjct: 12 MKLSEKEMLRISLRKDPILKSKKEAQHSILFKLLTTKPFHSEAFKGSIRALWSGLGGVTI 71
Query: 298 EPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHIT 357
N+F+ F + RI PWTFDK L+ +V G P+ + FS FW+ +
Sbjct: 72 RSIEGNLFMAVFTRRDDMERIFVRSPWTFDKKLIPIVRFEGDLQPTEVRFSHTAFWIRVF 131
Query: 358 KVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRL---V 417
+P+ + +G +G ++EV + G +R+R+ +++AQPL R L
Sbjct: 132 NLPIKSMIREVGEDIGQEIGRLLEVDVPENGFGWGEYLRIRVEIDIAQPLLRGCILQSDE 191
Query: 418 KGNGSVLWCPLQYERLPDFCFRCGRIGHSHREC-----SEEGEGVGADNQFLFGDWLRAV 477
G + W +YE LP FC+RCGR+GH EC EGV + +G WLRA+
Sbjct: 192 SDGGGLFWVDFKYEHLPIFCYRCGRLGHGSHECVVGRGGRISEGVSGEK---WGAWLRAL 251
Query: 478 PFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASPVPSGTPP 537
R + EG +PD +G E +MP DR E T +
Sbjct: 252 AARPAQPRRSREGVFQPDEEG------ESNMPFDR-----------EAATEN-------- 311
Query: 538 SVDPDIGVASADKGKEVADPGVAPEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVA 597
DP +P S G G +
Sbjct: 312 ------------------DP--SPPVS--------------------GGGCKLWDGHWLH 371
Query: 598 NENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSK 657
EI ++H +R A SS KD PP +S
Sbjct: 372 ELLEEIMQLEMH-------VERPACSSGKDA-------------------PPVTMRALSL 431
Query: 658 RLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGR 717
+ + G+P+ L +V+++ P ++FL ET+L+ + + LG + C V+ G+
Sbjct: 432 NCRGL--GNPQTVNELHNLVKKEGPNIVFLMETRLNVRNLEWLRVRLGMKGCLGVERHGQ 491
Query: 718 SGGLALLWSSSVSFSLLSFSNNHIDG-WISWDVYHWRLTGFYGFPAADKRDQTWSLLSKL 777
GGLALLW SSV ++ S+S +HIDG + D WRLTGFYG+P A R ++WSLL L
Sbjct: 492 GGGLALLWDSSVMINIQSYSEHHIDGEVVQNDGLRWRLTGFYGYPEAHLRHRSWSLLRHL 551
Query: 778 RGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCN 837
R SD PW+I GDFN + EK G D+ +++AAF+ + C L D+GF G FTW N
Sbjct: 552 RSISDVPWMIFGDFNEITRLEEKAGREDRNANQMAAFREALLDCSLQDMGFTGTEFTWSN 611
Query: 838 RRPEGTIYE-RLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWR----R 897
R G + RLDR + AW ++P+ +NHL SDH + L+L + R +
Sbjct: 612 NRENGDLVRVRLDRGVADAAWVQLFPHASINHLIVASSDH--VGLLLDSRTDQPRNHVPQ 671
Query: 898 SSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKM 957
+R+ RF+++WLK++ +++++ +W G +A ++AQ ++C + W +S +
Sbjct: 672 RKRRMFRFEKSWLKESGCEEVIQMAWEVQPIG---TAMYKVAQKIKQCRIKLIQWSQSHV 731
Query: 958 GNFPQKVQLAIEGLRGAGSRE-------PLSQAEAQLEDVLQEEELYWKQRSREVWLKEG 1017
P+ + ++ L+ +E ++ + L + ++ E+ W+QRSR VWL EG
Sbjct: 732 RVTPKLIDSKMKQLQELELKEKEDYDSRQINLIKRDLNGLHEKAEIVWRQRSRIVWLTEG 791
Query: 1018 DQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFD 1077
D+NT++FH AS R+++N I GL D Q WR + V Q+ DYF LF++S P + D
Sbjct: 792 DRNTKFFHENASQRKKINTILGLRDQQSNWRTEPLEVEQIAVDYFSSLFASSNP--RAID 794
Query: 1078 VSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS 1103
L +++ V MN L++PFT+EEI RAL Q HP K+PGPDG+S
Sbjct: 852 EVLHEVEGVVTPGMNNVLMRPFTQEEIKRALFQMHPSKSPGPDGMS 794
BLAST of Lag0040831 vs. ExPASy TrEMBL
Match:
A0A2N9H1U1 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS36358 PE=4 SV=1)
HSP 1 Score: 415.2 bits (1066), Expect = 8.4e-112
Identity = 287/906 (31.68%), Postives = 438/906 (48.34%), Query Frame = 0
Query: 228 MDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLS 287
MD L WEN+ LSE E F + D ES + + L+ +P+N +A R
Sbjct: 1 MDSLTSMWENLSLSELEGKKFDLEPDTISEQES----ILAARFLTRRPINLEAVARTFRP 60
Query: 288 VWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDF 347
+W + R++ GDNV I F + R+++ GPW++DK ++ + L F
Sbjct: 61 LWRTEKGFRLKDMGDNVVTIYFADEADLERVIANGPWSYDKYFIIFQRTEEEIPITALTF 120
Query: 348 SRCEFWVHITKVPLNYHTAAMARALGSVVGHVV-EVPGEGHSDWLGTVMRVRIVLNMAQP 407
+ WV I +P + A + R +GS +G ++ V E + W G +RV + +N+++P
Sbjct: 121 DTIDLWVQIHGLPPRHLNAGIGRQIGSTLGKIIPTVDSEDEASW-GDFVRVHVSVNISRP 180
Query: 408 LRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSE--EGEGVGADNQFLFGD 467
L R ++ G G + QYE+L +FC+ CG I HS ++CS +G + FG
Sbjct: 181 LCRGRKVGLGGGKEVLVSFQYEKLANFCYWCGFITHSDKDCSVWLRSKGSLTSEKQQFGA 240
Query: 468 WLRAVP---------------FRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLV 527
W+RA P FR A +T GR + G D + M + + +
Sbjct: 241 WMRAQPRASHRRKTVSVEGTLFRSQHATSTPPEQGR-KTETGEDLRAVHQMESLSIDPDI 300
Query: 528 DSGVVLEGTTASPVPSGTPPSVDPDIGVASADKGKEVADPGVAPEASSK--VGTVPLAPL 587
+ +E S +G ++P + A + KE + P S+ T+P +P
Sbjct: 301 PPIIPVEERNYS---NGDCIFMNP-MNSAQNESRKENTNTENFPHTDSQRDFVTIPTSP- 360
Query: 588 VATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKRLARSSLK-DISNVLSS 647
T G V + + V GP +KSWKR SS+K + +S
Sbjct: 361 --TSPPREALGDV------------TNMERGPVTKGPHQKSWKRKPSSSIKAPFEHTVSL 420
Query: 648 SVV-SGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLS 707
+ +G + + PP ++ + + G+P Q L ++V+E+ PLVLF+ ET L
Sbjct: 421 PLKRNGAEENWPSAPPTPMSYIAWNCRGL--GNPCTVQELFRLVREQDPLVLFVVETGLD 480
Query: 708 SNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWDVYH-W 767
R+ + L F V + + GGL L W + ++ S+S +HID I+ + W
Sbjct: 481 EARLEVLRCKLHFSSKLVVSRREQGGGLTLFWKQEANVTIKSYSLHHIDTVINEGMDDAW 540
Query: 768 RLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAA 827
R TGFYG P +R +W+LL L PWL GDFN LL EK+GG + ++
Sbjct: 541 RFTGFYGAPETHRRHLSWALLQSLHQQFSLPWLCMGDFNELLSMDEKQGGPVRSSRQMQD 600
Query: 828 FQNVIDSCGLLDLGFVGNRFTWCNRRPE-GTIYERLDRCFSSVAWHDIYPNCVVNHLDYH 887
F++ ID CG +DLG+ G FTWCN R + GT++ERLDR +++ W + +P + HL
Sbjct: 601 FRDAIDVCGFMDLGYQGPPFTWCNNRVDSGTVWERLDRGLATIPWFNNFPEARIFHLHAT 660
Query: 888 QSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPER 947
SDH PI LV P R+ R RF+E WL ++ V +W + G S R
Sbjct: 661 NSDHYPICLVTKP-AYTPPRAKPRPFRFEEVWLSNPGCRETVMAAWATQKNG---SHMFR 720
Query: 948 LAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGA--GSREPLSQAEA-----QLEDV 1007
+ R C + W R K GN Q++++ LR A S +S + A +++ +
Sbjct: 721 VQDKIRNCRMELRKWSRCKFGNISQQLKIKTAQLRAAEENSMRGMSHSTAFELKKEVQYL 780
Query: 1008 LQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQL 1067
L +EE W QR+R WLK GD+NTR+FH+ AS R+R N I L D+QG + +L
Sbjct: 781 LSQEERMWSQRARTGWLKGGDRNTRFFHQSASQRRRRNLITELHDNQGVTHTGDEAIGRL 840
Query: 1068 VTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAP 1103
+YF LF TS P DF+ L + V +MN L +PF +E+ A+KQ P KAP
Sbjct: 841 FEEYFDSLFKTSNP--VDFNSVLEGISPVVTVDMNTRLSQPFQRQEVDHAIKQMGPLKAP 873
BLAST of Lag0040831 vs. TAIR 10
Match:
AT1G43760.1 (DNAse I-like superfamily protein )
HSP 1 Score: 85.9 bits (211), Expect = 2.2e-16
Identity = 78/346 (22.54%), Postives = 150/346 (43.35%), Query Frame = 0
Query: 772 SDTPWLIGGDFN--ALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNR 831
+D ++ GDF+ A H P+ L FQN + L+D+ G +TW N
Sbjct: 217 TDQLMILVGDFDQIAATSDHYSVLQTSIPMRGLEEFQNCLRDSDLVDIPSRGVHYTWSNH 276
Query: 832 RPEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRIL 891
+ + I +LDR ++ W +P+ + SDH P ++L P + S++
Sbjct: 277 QDDNPIIRKLDRAIANGDWFSSFPSAIAVFELSGVSDHSPCIIILENLP----KRSKKCF 336
Query: 892 RFDETWLKQAELQQLVRDSWGSS-GEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQ 951
R+ + +W G + + + +++C + + R GN
Sbjct: 337 RYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFSLGEHLKAAKKCCKLL---NRQGFGNIQH 396
Query: 952 KVQLAIEGLRGAGSREPLSQAEA--QLEDVLQEE--------ELYWKQRSREVWLKEGDQ 1011
K + A++ L S+ + +++ ++E V +++ E +++Q+SR WL++GD
Sbjct: 397 KTKEALDSLESIQSQLLTNPSDSLFRVEHVARKKWNFFAAALESFYRQKSRIKWLQDGDA 456
Query: 1012 NTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLF-STSEPSDQDFDV 1071
NTR+FH+ Q N I L D ++ V +++ Y+ L S S+ D
Sbjct: 457 NTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLLGSDSDILTPDSVQ 516
Query: 1072 SLRDLQRSVDSEMNVDLLKPF-TEEEILRALKQSHPHKAPGPDGLS 1103
++D+ ++ L +++EI A+ +KAPGPD +
Sbjct: 517 RIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMPRNKAPGPDSFT 555
BLAST of Lag0040831 vs. TAIR 10
Match:
AT3G42140.1 (zinc ion binding;nucleic acid binding )
HSP 1 Score: 57.0 bits (136), Expect = 1.1e-07
Identity = 41/151 (27.15%), Postives = 63/151 (41.72%), Query Frame = 0
Query: 309 FHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAM 368
F S I+ GPW+F+ + V+ R + S +F R FW+ I +PL + TA +
Sbjct: 66 FQSEESMFSILRRGPWSFNDWMCVI--QRWTKLHSDAEFKRIPFWIQIRGIPLRFLTARI 125
Query: 369 ARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYE 428
++G +G +E L R V ++K QYE
Sbjct: 126 ITSIGERMGLFLET-----------------------NLGRDVSVLK---------FQYE 182
Query: 429 RLPDFCFRCGRIGHSHRECSEEG-EGVGADN 459
+L +FC CG + H EC G +G AD+
Sbjct: 186 KLKNFCTTCGMLSHDASECPTSGNQGPHADD 182
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_042990668.1 | 1.5e-107 | 30.53 | uncharacterized protein LOC122317666 [Carya illinoinensis] | [more] |
KAG2663507.1 | 5.8e-107 | 32.00 | hypothetical protein I3760_16G033000 [Carya illinoinensis] | [more] |
XP_022841874.1 | 2.2e-106 | 30.36 | uncharacterized protein LOC111365549 [Olea europaea var. sylvestris] | [more] |
KAG2711776.1 | 8.3e-106 | 31.63 | hypothetical protein I3760_04G092800 [Carya illinoinensis] | [more] |
XP_030939698.1 | 2.1e-104 | 29.88 | uncharacterized protein LOC115964550 [Quercus lobata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A2N9GF83 | 3.9e-117 | 31.72 | CCHC-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS2597... | [more] |
A0A7N2R0C3 | 6.6e-117 | 30.12 | Reverse transcriptase domain-containing protein OS=Quercus lobata OX=97700 PE=4 ... | [more] |
A0A7N2LIH6 | 4.8e-115 | 30.07 | Uncharacterized protein OS=Quercus lobata OX=97700 PE=3 SV=1 | [more] |
A0A2N9GJ35 | 3.4e-113 | 31.38 | Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27322 PE=3 SV=1 | [more] |
A0A2N9H1U1 | 8.4e-112 | 31.68 | Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... | [more] |