Lag0041280 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0041280
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionCCHC-type domain-containing protein
Locationchr13: 14892520 .. 14894296 (-)
RNA-Seq ExpressionLag0041280
SyntenyLag0041280
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGGATGAGCCTATTGAGGAAGTTCTGGCGAGGTTTAACATATCGGAAAAGGAATGCGAGACAGTAACAATTTCAAGCGAGGCCAAATCGGTTGATCTAGAGGACAAGAAGTGGTGCCTCATTGGTGAGGTAGTGACAAACCGGAAGTTCAACAGAGAAGCCTTCCGGAAGACGATGATGAACAGTTGGCAATGTAAGTCGGTTAAATTCGTAGAAGTTGATGAAAATGTGCTTTTGTTTTGCTTTGCGGATGCTGCGTCCATGTTATATGTACAAAACCAGGGGCCATGGCTCTTTGAGGAATCACTGTTGGTCCTTGCCAAGTGGAGCCCAAATATAAAAACAAAAGCAGAGTTGCCAAAGGTGTGTGACTTTTGGGTACAGATACATGGGCTTCCTTTTGACTGTAAAGGGCAGGAGGCAGCAAAAGTAGTGGCGCAAAAGATAGGTCGGGTTACAGATGAAGAGTGGGAGGTGGACTCTCGGAGCGTGCAACAACGAAAATTTATCAGAATTAAAGTGGAGCTAGATGTAACGAAATTGCTCATGAAAGGCTTTATAGTTATGACTGGGGGTTCCAAGAAGTGGGTATGGTTCAAGTATGAACGTCTGCCTAGGTTTTGCTCAAAGTGTGGAGTGATGGGGCATACAGCTCACTGGTGCAGTGCGAAACATCTCCAAACTTCCCCTGCGCCGCCAGTGTTTGGAGATTGGCTAAGGGCGGGTCCGGTACCGACAATGGCGAAACAAGGGGAAAGATCAGGTCATTGGCGTAGGAAGGAACAAGGTCAGGGGAAGGCACCGGAGAGCGGTGTGCCGTCGGTGGGAACAGAATCGGAGAAGGCCCCGGGCATAAATCTCGGAGGAGACGTATCTTCGGAACCACCTACGGGAATCGTGGAACCGAGGGTGGAAAATGAGCAGAATATCTCGGAGAATTTGGAACTGAATGAGGTACTACAGGTAATGGGCCAAGTAATGGGGCTAACCTGGCCGACGGGCAAACATTAGTGAGTGGGCCGAGCATTGGGCCTGAATTGAATGGGAGGCAAGGTAGTGGGAAAGCTGTTGATCAAGGCCCAGGGGGCACGCTACATAGGACCAATGCAGATGGGGCTAAGGTCAGAGGGTCATCAGTCTCACTGATTGGGTTTAAAAGGAAGTGTAGAGCTGGCGGTGTAGGTACGAGTGAACAAGCTGAGATGGGTTCGTCGAAAAGAGCTAAAGGGAGTGATAAGGGTGAGAGTTCTGATAGCATGGCTATGGAGACCGGCTTCATAGGGTCTCCAGGGGGATCATGAGTGTGCTGTGTTGGAATTGTCAGGGTGTAGGGAACCCCCTGACAGTTCGATCTCTTAAGGAGCAAGTGAAGCTCCATTCCCCAAATGTTATATTCTTGTCCGAAACAAAAAATAAGGCAAATAGGTTGGAGGGAATAAGGAGGCAATTGGGCTACGAGGGGTGTTTTGTGGTCGAGCCCCGTGGGCTCAAGGCAGGCCTTTGTCTGATGTGGAAGATTGTTGATGAGGTTGAGATCCATCAATATGCAGATTTCTTCATTGAGGCTGTGATTCGGCCTAAGACTGGCAACCCAAAATGGCATTTCTTTGGAGTCTATGCAAGCACGGATGAGAAGGAAAGAGAGCAACAACTCAGTAACCTGACTTCTAGAATTGGACTCTCACAGGATAATTACGTGGTAGGCGGGGACTTTAATGATATTGTTTGCAATGGGGAGAAGGAGGGGGCCTATACCGATCTCAAAGAAGTTTAG

mRNA sequence

ATGATGGATGAGCCTATTGAGGAAGTTCTGGCGAGGTTTAACATATCGGAAAAGGAATGCGAGACAGTAACAATTTCAAGCGAGGCCAAATCGGTTGATCTAGAGGACAAGAAGTGGTGCCTCATTGGTGAGGTAGTGACAAACCGGAAGTTCAACAGAGAAGCCTTCCGGAAGACGATGATGAACAGTTGGCAATGTAAGTCGGTTAAATTCGTAGAAGTTGATGAAAATGTGCTTTTGTTTTGCTTTGCGGATGCTGCGTCCATGTTATATGTACAAAACCAGGGGCCATGGCTCTTTGAGGAATCACTGTTGGTCCTTGCCAAGTGGAGCCCAAATATAAAAACAAAAGCAGAGTTGCCAAAGGTGTGTGACTTTTGGGTACAGATACATGGGCTTCCTTTTGACTGTAAAGGGCAGGAGGCAGCAAAAGTAGTGGCGCAAAAGATAGGTCGGGTTACAGATGAAGAGTGGGAGGTGGACTCTCGGAGCGTGCAACAACGAAAATTTATCAGAATTAAAGTGGAGCTAGATGTAACGAAATTGCTCATGAAAGGCTTTATAGTTATGACTGGGGGTTCCAAGAAGTGGGTATGGTTCAAGTATGAACGTCTGCCTAGGTTTTGCTCAAAGTGTGGAGTGATGGGGCATACAGCTCACTGGTGCAGTGCGAAACATCTCCAAACTTCCCCTGCGCCGCCAGTGTTTGGAGATTGGCTAAGGGCGGGTCCGGTACCGACAATGGCGAAACAAGGGGAAAGATCAGGTCATTGGCGTAGGAAGGAACAAGGTCAGGGGAAGGCACCGGAGAGCGGTGTGCCGTCGGTGGGAACAGAATCGGAGAAGGCCCCGGGCATAAATCTCGGAGGAGACGTATCTTCGGAACCACCTACGGGAATCGTGGAACCGAGGGTGGAAAATGAGCAGAATATCTCGGAGAATTTGGAACTGAATGAGGTACTACAGGTAATGGGCCAAGTAATGGGGCTAACCTGGCCGACGGGCAAACATTATGGGAAAGCTGTTGATCAAGGCCCAGGGGGCACGCTACATAGGACCAATGCAGATGGGGCTAAGGTCAGAGGGTCATCAGTCTCACTGATTGGGTTTAAAAGGAAGTGTAGAGCTGGCGGTGTAGGTACGAGTGAACAAGCTGAGATGGGTTCGTCGAAAAGAGCTAAAGGGAGTGATAAGGGGAACCCCCTGACAGTTCGATCTCTTAAGGAGCAAGTGAAGCTCCATTCCCCAAATGTTATATTCTTGTCCGAAACAAAAAATAAGGCAAATAGGTTGGAGGGAATAAGGAGGCAATTGGGCTACGAGGGGTGTTTTGTGGTCGAGCCCCGTGGGCTCAAGGCAGGCCTTTGTCTGATGTGGAAGATTGTTGATGAGGTTGAGATCCATCAATATGCAGATTTCTTCATTGAGGCTGTGATTCGGCCTAAGACTGGCAACCCAAAATGGCATTTCTTTGGAGTCTATGCAAGCACGGATGAGAAGGAAAGAGAGCAACAACTCAGTAACCTGACTTCTAGAATTGGACTCTCACAGGATAATTACGTGGTAGGCGGGGACTTTAATGATATTGTTTGCAATGGGGAGAAGGAGGGGGCCTATACCGATCTCAAAGAAGTTTAG

Coding sequence (CDS)

ATGATGGATGAGCCTATTGAGGAAGTTCTGGCGAGGTTTAACATATCGGAAAAGGAATGCGAGACAGTAACAATTTCAAGCGAGGCCAAATCGGTTGATCTAGAGGACAAGAAGTGGTGCCTCATTGGTGAGGTAGTGACAAACCGGAAGTTCAACAGAGAAGCCTTCCGGAAGACGATGATGAACAGTTGGCAATGTAAGTCGGTTAAATTCGTAGAAGTTGATGAAAATGTGCTTTTGTTTTGCTTTGCGGATGCTGCGTCCATGTTATATGTACAAAACCAGGGGCCATGGCTCTTTGAGGAATCACTGTTGGTCCTTGCCAAGTGGAGCCCAAATATAAAAACAAAAGCAGAGTTGCCAAAGGTGTGTGACTTTTGGGTACAGATACATGGGCTTCCTTTTGACTGTAAAGGGCAGGAGGCAGCAAAAGTAGTGGCGCAAAAGATAGGTCGGGTTACAGATGAAGAGTGGGAGGTGGACTCTCGGAGCGTGCAACAACGAAAATTTATCAGAATTAAAGTGGAGCTAGATGTAACGAAATTGCTCATGAAAGGCTTTATAGTTATGACTGGGGGTTCCAAGAAGTGGGTATGGTTCAAGTATGAACGTCTGCCTAGGTTTTGCTCAAAGTGTGGAGTGATGGGGCATACAGCTCACTGGTGCAGTGCGAAACATCTCCAAACTTCCCCTGCGCCGCCAGTGTTTGGAGATTGGCTAAGGGCGGGTCCGGTACCGACAATGGCGAAACAAGGGGAAAGATCAGGTCATTGGCGTAGGAAGGAACAAGGTCAGGGGAAGGCACCGGAGAGCGGTGTGCCGTCGGTGGGAACAGAATCGGAGAAGGCCCCGGGCATAAATCTCGGAGGAGACGTATCTTCGGAACCACCTACGGGAATCGTGGAACCGAGGGTGGAAAATGAGCAGAATATCTCGGAGAATTTGGAACTGAATGAGGTACTACAGGTAATGGGCCAAGTAATGGGGCTAACCTGGCCGACGGGCAAACATTATGGGAAAGCTGTTGATCAAGGCCCAGGGGGCACGCTACATAGGACCAATGCAGATGGGGCTAAGGTCAGAGGGTCATCAGTCTCACTGATTGGGTTTAAAAGGAAGTGTAGAGCTGGCGGTGTAGGTACGAGTGAACAAGCTGAGATGGGTTCGTCGAAAAGAGCTAAAGGGAGTGATAAGGGGAACCCCCTGACAGTTCGATCTCTTAAGGAGCAAGTGAAGCTCCATTCCCCAAATGTTATATTCTTGTCCGAAACAAAAAATAAGGCAAATAGGTTGGAGGGAATAAGGAGGCAATTGGGCTACGAGGGGTGTTTTGTGGTCGAGCCCCGTGGGCTCAAGGCAGGCCTTTGTCTGATGTGGAAGATTGTTGATGAGGTTGAGATCCATCAATATGCAGATTTCTTCATTGAGGCTGTGATTCGGCCTAAGACTGGCAACCCAAAATGGCATTTCTTTGGAGTCTATGCAAGCACGGATGAGAAGGAAAGAGAGCAACAACTCAGTAACCTGACTTCTAGAATTGGACTCTCACAGGATAATTACGTGGTAGGCGGGGACTTTAATGATATTGTTTGCAATGGGGAGAAGGAGGGGGCCTATACCGATCTCAAAGAAGTTTAG

Protein sequence

MMDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPVFGDWLRAGPVPTMAKQGERSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVSSEPPTGIVEPRVENEQNISENLELNEVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTNADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMGSSKRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLTSRIGLSQDNYVVGGDFNDIVCNGEKEGAYTDLKEV
Homology
BLAST of Lag0041280 vs. NCBI nr
Match: KAF7133372.1 (hypothetical protein RHSIM_Rhsim09G0106200 [Rhododendron simsii])

HSP 1 Score: 198.0 bits (502), Expect = 2.1e-46
Identity = 181/658 (27.51%), Postives = 284/658 (43.16%), Query Frame = 0

Query: 2   MDEPIEEVLARFNIS-EKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTM 61
           M + + +++  F++S E+E E + I  E     +E   + L+G+++T +K+N    ++++
Sbjct: 1   MADEVVDLIGNFHLSEEEEEEVIPIDDEICKQVVEACSFSLVGKLLTTKKYNVAMMKESL 60

Query: 62  MNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAE 121
             +W   +++  VEV +N+  F F +  S+  V N GPW F   LL L  W P +K    
Sbjct: 61  RRAWGSPENLTIVEVGDNLFHFRFDNENSLCKVLNGGPWNFNNHLLALKAWEPGMKVDQV 120

Query: 122 LPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSV--QQRKFIRIKVEL 181
                 FW+Q+ GLPF+    +  +++ +KIG        VD R+V   + +FIR++V +
Sbjct: 121 TFSSIYFWIQLWGLPFEFVNPKIGEIIGKKIGTFC----SVDERAVVGDKGRFIRVRVGI 180

Query: 182 DVTKLLMK-GFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA---P 241
            + K L + GFI +  GSK WV +K+ERL  FC  CG + H    C  K           
Sbjct: 181 AMDKPLKRGGFIALGNGSKFWVDYKFERLNSFCFYCGSLLHDQGECGVKISDVENGVVKD 240

Query: 242 PVFGDWLRA---------------------GPV------PTMAKQGERSGHWRRKEQGQG 301
             FG W++A                     GP       PT      + G+++  E  +G
Sbjct: 241 GKFGPWMKARGGVAAGGRSQSDSRRIVGGNGPTGESFHRPTSRSGPAKGGNFKILEDMEG 300

Query: 302 K-APESGVPSVGTESEKAPGINLGGD---------------------------------V 361
               +SG  S   E + +  I + G                                   
Sbjct: 301 ALISDSGNKSGLIEIKDSARITINGKESRLGDLIPFPDKSPNKAKDPAYAPNLDAMEVAA 360

Query: 362 SSEPPTGI--------VEPRVENEQN------------------------ISENLELNEV 421
           S+ P  G+          P +E E +                        I  NL+  EV
Sbjct: 361 STGPCKGVSGLGPAIQQSPFIEKELHQPIEANLVGLNNSFADGDMSSVVGIDINLKEVEV 420

Query: 422 LQVMGQV-MGLTWPT--GKHYGKAVDQGPGGTLHRTNADGA---------KVRGSSVSLI 481
            Q  G+  +GL  P   G    K ++   G T  R     A         K    ++S++
Sbjct: 421 SQSYGEADLGLAIPLSFGVDNSKMLNSSSGRTSRRKKGKRALTQIGPTIKKSGKENISVL 480

Query: 482 GFKRKCR----AGGVGTSEQAEMGSSKRAKGSD-----KGNPLTVRSLKEQVKLHSPNVI 538
           G +R+      A G  +      G  KR   SD      GNPLTVR LKE    +SP+V+
Sbjct: 481 GKRREQSGIHFAKGAQSEAMESHGGEKRLALSDINSSGVGNPLTVRQLKEVCNSYSPDVV 540

BLAST of Lag0041280 vs. NCBI nr
Match: KAF7141361.1 (hypothetical protein RHSIM_Rhsim06G0043400 [Rhododendron simsii])

HSP 1 Score: 196.8 bits (499), Expect = 4.8e-46
Identity = 189/675 (28.00%), Postives = 287/675 (42.52%), Query Frame = 0

Query: 16  SEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEV 75
           SE+E E + +S    S  +    + L+G ++T++KFN+ AF+  +  +W    +++ VEV
Sbjct: 16  SEEEEEEIVLSEAVVSKVVSSCLFSLVGRLLTSKKFNKIAFKDCLRKAWGMTANLRIVEV 75

Query: 76  DENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLP 135
            +N+  F FA+   M  V   GPW F++ +++L +W   +       +  + W+Q+HGLP
Sbjct: 76  GDNLFHFRFAEEEGMRRVLAGGPWNFDDHIVLLKQWQEGMVESDVTFESFNVWIQLHGLP 135

Query: 136 FDCKGQEAAKVVAQKIGRVTDEEWEVDSR--SVQQRKFIRIKVELDV-TKLLMKGFIVMT 195
           F+  GQE  +V+  KIG    E  EVD R    +Q +FIR++V L V T L   G IV  
Sbjct: 136 FEYIGQEVGRVIGSKIG----ELLEVDDRLEGGEQGRFIRVRVRLKVNTPLKRGGNIVCG 195

Query: 196 GGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA---PPVFGDWLRAG----- 255
           GG K WV +KYER+P FC  CG + H  + C  K             +G+W++A      
Sbjct: 196 GGRKVWVDYKYERIPAFCFYCGRVDHEENVCQIKDNDGQEGRLREGRYGEWMKAASAFRR 255

Query: 256 --------------------------------PVPTMAKQGER-------SGHWRRKEQG 315
                                           P     K GER        G WR  +  
Sbjct: 256 RREQAMEYDRNRNRGSGGVSHRTTVIASGGARPSDGYGKSGERGILTENKGGDWRDNQLV 315

Query: 316 Q----------GKAPESG--VPS---------------------VGTESEKAPGINLGG- 375
           +          GK+   G   PS                     + +  +  PG  L G 
Sbjct: 316 ELGDNSVISLNGKSHRVGDLFPSESDVNKGNSYDLINHGKENQGMQSSIDNGPGRELAGL 375

Query: 376 DVSSEPPTGIVEPRV------------ENEQNISENLELNEVL--QVMGQVMGL------ 435
            V +   TG+ + R             +N +    NL L E +  + MG   G+      
Sbjct: 376 RVLNPISTGLGKRRTTFLSSFFSDNGDQNGEYGKVNLGLPEGVNKKTMGLETGVQIEEGI 435

Query: 436 -----TWPTGKHYG------------KAVDQG-PGGTLH-----RTNADGAKVRGSSVSL 495
                  P G   G             +++ G  GGT       +    G    G + SL
Sbjct: 436 MTQTKDGPVGDFSGLEGVTLMDIPIRSSINIGTSGGTSQLIFSSKAEEVGKTRSGKAKSL 495

Query: 496 IGFKRKCRAGGVGTSEQAEMGSSKRAKGSDK---------------------GNPLTVRS 538
            G       GG G  ++  +  S    G  K                     GNPLTV +
Sbjct: 496 TG--NATSGGGRGRGQKKGIEKSTLVLGKRKTKPSSREIERLGDVNEADDGVGNPLTVHT 555

BLAST of Lag0041280 vs. NCBI nr
Match: XP_035540109.1 (uncharacterized protein LOC118344190 [Juglans regia])

HSP 1 Score: 194.9 bits (494), Expect = 1.8e-45
Identity = 154/569 (27.07%), Postives = 256/569 (44.99%), Query Frame = 0

Query: 15  ISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQ-CKSVKFVE 74
           ++E+E E + I  +A+ V  E     +IG++  +R    +    TM   W+  K   F E
Sbjct: 21  LTEEESEVLEIIDDAEEV-WEQGDMSIIGKIWLDRSIGLDVISATMGKIWRVSKPAVFRE 80

Query: 75  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGL 134
           V  N+ +  F +    + V +  PWLF+  L  L  +    +          FWVQIH L
Sbjct: 81  VGVNLFVITFRNQVDKMRVMDGRPWLFDNHLFALQMFDGYAQPNQWRFDKELFWVQIHNL 140

Query: 135 PFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGG 194
           P  C  +E  +++ + +G++   E +V +  +   KF+R++VE+ + K + +G ++   G
Sbjct: 141 PMVCMTKEKGRLIGESLGKLV--EVDVPNDEMGWGKFLRVRVEIPLMKAITRGRLIKVNG 200

Query: 195 SKKWVWFKYERLPRFCSKCGVMGHTAHWCS---AKHLQTSPAPPVFGDWLRAGP---VPT 254
            + W+  +YE+LPR C KCG + H    C       ++       FG WLRA P      
Sbjct: 201 QEVWINLRYEKLPRVCFKCGRIVHGYKGCEMGLEGTVKERGMNEQFGVWLRADPGFRRRF 260

Query: 255 MAKQGERSG-HWRRKEQG------QGKAPESGVPSVGTESEKAPGINLGGDVSSEPPTGI 314
              QG   G  W RK +        G    SG   + +E     G   G DV        
Sbjct: 261 NKGQGTEGGDRWNRKGEDVESTNTGGGRKGSGEGELYSEKVVEGGRGEGYDVGEGQNVVC 320

Query: 315 VEPRVENEQNISENLELN-----EVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTNA 374
           V  R+   +   E  ++      +++Q+     G       +  + + +G    +    +
Sbjct: 321 VNERILGWERSEEETDMGGKNKMKIVQINENERGEI-----NENEKIGEGEIEKVLSAES 380

Query: 375 DGAKVRGSSVSLIGFKRKCRAGGVGTSE---QAEMGSSKRAKGSDKGNPLTVRSLKEQ-- 434
           +G     ++V L G K K RA G+G SE    + +G  +   G D+ + +  R++K+   
Sbjct: 381 EGVGKDNNAVVLKG-KWKRRARGIGRSEGLDGSSIGEKRGLAGEDEVSNIKCRAVKKSKR 440

Query: 435 ----------------------VKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEP 494
                                 VK   P+++FL ETK ++N++E I+R+ G++GC VVEP
Sbjct: 441 EQEGKEQNVTISMAGAASQPRIVKEKQPDIVFLMETKLRSNKVEAIQRKGGFKGCIVVEP 500

Query: 495 RGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSN 538
            GL  GL +MWK VDEVE+  Y+ + I   +R +  N +W   G Y   D  +R+     
Sbjct: 501 IGLGGGLLMMWKDVDEVELCNYSQWHISVWVRNEKNNERWLLTGFYGDPDVSKRDMSWDL 560

BLAST of Lag0041280 vs. NCBI nr
Match: KAF7150653.1 (hypothetical protein RHSIM_Rhsim02G0038900 [Rhododendron simsii])

HSP 1 Score: 193.7 bits (491), Expect = 4.0e-45
Identity = 168/636 (26.42%), Postives = 283/636 (44.50%), Query Frame = 0

Query: 2   MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMM 61
           M + + E++   ++S++E + + I+ E     +E   + L+G+++T++KFN    + ++ 
Sbjct: 1   MADEVVELIGNCHLSDEEDDVIPIADEVCKQAVEACSFSLVGKLLTSKKFNVTVMKDSLR 60

Query: 62  NSW-QCKSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAEL 121
            +W   +++  VEV +N+  F F    ++  V N GPW F+  LLVL +W   +K +   
Sbjct: 61  RAWGSPENLHIVEVGDNLFHFRFDSETNLRKVLNGGPWNFDNYLLVLQEWESGMKAEQVS 120

Query: 122 PKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSV--QQRKFIRIKVELD 181
            ++  FWVQ+ GLPF+       +++ ++IG      + VD+R+V  ++ +FIR++V + 
Sbjct: 121 FQLVPFWVQLWGLPFEFVNPVIGEIIGKRIGSF----FSVDNRAVMGERGRFIRVRVGVP 180

Query: 182 VTKLLMK-GFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA---PP 241
           V K L + GFI +  G+K WV +K+ERL RFC  CG + H    C  +            
Sbjct: 181 VDKPLKRGGFIALGNGTKFWVDYKFERLNRFCYYCGSLLHEHGNCGVRSSDEGIGILKEG 240

Query: 242 VFGDWLRAGPVPTMAKQGERSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVS- 301
            FG W++AG     A + + +  W       G A +  + +VG+   K  G      +S 
Sbjct: 241 KFGAWMKAGGGGAGAGRQQPNSKW-----FSGGARDGVLRTVGSGKFKLGGDKDSAQISD 300

Query: 302 SEPPTGIVEPRVENEQNIS-ENLELNEVL-----QVMGQVMGLTWPTGKHYGKAVDQGP- 361
            E  +G++E +  +   I+ +   + ++L      V+G   G+  P+G         GP 
Sbjct: 301 KENNSGLIEIKDNDWVAINGKETRMGDLLPKSKDSVLGGNKGVLGPSGSDVDGLGGNGPE 360

Query: 362 -----GGTLHRTNADGA----KVRGSSVSLIGFKRKCRAGGVGTSEQ------------- 421
                G  L + N +GA     +  S +   G +   +  GVG  E              
Sbjct: 361 LCGGLGPVLSQGNPNGAGSSFLIEASPLGHYG-QAASKITGVGPIESSPDLNLQDVVVSQ 420

Query: 422 ----------------------AEMGSSKRAKGSDK------------------------ 481
                                 +  G +KR+KG  +                        
Sbjct: 421 SFDEATLGPAFSFGAAHSKIVASSSGGAKRSKGKKRNGVQSDSKLRRSEKENSGVTGKKR 480

Query: 482 -----------------GNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYE 538
                            GNPLT+R LK    L+SP+++FL+ETKNK  ++E I + +G  
Sbjct: 481 LLNDEQNVKGTNLEGGVGNPLTIRQLKGVCNLYSPDLVFLAETKNKKEKIEKIAKIVGLG 540

BLAST of Lag0041280 vs. NCBI nr
Match: XP_035544642.1 (uncharacterized protein LOC109020982 [Juglans regia])

HSP 1 Score: 186.4 bits (472), Expect = 6.4e-43
Identity = 155/567 (27.34%), Postives = 251/567 (44.27%), Query Frame = 0

Query: 2   MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMM 61
           M E +      F ++E+E   + + S+   + +   K+CL+G ++  +  N+EAFR TM+
Sbjct: 1   MTEELTRQWKNFKLTEQESTEMVLPSDTMEMAMHQGKFCLLGMIIAEKPINKEAFRNTMI 60

Query: 62  NSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAEL 121
             W+ +  V+F EV EN  L  F        V    PW F+  LL L  +  N+      
Sbjct: 61  KVWRSEGWVQFTEVGENRFLIEFNKDEDRQRVIKGRPWSFDRWLLCLHAFEGNMPVNDIQ 120

Query: 122 PKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT 181
               +FW+Q+H +P     +E  K + + +G+V   +   D R +   KF+RI+VE+ +T
Sbjct: 121 FTREEFWLQVHNMPLGTMTEEVGKQIGKNVGKVL--KVHSDDRGIGWGKFMRIRVEMYIT 180

Query: 182 KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSP-APP--VFG 241
           K LM+G  +   G K WV FKYERLP FC KCGV+ H    C     Q+SP A P   +G
Sbjct: 181 KALMRGMFLTFDGRKTWVQFKYERLPTFCLKCGVIRHNGKSC-----QSSPKASPQNQYG 240

Query: 242 DWLRAGPVPTMAKQGE----RSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVS 301
            WLRA      AK+G+    R GH   ++             VG E      +NLG  VS
Sbjct: 241 VWLRA----PAAKEGDINQKRYGHDSSQQTSGADHSWRNDGKVGDE------VNLGKAVS 300

Query: 302 SE---PPTG----------IVEPRVENEQNISENLEL--NEVLQVMGQVMGLTWPTGKHY 361
            E    P             ++P  E   ++ E+L     ++ +V   +M L   TG   
Sbjct: 301 KEGFKDPINTSDPIQENCVFLQPTKETSADLVESLMATDTQLPKVNVTLMDLEGDTGLPT 360

Query: 362 GKAVDQGPGGTLHRTN--ADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMG------SS 421
            K  D  P   + + +    G   R SS+         R   +  +  + +         
Sbjct: 361 LKE-DPTPTREVQQKSPPKPGPSKRDSSLKNYEPNSDLRPRLILLTNSSRIKPRFWQLPK 420

Query: 422 KRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRG 481
           +   G ++     +      +K   P+ +FL ETK K +++E ++R + ++  FV++ +G
Sbjct: 421 REPLGKEEPERSAMLICFLSLKSKLPSFVFLMETKCKRHKVECVKRLIKFDNSFVIDCKG 480

Query: 482 LKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLT 538
              GL  +WK   E E+H Y+   I  +++       W   G Y S     R+     L 
Sbjct: 481 FSGGLAFLWKNDVEAEVHSYSQNHISLMVKGAKEREDWLLTGFYGSPVTARRQSSWKLLQ 540

BLAST of Lag0041280 vs. ExPASy TrEMBL
Match: A0A7N2M6Y1 (CCHC-type domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 210.7 bits (535), Expect = 1.5e-50
Identity = 167/595 (28.07%), Postives = 274/595 (46.05%), Query Frame = 0

Query: 2   MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMM 61
           MD+ +   L    ++ +E E + +++ + S +LE+    L G ++++R  N  A + T+ 
Sbjct: 1   MDQEVVNSLGNLKLTREEEEDIVVANSSSSGNLEECSLSLFGRLLSDRHQNLRALKNTLR 60

Query: 62  NSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAEL 121
            +W+  S ++ VEV  ++L F F     + +V+  GPW FE +LL+L +W   + +K   
Sbjct: 61  AAWKMGSDLRIVEVGNSILQFKFGSKCQLEWVEKSGPWNFENNLLLLCRWRKGLTSKNIC 120

Query: 122 PKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQ--QRKFIRIKVELD 181
                FWVQI GLPF+   ++  + +  KIG+V     EVD R++Q  Q KF+R++VE+ 
Sbjct: 121 FSHSPFWVQIWGLPFENMVEDFGREIGSKIGKVL----EVDKRALQADQAKFLRVRVEVQ 180

Query: 182 VTKLLMK-GFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPVFG 241
           + K L + GF+      + WV F+YERLP FC KCG++GH    C    ++ +PA   +G
Sbjct: 181 LDKPLRRGGFVKNDENDRIWVDFRYERLPIFCYKCGILGHDDKHCLVNPME-NPAGNQYG 240

Query: 242 DWLRAGPVPTMAKQGERSGHWRRKEQGQGKAPESGVPSVGTES----------EKAPGIN 301
           +WL+AG      K G   G+++ K+  Q  A   G  S+G  S             P + 
Sbjct: 241 EWLKAGGA---LKDG--GGNFKLKQ--QANAEMRGADSMGLNSNIKERNGGSGHSCPALA 300

Query: 302 LGGDVSSEPPTGIVEPRVENEQNI-------------SENLELNEVLQVMGQVMGLTWPT 361
           +GG      P   V    E  +               S   E  E ++    V      +
Sbjct: 301 VGGGSGHSCPALAVGGGTEGRKGTVSLMDTDRMECVKSNPREHGEKVRTEDDVAD-ALRS 360

Query: 362 GKHYGKAVDQGPGGTLHRTNADGAKV------RGSSVSL-IGFKRKCRAGGVG----TSE 421
           G+      D  PGG     N  G +       RG + ++   FKR  R  G       +E
Sbjct: 361 GEVQKSPQDMRPGGN-GELNCHGEEAVPLCQERGLAQTIRRKFKRMARDKGKAQESVNAE 420

Query: 422 QAEMGSSKRAKGSD---------------------KGNPLTVRSLKEQVKLHSPNVIFLS 481
           +A+  S+KR   +D                      GNP +VR L+E V+   P ++FLS
Sbjct: 421 KAQEVSNKRKALTDVLFASEETAQKRLCFGWNCRGLGNPRSVRVLRELVQRWKPGIVFLS 480

Query: 482 ETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPK 538
           ETK K  ++  ++ ++G     +V   G   GL ++W     +E+  Y+ +FI+AV+   
Sbjct: 481 ETKMKNYQMNKVKFKIGLLNGLIVPSTGRSGGLAMLWNRDIHLEVQSYSRYFIDAVVTEV 540

BLAST of Lag0041280 vs. ExPASy TrEMBL
Match: A0A6P9DXY5 (uncharacterized protein LOC118344190 OS=Juglans regia OX=51240 GN=LOC118344190 PE=4 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 8.8e-46
Identity = 154/569 (27.07%), Postives = 256/569 (44.99%), Query Frame = 0

Query: 15  ISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQ-CKSVKFVE 74
           ++E+E E + I  +A+ V  E     +IG++  +R    +    TM   W+  K   F E
Sbjct: 21  LTEEESEVLEIIDDAEEV-WEQGDMSIIGKIWLDRSIGLDVISATMGKIWRVSKPAVFRE 80

Query: 75  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGL 134
           V  N+ +  F +    + V +  PWLF+  L  L  +    +          FWVQIH L
Sbjct: 81  VGVNLFVITFRNQVDKMRVMDGRPWLFDNHLFALQMFDGYAQPNQWRFDKELFWVQIHNL 140

Query: 135 PFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVTKLLMKGFIVMTGG 194
           P  C  +E  +++ + +G++   E +V +  +   KF+R++VE+ + K + +G ++   G
Sbjct: 141 PMVCMTKEKGRLIGESLGKLV--EVDVPNDEMGWGKFLRVRVEIPLMKAITRGRLIKVNG 200

Query: 195 SKKWVWFKYERLPRFCSKCGVMGHTAHWCS---AKHLQTSPAPPVFGDWLRAGP---VPT 254
            + W+  +YE+LPR C KCG + H    C       ++       FG WLRA P      
Sbjct: 201 QEVWINLRYEKLPRVCFKCGRIVHGYKGCEMGLEGTVKERGMNEQFGVWLRADPGFRRRF 260

Query: 255 MAKQGERSG-HWRRKEQG------QGKAPESGVPSVGTESEKAPGINLGGDVSSEPPTGI 314
              QG   G  W RK +        G    SG   + +E     G   G DV        
Sbjct: 261 NKGQGTEGGDRWNRKGEDVESTNTGGGRKGSGEGELYSEKVVEGGRGEGYDVGEGQNVVC 320

Query: 315 VEPRVENEQNISENLELN-----EVLQVMGQVMGLTWPTGKHYGKAVDQGPGGTLHRTNA 374
           V  R+   +   E  ++      +++Q+     G       +  + + +G    +    +
Sbjct: 321 VNERILGWERSEEETDMGGKNKMKIVQINENERGEI-----NENEKIGEGEIEKVLSAES 380

Query: 375 DGAKVRGSSVSLIGFKRKCRAGGVGTSE---QAEMGSSKRAKGSDKGNPLTVRSLKEQ-- 434
           +G     ++V L G K K RA G+G SE    + +G  +   G D+ + +  R++K+   
Sbjct: 381 EGVGKDNNAVVLKG-KWKRRARGIGRSEGLDGSSIGEKRGLAGEDEVSNIKCRAVKKSKR 440

Query: 435 ----------------------VKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEP 494
                                 VK   P+++FL ETK ++N++E I+R+ G++GC VVEP
Sbjct: 441 EQEGKEQNVTISMAGAASQPRIVKEKQPDIVFLMETKLRSNKVEAIQRKGGFKGCIVVEP 500

Query: 495 RGLKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSN 538
            GL  GL +MWK VDEVE+  Y+ + I   +R +  N +W   G Y   D  +R+     
Sbjct: 501 IGLGGGLLMMWKDVDEVELCNYSQWHISVWVRNEKNNERWLLTGFYGDPDVSKRDMSWDL 560

BLAST of Lag0041280 vs. ExPASy TrEMBL
Match: A0A2N9HE28 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS37812 PE=4 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 3.1e-43
Identity = 155/605 (25.62%), Postives = 258/605 (42.64%), Query Frame = 0

Query: 2   MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMM 61
           M + + + L +  ++++E +++ I+   +++ LE+ +  L+G  +T+R  N  A + T+ 
Sbjct: 1   MADHLADSLLKVKLTDEEEDSIVITGANRAIALEECEHSLLGRFLTHRPPNLRAAKTTLR 60

Query: 62  NSWQC-KSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAEL 121
            +W+    V+ V+V   ++ F F +A  M +V    PW F+  LL+L +W   + T A L
Sbjct: 61  TAWRMGDEVRMVDVGPGLIQFKFRNAFQMRWVLEHSPWNFDNHLLLLRRWESGM-TPANL 120

Query: 122 PKV-CDFWVQIHGLPFDCKGQEAAKVVAQKIGR---VTDEEWEVDSRSVQQRKFIRIKVE 181
                 FW+Q+ G+PFD   +E  + + +KIGR   V    W  D     Q   +RI+VE
Sbjct: 121 TFTHALFWIQVWGVPFDLMAEEVGEAIGRKIGRFIKVDRSPWFGD-----QASNLRIRVE 180

Query: 182 LDVTK-LLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPV 241
           + + K LL  GF++   G + WV +KYERL  FC +CG+MGH    C+    Q   +   
Sbjct: 181 VPIDKPLLRGGFVLSPDGQRVWVQYKYERLASFCHRCGLMGHATTQCNTTPQQEDASTLP 240

Query: 242 FGDWLRAG---------------------------PVPTMAKQGERSGHWRRKE------ 301
           +G+WL+AG                           P P      E   H    +      
Sbjct: 241 YGEWLKAGHTARGPQPQPRPTNKPANQTNPVASNPPPPPTPNSPESHPHPPNPDTTIKAN 300

Query: 302 ----QGQGKAP-----------------ESGVPSVGTESEKAPGI-----NLGGDVSSEP 361
                  G  P                 + G P + T +   P I      LGGD  ++ 
Sbjct: 301 ISPSDNHGDLPNPSSSEDHGLPKNKTLTKHGQPDMATITGPTPIIAVSEMTLGGDPRADT 360

Query: 362 PTGIVE-PRVENEQNISENLELNEVLQVMGQVMGL-TWPTGKHYGKAVDQGPGGTLHRTN 421
           P  +   P++E     S N   N+  ++  +   L +W       K  ++G   TL  +N
Sbjct: 361 PVDLPSTPKIE-----SANHYANQFEEIKTKTKALKSW-------KRTEKGE--TLSVSN 420

Query: 422 ADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMGSSKRAKGSDKGNPLTVRSLKEQVKLH 481
                        +G KR   A     +E +      +   +  GNP  +R L+   K  
Sbjct: 421 P------RMEAPTVGQKRISYAISDADTEDSPTPKKTKRPLTGLGNPKAIRILRNLAKEK 480

Query: 482 SPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFF 540
            P ++FL ETK    R+E +R  +G++  F V  +G   GL LMWK   EV +  ++   
Sbjct: 481 DPVMMFLIETKLDVKRMEKVRASVGFQYVFTVPSKGRSGGLALMWKDSIEVRVQTFSQHH 540

BLAST of Lag0041280 vs. ExPASy TrEMBL
Match: A0A2N9FJM4 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS15003 PE=4 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 3.1e-43
Identity = 155/605 (25.62%), Postives = 258/605 (42.64%), Query Frame = 0

Query: 2   MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMM 61
           M + + + L +  ++++E +++ I+   +++ LE+ +  L+G  +T+R  N  A + T+ 
Sbjct: 1   MADHLADSLLKVKLTDEEEDSIVITGANRAIALEECEHSLLGRFLTHRPPNLRAAKTTLR 60

Query: 62  NSWQC-KSVKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAEL 121
            +W+    V+ V+V   ++ F F +A  M +V    PW F+  LL+L +W   + T A L
Sbjct: 61  TAWRMGDEVRMVDVGPGLIQFKFRNAFQMRWVLEHSPWNFDNHLLLLRRWESGM-TPANL 120

Query: 122 PKV-CDFWVQIHGLPFDCKGQEAAKVVAQKIGR---VTDEEWEVDSRSVQQRKFIRIKVE 181
                 FW+Q+ G+PFD   +E  + + +KIGR   V    W  D     Q   +RI+VE
Sbjct: 121 TFTHALFWIQVWGVPFDLMAEEVGEAIGRKIGRFIKVDRSPWFGD-----QASNLRIRVE 180

Query: 182 LDVTK-LLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPAPPV 241
           + + K LL  GF++   G + WV +KYERL  FC +CG+MGH    C+    Q   +   
Sbjct: 181 VPIDKPLLRGGFVLSPDGQRVWVQYKYERLASFCHRCGLMGHATTQCNTTPQQEDASTLP 240

Query: 242 FGDWLRAG---------------------------PVPTMAKQGERSGHWRRKE------ 301
           +G+WL+AG                           P P      E   H    +      
Sbjct: 241 YGEWLKAGHTARGPQPQPRPTNKPANQTNPVASNPPPPPTPNSPESHPHPPNPDTTIKAN 300

Query: 302 ----QGQGKAP-----------------ESGVPSVGTESEKAPGI-----NLGGDVSSEP 361
                  G  P                 + G P + T +   P I      LGGD  ++ 
Sbjct: 301 ISPSDNHGDLPNPSSSEDHGLPKNKTLTKHGQPDMATITGPTPIIAVSEMTLGGDPRADT 360

Query: 362 PTGIVE-PRVENEQNISENLELNEVLQVMGQVMGL-TWPTGKHYGKAVDQGPGGTLHRTN 421
           P  +   P++E     S N   N+  ++  +   L +W       K  ++G   TL  +N
Sbjct: 361 PVDLPSTPKIE-----SANHYANQFEEIKTKTKALKSW-------KRTEKGE--TLSVSN 420

Query: 422 ADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMGSSKRAKGSDKGNPLTVRSLKEQVKLH 481
                        +G KR   A     +E +      +   +  GNP  +R L+   K  
Sbjct: 421 P------RMEAPTVGQKRISYAISDADTEDSPTPKKTKRPLTGLGNPKAIRILRNLAKEK 480

Query: 482 SPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRGLKAGLCLMWKIVDEVEIHQYADFF 540
            P ++FL ETK    R+E +R  +G++  F V  +G   GL LMWK   EV +  ++   
Sbjct: 481 DPVMMFLIETKLDVKRMEKVRASVGFQYVFTVPSKGRSGGLALMWKDSIEVRVQTFSQHH 540

BLAST of Lag0041280 vs. ExPASy TrEMBL
Match: A0A6P9EQ08 (uncharacterized protein LOC109020982 OS=Juglans regia OX=51240 GN=LOC109020982 PE=4 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 3.1e-43
Identity = 155/567 (27.34%), Postives = 251/567 (44.27%), Query Frame = 0

Query: 2   MDEPIEEVLARFNISEKECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMM 61
           M E +      F ++E+E   + + S+   + +   K+CL+G ++  +  N+EAFR TM+
Sbjct: 1   MTEELTRQWKNFKLTEQESTEMVLPSDTMEMAMHQGKFCLLGMIIAEKPINKEAFRNTMI 60

Query: 62  NSWQCKS-VKFVEVDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAEL 121
             W+ +  V+F EV EN  L  F        V    PW F+  LL L  +  N+      
Sbjct: 61  KVWRSEGWVQFTEVGENRFLIEFNKDEDRQRVIKGRPWSFDRWLLCLHAFEGNMPVNDIQ 120

Query: 122 PKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT 181
               +FW+Q+H +P     +E  K + + +G+V   +   D R +   KF+RI+VE+ +T
Sbjct: 121 FTREEFWLQVHNMPLGTMTEEVGKQIGKNVGKVL--KVHSDDRGIGWGKFMRIRVEMYIT 180

Query: 182 KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSP-APP--VFG 241
           K LM+G  +   G K WV FKYERLP FC KCGV+ H    C     Q+SP A P   +G
Sbjct: 181 KALMRGMFLTFDGRKTWVQFKYERLPTFCLKCGVIRHNGKSC-----QSSPKASPQNQYG 240

Query: 242 DWLRAGPVPTMAKQGE----RSGHWRRKEQGQGKAPESGVPSVGTESEKAPGINLGGDVS 301
            WLRA      AK+G+    R GH   ++             VG E      +NLG  VS
Sbjct: 241 VWLRA----PAAKEGDINQKRYGHDSSQQTSGADHSWRNDGKVGDE------VNLGKAVS 300

Query: 302 SE---PPTG----------IVEPRVENEQNISENLEL--NEVLQVMGQVMGLTWPTGKHY 361
            E    P             ++P  E   ++ E+L     ++ +V   +M L   TG   
Sbjct: 301 KEGFKDPINTSDPIQENCVFLQPTKETSADLVESLMATDTQLPKVNVTLMDLEGDTGLPT 360

Query: 362 GKAVDQGPGGTLHRTN--ADGAKVRGSSVSLIGFKRKCRAGGVGTSEQAEMG------SS 421
            K  D  P   + + +    G   R SS+         R   +  +  + +         
Sbjct: 361 LKE-DPTPTREVQQKSPPKPGPSKRDSSLKNYEPNSDLRPRLILLTNSSRIKPRFWQLPK 420

Query: 422 KRAKGSDKGNPLTVRSLKEQVKLHSPNVIFLSETKNKANRLEGIRRQLGYEGCFVVEPRG 481
           +   G ++     +      +K   P+ +FL ETK K +++E ++R + ++  FV++ +G
Sbjct: 421 REPLGKEEPERSAMLICFLSLKSKLPSFVFLMETKCKRHKVECVKRLIKFDNSFVIDCKG 480

Query: 482 LKAGLCLMWKIVDEVEIHQYADFFIEAVIRPKTGNPKWHFFGVYASTDEKEREQQLSNLT 538
              GL  +WK   E E+H Y+   I  +++       W   G Y S     R+     L 
Sbjct: 481 FSGGLAFLWKNDVEAEVHSYSQNHISLMVKGAKEREDWLLTGFYGSPVTARRQSSWKLLQ 540

BLAST of Lag0041280 vs. TAIR 10
Match: AT3G31430.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G18636.1); Has 295 Blast hits to 291 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 295; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 82.8 bits (203), Expect = 9.4e-16
Identity = 53/189 (28.04%), Postives = 88/189 (46.56%), Query Frame = 0

Query: 36  DKKWCLIGEVVTNRKFNREAFRKTMMNSW-QCKSVKFVEVDENVLLFCFADAASMLYVQN 95
           + ++ L G  V  R+ N  +   +M   W Q   V    ++     F F    S+  V  
Sbjct: 129 ENRFILFGRPVMPRRQNLRSIVASMPRIWGQSGLVHGRIMEGRQFHFIFTLEESLETVLR 188

Query: 96  QGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFDCKGQEAAKVVAQKIGRVT 155
           +GPW F + +++L +W P I     +P    FWVQI G+PF    +   + + + +G+V 
Sbjct: 189 RGPWAFNDWMILLQRWEPQIPLFPFIP----FWVQIRGIPFQFLNRGVVEHIGRALGQVL 248

Query: 156 DEEWEVDSRSVQQRKFIRIKVELDVT-KLLMKGFIVMTGGSKKWVWFKYERLPRFCSKCG 215
           D ++ V+   V +  F R+ +  D+T  L  +     T G    + F+YERL  FC  CG
Sbjct: 249 DTDFNVE--VVARMDFARVLLHWDITHPLRFQRHFQFTAGVNTLLRFRYERLRGFCEVCG 308

Query: 216 VMGHTAHWC 223
           ++ H    C
Sbjct: 309 MLTHDFGAC 311

BLAST of Lag0041280 vs. TAIR 10
Match: AT2G17920.1 (nucleic acid binding;zinc ion binding )

HSP 1 Score: 79.7 bits (195), Expect = 7.9e-15
Identity = 56/207 (27.05%), Postives = 93/207 (44.93%), Query Frame = 0

Query: 18  KECETVTISSEAKSVDLEDKKWCLIGEVVTNRKFNREAFRKTMMNSWQCKS-VKFVEVDE 77
           +E  ++ I +EA  +     +  +I   +  R  N +A    +  +W   + V    +D+
Sbjct: 16  QEGPSLFIPNEAYIMVAGRNRLSIIARPLNPRVQNLQAIITALPRAWGLTAHVHGRIIDD 75

Query: 78  NVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGLPFD 137
             + F F     +L VQ + PWLF    +   +W P            D WVQ+ G+PF 
Sbjct: 76  TYVQFLFQSEMDLLSVQRREPWLFNNWFVASQRWQP--APALNFVTTIDLWVQMRGIPFL 135

Query: 138 CKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT-KLLMKGFIVMTGGSK 197
              +E A  +AQ+IG +   ++  D+ S  Q  +IR++V + +T  L     I    G  
Sbjct: 136 YVSEETALEIAQEIGAIISLDFH-DTTST-QIAYIRVRVRVGITDSLRFFQRITFESGES 195

Query: 198 KWVWFKYERLPRFCSKCGVMGHTAHWC 223
             + F+YERL R CS C    H  ++C
Sbjct: 196 ALIRFQYERLRRICSNCFRFTHNRNYC 218

BLAST of Lag0041280 vs. TAIR 10
Match: AT5G18636.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25200.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 79.0 bits (193), Expect = 1.4e-14
Identity = 45/160 (28.12%), Postives = 71/160 (44.38%), Query Frame = 0

Query: 74  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGL 133
           +D   + F FA+   ++ VQ + PWLF    +   +W   +     L    D WVQI G+
Sbjct: 73  LDATYVQFVFANEIDLMMVQRREPWLFNNWFVAATRW--QVAPAHNLVTTIDLWVQIRGI 132

Query: 134 PFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT-KLLMKGFIVMTG 193
           P     +E    +AQ +G +   ++     +  Q  FIR++V   +T +L     I+   
Sbjct: 133 PLPYVSEETVLEIAQDLGEIISLDFH--EATSPQIAFIRVRVRFGITDRLRFFQRIIFDS 192

Query: 194 GSKKWVWFKYERLPRFCSKCGVMGHTAHWCSAKHLQTSPA 233
           G    + F+YERL R CS C    H   +C  +    S A
Sbjct: 193 GETATIRFQYERLRRLCSSCFRFTHNRAYCPYRQRSLSIA 228

BLAST of Lag0041280 vs. TAIR 10
Match: AT5G25200.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G18636.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 77.8 bits (190), Expect = 3.0e-14
Identity = 43/150 (28.67%), Postives = 68/150 (45.33%), Query Frame = 0

Query: 74  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGL 133
           +D   + F FA+   ++ VQ + PWLF    +   +W   +     L    D WVQI G+
Sbjct: 73  LDATYVQFLFANEIDLMMVQRREPWLFNNWFVAATRW--QVAPAHNLVTTIDLWVQIRGI 132

Query: 134 PFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT-KLLMKGFIVMTG 193
           P     +E    +AQ +G +   ++     +  Q  FIR++V   +T +L     I+   
Sbjct: 133 PLPYVSEETVLEIAQDLGEIISLDFH--EATSPQIAFIRVRVRFGITDRLRFFQRIIFDS 192

Query: 194 GSKKWVWFKYERLPRFCSKCGVMGHTAHWC 223
           G    + F+YERL R CS C    H   +C
Sbjct: 193 GETATIRFQYERLRRLCSSCFRFTHNRAYC 218

BLAST of Lag0041280 vs. TAIR 10
Match: AT2G02103.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25200.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 76.6 bits (187), Expect = 6.7e-14
Identity = 42/150 (28.00%), Postives = 67/150 (44.67%), Query Frame = 0

Query: 74  VDENVLLFCFADAASMLYVQNQGPWLFEESLLVLAKWSPNIKTKAELPKVCDFWVQIHGL 133
           +D   + F FA+   +L VQ + PWLF    +   +W   +     L    D WVQI G+
Sbjct: 73  LDATYVQFLFANEIDLLMVQRREPWLFNNWFVAATRW--QVAPAHNLVTTIDLWVQIRGI 132

Query: 134 PFDCKGQEAAKVVAQKIGRVTDEEWEVDSRSVQQRKFIRIKVELDVT-KLLMKGFIVMTG 193
           P     +E    +A  +G +   ++     +  Q  FIR++V   +T +L     ++   
Sbjct: 133 PLPYVSEETVMEIAHDLGEIISLDFH--EATSPQIAFIRVRVRFGITDRLRFFQRVIFDS 192

Query: 194 GSKKWVWFKYERLPRFCSKCGVMGHTAHWC 223
           G    + F+YERL R CS C    H   +C
Sbjct: 193 GETATIRFQYERLRRLCSSCFRFTHNRAYC 218

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAF7133372.12.1e-4627.51hypothetical protein RHSIM_Rhsim09G0106200 [Rhododendron simsii][more]
KAF7141361.14.8e-4628.00hypothetical protein RHSIM_Rhsim06G0043400 [Rhododendron simsii][more]
XP_035540109.11.8e-4527.07uncharacterized protein LOC118344190 [Juglans regia][more]
KAF7150653.14.0e-4526.42hypothetical protein RHSIM_Rhsim02G0038900 [Rhododendron simsii][more]
XP_035544642.16.4e-4327.34uncharacterized protein LOC109020982 [Juglans regia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A7N2M6Y11.5e-5028.07CCHC-type domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A6P9DXY58.8e-4627.07uncharacterized protein LOC118344190 OS=Juglans regia OX=51240 GN=LOC118344190 P... [more]
A0A2N9HE283.1e-4325.62Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9FJM43.1e-4325.62Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A6P9EQ083.1e-4327.34uncharacterized protein LOC109020982 OS=Juglans regia OX=51240 GN=LOC109020982 P... [more]
Match NameE-valueIdentityDescription
AT3G31430.19.4e-1628.04unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G17920.17.9e-1527.05nucleic acid binding;zinc ion binding [more]
AT5G18636.11.4e-1428.13unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G25200.13.0e-1428.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G02103.16.7e-1428.00unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025558Domain of unknown function DUF4283PFAMPF14111DUF4283coord: 35..175
e-value: 6.1E-25
score: 87.5
IPR025836Zinc knuckle CX2CX4HX4CPFAMPF14392zf-CCHC_4coord: 177..223
e-value: 3.0E-12
score: 46.0
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 393..544
e-value: 1.5E-12
score: 49.8
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 404..533
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 248..298
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 379..401
NoneNo IPR availablePANTHERPTHR31286:SF84SUBFAMILY NOT NAMEDcoord: 11..224
IPR040256Uncharacterized protein At4g02000-likePANTHERPTHR31286GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.8-LIKEcoord: 11..224

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0041280.1Lag0041280.1mRNA