Cp4.1LG01g21450 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g21450
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSAD1/UNC-84 domain protein 1
LocationCp4.1LG01 : 18142746 .. 18146619 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAACGCCGGTTGCTTTCATCCGATGCGTCGGTTAATTTAATTCAGGTCATGCACTCAATTATTCTCGATTCGGAAATCAAATTCTGCCACCGTATCCTCCGTTTTCCCTCGAAATTTCTCTGTTTTTTTCTTAGAAATTTCAAAATCTCAGTGTATACAAAATTCCTTTTCAATCTCTTTGCGTTTTACATTCTTGTATGTTCGTATGTTCTTCAATTTCGTTCTTCACTGTTAGGGCTTTTGTTTTGCATTTTGGGGAATTTATTTCTGCTGTCGGAATCGTGAAATTTTCTCCACAGAAATGTCGGCTTCGACCGTGTCGATAACGGCGAATCCGGCGGCGCGGAGGAGGCCGGTGTTGGTTTCGGAGAAGAAGGGGGCTAGTATTGAATTGTTGGCAACGGATGGCGCGAATCCTCTATCGAATACTGCTACTGCGGGGACTGGTGGTGCGGCCGATGATAAGGTCGCCGGTGGAATTGGTAGGGATATGAGCCATCACTCGATTAGAGGTGAGGTTGTTCTCGAGCGGTCATCTAGGGATCCACTTCAGATCAAGAAAACTGTTGCGAATTCCACCATTTCGCCGCGGCGGAGTAGAAAGGCGATTACGAAGCCGGAGAAGCCTCGATGGGCGACTATTCTTAGCGTCGTTACGAAGAATATTGTGCTTTTGATGGTTCTTCTAGGGTTGGTTCAAATGATTAGGAAATTAGCCCTAAAATCAGGGGATGGAGCGGTGGGAAATCAAATGGGGTTTTCGGAAGTTGAGGGACGGATTGCAGAAGTTGAAGCATTCTTGAAGACTACCACCAAGATGATTCAGGTTCAGGTGGAGGTCGTCGATCGGAAGATCGAGAACGAAGTTGGGGGATTGAGAAGGGAAGTTGACCATAAGATTGAGGCAAAAACTGCTGATCTTGAGAGCGGTTTGAAGGTGCTGCAGGATAAAGGGGAGGATCTGGAAAGGTCTTTGAGCGAGTTAACGGCCGTGGATTGGTTGTCGAAGCAAGAATTCGATAGGATTTACGACGAATTGAAGAAGGCGAAGAGCGGTGAAATTGATGAGCGGTTTGCAAATTTGGATGATATAAGAGCCTATGCAAGGGAGGTGGTTGAGATGGAGATTGAGAAGCATGCGGCTGATGGACTTGGTAGGGTGGATTATGCTCTTGCTTCTGGTGGGGCTATGGTTGTAAAGCATTCCGAGCCGTTTACAGGAAGAACAAGTAACTGGTTTTCGAGAAATGCAAGGAATGGAGTTCATAGAAATGCGAATAAGATGCTGAAACCGAGTTTCGGGGAGCCCGGGCAGTGCTTCCCCTTGAAAGGAAGTAGCGGGTTCGTTCAAATTAGGCTACGGACGGCGATCGTTCCAGAAGCTATTACTCTAGAGCATGTTGCCAAGGTGATGTTCTTATCCAAATATGTGTTTCTATATGGTAAATACTTATATCTTAGGATTGCCTGAAAATGAATGAGCATTTGTTAGTGTATACTGCCATGGCTTTGATTTAAAAGAAAAAAGATTGCACTGTATTCCATGAGTTGTGAAGAGAACCATAGGCATAGTTGAAAATGCAATGCTGGCACCTTATCTGCTTAGTTATTGTATAACAGCCGAAGCCTACCGCTAGTAGATATTGTCGTCTTTGGACTTTCCCTCGAGGTTTTTAAAACGCGTCTTTTAGAGTTACGCAACATAATCTGAATAAGTTTGGGTTGTTACAAATGGTATTAGAACCCGATACTGAGCAGTGTATCGGGGAGGTGCTGGGCCACCAAAGGGGGTGGATTGTGAGTTTCCACATTGGTTGGAGAGGGGAACGAAGCATTCCTTATAAGGATGTGGAAACCTCACCCTGATGGACCCGTTTTAAAACCTTCTGTTACGTTTTTCTTTACTGTAACAAGTTCTATCCTACAGACCAGCATAGTTAAGTGGGACACCATGCCCTCCAAAAGTTTTAAGTGGTGCTGTGATCTTAGTTTTAAATGGTGCTTACCCACTAGAGAGGGACACCATGCCCTCCGAAAGCGTTTCCTAATTGTGCTTCGGACAAGTCAGGTTGCACGTTTTTATACTAATCTTTGCGGGATATACACCGGAAAGGAAGACCTAGGTGGTAGGTGATTCAAAATCTCAAAAGGTATTGAGTTGGAATCCCATTCTAACTAAGGATTCTTGTGGTTCCGGAGAATCCAGCTACAGGAGAACCAGGAACGGAGAGCTTTCCCCCCTTTTCCGCCCGCCTCTTTGGTCTTAAGAATGCTGGTTTTAAGAATGAGTGATTGTCCTTCTCGACCCTTACTATAATGTCCCACATTGGTTGGGGAGAAGAACAAACCACCATTTATAAGGGTGTGGAAACCTTCTCCTAGAAAACGCATTTTAAAGCCTTGAGGGGAAGCCCGTAAGGGAAAGTCCAAAGAGGACAATATCTGCTAGCGGTGGGCTTGGGTCATTACACTTACTGCCCAACCGGATAGCAGACAGCTAATGCGTTCCACTTATTGAACAGGGTTCTATGGTCGGTTCGATATGTCGGTTCGATATTTTGAGTTGGGAGGGTTGTGGTTATGGATAATGCATATGTTATTTTAGAATGATGTACAAAAGTACCTTCTTTTCTATGGTGTGTATATGAGACATCCTTTGAGAAACCTGTAGTTGAAAAGGGTATGATGAAGATTGAAGATAGGGCATCTTCTTTCTACTAATCCAATATAGAGCTTAAAATATTCTTCAATGGCGTGTCTATGAAAAGCAAAACGCCCGTAGCCCAAGCCCATTGTTAGCAGATACTGTCCACTTTGGTCTGTTATATAGTCTGTTAGGGAGAGGTTTCTACCCCTTATAATGAATGCTTCGTTCCCCTCTCCAACCGATGTAGGATCTCACAATCCCCCTTGAGAGAGTTCCCCTCTCCAACCGATATGGGATCTCACACAAACAGTTTCTCGAAGTTAAATGGATAAATGCTTTAGCTTTAAAGAGGATACATTACGCTTTCGATAGATGTTTATATGTATTATTAGATTATACCATCTGCCAGCTTCCTCCTTTTTTACTCACTAGGCTTCACAATCTGGCTTCTCTGGACAGAGCGTAGCATACGATAGATCAAGCGCGCCTAAGGCGTGCAGGGTATCTGGATGGTTTCAAGGGGACGACGTTGCTGCCTCAGCTGCGAATGCTGAGAAGATGTTTCTCTTGGCCGAGTTCACGTACGATCTCGAAAAGAGCAATGCACAGACATTCAACGTGGTGGAGACAACAGGCTCCGGACTCGTTGACATGATCCGATTAGATTTCTCATCCAACCATGGAAGCCCATCGCACACTTGCATCTACCGCGTGAGAGTTCACGGTCACGAACCGTATTCTGTTTCTATGATGGCAACGCAGTCGTGATCGGACACAGCAGCCATGTATTGTGTTCCAAGTTTGTATTATCCTTGTAACTGTTACATTCCCTGTTCACTCTTAGAATCTTTTGGCTAGTAAGCACAGATGTTGCCTTTTACTCCTTTGTTTTCCTTCGGCATTGTAATGTGGGGTTATGTTTTGTTTTTTTTCAGCCTACGCTTCTCGGAGTCTTACACGTACCGAGCTTTAAATTCTAATCTTTCTCTCTCGACTCTCTAAACGAGTCTATGTAGATATCAGGTTGTTGTTTTTGTAAGTCTAACATGTGGGAGTCGTCGGGATTCAAACCGAGGATGATTGACTTAATCGATCGAGTTATATCTCGGGCGAATATTTTCTTACTTACCGAACTTTATTTTTGAAGTTTGTTTATAATACATTTTCTTTTAGTCAACAGTATTCATGGTCAAACCCGCTCCAAAACCAAACTAAAAGTAGAGACA

mRNA sequence

CAAACGCCGGTTGCTTTCATCCGATGCGTCGGTTAATTTAATTCAGGTCATGCACTCAATTATTCTCGATTCGGAAATCAAATTCTGCCACCGTATCCTCCGTTTTCCCTCGAAATTTCTCTGTTTTTTTCTTAGAAATTTCAAAATCTCAGTGTATACAAAATTCCTTTTCAATCTCTTTGCGTTTTACATTCTTGTATGTTCGTATGTTCTTCAATTTCGTTCTTCACTGTTAGGGCTTTTGTTTTGCATTTTGGGGAATTTATTTCTGCTGTCGGAATCGTGAAATTTTCTCCACAGAAATGTCGGCTTCGACCGTGTCGATAACGGCGAATCCGGCGGCGCGGAGGAGGCCGGTGTTGGTTTCGGAGAAGAAGGGGGCTAGTATTGAATTGTTGGCAACGGATGGCGCGAATCCTCTATCGAATACTGCTACTGCGGGGACTGGTGGTGCGGCCGATGATAAGGTCGCCGGTGGAATTGGTAGGGATATGAGCCATCACTCGATTAGAGGTGAGGTTGTTCTCGAGCGGTCATCTAGGGATCCACTTCAGATCAAGAAAACTGTTGCGAATTCCACCATTTCGCCGCGGCGGAGTAGAAAGGCGATTACGAAGCCGGAGAAGCCTCGATGGGCGACTATTCTTAGCGTCGTTACGAAGAATATTGTGCTTTTGATGGTTCTTCTAGGGTTGGTTCAAATGATTAGGAAATTAGCCCTAAAATCAGGGGATGGAGCGGTGGGAAATCAAATGGGGTTTTCGGAAGTTGAGGGACGGATTGCAGAAGTTGAAGCATTCTTGAAGACTACCACCAAGATGATTCAGGTTCAGGTGGAGGTCGTCGATCGGAAGATCGAGAACGAAGTTGGGGGATTGAGAAGGGAAGTTGACCATAAGATTGAGGCAAAAACTGCTGATCTTGAGAGCGGTTTGAAGGTGCTGCAGGATAAAGGGGAGGATCTGGAAAGGTCTTTGAGCGAGTTAACGGCCGTGGATTGGTTGTCGAAGCAAGAATTCGATAGGATTTACGACGAATTGAAGAAGGCGAAGAGCGGTGAAATTGATGAGCGGTTTGCAAATTTGGATGATATAAGAGCCTATGCAAGGGAGGTGGTTGAGATGGAGATTGAGAAGCATGCGGCTGATGGACTTGGTAGGGTGGATTATGCTCTTGCTTCTGGTGGGGCTATGGTTGTAAAGCATTCCGAGCCGTTTACAGGAAGAACAAGTAACTGGTTTTCGAGAAATGCAAGGAATGGAGTTCATAGAAATGCGAATAAGATGCTGAAACCGAGTTTCGGGGAGCCCGGGCAGTGCTTCCCCTTGAAAGGAAGTAGCGGGTTCGTTCAAATTAGGCTACGGACGGCGATCGTTCCAGAAGCTATTACTCTAGAGCATGTTGCCAAGAGCGTAGCATACGATAGATCAAGCGCGCCTAAGGCGTGCAGGGTATCTGGATGGTTTCAAGGGGACGACGTTGCTGCCTCAGCTGCGAATGCTGAGAAGATGTTTCTCTTGGCCGAGTTCACGTACGATCTCGAAAAGAGCAATGCACAGACATTCAACGTGGTGGAGACAACAGGCTCCGGACTCGTTGACATGATCCGATTAGATTTCTCATCCAACCATGGAAGCCCATCGCACACTTGCATCTACCGCGTGAGAGTTCACGGTCACGAACCGTATTCTGTTTCTATGATGGCAACGCAGTCGTGATCGGACACAGCAGCCATGTATTGTGTTCCAAGTTTGTATTATCCTTGTAACTGTTACATTCCCTGTTCACTCTTAGAATCTTTTGGCTAGTAAGCACAGATGTTGCCTTTTACTCCTTTGTTTTCCTTCGGCATTGTAATGTGGGGTTATGTTTTGTTTTTTTTCAGCCTACGCTTCTCGGAGTCTTACACGTACCGAGCTTTAAATTCTAATCTTTCTCTCTCGACTCTCTAAACGAGTCTATGTAGATATCAGGTTGTTGTTTTTGTAAGTCTAACATGTGGGAGTCGTCGGGATTCAAACCGAGGATGATTGACTTAATCGATCGAGTTATATCTCGGGCGAATATTTTCTTACTTACCGAACTTTATTTTTGAAGTTTGTTTATAATACATTTTCTTTTAGTCAACAGTATTCATGGTCAAACCCGCTCCAAAACCAAACTAAAAGTAGAGACA

Coding sequence (CDS)

ATGTCGGCTTCGACCGTGTCGATAACGGCGAATCCGGCGGCGCGGAGGAGGCCGGTGTTGGTTTCGGAGAAGAAGGGGGCTAGTATTGAATTGTTGGCAACGGATGGCGCGAATCCTCTATCGAATACTGCTACTGCGGGGACTGGTGGTGCGGCCGATGATAAGGTCGCCGGTGGAATTGGTAGGGATATGAGCCATCACTCGATTAGAGGTGAGGTTGTTCTCGAGCGGTCATCTAGGGATCCACTTCAGATCAAGAAAACTGTTGCGAATTCCACCATTTCGCCGCGGCGGAGTAGAAAGGCGATTACGAAGCCGGAGAAGCCTCGATGGGCGACTATTCTTAGCGTCGTTACGAAGAATATTGTGCTTTTGATGGTTCTTCTAGGGTTGGTTCAAATGATTAGGAAATTAGCCCTAAAATCAGGGGATGGAGCGGTGGGAAATCAAATGGGGTTTTCGGAAGTTGAGGGACGGATTGCAGAAGTTGAAGCATTCTTGAAGACTACCACCAAGATGATTCAGGTTCAGGTGGAGGTCGTCGATCGGAAGATCGAGAACGAAGTTGGGGGATTGAGAAGGGAAGTTGACCATAAGATTGAGGCAAAAACTGCTGATCTTGAGAGCGGTTTGAAGGTGCTGCAGGATAAAGGGGAGGATCTGGAAAGGTCTTTGAGCGAGTTAACGGCCGTGGATTGGTTGTCGAAGCAAGAATTCGATAGGATTTACGACGAATTGAAGAAGGCGAAGAGCGGTGAAATTGATGAGCGGTTTGCAAATTTGGATGATATAAGAGCCTATGCAAGGGAGGTGGTTGAGATGGAGATTGAGAAGCATGCGGCTGATGGACTTGGTAGGGTGGATTATGCTCTTGCTTCTGGTGGGGCTATGGTTGTAAAGCATTCCGAGCCGTTTACAGGAAGAACAAGTAACTGGTTTTCGAGAAATGCAAGGAATGGAGTTCATAGAAATGCGAATAAGATGCTGAAACCGAGTTTCGGGGAGCCCGGGCAGTGCTTCCCCTTGAAAGGAAGTAGCGGGTTCGTTCAAATTAGGCTACGGACGGCGATCGTTCCAGAAGCTATTACTCTAGAGCATGTTGCCAAGAGCGTAGCATACGATAGATCAAGCGCGCCTAAGGCGTGCAGGGTATCTGGATGGTTTCAAGGGGACGACGTTGCTGCCTCAGCTGCGAATGCTGAGAAGATGTTTCTCTTGGCCGAGTTCACGTACGATCTCGAAAAGAGCAATGCACAGACATTCAACGTGGTGGAGACAACAGGCTCCGGACTCGTTGACATGATCCGATTAGATTTCTCATCCAACCATGGAAGCCCATCGCACACTTGCATCTACCGCGTGAGAGTTCACGGTCACGAACCGTATTCTGTTTCTATGATGGCAACGCAGTCGTGA

Protein sequence

MSASTVSITANPAARRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAGGIGRDMSHHSIRGEVVLERSSRDPLQIKKTVANSTISPRRSRKAITKPEKPRWATILSVVTKNIVLLMVLLGLVQMIRKLALKSGDGAVGNQMGFSEVEGRIAEVEAFLKTTTKMIQVQVEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLSKQEFDRIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGAMVVKHSEPFTGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRTAIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEKSNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQS
BLAST of Cp4.1LG01g21450 vs. Swiss-Prot
Match: SUN1_ARATH (Protein SAD1/UNC-84 domain protein 1 OS=Arabidopsis thaliana GN=SUN1 PE=1 SV=1)

HSP 1 Score: 421.4 bits (1082), Expect = 1.3e-116
Identity = 247/485 (50.93%), Postives = 326/485 (67.22%), Query Frame = 1

Query: 1   MSASTVSITANPAA--RRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAG 60
           MSASTVSITAN AA  RR P+L  EKK        ++   P S +   G  G A     G
Sbjct: 1   MSASTVSITANTAAATRRTPILAGEKK--------SNFDYPQSESLANGGVGEA-----G 60

Query: 61  GIGRDMSHHSIRGEVVLERSSRDPLQ--IKKTVA-----NSTISPRRSRKAIT-KPEKPR 120
           G  RD+S    RGE  L+RS    L    +++V+     N+T + RR+RK  T K EK R
Sbjct: 61  GTSRDLS----RGEATLDRSQGQDLGPVTRRSVSAATGTNTTATQRRTRKVATPKSEKAR 120

Query: 121 WATILSVVTKNIVLLMVLLGLVQMIRKLALKSGD-----GAVGNQMGFSEVEGRIAEVEA 180
           W T++ +  K +  L++++GL+Q+ RK+ LK+        +   +M FS +E RIAEV+ 
Sbjct: 121 WKTVVRIFAKQLGALLIIVGLIQLTRKMILKASSPSSPISSYETEMAFSGLESRIAEVDG 180

Query: 181 FLKTTTKMIQVQVEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSL 240
            +K TT  +QVQVE++D+K+E E   LR+E++ K  A     +S LK ++ + E LE+S+
Sbjct: 181 LVKATTNSMQVQVELLDKKMEREAKVLRQEIERKASA----FQSELKKIESRTESLEKSV 240

Query: 241 SELTAVDWLSKQEFDRIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLG 300
            E+ A  W++K E +RIY+ELKK    +      ++D++RAYAR+++E EIEKHAADGLG
Sbjct: 241 DEVNAKPWVTKDELERIYEELKKGNVDDSAFSEISIDELRAYARDIMEKEIEKHAADGLG 300

Query: 301 RVDYALASGGAMVVKHSEPF-TGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKG 360
           RVDYALASGGA V++HS+P+  G+ S+WF+   R   H NA KML PSFGEPGQCFPLKG
Sbjct: 301 RVDYALASGGAFVMEHSDPYLVGKGSSWFATTMRR-AHTNAVKMLSPSFGEPGQCFPLKG 360

Query: 361 SSGFVQIRLRTAIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMF 420
           S G+VQIRLR  I+PEA TLEHVAKSVAYDRSSAPK CRVSG  QG +   S+A  E M 
Sbjct: 361 SEGYVQIRLRGPIIPEAFTLEHVAKSVAYDRSSAPKDCRVSGSLQGPE---SSAETENMQ 420

Query: 421 LLAEFTYDLEKSNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSV 470
           LL EFTYDL++SNAQTFN++E++ SGL+D +RLDF+SNHGS SHTCIYR RVHG  P  V
Sbjct: 421 LLTEFTYDLDRSNAQTFNILESSSSGLIDTVRLDFTSNHGSDSHTCIYRFRVHGRAPDPV 460

BLAST of Cp4.1LG01g21450 vs. Swiss-Prot
Match: SUN2_ARATH (Protein SAD1/UNC-84 domain protein 2 OS=Arabidopsis thaliana GN=SUN2 PE=1 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 9.4e-107
Identity = 238/481 (49.48%), Postives = 315/481 (65.49%), Query Frame = 1

Query: 1   MSASTVSITANPAA-RRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAGG 60
           MSASTVSITA+P   RR PVL  EKK       +   AN     ++AGT           
Sbjct: 1   MSASTVSITASPRTIRRTPVLSGEKKSNFDFPPSESHANAAIGESSAGTN---------- 60

Query: 61  IGRDMSHHSIRGEVVLERSSR---DPLQIKK----TVANSTISPRRSRKAI-TKPEKPRW 120
             +D+    IR E   ERS+     P+  K     T  N+T + RR+RK+   K ++ +W
Sbjct: 61  --KDL----IRAEAAGERSNTYDVGPVTRKSGSTATGTNTTTTQRRTRKSQGNKIDRGKW 120

Query: 121 ATILSVVTKNIVLLMVLLGLVQMIRKLALK-----SGDGAVGNQMGFSEVEGRIAEVEAF 180
            T++ V  K    L++L+GL+Q+IRKL LK     S +  +  +M  SE+E RI+ V+  
Sbjct: 121 KTVVRVFAKQFGALLLLVGLIQLIRKLTLKDSSLSSSNFPIETEMVLSELESRISAVDGL 180

Query: 181 LKTTTKMIQVQVEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLS 240
           +KTTTKM+QVQVE +D+K+++E   LR+ +D    + ++ L S LK ++ K E L+ S+ 
Sbjct: 181 VKTTTKMMQVQVEFLDKKMDSESRALRQTID----STSSVLHSELKKVESKTERLQVSVD 240

Query: 241 ELTAVDWLSKQEFDRIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGR 300
           EL A   +S++E +R+Y+ELKK K G+ D    N+D +RAYAR++VE EI KH ADGLGR
Sbjct: 241 ELNAKPLVSREELERVYEELKKGKVGDSD---VNIDKLRAYARDIVEKEIGKHVADGLGR 300

Query: 301 VDYALASGGAMVVKHSEPF-TGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGS 360
           VDYALASGGA V+ HS+PF  G   NWF   +R  VH  A KML PSFGEPGQCFPLKGS
Sbjct: 301 VDYALASGGAFVMGHSDPFLVGNGRNWFG-TSRRRVHSKAVKMLTPSFGEPGQCFPLKGS 360

Query: 361 SGFVQIRLRTAIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFL 420
           +G+V +RLR  I+PEA+TLEHV+K+VAYDRSSAPK CRVSGW    D+       E M L
Sbjct: 361 NGYVLVRLRAPIIPEAVTLEHVSKAVAYDRSSAPKDCRVSGWLGDIDM-----ETETMPL 420

Query: 421 LAEFTYDLEKSNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVS 467
           L EF+YDL++SNAQTF++ ++  SGLV+ +RLDF+SNHGS SHTCIYR RVHG E  SVS
Sbjct: 421 LTEFSYDLDRSNAQTFDIADSAHSGLVNTVRLDFNSNHGSSSHTCIYRFRVHGRELDSVS 452

BLAST of Cp4.1LG01g21450 vs. Swiss-Prot
Match: SUN2_MOUSE (SUN domain-containing protein 2 OS=Mus musculus GN=Sun2 PE=1 SV=3)

HSP 1 Score: 95.1 bits (235), Expect = 2.1e-18
Identity = 89/340 (26.18%), Postives = 150/340 (44.12%), Query Frame = 1

Query: 155 EVEGRIAEVEAFLKTTTKMIQVQVEVVDR-KIENEVGGLRREV----------------- 214
           E E R+ +++   K+ T+    +  V +  ++E ++  LR+E+                 
Sbjct: 397 ESEARVQQLKTEWKSMTQEAFQESSVKELGRLEAQLASLRQELAALTLKQNSVADEVGLL 456

Query: 215 DHKIEAKTADLESGLK------VLQDKG--------EDLERSLSELTAVDWLSKQEFDRI 274
             KI+A  AD+ES         +L D+G        +++   L EL         E    
Sbjct: 457 PQKIQAARADVESQFPDWIRQFLLGDRGARSGLLQRDEMHAQLQELENKILTKMAEMQGK 516

Query: 275 YDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGAMVV--K 334
                 A  G+I ++   +         +V+  +++++ D +G VDYAL SGGA V+  +
Sbjct: 517 SAREAAASLGQILQKEGIVGVTEEQVHRIVKQALQRYSEDRIGMVDYALESGGASVISTR 576

Query: 335 HSEPFTGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRTAIVPE 394
            SE +  +T+           H  + +++      PG C+  +G  GF  +RL   I P 
Sbjct: 577 CSETYETKTALLSLFGIPLWYHSQSPRVILQPDVHPGNCWAFQGPQGFAVVRLSARIRPT 636

Query: 395 AITLEHVAKSVAYDR--SSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEKSNA 454
           A+TLEHV K+++ +   SSAPK   + G+   +D+           LL  F YD +    
Sbjct: 637 AVTLEHVPKALSPNSTISSAPKDFAIFGF--DEDLQQEGT------LLGTFAYDQDGEPI 696

Query: 455 QTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHG 459
           QTF   + +      ++ L   +N G P +TCIYR RVHG
Sbjct: 697 QTF-YFQASKMATYQVVELRILTNWGHPEYTCIYRFRVHG 727

BLAST of Cp4.1LG01g21450 vs. Swiss-Prot
Match: SUN2_HUMAN (SUN domain-containing protein 2 OS=Homo sapiens GN=SUN2 PE=1 SV=3)

HSP 1 Score: 94.4 bits (233), Expect = 3.7e-18
Identity = 96/337 (28.49%), Postives = 155/337 (45.99%), Query Frame = 1

Query: 128 LLGLVQMIRKLALKSGDGAVGNQMGF--SEVEGRIAEVEAFLKTTTKMIQVQVEVVDRKI 187
           L GL Q +  LALK    +V  ++G    +++    +VE+             + + R  
Sbjct: 418 LAGLQQELAALALKQS--SVAEEVGLLPQQIQAVRDDVESQFPAWIS------QFLARGG 477

Query: 188 ENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLSKQEFDRIYDE 247
              VG L+RE   +++A+  +LES  K+L    E   +S  E  A   L+ Q        
Sbjct: 478 GGRVGLLQRE---EMQAQLRELES--KILTHVAEMQGKSAREAAASLSLTLQ-------- 537

Query: 248 LKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGAMVV--KHSE 307
            K+   G  +E+             +V+  +++++ D +G  DYAL SGGA V+  + SE
Sbjct: 538 -KEGVIGVTEEQ----------VHHIVKQALQRYSEDRIGLADYALESGGASVISTRCSE 597

Query: 308 PFTGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRTAIVPEAIT 367
            +  +T+           H  + +++      PG C+  +G  GF  +RL   I P A+T
Sbjct: 598 TYETKTALLSLFGIPLWYHSQSPRVILQPDVHPGNCWAFQGPQGFAVVRLSARIRPTAVT 657

Query: 368 LEHVAKSVAYDR--SSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEKSNAQTF 427
           LEHV K+++ +   SSAPK   + G+   +D+           LL +FTYD +    QTF
Sbjct: 658 LEHVPKALSPNSTISSAPKDFAIFGF--DEDLQQEGT------LLGKFTYDQDGEPIQTF 713

Query: 428 NVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHG 459
           +    T      ++ L   +N G P +TCIYR RVHG
Sbjct: 718 HFQAPT-MATYQVVELRILTNWGHPEYTCIYRFRVHG 713

BLAST of Cp4.1LG01g21450 vs. Swiss-Prot
Match: SUN1_HUMAN (SUN domain-containing protein 1 OS=Homo sapiens GN=SUN1 PE=1 SV=3)

HSP 1 Score: 94.0 bits (232), Expect = 4.8e-18
Identity = 66/206 (32.04%), Postives = 103/206 (50.00%), Query Frame = 1

Query: 266 AYAREVVEMEIEKHAADGLGRVDYALASGGAMVV--KHSEPFTGRTSN---------WFS 325
           A AR +V   ++ ++ D  G VD+AL SGG  ++  + SE +  +T+          +FS
Sbjct: 620 AQARAIVNSALKLYSQDKTGMVDFALESGGGSILSTRCSETYETKTALMSLFGIPLWYFS 679

Query: 326 RNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRTAIVPEAITLEHVAKSVA-- 385
           ++ R         +++P    PG C+  KGS G++ +RL   I P A TLEH+ K+++  
Sbjct: 680 QSPR--------VVIQPDI-YPGNCWAFKGSQGYLVVRLSMMIHPAAFTLEHIPKTLSPT 739

Query: 386 YDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEKSNAQTFNVVETTGSGLV 445
            + SSAPK   V G              E+  LL +FTYD +  + Q F  ++       
Sbjct: 740 GNISSAPKDFAVYG--------LENEYQEEGQLLGQFTYDQDGESLQMFQALKRPDDTAF 799

Query: 446 DMIRLDFSSNHGSPSHTCIYRVRVHG 459
            ++ L   SN G P +TC+YR RVHG
Sbjct: 800 QIVELRIFSNWGHPEYTCLYRFRVHG 808

BLAST of Cp4.1LG01g21450 vs. TrEMBL
Match: A0A0A0KCH3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G051490 PE=4 SV=1)

HSP 1 Score: 756.5 bits (1952), Expect = 1.9e-215
Identity = 394/472 (83.47%), Postives = 431/472 (91.31%), Query Frame = 1

Query: 1   MSASTVSITANPAARRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAGGI 60
           MSASTVSITANPA RRRPVL SEKKGAS ELLATDG NPLSNTAT GT GAADDK+AG  
Sbjct: 1   MSASTVSITANPATRRRPVLASEKKGASFELLATDGLNPLSNTATLGTVGAADDKLAGAN 60

Query: 61  GRDMSHHSIRGEVVLERSSRDPLQIKKTVANSTISPRRSRKAITKPEKPRWATILSVVTK 120
           GRDMSHHSIRGEVVLERSSRDP+QIKK VANSTISPRRSRK ITKPEKPRW TI+SV+TK
Sbjct: 61  GRDMSHHSIRGEVVLERSSRDPIQIKKAVANSTISPRRSRKVITKPEKPRWVTIVSVLTK 120

Query: 121 NIVLLMVLLGLVQMIRKLALKSGDGAVGNQMGFSEVEGRIAEVEAFLKTTTKMIQVQVEV 180
           N VLL+VLLGL QM+RKLALKSG+G VGNQMGFSEVEGRIAEVEA LKTT+KM+QVQVEV
Sbjct: 121 NGVLLLVLLGLAQMVRKLALKSGEGEVGNQMGFSEVEGRIAEVEALLKTTSKMLQVQVEV 180

Query: 181 VDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLSKQEFD 240
           VDRKIENEVGGLRREV+ KI+ KTADL+SGLK L++KGE+LERSLSEL   DWLSKQEFD
Sbjct: 181 VDRKIENEVGGLRREVNKKIDEKTADLDSGLKKLENKGEELERSLSELKTGDWLSKQEFD 240

Query: 241 RIYDELKKAKSGEIDE-RFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGAMVV 300
           +IY+ELKK K+GE DE RFANLD+IRA ARE++E EI+KHAADGLGRVDYA+ASGGAMVV
Sbjct: 241 KIYEELKKTKNGEFDEQRFANLDEIRASAREMIEREIQKHAADGLGRVDYAVASGGAMVV 300

Query: 301 KHSEPFTGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRTAIVP 360
           KHS+P+ GRTSNWF +N RNGVH +A+K+LKPSFGEPGQCF LKGSSGFVQIRLR AIVP
Sbjct: 301 KHSDPYRGRTSNWFLKNVRNGVHSDADKLLKPSFGEPGQCFALKGSSGFVQIRLRAAIVP 360

Query: 361 EAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEKSNAQ 420
           EAITLEHVAKSVA+DR+SAPK CRVSGWFQG +   SA N EKMF LA+FTYDLEKSNAQ
Sbjct: 361 EAITLEHVAKSVAFDRTSAPKDCRVSGWFQGKN-PNSAINGEKMFPLAKFTYDLEKSNAQ 420

Query: 421 TFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQS 472
           TF+VV+TTGSGLVDMIRLDFSSNHG+PSHTCIYR+RVHGHEPYSVSMMA QS
Sbjct: 421 TFDVVDTTGSGLVDMIRLDFSSNHGNPSHTCIYRMRVHGHEPYSVSMMAIQS 471

BLAST of Cp4.1LG01g21450 vs. TrEMBL
Match: A0A067J9Z6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04204 PE=4 SV=1)

HSP 1 Score: 582.0 bits (1499), Expect = 6.4e-163
Identity = 315/481 (65.49%), Postives = 376/481 (78.17%), Query Frame = 1

Query: 1   MSASTVSITANPAARRRPVLVSEKKGASIELLATD-----GANPLSNTATAGTGGAADDK 60
           MSASTVSITAN A RRRPV+  EKK  +IELL  +     G N + N         A+DK
Sbjct: 1   MSASTVSITANHAGRRRPVVAGEKKSTNIELLPNEAQINGGDNAIKN---------ANDK 60

Query: 61  VAGGIGRDMSHHSIRGEVVLERSSRDPLQIKKT-VANSTISPRRSRKAITKPEKPRWATI 120
           +     +D+SHHSIRGE VLERS++D  Q+KK  + NSTISPRRSRK + KPEKP W T+
Sbjct: 61  LVASHSKDLSHHSIRGEAVLERSTKDTTQVKKNAMVNSTISPRRSRKMVAKPEKPMWQTV 120

Query: 121 LSVVTKNIVLLMVLLGLVQMIRKLALKSGD--GAVGN-QMGFSEVEGRIAEVEAFLKTTT 180
           +SV TKN  LL+VL+GLVQMIR+LA+KSGD     G  Q+G SE E RIAEVE+F KTT 
Sbjct: 121 VSVFTKNFFLLLVLIGLVQMIRRLAMKSGDYHSISGTAQIGASEFESRIAEVESFFKTTA 180

Query: 181 KMIQVQVEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAV 240
           KMIQ+QVEVVD K+ NEV GLR+E+D KIE K   L+SGLK +  + E LE+S+SELTAV
Sbjct: 181 KMIQLQVEVVDAKVGNEVEGLRKEMDKKIEEKAELLDSGLKQIVARNEQLEKSISELTAV 240

Query: 241 DWLSKQEFDRIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYAL 300
           DWLSK++F  IY+ELKK K  E  E   +LDDIRAYAR++VE EIEKHAADGLGRVDYAL
Sbjct: 241 DWLSKEDFKMIYEELKKGKGNEFGESDISLDDIRAYARDIVEKEIEKHAADGLGRVDYAL 300

Query: 301 ASGGAMVVKHSEPF-TGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQ 360
           ASGGA VVKHSEP+ TG+ SNWF  ++R G H +A KMLKPSFGEPGQCFPLKGSSGFVQ
Sbjct: 301 ASGGASVVKHSEPYITGKGSNWFLMSSRGGAHPDAVKMLKPSFGEPGQCFPLKGSSGFVQ 360

Query: 361 IRLRTAIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFT 420
           I+LRTAIVPEA+TLEHVAK+VAYDRSSAPK CRVSGW QG D+  +  ++EKMFLL+EF+
Sbjct: 361 IKLRTAIVPEAVTLEHVAKNVAYDRSSAPKDCRVSGWLQGHDMDLT-IDSEKMFLLSEFS 420

Query: 421 YDLEKSNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQ 472
           YDLEKSNAQT+ V+++  S +VD +RLDF SNHGS SHTCIYR+RVHG+EP SVS++  +
Sbjct: 421 YDLEKSNAQTYAVLDSAASSVVDTVRLDFISNHGSSSHTCIYRLRVHGYEPDSVSVVTVE 471

BLAST of Cp4.1LG01g21450 vs. TrEMBL
Match: W9R9S3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_012197 PE=4 SV=1)

HSP 1 Score: 572.0 bits (1473), Expect = 6.7e-160
Identity = 314/476 (65.97%), Postives = 376/476 (78.99%), Query Frame = 1

Query: 1   MSASTVSITANPAARRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAGGI 60
           MSASTVSITANPA RRR V+  EKK +S+EL+A   A P           A DDK     
Sbjct: 1   MSASTVSITANPATRRRTVVAVEKK-SSVELVA---AEPQFK--------AGDDKSVAAN 60

Query: 61  GRDMSHHSIRGEVVLERSSRDPLQIKKTVANSTISP---RRSRKAITKPEKPRWATILSV 120
           GRD+S+HSIRG+ VLERSSRD + +KKT ANST+SP   RRSRK +  P KPRW T+LSV
Sbjct: 61  GRDLSNHSIRGDAVLERSSRDAVPVKKTAANSTVSPPSNRRSRKPVAAP-KPRWLTVLSV 120

Query: 121 VTKNIVLLMVLLGLVQMIRKLALKSGDGAVGNQMGFSEVEGRIAEVEAFLKTTTKMIQVQ 180
            TKN VLL++L+GLVQ++R+LAL+S DG  G     S+ EGRIAEVE F+KTT KMIQVQ
Sbjct: 121 FTKNFVLLVLLVGLVQIVRRLALRSSDG--GGPFALSDFEGRIAEVEKFVKTTAKMIQVQ 180

Query: 181 VEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLSKQ 240
           VEVVDRK+++EVGGLR EV  KIE K+  LES LK L+ K E LERSL E   ++W+SK+
Sbjct: 181 VEVVDRKVDSEVGGLR-EVGKKIEEKSVLLESQLKELEAKSEGLERSLGEFKDINWISKE 240

Query: 241 EFDRIYDELKKAKSGEID-ERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGA 300
           EFD+IY+ELKKA+S E   +   NL+DIRAYAR+VV  EIE+HAADGL R DYALA+GGA
Sbjct: 241 EFDKIYEELKKARSDEFGVDDGTNLNDIRAYARDVVLKEIERHAADGLARADYALATGGA 300

Query: 301 MVVKHSEPFT-GRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRT 360
           MVVKHSEPF  G+ +NWF + A NGVH +A KMLKPSFGEPGQCFPLKGSSGFV+I+LRT
Sbjct: 301 MVVKHSEPFLKGKGNNWFGKGATNGVHNDAEKMLKPSFGEPGQCFPLKGSSGFVEIKLRT 360

Query: 361 AIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEK 420
           AI+PEAITLEHVAKSVA+DRSSAPK CR+SGW QG +   S ++A ++FLLAEFTYDLEK
Sbjct: 361 AIIPEAITLEHVAKSVAFDRSSAPKNCRISGWLQGQN-TESTSDALRIFLLAEFTYDLEK 420

Query: 421 SNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQS 472
           SNAQT+NV+++  S +VD +R DF+SNHGSPSHTCIYR+RVHGHEP SV+M+A QS
Sbjct: 421 SNAQTYNVLDSASSSIVDTVRFDFTSNHGSPSHTCIYRLRVHGHEPESVAMIAMQS 459

BLAST of Cp4.1LG01g21450 vs. TrEMBL
Match: A0A151QPL8_CAJCA (Protein unc-84 isogeny B OS=Cajanus cajan GN=KK1_047108 PE=4 SV=1)

HSP 1 Score: 555.4 bits (1430), Expect = 6.4e-155
Identity = 304/475 (64.00%), Postives = 363/475 (76.42%), Query Frame = 1

Query: 1   MSASTVSITA-NPAARRRPVLVSEKKGAS-IELLATDGANPLSNTATAGTGGAADDKVAG 60
           MSASTVSITA NP  RRRPV+ +EKK  S IELLA D A   +  AT+G GG A      
Sbjct: 1   MSASTVSITAANPGTRRRPVIAAEKKTPSNIELLANDVAVSPAAVATSGDGGGA------ 60

Query: 61  GIGRDMSHHSIRGEVVLERSSRDPLQIKKTVANSTIS--PRRSRKAITKPEKPRWATILS 120
           G GRD+SHHSIRGE +LER+SRD   +KK    ++ S  PRR RK   K EKPRW T++ 
Sbjct: 61  GSGRDLSHHSIRGEALLERASRDLAPVKKVAGGNSASVPPRRLRKPAAKAEKPRWVTVVR 120

Query: 121 VVTKNIVLLMVLLGLVQMIRKLALKSGDGAVGNQMGFSEVEGRIAEVEAFLKTTTKMIQV 180
           +  KN+VLL+VL+GLVQ+IR+LALKSGD A G   G SE EGRI++VE  LK T KMIQV
Sbjct: 121 IFGKNLVLLVVLMGLVQLIRRLALKSGDAADGGYPGLSEFEGRISDVEGLLKRTAKMIQV 180

Query: 181 QVEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLSK 240
           QV+VVD+KIENEV GLR+E+  KI+ K   LESGL+ L+ + E+LE  +SEL   DWL+K
Sbjct: 181 QVDVVDKKIENEVRGLRKELSGKIDEKGVILESGLRKLEARSEELESRMSELKREDWLTK 240

Query: 241 QEFDRIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGA 300
           +EFD+  +EL+  K  E       LD+IR +AR V+E EIEKHAADGLGRVDYALASGGA
Sbjct: 241 EEFDKFVEELRNVKGYE----GGGLDEIREFARGVIEKEIEKHAADGLGRVDYALASGGA 300

Query: 301 MVVKHSEPFTGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRTA 360
            VVKHSE +     NWF+  A+NGVH +A KMLKPSFGEPGQCFPLK S GFVQI+LRTA
Sbjct: 301 AVVKHSEVYNTGKGNWFTMAAKNGVHPSAEKMLKPSFGEPGQCFPLKDSKGFVQIKLRTA 360

Query: 361 IVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEKS 420
           I+PEA+TLEHVAKSVAYDRSSAPK CRVSGW QG++ A S  +++KM+LLAEF+YDLEKS
Sbjct: 361 IIPEAVTLEHVAKSVAYDRSSAPKDCRVSGWLQGNN-ADSVLDSKKMYLLAEFSYDLEKS 420

Query: 421 NAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQS 472
           NAQTFNV+ +  SG+++ IRLDF+SNHGSPSHTCIYR+RVHGHEP  VSMMA  S
Sbjct: 421 NAQTFNVLSSAASGVINTIRLDFTSNHGSPSHTCIYRLRVHGHEPDLVSMMALGS 464

BLAST of Cp4.1LG01g21450 vs. TrEMBL
Match: M5X263_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003932mg PE=4 SV=1)

HSP 1 Score: 548.5 bits (1412), Expect = 7.9e-153
Identity = 304/478 (63.60%), Postives = 367/478 (76.78%), Query Frame = 1

Query: 1   MSASTVSITANPAA--RRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAG 60
           MSASTVSITANPA   RRR V+  EKK  +IEL++ + A+  + T              G
Sbjct: 83  MSASTVSITANPATTTRRRTVVAVEKKSTNIELVSAEKADGKAET--------------G 142

Query: 61  GIGRDMSHHSIRGEVVLERSSRDPLQIKKTVANSTISP---RRSRKAITKPEKPRWATIL 120
           G  +D+SHHSIRGE  L+RS+      KKT  NSTISP   RRSR+++     PRW T+L
Sbjct: 143 GNSKDLSHHSIRGEPGLDRSAHG----KKTGPNSTISPPSNRRSRRSVAVDPNPRWVTVL 202

Query: 121 SVVTKNIVLLMVLLGLVQMIRKLALKSGDGAVGNQMGFSEVEGRIAEVEAFLKTTTKMIQ 180
            +  KN +LL++++GL Q++R+LAL+SGDG     M FS++EGRIAEVE+F+K TTKM+Q
Sbjct: 203 RIFAKNFILLLLIVGLFQIVRRLALRSGDGV---PMAFSDLEGRIAEVESFVKKTTKMVQ 262

Query: 181 VQVEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLS 240
           VQVEVVDRKIE+EVGGL+REV+ KIE K   LE  L+ L+ + E LERS+ +L +V+WLS
Sbjct: 263 VQVEVVDRKIESEVGGLKREVEKKIEDKGVALERDLRKLEARNEGLERSVDDLRSVEWLS 322

Query: 241 KQEFDRIYDELKK-AKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASG 300
           KQEF+R+Y+ELKK AKSGE  E  A LDDIRAYAR VVE EIEKHAADGLGRVDYALAS 
Sbjct: 323 KQEFERVYEELKKAAKSGEDGEFGARLDDIRAYARNVVEKEIEKHAADGLGRVDYALASS 382

Query: 301 GAMVVKHSEPF-TGRTSNW-FSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIR 360
           GA VVKHSEP+  G+ SNW F ++ +NGVH +A+KML+PSFGEPGQCFPLKGSSGFVQI+
Sbjct: 383 GAFVVKHSEPYLVGKASNWVFLKSTKNGVHGDADKMLRPSFGEPGQCFPLKGSSGFVQIK 442

Query: 361 LRTAIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYD 420
           LRT I+PEAITLEHVAKSVAYDR SAPK CRVSGW +  D      + EKMF LAEFTYD
Sbjct: 443 LRTPIIPEAITLEHVAKSVAYDRRSAPKDCRVSGWLRAHD--DLEVDTEKMFSLAEFTYD 502

Query: 421 LEKSNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQ 471
           LEKSNAQTF+V+++  SGLVD +RLDF+SNHGS SHTCIYR+RVHGHEP +VSMM  Q
Sbjct: 503 LEKSNAQTFDVLDSAVSGLVDTVRLDFTSNHGSASHTCIYRLRVHGHEPDAVSMMTMQ 537

BLAST of Cp4.1LG01g21450 vs. TAIR10
Match: AT5G04990.1 (AT5G04990.1 SAD1/UNC-84 domain protein 1)

HSP 1 Score: 421.4 bits (1082), Expect = 7.4e-118
Identity = 247/485 (50.93%), Postives = 326/485 (67.22%), Query Frame = 1

Query: 1   MSASTVSITANPAA--RRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAG 60
           MSASTVSITAN AA  RR P+L  EKK        ++   P S +   G  G A     G
Sbjct: 1   MSASTVSITANTAAATRRTPILAGEKK--------SNFDYPQSESLANGGVGEA-----G 60

Query: 61  GIGRDMSHHSIRGEVVLERSSRDPLQ--IKKTVA-----NSTISPRRSRKAIT-KPEKPR 120
           G  RD+S    RGE  L+RS    L    +++V+     N+T + RR+RK  T K EK R
Sbjct: 61  GTSRDLS----RGEATLDRSQGQDLGPVTRRSVSAATGTNTTATQRRTRKVATPKSEKAR 120

Query: 121 WATILSVVTKNIVLLMVLLGLVQMIRKLALKSGD-----GAVGNQMGFSEVEGRIAEVEA 180
           W T++ +  K +  L++++GL+Q+ RK+ LK+        +   +M FS +E RIAEV+ 
Sbjct: 121 WKTVVRIFAKQLGALLIIVGLIQLTRKMILKASSPSSPISSYETEMAFSGLESRIAEVDG 180

Query: 181 FLKTTTKMIQVQVEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSL 240
            +K TT  +QVQVE++D+K+E E   LR+E++ K  A     +S LK ++ + E LE+S+
Sbjct: 181 LVKATTNSMQVQVELLDKKMEREAKVLRQEIERKASA----FQSELKKIESRTESLEKSV 240

Query: 241 SELTAVDWLSKQEFDRIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLG 300
            E+ A  W++K E +RIY+ELKK    +      ++D++RAYAR+++E EIEKHAADGLG
Sbjct: 241 DEVNAKPWVTKDELERIYEELKKGNVDDSAFSEISIDELRAYARDIMEKEIEKHAADGLG 300

Query: 301 RVDYALASGGAMVVKHSEPF-TGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKG 360
           RVDYALASGGA V++HS+P+  G+ S+WF+   R   H NA KML PSFGEPGQCFPLKG
Sbjct: 301 RVDYALASGGAFVMEHSDPYLVGKGSSWFATTMRR-AHTNAVKMLSPSFGEPGQCFPLKG 360

Query: 361 SSGFVQIRLRTAIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMF 420
           S G+VQIRLR  I+PEA TLEHVAKSVAYDRSSAPK CRVSG  QG +   S+A  E M 
Sbjct: 361 SEGYVQIRLRGPIIPEAFTLEHVAKSVAYDRSSAPKDCRVSGSLQGPE---SSAETENMQ 420

Query: 421 LLAEFTYDLEKSNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSV 470
           LL EFTYDL++SNAQTFN++E++ SGL+D +RLDF+SNHGS SHTCIYR RVHG  P  V
Sbjct: 421 LLTEFTYDLDRSNAQTFNILESSSSGLIDTVRLDFTSNHGSDSHTCIYRFRVHGRAPDPV 460

BLAST of Cp4.1LG01g21450 vs. TAIR10
Match: AT3G10730.1 (AT3G10730.1 SAD1/UNC-84 domain protein 2)

HSP 1 Score: 388.7 bits (997), Expect = 5.3e-108
Identity = 238/481 (49.48%), Postives = 315/481 (65.49%), Query Frame = 1

Query: 1   MSASTVSITANPAA-RRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAGG 60
           MSASTVSITA+P   RR PVL  EKK       +   AN     ++AGT           
Sbjct: 1   MSASTVSITASPRTIRRTPVLSGEKKSNFDFPPSESHANAAIGESSAGTN---------- 60

Query: 61  IGRDMSHHSIRGEVVLERSSR---DPLQIKK----TVANSTISPRRSRKAI-TKPEKPRW 120
             +D+    IR E   ERS+     P+  K     T  N+T + RR+RK+   K ++ +W
Sbjct: 61  --KDL----IRAEAAGERSNTYDVGPVTRKSGSTATGTNTTTTQRRTRKSQGNKIDRGKW 120

Query: 121 ATILSVVTKNIVLLMVLLGLVQMIRKLALK-----SGDGAVGNQMGFSEVEGRIAEVEAF 180
            T++ V  K    L++L+GL+Q+IRKL LK     S +  +  +M  SE+E RI+ V+  
Sbjct: 121 KTVVRVFAKQFGALLLLVGLIQLIRKLTLKDSSLSSSNFPIETEMVLSELESRISAVDGL 180

Query: 181 LKTTTKMIQVQVEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLS 240
           +KTTTKM+QVQVE +D+K+++E   LR+ +D    + ++ L S LK ++ K E L+ S+ 
Sbjct: 181 VKTTTKMMQVQVEFLDKKMDSESRALRQTID----STSSVLHSELKKVESKTERLQVSVD 240

Query: 241 ELTAVDWLSKQEFDRIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGR 300
           EL A   +S++E +R+Y+ELKK K G+ D    N+D +RAYAR++VE EI KH ADGLGR
Sbjct: 241 ELNAKPLVSREELERVYEELKKGKVGDSD---VNIDKLRAYARDIVEKEIGKHVADGLGR 300

Query: 301 VDYALASGGAMVVKHSEPF-TGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGS 360
           VDYALASGGA V+ HS+PF  G   NWF   +R  VH  A KML PSFGEPGQCFPLKGS
Sbjct: 301 VDYALASGGAFVMGHSDPFLVGNGRNWFG-TSRRRVHSKAVKMLTPSFGEPGQCFPLKGS 360

Query: 361 SGFVQIRLRTAIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFL 420
           +G+V +RLR  I+PEA+TLEHV+K+VAYDRSSAPK CRVSGW    D+       E M L
Sbjct: 361 NGYVLVRLRAPIIPEAVTLEHVSKAVAYDRSSAPKDCRVSGWLGDIDM-----ETETMPL 420

Query: 421 LAEFTYDLEKSNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVS 467
           L EF+YDL++SNAQTF++ ++  SGLV+ +RLDF+SNHGS SHTCIYR RVHG E  SVS
Sbjct: 421 LTEFSYDLDRSNAQTFDIADSAHSGLVNTVRLDFNSNHGSSSHTCIYRFRVHGRELDSVS 452

BLAST of Cp4.1LG01g21450 vs. NCBI nr
Match: gi|659113576|ref|XP_008456648.1| (PREDICTED: SUN domain-containing protein 2 [Cucumis melo])

HSP 1 Score: 779.6 bits (2012), Expect = 3.0e-222
Identity = 403/471 (85.56%), Postives = 437/471 (92.78%), Query Frame = 1

Query: 1   MSASTVSITANPAARRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAGGI 60
           MSASTVSITANPA RRRPVL SEKKGAS ELLATDG NPLSNTAT GTGGAADDKVAGG 
Sbjct: 1   MSASTVSITANPATRRRPVLASEKKGASFELLATDGVNPLSNTATLGTGGAADDKVAGGN 60

Query: 61  GRDMSHHSIRGEVVLERSSRDPLQIKKTVANSTISPRRSRKAITKPEKPRWATILSVVTK 120
           GRDMSHHSIRGEVVLERSSRDP+QIKK VANSTISPRRSRK +TKPEKPRW TI+SV+TK
Sbjct: 61  GRDMSHHSIRGEVVLERSSRDPIQIKKAVANSTISPRRSRKVVTKPEKPRWVTIVSVLTK 120

Query: 121 NIVLLMVLLGLVQMIRKLALKSGDGAVGNQMGFSEVEGRIAEVEAFLKTTTKMIQVQVEV 180
           N VLL+VLLGL QM+RKLALKSG+G VGNQMGFSEVEGRIAEVEA LKTT+KM+QVQVEV
Sbjct: 121 NGVLLLVLLGLAQMVRKLALKSGEGEVGNQMGFSEVEGRIAEVEALLKTTSKMLQVQVEV 180

Query: 181 VDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLSKQEFD 240
           VDRKIENEVGGLRREV+ KIE KTADL+SGLK L++KGE+LERSLSEL A DWLSKQEFD
Sbjct: 181 VDRKIENEVGGLRREVNKKIEEKTADLDSGLKKLENKGEELERSLSELKAGDWLSKQEFD 240

Query: 241 RIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGAMVVK 300
           +IY+ELKKAK+GE DERFANLD+IRA ARE++E EI+KHAADGLGRVDYA+ASGGAMVVK
Sbjct: 241 KIYEELKKAKNGEFDERFANLDEIRASAREMIEREIQKHAADGLGRVDYAVASGGAMVVK 300

Query: 301 HSEPFTGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRTAIVPE 360
           HSEP+ GRTSNWFS+N RNGVH +A+K+LKPSFGEPGQCF LKGSSGFVQI+LR AIVPE
Sbjct: 301 HSEPYRGRTSNWFSKNVRNGVHSDADKLLKPSFGEPGQCFALKGSSGFVQIKLRAAIVPE 360

Query: 361 AITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEKSNAQT 420
           AITLEHVAKSVA+DRSSAPK CRVSGWFQGDD   SA NAEKMFLLA+FTYDLEKSNAQT
Sbjct: 361 AITLEHVAKSVAFDRSSAPKDCRVSGWFQGDD-PNSAINAEKMFLLAKFTYDLEKSNAQT 420

Query: 421 FNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQS 472
           F+V +TTGSGLVDMIRLDFSSNHG PS+TCIYRVRVHGHEP+SVSMMA QS
Sbjct: 421 FDVADTTGSGLVDMIRLDFSSNHGGPSYTCIYRVRVHGHEPHSVSMMAIQS 470

BLAST of Cp4.1LG01g21450 vs. NCBI nr
Match: gi|449446333|ref|XP_004140926.1| (PREDICTED: SUN domain-containing protein 2 [Cucumis sativus])

HSP 1 Score: 756.5 bits (1952), Expect = 2.7e-215
Identity = 394/472 (83.47%), Postives = 431/472 (91.31%), Query Frame = 1

Query: 1   MSASTVSITANPAARRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAGGI 60
           MSASTVSITANPA RRRPVL SEKKGAS ELLATDG NPLSNTAT GT GAADDK+AG  
Sbjct: 1   MSASTVSITANPATRRRPVLASEKKGASFELLATDGLNPLSNTATLGTVGAADDKLAGAN 60

Query: 61  GRDMSHHSIRGEVVLERSSRDPLQIKKTVANSTISPRRSRKAITKPEKPRWATILSVVTK 120
           GRDMSHHSIRGEVVLERSSRDP+QIKK VANSTISPRRSRK ITKPEKPRW TI+SV+TK
Sbjct: 61  GRDMSHHSIRGEVVLERSSRDPIQIKKAVANSTISPRRSRKVITKPEKPRWVTIVSVLTK 120

Query: 121 NIVLLMVLLGLVQMIRKLALKSGDGAVGNQMGFSEVEGRIAEVEAFLKTTTKMIQVQVEV 180
           N VLL+VLLGL QM+RKLALKSG+G VGNQMGFSEVEGRIAEVEA LKTT+KM+QVQVEV
Sbjct: 121 NGVLLLVLLGLAQMVRKLALKSGEGEVGNQMGFSEVEGRIAEVEALLKTTSKMLQVQVEV 180

Query: 181 VDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLSKQEFD 240
           VDRKIENEVGGLRREV+ KI+ KTADL+SGLK L++KGE+LERSLSEL   DWLSKQEFD
Sbjct: 181 VDRKIENEVGGLRREVNKKIDEKTADLDSGLKKLENKGEELERSLSELKTGDWLSKQEFD 240

Query: 241 RIYDELKKAKSGEIDE-RFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGAMVV 300
           +IY+ELKK K+GE DE RFANLD+IRA ARE++E EI+KHAADGLGRVDYA+ASGGAMVV
Sbjct: 241 KIYEELKKTKNGEFDEQRFANLDEIRASAREMIEREIQKHAADGLGRVDYAVASGGAMVV 300

Query: 301 KHSEPFTGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRTAIVP 360
           KHS+P+ GRTSNWF +N RNGVH +A+K+LKPSFGEPGQCF LKGSSGFVQIRLR AIVP
Sbjct: 301 KHSDPYRGRTSNWFLKNVRNGVHSDADKLLKPSFGEPGQCFALKGSSGFVQIRLRAAIVP 360

Query: 361 EAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEKSNAQ 420
           EAITLEHVAKSVA+DR+SAPK CRVSGWFQG +   SA N EKMF LA+FTYDLEKSNAQ
Sbjct: 361 EAITLEHVAKSVAFDRTSAPKDCRVSGWFQGKN-PNSAINGEKMFPLAKFTYDLEKSNAQ 420

Query: 421 TFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQS 472
           TF+VV+TTGSGLVDMIRLDFSSNHG+PSHTCIYR+RVHGHEPYSVSMMA QS
Sbjct: 421 TFDVVDTTGSGLVDMIRLDFSSNHGNPSHTCIYRMRVHGHEPYSVSMMAIQS 471

BLAST of Cp4.1LG01g21450 vs. NCBI nr
Match: gi|568834331|ref|XP_006471288.1| (PREDICTED: protein SAD1/UNC-84 domain protein 1 [Citrus sinensis])

HSP 1 Score: 585.5 bits (1508), Expect = 8.3e-164
Identity = 314/475 (66.11%), Postives = 382/475 (80.42%), Query Frame = 1

Query: 1   MSASTVSITANPAARRRPVLVSEKKGAS--IELLATDGANPLSNTATAGTGGAADDKVAG 60
           MSAS VSITAN AARRRPV++++KK  +  IEL++ D   P  N       G  DDK   
Sbjct: 1   MSASAVSITANTAARRRPVVINDKKSNNNNIELVSVD---PQLN-------GVGDDKPTA 60

Query: 61  GIGRDMSHHSIRGEVVLERSSRDPLQIKKT-VANSTISPRRSRKAITKPEKPRWATILSV 120
              +D+SHHSIRGE V+++ +   +Q+KK+ +ANST+SPRRSRK+  KPEKPRWAT++S+
Sbjct: 61  AQSKDLSHHSIRGEAVVDKDTL--VQVKKSGLANSTVSPRRSRKSSPKPEKPRWATVVSI 120

Query: 121 VTKNIVLLMVLLGLVQMIRKLALKSGDGAVGNQMGFSEVEGRIAEVEAFLKTTTKMIQVQ 180
            TKN +LL+ +LGL QMIR++ LKSGD A G ++GFSE E RI EVE FLKTTTKM+Q+Q
Sbjct: 121 FTKNFLLLVAVLGLGQMIRRVYLKSGDSA-GAELGFSEFERRITEVENFLKTTTKMMQLQ 180

Query: 181 VEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLSKQ 240
           VEV+DRK+E+E+GGLRREV   I+ K+  LE  LK L++K E LER+LSEL AVDWLSK+
Sbjct: 181 VEVLDRKVESEMGGLRREVSKSIDDKSVILERELKKLEEKSEGLERTLSELKAVDWLSKE 240

Query: 241 EFDRIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGAM 300
           EF++ ++E KK KSGE+ E   +LDDIR YARE+VE EIEKHAADGLGRVDYALA+ GA 
Sbjct: 241 EFEKFFEEFKKQKSGELSENDVSLDDIRVYAREIVEKEIEKHAADGLGRVDYALATSGAF 300

Query: 301 VVKHSEPF-TGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRTA 360
           V+KHS+ +  G+ SNW S ++RNGVH  A+KMLKPSFGEPGQCFPLKGSSGFVQI+LRTA
Sbjct: 301 VIKHSDAYLAGKGSNWLSLSSRNGVHSYADKMLKPSFGEPGQCFPLKGSSGFVQIKLRTA 360

Query: 361 IVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEKS 420
           I+PEAITLEHVAKSVAYDRSSAPK CRVSGW QGDD +  A  AEKMFLL EFTYDL+KS
Sbjct: 361 IIPEAITLEHVAKSVAYDRSSAPKDCRVSGWLQGDD-SDLAVGAEKMFLLTEFTYDLDKS 420

Query: 421 NAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQS 472
           NAQTFNV++  GSG+VD +RLDF+SNHGS SHTCIYR+RVHG EP SVS++A QS
Sbjct: 421 NAQTFNVLDLPGSGVVDTVRLDFTSNHGSSSHTCIYRLRVHGREPDSVSVLAMQS 461

BLAST of Cp4.1LG01g21450 vs. NCBI nr
Match: gi|802795470|ref|XP_012092514.1| (PREDICTED: SUN domain-containing protein 3-like [Jatropha curcas])

HSP 1 Score: 582.0 bits (1499), Expect = 9.2e-163
Identity = 315/481 (65.49%), Postives = 376/481 (78.17%), Query Frame = 1

Query: 1   MSASTVSITANPAARRRPVLVSEKKGASIELLATD-----GANPLSNTATAGTGGAADDK 60
           MSASTVSITAN A RRRPV+  EKK  +IELL  +     G N + N         A+DK
Sbjct: 1   MSASTVSITANHAGRRRPVVAGEKKSTNIELLPNEAQINGGDNAIKN---------ANDK 60

Query: 61  VAGGIGRDMSHHSIRGEVVLERSSRDPLQIKKT-VANSTISPRRSRKAITKPEKPRWATI 120
           +     +D+SHHSIRGE VLERS++D  Q+KK  + NSTISPRRSRK + KPEKP W T+
Sbjct: 61  LVASHSKDLSHHSIRGEAVLERSTKDTTQVKKNAMVNSTISPRRSRKMVAKPEKPMWQTV 120

Query: 121 LSVVTKNIVLLMVLLGLVQMIRKLALKSGD--GAVGN-QMGFSEVEGRIAEVEAFLKTTT 180
           +SV TKN  LL+VL+GLVQMIR+LA+KSGD     G  Q+G SE E RIAEVE+F KTT 
Sbjct: 121 VSVFTKNFFLLLVLIGLVQMIRRLAMKSGDYHSISGTAQIGASEFESRIAEVESFFKTTA 180

Query: 181 KMIQVQVEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAV 240
           KMIQ+QVEVVD K+ NEV GLR+E+D KIE K   L+SGLK +  + E LE+S+SELTAV
Sbjct: 181 KMIQLQVEVVDAKVGNEVEGLRKEMDKKIEEKAELLDSGLKQIVARNEQLEKSISELTAV 240

Query: 241 DWLSKQEFDRIYDELKKAKSGEIDERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYAL 300
           DWLSK++F  IY+ELKK K  E  E   +LDDIRAYAR++VE EIEKHAADGLGRVDYAL
Sbjct: 241 DWLSKEDFKMIYEELKKGKGNEFGESDISLDDIRAYARDIVEKEIEKHAADGLGRVDYAL 300

Query: 301 ASGGAMVVKHSEPF-TGRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQ 360
           ASGGA VVKHSEP+ TG+ SNWF  ++R G H +A KMLKPSFGEPGQCFPLKGSSGFVQ
Sbjct: 301 ASGGASVVKHSEPYITGKGSNWFLMSSRGGAHPDAVKMLKPSFGEPGQCFPLKGSSGFVQ 360

Query: 361 IRLRTAIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFT 420
           I+LRTAIVPEA+TLEHVAK+VAYDRSSAPK CRVSGW QG D+  +  ++EKMFLL+EF+
Sbjct: 361 IKLRTAIVPEAVTLEHVAKNVAYDRSSAPKDCRVSGWLQGHDMDLT-IDSEKMFLLSEFS 420

Query: 421 YDLEKSNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQ 472
           YDLEKSNAQT+ V+++  S +VD +RLDF SNHGS SHTCIYR+RVHG+EP SVS++  +
Sbjct: 421 YDLEKSNAQTYAVLDSAASSVVDTVRLDFISNHGSSSHTCIYRLRVHGYEPDSVSVVTVE 471

BLAST of Cp4.1LG01g21450 vs. NCBI nr
Match: gi|703079999|ref|XP_010091304.1| (hypothetical protein L484_012197 [Morus notabilis])

HSP 1 Score: 572.0 bits (1473), Expect = 9.5e-160
Identity = 314/476 (65.97%), Postives = 376/476 (78.99%), Query Frame = 1

Query: 1   MSASTVSITANPAARRRPVLVSEKKGASIELLATDGANPLSNTATAGTGGAADDKVAGGI 60
           MSASTVSITANPA RRR V+  EKK +S+EL+A   A P           A DDK     
Sbjct: 1   MSASTVSITANPATRRRTVVAVEKK-SSVELVA---AEPQFK--------AGDDKSVAAN 60

Query: 61  GRDMSHHSIRGEVVLERSSRDPLQIKKTVANSTISP---RRSRKAITKPEKPRWATILSV 120
           GRD+S+HSIRG+ VLERSSRD + +KKT ANST+SP   RRSRK +  P KPRW T+LSV
Sbjct: 61  GRDLSNHSIRGDAVLERSSRDAVPVKKTAANSTVSPPSNRRSRKPVAAP-KPRWLTVLSV 120

Query: 121 VTKNIVLLMVLLGLVQMIRKLALKSGDGAVGNQMGFSEVEGRIAEVEAFLKTTTKMIQVQ 180
            TKN VLL++L+GLVQ++R+LAL+S DG  G     S+ EGRIAEVE F+KTT KMIQVQ
Sbjct: 121 FTKNFVLLVLLVGLVQIVRRLALRSSDG--GGPFALSDFEGRIAEVEKFVKTTAKMIQVQ 180

Query: 181 VEVVDRKIENEVGGLRREVDHKIEAKTADLESGLKVLQDKGEDLERSLSELTAVDWLSKQ 240
           VEVVDRK+++EVGGLR EV  KIE K+  LES LK L+ K E LERSL E   ++W+SK+
Sbjct: 181 VEVVDRKVDSEVGGLR-EVGKKIEEKSVLLESQLKELEAKSEGLERSLGEFKDINWISKE 240

Query: 241 EFDRIYDELKKAKSGEID-ERFANLDDIRAYAREVVEMEIEKHAADGLGRVDYALASGGA 300
           EFD+IY+ELKKA+S E   +   NL+DIRAYAR+VV  EIE+HAADGL R DYALA+GGA
Sbjct: 241 EFDKIYEELKKARSDEFGVDDGTNLNDIRAYARDVVLKEIERHAADGLARADYALATGGA 300

Query: 301 MVVKHSEPFT-GRTSNWFSRNARNGVHRNANKMLKPSFGEPGQCFPLKGSSGFVQIRLRT 360
           MVVKHSEPF  G+ +NWF + A NGVH +A KMLKPSFGEPGQCFPLKGSSGFV+I+LRT
Sbjct: 301 MVVKHSEPFLKGKGNNWFGKGATNGVHNDAEKMLKPSFGEPGQCFPLKGSSGFVEIKLRT 360

Query: 361 AIVPEAITLEHVAKSVAYDRSSAPKACRVSGWFQGDDVAASAANAEKMFLLAEFTYDLEK 420
           AI+PEAITLEHVAKSVA+DRSSAPK CR+SGW QG +   S ++A ++FLLAEFTYDLEK
Sbjct: 361 AIIPEAITLEHVAKSVAFDRSSAPKNCRISGWLQGQN-TESTSDALRIFLLAEFTYDLEK 420

Query: 421 SNAQTFNVVETTGSGLVDMIRLDFSSNHGSPSHTCIYRVRVHGHEPYSVSMMATQS 472
           SNAQT+NV+++  S +VD +R DF+SNHGSPSHTCIYR+RVHGHEP SV+M+A QS
Sbjct: 421 SNAQTYNVLDSASSSIVDTVRFDFTSNHGSPSHTCIYRLRVHGHEPESVAMIAMQS 459

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SUN1_ARATH1.3e-11650.93Protein SAD1/UNC-84 domain protein 1 OS=Arabidopsis thaliana GN=SUN1 PE=1 SV=1[more]
SUN2_ARATH9.4e-10749.48Protein SAD1/UNC-84 domain protein 2 OS=Arabidopsis thaliana GN=SUN2 PE=1 SV=1[more]
SUN2_MOUSE2.1e-1826.18SUN domain-containing protein 2 OS=Mus musculus GN=Sun2 PE=1 SV=3[more]
SUN2_HUMAN3.7e-1828.49SUN domain-containing protein 2 OS=Homo sapiens GN=SUN2 PE=1 SV=3[more]
SUN1_HUMAN4.8e-1832.04SUN domain-containing protein 1 OS=Homo sapiens GN=SUN1 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0KCH3_CUCSA1.9e-21583.47Uncharacterized protein OS=Cucumis sativus GN=Csa_6G051490 PE=4 SV=1[more]
A0A067J9Z6_JATCU6.4e-16365.49Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04204 PE=4 SV=1[more]
W9R9S3_9ROSA6.7e-16065.97Uncharacterized protein OS=Morus notabilis GN=L484_012197 PE=4 SV=1[more]
A0A151QPL8_CAJCA6.4e-15564.00Protein unc-84 isogeny B OS=Cajanus cajan GN=KK1_047108 PE=4 SV=1[more]
M5X263_PRUPE7.9e-15363.60Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003932mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G04990.17.4e-11850.93 SAD1/UNC-84 domain protein 1[more]
AT3G10730.15.3e-10849.48 SAD1/UNC-84 domain protein 2[more]
Match NameE-valueIdentityDescription
gi|659113576|ref|XP_008456648.1|3.0e-22285.56PREDICTED: SUN domain-containing protein 2 [Cucumis melo][more]
gi|449446333|ref|XP_004140926.1|2.7e-21583.47PREDICTED: SUN domain-containing protein 2 [Cucumis sativus][more]
gi|568834331|ref|XP_006471288.1|8.3e-16466.11PREDICTED: protein SAD1/UNC-84 domain protein 1 [Citrus sinensis][more]
gi|802795470|ref|XP_012092514.1|9.2e-16365.49PREDICTED: SUN domain-containing protein 3-like [Jatropha curcas][more]
gi|703079999|ref|XP_010091304.1|9.5e-16065.97hypothetical protein L484_012197 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR012919SUN_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g21450.1Cp4.1LG01g21450.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012919SUN domainPFAMPF07738Sad1_UNCcoord: 323..459
score: 5.8
IPR012919SUN domainPROFILEPS51469SUNcoord: 293..461
score: 38
NoneNo IPR availableunknownCoilCoilcoord: 211..231
scor
NoneNo IPR availablePANTHERPTHR12911SAD1/UNC-84-LIKE PROTEIN-RELATEDcoord: 61..470
score: 2.0E
NoneNo IPR availablePANTHERPTHR12911:SF27SUBFAMILY NOT NAMEDcoord: 61..470
score: 2.0E