Cp4.1LG15g07710 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g07710
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionFASCICLIN-like arabinogalactan protein 21, putative
LocationCp4.1LG15 : 7882855 .. 7883934 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTAAATGTTCATTCGAATGGTGGCATGCACCCATCTACTTCGCCGTCTCTGTAGTCTTAGCCTTCTTCGCCATCTCCACAGCTCTGAACTCCACGGCTACTTCCCACTCTGCAACTCCGCCGATCAAAGCCGTCGCCGACGAACTCTCCATAAACGCATCACGAGCCTTAAGAAGAGCCGGTTTCAACACCATTGCCACGCTCCTTCAGGTCTCCCCTGAACACTTCTTCTCCCCTCAAAACTCCACCATTTTCGCCATTAAAGATTCCGCCATCTCCAACACCTCTCTACCTCCATGGCTCCTCAAGAATCTTGTTCAATACCACACTTCCCCTTCTAAGCTTTCTATGGCTGACTTGTTAAAGAAGCCTCAAGGGGTCTGTCTACCGACCCTTTTGGCGCCAAAGAAAATTGCTATTACTAGAATGGATTCCACTGCTAGATTGGTTGAGATCAATCATGTTCTTGTTACAGACCCTGATATTTTTCTTGGTGGAAATGTTTCCATTCATGGCGTTCTTGGGCCATTTTCTCCTTTGGATCCTCTCGATGTTCATCAGGGATGGAGCTTTATTCAAGCGCCGTTCTGTGATTCAACCGCTACTTTAATTTCAGATTCTCTTGAAAACAAGAAGGGGGTTGAAGTTGAATGGAGAAGAATCATACGGTGGCTCAGTGCAAATGGGTTTGTTTCTTACGCAATCGGATTGCAATCTGTTCTCGAAGGGATTCTTCAAGATTTCGAAGGATTGAGATCGATTACCGTTTTTGCACCGCCGAATTTGGCCTCTGTGGCATCTTCATCGCCTGTTCTGACGCGAGCAGTGAGGCTTCACATTGTTCCTCGAATGGTTACATACAAATTCCTCGCTTCGTTGCCGGCCAGAACTTCGTTCAAGACGCTCGTCTCCGGCCAAGATCTTCAGGTCCTCGGAGGTGTTCGGGTTCCGAGAGGAACGGTGGTCGTTAATGGCGTGGAGATTGTCTCGCCGGAGATTTTCCGGTCAAGGAACTGTGTGATCCATGGGATTTCCCGGTCGCTTGAGACCGCCAGTTTACCTCATTTATCAAGGTGA

mRNA sequence

ATGGGTAAATGTTCATTCGAATGGTGGCATGCACCCATCTACTTCGCCGTCTCTGTAGTCTTAGCCTTCTTCGCCATCTCCACAGCTCTGAACTCCACGGCTACTTCCCACTCTGCAACTCCGCCGATCAAAGCCGTCGCCGACGAACTCTCCATAAACGCATCACGAGCCTTAAGAAGAGCCGGTTTCAACACCATTGCCACGCTCCTTCAGGTCTCCCCTGAACACTTCTTCTCCCCTCAAAACTCCACCATTTTCGCCATTAAAGATTCCGCCATCTCCAACACCTCTCTACCTCCATGGCTCCTCAAGAATCTTGTTCAATACCACACTTCCCCTTCTAAGCTTTCTATGGCTGACTTGTTAAAGAAGCCTCAAGGGGTCTGTCTACCGACCCTTTTGGCGCCAAAGAAAATTGCTATTACTAGAATGGATTCCACTGCTAGATTGGTTGAGATCAATCATGTTCTTGTTACAGACCCTGATATTTTTCTTGGTGGAAATGTTTCCATTCATGGCGTTCTTGGGCCATTTTCTCCTTTGGATCCTCTCGATGTTCATCAGGGATGGAGCTTTATTCAAGCGCCGTTCTGTGATTCAACCGCTACTTTAATTTCAGATTCTCTTGAAAACAAGAAGGGGGTTGAAGTTGAATGGAGAAGAATCATACGGTGGCTCAGTGCAAATGGGTTTGTTTCTTACGCAATCGGATTGCAATCTGTTCTCGAAGGGATTCTTCAAGATTTCGAAGGATTGAGATCGATTACCGTTTTTGCACCGCCGAATTTGGCCTCTGTGGCATCTTCATCGCCTGTTCTGACGCGAGCAGTGAGGCTTCACATTGTTCCTCGAATGGTTACATACAAATTCCTCGCTTCGTTGCCGGCCAGAACTTCGTTCAAGACGCTCGTCTCCGGCCAAGATCTTCAGGTCCTCGGAGGTGTTCGGGTTCCGAGAGGAACGGTGGTCGTTAATGGCGTGGAGATTGTCTCGCCGGAGATTTTCCGGTCAAGGAACTGTGTGATCCATGGGATTTCCCGGTCGCTTGAGACCGCCAGTTTACCTCATTTATCAAGGTGA

Coding sequence (CDS)

ATGGGTAAATGTTCATTCGAATGGTGGCATGCACCCATCTACTTCGCCGTCTCTGTAGTCTTAGCCTTCTTCGCCATCTCCACAGCTCTGAACTCCACGGCTACTTCCCACTCTGCAACTCCGCCGATCAAAGCCGTCGCCGACGAACTCTCCATAAACGCATCACGAGCCTTAAGAAGAGCCGGTTTCAACACCATTGCCACGCTCCTTCAGGTCTCCCCTGAACACTTCTTCTCCCCTCAAAACTCCACCATTTTCGCCATTAAAGATTCCGCCATCTCCAACACCTCTCTACCTCCATGGCTCCTCAAGAATCTTGTTCAATACCACACTTCCCCTTCTAAGCTTTCTATGGCTGACTTGTTAAAGAAGCCTCAAGGGGTCTGTCTACCGACCCTTTTGGCGCCAAAGAAAATTGCTATTACTAGAATGGATTCCACTGCTAGATTGGTTGAGATCAATCATGTTCTTGTTACAGACCCTGATATTTTTCTTGGTGGAAATGTTTCCATTCATGGCGTTCTTGGGCCATTTTCTCCTTTGGATCCTCTCGATGTTCATCAGGGATGGAGCTTTATTCAAGCGCCGTTCTGTGATTCAACCGCTACTTTAATTTCAGATTCTCTTGAAAACAAGAAGGGGGTTGAAGTTGAATGGAGAAGAATCATACGGTGGCTCAGTGCAAATGGGTTTGTTTCTTACGCAATCGGATTGCAATCTGTTCTCGAAGGGATTCTTCAAGATTTCGAAGGATTGAGATCGATTACCGTTTTTGCACCGCCGAATTTGGCCTCTGTGGCATCTTCATCGCCTGTTCTGACGCGAGCAGTGAGGCTTCACATTGTTCCTCGAATGGTTACATACAAATTCCTCGCTTCGTTGCCGGCCAGAACTTCGTTCAAGACGCTCGTCTCCGGCCAAGATCTTCAGGTCCTCGGAGGTGTTCGGGTTCCGAGAGGAACGGTGGTCGTTAATGGCGTGGAGATTGTCTCGCCGGAGATTTTCCGGTCAAGGAACTGTGTGATCCATGGGATTTCCCGGTCGCTTGAGACCGCCAGTTTACCTCATTTATCAAGGTGA

Protein sequence

MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRRAGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMADLLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSPLDPLDVHQGWSFIQAPFCDSTATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIGLQSVLEGILQDFEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLPARTSFKTLVSGQDLQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLPHLSR
BLAST of Cp4.1LG15g07710 vs. Swiss-Prot
Match: FLA21_ARATH (Fasciclin-like arabinogalactan protein 21 OS=Arabidopsis thaliana GN=FLA21 PE=2 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 1.2e-74
Identity = 168/364 (46.15%), Postives = 227/364 (62.36%), Query Frame = 1

Query: 1   MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATP-PIKAVADELSINASRALR 60
           MG CS + +   +YF +S+ LAF AIST L S   S    P    + +  LS+NAS  LR
Sbjct: 1   MGCCSSDCF---VYFILSIALAFMAISTTLRSPPDSEPTIPIAFSSSSPSLSLNASNTLR 60

Query: 61  RAGFNTIATLLQVSPEHFFSPQ-NSTIFAIKDSAISNTS-LPPWLLKNLVQYHTSPSKLS 120
           ++ F  IATLL +SPE F S   N+T+FAI+D++  NTS L P  LK L+ YHT P  LS
Sbjct: 61  QSNFKAIATLLHISPEIFLSSSPNTTLFAIEDASFFNTSSLHPLFLKQLLHYHTLPLMLS 120

Query: 121 MADLLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGP 180
           M DLLKKPQG CLPTLL  K + I+ ++  +R  E+NHV +T PD+FLG ++ IHGV+GP
Sbjct: 121 MDDLLKKPQGTCLPTLLHHKSVQISTVNQESRTAEVNHVRITHPDMFLGDSLVIHGVIGP 180

Query: 181 FSPLDPLDVHQGWSFIQAPFCDSTATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIG 240
           FSPL P   H     I  P C S  T  + + E +  V ++W RI++ LS+NGFV +AIG
Sbjct: 181 FSPLQPHSDH----LIHTPLCQSDTTNKTSNNE-EVPVSIDWTRIVQLLSSNGFVPFAIG 240

Query: 241 LQSVLEGILQD---FEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASL 300
           L SVL  I+ D    + L  +T+ A PNL S++S+SP L   VR HI+ + +TYK  AS+
Sbjct: 241 LHSVLNRIVNDHNHHKNLTGVTILATPNLVSLSSASPFLYEVVRHHILVQRLTYKDFASM 300

Query: 301 PARTSFKTLVSGQDLQVL-GGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETAS 358
             + + KTL   QDL +    V    G  +++GVEIV P++F S N VIHGIS +LE   
Sbjct: 301 SDKATVKTLDPYQDLTITRRNVNSSGGDFMISGVEIVDPDMFSSSNFVIHGISHTLE--- 353

BLAST of Cp4.1LG15g07710 vs. TrEMBL
Match: A0A0A0K1P3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G045490 PE=4 SV=1)

HSP 1 Score: 618.2 bits (1593), Expect = 6.2e-174
Identity = 312/364 (85.71%), Postives = 335/364 (92.03%), Query Frame = 1

Query: 1   MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRR 60
           M KCSFEWWHAPI F++SVVLAFFAISTAL+S+ TSHS TPP K++AD+LS+NASRALRR
Sbjct: 1   MAKCSFEWWHAPIVFSISVVLAFFAISTALHSS-TSHSPTPPNKSMADDLSLNASRALRR 60

Query: 61  AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMAD 120
           AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSP KLSMAD
Sbjct: 61  AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPVKLSMAD 120

Query: 121 LLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180
           LLKKP+GVCLPTLL PKKIAIT+MDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP
Sbjct: 121 LLKKPRGVCLPTLLMPKKIAITKMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180

Query: 181 LDPLDVHQGWSFIQAPFCDSTATLISDSLENKK-----GVEVEWRRIIRWLSANGFVSYA 240
           LDPLD+ QGWSFIQ+P+CD+ AT+ISD  E        GVEVEWRRIIRWLSANGF+SYA
Sbjct: 181 LDPLDLRQGWSFIQSPYCDTNATMISDPFETNNGVVGVGVEVEWRRIIRWLSANGFISYA 240

Query: 241 IGLQSVLEGILQDFEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLP 300
           IGLQ+VLEG+LQDFEGLRSITVFAPPNL+SVAS SPVL RAVRLHIVP+MVTYK LASLP
Sbjct: 241 IGLQTVLEGLLQDFEGLRSITVFAPPNLSSVASPSPVLNRAVRLHIVPQMVTYKSLASLP 300

Query: 301 ARTSFKTLVSGQDLQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLP 360
            RTS KTLVSGQD+++LGGVRVPRGTV VNGVEIVSPEIFRS NCVIHGISRSLE A LP
Sbjct: 301 TRTSLKTLVSGQDIEILGGVRVPRGTVKVNGVEIVSPEIFRSENCVIHGISRSLEIAGLP 360

BLAST of Cp4.1LG15g07710 vs. TrEMBL
Match: B9I6K5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s14840g PE=4 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 5.2e-104
Identity = 198/359 (55.15%), Postives = 262/359 (72.98%), Query Frame = 1

Query: 1   MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRR 60
           M  CS  WWHAP+YF  S VLAF AISTA+NS   S++AT P +  ++ LS+NASR LR 
Sbjct: 1   MASCS-HWWHAPVYFIASAVLAFIAISTAMNSP--SNNATRPTRPTSNYLSLNASRTLRE 60

Query: 61  AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMAD 120
           +GFN +ATLL +SPE FF   N+TIFAIKDS++ NTSLPPW LKNL+QYHTSP KLSM D
Sbjct: 61  SGFNIMATLLLISPEMFFLSPNTTIFAIKDSSLVNTSLPPWFLKNLLQYHTSPLKLSMED 120

Query: 121 LLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180
           + KKPQG C PTL+  KK+A+T++D+  RL EINHVLV+ PD+ L   ++IHGVL PFS 
Sbjct: 121 VFKKPQGSCFPTLVDRKKLAVTKIDAKERLAEINHVLVSHPDMVLERRITIHGVLAPFSS 180

Query: 181 LDPLDVHQGWSFIQAPFCDSTATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIGLQS 240
           L   DV+ GW  IQAP CD+ ++L+SD+  N   + +EW RII  LS++ FVS+AIGL S
Sbjct: 181 LRSKDVYFGWESIQAPICDANSSLVSDA--NGPRIILEWTRIIHLLSSHRFVSFAIGLNS 240

Query: 241 VLEGILQDFEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLPARTSF 300
           VL+ IL D + L S+T+FAPP L  VASSSP+L + VRLHI+P+  TY  LA+LP +   
Sbjct: 241 VLDRILADHKNLSSVTIFAPPELEFVASSSPMLEKIVRLHILPQRATYIELAALPDKQRL 300

Query: 301 KTLVSGQDLQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLPHLSR 360
           +TL+  +DL++  GV V +G + +NGVEI +PEIF S+  ++HGI+++ + A  P+ SR
Sbjct: 301 RTLLPDEDLKITKGVGVTQG-LAINGVEIAAPEIFSSKEFIVHGITQAFKIAKFPNASR 353

BLAST of Cp4.1LG15g07710 vs. TrEMBL
Match: M5X5K8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007875mg PE=4 SV=1)

HSP 1 Score: 380.6 bits (976), Expect = 2.2e-102
Identity = 198/351 (56.41%), Postives = 259/351 (73.79%), Query Frame = 1

Query: 9   WHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRRAGFNTIAT 68
           WHA +YF ++V+LAF AIS  L S A  +  T   K ++  +S+NASR LRRAGFN +AT
Sbjct: 7   WHAAVYFTMAVILAFIAISMTLRSEA--NDETSQTKPISHIVSLNASRTLRRAGFNIMAT 66

Query: 69  LLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMADLLKKPQGV 128
           LLQVSPE FFS  N+T+FA+KDSAISN +LPP LLKNL+QYHTSP +LSM DLL KPQG 
Sbjct: 67  LLQVSPELFFSSANATLFAVKDSAISNATLPPRLLKNLLQYHTSPLQLSMEDLLSKPQGA 126

Query: 129 CLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSPLDPLDVHQ 188
           CLPTL   K IAIT++D   R VEIN+VLV+ P++FL G +SIHGVLGPFS LDP DV++
Sbjct: 127 CLPTLYQQKSIAITKVDEKERSVEINNVLVSHPNLFLEGPISIHGVLGPFSALDPRDVYR 186

Query: 189 GWSFIQAPFCDSTATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIGLQSVLEGILQD 248
           GW  IQ+P CDS + +ISD   + K + VEW RIIR L++NGFVS+AIGL SVL+GIL D
Sbjct: 187 GWDIIQSPACDSNSNMISDVPPDLKNM-VEWTRIIRLLNSNGFVSFAIGLHSVLDGILGD 246

Query: 249 FEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLPARTSFKTLVSGQD 308
            +GL+S+T+F PP+L   A  +P+L + VR H++P+  T + L SL  RT  +TL+ GQ 
Sbjct: 247 HKGLKSVTIFVPPSLELEAYPTPLLEKIVRFHVLPQKFTNRELESLAPRTLLRTLLHGQP 306

Query: 309 LQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLPHLSR 360
           L+V G V   +G +V++GV IV+P++F S   ++HGISR+LE   LP+ +R
Sbjct: 307 LEVTGAVDFMKG-LVISGVNIVAPDMFSSSKFIVHGISRALELDDLPNTAR 353

BLAST of Cp4.1LG15g07710 vs. TrEMBL
Match: A0A067DY61_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g042381mg PE=4 SV=1)

HSP 1 Score: 372.9 bits (956), Expect = 4.5e-100
Identity = 207/346 (59.83%), Postives = 255/346 (73.70%), Query Frame = 1

Query: 4   CSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRRAGF 63
           CS E W A  +F +SVVLA  AIS +L++    + + P  K V  +L  NAS+ALRR+GF
Sbjct: 5   CS-ECWRALAFFTISVVLACMAISMSLHAIPKDNGS-PSTKPVNYQLFSNASKALRRSGF 64

Query: 64  NTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMADLLK 123
           N IATLLQVSPE F S  NSTIFAI+DSAISNTSLPPWL K L+QYHTSP KLSM DLL 
Sbjct: 65  NIIATLLQVSPEIFLSSHNSTIFAIQDSAISNTSLPPWLFKKLLQYHTSPLKLSMNDLLM 124

Query: 124 KPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSPLDP 183
           KPQG CLPT L  KK+AIT++    RL+EIN+VLV+ PDIFL G++SIHGVL PFS LDP
Sbjct: 125 KPQGSCLPTFLHQKKVAITKIVVKERLIEINNVLVSRPDIFLEGSLSIHGVLEPFSSLDP 184

Query: 184 LDVHQGWSFIQAPFCDS-TATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIGLQSVL 243
            ++H GW +IQ+P CDS ++TL+SD  E+K  V  EW +IIR LS+NGFVS+AIGL SV+
Sbjct: 185 QNIHPGWDYIQSPICDSFSSTLVSDITESKNMVN-EWTKIIRLLSSNGFVSFAIGLHSVI 244

Query: 244 EGILQDFEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLPARTSFKT 303
           + IL+D   L S T+FAP + A VASSSP+L R VRLHI+P+  TYK LASLP +T  KT
Sbjct: 245 DQILEDNINLNSTTIFAPADFAVVASSSPLLDRIVRLHILPQRFTYKELASLPGKTLLKT 304

Query: 304 LVSGQDLQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRS 349
           LV  Q L + GG    +G   +NGV+I +PEIF S+  VIHGIS++
Sbjct: 305 LVPNQYLVISGGADFIQG-FDINGVQIFAPEIFSSKQFVIHGISQA 346

BLAST of Cp4.1LG15g07710 vs. TrEMBL
Match: W9R6F2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_008733 PE=4 SV=1)

HSP 1 Score: 370.5 bits (950), Expect = 2.2e-99
Identity = 194/339 (57.23%), Postives = 249/339 (73.45%), Query Frame = 1

Query: 17  VSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRRAGFNTIATLLQVSPEH 76
           +SVVLAF AIST+L+S      +  P K    ELS+NAS+ALRRAGFN  ATL QVSPE 
Sbjct: 1   MSVVLAFIAISTSLHSKPGDPIS--PGKVAFHELSLNASKALRRAGFNFTATLFQVSPEI 60

Query: 77  FFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMADLLKKPQGVCLPTLLAP 136
           F S  NSTIFAI+D  ISN+SLPPWLLK+L+QYHT P  L M +LLKKPQG CLPTL   
Sbjct: 61  FLSSPNSTIFAIQDQVISNSSLPPWLLKDLLQYHTCPLNLPMDELLKKPQGSCLPTLHRK 120

Query: 137 KKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSPLDPLDVHQGWSFIQAP 196
           K IAIT++D   + +EINHV+V+ P++FLG  +S+HGVL PFS LDP DVHQGW+ IQAP
Sbjct: 121 KNIAITKIDLEEKSIEINHVVVSHPNVFLGKTISVHGVLEPFSSLDPQDVHQGWNIIQAP 180

Query: 197 FCDSTATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIGLQSVLEGILQDFEGLRSIT 256
            CDST+ L+SD LE+K    VEW  I+R LS+NGF+ +AIGL SVL+GIL+D+ GL S+T
Sbjct: 181 ICDSTSVLVSDVLESKN--MVEWSWIVRLLSSNGFIPFAIGLNSVLDGILEDYRGLESVT 240

Query: 257 VFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLPARTSFKTLVSGQDLQVLGGVR 316
           +FAPP+L+S+ S+SP+L R VR HI+P+ +TY+ LA+LP+     TL     ++V G   
Sbjct: 241 IFAPPSLSSLTSASPLLKRTVRFHILPQRLTYQELAALPSGMLLTTLSPDLHVEVRGTAS 300

Query: 317 VPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLP 356
             +G +VVNGV+I +P++F S+N  IHGISR+ E    P
Sbjct: 301 FEQG-LVVNGVKIAAPDMFSSKNFTIHGISRAFEVVQAP 334

BLAST of Cp4.1LG15g07710 vs. TAIR10
Match: AT5G06920.1 (AT5G06920.1 FASCICLIN-like arabinogalactan protein 21 precursor)

HSP 1 Score: 281.6 bits (719), Expect = 6.9e-76
Identity = 168/364 (46.15%), Postives = 227/364 (62.36%), Query Frame = 1

Query: 1   MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATP-PIKAVADELSINASRALR 60
           MG CS + +   +YF +S+ LAF AIST L S   S    P    + +  LS+NAS  LR
Sbjct: 1   MGCCSSDCF---VYFILSIALAFMAISTTLRSPPDSEPTIPIAFSSSSPSLSLNASNTLR 60

Query: 61  RAGFNTIATLLQVSPEHFFSPQ-NSTIFAIKDSAISNTS-LPPWLLKNLVQYHTSPSKLS 120
           ++ F  IATLL +SPE F S   N+T+FAI+D++  NTS L P  LK L+ YHT P  LS
Sbjct: 61  QSNFKAIATLLHISPEIFLSSSPNTTLFAIEDASFFNTSSLHPLFLKQLLHYHTLPLMLS 120

Query: 121 MADLLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGP 180
           M DLLKKPQG CLPTLL  K + I+ ++  +R  E+NHV +T PD+FLG ++ IHGV+GP
Sbjct: 121 MDDLLKKPQGTCLPTLLHHKSVQISTVNQESRTAEVNHVRITHPDMFLGDSLVIHGVIGP 180

Query: 181 FSPLDPLDVHQGWSFIQAPFCDSTATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIG 240
           FSPL P   H     I  P C S  T  + + E +  V ++W RI++ LS+NGFV +AIG
Sbjct: 181 FSPLQPHSDH----LIHTPLCQSDTTNKTSNNE-EVPVSIDWTRIVQLLSSNGFVPFAIG 240

Query: 241 LQSVLEGILQD---FEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASL 300
           L SVL  I+ D    + L  +T+ A PNL S++S+SP L   VR HI+ + +TYK  AS+
Sbjct: 241 LHSVLNRIVNDHNHHKNLTGVTILATPNLVSLSSASPFLYEVVRHHILVQRLTYKDFASM 300

Query: 301 PARTSFKTLVSGQDLQVL-GGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETAS 358
             + + KTL   QDL +    V    G  +++GVEIV P++F S N VIHGIS +LE   
Sbjct: 301 SDKATVKTLDPYQDLTITRRNVNSSGGDFMISGVEIVDPDMFSSSNFVIHGISHTLE--- 353

BLAST of Cp4.1LG15g07710 vs. TAIR10
Match: AT1G30800.1 (AT1G30800.1 Fasciclin-like arabinogalactan family protein)

HSP 1 Score: 48.5 bits (114), Expect = 9.9e-06
Identity = 41/120 (34.17%), Postives = 58/120 (48.33%), Query Frame = 1

Query: 67  ATLLQVSPEHFFSPQNSTIFAIKD-----------SAISNTSLPPWLLKNLVQYHTSPSK 126
           A  L    + F  P ++TIF   D           S   N +  P  L   V YH  P +
Sbjct: 54  ADFLSAVDDQFGIPLSATIFIPSDFDSADISSSSSSTTGNNNANPRRLS--VAYHIVPQR 113

Query: 127 LSMADL-LKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGV 175
           LS  DL L KP    LPTLL    I +T  +++     ++ VLV++PD+FL  +++IHGV
Sbjct: 114 LSFTDLRLFKPLSR-LPTLLPGNTIVVT--NNSVPGYALDGVLVSEPDLFLSSSIAIHGV 168

BLAST of Cp4.1LG15g07710 vs. NCBI nr
Match: gi|778723911|ref|XP_004144575.2| (PREDICTED: fasciclin-like arabinogalactan protein 21 [Cucumis sativus])

HSP 1 Score: 618.2 bits (1593), Expect = 8.9e-174
Identity = 312/364 (85.71%), Postives = 335/364 (92.03%), Query Frame = 1

Query: 1   MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRR 60
           M KCSFEWWHAPI F++SVVLAFFAISTAL+S+ TSHS TPP K++AD+LS+NASRALRR
Sbjct: 1   MAKCSFEWWHAPIVFSISVVLAFFAISTALHSS-TSHSPTPPNKSMADDLSLNASRALRR 60

Query: 61  AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMAD 120
           AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSP KLSMAD
Sbjct: 61  AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPVKLSMAD 120

Query: 121 LLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180
           LLKKP+GVCLPTLL PKKIAIT+MDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP
Sbjct: 121 LLKKPRGVCLPTLLMPKKIAITKMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180

Query: 181 LDPLDVHQGWSFIQAPFCDSTATLISDSLENKK-----GVEVEWRRIIRWLSANGFVSYA 240
           LDPLD+ QGWSFIQ+P+CD+ AT+ISD  E        GVEVEWRRIIRWLSANGF+SYA
Sbjct: 181 LDPLDLRQGWSFIQSPYCDTNATMISDPFETNNGVVGVGVEVEWRRIIRWLSANGFISYA 240

Query: 241 IGLQSVLEGILQDFEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLP 300
           IGLQ+VLEG+LQDFEGLRSITVFAPPNL+SVAS SPVL RAVRLHIVP+MVTYK LASLP
Sbjct: 241 IGLQTVLEGLLQDFEGLRSITVFAPPNLSSVASPSPVLNRAVRLHIVPQMVTYKSLASLP 300

Query: 301 ARTSFKTLVSGQDLQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLP 360
            RTS KTLVSGQD+++LGGVRVPRGTV VNGVEIVSPEIFRS NCVIHGISRSLE A LP
Sbjct: 301 TRTSLKTLVSGQDIEILGGVRVPRGTVKVNGVEIVSPEIFRSENCVIHGISRSLEIAGLP 360

BLAST of Cp4.1LG15g07710 vs. NCBI nr
Match: gi|659110734|ref|XP_008455382.1| (PREDICTED: fasciclin-like arabinogalactan protein 21 [Cucumis melo])

HSP 1 Score: 613.2 bits (1580), Expect = 2.8e-172
Identity = 312/364 (85.71%), Postives = 330/364 (90.66%), Query Frame = 1

Query: 1   MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRR 60
           M KCSFEWWHAPI F++SVVLAFFAISTAL+S+ TSH+ATPP K++ADELS+NASRALRR
Sbjct: 1   MAKCSFEWWHAPIVFSISVVLAFFAISTALHSS-TSHTATPPNKSMADELSLNASRALRR 60

Query: 61  AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMAD 120
           AGFNTIATLLQVSPEHFF PQNSTIFAIKDSAISNTSLPPWLLKNLV+YHTSP KLSM D
Sbjct: 61  AGFNTIATLLQVSPEHFFFPQNSTIFAIKDSAISNTSLPPWLLKNLVKYHTSPFKLSMTD 120

Query: 121 LLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180
           LLKKPQG CLPTLL PKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP
Sbjct: 121 LLKKPQGACLPTLLMPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180

Query: 181 LDPLDVHQGWSFIQAPFCDSTATLISDSLENKK-----GVEVEWRRIIRWLSANGFVSYA 240
           LDPLDV QGWS IQ+P+CDS  T+ISD  E        GVEVEWRRIIRWLSANGFVSYA
Sbjct: 181 LDPLDVRQGWSLIQSPYCDSNGTIISDPFEPNNGVVGVGVEVEWRRIIRWLSANGFVSYA 240

Query: 241 IGLQSVLEGILQDFEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLP 300
           IGLQ+VLEG+LQDF GLRSITVFAPPNL SV S SPVL RAVRLHIVP+MVTYK LASLP
Sbjct: 241 IGLQTVLEGLLQDFGGLRSITVFAPPNLVSVTSPSPVLNRAVRLHIVPQMVTYKSLASLP 300

Query: 301 ARTSFKTLVSGQDLQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLP 360
           ARTS KTLVSGQD+++LGGVRVPRGTVVVNGVEIVSPEIFRS NCVIHGISRSLE A LP
Sbjct: 301 ARTSLKTLVSGQDIEILGGVRVPRGTVVVNGVEIVSPEIFRSENCVIHGISRSLEIAGLP 360

BLAST of Cp4.1LG15g07710 vs. NCBI nr
Match: gi|1009127869|ref|XP_015880919.1| (PREDICTED: fasciclin-like arabinogalactan protein 21 [Ziziphus jujuba])

HSP 1 Score: 412.1 bits (1058), Expect = 9.6e-112
Identity = 215/355 (60.56%), Postives = 260/355 (73.24%), Query Frame = 1

Query: 1   MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRR 60
           M  C   WW AP+Y  VSV+LAF AIST L+S   S+ A PP K + +ELS+NASRALR 
Sbjct: 1   MAACCTRWWRAPVYITVSVMLAFLAISTTLHSN--SNGALPPRKPITNELSVNASRALRE 60

Query: 61  AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMAD 120
           AGFN  ATLL+VSPE F S +N+TIFAIKD AI  TSLPPWLLK+L+QYHTSP  L+M D
Sbjct: 61  AGFNVFATLLRVSPELFLSSRNATIFAIKDPAIPYTSLPPWLLKDLLQYHTSPLSLTMDD 120

Query: 121 LLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180
           LLK PQG CLPTL   K +A+TR+    RLVEINHV V+ P+IF GG +S+HGVLGPFSP
Sbjct: 121 LLKMPQGGCLPTLHREKNMALTRIHLKERLVEINHVFVSHPNIFFGGPISVHGVLGPFSP 180

Query: 181 LDPLDVHQGWSFIQAPFCDSTATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIGLQS 240
           LDP DVHQGW  IQAP CDS +TL+SD LE+K    VEW  IIR L++NGFVS+AIGLQ 
Sbjct: 181 LDPQDVHQGWDIIQAPICDSNSTLVSDVLESKN--MVEWSWIIRLLTSNGFVSFAIGLQY 240

Query: 241 VLEGILQDFEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLPARTSF 300
           VL+ +L+D  GL S+T+FAPPNL  +A  SP++ + VR HIVP+  TY+ LA LPA T  
Sbjct: 241 VLDDVLEDNRGLDSVTIFAPPNLPFLAFPSPLIKQIVRFHIVPQRFTYQELAGLPAGTLL 300

Query: 301 KTLVSGQDLQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLP 356
            TLV G  L V G V   RG +V+NGV+IV+P+IF S + +IHGISR+     LP
Sbjct: 301 MTLVPGLYLDVTGAVSFKRG-MVINGVQIVAPDIFSSDSFIIHGISRAFHMVQLP 350

BLAST of Cp4.1LG15g07710 vs. NCBI nr
Match: gi|743819904|ref|XP_011020998.1| (PREDICTED: fasciclin-like arabinogalactan protein 21 [Populus euphratica])

HSP 1 Score: 388.3 bits (996), Expect = 1.5e-104
Identity = 199/359 (55.43%), Postives = 263/359 (73.26%), Query Frame = 1

Query: 1   MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRR 60
           M  CS  WWHAP+YF  S VLAF AISTALNS   S++AT P +  ++ LS+NASR LR 
Sbjct: 1   MASCS-HWWHAPVYFIASAVLAFIAISTALNSP--SNNATRPTRPTSNYLSLNASRTLRE 60

Query: 61  AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMAD 120
           +GFN +ATLL +SPE FF   N+TIFAIKDS++ NTSLPPW LKNL+QYHTSP KLS+ D
Sbjct: 61  SGFNIMATLLSISPEMFFVSPNTTIFAIKDSSLVNTSLPPWFLKNLLQYHTSPLKLSIED 120

Query: 121 LLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180
           + KKPQG C PTL+  KK+A+T++D+  RL EINHVLV+ PD+ L   ++IHGVL PFS 
Sbjct: 121 VFKKPQGSCFPTLVDRKKLAVTKIDAKERLAEINHVLVSHPDMVLERRIAIHGVLAPFSS 180

Query: 181 LDPLDVHQGWSFIQAPFCDSTATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIGLQS 240
           L   DV+ GW  IQAP CD+ ++L+SD+  N   + +EW RII  LS++ FVS+AIGL S
Sbjct: 181 LRSKDVYFGWESIQAPICDANSSLVSDA--NAPRIILEWTRIIHLLSSHRFVSFAIGLNS 240

Query: 241 VLEGILQDFEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLPARTSF 300
           VL+ IL D + L S+T+FAPP L  VASSSP+L + VRLHI+P+  TY  LA+LP +   
Sbjct: 241 VLDRILADHKNLSSVTIFAPPELEFVASSSPMLEKIVRLHILPQRATYIELAALPDKQRL 300

Query: 301 KTLVSGQDLQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLPHLSR 360
           +TL+ G+DL++  GV V +G + ++GVEI +PEIF S+  ++HGI+R+ + A  P+ SR
Sbjct: 301 RTLLPGEDLEINKGVDVTQG-LAIDGVEIATPEIFSSKEFIVHGITRAFKMAKFPNASR 353

BLAST of Cp4.1LG15g07710 vs. NCBI nr
Match: gi|224126887|ref|XP_002319951.1| (hypothetical protein POPTR_0013s14840g [Populus trichocarpa])

HSP 1 Score: 386.0 bits (990), Expect = 7.4e-104
Identity = 198/359 (55.15%), Postives = 262/359 (72.98%), Query Frame = 1

Query: 1   MGKCSFEWWHAPIYFAVSVVLAFFAISTALNSTATSHSATPPIKAVADELSINASRALRR 60
           M  CS  WWHAP+YF  S VLAF AISTA+NS   S++AT P +  ++ LS+NASR LR 
Sbjct: 1   MASCS-HWWHAPVYFIASAVLAFIAISTAMNSP--SNNATRPTRPTSNYLSLNASRTLRE 60

Query: 61  AGFNTIATLLQVSPEHFFSPQNSTIFAIKDSAISNTSLPPWLLKNLVQYHTSPSKLSMAD 120
           +GFN +ATLL +SPE FF   N+TIFAIKDS++ NTSLPPW LKNL+QYHTSP KLSM D
Sbjct: 61  SGFNIMATLLLISPEMFFLSPNTTIFAIKDSSLVNTSLPPWFLKNLLQYHTSPLKLSMED 120

Query: 121 LLKKPQGVCLPTLLAPKKIAITRMDSTARLVEINHVLVTDPDIFLGGNVSIHGVLGPFSP 180
           + KKPQG C PTL+  KK+A+T++D+  RL EINHVLV+ PD+ L   ++IHGVL PFS 
Sbjct: 121 VFKKPQGSCFPTLVDRKKLAVTKIDAKERLAEINHVLVSHPDMVLERRITIHGVLAPFSS 180

Query: 181 LDPLDVHQGWSFIQAPFCDSTATLISDSLENKKGVEVEWRRIIRWLSANGFVSYAIGLQS 240
           L   DV+ GW  IQAP CD+ ++L+SD+  N   + +EW RII  LS++ FVS+AIGL S
Sbjct: 181 LRSKDVYFGWESIQAPICDANSSLVSDA--NGPRIILEWTRIIHLLSSHRFVSFAIGLNS 240

Query: 241 VLEGILQDFEGLRSITVFAPPNLASVASSSPVLTRAVRLHIVPRMVTYKFLASLPARTSF 300
           VL+ IL D + L S+T+FAPP L  VASSSP+L + VRLHI+P+  TY  LA+LP +   
Sbjct: 241 VLDRILADHKNLSSVTIFAPPELEFVASSSPMLEKIVRLHILPQRATYIELAALPDKQRL 300

Query: 301 KTLVSGQDLQVLGGVRVPRGTVVVNGVEIVSPEIFRSRNCVIHGISRSLETASLPHLSR 360
           +TL+  +DL++  GV V +G + +NGVEI +PEIF S+  ++HGI+++ + A  P+ SR
Sbjct: 301 RTLLPDEDLKITKGVGVTQG-LAINGVEIAAPEIFSSKEFIVHGITQAFKIAKFPNASR 353

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
FLA21_ARATH1.2e-7446.15Fasciclin-like arabinogalactan protein 21 OS=Arabidopsis thaliana GN=FLA21 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A0A0K1P3_CUCSA6.2e-17485.71Uncharacterized protein OS=Cucumis sativus GN=Csa_7G045490 PE=4 SV=1[more]
B9I6K5_POPTR5.2e-10455.15Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s14840g PE=4 SV=1[more]
M5X5K8_PRUPE2.2e-10256.41Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007875mg PE=4 SV=1[more]
A0A067DY61_CITSI4.5e-10059.83Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g042381mg PE=4 SV=1[more]
W9R6F2_9ROSA2.2e-9957.23Uncharacterized protein OS=Morus notabilis GN=L484_008733 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G06920.16.9e-7646.15 FASCICLIN-like arabinogalactan protein 21 precursor[more]
AT1G30800.19.9e-0634.17 Fasciclin-like arabinogalactan family protein[more]
Match NameE-valueIdentityDescription
gi|778723911|ref|XP_004144575.2|8.9e-17485.71PREDICTED: fasciclin-like arabinogalactan protein 21 [Cucumis sativus][more]
gi|659110734|ref|XP_008455382.1|2.8e-17285.71PREDICTED: fasciclin-like arabinogalactan protein 21 [Cucumis melo][more]
gi|1009127869|ref|XP_015880919.1|9.6e-11260.56PREDICTED: fasciclin-like arabinogalactan protein 21 [Ziziphus jujuba][more]
gi|743819904|ref|XP_011020998.1|1.5e-10455.43PREDICTED: fasciclin-like arabinogalactan protein 21 [Populus euphratica][more]
gi|224126887|ref|XP_002319951.1|7.4e-10455.15hypothetical protein POPTR_0013s14840g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000782FAS1_domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g07710.1Cp4.1LG15g07710.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000782FAS1 domainGENE3DG3DSA:2.30.180.10coord: 61..180
score: 2.0E-7coord: 222..349
score: 7.
IPR000782FAS1 domainPFAMPF02469Fasciclincoord: 244..349
score: 7.
IPR000782FAS1 domainSMARTSM00554fasc_3coord: 84..181
score: 3.9E-9coord: 256..352
score: 1.
IPR000782FAS1 domainPROFILEPS50213FAS1coord: 219..349
score: 9
IPR000782FAS1 domainunknownSSF82153FAS1 domaincoord: 42..180
score: 4.71E-11coord: 223..349
score: 1.06
NoneNo IPR availablePANTHERPTHR33985FAMILY NOT NAMEDcoord: 5..355
score: 6.4E
NoneNo IPR availablePANTHERPTHR33985:SF1FASCICLIN-LIKE ARABINOGALACTAN PROTEIN 21coord: 5..355
score: 6.4E

The following gene(s) are paralogous to this gene:

None