Cp4.1LG19g01890 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG19g01890
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionARM repeat superfamily protein
LocationCp4.1LG19: 1553034 .. 1555045 (+)
RNA-Seq ExpressionCp4.1LG19g01890
SyntenyCp4.1LG19g01890
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGGATCTGTTTCTCGTGTTGATTTCTATTTCTGTCAGGTCTTTTGTGGGCAAAAGTTTTAGTCCAATGCTAAGGAGAGAGCTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGAACGGCAATGAAAGCATTGAAGACTTATGTGAAGGAATTGGATTCCAAGGCTATTCCTGTTTTTCTTGCTCAAGTTTCTGAGAATAAAGAAACTGGTGCATTAACTGGGGAGTGTACTATTTCCCTCTATGAAGTTCTTGCTCGTGTTCATGGCGTTAATATCGTGCCGCAGATCGATCGGATTATGACTTCTATTATCAAAACTTTGGCTTCAAGTGCTGGCTCTTTCCCTCTTCAACAGGCCTGTTCTAAAGTTGTACCAGCCATTGCTAGATATGGGATTGATCCAACCACTCCTGATGATAAGAAGACTTATGTTATTCATTCTCTTTGTAATCCACTTTCTGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTGCTGGGGCCGCCCTCTGCTTGAAGGCTCTGGTCGATTCGGATAATTGGCGGTTCGCTTCCGACGAGATGGTTAATAAGGTGTGCCAGAATGTGGCTGGAGCTCTGGAGGAGAAGTTTACTCAAACCAATTCGCACATGGGGCTTGTTATGACACTAGCTAAGCGGAATCCTCGGATTGTCGAACCGTATGCTAGATTGTTGCTACAGGCCGGGCTGCGAATATTGAAATCTGGGATGGTGGAGAAGAATTCTCAGAAAAGATTGTCCTCCATTCAAATGATTAATTTCTTGATGAAGTGCTTAGATCCTTGGAGTATATTTTCTGAGCTGCAGACTATAACTGAGGAGATGGAGAATTGCCAATCTGATCAAATGGCTTATGTCAAAGGTGCAGCTTTTGAAACCCTGCAAACAGCAAGGAGAATAGCAGCGGATGAAGGGTCTAAAATGGGCAAATCACCAAGCTCGGTTACTGGATCGAACTTCATTGACCGCAGGAGAAGTCCATGGAGGAATGGTGGAAGCCGAACGCCCTCGTCCGAGTCTCCAGAATCCCAAACTCATGATTCATTCTTTGATTACGGGTCGCTTGATGGATCGCCCTTTTCGTCTAAACAAGCTTCTCTGAATTCTGGATTTGACCGTAGGAGTATGAACCGTAAACTTTGGAGATATGAGAATGGTGGGGTTGATATATCCCTCAAGGATGGCCTGTCGTTGTTCTCGGATATCGCTCGTGGAACCGACGTCTCCGACACATTGTCTCTGCACTCTGAAAGTCATAAATTTGACCATCATGGTGAAGAATATGCAGATGACTTTGCAGGGTTCTTTCAAATGAGTCCTCCTAGACACAGACTATCAAGAAGCACTACAACCAGCCCTGTTGTAAGTTCTGACAGCAAACTGCCTGATTCATGTCAATAATCTTAGGACAAATTCTAGTTTAGTTCTGTGAACATTCATTAGCTGATATCTATCTTTCCTTTCTTATACCTCAGAGGTCTCGTAACTGCATAAACGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCTCGTCCACTCTCTTCAAGATCTAAACGACGCAAACTCGGACTATGCTAGCAAAAGCTGCAAATGGAGGCAAAGGAGTTTGTCATTAGGCAATCTGGAGTGGAGTCCAAGATCATGTCATAATCAAAATGGGTCCCCAGATGATCAGAAACTTAGCAAAGACGACAGCAGCTCAGACAACGACAACAACAACGACAATGACAATGACAATGACAATGACGAACAATCACCAGGTGGTTCTGAATCAGTCTCTTCAACTGGTGGTGTTCCTGTCCAAGCTATGCCAGTGGTGGTGGCTCAACATAGCAAGATCAAAACTCAATACTCAGGCATTGAGATGGCATATAAGAAGACTGCTTTGAAACTGGTCTGTGGCTTCTCATTTTTGCTCTTCACAATATTCACTTCATTGCTCTGGATCAACGAGCAGGATCAAGGTACCTATCTTGTACCAACATAA

mRNA sequence

ATGTTGGATCTGTTTCTCGTGTTGATTTCTATTTCTGTCAGGTCTTTTGTGGGCAAAAGTTTTAGTCCAATGCTAAGGAGAGAGCTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGAACGGCAATGAAAGCATTGAAGACTTATGTGAAGGAATTGGATTCCAAGGCTATTCCTGTTTTTCTTGCTCAAGTTTCTGAGAATAAAGAAACTGGTGCATTAACTGGGGAGTGTACTATTTCCCTCTATGAAGTTCTTGCTCGTGTTCATGGCGTTAATATCGTGCCGCAGATCGATCGGATTATGACTTCTATTATCAAAACTTTGGCTTCAAGTGCTGGCTCTTTCCCTCTTCAACAGGCCTGTTCTAAAGTTGTACCAGCCATTGCTAGATATGGGATTGATCCAACCACTCCTGATGATAAGAAGACTTATGTTATTCATTCTCTTTGTAATCCACTTTCTGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTGCTGGGGCCGCCCTCTGCTTGAAGGCTCTGGTCGATTCGGATAATTGGCGGTTCGCTTCCGACGAGATGGTTAATAAGGTGTGCCAGAATGTGGCTGGAGCTCTGGAGGAGAAGTTTACTCAAACCAATTCGCACATGGGGCTTGTTATGACACTAGCTAAGCGGAATCCTCGGATTGTCGAACCGTATGCTAGATTGTTGCTACAGGCCGGGCTGCGAATATTGAAATCTGGGATGGTGGAGAAGAATTCTCAGAAAAGATTGTCCTCCATTCAAATGATTAATTTCTTGATGAAGTGCTTAGATCCTTGGAGTATATTTTCTGAGCTGCAGACTATAACTGAGGAGATGGAGAATTGCCAATCTGATCAAATGGCTTATGTCAAAGGTGCAGCTTTTGAAACCCTGCAAACAGCAAGGAGAATAGCAGCGGATGAAGGGTCTAAAATGGGCAAATCACCAAGCTCGGTTACTGGATCGAACTTCATTGACCGCAGGAGAAGTCCATGGAGGAATGGTGGAAGCCGAACGCCCTCGTCCGAGTCTCCAGAATCCCAAACTCATGATTCATTCTTTGATTACGGGTCGCTTGATGGATCGCCCTTTTCGTCTAAACAAGCTTCTCTGAATTCTGGATTTGACCGTAGGAGTATGAACCGTAAACTTTGGAGATATGAGAATGGTGGGGTTGATATATCCCTCAAGGATGGCCTGTCGTTGTTCTCGGATATCGCTCGTGGAACCGACGTCTCCGACACATTGTCTCTGCACTCTGAAAGTCATAAATTTGACCATCATGGTGAAGAATATGCAGATGACTTTGCAGGGTTCTTTCAAATGAGTCCTCCTAGACACAGACTATCAAGAAGCACTACAACCAGCCCTGTTAGGTCTCGTAACTGCATAAACGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCTCGTCCACTCTCTTCAAGATCTAAACGACGCAAACTCGGACTATGCTAGCAAAAGCTGCAAATGGAGGCAAAGGAGTTTGTCATTAGGCAATCTGGAGTGGAGTCCAAGATCATGTCATAATCAAAATGGGTCCCCAGATGATCAGAAACTTAGCAAAGACGACAGCAGCTCAGACAACGACAACAACAACGACAATGACAATGACAATGACAATGACGAACAATCACCAGGTGGTTCTGAATCAGTCTCTTCAACTGGTGGTGTTCCTGTCCAAGCTATGCCAGTGGTGGTGGCTCAACATAGCAAGATCAAAACTCAATACTCAGGCATTGAGATGGCATATAAGAAGACTGCTTTGAAACTGGTCTGTGGCTTCTCATTTTTGCTCTTCACAATATTCACTTCATTGCTCTGGATCAACGAGCAGGATCAAGGTACCTATCTTGTACCAACATAA

Coding sequence (CDS)

ATGTTGGATCTGTTTCTCGTGTTGATTTCTATTTCTGTCAGGTCTTTTGTGGGCAAAAGTTTTAGTCCAATGCTAAGGAGAGAGCTTGCTAATCTTGATAAAGATGCTGATAGTCGCAGAACGGCAATGAAAGCATTGAAGACTTATGTGAAGGAATTGGATTCCAAGGCTATTCCTGTTTTTCTTGCTCAAGTTTCTGAGAATAAAGAAACTGGTGCATTAACTGGGGAGTGTACTATTTCCCTCTATGAAGTTCTTGCTCGTGTTCATGGCGTTAATATCGTGCCGCAGATCGATCGGATTATGACTTCTATTATCAAAACTTTGGCTTCAAGTGCTGGCTCTTTCCCTCTTCAACAGGCCTGTTCTAAAGTTGTACCAGCCATTGCTAGATATGGGATTGATCCAACCACTCCTGATGATAAGAAGACTTATGTTATTCATTCTCTTTGTAATCCACTTTCTGAATCTTTGTTGGGTTCTCAAGAGAGCCTCACTGCTGGGGCCGCCCTCTGCTTGAAGGCTCTGGTCGATTCGGATAATTGGCGGTTCGCTTCCGACGAGATGGTTAATAAGGTGTGCCAGAATGTGGCTGGAGCTCTGGAGGAGAAGTTTACTCAAACCAATTCGCACATGGGGCTTGTTATGACACTAGCTAAGCGGAATCCTCGGATTGTCGAACCGTATGCTAGATTGTTGCTACAGGCCGGGCTGCGAATATTGAAATCTGGGATGGTGGAGAAGAATTCTCAGAAAAGATTGTCCTCCATTCAAATGATTAATTTCTTGATGAAGTGCTTAGATCCTTGGAGTATATTTTCTGAGCTGCAGACTATAACTGAGGAGATGGAGAATTGCCAATCTGATCAAATGGCTTATGTCAAAGGTGCAGCTTTTGAAACCCTGCAAACAGCAAGGAGAATAGCAGCGGATGAAGGGTCTAAAATGGGCAAATCACCAAGCTCGGTTACTGGATCGAACTTCATTGACCGCAGGAGAAGTCCATGGAGGAATGGTGGAAGCCGAACGCCCTCGTCCGAGTCTCCAGAATCCCAAACTCATGATTCATTCTTTGATTACGGGTCGCTTGATGGATCGCCCTTTTCGTCTAAACAAGCTTCTCTGAATTCTGGATTTGACCGTAGGAGTATGAACCGTAAACTTTGGAGATATGAGAATGGTGGGGTTGATATATCCCTCAAGGATGGCCTGTCGTTGTTCTCGGATATCGCTCGTGGAACCGACGTCTCCGACACATTGTCTCTGCACTCTGAAAGTCATAAATTTGACCATCATGGTGAAGAATATGCAGATGACTTTGCAGGGTTCTTTCAAATGAGTCCTCCTAGACACAGACTATCAAGAAGCACTACAACCAGCCCTGTTAGGTCTCGTAACTGCATAAACGTCGAAGATATGATCTTCAAAACTCCTCGGAAGCTCGTCCACTCTCTTCAAGATCTAAACGACGCAAACTCGGACTATGCTAGCAAAAGCTGCAAATGGAGGCAAAGGAGTTTGTCATTAGGCAATCTGGAGTGGAGTCCAAGATCATGTCATAATCAAAATGGGTCCCCAGATGATCAGAAACTTAGCAAAGACGACAGCAGCTCAGACAACGACAACAACAACGACAATGACAATGACAATGACAATGACGAACAATCACCAGGTGGTTCTGAATCAGTCTCTTCAACTGGTGGTGTTCCTGTCCAAGCTATGCCAGTGGTGGTGGCTCAACATAGCAAGATCAAAACTCAATACTCAGGCATTGAGATGGCATATAAGAAGACTGCTTTGAAACTGGTCTGTGGCTTCTCATTTTTGCTCTTCACAATATTCACTTCATTGCTCTGGATCAACGAGCAGGATCAAGGTACCTATCTTGTACCAACATAA

Protein sequence

MLDLFLVLISISVRSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYGIDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKVCQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKRLSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEGSKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQASLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFDHHGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNDANSDYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDNDNDNDEQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTIFTSLLWINEQDQGTYLVPT
Homology
BLAST of Cp4.1LG19g01890 vs. ExPASy Swiss-Prot
Match: Q5XVI1 (Protein SINE1 OS=Arabidopsis thaliana OX=3702 GN=SINE1 PE=1 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 4.0e-148
Identity = 341/625 (54.56%), Postives = 413/625 (66.08%), Query Frame = 0

Query: 17  VGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGALTG 76
           +G + +P+LR+ELANLDKD +SR++AMKALK+YVK+LDSKAIP FLAQV E KET +L+G
Sbjct: 1   MGLNLNPILRQELANLDKDTESRKSAMKALKSYVKDLDSKAIPGFLAQVFETKETNSLSG 60

Query: 77  ECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYGIDP 136
           E TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSFPLQQACSKV+PAIARYGIDP
Sbjct: 61  EYTISLYEILARVHGPNIVPQIDTIMSTIVKTLASSAGSFPLQQACSKVIPAIARYGIDP 120

Query: 137 TTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKVCQN 196
           TT +DKK  +IHSLC PL++SLL SQESLT+GAALCLKALVDSDNWRFASDEMVN+VCQN
Sbjct: 121 TTTEDKKRVIIHSLCKPLTDSLLASQESLTSGAALCLKALVDSDNWRFASDEMVNRVCQN 180

Query: 197 VAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKRLSS 256
           V  AL+    QT+  MGLVM+LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLS+
Sbjct: 181 VVVALDSNSNQTHLQMGLVMSLAKHNPLIVEAYARLLIHTGLRILGFGVSEGNSQKRLSA 240

Query: 257 IQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEGSKM 316
           +QM+NFLMKCLDP SI+SE++ I +EME CQSDQMAYV+GAA+E + T++RIAA+  SKM
Sbjct: 241 VQMLNFLMKCLDPRSIYSEVELIIKEMERCQSDQMAYVRGAAYEAMMTSKRIAAELESKM 300

Query: 317 GKSPSSVTGSNFIDRRRSPWRNGGSRTPS-SESPESQTHDSFFDYGS-LDGSPFSSKQAS 376
            K   SVTGSNF        RN  S  P  S SPESQT  SF  Y S ++ SP S    S
Sbjct: 301 EKGCRSVTGSNF------SRRNCSSIVPDYSLSPESQTLGSFSGYDSPVESSPIS--HTS 360

Query: 377 LNSGFDRRSMNRKLWRY-ENGG-VDISLKDGLSLFSDIARG-TDVSDTLSLHSESHKFDH 436
            NS FDRRS+NRKLWR  ENGG VDISLKDG  LFS + +G T VSD       S    +
Sbjct: 361 CNSEFDRRSVNRKLWRRDENGGVVDISLKDG--LFSRVTKGSTTVSD-------SPLVPY 420

Query: 437 HGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRN-CINVEDM-IFKTPRKLVHSLQDLN 496
              E  D+F GF   S       R+TT SP R R+  IN ED  IF TPRKL+ SLQ  +
Sbjct: 421 DTCENGDEFEGFLMES------LRNTTPSPQRQRSRRINAEDFNIFSTPRKLISSLQYPD 480

Query: 497 DANSDYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDND 556
           D + D++       Q  +  G  E          GS  + KL K                
Sbjct: 481 DVDLDHSD-----IQSPILRGERE-------KTIGSRKNPKLRK---------------- 540

Query: 557 NDNDEQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFL 616
                Q P   E++SST         + V++ +      +G +   K +  KLV   SF+
Sbjct: 541 -----QFPTMVETMSST---------ITVSEDTAQTQMITGKKKKKKMSYAKLVIAISFV 560

Query: 617 LFTIF-TSLLWINEQDQ-GTYLVPT 633
           +  +F T +L +N+ D  G Y VPT
Sbjct: 601 VVALFATVILMVNQDDDVGYYTVPT 560

BLAST of Cp4.1LG19g01890 vs. ExPASy Swiss-Prot
Match: Q9SQR5 (Protein SINE2 OS=Arabidopsis thaliana OX=3702 GN=SINE2 PE=1 SV=1)

HSP 1 Score: 324.3 bits (830), Expect = 3.0e-87
Identity = 175/297 (58.92%), Postives = 222/297 (74.75%), Query Frame = 0

Query: 17  VGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGALTG 76
           +G++     R+ELANLDKD DS +TAM  L++ VK+LD+K + VF+AQ+S+ KE G  +G
Sbjct: 1   MGRNLGSAFRQELANLDKDPDSHKTAMSNLRSIVKDLDAKVVHVFVAQLSDVKEIGLESG 60

Query: 77  ECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYGIDP 136
             T+SL+E LAR HGV I P ID IM +II+TL+SS GS  +QQACS+ V A+ARYGIDP
Sbjct: 61  GYTVSLFEDLARAHGVKIAPHIDIIMPAIIRTLSSSEGSLRVQQACSRAVAAMARYGIDP 120

Query: 137 TTPDDKKTYVIHSLCNPLSESLLGS--QESLTAGAALCLKALVDSDNWRFASDEMVNKVC 196
           TTP+DKKT VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VC
Sbjct: 121 TTPEDKKTNVIHSLCKPLSDSLIDSQHQQHLALGSALCLKSLVDCDNWRSASSEMVNNVC 180

Query: 197 QNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKRL 256
           Q++A ALE   ++  SHM LVM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL
Sbjct: 181 QSLAVALEATSSEAKSHMALVMALSKHNPFTVEAYARLFVKSGLRILDLGVVEGDSQKRL 240

Query: 257 SSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAAD 312
            +IQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A R+  +
Sbjct: 241 LAIQMLNFLMKNLNPKSISSELELIYQEMEKYQKDQ-HYVKMAAHETMRQAERLICE 296

BLAST of Cp4.1LG19g01890 vs. NCBI nr
Match: XP_023517873.1 (protein SINE1-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023522699.1 protein SINE1-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1223 bits (3164), Expect = 0.0
Identity = 632/632 (100.00%), Postives = 632/632 (100.00%), Query Frame = 0

Query: 1   MLDLFLVLISISVRSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPV 60
           MLDLFLVLISISVRSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPV
Sbjct: 1   MLDLFLVLISISVRSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPV 60

Query: 61  FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQ 120
           FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQ
Sbjct: 61  FLAQVSENKETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQ 120

Query: 121 ACSKVVPAIARYGIDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSD 180
           ACSKVVPAIARYGIDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSD
Sbjct: 121 ACSKVVPAIARYGIDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSD 180

Query: 181 NWRFASDEMVNKVCQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRI 240
           NWRFASDEMVNKVCQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRI
Sbjct: 181 NWRFASDEMVNKVCQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRI 240

Query: 241 LKSGMVEKNSQKRLSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFE 300
           LKSGMVEKNSQKRLSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFE
Sbjct: 241 LKSGMVEKNSQKRLSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFE 300

Query: 301 TLQTARRIAADEGSKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDY 360
           TLQTARRIAADEGSKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDY
Sbjct: 301 TLQTARRIAADEGSKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDY 360

Query: 361 GSLDGSPFSSKQASLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTL 420
           GSLDGSPFSSKQASLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTL
Sbjct: 361 GSLDGSPFSSKQASLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTL 420

Query: 421 SLHSESHKFDHHGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRK 480
           SLHSESHKFDHHGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRK
Sbjct: 421 SLHSESHKFDHHGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRK 480

Query: 481 LVHSLQDLNDANSDYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDN 540
           LVHSLQDLNDANSDYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDN
Sbjct: 481 LVHSLQDLNDANSDYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDN 540

Query: 541 DNNNDNDNDNDNDEQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTAL 600
           DNNNDNDNDNDNDEQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTAL
Sbjct: 541 DNNNDNDNDNDNDEQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTAL 600

Query: 601 KLVCGFSFLLFTIFTSLLWINEQDQGTYLVPT 632
           KLVCGFSFLLFTIFTSLLWINEQDQGTYLVPT
Sbjct: 601 KLVCGFSFLLFTIFTSLLWINEQDQGTYLVPT 632

BLAST of Cp4.1LG19g01890 vs. NCBI nr
Match: XP_023517874.1 (protein SINE1-like isoform X2 [Cucurbita pepo subsp. pepo] >XP_023522700.1 protein SINE1-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1204 bits (3114), Expect = 0.0
Identity = 619/619 (100.00%), Postives = 619/619 (100.00%), Query Frame = 0

Query: 14  RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 73
           RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA
Sbjct: 9   RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 68

Query: 74  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 133
           LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG
Sbjct: 69  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 128

Query: 134 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 193
           IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV
Sbjct: 129 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 188

Query: 194 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 253
           CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR
Sbjct: 189 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 248

Query: 254 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 313
           LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG
Sbjct: 249 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 308

Query: 314 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 373
           SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA
Sbjct: 309 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 368

Query: 374 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFDHHG 433
           SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFDHHG
Sbjct: 369 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFDHHG 428

Query: 434 EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNDANS 493
           EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNDANS
Sbjct: 429 EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNDANS 488

Query: 494 DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDNDNDND 553
           DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDNDNDND
Sbjct: 489 DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDNDNDND 548

Query: 554 EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 613
           EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI
Sbjct: 549 EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 608

Query: 614 FTSLLWINEQDQGTYLVPT 632
           FTSLLWINEQDQGTYLVPT
Sbjct: 609 FTSLLWINEQDQGTYLVPT 627

BLAST of Cp4.1LG19g01890 vs. NCBI nr
Match: XP_022925214.1 (protein SINE1-like [Cucurbita moschata])

HSP 1 Score: 1176 bits (3042), Expect = 0.0
Identity = 607/619 (98.06%), Postives = 609/619 (98.38%), Query Frame = 0

Query: 14  RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 73
           RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA
Sbjct: 9   RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 68

Query: 74  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 133
           LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG
Sbjct: 69  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 128

Query: 134 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 193
           IDPTTPDDKKTYVIHSLCNPLSESLLG QESLTAGAALCLKALVDSDNWRFASDEMVNKV
Sbjct: 129 IDPTTPDDKKTYVIHSLCNPLSESLLGYQESLTAGAALCLKALVDSDNWRFASDEMVNKV 188

Query: 194 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 253
           CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR
Sbjct: 189 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 248

Query: 254 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 313
           LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG
Sbjct: 249 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 308

Query: 314 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 373
           SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA
Sbjct: 309 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 368

Query: 374 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFDHHG 433
           SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKF HHG
Sbjct: 369 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFGHHG 428

Query: 434 EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNDANS 493
           EEYADDFAGFFQM PPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLN+ANS
Sbjct: 429 EEYADDFAGFFQMGPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNEANS 488

Query: 494 DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDNDNDND 553
           DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDN      DNDND
Sbjct: 489 DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDN------DNDND 548

Query: 554 EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 613
           EQSP GSESVSSTGGVPVQAMPVVVAQH+KIKTQYSGIEMAYKKTALKLVCGFSFLLFTI
Sbjct: 549 EQSPVGSESVSSTGGVPVQAMPVVVAQHTKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 608

Query: 614 FTSLLWINEQDQGTYLVPT 632
           FTSLLWINEQDQGTYLVPT
Sbjct: 609 FTSLLWINEQDQGTYLVPT 621

BLAST of Cp4.1LG19g01890 vs. NCBI nr
Match: XP_022966522.1 (protein SINE1-like [Cucurbita maxima])

HSP 1 Score: 1176 bits (3042), Expect = 0.0
Identity = 608/619 (98.22%), Postives = 614/619 (99.19%), Query Frame = 0

Query: 14  RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 73
           RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA
Sbjct: 9   RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 68

Query: 74  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 133
           LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG
Sbjct: 69  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 128

Query: 134 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 193
           IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV
Sbjct: 129 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 188

Query: 194 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 253
           CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGL+ILKSGMVEKNSQKR
Sbjct: 189 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLQILKSGMVEKNSQKR 248

Query: 254 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 313
           LSSIQMINFLMKCLDPWSIFSELQTI EEMENCQSDQMAYVKGAAFETLQTARRIAADEG
Sbjct: 249 LSSIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 308

Query: 314 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 373
           SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA
Sbjct: 309 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 368

Query: 374 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFDHHG 433
           SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDT+SLHSESHKFDHHG
Sbjct: 369 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTMSLHSESHKFDHHG 428

Query: 434 EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNDANS 493
           EEYAD+FAGFFQMSPPRHRLSRSTTTSPVRSR+CINVEDMIFKTPRKLVHSLQDLNDANS
Sbjct: 429 EEYADEFAGFFQMSPPRHRLSRSTTTSPVRSRSCINVEDMIFKTPRKLVHSLQDLNDANS 488

Query: 494 DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDNDNDND 553
           DYASKSCK RQRSLSLGNLEWSPRSCHNQNGSPD QKLSKDDSSSDNDNNN+N  DNDND
Sbjct: 489 DYASKSCKLRQRSLSLGNLEWSPRSCHNQNGSPDYQKLSKDDSSSDNDNNNNN--DNDND 548

Query: 554 EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 613
           E+SPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI
Sbjct: 549 EKSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 608

Query: 614 FTSLLWINEQDQGTYLVPT 632
           FTSLLWINEQDQGTYLVPT
Sbjct: 609 FTSLLWINEQDQGTYLVPT 625

BLAST of Cp4.1LG19g01890 vs. NCBI nr
Match: KAG6595508.1 (Protein SINE1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1173 bits (3035), Expect = 0.0
Identity = 605/619 (97.74%), Postives = 610/619 (98.55%), Query Frame = 0

Query: 14  RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 73
           RSFVGKSFSPMLRRELAN DKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA
Sbjct: 71  RSFVGKSFSPMLRRELANFDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 130

Query: 74  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 133
           LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG
Sbjct: 131 LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 190

Query: 134 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 193
           IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV
Sbjct: 191 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 250

Query: 194 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 253
           CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR
Sbjct: 251 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 310

Query: 254 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 313
           LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG
Sbjct: 311 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 370

Query: 314 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 373
           SKMGKSPSSVTGSNFIDRRRSPWRNGGS++PSSESPESQTHDSFFDYGSLDGSPFSSKQA
Sbjct: 371 SKMGKSPSSVTGSNFIDRRRSPWRNGGSQSPSSESPESQTHDSFFDYGSLDGSPFSSKQA 430

Query: 374 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFDHHG 433
           SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKF HHG
Sbjct: 431 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFGHHG 490

Query: 434 EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNDANS 493
           EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSR+CINVEDMIFKTPRKLVHSLQDLN+ANS
Sbjct: 491 EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRSCINVEDMIFKTPRKLVHSLQDLNEANS 550

Query: 494 DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDNDNDND 553
            YASKSCKWRQRSLSLGNLEWSPRSCHNQNGSP+DQKLSKDDSSSDNDNNN      DND
Sbjct: 551 GYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPNDQKLSKDDSSSDNDNNN------DND 610

Query: 554 EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 613
           EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI
Sbjct: 611 EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 670

Query: 614 FTSLLWINEQDQGTYLVPT 632
           FTSLLWINEQDQGTYLVPT
Sbjct: 671 FTSLLWINEQDQGTYLVPT 683

BLAST of Cp4.1LG19g01890 vs. ExPASy TrEMBL
Match: A0A6J1HN73 (protein SINE1-like OS=Cucurbita maxima OX=3661 GN=LOC111466174 PE=4 SV=1)

HSP 1 Score: 1176 bits (3042), Expect = 0.0
Identity = 608/619 (98.22%), Postives = 614/619 (99.19%), Query Frame = 0

Query: 14  RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 73
           RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA
Sbjct: 9   RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 68

Query: 74  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 133
           LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG
Sbjct: 69  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 128

Query: 134 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 193
           IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV
Sbjct: 129 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 188

Query: 194 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 253
           CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGL+ILKSGMVEKNSQKR
Sbjct: 189 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLQILKSGMVEKNSQKR 248

Query: 254 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 313
           LSSIQMINFLMKCLDPWSIFSELQTI EEMENCQSDQMAYVKGAAFETLQTARRIAADEG
Sbjct: 249 LSSIQMINFLMKCLDPWSIFSELQTIIEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 308

Query: 314 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 373
           SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA
Sbjct: 309 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 368

Query: 374 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFDHHG 433
           SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDT+SLHSESHKFDHHG
Sbjct: 369 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTMSLHSESHKFDHHG 428

Query: 434 EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNDANS 493
           EEYAD+FAGFFQMSPPRHRLSRSTTTSPVRSR+CINVEDMIFKTPRKLVHSLQDLNDANS
Sbjct: 429 EEYADEFAGFFQMSPPRHRLSRSTTTSPVRSRSCINVEDMIFKTPRKLVHSLQDLNDANS 488

Query: 494 DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDNDNDND 553
           DYASKSCK RQRSLSLGNLEWSPRSCHNQNGSPD QKLSKDDSSSDNDNNN+N  DNDND
Sbjct: 489 DYASKSCKLRQRSLSLGNLEWSPRSCHNQNGSPDYQKLSKDDSSSDNDNNNNN--DNDND 548

Query: 554 EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 613
           E+SPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI
Sbjct: 549 EKSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 608

Query: 614 FTSLLWINEQDQGTYLVPT 632
           FTSLLWINEQDQGTYLVPT
Sbjct: 609 FTSLLWINEQDQGTYLVPT 625

BLAST of Cp4.1LG19g01890 vs. ExPASy TrEMBL
Match: A0A6J1EBI1 (protein SINE1-like OS=Cucurbita moschata OX=3662 GN=LOC111432522 PE=4 SV=1)

HSP 1 Score: 1176 bits (3042), Expect = 0.0
Identity = 607/619 (98.06%), Postives = 609/619 (98.38%), Query Frame = 0

Query: 14  RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 73
           RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA
Sbjct: 9   RSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGA 68

Query: 74  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 133
           LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG
Sbjct: 69  LTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYG 128

Query: 134 IDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKV 193
           IDPTTPDDKKTYVIHSLCNPLSESLLG QESLTAGAALCLKALVDSDNWRFASDEMVNKV
Sbjct: 129 IDPTTPDDKKTYVIHSLCNPLSESLLGYQESLTAGAALCLKALVDSDNWRFASDEMVNKV 188

Query: 194 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 253
           CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR
Sbjct: 189 CQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKR 248

Query: 254 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 313
           LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG
Sbjct: 249 LSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEG 308

Query: 314 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 373
           SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA
Sbjct: 309 SKMGKSPSSVTGSNFIDRRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSPFSSKQA 368

Query: 374 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFDHHG 433
           SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKF HHG
Sbjct: 369 SLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESHKFGHHG 428

Query: 434 EEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNDANS 493
           EEYADDFAGFFQM PPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLN+ANS
Sbjct: 429 EEYADDFAGFFQMGPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQDLNEANS 488

Query: 494 DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDNDNDND 553
           DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDN      DNDND
Sbjct: 489 DYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDN------DNDND 548

Query: 554 EQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 613
           EQSP GSESVSSTGGVPVQAMPVVVAQH+KIKTQYSGIEMAYKKTALKLVCGFSFLLFTI
Sbjct: 549 EQSPVGSESVSSTGGVPVQAMPVVVAQHTKIKTQYSGIEMAYKKTALKLVCGFSFLLFTI 608

Query: 614 FTSLLWINEQDQGTYLVPT 632
           FTSLLWINEQDQGTYLVPT
Sbjct: 609 FTSLLWINEQDQGTYLVPT 621

BLAST of Cp4.1LG19g01890 vs. ExPASy TrEMBL
Match: A0A5A7UWA1 (ARM repeat superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold39G00310 PE=4 SV=1)

HSP 1 Score: 998 bits (2580), Expect = 0.0
Identity = 530/630 (84.13%), Postives = 566/630 (89.84%), Query Frame = 0

Query: 9   ISISVRSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSEN 68
           IS + RSF+ K+ SPMLRRE ANLDKDADSRR+AMKAL+TYVKELDSKAIPVFLAQVSEN
Sbjct: 4   ISETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSEN 63

Query: 69  KETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPA 128
           KETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPA
Sbjct: 64  KETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPA 123

Query: 129 IARYGIDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDE 188
           IARYGIDPTTPDDKK +VI+SLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDE
Sbjct: 124 IARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDE 183

Query: 189 MVNKVCQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEK 248
           MVNKVCQNVAGALEEK TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILK G+VEK
Sbjct: 184 MVNKVCQNVAGALEEKSTQTNSHMGLVMSLAKRNPRIVEPYARLLLQAGLRILKCGVVEK 243

Query: 249 NSQKRLSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRI 308
           NSQKRLS+IQMINFLM+CLDPWSIFSELQ+I EEMENCQSDQM YVKGAAFETLQTA++I
Sbjct: 244 NSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKI 303

Query: 309 AADEGSKMGKSPSSVTGSNFID-RRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSP 368
            AD+GSKM KSPSSVTGSNFID RRRSPWRNGGSRTPSSESPESQT DSFFDYGSL GSP
Sbjct: 304 LADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSP 363

Query: 369 FSSKQASLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESH 428
           FSS+QAS NS FDRRS+NRKLW YENGGVDISLKDGLSLFS++ RGTDVSDT+SLHS SH
Sbjct: 364 FSSRQASRNSAFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMSLHSGSH 423

Query: 429 KFDHHGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQD 488
           KF H+GEEYADDF+GFFQMSPPR RLSRSTTTSP+RSR+ I VEDMIFKTPRKLVHSLQD
Sbjct: 424 KFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRSYIKVEDMIFKTPRKLVHSLQD 483

Query: 489 LNDANSDYASKSCKWRQRSLSLGNLEWSP-RSCHNQNGSPDDQKLSKDDSSSDNDNNNDN 548
           LN+ NSDYAS S + R RSLS GNLEWSP R+  N+NGS D++KLSK+D           
Sbjct: 484 LNETNSDYASGSSRRRHRSLSSGNLEWSPPRAFLNRNGSADERKLSKEDEDG-------- 543

Query: 549 DNDNDNDEQSPGGSESVSSTGGVP----VQAMPVVVAQHSKIKTQYSGIEMAYKKTALKL 608
             D DN EQS G SES+SST GVP    VQAMPV V   SKIK QY G+EMAYKKTALKL
Sbjct: 544 -LDIDNGEQSQGSSESISSTDGVPTHVDVQAMPVAVTCQSKIKPQYYGMEMAYKKTALKL 603

Query: 609 VCGFSFLLFTIFTSLLWINEQDQGTYLVPT 632
           VCGFSFLLFTIFTSLLWI++ DQG+YLVPT
Sbjct: 604 VCGFSFLLFTIFTSLLWIDDHDQGSYLVPT 624

BLAST of Cp4.1LG19g01890 vs. ExPASy TrEMBL
Match: A0A1S3B5D3 (uncharacterized protein LOC103485976 OS=Cucumis melo OX=3656 GN=LOC103485976 PE=4 SV=1)

HSP 1 Score: 998 bits (2580), Expect = 0.0
Identity = 530/630 (84.13%), Postives = 566/630 (89.84%), Query Frame = 0

Query: 9   ISISVRSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSEN 68
           IS + RSF+ K+ SPMLRRE ANLDKDADSRR+AMKAL+TYVKELDSKAIPVFLAQVSEN
Sbjct: 4   ISETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALRTYVKELDSKAIPVFLAQVSEN 63

Query: 69  KETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPA 128
           KETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPA
Sbjct: 64  KETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPA 123

Query: 129 IARYGIDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDE 188
           IARYGIDPTTPDDKK +VI+SLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDE
Sbjct: 124 IARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDE 183

Query: 189 MVNKVCQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEK 248
           MVNKVCQNVAGALEEK TQTNSHMGLVM+LAKRNPRIVEPYARLLLQAGLRILK G+VEK
Sbjct: 184 MVNKVCQNVAGALEEKSTQTNSHMGLVMSLAKRNPRIVEPYARLLLQAGLRILKCGVVEK 243

Query: 249 NSQKRLSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRI 308
           NSQKRLS+IQMINFLM+CLDPWSIFSELQ+I EEMENCQSDQM YVKGAAFETLQTA++I
Sbjct: 244 NSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKI 303

Query: 309 AADEGSKMGKSPSSVTGSNFID-RRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSP 368
            AD+GSKM KSPSSVTGSNFID RRRSPWRNGGSRTPSSESPESQT DSFFDYGSL GSP
Sbjct: 304 LADKGSKMDKSPSSVTGSNFIDHRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSP 363

Query: 369 FSSKQASLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESH 428
           FSS+QAS NS FDRRS+NRKLW YENGGVDISLKDGLSLFS++ RGTDVSDT+SLHS SH
Sbjct: 364 FSSRQASRNSAFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMSLHSGSH 423

Query: 429 KFDHHGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQD 488
           KF H+GEEYADDF+GFFQMSPPR RLSRSTTTSP+RSR+ I VEDMIFKTPRKLVHSLQD
Sbjct: 424 KFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRSYIKVEDMIFKTPRKLVHSLQD 483

Query: 489 LNDANSDYASKSCKWRQRSLSLGNLEWSP-RSCHNQNGSPDDQKLSKDDSSSDNDNNNDN 548
           LN+ NSDYAS S + R RSLS GNLEWSP R+  N+NGS D++KLSK+D           
Sbjct: 484 LNETNSDYASGSSRRRHRSLSSGNLEWSPPRAFLNRNGSADERKLSKEDEDG-------- 543

Query: 549 DNDNDNDEQSPGGSESVSSTGGVP----VQAMPVVVAQHSKIKTQYSGIEMAYKKTALKL 608
             D DN EQS G SES+SST GVP    VQAMPV V   SKIK QY G+EMAYKKTALKL
Sbjct: 544 -LDIDNGEQSQGSSESISSTDGVPTHVDVQAMPVAVTCQSKIKPQYYGMEMAYKKTALKL 603

Query: 609 VCGFSFLLFTIFTSLLWINEQDQGTYLVPT 632
           VCGFSFLLFTIFTSLLWI++ DQG+YLVPT
Sbjct: 604 VCGFSFLLFTIFTSLLWIDDHDQGSYLVPT 624

BLAST of Cp4.1LG19g01890 vs. ExPASy TrEMBL
Match: A0A0A0KYP2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G192160 PE=4 SV=1)

HSP 1 Score: 989 bits (2557), Expect = 0.0
Identity = 525/630 (83.33%), Postives = 562/630 (89.21%), Query Frame = 0

Query: 9   ISISVRSFVGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSEN 68
           IS + RSF+ K+ SPMLRRE ANLDKDADSRR+AMKALKTYVKELDSKAIPVFLAQVSEN
Sbjct: 4   ISETQRSFMSKNLSPMLRREFANLDKDADSRRSAMKALKTYVKELDSKAIPVFLAQVSEN 63

Query: 69  KETGALTGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPA 128
           KETGAL GECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPA
Sbjct: 64  KETGALNGECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPA 123

Query: 129 IARYGIDPTTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDE 188
           IARYGIDPTTPDDKK +VI+SLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDE
Sbjct: 124 IARYGIDPTTPDDKKKHVIYSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDE 183

Query: 189 MVNKVCQNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEK 248
           MVNKVCQNVAGALEEK TQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILK G+VEK
Sbjct: 184 MVNKVCQNVAGALEEKSTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKCGVVEK 243

Query: 249 NSQKRLSSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRI 308
           NSQKRLS+IQMINFLM+CLDPWSIFSELQ+I EEMENCQSDQM YVKGAAFETLQTA++I
Sbjct: 244 NSQKRLSAIQMINFLMRCLDPWSIFSELQSIIEEMENCQSDQMPYVKGAAFETLQTAKKI 303

Query: 309 AADEGSKMGKSPSSVTGSNFID-RRRSPWRNGGSRTPSSESPESQTHDSFFDYGSLDGSP 368
            AD+GSKM KSPSSVTGSNF+D RRRSPWRNGGSRTPSSESPESQT DSFFDYGSL GSP
Sbjct: 304 LADKGSKMDKSPSSVTGSNFLDHRRRSPWRNGGSRTPSSESPESQTLDSFFDYGSLVGSP 363

Query: 369 FSSKQASLNSGFDRRSMNRKLWRYENGGVDISLKDGLSLFSDIARGTDVSDTLSLHSESH 428
           FSS+QAS NSGFDRRS+NRKLW YENGGVDISLKDGLSLFS++ RGTDVSDT+S++S SH
Sbjct: 364 FSSRQASRNSGFDRRSVNRKLWSYENGGVDISLKDGLSLFSEVTRGTDVSDTMSMYSGSH 423

Query: 429 KFDHHGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRNCINVEDMIFKTPRKLVHSLQD 488
           KF H+GEEYADDF+GFFQMSPPR RLSRSTTTSP+RSR+ INVEDMIFKTPRKLVHSLQD
Sbjct: 424 KFGHNGEEYADDFSGFFQMSPPRRRLSRSTTTSPLRSRSYINVEDMIFKTPRKLVHSLQD 483

Query: 489 LNDANSDYASKSCKWRQRSLSLGNLEWSP-RSCHNQNGSPDDQKLSKDDSSSDNDNNNDN 548
           LN+  SDYAS S + R RSLS GNLEWSP R+  NQNG  D+ KLSK+D           
Sbjct: 484 LNEGKSDYASGSSRCRHRSLSSGNLEWSPPRAFLNQNGFADEPKLSKEDEDG-------- 543

Query: 549 DNDNDNDEQSPGGSESVSSTGGVP----VQAMPVVVAQHSKIKTQYSGIEMAYKKTALKL 608
              N N EQS G  ES+SS  G P    VQA+PV VA  SK+K QY G+EMAYKKTALKL
Sbjct: 544 -LGNGNGEQSQGSYESISSADGAPTHVDVQAIPVAVACQSKMKPQYYGMEMAYKKTALKL 603

Query: 609 VCGFSFLLFTIFTSLLWINEQDQGTYLVPT 632
           VCGFSFLLFTIFTSLLWI++ DQG+YLVPT
Sbjct: 604 VCGFSFLLFTIFTSLLWIDDHDQGSYLVPT 624

BLAST of Cp4.1LG19g01890 vs. TAIR 10
Match: AT1G54385.1 (ARM repeat superfamily protein )

HSP 1 Score: 526.6 bits (1355), Expect = 2.8e-149
Identity = 341/625 (54.56%), Postives = 413/625 (66.08%), Query Frame = 0

Query: 17  VGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGALTG 76
           +G + +P+LR+ELANLDKD +SR++AMKALK+YVK+LDSKAIP FLAQV E KET +L+G
Sbjct: 1   MGLNLNPILRQELANLDKDTESRKSAMKALKSYVKDLDSKAIPGFLAQVFETKETNSLSG 60

Query: 77  ECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYGIDP 136
           E TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSFPLQQACSKV+PAIARYGIDP
Sbjct: 61  EYTISLYEILARVHGPNIVPQIDTIMSTIVKTLASSAGSFPLQQACSKVIPAIARYGIDP 120

Query: 137 TTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKVCQN 196
           TT +DKK  +IHSLC PL++SLL SQESLT+GAALCLKALVDSDNWRFASDEMVN+VCQN
Sbjct: 121 TTTEDKKRVIIHSLCKPLTDSLLASQESLTSGAALCLKALVDSDNWRFASDEMVNRVCQN 180

Query: 197 VAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKRLSS 256
           V  AL+    QT+  MGLVM+LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLS+
Sbjct: 181 VVVALDSNSNQTHLQMGLVMSLAKHNPLIVEAYARLLIHTGLRILGFGVSEGNSQKRLSA 240

Query: 257 IQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEGSKM 316
           +QM+NFLMKCLDP SI+SE++ I +EME CQSDQMAYV+GAA+E + T++RIAA+  SKM
Sbjct: 241 VQMLNFLMKCLDPRSIYSEVELIIKEMERCQSDQMAYVRGAAYEAMMTSKRIAAELESKM 300

Query: 317 GKSPSSVTGSNFIDRRRSPWRNGGSRTPS-SESPESQTHDSFFDYGS-LDGSPFSSKQAS 376
            K   SVTGSNF        RN  S  P  S SPESQT  SF  Y S ++ SP S    S
Sbjct: 301 EKGCRSVTGSNF------SRRNCSSIVPDYSLSPESQTLGSFSGYDSPVESSPIS--HTS 360

Query: 377 LNSGFDRRSMNRKLWRY-ENGG-VDISLKDGLSLFSDIARG-TDVSDTLSLHSESHKFDH 436
            NS FDRRS+NRKLWR  ENGG VDISLKDG  LFS + +G T VSD       S    +
Sbjct: 361 CNSEFDRRSVNRKLWRRDENGGVVDISLKDG--LFSRVTKGSTTVSD-------SPLVPY 420

Query: 437 HGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRN-CINVEDM-IFKTPRKLVHSLQDLN 496
              E  D+F GF   S       R+TT SP R R+  IN ED  IF TPRKL+ SLQ  +
Sbjct: 421 DTCENGDEFEGFLMES------LRNTTPSPQRQRSRRINAEDFNIFSTPRKLISSLQYPD 480

Query: 497 DANSDYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDND 556
           D + D++       Q  +  G  E          GS  + KL K                
Sbjct: 481 DVDLDHSD-----IQSPILRGERE-------KTIGSRKNPKLRK---------------- 540

Query: 557 NDNDEQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFL 616
                Q P   E++SST         + V++ +      +G +   K +  KLV   SF+
Sbjct: 541 -----QFPTMVETMSST---------ITVSEDTAQTQMITGKKKKKKMSYAKLVIAISFV 560

Query: 617 LFTIF-TSLLWINEQDQ-GTYLVPT 633
           +  +F T +L +N+ D  G Y VPT
Sbjct: 601 VVALFATVILMVNQDDDVGYYTVPT 560

BLAST of Cp4.1LG19g01890 vs. TAIR 10
Match: AT1G54385.2 (ARM repeat superfamily protein )

HSP 1 Score: 526.6 bits (1355), Expect = 2.8e-149
Identity = 341/625 (54.56%), Postives = 413/625 (66.08%), Query Frame = 0

Query: 17  VGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGALTG 76
           +G + +P+LR+ELANLDKD +SR++AMKALK+YVK+LDSKAIP FLAQV E KET +L+G
Sbjct: 1   MGLNLNPILRQELANLDKDTESRKSAMKALKSYVKDLDSKAIPGFLAQVFETKETNSLSG 60

Query: 77  ECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYGIDP 136
           E TISLYE+LARVHG NIVPQID IM++I+KTLASSAGSFPLQQACSKV+PAIARYGIDP
Sbjct: 61  EYTISLYEILARVHGPNIVPQIDTIMSTIVKTLASSAGSFPLQQACSKVIPAIARYGIDP 120

Query: 137 TTPDDKKTYVIHSLCNPLSESLLGSQESLTAGAALCLKALVDSDNWRFASDEMVNKVCQN 196
           TT +DKK  +IHSLC PL++SLL SQESLT+GAALCLKALVDSDNWRFASDEMVN+VCQN
Sbjct: 121 TTTEDKKRVIIHSLCKPLTDSLLASQESLTSGAALCLKALVDSDNWRFASDEMVNRVCQN 180

Query: 197 VAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKRLSS 256
           V  AL+    QT+  MGLVM+LAK NP IVE YARLL+  GLRIL  G+ E NSQKRLS+
Sbjct: 181 VVVALDSNSNQTHLQMGLVMSLAKHNPLIVEAYARLLIHTGLRILGFGVSEGNSQKRLSA 240

Query: 257 IQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAADEGSKM 316
           +QM+NFLMKCLDP SI+SE++ I +EME CQSDQMAYV+GAA+E + T++RIAA+  SKM
Sbjct: 241 VQMLNFLMKCLDPRSIYSEVELIIKEMERCQSDQMAYVRGAAYEAMMTSKRIAAELESKM 300

Query: 317 GKSPSSVTGSNFIDRRRSPWRNGGSRTPS-SESPESQTHDSFFDYGS-LDGSPFSSKQAS 376
            K   SVTGSNF        RN  S  P  S SPESQT  SF  Y S ++ SP S    S
Sbjct: 301 EKGCRSVTGSNF------SRRNCSSIVPDYSLSPESQTLGSFSGYDSPVESSPIS--HTS 360

Query: 377 LNSGFDRRSMNRKLWRY-ENGG-VDISLKDGLSLFSDIARG-TDVSDTLSLHSESHKFDH 436
            NS FDRRS+NRKLWR  ENGG VDISLKDG  LFS + +G T VSD       S    +
Sbjct: 361 CNSEFDRRSVNRKLWRRDENGGVVDISLKDG--LFSRVTKGSTTVSD-------SPLVPY 420

Query: 437 HGEEYADDFAGFFQMSPPRHRLSRSTTTSPVRSRN-CINVEDM-IFKTPRKLVHSLQDLN 496
              E  D+F GF   S       R+TT SP R R+  IN ED  IF TPRKL+ SLQ  +
Sbjct: 421 DTCENGDEFEGFLMES------LRNTTPSPQRQRSRRINAEDFNIFSTPRKLISSLQYPD 480

Query: 497 DANSDYASKSCKWRQRSLSLGNLEWSPRSCHNQNGSPDDQKLSKDDSSSDNDNNNDNDND 556
           D + D++       Q  +  G  E          GS  + KL K                
Sbjct: 481 DVDLDHSD-----IQSPILRGERE-------KTIGSRKNPKLRK---------------- 540

Query: 557 NDNDEQSPGGSESVSSTGGVPVQAMPVVVAQHSKIKTQYSGIEMAYKKTALKLVCGFSFL 616
                Q P   E++SST         + V++ +      +G +   K +  KLV   SF+
Sbjct: 541 -----QFPTMVETMSST---------ITVSEDTAQTQMITGKKKKKKMSYAKLVIAISFV 560

Query: 617 LFTIF-TSLLWINEQDQ-GTYLVPT 633
           +  +F T +L +N+ D  G Y VPT
Sbjct: 601 VVALFATVILMVNQDDDVGYYTVPT 560

BLAST of Cp4.1LG19g01890 vs. TAIR 10
Match: AT3G03970.1 (ARM repeat superfamily protein )

HSP 1 Score: 324.3 bits (830), Expect = 2.1e-88
Identity = 175/297 (58.92%), Postives = 222/297 (74.75%), Query Frame = 0

Query: 17  VGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGALTG 76
           +G++     R+ELANLDKD DS +TAM  L++ VK+LD+K + VF+AQ+S+ KE G  +G
Sbjct: 1   MGRNLGSAFRQELANLDKDPDSHKTAMSNLRSIVKDLDAKVVHVFVAQLSDVKEIGLESG 60

Query: 77  ECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYGIDP 136
             T+SL+E LAR HGV I P ID IM +II+TL+SS GS  +QQACS+ V A+ARYGIDP
Sbjct: 61  GYTVSLFEDLARAHGVKIAPHIDIIMPAIIRTLSSSEGSLRVQQACSRAVAAMARYGIDP 120

Query: 137 TTPDDKKTYVIHSLCNPLSESLLGS--QESLTAGAALCLKALVDSDNWRFASDEMVNKVC 196
           TTP+DKKT VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VC
Sbjct: 121 TTPEDKKTNVIHSLCKPLSDSLIDSQHQQHLALGSALCLKSLVDCDNWRSASSEMVNNVC 180

Query: 197 QNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKRL 256
           Q++A ALE   ++  SHM LVM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL
Sbjct: 181 QSLAVALEATSSEAKSHMALVMALSKHNPFTVEAYARLFVKSGLRILDLGVVEGDSQKRL 240

Query: 257 SSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAAD 312
            +IQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A R+  +
Sbjct: 241 LAIQMLNFLMKNLNPKSISSELELIYQEMEKYQKDQ-HYVKMAAHETMRQAERLICE 296

BLAST of Cp4.1LG19g01890 vs. TAIR 10
Match: AT3G03970.3 (ARM repeat superfamily protein )

HSP 1 Score: 324.3 bits (830), Expect = 2.1e-88
Identity = 175/297 (58.92%), Postives = 222/297 (74.75%), Query Frame = 0

Query: 17  VGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGALTG 76
           +G++     R+ELANLDKD DS +TAM  L++ VK+LD+K + VF+AQ+S+ KE G  +G
Sbjct: 1   MGRNLGSAFRQELANLDKDPDSHKTAMSNLRSIVKDLDAKVVHVFVAQLSDVKEIGLESG 60

Query: 77  ECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYGIDP 136
             T+SL+E LAR HGV I P ID IM +II+TL+SS GS  +QQACS+ V A+ARYGIDP
Sbjct: 61  GYTVSLFEDLARAHGVKIAPHIDIIMPAIIRTLSSSEGSLRVQQACSRAVAAMARYGIDP 120

Query: 137 TTPDDKKTYVIHSLCNPLSESLLGS--QESLTAGAALCLKALVDSDNWRFASDEMVNKVC 196
           TTP+DKKT VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VC
Sbjct: 121 TTPEDKKTNVIHSLCKPLSDSLIDSQHQQHLALGSALCLKSLVDCDNWRSASSEMVNNVC 180

Query: 197 QNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKRL 256
           Q++A ALE   ++  SHM LVM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL
Sbjct: 181 QSLAVALEATSSEAKSHMALVMALSKHNPFTVEAYARLFVKSGLRILDLGVVEGDSQKRL 240

Query: 257 SSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAAD 312
            +IQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A R+  +
Sbjct: 241 LAIQMLNFLMKNLNPKSISSELELIYQEMEKYQKDQ-HYVKMAAHETMRQAERLICE 296

BLAST of Cp4.1LG19g01890 vs. TAIR 10
Match: AT3G03970.2 (ARM repeat superfamily protein )

HSP 1 Score: 324.3 bits (830), Expect = 2.1e-88
Identity = 175/297 (58.92%), Postives = 222/297 (74.75%), Query Frame = 0

Query: 17  VGKSFSPMLRRELANLDKDADSRRTAMKALKTYVKELDSKAIPVFLAQVSENKETGALTG 76
           +G++     R+ELANLDKD DS +TAM  L++ VK+LD+K + VF+AQ+S+ KE G  +G
Sbjct: 1   MGRNLGSAFRQELANLDKDPDSHKTAMSNLRSIVKDLDAKVVHVFVAQLSDVKEIGLESG 60

Query: 77  ECTISLYEVLARVHGVNIVPQIDRIMTSIIKTLASSAGSFPLQQACSKVVPAIARYGIDP 136
             T+SL+E LAR HGV I P ID IM +II+TL+SS GS  +QQACS+ V A+ARYGIDP
Sbjct: 61  GYTVSLFEDLARAHGVKIAPHIDIIMPAIIRTLSSSEGSLRVQQACSRAVAAMARYGIDP 120

Query: 137 TTPDDKKTYVIHSLCNPLSESLLGS--QESLTAGAALCLKALVDSDNWRFASDEMVNKVC 196
           TTP+DKKT VIHSLC PLS+SL+ S  Q+ L  G+ALCLK+LVD DNWR AS EMVN VC
Sbjct: 121 TTPEDKKTNVIHSLCKPLSDSLIDSQHQQHLALGSALCLKSLVDCDNWRSASSEMVNNVC 180

Query: 197 QNVAGALEEKFTQTNSHMGLVMTLAKRNPRIVEPYARLLLQAGLRILKSGMVEKNSQKRL 256
           Q++A ALE   ++  SHM LVM L+K NP  VE YARL +++GLRIL  G+VE +SQKRL
Sbjct: 181 QSLAVALEATSSEAKSHMALVMALSKHNPFTVEAYARLFVKSGLRILDLGVVEGDSQKRL 240

Query: 257 SSIQMINFLMKCLDPWSIFSELQTITEEMENCQSDQMAYVKGAAFETLQTARRIAAD 312
            +IQM+NFLMK L+P SI SEL+ I +EME  Q DQ  YVK AA ET++ A R+  +
Sbjct: 241 LAIQMLNFLMKNLNPKSISSELELIYQEMEKYQKDQ-HYVKMAAHETMRQAERLICE 296

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q5XVI14.0e-14854.56Protein SINE1 OS=Arabidopsis thaliana OX=3702 GN=SINE1 PE=1 SV=1[more]
Q9SQR53.0e-8758.92Protein SINE2 OS=Arabidopsis thaliana OX=3702 GN=SINE2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_023517873.10.0100.00protein SINE1-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023522699.1 prote... [more]
XP_023517874.10.0100.00protein SINE1-like isoform X2 [Cucurbita pepo subsp. pepo] >XP_023522700.1 prote... [more]
XP_022925214.10.098.06protein SINE1-like [Cucurbita moschata][more]
XP_022966522.10.098.22protein SINE1-like [Cucurbita maxima][more]
KAG6595508.10.097.74Protein SINE1, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
A0A6J1HN730.098.22protein SINE1-like OS=Cucurbita maxima OX=3661 GN=LOC111466174 PE=4 SV=1[more]
A0A6J1EBI10.098.06protein SINE1-like OS=Cucurbita moschata OX=3662 GN=LOC111432522 PE=4 SV=1[more]
A0A5A7UWA10.084.13ARM repeat superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_s... [more]
A0A1S3B5D30.084.13uncharacterized protein LOC103485976 OS=Cucumis melo OX=3656 GN=LOC103485976 PE=... [more]
A0A0A0KYP20.083.33Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G192160 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G54385.12.8e-14954.56ARM repeat superfamily protein [more]
AT1G54385.22.8e-14954.56ARM repeat superfamily protein [more]
AT3G03970.12.1e-8858.92ARM repeat superfamily protein [more]
AT3G03970.32.1e-8858.92ARM repeat superfamily protein [more]
AT3G03970.22.1e-8858.92ARM repeat superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 317..331
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 555..569
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 310..356
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 338..356
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 513..569
NoneNo IPR availablePANTHERPTHR12794GEMIN2coord: 18..632
NoneNo IPR availablePANTHERPTHR12794:SF2PROTEIN SINE1coord: 18..632
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 19..304

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG19g01890.1Cp4.1LG19g01890.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000387 spliceosomal snRNP assembly
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
cellular_component GO:0032797 SMN complex