Cp4.1LG10g10430 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g10430
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDUF241 domain protein
LocationCp4.1LG10 : 4046971 .. 4052384 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGTAGTAGCTGTGGTGCGAAGCCAGATCCTGTTACTTATAAAATTTTGATAGATAATGTGTGTAATAGCAAGATCTACGCGGTTGGTAAGTTTGCTGTCACAATCGCACTTTTCTTGGCAGCGGACTTTGTGATAACTAGACACTCGCTTCTTAACAGCAAGTGAGTTAAGTCAATCTGTATTCGTGTCACCTACGGCCAGTCAATGTATGCAAACATACCTCTGACTCCAATTTGAGAAGAAGGTTTTAGAAATCGGAGACGGAGATTTAGAAGGAGACATTAAAAGCAAGATTGAGAGGATTTGAAAGCAATGTTATATAACATACAGGCAAAAGGCAACACAATAGCTTAAGCCGTATATAACTTTTTCGGGTATGGAGAATTTGACAACTAGTCGCATCTCTCTGTAGTACATTTTGAAGCGATTCAGGTTTGGTAGGCCGAGACGTGATTTCGCTGAATGGTCATACAAACAGAAAATACAATGTTAGACAATACAACATGGAATAACATAAAGGATTTTGACGTGGTATGGGCATGTAGCAATGGACAACACGGCTTGGACGAGCATGACCATGACACTTGGCAGGGGCAGCGAGGCAATTGGTGTCGGGCGCTTTGCAGGAGTGGTAGAATTGCTGAAGCCTATCGAGTGTTTGATTATGCAGTTGAGAGTAAAAGTATGACTGATGTTGCTGCATATTCAACGTTGGAGAGTACACTGAAGTCTCTGAAGAAGGCGAGGGAGCAAAGCCATGCTATATAACTGGTACAACTTTGGTCATTTCGTATTAATTGTTTGTATCTTTTGAAAATTTCTGGATTTTTTTTGCTTTCCTTAGTCTTTTACCTTACCCCCATTTGAAAATTTTCTGCTTTTACTCAATGTCGTTAAAGGGTGCCTTTTATTTTTCCTGAACGAACATTTATTTTGATGAATGAGCCAAATCTTGCCCTAACCGCCCAAGCTTAAGCCCACCGACCACTAACCGATATTGTCTTTTCTGGGCTTTCCCTTTCGAAATTTCCCGTTTGGGCTTTTAAAAAGTTTCTACAAGTTTATAAGGAAGGATTCATTCTACTCTCCAACTGTTCTCACAGATAGTAGATAGTCAAGGTCCCGCTGCTACTACTTTTGTTTGGATCTAGAATCTTGATGCCGACTCATTTCCTCAAGAGTGGGACAATTGTGAAGAAGAATATTCTTAGATTGGATAAACATTGTAATAGCATAAGCCCAAGCCCACCGGTATTAGAGATGTTTTTCCTCTTTAAACTTTTCCTTTGGACTTCCACTCAAGGTTTTAAAACACCTCTATTAGAGAGAGGTTTTCACACCCTTATAAAAATGTTCCGTTCTCCTGTTCTCCTCTCCAACCGATATGGGTTCTCACAATCCACCCCCTTTCGAGGTCCATCGTCTTCGCTGACACTCGTTTCTCTCTTCATTCAATATGGTATAAGTGTTGCAGCTTTAGCAGCTTAATGCAGCGACATCTTAAGTCTTCTCGAATGTTATGAATTGTGGTTATAGATTTATACTTTTGGAATGGATTAGAATATCAAGTATTCGACAATGTTGATGGCAGCTGATACCAAATTCTATTTCTTTACTCGTTCTGATGATAGCTGCCAAATCTCACGTTTCTAGTTCCTCTGATTTAGGCCTGGATCAACAACCTCAACAAGCATTTGGCTGGCTGGTGAACAAAGTGGTTTAGCCCGGGTTATCAGAGCTTGCTGGGTAAATTTTCATAGCTCAAACAATGGCAGGAGAATATGCTGAACCCGAAAAACTTCGATCAACTCATCTCAAAAGGTATGATTTATGAAAAACTTCGATCAACTCATCTCAAAAGGCATGATTTATGATTTGGGTCGAGTTGCTAACTTATTTAGGTTGAGTTTGAGTTGATTGTGCTGAGTATTAATAACAAAATACTTTCAATTAATTTTGCAAGAGAAACAAACAAAACTCTAGATTCTGGGTGATTTTTAGGGCAATAAACATGATTTGTATAATAATCTTGAGTGGTCAAAGGGAATTGATTCATGTGGATTCTTTTTTCTAGAATTGGGAGACACCCACTCATTGCCAGGCTTTAATGTTGAGAACCCACACTGCAATGCCAAGCCATATGGTTTGATAAATAGAAATTATTATTATTTATTCTTATTATTATTTTGTACATCTGTACGTTTTTCTCTCCCTCTATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTGGGTGACTGATAATAGAATCGTTGATTATTCTTATCCTTGAAGGAAGTGGATTGCATTACATTCATGCACCTCCACATTCATCCGCTACCGTAAAGATTTTTGCTCTGTTTTTCGCTCTGCGCCCTTCACATGTTCGACAAAATGCCTGACTGAACCTCTTCTCATGTCTCCCTCATGGCGTCTCACCCCGACCGGGGCCTCATTGTCTTCTCTCAATTTTTGACCATGTGGTTTGTTTTCTCTCATACCCACTTTCTCTTTTAAGCCATTTCCCTCTGTTATAATGAGAAATTTTACTCCTTATCTCTCAAACTGAAAATTGTGGTGTTCTATAGAGCTAGATCATGCAATAGTTTACTTGATTTTTGTGTTTGAATTCCATTGTAGAGTGACTTTTAAAGTTCTTACAAAGTGTTTGATGAGCATTTGTGGATTTTCATAACCAGCTCTTTAGGTAACTTTCTCTTCCCCCCATTGTGTTAGCCGCATGCCCCGCCCTTTGTCTGCTGCTGCATTACAAACATTACTATAATATTGGAATGGTAGTAGCAATGTTTGCTCATTGTAAGCCTAGATTCATGGCTATAGCTATTAACAAAGCCACCATTTAAAAGGGGGTAGCCACTTTTTATTTCCATGCACATTCTTCTTCATGGGTGGAGACAAAACATGGGTCGAGAATATTTAGGTTTGAATTTAGCTTATTCTTTTTTCACCCACATGATTATTCATCTGATTCATCTTAAATTATGTTGGGTTTAGGAGAGCCCAATTATTGGGGAGGCAGGGTGGCATTGGTATCATTATGGCCTGCAACTATGCCAACATGTTCAATACTTTATTATTGTATATTGTTGTCAGAGCTTGATGTTGGCCAATTGGTGGAGCATTTTATTAAAAAATTTGACATAGATGCTCAGTTTGTAGGCTGAAACTGTTACAAGAGAATCAAACAGAGCTGAGCCTTTGGTCTATAACGCTGTGACTTTGACATGCATGCATAGAGACTTTGTTAGCTTCTCTTTTCCATATAAATTGGCTGTTATGTTGGTTGGTTTTGCTCAAGCCATTCAATTCTCTTGTGAGTTTCAGCAATTTACAAACCAAGGATGGTGATCTCATCGATATTCACCGACGCCCACCAGCCTGTTAGATCGGTTAGCTTGCCTGCGAGGGTGGAGCTCGAGCCAGAACCATTGCTGGAGAGCCTAAAATCCTTTCAAGTTTCGTCTTCCAATGCGAAAACGACTCCTTTTGGACTCGAAGGGATCCGAGCTGCATTGGTTGGGCTAGCAGAGTTGTATAACTCTGTTGGAGAGCTTGTTCAGTCTTCTTCCACCCAGCAAGCTCTTGTTCACTATAAGGAGGGGAAGCTTGTGGAAGAGGCTTTAACTGAGTCTGTTATATTGATAGATTCTTGTTGCTCTGCAAGAGACATAATCCTTACGATGAAACAAAATATACAATCCCTTCAATCGGCTTTACGTCGAAAAGTAGCAGATTCGAGCATCGAAAGCCATGTTCGTGCCTACTTCAGCTTCCGAAGGAAAGCGAAGAAAGACATCGGAAGCTTCCTCGGTTCATTGAAGCAAATGCAAAGTAATAGAACAACAAGCTTCCCTTTATTGGATCTACCAAACCATGATTTGTTGCCTCTTATCAGACTGCTAAGAGAAGCAAGAACCATCAGCATCTCCATCTTTGGAGAGCTTCTAGCGTTCCTATCAACGTCGGTGACGAAGGGGAAGGCTAGCGGGTGGTCATTGGTCTCACAATTGATGCCAACGATCAGGTCGAGATCGGGTAAAGGACGGAAAATAGTAAATGAATTGGAGAGTGTGGATATTGGTCTCCATTCTCTCCTTGGCCATGGGAGAGAAAACGAAAGCAATGATAAAAAAGCTGAAGTTCAAATGGCACAAAGAAGGCTTAGAACATTGGCGTCAAGTTTTGAAGGAATAGAGAGTGAATTGGATTGCATGTTCAGATGTTTAGTTAAACACAGAGTGTGTTTCCTTAACATGTTAGTTCATTGAATTGTTTGAGATCTCAAACATTCCAAACTCTTGTACATTTTAGCCACTTTGTGCTCACAATTTTATCTAAGATAAATATAACCCGTCATTCTTTGATAG

mRNA sequence

ATGCGTACAATTTACAAACCAAGGATGGTGATCTCATCGATATTCACCGACGCCCACCAGCCTGTTAGATCGGTTAGCTTGCCTGCGAGGGTGGAGCTCGAGCCAGAACCATTGCTGGAGAGCCTAAAATCCTTTCAAGTTTCGTCTTCCAATGCGAAAACGACTCCTTTTGGACTCGAAGGGATCCGAGCTGCATTGGTTGGGCTAGCAGAGTTGTATAACTCTGTTGGAGAGCTTGTTCAGTCTTCTTCCACCCAGCAAGCTCTTGTTCACTATAAGGAGGGGAAGCTTGTGGAAGAGGCTTTAACTGAGTCTGTTATATTGATAGATTCTTGTTGCTCTGCAAGAGACATAATCCTTACGATGAAACAAAATATACAATCCCTTCAATCGGCTTTACGTCGAAAAGTAGCAGATTCGAGCATCGAAAGCCATGTTCGTGCCTACTTCAGCTTCCGAAGGAAAGCGAAGAAAGACATCGGAAGCTTCCTCGGTTCATTGAAGCAAATGCAAAGTAATAGAACAACAAGCTTCCCTTTATTGGATCTACCAAACCATGATTTGTTGCCTCTTATCAGACTGCTAAGAGAAGCAAGAACCATCAGCATCTCCATCTTTGGAGAGCTTCTAGCGTTCCTATCAACGTCGGTGACGAAGGGGAAGGCTAGCGGGTGGTCATTGGTCTCACAATTGATGCCAACGATCAGGTCGAGATCGGGTAAAGGACGGAAAATAGTAAATGAATTGGAGAGTGTGGATATTGGTCTCCATTCTCTCCTTGGCCATGGGAGAGAAAACGAAAGCAATGATAAAAAAGCTGAAGTTCAAATGGCACAAAGAAGGCTTAGAACATTGGCGTCAAGTTTTGAAGGAATAGAGAGTGAATTGGATTGCATGTTCAGATGTTTAGTTAAACACAGAGTGTGTTTCCTTAACATGTTAGTTCATTGAATTGTTTGAGATCTCAAACATTCCAAACTCTTGTACATTTTAGCCACTTTGTGCTCACAATTTTATCTAAGATAAATATAACCCGTCATTCTTTGATAG

Coding sequence (CDS)

ATGCGTACAATTTACAAACCAAGGATGGTGATCTCATCGATATTCACCGACGCCCACCAGCCTGTTAGATCGGTTAGCTTGCCTGCGAGGGTGGAGCTCGAGCCAGAACCATTGCTGGAGAGCCTAAAATCCTTTCAAGTTTCGTCTTCCAATGCGAAAACGACTCCTTTTGGACTCGAAGGGATCCGAGCTGCATTGGTTGGGCTAGCAGAGTTGTATAACTCTGTTGGAGAGCTTGTTCAGTCTTCTTCCACCCAGCAAGCTCTTGTTCACTATAAGGAGGGGAAGCTTGTGGAAGAGGCTTTAACTGAGTCTGTTATATTGATAGATTCTTGTTGCTCTGCAAGAGACATAATCCTTACGATGAAACAAAATATACAATCCCTTCAATCGGCTTTACGTCGAAAAGTAGCAGATTCGAGCATCGAAAGCCATGTTCGTGCCTACTTCAGCTTCCGAAGGAAAGCGAAGAAAGACATCGGAAGCTTCCTCGGTTCATTGAAGCAAATGCAAAGTAATAGAACAACAAGCTTCCCTTTATTGGATCTACCAAACCATGATTTGTTGCCTCTTATCAGACTGCTAAGAGAAGCAAGAACCATCAGCATCTCCATCTTTGGAGAGCTTCTAGCGTTCCTATCAACGTCGGTGACGAAGGGGAAGGCTAGCGGGTGGTCATTGGTCTCACAATTGATGCCAACGATCAGGTCGAGATCGGGTAAAGGACGGAAAATAGTAAATGAATTGGAGAGTGTGGATATTGGTCTCCATTCTCTCCTTGGCCATGGGAGAGAAAACGAAAGCAATGATAAAAAAGCTGAAGTTCAAATGGCACAAAGAAGGCTTAGAACATTGGCGTCAAGTTTTGAAGGAATAGAGAGTGAATTGGATTGCATGTTCAGATGTTTAGTTAAACACAGAGTGTGTTTCCTTAACATGTTAGTTCATTGA

Protein sequence

MRTIYKPRMVISSIFTDAHQPVRSVSLPARVELEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSLQSALRRKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPNHDLLPLIRLLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKIVNELESVDIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVCFLNMLVH
BLAST of Cp4.1LG10g10430 vs. TrEMBL
Match: A0A0A0KMP4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G507490 PE=4 SV=1)

HSP 1 Score: 473.0 bits (1216), Expect = 2.8e-130
Identity = 252/309 (81.55%), Postives = 278/309 (89.97%), Query Frame = 1

Query: 9   MVISSIFTDAHQPVRSVSLPARVELEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVG 68
           MVI SIFT AHQPVRSVSLP RVEL+PEPLL+SLKSFQVSS NAKTTPFGLE I+AALVG
Sbjct: 1   MVIKSIFTGAHQPVRSVSLPTRVELKPEPLLQSLKSFQVSSCNAKTTPFGLEEIQAALVG 60

Query: 69  LAELYNSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQS 128
           LAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL ESV+LIDSC SARDIILTMKQNIQ+
Sbjct: 61  LAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDIILTMKQNIQT 120

Query: 129 LQSALRRKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLL-DLPNHD 188
           LQSALRRK A+S +E+HVRAYFSFRRKAKKDIG+++  LK+M+++RTT+F LL D+ NHD
Sbjct: 121 LQSALRRKCANSIVENHVRAYFSFRRKAKKDIGNYISVLKRMENDRTTNFFLLWDIQNHD 180

Query: 189 LLPLIRLLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKIVN 248
           LLPLI+LLREAR++SISIFGELLAFLS  V KG A GWSLVSQLMP I+S SGKG+K VN
Sbjct: 181 LLPLIKLLREARSVSISIFGELLAFLSAPVVKGNARGWSLVSQLMPVIKSGSGKGQKTVN 240

Query: 249 ELESVDIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHR 308
           ELE+VDI L+SLLG GR    ND KAEVQ+AQRR+ TLASSFEGIES LDCMFRCLVKHR
Sbjct: 241 ELENVDIALNSLLGQGRGTCGNDNKAEVQIAQRRIGTLASSFEGIESGLDCMFRCLVKHR 300

Query: 309 VCFLNMLVH 317
           VCFLNMLVH
Sbjct: 301 VCFLNMLVH 309

BLAST of Cp4.1LG10g10430 vs. TrEMBL
Match: A0A061DF67_THECC (Selection and upkeep of intraepithelial T-cells protein 6, putative OS=Theobroma cacao GN=TCM_000100 PE=4 SV=1)

HSP 1 Score: 251.1 bits (640), Expect = 1.7e-63
Identity = 144/301 (47.84%), Postives = 208/301 (69.10%), Query Frame = 1

Query: 17  DAHQPVRSVSLPARVE---LEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVGLAELY 76
           D HQPVRS+SLP+RV    ++ E  L  L++++ SS +A     G E I+  LVGLAELY
Sbjct: 9   DVHQPVRSISLPSRVHPASVKLEAALNHLEAWKTSSHSAAAVSSG-ETIQIGLVGLAELY 68

Query: 77  NSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSLQSAL 136
           N V E++ S  T+Q L+HY+ GKLVEEAL ESV  +D+C   RD++L MK+++Q+LQSAL
Sbjct: 69  NCVQEIISSPQTKQKLLHYQNGKLVEEALDESVTFLDTCGKGRDLLLKMKEHVQTLQSAL 128

Query: 137 RRKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPNHDLLPLIR 196
           RR+  D SIE  V AY +FR+K KK++   LG+LK+++S +  S  LLD+  H LL +++
Sbjct: 129 RRRRGDLSIEIEVAAYINFRKKVKKELAKCLGALKEIES-KIGSSTLLDVDQH-LLMVVK 188

Query: 197 LLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKIVNELESVD 256
            LREA +I+IS+F  LL FLS    K +  GWS +S+L+PT    S K +K++NE+ SVD
Sbjct: 189 ALREASSITISVFQSLLLFLSMPSMKTRVRGWSKISKLIPTRFLSSEKEQKVMNEVGSVD 248

Query: 257 IGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVCFLNM 315
           + ++S+ GH    +  D  AEV+M Q  L+TL +S +G E+ LDC+F+CLV++RV FLN+
Sbjct: 249 LAVYSINGH---LKIGDSMAEVEMMQMMLKTLDASIDGFEAGLDCIFKCLVQNRVTFLNI 303

BLAST of Cp4.1LG10g10430 vs. TrEMBL
Match: A0A0D2QVL1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G189500 PE=4 SV=1)

HSP 1 Score: 241.9 bits (616), Expect = 1.1e-60
Identity = 134/304 (44.08%), Postives = 203/304 (66.78%), Query Frame = 1

Query: 16  TDAHQPVRSVSLPARVE---LEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVGLAEL 75
           +D HQPVRS+SLP+RV    ++ E  L  LK+++ SS +  T  F  E IR  LV LA+L
Sbjct: 8   SDVHQPVRSISLPSRVHPTCVKLEAALNHLKAWKTSSISTSTAGFSGETIRIGLVDLADL 67

Query: 76  YNSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSLQSA 135
           YN V E + S  TQ++LV Y+ G+LVEEAL ESV  +D+C  ARD++L MKQ++Q+LQSA
Sbjct: 68  YNCVRETITSPQTQRSLVQYQNGRLVEEALDESVTFLDTCGKARDLLLAMKQHVQTLQSA 127

Query: 136 LRRKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPNHDLLPLI 195
           LRR+  DSSIE+ + AY +FR+  KK++   LG+LK+++    +S   LD+  H LL ++
Sbjct: 128 LRRRRGDSSIETQIAAYINFRKTVKKEVAKCLGALKKLERRFVSSSTPLDVDPH-LLMVV 187

Query: 196 RLLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKIVNELESV 255
           ++LRE  +I+IS+F  LL FLS    K +  GWS +++L+P + S   +  K++NE+ +V
Sbjct: 188 KVLRETTSITISVFQSLLLFLSVPSMKTRVGGWSKITKLIPLLSSE--REHKVINEVGAV 247

Query: 256 DIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVCFLN 315
           D+   S+ G   + ++     EV M QR L+ + ++ +G E+ LDC+FRCLV++RV FLN
Sbjct: 248 DLAFCSING---QLKNGGGMVEVDMLQRTLKAVGATIDGFETGLDCVFRCLVQNRVTFLN 305

Query: 316 MLVH 317
           ++ H
Sbjct: 308 IITH 305

BLAST of Cp4.1LG10g10430 vs. TrEMBL
Match: A0A061DF42_THECC (Selection and upkeep of intraepithelial T-cells protein 6, putative OS=Theobroma cacao GN=TCM_000099 PE=4 SV=1)

HSP 1 Score: 227.3 bits (578), Expect = 2.7e-56
Identity = 141/305 (46.23%), Postives = 204/305 (66.89%), Query Frame = 1

Query: 11  ISSIFTDAHQPVRSVSLPARVELEP-EPLLESLKSFQVSSSNAKTTPFGLEGIRAALVGL 70
           ISS+   ++ PVRS+SLP+R++    E  L  LK+F++SS++ +T   G E I      L
Sbjct: 16  ISSVIHGSNVPVRSISLPSRLQPNSIEAELNELKTFRLSSAS-RTIHAGGETICTGFTRL 75

Query: 71  AELYNSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSL 130
           A+LYN++ E+VQS  TQQAL H +  KLVEEAL +SV L+D+C +ARD+IL M + +Q L
Sbjct: 76  AKLYNNIEEIVQSPLTQQALHHQQNVKLVEEALDDSVGLLDACGTARDLILMMMEQVQDL 135

Query: 131 QSALRRKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPNHDLL 190
           QSALRR+  DS I S++ AY SFR+K +K+I   L  LK+++ N  T FPL ++  H L 
Sbjct: 136 QSALRRRGGDSCIGSNILAYISFRKKLQKNIAKTLRVLKRLECNIGT-FPLFNVDCH-LS 195

Query: 191 PLIRLLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKIVNEL 250
            +++  RE+  I+IS+F  LL+FLS  V K KA GWSL+S+L+P    RS   +KI NE+
Sbjct: 196 MVVKAQRESHAITISLFQSLLSFLSMPVLKTKAGGWSLISKLVPIAAERS---QKIFNEV 255

Query: 251 ESVDIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVC 310
             VD  LH++ G  R+   ND   + Q+  +RL TL+ S +G+E+ LDC+FRCL+++RV 
Sbjct: 256 GIVDFTLHTVQGKLRK---NDATIDPQIELKRLETLSGSIKGLEAGLDCLFRCLIRNRVS 311

Query: 311 FLNML 315
            LN+L
Sbjct: 316 LLNIL 311

BLAST of Cp4.1LG10g10430 vs. TrEMBL
Match: M5VPI0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021917mg PE=4 SV=1)

HSP 1 Score: 226.9 bits (577), Expect = 3.5e-56
Identity = 148/304 (48.68%), Postives = 198/304 (65.13%), Query Frame = 1

Query: 21  PVRSVSLPARVELEPEPLLESLKS-----FQVSSSNAKTTPFGLEGIRAALVGLAELYNS 80
           PVRS+SLP+R+    + +   LK      F  ++  A ++P G E +   L GLAELYN 
Sbjct: 10  PVRSISLPSRLNPNSQKIESELKKLKTLRFSCAAEAASSSPLGSEALLEGLSGLAELYNC 69

Query: 81  VGELVQSSSTQQALVHYKEGK-LVEEALTESVILIDSCCSARDIILTMKQNIQSLQSALR 140
           + ELV S  TQQAL H+++ K LVEEAL  SV L+DSC +ARD++LTMK+++Q+LQSALR
Sbjct: 70  IEELVHSPLTQQALNHHQQCKTLVEEALDGSVGLLDSCGNARDLLLTMKEHVQNLQSALR 129

Query: 141 RKVA--DSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSN-RTTSFPLLDLPNHDLLPL 200
           R+     SSIES+V AY  FR+KAKK I   L  LK+M+SN    SF LLDL +H++  +
Sbjct: 130 RRRTGDSSSIESNVHAYICFRKKAKKSIAKSLRDLKKMESNINIGSFCLLDL-DHNVQIV 189

Query: 201 IRLLREARTISISIFGELLAFLSTSVTKG-KASGWSLVSQLMPTIRSRSGKGRKIVNELE 260
           ++LLRE   ++IS+F  L  FLS  +T   KAS W LVS+LM    + S  G+KI NE+ 
Sbjct: 190 LKLLRELSAVTISVFQSLCVFLSMPLTNNTKASKWCLVSKLMAVRFAASENGQKIYNEVG 249

Query: 261 SVDIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVCF 315
           SVDI L SL  HG   +S+  K +VQ  Q RL TL  S  G+E  L+ +FRCL++HRV  
Sbjct: 250 SVDIALCSL--HGHMKKSDYAKTDVQGVQWRLDTLDCSISGLEGGLERLFRCLLQHRVSL 309

BLAST of Cp4.1LG10g10430 vs. TAIR10
Match: AT4G35680.1 (AT4G35680.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 146.0 bits (367), Expect = 4.0e-35
Identity = 107/307 (34.85%), Postives = 174/307 (56.68%), Query Frame = 1

Query: 19  HQPVRSVSLPARVELEPEPLLESLKSFQV----SSSNAKTTPFGLEGIRAALVGLAELYN 78
           HQPVRS SLP+R+      L  +L    +    SSS + +  FG E +   LV L ELY 
Sbjct: 15  HQPVRSASLPSRIHPLSVKLRTALSRLSIWRRSSSSISVSASFGYETVLVGLVNLTELYG 74

Query: 79  SVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSLQSALR 138
            V EL++S   +  L+H++EGKL++E+L  SV+L+D     R++I+ M++++ +L+SALR
Sbjct: 75  CVHELLESPYVKHTLLHHQEGKLLDESLDGSVLLLDVYEGTREVIVAMREHVTNLKSALR 134

Query: 139 RKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPNHDLLPLIRL 198
           RK    S+E   +AYF+ R+KAKK+I   + +LK+M++   ++    +      +    +
Sbjct: 135 RK---GSLEKEAKAYFNLRKKAKKEISKQINALKKMETRDIST----NTDQDSAIASTSV 194

Query: 199 LREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLM--PTIR-SRSGKGRKIVNELES 258
           LRE   I++S+F  LL FLST       + +     L+  P +  S S K   ++ E++S
Sbjct: 195 LRETIQITVSMFRHLLLFLSTIPPPPSPAIFKTTIGLLSIPFVSPSLSDKSLILIKEMKS 254

Query: 259 V-DIGLHSLLGHGRENESNDKKAEVQ-MAQRRLR--TLASSFEGIESELDCMFRCLVKHR 315
           + D+ L S+L      +S     EV+ M   ++R   +   F  +E+ELD + +CLVK+R
Sbjct: 255 LDDVFLGSIL------DSRKTLFEVETMENEKMRRDVVEDGFRDLEAELDSVSKCLVKNR 308

BLAST of Cp4.1LG10g10430 vs. TAIR10
Match: AT4G35660.1 (AT4G35660.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 116.3 bits (290), Expect = 3.4e-26
Identity = 103/310 (33.23%), Postives = 158/310 (50.97%), Query Frame = 1

Query: 12  SSIFTDAHQPVRSVSLPAR-VELEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVGLA 71
           SS     H P RS+SLP R +  + + + E LK  Q  +S++  +      I+  L  L 
Sbjct: 6   SSSVATTHVPARSISLPTRLIHPKAQRVEEELKKIQALNSSSSAS----SRIQLGLAKLV 65

Query: 72  ELYNSVGELVQSSST-QQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSL 131
           ELY+ V E V SS   QQAL   +  KLVE+AL ES++L+D     RD+I T+ ++IQ L
Sbjct: 66  ELYDFVNEQVISSPQGQQALRLCRNRKLVEDALDESIVLLDVSDFTRDLIGTLMEHIQEL 125

Query: 132 QSALRRKVAD-SSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLD--LPNH 191
           QSALRR+  + SS++S +R+Y SF +K+K +    + SL + Q+ +          L  H
Sbjct: 126 QSALRRRRGNLSSVQSEIRSYISFHKKSKTEAARQVKSLARRQTKKKAWVIKQSGGLDEH 185

Query: 192 DLLPLIRLLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGK--GRK 251
             + +  +LR++   +ISI   LL FLSTS    +     +       IRS  G+  GRK
Sbjct: 186 SSM-VSNILRQSNASTISILQSLLQFLSTSGENNEKKNGEIGCVDNSMIRSFFGRIIGRK 245

Query: 252 IVNELESVDIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLV 311
           +V E+++      ++LG                   RL  +  S E I+ EL  + R L+
Sbjct: 246 MVKEIDA-----QTILG-------------------RLAMVNVSLEAIKDELSYLSRRLI 286

Query: 312 KHRVCFLNML 315
           +HR   LN++
Sbjct: 306 QHRASLLNIV 286

BLAST of Cp4.1LG10g10430 vs. TAIR10
Match: AT3G51410.1 (AT3G51410.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 115.9 bits (289), Expect = 4.4e-26
Identity = 99/304 (32.57%), Postives = 153/304 (50.33%), Query Frame = 1

Query: 16  TDAHQPVRSVSLPARVE---LEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVGLAEL 75
           T    PVRS+SLP+R+     + +  L  +  FQ SS +        + + A+L+ L+EL
Sbjct: 33  TKLQMPVRSISLPSRIHHPSAKFQAALSQIHLFQNSSDS--------QSLHASLLNLSEL 92

Query: 76  YNSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSLQSA 135
           Y+S+ +L  S  T QA          E +L  S  L+DSC +AR+++LT+++++ +LQSA
Sbjct: 93  YHSLHQLNHSLPTAQA----------EHSLDVSATLLDSCDAARNLVLTLREHLLNLQSA 152

Query: 136 LRRKVADSSIESHVRAYFSFRRKAKKDIGS-FLGSLKQMQSNRTTSFPLLDLPNHDLLPL 195
           LRRK  D S+E  ++ YFSFR+K KK+     LG  K++  + TT               
Sbjct: 153 LRRK--DKSMEVQIKEYFSFRKKIKKETNKLLLGLKKKLDDSETTE-------------- 212

Query: 196 IRLLREARTISISIFGELLAFLSTSVT-KGKASGWSLVSQLMPTIRSRSGKGRKIVNELE 255
                    +S+SIF  L  FLST+ T K K      VS+L   I         I++EL+
Sbjct: 213 ---------LSVSIFRSLFMFLSTTSTMKTKTCSLKFVSRL---ISGGHRSSSSIMSELQ 272

Query: 256 SVDIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVCF 315
           ++D  L S           D   E+   ++ L  L    E +E+ LD +F+ LV++RV  
Sbjct: 273 NLDAVLRS---------DGDNSKEI---KKMLERLEERTEELETALDSLFKSLVQYRVYL 278

BLAST of Cp4.1LG10g10430 vs. TAIR10
Match: AT2G17080.1 (AT2G17080.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 92.4 bits (228), Expect = 5.2e-19
Identity = 94/303 (31.02%), Postives = 142/303 (46.86%), Query Frame = 1

Query: 22  VRSVSLPARVELEPEPLLESL----KSFQVSSSNAKTTPFGLEGIRAALVGLAELYNSVG 81
           VRS S P+R   +   + E L     S Q SSS++ +       I   L  L EL+ S+ 
Sbjct: 7   VRSNSFPSRSHPQAAHVDEQLARLRSSEQASSSSSSS-------ICQRLDNLQELHESLD 66

Query: 82  ELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSLQSALRRKV 141
           +L+    TQQAL      K VE+ L  S+ ++D C  ++D +  MK+ +  +QS LRRK 
Sbjct: 67  KLISRPVTQQALSQEHNKKAVEQLLDGSLRILDLCNISKDALSEMKEGLMEIQSILRRKR 126

Query: 142 ADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPNHDLLPLIRLLRE 201
            D S E  V+ Y + R+  KK       SLK  Q+            N D L +     E
Sbjct: 127 GDLSEE--VKKYLTSRKSLKKSFQKVQKSLKVTQAEDN---------NDDTLAVFG---E 186

Query: 202 ARTISISIFGELLAFLSTSVTKGKASGWSLVSQLM----PTIRSRSGKGRKIVNELESVD 261
           A  I++S+F  LL+++S S T    S WS+VS+LM     T  ++  +  K+ +E +S  
Sbjct: 187 AEAITLSLFDSLLSYMSGSKT---CSKWSVVSKLMNKKKVTCEAQENEFTKVDSEFQS-- 246

Query: 262 IGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVCFLNM 317
                        E   K  +VQ        L S  + +E  L+ + + L+K+RV FLN+
Sbjct: 247 -------------EKTLKMDDVQ-------NLESCIQDLEDGLESLSKSLIKYRVSFLNI 263

BLAST of Cp4.1LG10g10430 vs. TAIR10
Match: AT4G35690.1 (AT4G35690.1 Arabidopsis protein of unknown function (DUF241))

HSP 1 Score: 90.9 bits (224), Expect = 1.5e-18
Identity = 78/300 (26.00%), Postives = 149/300 (49.67%), Query Frame = 1

Query: 22  VRSVSLPARVELEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVGLAELYNSVGELVQ 81
           +RS+SLP+        + ESL   +V + N  T     E +   L GL ELYN   + ++
Sbjct: 10  LRSISLPSSSHPSTTGIEESLN--KVKTINTMTG--SSESVLMGLEGLEELYNCTEDFLK 69

Query: 82  SSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSLQSALRRKV---A 141
             STQ+ +      + +EE L  S+ L+D C  +RD+++  +++++ +QS +RRK     
Sbjct: 70  MGSTQRVMSSSDGSEFMEEMLDGSLRLMDICSVSRDLMVETQEHVRGVQSCVRRKKVVGG 129

Query: 142 DSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPNHD-LLPLIRLLRE 201
           +  ++  V  Y  FR+  +K+    LGSLK +    ++S  + +    + L+ ++  +R+
Sbjct: 130 EDQLDVAVAGYVGFRKNMRKEAKRLLGSLKNIDGGLSSSSSVNNGEQEEHLVVVVDAMRQ 189

Query: 202 ARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRS-GKGRKIVNELESVDIGL 261
             ++S+++    L FLS     G+    ++ S+L   ++ +      +  NELE++D+ +
Sbjct: 190 VVSVSVAVLRSFLEFLS-----GRRQS-NIKSKLASVLKKKKVHHVEETKNELENLDLEI 249

Query: 262 HSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVCFLNMLVH 317
                    N+   K  EV+M          S +G E +L+ +FR L++ R   LN++ H
Sbjct: 250 FC-----SRNDLQKKLEEVEM----------SIDGFEKKLEGLFRRLIRTRASLLNIISH 284

BLAST of Cp4.1LG10g10430 vs. NCBI nr
Match: gi|778719859|ref|XP_011658069.1| (PREDICTED: uncharacterized protein LOC101217557 [Cucumis sativus])

HSP 1 Score: 473.0 bits (1216), Expect = 4.0e-130
Identity = 252/309 (81.55%), Postives = 278/309 (89.97%), Query Frame = 1

Query: 9   MVISSIFTDAHQPVRSVSLPARVELEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVG 68
           MVI SIFT AHQPVRSVSLP RVEL+PEPLL+SLKSFQVSS NAKTTPFGLE I+AALVG
Sbjct: 1   MVIKSIFTGAHQPVRSVSLPTRVELKPEPLLQSLKSFQVSSCNAKTTPFGLEEIQAALVG 60

Query: 69  LAELYNSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQS 128
           LAELYNSVGELVQSSSTQQALVHYKEGKLVEEAL ESV+LIDSC SARDIILTMKQNIQ+
Sbjct: 61  LAELYNSVGELVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDIILTMKQNIQT 120

Query: 129 LQSALRRKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLL-DLPNHD 188
           LQSALRRK A+S +E+HVRAYFSFRRKAKKDIG+++  LK+M+++RTT+F LL D+ NHD
Sbjct: 121 LQSALRRKCANSIVENHVRAYFSFRRKAKKDIGNYISVLKRMENDRTTNFFLLWDIQNHD 180

Query: 189 LLPLIRLLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKIVN 248
           LLPLI+LLREAR++SISIFGELLAFLS  V KG A GWSLVSQLMP I+S SGKG+K VN
Sbjct: 181 LLPLIKLLREARSVSISIFGELLAFLSAPVVKGNARGWSLVSQLMPVIKSGSGKGQKTVN 240

Query: 249 ELESVDIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHR 308
           ELE+VDI L+SLLG GR    ND KAEVQ+AQRR+ TLASSFEGIES LDCMFRCLVKHR
Sbjct: 241 ELENVDIALNSLLGQGRGTCGNDNKAEVQIAQRRIGTLASSFEGIESGLDCMFRCLVKHR 300

Query: 309 VCFLNMLVH 317
           VCFLNMLVH
Sbjct: 301 VCFLNMLVH 309

BLAST of Cp4.1LG10g10430 vs. NCBI nr
Match: gi|659081068|ref|XP_008441133.1| (PREDICTED: uncharacterized protein LOC103485362 [Cucumis melo])

HSP 1 Score: 454.5 bits (1168), Expect = 1.5e-124
Identity = 243/309 (78.64%), Postives = 273/309 (88.35%), Query Frame = 1

Query: 9   MVISSIFTDAHQPVRSVSLPARVELEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVG 68
           MVI+SIFT AHQP+RSVSLP RVE E EPLL+SLKSFQVSS  AKTTP GLE I+ ALVG
Sbjct: 1   MVITSIFTGAHQPIRSVSLPTRVERESEPLLQSLKSFQVSSCKAKTTPLGLEEIQVALVG 60

Query: 69  LAELYNSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQS 128
           LAELYNSVG+LVQSSSTQQALVHYKEGKLVEEAL ESV+LIDSC SARDIILTMKQ IQ+
Sbjct: 61  LAELYNSVGKLVQSSSTQQALVHYKEGKLVEEALNESVVLIDSCSSARDIILTMKQIIQT 120

Query: 129 LQSALRRKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLL-DLPNHD 188
           LQSALRRK A+S +ESHVRAYFS+RRKAKK+IGS++G LK+M+++RTT F LL D+ NHD
Sbjct: 121 LQSALRRKCANSIVESHVRAYFSYRRKAKKEIGSYIGVLKRMENDRTTDFFLLWDIQNHD 180

Query: 189 LLPLIRLLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKIVN 248
           LLP+I+LLREAR++SISIFGELLAFLS  V KGKA GWSLVSQLMP I+S SGKG+K VN
Sbjct: 181 LLPVIKLLREARSVSISIFGELLAFLSAPVVKGKARGWSLVSQLMPVIKSGSGKGQKTVN 240

Query: 249 ELESVDIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHR 308
           E+E+VDI L+SLLG GR    ND KAEVQ+AQRR+ TLASSFEGIES LD MF+CLVKHR
Sbjct: 241 EMENVDIALNSLLGQGRGTCGNDNKAEVQIAQRRIGTLASSFEGIESGLDSMFKCLVKHR 300

Query: 309 VCFLNMLVH 317
           VCFLNMLVH
Sbjct: 301 VCFLNMLVH 309

BLAST of Cp4.1LG10g10430 vs. NCBI nr
Match: gi|590702135|ref|XP_007046553.1| (Selection and upkeep of intraepithelial T-cells protein 6, putative [Theobroma cacao])

HSP 1 Score: 251.1 bits (640), Expect = 2.5e-63
Identity = 144/301 (47.84%), Postives = 208/301 (69.10%), Query Frame = 1

Query: 17  DAHQPVRSVSLPARVE---LEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVGLAELY 76
           D HQPVRS+SLP+RV    ++ E  L  L++++ SS +A     G E I+  LVGLAELY
Sbjct: 9   DVHQPVRSISLPSRVHPASVKLEAALNHLEAWKTSSHSAAAVSSG-ETIQIGLVGLAELY 68

Query: 77  NSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSLQSAL 136
           N V E++ S  T+Q L+HY+ GKLVEEAL ESV  +D+C   RD++L MK+++Q+LQSAL
Sbjct: 69  NCVQEIISSPQTKQKLLHYQNGKLVEEALDESVTFLDTCGKGRDLLLKMKEHVQTLQSAL 128

Query: 137 RRKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPNHDLLPLIR 196
           RR+  D SIE  V AY +FR+K KK++   LG+LK+++S +  S  LLD+  H LL +++
Sbjct: 129 RRRRGDLSIEIEVAAYINFRKKVKKELAKCLGALKEIES-KIGSSTLLDVDQH-LLMVVK 188

Query: 197 LLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKIVNELESVD 256
            LREA +I+IS+F  LL FLS    K +  GWS +S+L+PT    S K +K++NE+ SVD
Sbjct: 189 ALREASSITISVFQSLLLFLSMPSMKTRVRGWSKISKLIPTRFLSSEKEQKVMNEVGSVD 248

Query: 257 IGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVCFLNM 315
           + ++S+ GH    +  D  AEV+M Q  L+TL +S +G E+ LDC+F+CLV++RV FLN+
Sbjct: 249 LAVYSINGH---LKIGDSMAEVEMMQMMLKTLDASIDGFEAGLDCIFKCLVQNRVTFLNI 303

BLAST of Cp4.1LG10g10430 vs. NCBI nr
Match: gi|823191419|ref|XP_012491452.1| (PREDICTED: uncharacterized protein LOC105803659 [Gossypium raimondii])

HSP 1 Score: 241.9 bits (616), Expect = 1.5e-60
Identity = 134/304 (44.08%), Postives = 203/304 (66.78%), Query Frame = 1

Query: 16  TDAHQPVRSVSLPARVE---LEPEPLLESLKSFQVSSSNAKTTPFGLEGIRAALVGLAEL 75
           +D HQPVRS+SLP+RV    ++ E  L  LK+++ SS +  T  F  E IR  LV LA+L
Sbjct: 8   SDVHQPVRSISLPSRVHPTCVKLEAALNHLKAWKTSSISTSTAGFSGETIRIGLVDLADL 67

Query: 76  YNSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNIQSLQSA 135
           YN V E + S  TQ++LV Y+ G+LVEEAL ESV  +D+C  ARD++L MKQ++Q+LQSA
Sbjct: 68  YNCVRETITSPQTQRSLVQYQNGRLVEEALDESVTFLDTCGKARDLLLAMKQHVQTLQSA 127

Query: 136 LRRKVADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPNHDLLPLI 195
           LRR+  DSSIE+ + AY +FR+  KK++   LG+LK+++    +S   LD+  H LL ++
Sbjct: 128 LRRRRGDSSIETQIAAYINFRKTVKKEVAKCLGALKKLERRFVSSSTPLDVDPH-LLMVV 187

Query: 196 RLLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKIVNELESV 255
           ++LRE  +I+IS+F  LL FLS    K +  GWS +++L+P + S   +  K++NE+ +V
Sbjct: 188 KVLRETTSITISVFQSLLLFLSVPSMKTRVGGWSKITKLIPLLSSE--REHKVINEVGAV 247

Query: 256 DIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVKHRVCFLN 315
           D+   S+ G   + ++     EV M QR L+ + ++ +G E+ LDC+FRCLV++RV FLN
Sbjct: 248 DLAFCSING---QLKNGGGMVEVDMLQRTLKAVGATIDGFETGLDCVFRCLVQNRVTFLN 305

Query: 316 MLVH 317
           ++ H
Sbjct: 308 IITH 305

BLAST of Cp4.1LG10g10430 vs. NCBI nr
Match: gi|1009144039|ref|XP_015889582.1| (PREDICTED: uncharacterized protein LOC107424329 [Ziziphus jujuba])

HSP 1 Score: 240.4 bits (612), Expect = 4.4e-60
Identity = 145/309 (46.93%), Postives = 203/309 (65.70%), Query Frame = 1

Query: 9   MVISSIFTDAHQPVRSVSLPARVELEPEPLLESLKSFQV--SSSNAKTTPFGLEGIRAAL 68
           M  SS     HQPVRS+SLP+R+    + + E L   +    SS +   P   E I+  L
Sbjct: 1   MARSSATPCLHQPVRSISLPSRLHPNSQKIEEQLSKLKTWKLSSTSMGIPLARETIQLGL 60

Query: 69  VGLAELYNSVGELVQSSSTQQALVHYKEGKLVEEALTESVILIDSCCSARDIILTMKQNI 128
            GLAELYNS+ EL  S  TQ+AL+ ++  KLVEE L  S+ L+D+C +ARD++L MK+++
Sbjct: 61  TGLAELYNSIKELFHSPLTQKALLQHECRKLVEETLDGSIGLLDACGTARDLLLNMKKHL 120

Query: 129 QSLQSALRRK-VADSSIESHVRAYFSFRRKAKKDIGSFLGSLKQMQSNRTTSFPLLDLPN 188
           Q LQSA RR+   DSSIE++V+AY +FR+ AKKDI   + +LK M +N      LL++ +
Sbjct: 121 QDLQSAFRRRSTTDSSIETNVQAYITFRKLAKKDIVKSIRALKSMHANAVAFNYLLNV-D 180

Query: 189 HDLLPLIRLLREARTISISIFGELLAFLSTSVTKGKASGWSLVSQLMPTIRSRSGKGRKI 248
           H +L +I+LLRE  +I+ISIF  LL FLS  V K KA+GWSL+S+L+P   + S + +K+
Sbjct: 181 HHILMVIKLLRELSSITISIFWSLLMFLSVPVMKTKANGWSLISKLVPVTFAGSERAQKV 240

Query: 249 VNELESVDIGLHSLLGHGRENESNDKKAEVQMAQRRLRTLASSFEGIESELDCMFRCLVK 308
            NE+ +VDI L  L G+ R+N +   K  VQM QRRL TL    +G+E  LDC+FRCL++
Sbjct: 241 FNEVSNVDIALCYLDGNFRKNGA---KINVQMVQRRLETLDGVVDGLEGGLDCLFRCLIQ 300

Query: 309 HRVCFLNML 315
           HRV  LN+L
Sbjct: 301 HRVSLLNLL 305

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KMP4_CUCSA2.8e-13081.55Uncharacterized protein OS=Cucumis sativus GN=Csa_6G507490 PE=4 SV=1[more]
A0A061DF67_THECC1.7e-6347.84Selection and upkeep of intraepithelial T-cells protein 6, putative OS=Theobroma... [more]
A0A0D2QVL1_GOSRA1.1e-6044.08Uncharacterized protein OS=Gossypium raimondii GN=B456_007G189500 PE=4 SV=1[more]
A0A061DF42_THECC2.7e-5646.23Selection and upkeep of intraepithelial T-cells protein 6, putative OS=Theobroma... [more]
M5VPI0_PRUPE3.5e-5648.68Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021917mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G35680.14.0e-3534.85 Arabidopsis protein of unknown function (DUF241)[more]
AT4G35660.13.4e-2633.23 Arabidopsis protein of unknown function (DUF241)[more]
AT3G51410.14.4e-2632.57 Arabidopsis protein of unknown function (DUF241)[more]
AT2G17080.15.2e-1931.02 Arabidopsis protein of unknown function (DUF241)[more]
AT4G35690.11.5e-1826.00 Arabidopsis protein of unknown function (DUF241)[more]
Match NameE-valueIdentityDescription
gi|778719859|ref|XP_011658069.1|4.0e-13081.55PREDICTED: uncharacterized protein LOC101217557 [Cucumis sativus][more]
gi|659081068|ref|XP_008441133.1|1.5e-12478.64PREDICTED: uncharacterized protein LOC103485362 [Cucumis melo][more]
gi|590702135|ref|XP_007046553.1|2.5e-6347.84Selection and upkeep of intraepithelial T-cells protein 6, putative [Theobroma c... [more]
gi|823191419|ref|XP_012491452.1|1.5e-6044.08PREDICTED: uncharacterized protein LOC105803659 [Gossypium raimondii][more]
gi|1009144039|ref|XP_015889582.1|4.4e-6046.93PREDICTED: uncharacterized protein LOC107424329 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004320DUF241_pln
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g10430.1Cp4.1LG10g10430.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004320Protein of unknown function DUF241, plantPFAMPF03087DUF241coord: 66..313
score: 7.8
NoneNo IPR availablePANTHERPTHR33070FAMILY NOT NAMEDcoord: 4..314
score: 1.6
NoneNo IPR availablePANTHERPTHR33070:SF3EXPRESSED PROTEINcoord: 4..314
score: 1.6

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG10g10430Cucumber (Chinese Long) v3cpecucB0074
Cp4.1LG10g10430Cucumber (Chinese Long) v3cpecucB0092
Cp4.1LG10g10430Wax gourdcpewgoB0097
Cp4.1LG10g10430Wax gourdcpewgoB0071
Cp4.1LG10g10430Cucurbita pepo (Zucchini)cpecpeB066
Cp4.1LG10g10430Cucurbita pepo (Zucchini)cpecpeB076
Cp4.1LG10g10430Cucumber (Gy14) v1cgycpeB0033
Cp4.1LG10g10430Cucumber (Gy14) v1cgycpeB0843
Cp4.1LG10g10430Cucurbita maxima (Rimu)cmacpeB557
Cp4.1LG10g10430Cucurbita maxima (Rimu)cmacpeB656
Cp4.1LG10g10430Cucurbita maxima (Rimu)cmacpeB866
Cp4.1LG10g10430Cucurbita moschata (Rifu)cmocpeB511
Cp4.1LG10g10430Cucurbita moschata (Rifu)cmocpeB606
Cp4.1LG10g10430Cucurbita moschata (Rifu)cmocpeB806
Cp4.1LG10g10430Wild cucumber (PI 183967)cpecpiB064
Cp4.1LG10g10430Wild cucumber (PI 183967)cpecpiB080
Cp4.1LG10g10430Cucumber (Chinese Long) v2cpecuB069
Cp4.1LG10g10430Cucumber (Chinese Long) v2cpecuB085
Cp4.1LG10g10430Bottle gourd (USVL1VR-Ls)cpelsiB054
Cp4.1LG10g10430Bottle gourd (USVL1VR-Ls)cpelsiB066
Cp4.1LG10g10430Watermelon (Charleston Gray)cpewcgB059
Cp4.1LG10g10430Watermelon (Charleston Gray)cpewcgB061
Cp4.1LG10g10430Watermelon (97103) v1cpewmB063
Cp4.1LG10g10430Watermelon (97103) v1cpewmB073
Cp4.1LG10g10430Melon (DHL92) v3.5.1cpemeB058
Cp4.1LG10g10430Melon (DHL92) v3.5.1cpemeB060
Cp4.1LG10g10430Cucumber (Gy14) v2cgybcpeB295
Cp4.1LG10g10430Cucumber (Gy14) v2cgybcpeB710
Cp4.1LG10g10430Melon (DHL92) v3.6.1cpemedB076
Cp4.1LG10g10430Melon (DHL92) v3.6.1cpemedB080
Cp4.1LG10g10430Silver-seed gourdcarcpeB0554
Cp4.1LG10g10430Silver-seed gourdcarcpeB0482
Cp4.1LG10g10430Silver-seed gourdcarcpeB0913