Cp4.1LG04g08820.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG04g08820.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionEndonuclease/exonuclease/phosphatase domain-containing protein
LocationCp4.1LG04 : 10453069 .. 10455387 (+)
Sequence length1413
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACAAGAAGAAAAAAGAAAGAAAATAGTCACCTTGGCCTTTCAAGATTAGTCACGTGATCCAGGCATCGAGTTGTGGGCACTCACGTGCGTCGTCAAACCCCAGTGCCCTGCCCTCCCTTCGGACATGACCAAATTAAATTAGGATTTCTTTCCCCCTCCCTCTCTAATCTCAGCTACTTTTGATTCTTTACCCGTACCGGAAGGATCGAATGTACCGATAGGTTCTAGCTACTGCTGATCCTTTTTGGTTCTTAAAATCAGTAATCAACAAAAAGTAGGGATTTTCGAATTCTGGAAGAGAAAAATGAGTGTTACTTTGACTGTAATGACATTCAATCTCCATGATGATCAACCACCAGAGAGCTCAAATTCTTGGGAGAAGAGGAGGGACTTGTGTATCAGTGTCATCACTAGCTACTCGCCCGCGATTCTCTGCACTCAGCAAGGTAGCTCCCTTTTCCTTTACTCCTCTGCTACTCTACTTTATCGGATTTCATATTTGATCCTGTGAATGCCTTGTGGATCTTGATCATTGGTGCGATTTTGTTGCAATAATTGATCTGGGTCTTGTTTGAATCGAGTTTTCTTGGTGGGTTTTTGGTGCTTTATGATTTCATTCGAGTTTTGGGTTGTTTTGTTAGGAAGCTTCAATGGAGTAGTGTGGCCATTTTGGTGGAACTTTTTTTTGTGGACGAACTTCAATGATTGAGTTCAATACACTTGATCTAGGAAGAAACTAATGAACTGCTACCATTTCCTTCTCTGATTCCTTCATTTCTCTATTGGGTTCAGGTGTGAAATCTCAGTTGGACTTTCTTCAGCAGGGCTTGCCTGGTATGTTAGCTTGTTATTGTTCAGTTCATATTGGGTTAGAAGTTCTTTTTGTACGTTCTCTACGAATGTGGAGCTGAAATTTTTAACATCTTTGAGGGGAAGATCATAAAGCAAGTTCTTATAACTTAGAAGTATTAATTCTCTTTCAGGCTATGGCCAATTCGGAGTTTCAAGAAAAGGATCCAGAGACACATCAGATGAACACTGCACAATCTTCTACGACAAGGAGAAGGTATTGAATTTTGGTTCAAGTCTGATAATCCAGTGTGGTTTTTTATGCTTCACTGTTTGTTTCTTCGATTAAAAAGGTTGAGCTGTTAGAAGGTGGCACATTTTGGTTATCAGAATCACCTTCTGTCCCGGGAAGCATGTCGTGGGGTTCGATCGTTCCGCATATTGCAACGTGGGCGATATCCTTGAGATTTCATTATATTTTAACCTATCCATTAGACTTTGTGTTATTTGTTAACTTGACAGACTGCCCACACATTCGAACTGAAAGGAATCGAGCCTCCTGGGTTCTCATTTCAGATAGTGAATACGAGCATGGATGAGCTCAACCCTCGTGCTCGTAGACGAAGCGCTTTACTTACATGGCAGCACATTGCATCCTTACCTCCTAGCTTGCCAGTTATATATTGTGGAGGTTTCAACACAGAAAAGGAATCAACTACTGGTCGTTTTCTTCTTGGGAGATCCAGGTAACTGAAACAACATTTTCTGTCTTAATCAGCCTTATTAGCTCGCATAGTTCATTTATCTTAACAGATCTTGTCGTCCATGGTGGCAGAGAAAAAGGTGCAGTTGGGGACATGAGAGATACATGGGCGATTGCTCGGGCAAGGAAGAATGTTTCTCTTATTCGAACGTATCATGGCTTCAAAGGTATGCAATTTAGCTAAGACAAACTTGCTTAGGAGCGCGTTCGACCTTTGCTCGACTCGTTGAGACTCTTTTTTCAGGTGACAAACAGGGAGCTTTTGAATTTTTCAAGTTGATTCTAAGAGCACTCTGCCTTTGCTGGGATCGCCAGACCCAAGATCTACATGTAGATTGGATTCTTTTCAGAGGTAGATCTTTGATCCCTGTCGTGTGCGAAGTGGTAAACGATAATATCGACGGATTTTACCCGTCGTCTCACTACCCTTTGTTTTCCGAATTCATGCTTCCTCGAACCGTCAGAATGCTCGAAACAACAACTACTACTCAAGAGTGACACTAAATCACCATTTGTTATAACTCAGAATTCTCCATTGTTCTCTGTTTAAGCTCCAAATGATGTGTTTGTTGATGTAGAAGTTCATGTTTATAGCAGCAAGTTTTCTCTACTGGTTTACACTGTATGATAACCTTCAAATGACCACTTGAATTACTTCCAAAATGGTCATTGGTATTATTTTGCTTCTGAGAGAGTTCAAAGTTGTATCAAGTTTTCTTCTATGACAAGAAGTTGTTTCAAGAATGTGATTCATTGTTGATTTTCG

mRNA sequence

AACAAGAAGAAAAAAGAAAGAAAATAGTCACCTTGGCCTTTCAAGATTAGTCACGTGATCCAGGCATCGAGTTGTGGGCACTCACGTGCGTCGTCAAACCCCAGTGCCCTGCCCTCCCTTCGGACATGACCAAATTAAATTAGGATTTCTTTCCCCCTCCCTCTCTAATCTCAGCTACTTTTGATTCTTTACCCGTACCGGAAGGATCGAATGTACCGATAGGTTCTAGCTACTGCTGATCCTTTTTGGTTCTTAAAATCAGTAATCAACAAAAAGTAGGGATTTTCGAATTCTGGAAGAGAAAAATGAGTGTTACTTTGACTGTAATGACATTCAATCTCCATGATGATCAACCACCAGAGAGCTCAAATTCTTGGGAGAAGAGGAGGGACTTGTGTATCAGTGTCATCACTAGCTACTCGCCCGCGATTCTCTGCACTCAGCAAGGTGTGAAATCTCAGTTGGACTTTCTTCAGCAGGGCTTGCCTGGCTATGGCCAATTCGGAGTTTCAAGAAAAGGATCCAGAGACACATCAGATGAACACTGCACAATCTTCTACGACAAGGAGAAGACTGCCCACACATTCGAACTGAAAGGAATCGAGCCTCCTGGGTTCTCATTTCAGATAGTGAATACGAGCATGGATGAGCTCAACCCTCGTGCTCGTAGACGAAGCGCTTTACTTACATGGCAGCACATTGCATCCTTACCTCCTAGCTTGCCAGTTATATATTGTGGAGGTTTCAACACAGAAAAGGAATCAACTACTGGTCGTTTTCTTCTTGGGAGATCCAGAGAAAAAGGTGCAGTTGGGGACATGAGAGATACATGGGCGATTGCTCGGGCAAGGAAGAATGTTTCTCTTATTCGAACGTATCATGGCTTCAAAGGTGACAAACAGGGAGCTTTTGAATTTTTCAAGTTGATTCTAAGAGCACTCTGCCTTTGCTGGGATCGCCAGACCCAAGATCTACATGTAGATTGGATTCTTTTCAGAGGTAGATCTTTGATCCCTGTCGTGTGCGAAGTGGTAAACGATAATATCGACGGATTTTACCCGTCGTCTCACTACCCTTTGTTTTCCGAATTCATGCTTCCTCGAACCGTCAGAATGCTCGAAACAACAACTACTACTCAAGAGTGACACTAAATCACCATTTGTTATAACTCAGAATTCTCCATTGTTCTCTGTTTAAGCTCCAAATGATGTGTTTGTTGATGTAGAAGTTCATGTTTATAGCAGCAAGTTTTCTCTACTGGTTTACACTGTATGATAACCTTCAAATGACCACTTGAATTACTTCCAAAATGGTCATTGGTATTATTTTGCTTCTGAGAGAGTTCAAAGTTGTATCAAGTTTTCTTCTATGACAAGAAGTTGTTTCAAGAATGTGATTCATTGTTGATTTTCG

Coding sequence (CDS)

ATGAGTGTTACTTTGACTGTAATGACATTCAATCTCCATGATGATCAACCACCAGAGAGCTCAAATTCTTGGGAGAAGAGGAGGGACTTGTGTATCAGTGTCATCACTAGCTACTCGCCCGCGATTCTCTGCACTCAGCAAGGTGTGAAATCTCAGTTGGACTTTCTTCAGCAGGGCTTGCCTGGCTATGGCCAATTCGGAGTTTCAAGAAAAGGATCCAGAGACACATCAGATGAACACTGCACAATCTTCTACGACAAGGAGAAGACTGCCCACACATTCGAACTGAAAGGAATCGAGCCTCCTGGGTTCTCATTTCAGATAGTGAATACGAGCATGGATGAGCTCAACCCTCGTGCTCGTAGACGAAGCGCTTTACTTACATGGCAGCACATTGCATCCTTACCTCCTAGCTTGCCAGTTATATATTGTGGAGGTTTCAACACAGAAAAGGAATCAACTACTGGTCGTTTTCTTCTTGGGAGATCCAGAGAAAAAGGTGCAGTTGGGGACATGAGAGATACATGGGCGATTGCTCGGGCAAGGAAGAATGTTTCTCTTATTCGAACGTATCATGGCTTCAAAGGTGACAAACAGGGAGCTTTTGAATTTTTCAAGTTGATTCTAAGAGCACTCTGCCTTTGCTGGGATCGCCAGACCCAAGATCTACATGTAGATTGGATTCTTTTCAGAGGTAGATCTTTGATCCCTGTCGTGTGCGAAGTGGTAAACGATAATATCGACGGATTTTACCCGTCGTCTCACTACCCTTTGTTTTCCGAATTCATGCTTCCTCGAACCGTCAGAATGCTCGAAACAACAACTACTACTCAAGAGTGA

Protein sequence

MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGLPGYGQFGVSRKGSRDTSDEHCTIFYDKEKTAHTFELKGIEPPGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNTEKESTTGRFLLGRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLILRALCLCWDRQTQDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVRMLETTTTTQE
BLAST of Cp4.1LG04g08820.1 vs. TrEMBL
Match: A0A0B0P3J4_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_02575 PE=4 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 4.4e-135
Identity = 230/297 (77.44%), Postives = 255/297 (85.86%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+LTVMTFNLH+D+  +S NSWEKRRDLCISVITSYSP ILCTQQGVKSQLD+LQQGL
Sbjct: 1   MSVSLTVMTFNLHEDESEDSPNSWEKRRDLCISVITSYSPIILCTQQGVKSQLDYLQQGL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEKT----AHTFELK--------------GIEPP 120
           PGY QFG+SRKG +DTSDE CTIFYDKEK       TF L               G+EPP
Sbjct: 61  PGYDQFGISRKGPQDTSDECCTIFYDKEKVELIEGGTFWLSESPSVPGSTSWGSVGVEPP 120

Query: 121 GFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNTEKESTTGRFLLGR 180
           GFSFQ+VNTSMDE +PRARRRSALLTWQHIASLPPSLPV+YCGGFNT+KESTTGRFLLGR
Sbjct: 121 GFSFQVVNTSMDEFSPRARRRSALLTWQHIASLPPSLPVVYCGGFNTQKESTTGRFLLGR 180

Query: 181 SREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLILRALCLCWDRQTQD 240
           SRE G VGDMRD W  AR RKNVSLIRTYHGFKGDKQGA EF KL+ RALCLCWDRQTQD
Sbjct: 181 SREHGVVGDMRDVWPNARVRKNVSLIRTYHGFKGDKQGALEFLKLVFRALCLCWDRQTQD 240

Query: 241 LHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVRMLETTTTTQE 280
           LH+DWILFRGRSLIPV+C+VVNDN+DG+YPSSHYP+F+EF+LPRTVR++E  T+TQ+
Sbjct: 241 LHIDWILFRGRSLIPVLCQVVNDNMDGYYPSSHYPIFAEFLLPRTVRLMEPPTSTQD 297

BLAST of Cp4.1LG04g08820.1 vs. TrEMBL
Match: I1JGP9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_02G204100 PE=4 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 8.6e-131
Identity = 229/304 (75.33%), Postives = 248/304 (81.58%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+LTVMTFNLHDD+P +S NSWEKRRD+CISVITSYSP ILCTQQGVK+QLDFLQQGL
Sbjct: 1   MSVSLTVMTFNLHDDEPQDSPNSWEKRRDMCISVITSYSPIILCTQQGVKAQLDFLQQGL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEKT----AHTFELK-----------GIEP---- 120
           PGY QFG+SRKG +DT+D+HCTIFYDKEK       TF L            G E     
Sbjct: 61  PGYDQFGISRKGPQDTTDQHCTIFYDKEKVELLEGGTFWLSESPSVPGSMSWGSEVPCIA 120

Query: 121 ------------PGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNT 180
                       PGFSFQIVNT+MDE +PRARRRSALLTWQHIASLPPSLPV+YCGGFNT
Sbjct: 121 TWATFQLKGVEPPGFSFQIVNTNMDEFSPRARRRSALLTWQHIASLPPSLPVVYCGGFNT 180

Query: 181 EKESTTGRFLLGRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLIL 240
           +KESTTGRFLLGRSRE G VGDMRD W  AR RKNVSLI TYHGFKGDKQG  EF KLI 
Sbjct: 181 QKESTTGRFLLGRSREHGVVGDMRDAWPSARVRKNVSLIHTYHGFKGDKQGTLEFLKLIF 240

Query: 241 RALCLCWDRQTQDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVR 274
           RALCLCWDRQTQDLH+DWILFRGRSLIPV CEVVNDNIDG+YPSSH+P+F+EFMLPRTVR
Sbjct: 241 RALCLCWDRQTQDLHIDWILFRGRSLIPVSCEVVNDNIDGYYPSSHFPIFAEFMLPRTVR 300

BLAST of Cp4.1LG04g08820.1 vs. TrEMBL
Match: B9IMD6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0018s11590g PE=4 SV=2)

HSP 1 Score: 473.4 bits (1217), Expect = 1.9e-130
Identity = 228/299 (76.25%), Postives = 249/299 (83.28%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+LTVMTFNLHDDQ  +S NSWEKR+DLCISV+TSYSP ILCTQQGVK+QLD+LQQ L
Sbjct: 1   MSVSLTVMTFNLHDDQAEDSPNSWEKRKDLCISVVTSYSPMILCTQQGVKTQLDYLQQCL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEKTA----HTFELK----------------GIE 120
           PGYGQFG+SRKGS+D+ DEHCTIFYDKEK       TF L                  IE
Sbjct: 61  PGYGQFGISRKGSQDSLDEHCTIFYDKEKVELLEDGTFWLSESPSVPGSMSWGAAVPWIE 120

Query: 121 PPGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNTEKESTTGRFLL 180
           PPGFS QIVNT+MDE +PRARRRSALLTWQHIASLPPSLPV+YCGGFNT KESTTGRFLL
Sbjct: 121 PPGFSLQIVNTNMDEFSPRARRRSALLTWQHIASLPPSLPVVYCGGFNTHKESTTGRFLL 180

Query: 181 GRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLILRALCLCWDRQT 240
           GRS E G VGDMRDTW  A+ RKNVSL+ T+H FKGDKQGA EFFKLILRALCLCWDRQT
Sbjct: 181 GRSSEHGVVGDMRDTWPNAQVRKNVSLVHTFHDFKGDKQGALEFFKLILRALCLCWDRQT 240

Query: 241 QDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVRMLETTTTTQE 280
           QDLHVDWILFRGRSLIPV CEVVNDNI+G YPSSHYP+F+EFMLPR+VR+LE   T +E
Sbjct: 241 QDLHVDWILFRGRSLIPVQCEVVNDNINGRYPSSHYPIFAEFMLPRSVRLLEPPLTAEE 299

BLAST of Cp4.1LG04g08820.1 vs. TrEMBL
Match: A0A164ZN17_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_019393 PE=4 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 3.3e-130
Identity = 222/303 (73.27%), Postives = 250/303 (82.51%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           M+++LTVMTFNL +DQP +S NSW KRRDLCISVITSYSP I+CTQQGVKSQLD+LQQ L
Sbjct: 1   MNLSLTVMTFNLLEDQPEDSQNSWMKRRDLCISVITSYSPMIICTQQGVKSQLDYLQQCL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEK------------------------------- 120
           PGY QFG+SRKG+ DTSDEHCTIFYDKEK                               
Sbjct: 61  PGYDQFGISRKGTEDTSDEHCTIFYDKEKVELLEGGTFWLSESPSVPGSMSWGCEVPCIA 120

Query: 121 TAHTFELKGIEPPGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNT 180
           T  TF+LKG+EPPGFSFQ+VNT+MDEL+PRARRRSALLTWQHIASLPPSLPV+YCGGFNT
Sbjct: 121 TWTTFQLKGVEPPGFSFQVVNTNMDELSPRARRRSALLTWQHIASLPPSLPVVYCGGFNT 180

Query: 181 EKESTTGRFLLGRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLIL 240
           +KESTTGRFLLGRSRE G VGDM+D W+ AR RKNVSLIRTYHGFKGDKQGA EF KL+ 
Sbjct: 181 QKESTTGRFLLGRSREHGVVGDMKDAWSNARVRKNVSLIRTYHGFKGDKQGALEFLKLVF 240

Query: 241 RALCLCWDRQTQDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVR 273
           RA CLCWDRQTQDLHVDWIL+RGR+L+PV  EVV+DNIDGFYPSSHYP+++EFMLPR+VR
Sbjct: 241 RAFCLCWDRQTQDLHVDWILYRGRALVPVSSEVVSDNIDGFYPSSHYPVYAEFMLPRSVR 300

BLAST of Cp4.1LG04g08820.1 vs. TrEMBL
Match: A0A0R0F973_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_17G073700 PE=4 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 3.3e-130
Identity = 230/304 (75.66%), Postives = 247/304 (81.25%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+LTVMTFNL+DD+P +S NSWEKRRDLCISVITSYSP ILCTQQGVK+QLDFLQQGL
Sbjct: 1   MSVSLTVMTFNLYDDEPQDSPNSWEKRRDLCISVITSYSPIILCTQQGVKAQLDFLQQGL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEKT----AHTFELK-----------GIEP---- 120
           PGY QFG+SRKG +DT+ EHCTIFYDKEK       TF L            G E     
Sbjct: 61  PGYDQFGISRKGPQDTTSEHCTIFYDKEKVELLEGGTFWLSESPSVPGSMSWGSEVPCIA 120

Query: 121 ------------PGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNT 180
                       PGFSFQIVNT MDE +PRARRRSALLTWQHIASLPPSLPV+YCGGFNT
Sbjct: 121 TWATFQLKGVEPPGFSFQIVNTIMDEFSPRARRRSALLTWQHIASLPPSLPVVYCGGFNT 180

Query: 181 EKESTTGRFLLGRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLIL 240
           +KESTTGRFLLGRSRE G VGDMRD W  AR RKNVSLIRTYHGFKGDKQG  EF KLI 
Sbjct: 181 QKESTTGRFLLGRSREHGVVGDMRDAWPSARVRKNVSLIRTYHGFKGDKQGTLEFLKLIF 240

Query: 241 RALCLCWDRQTQDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVR 274
           RALCLCWDRQTQDLH+DWILFRGRSLIPV CEVVNDNIDG+YPSSH+P+F+EFMLPRTVR
Sbjct: 241 RALCLCWDRQTQDLHIDWILFRGRSLIPVSCEVVNDNIDGYYPSSHFPIFAEFMLPRTVR 300

BLAST of Cp4.1LG04g08820.1 vs. TAIR10
Match: AT4G30900.1 (AT4G30900.1 DNAse I-like superfamily protein)

HSP 1 Score: 411.8 bits (1057), Expect = 3.4e-115
Identity = 201/305 (65.90%), Postives = 233/305 (76.39%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+L+VM+FNLHDD P ES NSW KR+DLC++VITSYSP +LCTQQGVKSQLD+LQQGL
Sbjct: 1   MSVSLSVMSFNLHDDLPEESPNSWLKRKDLCLTVITSYSPIVLCTQQGVKSQLDYLQQGL 60

Query: 61  P-----GYGQFGVSRKG--------SRDTSD--EHCTIFYDKEK---------------- 120
           P     G  + G             +++  +  E  T +  +                  
Sbjct: 61  PAYDQFGISRTGPGDANDEHCTIFFNKEKVELLEGGTFWLSESPSVPGSTAWGSAVPCIA 120

Query: 121 TAHTFELKGIEPPGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNT 180
           T  TF+LKG EPPGFSFQIVNT++DE++PRARRRSALLTWQHIASLPP+LPV+YCGGFNT
Sbjct: 121 TWATFQLKGAEPPGFSFQIVNTNLDEISPRARRRSALLTWQHIASLPPTLPVVYCGGFNT 180

Query: 181 EKESTTGRFLLGRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLIL 240
           +KESTTGRFLLGRSRE G VGDMRD W  AR RKNV+LIRTYH FKGDKQG  EF KLI 
Sbjct: 181 QKESTTGRFLLGRSREHGVVGDMRDAWPSARVRKNVALIRTYHDFKGDKQGTVEFLKLIF 240

Query: 241 RALCLCWDRQTQDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVR 275
           RALCLCWDRQTQDLH DWIL+RGRS++PV+CE+VND ID  YPSSHYP+F+EFMLPR+VR
Sbjct: 241 RALCLCWDRQTQDLHTDWILYRGRSIVPVMCEIVNDKIDDLYPSSHYPVFAEFMLPRSVR 300

BLAST of Cp4.1LG04g08820.1 vs. NCBI nr
Match: gi|728841835|gb|KHG21278.1| (Uncharacterized protein F383_02575 [Gossypium arboreum])

HSP 1 Score: 488.8 bits (1257), Expect = 6.3e-135
Identity = 230/297 (77.44%), Postives = 255/297 (85.86%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+LTVMTFNLH+D+  +S NSWEKRRDLCISVITSYSP ILCTQQGVKSQLD+LQQGL
Sbjct: 1   MSVSLTVMTFNLHEDESEDSPNSWEKRRDLCISVITSYSPIILCTQQGVKSQLDYLQQGL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEKT----AHTFELK--------------GIEPP 120
           PGY QFG+SRKG +DTSDE CTIFYDKEK       TF L               G+EPP
Sbjct: 61  PGYDQFGISRKGPQDTSDECCTIFYDKEKVELIEGGTFWLSESPSVPGSTSWGSVGVEPP 120

Query: 121 GFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNTEKESTTGRFLLGR 180
           GFSFQ+VNTSMDE +PRARRRSALLTWQHIASLPPSLPV+YCGGFNT+KESTTGRFLLGR
Sbjct: 121 GFSFQVVNTSMDEFSPRARRRSALLTWQHIASLPPSLPVVYCGGFNTQKESTTGRFLLGR 180

Query: 181 SREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLILRALCLCWDRQTQD 240
           SRE G VGDMRD W  AR RKNVSLIRTYHGFKGDKQGA EF KL+ RALCLCWDRQTQD
Sbjct: 181 SREHGVVGDMRDVWPNARVRKNVSLIRTYHGFKGDKQGALEFLKLVFRALCLCWDRQTQD 240

Query: 241 LHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVRMLETTTTTQE 280
           LH+DWILFRGRSLIPV+C+VVNDN+DG+YPSSHYP+F+EF+LPRTVR++E  T+TQ+
Sbjct: 241 LHIDWILFRGRSLIPVLCQVVNDNMDGYYPSSHYPIFAEFLLPRTVRLMEPPTSTQD 297

BLAST of Cp4.1LG04g08820.1 vs. NCBI nr
Match: gi|694354423|ref|XP_009358437.1| (PREDICTED: uncharacterized protein LOC103949067 [Pyrus x bretschneideri])

HSP 1 Score: 483.0 bits (1242), Expect = 3.5e-133
Identity = 230/303 (75.91%), Postives = 250/303 (82.51%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+LTVMTFNLH+DQ  +S  SW+KRRDLCISVITSYSP ILCTQQGVKSQLD+LQQ L
Sbjct: 1   MSVSLTVMTFNLHEDQTEDSPYSWDKRRDLCISVITSYSPIILCTQQGVKSQLDYLQQCL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEK------------------------------- 120
           PGY QFG+SRKG  DTSDEHCTIFYDKEK                               
Sbjct: 61  PGYDQFGISRKGPEDTSDEHCTIFYDKEKVELLEGGTFWLSESPSVPGSMSWGSEVPCIA 120

Query: 121 TAHTFELKGIEPPGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNT 180
           T  TF+LKG EPPGFSFQIVNT+MDE +PRARRRSALLTWQHIASLPP LPV+YCGGFNT
Sbjct: 121 TWVTFQLKGAEPPGFSFQIVNTNMDEFSPRARRRSALLTWQHIASLPPGLPVVYCGGFNT 180

Query: 181 EKESTTGRFLLGRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLIL 240
           +KESTTGRFLLGRSRE GAVGDMRD W  AR RKNVSLIRT+HGFKGDKQGA EF KL+ 
Sbjct: 181 QKESTTGRFLLGRSREHGAVGDMRDAWPNARVRKNVSLIRTFHGFKGDKQGALEFLKLVF 240

Query: 241 RALCLCWDRQTQDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVR 273
           RALCLCWDRQTQDLHVDWILFRGRSLIPV+CEVV+DNIDG+YPSSHYP+F+EFMLPRTVR
Sbjct: 241 RALCLCWDRQTQDLHVDWILFRGRSLIPVLCEVVSDNIDGYYPSSHYPIFAEFMLPRTVR 300

BLAST of Cp4.1LG04g08820.1 vs. NCBI nr
Match: gi|658063656|ref|XP_008367748.1| (PREDICTED: uncharacterized protein LOC103431379 [Malus domestica])

HSP 1 Score: 480.7 bits (1236), Expect = 1.7e-132
Identity = 229/303 (75.58%), Postives = 249/303 (82.18%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+LTVMTFNLH+DQ  +S  SW+KRRDLCISVITSYSP ILCTQQGVKSQLD+LQQ L
Sbjct: 1   MSVSLTVMTFNLHEDQTEDSPYSWDKRRDLCISVITSYSPIILCTQQGVKSQLDYLQQCL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEK------------------------------- 120
           PGY QFG+SRKG  DTSDEHCTIFYDKEK                               
Sbjct: 61  PGYDQFGISRKGPEDTSDEHCTIFYDKEKVELLEGGTFWLSESPSVPGSMSWGSEVPCIA 120

Query: 121 TAHTFELKGIEPPGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNT 180
           T  TF+LKG EPPGFSFQIVNT+MDE +PRARRRSALLTWQHIASLPP LPV+YCGGFNT
Sbjct: 121 TWVTFQLKGAEPPGFSFQIVNTNMDEFSPRARRRSALLTWQHIASLPPGLPVVYCGGFNT 180

Query: 181 EKESTTGRFLLGRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLIL 240
           +KESTTGRFLLGRSRE GAVGDMRD W  AR RKNVSLIRT+HGFKGDKQGA EF KL+ 
Sbjct: 181 QKESTTGRFLLGRSREHGAVGDMRDAWPNARVRKNVSLIRTFHGFKGDKQGALEFLKLVF 240

Query: 241 RALCLCWDRQTQDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVR 273
           RALCLCWDRQTQDLHVDWILFRGRSL PV+CEVV+DNIDG+YPSSHYP+F+EFMLPRTVR
Sbjct: 241 RALCLCWDRQTQDLHVDWILFRGRSLSPVLCEVVSDNIDGYYPSSHYPIFAEFMLPRTVR 300

BLAST of Cp4.1LG04g08820.1 vs. NCBI nr
Match: gi|1021473638|ref|XP_016200253.1| (PREDICTED: uncharacterized protein LOC107641270 [Arachis ipaensis])

HSP 1 Score: 478.4 bits (1230), Expect = 8.5e-132
Identity = 233/304 (76.64%), Postives = 250/304 (82.24%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+LTVMTFNLHDDQ  +S NSWEKRRDLCISVITSYSP ILCTQQGVK+QLDFLQQGL
Sbjct: 1   MSVSLTVMTFNLHDDQAEDSPNSWEKRRDLCISVITSYSPIILCTQQGVKTQLDFLQQGL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEKT----AHTFELK-----------GIEP---- 120
           PGY QFG+SRKG +DT+DEHCTIFYDKEK       TF L            G E     
Sbjct: 61  PGYDQFGISRKGPQDTTDEHCTIFYDKEKVELLEGGTFWLSESPSVPGSMSWGSEVPCIA 120

Query: 121 ------------PGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNT 180
                       PGFSFQIVNT+MD+ +PRARRRSALLTWQHIASLPPSLPV+YCGGFNT
Sbjct: 121 TWATFQLKGVEPPGFSFQIVNTNMDQFSPRARRRSALLTWQHIASLPPSLPVVYCGGFNT 180

Query: 181 EKESTTGRFLLGRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLIL 240
           +KESTTGRFLLGRSRE G VGDMRD W  AR RKNVSLIRTYHGFKGDKQGA EF KLIL
Sbjct: 181 QKESTTGRFLLGRSREHGVVGDMRDAWPSARVRKNVSLIRTYHGFKGDKQGALEFLKLIL 240

Query: 241 RALCLCWDRQTQDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVR 274
           RALCLCWDRQTQDLH+DWILFRGRSLIPV CEVVNDNIDG+YPSSH+P+F+EFMLPRTVR
Sbjct: 241 RALCLCWDRQTQDLHIDWILFRGRSLIPVSCEVVNDNIDGYYPSSHFPIFAEFMLPRTVR 300

BLAST of Cp4.1LG04g08820.1 vs. NCBI nr
Match: gi|1012001775|ref|XP_015932931.1| (PREDICTED: uncharacterized protein LOC107459225 [Arachis duranensis])

HSP 1 Score: 475.3 bits (1222), Expect = 7.2e-131
Identity = 232/304 (76.32%), Postives = 249/304 (81.91%), Query Frame = 1

Query: 1   MSVTLTVMTFNLHDDQPPESSNSWEKRRDLCISVITSYSPAILCTQQGVKSQLDFLQQGL 60
           MSV+LTVMTFNLHDDQ  +S NSWEKRRDLCISVITSYSP ILCTQQGVK+QLDFLQQGL
Sbjct: 1   MSVSLTVMTFNLHDDQAEDSPNSWEKRRDLCISVITSYSPIILCTQQGVKTQLDFLQQGL 60

Query: 61  PGYGQFGVSRKGSRDTSDEHCTIFYDKEKT----AHTFELK-----------GIEP---- 120
           PGY QFG+SRKG +DT+DEHCTIFYDKEK       TF L            G E     
Sbjct: 61  PGYDQFGISRKGPQDTTDEHCTIFYDKEKVELLEGGTFWLSESPSVPGSMSWGSEVPCIA 120

Query: 121 ------------PGFSFQIVNTSMDELNPRARRRSALLTWQHIASLPPSLPVIYCGGFNT 180
                       PGFSFQIVNT+MD+ +PRARRRSALLTWQHIASLPPSLPV+YCGGFNT
Sbjct: 121 TWATFQLKGVEPPGFSFQIVNTNMDQFSPRARRRSALLTWQHIASLPPSLPVVYCGGFNT 180

Query: 181 EKESTTGRFLLGRSREKGAVGDMRDTWAIARARKNVSLIRTYHGFKGDKQGAFEFFKLIL 240
           +KESTTGRFLLGRSRE G VGDMRD W  A  RKNVSLIRTYHGFKGDKQGA EF KLIL
Sbjct: 181 QKESTTGRFLLGRSREHGVVGDMRDAWPSACVRKNVSLIRTYHGFKGDKQGALEFLKLIL 240

Query: 241 RALCLCWDRQTQDLHVDWILFRGRSLIPVVCEVVNDNIDGFYPSSHYPLFSEFMLPRTVR 274
           RALCLCWDRQTQDLH+DWILFRGRSLIPV CEVVNDNIDG+YPSSH+P+F+EFMLPRTVR
Sbjct: 241 RALCLCWDRQTQDLHIDWILFRGRSLIPVSCEVVNDNIDGYYPSSHFPIFAEFMLPRTVR 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0B0P3J4_GOSAR4.4e-13577.44Uncharacterized protein OS=Gossypium arboreum GN=F383_02575 PE=4 SV=1[more]
I1JGP9_SOYBN8.6e-13175.33Uncharacterized protein OS=Glycine max GN=GLYMA_02G204100 PE=4 SV=1[more]
B9IMD6_POPTR1.9e-13076.25Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0018s11590g PE=4 SV=2[more]
A0A164ZN17_DAUCA3.3e-13073.27Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_019393 PE=4 SV=1[more]
A0A0R0F973_SOYBN3.3e-13075.66Uncharacterized protein OS=Glycine max GN=GLYMA_17G073700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G30900.13.4e-11565.90 DNAse I-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|728841835|gb|KHG21278.1|6.3e-13577.44Uncharacterized protein F383_02575 [Gossypium arboreum][more]
gi|694354423|ref|XP_009358437.1|3.5e-13375.91PREDICTED: uncharacterized protein LOC103949067 [Pyrus x bretschneideri][more]
gi|658063656|ref|XP_008367748.1|1.7e-13275.58PREDICTED: uncharacterized protein LOC103431379 [Malus domestica][more]
gi|1021473638|ref|XP_016200253.1|8.5e-13276.64PREDICTED: uncharacterized protein LOC107641270 [Arachis ipaensis][more]
gi|1012001775|ref|XP_015932931.1|7.2e-13176.32PREDICTED: uncharacterized protein LOC107459225 [Arachis duranensis][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR005135Endo/exonuclease/phosphatase
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG04g08820Cp4.1LG04g08820gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG04g08820.1Cp4.1LG04g08820.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG04g08820.1:five_prime_utr:001Cp4.1LG04g08820.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG04g08820.1:cds:006Cp4.1LG04g08820.1:cds:006CDS
Cp4.1LG04g08820.1:cds:005Cp4.1LG04g08820.1:cds:005CDS
Cp4.1LG04g08820.1:cds:004Cp4.1LG04g08820.1:cds:004CDS
Cp4.1LG04g08820.1:cds:003Cp4.1LG04g08820.1:cds:003CDS
Cp4.1LG04g08820.1:cds:002Cp4.1LG04g08820.1:cds:002CDS
Cp4.1LG04g08820.1:cds:001Cp4.1LG04g08820.1:cds:001CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG04g08820.1:three_prime_utr:001Cp4.1LG04g08820.1:three_prime_utr:001three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005135Endonuclease/exonuclease/phosphatasePFAMPF03372Exo_endo_phoscoord: 8..253
score: 2.
IPR005135Endonuclease/exonuclease/phosphataseunknownSSF56219DNase I-likecoord: 3..267
score: 6.91
NoneNo IPR availablePANTHERPTHR12121CARBON CATABOLITE REPRESSOR PROTEIN 4coord: 2..273
score: 1.1
NoneNo IPR availablePANTHERPTHR12121:SF36ENDONUCLEASE/EXONUCLEASE/PHOSPHATASE DOMAIN-CONTAINING PROTEINcoord: 2..273
score: 1.1