CmoCh17G004300 (gene) Cucurbita moschata (Rifu)

NameCmoCh17G004300
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionNuclear transport factor 2 (NTF2) family protein
LocationCmo_Chr17 : 2861895 .. 2868031 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATTTTCCCGCAAGAAACTTGGGAAGAAGATATTTTCACCTCTCATGGCTGCCATTCGTTTTCGCGCGGCAAAGCTCCAAAGCATTCTGATTCAATCAACTACCAAACCATCCATGTTTGTAAATTGCAATGGCGACGATCTTCCCTTTTCAATCACGTACGTCTCTTCAAACATCTCTCAACGCCATTCGACCTAATCGCCCTCTCAAAATCTGCAAAATCCGATGTCAAGGGGAAAATCCCGCTACCGGTTCGCCGAAGAATCGAGAATCTAAGCCCGAGAATGCGGTGTTGAAGGTCGCTTGGTATGGCTCCGAGCTCTTGGGGATCGCCGCTTCATTTCTCCGTCCGCCGACGGATGTTGAAACGCCTGTTAGGGCGCAGGAGCTTGCGAGAGATGTGTCCGGTGCAATTCGTCGCCCTGTGATTGTGGAAACGATTAAGAAGGATTTTGAGCGGTCGTACTTCGTCACAGGTTTGGGAAATTTTTGTTTATGCGAGAGATTACTATTGTGCGTTGGTTTTGTCTCTGGATGTTTGGAAGTTCTAGAGTTCTAGGGTTCTTTTGAAATCAAGATAGTTTGTTTTGTTCAATTCAAGTTGTTCTTGAAACCATAAGCAGTAAGTTGAAACTAAATCTAAACCATAAATACTCAAATTATCGGAATTATATTGAAACTTTTTAAGATCATAGAGATTAAATTCAAATTTTATCCAAACCATGGAAGCTAAATTTGTAATTTAGCATAAGTAATGATTTTCATCCTCAATATTGTTGTCCTTAACTGTGTAATAGATGAACATATGTATATTACACTGCTAACGTGGGTATATAAGGAACAATGAAGTAGGTGAATTATAGAGTAGGTTGATATTAGATGACATTATAAAAGTTTTGAATACGACAAGAAACTACTAGGCACAAAATCGAAAGTTTAGGGATTGAATAAACATTTTTTTTAATTTAGGAACTTGTTAGACACAATATTAAGAGTTCGGGGTTTGTTAACATTTTTGTAAATTTAAAGATTTTTAGACACAAGCGACTAAATTTGTAATTCAACTTCAAAAGTTGATCGACTAAAAGTGACCTCTTCAAAGTAAAAGAATGAGTATCCCGTGAGGGGCAGACTCGTTGGCAAGGGCTTAGTTTCTCTTGGATATACTAGTTTAGAGGTCCCAAGTCCGATCCTTCGGGTGAGCTTAATACCAAAAACCCTCAATGTCTCACGGGTTCGAGCCTTGGGGAGGGTGCAGGTGCCCCCAAGTATAGGGATAAAAGCTCCGACTCCCAATCAAATAAAAGTAAGTTGAAGAACATTACATTAAAAAATACTTATTCAATGAATAAAAAAGTTAGAAAAGATTGCATAAAGAATCTAATTAGCAAAACAATGAAGTGAATGTGTTCATTTATATTAACTTTGGCTGACTCATTGTAGAACTTGTGAGCAGTTTTTCCAAGTTTTGTTCATTTGTATTTGGTGATGATCAGGGAACCTTACTGTTGAAGCTTATGAAGAGCAGTGCGAATTTGCTGATCCGGCTGGTTCTTTCAAAGGTCTTAGCCGATTTAAAAGAAACTGTACAAATTTTGGATCCCTTGTTGATAAGTCAAACATGAAGCTTACCAAATGGGAGGATTTTGAGGTGAGCTCAGTCTATAGAGATTACTGTTTTTTGTTTACTAAGCTTAGTGTTGAATTGTTGGTTGATTGGATTGAGTACAGGACAAGGGCATTGGACATTGGAAGTTCAGTTGTATCTTGTCATTTCCTTGGAGACCAATTCTATCTGGTATGTTTAATTTATGATGCTTCAGAGCTAGTTTGAGATAATTTCTTGGAATGGTGTTATTTCAGTGAAGTACTTCTGGAATGAGCACTAATTAAGTTTTTTCTCTGAGAACATTCTCTGCTTTGAAATGGTAATAAAAAGTAATTTTAACCGAAATCCCACGTCGGTTAGAGAGGAGAACAAAACATTTTTTATAAGAGTGTGGAAACCTCTCTTTAGCATATGCGTTTTAAAAACCTTGAGGAGAATACCGGCCGGTGTGCCAACAAGGACGTTGGGTCCAGAAGGGGGTGGATTGTGAGATCCCACATCGGTTGGGGAGGAGAACGAAACATTTTTTATAAGGGTGTGGAAATCTTTCTCTAGCAGACGCGTTTTAAAAACCTTGAGTGAAAGCCCAAAAGGGAAAGCACAAAAAGGACAATATCTGCTAGCGGTGGGTTTGGGCCATTACAAATATTGGGGGTCCCACGTCGATTTGAGAAGGGAATGAGTGCCACCGAGGATGCTGGGCCCTGAAGGGGGGTGGATAGTGAGATCTCACATCGGTTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGACGCATTTTAAAAGCCTTGAGGGGAAAGCCCGAAAGGGAAACCCAAAGAGGACAATATCTGCTAGCAATGGGTTTTGATCGTTACAAATATTGGAGGGTCCCACATCAATTAGAGAAAGGAACAAGTGCCAGCGAGGACGCTGGGTCCCGAAGGGAGGTGGATCGTGAGATCCCACTTAGAGAAAGGAACAAGTGCCAGCGAGGACGCTGGGTCCCGAAGGGAGGTGGATCGTGAGATCCCACATCGGTTGGAGTGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGATGCGTTTTATAAACCTTGAGGGGAAGTCCAAACGGGAAATCTTAGAGATGACAATATCTGCTAGCGGTGGGTTTGGGTCATTACAAATATTGGGGGGTTCCACATCGATTAAAGAAGGGAACAAGTGCCAGTGGTGGGTTTGGGTCATTACAAATATTGGGGGGTTCCACATCGATTAAAGAAGGGAACAAGTGCCAGTGGACAATATCTGCTAGCAATGGGTTTTGATCGTTACAAATATTGGAGGGTCCCACATCAATTAGAGAAAGGAACAAGTGCCAGCGAGGACGCTGGGTCCCGAAGGGAGGTGGATCGTGAGATCCCACTTAGAGAAAGGAACAAGTGCCAGCGAGGACGCTGGGTCCCGAAGGGAGGTGGATCGTGAGATCCCACATCGGTTGGAGTGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGATGCGTTTTATAAACCTTGAGGGGAAGTCCAAACGGGAAATCTTAGAGATGACAATATCTGCTAGCGGTGGGTTTGGGTCATTACAAATATTGGGGGGTTCCACATCGATTAAAGAAGGGAACAAGTGCCAGTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGACAATATCTGCTAGCAATGGGTTTTGATCGTTACAAATATTGGAGGGTCCCACATCAATTAGAGAAAGGAACAAGTGCCAGCGAGGACGCTGGGTCCCGAAGGGAGGTGGATCGTGAGATCCCACATCGGTTGGAGTGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGATGCGTTTTATAAACCTTGAGGGGAAGTCCAAACGGGAAATCTTAGAGATGACAATATCTGCTAGCGGTGGGTTTGGGTCATTACAAATATTGGGGGGTTCCACATCGATTAAAGAAGGGAACAAGTGCCAGTGGACAATATCTGCTAGCAATGGGTTTTGATCGTTACAAATATTGGAGGGTCCCACATCAATTAGAGAAAGGAACAAGTGCCAGCGAGGACGCTGGGTCCCGAAGGGAGGTGGATCGTGAGATCCCACTTAGAGAAAGGAACAAGTGCCAGCGAGGACGCTGGGTCCCGAAGGGAGGTGGATCGTGAGATCCCACATCGGTTGGAGTGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGATGCGTTTTATAAACCTTGAGGGGAAGTCCAAACGGGAAATCTTAGAGATGACAATATCTGCTAGCGGTGGGTTTGGGTCATTACAAATATTGGGGGGTTCCACATCGATTAAAGAAGGGAACAAGTGCCAGTGGTGGGTTTGGGTCATTACAAATATTGGGGGGTTCCACATCGATTAAAGAAGGGAACAAGTGCCAGTGAGGATGCTGGGCCCAAAAAGGGGTGGATTGTGAGATCCGAAGCATTATTTATAAGGGTATAGAAACCTCTCATTGGCAGACAAAATTTAAAACCTTGAGGGGAAGCTTGAAAGTGGAAACCCAAATAGGACAATATCTGCTAGTGGTGGGCTTGGGCCGTTACCGTTCGAAATCATTTTGGACTATACAACCCCTAGTAAGATGAAAATTTAAGACTACCTTCAATTGATTAATGCATTTTCTCTTCAAAAAAGATTCATAGAGTTTAAGATAAGTTTTGTGTTAAATTTGTATAGAGGTATTGTGAAGTTATTGTGAATACTCTGAGAAAAATATTGAATATTGCAGCAACTGGATATACAGAGTATTATTTTGATGCTGGATCTGGAAAAGTAAGCAGGTAAGAATGAATACCCACTTTATTATAGGCAGACCTCTAAATCCATTGGTGTTTCTGAATCTTTGTTGCTAAAGAATCACCATTTCTCAGGCATGTAGAACACTGGAATGTTCCTAAAATGGCTTTACTGAACCAAATTTTGAGACCCACTCGAGCTTGGTTGTGGTTTAAGAAACCAGGTGCTGCCTGATTCTTCTTTCTCATTAAGAACACAGCTTAGATCATAAACCTTTTCTTGTCAATGTTTTAATTTGATCTAGAAATCATATTTTGTGGGGTAATGTAAGGAAATTATATGTAATAGAGAGTTTTTTTTTTTTTTTTTCTTTTTGTTTTTGTTTCATAAAGTATTTGTAATAGATAGAGATTTATACAGATGGAACATGGAAGAATGAGATAGTATACAGCAGGTTGATTGGTGCCTACCTTTTTCTCTGAGTGGAATCTGCC

mRNA sequence

AAAATTTTCCCGCAAGAAACTTGGGAAGAAGATATTTTCACCTCTCATGGCTGCCATTCGTTTTCGCGCGGCAAAGCTCCAAAGCATTCTGATTCAATCAACTACCAAACCATCCATGTTTGTAAATTGCAATGGCGACGATCTTCCCTTTTCAATCACGTACGTCTCTTCAAACATCTCTCAACGCCATTCGACCTAATCGCCCTCTCAAAATCTGCAAAATCCGATGTCAAGGGGAAAATCCCGCTACCGGTTCGCCGAAGAATCGAGAATCTAAGCCCGAGAATGCGGTGTTGAAGGTCGCTTGGTATGGCTCCGAGCTCTTGGGGATCGCCGCTTCATTTCTCCGTCCGCCGACGGATGTTGAAACGCCTGTTAGGGCGCAGGAGCTTGCGAGAGATGTGTCCGGTGCAATTCGTCGCCCTGTGATTGTGGAAACGATTAAGAAGGATTTTGAGCGGTCGTACTTCGTCACAGGGAACCTTACTGTTGAAGCTTATGAAGAGCAGTGCGAATTTGCTGATCCGGCTGGTTCTTTCAAAGGTCTTAGCCGATTTAAAAGAAACTGTACAAATTTTGGATCCCTTGTTGATAAGTCAAACATGAAGCTTACCAAATGGGAGGATTTTGAGGACAAGGGCATTGGACATTGGAAGTTCAGTTGTATCTTGTCATTTCCTTGGAGACCAATTCTATCTGCAACTGGATATACAGAGTATTATTTTGATGCTGGATCTGGAAAAGTAAGCAGGCATGTAGAACACTGGAATGTTCCTAAAATGGCTTTACTGAACCAAATTTTGAGACCCACTCGAGCTTGGTTGTGGTTTAAGAAACCAGGTGCTGCCTGATTCTTCTTTCTCATTAAGAACACAGCTTAGATCATAAACCTTTTCTTGTCAATGTTTTAATTTGATCTAGAAATCATATTTTGTGGGGTAATGTAAGGAAATTATATGTAATAGAGAGTTTTTTTTTTTTTTTTTCTTTTTGTTTTTGTTTCATAAAGTATTTGTAATAGATAGAGATTTATACAGATGGAACATGGAAGAATGAGATAGTATACAGCAGGTTGATTGGTGCCTACCTTTTTCTCTGAGTGGAATCTGCC

Coding sequence (CDS)

ATGGCGACGATCTTCCCTTTTCAATCACGTACGTCTCTTCAAACATCTCTCAACGCCATTCGACCTAATCGCCCTCTCAAAATCTGCAAAATCCGATGTCAAGGGGAAAATCCCGCTACCGGTTCGCCGAAGAATCGAGAATCTAAGCCCGAGAATGCGGTGTTGAAGGTCGCTTGGTATGGCTCCGAGCTCTTGGGGATCGCCGCTTCATTTCTCCGTCCGCCGACGGATGTTGAAACGCCTGTTAGGGCGCAGGAGCTTGCGAGAGATGTGTCCGGTGCAATTCGTCGCCCTGTGATTGTGGAAACGATTAAGAAGGATTTTGAGCGGTCGTACTTCGTCACAGGGAACCTTACTGTTGAAGCTTATGAAGAGCAGTGCGAATTTGCTGATCCGGCTGGTTCTTTCAAAGGTCTTAGCCGATTTAAAAGAAACTGTACAAATTTTGGATCCCTTGTTGATAAGTCAAACATGAAGCTTACCAAATGGGAGGATTTTGAGGACAAGGGCATTGGACATTGGAAGTTCAGTTGTATCTTGTCATTTCCTTGGAGACCAATTCTATCTGCAACTGGATATACAGAGTATTATTTTGATGCTGGATCTGGAAAAGTAAGCAGGCATGTAGAACACTGGAATGTTCCTAAAATGGCTTTACTGAACCAAATTTTGAGACCCACTCGAGCTTGGTTGTGGTTTAAGAAACCAGGTGCTGCCTGA
BLAST of CmoCh17G004300 vs. TrEMBL
Match: A0A0A0K7I1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G006760 PE=4 SV=1)

HSP 1 Score: 365.9 bits (938), Expect = 3.7e-98
Identity = 173/207 (83.57%), Postives = 186/207 (89.86%), Query Frame = 1

Query: 1   MATIFPFQSRTSLQTSLNAIRPNRPLKICKIRCQGENPATGSPKNRESKPENAVLKVAWY 60
           MATI  FQ  +SL TSLN+IRPN  L+IC+I CQG NP T SP N+ESKPENAVLKVAWY
Sbjct: 1   MATIVSFQPHSSLHTSLNSIRPNPSLRICRIHCQGNNPTTDSPNNQESKPENAVLKVAWY 60

Query: 61  GSELLGIAASFLRPPTDVETPVRAQELARDVSGAIRRPVIVETIKKDFERSYFVTGNLTV 120
           GSELLGIAAS+LRPP DV+TP+RAQEL  DVSG+I RP+IVETIK+DF RSYFVTGNLT+
Sbjct: 61  GSELLGIAASYLRPPLDVQTPLRAQELTTDVSGSIPRPLIVETIKEDFRRSYFVTGNLTL 120

Query: 121 EAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFSCIL 180
           +AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWE FEDKGIGHWKFSCIL
Sbjct: 121 QAYEEQCEFADPAGSFKGLRRFKRNCTNFGSLVDKSNMKLTKWEGFEDKGIGHWKFSCIL 180

Query: 181 SFPWRPILSATGYTEYYFDAGSGKVSR 208
           SFPWRPILSATGYTEYYFDA SGKV R
Sbjct: 181 SFPWRPILSATGYTEYYFDARSGKVCR 207

BLAST of CmoCh17G004300 vs. TrEMBL
Match: F6I656_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g02050 PE=4 SV=1)

HSP 1 Score: 345.1 bits (884), Expect = 6.7e-92
Identity = 166/238 (69.75%), Postives = 191/238 (80.25%), Query Frame = 1

Query: 1   MATIFPFQSRT---SLQTSLNAIRPNRPLKICKIRCQGENPATGSPKNRESKPENAVLKV 60
           M ++  F S T   S +   +  RPN   KI + RC GENP + S   +ES+PENA+LKV
Sbjct: 17  MTSLLSFTSVTPHISFRHRNDFFRPNYHHKIHRFRCDGENPRSNSSTAQESEPENALLKV 76

Query: 61  AWYGSELLGIAASFLRPPTDVETPVRAQELARDVSGAIRRPVIVETIKKDFERSYFVTGN 120
           AWYGSELLGIAASF R P+ VE P RA +LA D SGA+ R  +VETIK+DF+RSYFVTGN
Sbjct: 77  AWYGSELLGIAASFFRSPSSVEAPERAIDLAGDGSGAVDRAALVETIKEDFQRSYFVTGN 136

Query: 121 LTVEAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFS 180
           LT+ AYE+ CEFADPAGSF+GL RFKRNCTNFGSL+ KSNMKL KWEDFEDKGIGHW+FS
Sbjct: 137 LTLSAYEDDCEFADPAGSFRGLRRFKRNCTNFGSLIQKSNMKLMKWEDFEDKGIGHWRFS 196

Query: 181 CILSFPWRPILSATGYTEYYFDAGSGKVSRHVEHWNVPKMALLNQILRPTRAWLWFKK 236
           C+LSFPW+PILSATGYTEYYFD+ SGKV RHVEHWNVPKMALL QILRP+R + W KK
Sbjct: 197 CVLSFPWKPILSATGYTEYYFDSQSGKVCRHVEHWNVPKMALLKQILRPSRGF-WGKK 253

BLAST of CmoCh17G004300 vs. TrEMBL
Match: A0A061DSV4_THECC (Nuclear transport factor 2 family protein OS=Theobroma cacao GN=TCM_005275 PE=4 SV=1)

HSP 1 Score: 335.5 bits (859), Expect = 5.3e-89
Identity = 155/207 (74.88%), Postives = 177/207 (85.51%), Query Frame = 1

Query: 30  KIRCQGENPATGSPKNRESKPENAVLKVAWYGSELLGIAASFLRPPTDVETPVRAQ-ELA 89
           +IRC GENP +  P  +ES PENA+LKVAWYGSELLGIAASFLRPP++VE   +   +L 
Sbjct: 33  RIRCNGENPRSDLPTRQESAPENALLKVAWYGSELLGIAASFLRPPSNVEAAAKNDLKLG 92

Query: 90  RDVSGAIRRPVIVETIKKDFERSYFVTGNLTVEAYEEQCEFADPAGSFKGLSRFKRNCTN 149
            D SGAI R  +VETIK D+ERSYFVTG LT++AYEE CEFADPAGSFKGL RFKRNCTN
Sbjct: 93  LDGSGAIDRTAVVETIKNDYERSYFVTGQLTLDAYEENCEFADPAGSFKGLRRFKRNCTN 152

Query: 150 FGSLVDKSNMKLTKWEDFEDKGIGHWKFSCILSFPWRPILSATGYTEYYFDAGSGKVSRH 209
           FGSL++KSNMKL KWEDFE+KG+GHW+FSC++SFPWRPILSATGYTEYYFDA SGKV RH
Sbjct: 153 FGSLIEKSNMKLMKWEDFENKGVGHWRFSCVMSFPWRPILSATGYTEYYFDAQSGKVCRH 212

Query: 210 VEHWNVPKMALLNQILRPTRAWLWFKK 236
           VEHWNVPKMALL Q+LRPTR + W K+
Sbjct: 213 VEHWNVPKMALLKQLLRPTRGF-WLKR 238

BLAST of CmoCh17G004300 vs. TrEMBL
Match: A0A059C0X3_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E00102 PE=4 SV=1)

HSP 1 Score: 329.7 bits (844), Expect = 2.9e-87
Identity = 159/244 (65.16%), Postives = 183/244 (75.00%), Query Frame = 1

Query: 1   MATIFPFQSRTSLQTSLNAIRPNRPL---KICKIRCQGENPATGSPKNR----ESKPENA 60
           MA I PF   T  ++S +      P    +I ++RC GENP  GS        E +PENA
Sbjct: 36  MAAILPFAPTTPRRSSGHKFASPFPASHPRIERLRCFGENPTRGSSAEAKPEPEPEPENA 95

Query: 61  VLKVAWYGSELLGIAASFLRPPTDVETPVRAQELARDVSGAIRRPVIVETIKKDFERSYF 120
           +LK AWYGSELLGIAAS  RPP   E P R  ELA D +GA  R  +VETIK+D++RSYF
Sbjct: 96  LLKAAWYGSELLGIAASLFRPPASAEAPAREFELAGDGAGAFDRSAVVETIKEDYQRSYF 155

Query: 121 VTGNLTVEAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGH 180
           VTGNLT+ AYEE CEFADPAGSF+GL RFKRNCTNFGSL++KSNMKL KWEDFEDKG+GH
Sbjct: 156 VTGNLTLHAYEEDCEFADPAGSFRGLRRFKRNCTNFGSLIEKSNMKLMKWEDFEDKGVGH 215

Query: 181 WKFSCILSFPWRPILSATGYTEYYFDAGSGKVSRHVEHWNVPKMALLNQILRPTRAWLWF 238
           W+FSCILSFPWRPILSATGYTEY+++A SGKV RHVEHW VPKM LL QI +P+R W W 
Sbjct: 216 WRFSCILSFPWRPILSATGYTEYFYNAQSGKVCRHVEHWKVPKMVLLKQIFKPSR-WAWE 275

BLAST of CmoCh17G004300 vs. TrEMBL
Match: A0A059BZM1_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E00102 PE=4 SV=1)

HSP 1 Score: 327.4 bits (838), Expect = 1.4e-86
Identity = 158/242 (65.29%), Postives = 182/242 (75.21%), Query Frame = 1

Query: 1   MATIFPFQSRTSLQTSLNAIRPNRPL---KICKIRCQGENPATGSPKNR----ESKPENA 60
           MA I PF   T  ++S +      P    +I ++RC GENP  GS        E +PENA
Sbjct: 36  MAAILPFAPTTPRRSSGHKFASPFPASHPRIERLRCFGENPTRGSSAEAKPEPEPEPENA 95

Query: 61  VLKVAWYGSELLGIAASFLRPPTDVETPVRAQELARDVSGAIRRPVIVETIKKDFERSYF 120
           +LK AWYGSELLGIAAS  RPP   E P R  ELA D +GA  R  +VETIK+D++RSYF
Sbjct: 96  LLKAAWYGSELLGIAASLFRPPASAEAPAREFELAGDGAGAFDRSAVVETIKEDYQRSYF 155

Query: 121 VTGNLTVEAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGH 180
           VTGNLT+ AYEE CEFADPAGSF+GL RFKRNCTNFGSL++KSNMKL KWEDFEDKG+GH
Sbjct: 156 VTGNLTLHAYEEDCEFADPAGSFRGLRRFKRNCTNFGSLIEKSNMKLMKWEDFEDKGVGH 215

Query: 181 WKFSCILSFPWRPILSATGYTEYYFDAGSGKVSRHVEHWNVPKMALLNQILRPTRAWLWF 236
           W+FSCILSFPWRPILSATGYTEY+++A SGKV RHVEHW VPKM LL QI +P+R W W 
Sbjct: 216 WRFSCILSFPWRPILSATGYTEYFYNAQSGKVCRHVEHWKVPKMVLLKQIFKPSR-WAWE 275

BLAST of CmoCh17G004300 vs. TAIR10
Match: AT2G46100.1 (AT2G46100.1 Nuclear transport factor 2 (NTF2) family protein)

HSP 1 Score: 299.7 bits (766), Expect = 1.6e-81
Identity = 136/208 (65.38%), Postives = 165/208 (79.33%), Query Frame = 1

Query: 21  RPNRPLKICKIRCQGENPATGSPKNRESKPENAVLKVAWYGSELLGIAASFLRPPTDVET 80
           R NR  +   + C+G+NP      ++  +P+N +LK+AWYGSELLGIAAS  R P +   
Sbjct: 27  RTNRRFEATGVSCRGQNPTDEPQTSKGPEPDNVLLKIAWYGSELLGIAASVFRSP-ETSP 86

Query: 81  PVRAQELARDVSGAIRRPVIVETIKKDFERSYFVTGNLTVEAYEEQCEFADPAGSFKGLS 140
            V   E+  D SG   R  +V++IK+DF+RSYFVTGNLT E YEE+CEFADPAGSFKGL+
Sbjct: 87  IVTGFEVPVDCSGRAVRVAVVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADPAGSFKGLA 146

Query: 141 RFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFSCILSFPWRPILSATGYTEYYFDA 200
           RFKRNCTNFGSL++KSNMKL KWE+FEDKGIGHWKFSC++SFPW+PILSATGYTEYYFD 
Sbjct: 147 RFKRNCTNFGSLIEKSNMKLMKWENFEDKGIGHWKFSCVMSFPWKPILSATGYTEYYFDT 206

Query: 201 GSGKVSRHVEHWNVPKMALLNQILRPTR 229
            SGK+ RHVEHWNVPK+AL  Q+LRP+R
Sbjct: 207 ESGKICRHVEHWNVPKIALFKQLLRPSR 233

BLAST of CmoCh17G004300 vs. TAIR10
Match: AT3G04890.2 (AT3G04890.2 Uncharacterized conserved protein (DUF2358))

HSP 1 Score: 68.6 bits (166), Expect = 6.1e-12
Identity = 33/104 (31.73%), Postives = 51/104 (49.04%), Query Frame = 1

Query: 100 IVETIKKDFERSYFVTGNLTVEAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMK 159
           ++  ++ D+   YFVTG LT   Y + C F DP  SF+G   ++RN       ++ ++++
Sbjct: 80  VMGILRSDYRNFYFVTGVLTSAIYSDDCIFEDPTISFQGTELYERNLKLLVPFLEDASIE 139

Query: 160 LTKWEDFEDKG----IGHWKFSCILSFPWRPILSATGYTEYYFD 200
           L   E  E       +  WK    L  PWRP++S  G T Y  D
Sbjct: 140 LQNMEKSESSQRNYILATWKLRTYLKLPWRPLISINGNTVYDLD 183

BLAST of CmoCh17G004300 vs. NCBI nr
Match: gi|659116628|ref|XP_008458172.1| (PREDICTED: uncharacterized protein LOC103497690 [Cucumis melo])

HSP 1 Score: 430.6 bits (1106), Expect = 1.7e-117
Identity = 203/237 (85.65%), Postives = 215/237 (90.72%), Query Frame = 1

Query: 1   MATIFPFQSRTSLQTSLNAIRPNRPLKICKIRCQGENPATGSPKNRESKPENAVLKVAWY 60
           MATI  FQ  +SLQTSL +IRPN  L+IC+I C+G NP T SP N+ESKPENAVLKVAWY
Sbjct: 1   MATIVSFQLHSSLQTSLYSIRPNPSLRICRIHCRGNNPTTDSPNNQESKPENAVLKVAWY 60

Query: 61  GSELLGIAASFLRPPTDVETPVRAQELARDVSGAIRRPVIVETIKKDFERSYFVTGNLTV 120
           GSELLGIAASFLRPP+DV+TPVRAQEL  DVSGAI RP+IVETIK+DF RSYFVTGNLT+
Sbjct: 61  GSELLGIAASFLRPPSDVQTPVRAQELTTDVSGAIPRPLIVETIKEDFRRSYFVTGNLTL 120

Query: 121 EAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFSCIL 180
           EAYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFSCIL
Sbjct: 121 EAYEEQCEFADPAGSFKGLRRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFSCIL 180

Query: 181 SFPWRPILSATGYTEYYFDAGSGKVSRHVEHWNVPKMALLNQILRPTRAWLWFKKPG 238
           SFPWRPILSATGYT+YYFDA SGKV RHVEHWNVPKMALL QILRPTR WLWFKK G
Sbjct: 181 SFPWRPILSATGYTDYYFDARSGKVCRHVEHWNVPKMALLKQILRPTRDWLWFKKAG 237

BLAST of CmoCh17G004300 vs. NCBI nr
Match: gi|449441428|ref|XP_004138484.1| (PREDICTED: uncharacterized protein LOC101218208 [Cucumis sativus])

HSP 1 Score: 427.2 bits (1097), Expect = 1.9e-116
Identity = 200/237 (84.39%), Postives = 213/237 (89.87%), Query Frame = 1

Query: 1   MATIFPFQSRTSLQTSLNAIRPNRPLKICKIRCQGENPATGSPKNRESKPENAVLKVAWY 60
           MATI  FQ  +SL TSLN+IRPN  L+IC+I CQG NP T SP N+ESKPENAVLKVAWY
Sbjct: 1   MATIVSFQPHSSLHTSLNSIRPNPSLRICRIHCQGNNPTTDSPNNQESKPENAVLKVAWY 60

Query: 61  GSELLGIAASFLRPPTDVETPVRAQELARDVSGAIRRPVIVETIKKDFERSYFVTGNLTV 120
           GSELLGIAAS+LRPP DV+TP+RAQEL  DVSG+I RP+IVETIK+DF RSYFVTGNLT+
Sbjct: 61  GSELLGIAASYLRPPLDVQTPLRAQELTTDVSGSIPRPLIVETIKEDFRRSYFVTGNLTL 120

Query: 121 EAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFSCIL 180
           +AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWE FEDKGIGHWKFSCIL
Sbjct: 121 QAYEEQCEFADPAGSFKGLRRFKRNCTNFGSLVDKSNMKLTKWEGFEDKGIGHWKFSCIL 180

Query: 181 SFPWRPILSATGYTEYYFDAGSGKVSRHVEHWNVPKMALLNQILRPTRAWLWFKKPG 238
           SFPWRPILSATGYTEYYFDA SGKV RHVEHWNVPKMALL QILRPTR WLWFKK G
Sbjct: 181 SFPWRPILSATGYTEYYFDARSGKVCRHVEHWNVPKMALLKQILRPTREWLWFKKAG 237

BLAST of CmoCh17G004300 vs. NCBI nr
Match: gi|700190486|gb|KGN45690.1| (hypothetical protein Csa_6G006760 [Cucumis sativus])

HSP 1 Score: 365.9 bits (938), Expect = 5.3e-98
Identity = 173/207 (83.57%), Postives = 186/207 (89.86%), Query Frame = 1

Query: 1   MATIFPFQSRTSLQTSLNAIRPNRPLKICKIRCQGENPATGSPKNRESKPENAVLKVAWY 60
           MATI  FQ  +SL TSLN+IRPN  L+IC+I CQG NP T SP N+ESKPENAVLKVAWY
Sbjct: 1   MATIVSFQPHSSLHTSLNSIRPNPSLRICRIHCQGNNPTTDSPNNQESKPENAVLKVAWY 60

Query: 61  GSELLGIAASFLRPPTDVETPVRAQELARDVSGAIRRPVIVETIKKDFERSYFVTGNLTV 120
           GSELLGIAAS+LRPP DV+TP+RAQEL  DVSG+I RP+IVETIK+DF RSYFVTGNLT+
Sbjct: 61  GSELLGIAASYLRPPLDVQTPLRAQELTTDVSGSIPRPLIVETIKEDFRRSYFVTGNLTL 120

Query: 121 EAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFSCIL 180
           +AYEEQCEFADPAGSFKGL RFKRNCTNFGSLVDKSNMKLTKWE FEDKGIGHWKFSCIL
Sbjct: 121 QAYEEQCEFADPAGSFKGLRRFKRNCTNFGSLVDKSNMKLTKWEGFEDKGIGHWKFSCIL 180

Query: 181 SFPWRPILSATGYTEYYFDAGSGKVSR 208
           SFPWRPILSATGYTEYYFDA SGKV R
Sbjct: 181 SFPWRPILSATGYTEYYFDARSGKVCR 207

BLAST of CmoCh17G004300 vs. NCBI nr
Match: gi|225454326|ref|XP_002277369.1| (PREDICTED: uncharacterized protein LOC100258452 [Vitis vinifera])

HSP 1 Score: 345.1 bits (884), Expect = 9.6e-92
Identity = 166/238 (69.75%), Postives = 191/238 (80.25%), Query Frame = 1

Query: 1   MATIFPFQSRT---SLQTSLNAIRPNRPLKICKIRCQGENPATGSPKNRESKPENAVLKV 60
           M ++  F S T   S +   +  RPN   KI + RC GENP + S   +ES+PENA+LKV
Sbjct: 17  MTSLLSFTSVTPHISFRHRNDFFRPNYHHKIHRFRCDGENPRSNSSTAQESEPENALLKV 76

Query: 61  AWYGSELLGIAASFLRPPTDVETPVRAQELARDVSGAIRRPVIVETIKKDFERSYFVTGN 120
           AWYGSELLGIAASF R P+ VE P RA +LA D SGA+ R  +VETIK+DF+RSYFVTGN
Sbjct: 77  AWYGSELLGIAASFFRSPSSVEAPERAIDLAGDGSGAVDRAALVETIKEDFQRSYFVTGN 136

Query: 121 LTVEAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFS 180
           LT+ AYE+ CEFADPAGSF+GL RFKRNCTNFGSL+ KSNMKL KWEDFEDKGIGHW+FS
Sbjct: 137 LTLSAYEDDCEFADPAGSFRGLRRFKRNCTNFGSLIQKSNMKLMKWEDFEDKGIGHWRFS 196

Query: 181 CILSFPWRPILSATGYTEYYFDAGSGKVSRHVEHWNVPKMALLNQILRPTRAWLWFKK 236
           C+LSFPW+PILSATGYTEYYFD+ SGKV RHVEHWNVPKMALL QILRP+R + W KK
Sbjct: 197 CVLSFPWKPILSATGYTEYYFDSQSGKVCRHVEHWNVPKMALLKQILRPSRGF-WGKK 253

BLAST of CmoCh17G004300 vs. NCBI nr
Match: gi|297745341|emb|CBI40421.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 345.1 bits (884), Expect = 9.6e-92
Identity = 166/238 (69.75%), Postives = 191/238 (80.25%), Query Frame = 1

Query: 1   MATIFPFQSRT---SLQTSLNAIRPNRPLKICKIRCQGENPATGSPKNRESKPENAVLKV 60
           M ++  F S T   S +   +  RPN   KI + RC GENP + S   +ES+PENA+LKV
Sbjct: 1   MTSLLSFTSVTPHISFRHRNDFFRPNYHHKIHRFRCDGENPRSNSSTAQESEPENALLKV 60

Query: 61  AWYGSELLGIAASFLRPPTDVETPVRAQELARDVSGAIRRPVIVETIKKDFERSYFVTGN 120
           AWYGSELLGIAASF R P+ VE P RA +LA D SGA+ R  +VETIK+DF+RSYFVTGN
Sbjct: 61  AWYGSELLGIAASFFRSPSSVEAPERAIDLAGDGSGAVDRAALVETIKEDFQRSYFVTGN 120

Query: 121 LTVEAYEEQCEFADPAGSFKGLSRFKRNCTNFGSLVDKSNMKLTKWEDFEDKGIGHWKFS 180
           LT+ AYE+ CEFADPAGSF+GL RFKRNCTNFGSL+ KSNMKL KWEDFEDKGIGHW+FS
Sbjct: 121 LTLSAYEDDCEFADPAGSFRGLRRFKRNCTNFGSLIQKSNMKLMKWEDFEDKGIGHWRFS 180

Query: 181 CILSFPWRPILSATGYTEYYFDAGSGKVSRHVEHWNVPKMALLNQILRPTRAWLWFKK 236
           C+LSFPW+PILSATGYTEYYFD+ SGKV RHVEHWNVPKMALL QILRP+R + W KK
Sbjct: 181 CVLSFPWKPILSATGYTEYYFDSQSGKVCRHVEHWNVPKMALLKQILRPSRGF-WGKK 237

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K7I1_CUCSA3.7e-9883.57Uncharacterized protein OS=Cucumis sativus GN=Csa_6G006760 PE=4 SV=1[more]
F6I656_VITVI6.7e-9269.75Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g02050 PE=4 SV=... [more]
A0A061DSV4_THECC5.3e-8974.88Nuclear transport factor 2 family protein OS=Theobroma cacao GN=TCM_005275 PE=4 ... [more]
A0A059C0X3_EUCGR2.9e-8765.16Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E00102 PE=4 SV=1[more]
A0A059BZM1_EUCGR1.4e-8665.29Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E00102 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G46100.11.6e-8165.38 Nuclear transport factor 2 (NTF2) family protein[more]
AT3G04890.26.1e-1231.73 Uncharacterized conserved protein (DUF2358)[more]
Match NameE-valueIdentityDescription
gi|659116628|ref|XP_008458172.1|1.7e-11785.65PREDICTED: uncharacterized protein LOC103497690 [Cucumis melo][more]
gi|449441428|ref|XP_004138484.1|1.9e-11684.39PREDICTED: uncharacterized protein LOC101218208 [Cucumis sativus][more]
gi|700190486|gb|KGN45690.1|5.3e-9883.57hypothetical protein Csa_6G006760 [Cucumis sativus][more]
gi|225454326|ref|XP_002277369.1|9.6e-9269.75PREDICTED: uncharacterized protein LOC100258452 [Vitis vinifera][more]
gi|297745341|emb|CBI40421.3|9.6e-9269.75unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR018790DUF2358
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0009536 plastid
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh17G004300.1CmoCh17G004300.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018790Protein of unknown function DUF2358PFAMPF10184DUF2358coord: 100..208
score: 8.0
NoneNo IPR availablePANTHERPTHR34123FAMILY NOT NAMEDcoord: 21..237
score: 1.1E
NoneNo IPR availablePANTHERPTHR34123:SF1SUBFAMILY NOT NAMEDcoord: 21..237
score: 1.1E

The following gene(s) are paralogous to this gene:

None