为了账号安全,请及时绑定邮箱和手机立即绑定

如何创建一个函数来删除 Python 中的特殊字符以进行特征工程?

如何创建一个函数来删除 Python 中的特殊字符以进行特征工程?

冉冉说 2022-11-29 16:58:47
我想创建一个函数来从熊猫数据框中删除特殊字符,但也传递一个参数来保留所需的字符。def strip_characters(c, req_char = ''):    spec_chars = ["!",'"',"#","%","&","'","(",")","*","+",",","-",".","/",":",";","<","=",">","?","@","[","\\","]","^","_","`","{","|","}","~","–"]    new_spec = spec_chars.remove(req_char)    for char in spec_chars:        c = c.str.replace(char, ' ')    return cdf['col'] = df['col'].apply(strip_characters,',')  # passing a comma to retain the character# df['col'] = ['Dining Room', 'Pre-War', 'Laundry in Building', '&Lobby']
查看完整描述

1 回答

?
慕桂英546537

TA贡献1848条经验 获得超10个赞

尝试这个,


import pandas as pd


df = pd.DataFrame({'col':['Dining Room', 'Pre-War', 'Laundry in Building', '&Lobby']})


# ([^) means match anything but word character

# "[^\w+|,]" to exclude specific character's from being replaced


df['col'].str.replace("[^\w+]"," ")


输出


0            Dining Room

1                Pre War

2    Laundry in Building

3                  Lobby


查看完整回答
反对 回复 2022-11-29
  • 1 回答
  • 0 关注
  • 86 浏览
慕课专栏
更多

添加回答

举报

0/150
提交
取消
意见反馈 帮助中心 APP下载
官方微信