我想创建一个函数来从熊猫数据框中删除特殊字符,但也传递一个参数来保留所需的字符。def strip_characters(c, req_char = ''): spec_chars = ["!",'"',"#","%","&","'","(",")","*","+",",","-",".","/",":",";","<","=",">","?","@","[","\\","]","^","_","`","{","|","}","~","–"] new_spec = spec_chars.remove(req_char) for char in spec_chars: c = c.str.replace(char, ' ') return cdf['col'] = df['col'].apply(strip_characters,',') # passing a comma to retain the character# df['col'] = ['Dining Room', 'Pre-War', 'Laundry in Building', '&Lobby']
1 回答
慕桂英546537
TA贡献1848条经验 获得超10个赞
尝试这个,
import pandas as pd
df = pd.DataFrame({'col':['Dining Room', 'Pre-War', 'Laundry in Building', '&Lobby']})
# ([^) means match anything but word character
# "[^\w+|,]" to exclude specific character's from being replaced
df['col'].str.replace("[^\w+]"," ")
输出
0 Dining Room
1 Pre War
2 Laundry in Building
3 Lobby
添加回答
举报
0/150
提交
取消