Python pandas 数据框中整列的子字符串

Question

提问by thenakulchawla

I have a pandas dataframe "df". In this dataframe I have multiple columns, one of which I have to substring. Lets say the column name is "col". I can run a "for" loop like below and substring the column:

我有一个熊猫数据框“df”。在这个数据框中，我有多个列，我必须对其中一列进行子串。假设列名是“col”。我可以运行如下所示的“for”循环并对列进行子字符串化：

for i in range(0,len(df)):
  df.iloc[i].col = df.iloc[i].col[:9]

But I wanted to know, if there is an option where I don't have to use a "for" loop, and do it directly using an attribute.I have huge amount of data, and if I do this, the data will take a very long time process.

但我想知道，如果有一个选项，我不必使用“for”循环，而是直接使用属性来做。我有大量的数据，如果我这样做，数据将需要一个很长的过程。

Answer 1

回答by ayhan

Use the straccessor with square brackets:

使用str带方括号的访问器：

df['col'] = df['col'].str[:9]

Or str.slice:

或str.slice：

df['col'] = df['col'].str.slice(0, 9)

Answer 2

回答by Radiumcola

I needed to convert a single column of strings of form nn.n% to float. I needed to remove the % from the element in each row. The attend data frame has two columns.

我需要将一列形式为 nn.n% 的字符串转换为浮点数。我需要从每一行的元素中删除 %。出席数据框有两列。

attend.iloc[:,1:2]=attend.iloc[:,1:2].applymap(lambda x: float(x[:-1]))

Its an extenstion to the original answer. In my case it takes a dataframe and applies a function to each value in a specific column. The function removes the last character and converts the remaining string to float.

它是原始答案的扩展。在我的情况下，它需要一个数据框并将一个函数应用于特定列中的每个值。该函数删除最后一个字符并将剩余的字符串转换为浮点数。

Python pandas 数据框中整列的子字符串

提问by thenakulchawla

回答by ayhan

回答by Radiumcola

相关推荐

最近更新

标签

Python pandas 数据框中整列的子字符串

提问by thenakulchawla

回答by ayhan

回答by Radiumcola

相关推荐

在 python 中的单词上拆分语音音频文件

加载 MySQLdb 模块时出错“您是否安装了 mysqlclient 或 MySQL-python？”

Python：将列表转换为带有索引的字典

Python numpy 数组类型错误：只有整数标量数组可以转换为标量索引

相关推荐

最近更新

标签