Python 尝试计算字符串中的单词

Question

提问by Harry Harry

I'm trying to analyze the contents of a string. If it has a punctuation mixed in the word I want to replace them with spaces.

我正在尝试分析字符串的内容。如果单词中混有标点符号，我想用空格替换它们。

For example, If Johnny.Appleseed!is:a*good&farmer is entered as an input then it should say there are 6 words, but my code only sees it as 0 words. I'm not sure how to remove an incorrect character.

例如，如果 Johnny.Appleseed!is:a*good&farmer 作为输入输入，那么它应该说有 6 个单词，但我的代码只将其视为 0 个单词。我不确定如何删除不正确的字符。

FYI: I'm using python 3, also I can't import any libraries

仅供参考：我正在使用 python 3，也无法导入任何库

string = input("type something")
stringss = string.split()

    for c in range(len(stringss)):
        for d in stringss[c]:
            if(stringss[c][d].isalnum != True):
                #something that removes stringss[c][d]
                total+=1
print("words: "+ str(total))

Answer 1

采纳答案by Ashwini Chaudhary

Simple loop based solution:

基于简单循环的解决方案：

strs = "Johnny.Appleseed!is:a*good&farmer"
lis = []
for c in strs:
    if c.isalnum() or c.isspace():
        lis.append(c)
    else:
        lis.append(' ')

new_strs = "".join(lis)
print new_strs           #print 'Johnny Appleseed is a good farmer'
new_strs.split()         #prints ['Johnny', 'Appleseed', 'is', 'a', 'good', 'farmer']

Better solution:

更好的解决方案：

Using regex:

使用regex：

>>> import re
>>> from string import punctuation
>>> strs = "Johnny.Appleseed!is:a*good&farmer"
>>> r = re.compile(r'[{}]'.format(punctuation))
>>> new_strs = r.sub(' ',strs)
>>> len(new_strs.split())
6
#using `re.split`:
>>> strs = "Johnny.Appleseed!is:a*good&farmer"
>>> re.split(r'[^0-9A-Za-z]+',strs)
['Johnny', 'Appleseed', 'is', 'a', 'good', 'farmer']

Answer 2

回答by Rushy Panchal

for ltr in ('!', '.', ...) # insert rest of punctuation
     stringss = strings.replace(ltr, ' ')
return len(stringss.split(' '))

Answer 3

回答by Prashant Kumar

Here's a one-line solution that doesn't require importing any libraries.
It replaces non-alphanumeric characters (like punctuation) with spaces, and then splits the string.

这是一种不需要导入任何库的单行解决方案。
它用空格替换非字母数字字符（如标点符号），然后split是字符串。

Inspired from "Python strings split with multiple separators"

灵感来自“用多个分隔符分割的 Python 字符串”

>>> s = 'Johnny.Appleseed!is:a*good&farmer'
>>> words = ''.join(c if c.isalnum() else ' ' for c in s).split()
>>> words
['Johnny', 'Appleseed', 'is', 'a', 'good', 'farmer']
>>> len(words)
6

Answer 4

回答by Dotan

try this: it parses the word_list using re, then creates a dictionary of word:appearances

试试这个：它使用 re 解析 word_list，然后创建一个 word:appearances 字典

import re
word_list = re.findall(r"[\w']+", string)
print {word:word_list.count(word) for word in word_list}

Answer 5

回答by TMoover

I know that this is an old question but...How about this?

我知道这是一个老问题，但是...怎么样？

string = "If Johnny.Appleseed!is:a*good&farmer"

a = ["*",":",".","!",",","&"," "]
new_string = ""

for i in string:
   if i not in a:
      new_string += i
   else:
      new_string = new_string  + " "

print(len(new_string.split(" ")))

Answer 6

回答by sweet_sugar

How about using Counter from collections ?

从集合中使用 Counter 怎么样？

import re
from collections import Counter

words = re.findall(r'\w+', string)
print (Counter(words))

Answer 7

回答by alien ware

#Write a python script to count words in a given string.
 s=str(input("Enter a string: "))
 words=s.split()
 count=0
  for word in words:
      count+=1

  print(f"total number of words in the string is : {count}")

Python 尝试计算字符串中的单词

提问by Harry Harry

采纳答案by Ashwini Chaudhary

Simple loop based solution:

基于简单循环的解决方案：

Better solution:

更好的解决方案：

回答by Rushy Panchal

回答by Prashant Kumar

回答by Dotan

回答by TMoover

回答by sweet_sugar

回答by alien ware

相关推荐

最近更新

标签

Python 尝试计算字符串中的单词

提问by Harry Harry

采纳答案by Ashwini Chaudhary

Simple loop based solution:

基于简单循环的解决方案：

Better solution:

更好的解决方案：

回答by Rushy Panchal

回答by Prashant Kumar

回答by Dotan

回答by TMoover

回答by sweet_sugar

回答by alien ware

相关推荐

Python：IndexError：列表索引超出范围错误

Python 如何将另一整列作为参数传递给 pandas fillna()

Python：在字符串中查找模式

Python 使用 Scipy 拟合威布尔分布

相关推荐

最近更新

标签