Python 熊猫创建一个包含 n 个元素的系列(顺序或随机)
声明:本页面是StackOverFlow热门问题的中英对照翻译,遵循CC BY-SA 4.0协议,如果您需要使用它,必须同样遵循CC BY-SA许可,注明原文地址和作者信息,同时你必须将它归于原作者(不是我):StackOverFlow
原文地址: http://stackoverflow.com/questions/33828475/
Warning: these are provided under cc-by-sa 4.0 license. You are free to use/share it, But you must attribute it to the original authors (not me):
StackOverFlow
pandas create a series with n elements (sequential or randbetween)
提问by becinyus
I am trying to create a pandas
series.
我正在尝试创建一个pandas
系列。
One column of the series should contain n sequential numbers. [1, 2, 3, ..., n]
该系列的一列应包含 n 个序列号。 [1, 2, 3, ..., n]
One column should contain random numbers between k
and k+100
.
一列应包含k
和之间的随机数k+100
。
One column should contain random selection between strings in a list. ['A', 'B', 'C', ... 'Z']
一列应包含列表中字符串之间的随机选择。 ['A', 'B', 'C', ... 'Z']
采纳答案by jezrael
There can be a lot of solutions. In the comments of the code block (#
) you will find a few links for more information:
可以有很多解决方案。在代码块 ( #
)的注释中,您会找到一些链接以获取更多信息:
import pandas as pd
import numpy as np
import random
import string
k = 5
N = 10
#http://docs.scipy.org/doc/numpy/reference/generated/numpy.random.randint.html
#http://stackoverflow.com/a/2257449/2901002
df = pd.DataFrame({ 'A' : range(1, N + 1 ,1),
'B' : np.random.randint(k, k + 100 , size=N),
'C' : pd.Series(random.choice(string.ascii_uppercase) for _ in range(N)) })
print df
# A B C
#0 1 60 O
#1 2 94 L
#2 3 10 W
#3 4 94 X
#4 5 60 O
#5 6 20 K
#6 7 58 Y
#7 8 40 I
#8 9 49 X
#9 10 65 S
Numpysolution:
麻木解决方案:
import pandas as pd
import numpy as np
k = 5
N = 10
alphabet = list('ABCDEFGHIJKLMNOPQRSTUVWXYZ')
#http://docs.scipy.org/doc/numpy/reference/generated/numpy.random.choice.html
df = pd.DataFrame({ 'A' : range(1, N + 1 ,1),
'B' : np.random.randint(k, k + 100 , size=N),
'C' : np.random.choice(np.array(alphabet, dtype="|S1"), N) })
print df
# A B C
#0 1 16 U
#1 2 76 X
#2 3 101 N
#3 4 61 F
#4 5 52 J
#5 6 62 A
#6 7 99 L
#7 8 23 N
#8 9 75 D
#9 10 16 Q
回答by Tom Ron
import pandas
n = 30
k = 40
pandas.DataFrame([(i, random.randint(k, k+100), chr(random.randint(ord('A'), ord('Z')))) for i in xrange(0, n)
If you want you specify the column names otherwise it is set to 0,1,2
如果您想指定列名,否则将其设置为 0,1,2