如何使用python将csv数据推送到mongodb

Question

提问by Viswanathan

Trying to push csv data in to mongodb using python.i'm a beginner to python & mongodb..i used the following code

尝试使用 python 将 csv 数据推送到 mongodb。我是 python 和 mongodb 的初学者。我使用了以下代码

import csv
import json
import pandas as pd
import sys, getopt, pprint
from pymongo import MongoClient
#CSV to JSON Conversion
csvfile = open('C://test//final-current.csv', 'r')
jsonfile = open('C://test//6.json', 'a')
reader = csv.DictReader( csvfile )
header= [ "S.No", "Instrument Name", "Buy Price", "Buy Quantity", "Sell Price", "Sell Quantity", "Last Traded Price", "Total Traded Quantity", "Average Traded Price", "Open Price", "High Price", "Low Price", "Close Price", "V" ,"Time"]
#fieldnames=header
output=[]
for each in reader:
    row={}
    for field in header:
        row[field]=each[field]
    output.append(row)

json.dump(output, jsonfile, indent=None, sort_keys=False , encoding="UTF-8")
mongo_client=MongoClient() 
db=mongo_client.october_mug_talk
db.segment.drop()
data=pd.read_csv('C://test//6.json', error_bad_lines=0)
df = pd.DataFrame(data)
records = csv.DictReader(df)
db.segment.insert(records)

but the output is given in this format

但输出是以这种格式给出的

/* 0 */
{
  "_id" : ObjectId("54891c4ffb2a0303b0d43134"),
  "[{\"AverageTradedPrice\":\"0\"" : "BuyPrice:\"349.75\""
}

/* 1 */
{
  "_id" : ObjectId("54891c4ffb2a0303b0d43135"),
  "[{\"AverageTradedPrice\":\"0\"" : "BuyQuantity:\"3000\""
}

/* 2 */
{
  "_id" : ObjectId("54891c4ffb2a0303b0d43136"),
  "[{\"AverageTradedPrice\":\"0\"" : "ClosePrice:\"350\""
}

/* 3 */
{
  "_id" : ObjectId("54891c4ffb2a0303b0d43137"),
  "[{\"AverageTradedPrice\":\"0\"" : "HighPrice:\"0\""
}

Actually i want the output to like for single id all the other fields should be showed as subtypes eg:

实际上，我希望输出喜欢单个 ID，所有其他字段都应显示为子类型，例如：

 _id" : ObjectId("54891c4ffb2a0303b0d43137")
    AveragetradedPrice :0
    HighPrice:0
    ClosePrice:350
    buyprice:350.75

Please help me Out.Thanks in advance

请帮帮我。提前致谢

Answer 1

采纳答案by Viswanathan

Thank you for the suggestion.This one is the corrected code:

谢谢你的建议。这是更正后的代码：

import csv
import json
import pandas as pd
import sys, getopt, pprint
from pymongo import MongoClient
#CSV to JSON Conversion
csvfile = open('C://test//final-current.csv', 'r')
reader = csv.DictReader( csvfile )
mongo_client=MongoClient() 
db=mongo_client.october_mug_talk
db.segment.drop()
header= [ "S No", "Instrument Name", "Buy Price", "Buy Quantity", "Sell Price", "Sell Quantity", "Last Traded Price", "Total Traded Quantity", "Average Traded Price", "Open Price", "High Price", "Low Price", "Close Price", "V" ,"Time"]

for each in reader:
    row={}
    for field in header:
        row[field]=each[field]

    db.segment.insert(row)

Answer 2

回答by deenaik

There is a better way with less number of imports, assuming you have a header row in your CSV.

假设您的 CSV 中有标题行，则有一种更好的方法可以减少导入次数。

from pymongo import MongoClient
import csv

# DB connectivity
client = MongoClient('localhost', 27017)
db = client.db
collection = db.collection

# Function to parse csv to dictionary
def csv_to_dict():
    reader = csv.DictReader(open(FILEPATH))
    result = {}
    for row in reader:
        key = row.pop('First_value')
        result[key] = row
    return query

# Final insert statement
db.collection.insert_one(csv_to_dict())

Hope that helps

希望有帮助

Answer 3

回答by Adil

The easiest way is by using pandas my code is

最简单的方法是使用熊猫我的代码是

import json
import pymongo
import pandas as pd
myclient = pymongo.MongoClient()

df = pd.read_csv('yourcsv.csv',encoding = 'ISO-8859-1')   # loading csv file
df.to_json('yourjson.json')                               # saving to json file
jdf = open('yourjson.json').read()                        # loading the json file 
data = json.loads(jdf)                                    # reading json file

now you can insert this json in your mangodb database :-]

现在你可以在你的 mangodb 数据库中插入这个 json :-]

Answer 4

回答by Perfect

Why do you insert data one by one? Take a look at this one.

为什么要一一插入数据？看看这个。

import pandas as pd
from pymongo import MongoClient

client = MongoClient(<your_credentials>)
database = client['YOUR_DB_NAME']
collection = database['your_collection']

def csv_to_json(filename, header=None):
    data = pd.read_csv(filename, header=header)
    return data.to_dict('records')

collection.insert_many(csv_to_json('your_file_path'))

Please be aware of that it might crash your app when the file is too big.

请注意，当文件太大时，它可能会导致您的应用程序崩溃。

如何使用python将csv数据推送到mongodb

提问by Viswanathan

采纳答案by Viswanathan

回答by deenaik

回答by Adil

回答by Perfect

相关推荐

最近更新

标签

如何使用python将csv数据推送到mongodb

提问by Viswanathan

采纳答案by Viswanathan

回答by deenaik

回答by Adil

回答by Perfect

相关推荐

Python 3: os.walk() 文件路径 UnicodeEncodeError: 'utf-8' codec can't encode: surrogates not allowed

Python 转换时区熊猫数据框

Python PyQt5 中的 connect() 方法在哪里？

Python 字符串操作：将每个句子的第一个字母大写

相关推荐

最近更新

标签