scrapy
Python/Scrapy question: How to get cleaner results?
My task for a project is to data mine a website for specific names. My experience with python isn\'t high. When I scraped all the names, they come out in this format:[详细]
2023-03-18 06:54 分类:问答Scrapy, hash tag on URLs
I\'m on the middle of a scrapping project using Scrapy. I realized that Scrapy strips the URL from a hash tag to the end.[详细]
2023-03-18 01:34 分类:问答ImportError: Scrapy for python no longer works, No module named scrapy.cmdline
Running Scrapy yields me the error: ImportError: No module named scrapy.cmdline I looked and I do have that m开发者_如何学编程odule in the correct location. Can this issue be resolved by creating a[详细]
2023-03-17 06:41 分类:问答Scrapy Crawl URLs in Order
So, my problem is relatively simple.I have one spider crawling multiple sites, and I need it to return the data in the order I write it in my code.It\'s posted below.[详细]
2023-03-16 21:25 分类:问答Python Scrapy: Convert relative paths to absolute paths
I have amended the code based on solutions offered below by the great folks here; I get the error shown below the code here.[详细]
2023-03-16 05:24 分类:问答HTTPCACHE not working when deploying scrapy project to scrapyd
When I try to use HTTPCACHE with scrapyd I get the following error: [scr开发者_如何转开发apy] WARNING: Disabled Httpcache Middlware: unable to find scrapy.cfg file to infer project data dirThe proble[详细]
2023-03-16 04:06 分类:问答Scraping from urls which contain regular expression
I\'ve been trying to scrape data from the website: http://uk.ratemyteachers.com/. I want to get information from a certain number of teachers who I do not know the name from the website.[详细]
2023-03-16 02:09 分类:问答Scrapy Python spider: Storing results in Latin-1, not in unicode
Currently my spider fetches results as needed but encodes them in unicode (UTF-8, I believe). When I save these results to a csv, I have a ton of cleaning to do as a result,开发者_开发问答 with all th[详细]
2023-03-15 15:30 分类:问答Scraping data straight to Cassandra database
Can anyone help me with Cassandra databa开发者_如何学运维se? Actually, I want to through the scraped data directly toCassandra. Currently, I am using scrapy for parsing data from different websites.[详细]
2023-03-15 12:35 分类:问答Trying to Use an ItemExporter in Scrapy
I\'m trying to implement some sort of Item Exporter in my code.My basic code is right now to scrape si.com for batting averages, just as an example.The results are presented in one long row, and I\'d[详细]
2023-03-15 06:51 分类:问答