Discussions

Ask a Question
ANSWERED

simple issue: entire basic html website but behind a password page - how?

How can i login the site with 80 legs and let it crawl after it authenticated?
ANSWERED

Inconsistent crawling for same set of data

Hi, We were scraping one of the websites by making our own app and URL list. Now, some weird behavior we are observing i.e. for the same set of URLs we are getting different outputs. It is inconsistent. Is it because all the 80legs IPs are blacklisted by that particular website? Kindly reply to the above issue. Thanks!
ANSWERED

Crawl limit is lower than my plan

Trying to start a new crawl this morning, but for some reason I am being limited to 10,000 URLs instead of the 100,000 URLs per my pricing plan. (crawls were working normally yesterday)

HELP !!!

Hi, I'm trying the product and created a crawl with the following link: http://www.bing.es/search?q=Agile+Coach+en+Madrid&count=100&first=600 I also placed to extract emails and go 10 levels inside, but nothing happens (returns 0 and says completed). What am I doing wrong? You can check all the craws in my account and you'll see that none of them work.
ANSWERED

I'm trying to create a crawl and keep getting an error.

I've attempted to create a crawl and keep getting errors. I'm trying to scrape for psychic and intuitive guide advisors on sites like yelp.com and etsy.com. I'd appreciate your help with getting this set up.
ANSWERED

Amazon scraping

Is it possible to crawl Amazon and get buy box prices and other info using a list of ASINs? If possible, how?

Throttling requests?

Is it possible to throttle the request rate?
ANSWERED

crawl not starting

I have a long list of urls and crawl status says "STARTED" but # urls crawled stays at 0

Not crawling every page

So I have found that when crawling some URL's it doesn't crawl all the pages? For example anderstore.com only returns 2 pages but there are many more. The crawl level is at 5. Also this URL www.sedgemoorfire.co.uk doesn't return any image links when on the default image crawler. Any ideas on how to fix this?

why am i getting error?

my dashboard says There has been an error No payment information on file. Visit the 80legs portal to reactive your account. and i cant add any urls to url list or execute any jobs.