Montreal, Québec, Canada

Roberto Rocha

Data storyteller and educator

Menu
  • Home
  • CV
  • Blog
  • Data services
  • Training
  • Contact

Month: April 2018

setting up AWS Lambda cloudwatch events in Lambda lambda cron job lambda environment variables other settings Lambda Lambda with S3 access Lambda logs lambda_handler

Setting up a Selenium web scraper on AWS Lambda with Python

April 29, 2018February 3, 2022 Roberto 110 Comments

IMPORTANT UPDATE This post is outdated now that AWS Lambda allows users to create and distribute layers with all sorts of plugins and packages, including Selenium and chromedriver. This simplifies a lot of the process. Here’s a post on how to make such a layer. And here’s a list of useful pre-packaged layers. This post […]

Posted in Tutorials
Read More

Recent Posts

  • Pair programming with LLMs: putting 5 leading models to the test
  • How to use ChatGPT Vision to turn handwritten forms into data
  • Using ChatGPT to clean data: an experiment
  • How to extract entities from raw text with Spacy: 3 approaches using Canadian data
  • Getting tabular data from unstructured text with GPT-3: an ongoing experiment

Recent Comments

  • Aditya Sharma on Using Python’s calendar module for scraping date-based data
  • Jed Clark on Using NLP to analyze open-ended responses in surveys
  • Roberto on Using ChatGPT to clean data: an experiment
  • Chris on Using ChatGPT to clean data: an experiment
  • Jacques Dufort on Using NLP to analyze open-ended responses in surveys
Theme: Albar by Kaira