We have already mentioned Apify in our email prospecting tools, for the Salesdorado email finder.Īpify is a platform that allows you to execute code on a medium scale, without having to manage anything on the server setup. Apify : to scrape between 100 lines - Little web culture required (no-code) Note that using a Spreadsheet opens the door to dynamic processes to refresh or enrich your data dynamically.
This is what is used in Salesdorado's lead scorer to get the title of the domain homepage associated with a contact's email address.
You can, for example, retrieve all the H2 titles of the article you are reading by writing =importxml("", "//h2") to a cell in a Google Sheets spreadsheet. Although not widely used, xPath queries can be used to retrieve structured data from the content of web pages. You can scrap quite easily using xPath, Google Sheets and the =importxml function. Thanks to the XPath syntax (very important in webscraping, and not specific to this use by Google Spreadsheets), you can obtain any element of a web page very easily. Here again, a rather "silly" use case, but Google Spreadsheets allows you to do a lot of things thanks to the ImportXML function. Google Spreadsheets: under 1000 rows, but with some complicated elements to retrieve
If you are looking for postcodes, common first names, telephone codes, it takes a minute with this method. You can copy and paste all the tables that are on Wikipedia into an Excel file or a Google Spreadsheet, for example. It may sound silly, but we often forget how well copy and paste works. 7 of these 10 methods require no (or almost no) prior knowledge. From the eternal copy and paste (which works much better than you might think), to more complex methods for larger projects. In this article, we will present 10 methods and tools for web scraping. Personalise the customer experience automatically, etc.It is a useful method in many situations: Web scraping is the extraction of data from a website in a structured way.