Этот материал находится в платной подписке. Оформи премиум подписку и смотри или слушай Web Crawling with Nodejs (H&M, Amazon, LinkedIn, AliExpress), а также все другие курсы, прямо сейчас!
Премиум
  1. Урок 1. 00:03:43
    What is the web crawling and web scraping and how is it different ?
  2. Урок 2. 00:04:57
    Legality of web scraping and web crawling
  3. Урок 3. 00:01:19
    Tools we will be using during development
  4. Урок 4. 00:02:33
    Methods of web crawling and web scraping
  5. Урок 5. 00:06:13
    Finding hidden API using Chrome Dev Tools
  6. Урок 6. 00:07:35
    Testing hidden API inside Postman, and finding other section API endpoints
  7. Урок 7. 00:01:46
    Initializing NPM + some info about Nodejs Request and Needle
  8. Урок 8. 00:02:21
    Creating our HTTP request with needle inside Nodejs
  9. Урок 9. 00:03:25
    Adding User-Agent header to get past denial in nodejs
  10. Урок 10. 00:04:06
    Creating MongoDB cluster for saving data
  11. Урок 11. 00:05:21
    Connecting to MongoDB cluster from Nodejs
  12. Урок 12. 00:04:54
    Saving data to MongoDB
  13. Урок 13. 00:07:36
    Getting all products in MongoDB using a loop with offset variable and pagesize
  14. Урок 14. 00:05:25
    Finding hidden API using Chrome Dev Tools
  15. Урок 15. 00:04:26
    Making API request from Postman with correct headers
  16. Урок 16. 00:05:50
    Making API request from Nodejs using Fetch API
  17. Урок 17. 00:04:59
    Getting many items using a for loop and sleep function
  18. Урок 18. 00:03:04
    Saving AliExpress products to MongoDB
  19. Урок 19. 00:01:09
    Intro to project
  20. Урок 20. 00:04:13
    Why are we using HTTP requests and not Puppeteer?
  21. Урок 21. 00:01:27
    Initializing NPM + installing jest, cheerio and needle npm packages
  22. Урок 22. 00:02:40
    Writing our reuseable httpRequest module for our testing and crawling
  23. Урок 23. 00:05:12
    Creating our test HTML file (check resources for URL)
  24. Урок 24. 00:03:27
    Setting up testing and intro to testing
  25. Урок 25. 00:04:17
    Writing our first test for our HTML parser
  26. Урок 26. 00:05:38
    Getting title from product page and making our test pass
  27. Урок 27. 00:02:51
    Getting the price from product page
  28. Урок 28. 00:06:41
    Making our second test and getting product links from page
  29. Урок 29. 00:08:39
    Writing out our actual webcrawling in 6 minutes!
  30. Урок 30. 00:03:08
    Setup so we only crawl only unique product ID's
  31. Урок 31. 00:05:22
    Adding a new test case for different layout + outtro
  32. Урок 32. 00:00:32
    Intro to project
  33. Урок 33. 00:00:34
    Initializing project with puppeteer and cheerio packages
  34. Урок 34. 00:02:44
    Opening Puppeteer browser and navigating to URL
  35. Урок 35. 00:06:05
    Login to Linkedin using Puppeteer
  36. Урок 36. 00:04:48
    Getting profile links on a LinkedIn profile
  37. Урок 37. 00:04:33
    Building web crawler loop for Puppeteer