Java Web Crawler: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
No edit summary |
||
Line 19: | Line 19: | ||
* [https://stackoverflow.com/questions/11559464/ EhCache overflow to disk at specific path] | * [https://stackoverflow.com/questions/11559464/ EhCache overflow to disk at specific path] | ||
* [https://javabeat.net/enablecaching-spring/ <code>@EnableCaching</code> Annotation in Spring] | * [https://javabeat.net/enablecaching-spring/ <code>@EnableCaching</code> Annotation in Spring] | ||
* [https://stackoverflow.com/questions/12836114/ Selenium Webdriver Remote Setup] | |||
* [https://dimitr.im/spring-boot-cache-ehcache Using EhCache 3 with Spring boot] | * [https://dimitr.im/spring-boot-cache-ehcache Using EhCache 3 with Spring boot] | ||
* [https://underthehood.meltwater.com/blog/2016/11/09/using-docker-with-selenium-server-to-run-your-browser-tests/ Using Selenium-Server on Docker] | |||
* [https://examples.javacodegeeks.com/enterprise-java/spring/boot/spring-boot-ehcache-example/ Spring Boot Ehcache Example] | * [https://examples.javacodegeeks.com/enterprise-java/spring/boot/spring-boot-ehcache-example/ Spring Boot Ehcache Example] | ||
* [https://www.baeldung.com/spring-boot-evict-cache Cache Eviction in Spring Boot] | * [https://www.baeldung.com/spring-boot-evict-cache Cache Eviction in Spring Boot] | ||
* [https://www.scrapingbee.com/blog/introduction-to-chrome-headless/ Chrome Headless with Java] | |||
|} | |} |
Revision as of 04:54, 9 October 2020
A web crawler, or spider, is a type of bot that's typically operated by search engines like Google and Bing. Their purpose is to index the content of websites all across the Internet so that those websites can appear in search engine results.