Commit Graph

100 Commits

Author SHA1 Message Date
Musab Gültekin 40f673f2e2 Fixed README. More Go versions added for testing 2019-06-15 22:35:51 +03:00
Musab Gültekin ddff3aee25 Request cancellations support added to Middlewares.
Some core functions refactored as middlewares.
Fixed race condition in exporting system. Now, only one goroutine will be responsible for exporting. This fixes concurrency issues on writing.
2019-06-15 22:27:46 +03:00
Musab Gültekin 83a7b9eb87 Merge pull request #4 from NMelis/master
Create CONTRIBUTING.md
2019-06-15 18:09:58 +03:00
Musab Gültekin f65456f18c Update CONTRIBUTING.md 2019-06-15 18:08:27 +03:00
Musab Gültekin 7b23596a2d Middleware support added. HTML Parsing disable option added.
Goroutine leaks will be tested using leaktest lib.
2019-06-15 17:55:40 +03:00
Melis Nurlan 2e29c47acd Create CONTRIBUTING.md
could add a description of how to become a contributor?
2019-06-15 19:43:49 +07:00
Musab Gültekin 4799b0f7b4 Fixed goroutine leaks. Updated travis build 2019-06-14 17:30:49 +03:00
Musab Gültekin f5b3b0d049 Fixed race conditions on exporters.
MaxIdleConns limit disabled to support unlimited requests to all hosts.
MaxIdleConnsPerHost limit increased to speed up requests to same host.
2019-06-14 16:10:36 +03:00
Musab Gültekin 83bfb01856 Merge pull request #3 from isacikgoz/master
Update README.md
2019-06-14 15:34:31 +03:00
Musab Gültekin b2f32b8830 Merge branch 'master' into master 2019-06-14 15:32:36 +03:00
Musab Gültekin c6ea82447a Travis-CI configuration added. Now, we'll test coverage on every commit. 2019-06-14 15:24:15 +03:00
Musab Gültekin 6caf1effd6 Rendered field exported to support rendered requests on Do function. Data races fixed. 2019-06-14 15:23:56 +03:00
Ibrahim Serdar Acikgoz 7360ffa3c9 Update README.md 2019-06-14 14:57:53 +03:00
Musab Gültekin 1a7d480b36 JS Rendered requests with Chrome support added 2019-06-13 22:08:45 +03:00
Musab Gültekin 76a687e193 Do function refactored 2019-06-13 20:26:07 +03:00
Musab Gültekin 7c43585769 Fixed license copyright 2019-06-13 15:48:02 +03:00
Musab Gültekin 8a6e19a031 New requests on StartRequests func will be made using Geziyor's methods. Not Requests chan
Options field exported.
2019-06-13 14:06:37 +03:00
Musab Gültekin 184081d3bf README updated for more advanced usage. Updated tests. 2019-06-12 22:22:01 +03:00
Musab Gültekin d56ea161a5 Making new requests on StartRequestsFunc is simplified by using channels 2019-06-12 21:54:57 +03:00
Musab Gültekin f7f4e401e2 Metadata adding on requests support added. StartRequests function implemented. 2019-06-12 21:30:45 +03:00
Musab Gültekin bd8d58576f Start requests function implemented. 2019-06-12 12:40:38 +03:00
Musab Gültekin 2f6cb06982 Disabling charset detection implemented. 2019-06-12 11:44:31 +03:00
Musab Gültekin a311a0f998 CSV exporter support added. Not finished for map type. 2019-06-11 20:42:22 +03:00
Musab Gültekin bbdc3bcacd Exporters made optional, as some scrapers only want to see data in console. 2019-06-11 18:59:37 +03:00
Musab Gültekin b8305d5e1a Limiting body reading support implemented. 2019-06-11 16:19:30 +03:00
Musab Gültekin 3790295658 Multiple Exporters and custom Exporters support added. 2019-06-11 16:10:49 +03:00
Musab Gültekin e4e8723426 Callback are now mandatory as almost all the scrapers use it. 2019-06-11 14:24:48 +03:00
Musab Gültekin ca2414c5c8 Request callbacks added.
Recover from all panics and continue scraping.
Only parse HTML if response is HTML.
2019-06-09 21:13:30 +03:00
Musab Gültekin 7abc7a370d Disabling logs support added. 2019-06-09 19:14:46 +03:00
Musab Gültekin b973c1c064 Request delays support added 2019-06-09 14:24:53 +03:00
Musab Gültekin 2263108838 User-Agent change support added. 2019-06-09 13:43:17 +03:00
Musab Gültekin 9263877339 Exporting data all types support added. 2019-06-09 13:22:20 +03:00
Musab Gültekin d967555b62 Global and Domain Concurrency limit implemented. Updated README 2019-06-09 11:53:40 +03:00
Musab Gültekin a9aaf86df3 Automatic determining response and decoding it. 2019-06-09 10:46:32 +03:00
Musab Gültekin f2fba85266 Timeout support added 2019-06-08 20:36:43 +03:00
Musab Gültekin 2e3bd18430 Options refactored to its own file. Timeout increased to 60 sec 2019-06-08 20:36:43 +03:00
Musab Gültekin b90908066b Head API added. Opt renamed to Options. Tests updated. More documentation added. 2019-06-08 20:36:43 +03:00
Musab Gültekin 95d97436bf Create LICENSE.txt 2019-06-08 19:59:49 +03:00
Musab Gültekin 815ae7eec5 Do request support added. Updated docs. 2019-06-08 19:45:48 +03:00
Musab Gültekin 54c7d3550f Gezer renamed to Geziyor 2019-06-08 17:14:10 +03:00
Musab Gültekin c525e0d7d0 Don't visit already visited URLs. Update README 2019-06-08 17:04:00 +03:00
Musab Gültekin edbddf74d8 Cache added to options 2019-06-08 16:01:31 +03:00
Musab Gültekin ca197ff06a Caching added.
JSON File export will append, not truncate.
2019-06-08 15:29:09 +03:00
Musab Gültekin 9e61a96412 Sync and Async requests support added by using go keyword. 2019-06-07 21:31:18 +03:00
Musab Gültekin ee7d498f22 Allowed domains support added 2019-06-07 21:02:31 +03:00
Musab Gültekin 5271095c8e Export chan added. We can now export data by sending to this chan. 2019-06-07 20:52:31 +03:00
Musab Gültekin e58b08cbd6 Relative URL handling added to Response type. 2019-06-07 15:44:14 +03:00
Musab Gültekin 944bd3bada Options used instead of direct parameter passing.
Tests updated.
2019-06-06 23:49:42 +03:00
Musab Gültekin 6358b87472 Use parse function to parse responses, instead of channels.
Parse response as HTML Document using goquery.
Added simple README.
2019-06-06 22:48:57 +03:00
Musab Gültekin 1c96048082 Initial commit 2019-06-06 17:11:19 +03:00