Skip to content

Commit 89fa840

Browse files
committed
Added i686 PhantomJS. Error encoding gbk with Requests GET
1 parent 0fed838 commit 89fa840

File tree

4 files changed

+367
-16
lines changed

4 files changed

+367
-16
lines changed

config.py

Lines changed: 5 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -6,26 +6,15 @@
66

77
request_delay = 0
88

9-
use_selenium = True
9+
use_selenium = False
1010

1111
browser_name = "PhantomJS"
1212

1313
urls_to_crawl = [
1414
{
15-
"url": "http://madeinheights.com",
16-
"follow_links_containing": "madeinheights.com",
17-
"regex_filters": [ r"st.ry" ]
18-
},
19-
{
20-
"url": "http://www.china.com.cn",
21-
"follow_links_containing": "www.china.com.cn",
22-
"regex_filters": [ r"/2014-07/\d\d/" ],
15+
"url": "http://www.cuyoo.com/",
16+
"follow_links_containing": "www.cuyoo.com",
17+
"regex_filters": [ r"/article" ],
2318
"ignore_query_strings": True,
24-
}
25-
## {
26-
## "url": "http://www.cuyoo.com/",
27-
## "follow_links_containing": "cuyoo.com",
28-
## "regex_filters": [ r"/article" ],
29-
## "ignore_query_strings": True,
30-
## },
19+
},
3120
]

phantomjs/ChangeLogLinuxi686

Lines changed: 360 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,360 @@
1+
Please see also http://phantomjs.org/releases.html.
2+
3+
2014-01-25: Version 1.9.7
4+
5+
* Reverted to GhostDriver 1.1.0 instead of 1.1.1 (issue 11915)
6+
* Fixed another warning of obsolete userSpaceScaleFactor on OS X 10.9 (issue 11612)
7+
8+
2014-01-20: Version 1.9.6
9+
10+
* Updated GhostDriver to version 1.1.1 (issue 11877, 11893)
11+
12+
2014-01-19: Version 1.9.3
13+
14+
* Fixed CoreText performance note on OS X 10.9 (issue 11418)
15+
* Fixed warning of obsolete userSpaceScaleFactor on OS X 10.9 (issue 11612)
16+
17+
2013-09-06: Version 1.9.2
18+
19+
* Fixed graphical artifacts with transparent background on Windows (issue 11276, 11007, 11366)
20+
* Updated GhostDriver to version 1.0.4 (issue 11452)
21+
22+
2013-06-04: Version 1.9.1
23+
24+
Critical bug fixes:
25+
26+
* Fixed problems with specifying proxy server (issue 10811, 11117)
27+
* Fixed UTF-8 encoding with system.stdout and system.stderr (issue 11162)
28+
* Ensured that onResourceReceived will be always invoked (issue 11163)
29+
* Fixed module loading from an absolute path on Windows (issue 11165)
30+
* Fixed typo in the command-line option for setting the cache size (11219)
31+
* Fixed possible crash when handling network requests (issue 11252, 11388)
32+
33+
2013-03-20: Version 1.9.0 "Sakura"
34+
35+
New features
36+
37+
* Added spawn and execFile to execute external programs (issue 10219)
38+
* Added the ability to abort network requests (issue 10230)
39+
* Added system access to stdin, stdout, and stderr (issue 10333)
40+
* Added support for custom CA certificates location (issue 10916)
41+
* Added seek function to the File stream (issue 10937)
42+
* Implemented file read for a specified number of bytes (issue 10938)
43+
* Added a callback to handle network error (issue 10954, 10997)
44+
* Added custom encoding support when opening a page (issue 11043)
45+
* Implemented require.stub() support for a factory function (issue 11044)
46+
* Added page loading indicator and progress (issue 11091)
47+
* Added a timeout option for network requests (issue 11129)
48+
49+
Improvements
50+
51+
* Fixed the build on FreeBSD (issue 10597)
52+
* Ensured a consistent 72 dpi for Linux headless rendering (issue 10659)
53+
* Fixed possible PDF error due to invalid CreationDate field (issue 10663)
54+
* Fixed crash when uploading non existing files (issue 10941)
55+
* Improved the autocomplete internal of the interactive/REPL mode (issue 10943)
56+
* Fixed possible crash when accessing inline frames (issue 10947)
57+
* Changed Linux binary package setup to be built on CentOS 5 (issue 10963)
58+
* Extended SSL ignore setting to synchronous XHR (issue 10985)
59+
* Added convenient constants for modifier keys (issue 11056)
60+
* Fixed incorrect date handling in the cookies (issue 11068)
61+
* Updated GhostDriver to version 1.0.3 (issue 11146)
62+
63+
Examples
64+
65+
* Fixed invalid data URI in the netsniff example (issue 10740)
66+
* Implemented a new weather example (issue 10794)
67+
* Fixed rendering issues in render_multi_url (issue 11021)
68+
* Fixed proper event sequence in page_events example (issue 11028)
69+
* Miscellanous tweaks (issue 11082)
70+
71+
2013-03-02: Version 1.8.2
72+
73+
Critical bug fixes:
74+
75+
* Fixed possible PDF error due to invalid CreationDate field (issue 663)
76+
* Fixed crash when uploading non existing files (issue 941)
77+
* Fixed possible crash when accessing inline frames (issue 947)
78+
* Extended SSL ignore setting to synchronous XHR (issue 985)
79+
* Fixed incorrect date handling in the cookies (issue 1068)
80+
81+
2013-01-06: Version 1.8.1
82+
83+
Critical bug fix:
84+
85+
* Mac OS X: Fix possible crash when using some TrueType fonts (issue 690)
86+
87+
2012-12-21: Version 1.8.0 "Blue Winter Rose"
88+
89+
New features
90+
91+
* Integrated GhostDriver as the WebDriver implementation (issue 49)
92+
* Added an option to specify the SSL protocol (issue 174)
93+
* Added encoding support for WebServer's response (issue 505)
94+
* Added process ID (PID) to the System module (issue 769)
95+
* Added properties to obtain page and frame title (issue 799)
96+
* Added page navigation methods (issue 808)
97+
* Added support for modifier keys in keyboard events (issue 835)
98+
* Added onFilePicker callback for more generic file upload API (issue 843)
99+
* Added the ability to set the page content and location (issue 909)
100+
101+
Improvements
102+
103+
* Fixed date parsing in ISO8601 format (issue 187, 267)
104+
* Fixed window.location (issue 530, 632)
105+
* Deregistered multiple callback handler (issue 807)
106+
* Fixed sending of double-click events (issue 848)
107+
* Increases maximum number of redirects (issue 849)
108+
* Fixed keycodes sent for lowercase characters (issue 852)
109+
* Fixed a regression in table row page break (issue 880)
110+
* Completed the CoffeeScript version of the examples (issue 907)
111+
* Updated Qt to version 4.8.4 (issue 918)
112+
* Fixed potential hang in some example scripts (issue 922)
113+
114+
2012-09-22: Version 1.7.0 "Blazing Star"
115+
116+
New features
117+
118+
* Added a module system modelled after CommonJS/Node.js (issue 47)
119+
* Added support for window pop-up (issue 151)
120+
* Static build on Linux (issue 413)
121+
* Added run-time detection of SSL support (issue 484)
122+
* Added more events support (issue 492, 712)
123+
* Added support for disabling automatic proxy detection (issue 580)
124+
* Provided page closing callback (issue 678)
125+
* Added methods to access URL, frames URL, frame Content (issue 758)
126+
* Added more cookies-related API (issue 761)
127+
128+
Improvements
129+
130+
* Refactored command-line options handling (issue 55)
131+
* Improved the workflow for producing release builds (issue 599)
132+
* Improved cookies API and implementation (issue 603, 761)
133+
* Improved frame switching API (issue 654)
134+
* Fixed iframe handling regression (issue 683)
135+
* Fixed OS version number with Windows 8 and Mountain Lion (issue 684, 688)
136+
* Fixed HAR navigation info in the netsniff example (issue 733)
137+
* Fixed compile warnings with Visual Studio (issue 744)
138+
* Removed hacks for static linking on Windows (issue 753)
139+
* Added ICO image handling on Windows (issue 779)
140+
* Fixed font antialiasing on Windows (issue 785)
141+
* Improved Jasmine test runner for Jasmine 1.2 (issue 792)
142+
143+
2012-07-22: Version 1.6.1
144+
145+
Bug fixes
146+
147+
* Don't build the deploy in debug mode (issue 599)
148+
* Fixed building on Windows (issue 424)
149+
* Fixed remote inspector when building statically (issue 430)
150+
151+
2012-06-20: Version 1.6.0 "Lavender"
152+
153+
New features
154+
155+
* Added support for passing arguments to WebPage's evaluate (issue 132)
156+
* Added callbacks for JavaScript onConfirm and onPrompt (issue 133)
157+
* Added stack trace when error occurs (issue 166)
158+
* Added support for local storage path and quota (issue 300)
159+
* Added initial support for cookies handling (issue 354)
160+
* Added support for header footer when printing the page (issue 410, 512)
161+
* Added headers support in the loading request (issue 452)
162+
* Added support to render the web page as base64-encoded string (issue 547)
163+
* Added hooks for navigation event (issue 562)
164+
* Added command-line option to show debug messages (issue 575)
165+
* Added support for the zoom factor for web page rendering (issue 579)
166+
* Added crash reporter for Mac OS X and Linux, based on Google Breakpad (issue 576)
167+
* Added 'os' object to the system module (issue 585)
168+
* Added support for asynchronous evaluation (issue 593)
169+
170+
Improvements
171+
172+
* Fixed remote debugging to work on Mac OS X and Windows (issue 430)
173+
* Fixed web server getting the dropped connection for empty response (issue 451)
174+
* Fixed text rendered as boxes (squares) on headless Linux (issue 460)
175+
* Updated Qt to version 4.8.2 (issue 495)
176+
* Updated CoffeeScript compiler to version 1.3.3 (issue 496)
177+
* Fixed the build script to detect and use MAKEFLAGS (issue 503)
178+
* Fixed the build script to properly pass Qt config flags (issue 507)
179+
* Changed Info.plist to be embedded in Mac OS X executable (issue 528)
180+
* Fixed wrong module require in the imagebin example (issue 536)
181+
* Fixed example scripts to exit with the right exit code (issue 544)
182+
* Fixed build failure with glib 2.31.0+ (issue 559)
183+
* Fixed error handler failures in some cases (issue 589)
184+
* Fixed Twitter-related examples to work with the new site (issue 609)
185+
186+
2012-03-20: Version 1.5.0 "Ghost Flower"
187+
188+
New features
189+
190+
* Added interactive mode, also known as REPL (issue 252)
191+
* Added setting for web security, to allow cross domain XHR (issue 28)
192+
* Added error handler for WebPage object (issue 166)
193+
* Added support for custom HTTP header in the network request (issue 77)
194+
* Added support for read write encoding in the file system module (issue 367)
195+
* Added remote debugging support on Linux (issue 6)
196+
* Added support for proxy authentication (issue 105)
197+
* Added System module, to retrieve environment variables (issue 271) and arguments (issue 276)
198+
* Added fs.readLink function (issue 329)
199+
* Added support for reading and writing binary data (issue 400)
200+
* Added support to retrieve request data in the WebServer? module (issue 340)
201+
* Added support for individual top/bottom/left/right print margins (issue 388)
202+
* Added command-line option --help (issue 347)
203+
* Added short command-line options -v and -h (issue 408)
204+
* Removed support for Flash and other plugins (issue 418)
205+
206+
Bug fixes
207+
208+
* Fixed multiple console.log arguments (issue 36)
209+
* Fixed file upload (issue 307)
210+
* Fixed the web server instance to be asynchronous (issue 326) and still support Keep Alive (issue 416)
211+
* Workaround Qt 4.8.0 crash due to empty URL scheme (issue 365)
212+
* Fixed a Content-Type problem where POST does not work (issue 337)
213+
* Fixed reading body request in the web server even without specific Content-Type (issue 439)
214+
* Fixed Jasmine test runner with Jasmine 1.1 (issue 402)
215+
* Fixed request URL formatting in the web server (issue 437)
216+
* Don't display debugging and warning messages (issue 323)
217+
218+
2011-12-31: Version 1.4.1
219+
220+
Bug fixes
221+
222+
* Fix setting the proxy type (issue 266)
223+
* Workaround for file upload regression (issue 307)
224+
* Fix extraneous messsages in non-debug mode (issue 323)
225+
226+
2011-12-22: Version 1.4.0 "Glory of the Snow"
227+
228+
New features
229+
230+
* Added embedded HTTP server (issue 115)
231+
* Added convenient build script for Linux (issue 197)
232+
* Added support for SOCKS5 proxy (issue 266)
233+
* Updated CoffeeScript compiler to version 1.2 (issue 312)
234+
235+
Bug fixes
236+
237+
* Fix potential crash in QUrl with Qt 4.8 (issue 304)
238+
* Fix bug in CookieJar with QSettings and string (PyPhantomJS issue 10)
239+
* Prevent showing the icon on Mac OS X Dock (issue 281)
240+
241+
Examples
242+
243+
* Added a new example to detect browsers sniffing (issue 263)
244+
* Added HTTP server example (issue 115)
245+
246+
2011-09-23: Version 1.3.0 "Water Lily"
247+
248+
Bug fixes
249+
250+
* Fixed open() and POST method, without specifying the finished handler
251+
* Fixed script execution warning dialog (issue 165)
252+
* Added WebPage.release() to free the web page from memory (issue 154)
253+
* Added special handling of about:blank (issue 235)
254+
* Made a separate network access manager for each page (issue 190)
255+
256+
New features
257+
258+
* Introduced file system API based on CommonJS Filesystem proposal (issue 129)
259+
* Added support for persistent cookies (issue 91)
260+
* Added event handling, currently only for mouse events (issue 234)
261+
* Added page scroll position (issue 162)
262+
* Added HTTP authentication support (issue 45)
263+
* Added callback for page initialization (issue 143)
264+
* Added support to specify script and output encoding (issue 186)
265+
* Added option to allow local content to do cross-domain access (issue 28)
266+
* Added support to apply configurations from a JSON file (issue 180)
267+
* Added a convenient WebPage initialization construction (issue 206)
268+
* Added option to limit the size of disk cache (issue 220)
269+
270+
Examples
271+
272+
* Added a new example on using Modernizr to detect features (issue 144)
273+
* Fixed pizza.js example to use Mobile Yelp (issue 200)
274+
* Fixed netsniff.coffee example due to wrong indentation (issue 225)
275+
* Added an example to show live network traffic (issue 227)
276+
* Added an example demonstrating different output encodings (issue 186)
277+
278+
2011-06-21: Version 1.2.0 "Birds of Paradise"
279+
280+
Version 1.2.0 is a major update. It introduces a whole set of new API.
281+
282+
Bug fixes
283+
284+
* Fixed rendering a very large web page (issue 54)
285+
* Fixed reporting of CoffeeScript compile error (issue 125)
286+
287+
New features
288+
289+
* Added callback for console message (issue 12)
290+
* Improved security model via WebPage object (issue 41)
291+
* Added support for POST, HEAD, PUT, and DELETE (issue 88)
292+
* Scripts filename is now passed as phantom.scriptName
293+
* Added callback to capture resource requests and responses (issue 2)
294+
* Added the ability to load external JavaScript (issue 32)
295+
296+
Examples
297+
298+
* Ported examples to use WebPage object
299+
* Added a new example to upload an image to imagebin.org
300+
* Added a new example to show HTTP POST feature
301+
* Added a new example to sniff network traffic and save it in HAR format
302+
303+
304+
2011-04-27: Version 1.1.0 "Cherry Blossom"
305+
306+
Fixed the script loading to use UTF-8 encoding (Yasuhiro Matsumoto).
307+
308+
Added check for system proxy setting (Yasuhiro Matsumoto).
309+
310+
Fixed building with Cygwin and Qt 4.5 (John Dalton).
311+
312+
Added a new example: driver for QUnit tests (Łukasz Korecki).
313+
314+
Fixed issue #20: problem with JPG transparent color (Alessandro Portale).
315+
316+
Fixed issue #9: ignore first line starting with #! (Matthias, aka fourplusone).
317+
318+
Fixed issue #7: support for file upload for form submission (Matthias, aka fourplusone).
319+
320+
Fixed issue #35: support for disabling images loading (Ariya Hidayat).
321+
322+
Fixed issue #14: enable or disable plugins (Ariya Hidayat).
323+
324+
Added a new example: using Canvas to produce the color wheel (Ariya Hidayat).
325+
326+
Added support for rasterizing as GIF image (Ariya Hidayat).
327+
328+
Added support for CoffeeScript (Ariya Hidayat).
329+
330+
Fixed issue #19: option for setting the proxy (Clint Berry, Ariya Hidayat).
331+
332+
Python implementation using PyQt (James Roe).
333+
334+
Fixed issue #17: Specify paper size for PDF export (Alessandro Portale).
335+
336+
Fixed issue #60: Win32 and OS/2 icon files (Salvador Parra Camacho).
337+
338+
Added clipping rectangle to the render function (Wouter de Bie).
339+
340+
Added an example on sychronous waiting (Gabor Torok).
341+
342+
Added command line option to use disk cache (Jon Turner).
343+
344+
Added text extracting example (Weston Ruter).
345+
346+
Fixed issue #93: Build with Qt < 4.7 (Ariya Hidayat).
347+
348+
Ported all examples to CoffeeScript (Robert Gieseke).
349+
350+
2011-01-17: Version 1.0.0
351+
352+
Initial launch.
353+
354+
The API is centralized at the 'phantom' object (as child of
355+
window object) which has the properties: args, content,
356+
loadStatus, state, userAgent, version, viewportSize, and
357+
the following functions: exit, open, render, sleep.
358+
359+
Several examples are included, among others: web page rasterizer,
360+
weather service, headless test framework driver, and many others.

phantomjs/phantomjs_linux_i686

36.9 MB
Binary file not shown.

0 commit comments

Comments
 (0)