Skip to content

Commit c462210

Browse files
committed
Changed the linkregex to the suggestion in gtoffoli's issue theanti9#7 temporarily while i restructure/rewrite the entire project
1 parent a2abec1 commit c462210

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

PyCrawler.py

100755100644
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@
5252

5353
# Compile keyword and link regex expressions
5454
keywordregex = re.compile('<meta\sname=["\']keywords["\']\scontent=["\'](.*?)["\']\s/>')
55-
linkregex = re.compile('<a.*\shref=[\'"](.*?)[\'"].*?>')
55+
linkregex = re.compile('<a\s(?:.*?\s)*?href=[\'"](.*?)[\'"].*?>')
5656
if domains:
5757
domainregex = re.compile(domains)
5858
else:

0 commit comments

Comments
 (0)