| User | Post |
|
3:30 am June 18, 2009
| Vincent
Admin
| | UK | |
|
| posts 209 |
|
|
While ScrapeGenius has got to be one of the best scrapers on the net – and it's totally free – keep your eyes peeled for the pro version, it will blow you away!
|
|
|
12:51 am June 20, 2009
| Target Market
Member
| | Roving keyboard | |
|
| posts 3 |
|
|
Hey Vince,
re: Free version
Alll I want to do is extract urls which have ONLY the 2 specific keywords. (widget has 2 words) In short what I'm trying to do is locate ALL of the sites which are either owned by a specific widget owner or are mentioning the widget owner or his widget worldwide within a URL link. I've conducted several variations of what I think I should do, ie, apply the “exact match” apply “widget name” within quotes also tried “widget” “name” this way, and tried widget name without quotes.
Point is, every time results return I end up with a url list which includes urls with either one or both widget words independantly and often times without any relevance at all to the widget, since both words independantly are common words.
Don't get me wrong I do have some of what I need because I can look at the url and notice perhaps the first 30 or so may be exact matches, but further down the list of say 900 results returned, the results are inconsistent, then I have to manually hunt.
All I need are the urls that mention both names in the widget preferably in sequence, complete isolation.
Hope this makes sense, perhaps there is a better way, is ScrapeGenius the right “widget” for this job? 
Target Market
|
|
|
4:50 pm June 22, 2009
| Vincent
Admin
| | UK | |
|
| posts 209 |
|
|
Just to double-check, in the advanced tab you should have:
'filter urls' checked
'all of these words' in the drop-down box
widget,name (no quotes but one separating comma) in the dialogue box
You should also have a shedload of keywords in the Google box. Personally I don't use the other two boxes, I don't think they work very well. (Not the software, the SE's!)
Can you tell me what happens when you try this please?
|
|
|
5:25 pm June 22, 2009
| Target Market
Member
| | Roving keyboard | |
|
| posts 3 |
|
|
You state “You should also have a shedload of keywords in the Google box”
Ok, obviously I'm missing something. Aside from the name of my widget what other keywords should be in my list?
|
|
|
11:53 am June 30, 2009
| Target Market
Member
| | Roving keyboard | |
|
| posts 3 |
|
|
Still no action. What say ye?
|
|
|
1:27 am July 1, 2009
| Vincent
Admin
| | UK | |
|
| posts 209 |
|
|
One keyword will generate x number of urls. After that, it is not possible to harvest urls because G's SERPS suddenly turns in to a big fat zero. Try it yourself – G is one big con, it may tell you that there are ten gazillion results for your search but have you ever tried viewing them? You will never get past 1000. Anyone could release a search engine that does that. Just cobble together a basic database and then whenever anyone does a search, tell them that there 10 billion results – but only let them ever see 1000.
To get round this problem, you have to do multiple searches, with each search having the potential to return 1000 urls. In order to do multiple searches you need multiple keywords. The more keywords you have, the more urls you will harvest.
|
|
|
7:24 pm August 18, 2009
| bigmark1972
New Member
| | | |
|
| posts 1 |
|
|
I cant seem to get anything out of Google. I am trying to harvest wordpress blogs…
|
|
Leave a Reply
You must be logged in to post a comment.