This forum lets our users get support, share idea and communicate with other users. Only paid users can see all topics and have the full access to the  forum, if you are not paid user of our softwares, you only can see few posts and only can post on free softwares part.  If you are paid user of our any software, please register a forum account using the same name and email with your software account , we will upgrade your forum account to paid account in few hours, then you can have full access to the forum.
You must be logged in to post Login Register

Full (Pro) Version

UserPost

3:30 am
June 18, 2009


Vincent

Admin

UK

posts 209

1

While ScrapeGenius has got to be one of the best scrapers on the net – and it's totally free – keep your eyes peeled for the pro version, it will blow you away!

12:51 am
June 20, 2009


Target Market

Member

Roving keyboard

posts 3

2

Hey Vince,

re: Free version

Alll I want to do is extract urls which have ONLY the 2 specific keywords.  (widget has 2 words) In short what I'm trying to do is locate ALL of the sites which are either owned by a specific widget owner or are mentioning the widget owner or his widget worldwide within a URL link.  I've conducted several variations of what I think I should do, ie, apply the “exact match”  apply “widget name” within quotes also tried “widget” “name” this way, and tried widget name without quotes.

Point is, every time results return I end up with a url list which includes urls with either one or both widget words independantly and often times without any relevance at all to the widget, since both words independantly are common words.


Don't get me wrong I do have some of what I need because I can look at the url and notice perhaps the first 30 or so may be exact matches, but further down the list of say 900 results returned, the results are inconsistent, then I have to manually hunt.

All I need are the urls that mention both names in the widget preferably in sequence, complete isolation.


Hope this makes sense, perhaps there is a better way, is ScrapeGenius the right “widget” for this job? Cool

Target Market

4:50 pm
June 22, 2009


Vincent

Admin

UK

posts 209

3

Just to double-check, in the advanced tab you should have:

'filter urls' checked

'all of these words' in the drop-down box

widget,name (no quotes but one separating comma) in the dialogue box

You should also have a shedload of keywords in the Google box. Personally I don't use the other two boxes, I don't think they work very well. (Not the software, the SE's!) 


Can you tell me what happens when you try this please?

5:25 pm
June 22, 2009


Target Market

Member

Roving keyboard

posts 3

4

You state “You should also have a shedload of keywords in the Google box”


Ok, obviously I'm missing something.  Aside from the name of my widget what other keywords should be in my list?

11:53 am
June 30, 2009


Target Market

Member

Roving keyboard

posts 3

5

Still no action. What say ye?

1:27 am
July 1, 2009


Vincent

Admin

UK

posts 209

6

One keyword will generate x number of urls. After that, it is not possible to harvest urls because G's SERPS suddenly turns in to a big fat zero. Try it yourself – G is one big con, it may tell you that there are ten gazillion results for your search but have you ever tried viewing them? You will never get past 1000. Anyone could release a search engine that does that. Just cobble together a basic database and then whenever anyone does a search, tell them that there 10 billion results – but only let them ever see 1000.


To get round this problem, you have to do multiple searches, with each search having the potential to return 1000 urls. In order to do multiple searches you need multiple keywords. The more keywords you have, the more urls you will harvest.

7:24 pm
August 18, 2009


bigmark1972

New Member

posts 1

7

I cant seem to get anything out of Google. I am trying to harvest wordpress blogs…