Thousands of Duplicate URL's being created by Alexandria

Started by Clancy, 17 July 2015, 01:26:26

Previous topic - Next topic
A financial contribution is greatly appreciated as a support, to help us to keep live the project.
If you like this project you can donate some piece of BitCoin to this address: bc1qy5tgq6tvrckac2a57unxvqcnxamrvhduve9sj9

Clancy

Hi:

I paid a consultant to create and install the Alexandria Book Lib for me on my Joomla 3.x site.  All well and good, EXCEPT, when I started doing site crawls, for SEO and for the redirects, (from my old site), thousands upon thousands (I stopped the Crawler at 16,000+) of duplicate URL's were being created, almost all, it seems, some variant of Alexandria. For example: 

http://www.booknook.biz/ebook_services/list/category/allington_maynard

This, which shouldn't exist.  Our "eBook services" page is in a completely different category and area of the site; and the author has NOTHING to do with that.  This happens over and over (I even posted about this in the Joomla.org forums, http://forum.joomla.org/viewtopic.php?f=712&t=888280  trying to find out how on earth this was happening.)  Sometimes, an author or book gets appended to some utterly unrelated URL, or the other way around. 

Is there something that we don't understand to fix this?  It's costing me real issues with Google, which is giving me CRAWL errors on these.  I'm getting penalized. 

HELP?

Clancy

A financial contribution is greatly appreciated as a support, to help us to keep live the project.
If you like this project you can donate some piece of BitCoin to this address: bc1qy5tgq6tvrckac2a57unxvqcnxamrvhduve9sj9

federica

Hi Clancy,
joomla articles are also affected by the same issue.

Try to access to http://www.booknook.biz/booknook-services/1-ciccio
the page is valid but it should return a 404 error
Al mondo ci sono 10 tipi di persone, quelli che hanno capito il codice binario e quelli che non l'hanno capito.
Informatizzati [url="https://informatizzati.org"]https://informatizzati.org[/url]
Stacca la spina [url="https://disconnessi.org"]https://disconnessi.org[/url]

Clancy

Quote from: federica on 17 July 2015, 10:09:03
Hi Clancy,
joomla articles are also affected by the same issue.

Try to access to http://www.booknook.biz/booknook-services/1-ciccio
the page is valid but it should return a 404 error

Hi, Federica:

Well, that's as may be, but bluntly, I'm not having issues with any other components but this one, in terms of thousands of urls' being created.  And there doesn't seem to be, for example, any instance in which a Zoo module item (say, the blog) combines with a Rok Sprocket item; everything that happens seems to have an ABL "segment" in it.  Therefore, I have to look at ABL for the answers.  I find it positively mind-boggling that there are webmasters who seem to not even KNOW that this is occurring; don't they crawl their own sites, looking for 404's, broken links, etc.?  So, my question is:  is this fixable?  Or are you saying that even though ABL seems to be the most affected, it's not ABL's job to fix it?  (n.b.:  I took a dupe copy of my site, and I removed the ABL module--and the problem went away. So....???)

Also:  http://www.booknook.biz/booknook-services/1-ciccio . Why do you feel that this should be a valid URL?  I have no category named "1" and I have nothing named "ciccio," that I know of? When I regex the site, using S&R for Joomla, I find nothing named "Ciccio."  Can you expand?

Clancy

federica

Joomla allows access to many urls for the same content.
The following links are valid urls and goes all to the homepage without 404 error:
http://www.booknook.biz/booknook-services/1-ciccio
http://www.booknook.biz/booknook-services/1-pippo
http://www.booknook.biz/blog/15-what-boneheaded-mistake-will-longmire-make-this-week

Is not only an ABL problem, is a general problem.

Can you tell me the different urls google sees for one ABL content?
Al mondo ci sono 10 tipi di persone, quelli che hanno capito il codice binario e quelli che non l'hanno capito.
Informatizzati [url="https://informatizzati.org"]https://informatizzati.org[/url]
Stacca la spina [url="https://disconnessi.org"]https://disconnessi.org[/url]

A financial contribution is greatly appreciated as a support, to help us to keep live the project.
If you like this project you can donate some piece of BitCoin to this address: bc1qy5tgq6tvrckac2a57unxvqcnxamrvhduve9sj9