I18n setups of Path Auto

Hi,

I am looking from i18N versions of the "strings to remove" of Path Auto plugin.

Is there a repository?

By the way, this is the one I created for Spanish:

un,una,la,el,es,a,ante,con,de,desde,en,entre,hacia, hasta,para,por,según,sin,so,sobre,tras,este,esta,aquel,aquella,su,sus,mi,tu,mis,tus

Groups:
Login to post comments

Not so simple

arhak - Mon, 2008-10-27 19:43

The question is not so simple as it might seem.
It is needed to analyze the grammar in deep.
For instance:
Why including un and una and not uno, unos, unas, etc as well?

Lets think about the sentence unos y ceros (about binary data of course)
the resulting path would be only ceros
Is that always desired?

A bigger list for your considerations, meaning each word has to be considered because depending on the sentences many can play different roles.

For example "se nos hizo tarde" is not the same as "esto es lo que ".
What would you expect for titles as "this is what I know", or "how to do it", the phrase "how to" is very popular in SEO, would you left out the word como?

y, o, u, pero

un, uno, unos, una, unas

el, lo, los, la, las, del, al

a, ante, con, contra, cuando, de, desde, hacia, hasta, para, por, según, si, sobre, tras, ..

mi, mis, tu, tus, su, sus, suyo, suyos, suya, suyas, tal, cual, algo, algún, alguno, alguna, no, ningún, ninguno, ninguna

cuan, cuanto, cuanta, también, tanto, así

que, esto, estos, esta, estas, es, eso, esos, esa, esas, ella, ellas, se, nos

PS: there are a lot of word missing yet (even for consideration only)
BTW I think this discussion should be in spanish


No semantics considered

miro_dietiker's picture
miro_dietiker - Tue, 2008-10-28 00:17

The job of omitting "unwanted?" words is per definition not trivial.

Since pathauto does no semantic analisys of the text, omitting of words can never have the claim to preserve the meaning. It simply means removing words with low relevance regarding SEO aspects (e.g. too short or too common).
Especially when removing negations things obviously change meaning.

I'm pretty sure there are people around who completely object to remove those.

If there is a repository it should be classified into different types of words while the discussion about families of words should be led back into a general discussion about "which classes of words to be removed?"

Greeting - Miro