Blacklist of search headwords and application rules

Contains list of sensitive words and their respective headwords to be blocked from being stored or shared to third parties.

In case you want to report more headwords that you feel that should be included in this list, please write to data@thegooddata.org

https://docs.google.com/spreadsheet/ccc?key=0Aten5ipBZkcTdGpEVnF2dk8xdGRDNjFOazUxMkdtR0E

Application rules

  1. In the query: Take out accents. Change hyphen, dots and apostrophes with spaces. Ad a space at the end of the query and then review that there are no consecutive spaces between words
  2. Then take the full list of sensitive headwords and check that they are not contained in the query. As long as one is included, discard the whole query as sensitive
  3. Consider all headwords as prefixes, that means that the should be a start of a word. In some cases those headwords have a space at the end, that means that it is not a prefix, but a full word