Hi; I'm looking for methods (good but not to complicated) to anonymize data. Any experience. My raw corpus input is growing; but the next step is getting rid of private data. That's a lot actually. Maybe building a dictionary and parse the files. On the other hand should I do the analysis on anonymous data or the initial full raw data.?