Most meta data is straight-up boring: author’s name, number of revisions, date created, etc. However, this information also includes tracked changes in your document, your revision history, and deleted comments. All word processors attach meta data by default, including Microsoft Word, OpenOffice.org, and Corel WordPerfect.

The discovery of a document’s meta data could be detrimental to your case or contract negotiation. Several states have deemed the searching of meta data to be unethical. Others have placed the burden of removing such data on the document creator. Regardless of your jurisdiction’s practice, it is always better to be safe than sorry.

Fortunately, removing meta data from your documents is a piece of cake.

Cleansing documents with Word

Microsoft’s integrated Document Inspector is designed to remove meta data from Word, Powerpoint, and Excel, but it should have no problem handling OpenOffice.org or WordPerfect documents, if you open them in Word. Here’s a quick run-down on how to work this amazingly simple feature:

  1. Open the Office document that you want to inspect for hidden data or personal information.
  2. Click the Microsoft Office ButtonMicrosoft Button, click Save As, and save a copy of your original document. It is a good idea to use the Document Inspector on a copy of your original document because it is not always possible to restore the data that Document Inspector removes.
  3. In the copy of your document, click the Microsoft Office ButtonMicrosoft Button, point to Prepare, and then click Inspect Document.
  4. In the Document Inspector dialog box, select the check boxes to choose the types of hidden content that you want to look for. (Microsoft has more information about what information Document Inspector can find and remove.)
  5. Click Inspect.
  6. Review the results of the inspection in the Document Inspector dialog box.
  7. Click Remove All next to the inspection results for the types of hidden content that you want to remove from your document.

Cleansing documents with OpenOffice.org

OpenOffice.org works a little differently. Open Document Format (and Microsoft’s new OOXML) files are not really individual files, but many files in a compressed file (like a .zip file). If you are a Linux user, open up an .odt file with an archive manager to see what I mean. To perform the same operation in Windows, back-up your original file and then change the extension from “.odt” to .”.zip”. To delete the meta data, you will need to delete the meta.xml file.

Screenshot of .ODT in archive manager

Each time you edit an .odt file the meta.xml file is regenerated. Thus, you must remove the meta.xml file each time the .odt file is edited. Double-clicking the file will open it in your browser window so you can see exactly what meta data is attached to each document.

While removing the meta.xml file will remove most personal information from the document, it will not remove tracked changes. These recorded changes are not meta data at all, but actual data that you can view and print.

To remove this data, go to Edit > Changes > and uncheck “Record.” This will prevent OpenOffice.org from recording any changes that are made by any user. If this option is checked, you can view all changes made to the document simply by clicking Edit > Change > Show. To delete all recorded changes to your document, click Edit > Change > Accept or Reject. You will then be presented with a pop-up box that will allow you to delete any change according to author, type (insertion or deletion), date, and comment.

To ensure anonymity and prevent the dissemination of privileged information, it is best to not use the record changes feature in OpenOffice.org.

Cleansing documents with WordPerfect

To save a WordPerfect document without meta data, choose File > Save without meta data. Piece of cake. For more information on how to remove existing meta data from a WordPerfect document, read Corel’s excellent instructional article.

Remove hidden data and personal information from Office documents | Microsoft

5 Comments

  1. Catherine Mulcahey says:

    Please help. I have Microsoft Office:Mac and can’t find Document Inspector.

    • Guzman Rejon says:

      MetaClean is a powerful tool to view, search, remove and edit metadata of Microsoft Office (Word, Excel, PowerPoint and Visio), OpenOffice (word processors, spreadsheets and presentations) and PDF documents.

      Compatible with Microsoft Windows, Linux, Unix and Mac OS X.

      MetaClean is specially designed to clean hundreds documents at once.

      More info http://www.adarsus.com/en/metaclean.html

  2. luc prévost says:

    Bonjour Mme Mulcahey,

    Go to Preferences then click on Security.
    Select “Remove personally identifiable information from the file on save” and “Warn before printing, saving or sending a document that contains tracked changes or comments”.

    Et voila!

    luc

  3. tom says:

    For the mac:

    It may work to simply set Word 2008 on mac to “remove personal information from this file on save” under Preferences in the Word menu, Personal Settings, Security, Privacy options. If you only wish to enforce the security measure on a final save it may be sufficient to select the “Warn before printing, saving, or sending a file that contains tracked changes or comments” check box.

    This doesn’t provide the fine-grained security selectivity that I understand Word provides on the Windows platforms, but would seem to get the job done.

    Alternatively, there’s this service:

    PDFs can contain metadata that should ideally be scrubbed as well, so probably a good idea to look at all filetypes that may be shared.

  4. Will Geer says:

    Thanks for the help guys. I just got my MacPro in today, but unfortunately it does not have Word. A quick Google search pulled it up for me though.

Leave a Reply