Hi, everyone! Thanks to those who took time to attend today's session ... and, if you missed it, check it out on-demand anytime after tomorrow... Meanwhile, I promised a lot of follow-up info to the folks who attended today, so here it is...
First, let me say that we covered A LOT today. That was not to overwhelm you, but to give you all the information. Just because you know that a type of metadata or hidden content could exist, doesn't mean it is in your document ... and doesn't mean you should be worried about it. The important thing is to have the information -- know what the possibilities are -- so that you can make an educated, confident decision about how you want to handle the hidden data issue for your documents. If you do feel overwhelmed at all by all of the content we covered, I suspect that going through the lists and links provided here will put it all into place. However, if questions or concerns remain after you read this post -- just post a comment and I'll do my best to clear them right up for you...
- To get the Microsoft free hidden data add-in for Word, Excel, and PowerPoint and\or the free Redaction add-in for Word, click here: http://office.microsoft.com/en-us/officeupdate/CD010773571033.aspx ... and you can always get to the general Office downloads site by office.microsoft.com/downloads.
- For the steps to resolving many types of metadata and hidden content discussed today, as well as more discussion on the options available for resolving that content, see my earlier blog post The Whole Truth About Metadata in Office Documents.
- As discussed, you don't need to know VBA to deal with metadata and hidden content, but if you do want to write some simple macros to automate some cleanup steps for yourself and you've never written macros before ... check out an earlier webcast I gave on the basics of Word VBA for document production (most of what you'll learn there applies to Excel and PowerPoint as well):
To view that VBA webcast, click here
- For the sample files used in that webcast, click here
You'll find a couple of macro tips as well, as promised, at the end of this post. But first ...
The list of metadata and hidden data types demonstrated in today's session: If you need to review the steps for determining if these exist in your files or how to remove them ... review today's session on demand. Also, many of these items are in the earlier blog article linked above ... and of course, post a comment with a question if you want the steps for any specific item in this list.
REMEMBER: Just because it's on this list doesn't make it a privacy threat and doesn't mean you have to worry about it. This list includes things that are actual metadata and that are hidden content commonly considered metadata. It is not exhaustive, because I won't include things I think are mentioned as metadata to scare people, but are incredibly unlikely (in my opinion) to be a privacy threat -- like Pivot Cache, slide timings, or style names.
THESE ARE NOT IN ANY PARTICULAR ORDER WITHIN EACH PROGRAM. Decide for yourself which items are important or not. Also be careful before you remove any type of metadata or hidden content, as it might not be a privacy risk, but removing it could cause some functionality in the program or an add-in program you use to not function (for example - custom document properties, bookmarks, the name of the attached template, or document variables). The important thing is simply to be aware of what your files MIGHT contain, so that you can confidently check for that content and, if you feel it's necessary, remove that content...
Notice that the lists below include both actual metadata and hidden content you place in a document but need to be aware if it's there ...
TIP: Consider creating 2 lists for yourself ... one list of items that you want to check for all documents and another, longer list to use just for extremely sensitive documents...
Word
• File Summary info
• Routing Slip
• Versions
• Custom Document Properties
• Fields
• Bookmarks
• Comments
• For tablet users … ink comments and annotations
• Hyperlinks
• Tracked Changes
• Embedded or Linked Objects\files
• Embedded linguistic data
• Fast Save Data
• Hidden Text
• Text with white font color applied, or shading the same color as text
• Content in headers and footers that have been turned off or are not visible because of the length of the header\footer
• Footnotes unseen when viewing the document in Normal view
• Style names
• Name of the document template
• If the file is a template – content saved in AutoText entries
Excel
• File Properties Summary info
• Routing slip
• Custom document properties
• Hidden rows, columns, sheets
• Frozen panes
• Embedded files and objects
• Tracked changes (for shared workbooks only)
• Comments
• For tablet users – ink annotations
• Conditional formatting – or other formatting - applied to obscure content
• Headers and footers are not visible in all views
• Hyperlinks
PowerPoint
• File Properties Summary info
• Custom Document Properties
• Fast Saves
• Comments
• Tablet users - ink annotations
• Saved header, footer, or date content that is not being shown on the applicable slide or master
• Formatting, such as white font color, that obscures text but does not remove it
• Notes page content
• Slides set to be hidden during a slide show
For those who want them ... some sample macro tips follow ...
Using the VB Editor to resolve metadata and other hidden content:
These are just a few tips to get you started. If you've never used the VB Editor to write or edit macros, I suggest starting with the VBA basics webcast I mentioned earlier in this post. And remember, this is just to save time (which can surely be important!)... but you DON'T HAVE TO write VBA to manage metadata and hidden for yourself.
Okie dokie ...
The general tips that follow (how to use the Immediate Window and how to write a For Each>>>Next loop apply to Word, Excel, and PowerPoint (any program in which you can write VBA, actually) -- but these specific examples are all Word properties just to keep it simple...
For on-the-fly document checking, use the Immediate Window:
- on the View menu click Immediate, or press Ctrl+G (note that Ctrl+G isn't a toggle command ... so it won't turn that window off).
To ask for information, start with a question mark, followed immediately by the line of code you want the information on. Press ENTER to get your information (the immediate window only deals with 1 line at a time). For example:
Find out how many comments are in a document:
?ActiveDocument.Bookmarks.Count
Find out how many hyperlinks are in a document:
?ActiveDocument.Hyperlinks.Count
To remove all of a type of item from a document at once:
Use a simple For Each...Next loop. For example:
Delete all hyperlinks from the document (the macro samples that follow need to go in a VBA module):
Sub DelHyp()
Dim alk as Hyperlink
For each alk in ActiveDocument.Hyperlinks
alk.Delete
Next
End Sub
Delete all comments from all currently open documents at once:
Sub DelCmts()
Dim acm as Comment
Dim adc as Document
For each adc in Documents
For each acm in adc.Comments
acm.Delete
Next acm
Next adc
End Sub
(Note that this interface doesn't let me indent lines of code, but good coding practice would indent the lines within the nested loops above for easy reading)
_____________________
If you need help with a macro for any particular metadata or hidden data feature in Word, Excel, or PowerPoint ... post a comment asking for it and I'll post either the answer or a link to the answer ...
Meanwhile - have a great Thursday everyone!!!