|
ABSTRACT
We present a demonstration-based system for automatically generating succinct step-by-step visual tutorials of photo manipulations. An author first demonstrates the manipulation using an instrumented version of GIMP that records all changes in interface and application state. From the example recording, our system automatically generates tutorials that illustrate the manipulation using images, text, and annotations. It leverages automated image labeling (recognition of facial features and outdoor scene structures in our implementation) to generate more precise text descriptions of many of the steps in the tutorials. A user study comparing our automatically generated tutorials to hand-designed tutorials and screen-capture video recordings finds that users are 20--44% faster and make 60--95% fewer errors using our tutorials. While our system focuses on tutorial generation, we also present some initial work on generating content-dependent macros that use image recognition to automatically transfer selection operations from the example image used in the demonstration to new target images. While our macros are limited to transferring selection operations we demonstrate automatic transfer of several common retouching techniques including eye recoloring, whitening teeth and sunset enhancement.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Maneesh Agrawala , Doantam Phan , Julie Heiser , John Haymaker , Jeff Klingner , Pat Hanrahan , Barbara Tversky, Designing effective step-by-step assembly instructions, ACM SIGGRAPH 2003 Papers, July 27-31, 2003, San Diego, California
|
 |
2
|
|
 |
3
|
Lawrence Bergman , Vittorio Castelli , Tessa Lau , Daniel Oblinger, DocWizards: a system for authoring follow-me documentation wizards, Proceedings of the 18th annual ACM symposium on User interface software and technology, October 23-26, 2005, Seattle, WA, USA
[doi> 10.1145/1095034.1095067]
|
 |
4
|
Michael Bolin , Matthew Webber , Philip Rha , Tom Wilson , Robert C. Miller, Automation and customization of rendered web pages, Proceedings of the 18th annual ACM symposium on User interface software and technology, October 23-26, 2005, Seattle, WA, USA
[doi> 10.1145/1095034.1095062]
|
| |
5
|
Booher, H. 1975. Relative comprehensibility of pictoral information and printed words in proceduralized instructions. In Human Factors, vol. 17, 266--277.
|
| |
6
|
Allen Cypher , Daniel C. Halbert , David Kurlander , Henry Lieberman , David Maulsby , Brad A. Myers , Alan Turransky, Watch what I do: programming by demonstration, MIT Press, Cambridge, MA, 1993
|
 |
7
|
|
| |
8
|
|
| |
9
|
Susan M. Harrison, A comparison of still, animated, or nonillustrated on-line help with written or spoken instructions in a graphical user interface, Proceedings of the SIGCHI conference on Human factors in computing systems, p.82-89, May 07-11, 1995, Denver, Colorado, United States
[doi> 10.1145/223904.223915]
|
 |
10
|
Julie Heiser , Doantam Phan , Maneesh Agrawala , Barbara Tversky , Pat Hanrahan, Identification and validation of cognitive design principles for automated generation of assembly instructions, Proceedings of the working conference on Advanced visual interfaces, May 25-28, 2004, Gallipoli, Italy
[doi> 10.1145/989863.989917]
|
 |
11
|
|
| |
12
|
|
| |
13
|
Huang, G., Ramesh, M., Berg, T., and Learned-Miller, E. 2007. Labeled faces in the wild: A database for studying face recognition in unconstrained environments. UMass, Amherst Technical Report 07--49.
|
| |
14
|
|
 |
15
|
|
| |
16
|
|
 |
17
|
|
 |
18
|
|
| |
19
|
|
 |
20
|
|
 |
21
|
Tessa Lau , Lawrence Bergman , Vittorio Castelli , Daniel Oblinger, Sheepdog: learning procedures for technical support, Proceedings of the 9th international conference on Intelligent user interfaces, January 13-16, 2004, Funchal, Madeira, Portugal
[doi> 10.1145/964442.964464]
|
| |
22
|
|
| |
23
|
Lieberman, H. 2001. Your Wish is My Command: Giving Users the Power to Instruct their Software. Morgan Kaufmann.
|
 |
24
|
Greg Little , Tessa A. Lau , Allen Cypher , James Lin , Eben M. Haber , Eser Kandogan, Koala: capture, share, automate, personalize business processes on the web, Proceedings of the SIGCHI conference on Human factors in computing systems, April 28-May 03, 2007, San Jose, California, USA
[doi> 10.1145/1240624.1240767]
|
| |
25
|
Meng, C., Yasue, M., Imamiya, A., and Mao, X. 1998. Visualizing histories for selective undo and redo. 459--464.
|
 |
26
|
|
 |
27
|
|
| |
28
|
Novick, L. R., and Morse, D. L. 2000. Folding a fish, making a mushroom: The role of diagrams in executing assembly procedures. Memory and Cognition 28, 7, 1242--56.
|
 |
29
|
|
| |
30
|
Su, S. 2007. Visualizing, editing, and inferring structure in 2D graphics. In UIST 2007 Doctoral Symposium.
|
 |
31
|
Michael Terry , Matthew Kay , Brad Van Vugt , Brandon Slack , Terry Park, Ingimp: introducing instrumentation to an end-user open source application, Proceeding of the twenty-sixth annual SIGCHI conference on Human factors in computing systems, April 05-10, 2008, Florence, Italy
[doi> 10.1145/1357054.1357152]
|
| |
32
|
Varis, L. 2006. Skin. Wiley Publishing.
|
| |
33
|
Zhou, Y., Gu, L., and Zhang, H. 2003. Bayesian tangent shape model: Estimating shape and pose parameters via bayesian inference. In Proc. CVPR, 109--116.
|
|