|
ABSTRACT
This paper presents a complete system for
constructing panoramic image mosaics from sequences of images.
Our mosaic representation associates a transformation matrix with
each input image, rather than explicitly projecting all of the images
onto a common surface (e.g., a cylinder). In particular, to
construct a full view panorama, we introduce a rotational
mosaic representation that associates a rotation matrix (and
optionally a focal length) with each input image. A patch-based
alignment algorithm is developed to quickly align two images given
motion models. Techniques for estimating and refining camera focal
lengths are also presented. In order to reduce accumulated registration errors, we apply
global alignment (block adjustment) to the whole sequence of images,
which results in an optimally registered image mosaic.
To compensate for small amounts of motion parallax introduced by
translations of the camera and other unmodeled distortions, we use a
local alignment (deghosting) technique which warps each image based
on the results of pairwise local image registrations. By combining both
global and local alignment, we significantly improve the quality of
our image mosaics, thereby enabling the creation of full view
panoramic mosaics with hand-held cameras. We also present an inverse texture mapping algorithm for
efficiently extracting environment maps
from our panoramic image mosaics. By mapping the mosaic onto an arbitrary
texture-mapped polyhedron surrounding the origin, we can explore the
virtual environment using standard 3D graphics viewers and hardware without
requiring special-purpose players.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Anandan, P. 1989. A computational framework and an algorithm for the measurement of visual motion. <i>International Journal of Computer Vision</i>, 2(3):283-310.
|
| |
2
|
Anandan, P., Sawhrey, H., Shashua, A., and Simoncelli, E. (Eds.) 1995. In <i>IEEE Workshop on Representations of Visual Scenes</i>, Cambridge, Massachusetts, IEEE Computer Society Press.
|
| |
3
|
Ayache, N. 1989. <i>Vision Stéréoscopique et Perception Multisensorielle </i>. InterEditions: Paris.
|
| |
4
|
|
| |
5
|
|
 |
6
|
|
 |
7
|
|
 |
8
|
|
| |
9
|
|
| |
10
|
Danielsson, P.E. 1980. Euclidean distance mapping. <i>Computer Graphics and Image Processing</i>, 14:227-248.
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
 |
14
|
|
| |
15
|
|
| |
16
|
Hansen, M., Anandan, P., Dana, K., van der Wal, G., and Burt, P. 1994. Real-time scene stabilization and mosaic construction. In <i>IEEE Workshop on Applications of Computer Vision (WACV'94)</i>, Sarasota, Florida, pp. 54-62.
|
| |
17
|
|
| |
18
|
Heckbert, P. 1989. <i>Fundamentals of Texture Mapping and Image Warping</i>. Master's Thesis, The University of California at Berkeley.
|
| |
19
|
Horn, B.K.P. 1987. Closed-form solution of absolute orientation using unit quaternions. <i>Journal of the Optical Society of America A</i>, 4(4):629-642.
|
| |
20
|
Horn, B.K.P., Hilden, H.M., and Negahdaripour, S. 1988. Closed-form solution of absolute orientation using orthonormal matrices. <i>Journal of the Optical Society of America A</i>, 5(7):1127-1135.
|
| |
21
|
|
| |
22
|
|
| |
23
|
Irani, M., Hsu, S., and Anandan, P. 1995b. Video compression using mosaic representations. <i>Signal Processing: Image Communication </i>, 7:529-552.
|
| |
24
|
|
| |
25
|
Kang, S.B. 1997. A survey of image-based rendering techniques. Technical Report 97/4, Digital Equipment Corporation, Cambridge Research Lab.
|
| |
26
|
|
| |
27
|
Kumar, R., Anandan, P., and Hanna, K. 1994. Shape recovery from multiple views: A parallax based approach. In <i>Image Understanding Workshop</i>, Morgan Kaufmann Publishers: Monterey, CA, pp. 947-955.
|
| |
28
|
|
 |
29
|
|
| |
30
|
Lee, M.-C. et al. 1997. A layered video object coding system using sprite and affine motion model. <i>IEEE Transactions on Circuits and Systems for Video Technology</i>, 7(1):130-145.
|
 |
31
|
|
| |
32
|
Lucas, B.D. and Kanade, T. 1981. An iterative image registration technique with an application in stereo vision. In <i>Seventh International Joint Conference on Artificial Intelligence (IJCAI-81)</i>, Vancouver, pp. 674-679.
|
| |
33
|
Malde, H.E. 1983. Panoramic photographs. <i>American Scientist</i>, 71(2):132-140.
|
| |
34
|
Mann, S. and Picard, R.W. 1994. Virtual bellows: Constructing high-quality images from video. In <i>First IEEE International Conference on Image Processing (ICIP-94)</i>, Austin, Texas, pp. 363-367.
|
 |
35
|
|
| |
36
|
Meehan, J. 1990. <i>Panoramic Photography</i>. Watson-Guptill.
|
| |
37
|
Milgram, D.L. 1977. Adaptive techniques for photomosaicking. <i>IEEE Transactions on Computers</i>, C-26(11):1175-1180.
|
| |
38
|
Moffitt, F.H. and Mikhail, E.M. 1980. <i>Photogrammetry</i>, 3rd edn. Harper & Row: New York.
|
| |
39
|
|
| |
40
|
|
| |
41
|
|
| |
42
|
|
| |
43
|
|
| |
44
|
Quam, L.H. 1984. Hierarchical warp stereo. In <i>Image Understanding Workshop</i>, Science Applications International Corporation, New Orleans, Louisiana, pp. 149-155.
|
| |
45
|
|
| |
46
|
|
| |
47
|
Sawhney, H.S. 1994. Simplifying motion and structure analysis using planar parallax and image warping. In <i>Twelfth International Conference on Pattern Recognition (ICPR'94)</i>, Jerusalem, Israel, IEEE Computer Society Press, pp. 403-408.
|
| |
48
|
|
| |
49
|
|
| |
50
|
Shi, J. and Tomasi, C. 1994. Good features to track. In <i>IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'94)</i>, Seattle, Washington, IEEE Computer Society, pp. 593-600.
|
| |
51
|
Shum, H.-Y. and Szeliski, R. 1997. Panoramic image mosaicing. Technical Report MSR-TR-97-23, Microsoft Research.
|
| |
52
|
|
| |
53
|
|
| |
54
|
Szeliski, R. 1994. Image mosaicing for tele-reality applications. In <i>IEEE Workshop on Applications of Computer Vision (WACV'94)</i>, Sarasota, Florida, IEEE Computer Society, pp. 44-53.
|
| |
55
|
|
| |
56
|
Szeliski, R. 1996. Video mosaics for virtual environments. <i>IEEE Computer Graphics and Applications</i>, 16(2):22-30.
|
| |
57
|
|
| |
58
|
|
| |
59
|
Tsai, R.Y. 1987. A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses. <i>IEEE Journal of Robotics and Automation</i>, RA-3(4):323-344.
|
 |
60
|
|
 |
61
|
|
 |
62
|
|
| |
63
|
|
| |
64
|
Wolf, P.R. 1974. <i>Elements of Photogrammetry</i>. McGraw-Hill: New York.
|
| |
65
|
|
| |
66
|
|
CITED BY 40
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
GuoJun Liu , XiangLong Tang , H. D. Cheng , JianHua Huang , JiaFeng Liu, A novel approach for tracking high speed skaters in sports using a panning camera, Pattern Recognition, v.42 n.11, p.2922-2935, November, 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ayman Kaheel , Motaz El-Saban , Mahmoud Refaat , Mostafa Ezz, Mobicast: a system for collaborative event casting using mobile phones, Proceedings of the 8th International Conference on Mobile and Ubiquitous Multimedia, p.1-8, November 22-25, 2009, Cambridge, United Kingdom
|
|
|
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.1
Multimedia Information Systems
Subjects:
Artificial, augmented, and virtual realities
Additional Classification:
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.3
COMPUTER GRAPHICS
I.3.7
Three-Dimensional Graphics and Realism
Subjects:
Virtual reality
I.4
IMAGE PROCESSING AND COMPUTER VISION
I.4.8
Scene Analysis
Subjects:
Range data
General Terms:
Algorithms,
Experimentation,
Measurement,
Theory
Keywords:
global alignment,
image mosaics,
local alignment,
panoramas,
parametric motion estimation,
virtual environment modeling
|