Simplest Visual Organs - MIT Media Lab

Mitsubishi Electric Research Laboratories Raskar 2007 

Less is More: 

Coded Computational Photography 

Projector 

Pos=0 

Pos=255 

Tags 

Ramesh Raskar 

Mitsubishi Electric Research Labs (MERL) 

Cambridge, MA 

Simplest Visual Organs 

Larval Trematode Worm 

‘Single Pixel’ Camera 

1



‘Single Pixel’ Camera 


? 


2

Special Aperture 

? 


Special Aperture 

The aperture of a 100 mm lens is modified 

Insert a coded mask with chosen binary pattern 

Rest of the camera is unmodified 

3

LED 

In Focus Photo 

Out of Focus Photo: Open Aperture 

4

Out of Focus Photo: Coded Aperture 

Bokeh 

5

Out of Focus Photo: Coded Aperture 

Captured Blurred 

Photo 

7

Refocused on Person 

Blurred Photos 

Open Aperture 

Coded Aperture, 7 * 7 Mask 

8

After Removing De-Focus Blur 

Open Aperture 

Coded Aperture, 7 * 7 Mask 

Motion Blurred Photo 

9

Short Exposure Traditional MURA 

Shutter 

Captured 

Single 

Photo 

Deblurred 

Result 

Dark 

and noisy 

Banding Artifacts and 

some spatial frequencies 

are lost 

10

Blurring == Convolution 

Sharp Photo 

Fourier 

Transform 

PSF == Sinc Function 

Blurred Photo 

ω 

Traditional Camera: Shutter is OPEN: Box Filter 

Sharp Photo 

Fourier 

Transform 

PSF == Broadband Function 

Blurred Photo 

Preserves High Spatial 

Frequencies 

Flutter Shutter: Shutter is OPEN and CLOSED 

11

Flutter Shutter Camera 

Raskar, Agrawal, Tumblin [Siggraph2006] 

LCD opacity switched 

in coded sequence 

Traditional 

Coded 

Exposure 

Deblurred 

Image 

Deblurred 

Image 

Image of 

Static 

Object 

12

Deblurred Images 

13

× 

14

Application: Aerial Imaging 

Sharpness versus Image Pixel Brightness 

T=100ms 

T = 0 

Long Exposure: 

Short Explosure: 

Flutter Shutter 

Shutter Open 

Shutter Closed 

Time 

Sharp image with 

sufficient brightness 

Motion Blur 

Defocus Blur 

15

Coded Exposure 

Coded Aperture 

Temporal 1-D broadband code: 

Motion Deblurring 

Spatial 2-D broadband mask: 

Focus Deblurring 

Less is More 

Blocking Light 

== More Information 

Coding in Time 

Coding in Space 

16

Codes at Work 

• Imaging 

– Aperture Modification 

• Without Lens 

– Astronomy [Fenimore and Gotterson, ’89, Skinner, ’88] 

– Nuclear Medicine Imaging [Zhang et al.’99] 

– Lensless Imaging, [Zomet & Nayar, CVPR’06] 

• With Lens 

– Range Imaging, [Johnson et al.’00, Hiura and Matsuyama’98, Farid and Simoncelli’98] 

– Wavefront Coding, CDM Optics 

– Levin et al. Siggraph’07 

• Illumination 

– Global Direct Separation , [Nayar, Guru, Grossberg, Raskar, Sig’06] 

– Veiling Glare Removal , [Talvala, Adams, Levoy, Sig’07] 

• Audio 

– Reverberation Analysis 

• Radar 

– Chirps for ranging 


17

Coded Aperture in Nature ? 


Turbellarian Worm 

Less is More .. 

• Coded Exposure 

– Motion Deblurring 

• Coded Aperture 

– Focus Deblurring 

• Optical Heterodyning 

– Light Field Capture 

Projector 

Pos=0 

• Coded Illumination 

– Motion Capture 

– Multi-flash: Cartoons 

Pos=255 

Tags 

18

Computational Photography 

1. Epsilon Photography 

– Multi-photos by perturbing camera parameters 

– HDR, panorama 

– ‘Ultimate camera’: (Photo-editors) 

2. Coded Photography 

– Single/few snapshot 

– Reversible encoding of data 

– Additional sensors/optics/illum 

– ‘Scene analysis’ : (Consumer software?) 

3. Impossible Photos 

– Beyond single view/illum 

– Not mimic human eye 

– ‘New art form’ 

[ Agrawal, Raskar, Nayar, Li Siggraph05 ] 

No-flash 

Flash Result Reflection Layer 

Gradient Vector Projection 

19



– Multiphotos by varying camera parameters 


– ‘Ultimate camera’: (Photo-editor) 





– ‘Scene analysis’ : (Next software?) 




– ‘New artform’ 



– Multiphotos by varying camera parameters 


– ‘Ultimate camera’: (Photo-editor) 





– ‘Scene analysis’ : (Next software?) 




– ‘New artform’ 

20

Mask? 

Mask 

Sensor 

Mask 

Sensor 

Full Resolution Digital 

Refocusing: 

Coded Aperture Camera 

4D Light Field from 2D 

Photo: 

Heterodyne Light Field 

Camera 

Capturing Light Field Inside a Camera 

21

Capturing Light Field Inside a Camera 

Lenslet-based Light Field camera 

[Adelson and Wang, 1992, Ng et al. 2005 ] 

Stanford Plenoptic Camera [Ng et al 2005] 

Contax medium format camera 

Kodak 16-megapixel sensor 

Adaptive Optics microlens array 

125μ square-sided microlenses 

4000 × 4000 pixels ÷ 292 × 292 lenses = 14 × 14 pixels per lens 

22

Digital Refocusing 

[Ng et al 2005] 

Can we achieve this with a Mask alone? 

Heterodyne Light Field Camera 

Mask 

Sensor 

Scanner sensor 

Mask 

23

How to Capture 

4D Light Field with 

2D Sensor ? 

What should be the 

pattern of the mask ? 

Optical Heterodyning 

High Freq Carrier 

100 MHz 

Receiver: Demodulation 

Incoming 

Signal 

Baseband Audio 

Signal 

99 MHz 

Reference 

Carrier 

Main Lens 

Object Mask Sensor 

Software Demodulation 

Recovered 

Light 

Field 

Photographic 

Signal 

(Light Field) 

Carrier 

Incident 

Modulated 

Signal 

Reference 

Carrier 

24

Captured 2D Photo 

Encoding due to 

Cosine Mask 

Traditional Camera vs Heterodyne Camera 

2D 

FFT 

Traditional Camera Photo 

Magnitude of 2D FFT 

2D 

FFT 

Heterodyne Camera Photo 

Magnitude of 2D FFT 

25

Computing 4D Light Field 

2D Sensor Photo, 1800*1800 2D Fourier Transform, 1800*1800 

2D 

FFT 

9*9=81 spectral copies 

4D IFFT 

Rearrange 2D tiles into 4D planes 

200*200*9*9 

4D Light Field 

200*200*9*9 

A Theory of Mask-Enhanced Camera 

Main Lens 

Object Mask Sensor 

•Mask == Light Field Modulator 

•Intensity of ray gets multiplied by Mask 

•Convolution in Frequency domain 

26

Related Work 

• Light Field Capture 

– Gortler et al., Levoy & Hanrahan, SIG’96, Isaksen et al.‘SIG00 

– Light Field Microscopy: Levoy et al. SIG’06 

– Integral Photography 

• Lippman’08, Ives’30, Georgeiv et al. EGSR’06, Okano et.al’97 

– Camera arrays: Wilburn et al. SIG’05 

– Flatbed Scanner + Lenslet array: Yang, 2000 

– Light Field Video Camera: Wilburn et.al'02 

– Programmable Aperture: Liang et. al ICIP 2007 

– Plenoptic Camera 

• Wang and Adelson’92 

• Ng et al.’05 

f θ 

Band-limited 

f θ0 

Light Field 

f x0 

f x 

Sensor Slice – Fourier 

Slice Theorem 

Photo = Slice of Light Field in Fourier Domain 

2005] 

[Ren Ng, SIGGRAPH 

27

How to Capture 2D Light Field with 1D Sensor ? 

f θ 

Band-limited 

f θ0 

Light Field 

f x0 

f x 

Sensor Slice 

Fourier Light Field Space 

Extra sensor bandwidth cannot capture 

extra dimension of the light field 

f θ 

Extra sensor 

bandwidth 

f θ0 

f x0 

Sensor 

f x 

Slice 

28

f θ 

??? 

??? 

??? ??? 

f x 

Solution: Modulation Theorem 

Make spectral copies of 2D light field 

f θ 

f θ0 

f x0 

f x 

Modulation 

Function 

29

Sensor Slice captures entire Light Field 

f θ 

Modulated Light Field 

f θ0 

f x0 

f x 

Modulation 

Function 

Demodulation to recover Light Field 

1D Fourier Transform of Sensor Signal 

f θ 

f x 

Reshape 1D Fourier Transform into 2D 

30

Modulation Function == Sum of Impulses 

Physical Mask = Sum of Cosines 

f θ 

f θ0 

f x0 

f x 

Cosine Mask Used 

Mask Tile 

1/f 0 

31

Where to place the Mask? 

Sensor 

Sensor 

Mask 

Mask 

f θ 

Mask Modulation 

Function 


Function 

f x 


Mask 

Sensor 

f θ 

f x 

Mask 

Modulation 

Function 

32


Mask 

Sensor 

v 

d 


Function 

α 

α = (d/v) (π/2) 

Captured 2D Photo 

Encoding due to 

Cosine Mask 

33

Computing 4D Light Field 

2D Sensor Photo, 1800*1800 

2D Fourier Transform 

2D 

FFT 

9*9=81 spectral copies 

4D IFFT 

Rearrange 2D tiles into 4D planes 

200*200*9*9 

4D Light Field 

200*200*9*9 

Only cone in 

focus 

Captured Photo 


34

Captured 

2D Photo 

Full resolution 2D image 

of Focused Scene Parts 

divide 

Image of White Lambertian 

Plane 

MERL 

Mask-Enhanced Cameras: Heterodyned Light Fields & Coded Aperture 

Veeraraghavan, Raskar, Agrawal, 

Mohan & Tumblin 

Differences with Plenoptic Camera 

Sensor 

Sensor 

Microlens 

array 

Mask 

Plenoptic Camera 

Heterodyne Camera 

• Micro-lens array 

• Samples individual rays 

• Needs high alignment precision 

• Wasted pixels 

• Narrowband Cosine Mask 

• Samples coded combination of rays 

• More flexible 

• No wastage 

35

Coding and Modulation in Camera Using Masks 

Mask? 

Sensor 

Mask 

Sensor 

Mask 

Sensor 

Coded Aperture for 

Full Resolution 


Heterodyne Light 

Field Camera 

Coded Imaging 







Projector 

Pos=0 



– Multi-flash: Cartoons 

Pos=255 

Tags 

36

Projector-based Displays 

Planar 

Non-planar 

Curved 

Objects 

Pocket-Proj 

2000 

1998 

2001 

2002 

Single 

Projector 

Us 

er : 

T 

j 

Projecto 

r 

? 

2000 1999 

2002 

1999 

2003 

Multiple 

Projectors 

Vicon 

Optical 

Motion Capture 

Medical Rehabilitation 

Athlete Analysis 

Body-worn markers 

High-speed 

IR Camera 

Performance Capture 

Biomechanical Analysis 

37

Projector 

Pos=0 

Coded Illumination 

High Speed Motion Capture 

Pos=255 

Tags 

- To increase tracking speed 

Code position: Non-colocated emitters 

- To work in Ambient Light 

Code time: 455KHz modulation 

- Invisible 

Code wavelength: Infrared 

Projector 

Light Meters 

Pos=0 

Tags 

- Distributed, wireless 

- Real-time location 

- Incident light reading 

Pos=255 

- Annotate Event Photos 

- Coded Illumination 

- Capture image location of imperceptible tags 

- Works in ambient light, 500 Hz 

38

Labeling Space 

Projector 

Pos=0 

Each location 

receives a unique 

temporal code 

Tags 

Pos=255 

But 60Hz 

video projector 

is too slow 

Fast Switching using 

Non-colocated 

Emitters for Structured Light 

Tag 

Fixed Masks 

+ Blinking LEDs 

Time multiplex, 

Freq or CDMA ? 

39

Fast Switching using 

Non-colocated 

Emitters for Structured Light 

How Labeling Works 

Light 

source 

Optics 

Screen 

GrayCode Mask 

pos=0 

pos=15 

Light source blink one by one and each position 

on the screen has different light pattern. 

4 light make 4 bit position resolution 

40


LED 

Optics 

Screen 

GrayCode Mask 

pos=0 

pos=15 

1 LED for 1 Bit pattern 


LED 

Optics 

Screen 

GrayCode Mask 

pos=0 

pos=15 


41


LED 

Optics 

Screen 

GrayCode Mask 

pos=0 

pos=15 



LED 

Optics 

Screen 

GrayCode Mask 

pos=0 

pos=15 


42

Coded Illumination Projector 

Focusing Optics 

Condensing Optics 

Light Source 

Gray code Slide 

The Gray code pattern 

Photosensing Tag 

43

2D Location 

3D Location 

X data 

X2 data 

X data 

Y data 

Y data 

Emitter Complexity 

Optical 

Motion 

Capture 

Receiver Complexity 

44

Imperceptible tags, Ambient Lighting, Id per marker 

Prakash [Raskar, Nii, Summet et al Siggraph 2007] 

High Speed Tracking 

45

Lightmeters: Realistic Editing + Blurring 

46

Coded Illumination 

for 

Motion Capture 

• 500 Hz Tracking 

• Id for each Marker Tag 

• Capture in Natural Environment 

– Visually imperceptible tags 

– Photosensing Tag can be hidden under clothes 

– Ambient lighting is ok 

• Unlimited Number of Tags Allowed 

• Base station and tags only a few 10’s $ 

Acknowledgements 

• Amit Agrawal, MERL 

• Jack Tumblin, Northwestern U. 

• Shree Nayar, Columbia U. 

• MERL 

– Jay Thornton, Keisuke Kojima 

• Mitsubishi Electric Japan 

– Kazuhiko Sumi, Haruhisa Okuda 

• Coded Aperture and Light Field 

– Ashok Veerarghavan, Ankit Mohan 

• Prakash, Motion Capture 

– Masahiko Inami, Hideaki Nii, Yuki Hashimoto, Jay Summet, Erich Bruns, 

Paul Dietz, Bert de Decker, Philippe Bekaert 

• Prof Yagi, Prof Ikeuchi and ACCV Organizers 

47

Future of Coding Light 

• How to block light in other ways? 

– Time, Space, Illumination .. Wavelength? On Sensor? 

• What other blockers? 

– Dynamic masks (LCDs), non-planar or colored masks? 

• Applications 

– Estimate params in presence of low pass convolution 

– Light Field Applications: lens aberration, microscopy 







Coded Photography 



– Multi-flash: Shape Contours 

Projector 

Pos=0 

Tags 

• Epsilon->Coded->Impossible Photos 

Pos=255 

48

Blind Camera 

Sascha Pohflepp, 

U of the Art, Berlin, 2006 

49







Coded Photography 



– Multi-flash: Shape Contours 

Projector 

Pos=0 

Tags 

• Epsilon->Coded->Impossible Photos 

Pos=255 

50

Multi-flash Camera for 

Detecting Depth Edges 

Left Top Right Bottom 

Depth 

Edges 

Canny Edges 

Depth Edges 

51

Car Manuals 

52

What are the problems 

with ‘real’ photo in 

conveying information ? 

Why do we hire artists 

to draw what can be 

photographed ? 

Shadows 

Clutter 

Many Colors 

Highlight Shape Edges 

Mark moving parts 

Basic colors 

53

Shadows 

A New Problem 

Highlight Edges 

Clutter 

Mark moving parts 

Many Colors 

Basic colors 

Gestures 

Input Photo Canny Edges Depth Edges 

54

Depth Edges with MultiFlash 

Raskar, Tan, Feris, Jingyi Yu, Turk – ACM SIGGRAPH 2004 

55

Depth Discontinuities 

Internal and external 

Shape boundaries, Occluding contour, Silhouettes 

57

Depth 

Edges 

Canny 

Our Method 

58

Photo 

Result 

Our Method 

Canny Intensity 

Edge Detection 

59

Simplest Visual Organs - MIT Media Lab

Create successful ePaper yourself

Delete template?

Save as template?