Introduction to Augmented Reality

Technology Conference


Fivos DOGANIS

whoami

🤝 linkedin.com/in/fivosdoganis
🧑‍🔬 fivos.doganis@3ds.com
📧 fivos.doganis@gmail.com

cover photo: www.instagram.com/steveroe_

University of Hull

  • Master of Science by Research (2001)
    Augmented Reality in Archaeology: Registration Issues

IRCAD (2002 - 2003)

  • Institut de Recherche contre les Cancers de l'Appareil Digestif
  • Startup
    • Virtual-Surg team
  • Augmented Reality Research Engineer

Dassault Systèmes (2003+)

  • 3D Visualization Engineer
    • Scenegraph, Materials
    • Geometry, Tessellation
  • Virtual and Augmented Reality (XR) Engineer
  • XR Research Engineer
  • XR Research Manager

Dassault Systèmes

From Shape to Life

Course prerequisites

  • Math 🔢
  • Programming 💻
    • JavaScript notions, or any similar language (HTML kept minimal)
  • 3D Web API 🧊
    • THREE.js notions strongly recommended (see Web 3D course)
    • alternatives: Babylon.js, WebGL, WebGPU
  • Desktop + Smartphone (or, even better, a XR HMD!) 📱 🥽

📅 Planning

  • AR Intro (20 min)

  • Theory (30 min)

    • THREE.js intro (10 min)
    • WebXR intro (10 min)
  • Practice (1.5 hours)

    • Simplest 3D webpage (Canvas, WebGL, THREE.js)
    • SceneGraph
    • Simple XR app (interactions, environment)

Definitions

Reality
Virtual Reality
Augmented Reality
Mixed Reality
Extended Reality
Metaverse
Spatial Computing
...

Definitions

Milgram, Paul; H. Takemura; A. Utsumi; F. Kishino (1994). "Augmented Reality: A class of displays on the reality-virtuality continuum".

Google's version

Properties of an AR system ⭐

(according to Azuma, 2001)

  • combines real and virtual objects in a real environment
  • runs interactively, and in real time
  • registers (aligns) real and virtual objects with each other.

Not AR:

  • special effects in movies
    • technology close to AR
    • not real time ❌
    • not in a real environment ❌
  • Google Glass
    • combines real and virtual objects in a real environment ✅
    • no registration ❌
    • it's a HUD (Head-Up Display)
      • can still be useful! (maintenance, sports etc.)

Google Glass concept video (2012)

This is not AR!!!

Definitions ⭐

  • VR : Virtual Reality Jaron Lanier, 1987

  • AR : Augmented Reality Thomas P. Caudell, 1990

  • MR : Mixed Reality

    • marketing term used by Microsoft
    • ⚠️ no clear definition! ➡️ Term must be defined before use!
  • XR : X = { eXtended / Cross (+) / Any (*) / A+V } Reality

    • recent generic term which encompasses AR and VR

AR | VR

AR | VR

Effects of VR

  • Isolates the user from the real world
  • Teleports the user to another world, which is entirely virtual

Tiltbrush

ITI

Effects of AR

  • The user stays in the real world
  • AR enhances the real world with contextual information
  • Augmented user: acquires new senses!
  • Information becomes visible
    • spatialized information overlaid on top of the real world

Google Maps AR

The browser of the future? 🧐

Audi AR manual (Metaio, 2012)

Takeaways ⭐

  • VR immerses the user in a virtual wolrd
  • AR brings virtual objects into the real world

Choosing the right paradigm

  • Immersion useful ?

    • Yes ➡️ VR
    • No ➡️ 3D
  • Immersion and real environment useful ?

    • Yes ➡️ AR
    • No ➡️ VR
  • Keep in mind continuum to pick the right paradigm to create the best possible experience

Choosing the right paradigm

History

Understand technological evolutions to anticipate the future

History

Key milestones

Prehistory (1966) ⭐

Markers (1999)

  • Monochrome markers
  • ARToolkit created by Hirokazu Kato

  • Alternatives: ARTag, ArUco
  • PC + Webcam

NFT, GPS (2005)

  • NFT: Natural Feature Tracking
    • Color photo tracking
  • Wikitude, Layar (GPS)
    • no image processing needed with GPS!
  • Vuforia
  • Marketing use-cases
  • PC, mobile phones, tablets

SLAM, 3D (2015)

  • 3D environment tracking
  • SLAM: Simultaneous Location And Mapping ⭐
  • 3D object tracking
  • Deep Learning
  • Occlusion 3D
  • ARKit, ARCore
  • Smartphones, HoloLens, Azure Kinect

Azure Kinect + HoloLens 2

HoloLens 2

Apple LiDAR

iPad Pro 2020, iPhone 12 Pro

Apple LiDAR vs FaceID

Near future

  • Form-factor: glasses 😎
  • AI
    • contextual assistance
    • understands both environment and user
  • 5G
    • application and information streaming (Edge Computing)
  • Spatialized Web: AR Cloud

Far future

  • AR will replace or complement smartphones
    • users will raise their heads again
      • but will they see better?
  • Contact Lens (Mojo Vision)
  • Ambient Computing
  • Ubiquitous Computing
  • Smart Cities

MAMAA Strategies

Meta, Alphabet, Microsoft, Apple, Amazon
and others!

AR [is for] adding shared meaning in the interaction between people.
Johnny Lee, Google I/O 2017

  • Dropped mobile VR (Cardboard 💀, Daydream 💀)
  • Dropped Tango 💀, to reach more devices: rely on RGB camera + AI
  • API ARCore, competes with Apple's ARKit
  • Google wants to provide cross-platform AR services
  • Google + Qualcomm + Samsung XR Headset coming in 2025

The Web connects the world's information, and AR connects information with the physical world. So together they can be applied to solve real life problems.
Andrey Doronichev, Google I/O 2017

Google Glass Enterprise Edition 2

Google Glass killed, for the second time, in 2023 💀

Gorillaz mobile AR app

I’m excited about AR [...] My view is it’s the next big thing, and it will pervade our entire lives.
Tim Cook, Apple CEO, 2020, via Silicon Republic

Hardware

  • Adds LiDAR for a robust SLAM (e.g. white walls scenario)

  • Extends its 'wearables' category

    • AirPods
    • Apple Watch
    • Apple Vision Pro
      • unveiled in June 2023
      • released in February 2024

People Occlusion + Scene Understanding (iOS)

fill

Eye tracking (visionOS)

"Spatial Computing", "EyeSight", Real Virtual Continuum

Avatars

Collaboration

Oculus Infinite Office

fill

Meta Quest 3: focus on AR

Reverse Passthrough prototype (CAD render)
video

fill

Michael Abrash in 2019

Project Aria

Project Orion (2024)
$10 000 prototype

Tracking techniques

  • Marker
    • accurate, fast ✅
    • tangible, printable ✅
    • need to display a marker to enable AR ❌
    • non-aesthetic ❌
    • can be hard to detect (low lighting, motion blur, occlusions) ❌
  • NFT: same but
    • more aesthetic, easier to embed in the real world (ads) ✅
    • more robust to occlusions ✅

Valve VR HMD early prototype

Tracking techniques

  • SLAM ⭐ : NFT evolution + reconstruction
    • more natural markerless experience ✅
    • partial scene reconstruction ✅
      • allows advanced functionalities (occlusions, collisions etc.)
    • not very accurate ❌
      • drift, loop closure
      • scene reconstructed and refined in real-time
      • difficult to define the origin of the scene
        • stable anchor points required

Tracking techniques

  • 3D object detection in a real scene
    • using computer vision (lighting, edges, silhouette)
      • generic algorithm ✅
      • but slow, especially during initial registration ❌
    • using Deep Learning
      • faster initial detection ✅
      • more robust regarding occlusions and lighting changes ✅
      • not generic: requires per model training ❌

Tracking techniques

Conclusion

  • No tracking technique is ideal
  • Keep them all in mind and choose the right one according to:
    • the scenario of the AR experience
      • industrial context, consumer, generic or specific
    • constraints
      • indoor, outdoor, mobile

Rendering

Rendering

  • Realistic or not
  • Lighting
    • detect the direction and intensity of real lights
    • fast environment reconstruction to simulate reflections (SLAM + AI)
  • Occlusions

Interactions

  • The missing part of the equation
  • Often neglected (cf. NReal)
  • Myth of the dying mouse (p. 17)
    • each form factor has an optimal interaction technique
    • most headsets handle hand tracking, but also offer controller, keyboard and mouse support!
  • The XR equivalent of the mouse has not been invented yet!

Interaction techniques

  • Screen, when using a smartphone 📱
    • not very immersive but accurate, and provides tactile feedback
  • Controllers with buttons 🎮
    • great haptic feedback but not immersive
  • HoloLens GGV : Gaze, Gesture, Voice 👀 ✋ 👄
    • natural interactions, with no external hardware
    • great but tiring, lacks privacy ("hey Cortana!"), and accuracy
  • Tangible interactions ✏️ 🔲
    • markers or accessories to add some tactile feedback ⬇️

Interactions

Conclusion

  • Immersive AR interactions have yet to be invented!
  • No interaction paradigm has become a standard yet
  • We must guide the users and try to understand their intent

End

of part 1!

Questions?

3D Reconstruction

Photo Credits

https://unsplash.com/photos/RgPVZvA4wBM

https://unsplash.com/photos/r2CAjGQ0gSI

www.instagram.com/steveroe_

# ![height:110px](https://www.isep.fr/wp-content/themes/isep/img/logo_isep.svg)

![bg right](https://d3i71xaburhd42.cloudfront.net/2f341ad9dcd2a14822db65acd8862f05301b9e2e/3-Figure3-1.png)

![bg right](https://veronikasadventure.com/wp-content/uploads/2024/02/2_acropolis-3d-representations-audio-self-guided-tour-w-ar.jpg)

![bg right](https://d201n44z4ifond.cloudfront.net/wp-content/uploads/sites/6/2019/10/16165332/Screen-Shot-2019-10-15-at-12.56.22-PM.png)

![bg right](https://blogs.3ds.com/perspectives/wp-content/uploads/sites/22/image_o2.jpg)

![bg right](https://www.3ds.com/assets/invest/styles/profile/public/2021-11/virtual-twin-healthcare.jpg.webp?itok=RF1_IOmU)

![bg left:50%](https://events.3ds.com/sites/default/files/styles/playground_experience_medium/public/2019-03/immersive-collaboration-experience-tribe-2-experience-1_0.PNG)

![bg](https://media.wired.com/photos/59267509f3e2356fd80094d1/master/pass/Singapore_Bishan_windsim_HP.jpg)

![bg fit](https://mms.businesswire.com/media/20141112006720/en/441072/4/Dassault_Systemes_Living_Heart_1.jpg)

![bg 110%](http://www.cao.fr/images_cp/plein/6772.jpg)

https://marknb00.medium.com/what-is-mixed-reality-60e5cc284330

fr

![bg fit](https://i0.wp.com/www.createursdemondes.fr/wp-content/uploads/2017/03/arPrincipe.jpg)

![bg right](https://thumbor.sd-cdn.fr/hvoAvvyScQUi2YBbwZMOdoskUTY=/fit-in/540x1000/cdn.sd-cdn.fr/wp-content/uploads/2019/12/cropped_cat_on.gif)

![bg right fit](https://i.redd.it/93fuwypcyp941.png)

nreal

edge computing

![bg fit](https://d3i71xaburhd42.cloudfront.net/bc1aeff95e4b856f7c867450cb98a33bf3d2ada6/9-Figure9-1.png)

![bg](https://thumbs.gfycat.com/GeneralSmugAmericanwarmblood-size_restricted.gif)

openvslam https://github.com/xdspacelab/openvslam/issues/108

![bg 140%](https://i.ytimg.com/vi/jNbYcw_dmcQ/maxresdefault.jpg)

Cover

https://unsplash.com/photos/muiuZ6cKtlA https://unsplash.com/photos/6Avhuh6UP2Y https://unsplash.com/photos/UVP-NlZEf0Y https://unsplash.com/photos/Ib2e4-Qy9mQ https://unsplash.com/photos/3MjyZPUZKIQ

Project cover

https://unsplash.com/photos/msnyz9L6gs4 https://unsplash.com/photos/T6BsBZdGwbg https://unsplash.com/photos/8r3Otv1zy0s https://unsplash.com/photos/eft_khJJgug https://unsplash.com/photos/qnBMlkav-j8 https://unsplash.com/photos/QJv-TlL1T9M https://unsplash.com/photos/KBDTG8IvlpI https://unsplash.com/photos/beIw89byFlw https://unsplash.com/photos/bs4qtd2NsGI https://unsplash.com/photos/lPbq-op9zno https://unsplash.com/photos/6vEqcR8Icbs https://unsplash.com/photos/qRkImTcLVZU https://unsplash.com/photos/Evp4iNF3DHQ https://unsplash.com/photos/7wBFsHWQDlk https://unsplash.com/photos/Vq2HnMA0Bp4 https://unsplash.com/photos/V_7xg72F3ls https://unsplash.com/photos/RPFL38ZZikA https://unsplash.com/photos/Ksn5ggA3L8s https://unsplash.com/photos/9Eheu3sIgrM