Achieving Robotic Data Efficiency Through Machine-Centric FDCT Vision Processing

  • Yair Wiseman

Research output: Contribution to journalArticlepeer-review

Abstract

To enhance a robot’s capacity to perceive and interpret its environment, an advanced vision system tailored specifically for machine perception was developed, moving away from human-oriented visual processing. This system improves robotic functionality by incorporating algorithms optimized for how computerized devices process visual information. Central to this paper’s approach is an improved Fast Discrete Cosine Transform (FDCT) algorithm, customized for robotic systems, which enhances object and obstacle detection in machine vision. By prioritizing higher frequencies and eliminating less critical lower frequencies, the algorithm sharpens focus on essential details. Instead of adapting the data stream for human vision, the FDCT and quantization tables were adjusted to suit machine vision requirements, achieving a file size reduction to about one-third of the original while preserving highly relevant data for robotic processing. This innovative approach significantly improves robots’ ability to navigate complex environments, perform tasks such as object recognition, motion detection, and obstacle avoidance with greater accuracy and efficiency.

Original languageEnglish
Article number518
JournalSensors
Volume26
Issue number2
DOIs
StatePublished - Jan 2026
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2026 by the author.

Keywords

  • FDCT
  • H.264
  • quantization tables
  • real-time data processing
  • robots
  • vision system
  • visual perception

Fingerprint

Dive into the research topics of 'Achieving Robotic Data Efficiency Through Machine-Centric FDCT Vision Processing'. Together they form a unique fingerprint.

Cite this