เทค

The Evolution of Computer Vision And Why the Future Belongs to the Edge

Discover how computer vision evolved from handcrafted algorithms to multimodal AI and why the future of visual intelligence is moving to the edge.

เวลาอ่าน

0 นาที

สารบัญ

ขยาย

เขียนโดย

คาลิน ชิโอบานู

ผู้ร่วมก่อตั้งและ CTO

Computer vision has evolved more in the last 15 years than in the previous 50.
From handcrafted algorithms to multimodal AI models capable of interpreting both text and imagery, we’ve moved from teaching machines to “see” to enabling them to “understand.”

‍

But this evolution hasn’t been linear it’s happened in distinct stages. And understanding that journey helps explain where the next breakthroughs will come from.

‍

The Three (and a Half) Generations of Computer Vision

1. Traditional Computer Vision (Pre-2010)

Before deep learning, computer vision was dominated by handcrafted features and mathematical heuristics.

‍
Engineers relied on predictable, explainable algorithms edge detection, segmentation, statistical pattern recognition that could be designed and implemented mathematically.

These systems worked, but only in narrow conditions.

They lacked adaptability and struggled to generalize beyond what they were explicitly programmed to see.

‍

2. The Neural Network Revolution (2010–2020)

Everything changed around 2012 with ImageNet and AlexNet.
This was when Geoffrey Hinton, Fei-Fei Li, and other researchers proved that neural networks could outperform traditional algorithms by a wide margin.

The world shifted to Convolutional Neural Networks (CNNs) lightweight, flexible architectures capable of detecting, recognizing, and classifying images across millions of categories.
CNNs brought computer vision to phones, cars, and retail cameras. They made AI practical.

‍

3. The Transformer Era (2020–Today)

Then came transformers first in language (Attention Is All You Need, 2017), then in vision.
Transformers can outperform CNNs in accuracy and flexibility, but at a cost: they are computationally heavy, memory-intensive, and dependent on powerful GPUs.

Despite that, they’ve become the backbone of the latest Vision Transformers (ViTs), powering everything from autonomous vehicles to large-scale image analytics.

‍

3.5. The Rise of Vision-Language Models

The newest frontier combines vision and language.
These multimodal models can look at an image and describe it in words or generate new images from text.
They’re incredibly capable, but also incredibly heavy. Running them at scale requires massive compute power, which brings us to the most exciting direction today: edge computing.

‍

Why the Future of Computer Vision Is on the Edge

The next big challenge is no longer about making AI smarter, it's about making it smaller.

Over the last year, Apple, Samsung, Meta, and Microsoft have all published papers showing how large models can be compressed to run locally on phones.
This is the shift I find most exciting because it directly intersects with what we’re building at OmniShelf.

At OmniShelf, we’ve managed to compress and optimize computer vision models so efficiently that they can run on hardware equivalent to a Samsung S7 while maintaining over 95% recognition accuracy.
That means scanning hundreds of products on a shelf, live, in under 15 seconds.

The implications for retail are huge.
We can deliver real-time insights without relying on cloud processing reducing latency, cost, and data transfer, all while improving privacy and resilience.

‍

Data: The Real Limiting Factor

Even with all this progress, one limitation remains: data quality.
Computer vision models, no matter how advanced, don’t have common sense.
They operate on probability so if the data is biased or poor quality, the results will be too.

That’s why at OmniShelf, our focus isn’t only on the model architecture but also on domain-specific, high-quality data pipelines. Because the smarter the data, the smarter the AI.

‍

Looking Ahead

We’re entering an era where AI doesn’t just live in the cloud it lives in the devices around us.

‍
Computer vision is becoming distributed, efficient, and context-aware.

For me, that’s what makes this field so fascinating: we’re finally reaching a point where machines can “see” as fast as we can and increasingly, right where we are.

At OmniShelf, we’re extending the frontier of computer vision by making cutting-edge AI accessible at the edge. Stay tuned the next phase of visual intelligence is happening right on the shelf.

‍

Sources and Further Reading

This article refers to several key research papers and concepts that define the modern era of computer vision:

ImageNet and AlexNet:
- Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems (NIPS). This paper demonstrated the power of deep CNNs, winning the ImageNet challenge and igniting the deep learning revolution.
The Transformer Architecture:
- Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention Is All You Need. Advances in Neural Information Processing Systems (NIPS). The foundational paper introducing the Transformer, which replaced recurrent and convolutional layers with the attention mechanism.
Vision Transformers (ViTs):
- Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, S., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale. International Conference on Learning Representations (ICLR). This work adapted the Transformer architecture for images, treating patches as sequence tokens.
Model Compression and Edge AI:
- Han, S., Pool, J., Tran, J., & Dally, W. J. (2015). Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. International Conference on Learning Representations (ICLR). A seminal paper on reducing model size for deployment on resource-constrained devices.
- General principles of Model Compression Techniques (including pruning, quantization, and knowledge distillation) are now widely discussed in industry publications focused on Edge AI deployment.

‍

ข้อมูลเชิงลึกและการอัปเดต

สำรวจเพิ่มเติมจากบล็อก OmniShelf

ก้าวไปข้างหน้าด้วยข้อมูลเชิงลึกที่ลึกซึ้งยิ่งขึ้น การอัปเดตผลิตภัณฑ์ และแนวโน้มอุตสาหกรรมที่กำหนดอนาคตของเทคโนโลยีการค้าปลีกค้นพบเรื่องราวเพิ่มเติมที่สำคัญต่อธุรกิจของคุณ

The Evolution of Computer Vision And Why the Future Belongs to the Edge

Discover how computer vision evolved from handcrafted algorithms to multimodal AI and why the future of visual intelligence is moving to the edge.

อ่านเพิ่มเติม

ไฮไลท์ เอเลเวท 2025

OmniShelf ร่วมมือกับ SOTI ที่ monday.com Elevate 2025 เพื่อแสดงให้เห็นว่า AI ระบบอัตโนมัติ และการมองเห็นชั้นวางแบบเรียลไทม์กำลังเปลี่ยนแปลงการดำเนินงานการค้าปลีกอย่างไรค้นพบไฮไลท์และข้อมูลเชิงลึกที่สำคัญจากกิจกรรม

อ่านเพิ่มเติม

ไฮไลท์การประชุมสุดยอด GS1 ฮ่องกงรีเทล 2025

ตั้งแต่ห่วงโซ่อุปทานที่ขับเคลื่อนด้วย AI ไปจนถึงการเปลี่ยนแปลงธุรกิจอย่างยั่งยืน นี่คือสิ่งที่ผู้นำอุตสาหกรรมแบ่งปันใน GS1 Hong Kong Retail Forum 2025 เกี่ยวกับการสร้างระบบนิเวศค้าปลีกที่ชาญฉลาดและยืดหยุ่นมากขึ้น

อ่านเพิ่มเติม

วิสัยทัศน์คอมพิวเตอร์ปฏิวัติการค้าปลีกอย่างไร

ค้นพบว่าวิสัยทัศน์คอมพิวเตอร์ที่ขับเคลื่อนด้วย AI กำลังปฏิวัติการดำเนินงานค้าปลีกอย่างไรตั้งแต่การป้องกันการโจรกรรมและการจัดการสินค้าคงคลังไปจนถึงการปรับปรุงประสบการณ์ของลูกค้า ดูว่าเทคโนโลยีนี้ซึ่งขับเคลื่อนโดยการประมวลผลแบบเอดจ์กำลังปรับเปลี่ยนประสบการณ์การช้อปปิ้งที่ทันสมัยได้อย่างไร

อ่านเพิ่มเติม

ทำไมนักช้อปถึงเดินไป

เรียนรู้ว่าทำไมความพร้อมใช้งานของผลิตภัณฑ์จึงเป็นรากฐานของประสบการณ์ของลูกค้าดูว่าป้าย “หมดสต็อก” หนึ่งสามารถส่งผลกระทบต่อยอดขาย ความภักดี และชื่อเสียงได้อย่างไร และ AI สำหรับค้าปลีกสามารถป้องกันไม่ให้เกิดเหตุการณ์ดังกล่าวได้อย่างไร

อ่านเพิ่มเติม

ชั้นวางที่ไม่สมบูรณ์กำลังทำลายร้านค้าของคุณนี่คือเหตุผล

สินค้าที่ไม่ถูกวางทุก ป้ายราคาที่ไม่ถูกต้อง และชั้นวางที่ว่างเปล่ามีค่าใช้จ่ายมากกว่าที่คุณคิดนี่คือเหตุผลว่าทำไมชั้นวางที่ไม่สมบูรณ์จึงเป็นปัญหาที่ซ่อนอยู่ของร้านค้าปลีกล้านล้านดอลลาร์

อ่านเพิ่มเติม

รีแคปวันที่ 3 ของ NRF Paris 2025: คอมเมิร์ซแบบครบวงจรและปัญญาประดิษฐ์

3 NRF Paris 2025

อ่านเพิ่มเติม

สรุปวันที่ 2 ของ NRF ปารีส 2025

วันที่ 2 ของ NRF Paris 2025 แสดงให้เห็นว่าผู้ค้าปลีกใช้ AI ข้อมูลและกลยุทธ์ที่เน้นลูกค้าอย่างไรเพื่อเปลี่ยนการชำระเงิน ห่วงโซ่อุปทาน ความภักดี และประสบการณ์ในร้านค้าตั้งแต่การสนับสนุนของชุมชนกับ Snipes & PSG ไปจนถึงการสร้างสรรค์ใหม่แบบ Omnichannel ของ MediaMarktSaturn กิจกรรมนี้เน้นอนาคตของการค้าปลีกด้วยการปรับแต่งส่วนบุคคล ระบบอัตโนมัติ และการมีส่วนร่วมของลูกค้ารุ่นต่อไป

อ่านเพิ่มเติม

สรุปวันที่ 1 ของ NRF ปารีส 2025: AI การชำระเงิน และนวัตกรรมOmnichannel

ค้นพบไฮไลท์จากวันที่ 1 ของ NRF Paris 2025 — Big Show ของร้านค้าปลีกในยุโรปตั้งแต่การดำเนินงานค้าปลีกที่ขับเคลื่อนด้วย AI และโซลูชันการชำระเงินที่พัฒนาไปจนถึงกลยุทธ์omnichannel และความยั่งยืน นี่คือวิธีที่ผู้นำกำลังสร้างอนาคตของการค้าปลีก

อ่านเพิ่มเติม

เรากำลังเข้าร่วม NRF 2025: การแสดงครั้งใหญ่ของร้านค้าปลีกในยุโรป

เข้าร่วม OmniShelf ที่ NRF Paris 2025 ซึ่งนวัตกรรมมาบรรจบกับร้านค้าปลีกเข้าร่วมกับเราในอีเวนต์ชั้นนำของยุโรปเพื่อสำรวจโซลูชันล้ำสมัยสำหรับอนาคตของการช็อปปิ้ง

อ่านเพิ่มเติม

ชั้นวางของคุณฆ่ายอดขายของคุณหรือไม่?

เรียนรู้ว่าการดำเนินการชั้นวางที่ไม่ดีทำให้ยอดขายปลีกอย่างเงียบ ๆ และโซลูชันแบบเรียลไทม์ที่ชาญฉลาดสามารถปรับปรุงรายได้ ความไว้วางใจ และประสิทธิภาพได้อย่างไร

อ่านเพิ่มเติม

ประเทศไทยพาณิชย์ จำกัด (มหาชน) สถานทูตทูตยมมมารก

(ซอส)

อ่านเพิ่มเติม

เราอยู่ที่งานค้าปลีกเอเชียเอ็กซ์โป 2025

2025!

อ่านเพิ่มเติม

ซีสโตร์

ภารกิจนั้นชัดเจนแบรนด์เครื่องดื่มรายใหญ่ใช้จ่ายเงินนับล้านเพื่อเปิดตัวเครื่องดื่มพลังงานใหม่ โดยวางแผนทุกรายละเอียดอย่างพิถีพิถันจนถึงตำแหน่งที่เย็นในร้านสะดวกซื้อหลายพันแห่งทั่วประเทศ

อ่านเพิ่มเติม

ภัยคุกคามที่มองไม่เห็นต่อร้านค้าปลีก: สินค้าคงคลังแฟนทอมหลอนเอเชียตะวันออก

ค้นพบว่าสินค้าคงคลังแฟนทอมระบายผลกำไรในภาคค้าปลีกของเอเชียตะวันออกเฉียงใต้ได้อย่างเงียบ ๆเรียนรู้สาเหตุ ผลที่ตามมา และวิธีแก้ปัญหาเชิงกลยุทธ์เพื่อกำจัดภัยคุกคามที่มองไม่เห็นนี้

อ่านเพิ่มเติม

ไอ ()

AI ออมนิเชลฟ สโตร์

อ่านเพิ่มเติม

ปัญหาล้านล้านดอลลาร์: การดำเนินการชั้นวางที่ไม่ดีทำให้ผู้ค้าปลีกมีค่าใช้จ่ายหลายพันล้านเหรียญ

ออมนิเชลฟ์ เอ็นไอเอ็มนิเชลฟ

อ่านเพิ่มเติม

Shoptalk 2025

ออมนิสเฮลฟ Shoptalk Europe 2025

อ่านเพิ่มเติม

ม้าลาย DevCon 2025:

OmniShelf ภูมิใจที่ได้อยู่ที่นี่บนพื้นดินเรากำลังแสดงให้เห็นว่าเครื่องมืออัจฉริยะชั้นวางแบบเรียลไทม์ของเราช่วยให้ทีมร้านค้าประหยัดเวลา ปรับปรุงการใช้งานชั้นวางสินค้าและเพิ่มยอดขายโดยไม่ต้องใช้ฮาร์ดแวร์เพิ่มเติม

อ่านเพิ่มเติม

พบกับเราที่ NRF สิงคโปร์ 2025

ค้นพบว่า OmniShelf และ Zebra Technologies กำลังเปลี่ยนแปลงการจัดการชั้นวางสินค้าด้วยข้อมูลข่าวกรองการค้าปลีกแบบเรียลไทม์ที่ NRF Singapore 2025เยี่ยมชมเราเพื่อรับการสาธิตสดและข้อมูลเชิงลึกจากผู้เชี่ยวชาญ

อ่านเพิ่มเติม