屈喜文, 韩瑶妹, 胡冕军. 基于单目视觉的空中手写数据采集系统[J]. 信息与控制, 2024, 53(3): 339-352. DOI: 10.13976/j.cnki.xk.2024.3066
引用本文: 屈喜文, 韩瑶妹, 胡冕军. 基于单目视觉的空中手写数据采集系统[J]. 信息与控制, 2024, 53(3): 339-352. DOI: 10.13976/j.cnki.xk.2024.3066
QU Xiwen, HAN Yaomei, HU Mianjun. In-air Handwriting Data Acquisition System Based on Monocular Vision[J]. INFORMATION AND CONTROL, 2024, 53(3): 339-352. DOI: 10.13976/j.cnki.xk.2024.3066
Citation: QU Xiwen, HAN Yaomei, HU Mianjun. In-air Handwriting Data Acquisition System Based on Monocular Vision[J]. INFORMATION AND CONTROL, 2024, 53(3): 339-352. DOI: 10.13976/j.cnki.xk.2024.3066

基于单目视觉的空中手写数据采集系统

In-air Handwriting Data Acquisition System Based on Monocular Vision

  • 摘要: 针对基于视觉的空中手写系统使用的3维传感器体积庞大、价格昂贵的缺点, 提出了一种新型的基于单目视觉的空中手写数据采集系统, 该系统用体积小、价格便宜的单目摄像头代替Kinect、Leap Motion等3维传感器, 更适合于手机、空调、电视等家用设备集成和大规模推广使用。为了解决目前基于单目视觉的空中手写系统每次只能写入单个字符或短文本, 用户书写文本长度受到限制的问题, 提出了一种坐标系虚拟滑动技术。首先, 该系统使用背景差分法和HSV(hue, saturation, value)颜色空间阴影消除法从2维视频帧中获取完整的手部轮廓; 其次, 利用手的结构特点将手掌和手指进行分割, 利用构建的指尖模板从分割出的手指中进行匹配来确定指尖的位置; 最后, 连接连续视频帧中的指尖位置形成空中手写字符轨迹, 通过提出的坐标系虚拟滑动技术使用户可以进行连续书写从而形成文本。实验结果表明, 所提系统可以让用户自由地在空中连续书写, 对指尖的检测准确率达到了96.8%。

     

    Abstract: Aiming at the shortcomings of the large and expensive three-dimensional (3D) sensor used in a vision-based in-air handwriting system, we propose a new type of in-air handwriting data acquisition system based on monocular vision. The system uses a small and cheap monocular camera to replace the 3D sensor such as Kinect and Leap Motion, which is more suitable for integration and large-scale promotion with household devices such as mobile phones, air conditioners, and televisions. To solve the problem that existing in-air handwriting systems based on monocular vision can only write a single character or short text each time, and the length of the user's written text is limited, we propose a coordinate system virtual sliding technology. First, the system uses the background difference method and the HSV color space shadow elimination method to obtain the complete hand contour from the two-dimensional video frame. Second, we separate the palm and finger using the structural characteristics of the hand. Using the constructed fingertip template, we determine the position of the fingertip by matching the segmented fingers. Finally, we connect the fingertip positions in continuous video frames to form in-air handwritten character trajectories. Through the proposed coordinate system virtual sliding technology, users can write continuously to form texts. The experimental results show that the system can allow users to write continuously in the air freely, and the detection accuracy of fingertips reaches 96.8%.

     

/

返回文章
返回