Vision-based tactile sensors (VBTSs) are widely used in robotic tasks, because of the high spatial resolution they offer and their relatively low manufacturing costs. However, variations in their sensing mechanisms, structural dimension, and other parameters lead to significant performance disparities between VBTSs currently in use. This makes it challenging to optimize VBTSs for specific tasks, as both the initial choice and subsequent fine-tuning are hindered by the lack of standardized metrics. To address this issue, we present TacEva, a comprehensive evaluation framework for the quantitative analysis of VBTS performance. We define a set of performance metrics that capture and quantify the key characteristics displayed in typical application scenarios. For each metric, we designed an experimental pipeline that provides a structured procedure for performance quantification. We then applied this evaluation approach to multiple VBTSs with distinct sensing mechanisms. The results show that the proposed framework yields a thorough evaluation of each design, and provides quantitative indicators for each performance dimension. This enables researchers to pre-select the most appropriate VBTS on a task by task basis, and also offers performance-guided insights for the optimization of VBTS design.
This interactive demo illustrates how different VBTS sensing mechanisms respond to a ball pressing a soft surface: IMM (intensity + lighting), MDM (marker displacement), and IMM+MDM (combined).
Drag the ball in any side-view panel.
Side View
Camera View
Side view: ball presses gel, producing contact shadow + photometric intensity changes.
Side View
Camera View
Side view markers spread laterally and compress vertically as deformation grows.
Side View
Camera View
Hybrid side view overlays IMM lighting behavior with MDM marker displacement.
Side View
Camera View
Transparent-surface effect: background remains visible while marker displacement is tracked.
This table summarizes reported metric coverage across VBTS works from the appendix (Table A1), grouped by the sensing-mechanism taxonomy in ref. [8]: IMM, MDM, and MFM (including hybrids). Application-level interpretation (Table A2) is shown later below the main evaluation sections.
| Mechanism | Work / Sensor | Res | FOV | Gel | FPS | Calib | SR | S | Rep | Rspatial | Rlight |
|---|---|---|---|---|---|---|---|---|---|---|---|
| IMM | DIGIT [29] | Y | Y | N | Y | N | Y | N | N | N | N |
| IMM | DTact [49] | Y | Y | Y | Y | Y | Y | Y | N | N | Y |
| IMM | OmniTact [30] | Y | Y | N | Y | N | N | N | N | N | N |
| IMM | Minsight [50] | Y | Y | Y | Y | Y | Y | Y | N | Y | N |
| IMM | ThinTact [32] | Y | Y | Y | Y | N | Y | P | N | P | N |
| IMM | InSight [51] | Y | Y | Y | Y | Y | P | Y | N | Y | N |
| IMM | GelTip [34] | N | N | Y | N | P | Y | N | N | N | N |
| MDM | GelForce [28] | Y | Y | Y | Y | Y | Y | Y | N | N | N |
| MDM | ChromaTouch [52] | Y | Y | Y | Y | Y | N | Y | N | N | N |
| MDM | TacTip [10] | Y | Y | Y | Y | Y | P | Y | N | Y | N |
| MDM | Deltac [53] | Y | Y | Y | N | Y | N | Y | N | P | N |
| MDM | Tac3D [54] | Y | Y | Y | Y | Y | N | Y | N | N | N |
| IMM+MDM | GelSight [3] | Y | Y | Y | Y | Y | Y | N | N | N | N |
| IMM+MDM | GelSlim [39] | Y | Y | N | Y | Y | N | N | Y | Y | Y |
| IMM+MDM | UV-Tac [55] | Y | Y | Y | Y | Y | P | Y | N | P | Y |
| IMM+MDM | DenseTac 2.0 [31] | Y | Y | Y | Y | Y | N | N | N | N | N |
| IMM+MFM | VisTac [56] | Y | Y | Y | Y | P | P | N | N | N | N |
| IMM+MFM | Finger-STS [57] | Y | Y | Y | N | N | Y | N | N | N | N |
| IMM+MFM | TIRgel [58] | Y | Y | N | Y | N | Y | N | N | N | Y |
| IMM+MFM | HiVTac [59] | Y | Y | Y | Y | Y | N | Y | N | Y | N |
| MDM+MFM | ViTacTip [11] | Y | Y | N | N | Y | Y | Y | N | N | Y |
| MDM+MFM | FingerVision [12] | Y | Y | Y | Y | N | N | P | N | P | P |
| MDM+MFM | SpecTac [60] | N | Y | Y | Y | Y | N | N | N | N | N |
| MDM+MFM | VPTS [61] | Y | Y | Y | Y | Y | N | N | N | N | N |
| IMM+MDM+MFM | MagicTac [41, 62] | Y | Y | Y | Y | Y | Y | N | Y | N | Y |
| IMM+MDM+MFM | F-Touch [9] | Y | Y | Y | Y | Y | N | N | Y | N | Y |
| Evaluation / Review | PBR-Design [36] | Y | Y | Y | Y | Y | Y | N | N | Y | Y |
| Evaluation / Review | VT-Review [63] | N | N | N | N | N | N | N | N | N | N |
| Evaluation / Review | Eval [37] | N | N | N | Y | N | Y | Y | N | N | N |
| Evaluation / Review | Sparsh [38] | N | N | N | Y | N | Y | N | N | P | P |
| Evaluation / Review | TacEva (Ours) | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y |
Definition. Two sequential steps with the sensor on a robot: (1) Surface geometry via first-contact mapping with a 10 mm spherical indenter; (2) Force/position mapping from synchronized images and 6‑axis F/T labels across randomized normal + shear stimuli.
Protocol. Probe the surface on a grid (≈0.1 mm steps) until contact (threshold ≈0.02 N), then indent to safe depths per device while adding small x–y displacements. Train a common ResNet‑18 baseline (70/20/10 split) to regress $(P_x, P_y, P_z, F_x, F_y, F_z)$. Report MAE, $R^2$, and sMAPE: $$\text{sMAPE} = \frac{1}{n} \sum_{i=1}^{n} \frac{|y_i - \hat{y}_i|}{\frac{|y_i| + |\hat{y}_i|}{2} + \epsilon} \times 100\%$$
Analysis. ViTacTip minimizes absolute force errors; GelSight variants excel in Pz; marker‑free GelSightWM is strong in Fz/Pz but weaker in Fxy; MagicTac is competitive in Pxy yet noisier in Fz.
Definition. Ability to distinguish closely spaced features. We report accuracy as a function of tolerance $\epsilon$ using a grating‑classification task:
Protocol. 3D‑printed dot/line gratings (≈0.05–2.0 mm). 100 presses per sample with randomized yaw. Train classifier; sweep ε.
Analysis. Above ≈5 mm, all near‑perfect. At 0.05 mm, GelSight/GelSightWM ≈99%, MagicTac ≈98%, ViTacTip ≈80% — reflecting gel stiffness/geometry and effective pixel density.
Evaluation using dot and line grating samples, with spacing from 0 mm (flat) to 2 mm, to determine the minimum resolvable feature size. All four sensors were benchmarked using grating-based samples for spatial resolution assessment.
Dot and Line samples from 0.0625 mm to 2 mm spacing - examples shown below
Definition. Normal compliance: $S = \Delta z / F$ (mm/N). Uniformity (0–1): $U = 1 / (1 + \sigma/|\mu|)$ from binned sensitivity means.
Protocol. Reuse calibration data; bin by (x,y); compute mean S per bin to form maps; aggregate μ, σ for U.
Analysis. ViTacTip is most sensitive but less uniform (edge‑enhanced S); GelSight/MagicTac are stiffer with higher U.
Definition. Stability of error across location and depth. Compute MAE per radial bin and per depth bin; robustness (lower is better):
Protocol. Collect a held‑out grid (≈1.6k points) with the same probing pattern; evaluate by bins over normalized radius/depth.
Analysis. ViTacTip holds force errors flat across the surface; planar gels show edge growth (notably in Fz and Pxy). Depth improves Pxy after shallow contact.
Definition. Sensitivity of prediction error to illumination changes (transparent/semi‑transparent devices). Example metric:
Protocol. Test under four scenes (diffuse/point/mixed; varying intensity). Compare to training‑light baseline using mean grayscale intensity.
Analysis. ViTacTip's errors grow under bright point sources; MagicTac's intensities shift less but error variance can rise due to grid interactions with external light.
Definition. Across $N$ repeats at $K$ points and $D$ depths, per‑channel variability (lower is better):
Protocol. K≈100 random points, step 0.1 mm to max depth; N=10 repeats per (point,depth).
Analysis. ViTacTip is most repeatable for forces and competitive for positions; GelSight is strongest in Pz; MagicTac is intermediate for position and higher variance for force.
Compare reconstructed surfaces across units of the same type via rigid alignment and nearest‑neighbor distances inside the common hull; the mean absolute surface gap summarizes manufacturing consistency.
Quantify the area between load/unload F–Δz curves (trapezoidal rule) over the overlap range at multiple surface points; ViTacTip shows measurable, spatially varying hysteresis, while GelSight variants/MagicTac show no clear hysteresis under our protocol.
| Mechanism | Sensor / Paper | Geometry-Focused Evaluations | Force-Focused Evaluations | Robustness-Critical Evaluations |
|---|---|---|---|---|
| IMM | DIGIT [29] | Pose control; contact and pose tracking | - | - |
| IMM | DTact [49] | Contact reconstruction; pose estimation; object recognition | - | Illumination robustness |
| IMM | OmniTact [30] | Connector insertion; contact-angle estimation | - | - |
| IMM | Minsight [50] | Tactile servoing; lump detection | Force estimation | - |
| IMM | ThinTact [32] | Delicate grasping; insertion; sliding-pose manipulation | Gentle force regulation | - |
| IMM | InSight [51] | Shape; orientation; posture sensing | Multi-contact force mapping | - |
| IMM | GelTip [34] | Contact localization | - | - |
| MDM | GelForce [64] | - | Traction-field estimation | - |
| MDM | ChromaTouch [52] | Curvature estimation (3D displacement field) | - | - |
| MDM | TacTip [10] | In-hand rolling/reorientation; edge following; localization; JND discrimination | - | - |
| MDM | DelTact [54] | Contact reconstruction | Force mapping | - |
| MDM | Tac3D [53] | Spatial resolution; contact reconstruction | Force estimation; friction-coefficient estimation; real-time mapping | - |
| IMM+MDM | GelSight [3] | Contact reconstruction; texture recognition; USB insertion | Force/stiffness estimation; slip detection | - |
| IMM+MDM | GelSlim [33,39,65] | Pose and shape reconstruction; insertion | Force/torque estimation; force-controlled manipulation; slip detection | Durability (>3000 grasps) |
| IMM+MDM | DenseTac 2.0 [31,44] | Dense contact reconstruction | Force/torque estimation | - |
| IMM+MDM | UV-Tac [55] | Contact localization; alignment | Normal and shear force mapping | UV/white-light switching |
| IMM+MFM | VisTac [56] | Contact reconstruction/localization; pose estimation; insertion | - | - |
| IMM+MFM | Finger-STS [57] | Object/texture recognition | Dense slip detection; bead-maze tasks | - |
| IMM+MFM | TIRgel [58] | Object classification | - | Ambient-brightness resistance |
| IMM+MFM | HiVTac [59] | Pose estimation; deformation analysis | Force estimation; grasping | - |
| MDM+MFM | ViTacTip [11] | Grating classification; pose estimation; localization | Force estimation | Lighting robustness (GAN-based) |
| MDM+MFM | FingerVision [12,66-68] | Surface reconstruction; deformation field | Force estimation; slip/vibration detection | Grasp-stability tests (shaking) |
| MDM+MFM | SpecTac [60] | 3D triangulation; feature matching | Force estimation | SIFT feature detection |
| MDM+MFM | VPTS [61] | Contact reconstruction; proximity exploration | - | Vision-proximity-tactile fusion |
| IMM+MDM+MFM | MagicTac [41,62] | Grating classification; pose estimation; contact localization | Force estimation | Lighting robustness; manufacture error; wear and tear |
| IMM+MDM+MFM | F-Touch [9] | Object/texture recognition | Force/torque estimation | - |
| Evaluation / Review | PBR-Design [36] | Contact reconstruction (embossed text, grasping); VBTS design | - | Design robustness (optical stability) |
| Evaluation / Review | VT-Review [63] | 3D visuo-tactile contact reconstruction | Force estimation | Robustness across textures and marker layouts |
| Evaluation / Review | Eval [37] | Edge detection; contact reconstruction | Minimum detectable force; sensitivity mapping; slip and frequency tests | Temperature/material dependency; real-world grasping |
| Evaluation / Review | Sparsh [38] | Pose tracking; texture recognition | Force estimation/mapping; slip-accumulation; bead-maze tasks; grasp stability | - |
| Evaluation / Review | TacEva (Ours) | Spatial resolution; contact localization | Force estimation; force sensitivity mapping | Repeatability; spatial and lighting robustness |
Selection guide: ViTacTip — best for low‑force, deep/soft contacts and force repeatability; sensitive to lighting and weaker in ultra‑fine resolution. MagicTac — fast, strong planar localization; force estimates noisier; control lighting when possible. GelSight — highest camera resolution and stable depth (Pz); modest frame rate and edge effects. GelSightWM — practical choice when shear is secondary; robust Pz/Fz without markers.
@article{taceva,
title = {TacEva: A Performance Evaluation Framework for Vision-Based Tactile Sensors},
author = {Cong, Qingzheng and Oh, Steven and Fan, Wen and Luo, Shan and Althoefer, Kaspar and Zhang, Dandan},
journal = {Advanced Intelligent Systems},
year = {2026},
pages = {e202501179},
doi = {10.1002/aisy.202501179},
url = {https://doi.org/10.1002/aisy.202501179}
}