Optimizing Submillimeter 3D Modeling with Auxiliary Lighting and Artificial Textures: An SfM-Based Approach

Francisco Roza de Moraes; Irineu da Silva

doi:10.12688/f1000research.157676.1

Home Browse Optimizing Submillimeter 3D Modeling with Auxiliary Lighting and Artificial...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Optimizing Submillimeter 3D Modeling with Auxiliary Lighting and Artificial Textures: An SfM-Based Approach

[version 1; peer review: 1 approved, 1 approved with reservations]

Francisco Roza de Moraes ¹, Irineu da Silva¹

PUBLISHED 04 Dec 2024

Author details Author details

¹ Department of Transportation Engineering, University of Sao Paulo Sao Carlos School of Engineering, São Carlos, State of São Paulo, 13563-120, Brazil

Francisco Roza de Moraes
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – Original Draft Preparation

Irineu da Silva
Roles: Formal Analysis, Project Administration, Supervision, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Abstract*

Background

This study examines the influence of auxiliary lighting configurations and artificial surface textures on the quality of 3D models generated using Structure from Motion (SfM) in an indoor laboratory setting.

Method

Experiments were conducted by capturing images of concrete, metal, and wooden specimens at a one-meter distance. Various lighting setups, including vertical and adjacent auxiliary lighting models, were tested to determine their impact on model accuracy. In addition, complex artificial textures, such as checkerboard patterns, were applied to the specimens to assess their effect on 3D model precision.

Results

Our results demonstrate that optimal lighting and artificial textures significantly enhance the accuracy of 3D models, especially for materials with uniform textures, such as painted metal. For materials with more varied textures, such as concrete and wood, improvements were notable but less pronounced. The combination of auxiliary lighting and artificial textures improved model quality by approximately 40% for high-texture materials and up to 60% for uniform-texture materials. Furthermore, the study highlights the role of image file formats in the SfM process. While RAW images stored in TIFF format offered a slight advantage over lossless JPEG in terms of model accuracy, the difference may not be substantial enough to justify the larger file size in situations where submillimeter precision is not required.

Conclusions

Overall, our findings emphasize the importance of tailored lighting and texturing strategies for achieving high-precision 3D models in SfM applications. These results are particularly relevant for structural testing and other applications that demand high-fidelity 3D reconstructions, providing a foundation for more accurate and reliable models.

Keywords

Structure from Motion, Auxiliary Lighting, Surface Artificial Texture, Storage Format, Scale Bars, Cloud Points, Root Mean Square Error

Corresponding author: Francisco Roza de Moraes

Competing interests: No competing interests were disclosed.

Grant information: This study was funded by the Brazilian Federal Agency for Support and Evaluation of Graduate Education (CAPES) - Finance Code 001 – process number: 88882.379118/2019-01.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2024 Roza de Moraes F and da Silva I. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Roza de Moraes F and da Silva I. Optimizing Submillimeter 3D Modeling with Auxiliary Lighting and Artificial Textures: An SfM-Based Approach [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2024, 13:1479 (https://doi.org/10.12688/f1000research.157676.1) First published: 04 Dec 2024, 13:1479 (https://doi.org/10.12688/f1000research.157676.1) Latest published: 04 Dec 2024, 13:1479 (https://doi.org/10.12688/f1000research.157676.1)

Introduction

The Structure from Motion (SfM) technique, developed from significant advances in Computer Vision, has become a widely adopted method for three-dimensional modeling from sets of two-dimensional images. The combination of low-cost equipment with user-friendly computational applications has contributed to the widespread popularity of this 3D modeling technique across various fields. In the academic context, for instance, the versatility of the SfM technique has enabled the production of high-quality models, facilitating detailed research, enhancing the documentation of historical heritage, and promoting the precise analysis of architectural and natural structures.

In Creus et al. (2021), it was noted the development of high-quality 3D products is related to factors such as the precision of control points, the use of camera calibration techniques, and the quality of the information about the set of images used. The latter factor is often partially neglected by users who, when capturing images in long-distance environments, tend to focus solely on the configuration of the photographic equipment without adequately assessing the scene’s characteristics and the object to be imaged.

However, in short-distance environments with varying lighting conditions and objects with low levels of texture, these characteristics are crucial for obtaining high-quality images, which are fundamental for achieving accurate modeling. This is because the SfM technique relies on the correlation of points between images to estimate three-dimensional information about the scene.

Therefore, to efficiently execute the 3D modeling process, especially for indoor environments with short-distance captures, the analyzed environment must have consistent lighting without variations in brightness or shadowed regions. The captured object must also have a sufficiently detailed surface texture to generate accurate correlation points.

Furthermore, the image storage format is another factor that is sometimes overlooked in SfM works but can significantly influence the quality of the modeling. Depending on the settings used by users, this can lead to either a loss of scene information or excessive consumption of storage space (Reznicek et al. 2016).

In the literature, there are few 3D modeling studies focused on the use of the SfM technique in indoor and short-range acquisition environments that comprehensively address these factors. Therefore, this study aims to investigate the effect of different auxiliary lighting configurations combined with the application of artificial textures on specimens made of varied materials (concrete, metal, and wood), simulating the use of the technique in laboratory testing of structural beams requiring millimeter accuracy or less.

These materials were selected due to their widespread use in structural beam testing and the distinct characteristics of their surface textures, which facilitated an accurate evaluation of the lighting configurations and artificial textures applied. Additionally, captures were conducted at an approximate distance of one meter to replicate the laboratory environment and adhere to the safety restrictions inherent to structural test protocols.

The experiments underscored the benefits of using well-placed auxiliary lighting and the improvements provided by employing semi-closed patterns of artificial textures, which increased the accuracy of the generated models. In terms of storage formats, the use of RAW format (TIFF) showed a slight advantage over the lossless JPG format. Therefore, the results show that this technique, with careful consideration of the factors addressed in this study, applies to the modeling of structural beam tests.

Efficient capture techniques for SfM

The Structure-from-Motion (SfM) technique for 3D modeling has been extensively utilized across numerous application domains, primarily due to its high degree of automation enabled by advanced computer vision methods, which contribute to the development of efficient workflows. Figure 1 illustrates a summarized overview of the standard SfM workflow.

Figure 1. Workflow of the Structure from Motion Multi-Views Stereo process for a set of images to produce 3D modeling.

The process begins with the acquisition of a series of images from multiple viewpoints, ensuring significant overlap between consecutive captures. Feature detection is performed on each image, followed by the matching of these features across the entire image set. The SfM phase then employs these redundant feature points to execute Bundle Adjustment, which, through the incorporation of control points or scale bars, generates a sparse point cloud that approximates the geometry of the object or scene (James et al. 2017). This sparse representation is further refined during the Multi-View Stereo stage, where additional feature detection and matching techniques are applied to produce a dense point cloud. The result is a high-fidelity three-dimensional reconstruction of the captured object or scene (Luhmann et al. 2020).

As demonstrated in the workflow, the set of input images is integral to the feature detection stages and is repeatedly utilized during the Multi-View Stereo (MVS) process. Consequently, the characteristics and quality of these images, which may vary in configuration and parameters, are crucial in determining the effectiveness and accuracy of the 3D modeling process. According to Hafeez et al. (2018), the comprehensiveness and precision of the resultant 3D models, facilitated by SfM, are intricately tied to both the quantity and quality of discernible points of interest within the images. Therefore, the assurance of high accuracy of the derived products requires the set of images to adequately portray the scene under consideration with an elevated level of detail and quality.

Image quality is inherently related to the lighting attributes of the captured scene, as argued by Dauvin et al. (2018). The authors highlighted the importance of uniform lighting throughout the region of interest in the scene to facilitate high-quality camera calibration, 3D modeling processes, and more detailed photographic capture of the surface of an object or scene. This factor enables the identification of points of correspondence between images and increases the accuracy and stability of the 3D reconstruction. Therefore, complementing scene capture with adequate lighting is fundamental to improving the effectiveness of element detection for objects characterized by intricate surface detail with variations in texture and tone. In many cases, solar illumination alone may be insufficient for objects with sparsely detailed surfaces.

Numerous studies in different domains, including those by Capéran et al. (2012), Kwak et al. (2013), Mishra et al. (2017), Nietiedt et al. (2020), and Nielsen et al. (2023) have addressed the challenge of low object detail using artificial textures projected or applied to object surfaces. This approach has yielded remarkable results, enhanced surface detail, and introduced new patterns, thereby facilitating the detection of a larger number of salient points of interest.

Besides lighting and surface texture, the choice of storage formats is another factor that influences the representation of the imaged object and JPG or JPEG (Joint Photographic Experts Group) format is preferred in widely used capture systems. However, whereas it offers a wide range of colors with minimal storage requirements, its data compression process can result in loss of information or reduced image quality. Such data loss adversely affects the image correlation processes that are essential to the effectiveness of the SfM technique.

To address the quality degradation caused by JPG, some professionals have used photographic equipment that produces uncompressed digital images, known as RAW files. Such files preserve the fidelity of the data captured by the camera sensor, enabling a more accurate representation of the scene. However, the superior quality of RAW images, often stored in TIFF (Tagged Image File Format), results in significantly higher storage space requirements compared to JPEG. Morgan et al. (2017) and Verma and Bourke (2019) investigated those file formats. The former study reported consistency in model quality between them; however, the raw format offered advantages in facilitating extensive image processing due to a richer data representation. Analyses conducted in the latter study revealed a slight superiority in positional accuracy of models generated in the raw format attributed to enhanced detection of salient points of interest. Notably, a more detailed scene representation can potentially yield more effective and higher-quality 3D models.

Methods

In this study, we utilized Agisoft Metashape Pro (www.agisoft.com) for 3D processing, a commercial software extensively cited in the literature. Alternatively, open-source software options such as Meshroom (alicevision.org), OpenMVG (openmvg.readthedocs.io), and COLMAP (colmap.github.io) can also be employed for 3D model generation using SfM techniques.

The primary aim of the research is to enhance modeling accuracy by optimizing parameters during the photographic acquisition phase. Consequently, the workflow and software configurations are not discussed in detail within this paper. For more information on configurations, of the workflow of the technique, see Leon et al. (2015), James et al. (2017), and Tinkham and Swayze (2021).

Evaluated test objects

The photographs were taken in the Geomatics Laboratory of the Transport Engineering Department of the São Carlos School of Engineering at the University of São Paulo. The selected test site has both natural and artificial lighting, which, according to the objectives of this study, influenced the photographic capture settings.

Samples of three materials, namely concrete, metal, and wood, simulated elements commonly used in laboratory testing of structural beams. The dimensions of the objects were 140 cm × 30 cm × 4 cm for the concrete beam, 140 cm × 40 cm × 1 cm for the metal structure, and 140 cm × 18 cm × 4 cm for the wooden beam.

The regions of interest for each object were defined as the surface planes of the respective objects in the captured photographs. Eight sets of acrylic rulers with checkerboard patterns were randomly placed around these regions to facilitate the automatic detection of reference points by the modeling software.

These acrylic plates of known dimensions functioned as scale bars for calibrating and scaling the resulting 3D models. Another set of three acrylic rulers, to be used as control bars (CBs), with checkerboard patterns and known dimensions, were placed within the region of interest of the test specimens to verify the quality of the 3D modeling. The bars were placed at different lengths and positions relative to the axes of the specimens being analyzed. Figure 2 shows the regions of interest and the configuration of SBs and CBs for each material.

Figure 2. Layout of the arrangement of positional elements, with Scale Bars in blue and Control Bars in yellow for (a) Concrete, (b) Metal, and (c) Wood.

Because of its significant dimensions and weight, a robust support structure was necessary to ensure the metal specimen’s stability and safety during imaging. This arrangement resulted in the support structure and the metal specimen being fixed in the center of the room. Consequently, the other specimens were placed next to the metal specimen, creating a consistent environment for analyzing all variables in this research.

Positioning of auxiliary lighting

In laboratory tests, the standard lighting of the location can cause variations in the color of the same region in different images due to the internal environmental conditions of the facilities. Farhadmanesh et al. (2021) addressed how such variations in representation can negatively affect the effectiveness of the interest point detection (SIFT-like algorithms), as discussed in Badano et al. (2015) and Lurie et al. (2017).

To meet the light quality requirements of this study, two auxiliary lighting units (softboxes) were used, each with a 7,000-lumen LED lamp and a color temperature of 5,000 Kelvin. The softboxes were arranged in three different positioning configurations relative to the specimen to simulate different structural test environments (see Figure 3).

Figure 3. Lighting Equipment Positions: (a) Standard configuration (b) Vertical alignment with the specimen along the camera capture line, (c) Adjacent placement on the sides of the object, and (d) Positioning below the object.

The initial “Standard” configuration ( Figure 3a) depicts the environment in its natural state, utilizing only the room’s ceiling lighting and without any additional light sources. In the “Vertical” model ( Figure 3b), auxiliary lighting was strategically positioned along the camera’s line of sight and directed towards the specimen to illuminate the object vertically and avoid shadowing caused by the acquisition process.

In the “Adjacent” configuration ( Figure 3c), softboxes were positioned in the lateral regions of the test object, simulating tests where other equipment is required between the camera and the object to avoid shadowed areas due to obstruction of lighting.

Finally, the “Beneath” configuration ( Figure 3d) shows an arrangement in which softboxes need to be positioned below the object for replicating experimental environments with space limitations for positioning lighting sources in “Vertical” or “Adjacent” configurations relative to the object.

Reconstruction sets were generated for each lighting scenario on multiple specimens to evaluate the impact of the proposed lighting configurations on the quality of the resulting 3D models. Given that the primary objective of this study is to enhance the image capture process, no post-processing techniques were employed to adjust colors or lighting.

This approach was chosen to ensure proper light exposure of the objects and environment, thereby facilitating the capture process and producing images rich in detectable and correlated features for the SfM and Multi-View Stereo algorithms, as discussed in O’Connor (2018) and Pena-Villasenin et al. (2019) without relying on labor-intensive post-processing steps.

The combinations utilized are presented in Table 1, which highlights the adopted lighting configurations, and the number of images generated at each acquisition stage.

Table 1. Processing combinations for various materials and lighting configurations, detailing image quantities obtained.

ID	Material	Lighting model	No. of images
CNA	Concrete	Standard (A)	42
CNB		Vertical (B)
CNC		Adjacent (D)
CND		Beneath (E)
MNA	Metal	Standard (A)	42
MNB		Vertical (B)
MNC		Adjacent (D)
MND		Beneath (E)
WNA	Wood	Standard (A)	42
WNB		Vertical (B)
WNC		Adjacent (D)
WND		Beneath (E)

Artificial texture patterns

Artificial texture patterns were applied to the surface of the specimens to enhance the detection of points of interest in the materials used, highlighting details of natural texture and color contrast on the object, thus facilitating the detection and correlation of these points between photographic sets.

In Reiss and Tommaselli (2011) and Detchev et al. (2014) image projectors were used to create random patterns on the surfaces of the analyzed objects. However, in the present study, due to the use of reference bars with checkerboard patterns and the analysis of the use of auxiliary lighting, image projection would introduce challenges in the automatic detection of reference bars and would impact the lighting configurations of the environment. Therefore, two random patterns, a checkerboard pattern (T1) and a more complex pattern (T2), were drawn on the surface of the objects. Figure 4 shows the sets of natural textures of the samples and the artificial patterns used.

Figure 4. Natural and artificial textures associated with each specimen analyzed.

(a) Natural texture of the concrete; (b) Artificial texture T1 in concrete; (c) T2 artificial texture on concrete; (d) Natural texture of the metal; (e) T1 artificial texture on metal; (f ) T2 artificial texture on metal; (g) Natural texture of the wood; (h) T1 artificial wood texture; (i) T2 artificial wood texture.

Due to the rich surface texture of the concrete and wood samples, a 4 cm x 4 cm checkerboard pattern (T1) drawn in white chalk and a second pattern (T2) of diagonally cut squares were used. The metal object analyzed, on the other hand, had a more uniform surface due to the texture of the material and the primer applied to protect it from corrosion. A red permanent marker was used to accentuate the color of the metal sample using the above drawing patterns.

The use of white chalk for the concrete and wood specimens, along with a red permanent marker for the metal specimen, was strategically chosen to create a high contrast between the colors of the test specimens and the artificial texture patterns. This deliberate contrast, coupled with the varied pattern representations on the object surfaces and precise exposure to auxiliary lighting, is designed to optimize the detection of a substantial number of elements by SIFT-type algorithms as discussed in Kanan and Cottrell (2012) and Wang et al. (2023).

Data acquisition and storage formats

In the SfM, the photo acquisition stage is key in producing high-quality 3D models (Caldera-Cordero and Polo 2018). Accurate image capture is essential for ensuring precise alignment, detailed reconstruction, and reliable measurement outcomes, particularly in structural analysis and laboratory testing scenarios.

The photographic capture process for this study incorporated scale bar (SB) configurations, image overlap percentages, and camera calibration techniques to facilitate the analysis of the relevant variables. These configurations and processes align with the methodologies outlined in de Moraes and da Silva (2024).

For each analysis, images were acquired following an ordered variation in camera positioning to establish a capture method like a regular vertical grid model. Figure 5 shows the photographic capture process for the region of interest on the concrete object using only vertical captures.

Figure 5. Photographic capture process of the concrete specimen highlighting the blue squares that symbolize each image acquired during the procedure.

A full-frame Canon EOS R camera, paired with a Canon RF 24-105 mm f/4L zoom lens, was employed for the photographic process. The camera was configured to manual mode, with a fixed focal length of 35mm to ensure consistent settings across all image captures. Exposure compensation was adjusted to +1 EV to enhance the brightness of all captures. This setup provided precise control over exposure parameters and captured regions with sufficient detail to meet the submillimeter precision requirements of the study.

To minimize image noise and ensure a greater depth of field, each shot was taken at ISO 100 and an aperture of f/11 was selected to improve focus across the scene, ensuring sharpness across the entire field of view. To further ensure the accuracy and stability of the images, a tripod, and a 5-second timer shutter were used.

A 1-meter capture distance was adopted in the experiments to evaluate the SfM technique used in structural testing. This distance maintains Ground Sample Distance (GSD) values and provides a more efficient capture process. The GSD values obtained for the experiments were approximately 0.15 mm. The image settings were configured to 6,720 x 4,480 pixels, with a 35 mm focal length on a camera equipped with a 35 mm × 24 mm full-frame sensor.

The photographic images were in RAW format. However, the image sets were converted to JPG and TIFF formats to evaluate the impact of different storage formats on the quality of 3D modeling. According to Detchev et al. (2014), Morgan et al. (2017) and Verma and Bourke (2019), these two formats are widely used in projects employing SfM. We chose the commercial software Adobe Photoshop (www.adobe.com/br/products/photoshop.html) to convert the raw files captured by the camera to TIFF and JPG formats, because it is easy to use. However, open-source computer applications such as RawTherapee (www.rawtherapee.com), darktable (www.darktable.org), and GIMP (www.gimp.org) also successfully meet our needs.

A Starrett EC799A-8 digital caliper was used to measure the lengths of the sets of SBs and CBs to size and check the 3D models developed. The instrument accurately checked the distances between the markings on the rulers and the targets of the reference elements to sub-millimeter accuracy. It offers a prominent level of precision, with error margins of ± 0.02 mm for measurements up to 10 cm and ± 0.03 mm for measurements above 10 cm Company (2007).

Five measurements were taken for each positional element to determine the length of the SBs and CBs accurately. The accuracy of the reference elements, combined with the positional accuracy of the SfM modeling, played a crucial role in determining the accuracy values used in the evaluation of the developed models. Figure 6 shows the mean values of the length of each bar obtained from the set of measurements, together with the standard deviation for each positional element. All measurements of the bars and images utilized in this study are freely accessible in the OSFHome repository at the following link: https://osf.io/k82ar/.

Figure 6. Representation of the average lengths and Standard Deviation of the measurements of each positional element obtained from five measurements.

Quality assessment

Initially, from a set of input images, the SfM process generates a point cloud referenced to the camera coordinate system, resulting in an inaccurate representation of real-world objects. Therefore, a scaling procedure is essential to scale the point cloud relative to a specific reference unit and ensure the positional accuracy of the three-dimensional product (Luhmann et al. 2020). Different formats and positioning configurations of SBs with known lengths were used to scale and refine the generated products.

A series of procedures were applied to evaluate the length of the positional elements used to estimate the positional accuracy of the models. First, the lengths were obtained by measuring the generated virtual models and compared with the values obtained from the digital caliper measurements. As discussed by Garcia and Oliveira (2021), the RMSE value (equation 1) was used to analyze the error and evaluate the accuracy of the distance prediction.

(1)

RMSE = \sqrt{\frac{\sum_{i = 1}^{n} {(E_{i} - R_{i})}^{2}}{n}}

where n is the number of samples, E_i is the estimated value at position i, and R_i is the value measured at position I.

The analysis of the values and the quality of the adjusted data provided by each modeling, via maximum and minimum values of the covariance matrix, facilitates a comprehensive evaluation of the entire 3D reconstruction process.

Results and discussions

This section provides the results and discusses the experiments conducted.

Assessment between different lighting configurations

Image sets with an overlap of approximately 80% were used to generate different 3D models. Camera calibration parameters were obtained using the pre-self-calibration method, using the maximum number of SBs (8) in environments like the region of interest, with consistent lighting configurations for each analysis. Figures 7 and 8 show the results of the RMSE values for these configurations in the concrete samples and the maximum and minimum diagonal values of the covariance matrix.

Figure 7. RMSE values of CBs for different lighting configurations in a concrete specimen.

An approximately 0.12 mm consistency is evident for this assessment.

Figure 8. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each set of 3D modeling of the concrete specimens, with best results in the CNB configuration - maximum value of 0.33 mm² and minimum one of 0.13 mm².

Consistent RMSE values ( Figure 7) between 0.12 mm and 0.11 mm were observed for the positional quality of 3D modeling for the different lighting setups in the concrete specimens. However, in terms of the quality of the modeling adjustment ( Figure 8), a significant improvement was achieved when auxiliary lighting was used in the photographic capture process compared to the natural lighting configuration (CNA). The values averaged 0.34 mm² and 0.13 mm² for the maximum and minimum values of the covariance matrix respectively when additional lighting was used, whereas the natural configuration gave significantly higher results.

These patterns, which indicate an improvement in RMSE values and the quality of 3D model adjustment when additional lighting was used, are attributed to the spectral characteristics of the material used as the test specimen. According to Senevirathne et al. (2021), concrete materials are influenced by factors such as mixture composition, texture, and surface color of the object, leading to a higher reflectance rate and hence more accurate 3D modeling of objects even under limited lighting conditions, as observed in the experiments.

As photographic capture relies on the amount of light reflected from objects to ensure a clearer process, the generated models showed a higher quality of adjustment under higher lighting intensities (e.g. in CNB, CNC, and CND configurations), due to the greater detail of the analyzed objects and areas of interest displayed in the image sets from these configurations, resulting in more accurate modeling.

Figures 9 and 10 show the results for RMSE and maximum and minimum values of the Covariance Matrix, respectively, for the different lighting configurations on metallic samples.

Figure 9. RMSE values of CBs under various lighting conditions in a concrete sample.

The assessment yielded values ranging from the least favorable (0.32 mm) in MNA to the most favorable (0.19 mm) in MNB.

Figure 10. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling set.

Superior results were achieved for the concrete object, with 0.41 mm² maximum value and 0.19 mm² minimum one.

The application of a layer of protective paint to the surface of the object is common due to the characteristics of metallic specimens concerning the oxidation process and rust formation. Although the paint protects the metal from corrosion effects, as a side effect it also smoothens the surface of the analyzed object, further reducing the textural properties of the specimens, as discussed by Sudarsanan et al. (2019).

The experiments showed a slight improvement in positional quality when any form of additional lighting (MNB, MNC, and MND) was used compared to its non-use (MNA). The RMSE values for the use of additional lighting ranged from 0.19 mm to 0.25 mm and were 0.32 mm when the MNA setup was used.

Regarding the maximum and minimum values of the Covariance Matrix for the different lighting configurations of metallic objects, a significant improvement was obtained when any form of lighting assistance (MNB, MNC, and MNC) was used. The elements of the Covariance matrix showed maximum values of the order of 0.4 mm², which is like that obtained when no additional lighting (MNA) was used. However, compared to the standard model of the test specimen, the minimum values improved by about 0.1 mm² when any lighting aid was used.

Finally, the results of the 3D modeling of the wooden specimens provided RMSE values and maximum and minimum values of the Covariance Matrix as shown in Figures 11 and 12, respectively.

Figure 11. RMSE values of CBs under various lighting conditions in a concrete sample.

The assessment yielded values ranging from the least favorable (0.32 mm) in MNA to the most favorable (0.19 mm) in MNB.

Figure 12. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling set.

Superior results were achieved for the concrete object, with 0.41 mm² maximum value and 0.19 mm² minimum one.

The RMSE values of the positional quality of the modeling across the different lighting arrangements in the wooden test bodies showed uniformity, ranging between 0.12 mm and 0.15 mm. Additionally, the quality of adjustments improved when any form of auxiliary lighting (WNB, WNC, and WND) was used in the photographic capture process, compared to the natural configuration (WNA).

The results of the quality analysis of the experiments with the wooden specimen can be attributed to the different structural characteristics of the materials. As discussed by Feng et al. (2019), the texture of wood shows a wide range of color variations and patterns that can coexist in a single artifact, allowing for detailed photographic capture comparable to that observed in concrete objects. The variety of texture and contrast of the objects’ surfaces facilitated a more detailed capture process, particularly when appropriate lighting was used, improving the recognition of elements in the image sets used and resulting in highly accurate modeling.

The analyses revealed the integration of lighting assistance substantially improved the quality of the 3D modeling for the three materials examined. However, the number of elements detected in each set of images, considering the various configurations and materials used, was evaluated towards a more comprehensive analysis for the selection of the most suitable configuration and, hence, optimal results. Figure 13 displays the number of sparse cloud points obtained after the detected elements had been filtered.

Figure 13. Values related to the sparse point cloud for each material under different lighting configurations employed in photographic capture.

Data suggest a slight superiority of Vertical and Adjacent lighting configurations across the three materials studied.

The number of points obtained for each lighting configuration shows a subtle consistency among the values for the same material analyzed. However, the most favorable results were obtained using the “Vertical (B)” and “Adjacent (C)” lighting configurations. A comparison of such information with previous analyses showed a slight advantage for these lighting configurations in terms of quality parameters.

Despite its results with minimal variation, the “Beneath (D)” configuration posed significant challenges to equipment installation and usage within a laboratory setting. Space constraints and safety considerations in structural testing, particularly beneath the test specimens, compromise the practicality of implementing Model D. Although the configuration offers advantages, organizations must carefully assess its adoption in terms of safety and test feasibility.

Due to the proximity of the values, auxiliary lighting should be positioned directly in front of the object or adjacent to the region of interest for photographic captures aimed at modeling objects with submillimeter precision.

Analysis of the use of different artificial textures

New sets of capture and processing involving the application of artificial textures to the specimens were initiated as a function of previous findings that underscored the advantages of “Vertical (B)” and “Adjacent (C)” lighting aids during the photographic capture stages. Each material analyzed in the experiment, namely, concrete, metal, and wood, was examined with two distinct texture patterns (T1 and T2) and the natural pattern inherent to each specimen.

Figure 14 shows the modeling results of the concrete specimens, in which two different lighting configurations and three texture models were analyzed. The artificial texture models were crafted, in a checkered pattern using white chalk to accentuate details of the surface of the object in the first model and the pattern used in the second.

Figure 14. RMSE values of CBs for different texture configurations in a concrete specimen.

An approximately 0.11 mm consistency is evident for the assessment.

The results showed a notable equilibrium in the total RMSE values for all configurations, averaging around 0.11 mm. However, the covariance matrix values ( Figure 15) showed differences between the artificial and natural texture models, especially in the maximum values. There was an improvement of approximately 0.1mm² when artificial texture models were used.

Figure 15. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling from the concrete set.

Superior results were achieved, with 0.22 mm² maximum value and 0.08 mm² minimum one.

Figure 16 shows the results of the RMSE quality metrics for modeling metallic samples. Texture patterns were created in the analysis using red permanent markers to enhance the contrast with the color of the protective paint applied to the samples. The first artificial texture (T1) aimed to highlight detail by incorporating a checkerboard pattern on the surface. Conversely, the second model (T2) intensified the pattern introduced by T1.

Figure 16. RMSE values of CBs for different texture configurations in a metal specimen.

The assessment yielded values ranging from the least favorable (around 0.21 mm) in MT2B and MT2C to the most favorable (0.16 mm) in MT1B and MT1C.

The T1 texture configuration showed a slight improvement in the RMSE results, with average values of 0.16 mm compared to the other texture configurations, which achieved average values of 0.21 mm. As shown in Figure 17, there was a significant improvement in both the maximum and minimum values of the Covariance Matrix when each artificial texture model was used.

Figure 17. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling from the metallic set.

Superior results were achieved, with 0.36 mm² maximum value and 0.14 mm² minimum one.

Figure 18 shows the RMSE results of wooden object modeling with configurations of artificial textures like those used in concrete specimen experiments. The artificial texture models on the wooden specimen were created using white chalk in two different patterns to accentuate the surface details of the object. The first pattern (T1) follows a checkered pattern while the second (T2) has a denser pattern.

Figure 18. RMSE values of CBs for different texture configurations in a wood specimen.

An approximately 0.12 mm consistency is evident in the assessment.

The behavior of the wooden specimens was remarkably like that of concrete objects. Although the RMSE values did not show significant variations with the application of different texture settings, the adjustment accuracy of the 3D models improved. Figure 19 shows the maximum and minimum values of the diagonal of the Covariance Matrix for modeling the wooden specimen, emphasizing a notable improvement in quality in terms of maximum values when an artificial texture model was applied.

Figure 19. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling from a wood set.

Superior results were achieved, with 0.19 mm² maximum value and 0.09 mm² minimum one.

Analysis of the results for the three materials showed similar behavior, despite the distinctive characteristics associated with the natural texture patterns and color of each sample. The use of artificial standards resulted in more accurate 3D modeling with less variation in the maximum and minimum values, as shown by the covariance matrix values. However, no significant gains in modeling were observed when positional quality was analyzed using RMSE values.

The consistency of the positional quality values is due to the image sets used in the analysis. The use of artificial lighting in all analyzed sets resulted in highly accurate modeling, as previously investigated, and combined with artificial textures, led to the acquisition of point clouds with low representation errors, as discussed by Hafeez et al. (2018).

The analysis suggests that artificial textures tend to improve the accuracy of 3D object modeling and facilitate the detection of elements between images, resulting in a more detailed representation. The experiments involved the use of light tools and artificial textures with significant contrast to the materials analyzed. It is therefore the responsibility of the user to determine the most appropriate pattern and methods for representing the texture of an object, considering the specific requirements of their experiments in terms of feasibility and potential benefits. In engineering laboratories, particularly for structural testing requiring sub-millimeter precision, the use of artificial textures with checkerboard patterns and a high degree of repetition is recommended to improve the quality of 3D modeling.

Analysis of different storage formats

Additional 3D modeling processes explored the effects of using different storage formats (TIFF and JPG) in sets of images captured in indoor environments at close range. Artificial textures (T1) and lighting support (B – Vertical) were adopted for the three test specimen materials.

The combinations made, including formats and various configurations, are presented in Table 2, along with the corresponding amounts of storage used by each format. The image sets indicate that the average size of files in TIFF format was approximately fifteen times larger than those in JPG format.

Table 2. Processing combinations for specimen materials, storage formats, and average image size for 3D modeling.

ID	Material	Texture	Lighting model	Format	Size per image (MB)
CT1B – TIFF	Concrete	T1	B	TIFF	84.1
CT1B – JPG	Concrete	T1	B	JPG	5.78
MT1B – TIFF	Metal	T1	B	TIFF	84.1
MT1B - JPG	Metal	T1	B	JPG	5.78
WT1B – TIFF	Wood	T1	B	TIFF	84.1
WT1B - JPG	Wood	T1	B	JPG	5.78

Figure 20 shows the quality values associated with the different formats for storage image sets for the optimal lighting and texture configurations previously examined in this study.

Figure 20. The RMSE values of CBs ranged for different save file configurations (TIFF and JPG) across all materials when artificial texture T1 and lighting condition B (Vertical) were used.

The TIFF configuration yielded better RMSE values in the assessment compared to the 3D models with the use of JPG images.

In comparison to JPG, the variation in the RMSE values for each specimen of the varied materials analyzed showed a slight improvement when TIFF was used. Such a trend is in line with the findings reported by Detchev et al. (2014), who observed that improvements in positional accuracy when using raw storage formats, as opposed to JPG, are typically of the order of submillimeter. Therefore, where file storage and transfer are important considerations, the adoption of formats such as TIFF may not offer substantial benefits and may even increase storage requirements.

The RMSE values did not reveal any significant differences that would justify the selection of a specific storage format. However, the results of the Covariance Matrix showed substantial disparities, as shown in Figure 21. The 3D modeling with images in TIFF format showed a better adjustment quality compared to models using JPG images.

Figure 21. The maximum and minimum values obtained from the Covariance Matrix indicate the quality of adjustment across all materials when artificial texture T1 and lighting condition B (Vertical) were used.

The TIFF configuration yielded better maximum and minimum values in the assessment compared to the respective 3D model that used JPG images.

The maximum and minimum values were notably more accurate and precise when the raw format was used. Morgan et al. (2017) reported this behavior, recognizing the higher level of detail in images in TIFF format and choosing raw formats over compressed ones. This decision was based on the superior capacity of TIFF for accommodating post-processing techniques while maintaining the integrity of the raw image data with no compression and information loss typical of JPG format.

TIFF storage format is recommended in function of its potential for higher efficiency in image post-processing techniques, especially in 3D modeling tasks requiring submillimeter precision. However, due to its rapid acquisition and lower storage space requirements, JPG can be a viable alternative when precision requirements are less stringent and extensive image post-processing activities are unnecessary.

Limitations of the study

This study, centered on the three-dimensional modeling of objects within the context of laboratory structural testing, has identified some limitations inherent to the experimental approach, particularly with SfM techniques.

Capture Distance: The experimental protocol required photographic captures to be conducted at approximately 1 meter from the objects under study. This distance, dictated by stringent safety protocols in the laboratory environment, inevitably constrained the resolution and detail of the 3D models produced. It is recognized that shorter capture distances would likely enhance model quality by increasing image resolution and enabling a more comprehensive representation of the object. Consequently, it is imperative to establish minimum capture distances that meet safety standards and optimize the quality of 3D reconstructions.

Artificial Texture Patterns: The application of artificial texture patterns in this study was intended to enhance the visibility of surface features on the test specimens, a critical factor for SfM algorithms. However, closed or overly repetitive texture patterns can obscure finer surface details, such as cracks or microfractures, which are essential for accurate structural analysis. It is therefore crucial for laboratory professionals to select texture patterns that balance enhancing surface visibility and preserving the detectability of critical surface features. This selection is vital for ensuring the robustness of the 3D models generated through computer vision techniques.

Auxiliary Lighting Positioning: The complex environment of the structural laboratory, characterized by the presence of various equipment and sensors, poses significant challenges for the effective positioning of auxiliary lighting. Inadequate lighting arrangements can introduce shadows and uneven illumination, which can degrade the quality of the images captured and, consequently, the accuracy of the 3D models produced. Proper lighting placement is essential to mitigate these effects, ensuring consistent illumination and minimizing shadowing that could compromise the integrity of the SfM process.

In summary, professionals engaged in such experimental work must possess a thorough understanding of the specific requirements and limitations of the tests being conducted, as well as the characteristics of the laboratory environment. This understanding is crucial to avoid suboptimal capture processes that could lead to reduced modeling quality or necessitate the repetition of experiments. By addressing these limitations, the fidelity and reliability of 3D models generated through SfM can be significantly improved.

Conclusion

This study explored the impact of different configurations on the optimization of the close-range photographic capture process in indoor environments. The configurations were examined for the generation of high-quality image sets suitable for SfM technique in the 3D modeling of specimens and submillimeter positional accuracy required for laboratory structural testing.

To assess the quality levels achieved and identify the configurations that have the greatest impact on the photographic capture process, multiple capture sets were created using different lighting configurations, artificial textures, and image storage formats for three varied specimens’ materials.

An analysis of the quality values suggested that more accurate results are obtained when Vertical and Adjacent auxiliary lighting models are used, since their adoption significantly improved the positional RMSE values and model adjustment quality, especially for metallic specimens characterized by more uniform textures. However, specimens made of materials with high texture variation (e.g., concrete and wood) only showed significant improvements in the adjustment quality.

Artificial textures, characterized by checkered patterns with contrasting colors, applied to the surface of the specimens showed a behavior like that of auxiliary lighting. The benefits were associated with improvements in the quality of adjustment of the three-dimensional products generated. The combination of auxiliary lighting and artificial textures led to an approximately 40% improvement in modeling quality for materials with high texture variation. Conversely, for materials with a more uniform texture, such as the metallic sample, improvements in modeling quality reached around 60% when the two analyzed configurations were adopted.

The quality values obtained from the evaluation of two different image file formats, RAW (stored in TIFF) and lossless JPEG, indicate a slight superiority in the quality of 3D products for the RAW format (stored in TIFF) compared to the lossless JPEG file format. However, in situations where submillimeter accuracy is not required, the lossless JPEG format may be justified due to its smaller file size. Additionally, if lossy compression is used, it is recommended that the reader conducts a preliminary assessment of the quality of the results by employing the procedures and methods proposed in this research.

The analyses highlighted the improvements in the quality of 3D products obtained by SfM with auxiliary lighting and artificial texture patterns. Regarding the storage format (TIFF or JPG), the results showed a slight advantage for TIFF. However, it is the user’s responsibility to determine their needs and assess whether the difference in storage space is justified, as TIFF requires more storage space than JPG.

Data availability

Underlying data

OSFHome: Dataset of images SfM - FRM - Lighting and Artificial texture - JPG (DOI 10.17605/OSF.IO/K82AR) (de Moraes 2024).

The project contains the following underlying data:

• concrete test specimen (12 sets of images in JPG format, which varied according to the lighting and artificial texture configurations adopted in the study).
• metal test specimen (12 sets of images in JPG format, which varied according to the lighting and artificial texture configurations adopted in the study).
• wood test specimen (12 sets of images in JPG format, which varied according to the lighting and artificial texture configurations adopted in the study).

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Reporting guidelines

Zenodo: Optimizing Submillimeter 3D Modeling with Auxiliary Lighting and Artificial Textures: An SfM-Based Approach

Creators, DOI: https://doi.org/10.5281/zenodo.13937284

The project contains the following reporting guidelines:

• STROBE Statement Utilized in the Preparation of the Article Titled “Optimizing Submillimeter 3D Modeling with Auxiliary Lighting and Artificial Textures: An SfM Approach”

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Acknowledgements

The authors thank the São Carlos School of Engineering for all the support. This study was financed by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES)

References

Badano A, et al.: Consistency and standardization of color in medical imaging: a consensus report. J. Digit. Imaging. 2015; 28: 41–52. PubMed Abstract | Publisher Full Text | Free Full Text
Caldera-Cordero JM, Polo ME: Analysis of free image-based modelling systems applied to support topographic measurements. Surv. Rev. 2018; 51(367): 300–309. Publisher Full Text
Capéran P, et al.: Optical 3-dimensional measurements on a frp beam tested at serviceability limit. Compos. Struct. 2012; 94(12): 3465–3477. Publisher Full Text Reference Source
Company S: Ec799 electronic calipers 165. Athol, Massachusetts: Starret L.S; 2007.
Creus P, Sanislav I, Dirks P: Application of sfm-mvs for mining geology: Capture set-up and automated processing using the dugald river znpb-ag mine as a case study. Eng. Geol. 2021; 293: 106314. Publisher Full Text . Reference Source
Dauvin L, et al.: Optimization of temperature, targets, and illumination for high precision photogrammetric measurements. IEEE Sensors J. 2018; 18(4): 1449–1456. Publisher Full Text
Detchev I, et al.: Deformation monitoring with off-the-shelf digital cameras for civil engineering fatigue testing. Int. Arch. Photogramm. Remote. Sens. Spat. Inf. Sci. 2014; XL-5: 195–202. Publisher Full Text
Farhadmanesh M, et al.: Highway asset and pavement condition management using mobile photogrammetry. Transp. Res. Rec. 2021; 2675(9): 296–307. Publisher Full Text
Feng X, et al.: Surface design of wood-based board to imitate wood texture using 3d printing technology. Bioresources. 2019; 14(4): 8196–8211. Publisher Full Text
Garcia MVY, Oliveira HC: The influence of flight configuration, camera calibration, and ground control points for digital terrain model and orthomosaic generation using unmanned aerial vehicles imagery. Boletim de ciências geodésicas. 2021; 27: e2021015.
Hafeez J, et al.: The effect of patterns on image-based modelling of texture-less objects. Metrol. Meas. Syst. 2018; 25(4): 755–767. Publisher Full Text Reference Source
James MR, Robson S, Smith MW: 3-d uncertainty-based topographic change detection with structure-from-motion photogrammetry: precision maps for ground control and directly georeferenced surveys. Earth Surf. Process. Landf. 2017; 42(12): 1769–1788. Publisher Full Text
Kanan C, Cottrell GW: Color-to-grayscale: does the method matter in image recognition? PLoS One. 2012; 7(1): e29740. PubMed Abstract | Publisher Full Text | Free Full Text
Kwak E, et al.: Precise photogrammetric reconstruction using model-based image fitting for 3d beam deformation monitoring. J. Surv. Eng. 2013; 139(3): 143–155. Publisher Full Text
Leon J, et al.: Measuring coral reef terrain roughness using ‘structure-from-motion’ close-range photogrammetry. Geomorphology. 2015; 242, 21–28. Geomorphology in the Geocomputing Landscape: GIS, DEMs, Spatial Analysis and statistics. Publisher Full Text Reference Source
Luhmann T, et al.: Close-range photogrammetry and 3d imaging. Berlin, Boston: De Gruyter; 2020.
Lurie KL, et al.: 3d reconstruction of cystoscopy videos for comprehensive bladder records. Biomed. Opt. Express. 2017; 8(4): 2106–2123. PubMed Abstract | Publisher Full Text | Free Full Text
Mishra SR, et al.: A simple image-based deformation measurement technique in tensile testing of geotextiles. Geosynth. Int. 2017; 24(3): 306–320.
de Moraes FR , da Silva I : Assessment of submillimeter precision via structure from motion technique in close-range capture environments. arxiv preprint arxiv:2409.15602. 2024. Publisher Full Text
de Moraes FR : Dataset of images SfM - FRM - Lighting and Artificial texture - JPG.2024, November 19. Publisher Full Text
Morgan JA, Brogan DJ, Nelson PA: Application of structure-from-motion photogrammetry in laboratory flumes. Geomorphology. 2017; 276: 125–143. Publisher Full Text Reference Source
Nielsen MS, et al.: Quantifying the influence of surface texture and shape on structure from motion 3d reconstructions. Sensors. 2023; 23(1). Publisher Full Text Reference Source
Nietiedt S, et al.: Accuracy investigations of image matching techniques by means of a textured dumbbell artefact. The international archives of the photogrammetry, remote sensing and spatial information sciences, XLIII-B2-2020. 2020; pp. 791–796. Reference Source
O’Connor J: Impact of image quality on sfm photogrammetry: colour, compression and noise. Kingston University; 2018. Thesis (PhD).
Pena-Villasenin S, Gil-Docampo M, Ortiz-Sanz J: Professional sfm and tls vs a simple sfm photogrammetry for 3d modelling of rock art and radiance scaling shading in engraving detection. J. Cult. Herit. 2019; 37: 238–246. Publisher Full Text
Reiss ML, Tommaselli AM: A low-cost 3d reconstruction system using a singleshot projection of a pattern matrix. Photogramm. Rec. 2011; 26(133): 91–110. Publisher Full Text
Reznicek J, Luhmann T, Jepping C: Influence of raw image preprocessing and other selected processes on accuracy of close-range photogrammetric systems according to vdi 2634. The international archives of the photogrammetry, remote sensing and spatial information sciences, XLI-B5. 2016; pp. 107–113. Reference Source
Senevirathne D, et al.: Effects of pavement texture and colour on urban heat islands: An experimental study in tropical climate. Urban Clim. 2021; 40: 101024. Publisher Full Text
Sudarsanan N, et al.: Digital image correlation technique for measurement of surface strains in reinforced asphalt concrete beams under fatigue loading. J. Mater. Civ. Eng. 2019; 31(8): 04019135. Publisher Full Text
Tinkham WT, Swayze NC: Influence of agisoft metashape parameters on uas structure from motion individual tree detection from canopy height models. Forests. 2021; 12(2): 250. Publisher Full Text
Verma AK, Bourke MC: A method based on structure-from-motion photogrammetry to generate sub-millimetre-resolution digital elevation models for investigating rock breakdown features. Earth Surf. Dyn. 2019; 7(1): 45–66. Publisher Full Text Reference Source
Wang T, et al.: Contrast enhancement-based preprocessing process to improve deep learning object task performance and results. Appl. Sci. 2023; 13(19): 10760. Publisher Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 04 Dec 2024

Author details Author details

¹ Department of Transportation Engineering, University of Sao Paulo Sao Carlos School of Engineering, São Carlos, State of São Paulo, 13563-120, Brazil

Francisco Roza de Moraes
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – Original Draft Preparation

Irineu da Silva
Roles: Formal Analysis, Project Administration, Supervision, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

This study was funded by the Brazilian Federal Agency for Support and Evaluation of Graduate Education (CAPES) - Finance Code 001 – process number: 88882.379118/2019-01.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (1)

version 1

Published: 04 Dec 2024, 13:1479

https://doi.org/10.12688/f1000research.157676.1

© 2024 Roza de Moraes F and da Silva I. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Roza de Moraes F and da Silva I. Optimizing Submillimeter 3D Modeling with Auxiliary Lighting and Artificial Textures: An SfM-Based Approach [version 1; peer review: 1 approved, 1 approved with reservations]. F1000Research 2024, 13:1479 (https://doi.org/10.12688/f1000research.157676.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 04 Dec 2024

Views

Reviewer Report 23 Aug 2025

Ivan Nikolov, Aalborg University, Aalborg, Denmark

Approved with Reservations

https://doi.org/10.5256/f1000research.173163.r402790

The paper looks into how lighting placement and surface texture change the quality of close range SfM 3D reconstructions on three types of materials - wood, concrete, metal. The research is quite interesting and can be very useful as an overview of the requirements that engineers should have to do with their setups before they can extract good 3D surface analysis captures.

There are some parts in the paper that are currently lacking and require additional work:

The authors briefly discuss their choice of of Agisoft Metashape Pro. Even though it's extensively cited in the literature the software is extremely expensive (especially the pro version) and it has comparative results to other free software solutions like Meshroom, RealityScan, 3DS Zephyr that are free. It also has comparable results to Pix4D, another paid software solution. The authors should either compare the results to at least the free SFM software out there or cite research that compares it. The reviewer has proposed paper citations for the latter option (please check).
In addition, no explanation has been given on the specific settings used in Metashape Pro to obtain the reconstructions, making the replicability impossible.
Furthermore, the authors should discuss why not use other ways for 3D reconstruction for indoor close range, like structured light, stereo cameras, time-of-flight cameras, solid state lidars, etc.

For the testing, the authors need to mention the light luminosity used in their research again to have better replicability.

It is generally not advised to take photos of objects for SfM reconstruction by just taking them in a line with overlap, without any rotation in the camera. This most of the time results in poorer reconstruction than if you also have a rotation in the camera axis. Why did the authors choose to do this capture configuration?

Do the authors also have ground truth representations of the objects that are being scanned? Maybe captured through a different scanning method or having 3D CAD models. Normally, to better compare 3D reconstruction quality, a comparison to ground truth objects is advised. Then the distance between the objects can be calculated, there can be a comparison where the errors are on the surface of the object, how much noise there is, etc. It will be a good idea to have such a comparison in the paper.

Authors are using a caliper to capture real-life distances. How many measurements were done for each place of measurement? Normally, when a human-led measurement like this is done, we need to know the standard deviation of the human error. These errors have been shown to propagate through the calculations and make the scaling of 3D model have errors (please check the third proposed article for more information)

When applying the texture to the surfaces, the reviewer imagines it was done by hand. This can lead to errors and places in the 3D reconstruction that have noise or holes. Why wasn't something like a projector or laser projector used to project the patterns on the surface? Then the uniformity of the pattern would be guaranteed.

Currently, there are no visuals of the reconstructions from the different experiments, making it hard to visually judge how well they are achieved. Please add these.

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

No
Are the conclusions drawn adequately supported by the results?

Yes

References

1. Kingsland K: Comparative analysis of digital photogrammetry software for cultural heritage. Digital Applications in Archaeology and Cultural Heritage. 2020; 18. Publisher Full Text
2. Nikolov I, Madsen C: Benchmarking Close-range Structure from Motion 3D Reconstruction Software Under Varying Capturing Conditions. 10058: 15-26 Publisher Full Text
3. Nikolov I, Madsen C: Calculating Absolute Scale and Scale Uncertainty for SfM Using Distance Sensor Measurements. 168-192 Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Deep learning, 3D reconstruction, SfM, photogrammetry, computer graphics

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 16 Jan 2026

Francisco Roza de Moraes, Department of Transportation Engineering, University of Sao Paulo Sao Carlos School of Engineering, São Carlos, 13563-120, Brazil

16 Jan 2026

Author Response

We thank the reviewer for the detailed and technically insightful comments, which significantly improved the methodological transparency and robustness of the manuscript. Responses to each comment are provided below.
Comment ... Continue reading We thank the reviewer for the detailed and technically insightful comments, which significantly improved the methodological transparency and robustness of the manuscript. Responses to each comment are provided below.
Comment 1
The authors briefly discuss their choice of Agisoft Metashape Pro. Even though it is extensively cited in the literature, the software is extremely expensive (especially the Pro version) and has comparable results to free SfM solutions such as Meshroom, RealityScan, and 3DF Zephyr, as well as to other commercial software such as Pix4D. The authors should either compare the results with at least free SfM software or cite research that performs such comparisons.
Response:
We thank the reviewer for this important and constructive comment. The suggested references comparing Agisoft Metashape with alternative SfM solutions were carefully reviewed and incorporated into the revised manuscript.
The choice of Agisoft Metashape Pro in this study was primarily motivated by its availability in our laboratory infrastructure and by the research team’s prior experience with the software, which facilitated a consistent and controlled experimental workflow. Importantly, the objective of the study was not to benchmark SfM software packages, but rather to investigate the influence of photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and storage formats, on the quality of close-range SfM-based 3D reconstructions.
As clarified in the revised manuscript, other computational solutions, both commercial and open source (e.g., Meshroom, COLMAP, OpenMVG, RealityScan, and 3DF Zephyr), can produce comparable results and would also meet the methodological requirements of the proposed experiments. The conclusions drawn in this work are therefore not software-dependent and can be transferred to alternative SfM pipelines.
Comment 2
In addition, no explanation has been given on the specific settings used in Metashape Pro to obtain the reconstructions, making the replicability impossible.
Response:
We thank the reviewer for highlighting this important point regarding replicability. In the revised version of the manuscript, the processing workflow and the specific configuration parameters used in Agisoft Metashape Pro were explicitly described.
A dedicated paragraph was added to the Methods section detailing the alignment, dense cloud generation, mesh reconstruction, and tiled model parameters applied consistently across all datasets. These settings were kept fixed for all experiments to ensure comparability between lighting configurations, artificial texture patterns, and storage formats.
By explicitly reporting these parameters, the revised manuscript now allows full replication of the reconstruction workflow using Agisoft Metashape Pro or equivalent SfM software.
Comment 3
Furthermore, the authors should discuss why not use other ways for 3D reconstruction for indoor close range, like structured light, stereo cameras, time-of-flight cameras, solid-state lidars, etc.
Response:
We thank the reviewer for this relevant and insightful comment. The choice of Structure-from-Motion (SfM) as the core reconstruction technique in this study was deliberate and aligned with the broader objectives of the doctoral research from which this work originates.
The primary aim of this research was to investigate how photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and image storage formats, affect the quality and reliability of SfM-based 3D reconstructions in laboratory-scale structural testing environments. SfM was selected because it is a passive, image-based technique that combines flexibility, scalability, and relatively low hardware cost, while allowing the use of standard photographic equipment and adaptable camera positioning under laboratory safety constraints.
Alternative close-range 3D reconstruction approaches, including structured light systems, stereo camera setups, time-of-flight sensors, and solid-state LiDARs, were considered within the broader research framework and, in some cases, evaluated in separate experimental stages. However, these techniques typically require specialized and often costly hardware, controlled illumination conditions, fixed sensor geometries, or limited operational ranges, which can reduce their applicability in structural laboratories characterized by space limitations, safety restrictions, and variable experimental configurations.
As future work, the research group is currently acquiring additional equipment, including stereo camera systems, dedicated lenses, and advanced illumination setups, which will allow a systematic comparison between SfM and alternative close-range 3D sensing technologies. Additionally, LiDAR-based depth sensing, available in modern smartphones, will be investigated as a complementary solution for selected laboratory scenarios. These investigations will be reported in future publications.
To clarify the scope of the present study, a brief discussion was added to the revised manuscript emphasizing that the objective was not to benchmark different 3D reconstruction technologies, but to optimize SfM acquisition strategies within realistic laboratory constraints.
Comment 4
For the testing, the authors should reiterate the light luminosity used in their research to enhance replicability.
Response:
We thank the reviewer for this comment regarding replicability. In the revised version of the manuscript, the auxiliary lighting specifications were explicitly reported.
The manuscript now states that two auxiliary lighting units (softboxes) were used, each equipped with a 7,000-lumen LED lamp and a color temperature of 5,000 K. These parameters were kept constant across all experiments involving auxiliary lighting to ensure consistency and reproducibility.
By explicitly reporting both the luminous flux and the color temperature of the light sources, the revised manuscript provides sufficient information for other researchers to replicate the illumination conditions adopted in this study.
Comment 5
It is generally not advised to take photos of objects for SfM reconstruction by just taking them in a line with overlap, without any rotation in the camera. This most of the time results in poorer reconstruction than if you also have a rotation in the camera axis. Why did the authors choose to do this capture configuration?
Response:
We thank the reviewer for this important methodological observation. The capture configuration adopted in this study was not arbitrary but was based on previous experimental evaluations conducted by the authors.
The use of a controlled capture geometry with overlapping images acquired along a regular grid, without intentional camera axis rotation, was motivated by earlier investigations reported by Moraes and da Silva, in which different acquisition strategies were systematically tested under similar laboratory conditions. In those experiments, no significant improvements in positional accuracy or model adjustment quality were observed when random camera orientations or additional rotations were introduced.
Given that the primary objective of the present study was to isolate and evaluate the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM reconstruction quality, a controlled and repeatable acquisition strategy was adopted. This approach reduced additional sources of variability and facilitated direct comparison between experimental configurations.
In addition, the selected capture strategy reflects practical constraints commonly encountered in laboratory structural testing, where camera positioning is often limited by safety protocols, equipment layout, and restricted access around the specimen. Under these conditions, a regular and well-controlled acquisition geometry provides a robust and reproducible solution without compromising the validity of the comparative analyses performed.
Comment 6
Do the authors also have ground truth representations of the objects that are being scanned? Maybe captured through a different scanning method or having 3D CAD models. Normally, to better compare 3D reconstruction quality, a comparison to ground truth objects is advised. Then the distance between the objects can be calculated, and there can be a comparison where the errors are on the surface of the object, how much noise there is, etc. It will be a good idea to have such a comparison in the paper.
Response:
We thank the reviewer for this relevant and constructive suggestion. In the present study, a full geometric ground truth representation of the specimens, such as high-resolution 3D CAD models or reference scans obtained with alternative sensing technologies, was not available.
Instead, the assessment of reconstruction quality was based on physically measurable reference elements incorporated into the experimental setup. Scale bars and control bars with known lengths were positioned within the region of interest and measured using a high-precision digital caliper. These elements provided reliable reference values for evaluating positional accuracy through distance-based metrics, such as RMSE, which are commonly adopted in close-range photogrammetry and SfM-based studies.
This approach was selected because the primary objective of the research was not to perform a full surface-to-surface comparison between reconstructed models and an external ground truth, but rather to analyze the relative impact of different acquisition conditions, including lighting configurations, artificial texture patterns, and storage formats, under controlled and repeatable laboratory conditions.
While surface-based comparisons against a full geometric ground truth can provide additional insights into noise distribution and local deviations, implementing such analyses would require complementary sensing systems or reference models that were outside the scope of the present work. This limitation is now explicitly acknowledged in the manuscript.
Comment 7
Authors are using a caliper to capture real-life distances. How many measurements were done for each place of measurement? Normally, when a human-led measurement like this is done, we need to know the standard deviation of the human error. These errors have been shown to propagate through the calculations and cause errors in the scaling of the 3D model have errors.
Response:
We thank the reviewer for this important comment regarding measurement uncertainty and error propagation.
In the present study, each scale bar and control bar was measured five independent times using a high-precision digital caliper. The repeated measurements allowed the estimation of the mean value and the corresponding standard deviation for each reference length. The observed standard deviations were on the order of hundredths of a millimeter, which is consistent with the manufacturer’s specifications of the instrument and within the accuracy level required for the submillimeter analyses performed in this work.
These reference measurements were subsequently used for model scaling and accuracy assessment. By relying on averaged values obtained from repeated measurements, the influence of operator-induced variability was minimized. The resulting uncertainty associated with the physical measurements was therefore significantly smaller than the variations observed in the SfM reconstruction metrics, ensuring that the scaling process did not dominate the error budget of the 3D models.
This measurement strategy is consistent with common practices in close-range photogrammetry and laboratory-scale SfM studies, where repeated caliper measurements are used to control human-induced uncertainty and ensure reliable reference data for model evaluation.
Comment 8
When applying the texture to the surfaces, the reviewer imagines it was done by hand. This can lead to errors and places in the 3D reconstruction that have noise or holes. Why wasn't something like a projector or a laser projector used to project the patterns on the surface? Then the uniformity of the pattern would be guaranteed.
Response:
We thank the reviewer for this relevant observation. The use of projected patterns, including digital or laser projectors, was considered during the experimental design phase of this research.
However, the adoption of projected textures was not pursued due to practical and methodological constraints associated with the laboratory environment and the objectives of the study. First, the use of projectors directly conflicted with the auxiliary lighting configurations under investigation. Since this work explicitly evaluates the influence of different lighting arrangements on SfM reconstruction quality, introducing a projected pattern would act as an additional and uncontrolled light source, altering the illumination distribution, contrast, and radiometric consistency of the scene. This would compromise the isolation of lighting-related variables that were central to the experimental design.
Second, the experimental protocol required a relatively large number of image acquisitions for each configuration, all performed using a single camera and under strict safety and access constraints typical of structural testing laboratories. During preliminary tests, maintaining consistent projection geometry and intensity over extended acquisition times proved challenging. Small changes in projector alignment, occlusions caused by camera repositioning, or gradual variations in projection intensity over time introduced inconsistencies that negatively affected feature detection and image matching.
In addition, the presence of reference elements such as scale bars and control bars with checkerboard patterns posed further challenges. Projected patterns interfered with the automatic detection of these reference targets, reducing their reliability for scaling and accuracy assessment.
For these reasons, manually applied artificial texture patterns were selected as a controlled and stable alternative. Although hand-applied patterns may introduce local variability, they provided consistent contrast throughout the acquisition process, did not interfere with the auxiliary lighting configurations, and ensured reliable detection of reference elements across all datasets.
To clarify this methodological choice, the revised manuscript explicitly defines artificial textures as manually applied patterns and acknowledges the trade-offs associated with alternative projection-based approaches.
We thank the reviewer for the detailed and technically insightful comments, which significantly improved the methodological transparency and robustness of the manuscript. Responses to each comment are provided below.
Comment 1
The authors briefly discuss their choice of Agisoft Metashape Pro. Even though it is extensively cited in the literature, the software is extremely expensive (especially the Pro version) and has comparable results to free SfM solutions such as Meshroom, RealityScan, and 3DF Zephyr, as well as to other commercial software such as Pix4D. The authors should either compare the results with at least free SfM software or cite research that performs such comparisons.
Response:
We thank the reviewer for this important and constructive comment. The suggested references comparing Agisoft Metashape with alternative SfM solutions were carefully reviewed and incorporated into the revised manuscript.
The choice of Agisoft Metashape Pro in this study was primarily motivated by its availability in our laboratory infrastructure and by the research team’s prior experience with the software, which facilitated a consistent and controlled experimental workflow. Importantly, the objective of the study was not to benchmark SfM software packages, but rather to investigate the influence of photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and storage formats, on the quality of close-range SfM-based 3D reconstructions.
As clarified in the revised manuscript, other computational solutions, both commercial and open source (e.g., Meshroom, COLMAP, OpenMVG, RealityScan, and 3DF Zephyr), can produce comparable results and would also meet the methodological requirements of the proposed experiments. The conclusions drawn in this work are therefore not software-dependent and can be transferred to alternative SfM pipelines.
Comment 2
In addition, no explanation has been given on the specific settings used in Metashape Pro to obtain the reconstructions, making the replicability impossible.
Response:
We thank the reviewer for highlighting this important point regarding replicability. In the revised version of the manuscript, the processing workflow and the specific configuration parameters used in Agisoft Metashape Pro were explicitly described.
A dedicated paragraph was added to the Methods section detailing the alignment, dense cloud generation, mesh reconstruction, and tiled model parameters applied consistently across all datasets. These settings were kept fixed for all experiments to ensure comparability between lighting configurations, artificial texture patterns, and storage formats.
By explicitly reporting these parameters, the revised manuscript now allows full replication of the reconstruction workflow using Agisoft Metashape Pro or equivalent SfM software.
Comment 3
Furthermore, the authors should discuss why not use other ways for 3D reconstruction for indoor close range, like structured light, stereo cameras, time-of-flight cameras, solid-state lidars, etc.
Response:
We thank the reviewer for this relevant and insightful comment. The choice of Structure-from-Motion (SfM) as the core reconstruction technique in this study was deliberate and aligned with the broader objectives of the doctoral research from which this work originates.
The primary aim of this research was to investigate how photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and image storage formats, affect the quality and reliability of SfM-based 3D reconstructions in laboratory-scale structural testing environments. SfM was selected because it is a passive, image-based technique that combines flexibility, scalability, and relatively low hardware cost, while allowing the use of standard photographic equipment and adaptable camera positioning under laboratory safety constraints.
Alternative close-range 3D reconstruction approaches, including structured light systems, stereo camera setups, time-of-flight sensors, and solid-state LiDARs, were considered within the broader research framework and, in some cases, evaluated in separate experimental stages. However, these techniques typically require specialized and often costly hardware, controlled illumination conditions, fixed sensor geometries, or limited operational ranges, which can reduce their applicability in structural laboratories characterized by space limitations, safety restrictions, and variable experimental configurations.
As future work, the research group is currently acquiring additional equipment, including stereo camera systems, dedicated lenses, and advanced illumination setups, which will allow a systematic comparison between SfM and alternative close-range 3D sensing technologies. Additionally, LiDAR-based depth sensing, available in modern smartphones, will be investigated as a complementary solution for selected laboratory scenarios. These investigations will be reported in future publications.
To clarify the scope of the present study, a brief discussion was added to the revised manuscript emphasizing that the objective was not to benchmark different 3D reconstruction technologies, but to optimize SfM acquisition strategies within realistic laboratory constraints.
Comment 4
For the testing, the authors should reiterate the light luminosity used in their research to enhance replicability.
Response:
We thank the reviewer for this comment regarding replicability. In the revised version of the manuscript, the auxiliary lighting specifications were explicitly reported.
The manuscript now states that two auxiliary lighting units (softboxes) were used, each equipped with a 7,000-lumen LED lamp and a color temperature of 5,000 K. These parameters were kept constant across all experiments involving auxiliary lighting to ensure consistency and reproducibility.
By explicitly reporting both the luminous flux and the color temperature of the light sources, the revised manuscript provides sufficient information for other researchers to replicate the illumination conditions adopted in this study.
Comment 5
It is generally not advised to take photos of objects for SfM reconstruction by just taking them in a line with overlap, without any rotation in the camera. This most of the time results in poorer reconstruction than if you also have a rotation in the camera axis. Why did the authors choose to do this capture configuration?
Response:
We thank the reviewer for this important methodological observation. The capture configuration adopted in this study was not arbitrary but was based on previous experimental evaluations conducted by the authors.
The use of a controlled capture geometry with overlapping images acquired along a regular grid, without intentional camera axis rotation, was motivated by earlier investigations reported by Moraes and da Silva, in which different acquisition strategies were systematically tested under similar laboratory conditions. In those experiments, no significant improvements in positional accuracy or model adjustment quality were observed when random camera orientations or additional rotations were introduced.
Given that the primary objective of the present study was to isolate and evaluate the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM reconstruction quality, a controlled and repeatable acquisition strategy was adopted. This approach reduced additional sources of variability and facilitated direct comparison between experimental configurations.
In addition, the selected capture strategy reflects practical constraints commonly encountered in laboratory structural testing, where camera positioning is often limited by safety protocols, equipment layout, and restricted access around the specimen. Under these conditions, a regular and well-controlled acquisition geometry provides a robust and reproducible solution without compromising the validity of the comparative analyses performed.
Comment 6
Do the authors also have ground truth representations of the objects that are being scanned? Maybe captured through a different scanning method or having 3D CAD models. Normally, to better compare 3D reconstruction quality, a comparison to ground truth objects is advised. Then the distance between the objects can be calculated, and there can be a comparison where the errors are on the surface of the object, how much noise there is, etc. It will be a good idea to have such a comparison in the paper.
Response:
We thank the reviewer for this relevant and constructive suggestion. In the present study, a full geometric ground truth representation of the specimens, such as high-resolution 3D CAD models or reference scans obtained with alternative sensing technologies, was not available.
Instead, the assessment of reconstruction quality was based on physically measurable reference elements incorporated into the experimental setup. Scale bars and control bars with known lengths were positioned within the region of interest and measured using a high-precision digital caliper. These elements provided reliable reference values for evaluating positional accuracy through distance-based metrics, such as RMSE, which are commonly adopted in close-range photogrammetry and SfM-based studies.
This approach was selected because the primary objective of the research was not to perform a full surface-to-surface comparison between reconstructed models and an external ground truth, but rather to analyze the relative impact of different acquisition conditions, including lighting configurations, artificial texture patterns, and storage formats, under controlled and repeatable laboratory conditions.
While surface-based comparisons against a full geometric ground truth can provide additional insights into noise distribution and local deviations, implementing such analyses would require complementary sensing systems or reference models that were outside the scope of the present work. This limitation is now explicitly acknowledged in the manuscript.
Comment 7
Authors are using a caliper to capture real-life distances. How many measurements were done for each place of measurement? Normally, when a human-led measurement like this is done, we need to know the standard deviation of the human error. These errors have been shown to propagate through the calculations and cause errors in the scaling of the 3D model have errors.
Response:
We thank the reviewer for this important comment regarding measurement uncertainty and error propagation.
In the present study, each scale bar and control bar was measured five independent times using a high-precision digital caliper. The repeated measurements allowed the estimation of the mean value and the corresponding standard deviation for each reference length. The observed standard deviations were on the order of hundredths of a millimeter, which is consistent with the manufacturer’s specifications of the instrument and within the accuracy level required for the submillimeter analyses performed in this work.
These reference measurements were subsequently used for model scaling and accuracy assessment. By relying on averaged values obtained from repeated measurements, the influence of operator-induced variability was minimized. The resulting uncertainty associated with the physical measurements was therefore significantly smaller than the variations observed in the SfM reconstruction metrics, ensuring that the scaling process did not dominate the error budget of the 3D models.
This measurement strategy is consistent with common practices in close-range photogrammetry and laboratory-scale SfM studies, where repeated caliper measurements are used to control human-induced uncertainty and ensure reliable reference data for model evaluation.
Comment 8
When applying the texture to the surfaces, the reviewer imagines it was done by hand. This can lead to errors and places in the 3D reconstruction that have noise or holes. Why wasn't something like a projector or a laser projector used to project the patterns on the surface? Then the uniformity of the pattern would be guaranteed.
Response:
We thank the reviewer for this relevant observation. The use of projected patterns, including digital or laser projectors, was considered during the experimental design phase of this research.
However, the adoption of projected textures was not pursued due to practical and methodological constraints associated with the laboratory environment and the objectives of the study. First, the use of projectors directly conflicted with the auxiliary lighting configurations under investigation. Since this work explicitly evaluates the influence of different lighting arrangements on SfM reconstruction quality, introducing a projected pattern would act as an additional and uncontrolled light source, altering the illumination distribution, contrast, and radiometric consistency of the scene. This would compromise the isolation of lighting-related variables that were central to the experimental design.
Second, the experimental protocol required a relatively large number of image acquisitions for each configuration, all performed using a single camera and under strict safety and access constraints typical of structural testing laboratories. During preliminary tests, maintaining consistent projection geometry and intensity over extended acquisition times proved challenging. Small changes in projector alignment, occlusions caused by camera repositioning, or gradual variations in projection intensity over time introduced inconsistencies that negatively affected feature detection and image matching.
In addition, the presence of reference elements such as scale bars and control bars with checkerboard patterns posed further challenges. Projected patterns interfered with the automatic detection of these reference targets, reducing their reliability for scaling and accuracy assessment.
For these reasons, manually applied artificial texture patterns were selected as a controlled and stable alternative. Although hand-applied patterns may introduce local variability, they provided consistent contrast throughout the acquisition process, did not interfere with the auxiliary lighting configurations, and ensured reliable detection of reference elements across all datasets.
To clarify this methodological choice, the revised manuscript explicitly defines artificial textures as manually applied patterns and acknowledges the trade-offs associated with alternative projection-based approaches.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 16 Jan 2026

Francisco Roza de Moraes, Department of Transportation Engineering, University of Sao Paulo Sao Carlos School of Engineering, São Carlos, 13563-120, Brazil

16 Jan 2026

Author Response

We thank the reviewer for the detailed and technically insightful comments, which significantly improved the methodological transparency and robustness of the manuscript. Responses to each comment are provided below.
Comment ... Continue reading We thank the reviewer for the detailed and technically insightful comments, which significantly improved the methodological transparency and robustness of the manuscript. Responses to each comment are provided below.
Comment 1
The authors briefly discuss their choice of Agisoft Metashape Pro. Even though it is extensively cited in the literature, the software is extremely expensive (especially the Pro version) and has comparable results to free SfM solutions such as Meshroom, RealityScan, and 3DF Zephyr, as well as to other commercial software such as Pix4D. The authors should either compare the results with at least free SfM software or cite research that performs such comparisons.
Response:
We thank the reviewer for this important and constructive comment. The suggested references comparing Agisoft Metashape with alternative SfM solutions were carefully reviewed and incorporated into the revised manuscript.
The choice of Agisoft Metashape Pro in this study was primarily motivated by its availability in our laboratory infrastructure and by the research team’s prior experience with the software, which facilitated a consistent and controlled experimental workflow. Importantly, the objective of the study was not to benchmark SfM software packages, but rather to investigate the influence of photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and storage formats, on the quality of close-range SfM-based 3D reconstructions.
As clarified in the revised manuscript, other computational solutions, both commercial and open source (e.g., Meshroom, COLMAP, OpenMVG, RealityScan, and 3DF Zephyr), can produce comparable results and would also meet the methodological requirements of the proposed experiments. The conclusions drawn in this work are therefore not software-dependent and can be transferred to alternative SfM pipelines.
Comment 2
In addition, no explanation has been given on the specific settings used in Metashape Pro to obtain the reconstructions, making the replicability impossible.
Response:
We thank the reviewer for highlighting this important point regarding replicability. In the revised version of the manuscript, the processing workflow and the specific configuration parameters used in Agisoft Metashape Pro were explicitly described.
A dedicated paragraph was added to the Methods section detailing the alignment, dense cloud generation, mesh reconstruction, and tiled model parameters applied consistently across all datasets. These settings were kept fixed for all experiments to ensure comparability between lighting configurations, artificial texture patterns, and storage formats.
By explicitly reporting these parameters, the revised manuscript now allows full replication of the reconstruction workflow using Agisoft Metashape Pro or equivalent SfM software.
Comment 3
Furthermore, the authors should discuss why not use other ways for 3D reconstruction for indoor close range, like structured light, stereo cameras, time-of-flight cameras, solid-state lidars, etc.
Response:
We thank the reviewer for this relevant and insightful comment. The choice of Structure-from-Motion (SfM) as the core reconstruction technique in this study was deliberate and aligned with the broader objectives of the doctoral research from which this work originates.
The primary aim of this research was to investigate how photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and image storage formats, affect the quality and reliability of SfM-based 3D reconstructions in laboratory-scale structural testing environments. SfM was selected because it is a passive, image-based technique that combines flexibility, scalability, and relatively low hardware cost, while allowing the use of standard photographic equipment and adaptable camera positioning under laboratory safety constraints.
Alternative close-range 3D reconstruction approaches, including structured light systems, stereo camera setups, time-of-flight sensors, and solid-state LiDARs, were considered within the broader research framework and, in some cases, evaluated in separate experimental stages. However, these techniques typically require specialized and often costly hardware, controlled illumination conditions, fixed sensor geometries, or limited operational ranges, which can reduce their applicability in structural laboratories characterized by space limitations, safety restrictions, and variable experimental configurations.
As future work, the research group is currently acquiring additional equipment, including stereo camera systems, dedicated lenses, and advanced illumination setups, which will allow a systematic comparison between SfM and alternative close-range 3D sensing technologies. Additionally, LiDAR-based depth sensing, available in modern smartphones, will be investigated as a complementary solution for selected laboratory scenarios. These investigations will be reported in future publications.
To clarify the scope of the present study, a brief discussion was added to the revised manuscript emphasizing that the objective was not to benchmark different 3D reconstruction technologies, but to optimize SfM acquisition strategies within realistic laboratory constraints.
Comment 4
For the testing, the authors should reiterate the light luminosity used in their research to enhance replicability.
Response:
We thank the reviewer for this comment regarding replicability. In the revised version of the manuscript, the auxiliary lighting specifications were explicitly reported.
The manuscript now states that two auxiliary lighting units (softboxes) were used, each equipped with a 7,000-lumen LED lamp and a color temperature of 5,000 K. These parameters were kept constant across all experiments involving auxiliary lighting to ensure consistency and reproducibility.
By explicitly reporting both the luminous flux and the color temperature of the light sources, the revised manuscript provides sufficient information for other researchers to replicate the illumination conditions adopted in this study.
Comment 5
It is generally not advised to take photos of objects for SfM reconstruction by just taking them in a line with overlap, without any rotation in the camera. This most of the time results in poorer reconstruction than if you also have a rotation in the camera axis. Why did the authors choose to do this capture configuration?
Response:
We thank the reviewer for this important methodological observation. The capture configuration adopted in this study was not arbitrary but was based on previous experimental evaluations conducted by the authors.
The use of a controlled capture geometry with overlapping images acquired along a regular grid, without intentional camera axis rotation, was motivated by earlier investigations reported by Moraes and da Silva, in which different acquisition strategies were systematically tested under similar laboratory conditions. In those experiments, no significant improvements in positional accuracy or model adjustment quality were observed when random camera orientations or additional rotations were introduced.
Given that the primary objective of the present study was to isolate and evaluate the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM reconstruction quality, a controlled and repeatable acquisition strategy was adopted. This approach reduced additional sources of variability and facilitated direct comparison between experimental configurations.
In addition, the selected capture strategy reflects practical constraints commonly encountered in laboratory structural testing, where camera positioning is often limited by safety protocols, equipment layout, and restricted access around the specimen. Under these conditions, a regular and well-controlled acquisition geometry provides a robust and reproducible solution without compromising the validity of the comparative analyses performed.
Comment 6
Do the authors also have ground truth representations of the objects that are being scanned? Maybe captured through a different scanning method or having 3D CAD models. Normally, to better compare 3D reconstruction quality, a comparison to ground truth objects is advised. Then the distance between the objects can be calculated, and there can be a comparison where the errors are on the surface of the object, how much noise there is, etc. It will be a good idea to have such a comparison in the paper.
Response:
We thank the reviewer for this relevant and constructive suggestion. In the present study, a full geometric ground truth representation of the specimens, such as high-resolution 3D CAD models or reference scans obtained with alternative sensing technologies, was not available.
Instead, the assessment of reconstruction quality was based on physically measurable reference elements incorporated into the experimental setup. Scale bars and control bars with known lengths were positioned within the region of interest and measured using a high-precision digital caliper. These elements provided reliable reference values for evaluating positional accuracy through distance-based metrics, such as RMSE, which are commonly adopted in close-range photogrammetry and SfM-based studies.
This approach was selected because the primary objective of the research was not to perform a full surface-to-surface comparison between reconstructed models and an external ground truth, but rather to analyze the relative impact of different acquisition conditions, including lighting configurations, artificial texture patterns, and storage formats, under controlled and repeatable laboratory conditions.
While surface-based comparisons against a full geometric ground truth can provide additional insights into noise distribution and local deviations, implementing such analyses would require complementary sensing systems or reference models that were outside the scope of the present work. This limitation is now explicitly acknowledged in the manuscript.
Comment 7
Authors are using a caliper to capture real-life distances. How many measurements were done for each place of measurement? Normally, when a human-led measurement like this is done, we need to know the standard deviation of the human error. These errors have been shown to propagate through the calculations and cause errors in the scaling of the 3D model have errors.
Response:
We thank the reviewer for this important comment regarding measurement uncertainty and error propagation.
In the present study, each scale bar and control bar was measured five independent times using a high-precision digital caliper. The repeated measurements allowed the estimation of the mean value and the corresponding standard deviation for each reference length. The observed standard deviations were on the order of hundredths of a millimeter, which is consistent with the manufacturer’s specifications of the instrument and within the accuracy level required for the submillimeter analyses performed in this work.
These reference measurements were subsequently used for model scaling and accuracy assessment. By relying on averaged values obtained from repeated measurements, the influence of operator-induced variability was minimized. The resulting uncertainty associated with the physical measurements was therefore significantly smaller than the variations observed in the SfM reconstruction metrics, ensuring that the scaling process did not dominate the error budget of the 3D models.
This measurement strategy is consistent with common practices in close-range photogrammetry and laboratory-scale SfM studies, where repeated caliper measurements are used to control human-induced uncertainty and ensure reliable reference data for model evaluation.
Comment 8
When applying the texture to the surfaces, the reviewer imagines it was done by hand. This can lead to errors and places in the 3D reconstruction that have noise or holes. Why wasn't something like a projector or a laser projector used to project the patterns on the surface? Then the uniformity of the pattern would be guaranteed.
Response:
We thank the reviewer for this relevant observation. The use of projected patterns, including digital or laser projectors, was considered during the experimental design phase of this research.
However, the adoption of projected textures was not pursued due to practical and methodological constraints associated with the laboratory environment and the objectives of the study. First, the use of projectors directly conflicted with the auxiliary lighting configurations under investigation. Since this work explicitly evaluates the influence of different lighting arrangements on SfM reconstruction quality, introducing a projected pattern would act as an additional and uncontrolled light source, altering the illumination distribution, contrast, and radiometric consistency of the scene. This would compromise the isolation of lighting-related variables that were central to the experimental design.
Second, the experimental protocol required a relatively large number of image acquisitions for each configuration, all performed using a single camera and under strict safety and access constraints typical of structural testing laboratories. During preliminary tests, maintaining consistent projection geometry and intensity over extended acquisition times proved challenging. Small changes in projector alignment, occlusions caused by camera repositioning, or gradual variations in projection intensity over time introduced inconsistencies that negatively affected feature detection and image matching.
In addition, the presence of reference elements such as scale bars and control bars with checkerboard patterns posed further challenges. Projected patterns interfered with the automatic detection of these reference targets, reducing their reliability for scaling and accuracy assessment.
For these reasons, manually applied artificial texture patterns were selected as a controlled and stable alternative. Although hand-applied patterns may introduce local variability, they provided consistent contrast throughout the acquisition process, did not interfere with the auxiliary lighting configurations, and ensured reliable detection of reference elements across all datasets.
To clarify this methodological choice, the revised manuscript explicitly defines artificial textures as manually applied patterns and acknowledges the trade-offs associated with alternative projection-based approaches.
We thank the reviewer for the detailed and technically insightful comments, which significantly improved the methodological transparency and robustness of the manuscript. Responses to each comment are provided below.
Comment 1
The authors briefly discuss their choice of Agisoft Metashape Pro. Even though it is extensively cited in the literature, the software is extremely expensive (especially the Pro version) and has comparable results to free SfM solutions such as Meshroom, RealityScan, and 3DF Zephyr, as well as to other commercial software such as Pix4D. The authors should either compare the results with at least free SfM software or cite research that performs such comparisons.
Response:
We thank the reviewer for this important and constructive comment. The suggested references comparing Agisoft Metashape with alternative SfM solutions were carefully reviewed and incorporated into the revised manuscript.
The choice of Agisoft Metashape Pro in this study was primarily motivated by its availability in our laboratory infrastructure and by the research team’s prior experience with the software, which facilitated a consistent and controlled experimental workflow. Importantly, the objective of the study was not to benchmark SfM software packages, but rather to investigate the influence of photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and storage formats, on the quality of close-range SfM-based 3D reconstructions.
As clarified in the revised manuscript, other computational solutions, both commercial and open source (e.g., Meshroom, COLMAP, OpenMVG, RealityScan, and 3DF Zephyr), can produce comparable results and would also meet the methodological requirements of the proposed experiments. The conclusions drawn in this work are therefore not software-dependent and can be transferred to alternative SfM pipelines.
Comment 2
In addition, no explanation has been given on the specific settings used in Metashape Pro to obtain the reconstructions, making the replicability impossible.
Response:
We thank the reviewer for highlighting this important point regarding replicability. In the revised version of the manuscript, the processing workflow and the specific configuration parameters used in Agisoft Metashape Pro were explicitly described.
A dedicated paragraph was added to the Methods section detailing the alignment, dense cloud generation, mesh reconstruction, and tiled model parameters applied consistently across all datasets. These settings were kept fixed for all experiments to ensure comparability between lighting configurations, artificial texture patterns, and storage formats.
By explicitly reporting these parameters, the revised manuscript now allows full replication of the reconstruction workflow using Agisoft Metashape Pro or equivalent SfM software.
Comment 3
Furthermore, the authors should discuss why not use other ways for 3D reconstruction for indoor close range, like structured light, stereo cameras, time-of-flight cameras, solid-state lidars, etc.
Response:
We thank the reviewer for this relevant and insightful comment. The choice of Structure-from-Motion (SfM) as the core reconstruction technique in this study was deliberate and aligned with the broader objectives of the doctoral research from which this work originates.
The primary aim of this research was to investigate how photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and image storage formats, affect the quality and reliability of SfM-based 3D reconstructions in laboratory-scale structural testing environments. SfM was selected because it is a passive, image-based technique that combines flexibility, scalability, and relatively low hardware cost, while allowing the use of standard photographic equipment and adaptable camera positioning under laboratory safety constraints.
Alternative close-range 3D reconstruction approaches, including structured light systems, stereo camera setups, time-of-flight sensors, and solid-state LiDARs, were considered within the broader research framework and, in some cases, evaluated in separate experimental stages. However, these techniques typically require specialized and often costly hardware, controlled illumination conditions, fixed sensor geometries, or limited operational ranges, which can reduce their applicability in structural laboratories characterized by space limitations, safety restrictions, and variable experimental configurations.
As future work, the research group is currently acquiring additional equipment, including stereo camera systems, dedicated lenses, and advanced illumination setups, which will allow a systematic comparison between SfM and alternative close-range 3D sensing technologies. Additionally, LiDAR-based depth sensing, available in modern smartphones, will be investigated as a complementary solution for selected laboratory scenarios. These investigations will be reported in future publications.
To clarify the scope of the present study, a brief discussion was added to the revised manuscript emphasizing that the objective was not to benchmark different 3D reconstruction technologies, but to optimize SfM acquisition strategies within realistic laboratory constraints.
Comment 4
For the testing, the authors should reiterate the light luminosity used in their research to enhance replicability.
Response:
We thank the reviewer for this comment regarding replicability. In the revised version of the manuscript, the auxiliary lighting specifications were explicitly reported.
The manuscript now states that two auxiliary lighting units (softboxes) were used, each equipped with a 7,000-lumen LED lamp and a color temperature of 5,000 K. These parameters were kept constant across all experiments involving auxiliary lighting to ensure consistency and reproducibility.
By explicitly reporting both the luminous flux and the color temperature of the light sources, the revised manuscript provides sufficient information for other researchers to replicate the illumination conditions adopted in this study.
Comment 5
It is generally not advised to take photos of objects for SfM reconstruction by just taking them in a line with overlap, without any rotation in the camera. This most of the time results in poorer reconstruction than if you also have a rotation in the camera axis. Why did the authors choose to do this capture configuration?
Response:
We thank the reviewer for this important methodological observation. The capture configuration adopted in this study was not arbitrary but was based on previous experimental evaluations conducted by the authors.
The use of a controlled capture geometry with overlapping images acquired along a regular grid, without intentional camera axis rotation, was motivated by earlier investigations reported by Moraes and da Silva, in which different acquisition strategies were systematically tested under similar laboratory conditions. In those experiments, no significant improvements in positional accuracy or model adjustment quality were observed when random camera orientations or additional rotations were introduced.
Given that the primary objective of the present study was to isolate and evaluate the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM reconstruction quality, a controlled and repeatable acquisition strategy was adopted. This approach reduced additional sources of variability and facilitated direct comparison between experimental configurations.
In addition, the selected capture strategy reflects practical constraints commonly encountered in laboratory structural testing, where camera positioning is often limited by safety protocols, equipment layout, and restricted access around the specimen. Under these conditions, a regular and well-controlled acquisition geometry provides a robust and reproducible solution without compromising the validity of the comparative analyses performed.
Comment 6
Do the authors also have ground truth representations of the objects that are being scanned? Maybe captured through a different scanning method or having 3D CAD models. Normally, to better compare 3D reconstruction quality, a comparison to ground truth objects is advised. Then the distance between the objects can be calculated, and there can be a comparison where the errors are on the surface of the object, how much noise there is, etc. It will be a good idea to have such a comparison in the paper.
Response:
We thank the reviewer for this relevant and constructive suggestion. In the present study, a full geometric ground truth representation of the specimens, such as high-resolution 3D CAD models or reference scans obtained with alternative sensing technologies, was not available.
Instead, the assessment of reconstruction quality was based on physically measurable reference elements incorporated into the experimental setup. Scale bars and control bars with known lengths were positioned within the region of interest and measured using a high-precision digital caliper. These elements provided reliable reference values for evaluating positional accuracy through distance-based metrics, such as RMSE, which are commonly adopted in close-range photogrammetry and SfM-based studies.
This approach was selected because the primary objective of the research was not to perform a full surface-to-surface comparison between reconstructed models and an external ground truth, but rather to analyze the relative impact of different acquisition conditions, including lighting configurations, artificial texture patterns, and storage formats, under controlled and repeatable laboratory conditions.
While surface-based comparisons against a full geometric ground truth can provide additional insights into noise distribution and local deviations, implementing such analyses would require complementary sensing systems or reference models that were outside the scope of the present work. This limitation is now explicitly acknowledged in the manuscript.
Comment 7
Authors are using a caliper to capture real-life distances. How many measurements were done for each place of measurement? Normally, when a human-led measurement like this is done, we need to know the standard deviation of the human error. These errors have been shown to propagate through the calculations and cause errors in the scaling of the 3D model have errors.
Response:
We thank the reviewer for this important comment regarding measurement uncertainty and error propagation.
In the present study, each scale bar and control bar was measured five independent times using a high-precision digital caliper. The repeated measurements allowed the estimation of the mean value and the corresponding standard deviation for each reference length. The observed standard deviations were on the order of hundredths of a millimeter, which is consistent with the manufacturer’s specifications of the instrument and within the accuracy level required for the submillimeter analyses performed in this work.
These reference measurements were subsequently used for model scaling and accuracy assessment. By relying on averaged values obtained from repeated measurements, the influence of operator-induced variability was minimized. The resulting uncertainty associated with the physical measurements was therefore significantly smaller than the variations observed in the SfM reconstruction metrics, ensuring that the scaling process did not dominate the error budget of the 3D models.
This measurement strategy is consistent with common practices in close-range photogrammetry and laboratory-scale SfM studies, where repeated caliper measurements are used to control human-induced uncertainty and ensure reliable reference data for model evaluation.
Comment 8
When applying the texture to the surfaces, the reviewer imagines it was done by hand. This can lead to errors and places in the 3D reconstruction that have noise or holes. Why wasn't something like a projector or a laser projector used to project the patterns on the surface? Then the uniformity of the pattern would be guaranteed.
Response:
We thank the reviewer for this relevant observation. The use of projected patterns, including digital or laser projectors, was considered during the experimental design phase of this research.
However, the adoption of projected textures was not pursued due to practical and methodological constraints associated with the laboratory environment and the objectives of the study. First, the use of projectors directly conflicted with the auxiliary lighting configurations under investigation. Since this work explicitly evaluates the influence of different lighting arrangements on SfM reconstruction quality, introducing a projected pattern would act as an additional and uncontrolled light source, altering the illumination distribution, contrast, and radiometric consistency of the scene. This would compromise the isolation of lighting-related variables that were central to the experimental design.
Second, the experimental protocol required a relatively large number of image acquisitions for each configuration, all performed using a single camera and under strict safety and access constraints typical of structural testing laboratories. During preliminary tests, maintaining consistent projection geometry and intensity over extended acquisition times proved challenging. Small changes in projector alignment, occlusions caused by camera repositioning, or gradual variations in projection intensity over time introduced inconsistencies that negatively affected feature detection and image matching.
In addition, the presence of reference elements such as scale bars and control bars with checkerboard patterns posed further challenges. Projected patterns interfered with the automatic detection of these reference targets, reducing their reliability for scaling and accuracy assessment.
For these reasons, manually applied artificial texture patterns were selected as a controlled and stable alternative. Although hand-applied patterns may introduce local variability, they provided consistent contrast throughout the acquisition process, did not interfere with the auxiliary lighting configurations, and ensured reliable detection of reference elements across all datasets.
To clarify this methodological choice, the revised manuscript explicitly defines artificial textures as manually applied patterns and acknowledges the trade-offs associated with alternative projection-based approaches.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 10 Feb 2025

Aleksandar Ašonja, University Business Academy, Cvećarska, Serbia

Approved

https://doi.org/10.5256/f1000research.173163.r356962

In general, the research topic is very current. Some chapters of the manuscript need to be revised to make them clearer and generally acceptable to readers. I suggest the following minimal changes and additions to make the manuscript acceptable for indexing.
1) Do not write the manuscript in personal pronouns (we…..) and possessive adjectives (Our…… ). The manuscript should be written in the third person and in the past tense.
2) The title and keywords describe the research well.
3) The proposal is to expand the abstract with some of the research results.
4) The introductory chapter is clearly written. This chapter should be expanded by introducing more recent research.
5) The work methodology is not very clear. In the work methodology, it should be noted: What scientific methods, techniques, analyses, software, devices and other equipment are used for research.
6) The research results are clear and describe the research well.
7) In the conclusion, it should be stated what was the scientific justification of the research.
8) The conclusion should highlight what the continuation of the research could be.
9) The reference should be expanded by introducing additional references.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Industrial Engineering, Mechanical Engineering, Renewable Energy Sources, Agricultural Engineering,Energy and Environment.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 16 Jan 2026

Francisco Roza de Moraes, Department of Transportation Engineering, University of Sao Paulo Sao Carlos School of Engineering, São Carlos, 13563-120, Brazil

16 Jan 2026

Author Response

We thank the reviewer for the careful evaluation of the manuscript and for the constructive comments, which helped improve clarity, structure, and scientific rigor. Each comment is addressed point by ... Continue reading We thank the reviewer for the careful evaluation of the manuscript and for the constructive comments, which helped improve clarity, structure, and scientific rigor. Each comment is addressed point by point below.

Comment 1
Do not write the manuscript in personal pronouns (we…). The manuscript should be written in the third person and in the past tense.
Response:
The manuscript was revised to remove personal pronouns and to ensure consistent use of the third person and past tense throughout all sections, following standard scientific writing conventions.

Comment 2
The proposal is to expand the abstract with some of the research results.
Response:
The abstract was expanded to include a concise summary of the main experimental results, highlighting the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM-based 3D modeling quality.

Comment 3
The introductory chapter is clearly written. This chapter should be expanded by introducing more recent research.
Response:
The Introduction was expanded with additional recent and relevant references addressing close-range SfM applications, lighting conditions, and feature detection robustness, strengthening the contextual background of the study.

Comment 4
The work methodology is not very clear. In the work methodology, it should be noted: What scientific methods, techniques, analyses, software, devices, and other equipment are used for research.
Response:
The Methods section was revised to improve clarity and reproducibility. The experimental setup, lighting configurations, artificial texture patterns, camera parameters, capture distance, calibration strategy, and evaluation metrics were explicitly described.

Comment 5
It should be specified which materials, software, devices, and other equipment are used for research.
Response:
All materials, software, and equipment used in the study were explicitly detailed, including camera and lens specifications, auxiliary lighting characteristics, calibration instruments, and processing software.

Comment 6
In the conclusion, it should be stated what the scientific justification of the research.
Response:
The Conclusion was revised to clearly state the scientific contribution of the study, emphasizing the systematic evaluation of capture configurations for improving SfM-based 3D modeling accuracy in laboratory environments.

Comment 7
The conclusion should highlight what the continuation of the research could be.
Response:
The Conclusion now includes perspectives for future research, such as extending the analysis to other materials, capture distances, texture strategies, and laboratory conditions.

Comment 8
The reference should be expanded by introducing additional references.
Response:
The reference list was expanded with additional recent and relevant publications to better support the discussion and contextualize the findings.
We thank the reviewer for the careful evaluation of the manuscript and for the constructive comments, which helped improve clarity, structure, and scientific rigor. Each comment is addressed point by point below.

Comment 1
Do not write the manuscript in personal pronouns (we…). The manuscript should be written in the third person and in the past tense.
Response:
The manuscript was revised to remove personal pronouns and to ensure consistent use of the third person and past tense throughout all sections, following standard scientific writing conventions.

Comment 2
The proposal is to expand the abstract with some of the research results.
Response:
The abstract was expanded to include a concise summary of the main experimental results, highlighting the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM-based 3D modeling quality.

Comment 3
The introductory chapter is clearly written. This chapter should be expanded by introducing more recent research.
Response:
The Introduction was expanded with additional recent and relevant references addressing close-range SfM applications, lighting conditions, and feature detection robustness, strengthening the contextual background of the study.

Comment 4
The work methodology is not very clear. In the work methodology, it should be noted: What scientific methods, techniques, analyses, software, devices, and other equipment are used for research.
Response:
The Methods section was revised to improve clarity and reproducibility. The experimental setup, lighting configurations, artificial texture patterns, camera parameters, capture distance, calibration strategy, and evaluation metrics were explicitly described.

Comment 5
It should be specified which materials, software, devices, and other equipment are used for research.
Response:
All materials, software, and equipment used in the study were explicitly detailed, including camera and lens specifications, auxiliary lighting characteristics, calibration instruments, and processing software.

Comment 6
In the conclusion, it should be stated what the scientific justification of the research.
Response:
The Conclusion was revised to clearly state the scientific contribution of the study, emphasizing the systematic evaluation of capture configurations for improving SfM-based 3D modeling accuracy in laboratory environments.

Comment 7
The conclusion should highlight what the continuation of the research could be.
Response:
The Conclusion now includes perspectives for future research, such as extending the analysis to other materials, capture distances, texture strategies, and laboratory conditions.

Comment 8
The reference should be expanded by introducing additional references.
Response:
The reference list was expanded with additional recent and relevant publications to better support the discussion and contextualize the findings.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 16 Jan 2026

Francisco Roza de Moraes, Department of Transportation Engineering, University of Sao Paulo Sao Carlos School of Engineering, São Carlos, 13563-120, Brazil

16 Jan 2026

Author Response

We thank the reviewer for the careful evaluation of the manuscript and for the constructive comments, which helped improve clarity, structure, and scientific rigor. Each comment is addressed point by ... Continue reading We thank the reviewer for the careful evaluation of the manuscript and for the constructive comments, which helped improve clarity, structure, and scientific rigor. Each comment is addressed point by point below.

Comment 1
Do not write the manuscript in personal pronouns (we…). The manuscript should be written in the third person and in the past tense.
Response:
The manuscript was revised to remove personal pronouns and to ensure consistent use of the third person and past tense throughout all sections, following standard scientific writing conventions.

Comment 2
The proposal is to expand the abstract with some of the research results.
Response:
The abstract was expanded to include a concise summary of the main experimental results, highlighting the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM-based 3D modeling quality.

Comment 3
The introductory chapter is clearly written. This chapter should be expanded by introducing more recent research.
Response:
The Introduction was expanded with additional recent and relevant references addressing close-range SfM applications, lighting conditions, and feature detection robustness, strengthening the contextual background of the study.

Comment 4
The work methodology is not very clear. In the work methodology, it should be noted: What scientific methods, techniques, analyses, software, devices, and other equipment are used for research.
Response:
The Methods section was revised to improve clarity and reproducibility. The experimental setup, lighting configurations, artificial texture patterns, camera parameters, capture distance, calibration strategy, and evaluation metrics were explicitly described.

Comment 5
It should be specified which materials, software, devices, and other equipment are used for research.
Response:
All materials, software, and equipment used in the study were explicitly detailed, including camera and lens specifications, auxiliary lighting characteristics, calibration instruments, and processing software.

Comment 6
In the conclusion, it should be stated what the scientific justification of the research.
Response:
The Conclusion was revised to clearly state the scientific contribution of the study, emphasizing the systematic evaluation of capture configurations for improving SfM-based 3D modeling accuracy in laboratory environments.

Comment 7
The conclusion should highlight what the continuation of the research could be.
Response:
The Conclusion now includes perspectives for future research, such as extending the analysis to other materials, capture distances, texture strategies, and laboratory conditions.

Comment 8
The reference should be expanded by introducing additional references.
Response:
The reference list was expanded with additional recent and relevant publications to better support the discussion and contextualize the findings.
We thank the reviewer for the careful evaluation of the manuscript and for the constructive comments, which helped improve clarity, structure, and scientific rigor. Each comment is addressed point by point below.

Comment 1
Do not write the manuscript in personal pronouns (we…). The manuscript should be written in the third person and in the past tense.
Response:
The manuscript was revised to remove personal pronouns and to ensure consistent use of the third person and past tense throughout all sections, following standard scientific writing conventions.

Comment 2
The proposal is to expand the abstract with some of the research results.
Response:
The abstract was expanded to include a concise summary of the main experimental results, highlighting the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM-based 3D modeling quality.

Comment 3
The introductory chapter is clearly written. This chapter should be expanded by introducing more recent research.
Response:
The Introduction was expanded with additional recent and relevant references addressing close-range SfM applications, lighting conditions, and feature detection robustness, strengthening the contextual background of the study.

Comment 4
The work methodology is not very clear. In the work methodology, it should be noted: What scientific methods, techniques, analyses, software, devices, and other equipment are used for research.
Response:
The Methods section was revised to improve clarity and reproducibility. The experimental setup, lighting configurations, artificial texture patterns, camera parameters, capture distance, calibration strategy, and evaluation metrics were explicitly described.

Comment 5
It should be specified which materials, software, devices, and other equipment are used for research.
Response:
All materials, software, and equipment used in the study were explicitly detailed, including camera and lens specifications, auxiliary lighting characteristics, calibration instruments, and processing software.

Comment 6
In the conclusion, it should be stated what the scientific justification of the research.
Response:
The Conclusion was revised to clearly state the scientific contribution of the study, emphasizing the systematic evaluation of capture configurations for improving SfM-based 3D modeling accuracy in laboratory environments.

Comment 7
The conclusion should highlight what the continuation of the research could be.
Response:
The Conclusion now includes perspectives for future research, such as extending the analysis to other materials, capture distances, texture strategies, and laboratory conditions.

Comment 8
The reference should be expanded by introducing additional references.
Response:
The reference list was expanded with additional recent and relevant publications to better support the discussion and contextualize the findings.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 04 Dec 2024

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 1 04 Dec 24	read	read

Aleksandar Ašonja, University Business Academy, Cvećarska, Serbia
Ivan Nikolov, Aalborg University, Aalborg, Denmark

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

15 Views

23 Aug 2025 | for Version 1

Ivan Nikolov, Aalborg University, Aalborg, Denmark

15 Views Cite this report Responses(1)

Approved With Reservations

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

No
Are the conclusions drawn adequately supported by the results?

Yes

References

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Deep learning, 3D reconstruction, SfM, photogrammetry, computer graphics

Respond to this report

Responses (1)

Author Response

16 Jan 2026

Francisco Roza de Moraes, Department of Transportation Engineering, University of Sao Paulo Sao Carlos School of Engineering, São Carlos, 13563-120, Brazil

We thank the reviewer for the detailed and technically insightful comments, which significantly improved the methodological transparency and robustness of the manuscript. Responses to each comment are provided below.
Comment 1
The authors briefly discuss their choice of Agisoft Metashape Pro. Even though it is extensively cited in the literature, the software is extremely expensive (especially the Pro version) and has comparable results to free SfM solutions such as Meshroom, RealityScan, and 3DF Zephyr, as well as to other commercial software such as Pix4D. The authors should either compare the results with at least free SfM software or cite research that performs such comparisons.
Response:
We thank the reviewer for this important and constructive comment. The suggested references comparing Agisoft Metashape with alternative SfM solutions were carefully reviewed and incorporated into the revised manuscript.
The choice of Agisoft Metashape Pro in this study was primarily motivated by its availability in our laboratory infrastructure and by the research team’s prior experience with the software, which facilitated a consistent and controlled experimental workflow. Importantly, the objective of the study was not to benchmark SfM software packages, but rather to investigate the influence of photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and storage formats, on the quality of close-range SfM-based 3D reconstructions.
As clarified in the revised manuscript, other computational solutions, both commercial and open source (e.g., Meshroom, COLMAP, OpenMVG, RealityScan, and 3DF Zephyr), can produce comparable results and would also meet the methodological requirements of the proposed experiments. The conclusions drawn in this work are therefore not software-dependent and can be transferred to alternative SfM pipelines.
Comment 2
In addition, no explanation has been given on the specific settings used in Metashape Pro to obtain the reconstructions, making the replicability impossible.
Response:
We thank the reviewer for highlighting this important point regarding replicability. In the revised version of the manuscript, the processing workflow and the specific configuration parameters used in Agisoft Metashape Pro were explicitly described.
A dedicated paragraph was added to the Methods section detailing the alignment, dense cloud generation, mesh reconstruction, and tiled model parameters applied consistently across all datasets. These settings were kept fixed for all experiments to ensure comparability between lighting configurations, artificial texture patterns, and storage formats.
By explicitly reporting these parameters, the revised manuscript now allows full replication of the reconstruction workflow using Agisoft Metashape Pro or equivalent SfM software.
Comment 3
Furthermore, the authors should discuss why not use other ways for 3D reconstruction for indoor close range, like structured light, stereo cameras, time-of-flight cameras, solid-state lidars, etc.
Response:
We thank the reviewer for this relevant and insightful comment. The choice of Structure-from-Motion (SfM) as the core reconstruction technique in this study was deliberate and aligned with the broader objectives of the doctoral research from which this work originates.
The primary aim of this research was to investigate how photographic acquisition conditions, such as auxiliary lighting, artificial texture patterns, and image storage formats, affect the quality and reliability of SfM-based 3D reconstructions in laboratory-scale structural testing environments. SfM was selected because it is a passive, image-based technique that combines flexibility, scalability, and relatively low hardware cost, while allowing the use of standard photographic equipment and adaptable camera positioning under laboratory safety constraints.
Alternative close-range 3D reconstruction approaches, including structured light systems, stereo camera setups, time-of-flight sensors, and solid-state LiDARs, were considered within the broader research framework and, in some cases, evaluated in separate experimental stages. However, these techniques typically require specialized and often costly hardware, controlled illumination conditions, fixed sensor geometries, or limited operational ranges, which can reduce their applicability in structural laboratories characterized by space limitations, safety restrictions, and variable experimental configurations.
As future work, the research group is currently acquiring additional equipment, including stereo camera systems, dedicated lenses, and advanced illumination setups, which will allow a systematic comparison between SfM and alternative close-range 3D sensing technologies. Additionally, LiDAR-based depth sensing, available in modern smartphones, will be investigated as a complementary solution for selected laboratory scenarios. These investigations will be reported in future publications.
To clarify the scope of the present study, a brief discussion was added to the revised manuscript emphasizing that the objective was not to benchmark different 3D reconstruction technologies, but to optimize SfM acquisition strategies within realistic laboratory constraints.
Comment 4
For the testing, the authors should reiterate the light luminosity used in their research to enhance replicability.
Response:
We thank the reviewer for this comment regarding replicability. In the revised version of the manuscript, the auxiliary lighting specifications were explicitly reported.
The manuscript now states that two auxiliary lighting units (softboxes) were used, each equipped with a 7,000-lumen LED lamp and a color temperature of 5,000 K. These parameters were kept constant across all experiments involving auxiliary lighting to ensure consistency and reproducibility.
By explicitly reporting both the luminous flux and the color temperature of the light sources, the revised manuscript provides sufficient information for other researchers to replicate the illumination conditions adopted in this study.
Comment 5
It is generally not advised to take photos of objects for SfM reconstruction by just taking them in a line with overlap, without any rotation in the camera. This most of the time results in poorer reconstruction than if you also have a rotation in the camera axis. Why did the authors choose to do this capture configuration?
Response:
We thank the reviewer for this important methodological observation. The capture configuration adopted in this study was not arbitrary but was based on previous experimental evaluations conducted by the authors.
The use of a controlled capture geometry with overlapping images acquired along a regular grid, without intentional camera axis rotation, was motivated by earlier investigations reported by Moraes and da Silva, in which different acquisition strategies were systematically tested under similar laboratory conditions. In those experiments, no significant improvements in positional accuracy or model adjustment quality were observed when random camera orientations or additional rotations were introduced.
Given that the primary objective of the present study was to isolate and evaluate the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM reconstruction quality, a controlled and repeatable acquisition strategy was adopted. This approach reduced additional sources of variability and facilitated direct comparison between experimental configurations.
In addition, the selected capture strategy reflects practical constraints commonly encountered in laboratory structural testing, where camera positioning is often limited by safety protocols, equipment layout, and restricted access around the specimen. Under these conditions, a regular and well-controlled acquisition geometry provides a robust and reproducible solution without compromising the validity of the comparative analyses performed.
Comment 6
Do the authors also have ground truth representations of the objects that are being scanned? Maybe captured through a different scanning method or having 3D CAD models. Normally, to better compare 3D reconstruction quality, a comparison to ground truth objects is advised. Then the distance between the objects can be calculated, and there can be a comparison where the errors are on the surface of the object, how much noise there is, etc. It will be a good idea to have such a comparison in the paper.
Response:
We thank the reviewer for this relevant and constructive suggestion. In the present study, a full geometric ground truth representation of the specimens, such as high-resolution 3D CAD models or reference scans obtained with alternative sensing technologies, was not available.
Instead, the assessment of reconstruction quality was based on physically measurable reference elements incorporated into the experimental setup. Scale bars and control bars with known lengths were positioned within the region of interest and measured using a high-precision digital caliper. These elements provided reliable reference values for evaluating positional accuracy through distance-based metrics, such as RMSE, which are commonly adopted in close-range photogrammetry and SfM-based studies.
This approach was selected because the primary objective of the research was not to perform a full surface-to-surface comparison between reconstructed models and an external ground truth, but rather to analyze the relative impact of different acquisition conditions, including lighting configurations, artificial texture patterns, and storage formats, under controlled and repeatable laboratory conditions.
While surface-based comparisons against a full geometric ground truth can provide additional insights into noise distribution and local deviations, implementing such analyses would require complementary sensing systems or reference models that were outside the scope of the present work. This limitation is now explicitly acknowledged in the manuscript.
Comment 7
Authors are using a caliper to capture real-life distances. How many measurements were done for each place of measurement? Normally, when a human-led measurement like this is done, we need to know the standard deviation of the human error. These errors have been shown to propagate through the calculations and cause errors in the scaling of the 3D model have errors.
Response:
We thank the reviewer for this important comment regarding measurement uncertainty and error propagation.
In the present study, each scale bar and control bar was measured five independent times using a high-precision digital caliper. The repeated measurements allowed the estimation of the mean value and the corresponding standard deviation for each reference length. The observed standard deviations were on the order of hundredths of a millimeter, which is consistent with the manufacturer’s specifications of the instrument and within the accuracy level required for the submillimeter analyses performed in this work.
These reference measurements were subsequently used for model scaling and accuracy assessment. By relying on averaged values obtained from repeated measurements, the influence of operator-induced variability was minimized. The resulting uncertainty associated with the physical measurements was therefore significantly smaller than the variations observed in the SfM reconstruction metrics, ensuring that the scaling process did not dominate the error budget of the 3D models.
This measurement strategy is consistent with common practices in close-range photogrammetry and laboratory-scale SfM studies, where repeated caliper measurements are used to control human-induced uncertainty and ensure reliable reference data for model evaluation.
Comment 8
When applying the texture to the surfaces, the reviewer imagines it was done by hand. This can lead to errors and places in the 3D reconstruction that have noise or holes. Why wasn't something like a projector or a laser projector used to project the patterns on the surface? Then the uniformity of the pattern would be guaranteed.
Response:
We thank the reviewer for this relevant observation. The use of projected patterns, including digital or laser projectors, was considered during the experimental design phase of this research.
However, the adoption of projected textures was not pursued due to practical and methodological constraints associated with the laboratory environment and the objectives of the study. First, the use of projectors directly conflicted with the auxiliary lighting configurations under investigation. Since this work explicitly evaluates the influence of different lighting arrangements on SfM reconstruction quality, introducing a projected pattern would act as an additional and uncontrolled light source, altering the illumination distribution, contrast, and radiometric consistency of the scene. This would compromise the isolation of lighting-related variables that were central to the experimental design.
Second, the experimental protocol required a relatively large number of image acquisitions for each configuration, all performed using a single camera and under strict safety and access constraints typical of structural testing laboratories. During preliminary tests, maintaining consistent projection geometry and intensity over extended acquisition times proved challenging. Small changes in projector alignment, occlusions caused by camera repositioning, or gradual variations in projection intensity over time introduced inconsistencies that negatively affected feature detection and image matching.
In addition, the presence of reference elements such as scale bars and control bars with checkerboard patterns posed further challenges. Projected patterns interfered with the automatic detection of these reference targets, reducing their reliability for scaling and accuracy assessment.
For these reasons, manually applied artificial texture patterns were selected as a controlled and stable alternative. Although hand-applied patterns may introduce local variability, they provided consistent contrast throughout the acquisition process, did not interfere with the auxiliary lighting configurations, and ensured reliable detection of reference elements across all datasets.
To clarify this methodological choice, the revised manuscript explicitly defines artificial textures as manually applied patterns and acknowledges the trade-offs associated with alternative projection-based approaches.

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

13 Views

10 Feb 2025 | for Version 1

Aleksandar Ašonja, University Business Academy, Cvećarska, Serbia

13 Views Cite this report Responses(1)

Approved

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Industrial Engineering, Mechanical Engineering, Renewable Energy Sources, Agricultural Engineering,Energy and Environment.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

16 Jan 2026

Francisco Roza de Moraes, Department of Transportation Engineering, University of Sao Paulo Sao Carlos School of Engineering, São Carlos, 13563-120, Brazil

We thank the reviewer for the careful evaluation of the manuscript and for the constructive comments, which helped improve clarity, structure, and scientific rigor. Each comment is addressed point by point below.

Comment 1
Do not write the manuscript in personal pronouns (we…). The manuscript should be written in the third person and in the past tense.
Response:
The manuscript was revised to remove personal pronouns and to ensure consistent use of the third person and past tense throughout all sections, following standard scientific writing conventions.

Comment 2
The proposal is to expand the abstract with some of the research results.
Response:
The abstract was expanded to include a concise summary of the main experimental results, highlighting the effects of auxiliary lighting, artificial texture patterns, and image storage formats on SfM-based 3D modeling quality.

Comment 3
The introductory chapter is clearly written. This chapter should be expanded by introducing more recent research.
Response:
The Introduction was expanded with additional recent and relevant references addressing close-range SfM applications, lighting conditions, and feature detection robustness, strengthening the contextual background of the study.

Comment 4
The work methodology is not very clear. In the work methodology, it should be noted: What scientific methods, techniques, analyses, software, devices, and other equipment are used for research.
Response:
The Methods section was revised to improve clarity and reproducibility. The experimental setup, lighting configurations, artificial texture patterns, camera parameters, capture distance, calibration strategy, and evaluation metrics were explicitly described.

Comment 5
It should be specified which materials, software, devices, and other equipment are used for research.
Response:
All materials, software, and equipment used in the study were explicitly detailed, including camera and lens specifications, auxiliary lighting characteristics, calibration instruments, and processing software.

Comment 6
In the conclusion, it should be stated what the scientific justification of the research.
Response:
The Conclusion was revised to clearly state the scientific contribution of the study, emphasizing the systematic evaluation of capture configurations for improving SfM-based 3D modeling accuracy in laboratory environments.

Comment 7
The conclusion should highlight what the continuation of the research could be.
Response:
The Conclusion now includes perspectives for future research, such as extending the analysis to other materials, capture distances, texture strategies, and laboratory conditions.

Comment 8
The reference should be expanded by introducing additional references.
Response:
The reference list was expanded with additional recent and relevant publications to better support the discussion and contextualize the findings.

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] Badano A, et al.: Consistency and standardization of color in medical imaging: a consensus report. J. Digit. Imaging. 2015; 28: 41–52. PubMed Abstract | Publisher Full Text | Free Full Text

[2] Caldera-Cordero JM, Polo ME: Analysis of free image-based modelling systems applied to support topographic measurements. Surv. Rev. 2018; 51(367): 300–309. Publisher Full Text

[3] Capéran P, et al.: Optical 3-dimensional measurements on a frp beam tested at serviceability limit. Compos. Struct. 2012; 94(12): 3465–3477. Publisher Full Text Reference Source

[4] Company S: Ec799 electronic calipers 165. Athol, Massachusetts: Starret L.S; 2007.

[5] Creus P, Sanislav I, Dirks P: Application of sfm-mvs for mining geology: Capture set-up and automated processing using the dugald river znpb-ag mine as a case study. Eng. Geol. 2021; 293: 106314. Publisher Full Text . Reference Source

[6] Dauvin L, et al.: Optimization of temperature, targets, and illumination for high precision photogrammetric measurements. IEEE Sensors J. 2018; 18(4): 1449–1456. Publisher Full Text

[7] Detchev I, et al.: Deformation monitoring with off-the-shelf digital cameras for civil engineering fatigue testing. Int. Arch. Photogramm. Remote. Sens. Spat. Inf. Sci. 2014; XL-5: 195–202. Publisher Full Text

[8] Farhadmanesh M, et al.: Highway asset and pavement condition management using mobile photogrammetry. Transp. Res. Rec. 2021; 2675(9): 296–307. Publisher Full Text

[9] Feng X, et al.: Surface design of wood-based board to imitate wood texture using 3d printing technology. Bioresources. 2019; 14(4): 8196–8211. Publisher Full Text

[10] Garcia MVY, Oliveira HC: The influence of flight configuration, camera calibration, and ground control points for digital terrain model and orthomosaic generation using unmanned aerial vehicles imagery. Boletim de ciências geodésicas. 2021; 27: e2021015.

[11] Hafeez J, et al.: The effect of patterns on image-based modelling of texture-less objects. Metrol. Meas. Syst. 2018; 25(4): 755–767. Publisher Full Text Reference Source

[12] James MR, Robson S, Smith MW: 3-d uncertainty-based topographic change detection with structure-from-motion photogrammetry: precision maps for ground control and directly georeferenced surveys. Earth Surf. Process. Landf. 2017; 42(12): 1769–1788. Publisher Full Text

[13] Kanan C, Cottrell GW: Color-to-grayscale: does the method matter in image recognition? PLoS One. 2012; 7(1): e29740. PubMed Abstract | Publisher Full Text | Free Full Text

[14] Kwak E, et al.: Precise photogrammetric reconstruction using model-based image fitting for 3d beam deformation monitoring. J. Surv. Eng. 2013; 139(3): 143–155. Publisher Full Text

[15] Leon J, et al.: Measuring coral reef terrain roughness using ‘structure-from-motion’ close-range photogrammetry. Geomorphology. 2015; 242, 21–28. Geomorphology in the Geocomputing Landscape: GIS, DEMs, Spatial Analysis and statistics. Publisher Full Text Reference Source

[16] Luhmann T, et al.: Close-range photogrammetry and 3d imaging. Berlin, Boston: De Gruyter; 2020.

[17] Lurie KL, et al.: 3d reconstruction of cystoscopy videos for comprehensive bladder records. Biomed. Opt. Express. 2017; 8(4): 2106–2123. PubMed Abstract | Publisher Full Text | Free Full Text

[18] Mishra SR, et al.: A simple image-based deformation measurement technique in tensile testing of geotextiles. Geosynth. Int. 2017; 24(3): 306–320.

[19] de Moraes FR , da Silva I : Assessment of submillimeter precision via structure from motion technique in close-range capture environments. arxiv preprint arxiv:2409.15602. 2024. Publisher Full Text

[20] de Moraes FR : Dataset of images SfM - FRM - Lighting and Artificial texture - JPG.2024, November 19. Publisher Full Text

[21] Morgan JA, Brogan DJ, Nelson PA: Application of structure-from-motion photogrammetry in laboratory flumes. Geomorphology. 2017; 276: 125–143. Publisher Full Text Reference Source

[22] Nielsen MS, et al.: Quantifying the influence of surface texture and shape on structure from motion 3d reconstructions. Sensors. 2023; 23(1). Publisher Full Text Reference Source

[23] Nietiedt S, et al.: Accuracy investigations of image matching techniques by means of a textured dumbbell artefact. The international archives of the photogrammetry, remote sensing and spatial information sciences, XLIII-B2-2020. 2020; pp. 791–796. Reference Source

[24] O’Connor J: Impact of image quality on sfm photogrammetry: colour, compression and noise. Kingston University; 2018. Thesis (PhD).

[25] Pena-Villasenin S, Gil-Docampo M, Ortiz-Sanz J: Professional sfm and tls vs a simple sfm photogrammetry for 3d modelling of rock art and radiance scaling shading in engraving detection. J. Cult. Herit. 2019; 37: 238–246. Publisher Full Text

[26] Reiss ML, Tommaselli AM: A low-cost 3d reconstruction system using a singleshot projection of a pattern matrix. Photogramm. Rec. 2011; 26(133): 91–110. Publisher Full Text

[27] Reznicek J, Luhmann T, Jepping C: Influence of raw image preprocessing and other selected processes on accuracy of close-range photogrammetric systems according to vdi 2634. The international archives of the photogrammetry, remote sensing and spatial information sciences, XLI-B5. 2016; pp. 107–113. Reference Source

[28] Senevirathne D, et al.: Effects of pavement texture and colour on urban heat islands: An experimental study in tropical climate. Urban Clim. 2021; 40: 101024. Publisher Full Text

[29] Sudarsanan N, et al.: Digital image correlation technique for measurement of surface strains in reinforced asphalt concrete beams under fatigue loading. J. Mater. Civ. Eng. 2019; 31(8): 04019135. Publisher Full Text

[30] Tinkham WT, Swayze NC: Influence of agisoft metashape parameters on uas structure from motion individual tree detection from canopy height models. Forests. 2021; 12(2): 250. Publisher Full Text

[31] Verma AK, Bourke MC: A method based on structure-from-motion photogrammetry to generate sub-millimetre-resolution digital elevation models for investigating rock breakdown features. Earth Surf. Dyn. 2019; 7(1): 45–66. Publisher Full Text Reference Source

[32] Wang T, et al.: Contrast enhancement-based preprocessing process to improve deep learning object task performance and results. Appl. Sci. 2023; 13(19): 10760. Publisher Full Text

Optimizing Submillimeter 3D Modeling with Auxiliary Lighting and Artificial Textures: An SfM-Based Approach

Abstract

Abstract*

Background

Method

Results

Conclusions

Keywords

Introduction

Efficient capture techniques for SfM

Figure 1. Workflow of the Structure from Motion Multi-Views Stereo process for a set of images to produce 3D modeling.

Methods

Evaluated test objects

Figure 2. Layout of the arrangement of positional elements, with Scale Bars in blue and Control Bars in yellow for (a) Concrete, (b) Metal, and (c) Wood.

Positioning of auxiliary lighting

Figure 3. Lighting Equipment Positions: (a) Standard configuration (b) Vertical alignment with the specimen along the camera capture line, (c) Adjacent placement on the sides of the object, and (d) Positioning below the object.

Table 1. Processing combinations for various materials and lighting configurations, detailing image quantities obtained.

Artificial texture patterns

Figure 4. Natural and artificial textures associated with each specimen analyzed.

Data acquisition and storage formats

Figure 5. Photographic capture process of the concrete specimen highlighting the blue squares that symbolize each image acquired during the procedure.

Figure 6. Representation of the average lengths and Standard Deviation of the measurements of each positional element obtained from five measurements.

Quality assessment

(1)

Results and discussions

Assessment between different lighting configurations

Figure 7. RMSE values of CBs for different lighting configurations in a concrete specimen.

Figure 8. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each set of 3D modeling of the concrete specimens, with best results in the CNB configuration - maximum value of 0.33 mm2 and minimum one of 0.13 mm2.

Figure 9. RMSE values of CBs under various lighting conditions in a concrete sample.

Figure 10. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling set.

Figure 11. RMSE values of CBs under various lighting conditions in a concrete sample.

Figure 12. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling set.

Figure 13. Values related to the sparse point cloud for each material under different lighting configurations employed in photographic capture.

Analysis of the use of different artificial textures

Figure 14. RMSE values of CBs for different texture configurations in a concrete specimen.

Figure 15. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling from the concrete set.

Figure 16. RMSE values of CBs for different texture configurations in a metal specimen.

Figure 17. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling from the metallic set.

Figure 18. RMSE values of CBs for different texture configurations in a wood specimen.

Figure 19. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each 3D modeling from a wood set.

Analysis of different storage formats

Table 2. Processing combinations for specimen materials, storage formats, and average image size for 3D modeling.

Figure 20. The RMSE values of CBs ranged for different save file configurations (TIFF and JPG) across all materials when artificial texture T1 and lighting condition B (Vertical) were used.

Figure 21. The maximum and minimum values obtained from the Covariance Matrix indicate the quality of adjustment across all materials when artificial texture T1 and lighting condition B (Vertical) were used.

Limitations of the study

Conclusion

Data availability

Underlying data

Reporting guidelines

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated

Figure 8. Maximum and minimum values from the Covariance Matrix indicate the quality of adjustment for each set of 3D modeling of the concrete specimens, with best results in the CNB configuration - maximum value of 0.33 mm² and minimum one of 0.13 mm².