0% found this document useful (0 votes)

18 views11 pages

Automated Machine Learning (AutoML) For MultiModel Data Fusion

The document discusses the integration of multi-model data fusion in autonomous vehicles (AVs) using Automated Machine Learning (AutoML) to enhance perception systems. It highlights the benefits of combining visual and audio data to improve decision-making, especially in challenging environments, and outlines the challenges of sensor heterogeneity, temporal synchronization, and confidence calibration. The research demonstrates that a multi-model fusion approach can significantly reduce false negatives and improve precision in object detection for AVs.

Uploaded by

IJMSRT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views11 pages

Automated Machine Learning (AutoML) For MultiModel Data Fusion

Uploaded by

IJMSRT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology

ISSN NO-2584-2706

Automated Machine Learning (AutoML) for Multi-

Model Data Fusion
Yogesh Sonvane1; Dr. Vivek Sharma2
1
CSE, TIT Bhopal, INDIA

Abstract
The development of autonomous vehicle (AV) enhance safety, efficiency, and accessibility of driving.
technology is significantly dependent on sensor One of the integral aspects of AV technologyis
systems to obtain accurate and reliable perception of perceiving the external world through multiple
the environment. The current paper examines the sensors. Historically, AVs have depended
integration of multi-model data fusion, here visual on technologies like cameras, LiDAR, radar, and
and audio data, into AV perception system other types of sensors to capture important
performance improvement. The research examines information for navigation and the detection of
the difficulties of single sensor modelities, especially obstacles [1]. Yet, each of these sensors has a
in edge scenarios like occlusions, bad weather, or different representation of the world, and the
poor visibility, and suggests a multi-model fusion difficulty is how to fuse this information
strategy to overcome such challenges. Utilizing meaningfully to improve the AV’s decision
Automated Machine Learning (AutoML) methods, capabilities [12].
the system tunes the fusion model to enhance Human drivers intuitively use a fusion of senses—
accuracy, eliminate false negatives, and enhance vision, hearing, and tactile sensation—to drive
precision for infrequent events. Experimental through intricate environments. This biological
findings show that the fused model performs better system inspiration causes the necessity for multi-
compared to visiononly and audio-only systems, model data fusion in autonomous vehicles.
showing a strong decrease in false negatives and a Multi-model fusion is the integration of information
12% boost in precision for identifying rare objects, from different sensors, including visual, audio,
including emergency vehicle sirens. The fusion LiDAR, and radar, to produce a richer perception of
system also achieves real-time processing needs with the environment [8]. This fusion ensures the vehicle
a total latency of 32 ms. Robustness testing also can drive in challenging conditions where it might
reveals that the fusion model works consistently even not be enough to depend on one kind of sensor, such
in noisy environments. This research highlights the as vision (e.g., low visibility, occlusions, or glare)
advantages of multi-sensor fusion and AutoML for [4].
autonomous vehicle systems and presents a path One of the new methods for enhancing the
toward more resilient and flexible AV perception efficiency and effectiveness of multi-model fusion
capabilities. is the application of Automated Machine Learning
(AutoML). AutoML allows model development
Keywords: processes such as hyperparameter tuning, feature
Autonomous Vehicles, Multi model Fusion, Audio- selection, and compression of models to be
Visual Perception, AutoML And Real-Time Object automated, which are essential in developing fusion
Detection. systems that function well in real-time [2]. AutoML
algorithms make the intricate process of combining
1. Introduction data from different modelities easier, enabling AVs
The emergence of Autonomous Vehicles (AVs) has to improve decision-making in dynamic
revolutionized the transportation industry, aiming to environments, where speed and precision are crucial
[11].

IJMSRT26JAN032 www.ijmsrt.com 46
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706

2.The Role of AutoML in Multi-model Data 2.3.Model Compression for Real-Time Deployment
Fusion In autonomous cars, it is critical that the fusion
The multi-model data fusion process is accompanied models handle data in real-time.
by a number of challenges, especially in aligning The complexity of multi-model models can result in
and fusing heterogeneous data sources. high computational requirements, which might be
AutoML plays a central role in automating the most challenging to satisfy with the processing power of
important tasks that would otherwise need manual embedded systems in cars [2].
tuning and adjustments, thereby speeding up AutoML addresses this through model compression
development and improving model accuracy [10]. methods:
• Pruning: Eliminating redundant components of the
2.1 Hyperparameter Optimization for Cross- model
model Alignment • Quantization: Reducing precision of model
In multi-model fusion, syncing information from parameters
various types of sensors is vital for sound decision- • Knowledge distillation: Transferring knowledge
making. For instance, visual information from from large to small models
cameras, LiDAR point cloud data, and radar signals Through model compression automation, AutoML
all capture the same environment but in unique ways enables the fusion system to run efficiently on
[12]. In order to combine these sources of data into resource-constrained devices without compromising
one, homogeneous output, the models must be performance [14].
properly tuned.
AutoML assists by making the hyperparameter 3.Challenges in Multi-model Fusion
optimization process automatic. Hyperparameters While multi-model data fusion presents impressive
govern the structure and learning procedure of fusion promise, several challenges must be overcome for
models, e.g., the number of layers in a neural successful integration of different data sources [1]:
network, the choice of suitable feature extraction
techniques, or the learning rate [14]. AutoML 3.1Sensor Heterogeneity
frameworks perform the search for the best Sensors output data in fundamentally different forms:
configuration automatically, thereby minimizing the • Cameras: 2D images
need for human experimentation and enabling more • LiDAR: 3D point clouds
suitable alignment of various sensor modelities [1]. • Radar: Velocity measurements
Integrating these diverse data types requires
2.2 Feature Selection to Reduce Dimensionality advanced techniques like manifold learning, which
Multi-model data can be very complex and maps data from each modelity into a common
dimensional. For example: latent space for fusion [8]. AutoML expedites this
• LiDAR sensors generate 3D point clouds by automatically selecting optimal models for
• Radar sensors provide distance and velocity learning unified representations [10].
measurements
• Visual data consists of high-resolution images 3.2.Temporal Synchronization
Merging all this data creates a vast amount of Sensors operate at different sampling rates:
information that is not only difficult to handle but can • Cameras: Typically 30 FPS
also result in inefficiencies and computational • LiDAR/Radar: Often 10-20 Hz
overhead [11]. This temporal misalignment can cause fusion
AutoML is central to feature selection, which serves errors.AutoML automates time warping
to decrease the dimensionality of the data. Through techniques to align sensor timestamps, ensuring all
automatically selecting the most informative features data corresponds to the same time intervals [12].
from every sensor modelity, AutoML ensures that
only the most significant information is utilized in the 3.3.Confidence Calibration
fusion process [4]. This decreases computational Sensor reliability varies by environmental conditions:
expenses, accelerates processing time, and increases • Cameras: Less reliable in low light
the accuracy of the fusion model by concentrating on • Radar: More robust in adverse weather
the most informative features.

IJMSRT26JAN032 www.ijmsrt.com 47
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706

AutoML handles confidence calibration by For this study, both visual and auditory datasets
dynamically adjusting sensor weights based on real- were utilized. The visual data was synthetically
time performance monitoring [11]. This ensures the generated, while the auditory data was recorded with
fusion system prioritizes the most trustworthy data varying noise levels to simulate real-world
sources at any given moment [8]. environments. This data served as input to the
respective modelity-specific models, which were
4.Bayesian Fusion Framework subsequently fused for enhanced detection
In multi-model fusion, perhaps the best approach for capabilities.
fusing information from disparate sensors is to utilize
a Bayesian framework [4]. This probabilistic method 6.1.1Visual Data Generation
enables the system to compensate for the uncertainty Visual data was synthesized through the Blender
in sensor information and make decisions based on 3D rendering tool. This allowed for the creation of
the probability of different outcomes realistic scenes that represent typical autonomous
[1]. driving environments, such as urban streets,
The Bayesian fusion model is expressed as: highways, and intersections. Objects of interest in
P(x, y) these scenes included vehicles, pedestrians, traffic
P(Decision | x, y) = signals, and emergency vehicles. Each image was
rendered at a resolution of 1920×1080 pixels,
P(x | Decision)P(y | Decision)P(Decision) providing high-quality input for the vision model.
In total, 10,000 images were generated, each
Where: covering a broad spectrum of possible driving
• P(Decision| x, y) is the posterior probability of a scenarios, including varying traffic densities,
decision given sensor evidence weather conditions (rain, fog), and lighting
• P(x | Decision) and P( y | Decision) are sensor variations (daytime, nighttime). These images
likelihood functions [12] were used for training and testing the visual model
• P( x, y) is the joint probability of multi-sensor data designed for object detection and classification.
• P(Decision) is the prior probability [11]
6.1.2Audio Data Generatio
5.AutoML Optimization for Multi-model Fusion The auditory data, specifically siren sounds, was
The optimization objective for real-time fusion can be created using the PyAudio library. The audio
formulated as: samples were generated at a sampling rate of 16
kHz, typical for real-time audio processing. Each
,Labelj) + λ∥θ∥2 audio clip lasted 5 seconds, mimicking emergency
vehicle sirens encountered in an urban setting.
Where: The generated audio samples were subjected to
• w represents modelity weighting factors [8]
various noise levels, with signal-to-noise ratios
• θ denotes fusion model hyperparameters [14]
(SNRs) ranging from 0 dB to 20 dB, simulating
• L is the loss function measuring prediction accuracy
real-world conditions where background noise
• λ controls regularization strength [2]
might interfere with audio signals. This diversity
of noise levels ensured that the system could
6.Methodology handle a variety of auditory inputs under different
This section describes the methodology employed in environmental conditions.
this research for integrating multi-model data into
autonomous vehicle (AV) perception systems. The 6.2.Multi-model Fusion
methodology consists of several key stages, including The goal of this work is to combine visual and
data acquisition, multi-model fusion, and optimization auditory data to improve decision-making
procedures, all of which are crucial for enhancing the accuracy, particularly in challenging scenarios
system’s detection and classification performance in where one modelity may fail. The fusion approach
diverse conditions. used here relies on a weighted

6.1. Data Acquisition

IJMSRT26JAN032 www.ijmsrt.com 48
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706

ensemble model, where the final output is a By transforming both types of data into a common
combination of the individual contributions latent space, it becomes possible to effectively
from the visual and auditory sensors. combine them for more accurate predictions.

6.2.1Ensemble Fusion Model 6.2.3 Time Synchronization

The core idea behind the fusion approach is to Since visual and auditory sensors operate at
compute a different frequencies, it is necessary to
weighted sum of modelity-specific models. For each synchronize their outputs before fusion. This is
input vector x, which contains both visual data xvision accomplished using time warping, a technique that
and audio data xaudio, the output decision function minimizes the temporal misalignment between the
D(x) is calculated using: sensor signals.
Let FFT(x(t)) and FFT(y(t)) represent the Fourier
Transforms of the visual and auditory signals,
respectively. The optimal time shift τ that aligns
Where: these signals is obtained by solving the following
• D(x) represents the final decision produced by the minimization problem:
fusion model.
• wi is the weight assigned to modelity i, which is
learned and optimized through AutoML techniques.
• fi(xi) is the modelity-specific function that processes
the data from each sensor. For example, a CNN Where:
model processes audio data, and a YOLOv8 model • ∆t represents the optimal time shift needed to
handles visual data. synchronize the signals.
• ϵ represents noise or uncertainty, modeled as a
• FFT(x(t)) and FFT(y(t)) are the Fourier
Gaussian distribution ϵ ∼ N(0,σ2). Transforms of the visual and audio data at time t,
The weights wi are optimized during the training respectively.
process using AutoML methods, allowing the model to • ∥·∥2 denotes the L2 norm, which measures the
learn the optimal combination of visual and auditory difference between the two signals.
inputs based on their performance in various After determining the optimal shift τ, the audio
conditions. data is adjusted to align with the visual data,
allowing for accurate fusion.
6.2.2. Data Alignment
Visual and auditory data originate from different types 6.2.4. Confidence Adjustment
of sensors, each with distinct characteristics. To enable Different modelities may have varying levels of
effective fusion, it is necessary to align these data reliability. To account for this, the confidence of
modelities within a shared latent space. This alignment each modelity is calibrated based on its precision
process involves projecting both the visual and and recall values. The reliability score ri for each
auditory data into a common space of dimension d, modelity i is calculated using the following
using manifold learning techniques: formula:

Where:
• ϕvision(x) is a transformation that projects visual data x Where:
into a shared latent space. • Precisioni is the precision of modelity i,
• ϕaudio(y) is a transformation that projects auditory data indicating the proportion of true positive
y into the same latent space. predictions made by the modelity.
d
• R denotes the shared latent space where both visual • Recalli is the recall of modelity i, representing
and auditory data are represented. the proportion of actual positives correctly
detected by the modelity.

IJMSRT26JAN032 www.ijmsrt.com 49
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706

The reliability score ri is used to adjust the weight wi challenging conditions for autonomous vehicles
assigned to each modelity during the fusion process. (AVs). The experiments include both synthetic
This ensures that more reliable modelities contribute data generation and evaluations of sensor
more to the final decision. performance, fusion accuracy, real-time
processing, and robustness under noise.
6.3Optimization with AutoML
The optimization of the fusion model is performed 7.1.Synthetic Data Generation
using AutoML techniques. The goal is to automatically We generated synthetic datasets for both visual
find the optimal weights wi and hyper parameters θ for and auditory inputs to test the fusion system under
the fusion model, minimizing the following objective controlled, replicable conditions.
function:
7.1.1Visual Data Generation
The visual data was synthesized using the Blender
,Labelj) + λ∥θ∥2
3D rendering platform. The generated scenes
included a range of objects typically encountered
Where: by AVs, such as cars, pedestrians, and emergency
• L(·,·) is the loss function used to quantify the error
vehicles like ambulances. These objects were
between the predicted output and the true label. embedded in different types of environments, with
• D(xj,yj;w,θ) is the decision function for the j-th
various weather conditions such as rain and fog to
sample, incorporating both visual and auditory data. simulate low visibility scenarios.
• λ is the regularization parameter that controls the
The dataset included 10,000 images, each with a
complexity of the model and prevents overfitting. resolution of 1920x1080 pixels. These images
• θ represents the hyper parameters of the model, such
were annotated to identify the presence and
as learning rates and filter sizes. location of key objects. The diversity of the scenes
Through AutoML, the optimal combination of weights was intentionally varied to include complex
and hyper parameters is determined automatically, backgrounds, occlusions, and changes in lighting
allowing for efficient training of the fusion model. conditions to mirror real-world driving situations.

6.4Implementation Details 7.1.2Audio Data Generation

The visual model was implemented using the YOLOv8 For the auditory input, we used the PyAudio
object detection framework, while the auditory model library to generate siren sounds from emergency
employed a convolutional neural network (CNN) vehicles. These audio clips were synthesized at a
trained on spectrogram representations of audio data. 16kHz sampling rate, with each clip lasting for 5
Both models were trained using deep learning seconds. The generated sirens were combined with
frameworks like Tensor Flow and PyTorch. background noise at various Signal-to-Noise
The optimization process, including hyper parameter Ratios (SNRs) to simulate real-world audio
search and weight adjustment, was handled by an environments, where noise from traffic or other
AutoML framework. The final fusion model was environmental sources can interfere with the
deployed on a GPU for real-time inference, ensuring detection of critical sounds.
that the system met the latency requirements of The dataset for the audio modelity was designed to
autonomous vehicles. be challenging by including varying types of siren
tones and other noise sources like street sounds
7.Experimental Results and engine noises. These challenges tested the
In this section, we present the results from experiments ability of the audio model to detect sirens reliably
designed to evaluate the performance of a multi-model in noisy environments.
data fusion system that combines visual and auditory
information. The goal was to examine the effectiveness 7.2.Modelity-Specific Performance
of this fusion in improving object detection, particularly Before fusing the visual and auditory data, each
in modelity was evaluated independently. In this
section, we describe the performance of the vision

IJMSRT26JAN032 www.ijmsrt.com 50
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706

model (YOLOv8) and the audio model (CNN) 7.3.Fusion Metrics

in detecting objects and sounds. Combining the visual and auditory data through
Vision Model Performance Metrics fusion was expected to yield better performance
than relying
on either modelity alone. This section outlines the
key metrics used to assess the fusion model’s
33.% Precision:
9% performance.
38.%
Recall:8 % The fusion accuracy was calculated using a
33.% Precision:
8%
Recall:7 % formula that incorporates the contributions from
29.% Precision:
7% both the vision and audio models:
32. % Recall:7 %
31. %

Fig. 1: Combined precision and recall metrics by Where: - TPvision and TPaudio are the true positives
object class. Outer ring shows precision values (Car: from each individual model. - TPboth represents the
92%, Pedestrian: 81%, Siren: 78%), inner ring shows true positives detected by both models. - N is the
recall values (88%, 79%, 72% respectively). Color total number of samples in the test set.
coding: blue=Car, green=Pedestrian, red=Siren. The fusion model resulted in the following
improvements: - A 15% reduction in false
7.2.1.Vision Model (YOLOv8) negatives compared to the visiononly model. - A
We applied the YOLOv8 object detection algorithm to 12% increase in precision for rare classes, such as
the synthetic visual dataset. YOLOv8 was chosen due sirens.
to its ability to perform real-time object detection with This highlights how combining complementary
high accuracy. The evaluation metric used was mean sensor data can improve detection accuracy,
Average Precision (mAP) at an Intersection over especially for rare or challenging events.
Union (IoU) threshold of 0.5.
mAP@0.5 = 0.85 7.4.Real-Time Processing
From the confusion matrix, we observed the following Real-time processing is a key requirement for
precision and recall values for various classes: autonomous vehicle systems, where timely
These results highlight the strengths of the model in decision-making is essential for safe navigation.
detecting cars and pedestrians but also suggest a The total processing time of the fusion system was
potential area of improvement for detecting sirens, measured and compared to the real-time
where auditory input could provide a valuable requirements of an AV.
complement. The total latency was computed as the sum of the
individual latencies for the vision model, audio
7.2.2.Audio Model (CNN) model, and the fusion process:
The audio model used was a convolutional neural Total Latency = tvision+taudio+tfusion =
network (CNN) designed to classify siren sounds. The 15ms+10ms+7ms = 32ms
model was trained using spectrograms of the audio The breakdown of latencies is as follows: -
clips. The network consisted of five convolutional YOLOv8
layers. Several performance metrics were calculated (vision model): 15ms (optimized using
for the audio model: TensorRT). - Audio CNN: 10ms (optimized using
Accuracy = 82%(F1-score = 0.80) ONNX runtime). - Fusion: 7 ms (performed using
ROC-AUC = 0.89 matrix operations on GPU).
The CNN performed relatively well at detecting siren With a total processing time of 32ms, the fusion
sounds but faced challenges in distinguishing them system meets the latency requirements for real-
from other types of background noise, especially in time AV systems, which typically need to operate
scenarios with low SNR. under 100 ms.

IJMSRT26JAN032 www.ijmsrt.com 51
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706
7.5.Robustness Analysis
We tested the robustness of the fusion system by 7.7Failure Modes
adding Gaussian noise to the input data, We identified two primary failure modes during
simulating noisy environmental conditions. The noise testing:
was modeled as: • **High Noise Levels**: When noise levels
xnoisy = x + N(0,σ2), σ ∈ [0,20] exceeded 20dB, the fusion system’s performance
Where σ is the standard deviation of the Gaussian deteriorated to that of the vision-only model. This
noise. The fusion system’s performance was evaluated suggests that, under extreme noise conditions, the
at different noise levels, with the following results: audio modelity no longer provided significant
The fusion model demonstrated superior resilience to benefits.
noise compared to the individual modelities. This • **Temporal Misalignment**: Significant delays
indicates that combining the vision and audio data (greater than 50ms) between the visual and audio
helps mitigate the impact of noise and provides a data led to an 8% decrease in accuracy. This
more reliable output. demonstrates the importance of precise temporal
synchronization for optimal fusion performance.

AccuracyComparison
by NoiseLevel 7.8.Computational Cost
10 Finally, we assessed the computational cost of the
fusion system by calculating the number of
floating-point operations (FLOPs) required for
9 each component. The breakdown is as follows:
Vision: 45GFLOPs/frame
Audio: 3GFLOPs/clip
8 Fusion: 0.5GFLOPs

These values show that even though the vision and

audio models need a lot of computing power, the
7
0dB 10dB 20dB fusion step is quick and can be done in real-time.
NoiseLevel (dB)
Vision Audio Fusion 8.Future Scope
Fig. 2: Performance
comparison
acrosssensormodelities The experiments conducted as part of this work
have identified that the incorporation of visual and
7.6 Confidence Intervals auditory information improves the performance of
To quantify the uncertainty in the fusion system’s autonomous vehicle (av) perception systems.
performance, we computed the 95% confidence Despite promising initial results, there exist
interval (CI) for fusion accuracy. The formula for the various areas with regard to its investigation and
CI is: amelioration toward optimizing the performance of
the system in real-life applications. Following is
the set of potential areas of future endeavor based
Where: - x¯ = 92% is the mean fusion accuracy. - z = on what has been discussed:
1.96 is the critical value for a 95- s = 2.1% is the
standard deviation. - n = 1000 is the sample size. 8.1.Extending modelities for improved
The resulting confidence interval for fusion accuracy perception
is: While this research was mainly focused on the
integration of visual and auditory inputs,
autonomous vehicles in the future will require the
integration of an even broader set of sensory
modelities. The addition of other sensors such as
This confidence interval confirms that the fusion lidar, radar, and tactile sensors would allow the
system provides consistent performance, with high system to increase its resilience, especially in poor
precision in the estimation of its accuracy. or obstructed environments where vision would
struggle.

IJMSRT26JAN032 www.ijmsrt.com 52
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706
self-driving cars driven in different weather and
Lidar, for example, provides precise depth data that traffic scenarios. It is also important to improve the
helps robustness of fusion models to surprise changes in
distinguish between objects in poor conditions such as illumination, noise, and object motions since this
fog, rain, or driving at night. Similarly, tactile sensors will be critical to real-world implementation.
might provide feedback for airplanes flying through
constricted areas or responding to shifts in the road 8.4.R eal-time adaptation and learning
surface. Subsequent research can explore the One promising field of research for the future is
integration of these other modelities with vision and the development of real-time adaptive systems
audio sensors by leveraging state-of-the-art machine capable of learning from the environment as the
learning algorithms, e.g., deep reinforcement learning vehicle is driven.
or attention mechanisms, to improve relative sensor With the use of machine learning algorithms,
importance assessment depending on the environment. including online learning and meta-learning, the
fusion system is able to dynamically adapt and
8.2. Improved sensor integration methods improve its performance as it accumulates more
This research used a fusion model that averaged visual data in real-time. For instance, the fusion system
and audio data through a straightforward weighted may adjust the weighting of modelities according
ensemble method. More advanced fusion methods, to sensor reliability, which might vary with road
however, may be able to yield better results, especially conditions or traffic situations [17]. This aspect can
with difficult data sets coming from different sources. also be used for sensor configuration optimization,
Methods such as attentionbased mechanisms, which allowing the vehicle to turn on or off specific
specialize in paying attention to certain sensor inputs sensors (e.g., decrease the use of audio in silent
based on context, and multi-task learning conditions) depending on the situation. This would
methodologies that exchange knowledge across not only improve performance but also conserve
different types of sensors may provide more intelligent computational resources [13]. While there has been
ways of merging the data streams. progress in terms of accuracy and resilience with
In addition, innovative methods for alignment and multi-model fusion, computational efficiency
synchronization of temporal data received from remains an issue, especially in real-time systems.
different sensors are needed in order to support data The current fusion system is computationally
that reaches at different velocities. Investigating more intensive, especially for the vision and audio
sophisticated techniques for timewarping as well as components. The problem of reducing the floating-
eliminating issues associated with sensor drift and point operations (flops) required for computation
delay would prove essential for use in real time within while ensuring performance remains a major
dynamic, complicated environments. challenge [16].
Future work may focus on developing more
8.3 Handling extreme environmental conditions effective fusion algorithms or leveraging
The system that already was in place was tested breakthroughs in hardware, including edge
within a controlled environment, where it was computing, domain-specific artificial intelligence
subjected to noise levels and to considerations such as chips, or low-power sensors. Pruning or
occlusions and glare. However, real-world quantization can also be used to reduce the size
environments present a much broader spectrum of and computational needs of deep learning models,
challenges than the controlled environments of the making them more deployable on embedded av
laboratory. Autonomous vehicles will need to cut systems [15].
through various adverse conditions, such as driving on
rainy or snowy days, sunshine, and densely populated 8.5.Improved certainty estimation
cityscapes with lots of moving objects. Future work The necessity of sustaining trustworthy fusion in
needs to be centered on assessing the performance of ambiguous or uncertain situations is brought to the
multi-model fusion systems under extreme weather forefront by the necessity of dynamic confidence
conditions. This may include the development of calibration among modelities. The experiments
simulation environments that are closer to actual evidently indicate that the performance of the
conditions or obtaining data from system

IJMSRT26JAN032 www.ijmsrt.com 53
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706
is dependent on the stability of the vision and audio vehicles to interact with other vehicles or
sensors, which may be influenced by various infrastructure in real-time
environmental factors. More sophisticated methods of [3].
confidence calibration, including Bayesian inference
and uncertainty modeling, can be explored in future 9.Conclusion
research to dynamically adjust the fusion process [9]. This study focused on the integration of multiple
In addition, confidence scores could be utilized more sensor modelities, particularly visual and auditory
effectively to inform decision-making processes, data, to enhance the perception systems of
allowing the av to make better decisions when autonomous vehicles (AVs). The primary aim was
presented with incongruent sensor information, for to explore how combining different types of
example, where the audio model perceives a siren, but sensory data could improve the vehicle’s ability to
the vision model fails to perceive the source owing to understand its environment, especially in complex
occlusions [6]. scenarios where a single sensor modelity might fall
short. The results indicate that multi-model fusion
8.6. Safety and ethical considerations offers a viable solution to several challenges faced
With autonomous vehicles being implemented by AVs, including situations involving visual
practically, their safety and ethical implications occlusions, glare, or difficult weather conditions.
become increasingly important. Multi-model sensor The experimental results demonstrated significant
fusion can contribute to safety by providing additional performance improvements when combining
levels of information, but it also raises some new vision and audio. The visual model, YOLOv8,
issues with data privacy, transparency of decision- achieved a mean average precision (mAP) of 0.85,
making, and accountability. The incorporation of while the auditory model, a convolutional neural
additional sensors and data sources necessitates the network (CNN), yielded an accuracy of 82%.
development of strong ethical frameworks to guarantee When fused, these models resulted in a 15%
that the systems function fairly, transparently, and in reduction in false negatives compared to the
accordance with legal and regulatory requirements [7]. vision-only model and a 12% increase in precision,
Future studies should focus on addressing these issues particularly for rare events such as emergency
by developing systems that not only excel at fusing sirens. Additionally, the fusion system was able to
various sensory inputs but also have aspects that enable meet the real-time processing requirements with a
users and stakeholders to comprehend the decision- total latency of just 32 ms, showing the system’s
making processes of the system’s actions. This can be practical feasibility for autonomous driving.
done through the development of explainable ai (xai) Despite the introduction of noise, the fusion system
methods tailored for multi-model fusion systems in avs demonstrated robust performance, maintaining its
[5]. accuracy even as the signal-to-noise ratio
decreased.
8.7 Integration with urban mobility systems The findings of this research underline the
The long-term goal of av technology is to create a importance of multi-sensor integration in
smooth transportation system that maximizes autonomous vehicle systems. By combining data
efficiency and safety in cities. This paper focuses from both visual and auditory sources, the system
mainly on the sensory aspect of av systems, but further can gain a richer understanding of the
research might consider how multimodel fusion environment, which improves its decision-making
systems fit into the general idea of smart cities. This capabilities in more challenging conditions.
includes connecting autonomous vehicles with other AutoML techniques were used to optimize the
transportation systems, including public transit and fusion models, which ensures that the system is
traffic management systems, to enable cooperative adaptable to a variety of sensor configurations and
decision-making. AV collaboration may involve dynamic environmental conditions. Audio sensors,
sharing sensory data or coordinating actions in real- being less affected by environmental factors like
time, most notably in complex scenarios such as fog or poor lighting, provide a complementary
intersection control, emergency response, or avoiding strength to the visual sensors, making the fusion
pedestrian collisions. Future studies may explore how system more reliable.
multi-model fusion systems can be extended to allow

IJMSRT26JAN032 www.ijmsrt.com 54
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706
While the results are promising, there are still several [2] J. Gu, A. Lind, T. Chhetri, M. Bellone, and
avenues for future work. For example, expanding the R. Sell. End-to-end multimodel sensor dataset
fusion framework to incorporate other sensor types, collection framework for autonomous vehicles. In
such as radar or thermal imaging, could further 2023 IEEE 26th International Conference on
improve robustness. These additional sensors would Intelligent Transportation
be particularly useful in scenarios where visual and
auditory sensors may not provide sufficient data, such Systems (ITSC), pages 2792–2797, Bilbao, Spain,
as in extreme weather conditions. Additionally, the 2023. IEEE.
experiments conducted in this study relied on [3] Zhiren Huang, Ximan Ling, Pu Wang, Fan
synthetic data, and future research should focus on Zhang, Yingping Mao, Tao Lin, and Fei-Yue
testing the system with real-world sensor data to Wang. Modeling real-time human mobility based
ensure its practical viability in real autonomous on mobile phone and transportation data fusion.
vehicles operating in live traffic. Transportation Research Part C: Emerging
Further advancements in AutoML could also play a Technologies, 96:251–269, 2018.
crucial role in the continuous adaptation of [4] Y. Li, Z. Zhao, Y. Chen, and R. Tian. A
autonomous systems. By integrating mechanisms like practical large-scale roadside multi-view multi-
online learning, the fusion model could adjust sensor spatial synchronization framework for
dynamically as new data is acquired, optimizing the intelligent transportation systems. TechRxiv, 2023.
system in real-time. Lastly, there is room to improve [5] Yanfang Ling, Jiyong Li, Lingbo Li, and
the computational efficiency of the fusion system. Shangsong Liang. Bayesian domain adaptation
While the system demonstrated satisfactory latency with gaussian mixture domain-indexing. Advances
and accuracy, optimizing the model to reduce in Neural Information Processing Systems,
computational overhead will be crucial for 37:87226–87254, 2024.
deployment on embedded platforms with limited [6] Tyron L Louw, Natasha Merat, and
resources. Exploring model compression techniques, Andrew Hamish Jamson. Engaging with highly
such as pruning or knowledge distillation, could help automated driving: To be or not to be in the loop?
address this challenge and make the system more In 8th International Driving Symposium on Human
feasible for real-world applications. Factors in Driver Assessment, Training and
In summary, the combination of multi-model data Vehicle Design. Leeds, 2015.
fusion for autonomous vehicle perception has shown [7] Tauheed Khan Mohd, Nicole Nguyen, and
great potential in enhancing both the accuracy and Ahmad Y Javaid. Multimodel data fusion in
resilience of the system. By integrating vision and enhancing human-machine interaction for robotic
auditory data, the AV system can overcome the applications: a survey. arXiv preprint
limitations of individual sensors and perform better arXiv:2202.07732, 2022.
in challenging environments. The use of AutoML [8] R. Nabati, L. Harris, and H. Qi. Cftrack:
optimization further ensures the system’s ability to Center-based radar and camera fusion for 3d multi-
adapt to varying sensor configurations, making it a object tracking. arXiv preprint arXiv:2107.05150,
promising candidate for real-world autonomous 2021.
vehicles. As research continues, further testing with [9] R. Nabati and H. Qi. Centerfusion: Center-
real-world data, the inclusion of additional sensor based radar and camera fusion for 3d object
modelities, and computational optimizations will be detection. IEEE WACV, pages 1527–1536, 2021.
critical steps in bringing robust, multi-sensor [10] N. Piperigkos, A. Lalos, and K. Berberidis.
autonomous driving systems closer to deployment. Graph laplacian extended kalman filter for
connected and automated vehicles localization. In
References IEEE ICPS, pages 328–333, 2021.
[1] F. Butt, J. Chattha, J. Ahmad, M. Zia, M. Rizwan, [11] D. Qiao and F. Zulkernine. Adaptive
and I. Naqvi. On the integration of enabling wireless feature fusion for cooperative perception using
technologies and sensor fusion for nextgeneration lidar point clouds. In IEEE WACV, 2023.
connected and autonomous vehicles. IEEE Access,
10:14643 – 14668, 2022. [12] C. Wang, S. Liu, X. Wang, and X. Lan. Time
synchronization and space registration of

IJMSRT26JAN032 www.ijmsrt.com 55
DOI: https://doi.org/10.5281/zenodo.18266525
Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology
ISSN NO-2584-2706
roadside lidar and camera. Electronics,
12(3):537, 2023.
[13] Haojie Wang, Jidong Zhai, Mingyu Gao, Feng
Zhang, Tuowei Wang, Zixuan Ma, Shizhi Tang,
Liyan Zheng, Wen Wang, Kaiyuan Rong, et al.
Optimizing dnns with partially equivalent
transformations and automated corrections. IEEE
Transactions on Computers, 72(12):3546–3560,
2023.
[14] A. Yusupov, S. Park, and J. Kim. Synchronized
delay measurement of multi-stream analysis over
data concentrator units. Electronics, 14(1):81 , 2024.
[15] Zhaoyun Zhang and Jingpeng Li. A review of
artificial intelligence in embedded systems.
Micromachines, 14(5):897, 2023.
[16] Fei Zhao, Chengcui Zhang, and Baocheng Geng.
Deep multimodel data fusion. ACM Computing
Surveys, 56(9):1–36, 2024.
[17] Hao Zhao, Yuejiang Liu, Alexandre Alahi, and
Tao Lin. On pitfalls of test-time adaptation. arXiv
preprint arXiv:2306.03536, 2023.

IJMSRT26JAN032 www.ijmsrt.com 56
DOI: https://doi.org/10.5281/zenodo.18266525

Early Fusion of Radar and Camera for ADAS
No ratings yet
Early Fusion of Radar and Camera for ADAS
11 pages
Deep Learning Sensor Fusion For Autonomous Vehicle
No ratings yet
Deep Learning Sensor Fusion For Autonomous Vehicle
34 pages
Multi-Sensor Fusion in Automated Driving
No ratings yet
Multi-Sensor Fusion in Automated Driving
22 pages
03 46 Sensorfussion Iiscconference
No ratings yet
03 46 Sensorfussion Iiscconference
9 pages
Transfuser: Imitation With Transformer-Based Sensor Fusion For Autonomous Driving
No ratings yet
Transfuser: Imitation With Transformer-Based Sensor Fusion For Autonomous Driving
18 pages
Technology Application of Autonomous Vehicle in Machine Learning
No ratings yet
Technology Application of Autonomous Vehicle in Machine Learning
5 pages
IJCDS MSDF ResNET
No ratings yet
IJCDS MSDF ResNET
11 pages
Machine Learning 2
No ratings yet
Machine Learning 2
16 pages
Multimodal Fusion Object Detection System For Autonomous Vehicles
No ratings yet
Multimodal Fusion Object Detection System For Autonomous Vehicles
9 pages
Anand Research Paper
No ratings yet
Anand Research Paper
5 pages
Indoor Robot SLAM With Multi Sensor Fusion
No ratings yet
Indoor Robot SLAM With Multi Sensor Fusion
12 pages
Obstacle Detection in Autonomous Vehicles
No ratings yet
Obstacle Detection in Autonomous Vehicles
5 pages
Multimodal Obstacle Detection for Vision Impairment
No ratings yet
Multimodal Obstacle Detection for Vision Impairment
12 pages
Radar-Camera Fusion via Deep Learning
No ratings yet
Radar-Camera Fusion via Deep Learning
10 pages
Sensors 22 02542
No ratings yet
Sensors 22 02542
23 pages
Matlab Expo 2021 Ai Radar Lidar Edt
No ratings yet
Matlab Expo 2021 Ai Radar Lidar Edt
65 pages
Autonomous Driving Sensor Fusion
No ratings yet
Autonomous Driving Sensor Fusion
11 pages
Multi-Modal 3D Detection Survey
No ratings yet
Multi-Modal 3D Detection Survey
30 pages
Deepleaning Radar& Sensov1
No ratings yet
Deepleaning Radar& Sensov1
16 pages
Deep Learning For Image and Point Cloud Fusion in Autonomous Driving: A Review
No ratings yet
Deep Learning For Image and Point Cloud Fusion in Autonomous Driving: A Review
19 pages
Presentation Template 25
No ratings yet
Presentation Template 25
11 pages
Road Object Detection Record
No ratings yet
Road Object Detection Record
6 pages
Sensors 24 04393 v2
No ratings yet
Sensors 24 04393 v2
20 pages
A Hybrid Model For Object Detection Based On Feature-Level Camera-Radar Fusion in Autonomous D
No ratings yet
A Hybrid Model For Object Detection Based On Feature-Level Camera-Radar Fusion in Autonomous D
7 pages
LiRaNet: Radar-Lidar Fusion for Trajectory Prediction
No ratings yet
LiRaNet: Radar-Lidar Fusion for Trajectory Prediction
18 pages
Seminar
No ratings yet
Seminar
20 pages
Radar-Camera Fusion for Self-Driving Cars
No ratings yet
Radar-Camera Fusion for Self-Driving Cars
14 pages
CNN and LiDAR Fusion for Object Classification
No ratings yet
CNN and LiDAR Fusion for Object Classification
6 pages
Deep Fusion for Multi-Sensor 3D Detection
No ratings yet
Deep Fusion for Multi-Sensor 3D Detection
16 pages
of Synopsis
No ratings yet
of Synopsis
10 pages
Artciulov 5 VA
No ratings yet
Artciulov 5 VA
23 pages
ADAS Sensor Fusion and Perception
100% (1)
ADAS Sensor Fusion and Perception
21 pages
03 ML+and+DL+in+ADAS+-+Sensors+&+Sensor+Fusion
No ratings yet
03 ML+and+DL+in+ADAS+-+Sensors+&+Sensor+Fusion
12 pages
SSC 22RI44 Unit-3 Full Notes
No ratings yet
SSC 22RI44 Unit-3 Full Notes
35 pages
Deep Learning for Data Integration in ADS
No ratings yet
Deep Learning for Data Integration in ADS
25 pages
Automotive Radar Point Cloud Dataset
No ratings yet
Automotive Radar Point Cloud Dataset
8 pages
Multi-Modal 3D Object Detection Survey
No ratings yet
Multi-Modal 3D Object Detection Survey
31 pages
LiDAR vs Camera in Autonomous Driving
No ratings yet
LiDAR vs Camera in Autonomous Driving
7 pages
Siemens SW Optimizing The Fusion of Multiple Sensor Types Fact Sheet
No ratings yet
Siemens SW Optimizing The Fusion of Multiple Sensor Types Fact Sheet
4 pages
Adas 4
No ratings yet
Adas 4
14 pages
Deep Learning For Image and Point Cloud Fusion in Autonomous Driving A Review
No ratings yet
Deep Learning For Image and Point Cloud Fusion in Autonomous Driving A Review
18 pages
DNN-LSTM Fusion for Target Tracking
No ratings yet
DNN-LSTM Fusion for Target Tracking
6 pages
Cooperative Perception for CAVs
No ratings yet
Cooperative Perception for CAVs
12 pages
IJCDS FPN ResNET
No ratings yet
IJCDS FPN ResNET
9 pages
BGSW Case Study Suchismita
No ratings yet
BGSW Case Study Suchismita
24 pages
Aiav Unit 2 Notes
No ratings yet
Aiav Unit 2 Notes
8 pages
Multi Sensor Fusion
No ratings yet
Multi Sensor Fusion
24 pages
Enhancing Automotive Radar with Machine Learning
No ratings yet
Enhancing Automotive Radar with Machine Learning
12 pages
Deep Learning-Based Robust Positioning For All-Weather Autonomous Driving
No ratings yet
Deep Learning-Based Robust Positioning For All-Weather Autonomous Driving
16 pages
R 2
No ratings yet
R 2
2 pages
2016 - Track Level Fusion of Extended Objects From
No ratings yet
2016 - Track Level Fusion of Extended Objects From
10 pages
Data Fusion Models and Guidelines Review
No ratings yet
Data Fusion Models and Guidelines Review
9 pages
(BEVfusion) NeurIPS-2022-bevfusion-a-simple-and-robust-lidar-camera-fusion-framework-Paper-Conference
No ratings yet
(BEVfusion) NeurIPS-2022-bevfusion-a-simple-and-robust-lidar-camera-fusion-framework-Paper-Conference
14 pages
LIDAR-Camera Fusion for Road Detection
No ratings yet
LIDAR-Camera Fusion for Road Detection
7 pages
Ijcds FPN Resnet Updated
No ratings yet
Ijcds FPN Resnet Updated
12 pages
06 - Machines 13 00130
No ratings yet
06 - Machines 13 00130
29 pages
U D: T U D P A C C: NI Rive Owards Niversal Riving Erception Cross Amera Onfigurations
No ratings yet
U D: T U D P A C C: NI Rive Owards Niversal Riving Erception Cross Amera Onfigurations
14 pages
Technical Seminar-PPT Template
No ratings yet
Technical Seminar-PPT Template
14 pages
Object Detection For Automotive Radar Point Clouds - A Comparison
No ratings yet
Object Detection For Automotive Radar Point Clouds - A Comparison
23 pages
Physiological and Biochemical Responses of Crop Plants To Heat and Drought Stress
No ratings yet
Physiological and Biochemical Responses of Crop Plants To Heat and Drought Stress
5 pages
Cityxplorer A Comprehensive Guide To Entire City
No ratings yet
Cityxplorer A Comprehensive Guide To Entire City
4 pages
IJMSRT25OCT063
No ratings yet
IJMSRT25OCT063
8 pages
The Unknown Struggles of Children With Incarcerated Parents
No ratings yet
The Unknown Struggles of Children With Incarcerated Parents
14 pages
Clarias Gariepinus
No ratings yet
Clarias Gariepinus
7 pages
Design and Development of A Real-Time Soil Miosture Monitoring-Based Automated Plant Watering Unit For Ideal Growth
No ratings yet
Design and Development of A Real-Time Soil Miosture Monitoring-Based Automated Plant Watering Unit For Ideal Growth
8 pages
AES-GCM Algorithm Implementation For The Protection of Online Shoppers Data in E-Commerce
No ratings yet
AES-GCM Algorithm Implementation For The Protection of Online Shoppers Data in E-Commerce
4 pages
Flipped Learning As A Pedagogical Approach It's Impact On Secondary School Students' Mathematics Outcomes in Taraba State, Nigeria
No ratings yet
Flipped Learning As A Pedagogical Approach It's Impact On Secondary School Students' Mathematics Outcomes in Taraba State, Nigeria
7 pages
Evaluation of Fire Pumps FP1 and FP2 at Oando Energy Resources Nigeria Limited Kwale Gas Plant Location
No ratings yet
Evaluation of Fire Pumps FP1 and FP2 at Oando Energy Resources Nigeria Limited Kwale Gas Plant Location
5 pages
The Functional Architecture of Soil Biodiversity Chemical, Biological and Ecosystem Engineers
No ratings yet
The Functional Architecture of Soil Biodiversity Chemical, Biological and Ecosystem Engineers
18 pages
A Hybrid Data Mining Model For Relationship Analysis On TikTok
No ratings yet
A Hybrid Data Mining Model For Relationship Analysis On TikTok
10 pages
Oreochromis Niloticus
No ratings yet
Oreochromis Niloticus
7 pages
Spatial Distribution and Environmental Risk of Explosive Residues (TNT, and DNT Isomers) in Soils and Surface Waters of Gwoza LGA, North East Nigeria
No ratings yet
Spatial Distribution and Environmental Risk of Explosive Residues (TNT, and DNT Isomers) in Soils and Surface Waters of Gwoza LGA, North East Nigeria
10 pages
Development Strategies For Sustainable Tourism - Tourist Operator's Perspective
No ratings yet
Development Strategies For Sustainable Tourism - Tourist Operator's Perspective
9 pages
Ability To Network With Government Support As A Driver of MSME Performance Growth in Kendari City
No ratings yet
Ability To Network With Government Support As A Driver of MSME Performance Growth in Kendari City
14 pages
Assessing Food Hygine Practice Among Vendors in Selected Public Primary Schools in Bauchi Metropolis
No ratings yet
Assessing Food Hygine Practice Among Vendors in Selected Public Primary Schools in Bauchi Metropolis
13 pages
Energy Transition Finance, Sovereign Wealth Funds, and Green Investment Policy
No ratings yet
Energy Transition Finance, Sovereign Wealth Funds, and Green Investment Policy
13 pages
The Functional Architecture of Soil Biodiversity Chemical, Biological and Ecosystem Engineers
No ratings yet
The Functional Architecture of Soil Biodiversity Chemical, Biological and Ecosystem Engineers
18 pages
EKWERE Framework and Corporate Reporting Quality Evidence From Listed Manufacturing Firms in Nigeria
No ratings yet
EKWERE Framework and Corporate Reporting Quality Evidence From Listed Manufacturing Firms in Nigeria
12 pages
Financial Modelling For Portfolio Management, Investment Analysis and Non-Interest Banking
No ratings yet
Financial Modelling For Portfolio Management, Investment Analysis and Non-Interest Banking
8 pages
Stochastic Analysis of Two Non-Identical Units System Model Subject To Inspection Policy
No ratings yet
Stochastic Analysis of Two Non-Identical Units System Model Subject To Inspection Policy
16 pages
Genre Hybridisation in Regional OTT Content A Study of Horror, Fantasy, and Folklore in Odia Digital Cinema
No ratings yet
Genre Hybridisation in Regional OTT Content A Study of Horror, Fantasy, and Folklore in Odia Digital Cinema
7 pages
Rural-Based Midwives and Nurses Instrumentalities and Persistence in The Provision of Obstetric and Neonatal Care in The Northeast Region of Ghana - A Qualitative Study
No ratings yet
Rural-Based Midwives and Nurses Instrumentalities and Persistence in The Provision of Obstetric and Neonatal Care in The Northeast Region of Ghana - A Qualitative Study
11 pages
Bancassurance in India The Case For Stronger RBI - IRDAI Collaboration
100% (1)
Bancassurance in India The Case For Stronger RBI - IRDAI Collaboration
11 pages
Brain Tumor Detection Using ML and DL Approaches A Comparative Review of Models and Methods
No ratings yet
Brain Tumor Detection Using ML and DL Approaches A Comparative Review of Models and Methods
9 pages
Unpaid Labour in A Globalised Economy Legal Recognition, Economic Impact and Pathways To Justice
100% (1)
Unpaid Labour in A Globalised Economy Legal Recognition, Economic Impact and Pathways To Justice
11 pages
Predictive Influence of Neonatal APGAR Scores On Early Childhood Feeding Outcomes in Gushegu and Nkwanta South Districts of Ghana A Prospective Cohort Study
No ratings yet
Predictive Influence of Neonatal APGAR Scores On Early Childhood Feeding Outcomes in Gushegu and Nkwanta South Districts of Ghana A Prospective Cohort Study
11 pages
Mathematical Modeling in Economics and Finance A Comprehensive Framework For Predictive and Policy Optimization
No ratings yet
Mathematical Modeling in Economics and Finance A Comprehensive Framework For Predictive and Policy Optimization
5 pages
The Moderating Effect of Corporate Governance (CG) Gender Diversity On The Relationship of CG Attributes and Financial Performance of Listed Companies in SriLanka
No ratings yet
The Moderating Effect of Corporate Governance (CG) Gender Diversity On The Relationship of CG Attributes and Financial Performance of Listed Companies in SriLanka
45 pages
Measuring The Social' Component of ESG A Critical Review of Methodologies For Assessing The Financial of Employee Well-Being
No ratings yet
Measuring The Social' Component of ESG A Critical Review of Methodologies For Assessing The Financial of Employee Well-Being
19 pages
Web Designing Lab File
No ratings yet
Web Designing Lab File
20 pages
ML Mini Project Final
No ratings yet
ML Mini Project Final
37 pages
Ds 90
No ratings yet
Ds 90
489 pages
Gr8-q4 - Advanced Web Development
No ratings yet
Gr8-q4 - Advanced Web Development
39 pages
YOLO Based Object Detection Models: A Review and Its Applications
No ratings yet
YOLO Based Object Detection Models: A Review and Its Applications
40 pages
AMS2100 2300 Users Guide
No ratings yet
AMS2100 2300 Users Guide
258 pages
Palestinian ICT Digital Transformation Readiness
No ratings yet
Palestinian ICT Digital Transformation Readiness
110 pages
SwinLSTM: Spatiotemporal Prediction Boost
No ratings yet
SwinLSTM: Spatiotemporal Prediction Boost
10 pages
myWorld 2.7 Admin Release Notes
No ratings yet
myWorld 2.7 Admin Release Notes
17 pages
Contens: SI22 Solar Pump Inverter Manual
No ratings yet
Contens: SI22 Solar Pump Inverter Manual
6 pages
ISAM 7330 FD Shelf Overview
No ratings yet
ISAM 7330 FD Shelf Overview
48 pages
Order of Operations Practice Worksheets
No ratings yet
Order of Operations Practice Worksheets
3 pages
Series 4000 - 7000 Operations & Maintenance Guide
No ratings yet
Series 4000 - 7000 Operations & Maintenance Guide
184 pages
Mathematics Form 1 Term 1-006
No ratings yet
Mathematics Form 1 Term 1-006
2 pages
Gajanan Resume Final
No ratings yet
Gajanan Resume Final
2 pages
UNIT 1 Web Tech
No ratings yet
UNIT 1 Web Tech
52 pages
n8n Every Agent Type
No ratings yet
n8n Every Agent Type
49 pages
SEEMOUS 2008 Math Problems
100% (1)
SEEMOUS 2008 Math Problems
4 pages
Middleware Systems
No ratings yet
Middleware Systems
5 pages
Full Stack Software Development Bootcamp
No ratings yet
Full Stack Software Development Bootcamp
25 pages
Labconco-7114000 Rev L Labconco Coated Steel Fiberglass and PVC Blowers User Manual 2
No ratings yet
Labconco-7114000 Rev L Labconco Coated Steel Fiberglass and PVC Blowers User Manual 2
64 pages
Timing and Control
No ratings yet
Timing and Control
40 pages
Qibla Finder App
No ratings yet
Qibla Finder App
13 pages
Grade 4 English Scheme Term 2
No ratings yet
Grade 4 English Scheme Term 2
25 pages
Gomigo Prod DefaultTrainingModules Training+Manual
No ratings yet
Gomigo Prod DefaultTrainingModules Training+Manual
40 pages
Factory Simulation Software System: Software Requirements Specification (SRS)
No ratings yet
Factory Simulation Software System: Software Requirements Specification (SRS)
24 pages
Landis+Gyr E850: Grid Metering
100% (1)
Landis+Gyr E850: Grid Metering
6 pages
Flexbox CSS
No ratings yet
Flexbox CSS
23 pages
Bga614 74226
No ratings yet
Bga614 74226
10 pages
Chi-Square Test for Feature Selection
No ratings yet
Chi-Square Test for Feature Selection
15 pages

Automated Machine Learning (AutoML) For MultiModel Data Fusion

Uploaded by

Automated Machine Learning (AutoML) For MultiModel Data Fusion

Uploaded by

Volume-4-Issue-1-January,2026 International Journal of Modern Science and Research Technology

Automated Machine Learning (AutoML) for Multi-

6.1. Data Acquisition

6.2.1Ensemble Fusion Model 6.2.3 Time Synchronization

6.4Implementation Details 7.1.2Audio Data Generation

model (YOLOv8) and the audio model (CNN) 7.3.Fusion Metrics

These values show that even though the vision and

You might also like