ChatTracer: Large Language Model Powered Real-time Bluetooth Device Tracking System

Read original: arXiv:2403.19833 - Published 7/10/2024 by Qijun Wang, Shichen Zhang, Kunzhe Song, Huacheng Zeng

ChatTracer: Large Language Model Powered Real-time Bluetooth Device Tracking System

Introduction

The paper introduces ChatTracer, a first-of-its-kind system that combines large language models (LLMs) with wireless sensor networks (WSNs) to provide real-time Bluetooth device tracking. ChatTracer uses an array of radio sniffing nodes to listen to Bluetooth signals emitted by nearby devices. It processes the received Bluetooth packets to extract physical and payload features, which are then fed into an LLM to generate human-like textual responses to user queries.

The authors conducted extensive measurements involving 60 popular Bluetooth devices and over 100 million BLE advertisement packets. They found that Android devices broadcast at least 120 BLE packets per minute, while Apple devices transmit more aggressively at a higher power, with most devices sending 300–1500 packets per minute.

To implement ChatTracer, the authors addressed two main challenges: grouping data packets from individual Bluetooth devices and fine-tuning the LLM for the localization application. They used a combination of physical features (time gap, AoA, RSS, CFO) and payload features (vendor, model, status, activity) to search for the source device in the database. For fine-tuning the LLM, they employed supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF).

The authors built a prototype of ChatTracer and evaluated its performance in three scenarios: an apartment, a laboratory, and a shopping mall. ChatTracer outperformed existing model-based and learning-based localization approaches, achieving median localization errors of 41 cm, 58 cm, and 98 cm in the respective scenarios. It also provides a ChatGPT-like interface for users to access real-time sensory information.

Understanding Bluetooth

The paper discusses the widespread use of Bluetooth Low Energy (BLE) technology in mobile devices and how it can be used for tracking locations and activities of these devices. By the end of 2024, it is estimated that 5.9 billion Bluetooth devices will be shipped. ChatTracer, a BLE-based tracking system, listens to BLE signals from nearby devices to track their locations in a non-invasive manner.

The paper explains the differences between Bluetooth Classic (BTC) and Bluetooth Low Energy (BLE), focusing on BLE Advertisement (Adv) packets. These packets are fundamental to the Bluetooth protocol and are used for device discovery, connection initiation, and efficient communication between Bluetooth devices. The format and data fields of BLE Adv packets have been standardized.

The authors conducted extensive experiments to understand the BLE Adv packets from Apple devices. They found that all Apple devices broadcast BLE Adv packets at a high rate, even when inactive or powered off. By decoding the Apple Continuity Messages (ACM) in the BLE Adv packets, a sniffing node can profile an Apple device's activity and user's activity.

Android devices also transmit BLE Adv packets, but not as many types as Apple devices. Measurements showed that Android devices broadcast a significant number of BLE packets per minute, both when in use and not in use. Other companies, such as Sony and Microsoft, also have their own implementations of Bluetooth chipsets, confirming the widespread availability of BLE Adv packets among mobile devices from different vendors.

ChatTracer: System Architecture

Figure 4. ChatTracer’s system architecture.

ChatTracer is a Bluetooth device tracking system that uses distributed radio receivers to sniff Bluetooth Low Energy (BLE) advertisement packets. The system extracts physical-layer features such as timestamp, received signal strength, carrier frequency offset, and angle of arrival from the packets. It also decodes the payload to obtain information about the device model, status, and activities. The packets are then grouped based on their device ID and stored in a database.

ChatTracer employs a fine-tuned large language model, Mistral-7B, which interacts with users through human-like text. The model is fine-tuned to improve its performance for this specific application.

The combination of device location, status, and activities obtained through ChatTracer can be used to profile people's activities in a non-invasive manner. The system has potential applications in various settings such as shopping malls, airports, libraries, and supermarkets.

BLE Packet Grouping

The paper proposes a new grouping strategy for identifying the source device of a received BLE packet in a database. The strategy combines physical and payload features of the packet.

Key points:

BLE packets from the same device are well-aligned in time, with a small error caused by clock jitters. However, using only time alignment for grouping can lead to errors.
The proposed grouping strategy has three steps: a. Adv_address matching: If the packet's adv_address matches an existing device, it is grouped with that device. b. Payload feature matching: Fixed features like vendor, model, and color are used to exclude impossible devices and reduce the search space. c. Feature weighted scoring: Physical-layer features (CFO, AoA, RSS) are combined to determine the source device using a weighted scoring system.
If the highest score is below a predefined threshold, a new device ID is created in the database for the packet. Otherwise, the packet is inserted under the device with the highest score.

The strategy aims to achieve reliable packet grouping by utilizing multiple features, as relying solely on time alignment or individual features can lead to errors.

Fine-Tuning LLM for ChatTracer

The paper discusses ChatTracer, which uses the Mistral-7B language model as its base. Mistral-7B's tasks include understanding user queries, inferring device location from BLE packet features, generating device trajectories, and providing device status and activity information. The model does not require fine-tuning for understanding user queries and providing device information, as it has been pre-trained for these tasks. However, fine-tuning is necessary for inferring device location and generating trajectories, which involve reasoning, regression, denoising, and outlier removal. The paper employs two fine-tuning techniques: SFT (Supervised Fine-Tuning) to improve localization accuracy and RLHF (Reinforcement Learning with Human Feedback) to enhance overall performance. The fine-tuning strategy is illustrated in a diagram in Figure 6.

Figure 6. ChatTracer’s fine-tuning diagram.

The paper describes the process of fine-tuning a large language model called Mistral-7B for localization and tracking tasks using supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). The SFT process involves collecting data from various scenarios, annotating the data using contextual templates, and fine-tuning the model using the annotated data. Few-shot chain-of-thought (CoT) reasoning is also employed to enhance the model's reasoning capabilities.

To address issues observed during testing, such as refused answers, performance degradation in unseen scenarios, and hallucinations, the authors employ RLHF. This process involves training a reward model (RM) using human annotations to evaluate the output of the fine-tuned model. The RM is then used to improve the overall performance of the model through reinforcement learning using the Proximal Policy Optimization (PPO) algorithm.

The paper provides details on the training process, including the loss functions, hyperparameters, and optimization techniques used for both the SFT and RLHF stages. Through this process, the domain knowledge and reasoning capability of humans can be transferred to the large language model for localization and tracking tasks.

Implementation

The Bluetooth sniffing node for ChatTracer uses a BladeRF 2.0 micro xA4 software-defined radio device, which supports a 61.44MHz sampling rate, 2×2 MIMO channels, and USB 3.0 connections. The device is equipped with a BT-200 Bias-Tee Low Noise Amplifier and two 12 dBi omnidirectional antennas for its receiving RF chains. The BladeRF converts the radio signal to baseband I/Q signals and sends them to an Intel NUC mini PC, which extracts the physical and payload features of identified BLE packets. The mini PC sends the BLE packet features and a timestamp to a MySQL database through the public Internet. The sniffing node has a 20-meter radius for BLE packet detection.

The trained language model (Mistral-7B) was fine-tuned on a computer server using four Nvidia V100 32GB GPUs and deployed on a MacBook Pro for real-time inference. End users can interact with the model via the Internet.

The project website will provide downloadable source code, including a C++ software suite for decoding Bluetooth radio signals from SDR devices and extracting BLE packet features, optimized to work with various SDR devices. The fine-tuned Mistral-7B model will also be published for public access.

(a) Apartment.

Performance Evaluation

The section conducts experiments to evaluate ChatTracer's performance in AoA estimation, localization accuracy compared to state-of-the-art Bluetooth localization approaches, and realistic application scenarios and unseen scenes.

Experiments were conducted in an apartment, laboratory, and shopping mall. The average AoA estimation error was assessed at different device orientations, distances, and moving speeds. ChatTracer outperformed baseline approaches GC-Loc, W-GAN, and AoA Triangulation with Kalman filter in localization accuracy. ChatTracer's median error was 41 cm, 58 cm, and 98 cm in the apartment, laboratory, and shopping mall, respectively, lower than the baseline approaches.

ChatTracer was deployed in a university dining hall to examine its practicality and generalizability. It accurately counted new visitors, analyzed customer tastes at food stations, and tracked individuals in unseen scenarios. ChatTracer also generated data summaries to assist in visitor flow analysis.

Ablation studies showed that Chain-of-Thought (CoT) reasoning and Reinforcement Learning from Human Feedback (RLHF) improved ChatTracer's performance in seen and unseen scenarios. Environmental layout data significantly improved localization accuracy, while device model and activity information reduced hallucination rate and increased answer rate.

Related Work

The paper discusses the advancements in large language models (LLMs) and the growing interest in open-source alternatives like LLaMA, Vicuna, and Mistral. Fine-tuning LLMs is a crucial strategy for customizing these models to suit specific tasks across diverse fields, including medicine, multimodality, and autonomous driving. The authors highlight that while LLMs have been applied in many areas, they have not been studied with wireless sensor networks (WSNs). Their work differs from existing literature by creating a new application of LLMs and demonstrating it in realistic scenarios.

The paper also explores Bluetooth-based indoor localization, which has gained attention due to its low cost, low power consumption, and widespread availability. One common approach is to utilize the received signal strength (RSS) of Bluetooth Low Energy (BLE) beacons for distance estimation and apply trilateration algorithms to estimate the target device's position. An alternative strategy uses the angle of arrival (AoA) of BLE signals, calculating phase differences from synchronized antenna arrays to determine the direction and applying triangulation algorithms for position estimation.

The authors introduce ChatTracer, the first system that uses LLMs for localization, demonstrating superior performance compared to existing approaches.

Conclusion

The paper explores the integration of Large Language Models (LLMs) with Wireless Sensor Networks (WSNs) to expand LLMs' knowledge of the physical world and transform human interaction with WSNs. It introduces ChatTracer, a real-time Bluetooth device tracking system powered by LLMs. ChatTracer consists of three main components: sniffing nodes, a database, and an LLM. The system's novelties include a reliable and efficient Bluetooth Low Energy (BLE) packet grouping algorithm and an LLM fine-tuning strategy combining Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). A prototype of ChatTracer was built and evaluated in three realistic scenarios, confirming its superior localization accuracy compared to existing approaches. The experiments also demonstrate a new way of integrating information and facilitating human interaction with WSNs.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

ChatTracer: Large Language Model Powered Real-time Bluetooth Device Tracking System

Qijun Wang, Shichen Zhang, Kunzhe Song, Huacheng Zeng

Large language models (LLMs) have transformed the way we interact with cyber technologies. In this paper, we study the possibility of connecting LLM with wireless sensor networks (WSN). A successful design will not only extend LLM's knowledge landscape to the physical world but also revolutionize human interaction with WSN. To the end, we present ChatTracer, an LLM-powered real-time Bluetooth device tracking system. ChatTracer comprises three key components: an array of Bluetooth sniffing nodes, a database, and a fine-tuned LLM. ChatTracer was designed based on our experimental observation that commercial Apple/Android devices always broadcast hundreds of BLE packets per minute even in their idle status. Its novelties lie in two aspects: i) a reliable and efficient BLE packet grouping algorithm; and ii) an LLM fine-tuning strategy that combines both supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). We have built a prototype of ChatTracer with four sniffing nodes. Experimental results show that ChatTracer not only outperforms existing localization approaches, but also provides an intelligent interface for user interaction.

7/10/2024

Large Language Models in Wireless Application Design: In-Context Learning-enhanced Automatic Network Intrusion Detection

Han Zhang, Akram Bin Sediq, Ali Afana, Melike Erol-Kantarci

Large language models (LLMs), especially generative pre-trained transformers (GPTs), have recently demonstrated outstanding ability in information comprehension and problem-solving. This has motivated many studies in applying LLMs to wireless communication networks. In this paper, we propose a pre-trained LLM-empowered framework to perform fully automatic network intrusion detection. Three in-context learning methods are designed and compared to enhance the performance of LLMs. With experiments on a real network intrusion detection dataset, in-context learning proves to be highly beneficial in improving the task processing performance in a way that no further training or fine-tuning of LLMs is required. We show that for GPT-4, testing accuracy and F1-Score can be improved by 90%. Moreover, pre-trained LLMs demonstrate big potential in performing wireless communication-related tasks. Specifically, the proposed framework can reach an accuracy and F1-Score of over 95% on different types of attacks with GPT-4 using only 10 in-context learning examples.

5/21/2024

💬

Large Language Models for Wearable Sensor-Based Human Activity Recognition, Health Monitoring, and Behavioral Modeling: A Survey of Early Trends, Datasets, and Challenges

Emilio Ferrara

The proliferation of wearable technology enables the generation of vast amounts of sensor data, offering significant opportunities for advancements in health monitoring, activity recognition, and personalized medicine. However, the complexity and volume of this data present substantial challenges in data modeling and analysis, which have been tamed with approaches spanning time series modeling to deep learning techniques. The latest frontier in this domain is the adoption of Large Language Models (LLMs), such as GPT-4 and Llama, for data analysis, modeling, understanding, and generation of human behavior through the lens of wearable sensor data. This survey explores current trends and challenges in applying LLMs for sensor-based human activity recognition and behavior modeling. We discuss the nature of wearable sensors data, the capabilities and limitations of LLMs to model them and their integration with traditional machine learning techniques. We also identify key challenges, including data quality, computational requirements, interpretability, and privacy concerns. By examining case studies and successful applications, we highlight the potential of LLMs in enhancing the analysis and interpretation of wearable sensors data. Finally, we propose future directions for research, emphasizing the need for improved preprocessing techniques, more efficient and scalable models, and interdisciplinary collaboration. This survey aims to provide a comprehensive overview of the intersection between wearable sensors data and LLMs, offering insights into the current state and future prospects of this emerging field.

8/2/2024

LLMSense: Harnessing LLMs for High-level Reasoning Over Spatiotemporal Sensor Traces

Xiaomin Ouyang, Mani Srivastava

Most studies on machine learning in sensing systems focus on low-level perception tasks that process raw sensory data within a short time window. However, many practical applications, such as human routine modeling and occupancy tracking, require high-level reasoning abilities to comprehend concepts and make inferences based on long-term sensor traces. Existing machine learning-based approaches for handling such complex tasks struggle to generalize due to the limited training samples and the high dimensionality of sensor traces, necessitating the integration of human knowledge for designing first-principle models or logic reasoning methods. We pose a fundamental question: Can we harness the reasoning capabilities and world knowledge of Large Language Models (LLMs) to recognize complex events from long-term spatiotemporal sensor traces? To answer this question, we design an effective prompting framework for LLMs on high-level reasoning tasks, which can handle traces from the raw sensor data as well as the low-level perception results. We also design two strategies to enhance performance with long sensor traces, including summarization before reasoning and selective inclusion of historical traces. Our framework can be implemented in an edge-cloud setup, running small LLMs on the edge for data summarization and performing high-level reasoning on the cloud for privacy preservation. The results show that LLMSense can achieve over 80% accuracy on two high-level reasoning tasks such as dementia diagnosis with behavior traces and occupancy tracking with environmental sensor traces. This paper provides a few insights and guidelines for leveraging LLM for high-level reasoning on sensor traces and highlights several directions for future work.

4/1/2024

ChatTracer: Large Language Model Powered Real-time Bluetooth Device Tracking System

Introduction

Understanding Bluetooth

ChatTracer: System Architecture​​​​​

BLE Packet Grouping

Fine-Tuning LLM for ChatTracer

Implementation

Performance Evaluation

Related Work

Conclusion

Related Papers

ChatTracer: Large Language Model Powered Real-time Bluetooth Device Tracking System

Large Language Models in Wireless Application Design: In-Context Learning-enhanced Automatic Network Intrusion Detection

Large Language Models for Wearable Sensor-Based Human Activity Recognition, Health Monitoring, and Behavioral Modeling: A Survey of Early Trends, Datasets, and Challenges

LLMSense: Harnessing LLMs for High-level Reasoning Over Spatiotemporal Sensor Traces

ChatTracer: System Architecture