Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation

Wang, Yuyang; Meng, Xianjia; Liu, Ximeng

doi:10.1007/s11036-023-02096-9

Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation

Open access
Published: 14 June 2023

(2023)
Cite this article

Download PDF

You have full access to this open access article

Mobile Networks and Applications Aims and scope Submit manuscript

Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation

Download PDF

Yuyang Wang^1,2,
Xianjia Meng¹ &
Ximeng Liu³

1534 Accesses
Explore all metrics

Abstract

Deep learning techniques have been widely used in natural language processing (NLP) tasks and have made remarkable progress. However, training the deep learning model relies on a large amount of data which may involve sensitive information like electronic medical records. The attacker can infer sensitive information from the model, which leads to privacy leakage. To solve this problem, we propose a Differentially Private Recurrent Variational AutoEncoder (DP-RVAE) that can generate simulated data in place of the sensitive dataset to preserve privacy. To generate high utility synthetic text, a part of sensitive text data is employed as the conditional input of the model and uses a dropout and noise perturbing mechanism to preserve differential privacy. In addition, we expand the proposed DP-RVAE to a federated learning setting and design a novel training paradigm for NLP tasks. Specifically, DP-RVAE is deployed to the client-side to train and generate personalized text. These DP-RVAE models would be aggregated and updated through the Federated Optimisation (FedOPT) algorithm so that personal information can be well preserved. We evaluate our proposed DP-RVAE through a text classification task on the Tweets depression sentiment and IMDB reviews datasets. Our DP-RVAE achieves a higher average test accuracy by 5.90% and 3.94% compared to the typical centralized training and federated learning approach, respectively. We also perform the keywords inference attack experiment on the medical description dataset collected from the real world. Compared to the typical differentially private preserving approach, the DP-RVAE decreases by 15.2% in average attack accuracy. The experimental results demonstrate that DP-RVAE can be applied to the NLP models to leverage accuracy while preserving sensitive privacy.

How to keep text private? A systematic review of deep learning methods for privacy-preserving natural language processing

Article Open access 21 May 2022

Inverse optimization strategy for improved differential privacy in deep auto encoder

Article 20 January 2024

Memorization of Named Entities in Fine-Tuned BERT Models

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In recent years, deep learning language models trained on a large of centralized data have achieved an impressive performance in various NLP downstream tasks (e.g., medical case analysis [1], sentiment analysis [2], next-word prediction in mobile keyboards [3]). Such success is mainly due to advanced machine learning techniques and large-scale data collection. For instance, Zeng et al. trained a medical language model on the Chinese MedDialog (MedDialog-CN) dataset, containing 3.4 million medical consultation records collected from the real world in the medical domain [4]. The medical language model can release the encoded vector-form embeddings of symptom descriptions for developing intelligent medical systems. However, deep learning language models can efficiently capture the critical text information in the training dataset and retain sensitive information like diseases in the vector-form embeddings, resulting in potential privacy disclosure risks. Pan et al. [5] propose a simple exploratory analysis task to obtain the sensitive text information from the language model’s text representation. The experimental results demonstrate that an adversary with nearly zero domain knowledge can infer the sensitive plain text information from the unprotected text embeddings (see in Fig. 1). Once the attacker can access the vector-form embeddings encoded by the medical language model, it may cause a massive privacy breach of patients. Furthermore, the patients would suffer direct and indirect damage like reputation and property.

Several approaches have proposed to provide privacy guarantees for the language models to prevent privacy leakage. Differential privacy is the most commonly used method to protect the user’s text data privacy. An effective way to reduce the memorization capability of training data is to apply differentially-private training techniques [6]. Shokri et al. [7] firstly train differential private deep models and utilize Stochastic Gradient Descent (SGD) algorithm to optimize the model with formal differential privacy (i.e., DP-SGD). McMahan et al. [8] have applied the DP-SGD for training Long Short-Term Memory (LSTM) language model to achieve the user-level privacy guarantee.

Recently, the sizeable pre-trained language models such as the Bidirectional Encoder Representations from Transformers (BERT) [9] provide state-of-the-art performance. The BERT is too large to deploy on mobile devices. Sanh et al. [10] proposed a lightweight BERT language model pre-trained with knowledge distillation to implement fewer parameters and faster inference, which is called DistillBERT. Unfortunately, applying the differentially private mechanisms for the pre-trained language model leads to declining prediction accuracy [11] significantly because of the DitillBERT model’s normalization layer.

Another way to tackle privacy concerns is sensitive data desensitization based on a generative model. The generative model like Generative Adversarial Networks (GAN) [12] can learn the distribution of the realistic dataset and generate new fake data. There are some studies on differential private GANs [13], a class of generative models recently. The differentially private GANs can synthesize high-quality simulated data to preserve the original sensitive image data [14, 15]. Zhang et al. [16] proposed the differentially private generative adversarial network (DP-SeqGAN). The DP-SeqGAN can generate the new text data similar to the original data distribution. However, the GAN-based model can not train with the text data stably and effectively. Moreover, the model would cost large privacy budgets even under a small noise scale. The privacy-preserving capability is concerned.

To generate the synthetic and high utility text data with a privacy preservation method, we propose a Differentially Private Recurrent Variational AutoEncoder (DP-RVAE), a text generation model based on the variational autoencoder [17] which is more suitable for text data generated than the GANs based models. Our model can reconstruct the sensitive text and generate desensitized text data for the language model training. The DP-RVAE can train stably and efficiently while improving performance with the privacy and accuracy trade-off.

Previous works try to protect privacy under a centralized setting. On the other hand, the participants may be unwilling to transfer their data to a central server due to the data privacy policy [18] and the concerns of data abuse. Consequently, the central server can not directly collect data from an individual client to train a well-performed language model. To release this problem, Federated Learning (FL) is a promising approach that can utilize the data to train a deep learning model across multiple devices for achieving data-driven Deep Learning (DL) solutions [19]. A language model like DistillBERT can jointly train on the individual client-side with the federated learning paradigm while keeping the personal data locally and private from other users or a central server. Despite the property of the FL, there is still a potential risk of leaky data that the attacker can obtain the data from the updated model parameters [20]. Furthermore, it would cause a significant accuracy degradation if the model trained on a centralized dataset compared to a distributed dataset in the federated learning setting [21]. To solve these problems, we extend the DP-RVAE to the federated learning setting [22], and our framework allows DP-RVAE to train and produce privacy-preserving synthetic text for NLP downstream tasks on each client. The sever can directly collect the synthetic text data produced by DP-RVAE from each client. A language model like DistillBERT can be trained more efficiently with centralized synthetic text data than typical NLP federated learning.

In summary, the main contributions of ours are listed as follows:

We first propose the Differentially Private Recurrent Variational AutoEncoder (DP-RVAE) model to surrogate high-utility text data with DP guarantee.
To improve the utility of the synthetic text data, we introduce the original text as the conditional input of the decoder module while using noise perturbing and word dropout methods to keep privacy.
We extend the DP-RVAE to the federated learning scenario. The DP-RVAE is deployed on the client-side to learn the individual features across multiple clients and generates the synthesized high utility text to train the language model on the central server while providing strong differential privacy guarantees for each client.
We evaluate our proposed DP-RVAE on the text classification task, achieving a higher average test accuracy by 5.90% and 3.94% than the typical approach in centralized and federated learning scenarios (across various privacy budgets. We also evaluate the defense capability of the DP-RVAE against keyword inference attacks. The DP-RVAE has a lower 15.2% average attack accuracy than the typical differentially private approach, and the attacker can not infer the sensitive information from sentence embedding.

The remainder of this paper organizes as follows. Section 2 reviews the related work of the generative models, the differentially private approaches in the NLP field, and the federated learning. Section 3 reviews the basics of the differential privacy and recurrent variational autoencoder model. Section 4 describes the details of the proposed model DP-RVAE, and the model extends to federated learning. Section 5 presents the results of the utility and evaluation experiments for the DP-RVAE and expansion in federated learning. Conclusions and future work are given in Section 6.

2 Related Work

The natural language processing (NLP) field uses a machine to analyze the human language text data. The NLP downstream tasks significantly improved with the pre-train language model like BERT. Many specific domains began to use the model in their tasks [23]. However, a particular field dataset like medical record [24] is quite sensitive and private. It may raise a privacy concern of collecting/sharing data to train the model. Recent works apply the differential privacy mechanism to prevent the deep learning model from disclosing sensitive information in the training dataset [7]. Moreover, considering a distribution scenario, applying the FL methods enable many individual clients to train their models jointly while keeping their local data decentralized and private from a centralized server [25].

2.1 Data Generative Models

Generative models can learn a joint distribution and generate synthetic data. The generative models with deep neural networks can approximate a likelihood function and have various forms [26, 27] for modelling high dimensional data like text, audios, and images. The Variational AutoEncoder (VAE) [27] is various of regularized autoencoder, which is one of the generative models. The VAE architecture generally involves a deep latent-variable model and an inference model. It allows for efficient latent-variable inference and synthesis. The latent-variable model is a generative model over the dataset. The inference model can approximate the posterior distribution of the generative model’s latent variables, which can be called the encoder. The main difference with the original autoencoder is the encoder. The encoder tries to compress the input data into a representation with low dimensions in the original autoencoder. Moreover, the encoder inside the VAE encodes the input data into a Gaussian probability density [28].

Dai et al. [29] proposed an innovative encoder-decoder architecture is called the sequence-to-sequence model, which introduced the RNNs as the backbones of encoder and decoder and has been successful in NLP downstream supervised tasks. Bowman et al. [30] adjusted the sequence-to-sequence architecture and combined the variational autoencoder for text generation proposed the Recurrent Variational AutoEncoder model (RVAE), which can reconstruct an actual sentence and generate a synthetic text. The following studies [31,32,33] improved the RVAE and achieved better performance.

A similar generative model, Generative Adversarial Network (GAN) [26] were majorly applied in the computer vision field [34]. The GAN is composed of the generator and discriminator modules and plays a zero-sum game in the training phase, a mathematic representation in game theory. The generator and discriminator can converge and achieve acceptable accuracy through this method. However, stability and time-consuming are always significant challenges. Liu et al. [35] proposed an extended GANs network architecture based on a self-conditioned method to stabilize the GANs training. Zhao et al. [36] utilized the differentiable augmentation approach to enhance the efficiency of GANs training. Unfortunately, these attempts focus on the computer vision domain. In the NLP field, only a few studies [37, 38] have been on text generation, while these GAN-based models still can not work very well under a discontinuous space essentially. GAN-based models’ stability and time-consuming training problems for text generation have no practical solutions.

2.2 Natural Language Processing with Differential Privacy

The deep learning technique can represent text data well in natural language processing (NLP). In many NLP tasks, the input text data is firstly represented into a density vector by a language model, then can be applied to the NLP downstream tasks like sentiment analysis and medical case analysis. Devlin et al. [9] approved that the text presentation can significantly leverage the downstream tasks’ accuracy. In many works, researchers pre-train a language model on their private dataset to earn a text representation and publish it for a broad set of NLP tasks like text classification.

However, it remains a privacy leakage risk of the text representation. Some experiments [39, 40] showed that users’ private information could be detected easily by the text representation from the language model via the membership attacks and the model inversion attacks. Preotiuc-Pietro et al. [41] showed that the author’s information could be predicted through the language cues from the text. Furthermore, Pan et al. [5] used the model inversion attack method to reconstruct the sensitive information from the text representation. Carlini et al. [42] demonstrated that it is possible to attack the widely used language model like BERT through technologies of training data extraction. Carlini et al. [43] pointed out that the privacy leakage from the language models is due to the strong memorization ability in neural networks. More concretely, text representation maintains too much information from the training dataset.

To tackle the privacy concerns above, these studies trained differentially private models [7, 44] to provide a strong privacy guarantee. Feyisetan et al. [45] injected the Laplacian noise into the word embedding representation vectors. Mcmahan et al. [8] applied differential privacy to model training for the next-word prediction task on the user-adjacent datasets. From another perspective, training models via adversarial learning can also enhance the robustness and privacy of neural representation in language models [46, 47]. However, the methods above can be available to preserve the private information in the text data while decreasing the accuracy rate for the downstream NLP task. Basu et al. [11] directly applied the gradient noise to provide differential privacy protection to the latest pre-train language model like DistillBERT and observed that the accuracy decreases vastly.

The desensitizing dataset is also a promising way to provide a privacy guarantee for sensitive datasets. Phan et al. [48] used the autoencoder model and introduced the noise into the objective function to encode the raw dataset into density representation vectors and publish the noised representation vectors for analyzing tasks. In the same way, Chen et al. [49] perturbed the gradient while training the variational autoencoder to achieve a more robust guarantee for the training dataset.

Generating simulated data with similar distributions to the original data is another way to desensitize the sensitive dataset [15, 50]. Chen et al. [51] trained a differentially private GANs model on the sensitive dataset, which can generate synthetic data without sensitive information instead of the realistic training dataset for a classified model. The GANs-based model can not be trained stable with gradient perturbing and does not have practical work in discrete data like text data.

2.3 Federated Learning Framework

With the development of decentralized mobile devices or servers, the private security on the mobile devices [52] receives much attention in terms of research and practice [53]. Federated learning is a promising privacy-preserving approach [54], which allows individual clients to collaborate to train models and share the gradients to update the global model while storing their data locally. The federated learning’s objective is to aggregate the local model from each client and optimize the global model. Several federated learning algorithms propose to optimize the global model. Federated Averaging (FedAvg) is a standard algorithm to divide the training phase into rounds and update the global model [22]. The federated proximal (FedProx) adds the proximal term based on the FedAvg algorithm and improves converging stability [55]. The Federated Optimization (FedOPT) is a generalized version of FedAvg and can converge more efficiently in a few rounds [56]. In our work, we propose to use the FedOPT federated learning algorithm to update the global model in a federated learning setting. Even the federated learning framework has a promising future to address the privacy risk of collecting data [57], especially in the NLP field. Users would not like to upload sensitive text to the third-party central server.

However, there is still limited research about federated learning with NLP. Sui et al. [58] processed the medical text data via federated learning for the extraction task. Liu et al. [54] used the pre-train BERT model with the federated learning framework to tackle the analysis of the medical notes, and the medical notes data is from multiple silos. Lin et al. [21] applied the BERT model to the NLP downstream tasks in the federated learning setting. Compared to the centralized setup, it has a quite gap in the accuracy rate under the same dataset and task. In a word, the prior works of NLP in federated learning mainly try to achieve their specific task. They neglect the privacy threats from membership inference attacks [59] and reconstruction attacks [60].

3 Preliminaries

3.1 Differential Privacy

Differential privacy (DP) can provide strong privacy guarantees for sensitive data analysis. With differential privacy preservation, attackers can hardly recover the information of the dataset. A randomized mechanism ${\mathscr{M}}$ with an output range $\mathcal {R}$. If the ${\mathscr{M}}$ satisfied the following equation, the mechanism is (𝜖,δ)-DP [61]. The formal definition of differential privacy is as follows.

$$ \operatorname{Pr}[\mathcal{M}(S) \in \mathcal{O}] \leq e^{\epsilon} \operatorname{Pr}\left[\mathcal{M}\left( S^{\prime}\right) \in \mathcal{O}\right]+\delta $$

(1)

For any two adjacent datasets S and $S^{\prime }$ which only differ by one sample, the mechanism holds for any subset of outputs $\mathcal {O} \in \mathcal {R}$. $\operatorname {Pr}[{\mathscr{M}}(S) \in \mathcal {O}]$ is the probability of the algorithm getting a specific result outputs $\mathcal {O}$. In our case, the recurrent variational autoencoder corresponds to the mechanism ${\mathscr{M}}$. The 𝜖 is the privacy budget’s upper bound value of privacy loss. The parameter δ is the failure probability of the differential privacy mechanism ${\mathscr{M}}$. The smaller 𝜖 and δ can achieve stronger privacy guarantees.

The typically differential privacy method for the neural networks is to inject noises into the gradients at the training phase. The privacy budget is an estimation metric of the privacy preservation level for the deep learning model. The Renyi Differential Privacy (RDP) [62] accounting mechanism provides a tighter estimation of privacy budget consumption when training the deep learning model. For any two adjacent datasets S and $S^{\prime }$, a randomized mechanism ${\mathscr{M}}$ is (α,𝜖)-RDP if the mechanism ${\mathscr{M}}$ satisfies the following equation.

$$ D_{\alpha}\left( \mathcal{M}(S) \| \mathcal{M}\left( S^{\prime}\right)\right) \leq \epsilon $$

(2)

Where the $D_{\alpha }\left ({\mathscr{M}}(S) \| {\mathscr{M}}\left (S^{\prime }\right )\right )$ is the Renyi divergence and defined as follows, where the parameter α> 1 is the order of the 𝜖-RDP.

$$ D_{\alpha}(\mathcal{M}(S) \| \mathcal{M}(S^{\prime})) \triangleq \frac{1}{\alpha-1} \log E_{x \sim \mathcal{M}(S^{\prime})}\left( \frac{\mathcal{M}(S)}{\mathcal{M}(S^{\prime})}\right)^{\alpha} $$

(3)

The RDP provides the more convenient composition and post-processing properties to account for the privacy budget over a sequence of differentially private mechanisms.

Theorem 1 (Composition)

For a sequence of k mechanisms ${\mathscr{M}}_{1}, {\mathscr{M}}_{2}, \ldots , {\mathscr{M}}_{k}$, each mechanism ${\mathscr{M}}_{i}$ satisfies (α,𝜖)-RDP, the k composition mechanism ${\mathscr{M}}_{1}, {\mathscr{M}}_{2}, \ldots , {\mathscr{M}}_{k}$ is also a $(\alpha , {\sum }_{i=1}^{k}\epsilon _{i})\text {-RDP}$.

Theorem 2 (Post-processing)

If a randomized mechanism ${\mathscr{M}}$ satisfies the (α,𝜖)-RDP, for any subsequent function F of mechanism ${\mathscr{M}}$ will satisfy the (α,𝜖)-RDP.

3.2 Recurrent Variational Autoencoder

The recurrent variational autoencoder can effectively approximate inference with the directed probabilistic models. Given observed a text dataset X = {x⁽¹⁾,x⁽²⁾,…,x^(N)}, the text sample x⁽ⁱ⁾ contains a string of words and can be denoted by $\boldsymbol {x}^{(i)}=\left \{x_{1}, x_{2}, \ldots , x_{L}\right \}$ the L is the length of the words number of text. The goal of the model is to estimate the parameters 𝜃 while minimizing marginal log-likelihood.

$$ \log p_{\theta}(\boldsymbol{X})=\sum\limits_{n=1}^{N} \log {\int}_{\boldsymbol{z}} p(\boldsymbol{z}) p_{\theta}\left( \boldsymbol{x}^{(n)} \mid \boldsymbol{z}\right) \mathrm{d} \boldsymbol{z} $$

(4)

The $p_{\theta }\left (\boldsymbol {z}\right )$ is the prior distribution of a latent variable z, where z is sampled from a multivariate diagonal Gaussian distribution. Because of the integration inside the marginal log-likelihood, the equation is undifferentiable and we can not directly use the gradient descent method to optimize the parameter 𝜃 So it can be inverted to the evidence lower bound (ELBO) of the marginal log-likelihood by using an approximation posterior distribution q_∅(z∣x) of p_𝜃(x∣z).

$$ \begin{array}{@{}rcl@{}} \log p_{\theta}(\boldsymbol{x}) &\geq & \mathbb{E}_{q_{\phi}(\boldsymbol{z}\mid \boldsymbol{x})}\left[\log p_{\theta}(\boldsymbol{x} \mid \boldsymbol{z})\right] \\ &&-\mathcal{D}_{KL}\left( q_{\phi}(\boldsymbol{z} \mid \boldsymbol{x})\|p(\boldsymbol{z})\right) \end{array} $$

(5)

We used the encoder which includes a single-layer LSTM combining with two fully-connected layers to predict the posterior distribution $q_{\emptyset }\left (\boldsymbol {z}\mid \boldsymbol {x}\right )$. More concretely, the posterior distribution $q_{\emptyset }\left (\boldsymbol {x}\mid \boldsymbol {z}\right )$ can be assumed as a multivariate diagonal Gaussian distribution.

$$ q_{\phi}(\boldsymbol{z}\mid\boldsymbol{x})=\mathcal{N}\left( \boldsymbol{z}; \mu_{\phi}(\boldsymbol{h}), \sigma_{\phi}(\boldsymbol{h})\right) $$

(6)

The function μ_ϕ and σ_ϕ both are linear layers to predict the mean and variance of the multivariate diagonal Gaussian distribution according to the hidden state vector h which is the final state output of the LSTM encoder that maps a sequential text input of $\boldsymbol {x}=\left \{x_{1}, x_{2}, \ldots , x_{L}\right \}$.

The operation of sampling latent variable z from $q_{\emptyset }\left (\boldsymbol {z}\mid \boldsymbol {x}\right )$ is non-continuous resulting that the gradient can not be computed and passed through. So we refine the sampling operation by $\boldsymbol {z}=\mu _{\phi }(\boldsymbol {x})+ \upbeta {\sum }_{\phi }^{\frac {1}{2}}(\boldsymbol {x})$, where the β is sampled from $\mathcal {N}(0, \boldsymbol {I})$.

The decoder module is also a LSTM layer which maps the latent variable z as the initial hidden states to the text sequence sample input and generates a new text sample while modeling the distribution of p_𝜃(x∣z) relies on the latent variable z. The model can be trained with the stochastic gradient descent and minimize the Loss function, where the N is the batch size.

$$ \begin{aligned} \operatorname{Loss}(\boldsymbol{X} ; \phi, \theta)=& \sum\limits_{i=1}^{N} \mathbb{E}_{q_{\phi}\left( \boldsymbol{z}\mid\boldsymbol{x}^{(i)}\right)}\left[\log p_{\theta}\left( \boldsymbol{x}^{(i)} \mid \boldsymbol{z}\right)\right] \\ &-\sum\limits_{i=1}^{N} \mathcal{D}_{KL}\left( q_{\phi}\left( \boldsymbol{z}\mid\boldsymbol{x}^{(i)}\right) \| p(\boldsymbol{z})\right) \end{aligned} $$

(7)

The first term of the equation is to encourage the model to reconstruct the original text input. The second term of the equation uses the Kullback-Leibler (KL) divergence, which can evaluate the similarity of the distributions between the p_𝜃(x∣z) and p_𝜃(z).

4 Proposed Method

This section describes how to introduce the differential privacy mechanisms to the recurrent variational autoencoder to generate high-utility desensitization text data, which is differentially private and hard for attackers to infer the original data information. In addition, we apply our method to protect personal text data privacy in both centralized and federated learning scenarios.

4.1 Sensitive Text Privacy Preservation Through DP-RVAE

The proposed approach DP-RVAE takes the sensitive text data as the input and reconstruct that to desensitized text data to protect the sensitive data. The synthetic text data can be used to train the language model while maintaining high utility (Fig. 2). To make federated learning adapt to mobile device environments, we use the DistillBERT [10] as the language model in our case, which is a smaller and faster version of BERT.

For the DP-RVAE generative model, we train it on a sensitive dataset D using a differentially private algorithm DP-SGD [7]. The differentially private training algorithm DP-SGD injects the noise into the stochastic gradients while training to keep the data of user-level private. We can provide a strong guarantee for personal sensitive text data via DP-RVAE. The text generated by the DP-RVAE can train the DistilBERT language model to predict the downstream tasks.

Our model is based on the recurrent variational autoencoder. It mainly involves two modules: the encoder and decoder, all single-layer LSTM to adapt the original autoencoder to text data.

4.2 Differential Privacy with RVAE

In this section, we will introduce the details of the differential privacy mechanisms in DP-RVAE (see Fig. 3). For the loss function of the recurrent variational autoencoder (Eq. 7) with a mini-batch of N samples X = {x₁,x₂,…,x_N}, each sample x_i is a sentence containing a sequential word tokens which represented by a integer vector. Then we pass the word tokens x_i into the word embedding module to get a float embedding vector $\boldsymbol {e}_{\boldsymbol {x}_{\boldsymbol {i}}}$. We use the stochastic gradients descending algorithm to update the model parameters ϕ and 𝜃, where the ϕ is the paramter of the encoder module and the 𝜃 is the paramter of the decoder module. Suppose that, a mini-batch of N latent variables Z = {z₁,z₂,…,z_N} which samples from the encoder $q_{\phi }\left (\boldsymbol {z} \mid \boldsymbol {e}_{\boldsymbol {x}}\right )$. Then we compute the average gradient η_d of the decoder for a batch.

$$ \operatorname{\boldsymbol{\eta}_{d}} = \frac{1}{N} \sum\limits_{i=1}^{N} \nabla_{\theta} \log p_{\theta}\left( \boldsymbol{e}_{\boldsymbol{x}_{\boldsymbol{i}}} \mid \boldsymbol{z}_{i}\right) $$

(8)

Then we compute the average gradient η_e of the encoder for a batch in the same way.

$$ \operatorname{\boldsymbol{\eta}_{e}} = \frac{1}{N} \sum\limits_{i=1}^{N} \nabla_{\phi}\left[\log p_{\theta}\left( \boldsymbol{e}_{\boldsymbol{x}_{\boldsymbol{i}}} \mid \boldsymbol{z}_{i}\right)-\text{KL}\left[q_{\phi}\left( \boldsymbol{z}_{i} \mid \boldsymbol{e}_{\boldsymbol{x}_{\boldsymbol{i}}}\right) \| p(\boldsymbol{z}_{i})\right]\right] $$

(9)

In our work, we only perturb the gradients of the encoder module to achieve better results while ensuring security. First, we use the clip gradient operation with the gradient norm bound parameter C to the gradient of the encoder module inside our model.

$$ \operatorname{\boldsymbol{\eta}_{e}}=\operatorname{Clip}\left( C, \operatorname{\boldsymbol{\eta}_{e}}\right) $$

(10)

Secondly, we add the Laplace Noise into the gradient of the encoder module to make it differentially private, where the σ_s is the scale of the noise.

$$ \operatorname{\boldsymbol{\eta}_{e}}=\operatorname{\boldsymbol{\eta}_{e}}+\operatorname{Laplace}\left( 0, {\sigma_{s}^{2}} C^{2} \boldsymbol{I}\right) $$

(11)

In the end, we apply the stochastic gradients descent algorithm to optimize all model parameters 𝜃. Note that the model’s training is differentially private because of the post-processing theorem of differential privacy. To keep the model differentially private, we use the < unk > character as the conditional text input of the decoder module to generate more personalized text and preserve the sensitive input text data.

4.3 Noise Perturbing and Word Dropout

In the previous section, all of the conditional text input of the decoder module is < unk > characters, and the generated result may be random, which ultimately change the input text meaning. So we utilize the original text instead of all < unk > characters as the conditional input of the decoder module. To ensure the model training satisfies the differential privacy, we introduce the word dropout, and noise perturb mechanism to process the original text to obtain the conditional input of the decoder module. Under the word dropout mechanism, each word in the original text has the equal dropout probability of replacing with the < unk > character. Furthermore, we add the Laplace noise into the conditional input of the decoder module. The word dropout and noise perturb mechanism can trade-off utility and privacy which has been mentioned and proved as a formal differential privacy mechanism [63].

For a sensitive text input of the encoder module $\boldsymbol {x}=\left \{x_{1}, x_{2}, \ldots , x_{L}\right \}$, we first use the word dropout method to randomly mask the word in the text input. More concretely, let apply a masked vector $\boldsymbol {I}_{mask}={\left \{0,1\right \}}^{L}$ with dropout probability ρ to the input text x ∗I_mask, the zeroes in the I_mask obey the uniform distribution. The word x_i will be replaced by the < unk > characters if the element’s value equals 0 at the corresponding position in I_mask. When dropout rate ρ = 1, it means all words will be replaced by < unk > character, as mentioned in the last section. Moreover, we employ the embedding mechanism to represent the feature of each word, which can be denoted by:

$$ \boldsymbol{F}=\operatorname{Embedding}(\boldsymbol{x})=\left\{\boldsymbol{e}_{x_{1}},\ \boldsymbol{e}_{x_{2}},\ldots,\boldsymbol{e}_{x_{L}}\right\} $$

(12)

Then we inject the noise with Laplace distribution into the words features F to make it satisfy the 𝜖-DP guarantee as follows.

$$ \boldsymbol{\hat{F}}=\operatorname{T}(\boldsymbol{F})=\boldsymbol{F}+\operatorname{Laplace}(\gamma) $$

(13)

Where the $\gamma =\frac {\Delta f}{\epsilon }$ is the scale of Laplace noise, the Δf is the sensity of the differential privacy, and the 𝜖 is the privacy budget. Here we bound the sensity of each element of the text with 1 (i.e., Δf = 1). And the T(F) is differential privacy.

The word dropout operation with the differentially private noise perturbation combination is still differentially private, and the privacy budget lowers to:

$$ \epsilon=\ln \left[(1-\rho) e^{\frac{1}{\gamma}}+\rho\right] $$

(14)

According to the composition theorem of differential privacy, we can still train DP-RVAE while satisfying the differential privacy. We denote the generated text Y with the input text x as:

$$ \boldsymbol{Y}=\text{DP-RVAE}\left( \boldsymbol{x}, \operatorname{T}\left( \boldsymbol{x} * \boldsymbol{I}_{\text {mask}}\right), \rho, \gamma\right) $$

(15)

4.4 DP-RVAE with Federated Learning

In the federated learning setup (Fig. 4), we deploy the DP-RVAE to s clients. In the initialized round, we use the public corpus to train the DP-RVAE in the center-sever, then broadcast it to each client. The language model DistillBERT is pre-trained. We initialize the classifier randomly and broadcast it to each client for prediction tasks.

In each round, the clients receive the latest global model DP-RVAE M_Global and DistillBERT LM from the server. We train the DP-RVAE with the sensitive dataset D_i on the client-side. Then the local model M_i learns from the features of individual text, and the parameters ${\widetilde {\theta }}_{i}$ will be updated. DP-RVAE can generate a corresponding personalized synthetic dataset ${\widetilde {D}}_{i}$. Then the local DistillBERT language model LM performs the prediction by querying the generated synthetic dataset ${\widetilde {D}}_{i}$.

Each client uploads the updated parameters ${\widetilde {\theta }}_{i}$ and synthetic dataset ${\widetilde {D}}_{i}$ to center server. The server aggregates the updated parameters and synthetic datasets from each client. For the global DP-RVAE, we perform the FedOPT federated learning algorithm to update the model parameters to consider a large amount of heterogeneous text data in NLP from different clients. The server first calculates the aggregated local model parameters change as ${\Delta }={{\sum }_{i}^{S}} p_{i} {\widetilde {\theta }}_{i} / {{\sum }_{i}^{S}} p_{i}$, where the p_i is the weight of client. To simplify, we assume the client has the same weight. Then the global DP-RVAE M_Global is updated according to the aggregated parameter change Δ. The centralized synthetic dataset $\widetilde {D}$ can be directly used to fine-tune the DistillBERT language model. By this paradigm, the language model can improve performance and reduce the negative impact from the distribution dataset in the federated learning setting.

5 Experiment and Security Analysis

In this section, we report the experimental results of the DP-RVAE on the two text classification datasets, Tweets Depression Sentiment [64], and IMDB Reviews [65] datasets. We utilize Opacus for our experiments and analyze the trade-off between privacy and the utility of the generated text. First, we use the generated synthetic text data for downstream NLP tasks with the language model DistillBERT and compare it to NLP task prediction accuracy of the benchmark of differentially private DistillBERT model with real-world datasets to evaluate the utility while consuming the same privacy budget to ensure privacy in the centralized setting. Second, we perform the keywords inference attack experiment to demonstrate the privacy-preserving capability of the DP-RVAE. Also, we test the DP-RVAE with our proposed federated learning paradigm. We compare it to the typical NLP federated learning methods with DistillBERT in the same NLP tasks.

5.1 Implementation Details

We describe the exact model’s hyper-parameters settings and include all of the details of the datasets for implementing the mechanisms and models practically.

Tweets Depression Sentiment Dataset

The Tweets Depression Sentiment dataset [64] includes tweets to detect depression tendency in the web, which is scraped from Twitter for the study, and data cleaning was performed while scraping. There are 2477 samples for training and 619 samples for testing.

IMDB Reviews Dataset

The IMDB Reviews dataset [65] involves 50000 samples for binary sentiment classification containing substantially more data than previous benchmark datasets, and each sample has an accurate sentiment annotation.

Medical Description Dataset

The medical description data comes from the CMS public healthcare records. According to the practice of Pan et al. [5], we pre-process the textual Healthcare Common Procedure Coding System (HCPCS) descriptions and use word matching to find the sentences which include the 10 keywords (e.g., head, hand, and face). We obtain a medical description dataset that contains 200,000 sentences while using the sentences to pre-train the DistillBERT language model for the sentence embeddings.

Hyper-parameters Settings

For the DP-RVAE, we set the noise scale γ = 0.001, the word dropout rate ρ = 0.6 in the encoder module, and the clip norm rate C = 1.0 the decoder module. We use the DP-SGD optimizer for the DP-RVAE, and the learning rate set is 0.05. For the differentially private benchmark model, we use the Adam optimizer with a learning rate of 0.0001. We use two layers of fully-connected neural networks as the classifier model and the Cross-Entropy as the text classification model’s loss function.

5.2 The Utility Evaluation of the Synthetic Data

For the utility evaluation of the synthetic text dataset that our model generated, we first train the DP-RVAE on a realistic dataset. The utility means the performance of the classifier model with the synthetic data on downstream tasks. Moreover, we use the accuracy metric to evaluate the model’s performance. To obtain the same label as the actual dataset, we apply the word dropout and noise perturbation mechanism with the original text as the additional input of the encoder module. Then we train the pre-trained language model with the classifier of the specific NLP task on the synthetic dataset. We keep track of the privacy budget spent in our algorithm by using the Renyi-DP accountant [62]. We use 80% of the dataset as training data and the rest as test data for each dataset. The batch size is 32. We choose the differentially private pre-training language DistillBERT with two fully-connected layers and a Tanh activation function and Sigmoid function to predict the benchmark model and train on the realistic dataset directly. We ran the experiment in different target privacy budget values, the privacy budget $\epsilon =\infty $ means without privacy guarantees, the probability of failure δ = 1 × 10^− 5.

The result, as shown in Table 1, our model performs better on two datasets under a low privacy budget. With the comparison of the DP-DistillBERT model, as an example, we see that our DP-RVAE with an average test accuracy of 54.10% and DP-DistillBERT with 48.20%, an average test accuracy improvement of 5.90% in a highly tight guarantee of privacy level (i.e., 𝜖 = 0.5). Without a privacy budget (i.e., $\epsilon =\infty $), our model’s average test accuracy with 72.13% and the benchmark model DP-DistillBERT with 77%, only reduce 4.87% test average accuracy. It means that the synthetic text data generated by our DP-RVAE has a small gap with the actual data.

Table 1 Average test accuracy of models trained in a centralized setting

Full size table

According to the experimental results, our model can generate a high-utility synthetic text, and the DistillBERT can still learn the feature information from the synthetic text data. The more experimental details on the Tweets Depression Sentiment dataset show in Fig. 5, we see that for a fixed and formal privacy level $\left (\epsilon \le 7.5\right )$, our model consistently outperforms DP-DistillBERT significantly. With the privacy budget increasing, the DP-DistillBERT changes into the DistillBERT with original text data training gradually. Consequently, it has higher accuracy than DP-RVAE but without a privacy guarantee nearly.

In addition, we find that with the value of the privacy budget decreasing, the classifier is harder to make the right decision. It demonstrates that the text feature is more difficult to be captured by the language model. From another aspect, it provides more robust privacy. Even though the attackers hijack the synthetic text, they still can not obtain sensitive information.

The impact of the failure probability parameter δ on the average accuracy of the DP-RVAE shows in Fig. 6. We can find that the test accuracy with different values of δ is almost equal under the same privacy budget. Also, we can observe that the larger δ can result in a more significant bias of test accuracy. According to the definition of differential privacy, the larger δ means more noise injection when the privacy budget 𝜖 is fixed. So we conduct that the noise scale can affect the DP-RVAE’s performance.

To demonstrate the effectiveness of using our DP-RVAE to generate a high utility sentence, we sample the generated sentence in three privacy budgets: 𝜖 = 0.5, 𝜖 = 5, and $\epsilon =\infty $ (i.e., without DP).

As shown in Table 2, as the value of the privacy budget decreases, the generated text becomes more confusing, and the < unk > character occurs more frequently. Despite that, we can still distinguish the sentiment from the generated text, similar to the original text. The generated text can prevent sensitive information disclosure like the “1st Birthday party” and “headache” state.

Table 2 A sample of generated text for various privacy budgets 𝜖 on Tweets Depression Sentiment dataset

Full size table

5.3 Keyword Inference Attack Experiment

We evaluate the defence capability of the DP-RVAE with the DistillBERT language model against the deep artificial neural networks (DANN) based keyword inference attack proposed by Pan et al. [5] on the medical description dataset. We assume that the attacker has known the ten exact sensitive disease keywords (e.g., head, hand, and face) and infers the target sentence embedding whether it contains a specific keyword. We provide the experimental results of the average attack accuracy on ten sensitive keywords under different privacy budgets.

From Table 3, the average keywords inference attack accuracy of our DP-RVAE decreased by 15.2% compared to the typical differentially private approach on the Medical Description dataset. We can observe that the keywords inference attack accuracy is only 8.63% when the privacy budget 𝜖 is set to 1 and can barely obtain the critical sensitive keywords information from the generated text with our DP-RVAE. Figure 7 shows the overall average accuracy of the keywords inference attack. The DP-DistillBERT and DP-RVAE can weaken the keywords inference attack to a random guessing manner when the privacy budget is under 1.5 compared to the DistillBERT without privacy guarantees. We can find out that the attack accuracy of our DP-RVAE is 53% even under a high privacy budget (i.e., 𝜖 = 2.5) and decreased by 18% compared with the DP-DistillBERT model.

Table 3 Average accuracy of keyword inference attack on the medical description dataset

Full size table

To demonstrate the defence capability of the DP-RVAE, we have shown more detailed results of the keyword inference attack experiment for each keyword in Fig. 8. We can observe that the DistillBERT has several specific keywords like the word hip which can be obtained by the attackers easily. For example, the attack accuracy of the word hip can achieve around 86%. When the privacy budget 1, the attack accuracy of the word hip decreases to 23%, we also observe that the DP-RVAE can provide better defence. The attack accuracy of each keyword ranges from 11% to 25%. The DP-RVAE can achieve a more robust defence capability than the DP-DistillBERT model. When the privacy budget is 0.5, the attackers can barely infer any keyword from the sensitive sentence embedding. In particular, the attack accuracy of the word hip is decreased to 8.2% significantly.

5.4 An Experiment in Federated Learning

In the federated learning setting, we use the same configuration with each client. We partition the dataset into N parts equally and randomly to simulate the experiment. To validate the effectiveness of our paradigm, we compare it to the typically federated learning DistillBERT in the NLP task mentioned in [21]. We run the experiment in the cross-silo setting, and each round selects the same clients.

We use 10 clients for the text classification task on the Tweets depression sentiment and IMDB reviews datasets for more details. In each round, the local epoch number sets to 2. The DP-RVAE and DistillBERT model’s hyper-parameters settings are the same as the previous experiment in a centralized setup. We add the differential privacy locally and aggregate each client’s model to perform the Federated Optimisation (FedOPT) federated learning algorithm. We retain 80% of the training dataset and randomly split it into 10 parts equally. We utilize the rest 20% of the dataset to test on the server-side. The model train phase will stop when exhaust the privacy budgets.

From the results represented in Table 4, our paradigm can achieve higher accuracy than the typical NLP federated learning under a lower privacy budget. We observe that our federated learning paradigm with the DP-RVAE improves test average accuracy with 4.94% on the two datasets under a high privacy guarantee (i.e., 𝜖 = 0.5). We argue that our DP-RVAE can still generate high-quality and personalized synthetic text data for each client. It can simulate a realistic dataset. Furthermore, without a privacy guarantee (i.e., $\epsilon =\infty $), the test accuracy in our paradigm improved 6.33% on the Tweets Depression Sentiment Dataset and improved 1.96% on the IMDB Reviews Dataset compared to the typical federated learning framework, respectively.

Table 4 Average test accuracy of models trained in a federated learning setup

Full size table

Additionally, our paradigm achieves higher accuracy under various privacy budgets (see in Fig. 9) than the typical DP-DistillBERT with federated learning. We argue that the language model is more effective in performance in a centralized setup even though using a simulating dataset than a real dataset in the distribution setup.

Moreover, we compare the test classification accuracy of the original federated learning with DP-DistillBERT and our DP-RVAE federated learning paradigm under various communication rounds R on the Tweets Depression Sentiment Dataset. Notably, we fix the total privacy budget 𝜖 = 30, and the local epoch number sets to 1 in each communication round for the model training phase (see in Fig. 10).

As a result, our DP-RVAE with DitsillBERT in federated learning paradigm begins to converge and achieve a test accuracy of 62% when round R = 3. Within a given privacy budget 𝜖 = 30, the original DP-DistillBERT with federated learning approach exhausted the privacy budget when round R = 8 and the highest test accuracy is only 56%. From the comparison, we can conduct that our federated learning paradigm can achieve a better test accuracy under a given privacy budget. On the other hand, our federated learning paradigm costs less privacy budget and rounds for a more robust privacy guarantee while achieving a better performance.

5.5 Analysis of DP-RVAE Model

5.5.1 Computation Complexity Analysis

Our DP-RVAE preserves text privacy for the NLP tasks by generating simulated data. The DP-RVAE’s computation complexity can be composed of the RVAE module, the Laplace noise injection, and the word dropout mechanism. The RVAE module is a standard sequence-to-sequence model, including two-layers LSTM, linear with the input text sequence length. Hence, the time complexity of the RVAE module is O(m + t) for computing a sample, where the m is the input text length, and the t is the generated text length. The time complexity of the Laplace noise injection and the word dropout mechanism is slight, which equals the data size n. As a result, the total time complexity of the DP-RVAE is O((m + t)n + n).

While comparing to train the DP-DistillBERT, the time complexity of the DP-DistillBERT is O(m²dn + n), where m is the input length, d is the dimensions of hidden vectors, and the n is the complexity of the noise injection operation, which is equal to the data size. In this way, training the DP-RVAE can reduce the time-consuming rather than DP-DistillBERT for one epoch. In the federated learning setting, the DP-DistillBERT model trains with the typical federated learning method would yield a heavy computing burden for the clients. On the contrary, our paradigm trains the DP-RVAE on the client-side and generates the synthetic text once while reducing the clients’ computation costs.

As opposed to our federated learning paradigm, there is another mainstream privacy-preserving federated learning framework based on the Security Multi-part Aggregation (SMA) method, which is proposed by Bonawitz et al. [66]. The updated DistillBERT model’s parameters from each client can be aggregated safely by the SMA. However, this method may cause additional computational costs for the client device. More specifically, each client performs 2s key agreements and creates t-out-of-n Shamir secret shares. Then generates s − 1 values for every other client for each entry in the input vector by stretching one pseudorandom generator (PRG) seed each. Consequently, each client’s additional computational cost is O(s² + sn), where the s is the number of clients and the n is data size. Compared with our paradigm, this is an enormous computation cost.

5.5.2 Security Analysis

We mainly consider two potential privacy attacks against the deep learning model for security analysis: gradient leakage and keywords inference attacks. First, we consider that an honest-but-curious client can work as a passive attacker to infer other clients’ sensitive information from the gradients when aggregating the DP-RVAE parameters in a federated learning setting. Assume the attacker can access the updated DP-RVAE parameters or gradients, and they can use the gradient leak attack methods to obtain the plain text. Even though there is no such practical attack work in the NLP federated learning. Most of the existing research is on the computer vision domain.

In this case, the differential privacy with gradients noise perturbation can effectively prevent the gradient leak attack. A mechanism can provide the differentially private preservation if it satisfies the (𝜖,δ)-DP definition. Differential privacy is a strictly mathematical definition of data privacy, and we introduce it in Section 3.1 that hypothesizes the attacker has exhaustive background knowledge and can prevent any attack in theory. Following this, we perform the differential privacy analysis for the DP-RVAE model, which can satisfy the differential privacy and allow for quantitative privacy analysis.

For the encoder module, we have adopted the stochastic gradient descent training algorithm proposed by Abadi et al. [7] to implement the differential privacy. Based on the RDP accounting mechanism, the encoder module is $\left (\mathcal {O}(q \epsilon \sqrt {t})+\frac {\log 1 / \alpha }{\lambda -1}, \delta \right )$-differentially private. Here, t is the training steps, and q is the training data sampling probability. We employ the noise perturbing and word dropout mechanisms to satisfy the 𝜖-differentially private for the decoder module. The privacy budget is $\epsilon =\ln [(1-\rho ) \exp (\frac {\Delta f}{b})+\rho ]$. According to Theorem 1 (i.e., the composition theorem), combining the encoder and decoder modules can still achieve a differential privacy guarantee for sensitive data. In conclusion, our DP-RVAE can prevent the gradients leak attack in the federated learning setting.

Secondly, the keyword inference attack is another potential privacy attack, a variation of model inversion attacks. We suppose that the attacker has invaded the clients. Furthermore, the attacker can access the outputs from the DistillBERT language model when the victim client utilizes the trained DitillBERT to predict a specific NLP task. The attacker can reconstruct the plain text from the text embedding and judge whether a sensitive keyword belongs to the original input text, which is extremely dangerous for the language models. In our approach, the DistillBERT utilizes the synthetic text generated by the DP-RVAE to train and predict the downstream tasks. At the same time, the generated text data is still differentially private according to Theorem 2 (i.e., the post-processing theorem). The client can utilize the generated text to train the language model DistilledBERT without privacy leaky. The attacker can barely infer the sensitive keyword from the text embedding even though the DistillBERT trains without privacy-preserving algorithms. Despite that, we have experimented on the DP-RVAE with the DistillBERT language model under the keyword inference attack [5] on the medical dataset to demonstrate the privacy security in the experiment (see in Section 5.3).

6 Conclusion and Future Work

This paper presents the DP-RVAE to generate simulated text for downstream task model training with a formal differential privacy guarantee. In addition, we propose a training paradigm based on the DP-RVAE in federated learning. Our experimental results show that the DP-RVAE can generate high-utility text data. Furthermore, the language model can be trained on the synthetic text effectively. Even though each client has a limited dataset in a federated learning setting, the proposed training paradigm can obtain better accuracy in the NLP downstream tasks. Because of the absence of the non-IID NLP dataset from the real world, we can not further evaluate the performance of our DP-RVAE with the non-IID data in the federated learning setting. As future work, we plan to explore how to generate more customization text to leverage the accuracy of the NLP tasks. We also intend to optimize the DP-RVAE for less computation complexity and better applied to the mobile device in the federated learning setting.

References

Yao L, Mao C, Luo Y (2019) Clinical text classification with rule-based features and knowledge-guided convolutional neural networks. BMC Med Inform Decision Making 19(3):31–39
Google Scholar
Xu H, Liu B, Shu L, Yu PS (2019) BERT post-training for review reading comprehension and aspect-based sentiment analysis. arXiv:1904.02232
Singhal K, Sidahmed H, Garrett Z, Wu S, Rush K, Prakash S (2021) Federated reconstruction: Partially local federated learning. arXiv:2102.03448
Zeng G, Yang W, Ju Z, Yang Y, Wang S, Zhang R, Zhou M, Zeng J, Dong X, Zhang R et al (2020) MedDialog: A large-scale medical dialogue dataset. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 9241–9250
Pan X, Zhang M, Ji S, Yang M (2020) Privacy risks of general-purpose language models. In: 2020 IEEE symposium on security and privacy (SP). IEEE, pp 1314–1331
Shokri R, Shmatikov V (2015) Privacy-preserving deep learning. In: Proceedings of the 22nd ACM SIGSAC conference on computer and communications security, pp 1310–1321
Abadi M, Chu A, Goodfellow I, McMahan HB, Mironov I, Talwar K, Zhang L (2016) Deep learning with differential privacy. In: Proceedings of the 2016 ACM SIGSAC conference on computer and communications security, pp 308–318
McMahan HB, Ramage D, Talwar K, Zhang L (2017) Learning differentially private recurrent language models. arXiv:1710.06963
Devlin J, Chang M-W, Lee K, Toutanova K (2018) BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805
Sanh V, Debut L, Chaumond J, Wolf T (2020) DistilBERT, a distilled version of BERT: smaller, faster cheaper and lighter
Basu P, Roy TS, Naidu R, Muftuoglu Z, Singh S, Mireshghallah F (2021) Benchmarking differential privacy and federated learning for bert models. arXiv:2106.13973
Makhzani A, Shlens J, Jaitly N, Goodfellow I, Frey B (2015) Adversarial autoencoders. arXiv:1511.05644
Li Y, Swersky K, Zemel R (2015) Generative moment matching networks. In: International conference on machine learning. PMLR, pp 1718–1727
Xie L, Lin K, Wang S, Wang F, Zhou J (2018) Differentially private generative adversarial network. arXiv:1802.06739
Torkzadehmahani R, Kairouz P, Paten B (2019) DP-CGAN: Differentially private synthetic data and label generation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 0–0
Zhang Y, Xixiang L, Yucong Z, Yige L (2020) Differentially private sequence generative adversarial networks for data privacy masking. Chin J Netword Inf Secur 6(4):109
Google Scholar
Bowman SR, Vilnis L, Vinyals O, Dai AM, Jozefowicz R, Bengio S (2016) Generating sentences from a continuous space
Voigt P, Von dem Bussche A (2017) The EU general data protection regulation (GDPR). A practical guide, vol 10, 1st edn. Springer International Publishing, Cham, p 3152676
Xiong J, Bi R, Tian Y, Liu X, Wu D (2021) Towards lightweight, privacy-preserving cooperative object classification for connected autonomous vehicles. IEEE Internet of Things Journal
Tian Y, Li T, Xiong J, Bhuiyan MZA, Ma J, Peng C (2021) A blockchain-based machine learning framework for edge services in IIoT. IEEE Transactions on Industrial Informatics
Lin BY, He C, Zeng Z, Wang H, Huang Y, Soltanolkotabi M, Ren X, Avestimehr S (2021) FedNLP: A research platform for federated learning in natural language processing. arXiv:2104.08815
McMahan B, Moore E, Ramage D, Hampson S, y Arcas BA (2017) Communication-efficient learning of deep networks from decentralized data. In: Artificial intelligence and statistics. PMLR, pp 1273–1282
Vicari M, Gaspari M (2021) Analysis of news sentiments using natural language processing and deep learning. Ai Soc 36(3):931–937
Article Google Scholar
Xing F, Malandri L, Zhang Y, Cambria E (2020) Financial sentiment analysis: an investigation into common mistakes and silver bullets. In: Proceedings of the 28th international conference on computational linguistics, pp 978–987
Liu M, Ho S, Wang M, Gao L, Jin Y, Zhang H (2021) Federated learning meets natural language processing: A survey. arXiv:2107.12603
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Adv Neural Inf Process Syst 27
Kingma DP, Welling M (2013) Auto-encoding variational Bayes
Kingma DP, Welling M (2019) An introduction to variational autoencoders. arXiv:1906.02691
Dai AM, Le QV (2015) Semi-supervised sequence learning. Adv Neural Inf Process Syst 28:3079–3087
Bowman SR, Vilnis L, Vinyals O, Dai AM, Jozefowicz R, Bengio S (2015) Generating sentences from a continuous space. arXiv:1511.06349
Semeniuta S, Severyn A, Barth E (2017) A hybrid convolutional variational autoencoder for text generation. arXiv:1702.02390
Liu D, Xue Y, He F, Chen Y, Lv J (2019) μ-forcing: Training variational recurrent autoencoders for text generation. ACM Tran Asian Low-Resour Lang Inf Process (TALLIP) 19(1):1–17
Google Scholar
Zhang X, Yang Y, Yuan S, Shen D, Carin L (2019) Syntax-infused variational autoencoder for text generation. arXiv:1906.02181
Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Aitken A, Tejani A, Totz J, Wang Z et al (2017) Photo-realistic single image super-resolution using a generative adversarial network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4681–4690
Liu S, Wang T, Bau D, Zhu J-Y, Torralba A (2020) Diverse image generation via self-conditioned GANs. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 14286–14295
Zhao S, Liu Z, Lin J, Zhu J-Y, Han S (2020) Differentiable augmentation for data-efficient GAN training. arXiv:2006.10738
Yu L, Zhang W, Wang J, Yu Y (2017) SeqGAN: Sequence generative adversarial nets with policy gradient. In: Proceedings of the AAAI conference on artificial intelligence, vol 31
Fedus W, Goodfellow I, Dai AM (2018) MaskGAN: better text generation via filling in the_. arXiv:1801.07736
Yu L, Liu L, Pu C, Gursoy ME, Truex S (2019) Differentially private model publishing for deep learning. In: 2019 IEEE symposium on security and privacy (SP). IEEE, pp 332–349
Rosso P, Potthast M, Stein B, Stamatatos E, Rangel F, Daelemans W (2019) Evolution of the pan lab on digital text forensics. In: Information retrieval evaluation in a changing world. Springer, pp 461–485
Preoţiuc-Pietro D, Lampos V, Aletras N (2015) An analysis of the user occupational class through twitter content. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (vol 1: Long Papers), pp 1754–1764
Carlini N, Tramer F, Wallace E, Jagielski M, Herbert-Voss A, Lee K, Roberts A, Brown T, Song D, Erlingsson U et al (2020) Extracting training data from large language models. arXiv:2012.07805
Carlini N, Liu C, Erlingsson Ú, Kos J, Song D (2019) The secret sharer: Evaluating and testing unintended memorization in neural networks. In: 28th {USENIX} security symposium ({USENIX} Security 19), pp 267–284
Carvalho RS, Vasiloudis T, Feyisetan O (2021) TEM: High utility metric differential privacy on text. arXiv:2107.07928
Feyisetan O, Balle B, Drake T, Diethe T (2020) Privacy-and utility-preserving textual analysis via calibrated multivariate perturbations. In: Proceedings of the 13th international conference on web search and data mining, pp 178–186
Mireshghallah F, Inan HA, Hasegawa M, Rühle V., Berg-Kirkpatrick T, Sim R (2021) Privacy regularization: Joint privacy-utility optimization in language models. arXiv:2103.07567
Li Y, Baldwin T, Cohn T (2018) Towards robust and privacy-preserving text representations. arXiv:1805.06093
Phan N, Wang Y, Wu X, Dou D (2016) Differential privacy preservation for deep auto-encoders: an application of human behavior prediction. In: 30th AAAI conference on artificial intelligence
Chen Q, Xiang C, Xue M, Li B, Borisov N, Kaafar D, Zhu H Differentially private data sharing: Sharing models versus sharing data
Jordon J, Yoon J, Van Der Schaar M (2018) PATE-GAN: Generating synthetic data with differential privacy guarantees. In: International conference on learning representations
Chen Q, Xiang C, Xue M, Li B, Borisov N, Kaarfar D, Zhu H (2018) Differentially private data generative models. arXiv:1812.02274
Li Q, Xia B, Huang H, Zhang Y, Zhang T (2021) TRAC: Traceable and revocable access control scheme for mHealth in 5G-enabled IIoT. IEEE Transactions on Industrial Informatics
Kang J, Xiong Z, Li X, Zhang Y, Niyato D, Leung C, Miao C (2021) Optimizing task assignment for reliable blockchain-empowered federated edge learning. IEEE Trans Veh Technol 70(2):1910–1923
Article Google Scholar
Liu D, Miller T (2020) Federated pretraining and fine tuning of bert using clinical notes from multiple silos. arXiv:2002.08562
Li T, Sahu AK, Zaheer M, Sanjabi M, Talwalkar A, Smith V (2020) Federated optimization in heterogeneous networks. Proc Mach Learn Syst 2:429–450
Google Scholar
Asad M, Moustafa A, Ito T (2020) FedOpt: towards communication efficiency and privacy preservation in federated learning. Appl Sci 10(8):2864
Article Google Scholar
Xiong J, Bi R, Tian Y, Liu X, Ma J (2021) Security and privacy in mobile crowdsensing: Models, progresses, and trends. Chin J Comput 44(9):1949–1966
Google Scholar
Sui D, Chen Y, Zhao J, Jia Y, Xie Y, Sun W (2020) Feded: Federated learning via ensemble distillation for medical relation extraction. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 2118–2128
Truex S, Liu L, Gursoy ME, Yu L, Wei W (2019) Demystifying membership inference attacks in machine learning as a service. IEEE Transactions on Services Computing
Bhowmick A, Duchi J, Freudiger J, Kapoor G, Rogers R (2019) Protection against reconstruction and its applications in private federated learning. Preprint arXiv (1812)
Dwork C, Roth A et al (2014) The algorithmic foundations of differential privacy. Found Trends Theor Comput Sci 9(3-4):211–407
Article MathSciNet MATH Google Scholar
Mironov I (2017) Rényi differential privacy. 2017 IEEE 30th Computer Security Foundations Symposium (CSF). https://doi.org/10.1109/csf.2017.11
Lyu L, He X, Li Y (2020) Differentially private representation for NLP: Formal guarantee and an empirical study on privacy and fairness. arXiv:2010.01285
Shen G, Jia J, Nie L, Feng F, Zhang C, Hu T, Chua T-S, Zhu W (2017) Depression detection via harvesting social media: A multimodal dictionary learning solution. In: IJCAI, pp 3838–3844
Maas A, Daly RE, Pham PT, Huang D, Ng AY, Potts C (2011) Learning word vectors for sentiment analysis. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, pp 142–150
Bonawitz K, Ivanov V, Kreuter B, Marcedone A, McMahan HB, Patel S, Ramage D, Segal A, Seth K (2017) Practical secure aggregation for privacy-preserving machine learning. In: Proceedings of the 2017 ACM SIGSAC conference on computer and communications security, pp 1175–1191

Download references

Author information

Authors and Affiliations

College of Information Science and Technology, Northwest University, 229 North Taibai Road, Xi’an, 710069, Shaanxi, China
Yuyang Wang & Xianjia Meng
Network and Data Center, Northwest University, 229 North Taibai Road, Xi’an, 710069, Shaanxi, China
Yuyang Wang
College of Computer and Data Science, Fuzhou University, 2 Xue Yuan Road, Fuzhou, 350108, Fujian, China
Ximeng Liu

Authors

Yuyang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xianjia Meng
View author publications
You can also search for this author in PubMed Google Scholar
Ximeng Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xianjia Meng.

Ethics declarations

Competing interests

The authors have no competing interests to declare that are relevant to the content of this article.

Conflict of Interests

The authors have no relevant financial or non-financial interests to disclose. All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript. The authors have no financial or proprietary interests in any material discussed in this article.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Y., Meng, X. & Liu, X. Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation. Mobile Netw Appl (2023). https://doi.org/10.1007/s11036-023-02096-9

Download citation

Accepted: 15 February 2022
Published: 14 June 2023
DOI: https://doi.org/10.1007/s11036-023-02096-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation

Abstract

Similar content being viewed by others

How to keep text private? A systematic review of deep learning methods for privacy-preserving natural language processing

Inverse optimization strategy for improved differential privacy in deep auto encoder

Memorization of Named Entities in Fine-Tuned BERT Models

1 Introduction

2 Related Work

2.1 Data Generative Models

2.2 Natural Language Processing with Differential Privacy

2.3 Federated Learning Framework

3 Preliminaries

3.1 Differential Privacy

Theorem 1 (Composition)

Theorem 2 (Post-processing)

3.2 Recurrent Variational Autoencoder

4 Proposed Method

4.1 Sensitive Text Privacy Preservation Through DP-RVAE

4.2 Differential Privacy with RVAE

4.3 Noise Perturbing and Word Dropout

4.4 DP-RVAE with Federated Learning

5 Experiment and Security Analysis

5.1 Implementation Details

Tweets Depression Sentiment Dataset

IMDB Reviews Dataset

Medical Description Dataset

Hyper-parameters Settings

5.2 The Utility Evaluation of the Synthetic Data

5.3 Keyword Inference Attack Experiment

5.4 An Experiment in Federated Learning

5.5 Analysis of DP-RVAE Model

5.5.1 Computation Complexity Analysis

5.5.2 Security Analysis

6 Conclusion and Future Work

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Conflict of Interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation