## Abstract

Lymphocyte-specific protein tyrosine kinase (LCK) is a key activator of T cells; however, little is known about the specific autoregulatory mechanisms that control its activity. We have constructed a model of LCK autophosphorylation and phosphorylation by the regulating kinase CSK. The model was fit to existing experimental data in the literature that presents an *in vitro* reconstituted membrane system, which provides more physiologically relevant kinetic measurements than traditional solution-based systems. The model is able to predict a robust mechanism of LCK autoregulation. It provides insights into the molecular causes of key site-specific phosphorylation differences between distinct experimental conditions. Probing the model also provides new hypotheses regarding the influence of individual binding and catalytic rates, which can be tested experimentally. This minimal model is required to elucidate the mechanistic interactions of LCK and CSK and can be further expanded to better understand T cell activation from a systems perspective. Our computational model enables the evaluation of LCK protein interactions that mediate T cell activation on a more quantitative level, providing new insights and testable hypotheses.

## Introduction

Lymphocyte-specific protein tyrosine kinase (LCK) is a key regulator of T cell activation and differentiation.6,38 LCK helps to activate healthy T cells against diseased cells in the body by phosphorylating immunotyrosine activating motifs (ITAMS) on the CD3ζ chain of the T cell receptor (TCR).25 Mutations in the LCK gene can lead to autoimmune disease14 and contribute to cancer.7 Recently, LCK has been shown to play an important and complex role in the activation of chimeric antigen receptor (CAR) engineered T cells.22 CARs are engineered proteins that contain a variety of T cell signaling domains linked to an extracellular antibody single chain variable fragment (scFv). These proteins can activate T cells against a tumor-associated antigen to eradicate cancer cells.22,37 As CARs are adapted and modified to more specifically target different types of cancer cells, understanding the detailed mechanisms that govern their activation has become more important. Despite its strong role in regulating T cell signaling, little is known about the specific mechanisms that control LCK catalytic activity.

LCK is a multi-domain protein that can catalyze the phosphorylation of many substrates in T cells, including itself. LCK has two main phosphorylation sites, the tyrosine residues Y394 and Y505. Y394 is located close to the kinase domain, and, therefore, has been shown to play a significant role in substrate specificity.23 Y505 is located near the C-terminal tail of the protein. When phosphorylated, this tail is thought to fold up and bind in *cis*, locking the molecule in a “closed” conformation.9 Therefore, it is commonly accepted that phosphorylation at Y394 (denoted as LCK species P_{394}U_{505}) increases the catalytic activity of LCK and phosphorylation at Y505 (species U_{394}P_{505}) decreases catalytic activity.44 It has been shown that the unphosphorylated and doubly phosphorylated forms of LCK (species U_{394}U_{505} and P_{394}P_{505}, respectively) retain an intermediate catalytic activity when acting on some substrates,15 although they may have more complex kinetics on others. These four forms of LCK distribute and aggregate differently within cells,34 and, while all four forms exist in resting T cells, efforts to calculate the exact ratios of the species have been inconclusive.2,30

Several proteins have been shown to control LCK phosphorylation. For example, C-terminal Src kinase (CSK) is a regulatory kinase that phosphorylates LCK specifically at Y505.41 In addition, several phosphatases act on LCK and CSK, most notably CD45 and PTPN22.45 It is commonly accepted that LCK can autophosphorylate at Y394,44 but it has only recently been appreciated that LCK can also autophosphorylate at Y505.15

The kinetics of these LCK phosphorylation and dephosphorylation reactions determine the pool of catalytically active LCK available to control T cell activation *in vivo*. Traditionally, the kinetics of these reactions are studied experimentally with recombinant proteins in solution3,33; however, inside the cell, LCK is largely bound to the plasma membrane, in a two-dimensional density distribution.18,46 This binding to the plasma membrane can profoundly influence a protein’s kinetics in several ways: (i) by altering the conformation of the protein, opening or closing available binding pockets, (ii) by changing the diffusion kinetics, which can alter the rate at which the enzyme encounters its substrate, and (iii) by altering the spatial segregation of certain groups of proteins in densely packed membranes, which may alter the ratio of active to inactive molecules in the system.4 Indeed, Hui *et al.* showed that the kinetics of LCK phosphorylation are vastly different when LCK is able to autophosphorylate on a membrane surface compared to in solution. These differences are both qualitative, in the order of phosphorylation of the two sites, and quantitative, in the rates of phosphorylation.15

In order to better understand the mechanisms through which LCK is regulated on the cell membrane, we have developed a computational model of LCK autophosphorylation and phosphorylation by the regulating kinase CSK. The model is fit to experimental data from the two-dimensional reconstituted membrane system developed by Hui and Vale.15 This data uses two concentrations of LCK: 500 molecules/μm^{2}, which corresponds to a physiological level of LCK in the cell, and 50 molecules/μm^{2}. One concentration of CSK is used, 500 molecules/μm^{2}, which is slightly higher than the maximal amount of CSK present in the cells. Modeling this minimal system will allow us to predict the fundamental mechanisms of LCK activation and improve our understanding of the differences between two- and three-dimensional enzyme kinetics. Several computational models have been developed to study the early phosphorylation events in T cell activation1,29,40; however, none of them have accounted for the different species of LCK or the effects that the various catalytic and binding activities of these different species will have on T cell activation. The model of LCK activation that we have developed provides a basis for understanding LCK phosphorylation and catalytic activity and can be implemented in larger models of T cell signaling. Thus our work will enable a better understanding of how LCK autoregulation affects the control of T cell activation in the context of TCR antigen discrimination and CAR signaling.

## Materials and Methods

### Data Extraction

Experimental data was extracted from Hui and Vale15 using the MATLAB GRABIT program (The MathWorks Inc., Natick, MA). To correspond to the data, all simulations were normalized by the simulated amount of LCK at 90 min.

### Model Structure

Several different models were tested to find the simplest mechanism that is able to reproduce the data. Each model was progressively more complex. First, a mechanism in which the LCK phosphorylation events are directly catalyzed without a binding step to form intermediate dimers, a model described by 18 kinetic parameters, was implemented. Subsequently, a Michaelis–Menten mechanism in which the individual LCK enzymatic species have the same catalytic rate regardless of the substrate, a model involving 25 kinetic parameters, was implemented. Neither of these models was able to both reproduce training data used for parameter estimation and predict data not used in the fitting process (results not shown).

The third model implemented a Michaelis–Menten mechanism in which all of the LCK species have different binding and catalytic rates. This mechanism was able to fit the training data and predict new data. In this model, it is assumed that the phosphorylation events are primarily governed by two main factors: the strength of the interaction between the enzyme-substrate pair (i.e., the dissociation constant, *k*
_{
d
}), and the catalytic rate of the enzyme on the substrate. The dissociation constant is the dissociation rate (*K*
_{
off
}) divided by the association rate (*K*
_{
on
}). To avoid over parameterizing the model, we assume the association rate to be the same for all of the 16 LCK pairs, reducing the number of LCK binding parameters from 32 to 17. In Michaelis–Menten kinetics, the catalytic rate is generally the rate limiting step, so *K*
_{
on
} was kept constant for all LCK–LCK or LCK–CSK binding pairs, as it is not expected to be rate limiting. The simplification of the association rates is also supported by studies showing that *K*
_{
on
} generally falls within a relatively small range (about one order of magnitude) for many different protein interactions.31,39 However, we still allow the *k*
_{
d
} values to differ between the LCK dimers by implementing a different *K*
_{
off
} for each pair. By estimating the same association rate and different dissociation rates for the different LCK dimers, each binding pair can remain bound for different amounts of time depending on the strength of the individual interactions, allowing the *k*
_{
d
} to span its full physiologically relevant range (more than 10 orders of magnitude).26,32

### Numerical Implementation of the Model

The model used to fit the training data is comprised of 23 non-linear ordinary differential equations (ODEs) (Supplementary File 1), and the model used for generating predictions with the catalytically inactive LCK is composed of 73 ODEs. The equations were written as a set of rules in BioNetGen,11 and implemented in MATLAB (The MathWorks Inc., Natick, MA). The model BioNetGen file is provided in Supplementary File 2, and the catalytically inactive model BioNetGen file is provided in Supplementary File 3.

### Parameter Estimation

Binding, on and off, and catalytic rates were estimated in an unbiased approach to find parameter sets that could both qualitatively differentiate between the rate of phosphorylation of Y394 and Y505 of LCK and quantitatively provide the best fit to the data.

Due to the large number of parameters to be fit, 38, and a lack of prior information about their possible ranges, a two-step approach was used to fit the model. First, a series of parameter sets was calculated by minimizing the weighted sum of the squared residual for a hybrid objective function (WSSR_{hybr}) that accounts for both the quantitative fit to the data and the qualitative order of the phosphorylation curves of the two substrate sites (Y394 and Y505).20 Without the addition of the qualitative order of the curves in the hybrid WSSR, all of the optimal parameter sets over-fit the conditions in which Y394 is phosphorylated faster than Y505 (High LCK, High LCK + CSK, and Low LCK) without capturing the increase in Y505 phosphorylation in the Low LCK + CSK case. Parameter sets that did not capture that increase were penalized by having a higher WSSR_{hybr}.

The WSSR_{hybr} is calculated by adding the WSSR for each data point to the WSSR for the distance between the Y394 and Y505 curves in each experimental condition:

where \(C_{exp,i}^{Y394}\) and \(C_{exp,i}^{Y505}\) are the *i*th experimentally measured LCK phosphorylation data point for Y394 or Y505, respectively. \(C_{sim,i}^{Y394}\) and \(C_{sim,i}^{Y505}\) are the simulated LCK phosphorylation at the *i*th time point. \(W_{i}^{Y394}\), \(W_{i}^{Y505}\), and \(W_{i}^{diff}\) are weighting terms, taken as 1/\(C_{exp,i}^{Y394}\), 1/\(C_{exp,i}^{Y505}\), and \(1/\left( {C_{\exp ,i}^{Y394} - C_{\exp ,i}^{Y505} } \right)\), respectively. *n* is the total number of experimental measurements. The minimization is subject to the upper and lower bounds of the free parameters, *θ*.

Particle swarm optimization (PSO)17 was used to find 1000 parameter sets that reached a minimum in the WSSR for the hybrid objective function. Briefly, PSO is able to efficiently search a parameter space by mimicking the ways in which groups of animals make decisions, for example how a colony of bees finds a new nesting site. Many particles move around the parameter space communicating their WSSR at each position. With each iteration, the positions and velocities of the particles are updated such that they approach a minimal WSSR. We used 31 particles searching a 38-dimensional bounded parameter space, with the particles starting at random points in the parameter space. Each iteration, a WSSR is calculated for every particle and the particles’ positions and velocities are then updated based on the their current WSSR and the global minimal WSSR. The algorithm is terminated when the global minimum WSSR remains constant for 50 iterations.

Next, the parameter sets from the hybrid WSSR were tailored to fit a quantitative WSSR (WSSR_{quant}). The 1000 hybrid parameter sets were then used in the second step as inputs to a more local parameter estimation approach, performed by the MATLAB lsqnonlin function. This algorithm solves the non-linear least squares problem using the trust-region-reflective optimization algorithm, minimizing the WSSR_{quant}:

where the variables are the same as those used in the WSSR_{hybr} function.

### Clustering

Parameter set clustering was done using the MATLAB kmeans function. The optimal number of clusters was determined using the silhouette method.36 The silhouette plots measure the confidence that a given point lies in the cluster to which it is assigned, with each point getting a score from −1 to 1. We used the sum of the silhouette plot to calculate the optimal number of clusters, which was found to be three.

### Sensitivity Analysis

The extended Fourier amplitude sensitivity test (eFAST), a global variance-based sensitivity analysis, was used to understand how different parameters (“model inputs”) affect model predictions (“model outputs”). This method has been used previously to analyze computational biological models.13,24 In this method, the values of all of the inputs are varied together at different frequencies within a specified range and the model outputs are recorded. We varied the parameters 100-fold up and down from their median values, shown in Table 1. The Fourier transform of the output indicates which parameter frequencies contribute most, thus, which parameters are most sensitive. Varying all of the parameters together allows us to calculate two different indices of sensitivity: the first-order FAST indices, *Si*, a measurement of the local sensitivity of individual inputs, and the total FAST indices, *STi*, a measurement of the global sensitivity which accounts for second and higher-order interactions between multiple inputs. A greater total index than first-order index indicates that an input is more important in combination with other parameters than alone. We implemented the eFAST method using MATLAB code developed by Kirschner and colleagues.27

### Statistical Analysis

All statistical analyses were determined with a one-way analysis of variance (ANOVA) using Graphpad Prism version 6 for Mac (GraphPad Software, San Diego, CA).

## Results

### Model Construction

We have constructed a model of LCK autophosphorylation and phosphorylation by the kinase CSK. Below, we describe the salient features of the model, and full details are provided in the Methods section. Hui *et al.* experimentally proved that LCK, starting from a pool of unphosphorylated LCK, is able to phosphorylate itself in *trans*, and that this is the predominant form of phosphorylation.15 Accordingly, our model assumes that each LCK substrate site, Y394 and Y505, must be phosphorylated by a catalytic site on a different LCK molecule. This is done by implementing a Michaelis–Menten mechanism in which each pair of enzyme and substrate LCK species has different binding and catalytic rates. This model is characterized by 38 kinetic parameters.

In the model, each of the four LCK species, referred to by their phosphorylation status as U_{394}U_{505}, P_{394}U_{505}, U_{394}P_{505}, and P_{394}P_{505}, are able to bind and phosphorylate the four substrate sites, Y394 of U_{394}U_{505} and U_{394}P_{505} and Y505 of U_{394}U_{505} and P_{394}U_{505}, with different kinetics. To simulate this, the catalytic domain of one LCK species in the model can interact with the Y394 or Y505 residue from another LCK species. This results in several possible dimer conformations between an LCK pair that each has the same association rate (*K*
_{
on
}), but have different dissociation rates (*K*
_{
off
}). The phosphorylation reactions can also be catalyzed at different rates depending on both the enzyme and substrate; subsequently, each of the 16 LCK dimer intermediates has a different catalytic rate (*K*
_{
cat
}). As an illustrative example, the interactions for a representative pair of LCK species, U_{394}U_{505} and P_{394}U_{505}, are shown in Fig. 1a. These two species have 3 different phosphorylation sites, Y394 on U_{394}U_{505} and Y505 on U_{394}U_{505} and P_{394}U_{505}, resulting in three different intermediate dimers. After binding with the same rate of association (*K*
_{
on
}), each of these species can unbind (*K*
_{
off,1
}, *K*
_{
off,2
}, *K*
_{
off,3
}) or catalyze a phosphorylation reaction (*K*
_{
cat,1
}, *K*
_{
cat,2
}, *K*
_{
cat,3
}) with different rates.

Like the LCK homodimers, CSK is able to associate with all of its substrates with the same association rate and has different dissociation and catalytic rates, depending on the substrate (Fig. 1b). It has been widely established in the literature that CSK is only able to phosphorylate LCK at Y505.41 Therefore, there are only two substrates available to CSK in the model (U_{394}U_{505} and P_{394}U_{505}), resulting in two intermediate CSK-LCK dimers.

The complete list of interacting pairs in the model and their respective dissociation and catalytic rates is shown in Table 1. In total, the model includes 23 unique species, the concentrations of which are described by 23 ordinary differential equations (ODEs) (Supplemental File 1). The association rate for all LCK pairs (*K*
_{
on
}) has a median value of 8.9 × 10^{−4} μm^{2}/molecules·s and a 90% confidence interval of 6.8 × 10^{−4}–1.9 × 10^{−3} μm^{2}/molecules·s. The association rate for all CSK-LCK pairs (K_{on,CSK}) has a median value of 5.9 × 10^{−4} μm^{2}/molecules·s and a 90% confidence interval of 4.4 × 10^{−4}–1.2 × 10^{−3}μm^{2}/molecules·s. The procedure used to estimate these rates is described in detail below.

### Determining the Optimal Parameter Sets

The model was fit to quantitative western blot data of LCK phosphorylation at Y394 and Y505 in a two-dimensional membrane reconstituted system obtained by Hui and Vale.15 We used four sets of Y394 and Y505 site-specific phosphorylation data to train the model: 500 molecules LCK/μm^{2}, 500 molecules LCK/μm^{2} + 500 molecules CSK/μm^{2}, 50 molecules LCK/μm^{2}, and 50 molecules LCK/μm^{2} + 500 molecules CSK/μm^{2}. A fifth set of site-specific data in which 50% of the LCK in the system is catalytically inactive was used as model validation, to test that the model parameters are able to predict data not used in the fitting process.

As most kinase kinetic studies are performed in solution, we could not directly apply any previous assumptions for the range of parameter values in this two-dimensional system. While there are numerical techniques that enable conversion of three-dimensional kinetic parameters to a two-dimensional system, there is no kinetic binding data of LCK–LCK dimers in either a two-dimensional or three-dimensional system to start from, apart from the autophosphorylation described in the paper by Hui and Vale. Additionally, Hui and Vale compared the autophosphorylation of LCK in solution to their two-dimensional membrane system and found that the phosphorylation kinetics for Y394 and Y505 do not change proportionately when transitioning between the two systems. In the membrane system, Y394 is phosphorylated much faster than Y505, while in solution it appears that Y505 is phosphorylated faster. In the solution data there is an initial rapid jump in Y505 phosphorylation above that of Y394, followed by a plateau and then a second phase in which both sites are rapidly phosphorylated, while in the membrane data the phosphorylation of Y505 is much steadier. These different dynamics imply that it would not be straightforward to inter-convert two-dimensional and three-dimensional kinetics. Therefore, we used an unsupervised fitting procedure in which the parameters are allowed to vary within very wide bounds (10^{−20} to 10^{10} μm^{2}/molecules·s for *K*
_{
on
}, and 10^{−20} to 10^{10} 1/s for *K*
_{
off
} and *K*
_{
cat
}).

We used a two-step fitting procedure to search the large parameter space and find parameter sets that can both qualitatively and quantitatively describe the training data. In the first step, we used particle swarm optimization (PSO) to minimize a hybrid weighted sum of the squared residuals (WSSR) objective function.20 This hybrid WSSR was used to optimize both the quantitative fit to the data points as well as a qualitative readout of the difference between the curves of phospho-Y394 and phospho-Y505 in each experimental setting (see methods for more detail). We used PSO to obtain 1000 parameter sets that could describe the differences in the rates of phosphorylation of the two sites for the different experimental conditions. PSO is a global optimization technique that enables efficient exploration of the parameter space.17 The hybrid WSSR values ranged from 3.3 × 10^{1} to 3.8 × 10^{9}, with a median value of 4.1 × 10^{2}. However, in the second step, all of the parameter sets obtained using PSO, regardless of their hybrid WSSR value, were tailored to better quantitatively fit the data using the MATLAB lsqnonlin function (MathWorks Inc., Natick, MA). Specifically, the parameter sets from PSO were used as starting points to minimize an objective function that calculates the WSSR between the experimental data and the model predictions. Each of the 1000 PSO parameter sets was tailored twice, resulting in 2000 parameter sets. The frequency distributions of the quantitative WSSR values are shown in Fig. 2a, ranging from 6.3 × 10^{0} to 1.1 × 10^{3}.

We used three criteria to determine the optimal parameter sets used for model simulations. Due to the large bounds, the majority of the 2000 parameter sets represented local minima that did not properly capture the data. Therefore, we first considered the parameter sets that were close to a global minimum, with respect to the training data, using the cumulative density function (CDF) of the WSSR. Secondly, we wanted to ensure that the parameter sets chosen were able to predict data not used in the fitting, termed validation data. Therefore, we used the CDF of the WSSR with respect to the validation data to find parameter sets that were able to predict the validation data well. Thirdly, we selected parameter sets that matched a molecularly detailed aspect of the Hui *et al.* data: in the high LCK experimental conditions, nearly 100% of the LCK is doubly phosphorylated by 90 min. To do this, we removed the parameter sets in which less than 90% of the LCK, in the high LCK condition, was doubly phosphorylated by 90 min.

Figures 2a and 2b shows CDF plots for the distributions of the quantitative WSSR values, which are used to find the parameter sets with good fits and predictions, respectively. The general trend is a sigmoidal function, with a tail at the beginning containing the parameter sets that all have low WSSR values. We started with the CDF plot for the training data set, and chose the end of the first step in the function as the cutoff for good fitting parameter sets (Fig. 2a, purple region, WSSR < 7.1). Then, taking only those parameter sets in the purple region, we calculated the WSSR values for the predicted data, which had a WSSR ranging from 2.3 to 8.9. Figure 2b shows the CDF function of these values. A similar cutoff point was chosen for parameter sets that had a good fit to the validation data (WSSR < 3.3), indicated by the yellow region.

The model fitting and parameter selection procedures described above resulted in 33 optimal parameter sets. However, these parameter sets showed high variability, and the median parameter values were not able to reproduce the data. Therefore, we clustered these parameter sets into three groups using the MATLAB kmeans function (Fig. 2c). The three clusters’ predictions of the validation data are shown in Supplemental Fig. 1, with median quantitative WSSR values of 6.8, 7.0, and 6.9 for the green, blue, and red clusters, respectively. These three groups provided different hypotheses for the kinetics of LCK phosphorylation. Although the green cluster had the lowest WSSR, it contained highly variable parameter sets without a significant difference between any of the catalytic rates (data not shown). Without statistical significance, no clear parameter values or mechanistic trends can be observed. The blue and red clusters did show significant differences between the parameters for different LCK species, indicating that specific LCK species interact with different kinetics (Supplemental Fig. 2). The primary difference between the blue and red clusters was the kinetics for the interactions of enzyme P_{394}U_{505} with Y505 on U_{394}U_{505} and enzyme P_{394}P_{505} with Y505 on P_{394}U_{505}. These differences effectively switched the contribution of these two reactions to the phosphorylation of Y505 in the training data simulations. Significantly, the red cluster had a lower WSSR value, indicating that it was able to match the training data and predict the validation data better than the blue cluster; therefore, the 20 parameter sets within the red cluster were determined to be optimal and were used in subsequent simulations.

### Model Fitting

Using the 20 optimal parameter sets, the model is able to accurately match the experimental data from Hui and Vale (Fig. 3). In Figs. 3a–3c, LCK Y394 is phosphorylated faster than Y505, and there is more phospho-Y394 in the system than phospho-Y505 at each time point. Conversely, in Fig. 3d, these rates are reversed, and there is more Y505 in the system. The model is able to capture this switch in the rates of Y505 and Y394 phosphorylation. In the high LCK data without or with CSK (Figs. 3a, 3b, respectively), there is a very sharp increase in Y394 phosphorylation within the first 15 s, followed by a much slower increase in phosphorylation. The model is able to capture this biphasic nature as well. Additionally, the model can refine the sharp response in the low LCK + CSK condition, compared to the high LCK conditions (Fig. 3d, blue line).

There are clear statistical differences between many of the estimated kinetic rates, despite high variability in their fitted values (Fig. 4). The majority of the estimated parameter values vary over a wide range, sometimes over 10 orders of magnitude or more. However, all of the sets are able to reproduce the data used in parameter fitting. Comparing the rates of different enzymes catalyzing the phosphorylation of a single substrate is of particular interest (Figs. 4a and 4c), as these comparisons enable a better understanding of the catalytic activity of individual LCK species. In general, the catalytic rates between enzyme species on a single substrate vary more than the dissociation rates for that substrate. Most of the dissociation rates are relatively high compared to the rate of association, with the exception of the dissociation rate of P_{394}P_{505} enzyme with P_{394}U_{505} substrate, which is consistently lower than the rate of association. This indicates that the P_{394}P_{505}-P_{394}U_{505} dimer can remain bound for a longer period than most others. The catalytic rate of U_{394}P_{505} is always statistically lower than the catalytic rate of all other species. Interestingly, both the LCK and CSK association rates, which are shared between the dimer pairs, are highly conserved, with only one parameter set falling outside of a tight range of less than two-fold (Fig. 4b).

### Model Validation

In addition to fitting the training data and estimating the optimal parameter values, the model with the optimal parameter sets is able to match validation data not used in the fitting. Hui and Vale quantified the amount of phospho-Y394 and -Y505 when 50% of the LCK in the system (250 molecules LCK/μm^{2}) was made catalytically inactive by a point mutation in the ATP binding site. To simulate this condition, we included a separate LCK species that can be phosphorylated at Y394 and Y505, but cannot act as an enzyme to catalyze the phosphorylation of other molecules. This model assumes that the catalytically inactive LCK interacts with the same kinetic parameters as active LCK. Figure 5 shows that the optimal parameter sets from the model fitting also provide a good match to the experimental validation data not used in the fitting, capturing the sharp early increase in phospho-Y394, the variable slope of phospho-Y505 levels, and the finding that the rate of Y394 is higher than that of Y505. This lends confidence to the predictive ability of the optimized model.

### Sensitivity Analysis

Despite the large variation in many of the parameters, the model is robust and can withstand a high level of biological variability. This is evident in the tight range of model fits and predictions shown in Figs. 3 and 5, respectively (i.e., the small band illustrating the 90% confidence intervals). Additionally, using the median parameter values from all 20 optimal parameter sets, the model is still able to recreate the data within the 50% confidence interval. This indicates significant model robustness, even considering a high level of biological variability.

To further quantify how sensitive the model is to the parameters, we performed a parameter sensitivity analysis using the extended Fourier amplitude sensitivity test (eFAST).27 The eFAST analysis is a global variance based method in which all of the parameters are varied together at different frequencies. The Fourier transform of the output can then be analyzed to determine which frequencies, and thus which parameters, have the most influence on the model outputs. This method calculates the first order eFAST indices (*Si*), a measurement of the local sensitivity of each parameter, as well as the total eFAST indices (*STi*), which takes into account the effects of higher order interactions between parameters.

Results from the global sensitivity analysis further quantify the robustness of the model. We performed the eFAST analysis to determine the sensitivity of the total phospho-Y394 and total phospho-Y505 in the system, at specific time points, with respect to all 38 kinetic parameters. There were no significant differences between the eFAST results for the high and low concentrations of LCK. Additionally, the qualitative results for the LCK specific parameters did not change with the addition of CSK; therefore, we only show the indices for the condition of high LCK + CSK (Fig. 6). The first order indices were slightly lower than the total indices, but qualitatively the same. The values of the sensitivity indices show that the levels of phosphorylated Y394 and Y505 are most sensitive to the association rates of the LCK species with each other and with CSK, the dissociation rate of CSK and U_{394}U_{505}, and the catalytic activity of CSK for U_{394}U_{505}. These are the same parameters for which the estimated values obtained from the model fitting have a narrow distribution. However, the levels of phospho-Y394 and -Y505 are insensitive to most of the parameters, justifying the large deviation in the distributions of the parameters’ estimated values (Fig. 4).

### Predicted Mechanism of LCK Activation

The molecular detail of our model allows us to make predictions regarding the mechanisms of LCK autophosphorylation and phosphorylation by CSK. Specifically, we can compare the median value of the estimated catalytic rates for each of the LCK enzyme species, P_{394}U_{505}, U_{394}P_{505}, U_{394}U_{505}, and P_{394}P_{505}, as shown in Figs. 7a, 7b, 7c, and 7d, respectively, with a summary of the pairwise interactions shown in Fig. 7e. The parameter estimation reveals that P_{394}U_{505} has the highest overall catalytic activity (i.e., this enzyme species only has red or purple arrows pointing to the phosphorylation reactions, Fig. 7a). Conversely, U_{394}P_{505} has the lowest activity (Fig. 7b). These results are in agreement with what has been shown in the literature for the catalytic rates of these enzymes on other substrates, such as the CD3z chain ITAMs.9,15 The catalytic activity of the U_{394}U_{505} and P_{394}P_{505} species varies depending on the substrate. U_{394}U_{505} preferentially phosphorylates itself and catalyzes phosphorylation at Y394 and Y505 with approximately the same rate. In comparison, P_{394}P_{505} shows a strong preference for site Y394 of U_{394}U_{505}. CSK is estimated to have higher catalytic activity against P_{394}U_{505} than U_{394}U_{505}, predicting that CSK will phosphorylate P_{394}U_{505} more readily than U_{394}U_{505}, which has been validated experimentally.5,41

The estimated catalytic activities identify specific molecular interactions that produce a negative feedback loop in LCK activation. The P_{394}U_{505} species, the most catalytically active form of the LCK, preferentially phosphorylates Y505, compared to Y394, with a difference of over three orders of magnitude (Fig. 7a, red arrows). This is significant because Y505 is thought to be the inhibitory site, generally, and its phosphorylation reduces the catalytic activity of LCK, thus providing a possible form of negative feedback. Doubly phosphorylated LCK, P_{394}P_{505}, on the other hand, preferentially phosphorylates U_{394}U_{505} to P_{394}U_{505}, increasing the overall catalytic activity of the pool of LCK. It has been shown that other intermolecular feedback mechanisms do play an important role in controlling and tailoring the T cell response42; however, these specific autoregulatory feedback mechanisms have not been identified before. Thus the model predicts, for the first time, that competing intramolecular feedback loops could help stabilize and control the overall activity of LCK. It will be important to see if these effects are still significant when more complex interactions between phosphatases and other substrates are taken into account.

The model also predicts the relative amounts of each LCK species in the system over time. In general, LCK starts in a completely unphosphorylated form and transitions through a singly phosphorylated intermediate to end with all of the LCK doubly phosphorylated (Fig. 8). The results are shown for both high and low LCK concentrations with and without CSK.

The predicted levels of the LCK species are directly influenced by the binding kinetics. The model predicts that a significant amount of P_{394}U_{505} and P_{394}P_{505} can remain bound together when they are both present in the system (Figs. 8i and 8k). This is due to the low dissociation rate for enzyme P_{394}P_{505} bound to substrate P_{394}U_{505} (Fig. 4a). U_{394}P_{505} and U_{394}U_{505} are also able to bind together, but this association is not as strong as that of P_{394}P_{505} and P_{394}U_{505} and seems to be primarily a result of the low catalytic activity for all binding conformations of these pairs. A similar, but even smaller, binding interaction is present between U_{394}P_{505} and P_{394}U_{505}. When CSK is added to the system, it serves as a sink for LCK, binding to it and keeping it in the system for longer. Significantly, much more U_{394}U_{505} is bound to CSK than P_{394}U_{505}, as shown by the sharp increase in bound U_{394}U_{505} when CSK is present (comparing Figs. 8l to 8k, blue lines), compared to the more modest increase in P_{394}U_{505} (comparing Figs. 8l to 8k, green lines).

The model allows us to explore the detailed mechanisms that govern previously unexplained features of the data. The presence of CSK significantly influences the levels of the LCK species. Adding CSK to the system greatly reduces the total amount of P_{394}U_{505}, leaving almost none of this highly active form free to interact with other species (comparing Figs. 8h to 8g, red lines). In the low LCK conditions, CSK also increases the amount of U_{394}P_{505}, greatly reducing the overall catalytic activity of the total pool of LCK in this experimental condition (comparing Figs. 8d to 8c, green lines). In comparison, in the high LCK + CSK experimental simulation, the total amount of U_{394}P_{505} does not significantly change compared to the high LCK condition (comparing Figs. 8b to 8a, green lines), while the amount of P_{394}U_{505} is still greatly reduced (Figs. 8b to 8a, red lines). The model predicts that this difference in the change of intermediate U_{394}P_{505} is responsible for the shift in the curves of phospho-Y394 and phospho-Y505 in Fig. 3d of the model training data sets.

## Discussion and Conclusion

We have constructed a model of LCK activation *via* autophosphorylation and phosphorylation by the kinase CSK based on data from an *in vitro* two-dimensional membrane system.15 LCK is an important regulator of T cell activation, and quantifying the kinetics that govern its activity will allow us to better understand and engineer T cells for therapeutic purposes. The kinetics of LCK phosphorylation in this *in vitro* membrane system are very different from those that occur in more commonly used solution systems. It is believed that this two-dimensional system more accurately reflects what occurs *in vivo*, as much of the LCK in T cells is bound to the inside of the membrane.46 Most prior biological computational signaling models have relied on enzyme solution kinetic parameters for initial estimates. However, we believe that by focusing on two-dimensional kinetics that are more representative of what occurs *in vivo*, we can create models that are more predictive.

Our model is able to fit the data of membrane bound LCK phosphorylation well, both quantitatively and qualitatively. Additionally, we are able to identify the specific kinetic parameters that most significantly control LCK phosphorylation. One limitation of the model is its inability to match early time point experimental measurements of the phospho-Y505 curve in the low LCK + CSK experimental condition (Fig. 3d). However, there are no error bars for the experimental data, and it is possible that these data may have some experimental error. The recombinant LCK protein used in this system is autophosphorylated as it is expressed, so it must first be dephosphorylated before the start of the experiment. Hui *et al.* used mass spectrometry to measure the efficiency of this dephosphorylation and found that a small amount of Y505 is still phosphorylated at the start of the experiment (~1.5%); however, many graphs show that the initial phosphorylation of Y505 is much higher than that, up to ~20%. This large variability in the starting concentration of phospho-Y505 could lead to an overestimation of the initial rate of Y505 phosphorylation in the low LCK + CSK condition. Additionally, the data are derived from quantitative western blotting, and there may be error in the band intensity readings, particularly for early time points where the band intensity is very close to background. For these reasons, it is possible that the initial increase in phospho-Y505 in the low LCK + CSK condition does not truly reflect LCK kinetics.

We applied an unbiased approach to fit the parameters, generating a set of optimal parameter values that are able to reproduce the data. Despite high variability in the fitted parameters, there are statistically significant differences between the estimated dissociation rates of the LCK dimers and catalytic activities of the LCK enzymes. The statistical analysis along with the global sensitivity analysis indicate that the proposed mechanism of LCK activation implemented in the model is robust and predictive. The model predictions reveal that the levels of individual LCK species can remain within a tight range, even with high variability in the parameter rates. This model robustness is biologically relevant, since it has been shown that the local microenvironment around a pool of LCK in the cell changes dramatically depending on the state of the cell and the proximity of other molecules.12,16,19,35

The model brings many new insights into the autoregulatory mechanisms of LCK with respect to the binding of LCK dimers. For example, the model predicts that P_{394}U_{505} and P_{394}P_{505} are able to form a relatively strong dimer compared to P_{394}U_{505}-U_{394}P_{505} or U_{394}U_{505}-U_{394}P_{505}. Additionally, there are pairs that do not significantly dimerize at all. A crystal structure of the LCK SH2 and SH3 domains show that these domains can homodimerize, and that this binding may be stabilized by the addition of the phosphorylated Y505 tail.10 However, it does not provide any information about how interactions from other domains, particularly the domain containing Y394, control the extent of this dimerization. The model parameters specify which dimers are able to bind more strongly, and from that, we can infer the role that these other domains play in LCK binding. Since the P_{394}U_{505}-P_{394}P_{505} dimer is stronger than P_{394}U_{505}-U_{394}P_{505}, we can hypothesize that the phospho-Y505 tail of P_{394}P_{505} may be more amenable to stabilizing the P_{394}U_{505}-P_{394}P_{505} dimer interaction than that of U_{394}P_{505}. This may be because the phospho-Y394 in P_{394}P_{505} keeps the molecule in a partially open conformation while the tail of U_{394}P_{505} is held in a closed conformation through *cis* binding.9 Also, since U_{394}U_{505} and P_{394}U_{505} do not dimerize with themselves or each other, we can conclude that the stabilization from the phospho-Y505 tail is important in the LCK intermolecular interactions.

The model also predicts that CSK plays a very strong role in controlling the distribution of LCK species in the model, which could be important for controlling LCK activity *in vivo*. Figure 8 shows that CSK is able to bind to LCK and increase the amount of U_{394}U_{505} and U_{394}P_{505} in the system while reducing the amount of P_{394}U_{505} and P_{394}P_{505}. It is known that clusters of T cell signaling molecules reside close to each other on lipid rafts inside the T cell membrane.12,19 The composition of these clusters changes as the T cell becomes activated and the immunological synapse begins to form.16 Keeping LCK clustered with CSK before synapse formation could act as a control mechanism to reduce aberrant LCK signaling in unstimulated cells. Once the synapse forms and CSK is sequestered outside of the synapse region, enough LCK can accumulate to lead to high levels of active P_{394}U_{505}. More studies need to be done to better understand how the possible LCK autoregulatory feedback mechanisms and CSK function *in vivo* when there are more substrates and phosphatases in the system.

The model also predicts new binding relationships between CSK and LCK that have not been identified experimentally. The model indicates that CSK is able to bind more strongly to U_{394}U_{505} than to P_{394}U_{505} (Fig. 8l). Conversely, studies of LCK binding to CSK in solution have shown that CSK is able to bind to P_{394}U_{505}, but not U_{394}U_{505}.5 Combined, these results suggests that CSK binding to U_{394}U_{505} could be an effect of the two-dimensional membrane system, indicating a significant difference between the mechanisms that occur in solution and those that are able to take place in a more physiologically relevant membrane bound arrangement.

Comparing the model simulations to data from LCK phosphorylation in solution continues to shed light on the differences between studying molecular kinetics in solution and in the native two-dimensional distribution. Hui *et al.* compared their experimental membrane system to a traditional solution system. The authors found that the rates at which the two LCK sites were phosphorylated were significantly different, and that the measured kinetics for levels of phospho-Y394 and -Y505 did not change proportionately. In the membrane system, for high LCK, Y394 is rapidly phosphorylated while Y505 slowly increases in a more steady manner (Fig. 3a). In solution, however, Y394 and Y505 phosphorylation both remain at their starting levels for about 10 min and then both increase very rapidly.

The model indicates that distinct mechanistic interactions can potentially contribute to differences in the LCK phosphorylation kinetics that occur in two-dimensions compared to solution. Our model and estimated parameter sets were obtained by fitting LCK phosphorylation data from the *in vitro* reconstituted membrane system developed by Hui and Vale.15 We also attempted to fit data for LCK phosphorylation measure in solution. Since a key distinction between the two-dimensional and solution-based systems is that the species’ amounts are given in units of density rather than concentration, we attempted to fit the in solution data by only adjusting the association rates. The association rates are the only parameters that depend on the amount of a species (i.e., *K*
_{
on
} has units of μm^{2}/molecules·s), whereas the dissociation and catalytic rates do not depend on concentration. We followed the same parameter fitting procedure described above using each of the membrane bound optimal parameter sets described in the paper as starting values to minimize the quantitative WSSR equation with the MATLAB lsqnonlin function; however, we were unable to fit the solution data. We then expanded our fitting of the solution data to include the association and dissociation rates, or the association, dissociation, and catalytic rates. The data still could not be fit with the mechanism used in the model. Although more experiments need to be done to properly compare the differences between LCK in solution and on the two-dimensional membrane surface, we believe this could point to a difference not only in the parameter values but also in the mechanism of LCK phosphorylation between the two settings.

Excitingly, the fitted model generates testable hypotheses, and the experimental *in vitro* two-dimensional membrane system can be used to explore some of these model predictions. The estimated parameter values and model predictions support the presence of both negative and positive autoregulated feedback on the catalytic activity of LCK, which have not been described previously. The negative feedback comes from catalytically active P_{394}U_{505} preferentially phosphorylating other LCK molecules at the Y505 inhibitory site. The positive feedback comes from the moderately active P_{394}P_{505} species preferentially pushing doubly unphosphorylated LCK to the active P_{394}U_{505} form (Figs. 7a and 7d). These new feedback mechanisms, hypothesized by the model predictions, can be tested with targeted experiments that focus specifically on the catalytic activities of individual phospho-LCK species. We can do this by mixing LCK that is either doubly phosphorylated or specifically phosphorylated only at Y394 with other, catalytically inactive, LCK species. It is also possible to test model hypotheses about the significance of bound dimers, like CSK and U_{394}U_{505}, by inserting domain deficient mutants, such as LCK lacking the SH2 or SH3 domains, into the two-dimensional membrane system. Thus, a systems biology approach of using an optimized and validated computational model in combination with quantitative experimental approaches can provide new and relevant biological insight into LCK activation.

It is possible to improve and strengthen the model by adding new proteins into this same *in vitro* membrane reconstituted system and performing model parameter estimation, as we have done here. This will allow us to better understand how individual proteins combine to produce the functions of the system as a whole. For example to better understand the mechanisms of LCK activation, we can incorporate dephosphorylating events, through proteins like CD45 and PTPN22, into the model to see how that action impacts the overall levels of individual LCK species.28 Having more data to fit the model will also help to more specifically identify the LCK kinetic parameters, many of which are still highly variable in the current model. We can also study more specific mechanisms of LCK by adding its substrates, CD3ζ, ZAP-70, and SHP-1 into the system. The model also serves as a starting point for studying the order and kinetics of LCK-mediated phosphorylation of the six CD3ζ ITAM tyrosine phosphorylation sites.8,21,43 We believe that the model provides a quantitative framework for studying many different protein interactions relevant to T cell signaling, particularly those involving LCK.

In summary, the model is a predictive tool that can be used to examine the dynamics of LCK autoregulation. As we continue to expand the model, we can use it to make new predictions about the larger systems that govern T cell activation and explore key biological hypotheses, like those described above. Many of the mechanistic questions described in this paper have proven difficult to investigate experimentally; however, using the computational framework described here, we will be able to explore these issues on a more quantitative level, providing insights and new testable hypotheses.

## References

Altan-Bonnet, G., and R. N. Germain. Modeling t cell antigen discrimination based on feedback control of digital erk responses.

*PLoS Biol.*3(11):e356, 2005.Ballek, O., J. Valecka, J. Manning, and D. Filipp. The pool of preactivated lck in the initiation of t-cell signaling: a critical re-evaluation of the lck standby model.

*Immunol. Cell Biol.*93(4):384–395, 2015.Bergman, M., T. Mustelin, C. Oetken, J. Partanen, N. A. Flint, K. E. Amrein, M. Autero, P. Burn, and K. Alitalo. The human p50csk tyrosine kinase phosphorylates p56lck at tyr-505 and down regulates its catalytic activity.

*EMBO J.*11(8):2919–2924, 1992.Berry, H. Monte carlo simulations of enzyme reactions in two dimensions: fractal kinetics and spatial segregation.

*Biophys. J.*83(4):1891–1901, 2002.Bougeret, C., T. Delaunay, F. Romero, P. Jullien, H. Sabe, H. Hanafusa, R. Benarous, and S. Fischer. Detection of a physical and functional interaction between csk and lck which involves the sh2 domain of csk and is mediated by autophosphorylation of lck on tyrosine 394.

*J. Biol. Chem.*271(13):7465–7472, 1996.Brownlie, R. J., and R. Zamoyska. T cell receptor signalling networks: branched, diversified and bounded.

*Nat. Rev. Immunol.*13(4):257–269, 2013.Burnett, R. C., J. C. David, A. M. Harden, M. M. Le Beau, J. D. Rowley, and M. O. Diaz. The lck gene is involved in the t(1;7)(p34;q34) in the t-cell acute lymphoblastic leukemia derived cell line, hsb-2.

*Genes Chromosomes Cancer*3(6):461–467, 1991.Chae, W. J., H. K. Lee, J. H. Han, S. W. Kim, A. L. Bothwell, T. Morio, and S. K. Lee. Qualitatively differential regulation of t cell activation and apoptosis by t cell receptor zeta chain itams and their tyrosine residues.

*Int. Immunol.*16(9):1225–1236, 2004.Chakraborty, A. K., and A. Weiss. Insights into the initiation of tcr signaling.

*Nat. Immunol.*15(9):798–807, 2014.Eck, M. J., S. K. Atwell, S. E. Shoelson, and S. C. Harrison. Structure of the regulatory domains of the src-family tyrosine kinase lck.

*Nature*368(6473):764–769, 1994.Faeder, J. R., M. L. Blinov, and W. S. Hlavacek. Rule-based modeling of biochemical systems with bionetgen.

*Methods Mol. Biol.*500:113–167, 2009.Filipp, D., B. L. Leung, J. Zhang, A. Veillette, and M. Julius. Enrichment of lck in lipid rafts regulates colocalized fyn activation and the initiation of proximal signals through tcr alpha beta.

*J. Immunol.*172(7):4266–4274, 2004.Finley, S. D., M. Dhar, and A. S. Popel. Compartment model predicts vegf secretion and investigates the effects of vegf trap in tumor-bearing mice.

*Front Oncol.*3:196, 2013.Goldman, F. D., Z. K. Ballas, B. C. Schutte, J. Kemp, C. Hollenback, N. Noraz, and N. Taylor. Defective expression of p56lck in an infant with severe combined immunodeficiency.

*J. Clin. Invest.*102(2):421–429, 1998.Hui, E., and R. D. Vale. In vitro membrane reconstitution of the t-cell receptor proximal signaling network.

*Nat. Struct. Mol. Biol.*21(2):133–142, 2014.Huppa, J. B., and M. M. Davis. T-cell-antigen recognition and the immunological synapse.

*Nat. Rev. Immunol.*3(12):973–983, 2003.Iadevaia, S., Y. Lu, F. C. Morales, G. B. Mills, and P. T. Ram. Identification of optimal drug combinations targeting cellular networks: integrating phospho-proteomics and computational network analysis.

*Cancer Res.*70(17):6704–6714, 2010.Ilangumaran, S., S. Arni, G. van Echten-Deckert, B. Borisch, and D. C. Hoessli. Microdomain-dependent regulation of lck and fyn protein-tyrosine kinases in t lymphocyte plasma membranes.

*Mol. Biol. Cell*10(4):891–905, 1999.Kabouridis, P. S. Lipid rafts in t cell receptor signalling (review).

*Mol. Membr. Biol.*23(1):49–57, 2006.Kanodia, J., D. Chai, J. Vollmer, J. Kim, A. Raue, G. Finn, and B. Schoeberl. Deciphering the mechanism behind fibroblast growth factor (fgf) induced biphasic signal-response profiles.

*Cell Commun. Signal.*12:34, 2014.Kersh, E. N., A. S. Shaw, and P. M. Allen. Fidelity of t cell activation through multistep t cell receptor zeta phosphorylation.

*Science*281(5376):572–575, 1998.Kofler, D. M., M. Chmielewski, G. Rappl, A. Hombach, T. Riet, A. Schmidt, A. A. Hombach, C. M. Wendtner, and H. Abken. Cd28 costimulation impairs the efficacy of a redirected t-cell antitumor attack in the presence of regulatory t cells which can be overcome by preventing lck activation.

*Mol. Ther.*19(4):760–767, 2011.Laham, L. E., N. Mukhopadhyay, and T. M. Roberts. The activation loop in lck regulates oncogenic potential by inhibiting basal kinase activity and restricting substrate specificity.

*Oncogene*19(35):3961–3970, 2000.Linderman, J. J., N. A. Cilfone, E. Pienaar, C. Gong, and D. E. Kirschner. A multi-scale approach to designing therapeutics for tuberculosis.

*Integr. Biol. (Camb)*7(5):591–609, 2015.Love, P. E., and S. M. Hayes. Itam-mediated signaling by the t-cell antigen receptor.

*Cold Spring Harb. Perspect. Biol.*2(6):a002485, 2010.Mangialavori, I., M. Ferreira-Gomes, M. F. Pignataro, E. E. Strehler, and J. P. Rossi. Determination of the dissociation constants for Ca

^{2+}and calmodulin from the plasma membrane Ca^{2+}pump by a lipid probe that senses membrane domain changes.*J. Biol. Chem.*285(1):123–130, 2010.Marino, S., I. B. Hogue, C. J. Ray, and D. E. Kirschner. A methodology for performing global uncertainty and sensitivity analysis in systems biology.

*J. Theor. Biol.*254(1):178–196, 2008.McNeill, L., R. J. Salmond, J. C. Cooper, C. K. Carret, R. L. Cassady-Cain, M. Roche-Molina, P. Tandon, N. Holmes, and D. R. Alexander. The differential regulation of lck kinase phosphorylation sites by cd45 is critical for t cell receptor signaling responses.

*Immunity*27(3):425–437, 2007.Mukhopadhyay, H., S.-P. Cordoba, P. K. Maini, P. A. van der Merwe, and O. Dushek. Systems model of t cell receptor proximal signaling reveals emergent ultrasensitivity.

*PLoS Comput. Biol.*9(3):e1003004, 2013.Nika, K., C. Soldani, M. Salek, W. Paster, A. Gray, R. Etzensperger, L. Fugger, P. Polzella, V. Cerundolo, O. Dushek, T. Hofer, A. Viola, and O. Acuto. Constitutively active lck kinase in t cells drives antigen receptor signal transduction.

*Immunity*32(6):766–777, 2010.Northrup, S. H., and H. P. Erickson. Kinetics of protein-protein association explained by brownian dynamics computer simulation.

*Proc. Natl. Acad. Sci. USA*89(8):3338–3342, 1992.Piran, U., and W. J. Riordan. Dissociation rate constant of the biotin-streptavidin complex.

*J. Immunol. Methods*133(1):141–143, 1990.Ramer, S. E., D. G. Winkler, A. Carrera, T. M. Roberts, and C. T. Walsh. Purification and initial characterization of the lymphoid-cell protein-tyrosine kinase p56lck from a baculovirus expression system.

*Proc. Natl. Acad. Sci. USA*88(14):6254–6258, 1991.Rossy, J., D. M. Owen, D. J. Williamson, Z. Yang, and K. Gaus. Conformational states of the kinase lck regulate clustering in early t cell signaling.

*Nat. Immunol.*14(1):82–89, 2013.Rossy, J., D. J. Williamson, C. Benzing, K. Gaus. The integration of signaling and the spatial organization of the T cell synapse.

*Front. Immunol.*3:352, 2012.Rousseeuw, P. J. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis.

*J. Comput. Appl. Math.*20:53–65, 1987.Sadelain, M., R. Brentjens, and I. Riviere. The basic principles of chimeric antigen receptor design.

*Cancer Discov.*3(4):388–398, 2013.Salmond, R. J., A. Filby, I. Qureshi, S. Caserta, and R. Zamoyska. T-cell receptor proximal signaling via the src-family kinases, lck and fyn, influences t-cell activation, differentiation, and tolerance.

*Immunol. Rev.*228(1):9–22, 2009.Schlosshauer, M., and D. Baker. Realistic protein-protein association rates from a simple diffusional model neglecting long-range interactions, free energy barriers, and landscape ruggedness.

*Protein Sci.*13(6):1660–1669, 2004.Sjölin-Goodfellow, H., M. P. Frushicheva, Q. Ji, D. A. Cheng, T. A. Kadlecek, A. J. Cantor, J. Kuriyan, A. K. Chakraborty, A. R. Salomon, and A. Weiss. The catalytic activity of the kinase zap-70 mediates basal signaling and negative feedback of the t cell receptor pathway.

*Sci. Signal.*8(377):49, 2015.Sondhi, D., W. Xu, Z. Songyang, M. J. Eck, and P. A. Cole. Peptide and protein phosphorylation by protein tyrosine kinase csk: insights into specificity and mechanism.

*Biochemistry*37(1):165–172, 1998.Stefanova, I., B. Hemmer, M. Vergelli, R. Martin, W. E. Biddison, and R. N. Germain. Tcr ligand discrimination is enforced by competing erk positive and shp-1 negative feedback pathways.

*Nat. Immunol.*4(3):248–254, 2003.van Oers, N. S., B. Tohlen, B. Malissen, C. R. Moomaw, S. Afendis, and C. A. Slaughter. The 21- and 23-kd forms of tcr zeta are generated by specific itam phosphorylations.

*Nat. Immunol.*1(4):322–328, 2000.Yamaguchi, H., and W. A. Hendrickson. Structural basis for activation of human lymphocyte kinase lck upon tyrosine phosphorylation.

*Nature*384(6608):484–489, 1996.Zikherman, J., C. Jenne, S. Watson, K. Doan, W. Raschke, C. C. Goodnow, and A. Weiss. Cd45-csk phosphatase-kinase titration uncouples basal and inducible t cell receptor signaling during thymic development.

*Immunity*32(3):342–354, 2010.Zimmermann, L., W. Paster, J. Weghuber, P. Eckerstorfer, H. Stockinger, and G. J. Schütz. Direct observation and quantitative analysis of lck exchange between plasma membrane and cytosol in living t cells.

*J. Biol. Chem.*285(9):6063–6070, 2010.

## Acknowledgments

The authors would like to acknowledge Dr. Ronald D. Vale and Dr. Enfu Hui for their help in clarifying the experimental data. This work was supported by the National Cancer Institute of the National Institutes of Health under Award Number F31CA200242 (to J.A.R.).

### Conflict of interest

J. Rohrs, P. Wang, and S. Finley declare that they have no conflicts of interest.

### Ethical Standards

No human or animal studies were carried out by the authors for this article.

## Author information

### Authors and Affiliations

### Corresponding author

## Additional information

Associate Editor Michael R. King oversaw the review of this article.

**Stacey D. Finley** is the Gabilan Assistant Professor of Biomedical Engineering at the University of Southern California, where she directs the Computational Systems Biology Laboratory. Her research group develops mechanistic models of biological processes and utilizes the models to gain insight into the dynamics and regulation of biological systems and enable the development of novel therapeutics for pathological conditions. Dr. Finley’s research interests include applying computational modeling to investigate tumor angiogenesis, tumor metabolism, and cancer immunotherapy. Dr. Finley received her B.S. in Chemical Engineering from Florida A & M University and obtained her Ph.D. in Chemical Engineering from Northwestern University. She completed postdoctoral training at Johns Hopkins University in the Department of Biomedical Engineering, where she was awarded postdoctoral fellowships from the NIH National Research Service Award and the UNCF/Merck Science Initiative. Dr. Finley was named a 2015 Emerging Scholar and is currently a Keystone Symposia Fellow. Most recently, she received an NSF CAREER award. Dr. Finley has a joint appointment in the Department of Chemical Engineering and Materials Science and is an associate member of the USC Norris Comprehensive Cancer Center.

This article is part of the 2016 Young Innovators Issue.

## Electronic supplementary material

Below is the link to the electronic supplementary material.

### 12195_2016_438_MOESM1_ESM.tiff

Supplemental File 1: Ordinary Differential Equations. This file lists the 23 ordinary differential equations from the model of LCK + CSK, as well as a table of the symbols representing each species described by the ODEs. Supplementary material 1 (TIFF 199216 kb)

### 12195_2016_438_MOESM2_ESM.tiff

Supplemental File 2: LCK + CSK Model BioNetGen Source Code. BioNetGet source code to generate the MATLAB executable ODEs for the model of LCK + CSK in a two-dimensional reconstituted membrane system.. Supplementary material 2 (TIFF 84183 kb)

### 12195_2016_438_MOESM3_ESM.pdf

Supplemental File 3: 50% Inactive LCK Model BioNetGen Source Code. BioNetGet source code to generate the MATLAB executable ODEs for the model of 50% inactive LCK + 50% wild type LCK in a two-dimensional reconstituted membrane system. Supplementary material 3 (PDF 44 kb)

### 12195_2016_438_MOESM4_ESM.bngl

Predictions for three parameter set clusters. Results of model validation of the three different parameter clusters, green (a), blue (b), and red (c), from Figure 2c. This validation data, taken from the literature (Hui, 2014), uses a reconstituted in vitro membrane system of LCK phosphorylation in which 50% of the LCK is catalytically inactive due to a point mutation at the ATP binding site and 50% is normally active. The model fits (lines) to the data (dots) are shown with 50% and 90% confidence intervals (dark and light shaded areas, respectively), for phospho-Y394 (green) and phospho-Y505 (purple). The experimental data is normalized by the western blot band intensity at 90 minutes, and the simulations are normalized by the concentration of LCK at the end of the 90 minutes simulation. Supplementary material 4 (BNGL 7 kb)

### 12195_2016_438_MOESM5_ESM.bngl

Comparison of Red Cluster and Blue Cluster Parameters. The comparison of estimated parameter sets for the blue and red cluster of Figure 2c are shown for the (a) binding parameters, (b) catalytic parameters, and (c) CSK parameters. The data show the values of the parameter sets with the median and range. Statistically significant differences between the clusters are denoted by bars above the points, with different numbers of stars representing different levels of statistical significance as calculated by a one-way ANOVA. Supplementary material 5 (BNGL 14 kb)

## Rights and permissions

**Open Access** This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

## About this article

### Cite this article

Rohrs, J.A., Wang, P. & Finley, S.D. Predictive Model of Lymphocyte-Specific Protein Tyrosine Kinase (LCK) Autoregulation.
*Cel. Mol. Bioeng.* **9**, 351–367 (2016). https://doi.org/10.1007/s12195-016-0438-7

Received:

Accepted:

Published:

Issue Date:

DOI: https://doi.org/10.1007/s12195-016-0438-7

### Keywords

- Systems biology
- Computational modeling
- T cell signaling
- Parameter estimation