Analysing Student Programs in the PHP Intelligent Tutoring System

Weragama, Dinesha; Reye, Jim

doi:10.1007/s40593-014-0014-z

Analysing Student Programs in the PHP Intelligent Tutoring System

Research Article
Published: 04 February 2014

Volume 24, pages 162–188, (2014)
Cite this article

Download PDF

International Journal of Artificial Intelligence in Education Aims and scope Submit manuscript

Analysing Student Programs in the PHP Intelligent Tutoring System

Download PDF

Dinesha Weragama¹ &
Jim Reye¹

6703 Accesses
18 Citations
Explore all metrics

Abstract

Programming is a subject that many beginning students find difficult. The PHP Intelligent Tutoring System (PHP ITS) has been designed with the aim of making it easier for novices to learn the PHP language in order to develop dynamic web pages. Programming requires practice. This makes it necessary to include practical exercises in any ITS that supports students learning to program. The PHP ITS works by providing exercises for students to solve and then providing feedback based on their solutions. The major challenge here is to be able to identify many semantically equivalent solutions to a single exercise. The PHP ITS achieves this by using theories of Artificial Intelligence (AI) including first-order predicate logic and classical and hierarchical planning to model the subject matter taught by the system. This paper highlights the approach taken by the PHP ITS to analyse students’ programs that include a number of program constructs that are used by beginners of web development. The PHP ITS was built using this model and evaluated in a unit at the Queensland University of Technology. The results showed that it was capable of correctly analysing over 96 % of the solutions to exercises supplied by students.

The PHP Intelligent Tutoring System

Algorithmic Debugging and Literate Programming to Generate Feedback in Intelligent Tutoring Systems

A Literature Review of Tutoring Systems: Pedagogical Approach and Tools

Introduction

Programming is a subject that needs to be studied by students in many disciplines. Students in a first-level programming class vary widely in their prior knowledge of relevant subject matter. The differing knowledge levels of these students need to be taken into account when designing a course to teach programming. Intelligent Tutoring Systems (ITSs) are suitable means of addressing the diverse prior knowledge of these individuals.

With the current popularity of the World Wide Web, more and more students are showing an interest in learning to develop web pages. Many resources that teach the basics of web development can be found online (“PHP Tutorial,” undated; “W3Schools online web tutorials,” undated). However, these do not customise their instruction based on the diverse prior knowledge of the students. No ITSs that teach web development have been published in the literature. The skills required to develop web pages are somewhat different from the skills required to develop stand-alone applications (described later). Such differences need to be taken into account when developing an ITS to teach web programming.

Programming in any form is a practical subject which many beginners find very difficult (Miliszewska and Tan 2007; Truong et al. 2003). Therefore, any course that teaches programming needs to include practical programming exercises. The students’ solutions to these exercises must be analysed and appropriate feedback should be provided in order to maximise learning. For this to be possible, a particular programming language needs to be selected. Of the many available web development languages, PHP is one of the most popular (“TIOBE Programming Community Index for December 2012,”). Therefore, PHP has been selected as the language taught by this ITS.

A major challenge that is encountered when designing an ITS that teaches programming is that a programming exercise rarely has a unique solution. This is exemplified by the simple PHP program segments shown in Table 1 (Weragama 2013). All three of these programs are solutions to an exercise asking the students to first display the text ‘Welcome!’ on a web page, then add 3 to whatever the value existing in the variable $y and store it into a variable $x and finally display the value of variable $x on the web page. There is no way of knowing which of these methods a student will use when answering the exercise. In order to teach effectively, the ITS should be capable of identifying all these solutions as semantically equivalent and therefore identifying them as correct.

Table 1 Different methods of writing the same program

Full size table

Researchers have taken different approaches to solving this problem as described later in this paper. However, these methods are difficult to use for analysing PHP programs for several reasons. Some of the methods are very specific to a certain programming language. Others can identify some alternative solutions but have difficulty as the number of possibilities increase. The programs in Table 1 show another problem encountered when dealing with PHP programming. It is possible to inter-mingle HTML and PHP code within each other and any analysis process needs to be capable of handling such code. The rest of this paper describes the design of a knowledge base that can handle alternative PHP programming code solutions to a single programming exercise and results of an evaluation of the PHP ITS, which implements this knowledge base.

The main emphasis of this research is on analysing alternative solutions to programming exercises correctly rather than on the pedagogical aspects of the ITS. However, the pedagogical aspects have been addressed (briefly) in order to show that the knowledge base designed in this manner can effectively be used in an ITS to teach PHP programming.

The rest of the paper provides details of how the PHP ITS is used to analyse alternative solutions to the same exercise. The next section describes the program analysis process used by the ITS, in detail. This is followed by a section that outlines the interfaces of the PHP ITS and a section that describes how the system was evaluated under practical use and presents the results of the analysis. The second-last section compares the PHP ITS to related work while the final section discusses the conclusions of this research and offers some suggestions for future work.

Program Analysis

The knowledge base of the PHP ITS has been designed to handle alternative solutions to a given programming exercise using concepts of Artificial Intelligence (AI) to model states and changes of state. It uses a set of predicates based on First Order Logic, and a set of rules and actions based on AI approaches, in order to analyse programs written by students.

Figure 1 shows a schematic representation of how the AI problem is formulated in this ITS. A state is represented by a set of facts and each fact is specified as an instance of a predicate. Each exercise specification contains an Overall Goal and an Initial State. The Overall Goal is the set of facts that need to be present in the final state of the program in order for it to be identified as correct. The Initial State contains a set of facts that represent the state of the exercise before the student’s program is analysed. This is an empty set for exercises where the student needs to write the entire code to create a web page. However, the PHP ITS contains some gap exercises, where the student is required to complete a part of the code while the other part of the code is supplied by the exercise. This means that a certain set of facts are already present before the student’s program segment is analysed. Such facts are given as the Initial State.

Once a student submits a solution to an exercise, it is converted into an Abstract Syntax Tree (AST) to make it easier to analyse (Weragama and Reye 2012a, b). This AST is then walked through node by node, creating facts that are relevant to the functionality of each node. The rules that are defined in the knowledge base are activated when certain facts are created in the knowledge base, to create additional facts. Similarly, the actions defined are activated when certain types of nodes are encountered, resulting in more facts. The final set of facts obtained in this manner, after analysing all the nodes of the AST, is known as the final state.

The final state is then compared against the Overall Goal to see if all the facts in the Overall Goal are present in the final state. If so, the student’s program is identified as correct. If some facts in the Overall Goal are not present in the final state, these facts are used to identify the errors in the student’s program and to provide appropriate feedback.

A common mistake made by many beginning students is to include unnecessary code in their programs. Such unnecessary statements are identified by maintaining a set of related statuses throughout the fact creation process. A new status is created each time a significant change in the set of facts occurs. When a new fact is created, a check is made to see whether it is dependent of any facts created during a previous status. If so, a link is created between the current status and this previous status. This status flow is analysed as a final step to identify any statuses that do not contribute to the Overall Goal of the program. If any such statuses are present, the program statements that resulted in the creation of these statuses are identified as unnecessary to achieve the Overall Goal and relevant feedback is provided.

It can be seen that this method of program analysis depends on the facts created during the AST walking process. This means that it should be possible to use the knowledge base created here to analyse programs written in other 3GL programming languages. The amount of additional work involved would depend on the number of differences between the AST produced by (whatever) the other language and PHP. Although this seems to be the case, the current work does not investigate this possibility.

Example of Program Analysis

This section discusses the analysis process of an example PHP program in detail. As discussed above, the knowledge base used for program analysis consists of a set of predicates, rules and actions. The exact predicates, rules and actions that come into effect are very much dependent on the type of program statements used so this analysis will consider only those relevant to the example. Consider a gap exercise where the student has been supplied code that creates a form with a textbox and a submit button. The student is required to write code to add 5 to the value entered in the textbox and display the result when the form is submitted. For simplicity, they are permitted to assume that a numeric value was entered in the textbox so it is not necessary to validate this data.

Initial State

Since this is a gap exercise, the Initial State will specify the facts that are created as a result of the form definition. The unique ids of the objects are assigned by the system during their creation and the names of the textbox and submit button are provided by the Initial State. Therefore, the facts shown in Fig. 2 are created in the system at this point.

The value stored in an HTML input element is accessed through PHP using a super-global array. In order to understand how this is handled in the knowledge base, it is first necessary to explain how PHP arrays in general are modeled. The relevant predicates and their relationships are shown in Fig. 3.

An array is actually a collection of objects, and each element in the array behaves in exactly the same method as a normal variable. Therefore, an array element is modeled as a sub-type of the Variable object called an ArrayVariable. Ordinary variables that are accessed using a name only are modeled as another sub-type of the Variable object called a SimpleVariable. Each Variable has a unique id given by the HasVariableId predicate and a value given by the HasValue predicate. Each SimpleVariable also has a name given by the HasName predicate.

An ArrayVariable is actually a relationship between an Array and a Key so it is a reification (or objectification) of a predicate that relates these two object types. The corresponding predicate is known as HasElement. When accessing an array element through PHP, it is possible to use any form of expression within the brackets after the array name. Therefore, the Key is related to an Expression through the HasKeyExpression predicate.

The array itself can be one of two sub-types: a UserDefinedArray or a PreDefinedArray, where the later is a standard part of the PHP language. Each UserDefinedArray has a name given by the HasArrayName predicate. Several types of PreDefinedArrays are found in PHP but for the purpose of this description, consider only FormArrays which are arrays that are created for accessing values entered in HTML forms. Two types of FormArrays are possible, based on the method of data access used by the form: $_GET and $_POST.

At this point it is necessary to explain a sub-type of the Expression object known as a SimpleExpression. These are actually VariableExprs and LiteralExprs. Each VariableExpr is connected to the corresponding VariableId through the HasVariable predicate. Each LiteralExpr is connected to another type of object known as a Literal which has a value given by the HasLitValue predicate.

The Initial State of the exercise specification also contains facts that are relevant to the FormArray created as a result of the form, in addition to the facts directly related to the elements of the form. These facts are shown in Fig. 4. It should be noted here that although the ArrayElements are created as variables, they are not assigned values in the Initial State of the program. The reason for this is that they will only contain values once the form is submitted and the supplied code does not specify any form submissions.

Overall Goal

The next part of the exercise specification is the Overall Goal. This specifies the required output of the program in terms of the predicates described above. In this exercise, the value entered in the textbox should be increased by 5 and displayed on the web page, only when the form is submitted. PHP uses a pre-defined function ‘isset’ to check whether a variable has been assigned a value. In order to check whether a form has been submitted, this function is called with the parameter set to a name of an input element within the form, usually the submit button. Therefore, it is first necessary to see how pre-defined function calls are modeled in order to model the Overall Goal.

A call to any function, whether pre-defined or user-defined, is modeled as a FunctionCall object with a unique id. A FunctionCall needs to call some Function and this relationship is given by the CallsFunction predicate. The Function has a name given by the HasFunctionName predicate. A function can have any number of parameters given by the HasParameter predicate that takes in three arguments, the FunctionId, a ParamPosition and a ParameterVariableId. Since the parameters in a function definition must always take the form of variables, they are modeled as a special sub-type of the Variable object known as a ParameterVariable.

A function call must contain the same number of parameters as the function that it calls. These are modeled using the HasParamExpression predicate. This takes in the FunctionCallId, the ParamPosition and an ExpressionId as parameters. An ExpressionId is used here since the actual parameters within a function call can be any form of expression. A FunctionCall can have a return value given by the HasReturnValue predicate. Very often, a call to a function can be replaced by an expression. Therefore, a FunctionCall is related to a sub-type of Expression known as FunctionExpr through the HasFunctionCall predicate.

Using these predicates, it is possible to now define the Overall Goal of the program. The Overall Goal needs to specify that the output should only occur when the form is submitted so it is conditional. Such conditional goals are specified using implications. The Overall Goal for this exercise is given in Fig. 5. In broad terms, the left hand side of the implication specifies that the ‘isset’ function should take in a parameter with the value ‘submit’ and should return the value True. In other words, the variable associated with the submit button should have a value or the form should have been submitted. The right hand side of the implication specifies that, if this is the case, the value of the variable associated with the textbox, or in other words the value entered in the textbox before the form was submitted, should be increased by 5 and the resultant value should be displayed. The OnPage predicate is a special predicate that is used to specify any data displayed on the web page. The first parameter specifies the displayed data and the second parameter specifies the position of the displayed data.

Walking the AST

The next step in the program analysis process is to convert a solution to the exercise submitted by a student into an AST. For the purpose of explanation consider a very common solution to this exercise as shown in Table 2.

Table 2 A solution to example exercise

Full size table

The first node encountered during the AST walking process of this program is a conditional node corresponding to the if statement. The condition of this selection statement is a FunctionExpr which in turn calls a function so the following facts corresponding to this function call are created. Note that the VariableExpr corresponding to the parameter of the function call accesses the ArrayVariable corresponding to the submit button that has already been created during the Initial State.

In this case, the FunctionCall accesses a pre-defined function. The knowledge base stored information regarding the number of parameters and functionality of pre-defined functions. This information is now accessed to create the rest of the predicates that correspond to the ‘isset’ function, resulting in the following facts.

Processing within the if part of the selection structure progresses only if the condition within the selection statement is True. Therefore, the value of the FunctionExpr can be taken to be true within the if part. Each Expression has a value given by the ValueOf predicate so the following fact is True within the if section.

At this point, a special rule within the knowledge base is activated. Although many rules within the knowledge base are quite general, this rule is specific to the ‘isset’ function since it is extensively used during forms programming using PHP. It specifies that, if the ‘isset’ function returns True, the value of the variable that is the parameter to the ‘isset’ function will also be set to True. This is because the ‘isset’ function returns True only if the variable has been assigned a value. The rule used in this case is shown in Fig. 6.

Since the system now contains facts that correspond to all the premises of the rule, it is activated to create a fact that corresponds to the conclusion so the following fact is created.

Considering the semantics of PHP form processing, all variables corresponding to input elements in the form are set when the submit button is set to True, or in other words, when the form is submitted. Another rule, shown in Fig. 7, is used to set symbolic values to these variables at this point since the actual input values are not known. For convenience, the rule sets each variable to the name of the corresponding InputElement. The resulting fact in this case is shown below. Note that, since the value of the FunctionExpr is True only within the if part of the selection statement, these facts are also only True within this scope.

Next, the AST nodes corresponding to the statements within the if part of the selection statement are analysed. The only node here corresponds to an echo statement, which activates an action. The corresponding Display action is shown in Fig. 8. It can be seen that this action takes in an ExpressionId as an argument so an expression is created for whatever needs to be displayed. In the case of the program in Table 2, the expression is an addition. This has been modeled as an AddExpr which is a sub-type of the CalculateExpression object, which in turn is a sub-type of the Expression object. Many CalculateExpressions contain two sub-expressions on either side so the object is again a reification of a predicate that relates the two sub-expressions.

In this case, the sub-expression on the left hand side of the AddExpr is a VariableExpr and the one on the right hand side is a LiteralExpr so the following facts are created.

It can be seen from Fig. 8 that, in order to find the pre-conditions of the action, it is necessary to find the value of the expression with id ExprId1. A set of rules have been defined in the knowledge base to find the value of many types of expressions. Figure 9 shows some of these rules which are useful for this explanation.

These rules are now activated due to facts that currently exist in the system, resulting in the following facts.

Note that Add(x,y,z) is a predicate which is true if the result of adding x and y is z. Such predicates are necessary since it is often necessary to deal with symbolic values during program analysis as described above.

Now, the pre-conditions of the Display action are satisfied. Note that rC is a variable which does not occur as a direct result of analysis of the student’s solution. It is a variable which holds a running counter showing the current position where any text is displayed on a web page. This is necessary in cases where the order of display of elements is important. Since the pre-conditions are now satisfied, facts relevant to the effect of the action are now created as below.

All the nodes of the AST have now been analysed, resulting in the final state of the program as shown in Fig. 10. Note that, in the interest of space, only facts relevant to comparing against the Overall Goal are presented here. The final stage of program analysis is to compare this final state against the Overall Goal given in Fig. 5. It can be seen that, all the facts in the Overall Goal are present in the final state when FUNCID1 = FuncId1, FUNCALLID1 = FuncCallId1, VAREXPRID1 = VarExprId1, VARID1 = VarId2, ARRID1 = ArrId1, KEYID1 = KeyId2, KEYEXPRID1 = KeyExprId2, LITID1 = LitId2, KEYID2 = KeyId1, VARID2 = VarId1, KEYEXPRID2 = KeyExprId2 and LITID2 = LitId1. Therefore, the student’s program is identified as correct.

Alternative Solutions to the Exercise

The above section discussed how the program in Table 2 is analysed by the PHP ITS. However, it is quite likely that a student entered another, equally correct solution. The program in Table 3 shows another example of a correct solution to the exercise. On inspection, it can be seen that the first difference between the programs occur within the if statement. Therefore, the analysis is the same as in the previous case up to this point.

Table 3 Another solution to example exercise

Full size table

In the program in Table 2, a Display action was immediately activated once within the if section. In the program in Table 3, this is replaced by an assignment statement, which is again modeled as an action as shown in Fig. 11. The arguments of this action are the name of the variable on the left hand side of the assignment statement, and the id of the expression on the right hand side. This expression can take any form but the pre-condition specifies that it should have a value. The value of any expression can be found using one or more of the rules used for this purpose. The effect in this case is dependent on whether the variable on the left hand side of the assignment statement already exists or not. If it does, the value of this variable is updated to the value of the expression. If the variable doesn’t exist, it is first created and the relevant predicates to set the name and value are then created (In PHP, variables are not declared. Each variable is created when you first assign a value to it).

In the case of the program in Table 3, the expression on the right hand side is a VariableExpr and it accesses the ArrayVariable corresponding to the textbox so the following fact is created.

Now, using the rules in Fig. 9, the ValueOf this expression is found as below.

The pre-conditions of the Assign action are now satisfied. In this case, a variable with the name on the left hand side, $a, does not exist so it is created, so the following facts come into existence.

Next, the node corresponding to the echo statement is analysed. In this case, the expression that is within the statement is somewhat different from the previous case so the following facts are created.

Again using the rules in Fig. 9, the following facts are created.

So it can be seen that the resulting final state is exactly the same as in the previous example. This means that, even though the form of the expressions used in the program in Table 3 are different from those used in the program in Table 2 , the program is still identified as correct. Other types of expression are handled in a similar manner using different rules in order to identify all semantically equivalent programs as correct.

Incorrect Solutions to the Exercise

To show the other side of the coin, we now describe how incorrect solutions to the exercise are handled by the knowledge base. Table 4 shows two incorrect solutions to this exercise. Program a displays the value entered in the textbox without incrementing it and Program b contains an unnecessary program statement where the value of the textbox is stored in a variable but the variable is never used.

Table 4 Incorrect solutions to exercise

Full size table

The analysis of Program a continues as in the case of the program in Table 2. The only difference occurs when the Display action is activated. Here, the expression created is just a VariableExpr referring to the ArrayVariable corresponding to the textbox. No calculations are performed so the ValueOf the expression is the value stored in the textbox. Therefore, the final state of the program is as shown in Fig. 12. On comparing this against the Overall Goal in Fig. 5, it can be seen that a component of the goal, the Add fact, is missing so that value in the OnPage fact is incorrect. This is used to identify the fact that the value displayed on the web page is incorrect and appropriate feedback is provided. Similarly, based on which components of the goal, or which sub-goals, are missing, specific feedback about the error is provided.

Identifying the error in Program b is handled somewhat differently. In this case, the Overall Goal is fully satisfied by the final state. However, there is a statement that does not contribute in any form to achieving this final state. As briefly described earlier, a series of statuses and their relationships are maintained during program analysis, in order to identify such unnecessary statements. If the Overall Goal is satisfied, these statuses are checked to see whether any exist that are not related to the status where the Overall Goal is satisfied. Any such statuses are identified as being created by unnecessary program statements.

The Initial State of any program creates a new status known as status 0. All facts created during the initial status are related to status 0. In this case, the creation of the ArrayVariables happens in this status so they are related to status 0. A new status is created each time a selection statement is encountered so a new status, status 1 is created for the if statement. The condition within this if statement refers to a variable created during status 0 so a link is maintained between status 0 and status 1.

Next, a new status, status 2 is created for the if part of the selection statement. Since this is a part of the main selection statement, it is linked to the relevant status, status 1. Any statuses resulting from program statements within the if part of the selection statement are linked to status 2. In this case, the first assignment statement creates a new status, status 3. Since it uses the value from a variable created in status 0, a link is created between these statuses. The echo statement within the if part creates a new status, status 4. Since this again accesses a variable created in status 0, these statuses are linked. The final status flow for Program b is shown in Fig. 13.

Status 4 is the status where the Overall Goal is satisfied. When considering the status flow, it can be seen that all statuses, except status 3 have links that terminate in status 4. Status 3 however, is at the end of a flow and does not link to status 4. This means that it does not contribute to the Overall Goal. Therefore, the statement that created this status, the assignment, is an unnecessary program statement. It is therefore identified as an error and appropriate feedback is provided.

In the case of the program in Table 2, the status flow is similar except for the fact that status 3 does not exist since there is no corresponding statement. Therefore, the status flow shows statuses that are all linked to the status where the Overall Goal is satisfied so no unnecessary program statements are present.

Types of Constructs Handled by the PHP ITS

The preceding parts of this paper describe how the PHP ITS handles different types of expressions, selection structures and pre-defined functions. The knowledge base is capable of handling several more types of constructs that are frequently encountered during novice PHP programming. This section summarizes how some of these constructs are handled.

Selection structures allow for many variations in writing programs. Conditions within selection structures allow for a multitude of semantically equivalent programs. For example, the condition $x > 10 can be written as $x > =11, !($x < =10), (10 < $x) and many other ways. It is necessary that the knowledge base be capable of recognizing all these variations. This is done using a set of rules that are used to convert between equivalent expressions. The number of rules is very small compared to the number of possible ways in which the expression can be written. The possibilities of writing equivalent selection structures are further increased by the fact that it is possible to write many independent if statements, if-else statements, nested if statements and switch statements that are semantically equivalent. Once such a program is converted to a set of facts as described above, it is possible to flatten out the set of facts by removing all levels of nesting. The Overall Goal in this case is also specified as a set of facts where all forms of nesting are removed. This makes it possible to identify the program as correct, no matter what levels of nesting are used.

In order to explain this further, consider an example where the student needs to write a program segment to display the grade based on a given mark. If the marks are greater than 80, the grade is an ‘A’. If the marks are between 60 and 79, the grade is a ‘B’ and if the marks are between 50 and 59 the grade is a ‘C’. In all other cases, the grade is a ‘F’. Figure 14 shows how the overall goal for this case is set up. Assume that the initial state is set up so that the VariableId of the variable holding marks is VarId1 and its initial value is val_m.

A correct program to achieve this objective can be written in a multitude of ways. Table 5 shows two correct solutions to the exercise, although they may not be ideal. The overall goal for this exercise is specified using a fully flattened state. In other words, each possible condition is enumerated separately, with the corresponding result for each such condition. In order to understand how Program a is analysed, consider its elseif ($marks > =60) condition. The ‘else’ in this case implies that the condition opposite to the condition of the corresponding ‘if’ (i.e. $marks > 80) is satisfied here. This means that, when analyzing this section of code, LessThan(val_m,80) becomes a component of the predicate to the premise of the implication, where val_m is taken to be the initial value of the mark. There is also an additional ‘if’ condition (i.e. $marks > =60). The predicate corresponding to this, GreaterThanOrEqual(val_m,60) is also a component of the premise since the grade is ‘B’ only obtained when this condition is satisfied too. Therefore, the complete premise of the implication is a combination of these two predicates, i.e. GreaterThanOrEqual(val_m,60) ∧ LessThan(val_m,80). Therefore, the program analysis also results in a flattened state and the corresponding sub-goals are seen to be satisfied. Therefore, both these programs are identified as correct by the system.

Table 5 Correct solutions to nesting exercise

Full size table

The above program considered the use of pre-defined functions. User defined functions are handled in a similar manner but special methods are necessary to analyze the functions themselves for correctness. In this case, hierarchical planning concepts in AI are utilized to handle the function definitions. The requirements of the function are given as a set of conditions of sub-plans which needs to be satisfied in order for the function to be identified as correct. Once the function is identified as correct, a special predicate is created and the Overall Goal checks for this predicate.

A similar method is utilized for analyzing loops. Again, conditions of sub-plans are used to check the correctness of the loop before checking for the correctness of the entire program (Weragama and Reye 2013). However, loops pose some additional problems. It is possible to classify loops based on their functionality (Reye et al. 2013). All types of loops cannot be handled by the present knowledge base. Definite loops, where the number of iterations are known before the actual execution of the loop and also loops that perform some action on each item in a collection independently can be handled. These account for over 50 % of loops encountered in real-world programming (Stavely 1993).

Another difficulty mentioned when handling PHP programs is the possibility of interleaving PHP and HTML code within each other. This problem has been handled by conducting the AST walking process in two steps instead of one. During the first step, all PHP statements are analysed. Some PHP sections could result in displaying text on the HTML web page but could depend on values of PHP variables. These statements are converted into the relevant HTML form. In the second step, all HTML statements, including those that result from the execution of PHP statements, are analysed.

Therefore, the knowledge base of the PHP ITS is capable of handling a large portion of the constructs encountered by beginners of PHP programming. It is difficult to say how much effort will be needed to expand the knowledge base to handle new constructs. Further research needs to be carried out in order to determine this.