Previous chapter To contents Next chapter

Chapter 5, Operators

To make it easier to write Pike, and to make the code somewhat shorter, some functions can be called with just one or two characters in the code. These functions are called operators and we have already seen how they work in plenty of examples. In this chapter I will describe in detail what they do. The operators are divided into categories depending on their function, but beware that some operators have meanings that go way beyond the scope of the category they are in.

5.1 Arithmetic operators

The arithmetic operators are the simplest ones, since they work just like you remember from math in school. The arithmetic operators are:

Function Syntax Identifier Returns
Addition a + b `+ the sum of a and b
Subtraction a - b `- b subtracted from a
Negation - a `- minus a
Multiplication a * b `* a multiplied by b
Division a / b `/ a divided by b
Modulo a % b `% the remainder of a division between a and b

The third column, "Identifier" is the name of the function that actually evaluates the operation. For instance, a + b can also be written as `+(a, b). I will show you how useful this can be at the end of this chapter.

When applied to integers or floats these operators do exactly what they are supposed to do. The only operator in the list not known from basic math is the modulo operator. The modulo operator returns the remainder from an integer division. It is the same as calculating a - floor(a / b) * b. floor rounds the value down to closest lower integer value. Note that the call to floor isn't needed when operating on integers, since dividing two integers will return the result as an integer and it is always rounded down. For instance, 8 / 3 would return 2.

If all arguments to the operator are integers, the result will also be an integer. If one is a float and the other is an integer, the result will be a float. If both arguments are float, the result will of course be a float.

However, there are more types in Pike than integers and floats. Here is the complete list of combinations of types you can use with these operators:

Operation Returned type Returned value
int + int int the sum of the two values
float + int
int + float
float + float
 float the sum of the two values
string + string
int + string
float + string
string + int
string + float
 string In this case, any int or float is first converted to a string. Then the two strings are concatenated and the resulting string is returned.
array + array array The two arrays are concatenated into a new array and that new array is returned.
mapping + mapping mapping A mapping with all the index-value pairs from both mappings is returned. If an index is present in both mappings the index-value pair from the right mapping will be used.
multiset + multiset multiset A multiset with all the indices from both multisets is returned.
int - int int The right value subtracted from the left.
float - int
int - float
float - float
 float The right value subtracted from the left.
string - string string A copy of the left string with all occurrences of the right string removed.
array - array array A copy of the left array with all elements present in the right array removed. Example: ({2,1,4,5,3,6,7}) - ({3,5,1}) will return ({2,4,6,7}).
mapping - mapping mapping A new mapping with all index-value pairs from the left mapping, except those indices that are also present in the right mapping.
multiset - multiset multiset A copy of the left multiset without any index present in the left multiset.
int int Same as 0 - int.
float float Same as 0 - float.
int * int int the product of the two values
float * int
int * float
float * float
 float the product of the two values
array(string) * string string All the strings in the array are concatenated with the string on the right in between each string. Example: ({"foo","bar"})*"-" will return "foo-bar".
array(array) * array array All the arrays in the left array are concatenated with the array on the right in between each array. Example: ({ ({"foo"}) ,({"bar"})})*({"-"}) will return ({ "foo","-","bar" }).
string * int string This operation will concatenate the string N times. Example: "foo"*3 will return "foofoofoo".
array * int string This operation will concatenate the array N times. Example: ({"foo"})*3 will return ({"foo","foo","foo"}).
int / int int The right integer divided by the left integer rounded towards minus infinity.
float / int
int / float
float / float
 float The right value divided by the left value.
string / string array(string) In symmetry with the multiplication operator, the division operator can split a string into pieces. The right string will be split at every occurrence of the right string and an array containing the results will be returned. Example: "foo-bar"/"-" will return ({"foo","bar"})
string / int array(string) This will split the string into pieces. The size of the pieces is given by the integer. Only complete pieces will be included in the result, the 'reminder' is discarded. Example: "foo-bar"/2 will return ({"fo","o-","ba"})
string / float array(string) This is similar to dividing a string with an integer, but it allows fraction-sized segments and the reminder will always be included. Example: "foo-bar"/2.5 will return ({"fo","o-b","ar"})
array / int array(array) This is similar to dividing a string with an integer, but splits an array. Example: ({1,2,3,4,5,6,7})/2 will return ({({1,2}),({3,4}),({5,6})})
array / float array(array) You should be able to predict what this does by now. :) Example: ({1,2,3,4,5,6,7,8})/2.5 will return ({({1,2}),({3,4,5}),({6,7}),({8})})
int % int int The remainder of a division. If a and b are integers, a%b is the same as a-(a/b)*b
float % float
int % float
float % int
 float The remainder of a division. If a and b are floats, a%b is the same as a-floor(a/b)*b
string % int string The remainder of a string division. Example: "foo-bar"%2 will return "r"
array % int string The remainder of an array division. Example: ({1,2,3,4,5,6,7})%2 will return ({7})

5.2 Comparison operators

The arithmetic operators would be hard to use for anything interesting without the ability to compare the results to each other. For this purpose there are six comparison operators:

Function Syntax Identifier Returns
Same a == b `== 1 if a is the same value as b, 0 otherwise
Not same a != b `!= 0 if a is the same value as b, 1 otherwise
Greater than a > b `>  1 if a is greater than b, 0 otherwise
Greater than or equal to a >= b `>= 1 if a is greater to or equal to b, 0 otherwise
Lesser than a < b `<  1 if a is lesser than b, 0 otherwise
Lesser than or equal to a <= b `<= 1 if a is lesser than or equal to b, 0 otherwise

The == and != operators can be used on any type. For two values to be same they must be the same type. Therefore 1 and 1.0 are not same. Also, for two values of pointer types to be the same the two values must be pointers to the same object. It is not enough that the two objects are of the same size and contain the same data.

The other operators in the table above can only be used with integers, floats and strings. If you compare an integer with a float, the int will be promoted to a float before the comparison. When comparing strings, lexical order is used and the values of the environment variables LC_CTYPE and LC_LANG are respected.

5.3 Logical operators

Logical operators are operators that operate with truth values. In Pike any value except zero is considered true. Logical operators are a very basic part of Pike. They can also decide which arguments to evaluate and which not to evaluate. Because of this the logical operators do not have any identifiers and can not be called as normal functions. There are four logical operators:

Function Syntax Returns
And a && b If a is false, a is returned and b is not evaluated. Otherwise, b is returned.
Or a || b If a is true, a is returned and b is not evaluated. Otherwise, b is returned.
Not ! a Returns 0 if a is true, 1 otherwise.
If-else a ? b : c If a is true, b is returned and c is not evaluated. Otherwise c is returned and b is not evaluated.

5.4 Bitwise/set operators

These operators are used to manipulate bits as members in sets. They can also manipulate arrays, multisets and mappings as sets.

Function Syntax Identifier Returns
Shift left a << b `<< Multiplies a by 2 b times.
Shift right a >> b `>> Divides a by 2 b times.
Inverse (not) ~ a `~ Returns -1-a.
Intersection (and) a & b `& All elements present in both a and b.
Union (or) a | b `| All elements present in a or b.
Symmetric difference (xor) a ^ b `^ All elements present in a or b, but not present in both.

The first three operators can only be used with integers and should be pretty obvious. The other three, intersection, union and symmetric difference, can be used with integers, arrays, multisets and mappings. When used with integers, these operators considers each bit in the integer a separate element. If you do not know about how bits in integers work I suggest you go look it up in some other programming book or just don't use these operators on integers.

When intersection, union or symmetric difference is used on an array each element in the array is considered by itself. So intersecting two arrays will result in an array with all elements that are present in both arrays. Example: ({7,6,4,3,2,1}) & ({1, 23, 5, 4, 7}) will return ({7,4,1}). The order of the elements in the returned array will always be taken from the left array. Elements in multisets are treated the same as elements in arrays. When doing a set operation on a mapping however, only the indices are considered. The values are just copied with the indices. If a particular index is present in both the right and left argument to a set operator, the one from the right side will be used. Example: ([1:2]) | ([1:3]) will return ([1:3]).

5.5 Indexing

The index and range operators are used to retrieve information from a complex data type.

Function Syntax Identifier Returns
Index a [ b ] `[] Returns the index b from a.
Lookup a ->identifier `-> Looks up the identifier. Same as a["identifier"].
Assign index a [ b ] = c `[]=; Sets the index b in a to c.
Assign index a ->identifier = c `->= Sets the index "identifier" in a to c.
Range a [ b .. c ] `[..] Returns a slice of a starting at the index b and ending at c.
Range a [ .. c ] `[..] Returns a slice of a starting at the beginning of a and ending at c.
Range a [ b .. ] `[..] Returns a slice of a from the index b to the end of a.

The index operator can be written in two different ways. It can be written as ob [ index ] or ob->identifier. However, the latter syntax is equal to ob [ "identifier" ]. You can only index strings, arrays, mapping, multisets and objects, and some of these can only be indexed on certain things as shown in this list:

Operation Returns
string[int] Returns the ascii value of the Nth character in the string.
array[int] Return the element in the array corresponding to the integer.
array[int]=mixed Sets the element in the array to the mixed value.
mapping[mixed]
mapping->identifier
 Returns the value associated with the index, 0 if it is not found.
mapping[mixed]=mixed
mapping->identifier=mixed
 Associate the second mixed value with the first mixed value.
multiset[mixed]
multiset->identifier
 Returns 1 if the index (the value between the brackets) is present in the multiset, 0 otherwise.
multiset[mixed]=mixed
multiset->identifier=mixed
 If the mixed value is true the index is added to the multiset. Otherwise the index is removed from the multiset.
object[string]
object->identifier
 Returns the value of the named identifier in the object.
object[string]=mixed
object->identifier=mixed
 Set the given identifier in the object to the mixed value. Only works if the identifier references a variable in the object.
program[string]
program->identifier
 Returns the value of the named constant identifier in the program.
string[int..int] Returns a piece of the string.
array[int..int] Returns a slice of the array.

When indexing an array or string it is sometimes convenient to access index from the end instead of from the beginning. This function can be performed by using a negative index. Thus arr[-i] is the same as arr[sizeof(arr)-i]. Note however that this behavior does not apply to the range operator. Instead the range operator clamps it's arguments to a suitable range. This means that a[b..c] will be treated as follows:

5.6 The assignment operators

There is really only one assignment operator, but it can be combined with lots of other operators to make the code shorter. An assignment looks like this:
variable = expression;
The variable can be a local variable, a global variable or an index in an array, object, multiset or mapping. This will of course set the value stored in variable to expression. Note that the above is also an expression which returns the value of the expression. This can be used in some interesting ways:
variable1 = variable2 = 1; // Assign 1 to both variables
variable1 =(variable2 = 1); // Same as above

// Write the value of the expression, if any
if(variable = expression)
    write(variable);
Using assignments like this can however be confusing to novice users, or users who come from a Pascal or Basic background. Especially the if statement can be mistaken for if(variable == expression) which would mean something completely different. As I mentioned earlier, the assignment operator can be combined with another operator to form operators that modify the contents of a variable instead of just assigning it. Here is a list of all the combinations:

Syntax Same as Function
variable += expression  variable = variable + expression Add expression to variable
variable -= expression  variable = variable - expression Subtract expression from variable
variable *= expression  variable = variable * expression Multiply variable with expression
variable /= expression  variable = variable / expression Divide variable by expression
variable %= expression  variable = variable % expression Modulo variable by expression
variable <<= expression  variable = variable << expression Shift variable expression bits left
variable >>= expression  variable = variable >> expression Shift variable expression bits right
variable |= expression  variable = variable | expression Or variable with expression
variable &= expression  variable = variable & expression And variable with expression
variable ^= expression  variable = variable ^ expression Xor variable with expression

In all of the above expressions variable can actually be any of type of assignable values. Assignable values are also known as lvalues and here is a list of lvalues:

Lvalue type Syntax Valid assignment type
a local or global variable   identifier  same as variable
an element in an array   array [ int ] any type
elements in elements in an array   array [ string ] any type
This is like map(arr, `[]=,string_indexing_element, assignment_element)
an element in an string   string [ int ] integer
an element in a mapping   mapping[mixed] or mapping->identifier  any type
an element in a multiset   multiset[mixed] or multiset->identifier  true / false
a variable in an object   object[string] or object->identifier  same type as named variable
a list of lvalues   [ lvalue, lvalue ]   an array, first value in the array will be assigned to the first lvalue in the list, second value in the array to the second value in the list etc.

5.7 The rest of the operators

Now there are only a couple of operators left. I have grouped them together in this section, not because they are not important, but because they do not fit in any particular categories.

Function Syntax Identifier Returns
       
splice @ a none Sends each element in the array a as an individual argument to a function call.
Increment ++ a none Increments a and returns the new value of a.
Decrement -- a none Decrements a and returns the new value of a.
Post increment a ++ none Increments a and returns the old value of a.
Post decrement a -- none Decrements a and returns the old value of a.
casting (type) a none Tries to convert a into a value of the specified type.
Null a, b none Evaluates a and b, then returns b.

The most important of these operators is the calling operator. It is used to call functions. The operator itself is just a set of parenthesis placed after the expression that returns the function. Any arguments to the function should be placed between the parenthesis, separated by commas. We have already seen many examples of this operator, although you might not have realized it was an operator at the time. The function call operator can do more than just calling functions though; if the 'function' is in fact an array, the operator will loop over the array and call each element in the array and returns an array with the results. If on the other hand, the 'function' is a program, the operator will clone an object from the program and call create() in the new object with the arguments given. In fact, the function clone is implemented like this:

object clone(mixed p, mixed ... args) { ( (program)p )(@args); }

On the subject of function calls, the splice operator should also be mentioned. The splice operator is an at sign in front of an expression. The expression should always be an array. The splice operator sends each of the elements in the array as a separate argument to the function call. The splice operator can only be used in an argument list for a function call.

Then there are the increment and decrement operators. The increment and decrement operators are somewhat limited: they can only be used on integers. They provide a short and fast way to add or subtract one to an integer. If the operator is written before the variable (++a) the returned value will be what the variable is after the operator has added/subtracted one to it. If the operator is after the variable (a++) it will instead return the value of the variable before it was incremented/decremented.

Casting is used to convert one type to another, not all casts are possible. Here is a list of all casts that actually _do_ anything:

casting from to operation
int string Convert the int to ASCII representation
float string Convert the float to ASCII representation
string int Convert decimal, octal or hexadecimal number to an int. Note that this will only work with decimal numbers in future versions.
string float Convert ASCII number to a float.
string program String is a filename, compile the file and return the program. Results are cached.
string object This first casts the string to a program, (see above) and then clones the result. Results are cached.
object type This calls the function 'cast' with a string containing the type as an argument.
string array Same as doing values(string)
array(int) string This does the inverse of the operation above. Ie. it constructs a string from an array of integers.
array array(type) This recursively casts all values in the array to type.
mapping array Same as Array.transpose(({indices(mapping),values(mapping)). Example: (array)([1:2,3:4]) will return ({ ({1,2}), ({3,4}) })
multiset array Same as doing indices(multiset).
int float Returns a float with the same value as the integer.
float int Returns the integer closest to the float.
function object Same as function_object(function).

You can also use the cast operator to tell the compiler things. If a is a variable of type mixed containing an int, then the expression (int)a can be used instead of a and that will tell the compiler that the type of that expression is int.

Last, and in some respect least, is the comma operator. It doesn't do much. In fact, it simply evaluates the two arguments and then returns the right hand one. This operator is mostly useful to produce smaller code, or to make defines that can be used in expressions.

5.8 Operator precedence

When evaluating an expression, you can always use parenthesis to tell the compiler in which order to evaluate things. Normally, the compiler will evaluate things from left to right, but it will evaluate operators with higher priority before those with lower. The following table shows the relative priority of all the operators in descending order:

(a) a() a[b] a->b a[b..c] ({}) ([]) (<>)
!a ~a (type)a ++a --a
a++ a--
a*b a/b a%b
a+b a-b
a>>b a<<b
a>b a>=b a<b a<=b
a==b a!=b
a&b
a^b
a|b
&&
||
a?b:c
=
@a
,

Examples:

The expression is evaluated in this order:
1+2*2   1+(2*2)
1+2*2*4   1+((2*2)*4)
(1+2)*2*4   ((1+2)*2)*4
1+4,c=2|3+5   (1+4),(c=((2|3)+5))
1+5 & 4 == 3   (1+(5 & 4)) == 3
c=1,99   (c=1),99
!a++ + ~--a()  (!(a++)) + (~((--a)()))

5.9 Operator functions

As mentioned earlier a + b can just as well be written as `+(a, b). Together with the function map which calls a function for every index in an array and the splice operator this can be used to create some very very fast and compact code. Let's look at some examples:
map(arr, `-)
This will return an array with each element negated.
map(text/"\n",`/," ")
This will divide a text into lines, each line will then be mapped through `/ and divided into an array of words.
`+(0, @arr)
This will add all the integers in the array arr together.
int abs(int a) { return ( a>0 ? `+ : `-)(a); }
This is a rather absurd but working function which will return the absolute value of a.

Previous chapter To contents Next chapter