Programming in D - Integers and Arithmetic Operations

Integers and Arithmetic Operations

We have seen that the if and while statements allow programs to make decisions by using the bool type in the form of logical expressions. In this chapter, we will see arithmetic operations on the integer types of D. These features will allow us to write much more useful programs.

Although arithmetic operations are a part of our daily lives and are actually simple, there are very important concepts that a programmer must be aware of in order to produce correct programs: the bit length of a type, overflow (wrap), and truncation.

Before going further, I would like to summarize the arithmetic operations in the following table as a reference:

Operator	Effect	Sample
++	increments by one	++variable
--	decrements by one	--variable
+	the result of adding two values	first + second
-	the result of subtracting 'second' from 'first'	first - second
*	the result of multiplying two values	first * second
/	the result of dividing 'first' by 'second'	first / second
%	the remainder of dividing 'first' by 'second'	first % second
^^	the result of raising 'first' to the power of 'second' (multiplying 'first' by itself 'second' times)	first ^^ second

Most of those operators have counterparts that have an = sign attached: +=, -=, *=, /=, %=, and ^^=. The difference with these operators is that they assign the result to the left-hand side:

    variable += 10;

That expression adds the value of variable and 10 and assigns the result to variable. In the end, the value of variable would be increased by 10. It is the equivalent of the following expression:

    variable = variable + 10;

I would like also to summarize two important concepts here before elaborating on them below.

Overflow: Not all values can fit in a variable of a given type. If the value is too big for the variable we say that the variable overflows. For example, a variable of type ubyte can have values only in the range of 0 to 255; so when assigned 260, the variable overflows, wraps around, and its value becomes 4. (Note: Unlike some other languages like C and C++, overflow for signed types is legal in D. It has the same wrap around behavior of unsigned types.)

Similarly, a variable cannot have a value that is less than the minimum value of its type.

Truncation: Integer types cannot have values with fractional parts. For example, the value of the int expression 3/2 is 1, not 1.5.

We encounter arithmetic operations daily without many surprises: if a bagel is $1, two bagels are $2; if four sandwiches are $15, one sandwich is $3.75, etc.

Unfortunately, things are not as simple with arithmetic operations in computers. If we don't understand how values are stored in a computer, we may be surprised to see that a company's debt is reduced to $1.7 billion when it borrows $3 billion more on top of its existing debt of $3 billion! Or when a box of ice cream serves 4 kids, an arithmetic operation may claim that 2 boxes would be sufficient for 11 kids!

Programmers must understand how integers are stored in computers.

Integer types

Integer types are the types that can have only whole values like -2, 0, 10, etc. These types cannot have fractional parts, as in 2.5. All of the integer types that we saw in the Fundamental Types chapter are the following:

Type	Number of Bits	Initial Value
byte	8	0
ubyte	8	0
short	16	0
ushort	16	0
int	32	0
uint	32	0
long	64	0L
ulong	64	0LU

The u at the beginning of the type names stands for "unsigned" and indicates that such types cannot have values less than zero.

Although they are equal to 0; 0L and 0LU are manifest constants typed as long and ulong, respectively.

Number of bits of a type

In today's computer systems, the smallest unit of information is called a bit. At the physical level, a bit is represented by electrical signals around certain points in the circuitry of a computer. A bit can be in one of two states that correspond to different voltages in the area that defines that particular bit. These two states are arbitrarily defined to have the values 0 and 1. As a result, a bit can have one of these two values.

As there aren't many concepts that can be represented by just two states, a bit is not a very useful type. It can only be useful for concepts with two states like heads or tails or whether a light switch is on or off.

If we consider two bits together, the total amount of information that can be represented multiplies. Based on each bit having a value of 0 or 1 individually, there are a total of 4 possible states. Assuming that the left and right digits represent the first and second bit respectively, these states are 00, 01, 10, and 11. Let's add one more bit to see this effect better; three bits can be in 8 different states: 000, 001, 010, 011, 100, 101, 110, 111. As can be seen, each added bit doubles the total number of states that can be represented.

The values to which these eight states correspond are defined by conventions. The following table shows these values for the signed and unsigned representations of 3 bits:

Bit State	Unsigned Value	Signed Value
000	0	0
001	1	1
010	2	2
011	3	3
100	4	-4
101	5	-3
110	6	-2
111	7	-1

We can construct the following table by adding more bits:

Bits	Number of Distinct Values	D Type	Minimum Value	Maximum Value
1	2
2	4
3	8
4	16
5	32
6	64
7	128
8	256	byte ubyte	-128 0	127 255
...	...
16	65536	short ushort	-32768 0	32767 65535
...	...
32	4294967296	int uint	-2147483648 0	2147483647 4294967295
...	...
64	18446744073709551616	long ulong	-9223372036854775808 0	9223372036854775807 18446744073709551615
...	...

I skipped many rows in the table and indicated the signed and unsigned versions of the D types that have the same number of bits on the same row (e.g. int and uint are both on the 32-bit row).

Choosing a type

D has no 3-bit type. But such a hypothetical type could only have 8 distinct values. It could only represent concepts such as the value of a die, or the week's day number.

On the other hand, although uint is a very large type, it cannot represent the concept of an ID number for each living person, as its maximum value is less than the world population of 7 billion. long and ulong would be more than enough to represent many concepts.

As a general rule, as long as there is no specific reason not to, you can use int for integer values.

Overflow

The fact that types can only hold values within a limited range may cause unexpected results. For example, although adding two uint variables with values of 3 billion each should produce 6 billion, because that sum is greater than the maximum value that a uint variable can hold (about 4 billion), this sum overflows. Without any warning, only the difference of 6 and 4 billion gets stored. (A little more accurately, 6 minus 4.3 billion.)

Truncation

Since integers cannot have values with fractional parts, they lose the part after the decimal point. For example, assuming that a box of ice cream serves 4 kids, although 11 kids would actually need 2.75 boxes, the fractional part of that value cannot be stored in an integer type, so the value becomes 2.

I will show limited techniques to help reduce the risk of overflow and truncation later in the chapter.

`.min` and `.max`

I will take advantage of the .min and .max properties below, which we have seen in the Fundamental Types chapter. These properties provide the minimum and maximum values that an integer type can have.

Increment: `++`

This operator is used with a single variable (more generally, with a single expression) and is written before the name of that variable. It increments the value of that variable by 1:

import std.stdio;

void main() {
    int number = 10;
    ++number;
    writeln("New value: ", number);
}

New value: 11

The increment operator is the equivalent of using the add-and-assign operator with the value of 1:

    number += 1;      // same as ++number

If the result of the increment operation is greater than the maximum value of that type, the result overflows and becomes the minimum value. We can see this effect by incrementing a variable that initially has the value int.max:

import std.stdio;

void main() {
    writeln("minimum int value   : ", int.min);
    writeln("maximum int value   : ", int.max);

    int number = int.max;
    writeln("before the increment: ", number);
    ++number;
    writeln("after the increment : ", number);
}

The value becomes int.min after the increment:

minimum int value   : -2147483648
maximum int value   : 2147483647
before the increment: 2147483647
after the increment : -2147483648

This is a very important observation because the value changes from the maximum to the minimum as a result of incrementing and without any warning! This effect is called overflow. We will see similar effects with other operations.

Decrement: `--`

This operator is similar to the increment operator; the difference is that the value is decreased by 1:

    --number;   // the value decreases by 1

The decrement operation is the equivalent of using the subtract-and-assign operator with the value of 1:

    number -= 1;      // same as --number

Similar to the ++ operator, if the value is the minimum value to begin with, it becomes the maximum value. This effect is called overflow as well.

Addition: +

This operator is used with two expressions and adds their values:

import std.stdio;

void main() {
    int number_1 = 12;
    int number_2 = 100;

    writeln("Result: ", number_1 + number_2);
    writeln("With a constant expression: ", 1000 + number_2);
}

Result: 112
With a constant expression: 1100

If the sum of the two expressions is greater than the maximum value of that type, it overflows and becomes a value that is less than both of the expressions:

import std.stdio;

void main() {
    // 3 billion each
    uint number_1 = 3000000000;
    uint number_2 = 3000000000;

    writeln("maximum value of uint: ", uint.max);
    writeln("             number_1: ", number_1);
    writeln("             number_2: ", number_2);
    writeln("                  sum: ", number_1 + number_2);
    writeln("OVERFLOW! The result is not 6 billion!");
}

maximum value of uint: 4294967295
             number_1: 3000000000
             number_2: 3000000000
                  sum: 1705032704
OVERFLOW! The result is not 6 billion!

Subtraction: `-`

This operator is used with two expressions and gives the difference between the first and the second:

import std.stdio;

void main() {
    int number_1 = 10;
    int number_2 = 20;

    writeln(number_1 - number_2);
    writeln(number_2 - number_1);
}

-10
10

It is again surprising if the actual result is less than zero and is stored in an unsigned type. Let's rewrite the program using the uint type:

import std.stdio;

void main() {
    uint number_1 = 10;
    uint number_2 = 20;

    writeln("PROBLEM! uint cannot have negative values:");
    writeln(number_1 - number_2);
    writeln(number_2 - number_1);
}

PROBLEM! uint cannot have negative values:
4294967286
10

It is a good guideline to use signed types to represent concepts that may ever be subtracted. As long as there is no specific reason not to, you can choose int.

Multiplication: `*`

This operator multiplies the values of two expressions; the result is again subject to overflow:

import std.stdio;

void main() {
    uint number_1 = 6;
    uint number_2 = 7;

    writeln(number_1 * number_2);
}

Division: `/`

This operator divides the first expression by the second expression. Since integer types cannot have fractional values, the fractional part of the value is discarded. This effect is called truncation. As a result, the following program prints 3, not 3.5:

import std.stdio;

void main() {
    writeln(7 / 2);
}

For calculations where fractional parts matter, floating point types must be used instead of integers. We will see floating point types in the next chapter.

Remainder (modulus): %

This operator divides the first expression by the second expression and produces the remainder of the division:

import std.stdio;

void main() {
    writeln(10 % 6);
}

A common application of this operator is to determine whether a value is odd or even. Since the remainder of dividing an even number by 2 is always 0, comparing the result against 0 is sufficient to make that distinction:

    if ((number % 2) == 0) {
        writeln("even number");

    } else {
        writeln("odd number");
    }

Power: ^^

This operator raises the first expression to the power of the second expression. For example, raising 3 to the power of 4 is multiplying 3 by itself 4 times:

import std.stdio;

void main() {
    writeln(3 ^^ 4);
}

Arithmetic operations with assignment

All of the operators that take two expressions have assignment counterparts. These operators assign the result back to the expression that is on the left-hand side:

import std.stdio;

void main() {
    int number = 10;

    number += 20;  // same as number = number + 20; now 30
    number -= 5;   // same as number = number - 5;  now 25
    number *= 2;   // same as number = number * 2;  now 50
    number /= 3;   // same as number = number / 3;  now 16
    number %= 7;   // same as number = number % 7;  now  2
    number ^^= 6;  // same as number = number ^^ 6; now 64

    writeln(number);
}

Negation: `-`

This operator converts the value of the expression from negative to positive or positive to negative:

import std.stdio;

void main() {
    int number_1 = 1;
    int number_2 = -2;

    writeln(-number_1);
    writeln(-number_2);
}

-1
2

The type of the result of this operation is the same as the type of the expression. Since unsigned types cannot have negative values, the result of using this operator with unsigned types can be surprising:

    uint number = 1;
    writeln("negation: ", -number);

The type of -number is uint as well, which cannot have negative values:

negation: 4294967295

Plus sign: `+`

This operator has no effect and exists only for symmetry with the negation operator. Positive values stay positive and negative values stay negative:

import std.stdio;

void main() {
    int number_1 = 1;
    int number_2 = -2;

    writeln(+number_1);
    writeln(+number_2);
}

1
-2

Post-increment: `++`

Note: Unless there is a strong reason not to, always use the regular increment operator (which is sometimes called the pre-increment operator).

Contrary to the regular increment operator, it is written after the expression and still increments the value of the expression by 1. The difference is that the post-increment operation produces the old value of the expression. To see this difference, let's compare it with the regular increment operator:

import std.stdio;

void main() {
    int incremented_regularly = 1;
    writeln(++incremented_regularly);      // prints 2
    writeln(incremented_regularly);        // prints 2

    int post_incremented = 1;

    // Gets incremented, but its old value is used:
    writeln(post_incremented++);           // prints 1
    writeln(post_incremented);             // prints 2
}

The writeln(post_incremented++); statement above is the equivalent of the following code:

    int old_value = post_incremented;
    ++post_incremented;
    writeln(old_value);                    // prints 1

Post-decrement: `--`

Note: Unless there is a strong reason not to, always use the regular decrement operator (which is sometimes called the pre-decrement operator).

This operator behaves the same way as the post-increment operator except that it decrements.

Operator precedence

The operators we've discussed above have all been used in operations on their own with only one or two expressions. However, similar to logical expressions, it is common to combine these operators to form more complex arithmetic expressions:

    int value = 77;
    int result = (((value + 8) * 3) / (value - 1)) % 5;

As with logical operators, arithmetic operators also obey operator precedence rules. For example, the * operator has precedence over the + operator. For that reason, when parentheses are not used (e.g. in the value + 8 * 3 expression), the * operator is evaluated before the + operator. As a result, that expression becomes the equivalent of value + 24, which is quite different from (value + 8) * 3.

Using parentheses is useful both for ensuring correct results and for communicating the intent of the code to programmers who may work on it in the future.

The operator precedence table will be presented later in the book.

Detecting overflow

Although it uses functions and ref parameters, which we have not covered yet, I would like to mention here that the core.checkedint module contains arithmetic functions that detect overflow. Instead of operators like + and -, this module uses functions: adds and addu for signed and unsigned addition, muls and mulu for signed and unsigned multiplication, subs and subu for signed and unsigned subtraction, and negs for negation.

For example, assuming that a and b are two int variables, the following code would detect whether adding them has caused an overflow:

import core.checkedint;

void main() {
    // Let's cause overflow for test purposes
    int a = int.max - 1;
    int b = 2;

    // This variable will become 'true' if the addition
    // operation inside the 'adds' function overflows:
    bool hasOverflowed = false;
    int result = adds(a, b, hasOverflowed);

    if (hasOverflowed) {
        // We must not use 'result' because it has overflowed
        // ...

    } else {
        // We can use 'result'
        // ...
    }
}

There is also the std.experimental.checkedint module that defines the Checked template but both its usage and its implementation are too advanced at this point in the book.

Preventing overflow

If the result of an operation cannot fit in the type of the result, then there is nothing that can be done. Sometimes, although the ultimate result would fit in a certain type, the intermediate calculations may overflow and cause incorrect results.

As an example, let's assume that we need to plant an apple tree per 1000 square meters of an area that is 40 by 60 kilometers. How many trees are needed?

When we solve this problem on paper, we see that the result is 40000 times 60000 divided by 1000, being equal to 2.4 million trees. Let's write a program that executes this calculation:

import std.stdio;

void main() {
    int width  = 40000;
    int length = 60000;
    int areaPerTree = 1000;

    int treesNeeded = width * length / areaPerTree;

    writeln("Number of trees needed: ", treesNeeded);
}

Number of trees needed: -1894967

Not to mention it is not even close, the result is also less than zero! In this case, the intermediate calculation width * length overflows and the subsequent calculation of / areaPerTree produces an incorrect result.

One way of avoiding the overflow in this example is to change the order of operations:

    int treesNeeded = width / areaPerTree * length ;

The result would now be correct:

Number of trees needed: 2400000

The reason this method works is the fact that all of the steps of the calculation now fit the int type.

Please note that this is not a complete solution because this time the intermediate value is prone to truncation, which may affect the result significantly in certain other calculations. Another solution might be to use a floating point type instead of an integer type: float, double, or real.

Preventing truncation

Changing the order of operations may be a solution to truncation as well. An interesting example of truncation can be seen by dividing and multiplying a value with the same number. We would expect the result of 10/9*9 to be 10, but it comes out as 9:

import std.stdio;

void main() {
    writeln(10 / 9 * 9);
}

The result is correct when truncation is avoided by changing the order of operations:

    writeln(10 * 9 / 9);

This too is not a complete solution: This time the intermediate calculation could be prone to overflow. Using a floating point type may be another solution to truncation in certain calculations.

Exercises

Write a program that takes two integers from the user, prints the integer quotient resulting from the division of the first by the second, and also prints the remainder. For example, when 7 and 3 are entered, have the program print the following equation:
```
7 = 3 * 2 + 1
```
Modify the program to print a shorter output when the remainder is 0. For example, when 10 and 5 are entered, it should not print "10 = 5 * 2 + 0" but just the following:
```
10 = 5 * 2
```
Write a simple calculator that supports the four basic arithmetic operations. Have the program let the operation to be selected from a menu and apply that operation to the two values that are entered. You can ignore overflow and truncation in this program.
Write a program that prints the values from 1 to 10, each on a separate line, with the exception of value 7. Do not use repeated lines as in the following code:
```
import std.stdio;

void main() {
    // Do not do this!
    writeln(1);
    writeln(2);
    writeln(3);
    writeln(4);
    writeln(5);
    writeln(6);
    writeln(8);
    writeln(9);
    writeln(10);
}
```
Instead, imagine a variable whose value is incremented in a loop. You may need to take advantage of the is not equal to operator != here.

... the solutions

[ ↢ Prev ] [ Next ↣ ]