C vs Pascal

Recently, due to a job change, I became re-immersed in Pascal after a long absence. Prior to this I'd spent a long time in C, though of the two languages I'd actually learned Pascal first. (But after FORTRAN, which was the norm at the time in school. In fact, I didn't learn C until after college, well after I'd learned BASIC and FORTH. Prior to college I'd first learned assembly language; FORTRAN, in college, was second.) In some ways Pascal (learned third) was like coming home. But, in probably more ways, it was more like "What is this shit?" I do prefer C, and it's not imprinting, the 'baby duck syndrome'.

Oh, the Pascal I'm using now is a practical Pascal, not the original teaching language I first learned, which is deficient in many ways.

While there is a very fine wikipedia page comparing these two languages, this is my personal opinion of these two third-generation languages. I like third-generation languages. Because of my interest in performance and state, necessary for high scale and fault tolerant computing, their general lack of higher-level abstraction means that you can pretty much always tell what a program is doing, and how long and what resources it is going to need while doing it; the general flow of control is not hidden. (To me, high-abstraction languages are a lot like handing tasks off to graduate assistants. I may be thinking that he'll get right on it, whereas he might think that the waves are bitchin' gnarly right now, and he'll do my task maybe tomorrow. Sometime. Sure, to avoid this I can be very specific in my task assignments and specifications, but often it's easier to just do it myself than to fully specify all the necessary parameters. The same, I find, can be true of computer languages.)

I'm going to ignore both languages' Object modeling. (And thus C++ and its ilk.) Just the straight procedural languages, which is where I have always been, and in fact still am, working. (Use of object models can conflict with both high-scale and fault tolerance, making them more difficult or even impossible.)

I'm going to ignore most purely syntactic differences, such as ":=" and "!=" vs "=" and "<>", whether the pointer indication character comes first or last, or whether types or names come first in declarations, union versus variant records, and Pascal's terminating period. Po-tay-toe, po-tah-toe. I'm also going to ignore attributes of specific compiler/IDE products, such as convenience, speed, or quality of generated code. These can all change, and often are extremely important, but have nothing to do with the qualities of the language itself.

NB: Both C and Pascal have learned from each other over the years, and some of what once might have been clear advantages of one language over the other are no more, in any significant way. (C gained const and enum, Pascal gained continue, break, and case's else, etc.) These languages are now more alike than different.

Advantage: C

Pre-processor provides a nice amount of system-configuration meta-processing. It's built right in to the language, you don't have to deploy system-specific meta-tools, like Makefiles and macro processors. (Those are much more capable, but they're also usually a lot more work. They also tend to be highly system-specific, so portability can be a real problem. C just managed to strike a nice balance of ease of use and sufficient power, built right in. I believe that the C preprocessor was one of the big reasons for the success of C, overcoming Pascal's inital lead in the industry. In response, practical Pascals later adopted some of this themselves.)
The #include directive allows for one to construct a semantic whole out of fragments placed in multiple files, controlled by #if constructs. The #define directive, besides being used for type definitions and primordial enumerations, can be 'abused' for textual aliasing. It can even be used for system configuration duties, by multiply #include-ing configuration files written in a configuration meta-language of sorts, using #define to determine what flavor of configuration is being pulled in at that moment. (This is commonly used when configuring RTOSs.)
And, of course, the pre-processor and the easily-abused ternary and comma operators make it the undisputed champion of obfuscated language contests between the two languages. (Maybe that's not a plus!)
Code generation. Pascal was designed so that a single-pass compiler could do the job. This can result in a very fast compiler, which is nice, and so Pascal typically focused on generating code fast, not on generating fast code (which requires extra compile-time processing). C, having always been a multi-pass compiler, felt more free to spend extra time on code optimization. For products that needed to be fast, C was a better choice. For products that needed to be developed fast, Pascal was a better choice.
The ternary and comma operators. Rarely needed, but when used properly they enhance the understandability of code, without introducing performance or storage penalties, or requiring huge ladders of refactoring into more conventional code or the introduction of a lot of intermediate variables, which I think is ultimately far more obfuscatory than using some compact semi-cryptic, but learnable, notation.
Similarly, case fall-throughs, cunningly exploited by Duff's Device. Properly used these result in less code duplication than the alternative. (I'm a big fan of the Don't Repeat Yourself programming paradigm, at least in principle. Note that by using goto you can get the same effect in Pascal, with a lot more syntactic noise.)
Explicit subroutine call semantics of "f()" (the parentheses) make it very clear that flow of control is going elsewhere, whether or not arguments are necessary. (In C "x = y;" is always a 'cheap' operation; in Pascal "x := y" could be cheap, or could be a hideously expensive function call with deep side-effects. There's no way to tell just by looking, you have to do deeper analysis, at every level due to nested functions, in order to know the true cost.)
Pascal's "result := X" uses a magic variable name, which is less obvious than C's explicit keyword-based "return X;". If you use Pascal's older "f := X" form where you assign results to the function's own name, you have to be aware of exactly what function you're inside of in order to recognize what is going on. Also, recursion can be a bit tricky if the function doesn't have any arguments. This is one place where the presence (or absence) of parentheses in Pascal is crucial. "f := f()" is how you call for recursion. I find this all quite messy.
Pascal's pesky 'no, not here, there' semicolon. A fixed terminator (C) is a lot less annoying to deal with than a context-sensitive separator. It was particularly irritating in a non-interactive punched-card environment, where I first learned an early Standard Pascal that didn't have the more relaxed semicolon rules. (Originally, if a semicolon wasn't required it was prohibited, i.e. null statements were prohibited, but now it's tolerated in many places where it's not necessary.) You had to stand in line for the punch; you had to stand in line to submit your job; you had to wait around for your output... only to find out that your trivial addition of an ELSE clause to a conditional meant that you had forgotten to go back and re-type the preceding card in order to remove the now-prohibited semicolon.
Start Over. Really, quite infuriating.
```
if X then
   Y;
else  {plus these two}
   Z; {new cards/lines}
```
Will not compile. It must be changed to:
```
if X then
   Y  {<== Note absence of formerly-required semicolon!}
else
   Z;
```
Feh!
Also note that when using conditional compilation (which Pascal now supports) that pesky semicolon can be a real problem. Consider:
```
if X then
   Y;
{$IFDEF ANAL}
else
   Z;
{$ENDIF}
```
Would not compile properly if you intended to include the optional error checking. (With C this structure would compile properly.) Instead you'd need something a bit obfuscatory like:
```
if X then
   Y  {<== Not here,}
{$IFDEF ANAL}
else
   Z  {<== and not here,}
{$ENDIF}
;  {<== but here.  WTF, Pascal!}
```
```
if X then begin
   Y;
end else begin
   Z;
end;
```
is just too damned wordy. (Oh look: there's an extra semicolon, and don't you forget it. Yay!) C's braces are much less... abrasive.
Assignment itself has a value. In spite of gcc's bias against it, there are many places where "if (x = y()) z();" is just what you wanted. This compact assign-and-test notation is especially attractive for programmers (like me) who learned assembly language first, where partial results ("x", above) are easily saved for later re-use, and where condition-code-based conditional execution is the norm. In C, "x = y = z;" is a clever side-effect of regular assignment, and can be used anywhere. (It's also another example of "Don't Repeat Yourself", on a small scale.)
Consider a situation where we are going to be concatenating two strings, but the result cannot be overlong, so we're going to preferentially punish the longer of the two components. In C it's pretty short, and moderately efficient. (If we really cared about runtime efficiency we'd factor the string lengths out of the loop entirely, or even eliminate the loop altogether, but that would be more complex, which carries its own problems. We favor here the simplest form, and modest efficiency. As a general policy Good Enough, with a reasonable path for getting Better if it should later turn out to be necessary, is often the best design choice.)
In C:
```
while (((len1 = strlen(s1)) + (len2 = strlen(s2))) > 26)
   if (len1 > len2)
      s1[len1-1] = 0;
   else
      s2[len2-1] = 0;
```
vs Pascal's similar-sized (but far less efficient):
```
while length(s1) + length(s2) > 26 do
   if length(s1) > length(s2) then
      setLength(s1, length(s1) - 1)
   else
      setLength(s2, length(s2) - 1);
```
But! Five different string length calls per loop iteration, versus two. (And only one semicolon.) Any Pascal expression that tries to approach C's runtime efficiency here will be somewhat larger due to the syntactic requirements of the language, which will tend to obscure the intrinsic simplicity of what we are trying to do here. Perhaps:
```
while True do begin
   len1 := length(s1);
   len2 := length(s2);
   if len1 + len2 <= 26 then break;
   if len1 > len2 then
      setLength(s1, len1 - 1)
   else
      setLength(s2, len2 - 1);
end;
```
Bulky. Or, consider another, more pithy example. In C:
```
while (p = next())
   free(p);
```
vs Pascal's equally efficient but much more verbose:
```
while True do begin
   p := next;
   if p <> nil then
      free(p)
   else
      break;
end;
```
or:
```
repeat
   p := next;
   if p <> nil then
      free(p)
until p = nil;
```
Which language best embodies the simple concept: "While there is an X, do Y to it?"
Multiple (chain) assignment, and compile-time initialization of data structures offer considerable clarity and efficiency, if properly used.
Short-circuit evaluation of booleans.
In C:
```
if (p && *p) {...
```
vs Pascal's much bulkier:
```
if p <> nil then
   if ^p <> 0 then ...
```
Pascal is quite a bit wordier. More to the point, you can't even put the Pascal conditional chain everywhere you might want to, like you can with the C idiom. Base Pascal prohibits short-circuit evaluation of booleans.
Many Pascal compilers now offer an option for short-circuit evaluation, where you could then express:
```
if (p <> nil) and (^p <> 0) then ...
```
without risking dereferencing a null pointer, but this behavior is optional, controlled by a compiler pragma, so you have to do more work to engage it, but then just how certain are you that the correct mode has been selected? And that you haven't damaged nearby code that perhaps was relying on the opposite behavior? And that nobody's ever going to change compilers on you?
Separate bitwise and logical operators. E.g. '&', '|', and '~' versus '&&', '||', and '!'. (Pascal has only and, or, and not.)
This difference really only exists because short-circuiting of logical operators is explicit. (Consider that there is no '^^' operator, because there is no short-circuit possible for exclusive-or.) Otherwise the default bitwise values for True/False (1/0) mean that using bitwise operators for logic would often work too, if short-circuit evaluation were not necessary for any reason.
Which is to say, this:
```
if ((a == b) | (c == d)) {
   ...
```
is logically equivalent to:
```
if ((a == b) || (c == d)) {
   ...
```
if short-circuiting were not necessary. However, because C treats any non-zero value as True, many otherwise valid logical expressions using bitwise (instead of logical) operators would actually fail. Consider:
```
if (a & b) {
   ...
```
If a were 1 (True) and b were 2 (also True), this expression, if intended to express logical conjunction, would evaluate to 0 (False)—Fail! This expression is perfectly legal C for detecting if any bits are set in common (intersection) between a and b, so it's not like the compiler should try to flag this usage, as it has no way of knowing which behavior you actually had in mind. (Bitwise intersection versus logical conjunction.)
It is usually best if logical operations are done exclusively with the logical operators. Consider that 'a' and '~a' are both True for any non-zero value! Bugs of this sort are common for a novice C programmer. C, like any powerful tool, has sharp edges; keep your hands out of the blades! (Short-circuit evaluation is expressly prohibited in Standard Pascal, however it is provided as an extension in many newer Pascals.)
Deeper operator precedence tree. C has 15 levels; Pascal, with fewer operators, has only 4. Pascal's paucity of precedence means that almost all even slightly complex expressions need parenthesization in order to function correctly. C's expressions can read more naturally without too many parentheses, thus looking a little cleaner, but even so sometimes you need 'un-natural' parentheses to make it function correctly. (Complex expressions can be a pitfall, no matter what language you're using.)
The lack of nested function definitions. While for teaching purposes Pascal's hierarchical function definition is an advantage, in a practical language it can be more of a disadvantage. In practice, one wants well-defined shareable library functions that can be used (and re-used) in multiple places. You don't want to be pasting (nearly?) identical variants of source code all throughout your programs. (Don't repeat yourself!) C's simple 2-level scoping encourages the building of portable libraries of reusable code.
You also want to know what data connections there are in your code, and because Pascal's nested functions have implicit access to all their containing functions' variables, all the way back up the chain, this means that the data connection graph is fairly complex, and somewhat hidden. This, IMHO, is unnecessary freedom, and can ultimately result in the programmer not knowing what is truly going on. More bugs, and much more difficulty in refactoring, hampering restructuring that might make sense logically, but which in practice would result in too much work, and so does not get done.
Which is to say, Pascal offers you the near-unlimited opportunity to step on (overload) method and variable names you don't even know are there. You think you're looking at one thing when examining your code, but what the compiler actually built (according to its rules) might be something else entirely. More bugs that must be found at runtime. C doesn't do this.
In this case, I argue that simpler is better. Complexity is one of the true sources of problems; unnecessary complexity is just stupid. Having more features is not necessarily better; sometimes it's just giving you more rope with which to hang yourself.
Case-sensitivity. Casing conventions can be used for semantic meaning, whereas Pascal's loosey-goosey casing means that you can't know what you're looking at just by looking. Casing differences might have significant meaning, or it might just be sloppiness. All search, analysis, and IDE tools used with Pascal need to be similarly case-insensitive. Computer programming is generally about exact specifications and behavior, which is diametrically opposed to sloppy casing. (Or indeed, sloppy anything!)
C allows dynamic for-loop control variables, and manipulation thereof; Pascal does not. This means things like parsers of variable-length items (TLV data structures, etc.) are harder to write in Pascal. C can step by any amount, in either direction—even both directions or not at all in a single loop; Pascal can only step through fixed integer ranges by ones, though in either direction. Pascal once touted this as a performance advantage, at least on machines that had low-level loop instructions, but modern code generators and optimizers in C do even better, without the restrictions.
Also, outside the scope of the loop itself, Pascal doesn't guarantee the value of the loop control variable, whereas in C it's always just another variable, like any other. (This is, I believe, due to the implementation of such loops on an early mainframe computer that Pascal grew up with. Once the variable was pushed into a loop control instruction it didn't come back out again afterwards.)
C allows you to write variable-argument functions such as printf(). In Pascal, such functions are few (e.g. WriteLn()) and are built in to the compiler; you cannot write your own. (If using an Object Pascal you might be able to get around this, sometimes, by writing a lot of type-specific code.)
C has the volatile declaration, which allows you to cleanly write device drivers and inter-thread communications without having to delve into assembly language. (C's bitfields, though non-portable, are also of use when writing device drivers. Do be aware, though, that volatile itself is somewhat controversial, in that it doesn't go deep enough for some uses, on [at least some of] the more complicated CPU's.)

Advantage: Pascal

Set operations. A clear win, no argument. The abstraction level is not high enough to cause performance or storage problems, unless you're being stupid about your set sizes. Most Pascal compilers impose reasonable limits on the set size anyway. (256 is common.)
Scalar range checking, ditto.
for-in loop for traversing strings, arrays, and sets, etc. Ditto. (Only in newer Pascals.)
Strings are intrinsic to the language. You just use them, and they work. (However, there are inherent performance and storage issues that might or might not be a problem depending on your needs, so care might be necessary. And there are now multiple, conflicting, encoding systems out there to complicate your day...)
The 'with X do' construct, which I have long thought of as a significant Pascal advantage. It saves a lot of typing. On the other hand, it's also a disadvantage, because it introduces yet another layer of implicit context that can cloud the issue of exactly which variables are being referred to. C seemed a lot cruder, but if 'X' is messy enough that repeated typing of it is repugnant, you can always introduce a short variable "(p->)" to mitigate that, while not clouding the issue of to what, exactly, you're referring. And, if you need to be working with two of these structures, as in a copy or comparison situation, which is not exactly unheard-of, Pascal's shortcut is of no help whatsoever.
This point I'm still waffling about, but I'm a lot less impressed with with than I used to be. It looks great in trivial examples, but falls down a lot when you start trying to do real work. Still, I find I use it quite a bit anyway. Do consider the following (stripped-down) scenario:
```
type rec = record id : integer; timeStamp : TDateTime; end;
var timeStamp : TDateTime; // Current timestamp for marking things.

procedure stampRec(var thing : rec);
begin
   with thing do
      timeStamp := timeStamp;
end;
```
This is syntactically clean and will even compile, but won't do what is intended. The compiler could choose to do any of the following (with-less) assignments:
```
      timeStamp :=       timeStamp;
      timeStamp := thing.timeStamp;
thing.timeStamp :=       timeStamp;
thing.timeStamp := thing.timeStamp;
```
only two of which are potentially useful, and you don't know which it'll choose. (The third is likely what you had in mind, but the fourth is likely what it would choose to do.) If you're going to use with you have to avoid using variables with the same names as fields, including fields you don't necessarily know/care about, even if that's what would otherwise make the most sense. Information hiding is definitely a double-edged sword.
You can't willy-nilly declare new variables all throughout a named function. This 'advantage' of C I believe is actually a detriment, in that it clouds the storage requirements of a function and can result in inadvertent variable name overloading and subsequent bugs.
Separation of variable declaration from variable use. With C (especially the later dialects) it can be very difficult to discern where all the allocation requirements are, and where all the generated code is. They can curl around each other like a basket of snakes. Not so with Pascal.
Likewise, while C's 'new' prototypes are a boon, how they were introduced was not. I loathe the fact that you must use the new-style function definitions when using prototypes. When rummaging through established code I rarely need to see the types of function arguments, and would much prefer the simpler argument lists of K&R C. If there's a typing problem I can always look at the type declarations, otherwise I can skip over them with ease. The compiler should be smart enough to do lint-style type checking, regardless of the form of the function definition. (In fact, the DIAB C compiler, pre-Standard, allowed mixed use of prototypes and K&R function definitions. There was no ambiguity, and no conflict. However, using the register keyword in prototypes to change the function ABI while not doing so in the function definition was a recipe for disaster.)
Being able to declare an API and its implementation in the same file, with the compiler enforcing consistency, means there's a lot less room for error. For most cases (non-library) this is a clear win.
Only when you need to use the C preprocessor's magic, like for system configuration, are there advantages to separate .c and .h files. Also, for libraries, where you are interfacing to code whose source is not provided, you end up needing two files anyway: one for the library provider which has declaration and implementation together, and one for the library user that has only declaration. Now you have two files with the same (!) information in them both, which is bad. (Don't Repeat Yourself!) In this use case C has the advantage. (The .h declaration file is shared, the .c implementation file is kept private, and the compiler itself will check the declarations against the definitions when building the library. You maybe did have to Repeat Yourself somewhat, but at least an automated tool is checking your work.)
Sub-functions. These can really help with factoring out common code fragments, without having to do a full extraction into a common library-able routine as you might have to do with C. E.g.:
```
function deleteTransaction(tranNumber : integer; workDate : TDateTime) : Boolean;
   function delFrom(dbTable : string) : Boolean;
   begin
      result := False;
      if Query('DELETE FROM ' + dbTable + ' WHERE register = ' +
               QI(regNumber) + ' AND workDate = ' +
               QD(workDate) + ' AND tranNumber = ' +
               QI(tranNumber)) then exit;
      result := True;
   end;
begin
   result := delFrom('transaction') and delFrom('lineItem) and delFrom('payment');
end;
```
The sub-function naturally has access to all the parent's variables and arguments, so you don't have to craft a linkage— this is something Pascal does for you. Just extract the fragment of common code and refer to it multiple times. In this example we apply the same operation to multiple SQL tables, guaranteed, and we get a composite result status.
On the other hand, if you should have crafted a library routine you end up with a lot of duplicate code, and multiple maintenance issues. (Don't Repeat Yourself!) Also, these little sub-functions should be just that: little. If they get too big you're probably doing something wrong, and obfuscating the structure of your program rather than clarifying it.
Oh, and beware the presence of any compiler pragma that allows short-circuit AND/OR evaluation. If it were on for this example function it could destroy the guarantee that all database tables be modified together! (This is another win for C: having two sets of Boolean evaluators, making this behavior explicit.) The invocation of the sub-function might better be:
```
begin
{$PUSH}{$B-} // Under no circumstances allow short-circuit evaluation here!
   result := delFrom('transaction') and delFrom('lineItem') and delFrom('payment');
{$POP}
end;
```
in order to make the intended behavior explicit. (This is the
Free Pascal pragma. Others could vary.)
Types vs names order. This is an insignificant difference, except that due to the binding precedence rules of C's pointer indication ('*') it's somewhat annoying that while it's simple to declare several pointers in Pascal:
```
var p1, p2, p3 : ^Char;
```
in C the obvious (but naive) declaration:
```
char* p1, p2, p3;
```
does not get you the same thing. You must use:
```
char *p1, *p2, *p3;
```
to declare several pointers. Not really a big deal, and you won't get far trying to compile your code if you got it wrong, but a minor irritant nonetheless. Normally only a problem for C novices.
Pascal's packed keyword allows you to tell the Pascal compiler that record (structure) storage space is at a premium, versus performance. However, this doesn't allow you to actually control the packing, but rather expresses your vague preference. C gives you this ability too, but less explicitly, and so it usually requires more work on your part.

Conclusion

C, no contest. (And this is not because I learned it first. As mentioned above, I learned Pascal first!)

There's a reason that C is the language in which most (if not all, now) newer languages are implemented, at least at first. Its data and control structures align very well to how almost all machines actually work at the low level, so C lives up to one description of it as a "high-level assembly language." With C you are very close to the machine, and can extract the ultimate in performance. Yes, that comes at a cost, but if you're mapping an abstraction to machine level, for whatever reason, that cost must be paid, by somebody. C allows you to get very close to the machine, if that's what you want, yet at the same time it allows you to easily construct very high levels of abstraction, all without hiding from you what's going on. The choices are up to you, and that freedom is C's ultimate strength.

However, Pascal's also a perfectly good choice, there's not enough wrong with it to make one want to avoid it, if there are other good reasons for using it.

Extra Credit

... if you noticed that C's advantage of a lack of sub-functions contradicts Pascal's advantage of a presence of sub-functions! It's the whole gift vs curse thing. Sometimes life is like that...

Return to Site Home