The Impact of Operating Systems Research on Software Technology

The Impact of Operating Systems Research on Software Technology

P. J. Denning, J. C. Browne, and J. L. Peterson

18 July 1977
Revised October 1977

This paper has been published. It should be cited as

Peter J. Denning, James C. Browne, and James L. Peterson, ``The Impact of Operating Systems Research on Software Technology'', in Peter Wegner (Editor), Research Directions in Software Technology, MIT Press, Cambridge, Mass., (June 1979), pages 490-513.

1. Introduction

The operating system is the most important and among the first of the software subsystems developed for a new computer. It is commonly the most complex of the programs ever developed for a given computer system; it defines the functions available to all other programs. This is why operating systems have so greatly influenced software technology.

The influence of operating systems has been felt in two ways. First, operating systems set the environment for applying all other software technology, whether the application is to user software or to system software. Second, operating systems research has in fact motivated much of the development of software technology, especially large software systems. Many early software problems were first formalized and treated as operating systems problems. Their solutions were then generalized and applied to other forms of software.

This paper surveys some of the influence of operating systems research on software technology. After a short historical discussion, it turns to the significant results of operating systems research. These results include user interfaces, resource allocation, and process management. The final part of the paper reviews current research (and trends) in operating systems. It is of course impossible to cover adequately all aspects of operating systems research in a short paper. We hope that the comments by the discussants and the bibliography of important papers will help to fill the gaps.

1.1. History: From Efficiency to Structure

Early operating systems were developed as control programs (or control systems) for their computers. These systems had two broad objectives:

Providing an effective and convenient user interface. Utility programs isolated users from complex machine details and protected against improper or illegal operations. Shared software service routines (particularly for I/O), automatic job sequencing, debugging systems and control cards made computer systems much easier to use.
Optimizing resource usage. Computer systems were scarce and expensive machines. The operating system attempted to allocate system resources optimally.

These two objectives required that the operating system completely control all system resources. Minor modifications to early computer hardware easily provided the operating system with some control. Centralized control over the physical resources permits implementing abstract resources which are simpler to use; and centralized schedulers permit the operating system to effectively increase the system's productivity.

Techniques to improve the efficiency and productivity of computer systems were of primary concern in early operating systems research. The disparity in processing speeds of the central processor unit (CPU) and its input/output (I/O) devices suggested sharing or multiplexing of resources as a means of using the resources more efficiently. Multiprogramming, which implements the sharing of the central processor and main memory among computer users, quickly became a dominant theme, producing extensive research on resource management, scheduling, modeling, measurement, and simulation. This research sought to evaluate and improve existing policies and mechanisms.

Resource sharing must not affect the correct execution of the user programs. A user's view of the system must be independent of the number of users of the system and, hence, of the actual state of the physical hardware. Each user is thus presented with an idealized virtual machine -- an abstraction of the underlying computer which does not depend on the presence of other users in the system [Buzen 1973; Popek 1974]. Many early operating systems implemented abstract devices as a tool to insulate users from the details of hardware and as a means of providing greater reliability. The abstract device was often more convenient to use than the actual device. An example is a file system which abstracts from the physical properties of disks and tapes, implementing user-named logical files.

Multiprogrammed operating systems implement multiple, concurrently operating virtual machines. The term "process" refers to a program executing on a virtual machine. Operating systems research has produced theoretical and practical results for managing processes. The study of cooperating communicating parallel processes was the first step in a progression of increasingly sophisticated design structures for operating systems.

We see in retrospect that abstraction and virtualization have been central themes in operating systems research. The early concern for efficient operation led to multiprogramming which, in turn, produced both a need for further abstractions and virtualization and spawned a large research literature on scheduling, modeling, and other aspects of performance.

1.2. Influence of Operating Systems Research

Operating systems research has had an enormous impact on many areas of computer science. Five are most notable.

Computer architecture. Operating system designers have the responsibility to design and build systems which solve user problems effectively and efficiently. In its development of tools of abstraction, operating systems research has contributed clearer concepts of defining and implementing abstract machines for users. Experience teaches that some of the most primitive functions cannot be efficient without hardware assistance -- functions such as virtual memory addressing, memory protection, procedure calling, and processor switching. The need to implement these functions efficiently has influenced the design of computer hardware. The concepts and techniques developed and tested in software in one generation of computers may appear in the hardware or firmware of subsequent generations.
Programming Languages and Methodology. Operating systems must be efficient, reliable, and error tolerant. They are very large programs which must execute correctly and efficiently in a variety of environments with differing amounts and kinds of resources. Operating system designers and implementers have thus been the first to grapple with major problems in the development of large computer programs. Hierarchical structure, abstraction and modularity -- themes dominant in programmming language and methodology research -- originated at least in substantial measure in operating systems research.
Programming languages which implement concepts such as monitors (Concurrent Pascal [Brinch Hansen 1976]) and abstract data types (Alphard [Wulf 1976], Clu [Liskov 1977], Gypsy [Ambler 1977]) are now appearing [Jones 1977]. It is not accidental that the designers of these language systems all have strong research roots in operating systems. The use of these concepts can have profound effects on the programming process. This has been demonstrated in the operating system environment. These concepts allow formal approaches to the specification and proof of correctness [Hoare 1974, Robinson 1975] and implementation of reliable systems [Linden 1976]. Brinch Hansen [1977] reports the implementation of a complete operating system in 1300 lines of Concurrent Pascal.
Data Base Management. Data management researchers have been encountering many problems in data sharing and protection which have already been faced by operating system designers. Data base systems face problems of synchronization, mutual exclusion, interprocess communication, and deadlock control problems similar to those present in operating systems. Data base research stands to benefit greatly from the existing body of operating systems research. (After all, the reinvention of the wheel proceeds with greater speed and ease if a model of the wheel is at hand.)
Operations Research. Operating systems research has motivated and contributed major new results to queueing theory and scheduling theory.

2. Significant Results

Operating systems research has steadily improved the efficiency and effectiveness of computer systems performance. This improvement has come both from insight and understanding of computing processes and also from improved techniques of resource allocation. The most significant visible result has been the development of successful time-sharing operating systems, bringing interactive computing to the individual at a reasonable cost. Supporting this highly visible impact is a large literature on resource scheduling and process control.

Another area of significant visibility has been structuring methods for large software projects. This is currently a highly active area of research. It is discussed fully in the section on Current Research.

2.1. Time-sharing Systems

The ultimate justification for computers is to enhance human capabilities by speeding up computations and performing routine clerical activities. Interactive computing, an essential feature of "time-sharing" computer systems [Wilkes 1975], allocates computer capacity as needed to humans. This major result of operating systems research has influenced software technology enormously.

There are two kinds of time-sharing systems. A general-purpose time-sharing system offers a spectrum of services which help users to develop, test, store, execute, and share programs. A special-purpose time-sharing system offers some particular type or class of service. An example is a "single language system", in which the user may write and execute programs only in a single language (such as Basic or APL). Another example is a transaction system, in which the user has access to a specific set of functions defined on a data base (such as an airline reservation system, a bank teller system, or a medical information system).

The most famous early time-sharing system was CTSS at MIT [Corbato 1962]. Other general-purpose systems were developed shortly thereafter at Bolt Beranek and Neuman for the DEC PDP-1, and the University of Manchester for the TITAN project [McKeag 1976]. Another well known system of the same period was built at the University of California at Berkeley for an XDS 940 [Lampson 1966]. These early systems and their principles have been treated by Wilkes [1975]. Several special-purpose time-sharing systems were also developed in this period.

Operating systems development by major vendors for noninteractive, or "batch", processing was well under way when experimental time-sharing systems were being designed. Consequently, time-sharing capabilities were not originally included in these major systems. So successful -- and salable -- have been time-sharing systems, that now every major vendor operating system is either interactive or supports a time-sharing subsystem. Interactive computing is even more popular with minicomputers; the UNIX operating system [Ritchie 1974] for the PDP-11 minicomputer is an example. Some large processors, notably the IBM 360/67 and the GE 645 were designed especially to support interactive computing.

The current technical and commercial impact of interactive computing is vast. The essential function of a computer, an efficient collector and distributor of information for people, is served naturally by these systems. It is difficult to conceive of systems such as an airlines reservation system operating in a batch mode. Applications such as military command and control would be impossible without rapid interaction. Interactive computing has contributed strongly to program design and development through on-line file systems and such programs as text editors and incremental compilers. Interactive program development and use has been found to improve human productivity in many installations.

2.2. Resource Allocation

The theme of multiprogramming appearing in most operating systems research has been inspired by the desire to utilize expensive resources efficiently by sharing them. The separate (virtual) machines of the users are in fact simulated on a common, shared, physical hardware system. The resource allocation policies used to implement virtual machines have comprised CPU (central processing unit) scheduling, primary memory management, and to a lesser extent, auxiliary memory management. Each of these areas is discussed in the following sections.

2.2.1. CPU Scheduling

Effective CPU scheduling was a requirement for the transition from the batch processing systems of 1960-1965 to the multiprogramming and time-sharing systems of the late 1960s and early 1970s. There are two fundamental scheduling objectives: maximizing processor efficiency and throughput, and minimizing the response time for short transactions in interactive systems. These objectives are not always compatible.

The CPU scheduling problem introduced some new factors not previously considered in the general scheduling literature: the service time required by an individual request is not known a priori; the service time distributions are highly hyperexponential; preemption is required for fair sharing and responsiveness. Thus, operating systems research has motivated new work in scheduling theory.

It was discovered early that scheduling the job whose CPU service time request is shortest minimizes overall response time (especially for short jobs). Preempting jobs of long CPU service requests allows "I/O-bound" jobs to proceed to the I/O devices of the system, thereby increasing throughput significantly [Stevens 1968; Ryder 1969; Marshall 1969]. Predictive and quantum-based scheduling prevent long jobs from monopolizing the CPU.

In a study of predictive schedulers, Sherman, Baskett and Browne [1972] demonstrated that straightforward prediction, when coupled with preemption of long jobs gives near optimal processor utilization. They also showed that a simple round robin scheduler with a properly chosen quantum size may produce CPU utilization almost as high as the most sophisticated predictors. (See also [Lassater 1973; Lering 1975; Cho 1975]). Most modern commercial operating systems use some form of predictive CPU scheduling with preemption.

Interactive systems face the challenge of reasonable response and high processor utilization without unduly delaying long jobs. The class of multilevel feedback schedulers (usually with discrete quanta and preemption) were proposed to meet this challenge [Corbato 1962]. Many operating systems, including Multics and IBM's MVS for the 370, use a multilevel feedback scheduler. Research on CPU scheduling has resulted in identifiable changes in operating systems over the years.

Multilevel feedback schedulers have been criticized because they are difficult to analyze. The complexity of analysis [Coffman 1973] does not diminish the importance of these schedulers. Though difficult, the analyses confirm the simple intuition on which these schedulers are based: they approximate minimal response time for short jobs, without prior knowledge of service time requirements.

Analysis of scheduling problems in early computer systems were based on classical queueing theory: a single server with a queue, Poisson arrivals, and arbitrary service times (the M/G/1 queueing system). Analyses for round-robin, multilevel, and priority schedulers are extensions of this classical model [Coffman 1973; Kleinrock 1975]. Many significant extensions to scheduling theory for discrete quantum-based and preemptive schedulers were developed.

Even as the analysis techniques for single-resource queueing systems were being perfected, they were becoming obsolete: computer systems were evolving into networks of resources. Scherr's study of CTSS [1967], based on a simple network (the machine repairman), showed that the model structure is far more important, in leading to accurate results, than the details of the CPU service distribution or the scheduler. Moore [1971] reached a similar conclusion for his analysis of MTS. Buzen [1971] introduced the "central server" queueing model, which has become an important analytic tool [Buzen 1973; Baskett 1975; Buzen 1975; Gelenbe 1975; Muntz 1975; Gelenbe 1976]. A recent development is the reformulation of the models for operational assumptions, so that the mathematics no longer depend on Poisson arrivals or exponential service distributions; the latter assumptions reduce the credibility of classical queueing analyses [Buzen 1976, Denning 1977]. The application of queueing network models to computer systems is covered in more length in the section "Performance Measurement: The Connection to Reality", elsewhere in this volume.

2.2.2. Memory Management

Because memory management techniques rely heavily on memory addressing hardware, memory management research and computer architecture have interacted strongly. The requirement in early multiprogramming systems that each job be loaded entirely in a contiguous region of main memory was eventually relaxed by paging and segmentation schemes. Research results have quantified the overhead inherent in these schemes and have shown how to partition memory to minimize waste. They have discovered efficient procedures for compacting programs in memory and for swapping programs in and out of main memory. Significant gains in efficiency have resulted in the cases where this research has been applied.

The most important class of results in memory management has dealt with the concept of virtual memory, implemented either by page or segment based addressing (or both) [Denning 1970]. Paging has been popular because it is is simple to implement; however, segmentation is of increasing interest because it allows the addressing mechanism to be used for protecting small, logical program units.

Paged virtual memory originated in the late 1950s with the ATLAS project at Manchester University [Kilburn 1962]; segmented virtual memory originated in the Burroughs B5000 series computers. A combination of segmentation and paging is used in Multics [Organick 1972]. Segmentation has been viewed both as a natural extension of overlaying techniques and as a powerful means of controlling access and sharing. Papers by Dennis [1965] and by Arden, et al. [1966] became standard references. These ideas influenced such commercial products as the IBM 360/67 with CP/67 (now VM/370), MTS (Michigan Terminal System), and the GE 645 (now Honeywell 6180) hardware for the Multics operating system.

Early virtual memory systems performed poorly. May still do. The reason is "thrashing", that is, serious degradation of CPU efficiency caused by excessive paging. Thrashing occurs when too many programs are forced into a fixed main memory. There has been extensive experimental work aimed at understanding which memory management polices are best. A famous early study by Belady [1966] focussed on fixed partition policies and included results for an (unimplementable) optimal paging algorithm, while more recent studies have included variable partition policies [Denning 1975; Hoare 1972]. Mattson, et al. [1970] developed a "stack model" for memory policies enabling efficient data collection and analysis for common paging policies (see also [Coffman 1973]).

The working set model of program behavior [Denning 1968] defined a method for determining the active part of the address space of an executing program, thereby laying the foundation for controlling thrashing through a combination of CPU scheduling and memory management [Denning 1976]. Only recently when queueing models allowed analyzing the effects of memory policies and load controllers, have we discovered effective scheduling methods for avoiding thrashing. Bard [1975] installed such a method in CP-67. Rodriguez-Rosell and Dupuy [1973] showed that by preventing the level of multiprogramming from getting too high, a working set dispatcher could effectively control thrashing. Denning, et al. [1976] demonstrated that optimal load controls are not difficult to define and implement, and that whereas load control has a greater effect than the memory policy, the working set dispatcher gives a robust control which is nearly optimal [Denning 1975; Denning 1978].

Another significant outgrowth of memory management research has been the modeling and quantification of "locality" in program behavior [Spirn 1977]. Demand paging works only because programs localize their references in address spaces for significant intervals of time [Denning 1968]. Only recently have reliable procedures been developed to measure [Madison 1976] and analyze locality [Courtois 1976; Courtois 1977; Denning 1978]. Procedures for assigning small program segments among large pages can produce significant improvements in virtual memory performance by maximizing locality of reference to pages [Ferrari 1975].

The impact of virtual memory is obvious. The dominant medium and large scale commercial systems use segmented or paged virtual memories. Many minicomputers and microcomputers use virtual memory architectures, though frequently only to solve the relocation problem for small segments placed in a large physical memory. Nearly all paging systems employ a variable partition memory allocation policy. The most successful of these policies derive from the working set model.

2.2.3. Auxiliary Memory

Auxiliary memory devices, whose access times are significantly slower than main memory access times, tend to be congestion points in a system. The policies of managing these devices are therefore concerned with maximizing throughput by eliminating avoidable idle time.

The paging drum (and fixed head disk), whose use in paging systems is widening, has been studied extensively. Requests to use the drum are sorted into queues, one for each drum sector; the drum services the request at the head of a given queue as the corresponding sector rotates beneath the read-write heads. This drum will behave according to a shortest-access-time-first (SATF) policy. Since an SATF policy minimizes total accumulated latency time, it maximizes possible throughput for a fixed load on the drum [Coffman 1969; Denning 1972; Coffman 1973; Fuller 1974].

Analyses of moving-arm disks have been less conclusive. The problem is that an SATF policy -- which in practice is approximated by a shortest seek time first policy - tends under heavy loads to discriminate against requests for the inner and outer cylinders of a disk. Although the throughput is maximum, response time may suffer because certain requests may wait excessively. A comprehensive study by Teorey and Pinkerton [1972] showed that no single policy is uniformly best over the entire range of load conditions. Several model studies have analyzed the effect of overlapping seek and transmission operations for multiple disks connected to a single controller; but the results depend strongly on the workload.

The performance of policies for reducing mechanical motion at a disk or drum is usually determined by considering the device off-line -- i.e., under specific loads. It is still open whether motion-reducing scheduling produces observable effects on overall performance when the device is on-line. Certain load controllers, for example, will set the level of multiprogramming to keep the average paging-drum queue length at 1 request [Denning 1976] -- under such a light load, the SATF policy may have no discernible effect. The standard practice of grouping the records of the same file on the same disk cylinder results in a majority of disk requests without seeks [Lynch 1972]; even under heavy disk loads, there is some question about the overall importance of arm-movement optimization.

2.3. Process Control and Management

Third generation computer systems introduced a new aspect of program design: concurrent or parallel programming. CPU scheduling and memory management mechanisms allow for creating and managing multiple virtual machines; but they do not address the logical control operations by which programmers may coordinate concurrently executing processes. A considerable body of theory has evolved in this area.

The term "process" has been used since the early 1960's to denote an abstraction of the activity of a processor. It gives the concept "program in execution" meaning irrespective of whether a processor happens to be executing the instructions of the given program at a given time. The development of this concept was a major advance; it provided a framework for concurrent programming.

To be reliable, systems supporting concurrent programs must satisfy four properties: determinacy, freedom from deadlock, indivisible operations, and synchronization.

2.3.1. Determinacy

The result of a computation performed by a set of processes on common data should be independent of the relative speeds of the processes. This question has been of interest to the designers of asynchronous circuits since the late 1950's [Miller 1959]; its importance in programming was not appreciated until the late 1960's. A sufficient condition for determinacy is: For each pair of processes which can be active simultaneously, the data which can be modified by the one must be disjoint from that which can be accessed by the other. (In some cases, this condition can be checked by a compiler [Brinch Hansen 1973].) A consequence of this condition is that the sequence of values written into memory is unique, given the initial contents of memory. If we add the constraint that every process produces some output, the above condition is both necessary and sufficient [Coffman 1973]. This subject is covered in more detail in the section "Concurrent Programming", elsewhere in this volume.

2.3.2. Freedom from Deadlock

The following situation is to be avoided: a collection of independent processes competing for the same resources is stuck in a circular wait, each waiting for at least one other to release resources. Approaches to deadlock prevention are of three types: a) obviating it by disallowing the circular wait condition, b) detecting it and recovering at minimum cost and c) controlling the allocation of resources so that the system resides continuously in "safe states". For a system of n processes, a detection algorithm of running time of order O(n log n) can be programmed [Holt 1972]; and if the maximal resource demands are known, the safeness of a state can be decided in about the same time [Coffman 1972; Habermann 1972; Coffman 1973].

Type (c) approaches may be expensive in execution and are unsuitable where fast response is desired. Type (b) procedures tend to have high overhead when the system is near saturation [Sherman 1974]. All the procedures for approaches (b) and (c) rely on knowing how many units of each resource type are available in the system. When there is no bound, as with semaphore counts, the procedures cannot be used. For this reason, approaches of type (a) or approaches based on preemption, are the most practical.

The most common approach of type (a) is called ordered resource usage. The system resources are partitioned into classes C_i, which are arranged in a hierarchy (such as a linear chain or a tree). A process which already holds resources from C_i may request resources from C_j only if C_j is below C_i in the hierarchy. This procedure prevents circular wait and operates irrespective of whether there are bounds on resource quantities. Since the partitioning restricts the ability to share, resource utilization may be lower than with type (b) or (c) methods.

2.3.3. Indivisible Operations

When independent processes perform operations on a shared object, the entire operation must be indivisible. This means that, when two processes attempt an operation on the shared object simultaneously, one of them must be delayed from starting until the other has finished. If this condition is not met, the two processes may interfere with each other. The original mutual exclusion solutions, which extended the indivisibility already present in memory addressing, were complex [Dijkstra 1965; Knuth 1966; Coffman 1973]. Practical solutions based on implementing wait and signal operations (also known as P and V operations) in an operating system kernel are now becoming common. Language notations for indivisible operations -- such as critical regions, queues, and monitors -- are less susceptible to error than direct use of wait and signal operations in programs [Brinch Hansen 1973; Hoare 1974; Brinch Hansen 1975].

2.3.4. Synchronization

There is often a requirement that a process be stopped at a given point until a signal or message is received from another. The wait and signal operations, and the send and get message-queue operations [Brinch Hansen 1970], which have been developed from theoretical investigations of this problem, are becoming common.

The major results of the research on process management have been accepted into practice because they are straightforward, they are effective in keeping concurrency simple, and they can be rigorously proved correct. The Venus operating system [Liskov 1972], for example, implements synchronization primitives in microcode. Habermann [1972] showed that with each semaphore we can associate three auxiliary variables: s(t), the number of sends, w(t), the number of attempted waits and r(t), the number of received signals, as of time t. The relation r(t) = min[s(t),w(t)] is invariant and can be used to establish the correctness of many synchronization problems. Brinch Hansen [1973] has refined this method and Howard [1976] has recently shown how to generalize it with history variables for use with arbitrary monitors.

3. Current Research

Current operating systems research is building on the significant concepts of the past. Now that CPU scheduling and memory management are well understood, most investigators assume that the operating system comprises multiple concurrent processes executing in virtual memory. Attention is shifting back to the user interface, concentrating on such problems as information security and system structure.

3.1. Information Security

Protection and sharing of information are essential but conflicting goals. It is extremely difficult to guarantee that, under all conditions, a process may acquire all and only that information to which it is entitled. All implemented protection and sharing mechanisms have been found to contain some form of a "loophole", though some systems are certainly more reliable and secure than others.

An important source of security violations is the system's access control mechanism or access policy specification. One attractive solution is based on a security kernel -- a minimal set of operating system programs which, if correct, will guarantee the correctness of all access-controlling operations throughout the system. Several projects to construct such a kernel are underway [Walter 1974; Robinson 1975; Millen 1976].

The concept of a "capability machine" has received considerable attention as a basis for a highly reliable and secure computer system. Dennis and Van Horn [1966] proposed a basic design, in which processes had lists of capabilities specifying accessible objects (processes, segments, procedures, capability lists) and allowable access modes to them; there was a small set of commands for accessing objects and manipulating capability lists. The Hydra operating system is an experimental capability machine emphasizing the implementation of "abstract data types" [Wulf 1974; Cohen 1975]. The Plessey 250 [England 1972] has demonstrated the feasibility of a capability architecture and the concept may appear in future computer architectures.

It is often useful to verify, during compilation, a program's authorization to access objects [Conway 1972; Brinch Hansen 1973]. This is because assurances are needed that a program is correct and reliable before it is put into operation. However, compiler checking cannot prevent hardware malfunctions or data entry errors; thus, run-time checking is also needed for a highly secure and reliable system. The required run-time checks are simple extensions of virtual memory address translations [Denning 1976; Linden 1976].

Formal treatments of protection problems began with the access matrix concept [Lampson 1971; Denning 1971; Graham 1972]. All existing implementations of access protection can be deduced from this concept. By showing that the states of an access matrix can simulate the configurations of a Turing machine, Harrison, et al. [1976] proved that the safety question -- can a given user ever acquire a given access right to a given object? -- is computationally intractable. Even if we can prove that a system's access control is safe, there are other ways to violate security. "Leakage channels", which pass encoded messages along legitimate access paths, are virtually impossible to seal [Lampson 1973]. Though these results do not prove that the safety question is intractable in all practical systems, they do help explain why these questions are persistently difficult to answer.

To handle information security problems which are not conveniently represented within the access matrix model, various research projects have been established. These projects have identified information security problems at three successively more difficult levels:

Access control. At this level, the objective is to assure that information-storing objects are accessed only by suitably authorized individuals or programs. To verify an attempted access, the mechanism determines the identity of the process making the attempt, then checks the types of access allowed for that process. At this level are the techniques deducible from the access matrix model.
Flow control. The right of access to objects is separable from the right to disseminate information. Access controls must be extended to control the flow of information among users and their objects. Most of these methods employ a concept of "security classes" and allowable flows among them; they range from compile-time certification to run-time checking [Denning 1975; Fenton 1974]. There has so far been little interest in these methods outside the military security sector.
Inference control. It is possible under most circumstances to deduce confidential information by correlating non-confidential responses from multiple queries to a statistical data base. This type of compromise appears virtually impossible to prevent. This problem is of considerable interest to designers of data management systems.

These three levels of security provide a framework for understanding research in protection of information within a computer system.

The impact of security research on major commercial systems has been slight. Only two major vendor systems offer appreciable protection capabilities: IBM's VM/CMS/370 and Honeywell's Multics. VM/CMS achieves protection at the expense of sharing by establishing disjoint address spaces for each copy of its individual user operating systems. Multics attaches access rights to each object in its file system and also employs special "rings of protection" hardware [Saltzer 1975]. The HYDRA [Wulf 1974; Cohen 1975; Wulf 1975] operating system for the C.mmp multiprocessor is an experimental capability machine.

3.2. System Structures

To be useful, the abstractions implemented by an operating system must correspond to the semantic concepts people use to solve problems. This requires narrowing the gap between languages and operating systems. There has been much recent investigation of semantic concepts and the corresponding structures in operating systems [Jones 1977]. Besides enhancing problem solving ability, this research is helping to simplify systems, documentation, and maintenance.

Errors are committed often in designing operating systems. Accordingly, a considerable effort has been directed toward program structures that are amenable to being proved correct. Concurrency makes operating system structures more difficult to prove correct than the sequential processes embodied in most programming languages. A major thrust of the research has, therefore, emphasized correctness of computation with concurrent operations.

Despite the prominence of the top-down design approach in the structured programming literature [Zurcher 1969; Dijkstra 1971; Parnas 1972], operating system designers seem to have much success with techniques strongly resembling the bottom-up approach [Dijkstra 1968; Liskov 1972; Robinson 1975] -- especially level structured systems and monitors. Habermann, Flon and Cooprider [1976] proposed a method of integrating top-down design with bottom-up implementation.

In his study of concurrent processes, Dijkstra [1968] showed that the major timing and sequencing problems could be solved with the primitive P and V operations on semaphores. Habermann [1972] showed an invariant relation among sent and received signal counts for P and V operations showing that formal proof techniques can be applied to concurrent programs. Howard [1976] showed that invariant relations among history variables can be used to prove the correctness of very general concurrent structures. Owicki and Gries [1976] have studied axiomatic systems for concurrent programs.

A collection of parallel programs can be regarded as implementing a network of servers that accomplishes some given task. Rather than proving termination, as must be done for ordinary sequential programs, one must prove that all work submitted is eventually completed. Dijkstra [1968] suggested that a hierarchical structure of processes, with downward calls and upward returns, could easily be proved to complete all work by showing that each process returns to a "homing position". Brinch Hansen [1973] has refined this argument when processes communicate by message buffers. The "correctness" of arbitrary networks of processes can be established so long as each task in the network progresses toward completion at each step through the network [Lauesen 1975]

The concept of monitors has attracted considerable attention as a method of organizing system functions to avoid synchronization problems. A monitor is a set of procedures managing a given set of hardware or software resources. It includes a locking mechanism that guarantees indivisibility of monitor operations by allowing at most one process to execute inside the monitor at a time. A language notation for monitors based on SIMULA classes was proposed by Brinch Hansen [1973]. Hoare [1974] has clarified its properties and showed axiomatic approaches for proving a monitor's correctness; Howard [1976] extended this by showing how to use history variables as a basis for the invariant assertions needed in the proofs. A small operating system of very limited capability, SOLO, was written in 1300 statements of Concurrent Pascal, a language implementing processes and monitors [Brinch Hansen 1976; Brinch Hansen 1977].

Despite the simplifications which monitors may allow in a system and its proof methods, there is a danger: the requirement of mutual exclusion of all operations within a given monitor may introduce excessive queueing at the monitor entries. Early versions of Multics, for example, could not support three processors very well because contention on the ready-list lock blocked the third processor most of the time. Present versions of Multics employ a large number of separate locks, each for a small portion of the ready list; but even this scheme loses 7% of all available time among 4 processors in waiting for locks to be released. Another possible problem with the centralization inherent in monitor-based scheduling may be incompatibility with distributed architectures. The resolution of these questions awaits performance studies of multiprocessor systems based on monitors.

As important as the correct operation of an operating system, is its continuous and reliable operation. An operating system must provide reliable service despite hardware malfunctions, program flaws, and data entry errors. The need for fault-tolerance may place severe constraints on the data structures and algorithms used to provide services to the uers. Current approaches to fault tolerance are based on extensive checkpointing to allow recovery in case of a detected error, or on recording operations so that the system may recover from an error by reversing the effect of all actions back to a point prior to the error. These techniques are expensive and used only in special cases. Randell [1975] has been studying the possibility of more efficient checkpointing based on "recovery blocks" declared in the programming language. There is a strong case that capability machines have a high degree of fault tolerance, but the users of these machines must be willing to accept the severe constraint of small protection domains [Denning 1976; Linden 1976].

This research is influencing, and is influenced by, emerging architectural concepts such as distributed computing and multiprocessor/multimemory computer systems. The HYDRA operating system was designed for the C.mmp architecture (16 PDP-11 processors and 16 memory units) [Wulf 1974; Wulf 1975]. The CM* project [Swan 1977], among others, is developing an operating system for architectures involving large numbers of processors. The RSEXEC operating system developed by BBN for the DEC-10 computers on the ARPAnet [Thomas 1973] illustrates an operating system for a computer communication network.

There are currently at least three models for the organization of distributed systems.

Each processor owns a collection of processes which communicate via interprocess messages or copied files (ARPA network model).
A process moves from processor to processor, changing execution domains with each jump (LNS model in HYDRA).
Each processor is a collection of data activated modules which wait until work appears at their inputs. (A network of queues).

Data sharing is awkward in the first model. The second is the conventional process/domain model extended to multiple processors. The third has the greatest intuitive appeal, but lacks a functionally descriptive formalism. We do not yet have enough experience to judge what role each model will play in the systems of the future.

4. Future Research

Operating systems is a maturing field. Research has produced a body of knowledge for building useful, powerful operating systems for conventional machine architectures. New operating systems are being worked on every day. New operating systems are brought on-line at military installations monthly. Microcomputer and minicomputer manufacturers are developing operating systems for their new products. Mainframe manufacturers are producing new operating systems while enhancing existing systems.

This high level of activity manifests implementation efforts, rather than new research. This activity emphasizes gradual change to existing systems; it avoids the revolutionary. Minicomputer vendors are modifying their systems to compete better with major products. They intend to develop operating systems well grounded in the existing research results. However, radical new concept bases are not in evidence.

There are relatively few operating systems design or implementation projects at present in the research community. The major reason for this is the lack of facilities to support research oriented implementation projects in universities and research laboratories. There is presently little motivation for universities to undertake large research projects on conventional architectures. However, future research on reliability could quickly renew the motivation for experimental implementations.

Another reason for the few operating systems research projects is the changing set of research topics. Classical operating systems research has concentrated on mechanisms to solve problems such as memory management and resource scheduling. Such mechanisms have been developed and installed in conventional architectures. The important current problems for conventional systems deal with policy: how best to use the existing mechanisms. An illustration of this is the problem of functionality, which asks: what kinds of abstract machines should be created for users? We have yet to see serious efforts to characterize the functions which will allow the most efficient execution of a given workload.

Despite such constraints, the research community will be active. Among its attentions will be the neglected user interface. Standard operating system facilities will be accessible by language notations rather than as arcane supervisor calls, and more semantic concepts will be supported efficiently by the operating system. Severe structural constraints and greater type-checking will produce more understandable programs and permit better documentation and maintenance.

Research will continue on the problems of constructing reliable, correct, fault tolerant systems, including methods for constructing programs which are convincingly correct before being put into operation. These methods will stress integration of language facilities and the operating system. Users will need to accept constraints on their freedom to produce unreliable software, but will find that reliable systems are also secure systems.

Research in secure computing will also continue. It will focus on auditing and detecting suspicious patterns of use; on practical methods of shielding small data bases from compromise under statistical queries; and on encryption of data in detachable storage or in transmission on communication networks.

New architectures will be respectable subjects of operating systems research. The decreasing cost of hardware allows serious consideration of approaches which previously were discarded as being too expensive. Cheaper hardware will enable dedicating processors and memory units to specific functions, thereby mitigating control problems created by sharing limited hardware resources. Sharing will continue to be important, but the focus will be on sharing information rather than hardware.

Microprocessors will open new applications involving computer controlled systems and distributed information. In such dedicated systems, operating system software will tend to merge with programming language and application software ("vertical integration") introducing new implementation problems. structuring problems.

The possibility of dedicating hardware resources to specific functions will mean that functional modules of the system will interact only via the communication links among them. Predicting throughput and response time will be much more reliable than it is today, and queueing models will play a stronger role in designing systems. We will thus see significant progress toward "guaranteed performance systems".

Decentralization -- distributed processing -- will be a major theme of future operating systems. The greater need for communication among computers will inspire new approaches to sharing information and synchronizing processes. Decentralization will also direct more attention toward data flow concepts -- languages and hardware. Multiprocess systems with many processors and networks of computer systems will create new problems for the research community to solve.

Bibliography and References

Ambler, A., et al., "Gypsy: a language for specification and implementation of verifiable programs", Proceedings of an ACM Conference on Language Design for Reliable Software, (March 1977), 1-10.
Arden, B. W., Galler, B. A., O'Brien, T. C., and Westervelt, F. H., "Program and addressing structure in a time-sharing environment", Journal of the ACM 13, 1 (January 1966), 1-16.
Bard, Y., "Application of the page survival index (PSI) to virtual memory system performance", IBM Journal of Research and Development 19, 3 (May 1975), 212-220.
Baskett, F., Chandy, K. M., Muntz, R. R., and Palacios, F. P., "Open, closed, and mixed networks of queues with different classes of customers", Journal of the ACM 22, 2 (April 1975), 248-260.
Belady, L. A., "A study of replacement algorithms for a virtual storage computer", IBM Systems Journal 5, 5 (1966), 78-101.
Brinch Hansen, P., "The nucleus of a multiprogramming system", Communications of the ACM 13, 4 (April 1970), 238-241.
Brinch Hansen, P., Operating Systems Principles, Prentice-Hall (1973).
Brinch Hansen, P., "The programming language Concurrent Pascal", IEEE Transactions on Software Engineering 1, 2 (June 1975), 199-207.
Brinch Hansen, P., "The Solo operating system", Software - Practice & Experience 6, (April-June 1976), 141-205.
Brinch Hansen, P., The Architecture of Concurrent Programs, Prentice-Hall (1977).
Buzen, J. P., "Queueing network models of multiprogramming", Ph.D. Dissertation, Harvard University, (1971).
Buzen, J. P., "Cost effective analytic tools for computer performance evaluation", Proceedings IEEE Compcon, (September 1975).
Buzen, J. P., "Computational algorithms for closed queueing networks of exponential servers", Communications of the ACM 16, 9 (September 1973), 527-531.
Buzen, J. P., "Fundamental operational laws of computer system performance", Acta Informatica 7, 2 (1976).
Buzen, J. P., and Gagliardi, U. O., "The evolution of virtual machine architecture", AFIPS Conference Proceedings 42, (1973 NCC), 291.
Cho, A. C., and Strauss, J. C., "A simulation study of dynamic dispatching", Proceedings ACM/SIGSIM Symposium on Simulation of Computer Systems, (August 1975), 141-150.
Coffman, E. G., "An analysis of a drum input/output queue under scheduled operation in a paged computer systems", Journal of the ACM 16, 1 (January 1969), 73-90.
Coffman, E. G. and Denning, P. J., Operating Systems Theory, Prentice-Hall (1973).
Coffman, E. G., Elphick, M. J., and Shoshani, A. "System deadlocks", Computing Surveys 4, 2 (June 1972), 67-78.
Coffman, E. G., and Ryan, T. A., "A study of storage partitioning using a mathematical model of locality", Communications of the ACM 15, 3 (March 1972).
Cohen, E. and Jefferson, D., "Protection in the Hydra operating system", Proceedings of the Fifth Symposium on Operating Systems Principles, (November 1975), 141-160.
Conway, R. W., Maxwell, W. L., and Morgan, H. L., "On the implementation of security measures in information systems", Communications of the ACM 15, 4 (April 1972), 211-220.
Corbato, F. J., Merwin-Daggett, M., and Daley, R. C., "An experimental time-sharing system", AFIPS Conference Proceedings (1962 SJCC), 335-344, also in Programming Languages and Systems, (S. Rosen, Ed.), McGraw-Hill (1968).
Courtois, P. J., "Decomposability, instabilities, and saturation in a multiprogrammed computer systems", Communications of the ACM 18, 7 (July 1975), 371-377.
Courtois, P. J., and Vantilborgh, H., "A decomposable model of program paging behavior", Acta Informatica 6, 3 (1976), 251-276.
Courtois, P. J., Decomposability, Academic Press, (1977).
Denning, D. E., "A lattice model for secure information flow", Communications of the ACM 19, 5 (May 1976), 236-242.
Denning, P. J., "The working set model for program behavior", Communications of the ACM 11, 5 (May, 1968), 323-333.
Denning, P. J., "Third generation computer systems", Computing Surveys 3, 4 (December 1971), 175-216.
Denning, P. J., "Virtual memory", Computing Surveys 2, 3 (September 1970), 153-189.
Denning, P. J., and Graham, G. S., "Multiprogrammed memory management", Proceedings IEEE 63, (June 1975), 924-939.
Denning, P. J., Kahn, K. C., Leroudier, J., Potier, D., and Suri, R., "Optimal multiprogramming", Acta Informatica 7, 2 (1976), 197-216.
Denning, P. J., and Slutz, D. R., "Generalized working sets for segment reference string", Report TR-225, Computer Sciences Department, Purdue University, (March 1976). To appear in Communications of the ACM.
Denning, P. J., "A note on paging drum efficiency", Computing Surveys 4, 1 (March 1972), 1-4.
Denning, P. J., "Optimal multiprogrammed memory management", in Advances in Programming Methodology, Volume 3, (Chandy and Yeh, Eds.), Prentice-Hall, (1978).
Denning, P. J., and Buzen, J. P., "Operational analysis of queueing networks", Proceedings of the Third International Symposium on Modeling and Evaluation of Computer Systems, North Holland, (October 1977).
Dennis, J. B., "Segmentation and the design of multiprogramming computer systems", Journal of the ACM 12, 4 (October 1965), 589-602.
Dennis, J. B., and Van Horn, E. C., "Programming semantics for multiprogrammed computations", Communications of the ACM 9, 3 (March 1966), 143-155.
Dijkstra, E. W., "Solution of a problem in concurrent program control",Communications of the ACM 8, 9 (September 1965), 569.
Dijkstra, E. W., "Cooperating sequential processes", in Programming Languages, (F. Genuys, Ed.), Academic Press (1968).
Dijkstra, E. W., "The structure of the THE multiprogramming system", Communications of the ACM 11, (May 1968), 341.
Dijkstra, E. W., "Hierarchical ordering of sequential processes", Acta Informatica 1, 2 (1971), 115-138.
England, D. M., "Architectural features of the system 250", Plessey Telecommunications, Letil, (1972).
Fabry, R. S., "Capability based addressing", Communications of the ACM 17, (1974), 403-411.
Fenton, J. S., "Memoryless subsystems", Computer Journal, (May 1974), 143-147.
Ferrari, D., "Tailoring programs to models of program behavior", IBM Journal of Research and Development 19, 3 (May 1975), 244-251.
Fuller, S. H., "Minimal total processing time drum and disk scheduling disciplines", Communications of the ACM 17, 7 (July 1974), 376-381.
Gelenbe, E., "On approximate computer system models", Journal of the ACM 22, 2 (April 1975), 261-269.
Gelenbe, E., and Muntz, R., "Probabilistic models for computer systems, Part I: Exact results", Acta Informatica 7, 1 (1976), 35-60.
Graham, G. S., and Denning, P. J., "Protection - principles and practice", AFIPS Conference Proceedings 40, (1972 SJCC), 417-429.
Habermann, N., "Prevention of system deadlock", Communications of the ACM 12, 7 (July 1969), 373-377.
Habermann, N., "Synchronization of communicating processes", Communications of the ACM 15, 3 (March 1972), 171-176.
Habermann, A. N., Flon, L., and Cooprider, L., "Modularization and hierarchy in a family of operating systems", Communications of the ACM 19, 5 (May 1976), 266-272.
Harrison, M. A., Ruzzo, W. L., and Ullman, J. D., "On protection in operating systems", Communications of the ACM 19, 8 (August 1976), 461-471.
Hoare, C. A. R., "Monitors: an operating system structuring concept", Communications of the ACM 17, 10 (October 1974), 549-557.
Hoare, C. A. R., and McKeag, R. M., "A survey of store management techniques", Operating Systems Techniques, (Hoare and Perrott, Eds.), Academic Press, (1972), 117.
Holt, R. C., "Deadlock in computer systems", Computing Surveys 4, 2 (September 1972).
Howard, J. A., "Mixed solutions to the deadlock problem", Communications of the ACM 16, (1973), 427-430.
Howard, J. A., "Proving monitors", Communications of the ACM 19, 5 (May 1976), 273-278.
Jones, A., "The narrowing gap between language systems and operating systems", Proceedings IFIP Congress, North Holland, (1977), 869-873.
Kilburn, T., Edwards, D. B. G., Lanigan, M. J., and Sumner, F. H., "One-level storage system", IRE Transactions on Electronic Computers EC-11, 2 (April 1962), 223-234.
Kleinrock, L., Queueing Systems, John Wiley and Sons, Volume 1 (1975), Volume 2 (1976).
Knuth, D. E., "Additional comments on a problem in concurrent programming", Communications of the ACM 9, 5 (May 1966), 321-323.
Lampson, B., "Protection", Proceedings Fifth Princeton Conference, Electrical Engineering Department, Princeton University, (March 1971); Reprinted in Operating Systems Review, (January 1974).
Lampson, B. W., "A note on the confinement problem", Communications of the ACM 16, 10 (October 1973), 613-615.
Lampson, B. W., Lichtenberger, W. W., and Pirtle, M. W., "A user machine in a time-sharing system", Proceedings of the IEEE 54, 12 (December 1966), 1766-1774.
Lampson, B. W., and Sturgis, H., "Reflections on an operating system design", Communications of the ACM 19, 5 (May 1976), 251-265.
Lassater, G. L., Lo, T., Chandy, K. M., and Browne, J. C., "Statistical and pattern based models for CPU burst prediction", Proceedings. 7th Symposium on the Interface of Computer Science and Statistics, (October 1973), 123-129.
Lauesen, S., "A large semaphore based operating system", Communications of the ACM 18, 7 (July 1975), 377-389.
Lering, K. L., Wood, R. C., and Chiu, W. W., "Sensitivity of predictive scheduling", Proceedings ACM/SIGSIM Symposium on Simulation of Computer Systems, (August 1975), 151-157.
Linden, T. A., "Operating system structures to support security and reliable software", Computer Surveys 8, 4 (December 1976), 409-445.
Liskov, B. H., "The design of the Venus operating system", Communications of the ACM 15, 3 (March 1972), 144-149.
Liskov, B. H., Snyder, A., Atkinson, R., and Schaffert, C., "Abstraction mechanisms in CLU", Communications of the ACM 20, 8 (August 1977), 564-576.
Lynch, W. C., "Do disk arms move?", ACM Sigmetrics Performance Evaluation Review 1, (1972), 3-6.
Madison, W., and Batson, A., "Characteristics of program localities", Communications of the ACM 19, 5 (May 1976), 285-294.
Marshall, B. S., "Dynamic calculation of dispatching priorities under OS/360 MVT", Datamation, (August 1969), 93-97.
Mattson, R. L., Gecsei, J., Slutz, D. R., and Traiger, E. L., "Evaluation techniques for storage hierarchies", IBM Systems Journal 9, 2 (1970), 78-ll7.
McKeag, R. M., and Wilson, R., Studies in Operating Systems, Academic Press, (1976).
Millen, J. K., "Security kernel validation in practice", Communications of the ACM 19, 5 (May 1976), 243-250.
Miller, D. E. and Bartky, W. S., "A theory of asynchronous circuits", Proceedings International Symposium on Theory of Switching, Annals Computer Laboratory, Harvard University 29, Part I (1959), 204-243.
Moore, C. G., "Network models for large scale time-sharing systems", Technical Report 71, Department of Industrial Engineering, University of Michigan, (April 1971).
Muntz, R. R., "Analytic modeling of interactive systems", Proceedings IEEE 63, (June 1975), 946-953.
Organick, E. I., The Multics System: An Examination of Its Structure, MIT press (1972).
Owicki, S., and Gries, D., "Verifying properties of parallel programs: an axiomatic approach", Communications of the ACM 19, 5 (May 1976), 279-284.
Parnas, D. L., "On the criteria to be used in decomposing systems into modules", Communications of the ACM 15, 12 (December 1972), 1053.
Popek, G. J., and Goldberg, R. P., "Formal requirements for virtualizable third generation architectures", Communications of the ACM 17, 7 (July 1974), 412-421.
Randell, B., "System structure for software fault tolerance", IEEE Transactions on Software Engineering 1, (June 1975).
Ritchie, D. M., and Thompson, K., "The UNIX time-sharing system", Communications of the ACM 17, 7 (July 1974), 365-375.
Robinson, L., Levitt, K. N., Neumann, P. G., and Saxena, A. R., "On attaining reliable software for a secure operating system", Proceedings of the 1975 International Conference on Reliable Software, (1975), 267-284.
Rodriguez-Rosell, J., and Dupuy, J. P., "The design, implementation, and evaluation of a working set dispatcher", Communications of the ACM 16, 4 (April 1973), 247-274.
Ryder, K. D., "A heuristic approach to task dispatching", IBM Systems Journal 8, (1970), 189-198.
Saltzer, J. H., and Schroeder, M. D., "The protection of information in computer systems", Proceedings IEEE 63, (September 1975).
Scherr, A. L., An Analysis of Time-Shared Computer Systems, MIT Press (1967).
Schwartz, J., Coffman, E. G., and Weissman, L., "A general purpose time-sharing system", AFIPS Conference Proceedings, (1964 SJCC), 397-411.
Sherman, S. W., Baskett, F., and Browne, J. C., "Trace driven modelling and analysis of CPU scheduling in a multiprogramming system", Communications of the ACM 15, (1972), 1063-1069.
Sherman,S. W., Howard, J., and Browne, J. C., "A study of response time under various deadlock algorithms and job schedules", Proceedings of the 1974 ACM National Conference, (October 1974), 520-525.
Spirn, J. R., Program behavior: Models and Measurements, Elsevier/North Holland, Series on Operating and Programming Systems, (1977).
Stevens, D. F., "On overcoming high-priority paralysis in multiprogramming systems: a case history", Communications of the ACM 11, (1968), 539-541.
Swan, R. J., Fuller, S. H., and Siewiorek, D. P., "Cm* - a modular multiprocessor", AFIPS Conference Proceedings, (1977 NCC).
Teorey, T., and Pinkerton, T., "A comparative analysis of disk scheduling policies", Communications of the ACM 15, 3 (March 1972), 177-184.
Thomas, R. H., "A resource sharing executive for the ARPANET", AFIPS Conference Proceedings, (1977 NCC), 155-163.
Walter, K. G., et al., "Modeling the security interface", Jennings Computer Center, Case Western Reserve University, Report 1158, (August 1974).
Wilkes, M. V., Time-Sharing Computer Systems, Third Edition, American Elsevier (1975).
Wulf, W., et al., "HYDRA: the kernel of a multiprocessor operating system", Communications of the ACM 17, 6, (June 1974), 337-344.
Wulf, W., Levin, R. and Pierson, C., "Overview of the HYDRA operating system development", Proceedings Fifth Symposium on Operating Systems Principles, (November 1975), 122-131.
Wulf, W., London, R. L., and Shaw, M., "An introduction to the constuction and verification of Alphard programs", IEEE Transactions on Software Engineering SE-2, 4 (December 1976), 253-265.
Zurcher, F. W., and Randell, B., "Iterative multilevel modeling - a methodology for computer system design", Proceedings IFIP Congress, (1968), 867-871.

Home

Comments?