**Prerequisites** Students should feel comfortable using computers. A rudimentary knowledge of programming language concepts and electrical fundamentals (8.02) is assumed. Each student must have an Athena account to access the software used to complete the lab assignments. Lectures TR 1 - 2 in 32-123 ## Recitation Sections | # | Time | Room | Instructor | |----|-------|--------|------------| | 1 | WF 10 | 26-322 | Brad | | 2 | WF 11 | 26-322 | Brad | | 3 | WF 11 | 34-303 | Silvina | | 4 | WF 12 | 34-303 | Silvina | | 5 | WF 12 | 34-304 | Li-Shiuan | | 6 | WF 1 | 34-304 | Li-Shiuan | | 7 | WF 1 | 34-303 | Chris | | 8 | WF 2 | 34-303 | Chris | | 9 | WF 2 | 36-155 | David | | 10 | WF 3 | 36-155 | David | | | | | | (Final assignments will be made after the first lecture) Section assignments are made by the registrar; please plan to attend your assigned section. If extraordinary circumstances make it impossible to attend the section you have been assigned, you may request a section reassignment by emailing 6004-headta@csail.mit.edu a request to be reassigned to a new recitation time. ### Staff | Duties | Name | Email<br>(@mit.edu) | Office | Phone | |---------------------------------------|---------------------------|-----------------------|------------|---------| | Lectures | Steve Ward | ward | 32-G786 | x3-6036 | | Head TA | Eben Kunz | ekunz@mit.edu | | - | | Admin<br>Asst | Cree Bruins | cbruins@csail.mit.edu | 32-G784A | x3-2629 | | + + + + + + + + + + + + + + + + + + + | Chris Terman | cjt | 32-G790 | x3-6038 | | R R | Li-Shiuan Peh | peh | za dala zd | | | Recitations | Silvina Hanono<br>Wachman | silvina | - 1010 | | | yk yyla | David Crowell | dcrowell | | | | 1116.2 | Brad Gaffney | bgaffney | | | | LAs | Becky Bianco | renminbi | <br> | |-----|--------------|----------|------| | | Daniel Gray | haiiro | <br> | The Head TA is in charge of most 6.004 operational issues, including section assignments. You can email the 6.004 Head TA at 6004-headta@csail.mit.edu. **Text** There is no required text for the course this semester. Readings for some of the course material will be available on-line. Handouts On-line versions of the handouts (in PDF format) can be found at this website. Problem sets There are no weekly graded problem sets. Instead there are on-line tutorial problems with answers you can use to test your understanding of the material. The WF recitations give you a chance to work on these problems with the help of the course staff and to ask any questions that you may have. **Collaboration** The assignments are intended to help you understand the material and should be done individually. You are welcome to get help from others but the work you hand in must be your own. Copying another person's work or allowing your work to be copied by others is a serious academic offense and will be treated as such. We do spot-check submissions to the on-line checkoff system for infractions of the collaboration policy. So please don't tempt fate by submitting someone else's work as your own; it will save us all a lot of grief. Labs There are eight lab assignments due at various times during the term and an optional design project at the end of the term. Completing each part of a lab earns points that count toward your final grade. Points are determined during a short interview about each with a member of the course staff. Note that you can submit your work for a lab more than once, for example, as you complete each part. After completing the work on some of the labs, you'll be presented with some on-line lab questions to answer (these are different than the tutorial questions mentioned above). And you'll need to schedule a short lab checkoff meeting for each lab with a member of the course staff. This meeting can happen after the lab's due date but to receive full credit it must be completed within one week of the due date. You must have a non-zero score for each required lab and all on-line lab questions must be checked-off as a prerequisite for passing the course. A missing required lab (i.e., a lab with a score of 0) will result in a failing grade; incompletes will not be given for unfinished laboratory work. The lab gets crowded just before an assignment is due so plan accordingly. The lab will be staffed by the course staff during the late afternoon and evening M through R, and during the afternoon on F. The 6.004 lab is located in 32-083 and is open 24 hours a day, 7 days a week. An access code is required for entry; it will be given out during lecture. The lab offers Linux-Athena workstations that can be used to complete the homework assignments. It is also possible to complete the assignments using your own computer: the lab software is written in Java and runs in Sun's Java 2 Standard Edition (J2SE) environment (see <u>Courseware</u> for details). The lab serves as a meeting room for 6.004 students and staff; staff members keep their office hours in the lab, whose schedule is posted elsewhere on the 6.004 web site. Late policy for labs: The on-line system will give you 50% of any points earned for submissions after the due date. So if your first submittal is late, you get 50% of the points. But if you submitted on-time for 15 points, and then late for 25 points, you'll get 20 points total for the lab. Note that points reported by JSim/BSim at check-in are for on-time submittals; you can check your on-line status page to see how many points count toward your total. This will be reported as "0" until you complete your checkoff meeting. Late policy for checkoffs: The on-line system will allow only 50% of any points earned by your lab (including any late penalties you incurred) if you don't complete your checkoff before the checkoff deadline. So if you miss both the lab deadline and the checkoff deadline, you'll only get 25% of the total points. Quizzes There are 4 fifty-minute, closed-book quizzes. The questions will be similar (perhaps identical!) to the <u>tutorial problems</u> and will ask you to provide short, written answers and/or explanations. The quizzes are scheduled roughly every three weeks during recitation: | Quiz | Date given | Deadline for grade corrections | |--------|------------|--------------------------------| | Quiz 1 | Fri, 9/23 | Wed, 10/5 | | Quiz 2 | Fri, 10/14 | Wed, 10/26 | | Quiz 3 | Fri, 11/4 | Wed, 11/16 | | Quiz 4 | Fri, 12/2 | Wed, 12/14 | To ensure everyone has a seat, please attend your assigned section on quiz days. If <u>exceptional</u> circumstances make it impossible to take a quiz at your assigned time, please contact your recitation instructor *before* the quiz to see if other arrangements can be made. Requests for make-ups after the quiz has been given are unlikely to be successful. There is no final exam. Grading The final grade is determined by performance on the quizzes (30 points/quiz, 120 points total), the labs and the design project (90 points total). In addition, you must have a non-zero score for each of the required labs and all the on-line lab questions must be checked off as a prerequisite for passing the course. A missing required lab will result in a failing grade; incompletes will not be given for unfinished laboratory work. Once your combined score has been computed as explained above, here's how grades will be assigned: | Grade | Requirement | |----------------------------------------|--------------------------| | A | 175 ≤ total points | | В | 155 ≤ total points < 175 | | С | 135 ≤ total points < 155 | | D | 115 ≤ total points < 135 | | F total points < 115, missing required | | ## 6.004 at a glance: Fall 2011 | Tue | Wed | Thu | Fri | | |--------------------------------------------------------------------------------------|---------------------------|-----------------------------------------------------------------------------------------|--------------------------|--| | Registration Day | NO CLASS | L1 Sep 08 Course overiew & mechanics. Basics of information. | R1 Sep 09 | | | L2 Sep 13 Digital abstraction, combinational logic, voltage-based encoding. | R2 Sep 14 | L3 Sep 15 CMOS technology; gate design; timing | R3 Sep 16 | | | L4 Sep 20<br>Canonical forms; synthesis, simplification | Holiday | L5 Sequential logic. Lab 1 (CMOS) due | R4 Sep 23<br>QUIZ 1 | | | Sep 27 Storage elements, finite state machines. | R5 Sep 28 | L7 Scp 29 Synchronization, metastability. Lab 2 (Adder) due | <b>R6</b> Sep 30 | | | L8 Oct 04 Pipelining; throughput and latency. | R7 Oct 05 | Case study: multipliers. Lab 3 (ALU) due | R8 Oct 07 | | | Columbus Day | R9 Oct 12 | L10 Oct 13 Models of computation, programmable architectures. | R10 Oct 14 QUIZ 2 | | | L11 Oct 18 Beta instruction set architecture, compilation. | R11 Oct 19 | L12 Oct 20 Machine language programming issues. Lab 4 (TM) due | R12 Oct 21 | | | L13 Oct 25 Stacks and procedures. | R13 Oct 26 | L14 Oct 27 Beta implementation. Lab 5 (Assy Lang) due | R14 Oct 28 | | | Multilevel memories; locality, performance, caches | R15 Nov 02 | L16 Nov 03 Cache design issues | R16 Nov 04 QUIZ 3 | | | L17 Nov 08 Virtual memory: mapping, protection, contexts | R17 Nov 09 | L18 Nov 10 Virtual machines: timesharing, OS kernels, supervisor calls Lab 6 (Beta) due | Veteran's Day | | | L19 Nov 15 Devices and interrupt handlers, preemptive interrupts, real-time issues | R18 Nov 16 | L20 Nov 17 Communication issues: busses, networks, protocols Lab 7 (Trap Handler) due | R19 Nov 18 | | | L21 Nov 22 Communicating processes: semaphores, synchronization, atomicity, deadlock | R20 Nov 23 | Thanksgiving | | | | L22 Nov 29 Pipelined Beta implementation, bypassing | R21 Nov 30 | L23 Dec 01 Pipeline issues: delay slots, annulment, exceptions Lab 8 (Tiny OS) due | R22 Dec 02 | | | Parallel processing, shared memory, cache coherence, consistency criteria | R23 Dec 07 | L25 Dec 08 Wrapup Lecture! | R24 Dec 09 <b>QUIZ 4</b> | | | No lecture | No recitation PROJECT due | Finals - 6.004 is over! | | | generated by glancegen.py 8/30/2011 SAW 9/8 ## Welcome to 6.004! Handouts: Lecture Slides, Calendar 6.004 - Fall 2011 9/8/11 modified 9/5/11 12:01 LO1 - Basics of Information 1 ## Course Mechanics ## Unlike other big courses, you'll have NO evening quizzes NO final exam NO weekly graded problem sets ## Instead, you'll face Repository of tutorial problems (with answers) FIVE quizzes, based on these problems (in Friday sections) Lextra ciedit EIGHT labs + on-line lab questions + Design Contest (all labs and olqs must be completed to pass!) ${\sf ALGORITHMIC}\ assignment\ of\ your\ grade!$ 6.004 - Fall 2011 9/8/11 LO1 - Basics of Information 2 ## 6.004: Course Clickables ## What do we see? - Structure - hierarchical design: - limited complexity at each level - reusable building blocks - · Interfaces - Key elements of system engineering; typically outlive the technologies they interface - Isolate technologies, allow evolution - Major abstraction mechanism What makes a good system design? - "Bang for the buck": minimal mechanism, maximal function - reliable in a wide range of environments - accommodates future technical improvements I see a bug in 6.004 - Fall 2011 LO1 - Basics of Information 5 Whirlwind, MIT Lincoln Labs ## Our plan of attack... - Understand how things work, bottom-up - \* Encapsulate our understanding using appropriate abstractions - Study organizational principles: abstractions, interfaces, APIs. - \* Roll up our sleeves and design at each level of hierarchy - Learn engineering tricks - history - systematic approaches - algorithms - diagnose, fix, and avoid bugs 6.004 - Fall 2011 LO1 - Basics of Information 6 First up: INFORMATION If we want to design devices to manipulate, communicate and store information then we need to quantify information so we can get a handle on the engineering issues. Goal: good implementations - ·Easy-to-use - ·Efficient - ·Reliable - ·Secure ٠... - ·Low-level physical representations - · High-level symbols and sequences of symbols had teu+ oce memor LO1 - Basics of Information 7 What is "Information"? Claude Shannan Formalized defin information, n. Knowledge communicated or received concerning a particular fact or circumstance. Pats WON their 18th game Tell me something new. "Really, NO FINAL!" Information resolves uncertainty. Information is simply that which cannot be predicted. The less predictable a message is, the more information it conveys! LO1 - Basics of Information & 6004 - Fall 2011 ## Quantifying Information (Claude Shannon, 1948) Suppose you're faced with N equally probable choices, and I give you a fact that narrows it down to M choices. Then I've given you Information is measured in bits (binary digits) = number of O/1's required to encode choice(s) log<sub>2</sub>(N/M) <u>bits</u> of information Examples: - information in one coin flip: $log_2(2/1) = 1$ bit - roll of 2 dice: log2(36/1) = 5.2 bits - outcome of a Red Sox game: 1 bit (well, actually, are both outcomes equally probable?) (Thy-still ned to know who actually won) 5.004 - Fall 2011 Loh vailable/huffman tree Need milliple games for Fixed-length encodings If all choices are equally likely (or we have no reason to expect otherwise), then a fixed-length code is often used. Such a code will use at least enough bits to represent the information content. ex. Decimal digits $10 = \{0,1,2,3,4,5,6,7,8,9\}$ 4-bit BCD (binary coded decimal) $\log_2(10) = 3.322 < 4bits$ ex. ~86 English characters = $\{\text{A-Z}\,(26),\,\text{a-z}\,(26),\,\text{O-9}\,(10),\,\,\text{punctuation}\,(11),\,\text{math}\,(9),\,\text{financial}\,(4)\}$ 7-bit ASCII (American Standard Code for Information Interchange) $\log_2(86) = 6.426 < 7bits$ 6.004 - Fall 2011 9/8/11 LO1 - Basics of Information 11 ## Encoding - Encoding describes the process of assigning representations to information - Choosing an appropriate and efficient encoding is a real engineering challenge - Impacts design at many levels - Mechanism (devices, # of components used) - Efficiency (bits used) - Reliability (noise) - Security (encryption) Next lecture: encoding a bit. What about longer messages? 04-Fall 2011 Loday LO1 - Basics of Information 10 ## Encoding numbers It is straightforward to encode positive integers as a sequence of bits. Each bit is assigned a weight. Ordered from right to left, these weights are increasing powers of 2. The value of an n-bit number encoded in this fashion is given by the following formula: # Haw to represent (-) # Signed integers: 2's complement 8-bit 2's complement example: $$11010110 = -2^7 + 2^6 + 2^4 + 2^2 + 2^1 = -128 + 64 + 16 + 4 + 2 = -42$$ If we use a two's complement representation for signed integers, the same binary addition mod 2" procedure will work for adding positive and negative numbers (don't need separate subtraction rules). The same procedure will also handle unsigned numbers! By moving the implicit location of "decimal" point, we can represent fractions too: $1101.0110 = -2^3 + 2^2 + 2^0 + 2^{-2} + 2^{-3} = -8 + 4 + 1 + 0.25 + 0.125 = -2.625$ 6.004 - Fall 2011 9/8/11 LO1 - Basics of Information 13 ## When choices aren't equally probable When the choices have different probabilities ( $p_i$ ), you get more information when learning of a unlikely choice than when learning of a likely choice Information from choice<sub>i</sub> = $log_2(1/p_i)$ bits Average information from a choice = $\sum p_i log_2(1/p_i)$ best (ase #### Example | choice; | Pi | log2(1/p) | |---------|------|-----------| | "A" | 1/3 | 1.58 bits | | "B" | 1/2 | 1 bit | | "c" | 1/12 | 3.58 bits | | "D" | 1/12 | 3.58 bits | Average information - = (.333)(1.58) + (.5)(1) - + (2)(.083)(3.58) - = 1.626 bits Can we find an encoding where transmitting 1000 choices is close to 1626 bits on the average? Using two bits for each choice = 2000 bits 6.004 - Fall 2011 9/8/11 LO1 - Basics of Information 14 ## Variable-length encodings (David Huffman, MIT 1951) Use shorter bit sequences for high probability choices, longer sequences for less probable choices | choice; | Pi | encoding | |---------|------|----------| | "A" | 1/3 | 11 | | "B" | 1/2 | 0 | | "c" | 1/12 | 100 | | "D" | 1/12 | 101 | | В | С | Α | В | A | D | |----|-----|----------|-----|----|----| | 0. | 100 | 11 | 01 | 1- | 10 | | | 0 | ٨ | 1 | | | | | | | X | | | | | В | 9 | / \ | 1 | | | | 0/ | $\wedge$ | 1 | A | | | | Ċ | | ľ | , | • | | | | | | | | Average information = (.333)(2)+(.5)(1)+(2)(.083)(3)= 1.666 bits Transmitting 1000 choices takes an average of 1666 bits... better but not optimal Huffman Decoding Tree To get a more efficient encoding (closer to information content) we need to encode sequences of choices, not just each choice individually. This is the approach taken by most file compression algorithms... Over the block - like ZIP LOT-Basics of Information 15 ## Data Compression Key: re-encoding to remove redundant information: match data rate to actual information content. "Outside of a dog, a book is man's best friend. Inside of a dog, its too dark to read..." -Groucho Marx Ideal: No redundant info - Only 4 MB unpredictable bits transmitted. Result appears *random!* LOSSLESS: can 'uncompress', get back original. A84b!\*m9@+M(p 6.004 - Fall 2011 9/8/1 LO1 - Basics of Information 16 ## "Able was I ere I saw Elba." \*1024 Compressed: 138 bytes 6.004 - Fall 2011 9/8/11 ## Is redundancy always bad? Simplifes tech Encoding schemes that attempt to match the information content of a data stream are minimizing redundancy. They are data compression techniques. "around on which the human hand of man has never before set foot" However, sometimes the goal of encoding information is to increase redundancy, rather than remove it. Why? - · Make the information easy to manipulate (fixed-sized encodings) - · Make the data stream resilient to noise (error detecting and correcting codes) Does recompression work? If ZIP compression of a 40MB Bible yields a 4MB ZIP file, what happens if we compress that? 6.004 - Fall 2011 ·Do we get a 400 KB file? · Can we compress that, to get a 40K file?? · Can we compress the whole Bible down to a single bit??? · IsitaOora 1???? redundancy! perfectly, the result has no Calmagolive Complexity Size of smallest Program to generate tree ## Error detection and correction Suppose we wanted to reliably transmit the result of a single coin flip: Heads: "O" Tails: "1" This is a prototype of the "bit" coin for Further suppose that during transmission a single-bit error occurs, i.e., a single "O" is turned into a "1" or a "1" is turned into a "O". LO1 - Basics of Information 20 6.004 - Fall 2011 101 - Basics of Information 19 The Hamming distance between a valid binary code word and the same code word with single-bit error is 1. The problem with our simple encoding is that the two valid code words ("O" and "1") also have a Hamming distance of 1. So a single-bit error changes a valid code word into another valid code word... 6.004 - Fall 2011 LO1 - Basics of Information 21 - bit errors # Increase hamming Error Correction J'estance to 101 "tails" J'e (10) 100 110 "tails" If D is the minimum Hammir distance between code words, we can correct up to By increasing the Hamming distance between valid code words to 3, we quarantee that the sets of words produced by single-bit errors don't overlap. So if we detect an error, we can perform error correction since we can tell what the valid code was before the error happened. - Can we safely detect double-bit errors while correcting 1-bit errors? - · Do we always need to triple the number of bits? LO1 - Basics of Information 23 ### Error Detection What we need is an encoding where a single-bit error doesn't produce another valid code word. If D is the minimum Hamming distance between code words, we can detect up to (D-1)-bit errors We can add single-bit error detection to any length code word by adding a parity bit chosen to quarantee the Hamming distance between any two valid code words is at least 2. In the diagram above, we're using "even parity" where the added bit is chosen to make the total number of 1's in the code word even. Can we correct detected errors? Not yet... 6.004 - Fall 2011 101 - Basics of Information 22 ## The right choice of codes can solve hard problems Reed-Solomon (1960) First construct a polynomial from the data symbols to be transmitted and then send an over-sampled plot of the polynomial instead of the original symbols themselves spread the information out so it can be recovered from a subset of the transmitted symbols. Particularly good at correcting bursts of erasures (symbols known to be incorrect) Used by CD, DVD, DAT, satellite broadcasts, etc. Viterbi (1967) A dynamic programming algorithm for finding the most likely sequence of hidden states that result in a sequence of observed events, especially in the context of hidden Markov models. Good choice when soft-decision information is available from the demodulator. Used by QAM modulation schemes (eg, CDMA, GSM, cable modems), disk drive electronics (PRML) 6.004 - Fall 2011 9/8/11 LO1 - Basics of Information 24 6004 - Fall 2011 ## Summary - · Information resolves uncertainty - · Choices equally probable: - · N choices down to M⇒ log<sub>2</sub>(N/M) bits of information - · use fixed-length encodings - · encoding numbers: 2's complement signed integers - · Choices not equally probable: - · choice, with probability $p_i \Rightarrow \log_2(1/p_i)$ bits of information - · average number of bits = $\sum p_i \log_2(1/p_i)$ - · use variable-length encodings - · To detect D-bit errors: Hamming distance > D - · To correct D-bit errors: Hamming distance > 2D #### Next time: - · encoding information electrically - · the digital abstraction - · combinational devices 6.004 - Fall 2011 9/8/11 LO1 - Basics of Information 25 Chris Terman lots of options 4 fidays - quittes on friday the much into ? Measure Roya (8) reasure rean nation to an nfolentropy in bits log ( Prob) infolentropy k bits > 2k different choices 4.1 bits & 5 bits if one bon it have multiple, can use variable-length encoding X = 8 bits 2° Choices for value = 256 Y 10 1 1 1 1 1 1 Hamming distance = 1 50 now narrow to 8 choices The into is lope $\log_2\left(\frac{1}{256}\right)$ = log 2 (25) = 5 6ils abstract measure Sometimes had to cationalize 6.004 is his far class Mulfman Code Want to encode Bad Dad Voing 1st tree 100 0 110 110 0 0 110 B A D D A DO Now deading 0 100 0 111 A B A E Short sequence = high probability Like P(A) = 5 P(B) = p(C) = P(D) = P(E) = 125 A M. 15 E 14 D .15 Brill Fæe | 9 | | | | |---|--------------------|-------------|----| | | Build From bottom | Up | | | | AI | Mex | | | | Then remains table | | | | | AII | 3 | | | | E, ( | 1 | | | | | 2 | | | | | 15<br>15 | | | | Êtc | 1. | ٠ | | | | AI.3<br>E.4 | | | | | 00.3 | | | | Etc | AIOU | 6 | | | AI OU | E | .6 | | | Finish | Tiee | l | | | E T O U | TIEE | · | | | ,, 100 | | | Optimal log2 (15) = 2.74 So our code for I letter is worse than dig (an Code words to get better What is any length of 600 char nessage 14.6.3 = $\frac{40 + 180}{2 \cdot 12} = 2 \cdot 2 \cdot \frac{bits/char}{char}$ 7 Oh no / 100 since length of whole nessage 220 bits Fixed Length Code 36it A 000 E 010 100 100 **Preface** 6.004 Compatational Structures This text approaches the field of digital systems architecture with a particular bias, one kindled during years of observation of M.T.T. students and speculation about the factors that distinguish outstanding engineers from the merely very competent majority. We bring to the project a conviction that the distinction reflects attitudes and perspectives that are at most subtle by-products of the usual technical education: subliminal lessons occasionally administered to students by virtue of accidental discovery rather than deliberate pedagogy. The organization and approach of *Computation Structures* is geared to the explicit cultivation of these characteristics in every student and reader. #### Scope and Mission As computer systems become increasingly complex, their study inevitably relies upon more specific areas of specialization. We compartmentalize specific technologies—logic elements, processors, compilers, operating systems—as subdisciplines to be treated by separate courses, texts, and intellectual tools. Interfaces between the subdisciplines are powerful engineering abstractions that allow, for example, the designer of a processor to deal entirely in the domain of digital logic, naively exploiting the electronic circuitry in the abstraction below and providing interpretive services to software in the abstraction above. The abstractions work sufficiently well that our curricula (and, for that matter, our job descriptions) accommodate specialists whose understanding of computer systems is cripplingly incomplete. We confer computer science degrees on theorists who may be helpless when programming is required, on hardware designers to whom the high-level structure of the system to which they contribute is mystery, and on programmers who lack a clear understanding of how or why their programs make real things happen. Often what passes for technical breadth takes the form of multiple specialties: Students exhibit pockets of local facility, unable to connect them into an effective understanding of the system as a whole. Such people can and do perform useful functions, but their potential for creativity is limited by their acceptance of the boundaries of their specialties. They shy from the challenge to venture beyond familiar approaches, to reshape problems, to develop new interfaces and revise existing ones; they accept the mysterious rather than unveiling it, oversensitized to their own limitations. They are, in effect, imprisoned by the abstractions they have been taught. Outstanding students, in contrast, somehow develop a perspective that illuminates the interfaces between technologies, rather than the technologies themselves, as the 9/10 most important structural elements of their discipline. They view the interfaces with respect but not reverence; they examine both sides of an abstraction, treating it as an object of study rather than a boundary between the known and the intractably mystical. They have discovered the power of their own universality; typically they have built complete systems, often with meager resources, and made them work. They have the sophistication, and most importantly the self-assurance, to explore new abstractions and to appreciate the role and costs of existing ones. Computation Structures is intended to produce renaissance engineers of this latter category. To this end, it deliberately telescopes much of computer science into a bottom-up progression that builds simple electronics into representative computer systems. The student is catapulted through one layer after another of implementation technology, an experience that accentuates technological interfaces and the structure induced by them. The breadth of coverage of the book reflects our compulsion to demystify its subject matter. We are unwilling to ask students to accept a technology on faith, or to postpone its penetration to some more advanced course. The spectrum of implementation technology is laid before them as a challenge to sample, to uncover, to discover their capacity to master its entirety. The ultimate mission of *Computation Structures* is not so much to convey facts as to catalyze personal discoveries by each student: the ability to master a wide range of technical details, the self-assurance to dive through a technological boundary, the power to grab a wire-wrap gun or CAD tool or body of code and create new structures. The considerable volume of technical detail is present largely to serve this goal; it is not the subject of the book, but rather a tool by which we convey its more important lesson. #### The Text Computation Structures does not presume to replace the many excellent texts that focus on established subtopics such as logic design or operating systems; nor is it intended for an elite audience capable of digesting four semesters of course work in a single term. It covers topics that every engineer—certainly every computer scientist—should be aware of, in sufficient detail to establish a concrete connection between each topic and practical reality. In many cases that connection is provided by the MAYBE computer, a simple microarchitecture that we use as a running example throughout the text. The MAYBE is presented in sufficient detail to allow its actual construction and operation, an activity required of M.I.T. students; even in the absence of such experience, it provides a subject for close scrutiny and a context for concrete discussions and questions. The text is introductory, presuming no formal background in digital systems; however, it generally assumes a level of technical sophistication consistent with that of an undergraduate engineering student. The text makes repeated connections to related technical areas. These should enrich the presentation for students with appropriate backgrounds, but are inessential to the sequel; for example, early chapters include a few circuit diagrams whose appreciation requires an Ohm's-law level of circuit sophistication. Programming ideas and constructs are introduced with a lack of fanfare that presumes some previous exposure to computers and programming on the part of the student. A subset of the C language, used for sample programs, is presented in a terse appendix; we have found that students easily attain a level of C fluency that allows them to read and understand the examples. The text does not make extensive use of real-world case studies, relying instead on the more coherent framework designed into our real but parochial example machines. Connections to the technical literature and to industrial practice are identified primarily in sections entitled Context; these provide pointers for deeper investigation of technical issues, links to adjacent fields, and occasional historical perspective. #### Role at M.I.T. Computation Structures is used at M.I.T. as the text for 6.004, a one-term, Shour-perweek sophomore "core" course required of all electrical engineering and computer science undergraduates. Three of the fifteen student hours are allocated to laboratory activities; the remainder are spent in classes and homework. Typical students have previously taken 6.001, a LISP-based programming course that uses Abelson and Sussman [1985] as a text, and 6.002, an introductory circuits course. The role of 6.002 is primarily to prepare the student for the 6.004 laboratory component, in which (for example) familiarity with an oscilloscope is assumed. 6.001 provides a first exposure to many ideas that recur in 6.004—programs, stacks, memory locations—albeit in a quite different context and often following a different idiom. 6.004 is a fast-moving course, stretching to accommodate the subject in its tightly packed single-term syllabus. Major topics are dealt with in about 25 lectures, which generally follow the order of the text and average one or two lectures per chapter. Substantial pruning is required to provide coherent if somewhat superficial coverage in lectures: While the major topics of the text are each dealt with in some depth, many of their ramifications and issues raised in the text are not addressed explicitly. Many of the optional (starred) sections are omitted, although the mix varies somewhat from one term to the next. The big picture emerging from the lectures is embellished by smaller section meetings held twice weekly, by homework, and by laboratory assignments. These components provide a vehicle for sampling underlying technical detail, stimulating each student to relate the lecture topic with at least one example of practical reality. The cultivation of this connection is a key element of the course and this text. Rather than presenting a stack frame in abstract terms, for example, we encourage each student to come to grips with the entire set of nested implementation technologies that make it real—machine language, microcode, circuit diagrams, logic gates. Frequent probes to concrete reality reinforce the two major lessons of the course: first, the value of abstractions as a tool for structuring complex systems; and second, the capacity of the individual student to master *any* aspect of a system he or she cares to focus on. #### Laboratory Work These key lessons of *Computation Structures* are reinforced by the laboratory component of the course. Each student is required to construct, debug, and program a working MAYBE computer from simple components (at the level of ROMs and registers). Several machine architectures—stack and general-register machines—are implemented and programmed using a common microarchitecture. oh ... 9 Preface xvii Students' computers are constructed from reusable take-home kits. The kits have a selection of logic components as well as integral prototyping board, power supply, and very primitive input/output provisions. Workstations (with a variety of support programs) and oscilloscopes are available in the laboratory. Although the wiring and much of the debugging can be performed by students at home, final checkout and subsequent program development require the use of facilities in the lab. The laboratory is structured as about eight assignments, each directed at a fairly tightly constrained set of goals. The directedness of the assignments differentiates the activity from more advanced "project" laboratories, which emphasize initiative and creativity; however, each assignment contains some nugget of creative challenge, such as the design of a machine instruction. A major creative outlet takes the form of an optional design contest, held at the end of each term, in which students are given a relatively free hand to modify their machines to improve performance on a benchmark program that is not revealed until the day of the contest. Each entrant is constrained to use only parts from the lab kit, although we allow n-student teams to combine n lab kits, normalizing the performance of such entries by a factor of n. (The best performance by this metric has come from two-kit entries.) In order to make the student computer construction tractable and reasonably failsafe, we have developed a set of printed-circuit modules (compatible with protoboard construction techniques) and a staged approach to construction whose early phases involve the use of a kernel subset of the machine to debug its incremental improvements. This methodology, involving the use of bootstrapping and scaffolding to develop complex systems, is among the major lessons of the course; it is one we have found no way to teach without hands-on experience. The integration of a "complete" laboratory with a broad-spectrum architecture course makes an ambitious package, but one that produces students whose technical maturity and self-assurance is based on that ultimate educator: They have each built a computer and made it work. We enthusiastically recommend the approach. While we value the *omniware* breadth of the laboratory, much of its pedagogy can be captured by somewhat less extreme programs. The majority of the laboratory activity involves coding at some level, and can be carried out effectively using simulation software rather than hardware implementations. Several simulators and related system software (available both for UNIX workstations and for PCs) provide a convenient development test bed for microcode and software, even where the alternative of a hardware MAYBE machine exists. Since the architectural personality of our machine is largely dictated by the contents of a control ROM, the range of architectural options that can be explored using reprogramming and simulation techniques is relatively unconstrained. An effective laboratory can thus be fashioned using only commonly available computing resources. In the absence of such resources, the complete implementations presented here can still be explored in a kind of ersatz laboratory whose medium is paper rather than machinery, that is, in assignments that probe, modify, and analyze the given structures. This approach was used at M.I.T. in years prior to the 6.004 laboratory; while the latter has clear advantages, a paper laboratory devoted to all levels of a single coherent example accomplishes many of the demystification and perspective- building goals of our current course. #### Alternative Paths through the Text The subject of the text lends itself well to subsetting by suppression of detail, and we expect individual courses and readers to deemphasize or omit selected subtopics to suit their needs and backgrounds. The (widely varying) dependencies among topics are mapped crudely by the following graph: Relatively inessential dependencies are shown as dotted edges. The level of detail shown in the diagram ignores certain subtleties, often bundling essential and optional issues together as single nodes. Some additional guidance to topic selection is provided by observing sections marked with an asterisk, indicating digressions that can be skipped without compromising subsequent (nonasterisked) coverage. Topic selection and path through the text may be adjusted to accommodate a number of different course needs. A two-term undergraduate sequence based on the text would allow careful and relaxed treatment of its entire coverage and, given $\Lambda$ $\Lambda$ sufficient student time, is an attractive alternative to the single-term firehose-style course offered at M.I.T. An introductory graduate course might augment the text by selected readings from the literature; candidates appear in the Context sections and in the bibliography. Certain of the exercises appearing at the end of each chapter are marked with one or two asterisks to indicate that they may be challenging for an introductory undergraduate course. A single asterisk suggests that the marked problem requires more than routine application of the principles of the chapter; double asterisks identify problems that should challenge graduate students. #### Acknowledgments Computation Structures reflects the ideas, influence, and efforts of literally dozens of people who have contributed to 6.004 and its ancestors over the years. This lineage begins with the seminal course 6.032, which was developed in the late 1960s by Jack Dennis and subsequently evolved under the auspices of Jon Allen and Bob Gallager. The book owes its name and much of its sense of direction to these early roots; it Preface conveys, to a considerable extent, lessons that have been taught its authors by their predecessors. The current course began to take shape during the spring of 1980, when the authors collaborated with Dave Reed in an effort to modernize 6.032 and its course notes. Subsequent years saw a succession of significant revisions to the course, each accompanied by a rewrite of the notes, exploring a variety of pedagogic formulae. The birth of the MAYBE in 1984 marks the point at which the book's current content and focus began to jell; the efforts of Dan Nussbaum and Chris Terman played a significant role during this formative period. Subsequent maturation of the laboratory work included substantial contributions to the MAYBE by Ziggy Blair, Ken Mackenzie, Joe Morgan, and Henry Houh. During this long history, the course and its notes were scrutinized by dozens of course staff and thousands of students, whose contributions to the surviving text are real but generally so diffused as to be unidentifiable. Exceptions, whose impact is more conspicuous, include Tom Anderson, Andy Ayers, Dave Goddeau, Mark Johnson, Rob Kassel, Ken Mackenzie, Rishiyur Nikhil, Dan Nussbaum, Jon Powell, Gill Pratt, Jerry Saltzer, Chris Terman, and John Wolfe. Valuable discussions, critique, and feedback on the text have been provided by Anant Agarwal, Bill Dally, Jacob Katzenelson, Jim Kirtley, Al Mok, Bruce Musicus, Steve Seda, Gerry Sussman, Rich Zippel, and many others. Particular credit is due John Wolfe for the massive effort he devoted to conversion of the source manuscript to TeX, to Eva Tervo and Sharon Thomas for the help they lent and the aggravation they sustained in preparing the manuscript, to Larry Cohen for his skilled and tireless editing, and to the Harris Corporation, whose HCX-9 was used to format the book with blazing speed. And to Don Knuth, that premier computer scientist whose contributions have been varied, prolific, and influential, we are indebted for TeX, which deserves simultaneous recognition as the text formatter of choice and the most idiosyncratic programming language known to us. It is impossible to account for the prenatal debt incurred by the book to the authors' wives, Debbie and Louise, whose loving encouragement nursed it through its long gestation. The authors are grateful to M.I.T. for providing an environment that led to *Computation Structures*, to the Department of Electrical Engineering and Computer Science for support of the course development surrounding the text, and to Hewlett-Packard and Tektronix for their donations to the 6.004 laboratory. Finally, a decade of 6.004 and 6.032 students deserve sympathy and thanks for their role as subjects of a long succession of educational experiments as well as credit for the result. They, after all, are more than an influence; they are the reason. The key to the orderly design of complex systems, digital or otherwise, lies in the decomposition of their function into modules whose behavior can be independently and concisely specified. To encourage this functional modularity and simplify the specifications for each module, we adopt a variety of engineering disciplines and agree to abide by the constraints they impose; in programming, for example, conventions for the representation of data and for passing parameters are typical self-imposed constraints. Each such discipline involves a set of primitive elements, each performing some prescribed basic function, together with rules for the construction of new elements by the composition of existing ones. The power of an engineering discipline derives from its simplification of the functional specifications of each module by the abstraction of essential function from nonessential detail; the description of a square-root procedure, for example, must confront the arithmetic relationship between its input and output but need not deal with the patterns of bits used to represent them. We can observe two broad classes of activities in the structuring of complex systems. The first and more common of these involves working within a single discipline, enriching its repertoire by combining existing functions to define new ones. The appropriate software analogy is the construction, within the framework of a particular programming language, of a library of procedures to perform various useful functions. The second and more radical structuring activity involves the use of the modules of one discipline to define and support a new, higher-level abstraction with its own primitive elements and composition rules. In software design, such radical structuring typically involves the use of a lower-level language (together with its library of support procedures) to implement a higher-level language whose primitives and composition rules are in some (perhaps application-dependent) sense more powerful. The major characteristic of the new level is that it isolates its user from the underlying implementation technology, as opposed to simply adding new options to it. Typically the structure of complex systems shows an alternation between these two activities. Functional extensions are made to some discipline until its complexity becomes unwieldy, at which point it is used to support a new abstraction with a manageable number of higher-level primitives and composition rules. Such abstractions and the disciplines that support them are the central theme of computer science. Every significant digital system involves many levels of discipline, each implementing a more powerful set of primitive elements and composition rules than the ones that underlie it. This chapter presents the fundamental abstraction that bridges the gap between the domain of the circuit designer (whose primitive elements are electronic components) and that of the computer architect. #### 1.1 Information and the Digital Abstraction The primary use of electronic devices is the processing of information. Major appliances such as televisions and computers serve to translate information from one format to another—from an encoded radio frequency input to an output picture, or from a string of input symbols to a corresponding string of output symbols. Each such complex information processor is synthesized from components that transform, combine, store, and manipulate representations of information. Thus we can dissect a television and identify the information-processing functions of its major subsystems, specifying the way in which the input and output information of each is encoded into electronic signals. We might further scrutinize each subsystem, documenting the function performed by each component in information-handling terms, down to the level of fundamental circuit elements. A capacitor, for example, is an energy storage device; in typical applications, its stored energy may be viewed as the electronic representation of stored information (such as elapsed time, in an RC timer circuit). While the electronic representations of information in a television typically involve continuous variables (such as voltages or frequencies), the symbolic information processed by a computer consists of discrete units such as binary digits. Digital systems, in general, are based on technology for the electronic representation of discrete information units; they offer the important advantage that the information content of each electronic signal is easy to quantify. They illustrate a powerful and important engineering principle: the pursuit of simplicity through constraint. Digital engineering involves a self-imposed design discipline that allows systems to be analyzed in the simple, abstract domain of logic rather than the vastly more complicated domain of underlying electronic principles. The advantages and mechanism of this abstraction are explored briefly in the following paragraphs. The amount of information carried in a discrete-valued signal s may be defined as $\log_2 N_{\rm v}$ , where $N_{\rm v}$ is the number of distinct values of s that can be reliably set and measured. It is conventional to take the logarithm in base 2 and express the result in bits, or binary digits, of information. We might propose, for example, to communicate each decimal digit d by means of a d-volt signal on a particular wire, so that $N_{\rm v}$ is 10. If our measurement technology is accurate enough to distinguish reliably between the ten values, the information conveyed by each signal is $\log_2 10 = 3.322$ bits. If our measurement tells us only whether or not the signal is below 5 V, however, $N_{\rm v}$ is 2 and the amount of information conveyed is $\log_2 2$ or 1 bit. A single bit is the minimum amount of information necessary to distinguish reliably between two values; it is the smallest convenient quantum for discrete information log 2 to get bits <sup>&</sup>lt;sup>1</sup> There are situations in which this view is inappropriate, such as applications in which a capacitor is used as a storage medium for energy to power other devices. dh ... representations and serves as the basic unit of information storage and communication in most digital systems. It is noteworthy that 1 bit is exactly the information content of a single digit in a binary number system, which makes binary (rather than, say, decimal) an attractive and popular choice for the representation of numbers in digital systems. Electronic parameters such as voltages are, to a good first approximation, continuous variables; we quantify them by real numbers rather than by integers. Each such parameter can in theory assume infinitely many distinct values, even over a bounded range; a voltage between 0 and 1 V may be 0.23 V or 0.6713 V or 0.6713456278 V. If we were able to set and measure such a voltage exactly, it would carry an infinite amount of information (since $N_v$ would be infinite). Of course, noise and other physical realities limit our ability to constrain and measure physical parameters. A measurement of 0.23 V indicates that 0.23 V is a more likely value than, say, 0.3 V, which in turn is more likely than 0.4 V. The actual amount of information conveyed by our measurement is awkward to quantify exactly; it depends on the accuracy and reliability characteristics of the measuring device as well as electrical noise and other detriments to the validity of the signal itself. A typical signal level might carry 10 or 12 bits of useful information, on the average, with moderate reliability. Such representations are ideally suited to applications (such as television) involving large volumes of information flow and in which occasional errors are not catastrophic. The synthesis and analysis techniques used in these situations typically assume that each signal represents the corresponding real number to an acceptable accuracy, thus avoiding the need for precise quantification of its information content. ### 1.2 Representation of Discrete Variables In many applications, it is convenient to use components whose behavior can be simply and precisely characterized in informational terms. A printer device connected to a computer can take as input the representation of a character to be printed, from an alphabet of, say, 128 possible characters. We would like the communication from the computer to the printer to be highly reliable: The printer must be able to determine precisely the character to be printed from the signals it receives. The amount of information conveyed to the printer is thus $\log_2 128 = 7$ bits for each character printed. The communication technique used will of course involve wires carrying continuously variable signals (such as voltages), but to reach our reliability goals we may be willing to sacrifice much of the information-carrying potential of each signal. A typical parallel printer interface, for example, involves seven independent wires, each carrying 1 bit of information encoded as a voltage. Each wire selects one of two alternative values (say, a 1 or 0 as the value of a corresponding binary digit), and the seven-wire aggregate thus identifies one of $2^7 = 128$ possible characters. The reliable translation between a discrete variable such as a binary digit and its representation as the approximate value of a continuous variable such as a voltage provides the key to what we call the *digital abstraction*. It allows us to use conventional circuit elements to build a family of *digital devices* whose information-processing characteristics can be specified as a simple logical function involving discrete inputs Oh that is what that mean! The Digital Abstraction as long as reliable -don't care about it ASCIT 3 #### 1.3 Combinational Devices The simplest and most fundamental abstraction in the repertoire of the digital engineer is the *combinational device*, which we formalize as follows: A combinational device is a circuit element having the following properties: - one or more discrete-valued input terminals; - · one or more discrete-valued output terminals; - a functional specification, detailing the value of each output for each of the possible combinations of input values; and - a timing specification, consisting (at minimum) of an upper bound $t_{\rm pd}$ on the time required for the device to compute the specified output values from an arbitrary set of input values. The usual interpretation of the *propagation delay* $t_{\rm pd}$ is that whenever a particular combination of input values is applied to the device and maintained for at least $t_{\rm pd}$ seconds, a corresponding set of output values will appear. Moreover, these output values will remain at the output terminals (at least) until the inputs change. Thus a set of input values applied for t seconds results in corresponding output values for a period of at least $t-t_{\rm pd}$ seconds. Note that $t_{\rm pd}$ is a *maximum* time required for new input values to be reflected at the output terminals; in general, the *minimum* such time is assumed to be zero.<sup>2</sup> One result of the latter assumption is that output values are contaminated immediately by any change at the inputs, regardless of $t_{\rm pd}$ . An important feature of combinational devices is the simplicity with which they may be interconnected to synthesize new combinational functions. In particular, new combinational devices may be created by combining combinational elements in acyclic circuits, so long as care is taken not to connect outputs together. More precisely, we can construct a combinational device by exploiting the basic rule of composition: **Combinational composition** A circuit is a combinational device if it consists of interconnected circuit elements such that · each circuit element is itself combinational, more hardwarey Speck. 2 <sup>&</sup>lt;sup>2</sup> A refinement of this assumption, in which a nonzero contamination delay becomes part of the device specification, is occasionally used in the design of performance-critical circuitry. - every node of the circuit is either designated as an input to the circuit or connects to exactly one output terminal of a circuit element, and - the circuit contains no directed cycles (that is, every path through the circuit that traverses elements only in the input-to-output direction visits each circuit node at most once). (Special memory -diff process for formula to actually memorize) Any of the circuit nodes may be designated as outputs of the new device. It is easy to verify that this construction yields devices that conform to our combinational criteria. The acyclic constraint allows us to proceed systematically from the circuit's input terminals through successive circuit elements, assigning to each node both a functional specification (as a function of the input values) and a time bound. The functional specification of each noninput node is derived from the specification of the element driving that node together with the functional specification assigned to that element's inputs; similarly, the propagation delay associated with such a node is computed by adding the element's $t_{\rm pd}$ to the maximum of the delays associated with its input nodes. In general, $t_{\rm pd}$ for the constructed device becomes the maximum cumulative propagation delay encountered on a path from any input to any output. #### 1.4 The Static Discipline: Logic Levels Kinda & The fact that we can synthesize arbitrarily complicated combinational devices using acyclic circuits of combinational elements gives us a bootstrapping technique for extending the usefulness of a primitive initial set of such devices; this is vaguely analogous to the induction step of an inductive proof. A remaining task is the development of a basis—an initial set of combinational elements from which we can synthesize more ambitious ones. Clearly we must begin with at least one combinational device that is defined in terms of technologies other than combinational devices. It is in the design of these basic elements that fundamental decisions regarding the representation of the discrete values of our abstraction are confronted. 6.02 Suppose, for example, that we are to represent a binary (or logical) variable d using a voltage v that can be varied between 0 and 5 V. We might choose to represent d=0 by a voltage of 0 and d=1 by 5 V. The efficacy of this representation then depends on our ability to set and measure v with sufficient reliability that the representation of d=0 is never mistaken for the representation of d=1 and vice versa, a reasonably easy demand to meet. We can afford to be fairly sloppy about setting and measuring v and still deduce the correct value of d. If we guarantee that v will be set to a value below 1 V when d=0 and above 4 V when d=1, and if we ensure that our measurement of v will be accurate to within 0.5 V, we shall always measure a voltage greater than 3.5 V when d=1 and below 1.5 V when d=0. We might distinguish between the two cases by comparing the measured voltage with a 2.5-V threshold voltage $V_t$ and assuming a logical 1 as the value of d if the measured value is above the threshold and a logical 0 if it is below. Figure 1.1 shows the mapping of the continuous variable v onto the discrete (binary digit) d. If the measured voltage is near or at the 2.5-V threshold, we will be unable Figure 1.1 Mapping of logic levels to voltage. to distinguish reliably between the two values of d. We avoid this embarrassing circumstance by outlawing it; we designate the region between 1.5 V and 3.5 V as a forbidden zone that does not correspond to the valid representation of a binary variable. Since we can reliably discriminate between voltages below 1.5 V and those above 3.5 V, adhering to the discipline that v must be set above 4 V or below 1 V ensures that forbidden values are avoided. In fact, we have allowed an extra half-volt margin for error on each side of the forbidden zone; these noise margins allow our scheme to function in the presence of a certain amount of noise or other unanticipated error. In general, the reliable representation of discrete variables using continuously variable physical quantities requires such ranges of excluded values between valid representations. This gives rise to the possibility of *invalid signals* that correspond to none of the logical values; a wire carrying $V_{\rm t}$ represents neither a logical 1 nor a logical 0 by the convention outlined above. Such invalid signals are foreign to the digital abstraction and are unaccounted for in the analysis and synthesis methodologies commonly used by the digital engineer. In order to protect digital designers from unanticipated invalid signals in their circuits, logic elements are commonly designed to conform to a *static discipline*: how to ensure i A static discipline is a guarantee that, given logically valid inputs, every circuit element will (after an appropriate propagation delay) deliver logically valid outputs. The interpretation of "logically valid" is relative to a particular representation convention, which in turn varies with the implementation technology. Typically, however, a single representation convention applies throughout an entire digital system. A static discipline offers a measure of assurance that invalid signals will not arise spontaneously in a well-designed combinational circuit: If the inputs to each device are well behaved, each device's outputs will also be well behaved after some bounded delay. Noise margins amount to imposing a more stringent validity requirement on output signals from a device than apply to its inputs, ensuring that marginally valid input signals are restored to unquestionable signals at the output. The threshold $V_{\rm il}$ represents the highest input voltage that must be interpreted as a logical 0; it is higher than $V_{\rm ol}$ , the highest output voltage that can be asserted to indicate a 0. This ensures that every valid output 0 will be interpreted as a 0 at connected inputs, even after some degradation. A similar relationship holds between $V_{\rm ih}$ (the lowest input representing a valid 1) and $V_{\rm oh}$ (the lowest output asserted to indicate a 1). #### 1.5 Rate of Information Flow The representation scheme we have developed provides for the *static* transmission of a fixed amount of information—a single bit—as an electric signal over a wire. In general, we expect this value to change from time to time; if it *never* changes, as would occur if the value communicated is a constant, the need for the wire should be questioned! Thus, while at any instant a signal might convey only a single bit of information, it typically carries a *sequence* of bits over a period of time. If we make provisions for changing a signal to represent a new bit of information every second on the second, that signal may be viewed as carrying a *flow* of information at the rate of 1 bit/s; it could be used to transmit a 100-bit binary number in 100 s. Any scheme we choose for communicating discrete variables places practical limits on the frequency with which we can change their values and thereby constrains the rate of information flow. Rates of information flow in digital systems are often specified in *baud*, or value transitions per second, a parameter that is constrained by the communication *bandwidth* of the underlying (analog) communication medium. A fundamental theorem by Nyquist places an upper bound on a communication channel's useful baud rate of twice its bandwidth, even in the absence of noise. In the case of binary representations, where each value is a binary digit, 1 baud is equivalent to 1 bit/s. Thus the theoretical maximum rate of information flow over a 3000-Hz voice-grade line carrying binary values as two different voltage levels is 6000 bits/s. This limit can be exceeded, in theory, by the use of more than two discrete values at a baud rate of less than 3000 changes per second; using four voltages, for example, raises the theoretical limit of our voice-grade line to 12,000 bits/s. Of course, the effective rate of information flow can always be increased by using multiple signals; a 100-bit number can be transmitted in 1 s serially using a single 100-baud binary signal or in parallel by means of a hundred 1-baud binary signals, each encoding a single digit. Such choices are faced frequently by designers of digital systems, and a judgment typically depends strongly on underlying technological issues. For example, digital communication between geographically distant subsystems, such as terminals of an airline's flight reservation system, might rely on leased telephone lines whose bandwidth characteristics limit transmitted signals to audio frequencies. Reliable digital communication over such a line is limited to a few thousand baud, but increasing the communication rate by using multiple lines band = transx per sec is expensive. Since communication cost is likely to be a dominant factor in the design of such systems, it is worthwhile devoting considerable effort to minimizing the required communication rates. Even in localized systems, communication costs are an important design consideration. As logic elements become cheaper and perform more complex functions, the cost of running wires (or other media) to interconnect them becomes an increasingly important element of system costs. Although nearby modules of a system can communicate at rates measured in hundreds of megabaud by using many parallel wires, such high bandwidth is expensive in device terminals and interconnections and is not to be squandered. Even within a single integrated-circuit chip, where the economies are based largely on space (chip area), interconnection costs often dominate: More of the chip area is devoted to lines transmitting signals from one logic element to another than to the logic elements themselves. #### 1.6 Transitions and Validity In a dynamic system, where logical variables change in value from time to time, we cannot in practice avoid brief excursions through the forbidden zone. Because of stray capacitance and other such physical constraints, the voltage representing a logical variable v cannot be changed instantaneously from one valid logic representation to the other, although the transition may be quite fast—a few nanoseconds is typical. We therefore enforce on ourselves an additional discipline: We avoid asking whether v represents a logical 1 or 0 at about the time when it may be making a transition between values. This in turn requires that the beholders of v—those devices to which v is an input—each have available information about when v might change. Most commonly this requirement is met by constructing $synchronous\ systems$ , in which logic values are constrained to make transitions only at prescribed times (for example, every microsecond on the microsecond) keyed to a globally available clock signal. For truly asynchronous systems, it is impossible to guarantee that no logic level will be sampled at an inauspicious time (for example, during a transition); however, such events can be $\overline{\text{made quite}}$ improbable in practice. This general topic is visited again in section . ### 1.7 Logic Families Some of the parameters in figure 1.1 seem arbitrary. The association of low voltage with logical 0 and high voltage with logical 1 is a convention termed positive logic, the dual convention, negative logic, represents logical 0 by the higher of the two voltages. This choice may be made arbitrarily to suit one's taste; a digital system can be analyzed in terms of positive logic, negative logic, or a mixture of the two. Other parameters, such as the choice of logically valid voltage ranges and threshold voltages, typically reflect a combination of design goals and characteristics of the implementation technology. Several distinct "families" of digital devices have been developed over the years, each with its own set of characteristics and parameters for mapping continuous electronic variables (usually voltage) onto logic (nearly always Figure 1.2 Resistor-transistor logic (a) RTL inverter; (b) NPN transistor. binary) values. Each such family includes a set of rules for the valid representation of logical values; each device in the family has the property that, so long as its inputs obey the rules, its outputs will also. They provide in each case a set of Tinkertoy-like modules that can be simply plugged together to build arbitrarily complex logical systems, with relative disregard for the underlying electronics. The resulting simplicity is an enormous advantage to the designer, who can deal with the concise logical characterization of devices whose specification in electrical terms would be nearly intractable. # 6.4 #### 1.8 Digital-Circuit Implementation Figure 1.2(a) depicts a simple logic device from an early implementation technology called resistor-transistor logic, or RTL. The device is an inverter; its logical output at C is 1 if its input (at A) is 0, and 0 if its input is 1. The transistor in the circuit of figure 1.2(b) may be viewed roughly as a switch controlled by its base (input) current, which in turn depends on the voltage at A. If the voltage at input A is high (representing, using positive logic, a logical 1 input), then the switch is closed (the collector and emitter are short-circuited) and the output C is effectively connected to ground (yielding a logical 0 output). If the input voltage is close to 0 (logical 0 input), the switch is open, leaving C connected to the supply voltage $V_{CC}$ through a small resistor; this results in a logical 1 output. #### 1.8.1 Logic Levels and Noise A slightly more sophisticated model of the transistor yields the voltage-transfer characteristics shown in figure 1.3 for our simple inverter. Over a portion of its input-voltage range, the device behaves like an amplifier with a negative gain (proportional to the slope of the center segment of the plot). This is termed the *active* or *linear* region of operation. When the input voltage is sufficiently low, the transistor is said to be a *cutoff* and effectively presents an open circuit between its emitter and collector. When the input voltage is sufficiently high, the Wollage The Digital Abstraction Figure 1.3 Voltage-transfer characteristics of an RTL inverter. transistor becomes <u>saturated</u> and effectively short-circuits its emitter to its collector. Details of the sizes of each region of operation (cutoff, active, and saturation) and the exact placement of the "knees" of the transfer characteristic depend on parameters of the transistor (such as its gain) as well as on values of the resistors in the device. The ranges of voltages specified for valid logic levels of 0 and 1, in turn, reflect these details and hence constrain their choice. It is important, for example, that the design parameters be such that a valid logic level at the input places the transistor either in the cutoff or saturation region of operation; this avoidance of the active region guarantees reliable performance in the presence of noise. In its active region, the circuit behaves like an amplifier with high gain; if, for example, a valid logical 1 at the input to the device biases the transistor into its linear region, it will amplify small perturbations of the input. Thus a 10-mV noise spike at the input might (assuming a gain of 10) become an 0.1-V noise spike at the output, as sketched in figure 1.4. The output signal presumably connects to the input of other devices, where its noise component may be amplified further, until the resulting voltages become invalid representations of logic values. Thus we constrain input voltages representing valid logic values to fall well within the cutoff and saturation regions, where the output voltage will be relatively insensitive to small perturbations of the input. In general, we leave a gap between each range of valid logic levels and the active region of the transistor in order to provide noise margins. In the RTL logic family, the standard supply voltage $V_{\rm CC}$ is 3.6 V. Valid logical 0 must be below 1 V, and logical 1 must be above 2.5 V; the resulting noise margins are each about 0.5 V. #### 1.8.2 Fanout Restrictions It is necessary to define the valid logic levels so as to include the device's output values when it has logically valid inputs. Having insisted that valid inputs will Figure 1.4 Amplification of a noise spike. result in either saturated or cutoff transistor behavior, we find that this reduces to a requirement that the cutoff output voltage $V_{\rm C}$ and the saturation output voltage $V_{\rm S}$ be included as valid logical 1 and 0 levels, respectively. Unfortunately, the characteristic plotted in figure 1.3 oversimplifies the behavior of our device in several ways. One of these is the assumption, made for the purposes of the preceding discussion, that the load on the output of the device imposed by external devices connected to terminal C is negligible. In fact, the voltage drops across the collector resistor and the transistor itself are functions of the output current; the effect of a heavy load on the output of the inverter is, roughly, to move the cutoff voltage $V_{\rm C}$ and the saturation voltage $V_{\rm S}$ toward each other and hence toward the forbidden zone between valid logic representations. As a result of this effect, each family of logic implementations imposes a restriction on the number of device inputs to which each device output can be connected; this is called the *fanout* limitation. In the case of RTL, device fanout is only about 5. This is primarily due to the voltage drop across the collector resistor while the transistor is at cutoff (in the logical 1 state), which lowers the output voltage as the fanout (and hence the output current) is increased. #### 1.8.3 Nonlinearities and Gain We have seen that the continuous range of voltages between minimum (zero or ground, in our example) and maximum (the supply voltage $V_{\rm CC}$ ) is divided into several regions: The valid logic levels are at the extremes, a forbidden zone occupies the center portion, and the remaining two gaps constitute the noise margins. It is generally desirable to center the forbidden zone between the valid logic levels, yielding comparable noise margins for logical 0 and 1 values; it is the *minimum* of the noise margins that dictates the maximum tolerable electrical noise level for a logic family. iden / Figure 1.5 Ultra-high-gain switching device. The reader may have noticed that the active region of the transistor, which constitutes its useful range in linear applications (such as in a high-fidelity amplifier) is a positive annoyance in digital applications. The fact that it must be avoided in the assignment of valid logic levels motivates us to minimize its size, thereby allowing more freedom to assign logic levels and noise margins. From this standpoint, an ideal transistor for digital purposes has an infinitesimal linear region, as depicted in figure 1.5. In this hypothetical device, the linear range is reduced to a vertical line, corresponding to a (negative) infinite gain at a single threshold voltage $V_{\rm T}$ . The gain of active elements such as transistors plays a crucial role in digital logic, in that it allows a device with marginally valid input levels to produce output levels that are well within the valid ranges. It is possible to build logic devices using only passive devices such as diodes or even resistors, but the outputs of such devices will in general be closer to the forbidden zone than the inputs. Hence a logic signal passing through several such devices will gradually deteriorate until it is no longer valid. A passive device fails to guarantee that every valid input produces a valid output; one can always find a marginally valid input value for which the output value is forbidden. Such a device fails to meet the static discipline unless the validity standards applied to its output signals are less rigorous than those applied to its inputs. In order to apply a set of validity constraints such as those depicted in figure 1.1 uniformly to both inputs and outputs of a combinational device, we need an active device exhibiting both gain and nonlinearity. Consider figure 1.6, which shows the static voltage-transfer curve of an inverter. The shaded regions of the diagram represent valid inputs that generate invalid outputs and are excluded by the static discipline. Consequently, the transfer curve of the inverter must enter the center rectangle at the top and leave it at the bottom, in order to constrain invalid outputs to occur only on invalid inputs. Since the width $V_{\rm ih}-V_{\rm il}$ of this rectangle is less than its height $V_{\rm oh}-V_{\rm ol}$ by the sum of the noise margins, book written too tormal to be clea Correct Figure 1.6 Forbidden V<sub>in</sub>-V<sub>out</sub> regions. the slope of the transfer characteristic—hence the gain—must be greater than 1 (in magnitude) over this center section of the curve. Moreover, since the difference between the maximum and minimum input values is at least as great as the difference between the maximum and minimum output values, the *average* gain over the entire transfer curve is at most 1; this implies that the gain outside of the central forbidden region must be less than 1. Thus differing slopes are required along various regions of the transfer characteristic, dictating a nonlinear input/output relationship. These considerations lead us to the inescapable conclusion that only *nonlinear* devices are suitable for the implementation of logic families. Linear devices, whose DC gain is constant over their entire operating range, will result in logic elements that fail to meet the static discipline for certain values. Passive and linear devices are occasionally used to perform logic functions, although this practice requires that active ("restoring") logic elements be interspersed to restore marginal signals to valid logic values. This technique amounts to a change of logic-level representations between the inputs to a device and its outputs, choosing in the latter case an ad hoc representation that accommodates the signal deterioration imposed by the device (such as lower signal levels due to low gain). #### 1.8.4 Input Gating A logic family must provide devices that effect logical functions of several input variables. Devices that combine logical values in elementary ways are often called *gates*, a name that stems from the early days of switching theory. As the name Clinear too shap Why? Figure 1.7 Inferior RTL NOR gate. Figure 1.8 NOR gate voltage-transfer characteristic. suggests, gating can be viewed as a mechanism by which one logical signal can be blocked or passed, depending on the value of another signal. In the RTL family, the simplest such device to implement is the two-input *NOR* gate. This device has the property that its output is a logical 0 (using positive logic) if either or both of its inputs are logical 1s; its output is 1 if and only if both inputs are 0s. An early implementation strategy for such devices used resistors to take a linear combination of the two input voltages and directed the result to a transistor with a strategically placed active region. Figure 1.7 shows such a circuit, and figure 1.8 shows a plausible voltage-transfer characteristic. Note that the input circuitry (resistors) produces a weighted sum of the input voltages; the weights are made equal because of the symmetry of the operands to the logical NOR function. The effect is to select the transistor's input from one of three equally spaced values, corresponding to input combinations containing zero, one, or two logical 1s, respectively. This implementation strategy is an example of threshold logic, in which a logic function is computed by comparison of a weighted sum of binary inputs against a fixed threshold value. The inherent weakness of this approach stems from the way that *fanin*, or number of inputs, compromises its noise-immunity characteristics. Referring to figure 1.8, we note that the transistor now has three valid input ranges rather than two; there is consequently less room between any two of them to squeeze both the forbidden zone (including the active transistor region) and noise margins. As the number of inputs is increased, the problem clearly gets worse, until the fundamental limit is reached in which pairs of distinct linear combinations of valid inputs are separated by less than the active range of the transistor. Primarily as a result of these considerations, the NOR gate implementation shown in figure 1.7 was abandoned early in the history of RTL. The problem with this circuit stems from its use of linear circuit elements—namely resistors—to combine logic values. Although it was resolved in the case of RTL by moving from *input gating* (in which transistor inputs are combined) to the *output gating* strategy described in the next section, modern logic families use input gating extensively. The difference is that they use nonlinear circuit elements to combine the inputs. Diode-transistor logic (DTL) uses input diodes to effect logic functions, and the popular transistor-transistor logic (TTL) families use multiemitter transistors to perform input gating. It should be noted that input gating typically involves passive devices such as diodes or resistors and hence must be followed by an active "restoring" device to ensure unambiguous output values despite possibly marginal inputs. #### 1.8.5 Output Gating: Wired-OR An alternative implementation of multi-input RTL gates is shown in figure 1.9. This circuit has noise characteristics similar to those of the inverter of figure 1.2, since the inputs are combined by the nonlinear output transistors rather than by the linear input resistors. A reasonable understanding of its operation may be obtained by viewing the transistors as switches connected in parallel, so that either or both switches being closed effectively connects the output C to ground. While there are second-order effects that limit potential fanin with this approach, they are much less serious than those inherent in threshold logic techniques. The NOR circuit of figure 1.9 differs only slightly from the result of connecting the outputs of two RTL inverters together; the difference is simply a reduction of the collector resistance in the latter case. In fact, the interconnection of outputs is common practice when using RTL and certain other amenable logic families. The technique has come to be known as wired-OR, although in the present case (RTL positive logic) wired-AND would be a more appropriate term. Thus two RTL logic levels A and B, each the output of some RTL device, can be simply wired together to yield their logical conjunction A AND B (or AB, using the usual Boolean algebraic notation). There are, of course, limits to the number of outputs that can be wired together. If n RTL 1 levels are connected to a single RTL 0 level, for example, then one saturated transistor must carry the sum of the currents flowing through the n+1 collector resistors. This increased current not only degrades the output voltage (moves it Figure 1.9 Improved RTL NOR gate. Figure 1.10 Signal timing. closer to the fo<u>rbidden zone</u>) but may, for sufficiently large *n*, approach the current limits of the output transistor. #### 1.8.6 Timing Considerations If we use a rectangular voltage waveform such as the $V_{\rm in}$ in figure 1.10 as input to an inverter, we are likely to observe an output waveform similar to that plotted as $V_{\rm out}$ . We notice first that, in contrast to the "ideal" rectangular shape of the $V_{\rm in}$ curve, the corners of the output waveform have become rounded and the rising and falling edges are not vertical. These effects reflect fundamental physical properties, such as capacitive effects, of the device. In order to make a transition between the valid logic levels, electrical charge must be moved—for example, to and from transistor junctions. This motion of charge constitutes work (in the sense of physics), and since we are constrained to apply finite power to the device, the transitions require finite amounts of time. While very fast logic families such as emitter-coupled logic (ECL) attempt to minimize the amounts of stored charge to be moved and maximize the amount of power used to move it, in practice transitions between valid logic levels are not instantaneous. The time $t_{\rm r}$ taken for a transition from low to high logic level is called *rise time*, and the time $t_{\rm f}$ for the reverse transition is called *fall time*. Typical rise and fall times are measured in very small numbers of nanoseconds. They are dependent not only on the technology of particular logic families, but on the load placed on particular signals; higher load tends to add capacitance and hence prolong transitions. A pleasant characteristic of rise and fall times is their resistance to accumulation as a signal passes through a number of cascaded logic devices. Thus the waveform at the end of a long string of series-connected inverters will have rise and fall times similar to that of a single inverter. This is attributable to the high gains of the transistors, each of which tends to increase the slope of its input transitions. The output waveform is delayed from the input by an approximately constant interval, corresponding to the propagation delay $t_{\rm pd}$ of the device. This delay also reflects the time necessary for the motion of charge that accompanies transitions and is thus closely related to the rise and fall times. However, propagation delay <u>does</u> accumulate through cascaded devices. Thus, if the propagation delay of an RTL inverter is 12 ns, the output of n series-connected inverters will be delayed (with respect to its input) by 12n ns. Of course, the output will be inverted if n is odd and will be a delayed replica of the input if n is even. Propagation delay is an important characteristic of digital devices, leading ultimately to performance limitations (in the sense of the number of computations that can be executed per unit of time) in complex digital systems. #### 1.8.7 Propagation and Contamination Delays The delay involved in a complex digital computation is typically determined by the *longest* time necessary for signals to propagate from the inputs of a system to its outputs. As a consequence, digital designers often compute the total propagation delay along each signal path (adding the delays of each device it includes) to determine the limiting *critical path* of information flow. The usefulness of $t_{\rm pd}$ specifications for component devices, then, stems from their validity as an *upper bound* on the delay of data through each device: System designers need a guarantee that the time taken by signals to propagate along each path is no longer than the value they compute from device specifications. An overly conservative $t_{\rm pd}$ specification may cause a system to be designed for suboptimal performance, but it will operate as its designer predicts; an optimistic $t_{\rm pd}$ specification is likely to cause faulty operation. To optimize performance, it is valuable to have the tightest possible bounds on propagation delays in order to know precisely when signals are valid; however, we must insist that these bounds be conservative. Very aggressive designs sometimes require us to pin down signal validity periods even more precisely. Glancing back at figure 1.10, we note that the output voltage remains in a logically valid range for a brief period after the input changes; this reflects our acceptance of a range of valid voltages as well as the physical propagation time of the leading edge of the input signal. This delay represents an additional period of output signal validity that can be exploited in carefully engineered designs: A designer might assume that the output of a gate remains valid for a nanosecond or two after its input changes, in contrast to the more conservative assumption that output validity is immediately corrupted by an input change. To facilitate such highly engineered designs, the specification of a combinational device might include, in addition to the usual propagation delay $t_{\rm pd}$ , a contamination delay $t_{\rm cd}$ . The contamination delay of a combinational device is the minimum interval following an input change during which the validity of previous outputs is guaranteed to remain intact. While $t_{\rm pd}$ is an *upper* bound on a period of output *invalidity*, $t_{\rm cd}$ is a *lower* bound on a period of output *validity*. Like propagation delay, the contamination delay specifications must be conservative; however, a conservative $t_{\rm cd}$ is *smaller* than necessary, whereas a conservative $t_{\rm pd}$ is *larger* than necessary. Like propagation delays, contamination delays can be specified independently for each input-to-output path of a device. The contamination delay between an input-output pair reflects the *fastest* path between that input and output; the propagation delay reflects the *slowest* path. The contamination delay is always less than the corresponding propagation delay; typically, it is much less. Designs that substantively exploit nonzero contamination delays are rare, and for good reason. Squeezing the last nanosecond out of a path requires painstaking timing analysis and effectively returns the engineer to the analog domain of continuous variables and physical effects. It also tends to compromise system manufacturability by depending on subtle characteristics that are quite variable from device to device. Unless otherwise noted, we shall assume zero contamination delays throughout the remainder of this text. 1.8.8 Speed and Power A major weakness of RTL stems from its use of a collector resistor (called a *pullup resistor*) to establish its high-voltage output level. First, this resistor limits collector current and hence prolongs the time necessary to charge stray capacitances associated with a low-to-high transition, yielding poor rise times and consequently high propagation delays. Second, it provides a substantially poorer path for current flow to $V_{\rm CC}$ than a saturated transistor provides to ground; hence it introduces an asymmetry between the rise and fall times of the device. Adjustments in the value of the pullup resistor affect both the switching speed and the power dissipated by the gate. For a resistance R and total stray capacitance C, the switching times are on the order of RC seconds; hence halving the value of the pullup resistor speeds up the device. However, the power dissipated by the resistor not Justal = <sup>&</sup>lt;sup>3</sup> Contamination delays are occasionally referred to as minimum propagation delays, which is misleading. Propagation and contamination generally happen at different times, contamination being earlier. Figure 1.11 Simplified TTL inverter. on a low-voltage output is roughly $$P = \frac{(V_{\rm CC})^2}{R}.$$ Thus the product of the switching time and the power dissipation, often cited as the speed-power product, is relatively independent of the value of R and is approximately constant for a particular logic family or implementation strategy. Speed and power dissipation are important parameters in digital-system design decisions, and the speed-power product is sometimes used as a measure of efficiency by which alternative families are compared. The asymmetry between the resistive pullup and saturated transistor pulldown tends to make opposite-going transitions propagate at differing rates through RTL devices, yielding in effect separate propagation delays for rising and falling edges. Aside from its other disadvantages, this disparity complicates enormously the conceptual model of the devices necessary to use them effectively; in particular, it casts suspicious light on our lumping of propagation delay into a simple single parameter. Figure 1.11 illustrates the more symmetrical output circuitry used in TTL logic. The two output transistors are stacked in a *totem-pole output* configuration, arranged so that valid output levels are asserted by one transistor being saturated while the other is cut off. When the A input is low, the input transistor is cut off. As a result, the base of the upper (pullup) output transistor is pulled high and that transistor saturates, while the base of the lower (pulldown) output transistor is pulled low and that transistor is cut off. Consequently, the voltage at the output terminal C rises close to the collector supply voltage $V_{\rm CC}$ . Conversely, when the A input is high, the input transistor saturates. The bases of the two output transistors are effectively wired together and pulled to a low voltage by current going into the base of the pulldown transistor, which saturates. The diode in series with the pullup transistor's emitter causes the voltage at that emitter to be higher than the voltage at C by the voltage drop Review 8.02 material - Capictanea - capacitor - RLL Circuit Figure 1.12 Three-state driver. across a conducting silicon diode (typically 0.7 V). As a result, the pullup transistor's emitter is at a high enough potential, relative to its base, that the transistor cuts off, and the voltage at C drops to nearly zero. Viewing the transistors as switches, we find that the circuit can provide a valid high output voltage if it connects C to $V_{CC}$ about as definitively as it can short C to ground to force it low. The passive resistive pullup of RTL logic has effectively been replaced by an active switch (transistor), resulting in a faster device with more similar rise and fall times. A disadvantage of this scheme is that it rules out wired-OR interconnection of outputs, since the interconnection of a high to a low output would result in a low impedance path (through two saturated transistors) between $V_{\rm CC}$ and ground. This constraint is partially alleviated by the availability of three-state drivers (often called tristate drivers), which feature a logic-level enable input. When enable carries a logical 1, the driver behaves as a normal TTL driver; when *enable* carries a logical 0, both of the totem-pole output transistors are cut off, leaving the output terminal floating. The symbol for a three-state driver is shown in figure 1.12. Several threestate outputs may be connected, so long as the logic that drives their enable inputs is designed to guarantee that at most one connected driver is enabled at any time. Of course, a line connected only to disabled three-state drivers is left floating and should generally be considered to carry an invalid logic level. The goal of fast transitions dictates that the pair of totem-pole output transistors change between cutoff and saturation as nearly simultaneously as possible; thus there tends to be a brief period during each transition when both transistors are conducting. This in turn presents (momentarily) a nearly short-circuit path between $V_{\rm CC}$ and ground, tending to generate annoying supply voltage spikes that can interfere with other logic devices sharing the same power supply. For this reason, it is common practice to proliferate small filtering capacitors in TTL designs to smooth out supplyvoltage variations. #### 1.8.9 Saturation Charge It has already been noted that electric charge stored in semiconductor junctions and stray capacitances requires both time and power to move, and hence limits the performance of digital circuits. This effect is due in large part to charge (and hence delay) associated with switching a transistor in and out of saturation. Several techniques have been developed to avoid delays stemming from saturation charge; in each case, output transistors remain in their active region rather than becoming saturated. This leads to faster switching times, generally at the cost of increased power consumption. Schottky TTL is similar in design to (and compatible with) standard TTL, except for the incorporation of a very fast diode (a schottky diode) between the base and collector of each transistor. This diode prevents the transistor from saturating and effectively provides nonlinearity at the low-voltage end of its voltage-transfer characteristic, but with greatly reduced charge storage. Emitter-coupled logic (ECL) avoids nonlinearities at the saturation end of transistor voltage-transfer curves altogether. Instead, it combines input transistors with output transistors in such a way that either input or output transistors are cut off, thus using the relatively fast cutoff nonlinearities to stabilize both logical output values. This results in very fast switching times (on the order of a nanosecond) and ensures that each gate will provide some current path to ground at all times, thus increasing power dissipation. #### 1.8.10 Power and Density The density at which logic gates can be packed onto a silicon chip is an important factor in both cost and performance. Integrated circuits that can be reliably manufactured involve a marginal production cost that is relatively independent of the complexity of the circuit they contain. Thus, to a first approximation, a small-scale integrated circuit containing a few dozen gates might have a price similar to that of a complete computer chip in high-volume production. Moreover, for a variety of reasons, electric signals take longer to propagate between chips than to propagate between gates on the same chip. Thus a complete single-chip computer is likely to outperform one that distributes the same technology among several chips, because of the increased interchip communication time. These and other factors have stimulated a strong push to increase the densities of integrated circuits; these have improved from the level of a few gates per chip in the mid-1960s to numbers in the hundreds of thousands by the late 1980s. Density, however, is limited by several factors. Among these is the optical technology used to inscribe circuits on silicon, which limits (currently to a micron or so) the width of the on-chip lines that make up the circuit. Clearly the complexity of the basic gate design further limits the number of gates on a chip. Finally, the power consumed by each gate, coupled with the physical constraints on the amount of heat an integrated-circuit package can dissipate, is a fundamental limitation and provides a principal motivation for our concern with power dissipation as a parameter of gate technology. Currently the highest densities are obtainable using metal-oxide semiconductor (MOS) technologies. The basic switching element in an MOS gate is a field-effect transistor, or FET, whose principal characteristic is its very high input impedance: An input to an FET draws virtually no current and hence consumes virtually no power. The combination of very low power consumption and a very simple gate design allows high densities and hence very complex functions on a single integrated circuit. Nearly all of the single-chip computers, for example, use MOS technologies. Figure 1.13 n-channel FET. Figure 1.14 NMOS inverter. MOS transistors, like bipolar ones, come in two polarities, called *n-channel* and *p-channel*, respectively. Figure 1.13 shows an n-channel FET. Note that it is a three-terminal device, which for our purposes can be viewed as a voltage-controlled switch. So long as the *drain* has a positive voltage with respect to the *source*, current flows from the drain to the source when the *gate* voltage is high (approaching the drain voltage) but not when the gate voltage is low (near the source voltage). A common digital MOS technology—NMOS—uses n-channel FETs as the basic switching element. The basic NMOS gate (figure 1.14) is similar to the RTL gate, using a resistive pullup<sup>4</sup> and an active pulldown; the big difference between NMOS and RTL gates is the much greater currents (and lower resistances) of the latter. An MOS technology using active pullups, called *CMOS* (for complementary MOS), combines n- and p-channel FETs to form a gate with very low power dissipation. A CMOS gate may be viewed as two series-connected switches (similar to the totem-pole outputs of TTL), arranged so that one switch or the other is always open. Thus a quiescent CMOS gate provides no path for current to flow between $V_{\rm DD}$ <sup>&</sup>lt;sup>4</sup> In fact, since resistors are very awkward to implement using NMOS technology, a fixed-bias *depletion-mode FET* is used as the pullup. To a good first approximation, the depletion-mode pullup may be viewed as a resistor. Figure 1.15 CMOS inverter. and ground; furthermore, because of the high impedances of FET inputs, negligible amounts of current are drawn by other CMOS devices connected to its output. The result is that CMOS logic consumes very little power except during transitions, leading to a speed-power product several orders of magnitude better than those of other logic families. CMOS offers high noise immunity, allows a wide range of power-supply voltages, and is generally easy to design with; for these reasons, it has become the technology of choice for large-scale integrated circuits. Each of the MOS technologies requires relatively complex buffers to interface the very-high-impedance MOS gates to external (off-chip) logic. As a result, they are generally used only for complex functions (such as calculator chips and microprocessors) where the number of external connections is small compared with the number of MOS gates involved. #### 1.9 Summary Wait for class for end port The technology discussed in this chapter provides the basis for the <u>fundamental</u> abstraction of digital systems. It allows systems to be constructed by designers who are largely ignorant of the underlying electronics, who deal with the primitive elements (such as gates) strictly in the digital domain, and who agree to abide by the simple rules (such as fanout and synchrony) upon which we insist. It should be emphasized that our goal has not been to deal seriously with the topic of logic gate design. Although the examples given were based on electronic implementations, the reader should recognize that the fundamental issues are not electronic; they apply equally to the realization of digital values as mechanical tension, fluid pressures, or any other continuously variable physical parameter. In each case nonlinearities, gain, and the "forbidden zone" will play essential roles. Each will involve speed-power trade-offs, noise, and propagation delays. An appreciation of the electronic details of the preceding examples is not essential to the understanding or even the effective design of digital systems; indeed, the degree of naivete they allow their users is precisely their value as an abstraction. It is important, however, to recognize the limits to the abstraction and the physical constraints from which they derive. #### 1.10 Context The RTL and NMOS logic families were the progenitors of logic ICs (early 1960s) and VLSI (early 1970s), respectively, but each is now obsolete. Surviving families of off-the-shelf logic chips include ECL and the still ubiquitous TTL; the venerated TTL Data Book [TI 1984] will provide the uninitiated reader with a glimpse of the Tinkertoy-set world that has been available to the digital designer for a quarter century. Many contemporary textbooks on digital design, such as Hill and Peterson [1987], treat the subject of this and the following several chapters at a more detailed but still introductory level. Logic implementation issues are treated in the context of NMOS technology in the classic Mead and Conway [1980], which precipitated something of a revolution by popularizing VLSI design beyond the engineering elite. Weste and Eshraghian [1985] perform a similar service for CMOS, the current vogue among custom chip design technologies. #### 1.11 Problems Problem 1.1 Logic Functions: A. Using suitable values for valid logic levels and noise margins, what logic function does the device with the above transfer characteristic compute? Assume the convention of positive logic. B. What function is computed by the above device if we assume negative logic? #### Problem 1.2 Base 3 Numbering: Ternary is a term referring to the number system in base 3. Consider a convention in which a ternary digit is represented as an electric voltage between 0 and 10 V. Let 0–1 V represent a valid "0" output, 4–6 V a valid "1" output, and 9–10 V a valid "2" output. - A. Assuming noise margins 1 V wide, show the mapping of logic levels to voltages for this ternary system. Include valid logic-level outputs, noise margins, and forbidden zones. Your result should resemble figure 1.1. - B. Graph the transfer characteristic for a device capable of acting as a ternary logic *buffer*, that is, a device that produces at its output the same logic level present at its input, as shown below: | IN | OUT | | |----|-----|--| | 0 | 0 | | | 1 | 1 | | | 2 | 2 | | C. Can a device with the following transfer characteristic be used as a ternary logic buffer? Why or why not? - D. How many bits of information are carried in a ternary signal on a single wire? - E. How many different combinations of valid logic levels can be encoded on three ternary wires? How many bits of information does this represent? How many wires would be needed to carry this same amount of information in binary? - F. What is the information flow in bits per second for three ternary wires if a new set of values is sent every 10 ms? G. What is the information flow in bits per second for three *binary* wires if a new set of values is sent every 10 ms? #### Problem 1.3 RTL Logic: An RTL inverter circuit has the transfer characteristic diagrammed below: For suitable values of the threshold voltages $V_1$ , $V_2$ , $V_3$ , and $V_4$ in the figure below, this RTL inverter obeys the static discipline. | Valid<br>0<br>out | Noise margin | Forbidden zone | Noise margin | Valid<br>1<br>out | |-------------------|--------------|----------------|------------------|-------------------| | ı | '1 V | <u> </u> | V <sub>2</sub> J | /4 | A. What is the smallest possible width for the forbidden zone (that is, what is the smallest possible value of $V_3 - V_2$ ) if the noise margins are each at least 0.5 V wide? Give values of $V_1$ , $V_2$ , $V_3$ , and $V_4$ corresponding to this minimum width. Why can't the forbidden zone be made any smaller than this? The above transfer characteristic is for an *unloaded* RTL inverter, that is, an inverter with no other logic circuits connected to its output. If other logic circuits are connected to this output, they will draw current from it and affect the shape of the transfer characteristic. Some of these altered transfer characteristics are shown below. - B. Pick threshold voltages $V_1$ , $V_2$ , $V_3$ , and $V_4$ , as before, so that the static discipline is obeyed where individual inverters have a fanout of no more than 5. In other words, the transfer curve of any individual inverter may be that of an unloaded inverter, or an inverter loaded with five logic inputs, or anywhere in between. Continue to allow noise margins of at least 0.5 V. Briefly explain the constraints on your selection (such as " $V_1$ cannot be greater than 2 V because..."). - C. If the fanout of some inverters is increased to 10, is it still possible to pick $V_1$ , $V_2$ , $V_3$ , and $V_4$ to allow noise margins of 0.5 V? Why or why not? #### Problem 1.4 Transfer Characteristics: Is it possible to assign valid logic level and noise margin boundaries so that a device with the following transfer characteristic would serve as an inverter? Briefly explain your reasoning. Problem 1.5 Transfer Characteristics II: Is it possible to assign valid logic levels and noise margin boundaries so that a device with the following transfer characteristic would serve as an inverter? Briefly explain your reasoning. #### Problem 1.6 Barracks Logic: Barracks logic is built out of sleeping soldiers covered by electric blankets. Each blanket has a control switch with discrete control settings ranging in 5-degree (Fahrenheit) intervals from 0 to 50 degrees. The temperature of a soldier covered by one or more electric blankets will be the sum of the ambient temperature in the barracks plus the setting on the controller for each blanket. Each soldier has a preferred sleeping temperature, which varies from individual to individual but is always within the range of 60 to 80 degrees, inclusive. If a soldier's temperature departs from his preferred temperature, he will wake up once every minute and adjust the control by one 5-degree increment in the direction he would like his temperature to be modified (if he is cold, he will increase the setting on his control, and vice versa). He will continue these adjustments by 5-degree increments until he once again reaches his preferred temperature (and goes to sleep) or runs out of settings (in which case he grumbles angrily in bed). If every soldier is allowed to control his own blanket, each will soon reach his preferred temperature and slide into nocturnal bliss (assuming a suitable ambient temperature). The interesting aspects of barracks logic result from switching the controls of the various blankets to different soldiers. Inputs to the system are accomplished by placing a few controls in the hands of outsiders, and outputs are read from the control settings of certain soldiers designated by the logic designer. - A. Draw the graph of output control setting vs. input control setting for a typical soldier in steady state. Assume an ambient temperature of 40 degrees. Mark your graph with good choices of the valid regions for the two logical values, the forbidden zone, and the noise margins. Let logical 0 be when a control is completely off and logical 1 be when the control is completely on (or at the highest setting). - B. List some sources of noise that justify the need for noise margins. - C. Even though it is the middle of February, a sudden warm spell raises the ambient temperature in our barracks logic system to 55 degrees. Sketch a new graph of output control setting vs. input control setting in the warmer barracks. - D. Over what range of ambient temperatures will barracks logic function reliably? - E. Does the following arrangement perform a useful function? What is it? - F. A model HOT-1 electric blanket control can power 1200 W of blankets. A model HOT-BED electric blanket requires 275 W. What is the fanout of a HOT-1? What might happen if this rating is exceeded? Is there more danger of exceeding the rating at an output value of logical 0 or logical 1? - G. To create a system with multiple inputs, we allow several blankets to be placed over a single soldier. What is the maximum fanin possible in barracks logic if 170 degrees is the highest temperature a soldier can tolerate without his characteristics being permanently altered? - H. Show how to build a NOR gate and an AND gate in barracks logic. - I. Explain what causes rise time, fall time, and propagation delay in barracks logic. Give worst-case numerical values for each of these parameters for the barracks logic inverter. (Let *rise time* be the time from when an output leaves the valid logical 0 range until it enters the valid logical 1 range, and vice versa for fall time. For propagation delay, use the time from when the input switches to a valid logic level to the time when the output switches to a valid logic level.) #### Problem 1.7 An inverter has propagation delay $t_d$ and rise time $t_r$ . What is the propagation delay and rise time of three inverters connected in series? #### Problem 1.8 Consider a family of logic in which $V_{\rm ol} = 0.6 \, \mathrm{V},$ $V_{\rm oh} = 2.8 \, \rm V$ $V_{\rm il} = 1.0 \, \rm V$ $V_{\rm ih} = 2.2 \, \rm V.$ - A. How wide are the 0 and the 1 noise margins (in volts)? - B. What is the smallest average voltage gain that a buffer in this family must exhibit over the range $V_{\rm il} < V_{\rm in} < V_{\rm ih}$ ? (Recall that a buffer is a combinational device that asserts on its output the same logical value that it senses at its input.) #### Problem 1.9 Consider a logic buffer (output logic value = input logic value) in a family of logic parameterized by the following mapping of voltages to logic levels: - A. What is the minimum gain a *buffer* must exhibit over the forbidden zone in order to obey the static discipline with this convention? - B. Sketch the voltage-transfer characteristic of a buffer in this logic family. - C. Let $V_{\rm f}$ be a voltage that, when applied to the inverter's input terminal, yields $V_{\rm f}$ at the buffer's output. Give a range of possible values for $V_{\rm f}$ . #### Problem 1.10 Propagation Delay and Rise Time: Assume that all gates have a propagation delay $t_{\rm pd}$ and a rise time $t_{\rm r}$ . What is the maximum propagation delay and rise time of the following circuit? #### Problem 1.11 Suppose we have an inverter with the following transfer characteristic, where $V_{\rm oh}$ , $V_{\rm ih}$ , $V_{\rm ol}$ , and $V_{\rm il}$ are defined as in figure 1.1. - A. What are the sizes of the noise margins for this inverter? - B. Consider a single inverter having the above transfer characteristic, conventionally diagrammed as shown below. To what value could IN drop before OUT would cease to be a valid low output? C. Now consider three cascaded inverters, each having the transfer characteristic shown above: To what value could IN drop before OUT would cease to be a valid low output? - D. Suppose we put a box around these three cascaded inverters and called the box a NEW INVERTER. If $V_{\rm oh}$ = 4 V and $V_{\rm ol}$ = 1 V, what are the noise margins for this NEW INVERTER? - E. Give two reasons why this type of cascading is not a good way to increase noise margins. WP transistor that I cansistor that I cansistor that I cansistor that I cansistor that I cansistor that tansister amplifies Iswitches electric signal (idid we do in 6.01?) Vin Base Prollector Emitter Small s'unal to control larger signal -> gain Output in proportion to input signal -> amphitier Diff types/channels Physical peron mon Concrete encoding of information To this point we've discussed encoding information using bits. But where do bits come from? If we're going to design a machine that manipulates information, how should that information be physically encoded? 00101101 One if by land, and two if by sea: And I on the opposite share will be. Ready to ride and spread the alarm Through every Middleeex village and farm. For the country folk to be up and to arm." What makes a good bit? - cheap (we want a lot of them) - stable (reliable, repeatable) - ease of manipulation (access, transform, combine, transmit, store) 6.004 - Fall 2011 LO2 - Digital Abstraction 2 Substrates for computation by bit adder We can build upon almost any physical phenomenon... Those last ones lanterns dominos not pratial engraved stone tablets Billiard balls E. Coli polarization of a photon 9/13/11 WaitI But, since we're EE's... Stick with things we know about: voltages phase currents frequency This semester we'll use yoltages to encode information. But the best choice depends on the intended application... Voltage pros: easy generation, detection lots of engineering knowledge potentially low power in steady state Voltage cons: easily affected by environment DC connectivity required? R&C effects slow things down 6.004 - Fall 2011 ## Patrie Representing information with voltage Representation of each point (x, y) on a B&W Picture: O volts: BLACK 1 volt: WHITE 0.37 volts: 37% Gray etc. Representation of a picture: Scan points in some prescribed raster order... generate voltage waveform 6.004 - Fall 2011 Paster scan ## Information Processing = Computation First let's introduce some processing blocks: 6.004 - Fall 2011 LO2 - Digital Abstraction 6 ## Why have processing blocks? The goal of modular design: Abstraction What does that mean, anyway? Rules simple enough for a 6-3 to follow... Understanding BEHAVIOR without knowing IMPLEMENTATION Predictable composition of functions Tinker-toy assembly Guaranteed behavior, under REAL WORLD circumstances Let's build a system! 6.004 - Fall 2011 6.004 - Fall 2011 ### Why did our system fail? Why doesn't reality match theory? 1. COPY Operator doesn't work right lunrealistic to 2. INVERSION Operator doesn't work right 3. Theory is imperfect 4. Reality is imperfect 5. Our system architecture stinks ANSWER: all of the above! Noise and inaccuracy are inevitable; we can't reliably reproduce infinite information-- we must design our system to tolerate some amount of error if it is to process information reliably. 6.004 - Fall 2011 9/13/11 LO2 - Digital Abstraction 9 ## The Digital Panacea... Why digital? ... because it keeps the contracts simple! The price we pay for this robustness: (out) transit anything y wire 0 or 1 but reshirat All the information that we transfer between modules is only 1 crummy bit! But, we get a guarantee of reliable processing. #### The Key to System Design A system is a structure that is quaranteed to exhibit a specified behavior, assuming all of its components obey their specified behaviors. How is this achieved? Contracts! Every system component will have clear obligations and responsibilities. If these are maintained we have every right to expect the system to behave as planned. If contracts are violated all bets are off. 9/13/11 LO2 - Dialtal Abstraction 10 ## The Digital Abstraction Keep in mind that the world is not digital, we would simply like to engineer it to behave that way. Furthermore, we must use real physical phenomena to implement digital designs! 6.004 - Fall 2011 ## Using Voltages "Digitally" - Key idea: don't allow "O" to be mistaken for a "1" or vice versa - Use the same "uniform representation convention" for every component and wire in our digital system - To implement devices with high reliability, we outlaw "close calls" via a representation convention which forbids a range of voltages between "O" and "1". Fundemental abstraction ## A Digital Processing Element A combinational device is a circuit element that has ## A Combinational Digital System A set of interconnected elements is a we (an hook them we combinational decimal decimal) combinational device if - each circuit element is combinational - every input is connected to exactly one output or to some vast supply of constant O's and 1's - the circuit contains no directed cycles LO2 - Digital Abstraction 13 #### Why is this true? 6004 - Fall 2011 Given an acyclic circuit meeting the above constraints, we can derive functional and timing specs for the input/output behavior from the specs of its components! We'll see lots of examples soon. But first, we need to build some combinational devices to work with... 9/13/11 Wires: theory vs. practice Does a wire obey the static discipline? Noise: changes voltage... Vout (voltage close to boundary (voltage in forbidden zone: with forbidden zone) Oops, not a valid voltage!) Questions to ask ourselves In digital systems, where does noise come from? How big an effect are we talking about? 6.004 - Fall 2011 LO2 - Digital Abstraction 16 6.004 - Fall 2011 ## Power Supply Noise L's from chip leads R's and C's from Aluminum wiring layers Current loads from onchip devices AV from: · IR drop (between gates: 30mV, within module: 50mV, across chip: 350mV) · L(dl/dt) drop (use extra pins and bypass caps to keep within 250mV) · LC ringing triggered by current "steps" 6.004 - Fall 2011 LO2 - Digital Abstraction 17 ## different value Sequential Interference $\Delta V$ from energy storage left over from earlier signaling on the wire: · transmission line discontinuities (reflections off of impedance mismatches and terminations) [Dally]Fig. 6-20 (narrow pulses are lost due to incomplete transitions) 6.004 - Fall 2011 Fix: slower operation, limiting voltage swings and slew rates Performere Card Digital Abstraction 19 #### Crosstalk This situation frequently happens on integrated circuits where there are many overlapping wiring layers. In a modern integrated circuit $\Delta V_A$ might be 2.5V, $C_0 = 20$ fF and $C_c = 10$ fF $\rightarrow \Delta V_B = 0.83$ V! Designers often try to avoid these really bad cases by careful routing of signals. but some crosstalk is unavoidable. 6.004 - Fall 2011 9/13/11 LO2 - Digital Abstraction 18 Contract ## Needed: Noise Margins! Does a wire obey the static discipline? No! A combinational device must restore marginally valid signals. It must accept marginal inputs and provide unquestionable outputs ## Summary - · Use voltages to encode information - · "Digital" encoding - · valid voltage levels for representing "O" and "1" - · forbidden zone avoids mistaking "O" for "1" and vice versa - · gives rise to notion of signal VALIDITY. - · Noise - Want to tolerate real-world conditions: NOISE. - · Key: tougher standards for output than for input - · devices must have gain and have a non-linear VTC - · Combinational devices - · Each logic family has Tinkertoy-set simplicity, modularity - predictable composition: "parts work → whole thing works" - · static discipline - · digital inputs, outputs; restore marginal input voltages - · complete functional spec - · valid inputs lead to valid outputs in bounded time # Can this be a combinational device? Problem Suppose that you measured the voltage transfer curve of the device shown below. Could we build a logic family using it as a single-input combinational device? ## Next time: Building Logic w/ Transistors 6.004 - Fall 2011 LO2 - Digital Abstraction 23 6.004 - Fall 2011 9/13/11 ## (.004 Recitation 2 binary signaling scheme 0 or 1 - voltage to encode base 0:00 1: 11 In ceal world, things not perfect -noise on wire - imperfect doorpoperties components - envionmental - temo Have Seperate spees for in tout in out $\delta \leq V_{01} \leq V_{IL}$ 1 = VOH VIH input high noise \_\_\_margins O Val VIN I Vat Could be pich Vol = 00 ? Man Vol Z 15V e So first we are gathering constraints 1 if Voc = ,5V VIH 23 I then we can see Von: 3.5 VII = [V don't need to be symmetric Can do some optimization VOL VIL VIH VOHI SV V 3.51 Not valid Since may gain = 1 So O noise margin (9,1) 2 3 4 5 Must be 71 (3.5,6) Cold we draw noise margins? Yes - and now what are they $\sqrt{\alpha} < 1$ etc Whats the goal? depends on problem? - max noise margin - is it legal? -or just a legal noise morgin $\frac{d}{d} = \begin{cases} V_{A} \cdot V_{B} & \text{prod } \neq 2 \\ 2 & \text{prod } \neq 2 \end{cases}$ Voi VEC 1/ 1/1 VOL VIC KHAM VOH 1.5, 16 1.7 1.8 So is this device a legal before in out Mo Va VIL VIH VOH 17 19 1.1 1.3 Plug in a valid low voltage 0->.9 - will I get something 2,7? 50 19.19 = 181 Which is 7.7 so & No VOL VIL VIH VOH 11 13 1,7 1,9 13.13 = 104 which is 21 & small noise margin all 12 Volts ## CMOS Technology NEXT WEEK: ·THU: Lab 1 due! · FRI: QUIZ 1!!! LO3 - CMOS Technology 1 everything on the wish list MOSFETS: Gain & non-linearity MOSFETs (metal-oxide-semiconductor field-effect transistors) are fourterminal voltage-controlled switches. Current flows between the diffusion terminals if the voltage on the gate terminal is large enough to some diffusion create a conducting "channel", otherwise the mosfet is off and the diffusion terminals are not connected. 6.004 - Fall 2011 LO3 - CMOS Technology 3 ## Representing Combinational Device Wish List ✓ Design our system to tolerate some amount of error ⇒ Add positive noise margins ⇒ VTC: gain>1 & nonlinearity Mule up to V Lots of gain ⇒ big noise margin 100/4 Schon ✓ Cheap, small √ Changing voltages will require us to dissipate power, but if no voltages are changing, we'd like zero power dissipation ✓ Want to build devices with useful functionality (what sort of operations do we want to perform?) Shaded - has valid ins but FETs as switches The four terminals of a Field Effect Transistor (gate, source, drain and bulk) connect to conductors that generate a complicated set of electric fields in the channel region which depend on the relative voltages of each terminal. type channel between the source and drain. The gate voltage when the channel first forms is called the threshold voltage -- the mosfet switch goes from "off" to "on". cause a drift current from the drain to the source. ## Same manut process #### FETs come in two flavors NFET: n-type source/drain diffusions in a p-type substrate. Positive threshold voltage: inversion forms n-type channel PFET: p-type source/drain diffusions in a n-type substrate. Negative threshold voltage; inversion forms p-type channel. The use of both NFETs and PFETs - complimentary transistor types - is a key to CMOS (complementary MOS) logic families. LO3 - CMO5 Technology 5 ## Can completty control voltage on wire blu high + gnd CMOS Inverter VTC When VIN is low, the nfet is off and the pfet is on, so current flows into the output node and Vour eventually reaches $V_{DD}$ (= $V_{OH}$ ) at which point no more current will flow. ' pfet "on' nfet "off" Steady state reached when Vout reaches value where $I_{nu} = I_{nd}$ . > When V<sub>IN</sub> is high, the pfet is off and the nfet is on, so current flows out of the output node and Vour eventually reaches GND (= Vol) at which point no more current will flow. > > pfet "off" nfet "on' When VIN is in the middle, both the pfet and nfet are on and the shape of the VTC depends on the details of the devices' characteristics. CMOS gates have very high gain in this region (small changes in $V_{IN}$ produce large changes in $V_{OUT}$ ) and the VTC is almost a 10step function. 6.004 - Fall 2011 ## CMOS Recipe If we follow two rules when constructing CMOS circuits then we can model the behavior of the mosfets as simple switches: Rule #1: only use NFETs in pulldown circuits (paths from output node to GND) Rule #2: only use PFETs in pullup circuits (paths from output node to Vpp) NFET Operating regions: PFET Operating regions: "off": $$V_{G} > V_{DD} + V_{TH,PFET} \qquad S \longrightarrow D$$ "on": $$V_{G} < V_{DD} + V_{TH,PFET} \qquad S \longrightarrow U_{M}U \longrightarrow D$$ $$\sim -V_{TH,NFET}$$ LO3 - CMOS Technology 6 ## Beyond Inverters: Complementary pullups and pulldowns Now you know what the "C" in CMOS stands for! We want complementary pullup and pulldown logic, i.e., the pulldown should be "on" when the pullup is "off" and vice versa. | | 1 11 pullup | pulldown | $F(A_1,,An)$ | |-------|-------------|----------|---------------| | (ar k | on pullup | off | driven "1" | | all C | ombos off | on | driven "O" | | 00. | on | on | driven "X" | | | off | off | no connection | Since there's plenty of capacitance on the output node, when the output becomes disconnected it "remembers" its previous voltage -at least for a while. The "memory" is the load capacitor's charge. Leakage currents will cause eventual decay of the charge (that's why DRAMs need to be refreshed!). The tor charge ## Here's another... 9/15/11 6.004 - Fall 2011 LO3 - CMOS Technology 11 9/15/11 LO3 - CMO5 Technology 12 ## 2 Issues' timing, heat A Quick Review - A combinational device is a circuit element that has - one or more digital inputs - one or more digital outputs - a functional specification that details the value of each output for every possible combination of valid input values - a timing specification consisting (at minimum) of an upper bound t<sub>PD</sub> on the required time for the device to compute the specified output values from an arbitrary set of stable, valid input values 6004 - Fall 2011 Static discipline 9/15/11 LO3 - CMO5 Technology 13 ## Due to unavoidable delays... Propagation delay (t<sub>PD</sub>): An IPPER BOUND on the An UPPER BOUND on the delay from valid inputs to valid outputs. #### Big Issue 1: Time Wire delays: $\tau_{RC} \approx 50 \text{ps/mm}$ Implies > 1 ns to traverse a 20mm x 20mm chip This is a long time in a 2GHz processor 6.004 - Fall 2011 Stary re distances, capationie LO3 - CMOS Technology 14 ### Contamination Delay an optional, additional timing spec INVALID inputs take time to propagate, too... Do we really need to? Usually not... it'll be important when we design circuits with registers (coming soon!) If t<sub>CD</sub> is not specified, safe to assume it 0. CONTAMINATION DELAY, $t_{CD}$ A LOWER BOUND on the delay from any invalid input to an invalid output 6004 - Fall 2011 fine till output econes inali LO3 - CMOS Technology 16 ## The Combinational Contract t<sub>PD</sub> propagation delay t<sub>CD</sub> contamination delay Note: - 1. No Promises during XXXXX - 2. Default (conservative) spec: $t_{CD} = 0$ 9/15/11 LO3 - CMOS Technology 17 6.004 - Fall 2011 6004 - Fall 2011 ## Acyclic Combinational Circuits If NAND gates have a $t_{PD} = 4nS$ and $t_{CD} = 1nS$ $t_{PD} = 12 nS$ $t_{CD}$ is the minimum cumulative contamination delay over all paths from inputs to outputs t<sub>PD</sub> is the *maximum* cumulative propagation delay over all paths from inputs to outputs # Oh yeah... one last timing issue NOR: produce any Thing Recall the rules for combinational devices: Output guaranteed to be valid when <u>all</u> inputs have been 2. The prop valid for at least $t_{PD}$ , and, outputs may become invalid no earlier than top after an input changes! Many gate implementations--e.g., CMOSadhere to even tighter restrictions. # What happens in this case? Input A alone is sufficient to #### LENIENT Combinational Device: Output guaranteed to be valid when any combination of inputs sufficient to determine output value has been valid for at least $t_{PD}$ . Tolerates transitions -- and invalid levels -- on irrelevant inputs! LO3 - CMOS Technology 20 LO3 - CMOS Technology 19 ## Big Issue 2: Power Energy dissipated = CVDD2 per cycle CVC1 fire CXC Power consumed = $fnCV_{DD}^2$ per chip where f = frequency of charge/discharge n = number of gates /chip 6.004 - Fall 2011 LO3 - CMO5 Technology 21 # 32 Amps (@220v) MIT Computation Center and Pizzeria I've got the solution! ### Unfortunately... Modern chips (UltraSparc III, Power4, Itanium 2) dissipate from 80W to 150W with a Vdd = 1.2V(Power supply current is = 100 Amps) Cooling challenge is like making the filament of a 100W incandescent lamp cool to the touch! Worse yet... - Little room left to reduce Vdd - nC and f continue to arow Hey: could we Somehow recycle the charge? LO3 - CMO5 Technology 22 # MUST computation consume energy? (a tiny digression...) How energy-efficient can we make a gate? It seems that switching the input to a NAND gate will always dissipate some energy... / \ http://www.research.ibm.com/journal/rd/44 1/landauerii.pdf Landauer's Principle (1961): discarding C AB 00 NAND GATE: 2 bits → 1 bit 0 1 (information 10 Loss!) Bennett (1973): Use rev bound to energy use! information is what costs energy! Landauer: 1 bit = K·T·log(2) Einstein: E = M·C2 $\Rightarrow$ 1.45 \* 10<sup>37</sup> bits/pound? AB PQ **FEYNMAN** 00 0 0 GATE: 2 bits → 2 bits (information 0 Preservina!) 1 0 Bennett, Fredkin, Feynman, others: Computer systems constructed from infopreserving elements. Theory: NO lower bound on energy use! Practice: Research frontier (qubits, etc.) 9/15/11 LO3 - CMO5 Technology 23 ## Summary 9/15/11 #### · CMOS 6.004 - Fall 2011 - Only use NFETs in pulldowns, PFETs in pullups → mosfets behave as voltage-controlled switches - Series/parallel Pullup and pulldown switch circuits are complementary - CMOS gates are naturally inverting (rising input transition can only cause falling output transition, and vice versa). - "Perfect" VTC (high gain, $V_{OH} = V_{DD}$ , $V_{OL} = GND$ ) means large noise margins and no static power dissipation. #### · Timing specs - tpp: upper bound on time from valid inputs to valid outputs - t<sub>CD</sub>: lower bound on time from invalid inputs to invalid outputs - If not specified, assume t<sub>CD</sub> = O - Lenient gates: output unaffected by some input transitions - · Next time: logic simplification, other canonical forms 6.004 - Fall 2011 9/15/11 LO3 - CMO5 Technology 24 #### CMOS technology <u>Problem 1.</u> The following diagram shows a schematic for the pulldown circuitry for a particular CMOS gate: - A. \* What is the correct schematic for the pullup circuitry? - B. \* Assuming the pullup circuitry is designed correctly, what is the logic function implemented this gate? - C. ★ Assuming the pullup circuitry is designed correctly, when the output of the CMOS gate above is a logic "0", in the steady state what would we expect the voltage of the output terminal to be? What would be the voltage if the output were a logic "1"? <u>Problem 2.</u> The following diagram shows a schematic for the pullup circuitry for a particular CMOS gate: - A. \* Draw a schematic for the pulldown circuitry for this CMOS gate. - B. ★ Assuming the pulldown circuitry is designed correctly, give an expression for the logic function implemented by this gate. Problem 3. Consider the following circuit built from nfets and pfets: - A. ★ Can this circuit be used as a CMOS gate? If not, explain why. If so, what function does it compute? - B. \* If we wanted the output voltage to change more quickly when going from a logic "0" to a logic "1", what changes would we make to the fets? <u>Problem 4.</u> Consider the 4-input Boolean function Y = (A\*B) + (C\*D) where "\*" is AND and "+" is OR. A. \* Implement the function with a single 4-input CMOS gate and an inverter. <u>Problem 5.</u> Anna Logue, a circuit designer who missed several early 6.004 lectures, is struggling to design her first CMOS logic gate. She has implemented the following circuit: Anna has fabricated 100 test chips containing this circuit, and has a simple testing circuit which allows her to try out her proposed gate statically for various combinations of the A and B inputs. She has burned out 97 of her chips, and needs your help before destroying the remaining three. She is certain she is applying only valid input voltages, and expects to find a valid output at terminal C. Anna also keeps noticing a very faint smell of smoke. A. What is burning out Anna's test chips? Give a specific scenario, including input 2 of 4 - values together with a description of the failure scenario. For what input combinations will this failure occur? - B. Are there input combinations for which Anna can expect a valid output at C? Explain. - C. One of Anna's test chips has failed by burning out the pullup connected to A as well as the pulldown connected to B. Each of the burned out FETs appears as an open circuit, but the rest of the circuit remains functional. Can the resulting circuit be used as a combinational device whose two inputs are A and B? Explain its behavior for each combination of valid inputs. - D. In order to salvage her remaining three chips, Anna connects the A and B inputs of each and tries to use it as a single-input gate. Can the result be used as a single-input combinational device? Explain. <u>Problem 6.</u> Occasionally you will come across a CMOS circuit where the complementary nature of the n-channel pull-downs and p-channel pull-ups are not obvious, as in the circuit shown below: A. Construct a table that gives the on-off status of each transistor in the circuit above for all combinations of inputs A and B. 3 of 4 9/16/2011 10:14 AM 4 of 4 B. Compute the output, Y, for each input combination and describe the function of the above circuit. Problem 7. In lecture there was a brief overview of the CMOS fabrication process. - A. What keeps the source/drain diffusions of a MOSFET from shorting out to the substrate or to each other? - B. Why does reducing the thickness of the thin oxide layer improve the performance of the mosfets? - C. Why is silicon dioxide (SiO<sub>2</sub>) deposited right before a new wiring layer is added to the surface of the wafer? - D. How are connections between the wiring layers made? - E. If one wanted to increase $I_{DS}$ for a NFET, how should it's dimensions be changed? - F. Suppose there are two mosfets of width W and length L connected in parallel, i.e., all their terminal connections are identical. Given that I<sub>DS</sub> of a mosfet is proportional to W/L, what would be the appropriate dimensions for a *single* mosfet that would have the same I<sub>DS</sub> as the pair connected in parallel? Eben not Chis today Todan'i last material for Quiz 1 — last senester more material — look at total question V=== gand n-type MOS d'ode = conduts current l'direction Gate / Bill p Source if hook up wrong war get fire For now no difference blu drain + source If bulk Voltage > Sarce Voltage - a lot of current flows - the transactor on flee 1015 amps | 1 | | |---|-------------------------------------------------------------------------------------------------------------------------------------------------| | ) | Control where bulk is connected | | | You usally correct bulk to source | | | B | | | Connect everything to grand so no current can now | | | Bilk is the most of transistor physically | | | No current goes into gate insolated -but it all high voltage on gate -drows regitive electron to it -so conducts current from Lain to source | | | Act like a capacitor | Act like a capacitor is capacitance -but don't need to worry about (Males a tot more sense now) So has on roff condition don't think of it as lor 0 (an swap Drain + Source - it always goes from high to low potential - Standard is high potential/drain at top "Low potential" is gate to source (which is ground for is) V-Mos tie plk to high So no conduction on diveds often bulk + same connected people draw it pside down > dioeds are inherent > side effect (review p-n type side side) In this one want to attract @ charge So conrect gate to low voltage Cullent Flows from high + low Source drain De Morgans Lan #1 Pullabur network South En-mos retworks The Can knock output to grant I tonect to grand Capitator Will ducharge to 0 | (So contined!) | |-------------------------------------------------------------------------| | AM Also need to charge up | | (I was trinking that -must not be as confisel -need to renember (6.01) | | When whole thing conducts - its "on" not 1 or 0 | | ABCD PD | | O O O O Off = no gates open<br>Since no voltage differential<br>Over it | | 1110n | | If named to do this v/ P-Mos<br>world need voltage below ground | | OA (Arb) | | D. C (A+B) The logic formla and | | - negation | PM up network world be apposite on loft -anything serial, make parallel 1 parallel, " Serial Want to describe network as $\overline{A}$ $\overline{B}$ $\overline{C}$ $\overline{D}$ $\overline{D}$ s $\overline{C}$ + $(\overline{A} \circ \overline{B})) = "on"$ To is blow, (note) (so many diff wars to think about things! (all output F Week to prove are the same via Mr. De Morgan Examples whole section of Ano F= D (C+ A+B) $F = \overline{D} \cdot (\overline{C} + (\overline{A} \cdot \overline{B})) \in matches what we got$ · w/ graphical cheater retwork 1 Jone - confirmed Weed both to to full logic operation - need to be complementy - revese it visually - Contin u/ DeMorgans F=AHB Tit Aor B is high It wall O it output at static high is anothing the happening So output high when not A and not B F= A . B A+B are "extenally" controlled - one wires coming to it - Controlled path by other parts of circuit Note A rot conrectled Conrectled Quiz is again next Finday ## Let's build a MOSFET Start with a $500\mu$ slice of a silicon ingot that has been doped with an acceptor (typically boron) to increase the concentration of holes to $10^{14}$ /cm³ - $10^{16}$ /cm³. At room temperature, all the dopants in this p-type material are ionized, turning the silicon into a semiconductor. We'll build many copies of the same circuit onto a single wafer. Only a certain percentage of the chips will work; those that work will run at different speeds. The yield decreases as the size of the chips increases and the feature size decreases. Wafers are processed by automated fabrication lines. To minimize the chance of contaminants ruining a process step, great care is taken to maintain a meticulously clean environment. So put on your bunny suits and let's begin... CMOS Fabrication 1 ## Creating patterns on the wafer Images from: http://www.intel.com/education/chips/index.htm A "thick" (0.4u) layer of $\mathrm{SiO}_2$ is formed by oxidizing the surface of the wafer with wet oxygen (we rust it!). The $\mathrm{SiO}_2$ will serve as insulation between the conductive substrate and subsequent conductive layers we'll build on top of the oxide. Now we'll form a pattern in the $\mathrm{SiO}_2$ using a mask & etch process. First the wafer is coated with a layer of photoresist. Photoresist becomes soluble when exposed to ultraviolet light... like making +-dirts Using a mask to protect parts of the wafer, we'll expose those portions of the wafer where we want to remove the photoresist. We'll use different masks when creating each of the different structures on the wafer. CMOS Fabrication 2 6.004 6.004 ## The etching process The exposed photoresist is removed with a <u>solvent</u>. The unexposed photoresist remains, masking portions of the underlying $SiO_2$ layer. A chemical etch is then used to remove the revealed silicon dioxide. Finally, the remaining photoresist is removed with a different solvent and we're left with pattern of insulating SiO<sub>2</sub> on top of exposed p-type substrate. ## Gate oxide & polysilicon Now a "thin" ( $20 \, \text{Å}$ ) layer of SiO $_2$ , called gate oxide, is grown on the surface. The gate oxide needs to be of high quality: uniform thickness, no defects! The thinner the oxide, the more oomph the FET will have (we'll see why soon) but the harder it is to make it defect-free. Coming soon to a fab near you: $12 \, \text{Å}$ gate oxide... On top of the thin oxide a 0.7u thick layer of polycrystalline silicon, called polysilicon or poly for short, is deposited by CVD. The poly layer is patterned and plasma etched (thin ox not covered by poly is etched away too!) exposing the surface where the source and drain junctions will be formed. Poly has a high sheet resistance of $20~\Omega/\text{sq}$ which can be reduced by adding a layer of a silicided refractory metal such titanium (TiSi<sub>2</sub>), tantalum (TaSi<sub>2</sub>) or molybdenum (MoSi<sub>2</sub>) => 1, 3 or $5~\Omega/\text{sq}$ . CVD? #### Source/drain diffusions Donor implants are used to create self-aligned MOSFET source/drain diffusions and substrate contacts. Usually As is preferred to obtain shallow N-type diffusions and minimal lateral diffusion. High doses are needed to make low resistance (25 $\Omega/\text{sq}$ ) diffusion wires. Afterwards a short thermal annealing step is performed to repair surface damage caused by the implantation. This completes the construction of the MOSFET itself. Now we'll add the metal wiring layers... Wires: metal interconnect Deposit SiO<sub>2</sub> insulation Etch openings for vias/contacts Deposit Al/Cu conductor Etch away Al/Cu leaving wires After a layer of $\mathrm{SiO}_2$ insulation has been deposited, aluminum or copper is deposited, patterned, then etched to form low-resistance (.07 $\Omega/\mathrm{sq}$ ) interconnect. With planarization of the $\mathrm{SiO}_2$ (a mechanical polishing step that creates a flat surface), multiple levels of metal interconnect are possible -- 6 to 8 layers are common in today's processes. 6.004 CMOS Fabrication 5 6.0C4 CMOS Fabrication 6 ## Multiple interconnect layers canera tons of engineering (MOS - the System? MOSFET - actual transistor? Looked it up and that is right (MOS - has = # of p and n MOSFETS way of designing this circles FET -transistor TOS adds includator so uses less current Field effect transistor - relies on electric field to control Shape and this conductivity of channel in a Semicondutor material Sauce - Carriers enter channel Diain - 11 leave Gate - controls VDS 1 Current is I6 (I need to relearn all my electricity terms) (I he was of willipedia extra into) (2) Remember Voltage = pressure Current - Flow Voltage d'offcience in electical potential WPIL to do againts static lectic field to move charge Electric (harge (covlomb) property of mater that causes mother to Experience a force when near other electrically Charged matter Current-flow of electic charge throug nedium (ampère/amp) Apprixondidat Idensister - Semiconductor to amplify + Swith electric signals ( Have I ever saggeted learned ??) at least 3 terminals Vo Hage or current to one puir of terminals changes Cullent flowing through ofer terminals If alpot 7 input can amplify E Pin Thinking of why needed - not just no (mal chieft) Making connections - FRA 'Coad let sentance closer) make complicated making connections - FRA 'Coad let sentance closer) make complicated mos-style systems So lot transistor has FET so that w/ field is most busic/simplest way not just most efficient Needs Semi and ctors ~1945 What before that? killed off low cost mass computers Replaced · electronechanical devices propose Mico controller easer than wilding basic cinic TOh so now I hinda condoctand how with several our pose pe important (epidithny) [This weekent I um learning a lot -but not getting P-Sets done) Vaccine tibes Compuison smaller, easer bo pril louvoi voltage Ne warm up higher reliably insetive for Shalay py no 2 1000 A high power, high freq broudcasting worse Vorerable to recideur explosions Vacuum tibes device that relies on flow of electrons current though a vacuum - amplification or switching - thermonic emission of electrons from a hot filament (cathod) to annode (plate) - add. electrodo can amplify & such - before that mechnical computers Semicondutor - material ul electic conductivity due to electron flow intermediate b/w that of a conductor and a insularor - Current carried by flow of electrons in wires here flow of electrons or flow of (1) charged holes -Silicen most common - add dopanb floles electron absence at as charge carrier - Elections need to fall into holes -" the lack of an electron" ANDA Farstoner Vrupsistar MOSFET N-type extinsic semicondutor praide extra conduction elatriphs to host material So excess of negitive (n-type) Charge Calliels (Renember 3.091) TBut abstractions there were very different P-type M (P= positive) in crease # free charge carriers takes away wealth bound other electrons Lleares behind holes Charge Carriers - à Free (mobile, enbound) partiele carriging Tlots of trem an electic charge - elections, ions Transistor #ability to send use small signal applied by one pair of terrinels to control much larger signal at under pair of teminals it gain - can be amplifer or swith - b'polar base Collector emitter gate source dain « (i so ist diff terms ?) VCC - power supply U+ bipda/ Pritter = goard (9) CMOS Wor Logic that uses p., n. Mosfers as building blocks - No Current & Flows (ideally) Go no power consunt - except when inpt switched - Connecting both n-, p- Dean gates /diains togeter - So one always connects, while the other does not - otherwise fire as then better zone conducts dots of current would flow Mogica Diode - two terminal electronic component / non linear current - voltage charistic - allows current to pass in I directlon (formerd) - while blocking 10 (athor) Gate - identized or physical device implemently a Boolean function - ideal i Zero (ise-time (time to go from low to Unlimited for out (10) fan at the ability of a logic gate out to dive a # of logic gate inputs of Same type = min ( [ Tout high ] [ Int low ]) - perfeti os in put impedience - "Seen" O output impaliance - exhibited impedance - meg sure of opposition to AC extends context of resistance to AC in DC impediance = cesistance logic gale - implemented w/ diodes or transistors areting as electic switches - can also do mech leally Liade logic -simplest AND, OR only -but no NOT/inveter RTL - loye - cesistor, transitor TTL - Francistor, Francitor Now MOSFETS APPOI NANO gate not and Out easy to construct of mes MOR LNAAVO Logic gates All AND OR NOT XOR (an le brilt from NAND XNOR - AND A.B A+B -NOT Vac Vac # NANO physically A, B=poly Notes Th do Connect base N connect B to God to keep reversely Biased Pv = Vn, keeps D + 5 is clated From B P connect B to VDP to keep PN reversioned But is corly dawn assumed N-pll down node to 6ND P-Pull up node to Von Inverter Stoady state Tour = Ipd Vin low - nfet off P fet flows "on" So Vat (earlies Vop # MASSACHUSETTS INSTITUTE OF TECHNOLOGY DEPARTMENT OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE #### 6.004 Computation Structures Lab #1 De 9/22 #### **General Information** Lab assignments are due on Thursdays; check the on-line course calendar for the actual due date for each lab. Look at the on-line questions before you start to see what information you should collect while working-on each lab. The on-line questions can be loaded from the "On-line assignments" page accessible from the navigation sidebar on the course website (<a href="http://6004.csail.mit.edu">http://6004.csail.mit.edu</a>). If you have difficulties, questions or suggestions with the on-line system please send email to 6004-labs@csail.mit.edu. You can visit each on-line question page as many times as necessary to complete the assignment – you do not have to answer all the questions in a single session. Click on the "Save" button at the bottom of the question page to save your answers. You can then come back to the page later on and pick up where you left off. You can also check your answers at any time by clicking on the "Check" button. When the system detects that all your answers are correct (either because of a "Check" or "Save"), it will give you credit for completing the assignment. To receive credit for a lab, you'll need to have a short lab checkoff meeting with a member of the course staff and answer some questions about your work. Just come by the lab after you've completed your check-in and talk with one of the on-duty staff. The meeting can happen after the due date of the lab but to receive full credit you must complete the meeting within one week of the lab due date. To avoid long waits choose a time other than Wednesday or Thursday evenings. The lab gets crowded just before an assignment is due and some of the problems are too long to be done the night before the due date, so plan accordingly. There will be course staff in the lab during the late afternoon and evening; check the course website for this semester's schedule. 7755331 The 6.004 lab is located in 32-083 and is open 24 hours-a-day, 7 days-a-week. An access code is required for entry; it was given out in the email that included your section assignment. The lab has Linux-Athena workstations that can be used to complete the homework assignments. The lab software is written in Java and should run on any Java Virtual Machine supporting JDK 1.3 or higher. The courseware can be run on any Linux-Athena workstation. You can also download the courseware for your Linux, Windows, or Macintosh computer – see the "Courseware" page on the 6.004 website. In this lab, we'll be using a *simulation program* (JSim) to make some measurements of an N-channel mosfet (or "nfet" for short). JSim uses mathematical models of circuit elements to make predictions of how a circuit will behave both statically (DC analysis) and dynamically (transient analysis). The model for each circuit element is parameterized, e.g., the mosfet model includes parameters for the length and width of the mosfet as well as many parameters characterizing the physical aspects of the manufacturing process. For the models we are using, the manufacturing parameters have been derived from measurements taken at the integrated circuit fabrication facility and so the resulting predictions are quite accurate. The (increasingly) complete JSim documentation can be found at the course website. But we'll try to include pertinent JSim info in each lab writeup. To run JSim, login to an Athena console. We recommend using the computers in the 6.004 lab (32-083) since JSim has been tested and is known to run with satisfactory performance in that environment. Another benefit of using the 6.004 lab is that there's plenty of help around, both from your fellow students and the course staff. After signing onto the Athena station, add the 6.004 locker to gain access to the design tools and model files (you only have to do this once each session): athena% add 6.004 Start JSim running in a separate window by typing athena% jsim & It can take a few moments for the Java runtime system to start up, please be patient! JSim takes as input a *netlist* that describes the circuit to be simulated. The initial JSim window is a very simple editor that lets you enter and modify your netlist. You may find the editor unsatisfactory for large tasks—it's based on the JTextArea widget of the Java Swing toolkit that in some implementations has only rudimentary editing capabilities. If you use a separate editor to create your netlists, you can have JSim load your netlist files when it starts: athena% jsim filename ... filename & Two limitations of the JSim editor are that Ctrl-S does not save and there is no undo. Say what! There are various handy buttons on the JSim toolbar: Ехіт Exit. Asks if you want to save any modified file buffers and then exits JSim. New file. Create a new edit buffer called "untitled". Any attempts to save this buffer will prompt the user for a filename. Open file. Prompts the user for a filename and then opens that file in its own edit buffer. If the file has already been read into a buffer, the buffer will be reloaded from the file (after asking permission if the buffer has been modified). Close file. Closes the current edit buffer after asking permission if the buffer has been modified. Reload file. Reload the current buffer from its source file after asking permission if the buffer has been modified. This button is useful if you are using an external editor to modify the netlist and simply want to reload a new version for simulation. Save file. If any changes have been made, write the current buffer back to its source file (prompting for a file name if this is an untitled buffer created with the "new file" command). If the save was successful, the old version of the file is saved with a ".bak" extension. Save file, specifying new file name. Like "Save file" but prompts for a new file name to use. Save all files. Like "save file" but applied to all edit buffers. Stop simulation. Clicking this control will stop a running simulation and display whatever waveform information is available. Device-level simulation. Use a Spice-like circuit analysis algorithm to predict the behavior of the circuit described by the current netlist. After checking the netlist for errors, JSim will create a simulation network and then perform the requested analysis (i.e., the analysis you asked for with a ".dc" or ".tran" control statement). When the simulation is complete the waveform window is brought to the front so that the user can browse any results plotted by any ".plot" control statements. Fast transient analysis. This simulation algorithm uses more approximate device models and solution techniques than the device-level simulator but should be much faster for large designs. For digital logic, the estimated logic delays are usually within 10% of the predictions of device-level simulation. This simulator only performs transient analysis. Gate-level simulation. This simulation algorithm only knows about gates and logic values (instead of devices and voltages). We'll use this feature later in the term when trying to simulate designs that contain too many mosfets to be simulated at the device level. Switch to waveform window. In the waveform window this button switches to the editor window. Of course, you can accomplish the same thing by clicking on the border of the window you want in front, but sometimes using this button is less work. Using information supplied in the checkoff file, check for specified node values at given times. If all the checks are successful, submit the circuit to the on-line assignment system. The waveform window shows various waveforms in one or more "channels." Initially one channel is displayed for each ".plot" control statement in your netlist. If more than one waveform is assigned to a channel, the plots are overlaid on top of each using a different drawing color for each waveform. If you want to add a waveform to a channel simply type the appropriate signal name in the list appearing to the left of the waveform display (the name of each signal should be on a separate line). You can also add the name of the signal you would like displayed to the appropriate ".plot" statement in your netlist and rerun the simulation. If you simply name a node in your circuit, its voltage is plotted. You can also ask for the current through a voltage source by entering "I(Vid)". The waveform window has several other buttons on its toolbar: Select the number of displayed channels; choices range between 1 and 16. Print. Prints the contents of the waveform window (in color if you have a color printer!). If you are using Athena, you have to print to a file and then send the file to the printer: select "file" in the print dialog, supply the name you'd like to use for the plot file, then click "print". You can send the file to one of the printers in the lab using "lpr", e.g., "lpr -Pcs foo.plot". You can zoom and pan over the traces in the waveform window using the control found along the bottom edge of the waveform display: What is The difference What there? zoom in. Increases the magnification of the waveform display. You can zoom in around a particular point in a waveform by placing the cursor at the point on the trace where you want to zoom in and typing upper-case "X". zoom out. Decreases the magnification of the waveform display. You can zoom out around a particular point in a waveform by placing the cursor at the point on the trace where you want to zoom out and typing lower-case "x". surround. Sets the magnification so that the entire waveform will be visible in the waveform window. The scrollbar at the bottom of the waveform window can be used to scroll through the waveforms. The scrollbar will be disabled if the entire waveform is visible in the window. You can recenter the waveform display about a particular point by placing the cursor at the point which you want to be at the center of the display and typing "c". The JSim netlist format is quite similar to that used by Spice, a well-known circuit simulator. Each line of the netlist is one of the following: A comment line, indicated by an "\*" (asterisk) as the first character. Comment lines (and also blank lines) are ignored when JSim processes your netlist. You can also add comments at the end of a line by preceding the comment with the characters "//" (C++- or Java-style comments). All characters starting with "//" to the end of the line are ignored. Any portion of a line or lines can be turned into a comment by enclosing the text in "/\*" and "\*/" (C-style comments). A continuation line, indicated by a "+" (plus) as the first character. Continuation lines are treated as if they had been typed at the end of the previous line (without the "+" of course). There's no limitation on the length of an input line but sometimes it's easier to edit long lines if you use continuation lines. Note that "+" also continues "\*" comment lines! A control statement, indicated by a "." (period) as the first character. Control statements provide information about how the circuit is to be simulated. We'll describe the syntax of the different control statements as we use them below. A circuit element, indicated by a letter as the first character. The first letter indicates the type of circuit element, e.g., "r" for resistor, "c" for capacitor, "m" for mosfet, "v" for voltage source. The remainder of the line specifies which circuit nodes connect to which device terminals and any parameters needed by that circuit element. For example the following line describes a $1000\Omega$ resistor called "R1" that connects to nodes A and B. R1 A B 1k Note that numbers can be entered using engineering suffixes for readability. Common suffixes are "k"=1000, "u"=1E-6, "n"=1E-9 and "p"=1E-12. 12. (So give is some still Lab#I to try 6.004 Computation Structures ### 2: Characterizing MOSFETs Let's make some measurements of an nfet by hooking it up to a couple of voltage sources to generate different values for $V_{GS}$ and $V_{DS}$ : We've included an ammeter (built from a 0v voltage source) so we can measure <u>IDS</u>, the current flowing through the mosfet from its drain terminal to its source terminal. Here's the translation of the schematic into our netlist format: ``` * plot Ids vs. Vds for 5 different Vgs values .include "/mit/6.004/jsim/nominal.jsim" Vmeter vds drain 0v Vds vds 0 0v Vgs gate 0 0v * N-channel mosfet used for our test M1 drain gate 0 0 NENH W=1.2u L=600n .dc Vds 0 5 .1 Vgs 0 5 1 .plot I(Vmeter) ``` ME The first line is a comment. The second line is a control statement that directs JSim to include a netlist file containing the mosfet model parameters for the manufacturing process we'll be targeting this semester. The pathname that's shown will work when running on Athena; if you're running at home you'll need to specify the directory where you downloaded the 6.004 tools. To make your life easier, you may want to copy nominal.jsim and other ".include" files into your Athena directory and use a relative pathname so that you don't have to modify your files when moving them back and forth. The next three lines specify three voltage sources; each voltage source specifies the two terminal nodes and the voltage we want between them. Note that the reference node for the circuit (marked with a ground symbol in the schematic) is always called "0". The "v" following the voltage specification isn't a legal scale factor and will be ignored by JSim – it's included just remind ourselves that last number is the voltage of the voltage source. All three sources are initially set to 0 volts but the voltage for the Vds and Vgs sources will be changed later when JSim processes the ".dc" control statement. We can ask JSim to plot the current through voltage sources which is how we'll see what $I_{DS}$ is for different values of $V_{GS}$ and $V_{DS}$ . We could just ask for the current of the VDS voltage source, but the sign would be wrong since JSim uses the convention that positive current flows from the 2: Characterizing MOSFETs what is a moster anywar three hat leaned three that the tend tlave hat leaned really tend this staff /=IR high to low - as normal positive to negative terminal of a voltage source. So we introduce a 0-volt source with its terminals oriented to produce the current sign we're looking for. The sixth line is the mosfet itself, where we've specified (in order) the names of the drain, gate, source and substrate nodes of the mosfet. The next item names the set of model parameters JSim should use when simulating this device; specify "NENH" to create an nfet and "PENH" for a P-channel mosfet ("pfet"). The final two entries specify the width and length of the mosfet. Note that the dimensions are in microns (1E-6 meters) since we've specified the "u" scale factor as a suffix. Don't forget the "u" or your mosfets will be meters long! You can always use scientific notation (e.g., 1.2E-6) if suffixes are confusing. The seventh line is a control statement requesting a DC analysis of the circuit made with different settings for the Vds and Vgs voltage sources: the voltage of Vds is swept from 0V to 5V in .1V steps, and the voltage of Vgs is swept from 0V to 5V in 1V steps. Altogether 51 \* 6 separate measurements will be made. The eighth and final line requests that JSim plot the current through the voltage source named "Vmeter". JSim knows how to plot the results from the dual voltage sweep requested on the previous line: it will plot I(Vmeter) vs. the voltage of source Vds for each value of voltage of the source Vgs—there will be 6 plots in all, each consisting of 51 connected data points. After you enter the netlist above, you might want to save your efforts for later use by using the "save file" button. To run the simulation, click the "device-level simulation" button on the tool bar. After a pause, a waveform window will pop up where we can take some measurements. As you move the mouse over the waveform window, a moving cursor will be displayed on the first waveform above the mouse's position and a readout giving the cursor coordinates will appear in the upper left hand corner of the window. To measure the delta between two points, position the mouse so the cursor is on top of the first point. Now click left and drag the mouse (i.e., move the mouse while holding its left button down) to bring up a second cursor that you can then position over the second point. The readout in the upper left corner will show the coordinates for both cursors and the delta between the two coordinates. You can return to one cursor by releasing the left button. We're now ready to make some measurements: On-line question 1A: To get a sense of how well the channel of a turned-on mosfet conducts, let's estimate the effective resistance of the channel while the mosfet is in the linear conduction region. We'll use the Vgs = 5V curve (the upper-most plot in the window). The actual effective resistance is given by $\partial V_{DS}/\partial I_{DS}$ and clearly depends on which $V_{DS}$ we choose. Let's use $V_{DS} = 1.2V$ . We could determine the resistance graphically from the slope of a line since tangent to the $I_{DS}$ curve at $V_{DS} = 1.2V$ . But we can get a rough idea of the channel resistance by determining the slope of a line passing through the origin and the point we resistance by determining the slope of a line passing through the origin and the point we chose on the $I_{DS}$ curve, i.e., 1.2V/ $I_{DS}$ . 0.5 - 0.7 0.0 0.0 0.0 Of course, the channel resistance depends on the dimensions of the mosfet we used to make the measurement. For mosfets, their I<sub>DS</sub> is proportional to W/L where W is the 6.004 Computation Structures -7- Lab#1 IDS ~ Width Lenght L tor each Vgs AL 1-1- width of the mosfet (1.2 microns in this example) and L is the length (0.6 microns in this example). When reporting the effective channel resistance, it's useful to report the *sheet sheet resistance*, i.e., the resistance when W/L = 1. That way you can easily estimate the effective channel resistance for size device by scaling the sheet resistance appropriately. Since W/L = 2 for the device you measured, it conducted twice as much current and has half the channel resistance as a device with W/L = 1, so you need to double the channel resistance you computed above in order to estimate the effective channel sheet resistance. Use the on-line questions page for this lab to report the value for $I_{DS}$ that you measured and the effective channel sheet resistance you calculated from that measurement. # On-line question 1B: Now let's see how well the mosfet turns "off." Take some measurements of $I_{DS}$ at various points along the $V_{GS}$ =0V curve (the bottom-most plot in the window). Notice that they aren't zero! Mosfets do conduct minute amounts of current even when officially "off", a phenomenon called "subthreshold conduction." While negligible for most purposes, this current is significant if we are trying to store charge on a capacitor for long periods of time (this is what DRAMs try to do). Make a measurement of $I_{DS}$ when $V_{GS}$ =0V and $V_{DS}$ =2.5V. Based on this measurement report how long it would take for a .05pF capacitor to discharge from 5V to 2.5V, i.e., to change from a valid logic "1" to a voltage in the forbidden zone. Recall from 6.002 that Q = CV, so we can estimate the discharge time as $\Delta t = C(\Delta V / I_{OFF})$ . So if our mosfet switch controls access to the storage capacitor, you can see we'll need to refresh the capacitor's charge at fairly frequent intervals. So Ves is power Source Vos is where measure at a year year W for annueter Armmeter Armmeter Ref C Toky ## 3: Gate-level timing The following JSim netlist shows how to define your own circuit elements using the ".subckt" statement: \* circuit for Lab#1, parts C thru F .include "/mit/6.004/jsim/nominal.jsim" \* 2-input NAND: inputs are A and B, output is Z .subckt nand2 a b z MPD1 z a 1 0 NENH sw=8 sl=1 MPD2 1 b 0 0 NENH sw=8 sl=1 MPU1 z a vdd vdd PENH sw=8 sl=1 MPU2 z b vdd vdd PENH sw=8 sl=1 \* INVERTER: input is A, output is Z .subckt inv a z MPD1 z a 0 0 NENH sw=16 sl=1 MPU1 z a vdd vdd PENH sw=16 sl=1 .ends B NAND 0 2 The give name like a tention The ".subckt" statement introduces a new level of netlist. All lines following the ".subckt" up to the matching ".ends" statement will be treated as a self-contained subcircuit. This includes model definitions, nested subcircuit definitions, electrical nodes and circuit elements. The only parts of the subcircuit visible to the outside world are its terminal nodes which are listed following the name of the subcircuit in the ".subckt" statement: .subckt name terminals... \* internal circuit elements are listed here .ends In the example netlist, two subcircuits are defined: "nand2" which has 3 terminals (named "a", "b" and "z" inside the nand2 subcircuit) and "inv" which has 2 terminals (named "a" and "z"). Once the definitions are complete, you can create an instance of a subcircuit using the "X" circuit element: Xid nodes... name ( Who - just because ? where *name* is the name of the circuit definition to be used, <u>id</u> is a unique name for this instance of the subcircuit and *nodes*... are the names of electrical nodes that will be hooked up to the terminals of the subcircuit instance. There should be the same number of nodes listed in the "X" statement as there were terminals in the ".subckt" statement that defined *name*. For example, here's a short netlist that instantiates 3 NAND gates (called "g0", "g1" and "g2"): Xg0 d0 ctl z0 nand2 Xg1 d1 ctl z1 nand2 Xg2 d2 ctl z2 nand2 The node "ctl" connects to all three gates; all the other terminals are connected to different nodes. Note that any nodes that are *private* to the subcircuit definition (i.e., nodes used in the deh subcircuit that don't appear on the terminal list) will be unique for each instantiation of the subcircuit. For example, there is a private node named "1" used inside the nand2 definition. When JSim processes the three "X" statements above, it will make three independent nodes called "xg0.1", "xg1.1" and "xg2.1", one for each of the three instances of nand2. There is no sharing of internal elements or nodes between multiple instances of the same subcircuit. So we can see that sometime It is sometimes convenient to define nodes that are shared by the entire circuit, including subcircuits; for example, power supply nodes. The ground node "0" is such a node; all references to "0" anywhere in the netlist refer to the same electrical node. The included netlist file nominal.jsim defines another shared node called "vdd" using the following statements: The example netlist above uses "vdd" whenever a connection to the power supply is required. The other new twist introduced in the example netlist is the use of symbolic dimensions for the mosfets ("SW=" and "SL=") instead of physical dimensions ("W=" and "L="). Symbolic dimensions specify multiples of a parameter called SCALE, which is also defined in nominal.jsim: .option SCALE=0.6u So with this scale factor, specifying "SW=8" is equivalent to specifying "W=4.8u." Using symbolic dimensions is encouraged since it makes it easier to determine the W/L ratio for a mosfet (the current through a mosfet is proportional to W/L) and it makes it easy to move the design to a new manufacturing process that uses different dimensions for its mosfets. Note that in almost all instances "SL=1" since increasing the channel length of a mosfet reduces its current carrying capacity, not something we're usually looking to do. W save effects We'll need to keep the PN junctions in the source and drain diffusions reverse biased to ensure that the mosfets stay electrically isolated, so the substrate terminal of nfet (those specifying the "NENH" model) should always be hooked to ground (node "0"). Similarly the substrate terminal of pfet (those specifying the "PENH" model) should always be hooked to the power supply (node "vdd"). What does Changing width do? Because ? With the preliminaries out of the way, we can tackle some design issues: # On-line question 1C: To maximize noise margins we want to have the transition in the voltage transfer characteristic (VTC) of the nand2 gate centered halfway between ground and the power supply voltage (3.3V). To determine the VTC for nand2, we'll perform a dc analysis to plot the gate's output voltage as a function of the input voltage using the following additional netlist statements: isinila to what ist did? Lab #1 6.004 Computation Structures \* dc analysis to create VTC Xtest vin vin vout nand2 Vin vin 0 0v where is Ward 2 1 4/1.65 // make measurements easier! Vol vol 0 0.3v // see part (D) Voh voh 0 3v .dc Vin 0 3.3 .005 .plot vin vout voh vol gate of part (C) be a legal member of the logic family? N=pall down P=pall up TCI Blace Combine this netlist fragment with the one given at the start of this section and run the simulation. To center the VTC transition, keep the size of the nfet in the nand2 definition as "SW=8 SL=1" and adjust the width of both pfets until the plots for vin and vout intersect at about 1.65 volts. Just try different integral widths (i.e. 9, 10, 11, ...). Report the integral width that comes closest to having the curves intersect at 1.65V. **On-line question 1D:** The noise immunity of a gate is the smaller of the low noise margin $(V_{IL} - V_{OL})$ and the high noise margin $(V_{OH} - V_{IH})$ . If we specify $V_{OL} = 0.3 \text{ V}$ and $V_{OH} = 3.0 \text{ V}$ , what is the largest possible noise immunity we could specify and still have the "improved" NAND (from () Hint: to measure the low noise margin, use the VTC to determine what $V_{IN}$ has to be in $\gamma$ order for V<sub>OUT</sub> to be 3V, and then subtract V<sub>OL</sub> (0.3V) from that number. To measure the high noise margin, use the VTC to determine what V<sub>IN</sub> has to be in order for V<sub>OUT</sub> to be 0.3V, and then subtract that number from V<sub>OH</sub> (3.0V). We've added some voltage sources corresponding to $V_{OL}$ and $V_{OH}$ to make it easier to make the measurements on the AS WW... VTC plot. NOTE: make these measurements using your "improved" nand2 gate that has the centered VTC, i.e., with the updated widths for the PFETS. Now that we have the mosfets ratioed properly to maximize noise immunity, let's measure the contamination time (t<sub>C</sub>) and propagation time (t<sub>P</sub>) of the nand2 gate. The contamination delay, t<sub>CD</sub>, for the nand2 gate will be a lower bound for all the t<sub>C</sub> measurements we make. Similarly, the propagation delay, t<sub>PD</sub>, for the nand2 gate will be an upper bound for all the t<sub>P</sub> measurements. Recall that the contamination time is the period of output validity after the inputs have become invalid. So for nand2: $t_{C-FALL}$ = time elapsed from when input > $V_{IL}$ to when output < $V_{OH}$ $t_{C-RISE}$ = time elapsed from when input $< V_{IH}$ to when output $> V_{OL}$ $t_C = min(t_{C-RISE}, t_{C-FALL})$ Similarly the propagation time is the period of output invalidity after the inputs have become valid. So for nand2: Lab #1 ### 3: Gate-level timing $t_{P\text{-RISE}}$ = time elapsed from when input $\leq V_{IL}$ to when output $\geq V_{OH}$ $t_{P\text{-FALL}}$ = time elapsed from when input $\geq V_{IH}$ to when output $\leq V_{OL}$ $t_{P}$ = max( $t_{P\text{-RISE}}$ , $t_{P\text{-FALL}}$ ) Following standard practice, we'll choose the logic thresholds as follows: $V_{OL} = 10\%$ of power supply voltage = .3V $V_{IL} = 20\%$ of power supply voltage = .6V $V_{IH} = 80\%$ of power supply voltage = 2.6V $V_{OH} = 90\%$ of power supply voltage = 3V You can use a voltage source with either a pulse or piece-wise linear waveform to generate test signals for your circuit. Here's how to enter them in your nedist: This statement produces a periodic waveform with the following shape: Don't forget to specify your times in nanoseconds (use an "n" suffix)! **Do not specify zero rise** and fall times since the simulation will probably fail to converge. To construct a piece-wise linear waveform, you need to supply a list of time, voltage pairs: The voltage will be v1 for times before t1, and vn for times after tn. # On-line question 1E: Replace the netlist fragment from (C) with the following test circuit that will let us measure various delays: \* test jig for measuring tcd and tpd Xdriver vin nin inv Xtest vdd nin nout nand2 Cload nout 0 .02pf Vin vin 0 pulse(3.3,0,5ns,.1ns,.1ns,4.8ns) ### 3: Gate-level timing ``` Vol vol 0 0.3v // make measurements easier! Vil vil 0 0.6v Vih vih 0 2.6v Voh voh 0 3.0v .tran 15ns .plot vin .plot nin nout vol vil vih voh ``` werent we trying > **NOTE:** make these measurements using your "improved" nand2 gate that has the centered VTC, i.e., with the updated widths for the pfets. We use an inverter to drive the nand2 input since we would normally expect the test gate to be driven by the output of another gate (there are some subtle timing effects that we'll miss if we drive the input directly with a voltage source). Run the simulation and measure the contamination and propagation delays for both the rising and falling output transitions. (You'll need to zoom in on the transitions in order to make an accurate measurement.) Combine as described above to produce estimates for $t_C$ and $t_P$ . ## On-line question 1F: We mentioned several times in lecture the desire to have our circuits operate reliably over a wide range of environmental conditions. We can have JSim simulate our test circuit at different temperature by adding a ".temp" control statement to the netlist. Normally JSim simulates the circuit at room temperature (25°C), but we can simulate the circuit at, say, 100°C by adding the following to our netlist: For many consumer products, designs are tested in the range of $0^{\circ}$ C to $100^{\circ}$ C. Repeat your measurements of part (E) at $100^{\circ}$ C and report your findings. Recompute your estimates for $t_C$ and $t_P$ indicating which measurement(s) determined your final choice for the two delays. Based on your experiment, if a Pentium 4 processor is rated to run correctly at 3Ghz at $100^{\circ}$ C, how fast can you clock it and still have it run correctly at room temperature (assuming $t_{PD}$ is the parameter that determines "correct" behavior)? This is why you can usually get away with overclocking your CPU—it's been rated for operation under much more severe environmental conditions than you're probably running it at! ## 4: CMOS logic-gate design As the final part of this lab, your mission is to design and test a CMOS circuit that implements the function $F(A,B,C) = C + A \cdot B$ using nfets and pfets. The truth table for F is shown below: | | Α | B. | C | F(A,B,C) | |----|---|----|---|-------------| | bi | 0 | 0 | 0 | 0 | | | 0 | 0 | 1 | the loss of | | | 0 | 1 | 0 | 0 | | | 0 | 1 | 1 | 1 | | | 1 | 0 | 0 | 0 | | | 1 | 0 | 1 | 1 1 1 W | | | 1 | 1 | 0 | 1 | | | 1 | 1 | 1 | al a lanc | Your circuit must contain no more than 8 mosfets. Remember that only nfets should be used in pulldown circuits and only pfets should be in pullup circuits. Hint: using six mosfets, implement the complement of F as one large CMOS gate and then use the remaining two mosfets to invert the output of your large gate. Enter the netlist for F as a subcircuit that can be tested by the test-jig built into lab1checkoff.jsim (a file that we supply and which can be found in the course locker). Note that the checkoff circuitry expects your F subcircuit to have exactly the terminals shown below – the inside circuitry is up to you, but the ".subckt F..." line in your netlist should match exactly the one shown below. ``` .include "/mit/6.004/jsim/nominal.jsim" .include "/mit/6.004/jsim/lab1checkoff.jsim" ``` ... you can define other subcircuits (eg, INV or NAND gates) here ... ``` .subckt F A B C Z ... your circuit netlist here .ends ``` lab1checkoff.jsim contains the necessary circuitry to generate the appropriate input waveforms to test your circuit. It includes a .tran statement to run the simulation for the appropriate length of time and a few .plot statements which will display the input and output waveforms for your circuit. For faster simulation, use the (fast transient analysis) button! ### 5: Checkoff When you are satisfied your circuit works, you can start the checkoff process by making sure your top-level netlist is visible in the edit window and that you've complete a successful simulation run. Then click on the green checkmark in the toolbar. JSim proceeds with the following steps: - 1. JSim verifies that a .checkoff statement was found when your netlist was read in. - 2. JSim processes each of the .verify statements in turn by retrieving the results of the most recent simulation and comparing the computed node values against the supplied expected values. It will report any discrepancies it finds, listing the names of the nodes it was checking, the simulated time at which it was checking the value, the expected value and the actual value. - 3. When the verification process is successful, Jsim asks for your 6.004 user name and password (the same ones you use to login to the on-line assignment system) so it can send the results to the on-line assignment server. - 4. JSim sends your circuits to the on-line assignment server, which responds with a status message that will be displayed for you. If you've misentered your username or password you can simply click on the green checkmark to try again. Note that the server will check if your circuit has at most 8 mosfets if it contains more, you'll see a message to that effect and your check-in will not complete. If you have any difficulties with checkoff, talk to a TA, or send email to 6004-labs@csail.mit.edu. Remember to schedule a lab checkoff meeting with a member of the course staff after you complete the on-checkoff and on-line questions. This meeting can happen after the due date of the lab but must be completed within one week of the lab's due date in order to receive full credit. ### 6.004 On-line: Questions for Lab 1 When you're done remember to save your work by clicking on the Save'button at the bottom of the page. You can check if your answers are correct by clicking on the Check'button. When entering numeric values in the answer fields, you can use integers (1000), floating-point numbers (1000.0), scientific notation (1e3), or JSim numeric scale factors (1K). #### Problem 1. A. Report the mosfet Ids you measured from the device curves for Vgs = 5V and Vds = 1.2V. Ids (in amps): Compute the effective channel sheet resistance using (Vds)/(Ids) as an estimate for the channel resistance of the test mosfet. Don't forget to correct for the W/L of the test device! Sheet resistance (in ohms): B. Report the mosfet Ids you measured from the device curves for Vgs = 0V and Vds = 2.5V. Ids (in amps): Calculate the time it would take to discharge a 0.05pF capacitor from 5V to 2.5V Discharge time (in seconds): C. Determine a scaled width (SW) for the two pullup mosfets so that the Vin and Vout curves for the NAND gate intersect at VDD/2 (1.65V). #### Scaled width: D. What is the largest noise immunity we could specify and still have the NAND gate qualify as a legal device? Please fill in your answer with a precision of .01 volts. Hint: to measure the low noise margin, use the VTC to determine what Vin has to be in order for Vout to be 3V, then subtract Vol (0.3V) from that number. To measure the high noise margin, use the VTC to determine what Vin has to be in order for Vout to be 0.3V, then subtract that number from Voh (3.0V). ### Maximum noise immunity (in volts): E. Following standard practice, choose the logic thresholds as follows: Vol = 10% of power supply voltage = 0.3V Vil = 20% of power supply voltage = 0.6V Vih = 80% of power supply voltage = 2.6V Voh = 90% of power supply voltage = 3.0V Measure the contamination and propagation times for your NAND gate using both the falling and rising output transitions. tC for falling output (in seconds): tp for falling output (in seconds): | | t <sub>C</sub> for rising output (in seconds): | |---------------------------------------------------|-------------------------------------------------------------------------| | | tp for rising output (in seconds): | | Use these measurements to com | pute estimates for an upper bound for tCD and a lower bound for tpD. | | | upper bound for tCD (in seconds): | | . Syr | lower bound for tpp (in seconds): | | | and tpD this time including the measurements you made at 100 degrees C. | | | Recomputed upper bound for tCD (in seconds): | | | Measurement used:select answer | | | | | | Recomputed lower bound for tpp (in seconds): | | | Measurement used:select answer | | Check Save | | | course on line questions on lab laugestions veloc | | Oh wiki w/ help - not much Sheet Resistance - Resistance of films unitorm in thickness, $W|_{L=2}$ is twice convert, but resistance Rs = P L - R L W Tscaling factor (A) Is Sheet resistance 814 MA = ,000 814 Vds W Ids L . 1.2 1.2 M = 2948 12 O 16 ) VGS = 0 V V05 = 2.51 I = 5.155 pA (1) Can't enter in 6ci notation How long for 185 pt (ap to discharge 5V+2.5V Q=(V) A f = ( AV) = .05p (5-2.5)5.155p= ,624 Sec (V) 10 This is a VTC Vat Oh the # ore weird to Woldt; 1.5 V' 3 V low noise margin VII - VOL Votl - Vru high Voti = 3V Vo1 = 13V VIL = we not fand = 1.54 1.456 VIH = 1,872 So low 1.456 1.156 Smaller 50 1.200 high 3-1,872: 1,128 Psing Smaller now O carding ara 10 vall = 1 er 13 = since worst case - oh they tell you no VIL since in 16 -td we nant i no have some time to see - gress 5 nc. And pulse length i Now plot a howill emailed in This has me stimpled Ship for now So pulse and pund does not alterate anything. F) Oh this was previous qui? - There was no qu Than what was that text before i. Where we used a pulse And never specified an atat!!! for palse - know the syntax So They did give specific Siredians Non measure What is diff n in and vin' Oh nin is output of an inverter + inpat nand ? Vin ITAN MANDE Moch Measure here - obsorb want for hand 2 N'in starts chaning 5.0392ns Nort starts 11 5.0856 ms 2 .0464 n = 11 tc No -not what supposed to do When be input 7 VII to when output I Voy fall VIY VOL Cise do both + take min Input 7 VIL= 5.0848 output L Vote = 5.112 ns Gall .0272 c smallest Inpt L VIH = 9,9464 oupt 7 Voi = 9,9896 rise = ,0434 = not somethical Think mixed rising talling 18272n = (ising offit () twheat is talling inpit) That this are to falling atput! talling output / living input 5.0824 5.112 .0296 0 masure error So did not mix iving / falling - Stumple 1 on answer to rise time inpt & In output 2 VOH max C> fall propagation time Tive time input $\leq IIL = 1.9837$ Gotput 7 VOH = 10.08 9 .0968n 0tall tine in ZIIH = 5.1254 Oct 06 L VOL = 5.1868 5.0552n0 If to fall input 7 VIL = 5.088 1.0312 emin 0 But correct weasurent used = to Rise (a) 250 Why? ? ? ? Oh - I think qu was put hest top for both temp ranges 10272n - tc rise 25° and it accepted it as close enough rise input & VIL 9.984) 1136 n Emax of all 0 out put 7 VoH fall input ZVIH inpt ZVIH 5,136 1,072 n (V) Telette complete: Online Qu (I shald lah at totaial problems) -MOSFET combo (cros) naty B AND a but how to do AND - can do everything w/ NANO, All do we reed some Ale Morgan's Algarasa cleverness' how had biase they made it for us'. So you can either do everything u/ lots of NANDS or a few more specific by hebi And is NANO + inverter I drew it really wrong This has 6 a design from scratch -no never leaved I don't think $$F = (A+B)$$ NAND NAND Note invert Idoes class have combosi hotes has $F = A \cdot (B+C)$ $F = A + (B \cdot C)$ Note that More basic AND/or palls. online —I see half of it So A.B A.I. A+B AIL JIB That is actually ceally obvious... But is only I side Man = Gnd Oxput So that is pell down and soutput u/ n P-type Vad Pullup so here to the d margan F = C. (A+B) at 1 so the not nears P-fefi So we want F = F? Or F=F? (150 just make up letters to connect? Evari (ant find M 2 What is that! Output?' - but what is F then? So cenamed F77 Z Runs now (X) Node veitication error Look closly at apput | A | B | ( | 1 F | | | | |------------|------------|-----|------|-------------------|---------------------------------------|-----| | $\Diamond$ | Ŏ | 0 | 0 | | ) | | | Q | 0 | l | 40 | $(\hat{\chi})$ | | | | Ø, | 1 | 0 1 | ,5 | (X) | twhy | ,5 | | 6 | ( \ | ( 4 | 0 | 8 | · · · · · · · · · · · · · · · · · · · | . 0 | | ) | 0 | 0 | 15 | (X) | | | | 1 | $\bigcirc$ | ( | | $\langle \rangle$ | | | | 1 | 1 | λ | 1 | | | | | 1 | 1 | Ì | \ \ | | | | | l | ι | | ( () | (X) | | | So what is wrong? May need to ask TA Well think about it -it & copens + Floods w/O PM day - conduts w/ Vgate is high at then plls Journ But from example d'id -but example out is F So add an inverter to out ext. F / We (xt (pre)) ilat ten I would have I Fets But at least get working on when V place high on when V brigh low Now Give the second of the complement of F is $F = \overline{(A + B)}$ before our new invalue. Ad [b] World A.B & A.B So actually F = (+ (A·B) Inverter 1 out Try that That's the exact sure! - oh go in to lab for help mon TA Help C A 0 40 () 0 HJ Actually had Vob this flict by I had the connection Tank his uses l just connect together + no inverter don't weed extra - Where dik I get that idea? Xgate (nputs) (outputs) name tvo 7 av subcet (name) (inputs) (outputs) have oppt Cruit $$F = \pi \left( + (A + B) \right)$$ $$AF = \left( , (A + B) \right)$$ - b4 | ) | | | | | | | | | |------------------|------------|------------|-------------|------|-------------------|----------|---------------|------------| | Try | A | B | | F | | | | | | | $\Diamond$ | $\Diamond$ | 9 | Ŏ | 0 | | | | | • | 1 | Ó | $\Diamond$ | kind | a 0 | e al | 1 tha | t matte | | | () | 1 | $\Diamond$ | 0 | 8 | | | rs close e | | | (5 | | 0 | | 0 | | | | | | ) | () | l | | $\bigotimes$ | \tp | erhups<br>+ v | I redd | | | G | | ι<br> | ' | $\langle \rangle$ | , | · W/0 | rig', | | | l | l | l | 1 | $\bigcirc$ | | | | | 11.0 | | | | | | | | | | Salve Dea | ein i | s da | wa ls | DJ | | | | | | Dia<br>Sa | MCC | • | ll | | | | | | | $\triangleright$ | | | | | | | | | | | am | to | wards<br>li | | 0 | <i>t</i> | | | | S | ovile | | K | | | | | | | That | did | not | help | cha | ng Ox | perany t | hing | | Changing to width 10 did hot help either I shall have used UDD Since it was like 3.3V che ched NAND or NOR -general purpose ANP Can't to Man AND in I staye tell him why By why can do a more complex c So why can't you not do it? No possible combos ( (an't get power to flow that may w/ r fire Can only do O logice blc it always switches 2 ls > 0 2000 > 1 I Prop delay ? width or I length Static power d'apations - very small V Checloff 1.004 Quit | Rever Lecture, 1-3 l. Basics of digital into - intro - Information - encoding into mution - Shanon limit logn (N) Néhocies M= man narrowed choices bits - Fixed length (= probable) - Octal 10,3720 - Hex, 0x, 7d0 - Into -2N-1 to 2N-1 -1 - When choices not = propable logz (Pi) lib - avg into from choice $\geq p$ ; $\log_2\left(\frac{1}{p_1}\right)$ - Compression - Huffman code Need to add redundary back in Hamming distance defect Dabit errors HD7D correct D-bit 11 HD 7 2D Lecture 2 Digital Abstraction encode data/ state Sonohow Using Voltage OP # of points in analog \* The analog degrades built on abstractions + contracts 1 Fobidden 1 Mg Combonational device - I or more lands - I or more atputy - Functional spec - timing spec upper land tod - time to get at Lots of broblen of voltage - (ross take, Pover Supply noise, sequential interfeasing Moise margins - can input, but not input So that moves our Vox Vol= lex legal at regions but not in Lecture 3 Mos - need gain 71 So non hirecity for noise marging - MUSFETS Source Loan Idrain L'ienient - Output valid when any combo of inputs sufficient to determain output has been valid to topp -so tolerates transitions/invalid heres 6.004 Problems Basus at into 1. Pog 2 (8/3) / 2. logn (52) rest (ard logn (51) last logn (1) 5 Alth condigation 3. X 011 \_ \_\_\_\_\_TNO N bits log 2 (N/3) log 2 (2N-3) cold not see this $= loy_2(2^3)$ = 3 bits (duh) 4. X is unknown 8-bit 86it HD =1 known 7/8 lights of other 44 but Lan't know which $log_2\left(\frac{8}{7}\right)$ 78 choices giver to 8 possible choices log 2 ( 256) = 5 A lot more trialy than looks. Top = all possible choices Bottom- ner possible choices Hoffman trees lic more lats to more into ie less likely to occur (These classes are all about tichs...) 2. Deade ABAEC 3. Tree A ( Did here in recitation Enor detection + correction Vid we do this in 6.004? Not ceal1 Did Single bit connection MD must be 3 Back to 2's complenent Negate by Ellip each bit + add 1 MSB -> LSMB They or not More to lecture 2 Combo devices Vol of 0 - No - device count do 0 15 V () M Did in Recitation Want largest pessible Vol 15 Complex math I did not try now Max noise marging Vol lon as posible Vot high " " Hod up derices so No largest noise margin 2 lool at VTC 4-2(N+1.5-1) a 5-2N So 5-3N Z 3 IF W = 23 Still don't know why we went though all that trable! Never leaned this method - well that is what studing is for No! Gain is not 71 Man did I was that up. I should have checked gain But then I should not have been ablet to daw boxes - ? does it have to be w/ connecting to constre VIH must be high enough to produce Vol - can produce Vol or less My So what obes that look like on graph? must be in this region below So my valve VIL= 1 Vout 3.25 (8) Not five All not positive on this Look carefully on test \bH = \IH +W Vol = Voc + N \* VIH Obreates Vol or less at & VIL of 1 VOH or more of Find N Think would see better it do example in recultion 1 go to 2 in a con fili Like for A just do tanillous # Always check gain VIH Thow did They get that T (i When gain LL? Weed to move on Complicated static displine au, trich A valid output does not imply a valid input Slope = rise >1 Du Ti KAR At 101 What is En So look at I and manually do A B Out nothing Closed But then hen to sungize ? About They want and in form Of combo AND DRS -(0+((A+B)) I didn't get what they were asking Pullup given BD -2 ors togetor (13) No l'in matters AL BIG It was 2 ANDORS in AND Selves I was thinking that -but lid not tollow thoug This is (A.O) + (Anna Brc) = F (Btc) · (A+D) = Fe they say this is F Well F $(B \cdot C) \cdot (\overline{A \cdot D}) = \overline{F}$ for pollday (B.() · (A.D) = F 3. (an circuit be used as (MSS gage? Yes - pull down + pull up is complerety I forgot what the rule was Nake it charge more qually -less items/levels But can it be done here? ? width of PEet Ttorgot was possible ans Wh Bit this also P capacitance But speed gain from I pullip more than Offsets slowdown P Capquiarue Can also I W of 2 NFET pulldowns Will I capacitance + speed up Inx The lecture slides say nothing about width .... So to speedup ? vieth P 124C Tend to think N > P Spedly I T wilth . (5) Out of time and cerien Implement gate Smale From Yothengen Wirent Having on both put Have 3 left make inverter Really Complex circuits -simplify lst (Mus Fab -optional # Qviz / Debreat After talking VTC - thinks did pretty well - figured it not - good straying - good straying - O.AMD & = 0! Hamming Dictance - Very bad and eall things leading on clearity - meed to review hot towns on Bits of into - still tricky, did oh But reliability / prior ECC bad Since stipped in raview Old never study 25 complement | 1 | 2 | /4 | |---|----|-----| | 2 | 10 | /13 | | 3 | 2 | /6 | | 4 | 7 | 17 | #### MASSACHUSETTS INSTITUTE OF TECHNOLOGY DEPARTMENT OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE #### **6.004 Computation Structures** Fall 2011 Quiz #1: September 23 2011 | Name MI | DL. | | Athena login name | Score | |------------------------------------|-----------------------------------------------------------------------------------------------------|-------------------------------------------------------|--------------------------------------------------|------------------------------------------| | Michael | Plasmoier | | Theplaz | | | Brad, 26-322<br>□ WF 10<br>□ WF 11 | Silvina, 34-303<br>☐ WF 11<br>☐ WF 12 | <b>Li-Shiuan</b> , 34-304<br>→ WF 12<br><b>X</b> WF 1 | Chris, 34-303 | David, 36-155 ☐ WF 2 ☐ WF 3 | | Notes: | | | | | | | IN YOUR NAME,<br>g on the problems o | | nd MARK YOUR SE | CTION ABOVE | | quiz pages. Fe | reference materials<br>el free to use page b<br>es provided for each | oacks as scratch space | of diagrams are provi<br>ce, but be sure to mark | ded on the backs of<br>a your answers on | | Problem 1 (4 point | s): Quickies and Tr | ickies | | * | | | | | s complement binary n | umber 11100? | | alid no | Decimal e | equivalent of two's co | omplement 11100: | X | | | | | delay that is greater tha | n its propagation | | not hat m | Can ted be | greater than t <sub>pd</sub> ? cir | rele one: YES Ca | AN'T TELL NO | | combinatio<br>to be a valid | nal device) without l<br>d combinational dev | being a <i>lenient</i> combinice without being leni | | ossible for an inverter | | any inpot > To | Apd valid<br>Aight a valid invert | er not be lenient? cir | rcle one: YES Ca | AN'T TELL NO | | (D) A properly-<br>well as som | designed CMOS gat<br>ne NFETs). What co | nclusion can you dray | ne output, and is built us v? | sing N-1 PFETs (as | | Von. | : It uses NFETs in its<br>: Its output is always<br>: Its output is indeper<br>: It is not lenient. | s pullup circuit. | of the N inputs. dependence in the Notice: C1 C2 | 65 - not always | | | . T.one of the above. | Ward Au | 1 (5 | <b>3</b> C4 <b>C</b> 5 | | 6.004 Fall 2011 | | -1 of 5 - | don I think so | Quiz #1 | # Problem 2. (13 points): UVWXYZ Coding A secretive government agency has a large amount of data that has been encrypted as strings of letters from the alphabet UVWXYZ. They need to transmit this data over a binary channel (i.e., one that communicates a sequence of 0's and 1's), and have asked for your help. | communicates a sequence of 0's and 1's), and have asked for your help. | |----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | They ask you to design a fixed-length code that transmits each of the U, V, W, X, Y, or Z letters as a message codeword (binary string). | | (A) (1 point) Using the shortest possible fixed-length code, how many bits are transmitted for each message codeword (i.e., for each letter from the UVWXYZ alphabet)? | | Bits transmitted for each UVWXYZ message: bits | | You design a fixed-length coding that translates each of the six letters to a minimally-sized codeword, and present it to to the agency. Your contact there asks, perhaps to impress you with his technical acumen, what the minimal Hamming distance is between valid codewords in the fixed-length code you've supplied. | | (B) (1 point) What is the minimal Hamming distance between valid code words in your fixed- | | length code? | | Minimal Hamming distance between valid codewords: | | forgot up - was in B.DI | | Max Noyes, an agency spokesperson, is concerned about single-bit errors that may be occasionally be introduced during codeword transmission. He probes you about error-detecting characteristics of your simple fixed-length code. | | (C) (1 point) Can your code detect all single-bit errors? | | Exact details ( Collect L2D Since may be other Character | | In order to improve its error-handling characteristics, you add a single parity bit to each codeword. You choose to use odd parity, i.e. you set the parity bits to make the total number of 1's in each codeword odd. | | (D) (1 point) What is the minimal Hamming distance between valid code words in your fixed-length code with the added parity bit? | | Minimal Hamming distance between valid codewords: | | (E) (1 point) Can your revised code detect all single-bit errors? Can it correct them? | | Single-bit errors can be (Circle all that apply): DETECTED CORRECTED | Exploring options, Max asks what the minimum Hamming distance between valid codewords would have to be to allow for correction of both 2-bit and 1-bit errors. | to be to allow for corr | ection of both 2-bit a | nd 1-bit errors. | | | |------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------| | <b>(F)</b> (1 point) | What is your answer | ? What is the necess | sary Hamming distanc | e? | | | | | e to correct 2-bit erro | ors: | | A | B (but | tur ? | in better | XI | | Les Bitz is suddenly p | | | | nsists on abandoning | | | | | ocus on minimizing th | | | must be transmitted. each of the six letters | | | | | | Les provides you with | | | icy appear in strings to | o oc transmitted). | | | ontrest de transcritoro de monte de la constitución | | TOTAL STATE OF THE | , | | | Message | Probability | Info (bits) | | | | Ü | 1/2 | An(2) | $l_{\Lambda}\left(\frac{1}{p}\right)$ | | e : | V | 1/4 | X (4) | V | | | W, X, Y, Z | 1/16 | to (16) | | | The table gives the prof bits of information (G) (3 points | | ch message, has bee | n left blank. | terrecting the number | | | | | (C | omplete above table) | | ( <b>H</b> ) (1 point) | What is the average | amount of informati | on conveyed by each | message? | | 12 /2 | (2) + ty ln(4) | 4. la (16) Aver | age info per message | e: bits | | Responding to pressur<br>coding scheme you le | re from Les to reduce | the number of bits t | ransmitted, you reme | mber the Huffman | | (I) (2 points<br>For a W | 111 | | nany bits are transmit | | | | 0 0 | Codeword size | for U:; for V: | 2; for W: Y | | | V 0/1 | did not wild pr | operly | , | | | In your Huffman coossage? | ding scheme, what is | s the average number | of bits transmitted for | | | WXYZ | Average bits trai | nsmitted per messag | e: 125 bits | | | 1 | S | • | 1-1 | | | 2,1+4,2 | 1 / 4 | | • | | | COLUMN TO WAR I AND IN THE TAXABLE PARTY. | 100 1 | | | ## Scratch copies of diagram from problem 3: #### Problem 3. (6 points) Logical Negativism Logical Negativism, Inc is an MIT spinoff whose aim is to market a new logic family whose only component type is an inverter. Their motto, *Logic is NOT everything*, has convinced a venture group that this business plan has merit, and they are now well funded. Their founder, Ms. Anna Logge, has developed a device to be used as their inverter; its static voltage transfer curve is shown to the right. Anna is considering the choice of parameters by which LNI's logic family will represent logic values, and needs your help. The voltage transfer curve of a proposed inverter for a new logic family is shown to the right (spare copies of this diagram are on the back of page 3 of this quiz). Several possible schemes for mapping logic values to voltages are being considered, as summarized in the incomplete table below. Recall that Noise Immunity (last row) is defined as the lesser of the two noise margins. Complete LNI's table, by filling in missing entries. Choose each value you enter so as to maximize the noise margins of the corresponding scheme. If the numbers in a scheme can't be completed such that the LNI device functions as an inverter with positive noise margins, put an X in the entries for that column. LNI's Possible Logic Mappings: | 19 | Scheme<br>A | Scheme<br>B | Scheme<br>C | |-------------------|-------------|-------------|-------------| | $V_{OL}$ | .999 | 1- | 1 | | $V_{IL}$ | 18 | 0.7 | 1.5 | | $V_{IH}$ | 3.5 | 3.5 | 319 | | V <sub>OH</sub> | 3 | 18 | 4.5 | | Noise<br>Immunity | 5 | V | 15 | -2 -7 (complete table – 10 entries) Scratch copies of diagram from problem 4: C A B When both are Closed what GFOSEL DANDO = OG #### Problem 3. (7 points): Saga of Helena Handbasket Helena Handbasket, who barely passed 6.004, has been hired to design CMOS gates for Hapless Logic, Inc. Remembering something about PFETs in pullups and NFETs in pulldowns, her first design was a 3-input device whose circuit is shown to the right. Helena's intent was that these devices compute some useful 3-input Boolean function D=F(A,B,C); unfortunately, the devices don't seem to work as planned. To make matters worse, she had 1,000,000,000 of the devices fabricated (thinking that the order to the fab must, of course, be in binary rather than decimal). The defective devices, known within HLI as the Gates of Helena, have become a subject of ridicule. Helena has brought you in as a consultant. Your first task is to figure out how badly Helena blew the design of her 3-input logic device – in particular, whether it drives the output D to a valid logic level for every combination of the inputs A, B, and C. You may use the scratch diagrams on the back of the previous page for your work if you like. (A) (2 points) Are there logical (0/1) values of A, B, and C for which D is not driven at all? If so, give values for A, B, and C that leave D undriven; else write NONE. A, B, C values for undriven D, or "NONE": (B) (2 points) Are there logical values of A, B, and C for which D is pulled down and up simultaneously? If so, values for A, B, and C that cause such a conflict; else write NONE. A, B, C values for output conflict, or "NONE": Nora Nanda, Helena's assistant, suggests that the devices might be salvaged by using them to compute useful functions of fewer than 3 inputs. She proposes that a two-input function of X and Y, for example, might be computed by connecting each of the three inputs A, B, and C to either X, Y, or the logical constants 0 (ground) or 1 (vdd), and reading the output on D. (C) (1 point) Can Nora's approach be used to compute NAND of X and Y? If so, choose values (X, Y, 0, or 1) for each of A, B, and C such that D is NAND(X,Y); else circle NO. Choose A, B, C values or circle NO: A=\_\_\_; B=\_\_; C=\_\_; or NO (D) (1 point) Can Nora's approach be used to compute **NOR** of X and Y? If so, choose values (X, Y, 0, or 1) for each of A, B, and C such that D is NOR(X,Y); else circle NO. Choose A, B, C values or circle NO: A= ; B= ; C= ; or NO (E) (1 point) Can Nora's approach be used to compute **OR** of X and Y? If so, choose values (X, Y, 0, or 1) for each of A, B, and C such that D is OR(X,Y); else circle NO. Choose A, B, C values or circle NO: A=\_\_\_; B=\_\_\_; C=\_\_\_; or NO END OF QUIZ! by some inverters | 1 | / 4 | |---|-----| | 2 | /13 | | 3 | / 6 | | 4 | 17 | | | /30 | # MASSACHUSETTS INSTITUTE OF TECHNOLOGY DEPARTMENT OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE #### 6.004 Computation Structures Fall 2011 Quiz #1: September 23 2011 | Name | Solutions | An interest of a | Athena login nan | ne Score 30 | |------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------|---------------------------------|-----------------------------| | Brad, 26-322<br>□ WF 10<br>□ WF 11 | Silvina, 34-303<br>□ WF 11<br>□ WF 12 | Li-Shiuan, 34-304<br>☐ WF 12<br>☐ WF 1 | Chris, 34-303 ☐ WF 1 ☐ WF 2 | David, 36-155 ☐ WF 2 ☐ WF 3 | | Notes: | | garangan<br>Jaharangan dan | er er aufregli<br>eral oan dear | | | | FILL IN YOUR NAME, rking on the problems on | | nd MARK YOUR S | SECTION ABOVE | | quiz pages | ally, reference materials and the second sec | acks as scratch spac | | | | Problem 1 (4 | points): Quickies and Tri | ckies | | | | (A) What | decimal integer is represen | nted by the 5-bit two' | s complement binary | number 11100? | | | Decimal e | quivalent of two's co | omplement 11100: | -4 | | (B) Is it po<br>delay? | ossible for an inverter to h | ave a contamination of | delay that is greater t | han its propagation | | | Can t <sub>cd</sub> be | greater than t <sub>pd</sub> ? cir | rcle one: YES | CAN'T TELL NO | | combi | nure we saw that a 2-input national device) without be valid combinational devi | eing a lenient combin | national device. Is it | | | | Might a valid inverte | er not be lenient? cir | cle one: YES | CAN'T TELL(NO | | (D) A prop<br>well a | perly-designed CMOS gates some NFETs). What con | e has N inputs and on<br>nclusion can you dray | ne output, and is built | using N-1 PFETs (as | | | C1: It uses NFETs in its C2: Its output is always C3: Its output is indeper C4: It is not lenient. C5: None of the above. | 0. adent of at least one of | · | C3 C4 C5 | | | | Chele best ch | oice: C1 C2 | | ## Problem 2. (13 points): UVWXYZ Coding A secretive government agency has a large amount of data that has been encrypted as strings of letters from the alphabet UVWXYZ. They need to transmit this data over a binary channel (i.e., one that communicates a sequence of 0's and 1's), and have asked for your help. | They ask you to design a fixed-length code that transmits each of the U, V, W, X, Y, or Z letters message codeword (binary string). | s as a | |-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------| | (A) (1 point) Using the shortest possible fixed-length code, how many bits are transm each message codeword (i.e., for each letter from the UVWXYZ alphabet)? | itted for | | Bits transmitted for each UVWXYZ message: 3 | bits | | You design a fixed-length coding that translates each of the six letters to a minimally-sized cod present it to to the agency. Your contact there asks, perhaps to impress you with his technical a what the minimal Hamming distance is between valid codewords in the fixed-length code you's supplied. | cumen, | | <b>(B)</b> (1 point) What is the minimal Hamming distance between valid code words in yo length code? | ur fixed- | | Minimal Hamming distance between valid codewords: | 1 | | Max Noyes, an agency spokesperson, is concerned about single-bit errors that may be occasion introduced during codeword transmission. He probes you about error-detecting characteristics simple fixed-length code. | | | (C) (1 point) Can your code detect all single-bit errors? | | | Detect single-bit errors? Circle one: | YESNO | | In order to improve its error-handling characteristics, you add a single parity bit to each codew choose to use odd parity, i.e. you set the parity bits to make the total number of 1's in each code | | | (D) (1 point) What is the minimal Hamming distance between valid code words in yo<br>length code with the added parity bit? | ur fixed- | | Minimal Hamming distance between valid codewords: | 2 | | (E) (1 point) Can your revised code <i>detect</i> all single-bit errors? Can it <i>correct</i> them? | | | Single-bit errors can be (Circle all that apply): DETECTED CO | RRECTED | | | fax asks what the min rection of both 2-bit a | imum Hamming dista<br>and 1-bit errors. | nce between valid co | odewords would have | |----------------------------------------------------------------------------|---------------------------------------------------|-----------------------------------------------------------------------------------------------------------------|-------------------------------------------|-----------------------------------------------| | <b>(F)</b> (1 point | ) What is your answe | r? What is the necessa | ry Hamming distanc | ee? <b>5</b> | | | Minima | l Hamming distance | to correct 2-bit erro | ors: | | | | | | | | all worries about error<br>must be transmitted.<br>each of the six letters | or detection and correct<br>Les provides you with | roject and is determine ction, and wants to footh some key additional frequency at which the ally-filled table: | cus on minimizing the information: he kno | ne number of bits that bws the probability of | | | Message | Probability | Info (bits) | er arman er er jennale | | | U | 1/2 | 1 | | | | V | 1/4 | 2 | | | | W, X, Y, Z | 1/16 | 4 | | | of bits of information | n communicated by ea | the six messages. The ach message, has been by filling in the missi | left blank. | reflecting the number | | | | | (Co | omplete above table) | | <b>(H)</b> (1 point | t) What is the average | amount of informatio | on conveyed by each | message? | | | | Avera | nge info per messago | e: bits | | | | e the number of bits tr<br>se it to design an appr | | | | (I) (2 point<br>For a W | | oding scheme, how m | any bits are transmit | ted for a U? For a V? | | | | Codeword size f | for U:; for V: | ; for W:4 | | (J) (1 point each me | | ding scheme, what is | the average number | of bits transmitted for | | | | Average bits trans | smitted per message | e: bits | #### Problem 3. (6 points) Logical Negativism Logical Negativism, Inc is an MIT spinoff whose aim is to market a new logic family whose only component type is an inverter. Their motto, *Logic is NOT everything*, has convinced a venture group that this business plan has merit, and they are now well funded. Their founder, Ms. Anna Logge, has developed a device to be used as their inverter; its static voltage transfer curve is shown to the right. Anna is considering the choice of parameters by which LNI's logic family will represent logic values, and needs your help. The voltage transfer curve of a proposed inverter for a new logic family is shown to the right (spare copies of this diagram are on the back of page 3 of this quiz). Several possible schemes for mapping logic values to voltages are being considered, as summarized in the incomplete table below. Recall that Noise Immunity (last row) is defined as the lesser of the two noise margins. Complete LNI's table, by filling in missing entries. Choose each value you enter so as to maximize the noise margins of the corresponding scheme. If the numbers in a scheme can't be completed such that the LNI device functions as an inverter with positive noise margins, put an X in the entries for that column. LNI's Possible Logic Mappings: | ov Postintane | Scheme<br>A | Scheme<br>B | Scheme<br>C | |-------------------|-------------|-------------|-------------| | $V_{OL}$ | Х | Х | 1 | | $V_{IL}$ | Х | 0.7 | 1.5 | | $V_{IH}$ | 3.5 | Х | 3.5 | | V <sub>OH</sub> | 3 | Х | 4.5 | | Noise<br>Immunity | Χ | X | 0.5 | (complete table - 10 entries) #### Problem 3. (7 points): Saga of Helena Handbasket Helena Handbasket, who barely passed 6.004, has been hired to design CMOS gates for Hapless Logic, Inc. Remembering something about PFETs in pullups and NFETs in pulldowns, her first design was a 3-input device whose circuit is shown to the right. Helena's intent was that these devices compute some useful 3-input Boolean function D=F(A,B,C); unfortunately, the devices don't seem to work as planned. To make matters worse, she had 1,000,000,000 of the devices fabricated (thinking that the order to the fab must, of course, be in binary rather than decimal). The defective devices, known within HLI as the Gates of Helena, have become a subject of ridicule. Helena has brought you in as a consultant. Your first task is to figure out how badly Helena blew the design of her 3-input logic device – in particular, whether it drives the output D to a valid logic level for every combination of the inputs A, B, and C. You may use the scratch diagrams on the back of the previous page for your work if you like. (A) (2 points) Are there logical (0/1) values of A, B, and C for which D is not driven at all? If so, give values for A, B, and C that leave D undriven; else write NONE. A, B, C values for undriven D, or "NONE": B=0, C=1 (A=1 or 0) (B) (2 points) Are there logical values of A, B, and C for which D is pulled down and up simultaneously? If so, values for A, B, and C that cause such a conflict; else write NONE. A, B, C values for output conflict, or "NONE": Nora Nanda, Helena's assistant, suggests that the devices might be salvaged by using them to compute useful functions of fewer than 3 inputs. She proposes that a two-input function of X and Y, for example, might be computed by connecting each of the three inputs A, B, and C to either X, Y, or the logical constants 0 (ground) or 1 (vdd), and reading the output on D. (C) (1 point) Can Nora's approach be used to compute **NAND** of X and Y? If so, choose values (X, Y, 0, or 1) for each of A, B, and C such that D is NAND(X,Y); else circle NO. Choose A, B, C values or circle NO: A= X; B= Y; C= 0; or NO (D) (1 point) Can Nora's approach be used to compute **NOR** of X and Y? If so, choose values (X, Y, 0, or 1) for each of A, B, and C such that D is NOR(X,Y); else circle NO. Choose A, B, C values or circle NO: A = X; B = 1; C = Y; or NO (E) (1 point) Can Nora's approach be used to compute **OR** of X and Y? If so, choose values (X, Y, 0, or 1) for each of A, B, and C such that D is OR(X,Y); else circle NO. Choose A, B, C values or circle NO: A=\_\_\_; B=\_\_\_; C=\_\_\_; or NO END OF QUIZ!