Reprinted
by permission from VLSI
Research
The
Cook's Tour: AMD's only microprocessor fab is in Dresden, Germany. It
is called Fab 30. The cleanroom is sized at 14K meters square and has
a capacity of 20K WSM (Wafer Starts per Month). It was named fab of the
year in 2001 by Semiconductor International. Today, many would say that
it is just another 200mm fab. But dig deeper and you will find it's quite
special. AMD has systematically matched and even exceeded the best with
as little as a tenth the resources. Others have spent far more and most
didn't come close. The results are pretty impressive. AMD is the only
company to field a competitive commercial microprocessor. AMD was the
first to have copper in production with yields above aluminum (something
that many were still struggling to do four years later). It was the first
to bring SOI to high volume microprocessor production. It has also kept
up with the unmatched scaling needs of microprocessor transistors – on
average, introducing a new transistor node every two to three months.
The factory they've built here holds many of the secrets to AMD's competitiveness.
So I'm going to delve deeper than the normal fab tour.
How do you compete head on with the world's largest microprocessor company?
It starts with people. At AMD it's all about the people. It isn't just
some fluffy mission statement on conference room posters; it's real to
the core here. You can sense a lot of positive energy among the people
at AMD when you're in Dresden. One way to tell is the degree to which
quality discipline pervades the organization. A good reflection of this
that I found inside AMD's fab was that tool boxes and PM spares kitting
were a model of orderliness. It was easy to find poke yoke* techniques
applied throughout. Instructions and procedures were well documented and
readily available inside the fab. People have to understand and want to
do this for it to be successful.
Morale is essential and it is running very high at AMD Saxony. One reason
is that the facility has created 2000 direct jobs in a region that was
very poor after the fall of the Berlin Wall. 40% of the people working
at AMD Dresden have university degrees. More importantly, 30% of them
were previously unemployed. Average blue collar worker wages for Dresden
have risen 16.2% since 1997 and white collar salaries are up 23.6%. So
there has been a significant benefit to the region as semiconductor production
was introduced by Infineon first and then AMD. But don't think this is
just something happening in Dresden. AMD has a long history of having
highly charged-people.
Nevertheless, people are not everything. Thinking that your people are
more motivated, or better, than your leading competitors can be a fatal
mistake. It is what you do differently with them that counts. It's the
break-away strategies they develop and the degree to which they are empowered
to turn vision into execution.
AMD's strategy has always centered on being agile at the leading edge.
There is a corporate culture here that centers on speed. This had been
strengthened with Hector Ruiz coming to the helm because he is more hands-on
when it comes to operations. It's not just time-to-market, it's speed
everywhere. Within the factory, the focus is substantially different than
the classical Management By Objective focus (MBO). One way that AMD gains
its speed is in what it calls Management By Exception (MBE). They architect
the objectives into the system so that they don't have to worry about
these and then they focus on the exceptions. System is not just a word
here. It is a real computer network that executes objectives automatically.
At AMD, computers are used to automatically make the no-brainer decisions
– that leaves time for the humans to focus on the exceptions. Another
tool AMD has developed is something they call value stream mapping, which
focuses on the value of the product as it progresses down the line. These
tools are designed to quickly contextualize what's happening in the market
with a manufacturing response. The result is much faster decision cycle
times (DCTs) throughout the company.
One thing about having one fab is that you must do development in the
same fab you're doing production. That means adding new processes and
materials right in the middle of everything. A factory will be brought
to its knees without superb controls. Just to give you an idea of how
massive this job is, last year AMD routinely ran 2.4M MES transactions
per day; 120K dispatch queries per day, and 15K carrier moves per day.
And these numbers are growing constantly. What's amazing is that the number
of masks has grown by over 50% and the fab is still chunking out the same
20K WSM it was designed to do almost ten years ago. So there has been
a huge productivity benefit extracted. These benefits are the key reasons
why AMD chose to develop its own intelligent factory automation. It is
a system that far exceeds that of anything commercially available.
AMD's Automated Precision Manufacturing (APM) is one of the cornerstones
of its ability to compete against more modern 300mm fabs. It does this
not only by enabling greater efficiency, it also enables higher baseline
yields, better binning, and faster yield learning, all of which drives
product performance. APM is essentially a massive factory control software
suite. Its IP portfolio is rich, with AMD having more than 250 granted
and pending patents. To understand why it is so powerful, consider that
it was architected with the following goals:
- Enable flexibility and modularity of WIP, tool, and recipe to deal
will many different products, rapid technology change, and aggressive
process development in a production fab.
- Enable fast lot movement and intelligent scheduling that shortens
learning cycles in spite of growing masks by being able to set priorities,
schedules, and tool loading dynamically.
- Enable tight parameter control despite quick changes, process mix,
and fast yield learning for new processes by building in control solutions
that have a sturdy framework architecture.
- Enable rapid yield learning and excursion suppression with fully integrated
engineering data collection and analysis.
- Automate decision making to a highly integrated level.
- Architected to be easily modified and improved upon continuously.
These goals are still in place today – lighting the path for what are
continuous improvements to APM. Change is constant at AMD and they average
65 software changes per week. It's not just the software that is changing,
as they also make an average of 25 process flow changes per week and 1000
recipe changes per week. Being able to cope with this level of change
proves the robustness of APM as well as highlighting the agility it brings
to a company proud of being fleet of foot.
AMD's APM spans across and integrates product performance targeting,
process control, production scheduling, yield management systems, and
equipment performance optimization into one common IT system. What I found
particularly impressive in its APC module (Advanced Process Control) is
that it automates wafer level recipe control, fault detection, and e-diagnostics;
along with integrating a next generation SPC (Statistical Process Control)
system and fab-wide control system. 50% of the tools have fully automated
APC control loops; 10% have wafer level recipe control; and 60% have controls
to ensure that the equipment configuration is constant.
The depth and robustness of APM's SPC is superb as they have about 22K
active SPC charts. These are rolled over on a continuous basis, as 100%
of the tools have automated SPC data setup and collection. AMD adds about
30 new SPC charts every day. These are not just charts for a human to
look at, systems watch and act on them as well. About half of the SPC
charts can automatically put a lot on hold and a third can bring the tool
off line (AMD calls this "automated lot-on-hold" and automated tool down”).
Roughly 10% can do both. One of the more subtle benefits of this is that
AMD has been able to extend out times between preventative maintenance
(PM) and also been able to better integrate PM schedules with production
schedules for greater efficiency.
There is also a YMS module (Yield Management Systems), which is an analysis
tool that integrates data mining and allows chaining of engineering data
from different tools. In other words, it allows engineers to bring together
disparate data sets automatically into one common analysis platform. What
I saw is a process engineer's dream. AMD's YMS has evolved from wafer
level analysis to the die level and is now moving to a level of capability
that is sub-die. While similar, YMS is fundamentally different from SPC
in that SPC is used to control equipment, while YMS is used to diagnose
problems and improve yield. YMS is much more of a machine-to-human interface.
It serves the needs of three different groups: product line engineering,
fab product engineering, and fab-wide contamination management. There
is also a tiered structure to deal with yield issues at the module level,
fab-wide, and at the product level. There are also two key monitoring
points: process and equipment. Dealing with these widely disparate needs
in a single system is very difficult, but the benefits are less duplication
of effort and wider availability of data. AMD's YMS eliminates the data
silos that form, too often, between groups.
The IPS module (Integrated Production Scheduling) has real time dispatching
of wafers and active scheduling. In the latest version, the scheduling
will be predictive and agent based. Each lot will be its own agent that
carries with it all the information needed to finish, including multiple
levels of priorities. Agent Based Scheduling (ABS) is formed around scheduling
agents and processing agents. These agents are independent entities that
serve and look out for their owner in the system. The agents can represent
lots, machines, PMs (Preventive Maintenance schedules), resources, etc.
The power comes from collective behavior of these agents. So if sales
or engineering have a hot lot – even the ones that just have to go in
today that didn't exist yesterday – it can simply be entered in and the
system dynamically makes all changes needed to schedules and tool loading.
Think of ABS like an anti-lock braking system for factories that keeps
the company from skidding off the road when the market slams on the brakes.
Better yet, in this case the four wheels act autonomously, while working
towards a common goal. It's all real time, so managers don't have to wait
to see what the impact on other lots will be. More importantly, because
it is ABS it won't allow the new hot lot to override ones with higher
priority. The agents for the latter will over-rule the agents for the
hot lot automatically. These will still meet their deadlines with no human
intervention. Now that's automated decision making and it's really cool.
Put all of this together and it brings even more power to AMD's competitive
position. APM has enabled them to add more Internal Device Versions (IDVs),
which in turn opened the door to massive differentiation. With APM, AMD
has created an operating environment requiring that every single lot finish
different from the device it started out as. They can change part numbers
mid-process, anywhere in line. AMD has linked process recipe contexting
with production planning so that they have modular routes that break up
into reusable chunks. Modular routes allow for rapid implementation of
new technology flows that are variations of prior flows. That is a fundamental
change because it allows much better responsiveness to customer needs.
The old formula was that when markets changed, a chip maker started more
wafers and scrapped the ones in-process that weren't needed or finished
them and sold them at a loss at fire sale prices. A wafer typically started
and finished as the same part number. There were gate arrays, which allowed
the last few interconnect mask levels to change, but these proved extremely
wasteful of silicon. What AMD has done is drive the change to the process
module level, as well as the functional block level. The important point
is that they can change a wafer to a different part number mid-stream
in the process. This makes them far more responsive to the market as well
as being far more efficient, far less wasteful, and ultimately far more
profitable.
How this is done goes back to AMD's value stream mapping focus and how
they quickly contextualize what's happening in the market with a manufacturing
response. As market opportunities emerge, the key is rapid alignment of
AMD's production plan to the evolving needs. They first must decide what
is needed, which is done with direction from Sales & Marketing via inputs
from the OEM. They then prioritize, which is done with a combination of
risk assessment, evaluation of market factors, and management input. This
is then overlaid on their manufacturing position, which is an evaluation
of what they have using manufacturing data. From this comes the decision
point of what to build, which creates a new build plan that is inputted
and out of which comes a new set of data that describes what can be built
given the new plan, the resources at hand, and the Work In Progress (WIP).
This is then turned over to a profit & loss planning process, out of which
the final decision as to what they will commit is made. This entire decision
cycle can go though a full revolution in 24 hours. To me, when I consider
the companies that can't change the coffee brand in the cafeteria this
fast, this is amazing. Any CEO will understand what this means to their
business competitively and profit-wise.
As for the future, AMD's upcoming 300mm fab will integrate automated
material movement with all these modules. The level of change going on
here shows that APM is not something that will ever be an off-the-shelf
suite of software. I've watched APM develop quietly from a vision I first
heard about almost twenty years ago. A lot of credit has to be given to
the development team, which executed on a vision established so long ago.
At the same time, a lot of credit has to go to management for sticking
with the vision for so long. I'm sure it hasn't been easy. But make no
doubt about it: APM is a core strategic strength of AMD.
* Poke Yoke is a quality technique developed by Toyota to make the subtle
sublime. With it, simple human error is virtually eliminated. The classic
example is bolting wheels on. Toyota put a staging tray and a robot between
the assembler and the open box of nuts. Every time a car rolled up, the
robot placed five nuts on the staging tray next to the assembler. That
way, if the operator was interrupted and forgot that some nuts had not
been installed, it would be patently obvious because some leftover nuts
would still be in the staging tray. Toyota did this across its assembly
stations and quality soared.
|
|
This e-mail report is confidential and for the use of VLSI Research Inc
and its Clients only. Feel free to distribute it within your company.
We only ask that you not send it outside your organization without VLSI
Research Inc's permission. This report contains VLSI Research's analysis
of information which in some cases has been furnished to VLSI Research
by responsible persons on the understanding that it will be released by
VLSI Research only to a limited audience and not be made widely available
or that its release will be restricted due to its confidential nature.
We receive letters and e-mails on current topics that are of interest
to our subscribers and comments on our newsletter articles. We value that
subscriber input and like to use it. By submitting such material to us,
you authorize us to publish and republish it in any form or medium, to
edit it for style and length, and to comment upon or criticize it and
to publish others' comments or criticisms, as the case may be. Publication
of this report is not intended to constitute a disclosure to the public
of the information contained in this report. Although data was obtained
from sources considered reliable, it cannot be guaranteed. No independent
steps have been taken to confirm its accuracy, truthfulness, or completeness.
Printed in the United States of America. All rights reserved. No part
of this publication may be used in any legal proceedings nor may any information
contained herein be disclosed to any third party, or reproduced, or transmitted
to any third party, in any form or by any means -- mechanical, electronic,
photocopying, duplicating, microfilming, videotape, verbally or otherwise
-- without prior written permission of VLSI Research.
Copyright © 2004 by VLSI Research Inc
|