We’ll Have to Reduce Test-and-Punish. Talking about Social Emotional Learning Isn’t Enough

Silly me!  I didn’t realize until a couple of weeks ago that SEL is a thing.  SEL is a new term in educational circles: Social Emotional Learning.  I heard Linda Darling-Hammond—Stanford University emeritus professor, CEO of the Learning Policy Institute, and chair of an Aspen Institute National Commission on Social, Emotional, and Academic Development—present the work of the commission, and then I started reading more about Social Emotional Learning (SEL).

It would appear that many of the educational academics promoting SEL are doing so as an effort to shift our schools’ focus away from the incessant drilling on basic language arts and math that has been driven by the high-stakes testing embedded in the 2002 No Child Left Behind (NCLB).  NCLB and Race to the Top, that compounded NCLB’s punitive grasp on our public schools, have created fear-driven pressure to raise scores at any cost. The stakes are high: Schools have been closed or charterized, teachers fired or their salaries cut, and school districts trapped in state takeover.  And worse—in terms of the social and emotional health of children—students whose reading scores are too low at the end of third grade have been retained in grade for an extra remedial year.

The Learning Policy Institute has been intent about trying to help state education departments take advantage of the way the 2015 Every Student Succeeds Act (ESSA) tweaks accountability.  ESSA eliminates direct federal punishments for low test scores by turning accountability over to states, but it says states must have their own plans to hold public schools accountable.  Beyond the required reporting of test scores and graduation rates, states can now add new factors, as long as the new factors are research-based. For example, the Learning Policy Institute has been explaining how research backs up the establishment of wraparound Community Schools.  Its publications have shown states how to demonstrate through research that Community Schools are a worthy of inclusion in states’ dashboards of factors by which schools can be judged and held accountable.

Now, it would appear that Darling-Hammond’s support of Social Emotional Learning, through her leadership on the Aspen SEL Commission, is an attempt to help states position SEL as a factor in their Every Student Succeeds dashboards by which schools can be held accountable.  In Education Week a year ago after Aspen released coverage of its new SEL Commission, Evie Blad reported: “The new federal education law requires schools to report new factors, like chronic absenteeism rates, in their public report cards, and it requires states to broaden how they measure school success.  No state decided to include direct measures of social-emotional learning in its accountability system.  Most cited cautions from researchers who’ve said existing measures are not sophisticated enough to be used for high-stakes purposes.  But mindfulness of students’ emotions, relationships, and development can help schools show improvement in other areas covered by the law, like attendance and achievement commissioners said.”  The Aspen Commission, we should assume, hopes its new report will beef up the research base on SEL.

I suppose it s worth establishing a research base to support education of the whole child if in some way measuring SEL will help states be more humane in evaluating what is being accomplished at school.  However, it is also essential to remember that the Every Student Succeeds Act makes two other factors primary in the states’ ESSA accountability reports: standardized test scores and high school graduation rates.  I wonder if inserting Social Emotional Learning right on top of test-and-punish doesn’t merely represent a contradiction in strategies. And figuring out metrics by which a state can judge how a district is doing at SEL and then holding schools accountable for SEL in the state’s accountability system seems bizarre.

Some of the puzzling language in the Aspen Institute Commission’s report is about showing states and school districts how to measure SEL so that it will count for school accountability: “Develop and use measures to track progress across school and out-of-school settings, with a focus on continuous improvement rather than rewards and sanctions.”  So far the advice seems pretty positive compared to what we’re doing now which is focusing on rewards and sanctions. But the report later vaguely suggests some kind of measurable outcomes: “Use a broader range of assessments and other demonstrations of learning that capture the full gamut of young people’s knowledge and skills… Use data to identify and address gaps in students’ access to the full range of learning opportunities in and out of school.”

Recently in his personal blog, the writer and education professor at UCLA, Mike Rose raised concerns about Social Emotional Learning: “(D)o we need all these studies to demonstrate what any good teacher knows: that the nature and quality of the relationship between teachers and students matter?… More broadly I worry that as we pay needed attention to the full scope of a child’s being, we will inadvertently reinforce the false dichotomy between thought and emotion.”

Rose harks back to a piece he wrote in 2013 in which he worried that, “Under No Child Left Behind and Race to the Top, cognition in education policy has increasingly come to mean the skills measured by standardized tests of reading and mathematics.  And as economists have gotten more involved in education, they’ve needed quantitative measures of cognitive ability and academic achievement for their analytical models….”  Rose worries about dividing education into a “cognitive/non-cognitive binary.”  “The problem is exacerbated by the aforementioned way economists carve up and define mental activity.  If cognition is represented by scores on ability or achievement tests, then anything not captured in those scores—like the desired qualities of character—is, de facto, non-cognitive.  We’re now left with a pinched notion of cognition and a reductive dichotomy to boot.”

For Rose, social and emotional work must be an essential part of every teacher’s daily practice—and something children learn in their experience of schooling. In an excellent 2014 article published by The American Scholar, Rose describes the characteristics of the best classrooms he visited on a journey across the United States to research his fine book, Possible Lives: “For all the variation… the classrooms shared certain qualities… The classrooms were safe. They provided physical safety, which in some neighborhoods is a real consideration.  But there was also safety from insult and diminishment… And there was safety to take intellectual risks… Intimately related to safety is respect, a word I heard frequently during my travels.  It meant many things: politeness, fair treatment, and beyond individual civility, a respect for the language and culture of the local population… Respect also has a cognitive dimension.  As a New York principal put it, ‘It’s not just about being polite—even the curriculum has to be challenging enough that it’s respectful.’  Talking about safety and respect leads to a consideration of authority… A teacher’s authority came not just with age or with the role, but from multiple sources—knowing the subject, appreciating students’ backgrounds, and providing a safe and respectful space.  And even in traditionally run classrooms, authority was distributed.  Students contributed to the flow of events, shaped the direction of discussion, became authorities on the work they were doing.  These classrooms, then, were places of expectation and responsibility… (O)verall the students I talked to, from primary-grade children to graduating seniors, had the sense that their teachers had their best interests at heart and their classrooms were good places to be.”

The people who are trying to make Social Emotional Learning part of states’ Every Student Succeeds accountability dashboards undoubtedly have good intentions. They are trying, once again to make normal child development and attention to the needs of the whole child primary goals in America’s public school classrooms.  Unfortunately, however, because standardized test scores and high school graduation rates—both highly measurable data sets—remain at the very center of ESSA’s federal demand for school accountability, Social Emotional Learning will always be on the side.

To improve the social and emotional climate in our schools today, we’ll need do go after what is really the problem—what Harvard’s Daniel Koretz calls “the testing charade.”

Test-and-Punish Just Hangs on as Failed Education Strategy

ESSA, the Every Student Succeeds Act of 2015, is like an old, altered, jacket, now frayed at the cuffs. The fabric was never really good in the first place and, when the jacket was made over, the alternations didn’t do much to improve the design. Not much noticed at the back of the closet, the jacket sags there. But it would take too much energy to throw it away.

Pretty much everybody agrees these days that the 2001 school “reform” law, No Child Left Behind, was a failure. The Secretary of Education, Betsy DeVos went to the American Enterprise Institute the other day and criticized the education policies of George W. Bush and Barack Obama.

And at the other end of the political spectrum, on January 8, 2018, the 16th anniversary of the day President George W. Bush signed No Child Left Behind, Diane Ravitch declared, “NCLB, as it was known, is the worst federal education legislation ever passed by Congress.  It was punitive, harsh, stupid, ignorant about pedagogy and motivation, and ultimately a dismal failure… The theory was simple, simplistic, and stupid: test, then punish or reward.”

In December, 2015, Congress made over No Child Left Behind by passing the Every Student Succeeds Act.  While the law reduces the reach of the Secretary of Education and requires that the states instead of the federal government develop plans for punishing the so-called “failing” schools, ESSA, as the new version is called, keeps annual standardized testing and perpetuates the philosophy that the way to make educators raise test scores faster is to keep on with the sanctions.  ESSA remains a test-and-punish law.

But now it seems ESSA is going out of use like that old, remade jacket. The states, as required, have churned out their ESSA school improvement plans and submitted them to the U.S. Department of Education, and Betsy DeVos’s staff people have been busy approving them—in batches.  This week the Department approved a batch of eleven such plans—from Arkansas, Maryland, Missouri, New York, Ohio, Pennsylvania, Puerto Rico, South Dakota, Washington, Wisconsin and Wyoming.  Education Week‘s federal education reporter, Alyson Klein describes the eleven plans that were approved this week.

Ohio’s was one of the plans approved, and Patrick O’Donnell at the Plain Dealer perfectly captures the irony of the now pretty meaningless process in Ohio’s ESSA Plan Wins Federal Approval—and Few Care: “Though many observers nationally and here in Ohio had hoped states would present grand new visions for schools through the new plans mandated by 2015’s Every Student Succeeds Act (ESSA), that hasn’t happened… State Superintendent Paolo DeMaria’s plan made few changes to the state’s testing and report card system, promising little more than making sure the state follows federal law. A new vision and approach?  That’s all being handled separately, just not in the plan. Critics wanted the plan to make big cuts in state tests. It doesn’t but DeMaria and the state school board later asked the legislature for those cuts.  Others wanted the plan to reduce the use of tests in teacher evaluations.  DeMaria and a panel of educators are seeking those changes apart from the submitted plan. And some wanted the state to show a vision for schools that was less reliant on test scores in academic subjects. School board members and several panels of educators have been meeting the last few months to build new goals that are far more focused on the ‘whole child’ than before.”

There is even some talk in Columbus about the problems of the state’s “A”-“F” letter grades to rate and rank schools and school districts, despite that Ohio’s school report cards with letter grades are a feature of the ESSA plan Ohio submitted and that was approved this week.

The 2015, Every Student Succeeds Act is merely a made over version of No Child Left Behind—made over because Congress wasn’t really ready to accept that the law’s overall strategy of high stakes testing and a succession of punishments has accomplished neither of NCLB’s overall goals: helping the children who have been left behind and closing achievement gaps.

But consensus about No Child Left Behind’s overall failure and the failure of it punitive strategy keeps on growing.  Harvard University’s Daniel Koretz put several more nails in its coffin in his excellent new book The Testing Charade: Pretending to Make Schools Better. Please read this book. In it Koretz shows exactly why the scheme of testing all students and punishing the teachers and the schools where scores do not rise quickly cannot work—why the scheme is merely a charade:  “One aspect of the great inequity of the American educational system is that disadvantaged kids tend to be clustered in the same schools. The causes are complex, but the result is simple: some schools have far lower average scores—and, particularly important in this system, more kids who aren’t ‘proficient’—than others. Therefore, if one requires that all students must hit the proficient target by a certain date, these low-scoring schools will face far more demanding targets for gains than other schools do. This was not an accidental byproduct of the notion that ‘all children can learn to a high level.’ It was a deliberate and prominent part of many of the test-based accountability reforms… Unfortunately… it seems that no one asked for evidence that these ambitious targets for gains were realistic. The specific targets were often an automatic consequence of where the Proficient standard was placed and the length of time schools were given to bring all students to that standard, which are both arbitrary.” (pp. 129-130) “The result was, in many cases, unrealistic expectations that teachers simply couldn’t meet by any legitimate means.” (p. 134)

If our society were intent on helping the children who have been left behind, we would invest in ameliorating poverty and in supporting the hard working teachers in the schools in our poorest communities. Things like reauthorizing the Children’s Health Insurance Program would help!  The ESSA plans being submitted to the Department of Education aren’t having much impact at all.  The old, made-over NCLB jacket is slowly slipping to the back of the closet.

EXTRA: A Teacher’s Summary of What Proficiency Might Mean

Here, in The Insufficiency of Proficiency, Oklahoma teacher, Rob Miller speculates on what proficiency might mean. His essay specifically explores any of a number of kinds of complex (or simple) understanding that we might be aiming for—but really cannot measure—when we test “reading proficiency.”

His post is far deeper, however—about the meaning for all of us of more than 15 years of nationally mandated standardized testing. This is a fascinating essay about making educational policy based on a reductive theory of human learning.

Miller begins with a seasonal theme:

Twas the week before Christmas, when all thro’ the state
All the children were stirring, eager to learn their fate;
Their test scores from April would soon be delivered,
I hope I’m proficient the children all quivered;
The wait’s been soooo long…my hands are all sweaty
I need to know now … am I college and career ready?

His piece is also a seasonal reflection for the new year. How many more years will it take us to recognize the limitations of test scores for measuring what we really want children to know?

Sorting Out the Debate About Educational Accountability

The watchword for the last quarter century’s school reform has been accountability: holding schools and school teachers accountable for quickly raising students’ scores on standardized tests. Sanctioning schools and teachers who can’t quickly raise scores was supposed to be an effective strategy for overcoming educational injustice. Test-and-punish has enabled us at least to say we’ve been doing something to hold schools accountable.

The politics of this conversation are pretty confusing—all going back to the federal education law, the 2001 No Child Left Behind Act (NCLB), and the debate about its replacement, the 2015 Every Student Succeeds Act (ESSA).  There was bipartisan agreement in 2001-2002 when NCLB was debated, passed, and signed into law that our society could close racial and economic achievement gaps by testing all students and then demanding that schools quickly raise the scores of underachieving students. In 2015 when Congress debated the law’s reauthorization, accountability-hawk Democrats stood by test-and-punish accountability; many Republicans, led by Senator Lamar Alexander instead pushed to expand states’ rights by lifting the heavy hand of the federal government and allowing states to design their own plans to improve so-called failing schools. Worrying that removal of universal testing would let schools off the hook, the Civil Rights Community has stood by NCLB’s testing plan. Many have continued to assume that universal testing exposes achievement gaps and that the exposure will motivate politicians and educators to address racial and economic disparities.

Test-and-punish school reform has been at the center of a conversation between Republican Senator Lamar Alexander, the chair of the Senate Health, Education, Labor and Pensions Committee, and Republican Education Secretary Betsy DeVos.  An article by Caitlin Emma published over the weekend by POLITICO examines the history of No Child Left Behind vs. the Every Student Succeeds Act as a background for looking at how policy around school accountability has been evolving in the Trump administration. Emma describes the new ESSA, passed by a Republican Congress in 2015 and designed to return at least some authority for accountability back to the states. But Democrats prodded by Civil Rights leaders and some Republicans have stood by federally imposed accountability: “Critics… worry whether states will adequately track and provide equal opportunities for at-risk kids…. (Even) former Republican Rep. John Kline… an architect of the measure, has said he’s worried states are now getting away with testing plans that violate a key requirement of the law—that states administer the same test to all students annually.  The provision is critical (Kline believes) so that states are forced to report the performance of all students and the results for poor and minority students are not hidden from view, as they were for decades before federal testing requirements were enacted.”

Emma explains: “The Every Student Succeeds Act, which passed in 2015, was widely viewed by Republicans as a corrective to the federal overreach that followed… No Child Left Behind.”  Emma reports that last summer, when Jason Botel, an official in Betsy DeVos’s Department of Education began reviewing the states’ applications for federal funds under the ESSA, Botel demanded that before he would approve some states’ plans, they must toughen their standards and demand more.  Powerful Republican Senator Lamar Alexander, who had—during the 2015 reauthorization—supported a return of control to the states, formally complained to Betsy DeVos—“furious that a top DeVos aide was circumventing a new law aimed at reducing the federal government’s role in K-12 education. He contended that the agency was out of bounds by challenging state officials, for instance, about whether they were setting sufficiently ambitions goals for their students.”

For many of us who have, for fifteen years, closely followed educational accountability as mandated under No Child Left Behind and the Every Student Succeeds Act, the entire debate seems wrong-headed and bizarre.  I am writing about those of us who care deeply about expanding opportunity for children segregated in schools where poverty is highly concentrated— schools where intense segregation by poverty is overlaid on segregation by ethnicity and race. The schools these children attend have, under federal policy, been derided by accountability hawks as “failing” schools.  Widespread blaming—of schools and school teachers—now dominates discussions of school reform even as sociologists increasingly document that family and neighborhood poverty pose overwhelming challenges for these children and their schools.

Much of the confusion and rancor arises because the public debate about school accountability conflates two very different questions:

  • Should the federal government be involved at all in telling states what to do about education?
  • Is test-and-punish accountability an effective strategy for improving public schools and closing opportunity gaps?

The original federal education law, the 1965 Elementary and Secondary Education Act, addressed the first question as a response to the needs of children in primarily southern states, where schools serving black children had been underfunded and inadequate for generations. There are similar problems of inequity across cities today and forgotten rural areas. Poor children and children of color segregated in particular areas remain under served. The debate about this first question involves states’ rights vs. what has come to be accepted (by many of us) as the federal government’s responsibility to protect the rights of all children and ensure they are all well served. It is a heated question that remains underneath much of the debate about school reform.

The second question involves the strategy Congress chose for reforming schools in the 2001 No Child Left Behind Act. Congress blamed teachers and schools and devised a law that was supposed to force schools and teachers to work harder and faster to improve test scores in schools where achievement lagged when all children in each state were tested on a single standardized test.  It is becoming clearer all the time that when Congress jumped behind test-and-punish accountability, it chose the wrong strategy.  A long and growing body of research demonstrates that test scores are far more aligned with a school’s aggregate economic level than with the work of the teachers or the curriculum being offered to students. Economists like Bruce Baker at Rutgers University also document enormous opportunity gaps as these same public schools in our nation’s poorest communities receive far less public investment than the schools in wealthy suburbs, schools serving children whose families also invest heavily in enrichments at home.

Here is just some of the prominent research from the past ten years that tries to answer the second question.

In 2010, Anthony Bryk and educational sociologists from the Consortium on Chicago School Research at the University of Chicago described the challenges for a particular subset of schools in Chicago, Illinois that exist in a city where many schools serve low income children. The Consortium focused on 46 schools whose students live in neighborhoods where poverty is extremely concentrated.  These “truly disadvantaged” schools are far poorer than the norm. They serve families and neighborhoods where the median family income is $9,480. They are racially segregated, each serving 99 percent African American children, and they serve on average 96 percent poor children, with virtually no middle class children present. The researchers report that in the truly disadvantaged schools, 25 percent of the children have been substantiated by the Department of Children and Family Services as being abused or neglected, either currently or during some earlier point in their elementary career. “This means that in a typical classroom of 30… a teacher might be expected to engage 7 or 8 such students every year.”  “(T)he job of school improvement appears especially demanding in truly disadvantaged urban communities where collective efficacy and church participation may be relatively low, residents have few social contacts outside their neighborhood, and crime rates are high.  It can be equally demanding in schools with relatively high proportions of students living under exceptional circumstances, where the collective human need can easily overwhelm even the strongest of spirits and the best of intentions. Under these extreme conditions, sustaining the necessary efforts to push a school forward on a positive trajectory of change may prove daunting indeed.” (Organizing Schools for Improvement, pp. 172-187)

Then in 2011, Sean Reardon of Stanford University released a massive data analysis confirming the connection of school achievement gaps to growing economic inequality and residential patterns becoming rapidly more segregated by income. Reardon documented that across America’s metropolitan areas the proportion of families living in either very poor or very affluent neighborhoods increased from 15 percent in 1970 to 33 percent by 2009, and the proportion of families living in middle income neighborhoods declined from 65 percent in 1970 to 42 percent in 2009.  Reardon also demonstrated that along with growing residential inequality is a simultaneous jump in an income-inequality school achievement gap among children and adolescents.  The achievement gap between students with income in the top ten percent and students with income in the bottom ten percent is 30-40 percent wider among children born in 2001 than those born in 1975.

In The Testing Charade, a book published just last month, Daniel Koretz of Harvard University blames test-and-punish accountability for enabling our society to pretend that we have been overcoming educational inequity at the same time we avoid making the public investment necessary even to begin addressing the problem: “One aspect of the great inequity of the American educational system is that disadvantaged kids tend to be clustered in the same schools. The causes are complex, but the result is simple: some schools have far lower average scores…. Therefore, if one requires that all students must hit the proficient target by a certain date, these low-scoring schools will face far more demanding targets for gains than other schools do. This was not an accidental byproduct of the notion that ‘all children can learn to a high level.’ It was a deliberate and prominent part of many of the test-based accountability reforms…. Unfortunately… it seems that no one asked for evidence that these ambitious targets for gains were realistic. The specific targets were often an automatic consequence of where the Proficient standard was placed and the length of time schools were given to bring all students to that standard, which are both arbitrary.” (pp. 129-130)  “If we are going to make real headway, we are going to have to confront the simple fact that many teachers will need substantial supports if they are going to markedly improve the performance of their students… And the range of services needed is broad. One can’t expect students’ performance in schools to be unaffected by inadequate nutrition, insufficient health care, home environments that have prepared them poorly for school, or violence on the way to school.” (p. 201)

The second question involves the overall direction of education policy, and it is important because we desperately need a better strategy. Blaming and punishing the schools with the lowest scores—by closing “failing” schools or privatizing them or firing their teachers and principals—has only further undermined the public schools in the poorest neighborhoods of our big cities without addressing the opportunity gaps the tests identify.

Today’s Republican tax slashing agenda will only further reduce public investment in education.  And we are likely to keep on blaming the victims.

Federally Mandated Test-and-Punish Didn’t Go Away with NCLB

As you very likely remember, No Child Left Behind, the much hated 2002 version of the federal education law—the one Jonathan Kozol once called “the federal testing law”— was reauthorized last December. Now instead we have the Every Student Succeeds Act (ESSA).  There is widespread agreement that nearly fifteen years’ of test-based accountability has failed to raise overall student achievement; flat and declining scores on the National Assessment of Education Progress confirm that failure. Neither has the annual testing and disaggregation of scores resulted in the diminishing of achievement gaps. But the federal government doesn’t shift direction so easily.  Here is a quick update on what is happening as the rules that will implement the new law are being developed.

There is one bright spot: In the new law, Congress eliminated any federal mandate to tie teacher evaluation to students’ standardized test scores.  The U.S. Department of Education had made it a requirement that states applying for federal waivers from the worst punishments of NCLB could qualify for waivers only if they agreed to pass state laws to tie teacher evaluation to what have been called Value Added Measures—VAM algorithims that try to calculate the amount of learning each teacher “adds” to the overall education of each student.  The American Statistical Association, the American Educational Research Association and a number of academic researchers have demonstrated that VAM scores not only fail to measure many qualities of excellent teachers, but also are inaccurate and unstable from year to year.  It is possible that Congress listened to the experts—more likely that it listened on this one issue at least to the National Education Association and the American Federation of Teachers and many others who pointed to obviously flawed low VAM ratings for many award-winning teachers and to the collapse of morale among teachers across the United States.

While Congress eliminated the federal push to evaluate teachers by students’ scores, it could not undo the teacher-evaluation laws passed in recent years across the states to qualify for federal waivers. Hawaii, at least, has now begun to undo the damage, according to a mid-May report from the Hawaii Tribune-Herald: “Educators in Hawaii just became a little more powerful.  The State Board of Education unanimously approved recommendations Tuesday effectively removing standardized test scores as a requirement in the measurement of teacher performance…. The recommendations… will offer more flexibility to incorporate and weigh different components of teacher performance evaluation, although the option to use test scores in performance evaluations remains.”

Apart from teacher evaluation, however, not much about test-and-punish has really changed. Last week, the U.S. Department of Education released proposed rules for the implementation of ESSA and there has been considerable argument from Republican leaders in Congress who want to turn more authority over to states, while the Obama administration wants to keep the federal government strongly involved.

Here is the explanation of Emma Brown of the Washington Post: “The law requires states to continue administering standardized math and reading tests to students in Grades 3 through 8 and once in high school.  But it also gave states a new opportunity to include other non-test measures, such as access to advanced coursework and rates of chronic absenteeism, in judging schools.  Under the regulations released Thursday, states would be required to wrap all of those various indicators into one simple rating, such as a letter grade, to provide parents with clear, easy-to-understand information about school performance… The previous education law, No Child Left Behind, prescribed sanctions for schools that failed to meet test score targets.  The Every Student Succeeds Act takes a different approach, allowing states to decide how to intervene in struggling schools as long as those interventions are ‘evidence based.'”

One thing is clear from press reports: the conversation remains centered pretty much in the weeds of the details of outcomes-based accountability—measuring schools’ success in meeting demands for higher test scores.  Here is how Valerie Strauss of the Washington Post describes the proposed rules to implement ESSA: “The proposed regulations, among other things, would require states to ensure that school districts are implementing ‘accountability’ systems based on multiple measures.  The states have a lot of discretion on how those systems should be constructed but not total, with the federal government requiring that states ‘assign a comprehensive, summative rating for each school to provide a clear picture of its overall standing’….”

As the new law was debated in Congress last fall, the National Education Association lobbied hard for at least the inclusion of “input” measures as part of school evaluation to make it possible to consider each school’s real capacity to meet the demand for higher scores. This would have shifted the measure of accountability toward the consideration of a school’s resources.  The goal was to find a way to let districts expose inequity in things like class size, number of counselors and support staff, and financial resources available per-child from district to district.  States can still include such measures as part of their multiple-measure-accountability ratings, but it is unlikely to happen unless Congress pushes harder.  After all, that would shift the blame—and test-based accountability is a blame game—to the states that refuse to distribute funding equitably and persist in shorting the school districts that serve the poorest children.  And while states are now federally required to intervene in low-scoring schools, there is no evidence that the focus will shift from punitive interventions like closing or charterizing schools and firing educators, and no evidence that states will feel pressed to invest in the poorest schools.

One thing is clear.  In its proposed rules, the Obama Department of Education strongly discourages opt-outs by parents protesting the testing regime.  Strauss explains: “With a testing ‘opt out’ movement that has been growing in recent years, the department spells out a series of punitive options states should take in an attempt to get schools to ensure a 95 percent student participation rate on federally required state-selected standardized tests.” It remains unclear what the consequences would be for higher rates of opting out.  Strauss continues: “Under NCLB and now under ESSA, at least 95 percent of eligible students are required to take the state-chosen standardized test used to hold states and school districts ‘accountable.’  Last year, some states did drop below 95 percent, and in recent months the Education Department has been sending letters to states with ‘suggestions’ of how to handle schools that can’t drum up 95 percent support.  It also said federal funds could be withheld from states that did not deal effectively with opt outs.”  Although it is clear that the Department of Education discourages opting out, what the federal government will do about it remains unknown.

Public comment will be accepted on the draft rules until August 1, 2016.

Congress Is Likely to Reauthorize Education Law. How Will We Undo Arne Duncan’s Damage?

Seven years ago today—on November 30, 2008—I picked up my Sunday Cleveland Plain Dealer to see a story above the fold on the front page, a story whose headline screamed: Good Teachers Are Key to Student Achievement, but Bad Ones Are Hard to Fire.  The story itself purported to be a news analysis, part of a series, “a Plain Dealer project reporting on the state of teaching.”  But then there was the photo, of a truck parked in front of the National Education Association’s building in Washington, D.C.  It was one of those trucks that pulls nothing but a sign, and this one—with a picture of a wormy apple—said: “Vote for the Worst Unionized Teachers Who Can’t Be Fired.”  Whatever the content of the article, the message that Sunday morning came from the sign the truck was pulling along—“worst unionized teachers who can’t be fired.”

Then a few days later came David Brooks’ NY Times column about newly elected President Barack Obama’s pending decision about a Secretary of Education.  The new president had appointed Linda Darling-Hammond, a Stanford University professor of education to head his education transition team, but there was enormous pressure from New York’s mayor Michael Bloomberg for Obama to choose Joel Klein, who was at that time serving as Bloomberg’s appointed chancellor of the NYC public schools.

On December 5, 2008, Brooks, a school “reformer” through and through, framed what had already become a polarized battle—“reformers” vs. teachers’ unions: “On the one hand, there are the reformers like Joel Klein and Michelle Rhee, who support merit pay for good teachers, charter schools and tough accountability standards.  On the other hand, there are the teachers’ unions and the members of the Ed School establishment, who emphasize greater funding, smaller class sizes and superficial reforms.  During the presidential race, Barack Obama straddled the two camps.  One campaign adviser, John Schnur, represented the reform view in the internal discussions.  Another, Linda Darling-Hammond, was more likely to represent the establishment view… Each camp was secretly convinced that at the end of the day, Obama would come down on their side… Obama never had to pick a side.  That is, until now.  There is only one education secretary, and if you hang around these circles, the air is thick with speculation…   (O)ne morning a few weeks ago, I got a flurry of phone calls from reform leaders nervous that Obama was about to side against them…  (T)he union lobbying efforts are relentless and in the past week prospects for a reforming education secretary are thought to have dimmed… The candidates before Obama apparently include: Joel Klein, the highly successful New York chancellor who has, nonetheless, been blackballed by the unions; Arne Duncan, the reforming Chicago head who is less controversial; Darling-Hammond herself; and some former governor to be named later, with Darling-Hammond as the deputy secretary.  In some sense the final option would be the biggest setback for reform.  Education is one of those areas where implementation and the details are more important than grand pronouncements.  If the deputies and assistants in the secretary’s office are not true reformers, nothing will get done.  The stakes are huge.  For the first time in decades, there is real momentum for reform.”

The wave of articles that surfaced that week was noticeable, and in my office in the United Church of Christ’s justice ministries, I felt compelled to trade turns with someone else in the rota of staff who wrote the little Witness for Justice columns each week.  On December 15 that year, I described my fear: “(A)s I write, there is an attack on public school teachers by advocates who seek a Secretary who would base pay on test scores, deny tenure, intensify the test-and-punish mechanisms of No Child Left Behind, and rely far more on charter schools.  These critics deride public school improvement as mere ‘weak, status-quo’ reform.”

Fast-forward seven years, and here we are at the end of Secretary of Education Arne Duncan’s tenure. Duncan’s policies have been so widely disliked in their implementation that there seems to be bipartisan Congressional consensus, unheard of these days, to undo the damage everybody has come to believe happened due to Bush’s No Child Left Behind and Obama and Duncan’s Race to the Top, School Improvement Grants and No Child Left Behind waivers.  Duncan has resigned as of the end of 2015, and will be replaced by John King, an acting secretary as a placeholder for the last year of President Obama’s term.

And if Congress acts this week finally (after several previous tries) to reauthorize the federal education law called the Elementary and Secondary Education Act, we may find ourselves without the version we’ve been living with now for 14 years—No Child Left Behind.

While some of the “punishments” promoted by Duncan for so-called failing schools were originally outlined in 2002 in the original test-and-punish No Child Left Behind Act, Duncan and his Department were the ones who worked out how the “turnaround” plans that fired teachers and principals and closed or charterized schools would be imposed on our nation’s poorest schools.  While the problems in the economy were evident by December of 2008, nobody could have imagined the competitive grant programs that Duncan’s Department of Education created as part of the 2009 federal stimulus package.  These were the programs by which states applied for federal grants and eventually waivers from No Child Left Behind’s “Adequate Yearly Progress” system that had begun to attach the label of “failing” to far too many public schools in every state.  Duncan’s Department developed hoops states had to jump through even to apply for these federal grants—remove any state statutory caps on the number of charter schools that can be launched in any one year—intensify Value Added, econometric evaluations of school teachers based on their students’ test scores—embrace college-and-career-ready standards for all students.  Because the Department of Education cannot by federal law prescribe curricula, the Department merely incentivized states to adopt “more rigorous” standards, which in practical terms meant joining one of the two Common Core Curriculum consortia—PARCC or Smarter Balanced.

David Brooks’ words from December 2008 were prophetic: “Education is one of those areas where implementation and the details are more important than grand pronouncements.  If the deputies and assistants in the secretary’s office are not true reformers, nothing will get done.”  As Secretary of Education, Arne Duncan was never one for grand pronouncements, but his Department’s actions have utterly transformed education policy across the fifty states.  State legislatures changed state laws to try for Race to the Top grants and to secure their waivers.  States removed caps on the launch of new charters, and then vastly expanded the number of charters with help from billions of dollars in federal Charter School Program grants. But nobody in the federal government imposed any oversight and only in the most careful states has there been regulation to protect children and taxpayers from unscrupulous profiteers.  Schools have been closed as a “turnaround” policy in many cities as children have been forced to relocate via public transportation in many cases, with some crossing dangerous gang boundaries. The American Statistical Association and now the American Educational Research Association have condemned the use of Value Added Measure algorithms for evaluating teachers because the formulas are unstable and fail to measure many of the qualities of a good teacher.

There is wide agreement that the bipartisan Congressional consensus that seems to have been reached on a plan to reauthorize No Child Left Behind is primarily a repudiation of Arne Duncan’s tenure and policies.

In a profound article, School Reform Fails the Test: How Can Our Schools Get Better When We’ve Made Our Teachers the Problem and Not the Solution?, Mike Rose the writer and UCLA professor of education wonders: “What if reform had begun with the assumption that at least some of the answers for improvement were in the public schools themselves, that significant unrealized capacity exists in the teaching force, that even poorly performing schools employ teachers who work to the point of exhaustion to benefit their students?  Imagine, then, what could happen if the astronomical amount of money and human resources that went into the past decade’s vast machinery of high states testing… had gone into a high-quality, widely distributed program of professional development.  I don’t mean the quick-hit, half-day events that teachers endure, but serious, extended engagement of the kind offered by the National Science Foundation and the National Writing Project…. Imagine as well that school reform acknowledged poverty as a formidable barrier to academic success.  All low-income schools would be staffed with a nurse and a social worker and have a direct link to local health and service agencies… Extra tutoring would be provided… Schools would be funded to stay open late, providing academic and recreational activities for their students.”

Assuming that in the next week or so both houses of Congress affirm the agreement, passed the week before Thanksgiving by a Senate/House conference committee, to reauthorize the federal education law, the question will be where do we go from here?  Mike Rose’s vision describes where many of us would like education policy to go.  But Arne Duncan has ensured that Congress cannot just undo the explosive growth of charters or quickly take back the unworkable schemes for evaluating teachers that state legislatures have passed to qualify for federal No Child Left Behind waivers (even though the waivers themselves will be rendered unnecessary once No Child Left Behind is gone), or help states improve their curricula and avoid the ugly politics around the Common Core for which they have already signed up and invested millions of dollars. These policies have now been enacted into the laws of the fifty states. Amending or eliminating these policies will have to be accomplished one state legislature at a time and will require concerted state-by-state advocacy.

Cliches come to mind. The cats are out of the bag. Pandora’s box has been opened. It’s hard to put the toothpaste back in the tube.

NAEP Scores Stagnate; Test-and-Punish Flops; But Duncan’s New Plan Fails to Change Course

The biennial NAEP scores were released yesterday.  Diane Ravitch knows a lot about the National Assessment of Educational Progress, the NAEP.  Appointed by President Bill Clinton, she served on the National Assessment Governing Board for seven years. She describes what this test is: “NAEP is an audit test. It is given every other year to samples of students in every state and in about 20 urban districts. No one can prepare for it, and no one gets a grade. NAEP measures the rise or fall of average scores for states in fourth grade and eighth grade in reading and math and reports them by race, gender, disability status, English language ability, economic status, and a variety of other measures. The 2015 NAEP scores showed no gains nationally in either grade and in either subject… The best single word to describe NAEP 2015 is stagnation.”

Ravitch describes what she believes is the meaning of this year’s scores, and I agree with her: “For nearly 15 years, Presidents Bush and Obama and the Congress have bet billions of dollars—both federal and state—on a strategy of testing, accountability, and choice.  They believed that if every student was tested in reading and mathematics every year from grades 3 to 8, test scores would go up and up. In those schools where test scores did not go up, the principals and teachers would be fired and replaced. Where scores didn’t go up for five years in a row, the schools would be closed. Thousands of educators were fired, and thousands of public schools were closed, based on the theory that sticks and carrots, rewards and punishments, would improve education.”  But it hasn’t worked.

Carol Burris, the retired NY high school principal and now executive director of the Network for Public Education, interprets the 2015 NAEP scores: “NAEP is a truth teller. There is no NAEP test prep industry, or high-stakes consequence that promotes teaching to the test. NAEP is what it was intended to be—a national report card by which we can gauge our national progress in educating our youth.  During the 1970s and ’80s, at the height of school desegregation efforts, the gap in scores between our nation’s white and black students dramatically narrowed. You could see the effects of good, national policy reflected in NAEP gains. The gaps have remained, however, and this year, the ever so slight narrowing of gaps between white and black students is due to drops in the scores of white students—hardly a civil rights victory.”

Last weekend, U.S. Secretary of Education Arne Duncan announced what some people have seen as a significant pivot in education policy—a turning away from reliance on so much testing and a limit of 2 percent on the amount of time students are spending taking standardized tests at school. Here is what the U.S. Department of Education’s press release says about the anticipated change: “In too many schools, there is unnecessary testing and not enough clarity of purpose applied to the task of assessing students, consuming too much instructional time and creating undue stress for educators and students. The Administration bears some of the responsibility for this and we are committed to being part of the solution.”

The Department’s new Testing Action Plan describes seven principles that will govern a new testing policy whose formal guidance document will be released in 2016: Tests should be “worth taking,” “high quality,” “time-limited,” “fair—and supportive of fairness,” “fully transparent to students and parents,” “just one of multiple measures,” and “tied to improved learning.”  Duncan proposes to make funds available to help states and school districts “develop less-burdensome assessments,” to review their tests and make them more innovative, to pay for experts to guide states on how to reduce time on testing, and to provide technical assistance and even technical assistance centers and labs to “provide targeted assessment audit support.”

Duncan also indicates that the Department of Education will be more flexible in its demands relating to testing and its uses: “The Administration will invite states that wish to request waivers of federal rules that stand in the way of innovative approaches to testing to work with the Department to promote high-quality, comparable, statewide measures.”  Duncan continues: “The Department will work with external assessment experts to implement a more transparent assessment peer review process of state assessments… To avoid double-testing of students, the Department will offer states flexibility from No Child Left Behind’s requirement that all 8th graders be tested on the same, statewide 8th grade math and reading tests, when such students are taking advanced high-school level coursework in 8th grade.”  He adds: “The Administration has adjusted its policies to provide greater flexibility to states in determining how much weight to ascribe to statewide standardized test results in educator evaluation systems required under the Administration’s ESEA flexibility policy.”

The Department of Education’s proposal to adjust its testing policies follows a major report released last week by the Council of the Great City Schools that deplores the amount of standardized testing being required by the federal government and states.  The report explains that, “401 unique tests were administered across subjects in the 66 Great City School systems.  Students… were required to take an average of 112.3 tests between pre-K and grade 12… The average student in these districts will typically take about eight standardized tests per year… Some of these tests are administered to fulfill federal requirements under No Child Left Behind, NCLB waivers, or Race to the Top, while many others originate at the state and local levels… Testing pursuant to NCLB in grade three through eight and once in high school in reading and mathematics is universal across all cities.  Science testing is also universal according to the grade bands specified in NCLB.  Testing in grades PK-2 is less prevalent than in other grades, but survey results indicate that testing in these grades is common as well… Urban school districts have more tests designed for diagnostic purposes than any other use, while having the fewest tests in place for purposes of international comparisons… Some 39 percent of districts reported having to wait between two and four months before final state test results were available at the school level, thereby minimizing their utility for instructional purposes… There is some redundancy in the exams districts give… The findings suggest that some tests are not well aligned to each other, are not specifically aligned with college-or career-ready standards, and often do not assess student mastery of any specific content.”

I don’t believe the Department’s change of plans will significantly address the challenges described so thoroughly by the Council for the Great City Schools.  America’s educational philosophy, formalized in 2002 in the federal No Child Left Behind Act (NCLB) and perpetuated for more than a dozen years now in federal policies like Race to the Top and the NCLB Waivers, is a two-pronged strategy: first test and then punish the districts and schools that cannot quickly raise scores on the tests.  It is the PUNISH strand of this policy that has caused the explosion of testing.  NCLB’s mandate of an annual test was the mere beginning.  Fear is the motivator in a system driven by sanctions and punishments.  In the school districts where scores are lowest, school officials, driven by fear, have added practice tests and benchmark tests and more practice tests and hours of test-prep—anything that might raise test scores. If students are tested again and again, says the logic of a test-and-punish plan, maybe students will get better at taking tests and their scores will rise.

Here are the problems I see in the new testing ideas announced last weekend by the Department of Education.

  • In the first place, as Diane Ravitch points out so clearly, limiting testing to 2 percent of the school year is not really much of a change: “Actually that wasn’t a true reduction, because 2% translates into between 18-24 hours of testing, which is a staggering amount of annual testing for children in grades 3-8 and not different from the status quo in most states.”
  • Secretary Duncan’s new plan does not cut testing as the centerpiece of current proposals in Congress for the reauthorization of the Elementary and Secondary Education Act (that we currently call NCLB). Duncan proposes neither to move from annual testing to grade-span testing (once in elementary, middle and high school) nor to eliminate the high stakes for school districts and schools (and their teachers) unable quickly to raise students’ scores.
  • High stakes will continue to frighten school district officials into narrowly focusing on the tested subjects of reading and math and to fill too much time with test prep lessons.
  • Somehow, says Duncan, there is to be more flexibility about basing teachers’ evaluations on students’ scores, particularly for teachers in subjects that are not tested. This presumably means we won’t be adding more tests in previously non-tested subjects just to be able to judge teachers by the tests.
  • It seems to me that—quite typically for this Administration—there will be more money made available for consultants to evaluate and re-calibrate the testing system.

Duncan’s concept is to retain a test-and-punish system.  Writing for Education Week‘s federal policy blog, Andrew Ujifusa quotes Secretary Duncan’s own description of the new plans at a press conference: “The goal is to have good assessment that drives instruction, and if you reduce testing to 1 percent, and it isn’t relevant… it is not guiding instruction, that is a loss, that’s a failure, not a win.” Ujifusa paraphrases Duncan’s next comment: “However, if students spend slightly more than 2 percent of instructional time on testing and the assessments are helping teachers, parents understand them, and students are part of the solution, that’s a good outcome.”  This sounds to me as though we are going to continue with a test-based system.

Considering the stagnant NAEP test scores released yesterday, Kevin Welner, Director of the National Education Policy Center, comments on Duncan’s recently announced reassessment of the role of standardized testing: “It’s long past time to recognize that any benefits of test-based accountability policies are at best very small, and any meager benefits teased out are more than counterbalanced by negative unintended consequences.” “This (the focus on testing) is the tragedy. It has distracted policymakers’ attention away from the extensive research showing that, in a very meaningful way, achievement is caused by opportunities to learn. It has diverted them from the truth that the achievement gap is caused by the opportunity gap. Those advocating for today’s policies have pushed policymakers to disregard the reality that the opportunity gap arises more from out-of-school factors that inside-of-school factors… So schools with low test scores were labeled ‘failing’ and were shut down or reconstituted or turned over to private operators of charter schools… Teachers whose students’ test scores didn’t meet targets were publicly shamed or denied pay or even dismissed.  Our entire public schooling structure became intensively focused on increasing test scores. But once we admit that those test scores are driven overwhelmingly by students’ poverty- and racism-related experiences outside of school, then ‘failing’ schools are little more than schools enrolling the children in the communities that we as a society have failed.”