Henry VIII and The Two Noble Kinsmen
from First Principles

A comprehensive authorship attribution study for Shakespeare scholars
March 2026 · Codex research workspace · Database: 527 early modern plays · Target: Henry VIII (First Folio, play ID 502)

This report presents the full findings of a first-principles computational authorship study of Henry VIII. It expands substantially on the accompanying PDF, explaining every methodological decision in plain prose, comparing results against recent scholarship, and providing the complete chunk-level data. The central finding: the broad traditional Fletcher scene map is too coarse. The play has a strong Shakespearean backbone with a smaller number of locally persistent Fletcherward pockets — narrower and more fragile than scene-level tradition implies.

1. The Problem

Henry VIII — or All Is True, as the play was known in its own time — is one of the longest-running authorship debates in Shakespeare studies. The play was printed in the First Folio of 1623 without attribution to any collaborator. It was accepted as entirely Shakespeare's until 1850, when James Spedding argued from internal verse evidence that substantial portions must be by a different hand. That hand, he proposed, was John Fletcher.

Spedding's argument has proven remarkably durable. It was reinforced by Cyrus Hoy's linguistic-marker work in the early 1960s, extended by Jonathan Hope's sociolinguistic analysis in the 1990s, and forcefully advocated by Brian Vickers in 2002. More recent computational work — especially Petr Plecháč's rolling attribution study published in Digital Scholarship in the Humanities in 2021 — broadly supports collaboration while introducing some refinements to the scene-level map.

Yet the question has also never been fully settled. A determined minority of scholars has maintained Shakespeare's sole authorship. Editors have generally been cautious about exact internal boundaries. And the methods used to argue for collaboration have shifted substantially over time: from verse metrics, to linguistic markers, to function-word profiles, to rolling computational attribution.

This project does not try to settle the full history of the question in one stroke. It does something more focused: it begins from the text itself rather than from a previously published map, it validates every method before interpreting results, and it asks what survives when suspicious scenes are divided into their local dramatic sub-units.

Why "from first principles"? The key methodological commitment of this project is that no previously published Henry VIII scene map is allowed to function as the answer key. Scholarship is used as a comparison layer and benchmark — not as a training label. This means that the traditional Spedding/Hoy divisions and the more recent Plecháč rolling map are treated as hypotheses to test, not ground truth to recover.

Why this is a hard problem

Four features of Henry VIII make the authorship question genuinely difficult, independent of any particular method:

  1. No external documentary evidence. There is no contemporary document directly naming Fletcher as co-author. The collaboration hypothesis rests entirely on internal evidence — style, vocabulary, verse patterns, and linguistic habit.
  2. The play contains multiple dramatic registers. Ceremonial pageantry, legal accusation, comic crowd scenes, intimate prophecy, and political counsel all appear in the same play. Different dramatic registers can themselves generate different stylometric profiles, independent of authorship.
  3. The First Folio text may have been mediated. Compositorial practice, scribal transcription, and editorial normalization can affect the very features — spelling, contraction, function-word choice — that authorship methods depend on.
  4. Methods disagree on units of attribution. Some scholars work at the scene level. Others work at the rolling-window level. Others work scene-by-scene. The scale at which you measure can determine what you find.

2. The History of Scholarship

Understanding the current project requires understanding what the existing scholarship has and has not established. The following is an expanded review of the major positions, with particular attention to their methodologies and their limitations.

James Spedding
1850 · "Who Wrote Shakespeare's Henry VIII?"

The historical starting point. Spedding argued from verse endings — particularly feminine endings — that large portions of the play bore a different stylistic character from Shakespeare's late verse. His proposed Fletcher territory included scenes 1.3, 1.4, 2.1, 2.2, 3.1, 4.1, 4.2, 5.3, and 5.4. This became the "canonical" map that most subsequent scholars have either defended or modified. Importantly, Spedding's method was already lineation-sensitive: he noticed that verse structure (not just vocabulary) differed. The current project's verse-and-rhyme analysis is a direct methodological descendant of this instinct.

Cyrus Hoy
1962 · Studies in Bibliography

Hoy's contribution was to shift the evidence from verse metrics to linguistic markers. He documented Fletcher's distinctive preferences — ye for you, 'em for them, has for hath, does for doth — across a large corpus of Fletcher's secure work. He found that these markers clustered in the Spedding-attributed scenes of Henry VIII, broadly confirming the scene map. The current project tests these markers carefully: several validate strongly on the secure reference corpus, but some (ye especially) are complicated by transmission and dramatic mode.

Jonathan Hope
1994 · The Authorship of Shakespeare's Plays

Hope introduced a sociolinguistic approach, focusing on second-person pronoun usage (the T/V distinction: thou/thee vs. you/your) as a marker of social register and authorial habit. His results supported collaboration in a general sense, though with important caveats about the relationship between sociolinguistic habit and dramatic mode. The current project tests the T/V layer independently; it validates at a moderate level (0.727 overall accuracy) but is not strong enough to adjudicate the full map alone.

Brian Vickers
2002 · Shakespeare, Co-Author

Vickers's monograph offered the most sustained advocacy for the collaboration hypothesis across multiple plays, including Henry VIII. He treated several scenes as securely Fletcherian and challenged critics who resisted the attribution. The current project is more cautious about the same scenes: 5.2 now looks largely Shakespearean under chunk-level analysis, and 3.1 appears unresolved rather than decisively Fletcherian.

Petr Plecháč
2021 · Digital Scholarship in the Humanities

The most important recent computational study. Plecháč combined most-frequent-word analysis with most-frequent-rhythmic-pattern analysis, using rolling attribution to assign authorship probabilities across the play without being locked to scene boundaries. His results broadly support Spedding while adding modifications that partially align with earlier Merriam-style analyses. He found that authorship shifts do not always respect scene boundaries — a key finding that this project's manual chunking approach also supports. However, his rolling trace is considerably more Fletcherward than the current project's local reproduction with a validated reference corpus.

Craig & Kinney
2009 · Shakespeare, Computers, and the Mystery of Authorship

Craig and Kinney's function-word tests of six Henry VIII scenes found that Fletcher wrote 3.1 and 5.2, while Shakespeare wrote the other four tested scenes. Their results partially diverge from the full Spedding map, and subsequent scholars have criticized their Zeta methodology. The current project independently recovers a conservative reading: 3.1 appears mixed rather than decisively Fletcherian, and 5.2 appears largely Shakespearean in three of four sub-chunks.

Will Sharpe
2016 (NOS edition) · 2023 (Shakespeare & Collaborative Writing)

Sharpe edited Henry VIII for The New Oxford Shakespeare and has continued to work on collaborative writing methodology. His 2023 Oxford book frames collaborative authorship in terms of theatre history, book history, and attribution evidence together. The NOS edition takes a cautious position on exact boundaries while affirming collaboration. This editorial caution is strongly aligned with the main finding of the present project.

Folger / Cambridge Editors
Various · Editorial tradition

Both the Folger and Cambridge editorial traditions treat collaboration as highly plausible but refrain from drawing firm scene-by-scene boundaries. The Folger appendix notes that the collaboration hypothesis rests on internal evidence without documentary confirmation. This position — collaboration is likely, exact boundaries remain unstettled — is precisely where the present project also lands.

Recent Literature (2020–2026)

Beyond the core texts above, the following recent scholarship is relevant to the present project:

The key pattern in recent scholarship The recent literature confirms a consistent pattern: (1) most computational studies support collaboration; (2) they disagree significantly on the extent and exact location of Fletcher material; (3) the disagreement often traces back to different reference corpora, feature families, and units of attribution. The present project addresses all three sources of variation explicitly.

3. Data, Texts, and the Reference Corpus

The database

All analysis rests on a local SQLite database of 527 early modern English plays. The database stores spoken-token sequences, lemmas (via MorphAdorner), speaker information, act/scene structure, and bibliographic metadata. Lemmatization allows comparison of lexical habits independently of surface spelling variation, which matters significantly for a play like Henry VIII where compositor practice may have standardized or altered original spelling.

The target play is Henry VIII (database play ID 502, 36th play in the First Folio subset). A diplomatic Folger transcription was separately consulted for the original-spelling investigation.

The secure reference corpus

Author-comparison analysis uses a deliberately narrow set of secure reference plays. These are plays whose single authorship is not seriously disputed and which provide enough text for reliable window-level comparison. The corpus is not claimed to represent the entirety of either author's output — it represents plays secure enough to use as a calibration standard.

AuthorPlayTokens
FletcherBonduca20,226
FletcherThe Faithful Shepherdess19,962
FletcherThe Loyal Subject25,579
FletcherThe Mad Lover18,378
FletcherThe Woman's Prize22,858
FletcherValentinian24,552
ShakespeareAntony and Cleopatra23,855
ShakespeareAs You Like It21,442
ShakespeareCoriolanus26,675
ShakespeareHamlet29,897
ShakespeareHenry IV, Part 124,091
ShakespeareHenry IV, Part 225,899
ShakespeareHenry V25,702
ShakespeareJulius Caesar19,131
ShakespeareKing Lear25,376
ShakespeareMacbeth16,547
ShakespeareMeasure for Measure21,346
ShakespeareOthello25,967
ShakespeareRichard II21,864
ShakespeareThe Tempest16,159
ShakespeareTwelfth Night19,548
ShakespeareMuch Ado About Nothing21,442

The corpus has 16 Shakespeare plays and 6 Fletcher plays. The asymmetry matters: the method will be more comfortable recognizing Shakespeare than Fletcher, and this is confirmed in validation. All conclusions are read against this known asymmetry.

Six governing assumptions

  1. No previously published Henry VIII section map functions as the answer key.
  2. Large scenes are not presumed homogeneous. If a scene changes local dramatic function, it may need manual subdivision.
  3. No method is interpreted before being validated on secure single-author material.
  4. No single feature family decides the case alone.
  5. TNK can be used as a definitely mixed Shakespeare–Fletcher play, but not as a secure internal partition map.
  6. Any chunk that weakens substantially under finer subdivision is treated cautiously, even if it looked persuasive at coarser scale.

4. How Henry VIII Was Manually Chunked

Why scene level was not enough

The project did not begin with the 46-chunk map. It began with scene-level exploration. That first pass immediately revealed a problem: many of Henry VIII's scenes are too long and too internally varied to be treated as single authorial units. A single scene in this play may contain a ceremonial opening, a legal accusation, a private emotional reaction, a packet exchange, and a formal conclusion — five dramatically distinct phases within one scene boundary. Attributing that kind of scene to one author is not just potentially wrong; it is not even a well-formed question.

The methodological response was manual chunking: inserting boundaries only where the dramatic action itself makes a local break intelligible, in the way a literary critic would notice a change even without any stylometric agenda.

Principles of chunking

A new chunk boundary was drawn when one or more of the following occurred:

This means the chunk is best understood as a local dramatic unit, not a mechanical token slice. The result was a 46-chunk map across the play's 16 scenes.

Representative chunking examples

ChunkSceneTokensLabelBoundary rationale
1.3a1.3575Court satire and French fashionsCompact comic-expository scene; kept whole because internal mode stays stable
3.2a3.2607Noble coalition and anti-Wolsey exposureOpening noble conference through Wolsey-Cromwell encounter before private alarm
3.2b3.2679Packet exchange and Wolsey's inward alarmPrivate alarm and preparation for downfall — distinct from the public confrontation
3.2c3.2123Wolsey confronted and strippedBrief formal article-recital block; highly compressed public humiliation
3.2d3.2935Wolsey after the fallReflective post-fall aftermath — a dramatically distinct emotional register
5.4a5.467Procession naming and opening blessingCeremonial prelude before prophecy; mode shifts sharply from 5.3
5.4b5.4359Cranmer's first prophecy movementFirst prophetic arc; internally stable
5.4c5.4182Final prophecy and king's answerSecond arc plus royal response; distinct closure movement
5.4d5.4123EpilogueNon-dramatic; separate unit for all analyses
46-chunk map of Henry VIII
Figure 1. The 46-chunk map of Henry VIII. Colors indicate the working judgment that emerged from the first-principles ledger: Shakespeare-like (teal), mixed/transitional (amber), Fletcher-like (red), or too short to assess (grey). The strong Shakespearean backbone running through Acts 1–3 is immediately visible, as are the narrow Fletcher-like pockets in 1.3 and 5.3.
28
Shakespeare-like or leaning chunks
2
Fletcher-like chunks in final map
13
Mixed / transitional chunks

5. How The Two Noble Kinsmen Was (and Was Not) Used

Earlier drafts of this project were tempted to use The Two Noble Kinsmen (TNK) as a validation case: if a method can recover the familiar internal Shakespeare/Fletcher partition of TNK, then it can be trusted on Henry VIII. This is a tempting shortcut, but it is methodologically circular.

The standard TNK partition (Acts 1, 2, and 5 mostly Shakespeare; Acts 3 and 4 mostly Fletcher) is itself disputed and internally derived. Using it as a validation target means assuming precisely what you are supposed to test. If your method recovers the disputed TNK map, all you have shown is that your method agrees with a prior disputed conclusion — not that either is correct.

The evidentiary rule adopted here TNK may be treated as a definitely mixed Shakespeare/Fletcher play. It may not be treated as if its internal act-by-act or scene-by-scene partition were secure ground truth. This means TNK enters the analysis only through coarse whole-play texture metrics — not through scene-labeled attribution comparisons.

This is a narrower use of TNK, but a cleaner one. Under this rule, TNK is useful for the following question: Do Henry VIII and TNK both show mixed internal texture clearly above the single-author baseline? That question can be answered without assuming we know which TNK scenes belong to which author.

6. The Methods, Explained

Method 1: The Refined Stack

The main local attribution method is the "refined stack," combining three feature families. Each Henry VIII chunk is represented in three different ways, and compared against rolling windows of the same length drawn from secure Shakespeare and Fletcher reference plays.

Character trigrams (char): overlapping three-character sequences (e.g., hon, ono, nou from honour). These capture recurrent spelling shapes and some lexical habits without becoming a simple bag of full words. They are particularly useful because they are somewhat insensitive to morphological inflection.

Function-word profile (fw): high-frequency words like and, of, with, for. In authorship work, function words are prized because they are less topic-dependent than content words — a playwright uses the and that habitually, regardless of whether the scene is about love or politics.

Marker profile (marker): a small targeted set of authorship markers with a known prior in Shakespeare/Fletcher attribution — especially pronoun preferences, auxiliary verb choices, and contraction habits (see Method 4 below for detail).

For each feature family, the chunk is compared to reference windows using cosine distance. The best-matching windows on each side are identified. The method then asks: on average, is this chunk nearer to Shakespeare windows or Fletcher windows? If two of three feature families agree on one author, that is the chunk's reading. If they split or the margin is small, the chunk is Mixed.

Why these three families, and not others? Early validation runs tested five families: function words, lemmas, character trigrams, markers, and dialogue geometry. Fletcher recovery was weak for lemmas (0.389) and very weak for geometry (0.250). The refined stack drops those two weak families and focuses on the three that actually discriminate Fletcher in the secure corpus.

Method 2: Rolling Attribution

The rolling-attribution pass asks the same authorship question continuously rather than chunk-by-chunk. A fixed-size window (400 or 700 tokens) is moved through the entire play one step at a time. Each window is compared to the secure reference corpus. The result is a continuous Fletcher-proxy trace — a number between 0 and 1 at each position, where higher values indicate closer proximity to the Fletcher reference plays.

This method is the direct local equivalent of Plecháč's rolling attribution. The key difference is the feature stack (trigrams + markers + function words here, versus most-frequent words + most-frequent rhythmic types in Plecháč) and the reference corpus (16 Shakespeare / 6 Fletcher here vs. a different selection in Plecháč).

The rolling method has two advantages over chunk-based analysis: it does not depend on chunk boundary placement, and it can reveal whether apparent Fletcher pressure forms a genuine continuous block or a brief local spike.

Method 3: Verse and Rhyme

The verse-and-rhyme layer reconstructs line-by-line structure from the token sequence and measures several lineation-sensitive features: feminine-ending tendencies, strong end-stopping, shared lines between speakers, and adjacent rhyme density. This layer matters because Spedding's original argument was already lineation-sensitive, and because several scholars (including Plecháč) have found verse patterns diagnostic.

However, in this project's validation, verse features perform only moderately on the secure reference corpus. Feminine endings achieve about 0.727 accuracy for the Shakespeare/Fletcher distinction. Shared-line and end-stop claims are weaker still. The verse layer is therefore used as a secondary check rather than a primary decision criterion.

Method 4: Classic Marker Audit

Rather than simply assuming that the traditional Fletcher markers work, the project tested them against the secure corpus. The question was: which traditional markers actually separate secure Shakespeare plays from secure Fletcher plays in the local reference data?

MarkerFavorsAccuracyStrengthUsable?
feminine ending rateFletcher0.727moderateYes (secondary)
ye frequencyFletcher0.909strongCautiously (transmission-sensitive)
'em rather than themFletcher0.955strongYes
has rather than hathFletcher0.955strongYes
does rather than dothFletcher0.773moderateYes
colloquial contraction densityFletcher0.909strongYes

The strongest markers are 'em/them and has/hath, each reaching 0.955 accuracy. The ye marker is strong in principle but complicated by its sensitivity to database normalization — some lemmatization pipelines collapse ye and you, which would suppress the signal. Results using ye are read with extra caution.

Crucially, even strong markers behave unevenly across Henry VIII: they are often local within a scene rather than scene-wide. This is one of the most important empirical results in the entire project — it undermines the traditional assumption that a scene with several Fletcher marker hits is a uniformly Fletcherian scene.

Method 5: Original-Spelling Check

Because spelling arguments recur in the Henry VIII tradition (Hoy's work relied partly on ye and 'em in original spelling), the project returned to a diplomatic Folger transcription of the play. This layer was handled with caution. Ordinary Folio spellings — heere, beleeue, honour — are as likely to reflect compositor practice as authorial habit. More local colloquial forms — ye, 'em, ha's — are more informative but still potentially affected by scribe or compositor.

The main value of this layer is negative: spelling evidence can intensify a pre-existing suspicion, but it does not by itself settle a chunk.

Method 6: Content-Word Frequency Profile (CWFP)

The CWFP method asks a fundamentally different question from the refined stack. It ignores function words and linguistic markers entirely, and compares texts only through recurring content lemmas — words like honour, grace, blood, power, lord, love. The vocabulary is built from ~2,567 content lemmas that appear in at least 50% of reference plays.

CWFP is useful because it provides an independent signal. If a chunk shows Fletcherward pressure in both the refined stack and CWFP, that cross-method convergence is stronger evidence than either alone. CWFP is more vulnerable to subject matter (a politically intense scene will use different content words than a comic one), but it is less vulnerable to the marker-transmission problems that affect function-word approaches.

CWFP and authorship attribution CWFP was originally developed and validated for this project on the full First Folio corpus, achieving top-1 accuracy of 97.2% in identifying which First Folio play a text window comes from. Its validation on the Shakespeare/Fletcher binary classification task achieves 0.879 overall at 700-token windows — respectable but more conservative than the refined stack.

Method 7: Genre and Mode Contamination Test

The most important methodological question in the entire project is this: could the Fletcherward pressure in suspect Henry VIII chunks simply reflect their dramatic mode rather than their authorship? Court satire might look Fletcherward. Comic crowd scenes might look Fletcherward. Processional narration might look Fletcherward — regardless of who wrote the passage.

The contamination test directly addresses this objection. For each surviving pressure chunk, the project found exact-length windows from secure Shakespeare plays that matched the Henry VIII chunk as closely as possible in dramatic geometry: same approximate speaker count, same turn structure, same concentration of speech among a small number of speakers. These matched Shakespeare windows were then put through the refined stack and CWFP.

If the matched Shakespeare controls also drift Fletcherward, the contamination explanation is supported and the Henry VIII pressure is inconclusive. If the Henry VIII chunks are consistently more Fletcherward than their matched Shakespeare controls, the contamination defense weakens.

7. Validation Before Interpretation

The asymmetry in method performance is the most important single fact about the project's evidentiary standard. It must be understood before any Henry VIII result is interpreted.

Refined stack validation

Leave-one-play-out validation on the secure corpus gives the following results:

Window sizeOverall accuracyShakespeare recoveryFletcher recovery
200 tokens0.8791.0000.556
400 tokens0.9091.0000.667
700 tokens0.9551.0000.833

CWFP validation

Window sizeOverall accuracyShakespeare recoveryFletcher recovery
200 tokens0.7880.9380.389
400 tokens0.8791.0000.556
700 tokens0.8790.9790.611
Validation comparison chart
Figure 2. Validation accuracy on the secure reference corpus for both main attribution methods. Both methods recover Shakespeare more reliably than Fletcher. This asymmetry means that a conservative Henry VIII result — one where relatively few chunks appear Fletcherian — cannot be taken at face value as evidence against collaboration. The method is easier on Shakespeare. Any positive Fletcher finding therefore carries more weight than an absence of Fletcher findings.
The asymmetry matters for interpretation Fletcher recovery at 700-token windows is 0.833 for the refined stack and only 0.611 for CWFP. At smaller chunks (many Henry VIII chunks are 200–500 tokens), Fletcher recovery falls further. This means: when the method says "Shakespeare," we can be quite confident. When it says "Mixed" for a small chunk, some of those could be Fletcher — the method simply cannot tell.

8. The Main Henry VIII Findings

The first-principles analysis produces a result that is neither the broad traditional Fletcher map nor an all-Shakespeare reading. The play has a genuine Shakespearean backbone — visible across multiple methods and across the manual chunking, the rolling attribution trace, and the content-word profile. But a handful of locally persistent Fletcherward pockets survive the full battery of tests.

The Shakespearean backbone

The Wolsey material — the political rise and fall across Acts 1–3 — remains consistently and strongly Shakespearean across the refined stack, rolling attribution, and content-word profile. Scenes like 1.1, 1.2, 2.4, and the entire 3.2 complex (Wolsey's fall and its aftermath) are among the most securely Shakespearean regions in any attribution analysis.

The legal-ceremonial scenes — particularly the trial of Katherine in 2.4 and the Cranmer vindication in Act 5 — are also strongly Shakespearean. The play's political-historical architecture appears to be Shakespeare's in its most fundamental design.

What looked Fletcherian and then weakened

Before local subdivision, several parent chunks looked suspicious. What happened to each under finer resolution is the key story of the project:

1.3a Court satire and French fashions — Initially the best local Fletcher island under the refined stack. But under manual subdivision into three local units, only the opening satire subunit retained any Fletcher signal. The central proclamation and closing Wolsey movement turned Shakespearean. Working judgment: cautiously F-like in the outer frame, mixed internally.
4.1b Coronation procession and immediate response — Strengthened briefly under the verse-and-rhyme layer. Under local resolution it did not sustain a stable Fletcher reading. The core ceremonial description turned Shakespeare-adjacent. Working judgment: Mixed.
5.2b Council attack on Cranmer — Earlier had attracted some lineation-based pressure. Under local resolution, three of four sub-units resolved Shakespearean. Only the brief Cromwell-Gardiner quarrel retained a mild Fletcherward lineation pull. Working judgment: Sh-like.
3.2c Wolsey confronted and stripped — Showed article-recital pressure. Under local resolution, all four sub-units resolved Shakespearean. Working judgment: Sh-like.
5.3a Porter comic crowd-control — The strongest and most persistent Fletcher pressure point across all methods. Remains F-like in refined stack, rolling attribution (at both 400 and 700 tokens), and shows Fletcherward CWFP margin. Working judgment: F-like.
Subchunk resolution results
Figure 3. What happens when the main parent pressure chunks are divided into local sub-units. In most cases, the local sub-units are predominantly Shakespearean even when the parent chunk looked suspicious. This does not mean the pressure was imaginary — it means the old scene-scale labels were too coarse. The most important residue after subdivision is in 1.3a (partly) and 5.3a (strongly).
Traditional scenes vs chunk-level map
Figure 4. The classic Spedding/Hoy Fletcher scenes shown alongside the chunk-level working map from this study. The key observation: traditional scene-level assignments often cover regions that the chunk-level analysis shows to be internally divided or largely Shakespearean. The traditional map is not simply wrong — its pressure points are often real — but it is too coarse about the internal composition of those scenes.
Method matrix comparison
Figure 5. The whole-play method matrix: for each chunk, the working ledger judgment, the refined stack result, the CWFP result, and the rolling attribution result at 400 and 700 tokens. The most important pattern: the four methods broadly agree on the Shakespeare-like backbone. Where they disagree most is in the middle of the play (Acts 2–3) and in the problem zones of 1.3 and 5.3.

Rolling Attribution: The Continuous View

The rolling attribution pass approaches Henry VIII from the opposite direction from manual chunking: it makes no assumptions about where dramatic units begin and end, and simply asks, window by window, whether each 400- or 700-token stretch is closer to the Shakespeare or Fletcher reference corpus.

The result is striking. The continuous trace is overwhelmingly Mixed, with only a very small number of decisively Fletcher-classified windows:

Window sizeTotal windowsShakespeareFletcherMixed
400 tokens184333148
700 tokens182321149

The rolling trace does not produce a broad alternating Shakespeare/Fletcher map. The play is not neatly divisible into large Shakespeare blocks and large Fletcher blocks. Instead, most windows sit in the mixed zone, with a small number of strong Shakespeare windows in the Wolsey and trial material, and a handful of Fletcher windows concentrated in 5.3a and, more weakly, 1.3a.

This is the most direct local answer to the Plecháč-style rolling question. The current result is considerably more conservative than Plecháč's. The key methodological differences: the current project's reference corpus is more carefully validated, and the feature stack (trigrams + markers + function words) is different from Plecháč's most-frequent-word plus verse-rhythm combination. Whether this difference reflects a genuine methodological improvement, or reflects the different feature families responding to different aspects of style, is itself an open research question.

Rolling attribution trace
Figure 6. The rolling attribution trace across Henry VIII. Each point represents one overlapping window; higher values indicate closer proximity to the Fletcher reference plays. The strong Shakespearean windows in Acts 1–3 (the Wolsey material) are clearly visible. The Fletcher spikes are narrow and localized — not the broad continuous Fletcher blocks that scene-level tradition implies.

9. The TNK Coarse Benchmark

Under the stricter evidentiary rule, TNK is used only to ask: do Henry VIII and TNK both show coarse mixed internal texture, clearly above the single-author baseline? The answer is yes for both plays — but in different ways.

PlayWindowWindowsMixed shareSwitch rateEntropy
Henry VIII400580.8100.2630.701
Henry VIII700570.7720.2320.775
The Two Noble Kinsmen400580.9310.1230.362
The Two Noble Kinsmen700580.9480.1050.294

TNK has a higher mixed share — more of its windows sit in the middle zone — but a lower switch rate and lower entropy. This means TNK looks steadily mixed, while Henry VIII looks more volatile: it swings more sharply between Shakespeareward and Fletcherward readings. Both plays are well above the single-author baseline in internal texture complexity.

This difference between the two plays is itself informative. If the method simply flattened all plays into undifferentiated ambiguity, Henry VIII and TNK would look identical. They do not. Henry VIII is more volatile; TNK is more uniformly mixed. That distinction is consistent with Henry VIII having a strong Shakespearean backbone with a smaller number of concentrated Fletcherward pockets, while TNK has more evenly interleaved authorial contributions.

Henry VIII and TNK rolling traces
Figure 7. Rolling Fletcher-proxy traces for Henry VIII and TNK at 700-token windows. TNK (right) is more steadily mixed; Henry VIII (left) fluctuates more sharply. The narrow Fletcherward spikes in Henry VIII contrast with TNK's more uniform mixed texture. Note that the current project does not use this comparison to locate Fletcher in TNK — it uses it only to ask whether both plays are recognizably mixed relative to single-author plays.
Mixed texture comparison
Figure 8. Whole-play mixed-texture metrics for Henry VIII, TNK, and the secure single-author reference plays. Both Henry VIII and TNK sit clearly above the single-author plays in mixed-share and several variance metrics. The difference between them illustrates that the method is sensitive to the character of the mixing, not just its presence.

10. CWFP Results

The CWFP pass provides a new kind of evidence because it is based on common content words rather than function words or markers. A chunk that shows Fletcherward pressure in CWFP cannot be dismissed by arguing that the refined stack merely reflects marker frequency or function-word habit.

Summary of CWFP classifications

11
Shakespeare-leaning chunks
1
Fletcher chunk (5.4b)
34
Mixed chunks

CWFP does not restore the traditional large Fletcher share. Only one chunk receives a clear Fletcher classification — 5.4b (Cranmer's first prophecy movement), whose nearest CWFP neighbor is Fletcher's Valentinian. But the margin pattern is more informative than the classification count: many chunks that CWFP classifies as Mixed are slightly closer to the Fletcher reference than to Shakespeare, and these subtle margins converge with other methods in consistent ways.

Most Fletcher-leaning by CWFP

ChunkLabelCWFP Fletcher marginNearest play
5.4bCranmer's first prophecy−0.025Fletcher / Valentinian
3.2dWolsey after the fall−0.017Fletcher / Valentinian
3.1bCardinals with Katherine−0.016Fletcher / Valentinian
4.2bGriffith's defense and Katherine's vision−0.010Fletcher / Valentinian
4.1bCoronation procession−0.010Fletcher / The Mad Lover
2.2bWolsey-Campeius divorce management−0.009Fletcher / Valentinian
5.3aPorter comic crowd-control−0.009Fletcher / The Woman's Prize

Most Shakespeare-leaning by CWFP

ChunkLabelCWFP Shakespeare marginNearest play
1.1cBuckingham arrest and fatal exit+0.041Shakespeare / Measure for Measure
2.4dKing's conscience narrative+0.039Shakespeare / King Lear
1.2cBuckingham matter and surveyor+0.037Shakespeare / Measure for Measure
2.4cKing's vindication of Wolsey+0.036Shakespeare / Antony and Cleopatra
3.2aNoble coalition and exposure+0.033Shakespeare / Measure for Measure
1.1bWolsey grievance and foreign-policy+0.030Shakespeare / Henry V
Pressure summary chart
Figure 9. The surviving pressure chunks under the two main attribution methods. Negative margin means the chunk is more Fletcherward. The overlaid points show the number of validated Fletcherward markers in the same chunk. The convergence of negative CWFP margin, negative refined margin, and high marker count in the same chunks is the strongest available evidence for locally persistent Fletcherward pressure.

CWFP–Marker Crosswalk

Cross-method convergence is the strongest form of evidence available. The following chunks show both high validated marker hits (≥3) and a Fletcherward CWFP margin:

ChunkLabelMarker hitsCWFP nearestRefined result
1.3aCourt satire and French fashions3Fletcher / The Mad LoverF-like
2.2bWolsey-Campeius divorce management3Fletcher / ValentinianSh-like
3.1bCardinals with Katherine3Fletcher / ValentinianMixed
3.2dWolsey after the fall3Fletcher / ValentinianSh-like
4.1aGentlemanly prelude to coronation3Fletcher / The Mad LoverSh-like
4.1cThird Gentleman narrates coronation3Fletcher / The Mad LoverSh-like
5.4cFinal prophecy and king's answer3Fletcher / ValentinianMixed

Note that several of these chunks — 2.2b, 3.2d, 4.1a, 4.1c — are classified as Shakespeare-like by the refined stack despite having high marker counts and Fletcherward CWFP neighbors. This is an important interpretive point: it suggests that the marker and content-word signals in these chunks may be picking up dramatic mode (ceremonial narration, counselor banter, public accusation) rather than authorial identity. The genre/mode contamination test was designed precisely to investigate this possibility.

11. The Genre/Mode Contamination Test: Results

This is the methodological crux of the project. For each surviving pressure chunk, the test found 20 exact-length windows from secure Shakespeare plays with the closest available dramatic geometry (speaker count, turn structure, speaker concentration). Those confirmed-Shakespeare windows were then run through the refined stack and CWFP. If they also drift Fletcherward, the Henry VIII pressure could be dismissed as a mode effect. If they do not, the contamination defense weakens.

Key result: the contamination defense largely fails Under the refined stack, matched Shakespeare controls produced zero Fletcher classifications across all tested pressure chunks. Under CWFP, matched Shakespeare controls also produced zero Fletcher classifications. Several Henry VIII chunks — especially 1.3a, 5.3a, 3.1b, and 3.2d — are more Fletcherward than nearly all of their matched Shakespeare controls.
ChunkLabel Refined targetMatched Sh controls (F/M/S)Refined percentile CWFP targetMatched Sh controls (F/M/S)CWFP percentile
1.3aCourt satire F-like0 F / 1 M / 19 S0.000 Mixed0 F / 9 M / 11 S0.100
2.2bWolsey-Campeius Sh-like0 F / 0 M / 20 S0.000 Mixed0 F / 10 M / 10 S0.000
3.1bCardinals with Katherine Mixed0 F / 0 M / 20 S0.000 Mixed0 F / 5 M / 15 S0.000
3.2dWolsey after the fall Sh-like0 F / 0 M / 20 S0.000 Mixed0 F / 6 M / 14 S0.000
4.1aGentlemanly prelude Sh-like0 F / 1 M / 19 S0.050 Mixed0 F / 11 M / 9 S0.250
4.1cThird Gentleman narrates Sh-like0 F / 0 M / 20 S0.000 Mixed0 F / 8 M / 12 S0.000
5.3aPorter crowd-control F-like0 F / 0 M / 20 S0.000 Mixed0 F / 12 M / 8 S0.000
5.4cFinal prophecy Mixed0 F / 3 M / 17 S0.000 Mixed0 F / 12 M / 8 S0.100

The percentile column is the key number. A percentile of 0.000 means the Henry VIII chunk is more Fletcherward than every single one of the 20 matched Shakespeare controls. Most of the live pressure chunks achieve this. This is a strong result: it means the contamination objection cannot explain away the pressure in these specific locations.

What the contamination test does not prove The test shows that comparable Shakespeare scenes in the secure corpus do not drift Fletcherward in the same way the Henry VIII pressure chunks do. This changes the burden of proof — you can no longer dismiss the pressure as a normal mode effect. But it does not prove that Fletcher wrote these chunks. Other explanations remain possible: Shakespeare's style at this point in his career was unusually close to Fletcher's; the passages involved unusual dramatic modes that Shakespeare rarely used elsewhere; or transmission effects affected exactly these passages. The test narrows the field of explanations, but does not eliminate all alternatives.
Contamination test percentile chart
Figure 10. Percentile of each live Henry VIII pressure chunk among its matched Shakespeare controls. Lower percentile means the Henry VIII chunk is more Fletcherward than most of the Shakespeare controls matched to it in dramatic geometry. The near-zero percentiles for 1.3a, 5.3a, 3.1b, 3.2d, 4.1c, and 2.2b are the most important numbers in this chart.

12. Comparing the Current Map to the Traditional Scene Partition

The following table shows each of the sixteen scenes in Henry VIII, indicating whether traditional scholarship assigned it to Fletcher, how many validated marker hits the current analysis found, and how the current chunk-level analysis distributes the scene's local units.

SceneTraditional Fletcher scene?Validated marker hits Current Sh-like chunksCurrent Mixed chunksCurrent F-like chunks
1.1No13
1.2No13
1.3Yes (Spedding)31
1.4Yes (Spedding)221
2.1Yes (Spedding)321
2.2Yes (Spedding)22
2.3No12
2.4No131
3.1Yes (Spedding)32
3.2No (partly in some traditions)4
4.1No (Spedding had it Sh)321
4.2Yes (Spedding)121
5.1No31
5.2Yes (some traditions)112
5.3Yes (Spedding)21
5.4Yes (Spedding)111

The most striking pattern: among the nine scenes that Spedding assigned to Fletcher, the current chunk-level analysis finds only two scenes that contain any F-like chunks (1.3 and 5.3). The other seven scenes that Spedding called Fletcher territory contain chunks that resolve as Shakespearean or Mixed under the current analysis. This does not mean Spedding was entirely wrong — his pressure instincts were often correct at the scene level. It means the Fletcher signal within those scenes is often local rather than scene-wide.

13. Conclusions

The findings can be stated in a way that is both cautious and affirmative. They do not require choosing between "Spedding was right" and "the play is all Shakespeare." A more specific and better-supported picture is available.

Finding 1: A Shakespearean backbone is real The Wolsey material (Acts 1–3), the trial of Katherine (2.4), the Cranmer vindication (Act 5), and most of the play's legal-historical architecture are consistently Shakespearean across the refined stack, rolling attribution, CWFP, and the subchunk resolution studies. This result is robust across multiple methods and multiple units of analysis.
Finding 2: A few locally persistent Fletcherward pockets survive After all validation, subchunk resolution, and contamination testing, a small set of chunks remain consistently more Fletcherward than matched Shakespeare controls: most strongly 5.3a (Porter crowd-control), and with lesser confidence 1.3a (court satire), 3.1b (Cardinals with Katherine), and 5.4c (final prophecy). These pockets are real: they are not easily explained by dramatic mode, they survive content-word profiling, and they sit well outside the distribution of matched Shakespeare controls.
Finding 3: The traditional broad scene map is too coarse The Spedding/Hoy scene-level partition overstates both the continuity and the extent of the Fletcher signal. Many scenes traditionally assigned to Fletcher — including large parts of 2.1, 2.2, 3.1, 4.2, 5.2, and 5.4 — contain substantial Shakespearean material under chunk-level analysis. The Fletcher signal within these scenes is often local (a sub-block, a particular mode, a brief exchange) rather than scene-wide.
Finding 4: TNK is a definitely mixed play but not a scene-labeled answer key The coarse benchmark confirms that both Henry VIII and TNK show mixed internal texture clearly above the single-author baseline. Henry VIII is more volatile in its mixing; TNK is more uniformly mixed. Using TNK as a labeled validation case — as some earlier workflows were tempted to do — would have introduced circular reasoning, and the present project's decision to reject that move is methodologically important.
Finding 5: The best picture is intermediate Henry VIII is neither cleanly divisible into large Fletcher scenes nor reducible to an all-Shakespeare reading. It is better described as a largely Shakespearean play with a small number of locally persistent Fletcherward pockets — narrower, more fragile, and more concentrated than the classic scene tradition implies. The most important revision to received wisdom is not "Fletcher disappears." It is: "the broad old map overstates how continuous and how scene-wide the Fletcher signal is."

What would change the picture

The following findings would substantially revise the current conclusions:

Appendix: Complete Chunk-by-Chunk Method Ledger

The following table gives all 46 chunks with their scene assignment, token count, short label, and judgments from the four main analysis layers: the refined stack, CWFP, rolling attribution at 400 and 700 tokens, validated marker hit count, and the working judgment that synthesizes across methods.

ChunkSceneTokensLabel RefinedCWFPRoll 400Roll 700MarkersJudgment
1.1a1.1681Field-of-Cloth spectacleShMixMixMix1Sh-like
1.1b1.11159Wolsey grievance and corruptionShShMixMix3Sh-like
1.1c1.1237Buckingham arrestShShShMix2Sh-like
1.2a1.2423Royal opening and grievanceShShMixMix1Sh-like
1.2b1.2437Taxation debate and revocationShMixMixMix2Sh-like
1.2c1.2871Buckingham matter and surveyorShShMixMix2Sh-like
1.3a1.3575Court satire, French fashionsFlMixFlMix3F-like
1.4a1.4293Banquet wit and erotic banterShMixMixMix3Sh-like
1.4b1.4228Wolsey's feast under interruptionMixMixMixMix2Mixed
1.4c1.4373Masque recognition, Anne singled outShMixMixMix3Sh-like
2.1a2.1441Gentlemen on Buckingham's trialShMixMixMix3Sh-like
2.1b2.1708Buckingham's public valedictionMixMixMixMix2Mixed
2.1c2.1262Aftermath and divorce rumorShMixMixMix2Sh-like
2.2a2.2674Anti-Wolsey counselShMixMixMix3Sh-like
2.2b2.2488Wolsey-Campeius divorce managementShMixMixMix3Sh-like
2.3a2.3412Anne's pity, anti-eminence talkMixMixMixMix2Mixed
2.3b2.3475Elevation of AnneMixMixMixMix2Mixed
2.4a2.4438Ceremonial opening, Katherine's pleaShShMixMix1Sh-like
2.4b2.4601Wolsey-Campeius answer, Katherine's refusalMixShMixMixMixed
2.4c2.428King's vindication of KatherineShShMixMix2Sh-like
2.4d2.4595King's conscience narrativeShShMixMix2Sh-like
3.1a3.1165Katherine's melancholy and receptionMixMixMixMix2Mixed
3.1b3.11347Cardinals with KatherineMixMixMixMix3Mixed
3.2a3.2607Noble coalition, anti-Wolsey exposureShShMixMixSh-like
3.2b3.2679Packet exchange, Wolsey's alarmShShMixMix2Sh-like
3.2c3.2123Wolsey confronted and strippedShMixMixMix3Sh-like
3.2d3.2935Wolsey after the fallShMixMixMix3Sh-like
4.1a4.1295Gentlemanly prelude to coronationShMixMixMix3Sh-like
4.1b4.1157Coronation processionMixMixMixMix2Mixed
4.1c4.153Third Gentleman narrates coronationShMixMixMix3Sh-like
4.2a4.2358Katherine judges WolseyShMixMixMix2Sh-like
4.2b4.2277Griffith's defense, Katherine's visionMixMixMixMix1Mixed
4.2c4.2745Capuchius, Katherine's final petitionsShMixMixMix3Sh-like
5.1a5.1472Gardiner and Lovell on CranmerShShMixMix2Sh-like
5.1b5.1223King, Suffolk, queen's laborShMixMixMixSh-like
5.1c5.1584King and Cranmer in privateMixMixMixMix2Mixed
5.1d5.116Birth report, Old Lady comic closeShMixMixMix2Sh-like
5.2a5.2289Cranmer waiting, king observesMixMixMixMix2Mixed
5.2b5.2787Council attack on CranmerShMixMixMix3Sh-like
5.2c5.2735Ring and royal restorationMixMixMixMix2Mixed
5.3a5.3592Porter comic crowd-controlFlMixFlFl2F-like
5.3b5.3138Chamberlain restores orderMixMixMix1
5.4a5.467Procession naming, opening blessingMixMixMix
5.4b5.4359Cranmer's first prophecyShFlMixMix1Sh-like
5.4c5.4182Final prophecy and king's answerMixMixMixMix3Mixed
5.4d5.4123EpilogueMixMixMix2

Works Cited and Comparison Points


Henry VIII Authorship Project · March 2026 · Codex research workspace
Database: early_modern_plays.db · 527 plays · Target: play ID 502 · Chunks: 46