Falsifiable Hypotheses for Research in Bowen Theory

This page serves as a collection for a list of example falsifiable hypotheses necessary to progress research under Bowen Theory. This page and its list of additional hypotheses will be constantly refined as new ideas are generated.

Falsifiable hypotheses are the universal requirement for accepted scientific experiments. The more subjective a set of ideas, the more difficult it is to generate falsifiable hypotheses from them. Ideas about human behavior are often so subjective that they cannot be falsified. These ideas are aptly called “not even wrong” (Woit, 2007). Love/hate, good/bad, soul/spirit, are all subjective ideas that are impossible to measure or test without the development of precise definitions.

So far as formal research goes, Bowen Theory remains no different. A review of available literature (Stinson, 2020) showed that there have currently been no falsifiable experiments that are 1) actually congruent with Bowen Theory, and 2) published in peer-review academic journals outside of the Bowen Network. No matter how small or how simple, any falsifiable experiment represents a contribution to the knowledge base behind Bowen Theory and the broader field of psychology as the study of human behavior.

Markers of a Falsifiable Hypothesis

  • There is a clearly defined positive and negative result in the prediction.
  • The hypothesis uses definitions that no one on the street would debate, i.e. the definitions are “factual.”
    • OR: The hypothesis uses terms agreed upon by trained experts if no one on the street debates the outcome of the prediction is defined in terms no person would debate.
  • Technology can be built around the reliability of the prediction (Technology repeatedly validates every hypothesis and every theory it depends on.)

The presence of technology that actually works probably is a slam-dunk validation of a hypothesis.

Two population-level waves of COVID infections where virtually all hospitalizations were unvaccinated people validates the theory and hypothesis of the RNA vaccine. The vaccine itself, the infrastructure to administer it, and any other tools built upon the assumption that the infrastructure works are examples of technology repeatedly validating the experimental hypothesis from the clinical trials. This population-level replication is the best evidence that can be found for a theory.

The fact that the entire civilized world, which is now most of the world, depends on mobile phones. The Every time a person picks up a mobile phone, they are replicating the original experiments run by the R&D departments of the phone developers. This is a population-level study on not just the hypotheses predicting the function of the software and hardware components, but all of the theories of electromagnetism, physical laws dictating the transmission and reception of audio signals, light, electricity, etc, and all laws underlying the technologies driving the infrastructure upon which that device is connected. This is as good as it gets for the validation of falsifiable prediction, where a hypothesis is tested a trillion-trillion times per second all across the world.

Theories of human behavior are nowhere near this level of precision. That is because we do not have the knowledge required to produce precise definitions which predict human behavior. That is understandable, as Homo sapiens studying himself is perhaps the ultimate scientific challenge. But that does not change the ultimate standard to which all science aspires – to produce explicit predictive models which by definition rely on falsifiable predictions.

List of Example Hypotheses

Inter-rater Reliability of Any Number of Specialized Terms

There have been no studies that establish inter-rater reliability for specialized terms (i.e. jargon) from Bowen Theory. Inter-rater reliability is required to demonstrate that a term has a definition precise enough for scientific research. Until a term has inter-rater reliability, it cannot be used to track data for a data model.

Examples of terms that are widely used but do not have demonstrated inter-rater reliability:

  • Anxiety, Chronic
  • Anxiety, Acute
  • Functioning
  • Open Contact
  • Governed by self
  • Self
  • Governed by self
  • Governed by system
  • Inside of Triangle
  • Outside of triangle
  • Preferred position of triangle
  • Separate more of a self
  • Emotional fusion
  • Stuck-togetherness
  • and more…

An experiment to establish inter-rater reliability could be conducted for any one of these terms. Such studies are simple to organize and administer. Any difficulty will arise from the vagueness of the ideas themselves as opposed to a scientific methodology.

An experiment to test the inter-rater reliability of a term is easy to organize. You must compile a list of example uses of the term. The list must also include a number of example uses which appear similar to, but are not the term. You then give the list to a number of rater subjects who must identify which examples are and are not accurate representations of the term. Then then check and see whether the rater’s responses agree.

This kind of research is extremely important for Bowen theory as a science. It is not possibly to track data using a technical term until inter-rater reliability has been established for it. Until that occurs, formal scientific claims cannot be made referring to observations made using the term.

Definition of Anxiety (Inter-Rater Reliability)

The idea of anxiety is widely used in Bowen Theory. There currently exists no reliable way to identify increases and decreases in anxiety. This is the case both for acute shifts in anxiety or shifts that occur over longer periods of time.

An experiment would propose a definition for anxiety prior to the experiment, give some set of example shifts in anxiety up or down, and predict that there will be no notable agreement among a set of subjects asked to identify whether any shift occurred in each example, and if so when and whether the shift was up or down.

Some suggestions for definitions of anxiety would be an autonomic response that results in the subject rapidly searching for solutions to an urgent problem. This assumes that anxiety has something to do with a combination of increased anxiety and increased urgency. Finding a definition for acute anxiety may be easier than a definition for baseline (i.e. chronic) anxiety, because chronic anxiety is literally “normal” for a person and so is more out of view.

Definition of Triangle (Inter-Rater Reliability)

On first glance, the idea of the “triangle” appears more objective than concepts from most clinical theories. A triangle has two positions: two on the inside, and one on the outside.

An experiment would define when a person was on the inside and outside of a triangle. The hypothesis would predict that there would be no notable agreement among independent subjects, trained or otherwise, in identifying the inside and outside positions of a triangle from various examples. Examples could be written narrative, audio, or video recordings.

Positions in a Triangle (Inter-Rater Reliability)

Once inter-rater reliability was established for the triangle concept, subsequent experiments could test the inter-rater reliability of shifts in the triangle configuration from a set of examples. The definition of a shift would be self-evident given the definitions of inside and outside positions. The hypothesis would predict that there would be no notable agreement between subjects, trained or otherwise, in identifying shifts in triangles from the examples. Examples could be written narrative, audio, or video recordings.

The Triangle Phenomenon

Once Inter-rater reliability was established for shifts in positions in a triangle, an experiment could test whether the triangle phenomenon itself actually exists as hypothesized. The existence of triangles are taken as so obvious that there as still been no formal research demonstrating that they actually exist. If triangles are as obvious as they are assumed to be then such research should be low-hanging fruit for a seminal contribution. If triangles are demonstrated to exist, such research could be cited as evidence for the part of an individual in a system that impacts that individual’s functioning.

At the most basic level, a shift to a preferred position in the triangle should result in a decrease in acute anxiety while a shift to an unpreferred position should result in an increase in acute anxiety. A hypothesis would predict that given a “sufficient” increase in tension in a dyad, one or the other person will take a desired an “inside” position either with the other person or with a third. If tension increases “sufficiently” further, one or the other person will seek a preferred “outside” position leaving the other two in an unpreferred inside position.

Examples would be demonstrated easily in the Family Diagram app with one Inside or Outside symbol per dyad in the triangle along with necessary shifts in a “Δ Anxiety” variable. Each shift in inside/outside position would include a start and end date-time of the position to visualize the change. The hypothesis would predict that each shift resulted in a shift in anxiety. A minimum sufficient sample of shifts in a single vignette for a single triangle might be 3. A sufficient sample size might be five vignettes, each from a different triad.

Sample vignettes from triads that did not show triangling must be included in the finding to avoid cherry picking data and provide a more accurate view of what necessitates “sufficient” anxiety for shifts in triangles.

Note that this hypothesis does not include the concept of differentiation of self. Ultimately, the concepts of the theory are interlocking/interdependent and cannot be separated from each other. However, in experimental research initial concessions can be made in order to move more systematically toward hard evidence for an increasing amount of the theory.

It is unclear as of this writing how one might design a falsifiable hypothesis for the triangle phenomenon while including differentiation of self. However, if the most basic aspect of the triangle is demonstrated to exist then subsequent speculation about how to account for variation in triangling would have higher quality data about triangles upon which to experiment.

Attachment Style and the Triangle

Attachment theory is the go-to theory in clinical psychology. It describes the development of a child’s relational pattern in terms of the two-person system of that child and their primary caregiver. It predicts the manner in which that child will respond to a particular anxiety-producing situation.

Bowen Theory describes a person’s response to anxiety in terms of that person’s functional position in their family system. The person is impacted most by their position in the primary triangle between mother and father, less in the triangle between sibling and parent, still less by triangles between extended family members, and still less by friends and professional contacts.

The triangle concept and hypothesis would predict that a person’s “attachment style” as defined by attachment theory would not predict all responses to anxiety when a third person is involved with the subject and primary caregiver. For example, an anxious-avoidant child may “present” with a disorganized attachment style when there is tension between the parents (Dallos, Lakus, Cahart, and McKenzie, 2016; Ross, A. S., Hinshaw, A. B., & Murdock, N. L., 2016).

If correct, this would support the notion that the family system plays as much a part in the response to anxiety as the nature of bond in the primary two-person system.

Conditions for the Selection of a Focused-On Child

The literature suggests that this is a kind of default anxiety binding mechanism for when the other mechanisms are insufficient to bind the anxiety/tension between the parents. Bowen’s own description of the selection process is quite open-ended. It lists some broad ideas for sufficient conditions but nothing testable. Examples are: sex of the child in relation to the family plan, sibling position, emotional climate and nuclear family configuration just prior to the birth of the child, and others.

There is no testable definition of the necessary conditions for two parents to select some child to be the focused-upon one. This means there is no proof that the phenomenon of child-focus exists. Further, there is no clear definition for the conditions sufficient to select one particular child.

A testable hypothesis would provide a definition for the conditions sufficient A) for the selection of some child and B) for which child would be selected. A null-hypothesis would predict that this definition yields no observable reliability across a sample of family cases where some child is assumed to be focused on.

In lieu of a testable definition of conditions for the selection of a focused-upon child, an open-ended exploratory experiment would define a limited set of family cases where there was assumed to be a focused upon child. It would gather observations about each case assumed to contribute to the selection of some child and more specifically the selection of one child in particular. Those patterns would then be incrementally tested and refined with other families until a degree of blind predictive power was achieved.

Baseline of Functioning

TODO: Definition and inter-rater reliability. Essential measurable for any clinical application.

Openness / Connectedness with Extended Family

TODO: Definition and Inter-rater reliability. Essential for Bowen Theory in particular.

Number of facts + certainty can name WRT family members: name, birth dates, birth locations, # of siblings. *potentially* use of subjective/objective points; depends on ability to distinguish between opinion & fact, i.e. “parsing”

Distinguishing Thinking VS Feeling

TODO: Definition and Inter-rater reliability. Essential for concept of differentiation of self. Also Related to Parsing

Systems Thinking

TODO: Definition and Inter-rater reliability.

Ability to “Parse” In Real Time

TODO: Definition and Inter-rater reliability. Related to separating thinking from feeling but more operationalized. Has to do with constructing a timeline, essential for understanding individual functioning let alone system functioning.

Submissions and/or Critique

Good science depends on good-faith critique and contribution. If you have any criticism or ideas for additional hypotheses, please feel free to submit them using the form below.


Dallos, R., Lakus, K., Cahart, M.S., & McKenzie, R. (2016). Becoming invisible: The effect of triangulation on children’s well-being. Clinical Child Psychology and Psychiatry, 21(3), 461–476.

Ross, A. S., Hinshaw, A. B., & Murdock, N. L. (2016). Integrating the relational matrix: Attachment style, differentiation of self, triangulation, and experiential avoidance. Contemporary Family Therapy, 38, 400–411.

Stinson, P. (2020). To What Extent did the Buddha Define a Natural System? Insights from Bowen’s Natural System Theory of the Family. California Institute of Integral Studies, San Francisco, CA.

Woit, P. (2007). Not Even Wrong: The Failure of String Theory and the Search for Unity in Physical Law for Unity in Physical Law. New York, NY: Basic Books.

Share This

New From Alaska Family Systems

Get The Latest Updates

Subscribe For Notifications on New Posts

No spam, notifications only about new products, updates.

Sign Up For News

Stay current.