The Authority, Provenance and Semantic Governance Research Series
White Paper No. 4

A Federated Architecture for Verified and Governed Knowledge Systems

Introducing the Verified Source Protocol and the SHAMIL™ Structured Entity Framework

Younis Group
Search Sciences™ Research Programme

Published under the leadership of Mohammed Younis, Chief Scientist

Version 1.0
April 2026

Publication Note

This paper forms part of the Authority, Provenance and Semantic Governance Research Series produced by Younis Group under the Search Sciences™ Research Programme. The series examines the structural conditions governing authority, provenance, and semantic integrity in AI-mediated information systems.

White Paper No. 4 builds upon the admissibility framework established in White Paper No. 1, the verification-first architectural principles articulated in White Paper No. 2, and the semantic governance model introduced in White Paper No. 3. It formally introduces two complementary but distinct architectural components: the Verified Source Protocol (VSP) as a verification layer governing representational legitimacy, and SHAMIL™ as a deterministic semantic model governing knowledge structure.

The two components are interoperable but not interdependent. VSP defines constraints on authority and provenance without prescribing semantic structure. SHAMIL™ defines structured semantic representation without determining verification mechanisms. Their integration establishes a federated architectural model whilst preserving clear functional separation.

The intellectual foundations of both components are rooted in the Islamic Golden Age tradition of information science. VSP instantiates, in computational form, the provenance-chain methodology formalised by Imam Al-Bukhari in the ninth century. SHAMIL™ instantiates, as a structured entity framework, the hierarchical classification of knowledge developed by Al-Farabi in the tenth century. The name SHAMIL™ itself is derived from the Arabic شامل, meaning comprehensive, universal, all-encompassing, and complete — a name that reflects both the scope of the framework and its intellectual origins. The systematic procedural principles of Al-Khwarizmi and the empirical auditability conditions of Ibn al-Haytham underpin both components throughout.

This document contributes to an ongoing and cumulative research programme and is published to support scholarly discussion and infrastructural clarity. The principles described herein are conceptual and architecture-oriented. They do not constitute operational deployment or ratified standards.

Companion Publications

White Paper No. 1: The Admissibility Problem in AI-Mediated Information Systems. Authority, Provenance and Semantic Governance Research Series. Search Sciences™ Research Programme. Younis Group, 2026.

White Paper No. 2: Verification-First Architecture: Designing Pre-Interpretive Constraints for Authoritative Digital Systems. Authority, Provenance and Semantic Governance Research Series. Search Sciences™ Research Programme. Younis Group, 2026.

White Paper No. 3: Semantic Governance in Admissible Knowledge Systems: Deterministic Entity Architecture for AI-Compatible Environments. Authority, Provenance and Semantic Governance Research Series. Search Sciences™ Research Programme. Younis Group, 2026.

Suggested citation:

Younis, M. (2026) ‘A Federated Architecture for Verified and Governed Knowledge Systems: Introducing the Verified Source Protocol and the SHAMIL™ Structured Entity Framework’. White Paper No. 4. Authority, Provenance and Semantic Governance Research Series. Search Sciences™ Research Programme. Younis Group.

Abstract

The preceding papers in this series defined the admissibility problem in AI-mediated systems, articulated the principles of verification-first architecture, and established the necessity of semantic governance within admissible knowledge environments. This paper formally introduces the integrated architectural response: the Verified Source Protocol (VSP) as a cryptographic verification layer, and SHAMIL™ as a deterministic structured entity framework.

VSP instantiates, in computational form, the provenance-chain governance methodology that Imam Al-Bukhari formalised in the ninth century: a claim is not admissible unless its chain of attribution is declared, intact, and independently verifiable. SHAMIL™ instantiates the hierarchical classification of knowledge that Al-Farabi developed in the tenth century: meaning is not admissible unless it is structurally placed, domain-scoped, and governed. The name SHAMIL™ derives from the Arabic شامل, meaning comprehensive, universal, all-encompassing, and complete — a name that carries the scope of the framework and the intellectual tradition from which it emerges.

Together, these two components establish a federated infrastructure for governing authority, provenance, and meaning prior to computational interpretation. The paper outlines their structural design, governance separation principles, and interoperability characteristics, arguing that verified and semantically constrained representations constitute a foundational layer for AI-compatible information systems. The systematic procedural principles of Al-Khwarizmi and the empirical auditability conditions of Ibn al-Haytham are structurally present throughout both components.

1. Introduction

The restoration of legitimacy within digital information systems requires more than incremental refinement of ranking algorithms or model architectures. The preceding papers in this series have established, through successive stages of argument, that the structural failures of AI-mediated information environments are not technical defects. They are governance failures. Authority is inferred rather than declared. Provenance is opaque rather than traceable. Meaning is probabilistically approximated rather than structurally constrained.

The intellectual tradition that most precisely diagnosed these conditions, and developed the methodological responses to them, is the Islamic Golden Age tradition of information science. Imam Al-Bukhari established that a claim must carry a verified chain of attribution before it is admitted as authoritative. Al-Farabi established that knowledge must be hierarchically classified and domain-scoped before its claims can be reliably governed. Al-Khwarizmi established that unknowns must be resolved through defined systematic procedure rather than approximation. Ibn al-Haytham established that the validity of a representation depends on the structural conditions of its production and the auditability of its states over time.

These four scholars, working between the ninth and eleventh centuries, collectively articulated the intellectual architecture that the two frameworks introduced in this paper implement. VSP implements Al-Bukhari’s provenance-chain governance in computational form. SHAMIL™ implements Al-Farabi’s hierarchical classification of knowledge as a structured entity framework. Both are underpinned by Al-Khwarizmi’s procedural determinism and Ibn al-Haytham’s auditability conditions.

VSP is the isnad made computational. SHAMIL™ is Al-Farabi’s classification of the sciences made structural. The architecture is new. The intellectual foundations are over a millennium old.

This paper formally introduces both components, describes their structural design and functional separation, and argues that their integration establishes the foundational layer for verified and semantically governed AI-compatible knowledge systems.

2. The Verified Source Protocol

2.1 Purpose and Intellectual Foundation

The Verified Source Protocol (VSP) defines a set of non-optional constraints governing how authority and provenance are represented within digital systems. Its function is limited and precise: to determine the legitimacy of a representation prior to interpretation.

The intellectual antecedent of VSP is Imam Al-Bukhari’s isnad methodology, formalised in the ninth century as the governing framework for the authentication of transmitted knowledge. The isnad required that every claim entering the authoritative corpus carry a complete, named, and independently verifiable chain of transmission from its origin to its recorder. A claim without an intact isnad was not evaluated on its content — it was not admitted at all. Admissibility preceded evaluation. The structural principle is unchanged in VSP: a digital representation that cannot demonstrate declared, cryptographically bound, and temporally auditable provenance is not eligible for computational processing.

VSP does not rank information. It does not evaluate popularity, determine truth, or influence engagement metrics. It governs admissibility and nothing else. This functional limitation is not a design constraint. It is the condition of its structural neutrality.

VSP and the Isnad: A Structural Parallel

Imam Al-Bukhari (810–870 CE) formalised the isnad as a pre-interpretive governance mechanism. Before a hadith could enter the authoritative corpus, its chain of transmission — naming every transmitter from the original source to the recorder — had to be independently verified. The integrity of the chain was not a quality indicator. It was the condition of admissibility.

The Verified Source Protocol implements the same principle in computational form. A representation must carry a declared entity identifier, a cryptographic signature binding the representation to that entity, and a time-addressable record enabling historical inspection. The chain is digital rather than biographical. The governance logic is identical: provenance is declared, bound, and verifiable before interpretation begins.

Al-Bukhari produced the most rigorous verification system in the pre-modern scholarly world. VSP is its computational heir.

2.2 Core Properties

The protocol establishes the following structural properties for entities asserting authority within a VSP-compliant environment:

• Decentralised identifiers — stable, globally unique identifiers that are not controlled by any single registry or platform.

• Cryptographic key pairs — enabling signed representations that are tamper-detectable and non-repudiable.

• Federated registries — that attest to identity without assuming central ownership or gatekeeping authority.

• Time-addressable representations — enabling historical states to be inspected and silent modification to be detected.

Entities retain cryptographic ownership of their authoritative information. Registries attest. They do not control. This distinction is structurally fundamental: the moment a registry assumes control rather than attestation, it replicates the centralisation failure that VSP is designed to prevent.

Al-Khwarizmi’s methodological principle is operative here. The verification procedure is defined, open, and domain-independent. Any entity, in any jurisdiction, applying the same procedure to the same representation will arrive at the same verification result. The integrity of the system does not depend on the authority of any single institution. It depends on the structural soundness of the procedure.

2.3 Trust Anchors and Federation

Trust anchors establish roots of verification within defined domains or jurisdictions. Federation allows multiple registries to operate concurrently, provides authority portability across systems, and eliminates single-point gatekeeping. Verification is therefore decentralised whilst remaining structurally governed.

Ibn al-Haytham’s auditability conditions apply directly to the federation model. A federated verification system is only as trustworthy as the inspectability of its trust anchors. Each anchor must publish its compliance criteria openly, maintain inspectable attestation records, and be subject to independent audit. A trust anchor that cannot be audited is not a trust anchor. It is an unverified authority claim — precisely the failure condition VSP exists to prevent.

3. The SHAMIL™ Structured Entity Framework

3.1 Name and Intellectual Origin

The name SHAMIL™ derives from the Arabic شامل, meaning comprehensive, universal, all-encompassing, and complete. It is not an acronym. It is a name chosen to reflect both the scope of the framework and the intellectual tradition from which it emerges. A framework designed to provide complete structural governance of knowledge — encompassing identity, classification, attributes, capabilities, relationships, and external references — carries a name from the scholarly tradition that first articulated why such completeness is a governance requirement rather than an aspiration.

Al-Farabi’s Ihṣaʼ al-ʿulūm — the Enumeration of the Sciences — was itself an act of comprehensive structural governance: a systematic attempt to classify all knowledge into hierarchically organised domains, each with defined scope, permissible attributes, and boundaries of valid inference. SHAMIL™ is the contemporary structural implementation of that principle. Its six dimensions operationalise what Al-Farabi’s classification established theoretically: that the organisation of knowledge is a governance act, not a cataloguing exercise.

SHAMIL™ derives from the Arabic for all-encompassing and complete. It carries in its name the intellectual tradition that established why completeness in knowledge organisation is a structural requirement, not a preference.

3.2 Rationale

Verification alone constrains legitimacy. It does not organise meaning. A cryptographically verified representation may still be semantically ambiguous: its type may be undeclared, its attributes ungoverned, its relationships undefined. AI systems operating on admissible but semantically unstructured inputs will resolve these ambiguities probabilistically — which is to say, according to the statistical distributions of their training corpus rather than the structural intentions of the asserting authority.

To prevent semantic drift within admissible systems, a deterministic entity architecture is required. SHAMIL™ provides this architecture. It does not replace verification. It operates alongside it, addressing the layer of structural governance that VSP does not cover.

3.3 The Six Dimensions

SHAMIL™ defines six core representational dimensions. Together they form a minimal but comprehensive schema for governed knowledge graphs. They do not restrict domain diversity. They constrain representational ambiguity.

Subject	The canonical entity — stable, unambiguous, and persistently identifiable across systems and jurisdictions.
Hierarchy	Explicit ontological placement within a defined knowledge domain. Governs permissible attributes and relational scope.
Attribute	Verifiable properties with declared semantic scope, data type constraints, provenance, and temporal versioning.

Method	Structured capabilities or permissible actions, explicitly bounded by ontological type and governing domain.
Interaction	Declared relationships between entities — typed, directionally specified, and scope-limited.
Link	Traceable external references and identifiers connecting the entity to authoritative corroborating registries.

These six dimensions map directly onto the structural requirements that Al-Farabi identified as necessary for unambiguous knowledge organisation. Subject and Hierarchy operationalise his ontological placement requirement. Attribute operationalises his domain-scoped property governance. Method operationalises his requirement that capabilities be defined within domains rather than inferred across them. Interaction operationalises his relational discipline. Link operationalises his requirement for traceable external corroboration.

SHAMIL™ is Al-Farabi’s classification of the sciences expressed as a structured entity schema. The intellectual lineage is direct and deliberate.

3.4 Determinism and Constraint

The framework is domain-agnostic yet structurally constrained. It enables consistent classification across systems, controlled attribute expansion, explicit relational discipline, and actionable semantics within bounded scope. SHAMIL™ does not replace graph technologies. It governs their structure, imposing the conditions under which semantic meaning is defined rather than inferred.

Al-Khwarizmi’s methodological contribution is structurally present throughout. Each dimension of SHAMIL™ is governed by defined rules that produce deterministic outcomes. An attribute is either within declared scope or it is not. A relationship is either of a permissible type or it is not. A capability is either bounded by the entity’s ontological type or it is not. The framework does not approximate. It resolves.

SHAMIL™ does not prescribe a universal ontology or domain vocabulary. It defines the structural conditions under which any domain-specific ontology must operate.

Al-Farabi and the Six Dimensions of SHAMIL™

Al-Farabi (872–950 CE) argued that the classification of the sciences was a structural requirement for the integrity of knowledge. Each domain must have defined scope, permissible properties, and explicit boundaries of valid inference. A claim situated in the wrong domain is not merely misclassified — it is governed by the wrong rules.

The six dimensions of SHAMIL™ implement this principle as a structured entity schema: • Subject — canonical identity, directly corresponding to Al-Farabi’s requirement for unambiguous entity identification.

• Hierarchy — ontological placement, directly corresponding to his domain classification requirement.

• Attribute — governed properties, directly corresponding to his domain-scoped attribute constraints.

• Method — bounded capabilities, directly corresponding to his requirement that actions be defined within domains.

• Interaction — typed relationships, directly corresponding to his relational discipline requirement.

• Link — external corroboration, directly corresponding to his requirement for traceable inter-domain references.

SHAMIL™ is Al-Farabi’s Enumeration of the Sciences operationalised as a contemporary entity framework.

4. Integrated Architecture

4.1 Layered Separation

The architecture operates as two distinct but complementary layers. The verification layer, established through VSP, governs identity, provenance, and temporal integrity. The semantic layer, established through SHAMIL™, governs entity structure and meaning. The two layers are interoperable but not interdependent: VSP defines constraints on representational legitimacy without prescribing semantic structure; SHAMIL™ defines semantic structure without determining authority or verification mechanisms. Each may operate independently within compliant environments.

This functional separation is structurally essential. It preserves verification neutrality — VSP does not favour any particular semantic model. It preserves interpretive independence — SHAMIL™ does not prescribe verification pathways. And it preserves governance clarity — each layer can be specified, audited, and evolved independently of the other.

Al-Farabi’s classification of the sciences was premised on precisely this principle of domain autonomy. The criteria governing one branch of knowledge could not be imported from another without introducing categorical error. The verification layer and the semantic layer are different domains of governance. Their separation is not an architectural preference. It is a structural requirement.

4.2 Pre-Interpretive Governance

When the two layers operate in combination, the full conditions of pre-interpretive governance are satisfied. An entity is cryptographically verified through VSP: its identity is declared, its representation is signed, its provenance is traceable. Its representation is structurally constrained through SHAMIL™: its type is classified, its attributes are governed, its relationships are explicitly defined. Only then is it eligible for interpretive or generative processing.

This sequence — verify, then structure, then interpret — is the computational implementation of the methodological sequence that the Islamic Golden Age scholarly tradition established. Al-Bukhari verified provenance before admitting a claim. Al-Farabi classified knowledge before evaluating its implications. Al-Khwarizmi derived unknowns through defined procedure before acting on the result. Ibn al-Haytham audited observations before drawing conclusions. The sequence is the same. The medium is different.

This architecture does not compete with existing web standards or knowledge graph technologies. It operates as a governance layer above them, defining the conditions under which their outputs can be considered admissible and semantically valid.

Verification without structure leaves ambiguity. Structure without verification leaves vulnerability. Integrated, they provide the conditions for knowledge that can be trusted.

5. AI Compatibility

AI systems operating within this architecture benefit from machine-verifiable admissibility checks, deterministic semantic structures, reduced ambiguity in relational reasoning, and traceable provenance for outputs. The architecture does not eliminate probabilistic reasoning. It constrains the input space within which reasoning occurs.

This distinction is structurally significant. A probabilistic system operating on VSP-verified and SHAMIL™-governed inputs will produce outputs that are bounded by the structural

constraints of both layers. A probabilistic system operating on ungoverned inputs will produce outputs bounded only by the statistical distributions of its training corpus — which may include unverified, semantically invalid, adversarially optimised, or historically erroneous content.

The architecture enhances structural reliability without limiting computational capability. It does not require AI systems to be rebuilt. It requires the information environment in which they operate to be governed. This is the structural intervention that the admissibility problem demands, and which the preceding papers in this series have progressively developed the case for.

6. Governance and Stewardship

To prevent consolidation of authority, governance of the verification protocol must remain institutionally independent from implementation infrastructure. Standards stewardship must define compliance criteria, publish specifications openly, and maintain separation from commercial registries whose revenue models depend on the current ungoverned state of the information environment.

Semantic framework evolution must similarly occur through transparent specification rather than proprietary opacity. Federation requires neutrality. A governance model that is captured by commercial interests is not a governance model. It is an optimisation mechanism for those interests.

The Islamic waqf — the endowment model that sustained the libraries, universities, and hospitals of the Islamic Golden Age — provides the structural precedent. The waqf committed resources to the perpetuation of knowledge for the common good on a non-extractive basis. It did not monetise the knowledge it sustained. It governed the conditions under which knowledge could be produced, preserved, and shared. Independent stewardship of VSP and SHAMIL™ reflects the same structural principle: the infrastructure serves the integrity of the knowledge environment. It does not extract value from it.

7. Applied Implications

The integrated architecture is applicable across any domain in which authoritative information carries material consequence. This includes local business ecosystems, professional credentials, regulated industries, scientific data publication, and enterprise AI knowledge systems. In each case the objective is identical: to ensure that representations are both admissible and semantically governed before computational interpretation.

The implications extend beyond individual domains. At the level of national information infrastructure, the architecture provides the structural basis for a verifiable entity registry in which organisations, professionals, and institutions can assert authority that is declared, cryptographically bound, and independently auditable. At the level of international knowledge exchange, it provides the structural basis for federated interoperability in which semantic portability is assured by shared ontological governance rather than assumed through technical compatibility alone.

In each of these contexts, the architecture does not impose a particular content model or domain vocabulary. It imposes structural governance on whatever content model is adopted. The governance is the contribution. The content remains the domain’s own.

8. Conclusion

This paper has formally introduced two complementary architectural components whose intellectual foundations were laid in the Islamic Golden Age tradition of information science and whose structural design has been developed through the preceding papers in this series.

The Verified Source Protocol implements, in computational form, the provenance-chain governance that Imam Al-Bukhari formalised in the ninth century. A representation is admissible only if its chain of attribution is declared, cryptographically bound, and independently verifiable. The Verified Source Protocol is a computational instantiation of the governance logic underlying the isnad.

SHAMIL™ implements, as a structured entity framework, the hierarchical classification of knowledge that Al-Farabi developed in the tenth century. A representation is semantically governed only if its entity is classified, its attributes are scoped, and its relationships are explicitly typed. SHAMIL™ is the Enumeration of the Sciences made structural. Its name — from the Arabic شامل, meaning comprehensive, universal, all-encompassing, and complete — carries both the scope of the framework and the intellectual tradition from which it emerges.

Both components are underpinned by Al-Khwarizmi’s systematic procedural determinism and Ibn al-Haytham’s empirical auditability conditions. Together, the four scholars whose work governs this architecture were writing between the ninth and eleventh centuries. Their intellectual contributions were not incorporated into the design of the modern web. The compounding structural failures of AI-mediated information systems are, in part, the consequence of that omission.

Verification without structure leaves ambiguity. Structure without verification leaves vulnerability. Integrated through a federated architecture that preserves functional separation and independent governance, VSP and SHAMIL™ provide the foundational layer for knowledge systems in which authority is declared, meaning is governed, and AI-mediated reasoning operates on a structurally sound input space.

Future research will examine implementation pathways, registry compliance models, and sector-specific case studies within this architecture.

References

Al-Bukhari, M. (846 CE) Al-Jāmiʿ al-Ṣaḥīḥ (Ṣaḥīḥ al-Bukhārī). Compiled 846 CE. The foundational collection of authenticated hadith, incorporating the isnad methodology as a formal system of provenance verification and chain-of-transmission governance. The direct intellectual antecedent of VSP.

Al-Farabi, A.N. (c. 952 CE) Ihṣaʼ al-ʿulūm (The Enumeration of the Sciences). Translated by Palencia, A.G. (1953). Madrid. The foundational systematic classification of the sciences establishing hierarchical knowledge organisation and domain-scoped authority as structural governance requirements. The direct intellectual antecedent of SHAMIL™.

Al-Khwarizmi, M. (c. 830 CE) Kitāb al-mukhṣar fī ḥisāb al-jabr waʻl-muqābala (The Compendious Book on Calculation by Completion and Balancing). Translated by Rosen, F. (1831). London: Oriental Translation Fund. The foundational text of systematic algorithmic procedure and the lawful resolution of unknowns through defined structural methods.

Ibn al-Haytham, H. (c. 1011 CE) Kitāb al-Manāẓir (Book of Optics). Latin translation: De Aspectibus (c. 1200). The systematic application of empirical auditability, controlled observation, and repeatable verification as structural conditions of valid knowledge.

Berners-Lee, T., Hendler, J. and Lassila, O. (2001) ‘The Semantic Web’, Scientific American, 284(5), pp. 34–43.

Diffie, W. and Hellman, M. (1976) ‘New directions in cryptography’, IEEE Transactions on Information Theory, 22(6), pp. 644–654.

Floridi, L. (2014) The Fourth Revolution: How the Infosphere is Reshaping Human Reality. Oxford: Oxford University Press.

Gruber, T.R. (1993) ‘A translation approach to portable ontology specifications’, Knowledge Acquisition, 5(2), pp. 199–220.

Lessig, L. (2006) Code: Version 2.0. New York: Basic Books.

Version History

Version 1.0

Initial publication. April 2026. Islamic Golden Age intellectual genealogy formally integrated throughout. VSP established as the computational implementation of Imam Al-Bukhari’s isnad methodology. SHAMIL™ established as the structural implementation of Al-Farabi’s classification of the sciences. Etymology and intellectual origin of the SHAMIL™ name formally declared.

How to Cite the Series

The papers are published as part of an ongoing working paper series. Individual papers should be cited using their respective titles and publication details.

Example citation:

Younis, M. (2026) ‘Verification-First Architecture: Designing Pre-Interpretive Constraints for Authoritative Digital Systems’. White Paper No. 2. Authority, Provenance and Semantic Governance Research Series. Search Sciences™ Research Programme. Younis Group.

Closing Note

This series is published to contribute to scholarly discussion on authority, provenance and governance in digital systems and is intended as an evolving research record.

The Authority, Provenance and Semantic Governance Research SeriesWhite Paper No. 4