500 0

--- Chunk 1 ---
Metadata: {"id": "74","index": "0"}
<!DOCTYPE html>
<html>
<head>
    <title>HTML with Mixed Tags</title>
</head>
<body>
    <h1>Document with Mixed Tags</h1>

    <div class="intro">
        <p>This document contains a mixture of header tags, div tags, and other HTML elements to test chunker behavior with complex structures.</p>
    </div>

    <h2>Section with Headers</h2>

--- Chunk 2 ---
Metadata: {"header1": "Document with Mixed Tags","header2": "Section with Headers","id": "77","index": "1"}

    <p>Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut labore et dolore magna aliqua.</p>

    <div class="content-block">
        <h3>Nested Header in Div</h3>

--- Chunk 3 ---
Metadata: {"header1": "Document with Mixed Tags","header2": "Section with Headers","header3": "Nested Header in Div","id": "80","index": "2"}

        <p>Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.</p>

        <div class="sub-block">
            <p>Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.</p>
            <span>This is a span element with some text content that should be included in chunking.</span>
        </div>

        <section class="article-section">
            <h4>Header in Section Tag</h4>

--- Chunk 4 ---
Metadata: {"header1": "Document with Mixed Tags","id": "87","index": "3"}

            <p>Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.</p>
        </section>
    </div>

    <h2>Another Top-Level Section</h2>

    <article>
        <header>
            <p>This is inside an article header tag, not an h1-h6 header.</p>
        </header>

        <main>
            <p>Main content inside article: Sed ut perspiciatis unde omnis iste natus error sit voluptatem accusantium doloremque laudantium.</p>

--- Chunk 5 ---
Metadata: {"header": "Another Top-Level Section","header1": "Document with Mixed Tags","header2": "Another Top-Level Section","id": "92","index": "4"}

        </main>

        <aside>
            <p>Aside content: Totam rem aperiam, eaque ipsa quae ab illo inventore veritatis et quasi architecto beatae vitae dicta sunt explicabo.</p>
        </aside>
    </article>

    <footer>
        <div class="footer-content">
            <p>Footer with div: Nemo enim ipsam voluptatem quia voluptas sit aspernatur aut odit aut fugit, sed quia consequuntur magni dolores eos qui ratione voluptatem sequi nesciunt.</p>

--- Chunk 6 ---
Metadata: {"header": "Another Top-Level Section","header1": "Document with Mixed Tags","header2": "Another Top-Level Section","id": "94","index": "5"}

        </div>
    </footer>
</body>
</html>
