section

Docs to Corpus Pipeline

Nexus Docs (CopyParty) - File Upload & Pipeline

Overview

CopyParty provides the Nexus Docs web interface and file storage system. It serves as the entry point for user file uploads that can then be processed by AI and stored in Corpus.


1. File Upload Entry Points

Web UI

  • URL: docs.corlera.com
  • Authentication: Username/password
  • Features:
  • Drag-and-drop upload
  • Folder creation
  • File sharing with expiring links
  • Media previews (images, video, audio)

MCP Tools

  • docs.upload: Upload files from local paths or TEMP staging
  • docs.create_folder: Create directories
  • docs.move: Organize files between folders

2. Storage Locations

Visibility Path Access
Private /data/cdn/users/{user}/documents/ User only
Public /data/cdn/users/{user}/public/ Anyone with link
Shared /data/cdn/users/{user}/shared/ Configurable

Base Path: /data/cdn/users/


3. AI Processing Pipeline

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                    DOCS β†’ CORPUS PIPELINE                       β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚                                                                 β”‚
β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”      β”‚
β”‚  β”‚   UPLOAD     β”‚    β”‚   PROCESS    β”‚    β”‚    STORE     β”‚      β”‚
β”‚  β”‚              β”‚    β”‚              β”‚    β”‚              β”‚      β”‚
β”‚  β”‚ docs.corlera β”‚ β†’  β”‚  LLMSherpa   β”‚ β†’  β”‚   Corpus     β”‚      β”‚
β”‚  β”‚   .com       β”‚    β”‚  (Port 6650) β”‚    β”‚   Server     β”‚      β”‚
β”‚  β”‚              β”‚    β”‚              β”‚    β”‚              β”‚      β”‚
β”‚  β”‚ /data/cdn/   β”‚    β”‚  Extract:    β”‚    β”‚  Quadfecta:  β”‚      β”‚
β”‚  β”‚ users/{user} β”‚    β”‚  - Text      β”‚    β”‚  - FTS       β”‚      β”‚
β”‚  β”‚              β”‚    β”‚  - Structure β”‚    β”‚  - Vector    β”‚      β”‚
β”‚  β”‚              β”‚    β”‚  - Tables    β”‚    β”‚  - Graph     β”‚      β”‚
β”‚  β”‚              β”‚    β”‚  - Images    β”‚    β”‚  - Relationalβ”‚      β”‚
β”‚  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜      β”‚
β”‚                                                                 β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Pipeline Steps

  1. Monitor: Watch /data/cdn/users/ for new uploads
  2. Extract: Send to LLMSherpa (port 6650) for intelligent parsing
  3. Process: LLMSherpa extracts text, tables, structure
  4. Ingest: Store extracted content in Corpus with metadata
  5. Index: Quadfecta indexes for multi-modal search

4. URL Mapping

Web URL Local Path
docs.corlera.com/home/chris/ /data/cdn/users/chris/
docs.corlera.com/home/chris/documents/ /data/cdn/users/chris/documents/
docs.corlera.com/home/chris/public/ /data/cdn/users/chris/public/
docs.corlera.com/shr/{share_id} /data/cdn/shr/{share_id}

5. Infrastructure

Component Details
Container nexus-cdn
Image copyparty/ac
Port 3923
Config /data/cdn/system/config/copyparty.conf
Domain docs.corlera.com

6. Integration Points

  • TEMP Server: Upload from staging via temp_id parameter
  • Corpus Server: Destination for extracted content
  • LLMSherpa: Document extraction service (port 6650)
  • Search Server: Query indexed content via Quadfecta
ID: 1678bf21
Path: Nexus 3.0 Complete Environment Reference > Archive - Deprecated Systems > CopyParty Web UI (docs.corlera.com) > Docs to Corpus Pipeline
Updated: 2026-01-07T10:25:19