tosin2013/mcp-codebase-insight # codebase.md

This is page 1 of 8. Use http://codebase.md/tosin2013/mcp-codebase-insight?lines=true&page={x} to view the full context.

# Directory Structure

```
├── .bumpversion.cfg
├── .codecov.yml
├── .compile-venv-py3.11
│   ├── bin
│   │   ├── activate
│   │   ├── activate.csh
│   │   ├── activate.fish
│   │   ├── Activate.ps1
│   │   ├── coverage
│   │   ├── coverage-3.11
│   │   ├── coverage3
│   │   ├── pip
│   │   ├── pip-compile
│   │   ├── pip-sync
│   │   ├── pip3
│   │   ├── pip3.11
│   │   ├── py.test
│   │   ├── pyproject-build
│   │   ├── pytest
│   │   ├── python
│   │   ├── python3
│   │   ├── python3.11
│   │   └── wheel
│   └── pyvenv.cfg
├── .env.example
├── .github
│   ├── agents
│   │   ├── DebugAgent.agent.md
│   │   ├── DocAgent.agent.md
│   │   ├── README.md
│   │   ├── TestAgent.agent.md
│   │   └── VectorStoreAgent.agent.md
│   ├── copilot-instructions.md
│   └── workflows
│       ├── build-verification.yml
│       ├── publish.yml
│       └── tdd-verification.yml
├── .gitignore
├── async_fixture_wrapper.py
├── CHANGELOG.md
├── CLAUDE.md
├── codebase_structure.txt
├── component_test_runner.py
├── CONTRIBUTING.md
├── core_workflows.txt
├── create_release_issues.sh
├── debug_tests.md
├── Dockerfile
├── docs
│   ├── adrs
│   │   └── 001_use_docker_for_qdrant.md
│   ├── api.md
│   ├── components
│   │   └── README.md
│   ├── cookbook.md
│   ├── development
│   │   ├── CODE_OF_CONDUCT.md
│   │   ├── CONTRIBUTING.md
│   │   └── README.md
│   ├── documentation_map.md
│   ├── documentation_summary.md
│   ├── features
│   │   ├── adr-management.md
│   │   ├── code-analysis.md
│   │   └── documentation.md
│   ├── getting-started
│   │   ├── configuration.md
│   │   ├── docker-setup.md
│   │   ├── installation.md
│   │   ├── qdrant_setup.md
│   │   └── quickstart.md
│   ├── qdrant_setup.md
│   ├── README.md
│   ├── SSE_INTEGRATION.md
│   ├── system_architecture
│   │   └── README.md
│   ├── templates
│   │   └── adr.md
│   ├── testing_guide.md
│   ├── troubleshooting
│   │   ├── common-issues.md
│   │   └── faq.md
│   ├── vector_store_best_practices.md
│   └── workflows
│       └── README.md
├── error_logs.txt
├── examples
│   └── use_with_claude.py
├── github-actions-documentation.md
├── Makefile
├── module_summaries
│   ├── backend_summary.txt
│   ├── database_summary.txt
│   └── frontend_summary.txt
├── output.txt
├── package-lock.json
├── package.json
├── PLAN.md
├── prepare_codebase.sh
├── PULL_REQUEST.md
├── pyproject.toml
├── pytest.ini
├── README.md
├── requirements-3.11.txt
├── requirements-3.11.txt.backup
├── requirements-dev.txt
├── requirements.in
├── requirements.txt
├── run_build_verification.sh
├── run_fixed_tests.sh
├── run_test_with_path_fix.sh
├── run_tests.py
├── scripts
│   ├── check_qdrant_health.sh
│   ├── compile_requirements.sh
│   ├── load_example_patterns.py
│   ├── macos_install.sh
│   ├── README.md
│   ├── setup_qdrant.sh
│   ├── start_mcp_server.sh
│   ├── store_code_relationships.py
│   ├── store_report_in_mcp.py
│   ├── validate_knowledge_base.py
│   ├── validate_poc.py
│   ├── validate_vector_store.py
│   └── verify_build.py
├── server.py
├── setup_qdrant_collection.py
├── setup.py
├── src
│   └── mcp_codebase_insight
│       ├── __init__.py
│       ├── __main__.py
│       ├── asgi.py
│       ├── core
│       │   ├── __init__.py
│       │   ├── adr.py
│       │   ├── cache.py
│       │   ├── component_status.py
│       │   ├── config.py
│       │   ├── debug.py
│       │   ├── di.py
│       │   ├── documentation.py
│       │   ├── embeddings.py
│       │   ├── errors.py
│       │   ├── health.py
│       │   ├── knowledge.py
│       │   ├── metrics.py
│       │   ├── prompts.py
│       │   ├── sse.py
│       │   ├── state.py
│       │   ├── task_tracker.py
│       │   ├── tasks.py
│       │   └── vector_store.py
│       ├── models.py
│       ├── server_test_isolation.py
│       ├── server.py
│       ├── utils
│       │   ├── __init__.py
│       │   └── logger.py
│       └── version.py
├── start-mcpserver.sh
├── summary_document.txt
├── system-architecture.md
├── system-card.yml
├── test_fix_helper.py
├── test_fixes.md
├── test_function.txt
├── test_imports.py
├── tests
│   ├── components
│   │   ├── conftest.py
│   │   ├── test_core_components.py
│   │   ├── test_embeddings.py
│   │   ├── test_knowledge_base.py
│   │   ├── test_sse_components.py
│   │   ├── test_stdio_components.py
│   │   ├── test_task_manager.py
│   │   └── test_vector_store.py
│   ├── config
│   │   └── test_config_and_env.py
│   ├── conftest.py
│   ├── integration
│   │   ├── fixed_test2.py
│   │   ├── test_api_endpoints.py
│   │   ├── test_api_endpoints.py-e
│   │   ├── test_communication_integration.py
│   │   └── test_server.py
│   ├── README.md
│   ├── README.test.md
│   ├── test_build_verifier.py
│   └── test_file_relationships.py
└── trajectories
    └── tosinakinosho
        ├── anthropic_filemap__claude-3-sonnet-20240229__t-0.00__p-1.00__c-3.00___db62b9
        │   └── db62b9
        │       └── config.yaml
        ├── default__claude-3-5-sonnet-20240620__t-0.00__p-1.00__c-3.00___03565e
        │   └── 03565e
        │       ├── 03565e.traj
        │       └── config.yaml
        └── default__openrouter
            └── anthropic
                └── claude-3.5-sonnet-20240620:beta__t-0.00__p-1.00__c-3.00___03565e
                    └── 03565e
                        ├── 03565e.pred
                        ├── 03565e.traj
                        └── config.yaml
```

# Files

--------------------------------------------------------------------------------
/.codecov.yml:
--------------------------------------------------------------------------------

```yaml
 1 | codecov:
 2 |   require_ci_to_pass: yes
 3 |   notify:
 4 |     wait_for_ci: yes
 5 | 
 6 | coverage:
 7 |   precision: 2
 8 |   round: down
 9 |   range: "70...100"
10 |   status:
11 |     project:
12 |       default:
13 |         target: 80%
14 |         threshold: 2%
15 |         base: auto
16 |         if_ci_failed: error
17 |         informational: false
18 |         only_pulls: false
19 |     patch:
20 |       default:
21 |         target: 80%
22 |         threshold: 2%
23 |         base: auto
24 |         if_ci_failed: error
25 |         informational: false
26 |         only_pulls: false
27 | 
28 | parsers:
29 |   gcov:
30 |     branch_detection:
31 |       conditional: yes
32 |       loop: yes
33 |       method: no
34 |       macro: no
35 | 
36 | comment:
37 |   layout: "reach,diff,flags,files,footer"
38 |   behavior: default
39 |   require_changes: false
40 |   require_base: no
41 |   require_head: yes
42 |   branches:
43 |     - main
44 | 
45 | ignore:
46 |   - "tests/**/*"
47 |   - "setup.py"
48 |   - "docs/**/*"
49 |   - "examples/**/*"
50 |   - "scripts/**/*"
51 |   - "**/version.py"
52 |   - "**/__init__.py"
53 | 
```

--------------------------------------------------------------------------------
/.bumpversion.cfg:
--------------------------------------------------------------------------------

```
 1 | [bumpversion]
 2 | current_version = 0.1.0
 3 | commit = True
 4 | tag = True
 5 | parse = (?P<major>\d+)\.(?P<minor>\d+)\.(?P<patch>\d+)((?P<release>[a-z]+)(?P<build>\d+))?
 6 | serialize = 
 7 | 	{major}.{minor}.{patch}{release}{build}
 8 | 	{major}.{minor}.{patch}
 9 | 
10 | [bumpversion:part:release]
11 | optional_value = prod
12 | first_value = dev
13 | values = 
14 | 	dev
15 | 	prod
16 | 
17 | [bumpversion:part:build]
18 | first_value = 1
19 | 
20 | [bumpversion:file:pyproject.toml]
21 | search = version = "{current_version}"
22 | replace = version = "{new_version}"
23 | 
24 | [bumpversion:file:src/mcp_codebase_insight/version.py]
25 | search = __version__ = "{current_version}"
26 | replace = __version__ = "{new_version}"
27 | 
28 | [bumpversion:file:src/mcp_codebase_insight/version.py]
29 | search = VERSION_MAJOR = {current_version.split(".")[0]}
30 | replace = VERSION_MAJOR = {new_version.split(".")[0]}
31 | 
32 | [bumpversion:file:src/mcp_codebase_insight/version.py]
33 | search = VERSION_MINOR = {current_version.split(".")[1]}
34 | replace = VERSION_MINOR = {new_version.split(".")[1]}
35 | 
36 | [bumpversion:file:src/mcp_codebase_insight/version.py]
37 | search = VERSION_PATCH = {current_version.split(".")[2]}
38 | replace = VERSION_PATCH = {new_version.split(".")[2]}
39 | 
```

--------------------------------------------------------------------------------
/.env.example:
--------------------------------------------------------------------------------

```
 1 | # Server configuration
 2 | MCP_HOST=127.0.0.1
 3 | MCP_PORT=3000
 4 | MCP_LOG_LEVEL=INFO
 5 | MCP_DEBUG=false
 6 | 
 7 | # Qdrant configuration
 8 | QDRANT_URL=http://localhost:6333
 9 | QDRANT_API_KEY=your-qdrant-api-key-here
10 | 
11 | # Directory configuration
12 | MCP_DOCS_CACHE_DIR=docs
13 | MCP_ADR_DIR=docs/adrs
14 | MCP_KB_STORAGE_DIR=knowledge
15 | MCP_DISK_CACHE_DIR=cache
16 | 
17 | # Model configuration
18 | MCP_EMBEDDING_MODEL=all-MiniLM-L6-v2
19 | MCP_COLLECTION_NAME=codebase_patterns
20 | 
21 | # Feature flags
22 | MCP_METRICS_ENABLED=true
23 | MCP_CACHE_ENABLED=true
24 | MCP_MEMORY_CACHE_SIZE=1000
25 | 
26 | # Optional: Authentication (if needed)
27 | # MCP_AUTH_ENABLED=false
28 | # MCP_AUTH_SECRET_KEY=your-secret-key
29 | # MCP_AUTH_TOKEN_EXPIRY=3600
30 | 
31 | # Optional: Rate limiting (if needed)
32 | # MCP_RATE_LIMIT_ENABLED=false
33 | # MCP_RATE_LIMIT_REQUESTS=100
34 | # MCP_RATE_LIMIT_WINDOW=60
35 | 
36 | # Optional: SSL/TLS configuration (if needed)
37 | # MCP_SSL_ENABLED=false
38 | # MCP_SSL_CERT_FILE=path/to/cert.pem
39 | # MCP_SSL_KEY_FILE=path/to/key.pem
40 | 
41 | # Optional: Proxy configuration (if needed)
42 | # MCP_PROXY_URL=http://proxy:8080
43 | # MCP_NO_PROXY=localhost,127.0.0.1
44 | 
45 | # Optional: External services (if needed)
46 | # MCP_GITHUB_TOKEN=your-github-token
47 | # MCP_JIRA_URL=https://your-jira-instance
48 | # MCP_JIRA_TOKEN=your-jira-token
49 | 
50 | # Optional: Monitoring (if needed)
51 | # MCP_SENTRY_DSN=your-sentry-dsn
52 | # MCP_DATADOG_API_KEY=your-datadog-api-key
53 | # MCP_PROMETHEUS_ENABLED=false
54 | 
55 | # Test Configuration
56 | # These variables are used when running tests
57 | MCP_TEST_MODE=1
58 | MCP_TEST_QDRANT_URL=http://localhost:6333
59 | MCP_TEST_COLLECTION_NAME=test_collection
60 | MCP_TEST_EMBEDDING_MODEL=all-MiniLM-L6-v2
61 | 
62 | # Event Loop Debug Mode
63 | # Uncomment to enable asyncio debug mode for testing
64 | # PYTHONASYNCIODEBUG=1
65 | 
```

--------------------------------------------------------------------------------
/.gitignore:
--------------------------------------------------------------------------------

```
  1 | # Python
  2 | __pycache__/
  3 | *.py[cod]
  4 | *$py.class
  5 | *.so
  6 | .Python
  7 | build/
  8 | develop-eggs/
  9 | dist/
 10 | downloads/
 11 | eggs/
 12 | .eggs/
 13 | lib/
 14 | lib64/
 15 | parts/
 16 | sdist/
 17 | var/
 18 | wheels/
 19 | *.egg-info/
 20 | .installed.cfg
 21 | *.egg
 22 | MANIFEST
 23 | 
 24 | # Virtual Environment
 25 | .env
 26 | .venv
 27 | env/
 28 | venv/
 29 | ENV/
 30 | env.bak/
 31 | venv.bak/
 32 | 
 33 | # IDE
 34 | .idea/
 35 | .vscode/
 36 | *.swp
 37 | *.swo
 38 | *~
 39 | .project
 40 | .pydevproject
 41 | .settings/
 42 | 
 43 | # Testing
 44 | .tox/
 45 | .coverage
 46 | .coverage.*
 47 | .cache
 48 | nosetests.xml
 49 | coverage.xml
 50 | *.cover
 51 | .hypothesis/
 52 | .pytest_cache/
 53 | htmlcov/
 54 | 
 55 | # Documentation
 56 | docs/_build/
 57 | docs/api/
 58 | 
 59 | # Project specific
 60 | docs/adrs/*
 61 | !docs/adrs/001_use_docker_for_qdrant.md
 62 | !docs/adrs/README.md
 63 | knowledge/*
 64 | !knowledge/README.md
 65 | cache/*
 66 | !cache/README.md
 67 | logs/*
 68 | !logs/README.md
 69 | .test_cache/
 70 | test_knowledge/
 71 | build_output.txt
 72 | testreport.txt
 73 | test_env/
 74 | codebase_stats.txt
 75 | dependency_map.txt
 76 | vector_relationship_graph.*
 77 | verification-config.json
 78 | *.dot
 79 | *.json.tmp
 80 | 
 81 | # Jupyter Notebook
 82 | .ipynb_checkpoints
 83 | 
 84 | # Distribution / packaging
 85 | .Python
 86 | env/
 87 | build/
 88 | develop-eggs/
 89 | dist/
 90 | downloads/
 91 | eggs/
 92 | .eggs/
 93 | lib/
 94 | lib64/
 95 | parts/
 96 | sdist/
 97 | var/
 98 | wheels/
 99 | *.egg-info/
100 | .installed.cfg
101 | *.egg
102 | 
103 | # Installer logs
104 | pip-log.txt
105 | pip-delete-this-directory.txt
106 | 
107 | # Unit test / coverage reports
108 | htmlcov/
109 | .tox/
110 | .coverage
111 | .coverage.*
112 | .cache
113 | nosetests.xml
114 | coverage.xml
115 | *.cover
116 | .hypothesis/
117 | .pytest_cache/
118 | 
119 | # Translations
120 | *.mo
121 | *.pot
122 | 
123 | # Django stuff:
124 | *.log
125 | local_settings.py
126 | db.sqlite3
127 | db.sqlite3-journal
128 | 
129 | # Flask stuff:
130 | instance/
131 | .webassets-cache
132 | 
133 | # Scrapy stuff:
134 | .scrapy
135 | 
136 | # Sphinx documentation
137 | docs/_build/
138 | 
139 | # PyBuilder
140 | target/
141 | 
142 | # Jupyter Notebook
143 | .ipynb_checkpoints
144 | 
145 | # pyenv
146 | .python-version
147 | 
148 | # celery beat schedule file
149 | celerybeat-schedule
150 | 
151 | # SageMath parsed files
152 | *.sage.py
153 | 
154 | # Environments
155 | .env
156 | .venv
157 | env/
158 | venv/
159 | ENV/
160 | env.bak/
161 | venv.bak/
162 | 
163 | # Spyder project settings
164 | .spyderproject
165 | .spyproject
166 | 
167 | # Rope project settings
168 | .ropeproject
169 | 
170 | # mkdocs documentation
171 | /site
172 | 
173 | # mypy
174 | .mypy_cache/
175 | .dmypy.json
176 | dmypy.json
177 | 
178 | # Pyre type checker
179 | .pyre/
180 | 
181 | # pytype static type analyzer
182 | .pytype/
183 | 
184 | # Cython debug symbols
185 | cython_debug/
186 | 
187 | # macOS
188 | .DS_Store
189 | .AppleDouble
190 | .LSOverride
191 | Icon
192 | ._*
193 | .DocumentRevisions-V100
194 | .fseventsd
195 | .Spotlight-V100
196 | .TemporaryItems
197 | .Trashes
198 | .VolumeIcon.icns
199 | .com.apple.timemachine.donotpresent
200 | 
201 | # Windows
202 | Thumbs.db
203 | ehthumbs.db
204 | Desktop.ini
205 | $RECYCLE.BIN/
206 | *.cab
207 | *.msi
208 | *.msm
209 | *.msp
210 | *.lnk
211 | 
212 | # Linux
213 | *~
214 | .fuse_hidden*
215 | .directory
216 | .Trash-*
217 | .nfs*
218 | 
219 | # Project specific
220 | .env
221 | .env.*
222 | !.env.example
223 | *.log
224 | logs/
225 | cache/
226 | knowledge/
227 | docs/adrs/*
228 | !docs/adrs/001_use_docker_for_qdrant.md
229 | 
230 | # Documentation and ADRs (temporary private)
231 | docs/adrs/
232 | docs/private/
233 | docs/internal/
234 | 
235 | # Cache and Temporary Files
236 | cache/
237 | .cache/
238 | tmp/
239 | temp/
240 | *.tmp
241 | *.bak
242 | *.log
243 | 
244 | # Sensitive Configuration
245 | .env*
246 | !.env.example
247 | *.key
248 | *.pem
249 | *.crt
250 | secrets/
251 | private/
252 | 
253 | # Vector Database
254 | qdrant_storage/
255 | 
256 | # Knowledge Base (private for now)
257 | knowledge/patterns/
258 | knowledge/tasks/
259 | knowledge/private/
260 | 
261 | # Build and Distribution
262 | dist/
263 | build/
264 | *.pyc
265 | *.pyo
266 | *.pyd
267 | .Python
268 | *.so
269 | 
270 | # Misc
271 | .DS_Store
272 | Thumbs.db
273 | *.swp
274 | *.swo
275 | *~
276 | 
277 | # Project Specific
278 | mcp.json
279 | .cursor/rules/
280 | module_summaries/
281 | logs/
282 | references/private/
283 | prompts/
284 | 
285 | # Ignore Qdrant data storage directory
286 | qdrant_data/
287 | .aider*
288 | 
```

--------------------------------------------------------------------------------
/tests/README.test.md:
--------------------------------------------------------------------------------

```markdown
 1 | import pytest
 2 | from pathlib import Path
 3 | 
 4 | @pytest.fixture
 5 | def readme_content():
 6 |     readme_path = Path(__file__).parent / "README.md"
 7 |     with open(readme_path, "r") as f:
 8 |         return f.read()
 9 | 
10 | 
```

--------------------------------------------------------------------------------
/docs/components/README.md:
--------------------------------------------------------------------------------

```markdown
 1 | # Core Components
 2 | 
 3 | > 🚧 **Documentation In Progress**
 4 | > 
 5 | > This documentation is being actively developed. More details will be added soon.
 6 | 
 7 | ## Overview
 8 | 
 9 | This document details the core components of the MCP Codebase Insight system. For workflow information, please see the [Workflows Documentation](../workflows/README.md).
10 | 
11 | ## Components
12 | 
13 | ### Server Framework
14 | - API endpoint management
15 | - Request validation
16 | - Response formatting
17 | - Server lifecycle management
18 | 
19 | ### Testing Framework
20 | - Test environment management
21 | - Component-level testing
22 | - Integration test support
23 | - Performance testing tools
24 | 
25 | ### Documentation Tools
26 | - Documentation generation
27 | - Relationship analysis
28 | - Validation tools
29 | - Integration with code analysis
30 | 
31 | ## Implementation Details
32 | 
33 | See the [System Architecture](../system_architecture/README.md) for more details on how these components interact 
```

--------------------------------------------------------------------------------
/scripts/README.md:
--------------------------------------------------------------------------------

```markdown
 1 | # Utility Scripts
 2 | 
 3 | This directory contains utility scripts for the MCP Codebase Insight project.
 4 | 
 5 | ## Available Scripts
 6 | 
 7 | ### check_qdrant_health.sh
 8 | 
 9 | **Purpose**: Checks if the Qdrant vector database service is available and healthy.
10 | 
11 | **Usage**:
12 | ```bash
13 | ./check_qdrant_health.sh [qdrant_url] [max_retries] [sleep_seconds]
14 | ```
15 | 
16 | **Parameters**:
17 | - `qdrant_url` - URL of the Qdrant service (default: "http://localhost:6333")
18 | - `max_retries` - Maximum number of retry attempts (default: 20)
19 | - `sleep_seconds` - Seconds to wait between retries (default: 5)
20 | 
21 | **Example**:
22 | ```bash
23 | ./check_qdrant_health.sh "http://localhost:6333" 30 2
24 | ```
25 | 
26 | > Note: This script uses `apt-get` and may require `sudo` privileges on Linux systems. Ensure `curl` and `jq` are pre-installed or run with proper permissions.
27 | 
28 | **Exit Codes**:
29 | - 0: Qdrant service is accessible and healthy
30 | - 1: Qdrant service is not accessible or not healthy
31 | 
32 | ### compile_requirements.sh
33 | 
34 | **Purpose**: Compiles and generates version-specific requirements files for different Python versions.
35 | 
36 | **Usage**:
37 | ```bash
38 | ./compile_requirements.sh <python-version>
39 | ```
40 | 
41 | **Example**:
42 | ```bash
43 | ./compile_requirements.sh 3.11
44 | ```
45 | 
46 | ### load_example_patterns.py
47 | 
48 | **Purpose**: Loads example patterns and ADRs into the knowledge base for demonstration or testing.
49 | 
50 | **Usage**:
51 | ```bash
52 | python load_example_patterns.py [--help]
53 | ```
54 | 
55 | ### verify_build.py
56 | 
57 | **Purpose**: Verifies the build status and generates a build verification report.
58 | 
59 | **Usage**:
60 | ```bash
61 | python verify_build.py [--config <file>] [--output <report-file>]
62 | ```
63 | 
64 | ## Usage in GitHub Actions
65 | 
66 | These scripts are used in our GitHub Actions workflows to automate and standardize common tasks. For example, `check_qdrant_health.sh` is used in both the build verification and TDD verification workflows to ensure the Qdrant service is available before running tests.
67 | 
68 | ## Adding New Scripts
69 | 
70 | When adding new scripts to this directory:
71 | 
72 | 1. Make them executable: `chmod +x scripts/your_script.sh`
73 | 2. Include a header comment explaining the purpose and usage
74 | 3. Add error handling and sensible defaults
75 | 4. Update this README with information about the script
76 | 5. Use parameter validation and help text when appropriate 
```

--------------------------------------------------------------------------------
/docs/development/README.md:
--------------------------------------------------------------------------------

```markdown
  1 | # Development Guide
  2 | 
  3 | > 🚧 **Documentation In Progress**
  4 | > 
  5 | > This documentation is being actively developed. More details will be added soon.
  6 | 
  7 | ## Overview
  8 | 
  9 | This guide covers development setup, contribution guidelines, and best practices for the MCP Codebase Insight project.
 10 | 
 11 | ## Development Setup
 12 | 
 13 | 1. **Clone Repository**
 14 |    ```bash
 15 |    git clone https://github.com/modelcontextprotocol/mcp-codebase-insight
 16 |    cd mcp-codebase-insight
 17 |    ```
 18 | 
 19 | 2. **Create Virtual Environment**
 20 |    ```bash
 21 |    python -m venv venv
 22 |    source venv/bin/activate  # On Windows: venv\Scripts\activate
 23 |    ```
 24 | 
 25 | 3. **Install Development Dependencies**
 26 |    ```bash
 27 |    pip install -e ".[dev]"
 28 |    ```
 29 | 
 30 | 4. **Setup Pre-commit Hooks**
 31 |    ```bash
 32 |    pre-commit install
 33 |    ```
 34 | 
 35 | ## Project Structure
 36 | 
 37 | ```
 38 | mcp-codebase-insight/
 39 | ├── src/
 40 | │   └── mcp_codebase_insight/
 41 | │       ├── analysis/       # Code analysis modules
 42 | │       ├── documentation/  # Documentation management
 43 | │       ├── kb/            # Knowledge base operations
 44 | │       └── server/        # FastAPI server
 45 | ├── tests/
 46 | │   ├── integration/       # Integration tests
 47 | │   └── unit/             # Unit tests
 48 | ├── docs/                 # Documentation
 49 | └── examples/            # Example usage
 50 | ```
 51 | 
 52 | ## Testing
 53 | 
 54 | ```bash
 55 | # Run unit tests
 56 | pytest tests/unit
 57 | 
 58 | # Run integration tests
 59 | pytest tests/integration
 60 | 
 61 | # Run with coverage
 62 | pytest --cov=src tests/
 63 | ```
 64 | 
 65 | ## Code Style
 66 | 
 67 | - Follow PEP 8
 68 | - Use type hints
 69 | - Document functions and classes
 70 | - Keep functions focused and small
 71 | - Write tests for new features
 72 | 
 73 | ## Git Workflow
 74 | 
 75 | 1. Create feature branch
 76 | 2. Make changes
 77 | 3. Run tests
 78 | 4. Submit pull request
 79 | 
 80 | ## Documentation
 81 | 
 82 | - Update docs for new features
 83 | - Include docstrings
 84 | - Add examples when relevant
 85 | 
 86 | ## Debugging
 87 | 
 88 | ### Server Debugging
 89 | ```python
 90 | import debugpy
 91 | 
 92 | debugpy.listen(("0.0.0.0", 5678))
 93 | debugpy.wait_for_client()
 94 | ```
 95 | 
 96 | ### VSCode Launch Configuration
 97 | ```json
 98 | {
 99 |   "version": "0.2.0",
100 |   "configurations": [
101 |     {
102 |       "name": "Python: Remote Attach",
103 |       "type": "python",
104 |       "request": "attach",
105 |       "port": 5678,
106 |       "host": "localhost"
107 |     }
108 |   ]
109 | }
110 | ```
111 | 
112 | ## Performance Profiling
113 | 
114 | ```bash
115 | python -m cProfile -o profile.stats your_script.py
116 | python -m snakeviz profile.stats
117 | ```
118 | 
119 | ## Next Steps
120 | 
121 | - [Contributing Guidelines](CONTRIBUTING.md)
122 | - [Code of Conduct](CODE_OF_CONDUCT.md)
123 | - [API Reference](../api/rest-api.md) 
```

--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------

```markdown
  1 | # MCP Codebase Insight - WIP
  2 | 
  3 | > 🚧 **Development in Progress** 
  4 | > 
  5 | > This project is actively under development. Features and documentation are being continuously updated.
  6 | 
  7 | ## Overview
  8 | 
  9 | MCP Codebase Insight is a system for analyzing and understanding codebases through semantic analysis, pattern detection, and documentation management.
 10 | 
 11 | ## Current Development Status
 12 | 
 13 | ### Completed Features
 14 | - ✅ Core Vector Store System
 15 | - ✅ Basic Knowledge Base
 16 | - ✅ SSE Integration
 17 | - ✅ Testing Framework
 18 | - ✅ TDD and Debugging Framework (rules_template integration)
 19 | 
 20 | ### In Progress
 21 | - 🔄 Documentation Management System
 22 | - 🔄 Advanced Pattern Detection
 23 | - 🔄 Performance Optimization
 24 | - 🔄 Integration Testing
 25 | - 🔄 Debugging Utilities Enhancement
 26 | 
 27 | ### Planned
 28 | - 📋 Extended API Documentation
 29 | - 📋 Custom Pattern Plugins
 30 | - 📋 Advanced Caching Strategies
 31 | - 📋 Deployment Guides
 32 | - 📋 Comprehensive Error Tracking System
 33 | 
 34 | ## Quick Start
 35 | 
 36 | 1. **Installation**
 37 |    ```bash
 38 |    pip install mcp-codebase-insight
 39 |    ```
 40 | 
 41 | 2. **Basic Usage**
 42 |    ```python
 43 |    from mcp_codebase_insight import CodebaseAnalyzer
 44 |    
 45 |    analyzer = CodebaseAnalyzer()
 46 |    results = analyzer.analyze_code("path/to/code")
 47 |    ```
 48 | 
 49 | 3. **Running Tests**
 50 |    ```bash
 51 |    # Run all tests
 52 |    pytest tests/
 53 |    
 54 |    # Run unit tests
 55 |    pytest tests/unit/
 56 |    
 57 |    # Run component tests
 58 |    pytest tests/components/
 59 |    
 60 |    # Run tests with coverage
 61 |    pytest tests/ --cov=src --cov-report=term-missing
 62 |    ```
 63 | 
 64 | 4. **Debugging Utilities**
 65 |    ```python
 66 |    from mcp_codebase_insight.utils.debug_utils import debug_trace, DebugContext, get_error_tracker
 67 |    
 68 |    # Use debug trace decorator
 69 |    @debug_trace
 70 |    def my_function():
 71 |        # Implementation
 72 |    
 73 |    # Use debug context
 74 |    with DebugContext("operation_name"):
 75 |        # Code to debug
 76 |    
 77 |    # Track errors
 78 |    try:
 79 |        # Risky operation
 80 |    except Exception as e:
 81 |        error_id = get_error_tracker().record_error(e, context={"operation": "description"})
 82 |        print(f"Error recorded with ID: {error_id}")
 83 |    ```
 84 | 
 85 | ## Testing and Debugging
 86 | 
 87 | ### Test-Driven Development
 88 | 
 89 | This project follows Test-Driven Development (TDD) principles:
 90 | 
 91 | 1. Write a failing test first (Red)
 92 | 2. Write minimal code to make the test pass (Green)
 93 | 3. Refactor for clean code while keeping tests passing (Refactor)
 94 | 
 95 | Our TDD documentation can be found in [docs/tdd/workflow.md](docs/tdd/workflow.md).
 96 | 
 97 | ### Debugging Framework
 98 | 
 99 | We use Agans' 9 Rules of Debugging:
100 | 
101 | 1. Understand the System
102 | 2. Make It Fail
103 | 3. Quit Thinking and Look
104 | 4. Divide and Conquer
105 | 5. Change One Thing at a Time
106 | 6. Keep an Audit Trail
107 | 7. Check the Plug
108 | 8. Get a Fresh View
109 | 9. If You Didn't Fix It, It Isn't Fixed
110 | 
111 | Learn more about our debugging approach in [docs/debuggers/agans_9_rules.md](docs/debuggers/agans_9_rules.md).
112 | 
113 | ## Documentation
114 | 
115 | - [System Architecture](docs/system_architecture/README.md)
116 | - [Core Components](docs/components/README.md)
117 | - [API Reference](docs/api/README.md)
118 | - [Development Guide](docs/development/README.md)
119 | - [Workflows](docs/workflows/README.md)
120 | - [TDD Workflow](docs/tdd/workflow.md)
121 | - [Debugging Practices](docs/debuggers/best_practices.md)
122 | 
123 | ## Contributing
124 | 
125 | We welcome contributions! Please see our [Contributing Guide](CONTRIBUTING.md) for details.
126 | 
127 | ## License
128 | 
129 | This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
130 | 
131 | ## Support
132 | 
133 | - [Issue Tracker](https://github.com/modelcontextprotocol/mcp-codebase-insight/issues)
134 | - [Discussions](https://github.com/modelcontextprotocol/mcp-codebase-insight/discussions)
135 | 
```

--------------------------------------------------------------------------------
/tests/README.md:
--------------------------------------------------------------------------------

```markdown
  1 | # Test Structure
  2 | 
  3 | This directory contains the test suite for the MCP Codebase Insight project. The tests are organized into the following structure:
  4 | 
  5 | ## Directory Structure
  6 | 
  7 | ```
  8 | tests/
  9 | ├── components/           # Component-level tests
 10 | │   ├── test_vector_store.py
 11 | │   ├── test_knowledge_base.py
 12 | │   ├── test_task_manager.py
 13 | │   └── ...
 14 | ├── integration/         # Integration and API tests
 15 | │   ├── test_api_endpoints.py
 16 | │   └── test_server.py
 17 | ├── config/             # Configuration tests
 18 | │   └── test_config_and_env.py
 19 | ├── conftest.py         # Shared test fixtures
 20 | └── README.md           # This file
 21 | ```
 22 | 
 23 | ## Test Categories
 24 | 
 25 | 1. **Component Tests** (`components/`)
 26 |    - Unit tests for individual components
 27 |    - Tests component initialization, methods, and cleanup
 28 |    - Isolated from other components where possible
 29 | 
 30 | 2. **Integration Tests** (`integration/`)
 31 |    - Tests for API endpoints
 32 |    - Server lifecycle tests
 33 |    - Component interaction tests
 34 | 
 35 | 3. **Configuration Tests** (`config/`)
 36 |    - Environment variable handling
 37 |    - Configuration file parsing
 38 |    - Directory setup and permissions
 39 | 
 40 | ## API Test Coverage
 41 | 
 42 | The following API endpoints are tested in the integration tests:
 43 | 
 44 | | Endpoint | Test Status | Test File |
 45 | |----------|-------------|-----------|
 46 | | `/health` | ✅ Tested | `test_api_endpoints.py` |
 47 | | `/api/vector-store/search` | ✅ Tested | `test_api_endpoints.py` |
 48 | | `/api/docs/adrs` | ✅ Tested | `test_api_endpoints.py` |
 49 | | `/api/docs/adrs/{adr_id}` | ✅ Tested | `test_api_endpoints.py` |
 50 | | `/api/docs/patterns` | ✅ Tested | `test_api_endpoints.py` |
 51 | | `/api/docs/patterns/{pattern_id}` | ✅ Tested | `test_api_endpoints.py` |
 52 | | `/api/analyze` | ✅ Tested | `test_api_endpoints.py` |
 53 | | `/api/tasks/create` | ✅ Tested | `test_api_endpoints.py` |
 54 | | `/api/tasks` | ✅ Tested | `test_api_endpoints.py` |
 55 | | `/api/tasks/{task_id}` | ✅ Tested | `test_api_endpoints.py` |
 56 | | `/api/debug/issues` | ✅ Tested | `test_api_endpoints.py` |
 57 | | `/api/debug/issues/{issue_id}` | ✅ Tested | `test_api_endpoints.py` |
 58 | | `/api/debug/issues/{issue_id}/analyze` | ✅ Tested | `test_api_endpoints.py` |
 59 | | `/tools/*` | ✅ Tested | `test_api_endpoints.py` |
 60 | 
 61 | Each test verifies:
 62 | - Successful responses with valid input
 63 | - Error handling with invalid input
 64 | - Response structure and content validation
 65 | - Edge cases where applicable
 66 | 
 67 | ## Running Tests
 68 | 
 69 | To run all tests:
 70 | ```bash
 71 | python -m pytest tests/
 72 | ```
 73 | 
 74 | To run specific test categories:
 75 | ```bash
 76 | # Run component tests
 77 | python -m pytest tests/components/
 78 | 
 79 | # Run integration tests
 80 | python -m pytest tests/integration/
 81 | 
 82 | # Run config tests
 83 | python -m pytest tests/config/
 84 | 
 85 | # Run API endpoint tests only
 86 | python -m pytest tests/integration/test_api_endpoints.py
 87 | 
 88 | # Run tests for a specific API endpoint
 89 | python -m pytest tests/integration/test_api_endpoints.py::test_health_check
 90 | ```
 91 | 
 92 | ## Test Fixtures
 93 | 
 94 | Shared test fixtures are defined in `conftest.py` and include:
 95 | 
 96 | - `temp_dir`: Temporary directory for test files
 97 | - `test_config`: Server configuration for testing
 98 | - `embedder`: Sentence transformer embedder
 99 | - `vector_store`: Vector store instance
100 | - `test_server`: Server instance for testing
101 | - `test_client`: FastAPI test client
102 | - `test_code`: Sample code for testing
103 | - `test_adr`: Sample ADR data
104 | - `env_vars`: Environment variables for testing
105 | 
106 | ## Writing New Tests
107 | 
108 | 1. Place new tests in the appropriate directory based on what they're testing
109 | 2. Use the shared fixtures from `conftest.py`
110 | 3. Follow the existing patterns for async tests and cleanup
111 | 4. Add proper docstrings and comments
112 | 5. Ensure proper cleanup in fixtures that create resources
113 | 
114 | ## Test Dependencies
115 | 
116 | The test suite has the following dependencies:
117 | - pytest
118 | - pytest-asyncio
119 | - httpx
120 | - fastapi
121 | - sentence-transformers
122 | 
123 | Make sure these are installed before running tests. 
```

--------------------------------------------------------------------------------
/docs/README.md:
--------------------------------------------------------------------------------

```markdown
 1 | # MCP Codebase Insight Documentation
 2 | 
 3 | Welcome to the MCP Codebase Insight documentation. This directory contains detailed information about installation, configuration, usage, and development of the MCP Codebase Insight tool.
 4 | 
 5 | ## Documentation Structure
 6 | 
 7 | ### Getting Started
 8 | - [Installation Guide](getting-started/installation.md) - Complete installation instructions
 9 | - [Configuration Guide](getting-started/configuration.md) - Configuration options and environment setup
10 | - [Quick Start Tutorial](getting-started/quickstart.md) - Get up and running quickly
11 | - [Qdrant Setup](getting-started/qdrant_setup.md) - Vector database setup and configuration
12 | 
13 | ### Core Features
14 | - [Code Analysis](features/code-analysis.md) - Understanding code patterns and insights
15 | - [ADR Management](features/adr-management.md) - Managing architectural decisions
16 | - [Documentation Management](features/documentation.md) - Auto-generation and maintenance
17 | - [Knowledge Base](features/knowledge-base.md) - Pattern storage and retrieval
18 | - [Debug System](features/debug-system.md) - Intelligent debugging assistance
19 | - [Build Verification](features/build-verification.md) - Automated build checks
20 | 
21 | ### API Reference
22 | - [REST API](api/rest-api.md) - Complete API endpoint documentation
23 | - [SSE Integration](SSE_INTEGRATION.md) - Server-Sent Events integration guide
24 | - [Vector Store API](api/vector-store-api.md) - Vector database interaction
25 | - [Client Libraries](api/client-libraries.md) - Available client SDKs
26 | 
27 | ### Development
28 | - [Contributing Guide](development/contributing.md) - How to contribute to the project
29 | - [Architecture Overview](development/architecture.md) - System architecture and design
30 | - [Testing Guide](testing_guide.md) - Writing and running tests
31 | - [Best Practices](development/best-practices.md) - Coding standards and guidelines
32 | 
33 | ### Deployment
34 | - [Production Deployment](deployment/production.md) - Production setup guide
35 | - [Docker Deployment](deployment/docker.md) - Container-based deployment
36 | - [Scaling Guide](deployment/scaling.md) - Handling increased load
37 | - [Monitoring](deployment/monitoring.md) - System monitoring and alerts
38 | 
39 | ### Troubleshooting
40 | - [Common Issues](troubleshooting/common-issues.md) - Frequently encountered problems
41 | - [FAQ](troubleshooting/faq.md) - Frequently asked questions
42 | - [Debug Guide](troubleshooting/debug-guide.md) - Advanced debugging techniques
43 | - [Support](troubleshooting/support.md) - Getting help and support
44 | 
45 | ## Quick Links
46 | 
47 | - [GitHub Repository](https://github.com/modelcontextprotocol/mcp-codebase-insight)
48 | - [Issue Tracker](https://github.com/modelcontextprotocol/mcp-codebase-insight/issues)
49 | - [Discussions](https://github.com/modelcontextprotocol/mcp-codebase-insight/discussions)
50 | - [Release Notes](CHANGELOG.md)
51 | - [License](../LICENSE)
52 | 
53 | ## Contributing to Documentation
54 | 
55 | We welcome contributions to improve this documentation. Please see our [Contributing Guide](development/contributing.md) for details on:
56 | 
57 | - Documentation style guide
58 | - How to submit documentation changes
59 | - Documentation testing
60 | - Building documentation locally
61 | 
62 | ## Documentation Versions
63 | 
64 | This documentation corresponds to the latest stable release of MCP Codebase Insight. For other versions:
65 | 
66 | - [Latest Development](https://github.com/modelcontextprotocol/mcp-codebase-insight/tree/main/docs)
67 | - [Version History](https://github.com/modelcontextprotocol/mcp-codebase-insight/releases)
68 | 
69 | ## Support
70 | 
71 | If you need help or have questions:
72 | 
73 | 1. Check the [FAQ](troubleshooting/faq.md) and [Common Issues](troubleshooting/common-issues.md)
74 | 2. Search existing [GitHub Issues](https://github.com/modelcontextprotocol/mcp-codebase-insight/issues)
75 | 3. Join our [Discussion Forum](https://github.com/modelcontextprotocol/mcp-codebase-insight/discussions)
76 | 4. Open a new issue if needed
77 | 
```

--------------------------------------------------------------------------------
/docs/system_architecture/README.md:
--------------------------------------------------------------------------------

```markdown
  1 | # System Architecture
  2 | 
  3 | > 🚧 **Documentation In Progress**
  4 | > 
  5 | > This documentation is being actively developed. More details will be added soon.
  6 | 
  7 | ## Overview
  8 | 
  9 | This document provides a comprehensive overview of the MCP Codebase Insight system architecture. For detailed workflow information, please see the [Workflows Documentation](../workflows/README.md).
 10 | 
 11 | ## Architecture Components
 12 | 
 13 | ### Core Systems
 14 | - Vector Store System
 15 | - Knowledge Base
 16 | - Task Management
 17 | - Health Monitoring
 18 | - Error Handling
 19 | - Metrics Collection
 20 | - Cache Management
 21 | 
 22 | ### Documentation
 23 | - ADR Management
 24 | - Documentation Tools
 25 | - API Documentation
 26 | 
 27 | ### Testing
 28 | - Test Framework
 29 | - SSE Testing
 30 | - Integration Testing
 31 | 
 32 | ## Detailed Documentation
 33 | 
 34 | - [Core Components](../components/README.md)
 35 | - [API Reference](../api/README.md)
 36 | - [Development Guide](../development/README.md)
 37 | 
 38 | ## System Overview
 39 | 
 40 | This document provides a comprehensive overview of the MCP Codebase Insight system architecture, focusing on system interactions, dependencies, and design considerations.
 41 | 
 42 | ## Core Systems
 43 | 
 44 | ### 1. Vector Store System (`src/mcp_codebase_insight/core/vector_store.py`)
 45 | - **Purpose**: Manages code embeddings and semantic search capabilities
 46 | - **Key Components**:
 47 |   - Qdrant integration for vector storage
 48 |   - Embedding generation and management
 49 |   - Search optimization and caching
 50 | - **Integration Points**:
 51 |   - Knowledge Base for semantic understanding
 52 |   - Cache Management for performance optimization
 53 |   - Health Monitoring for system status
 54 | 
 55 | ### 2. Knowledge Base (`src/mcp_codebase_insight/core/knowledge.py`)
 56 | - **Purpose**: Central repository for code insights and relationships
 57 | - **Key Components**:
 58 |   - Pattern detection and storage
 59 |   - Relationship mapping
 60 |   - Semantic analysis
 61 | - **Feedback Loops**:
 62 |   - Updates vector store with new patterns
 63 |   - Receives feedback from code analysis
 64 |   - Improves pattern detection over time
 65 | 
 66 | ### 3. Task Management (`src/mcp_codebase_insight/core/tasks.py`)
 67 | - **Purpose**: Handles async operations and job scheduling
 68 | - **Key Components**:
 69 |   - Task scheduling and prioritization
 70 |   - Progress tracking
 71 |   - Resource management
 72 | - **Bottleneck Mitigation**:
 73 |   - Task queuing strategies
 74 |   - Resource allocation
 75 |   - Error recovery
 76 | 
 77 | ### 4. Health Monitoring (`src/mcp_codebase_insight/core/health.py`)
 78 | - **Purpose**: System health and performance monitoring
 79 | - **Key Components**:
 80 |   - Component status tracking
 81 |   - Performance metrics
 82 |   - Alert system
 83 | - **Feedback Mechanisms**:
 84 |   - Real-time status updates
 85 |   - Performance optimization triggers
 86 |   - System recovery procedures
 87 | 
 88 | ### 5. Error Handling (`src/mcp_codebase_insight/core/errors.py`)
 89 | - **Purpose**: Centralized error management
 90 | - **Key Components**:
 91 |   - Error classification
 92 |   - Recovery strategies
 93 |   - Logging and reporting
 94 | - **Resilience Features**:
 95 |   - Graceful degradation
 96 |   - Circuit breakers
 97 |   - Error propagation control
 98 | 
 99 | ## System Interactions
100 | 
101 | ### Critical Paths
102 | 1. **Code Analysis Flow**:
103 |    ```mermaid
104 |    sequenceDiagram
105 |        participant CA as Code Analysis
106 |        participant KB as Knowledge Base
107 |        participant VS as Vector Store
108 |        participant CM as Cache
109 |        
110 |        CA->>VS: Request embeddings
111 |        VS->>CM: Check cache
112 |        CM-->>VS: Return cached/null
113 |        VS->>KB: Get patterns
114 |        KB-->>VS: Return patterns
115 |        VS-->>CA: Return analysis
116 |    ```
117 | 
118 | 2. **Health Monitoring Flow**:
119 |    ```mermaid
120 |    sequenceDiagram
121 |        participant HM as Health Monitor
122 |        participant CS as Component State
123 |        participant TM as Task Manager
124 |        participant EH as Error Handler
125 |        
126 |        HM->>CS: Check states
127 |        CS->>TM: Verify tasks
128 |        TM-->>CS: Task status
129 |        CS-->>HM: System status
130 |        HM->>EH: Report issues
131 |    ```
132 | 
133 | ## Performance Considerations
134 | 
135 | ### Caching Strategy
136 | - Multi-level caching (memory and disk)
137 | - Cache invalidation triggers
138 | - Cache size management
139 | 
140 | ### Scalability Points
141 | 1. Vector Store:
142 |    - Horizontal scaling capabilities
143 |    - Batch processing optimization
144 |    - Search performance tuning
145 | 
146 | 2. Task Management:
147 |    - Worker pool management
148 |    - Task prioritization
149 |    - Resource allocation
150 | 
151 | ## Error Recovery
152 | 
153 | ### Failure Scenarios
154 | 1. Vector Store Unavailable:
155 |    - Fallback to cached results
156 |    - Graceful degradation of search
157 |    - Automatic reconnection
158 | 
159 | 2. Task Overload:
160 |    - Dynamic task throttling
161 |    - Priority-based scheduling
162 |    - Resource reallocation
163 | 
164 | ## System Evolution
165 | 
166 | ### Extension Points
167 | 1. Knowledge Base:
168 |    - Plugin system for new patterns
169 |    - Custom analyzers
170 |    - External integrations
171 | 
172 | 2. Monitoring:
173 |    - Custom metrics
174 |    - Alert integrations
175 |    - Performance profiling
176 | 
177 | ## Next Steps
178 | 
179 | 1. **Documentation Needs**:
180 |    - Detailed component interaction guides
181 |    - Performance tuning documentation
182 |    - Deployment architecture guides
183 | 
184 | 2. **System Improvements**:
185 |    - Enhanced caching strategies
186 |    - More robust error recovery
187 |    - Better performance monitoring 
```

--------------------------------------------------------------------------------
/.github/agents/README.md:
--------------------------------------------------------------------------------

```markdown
  1 | # Custom Agents for MCP Codebase Insight
  2 | 
  3 | This directory contains specialized AI agent instructions tailored for the MCP Codebase Insight project. Each agent has deep knowledge of specific aspects of the codebase and can help you work more effectively.
  4 | 
  5 | ## Available Agents
  6 | 
  7 | ### 🧪 [TestAgent](./TestAgent.agent.md)
  8 | **Expertise**: Testing, test runner, async test patterns, debugging test failures
  9 | 
 10 | **Use when:**
 11 | - Writing new tests for features or bug fixes
 12 | - Running tests with the custom test runner
 13 | - Debugging test failures, especially async/event loop issues
 14 | - Improving test coverage
 15 | 
 16 | **Key Knowledge:**
 17 | - Custom `./run_tests.py` test runner usage
 18 | - Test isolation and event loop management
 19 | - pytest-asyncio patterns and fixtures
 20 | - Component and integration test structures
 21 | 
 22 | ---
 23 | 
 24 | ### 🔍 [VectorStoreAgent](./VectorStoreAgent.agent.md)
 25 | **Expertise**: Qdrant vector store, embeddings, semantic search, performance optimization
 26 | 
 27 | **Use when:**
 28 | - Working with the vector store (add, search, update, delete)
 29 | - Managing embeddings and collections
 30 | - Optimizing vector search performance
 31 | - Debugging Qdrant connection issues
 32 | 
 33 | **Key Knowledge:**
 34 | - VectorStore and EmbeddingProvider APIs
 35 | - Qdrant version compatibility
 36 | - Batch operations and filters
 37 | - Performance best practices
 38 | 
 39 | ---
 40 | 
 41 | ### 📝 [DocAgent](./DocAgent.agent.md)
 42 | **Expertise**: Documentation, ADRs, API docs, code comments, architecture diagrams
 43 | 
 44 | **Use when:**
 45 | - Creating or updating documentation
 46 | - Writing Architecture Decision Records (ADRs)
 47 | - Documenting APIs and code examples
 48 | - Creating architecture diagrams
 49 | 
 50 | **Key Knowledge:**
 51 | - ADR management system
 52 | - Documentation structure and templates
 53 | - Docstring format (Google style)
 54 | - Mermaid diagram syntax
 55 | 
 56 | ---
 57 | 
 58 | ### 🐛 [DebugAgent](./DebugAgent.agent.md)
 59 | **Expertise**: Debugging, issue diagnosis, error handling, Agans' 9 Rules
 60 | 
 61 | **Use when:**
 62 | - Debugging complex issues systematically
 63 | - Handling async/event loop errors
 64 | - Diagnosing Qdrant connection problems
 65 | - Investigating memory leaks or resource issues
 66 | 
 67 | **Key Knowledge:**
 68 | - Agans' 9 Rules of Debugging
 69 | - Common async/event loop issues
 70 | - Configuration and environment problems
 71 | - Systematic debugging workflows
 72 | 
 73 | ---
 74 | 
 75 | ## How to Use These Agents
 76 | 
 77 | ### In VS Code with GitHub Copilot
 78 | 
 79 | 1. **Open the agent file** you need (e.g., `TestAgent.agent.md`)
 80 | 2. **Reference it in Copilot Chat**: "Using @TestAgent, help me write tests for the new feature"
 81 | 3. **Ask specific questions**: "How do I run integration tests in isolation?"
 82 | 
 83 | ### In Claude or Other AI Tools
 84 | 
 85 | 1. **Copy the agent content** into your conversation
 86 | 2. **Provide context**: "I'm the TestAgent for this project..."
 87 | 3. **Ask your question** in the same conversation
 88 | 
 89 | ### General Workflow
 90 | 
 91 | ```mermaid
 92 | graph LR
 93 |     A[Need Help] --> B{What Type?}
 94 |     B -->|Testing| C[TestAgent]
 95 |     B -->|Vector Store| D[VectorStoreAgent]
 96 |     B -->|Documentation| E[DocAgent]
 97 |     B -->|Debugging| F[DebugAgent]
 98 |     
 99 |     C --> G[Get Specialized Help]
100 |     D --> G
101 |     E --> G
102 |     F --> G
103 | ```
104 | 
105 | ## Agent Selection Guide
106 | 
107 | | Task | Recommended Agent | Why |
108 | |------|------------------|-----|
109 | | Write unit tests | TestAgent | Knows test patterns and runner |
110 | | Fix failing tests | TestAgent + DebugAgent | Testing expertise + debugging |
111 | | Add vector search | VectorStoreAgent | Deep Qdrant knowledge |
112 | | Optimize queries | VectorStoreAgent | Performance expertise |
113 | | Create ADR | DocAgent | ADR system expert |
114 | | Update API docs | DocAgent | Documentation specialist |
115 | | Debug async error | DebugAgent | Async troubleshooting expert |
116 | | Qdrant won't connect | VectorStoreAgent + DebugAgent | Both have relevant knowledge |
117 | | Memory leak | DebugAgent | Resource debugging specialist |
118 | 
119 | ## Multi-Agent Collaboration
120 | 
121 | For complex tasks, you can use multiple agents:
122 | 
123 | **Example: Adding a New Feature**
124 | 
125 | 1. **VectorStoreAgent**: Implement vector store operations
126 | 2. **TestAgent**: Write comprehensive tests
127 | 3. **DocAgent**: Document the feature and create ADR
128 | 4. **DebugAgent**: Help if issues arise during development
129 | 
130 | **Example Workflow:**
131 | 
132 | ```bash
133 | # 1. Implement feature with VectorStoreAgent
134 | # "Help me add batch delete operation to VectorStore"
135 | 
136 | # 2. Write tests with TestAgent  
137 | # "Create tests for the batch delete operation"
138 | 
139 | # 3. Debug issues with DebugAgent
140 | # "Tests failing with event loop errors, help debug"
141 | 
142 | # 4. Document with DocAgent
143 | # "Document the new batch delete feature and create an ADR"
144 | ```
145 | 
146 | ## Creating Your Own Agent
147 | 
148 | If you need a specialized agent for a specific domain:
149 | 
150 | ```markdown
151 | # [YourAgent] Agent
152 | 
153 | You are a specialized [domain] agent for MCP Codebase Insight.
154 | 
155 | ## Your Responsibilities
156 | 1. [Primary responsibility]
157 | 2. [Secondary responsibility]
158 | 
159 | ## Critical Knowledge
160 | - [Key concept 1]
161 | - [Key concept 2]
162 | 
163 | ## Common Operations
164 | [Examples and patterns]
165 | 
166 | ## When to Escalate
167 | [Limitations and handoff criteria]
168 | ```
169 | 
170 | ## Agent Updates
171 | 
172 | These agents are living documents. Update them when:
173 | - New patterns emerge in the codebase
174 | - Common issues are discovered and solved
175 | - APIs change significantly
176 | - New best practices are established
177 | 
178 | ## Feedback
179 | 
180 | If an agent:
181 | - Gives incorrect information → Update the agent file
182 | - Is missing important context → Add it to the agent
183 | - Doesn't cover your use case → Create a new agent or extend existing one
184 | 
185 | ## Related
186 | 
187 | - [Main Copilot Instructions](../copilot-instructions.md) - General project guidance
188 | - [Contributing Guide](../../CONTRIBUTING.md) - How to contribute
189 | - [Testing Guide](../../docs/testing_guide.md) - Detailed testing information
190 | - [Architecture Docs](../../docs/system_architecture/) - System design
191 | 
```

--------------------------------------------------------------------------------
/docs/workflows/README.md:
--------------------------------------------------------------------------------

```markdown
  1 | # MCP Codebase Insight Workflows
  2 | 
  3 | ## Overview
  4 | 
  5 | This document details the various workflows supported by MCP Codebase Insight, including both user-facing and system-level processes. These workflows are designed to help developers effectively use and interact with the system's features.
  6 | 
  7 | ## Quick Navigation
  8 | 
  9 | - [User Workflows](#user-workflows)
 10 |   - [Code Analysis](#1-code-analysis-workflow)
 11 |   - [Documentation Management](#2-documentation-management-workflow)
 12 |   - [Testing](#3-testing-workflow)
 13 | - [System Workflows](#system-workflows)
 14 |   - [Vector Store Operations](#1-vector-store-operations)
 15 |   - [Health Monitoring](#2-health-monitoring)
 16 | - [Integration Points](#integration-points)
 17 | - [Best Practices](#best-practices)
 18 | - [Troubleshooting](#troubleshooting)
 19 | - [Next Steps](#next-steps)
 20 | 
 21 | ## User Workflows
 22 | 
 23 | ### 1. Code Analysis Workflow
 24 | 
 25 | #### Process Flow
 26 | ```mermaid
 27 | graph TD
 28 |     A[Developer] -->|Submit Code| B[Analysis Request]
 29 |     B --> C{Analysis Type}
 30 |     C -->|Pattern Detection| D[Pattern Analysis]
 31 |     C -->|Semantic Search| E[Vector Search]
 32 |     C -->|Documentation| F[Doc Analysis]
 33 |     D --> G[Results]
 34 |     E --> G
 35 |     F --> G
 36 |     G -->|Display| A
 37 | ```
 38 | 
 39 | #### Steps
 40 | 1. **Submit Code**
 41 |    - Upload code files or provide repository URL
 42 |    - Specify analysis parameters
 43 |    - Set analysis scope
 44 | 
 45 | 2. **Analysis Processing**
 46 |    - Pattern detection runs against known patterns
 47 |    - Semantic search finds similar code
 48 |    - Documentation analysis checks coverage
 49 | 
 50 | 3. **Results Review**
 51 |    - View detected patterns
 52 |    - Review suggestions
 53 |    - Access related documentation
 54 | 
 55 | ### 2. Documentation Management Workflow
 56 | 
 57 | #### Process Flow
 58 | ```mermaid
 59 | graph TD
 60 |     A[Developer] -->|Create/Update| B[Documentation]
 61 |     B --> C{Doc Type}
 62 |     C -->|ADR| D[ADR Processing]
 63 |     C -->|API| E[API Docs]
 64 |     C -->|Guide| F[User Guide]
 65 |     D --> G[Link Analysis]
 66 |     E --> G
 67 |     F --> G
 68 |     G -->|Update| H[Doc Map]
 69 |     H -->|Validate| A
 70 | ```
 71 | 
 72 | #### Steps
 73 | 1. **Create/Update Documentation**
 74 |    - Choose document type
 75 |    - Write content
 76 |    - Add metadata
 77 | 
 78 | 2. **Processing**
 79 |    - Analyze document relationships
 80 |    - Update documentation map
 81 |    - Validate links
 82 | 
 83 | 3. **Validation**
 84 |    - Check for broken links
 85 |    - Verify consistency
 86 |    - Update references
 87 | 
 88 | ### 3. Testing Workflow
 89 | 
 90 | #### Process Flow
 91 | ```mermaid
 92 | graph TD
 93 |     A[Developer] -->|Run Tests| B[Test Suite]
 94 |     B --> C{Test Type}
 95 |     C -->|Unit| D[Unit Tests]
 96 |     C -->|Integration| E[Integration Tests]
 97 |     C -->|SSE| F[SSE Tests]
 98 |     D --> G[Results]
 99 |     E --> G
100 |     F --> G
101 |     G -->|Report| A
102 | ```
103 | 
104 | #### Steps
105 | 1. **Test Initialization**
106 |    - Set up test environment
107 |    - Configure test parameters
108 |    - Prepare test data
109 | 
110 | 2. **Test Execution**
111 |    - Run selected test types
112 |    - Monitor progress
113 |    - Collect results
114 | 
115 | 3. **Results Analysis**
116 |    - Review test reports
117 |    - Analyze failures
118 |    - Generate coverage reports
119 | 
120 | ## System Workflows
121 | 
122 | ### 1. Vector Store Operations
123 | 
124 | #### Process Flow
125 | ```mermaid
126 | sequenceDiagram
127 |     participant User
128 |     participant Server
129 |     participant Cache
130 |     participant VectorStore
131 |     participant Knowledge
132 |     
133 |     User->>Server: Request Analysis
134 |     Server->>Cache: Check Cache
135 |     Cache-->>Server: Cache Hit/Miss
136 |     
137 |     alt Cache Miss
138 |         Server->>VectorStore: Generate Embeddings
139 |         VectorStore->>Knowledge: Get Patterns
140 |         Knowledge-->>VectorStore: Return Patterns
141 |         VectorStore-->>Server: Return Results
142 |         Server->>Cache: Update Cache
143 |     end
144 |     
145 |     Server-->>User: Return Analysis
146 | ```
147 | 
148 | #### Components
149 | 1. **Cache Layer**
150 |    - In-memory cache for frequent requests
151 |    - Disk cache for larger datasets
152 |    - Cache invalidation strategy
153 | 
154 | 2. **Vector Store**
155 |    - Embedding generation
156 |    - Vector search
157 |    - Pattern matching
158 | 
159 | 3. **Knowledge Base**
160 |    - Pattern storage
161 |    - Relationship tracking
162 |    - Context management
163 | 
164 | ### 2. Health Monitoring
165 | 
166 | #### Process Flow
167 | ```mermaid
168 | sequenceDiagram
169 |     participant Monitor
170 |     participant Components
171 |     participant Tasks
172 |     participant Alerts
173 |     
174 |     loop Every 30s
175 |         Monitor->>Components: Check Status
176 |         Components->>Tasks: Verify Tasks
177 |         Tasks-->>Components: Task Status
178 |         
179 |         alt Issues Detected
180 |             Components->>Alerts: Raise Alert
181 |             Alerts->>Monitor: Alert Status
182 |         end
183 |         
184 |         Components-->>Monitor: System Status
185 |     end
186 | ```
187 | 
188 | #### Components
189 | 1. **Monitor**
190 |    - Regular health checks
191 |    - Performance monitoring
192 |    - Resource tracking
193 | 
194 | 2. **Components**
195 |    - Service status
196 |    - Resource usage
197 |    - Error rates
198 | 
199 | 3. **Tasks**
200 |    - Task queue status
201 |    - Processing rates
202 |    - Error handling
203 | 
204 | 4. **Alerts**
205 |    - Alert generation
206 |    - Notification routing
207 |    - Alert history
208 | 
209 | ## Integration Points
210 | 
211 | ### 1. External Systems
212 | - Version Control Systems
213 | - CI/CD Pipelines
214 | - Issue Tracking Systems
215 | - Documentation Platforms
216 | 
217 | ### 2. APIs
218 | - REST API for main operations
219 | - SSE for real-time updates
220 | - WebSocket for bi-directional communication
221 | 
222 | ### 3. Storage
223 | - Vector Database (Qdrant)
224 | - Cache Storage
225 | - Document Storage
226 | 
227 | ## Best Practices
228 | 
229 | ### 1. Code Analysis
230 | - Regular analysis scheduling
231 | - Incremental analysis for large codebases
232 | - Pattern customization
233 | 
234 | ### 2. Documentation
235 | - Consistent formatting
236 | - Regular updates
237 | - Link validation
238 | 
239 | ### 3. Testing
240 | - Comprehensive test coverage
241 | - Regular test runs
242 | - Performance benchmarking
243 | 
244 | ## Troubleshooting
245 | 
246 | ### Common Issues
247 | 1. **Analysis Failures**
248 |    - Check input validation
249 |    - Verify system resources
250 |    - Review error logs
251 | 
252 | 2. **Performance Issues**
253 |    - Monitor cache hit rates
254 |    - Check vector store performance
255 |    - Review resource usage
256 | 
257 | 3. **Integration Issues**
258 |    - Verify API endpoints
259 |    - Check authentication
260 |    - Review connection settings
261 | 
262 | ## Next Steps
263 | 
264 | 1. **Workflow Optimization**
265 |    - Performance improvements
266 |    - Enhanced error handling
267 |    - Better user feedback
268 | 
269 | 2. **New Features**
270 |    - Custom workflow creation
271 |    - Advanced analysis options
272 |    - Extended integration options
273 | 
274 | 3. **Documentation**
275 |    - Workflow examples
276 |    - Integration guides
277 |    - Troubleshooting guides 
```

--------------------------------------------------------------------------------
/CONTRIBUTING.md:
--------------------------------------------------------------------------------

```markdown
 1 | # Contributing to MCP Codebase Insight
 2 | 
 3 | > 🚧 **Documentation In Progress**
 4 | > 
 5 | > This documentation is being actively developed. More details will be added soon.
 6 | 
 7 | ## Getting Started
 8 | 
 9 | 1. Fork the repository
10 | 2. Clone your fork
11 | 3. Create a new branch
12 | 4. Make your changes
13 | 5. Submit a pull request
14 | 
15 | ## Development Setup
16 | 
17 | See the [Development Guide](docs/development/README.md) for detailed setup instructions.
18 | 
19 | ## Code Style
20 | 
21 | - Follow PEP 8 guidelines
22 | - Use type hints
23 | - Write docstrings for all public functions and classes
24 | - Keep functions focused and small
25 | - Write clear commit messages
26 | 
27 | ## Testing
28 | 
29 | - Write tests for new features
30 | - Ensure all tests pass before submitting PR
31 | - Include both unit and integration tests
32 | - Document test cases
33 | 
34 | ## Documentation
35 | 
36 | - Update documentation for new features
37 | - Follow the documentation style guide
38 | - Include examples where appropriate
39 | - Keep documentation up to date with code
40 | 
41 | ## Pull Request Process
42 | 
43 | 1. Update documentation
44 | 2. Add tests
45 | 3. Update CHANGELOG.md
46 | 4. Submit PR with clear description
47 | 5. Address review comments
48 | 
49 | ## Code of Conduct
50 | 
51 | Please note that this project is released with a [Code of Conduct](CODE_OF_CONDUCT.md). By participating in this project you agree to abide by its terms.
52 | 
```

--------------------------------------------------------------------------------
/CLAUDE.md:
--------------------------------------------------------------------------------

```markdown
 1 | # TechPath Project Guidelines
 2 | 
 3 | ## Build & Test Commands
 4 | - **Python**: `make install-dev` (setup), `make start` (run server), `make check` (all checks)
 5 | - **Python Tests**: `make test` or `pytest tests/test_file.py::test_function_name` (single test)
 6 | - **Frontend**: `cd project && npm run dev` (development), `npm run build` (production)
 7 | - **Frontend Tests**: `cd project && npm test` or `npm test -- -t "test name pattern"` (single test)
 8 | - **Linting**: `make lint` (Python), `cd project && npm run lint` (TypeScript/React)
 9 | - **Formatting**: `make format` (Python), `prettier --write src/` (Frontend)
10 | 
11 | ## Code Style Guidelines
12 | - **Python**: Black (88 chars), isort for imports, type hints required
13 | - **TypeScript**: 2-space indent, semicolons, strong typing with interfaces
14 | - **Imports**: Group by external then internal, alphabetize
15 | - **React**: Functional components with hooks, avoid class components
16 | - **Types**: Define interfaces in separate files when reused
17 | - **Naming**: camelCase for JS/TS variables, PascalCase for components/types, snake_case for Python
18 | - **Error Handling**: Try/catch in async functions, propagate errors with descriptive messages
19 | - **Comments**: Document complex logic, interfaces, and function parameters/returns
20 | - **Testing**: Unit test coverage required, mock external dependencies
```

--------------------------------------------------------------------------------
/docs/development/CONTRIBUTING.md:
--------------------------------------------------------------------------------

```markdown
  1 | # Contributing Guidelines
  2 | 
  3 | > 🚧 **Documentation In Progress**
  4 | > 
  5 | > This documentation is being actively developed. More details will be added soon.
  6 | 
  7 | ## Welcome!
  8 | 
  9 | Thank you for considering contributing to MCP Codebase Insight! This document provides guidelines and workflows for contributing.
 10 | 
 11 | ## Code of Conduct
 12 | 
 13 | Please read and follow our [Code of Conduct](CODE_OF_CONDUCT.md).
 14 | 
 15 | ## How Can I Contribute?
 16 | 
 17 | ### Reporting Bugs
 18 | 
 19 | 1. Check if the bug is already reported in [Issues](https://github.com/modelcontextprotocol/mcp-codebase-insight/issues)
 20 | 2. If not, create a new issue with:
 21 |    - Clear title
 22 |    - Detailed description
 23 |    - Steps to reproduce
 24 |    - Expected vs actual behavior
 25 |    - Environment details
 26 | 
 27 | ### Suggesting Enhancements
 28 | 
 29 | 1. Check existing issues and discussions
 30 | 2. Create a new issue with:
 31 |    - Clear title
 32 |    - Detailed description
 33 |    - Use cases
 34 |    - Implementation ideas (optional)
 35 | 
 36 | ### Pull Requests
 37 | 
 38 | 1. Fork the repository
 39 | 2. Create a feature branch
 40 | 3. Make your changes
 41 | 4. Run tests and linting
 42 | 5. Submit PR with:
 43 |    - Clear title
 44 |    - Description of changes
 45 |    - Reference to related issues
 46 |    - Updated documentation
 47 | 
 48 | ## Development Process
 49 | 
 50 | ### 1. Setup Development Environment
 51 | 
 52 | Follow the [Development Guide](README.md) for setup instructions.
 53 | 
 54 | ### 2. Make Changes
 55 | 
 56 | 1. Create a branch:
 57 |    ```bash
 58 |    git checkout -b feature/your-feature
 59 |    ```
 60 | 
 61 | 2. Make changes following our style guide
 62 | 3. Add tests for new functionality
 63 | 4. Update documentation
 64 | 
 65 | ### 3. Test Your Changes
 66 | 
 67 | ```bash
 68 | # Run all tests
 69 | pytest
 70 | 
 71 | # Run specific test file
 72 | pytest tests/path/to/test_file.py
 73 | 
 74 | # Run with coverage
 75 | pytest --cov=src tests/
 76 | ```
 77 | 
 78 | ### 4. Submit Changes
 79 | 
 80 | 1. Push to your fork
 81 | 2. Create pull request
 82 | 3. Wait for review
 83 | 4. Address feedback
 84 | 
 85 | ## Style Guide
 86 | 
 87 | ### Python Code Style
 88 | 
 89 | - Follow PEP 8
 90 | - Use type hints
 91 | - Maximum line length: 88 characters
 92 | - Use docstrings (Google style)
 93 | 
 94 | ### Commit Messages
 95 | 
 96 | ```
 97 | type(scope): description
 98 | 
 99 | [optional body]
100 | 
101 | [optional footer]
102 | ```
103 | 
104 | Types:
105 | - feat: New feature
106 | - fix: Bug fix
107 | - docs: Documentation
108 | - style: Formatting
109 | - refactor: Code restructuring
110 | - test: Adding tests
111 | - chore: Maintenance
112 | 
113 | ### Documentation
114 | 
115 | - Keep README.md updated
116 | - Add docstrings to all public APIs
117 | - Update relevant documentation files
118 | - Include examples for new features
119 | 
120 | ## Review Process
121 | 
122 | 1. Automated checks must pass
123 | 2. At least one maintainer review
124 | 3. All feedback addressed
125 | 4. Documentation updated
126 | 5. Tests added/updated
127 | 
128 | ## Getting Help
129 | 
130 | - Join our [Discord](https://discord.gg/mcp-codebase-insight)
131 | - Ask in GitHub Discussions
132 | - Contact maintainers
133 | 
134 | ## Recognition
135 | 
136 | Contributors will be:
137 | - Listed in CONTRIBUTORS.md
138 | - Mentioned in release notes
139 | - Credited in documentation
140 | 
141 | Thank you for contributing! 
```

--------------------------------------------------------------------------------
/docs/development/CODE_OF_CONDUCT.md:
--------------------------------------------------------------------------------

```markdown
  1 | # Code of Conduct
  2 | 
  3 | > 🚧 **Documentation In Progress**
  4 | > 
  5 | > This documentation is being actively developed. More details will be added soon.
  6 | 
  7 | ## Our Pledge
  8 | 
  9 | We as members, contributors, and leaders pledge to make participation in our
 10 | community a harassment-free experience for everyone, regardless of age, body
 11 | size, visible or invisible disability, ethnicity, sex characteristics, gender
 12 | identity and expression, level of experience, education, socio-economic status,
 13 | nationality, personal appearance, race, religion, or sexual identity
 14 | and orientation.
 15 | 
 16 | We pledge to act and interact in ways that contribute to an open, welcoming,
 17 | diverse, inclusive, and healthy community.
 18 | 
 19 | ## Our Standards
 20 | 
 21 | Examples of behavior that contributes to a positive environment for our
 22 | community include:
 23 | 
 24 | * Demonstrating empathy and kindness toward other people
 25 | * Being respectful of differing opinions, viewpoints, and experiences
 26 | * Giving and gracefully accepting constructive feedback
 27 | * Accepting responsibility and apologizing to those affected by our mistakes,
 28 |   and learning from the experience
 29 | * Focusing on what is best not just for us as individuals, but for the
 30 |   overall community
 31 | 
 32 | Examples of unacceptable behavior include:
 33 | 
 34 | * The use of sexualized language or imagery, and sexual attention or
 35 |   advances of any kind
 36 | * Trolling, insulting or derogatory comments, and personal or political attacks
 37 | * Public or private harassment
 38 | * Publishing others' private information, such as a physical or email
 39 |   address, without their explicit permission
 40 | * Other conduct which could reasonably be considered inappropriate in a
 41 |   professional setting
 42 | 
 43 | ## Enforcement Responsibilities
 44 | 
 45 | Project maintainers are responsible for clarifying and enforcing our standards of
 46 | acceptable behavior and will take appropriate and fair corrective action in
 47 | response to any behavior that they deem inappropriate, threatening, offensive,
 48 | or harmful.
 49 | 
 50 | ## Scope
 51 | 
 52 | This Code of Conduct applies within all community spaces, and also applies when
 53 | an individual is officially representing the community in public spaces.
 54 | 
 55 | ## Enforcement
 56 | 
 57 | Instances of abusive, harassing, or otherwise unacceptable behavior may be
 58 | reported to the project maintainers responsible for enforcement at
 59 | [INSERT CONTACT METHOD].
 60 | 
 61 | All complaints will be reviewed and investigated promptly and fairly.
 62 | 
 63 | ## Enforcement Guidelines
 64 | 
 65 | Project maintainers will follow these Community Impact Guidelines in determining
 66 | the consequences for any action they deem in violation of this Code of Conduct:
 67 | 
 68 | ### 1. Correction
 69 | 
 70 | **Community Impact**: Use of inappropriate language or other behavior deemed
 71 | unprofessional or unwelcome in the community.
 72 | 
 73 | **Consequence**: A private, written warning from project maintainers, providing
 74 | clarity around the nature of the violation and an explanation of why the
 75 | behavior was inappropriate.
 76 | 
 77 | ### 2. Warning
 78 | 
 79 | **Community Impact**: A violation through a single incident or series
 80 | of actions.
 81 | 
 82 | **Consequence**: A warning with consequences for continued behavior. No
 83 | interaction with the people involved, including unsolicited interaction with
 84 | those enforcing the Code of Conduct, for a specified period of time.
 85 | 
 86 | ### 3. Temporary Ban
 87 | 
 88 | **Community Impact**: A serious violation of community standards, including
 89 | sustained inappropriate behavior.
 90 | 
 91 | **Consequence**: A temporary ban from any sort of interaction or public
 92 | communication with the community for a specified period of time.
 93 | 
 94 | ### 4. Permanent Ban
 95 | 
 96 | **Community Impact**: Demonstrating a pattern of violation of community
 97 | standards, including sustained inappropriate behavior, harassment of an
 98 | individual, or aggression toward or disparagement of classes of individuals.
 99 | 
100 | **Consequence**: A permanent ban from any sort of public interaction within
101 | the community.
102 | 
103 | ## Attribution
104 | 
105 | This Code of Conduct is adapted from the [Contributor Covenant][homepage],
106 | version 2.0, available at
107 | https://www.contributor-covenant.org/version/2/0/code_of_conduct.html.
108 | 
109 | [homepage]: https://www.contributor-covenant.org 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/core/__init__.py:
--------------------------------------------------------------------------------

```python
1 | """Core package initialization."""
2 | 
3 | from .config import ServerConfig
4 | 
5 | __all__ = ["ServerConfig"]
6 | 
```

--------------------------------------------------------------------------------
/requirements-dev.txt:
--------------------------------------------------------------------------------

```
1 | pytest>=8.0
2 | pytest-asyncio>=0.26.0
3 | anyio>=3.0.0
4 | httpx>=0.24.0
5 | fastapi[all]>=0.100.0
6 | qdrant-client>=1.2.0
7 | 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/__init__.py:
--------------------------------------------------------------------------------

```python
1 | """MCP Codebase Insight package."""
2 | 
3 | from .core.config import ServerConfig
4 | 
5 | __version__ = "0.2.2"
6 | __all__ = ["ServerConfig"]
7 | 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/utils/__init__.py:
--------------------------------------------------------------------------------

```python
1 | """Utils package initialization."""
2 | 
3 | from .logger import Logger, get_logger, logger
4 | 
5 | __all__ = ["Logger", "get_logger", "logger"]
6 | 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/asgi.py:
--------------------------------------------------------------------------------

```python
 1 | """ASGI application entry point."""
 2 | 
 3 | from .core.config import ServerConfig
 4 | from .server import CodebaseAnalysisServer
 5 | 
 6 | # Create server instance with default config
 7 | config = ServerConfig()
 8 | server = CodebaseAnalysisServer(config)
 9 | 
10 | # Export the FastAPI app instance
11 | app = server.app 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/core/component_status.py:
--------------------------------------------------------------------------------

```python
 1 | """Component status enumeration."""
 2 | 
 3 | from enum import Enum
 4 | 
 5 | class ComponentStatus(str, Enum):
 6 |     """Component status enumeration."""
 7 |     
 8 |     UNINITIALIZED = "uninitialized"
 9 |     INITIALIZING = "initializing"
10 |     INITIALIZED = "initialized"
11 |     FAILED = "failed"
12 |     CLEANING = "cleaning"
13 |     CLEANED = "cleaned" 
```

--------------------------------------------------------------------------------
/module_summaries/database_summary.txt:
--------------------------------------------------------------------------------

```
1 | # Database Module Summary
2 | - **Purpose**: Describe the database's role in the application.
3 | - **Key Components**: List database types, schema designs, and any ORM tools used.
4 | - **Dependencies**: Mention the relationships with the backend and data sources.
5 | - **Largest Files**: Identify the largest database-related files and their purposes.
6 | 
```

--------------------------------------------------------------------------------
/module_summaries/backend_summary.txt:
--------------------------------------------------------------------------------

```
1 | # Backend Module Summary
2 | - **Purpose**: Describe the backend's role in the application.
3 | - **Key Components**: List key components such as main frameworks, APIs, and data handling.
4 | - **Dependencies**: Mention any database connections and external services it relies on.
5 | - **Largest Files**: Identify the largest backend files and their purposes.
6 | 
```

--------------------------------------------------------------------------------
/module_summaries/frontend_summary.txt:
--------------------------------------------------------------------------------

```
1 | # Frontend Module Summary
2 | - **Purpose**: Describe the frontend's role in the application.
3 | - **Key Components**: List key components such as main frameworks, libraries, and UI components.
4 | - **Dependencies**: Mention any dependencies on backend services or external APIs.
5 | - **Largest Files**: Identify the largest frontend files and their purposes.
6 | 
```

--------------------------------------------------------------------------------
/pytest.ini:
--------------------------------------------------------------------------------

```
 1 | [pytest]
 2 | asyncio_mode = strict
 3 | asyncio_default_fixture_loop_scope = session
 4 | testpaths = tests
 5 | python_files = test_*.py
 6 | python_classes = Test*
 7 | python_functions = test_*
 8 | addopts = -v --cov=src/mcp_codebase_insight --cov-report=term-missing 
 9 | filterwarnings =
10 |     ignore::DeprecationWarning:pkg_resources.*
11 |     ignore::DeprecationWarning:importlib.*
12 |     ignore::DeprecationWarning:pytest_asyncio.*
13 |     ignore::DeprecationWarning:pydantic.*
14 |     ignore::pydantic.PydanticDeprecatedSince20 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/version.py:
--------------------------------------------------------------------------------

```python
 1 | """Version information."""
 2 | 
 3 | __version__ = "0.1.0"
 4 | __author__ = "MCP Team"
 5 | __author_email__ = "[email protected]"
 6 | __description__ = "MCP Codebase Insight Server"
 7 | __url__ = "https://github.com/modelcontextprotocol/mcp-codebase-insight"
 8 | __license__ = "MIT"
 9 | 
10 | # Version components
11 | VERSION_MAJOR = 0
12 | VERSION_MINOR = 1
13 | VERSION_PATCH = 0
14 | VERSION_SUFFIX = ""
15 | 
16 | # Build version tuple
17 | VERSION_INFO = (VERSION_MAJOR, VERSION_MINOR, VERSION_PATCH)
18 | 
19 | # Build version string
20 | VERSION = ".".join(map(str, VERSION_INFO))
21 | if VERSION_SUFFIX:
22 |     VERSION += VERSION_SUFFIX
23 | 
```

--------------------------------------------------------------------------------
/test_function.txt:
--------------------------------------------------------------------------------

```
 1 | async def test_health_check(client: httpx.AsyncClient):
 2 |     """Test the health check endpoint."""
 3 |     response = await client.get("/health")
 4 |     
 5 |     assert response.status_code == status.HTTP_200_OK
 6 |     data = response.json()
 7 |     
 8 |     # In test environment, we expect partially initialized state
 9 |     assert "status" in data
10 |     assert "initialized" in data
11 |     
12 |     # We don't assert on components field since it might be missing
13 |     
14 |     # Accept 'ok' status in test environment
15 |     assert data["status"] in ["healthy", "initializing", "ok"], f"Unexpected status: {data["status"]}"
16 |     
17 |     # Print status for debugging
18 |     print(f"Health status: {data}")
```

--------------------------------------------------------------------------------
/tests/integration/fixed_test2.py:
--------------------------------------------------------------------------------

```python
 1 | 
 2 | async def test_health_check(client: httpx.AsyncClient):
 3 |     """Test the health check endpoint."""
 4 |     response = await client.get("/health")
 5 |     
 6 |     assert response.status_code == status.HTTP_200_OK
 7 |     data = response.json()
 8 |     
 9 |     # In test environment, we expect partially initialized state
10 |     assert "status" in data
11 |     assert "initialized" in data
12 |     
13 |     # We don't assert on components field since it might be missing
14 |     
15 |     # Accept 'ok' status in test environment
16 |     assert data["status"] in ["healthy", "initializing", "ok"], f"Unexpected status: {data['status']}"
17 |     
18 |     # Print status for debugging
19 |     print(f"Health status: {data}")
20 | 
```

--------------------------------------------------------------------------------
/run_fixed_tests.sh:
--------------------------------------------------------------------------------

```bash
 1 | #!/bin/bash
 2 | # This script runs tests with proper path and environment setup
 3 | 
 4 | set -e
 5 | 
 6 | # Activate the virtual environment
 7 | source .venv/bin/activate
 8 | 
 9 | # Install the package in development mode
10 | pip install -e .
11 | 
12 | # Set environment variables
13 | export MCP_TEST_MODE=1
14 | export QDRANT_URL="http://localhost:6333"
15 | export MCP_COLLECTION_NAME="test_collection_$(date +%s)"
16 | export PYTHONPATH="$PYTHONPATH:$(pwd)"
17 | 
18 | # Check if we should run a specific test or all tests
19 | if [ $# -eq 0 ]; then
20 |   echo "Running specific vector store tests..."
21 |   python component_test_runner.py tests/components/test_vector_store.py
22 | else
23 |   echo "Running specified tests: $*"
24 |   python component_test_runner.py "$@"
25 | fi
26 | 
```

--------------------------------------------------------------------------------
/debug_tests.md:
--------------------------------------------------------------------------------

```markdown
 1 | # Debug MCP Codebase Insight Tests
 2 | 
 3 | ## Problem Statement
 4 | Debug and fix the test execution issues in the MCP Codebase Insight project. The main test script `run_tests.py` is encountering issues with module imports and test execution.
 5 | 
 6 | ## Current Issues
 7 | 1. Module import errors for `mcp_codebase_insight` package
 8 | 2. Test execution failures
 9 | 3. Coverage reporting issues
10 | 
11 | ## Expected Behavior
12 | - All tests should run successfully
13 | - Coverage reports should be generated
14 | - No import errors should occur
15 | 
16 | ## Additional Context
17 | - The project uses pytest for testing
18 | - Coverage reporting is handled through pytest-cov
19 | - The project is set up with a virtual environment
20 | - Environment variables are set in .env file 
```

--------------------------------------------------------------------------------
/docs/templates/adr.md:
--------------------------------------------------------------------------------

```markdown
 1 | # {title}
 2 | 
 3 | ## Status
 4 | 
 5 | {status}
 6 | 
 7 | ## Context
 8 | 
 9 | {context}
10 | 
11 | ## Decision Drivers
12 | 
13 | <!-- What forces influenced this decision? -->
14 | 
15 | * Technical constraints
16 | * Business requirements
17 | * Resource constraints
18 | * Time constraints
19 | 
20 | ## Considered Options
21 | 
22 | {options}
23 | 
24 | ## Decision
25 | 
26 | {decision}
27 | 
28 | ## Expected Consequences
29 | 
30 | ### Positive Consequences
31 | 
32 | {positive_consequences}
33 | 
34 | ### Negative Consequences
35 | 
36 | {negative_consequences}
37 | 
38 | ## Pros and Cons of the Options
39 | 
40 | {options_details}
41 | 
42 | ## Links
43 | 
44 | <!-- Optional section for links to other decisions, patterns, or resources -->
45 | 
46 | ## Notes
47 | 
48 | {notes}
49 | 
50 | ## Metadata
51 | 
52 | * Created: {created_at}
53 | * Last Modified: {updated_at}
54 | * Author: {author}
55 | * Approvers: {approvers}
56 | * Status: {status}
57 | * Tags: {tags}
58 | {metadata}
59 | 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/models.py:
--------------------------------------------------------------------------------

```python
 1 | """API request and response models."""
 2 | 
 3 | from typing import List, Dict, Any, Optional
 4 | from pydantic import BaseModel
 5 | 
 6 | class ToolRequest(BaseModel):
 7 |     """Base request model for tool endpoints."""
 8 |     name: str
 9 |     arguments: Dict[str, Any]
10 | 
11 | class CrawlDocsRequest(BaseModel):
12 |     """Request model for crawl-docs endpoint."""
13 |     urls: List[str]
14 |     source_type: str
15 | 
16 | class AnalyzeCodeRequest(BaseModel):
17 |     """Request model for analyze-code endpoint."""
18 |     code: str
19 |     context: Dict[str, Any]
20 | 
21 | class SearchKnowledgeRequest(BaseModel):
22 |     """Request model for search-knowledge endpoint."""
23 |     query: str
24 |     pattern_type: str
25 |     limit: int = 5
26 | 
27 | class CodeAnalysisRequest(BaseModel):
28 |     """Code analysis request model."""
29 |     
30 |     code: str
31 |     context: Optional[Dict[str, Any]] = None 
```

--------------------------------------------------------------------------------
/core_workflows.txt:
--------------------------------------------------------------------------------

```
 1 | # Core Workflows
 2 | 
 3 | ## User Journeys
 4 | 1. **Product Browsing**:
 5 |    - Relevant code files: [list of files responsible for navigation, product listing]
 6 |    - File sizes: [line counts for each key file]
 7 |   
 8 | 2. **Checkout Process**:
 9 |    - Relevant code files: [list of files responsible for cart management, payment handling]
10 |    - File sizes: [line counts for each key file]
11 |   
12 | 3. **User Authentication**:
13 |    - Relevant code files: [list of files responsible for login, logout, user session management]
14 |    - File sizes: [line counts for each key file]
15 | 
16 | ### Note:
17 | - The workflows and summaries provided are examples. Please modify them to fit the specific use case and structure of your application repository.
18 | - Pay special attention to large files, as they may represent core functionality or potential refactoring opportunities.
19 | 
```

--------------------------------------------------------------------------------
/summary_document.txt:
--------------------------------------------------------------------------------

```
 1 | # Application Summary
 2 | 
 3 | ## Architecture
 4 | This document provides a summary of the application's architecture, key modules, and their relationships.
 5 | 
 6 | ## Key Modules
 7 | - Placeholder for module descriptions.
 8 | - Include information about the functionality, dependencies, and interaction with other modules.
 9 | 
10 | ## Key Files by Size
11 | - See codebase_stats.txt for a complete listing of files by line count
12 | - The largest files often represent core functionality or areas that might need refactoring
13 | 
14 | ## High-Level Next Steps for LLM
15 | 1. Identify and generate module summaries for frontend, backend, and database.
16 | 2. Document core workflows and user journeys within the application.
17 | 3. Use the LLM relationship prompt (llm_relationship_prompt.txt) to generate a comprehensive relationship analysis.
18 | 4. Pay special attention to the largest files and their relationships to other components.
19 | 
20 | 
```

--------------------------------------------------------------------------------
/.github/workflows/publish.yml:
--------------------------------------------------------------------------------

```yaml
 1 | name: Publish to PyPI
 2 | 
 3 | on:
 4 |   push:
 5 |     tags:
 6 |       - 'v*'
 7 | 
 8 | jobs:
 9 |   deploy:
10 |     runs-on: ubuntu-latest
11 |     environment:
12 |       name: pypi
13 |       url: https://pypi.org/p/mcp-codebase-insight
14 |     permissions:
15 |       id-token: write
16 |       contents: read
17 | 
18 |     steps:
19 |     - uses: actions/checkout@v4
20 |       with:
21 |         fetch-depth: 0
22 |     
23 |     - name: Set up Python
24 |       uses: actions/[email protected]
25 |       with:
26 |         python-version: '3.x'
27 |     
28 |     - name: Install dependencies
29 |       run: |
30 |         python -m pip install --upgrade pip
31 |         pip install build twine
32 |     
33 |     - name: Build package
34 |       run: python -m build
35 |     
36 |     - name: Check distribution
37 |       run: |
38 |         python -m twine check dist/*
39 |         ls -l dist/
40 |     
41 |     - name: Publish to PyPI
42 |       env:
43 |         TWINE_USERNAME: __token__
44 |         TWINE_PASSWORD: ${{ secrets.PYPI_API_TOKEN }}
45 |       run: python -m twine upload dist/* 
```

--------------------------------------------------------------------------------
/package.json:
--------------------------------------------------------------------------------

```json
 1 | {
 2 |   "name": "vite-react-typescript-starter",
 3 |   "private": true,
 4 |   "version": "0.0.0",
 5 |   "type": "module",
 6 |   "scripts": {
 7 |     "dev": "vite",
 8 |     "build": "tsc && vite build",
 9 |     "lint": "eslint .",
10 |     "preview": "vite preview"
11 |   },
12 |   "dependencies": {
13 |     "@supabase/supabase-js": "^2.39.7",
14 |     "lucide-react": "^0.344.0",
15 |     "react": "^18.3.1",
16 |     "react-dom": "^18.3.1",
17 |     "react-router-dom": "^6.22.0",
18 |     "recharts": "^2.12.1"
19 |   },
20 |   "devDependencies": {
21 |     "@eslint/js": "^9.9.1",
22 |     "@tsconfig/recommended": "^1.0.3",
23 |     "@types/node": "^20.11.24",
24 |     "@types/react": "^18.3.5",
25 |     "@types/react-dom": "^18.3.0",
26 |     "@vitejs/plugin-react": "^4.3.1",
27 |     "autoprefixer": "^10.4.18",
28 |     "eslint": "^9.9.1",
29 |     "eslint-plugin-react-hooks": "^5.1.0-rc.0",
30 |     "eslint-plugin-react-refresh": "^0.4.11",
31 |     "globals": "^15.9.0",
32 |     "postcss": "^8.4.35",
33 |     "tailwindcss": "^3.4.1",
34 |     "typescript": "^5.5.3",
35 |     "typescript-eslint": "^8.3.0",
36 |     "vite": "^5.4.2"
37 |   }
38 | }
```

--------------------------------------------------------------------------------
/Dockerfile:
--------------------------------------------------------------------------------

```dockerfile
 1 | # Use Python 3.11 slim image
 2 | FROM python:3.11-slim
 3 | 
 4 | # Set working directory
 5 | WORKDIR /app
 6 | 
 7 | # Set environment variables
 8 | ENV PYTHONUNBUFFERED=1 \
 9 |     PYTHONDONTWRITEBYTECODE=1 \
10 |     PIP_NO_CACHE_DIR=1 \
11 |     PIP_DISABLE_PIP_VERSION_CHECK=1
12 | 
13 | # Install system dependencies
14 | RUN apt-get update \
15 |     && apt-get install -y --no-install-recommends \
16 |         build-essential \
17 |         curl \
18 |         git \
19 |     && rm -rf /var/lib/apt/lists/*
20 | 
21 | # Install Rust (needed for pydantic)
22 | RUN curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh -s -- -y
23 | ENV PATH="/root/.cargo/bin:${PATH}"
24 | 
25 | # Copy requirements file
26 | COPY requirements.txt .
27 | 
28 | # Install Python dependencies
29 | RUN pip install --no-cache-dir -r requirements.txt
30 | 
31 | # Copy source code
32 | COPY src/ src/
33 | COPY scripts/ scripts/
34 | 
35 | # Copy configuration files
36 | COPY .env.example .env
37 | 
38 | # Create necessary directories
39 | RUN mkdir -p \
40 |     docs/adrs \
41 |     knowledge \
42 |     cache \
43 |     logs
44 | 
45 | # Set permissions
46 | RUN chmod +x scripts/start_mcp_server.sh
47 | 
48 | # Expose port
49 | EXPOSE 3000
50 | 
51 | # Set entrypoint
52 | ENTRYPOINT ["scripts/start_mcp_server.sh"]
53 | 
54 | # Set default command
55 | CMD ["--host", "0.0.0.0", "--port", "3000"]
56 | 
```

--------------------------------------------------------------------------------
/docs/getting-started/qdrant_setup.md:
--------------------------------------------------------------------------------

```markdown
 1 | # Qdrant Setup Guide
 2 | 
 3 | > 🚧 **Documentation In Progress**
 4 | > 
 5 | > This documentation is being actively developed. More details will be added soon.
 6 | 
 7 | ## Overview
 8 | 
 9 | This guide covers setting up Qdrant vector database for MCP Codebase Insight.
10 | 
11 | ## Installation Methods
12 | 
13 | ### 1. Using Docker (Recommended)
14 | 
15 | ```bash
16 | # Pull the Qdrant image
17 | docker pull qdrant/qdrant
18 | 
19 | # Start Qdrant container
20 | docker run -p 6333:6333 -v $(pwd)/qdrant_storage:/qdrant/storage qdrant/qdrant
21 | ```
22 | 
23 | ### 2. From Binary
24 | 
25 | Download from [Qdrant Releases](https://github.com/qdrant/qdrant/releases)
26 | 
27 | ### 3. From Source
28 | 
29 | ```bash
30 | git clone https://github.com/qdrant/qdrant
31 | cd qdrant
32 | cargo build --release
33 | ```
34 | 
35 | ## Configuration
36 | 
37 | 1. **Create Collection**
38 |    ```python
39 |    from qdrant_client import QdrantClient
40 |    
41 |    client = QdrantClient("localhost", port=6333)
42 |    client.create_collection(
43 |        collection_name="code_vectors",
44 |        vectors_config={"size": 384, "distance": "Cosine"}
45 |    )
46 |    ```
47 | 
48 | 2. **Verify Setup**
49 |    ```bash
50 |    curl http://localhost:6333/collections/code_vectors
51 |    ```
52 | 
53 | ## Next Steps
54 | 
55 | - [Configuration Guide](configuration.md)
56 | - [Quick Start Guide](quickstart.md)
57 | - [API Reference](../api/rest-api.md) 
```

--------------------------------------------------------------------------------
/tests/components/test_embeddings.py:
--------------------------------------------------------------------------------

```python
 1 | 
 2 | import sys
 3 | import os
 4 | 
 5 | # Ensure the src directory is in the Python path
 6 | sys.path.insert(0, os.path.abspath(os.path.join(os.path.dirname(__file__), '../../')))
 7 | 
 8 | import pytest
 9 | import asyncio
10 | from src.mcp_codebase_insight.core.embeddings import SentenceTransformerEmbedding
11 | 
12 | @pytest.mark.asyncio
13 | async def test_embedder_initialization():
14 |     """Test that embedder initializes correctly."""
15 |     embedder = SentenceTransformerEmbedding()
16 |     try:
17 |         await asyncio.wait_for(embedder.initialize(), timeout=60.0)
18 |         assert embedder.model is not None
19 |         assert embedder.vector_size == 384  # Default size for all-MiniLM-L6-v2
20 |     except asyncio.TimeoutError:
21 |         pytest.fail("Embedder initialization timed out")
22 |     except Exception as e:
23 |         pytest.fail(f"Embedder initialization failed: {str(e)}")
24 | 
25 | @pytest.mark.asyncio
26 | async def test_embedder_embedding():
27 |     """Test that embedder can generate embeddings."""
28 |     embedder = SentenceTransformerEmbedding()
29 |     await embedder.initialize()
30 |     
31 |     # Test single text embedding
32 |     text = "Test text"
33 |     embedding = await embedder.embed(text)
34 |     assert len(embedding) == embedder.vector_size
35 |     
36 |     # Test batch embedding
37 |     texts = ["Test text 1", "Test text 2"]
38 |     embeddings = await embedder.embed_batch(texts)
39 |     assert len(embeddings) == 2
40 |     assert all(len(emb) == embedder.vector_size for emb in embeddings) 
```

--------------------------------------------------------------------------------
/async_fixture_wrapper.py:
--------------------------------------------------------------------------------

```python
 1 | """
 2 | Async Fixture Wrapper for Component Tests
 3 | 
 4 | This script serves as a wrapper for running component tests with complex async fixtures
 5 | to ensure they are properly awaited in isolated test mode.
 6 | """
 7 | import os
 8 | import sys
 9 | import asyncio
10 | import pytest
11 | import importlib
12 | from pathlib import Path
13 | 
14 | def run_with_async_fixture_support():
15 |     """Run pytest with proper async fixture support."""
16 |     # Get the module path and test name from command line arguments
17 |     if len(sys.argv) < 3:
18 |         print("Usage: python async_fixture_wrapper.py <module_path> <test_name>")
19 |         sys.exit(1)
20 |     
21 |     module_path = sys.argv[1]
22 |     test_name = sys.argv[2]
23 |     
24 |     # Configure event loop policy for macOS if needed
25 |     if sys.platform == 'darwin':
26 |         import platform
27 |         if int(platform.mac_ver()[0].split('.')[0]) >= 10:
28 |             # macOS 10+ - use the right event loop policy
29 |             asyncio.set_event_loop_policy(asyncio.DefaultEventLoopPolicy())
30 |     
31 |     # Ensure PYTHONPATH is set correctly
32 |     base_dir = str(Path(module_path).parent.parent)
33 |     sys.path.insert(0, base_dir)
34 |     
35 |     # Build pytest args
36 |     pytest_args = [module_path, f"-k={test_name}", "--asyncio-mode=strict"]
37 |     
38 |     # Add any additional args
39 |     if len(sys.argv) > 3:
40 |         pytest_args.extend(sys.argv[3:])
41 |     
42 |     # Run the test
43 |     exit_code = pytest.main(pytest_args)
44 |     
45 |     sys.exit(exit_code)
46 | 
47 | if __name__ == "__main__":
48 |     run_with_async_fixture_support()
49 | 
```

--------------------------------------------------------------------------------
/PULL_REQUEST.md:
--------------------------------------------------------------------------------

```markdown
 1 | # GitHub Actions Workflow Improvements
 2 | 
 3 | @coderabbit I'd like to request your detailed review of our GitHub Actions workflows.
 4 | 
 5 | ## Overview
 6 | 
 7 | This PR aims to improve the GitHub Actions workflows in our repository by:
 8 | 
 9 | 1. **Documenting** all existing workflows
10 | 2. **Addressing** the test pattern issue in build-verification.yml
11 | 3. **Extracting** common functionality into reusable scripts
12 | 4. **Standardizing** practices across different workflows
13 | 
14 | ## Changes
15 | 
16 | - Added comprehensive documentation of all GitHub Actions workflows
17 | - Fixed the wildcard pattern issue (`test_*`) in build-verification.yml
18 | - Extracted Qdrant health check logic into a reusable script
19 | - Added README for the scripts directory
20 | 
21 | ## Benefits
22 | 
23 | - **Maintainability**: Common logic is now in a single location
24 | - **Readability**: Workflows are cleaner and better documented
25 | - **Reliability**: Fixed test pattern ensures more consistent test execution
26 | - **Extensibility**: Easier to add new workflows or modify existing ones
27 | 
28 | ## Request for Review
29 | 
30 | @coderabbit, I'm particularly interested in your feedback on:
31 | 
32 | 1. Workflow structure and organization
33 | 2. Any redundancies or inefficiencies you notice
34 | 3. Any missing best practices
35 | 4. Suggestions for further improvements
36 | 
37 | ## Future Improvements
38 | 
39 | We're planning to implement additional enhancements based on your feedback:
40 | 
41 | - Extract more common functionality into reusable actions
42 | - Standardize environment variables across workflows
43 | - Improve caching strategies
44 | - Add workflow dependencies to avoid redundant work
45 | 
46 | Thank you for your time and expertise! 
```

--------------------------------------------------------------------------------
/run_test_with_path_fix.sh:
--------------------------------------------------------------------------------

```bash
 1 | #!/bin/bash
 2 | # This script runs tests with a fix for the Python path issue
 3 | 
 4 | set -e
 5 | 
 6 | # Activate the virtual environment
 7 | source .venv/bin/activate
 8 | 
 9 | # Setup environment for Qdrant
10 | export MCP_TEST_MODE=1
11 | export QDRANT_URL="http://localhost:6333"
12 | export MCP_COLLECTION_NAME="test_collection_$(date +%s)"
13 | export PYTHONPATH="$PYTHONPATH:$(pwd)"
14 | 
15 | # Initialize Qdrant collection for testing
16 | echo "Creating Qdrant collection for testing..."
17 | python - << EOF
18 | import os
19 | from qdrant_client import QdrantClient
20 | from qdrant_client.http import models
21 | 
22 | # Connect to Qdrant
23 | client = QdrantClient(url="http://localhost:6333")
24 | collection_name = os.environ.get("MCP_COLLECTION_NAME")
25 | 
26 | # Check if collection exists
27 | collections = client.get_collections().collections
28 | collection_names = [c.name for c in collections]
29 | 
30 | if collection_name in collection_names:
31 |     print(f"Collection {collection_name} already exists, recreating it...")
32 |     client.delete_collection(collection_name=collection_name)
33 | 
34 | # Create collection with vector size 384 (for all-MiniLM-L6-v2)
35 | client.create_collection(
36 |     collection_name=collection_name,
37 |     vectors_config=models.VectorParams(
38 |         size=384,  # Dimension for all-MiniLM-L6-v2
39 |         distance=models.Distance.COSINE,
40 |     ),
41 | )
42 | 
43 | # Create test directory that might be needed
44 | os.makedirs("qdrant_storage", exist_ok=True)
45 | 
46 | print(f"Successfully created collection {collection_name}")
47 | EOF
48 | 
49 | # Run all component tests in vector_store
50 | echo "Running all vector store tests with component_test_runner.py..."
51 | python component_test_runner.py tests/components/test_vector_store.py
52 | 
```

--------------------------------------------------------------------------------
/test_imports.py:
--------------------------------------------------------------------------------

```python
 1 | #!/usr/bin/env python3
 2 | """
 3 | Test script to verify imports work correctly
 4 | """
 5 | 
 6 | import sys
 7 | import importlib
 8 | import os
 9 | 
10 | def test_import(module_name):
11 |     try:
12 |         module = importlib.import_module(module_name)
13 |         print(f"✅ Successfully imported {module_name}")
14 |         return True
15 |     except ImportError as e:
16 |         print(f"❌ Failed to import {module_name}: {e}")
17 |         return False
18 |     
19 | def print_path():
20 |     print("\nPython Path:")
21 |     for i, path in enumerate(sys.path):
22 |         print(f"{i}: {path}")
23 | 
24 | def main():
25 |     print("=== Testing Package Imports ===")
26 |     
27 |     print("\nEnvironment:")
28 |     print(f"Python version: {sys.version}")
29 |     print(f"Working directory: {os.getcwd()}")
30 |     
31 |     print("\nTesting core package imports:")
32 |     
33 |     # First ensure the parent directory is in the path
34 |     sys.path.insert(0, os.getcwd())
35 |     print_path()
36 |     
37 |     print("\nTesting imports:")
38 |     
39 |     # Test basic Python imports
40 |     test_import("os")
41 |     test_import("sys")
42 |     
43 |     # Test ML/NLP packages
44 |     test_import("torch")
45 |     test_import("numpy")
46 |     test_import("transformers")
47 |     test_import("sentence_transformers")
48 |     
49 |     # Test FastAPI and web packages
50 |     test_import("fastapi")
51 |     test_import("starlette")
52 |     test_import("pydantic")
53 |     
54 |     # Test database packages
55 |     test_import("qdrant_client")
56 |     
57 |     # Test project specific modules
58 |     test_import("src.mcp_codebase_insight.core.config")
59 |     test_import("src.mcp_codebase_insight.core.embeddings")
60 |     test_import("src.mcp_codebase_insight.core.vector_store")
61 |     
62 |     print("\n=== Testing Complete ===")
63 | 
64 | if __name__ == "__main__":
65 |     main()
66 | 
```

--------------------------------------------------------------------------------
/scripts/setup_qdrant.sh:
--------------------------------------------------------------------------------

```bash
 1 | #!/bin/bash
 2 | 
 3 | # Script to set up Qdrant for MCP Codebase Insight
 4 | set -e
 5 | 
 6 | # Colors for output
 7 | GREEN='\033[0;32m'
 8 | RED='\033[0;31m'
 9 | NC='\033[0m' # No Color
10 | 
11 | echo "Setting up Qdrant for MCP Codebase Insight..."
12 | 
13 | # Check if Docker is running
14 | if ! docker info > /dev/null 2>&1; then
15 |     echo -e "${RED}Error: Docker is not running${NC}"
16 |     exit 1
17 | fi
18 | 
19 | # Check if port 6333 is available
20 | if lsof -Pi :6333 -sTCP:LISTEN -t >/dev/null ; then
21 |     echo -e "${RED}Warning: Port 6333 is already in use${NC}"
22 |     echo "Checking if it's a Qdrant instance..."
23 |     if curl -s http://localhost:6333/health > /dev/null; then
24 |         echo -e "${GREEN}Existing Qdrant instance detected and healthy${NC}"
25 |         exit 0
26 |     else
27 |         echo -e "${RED}Port 6333 is in use by another service${NC}"
28 |         exit 1
29 |     fi
30 | fi
31 | 
32 | # Create data directory if it doesn't exist
33 | mkdir -p ./qdrant_data
34 | 
35 | # Stop and remove existing container if it exists
36 | if docker ps -a | grep -q mcp-qdrant; then
37 |     echo "Removing existing mcp-qdrant container..."
38 |     docker stop mcp-qdrant || true
39 |     docker rm mcp-qdrant || true
40 | fi
41 | 
42 | # Pull latest Qdrant image
43 | echo "Pulling latest Qdrant image..."
44 | docker pull qdrant/qdrant:latest
45 | 
46 | # Start Qdrant container
47 | echo "Starting Qdrant container..."
48 | docker run -d \
49 |     --name mcp-qdrant \
50 |     -p 6333:6333 \
51 |     -v "$(pwd)/qdrant_data:/qdrant/storage" \
52 |     qdrant/qdrant
53 | 
54 | # Wait for Qdrant to be ready
55 | echo "Waiting for Qdrant to be ready..."
56 | for i in {1..30}; do
57 |     if curl -s http://localhost:6333/health > /dev/null; then
58 |         echo -e "${GREEN}Qdrant is ready!${NC}"
59 |         exit 0
60 |     fi
61 |     echo "Waiting... ($i/30)"
62 |     sleep 1
63 | done
64 | 
65 | echo -e "${RED}Error: Qdrant failed to start within 30 seconds${NC}"
66 | exit 1
67 | 
```

--------------------------------------------------------------------------------
/scripts/check_qdrant_health.sh:
--------------------------------------------------------------------------------

```bash
 1 | #!/bin/bash
 2 | set -euo pipefail
 3 | # Script to check if Qdrant service is available and healthy
 4 | # Usage: ./check_qdrant_health.sh [qdrant_url] [max_retries] [sleep_seconds]
 5 | 
 6 | # Default values
 7 | QDRANT_URL=${1:-"http://localhost:6333"}
 8 | MAX_RETRIES=${2:-20}
 9 | SLEEP_SECONDS=${3:-5}
10 | 
11 | echo "Checking Qdrant health at $QDRANT_URL (max $MAX_RETRIES attempts with $SLEEP_SECONDS seconds delay)"
12 | 
13 | # Install dependencies if not present
14 | if ! command -v curl &> /dev/null || ! command -v jq &> /dev/null; then
15 |     echo "Installing required dependencies..."
16 |     apt-get update &> /dev/null && apt-get install -y curl jq &> /dev/null || true
17 | fi
18 | 
19 | # Check if dependencies are available
20 | if ! command -v curl &> /dev/null; then
21 |     echo "Error: curl command not found and could not be installed"
22 |     exit 1
23 | fi
24 | 
25 | if ! command -v jq &> /dev/null; then
26 |     echo "Warning: jq command not found and could not be installed. JSON validation will be skipped."
27 |     JQ_AVAILABLE=false
28 | else
29 |     JQ_AVAILABLE=true
30 | fi
31 | 
32 | # Wait for Qdrant to be available
33 | retry_count=0
34 | until [ "$(curl -s -o /dev/null -w "%{http_code}" "$QDRANT_URL/collections")" -eq 200 ] || [ "$retry_count" -eq "$MAX_RETRIES" ]
35 | do
36 |     echo "Waiting for Qdrant... (attempt $retry_count of $MAX_RETRIES)"
37 |     sleep "$SLEEP_SECONDS"
38 |     retry_count=$((retry_count+1))
39 | done
40 | 
41 | if [ "$retry_count" -eq "$MAX_RETRIES" ]; then
42 |     echo "Qdrant service failed to become available after $((MAX_RETRIES * SLEEP_SECONDS)) seconds"
43 |     exit 1
44 | fi
45 | 
46 | # Check for valid JSON response if jq is available
47 | if [ "$JQ_AVAILABLE" = true ]; then
48 |     if ! curl -s "$QDRANT_URL/collections" | jq . > /dev/null; then
49 |         echo "Qdrant did not return valid JSON."
50 |         exit 1
51 |     fi
52 | fi
53 | 
54 | echo "Qdrant service is accessible and healthy."
55 | exit 0 
```

--------------------------------------------------------------------------------
/docs/qdrant_setup.md:
--------------------------------------------------------------------------------

```markdown
 1 | # Qdrant Setup Guide
 2 | 
 3 | ## Overview
 4 | This document outlines the setup and maintenance procedures for the Qdrant vector database instance required for running tests and development.
 5 | 
 6 | ## Prerequisites
 7 | - Docker installed and running
 8 | - Port 6333 available on localhost
 9 | - Python 3.8+ with pip
10 | 
11 | ## Setup Options
12 | 
13 | ### Option 1: Docker Container (Recommended for Development)
14 | ```bash
15 | # Pull the latest Qdrant image
16 | docker pull qdrant/qdrant:latest
17 | 
18 | # Run Qdrant container
19 | docker run -d \
20 |   --name mcp-qdrant \
21 |   -p 6333:6333 \
22 |   -v $(pwd)/qdrant_data:/qdrant/storage \
23 |   qdrant/qdrant
24 | 
25 | # Verify the instance is running
26 | curl http://localhost:6333/health
27 | ```
28 | 
29 | ### Option 2: Pre-existing Instance
30 | If using a pre-existing Qdrant instance:
31 | 1. Ensure it's accessible at `localhost:6333`
32 | 2. Verify health status
33 | 3. Configure environment variables if needed:
34 | ```bash
35 | export QDRANT_HOST=localhost
36 | export QDRANT_PORT=6333
37 | ```
38 | 
39 | ## Health Check
40 | ```python
41 | from qdrant_client import QdrantClient
42 | 
43 | client = QdrantClient(host="localhost", port=6333)
44 | health = client.health()
45 | print(f"Qdrant health status: {health}")
46 | ```
47 | 
48 | ## Maintenance
49 | - Regular health checks are automated in CI/CD pipeline
50 | - Database backups are stored in `./qdrant_data`
51 | - Version updates should be coordinated with the team
52 | 
53 | ## Troubleshooting
54 | 1. If container fails to start:
55 |    ```bash
56 |    # Check logs
57 |    docker logs mcp-qdrant
58 |    
59 |    # Verify port availability
60 |    lsof -i :6333
61 |    ```
62 | 
63 | 2. If connection fails:
64 |    ```bash
65 |    # Restart container
66 |    docker restart mcp-qdrant
67 |    
68 |    # Check container status
69 |    docker ps -a | grep mcp-qdrant
70 |    ```
71 | 
72 | ## Responsible Parties
73 | - Primary maintainer: DevOps Team
74 | - Documentation updates: Development Team Lead
75 | - Testing coordination: QA Team Lead
76 | 
77 | ## Version Control
78 | - Document version: 1.0
79 | - Last updated: 2025-03-24
80 | - Next review: 2025-06-24
81 | 
```

--------------------------------------------------------------------------------
/setup_qdrant_collection.py:
--------------------------------------------------------------------------------

```python
 1 | from qdrant_client import QdrantClient
 2 | from qdrant_client.http import models
 3 | from qdrant_client.http.models import Distance, VectorParams
 4 | 
 5 | def setup_collection():
 6 |     # Connect to Qdrant
 7 |     client = QdrantClient(
 8 |         url='https://e67ee53a-6e03-4526-9e41-3fde622323a9.us-east4-0.gcp.cloud.qdrant.io:6333',
 9 |         api_key='eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJhY2Nlc3MiOiJtIiwiZXhwIjoxNzQ1MTAyNzQ3fQ.3gvK8M7dJxZkSpyzpJtTGVUhjyjgbYEhEvl2aG7JodM'
10 |     )
11 |     
12 |     collection_name = "mcp-codebase-insight"
13 |     
14 |     try:
15 |         # Check if collection exists
16 |         collections = client.get_collections().collections
17 |         exists = any(c.name == collection_name for c in collections)
18 |         
19 |         # If collection exists, recreate it
20 |         if exists:
21 |             print(f"\nRemoving existing collection '{collection_name}'")
22 |             client.delete_collection(collection_name=collection_name)
23 |         
24 |         # Create a new collection with named vector configurations
25 |         print(f"\nCreating collection '{collection_name}' with named vectors")
26 |         
27 |         # Create named vectors configuration
28 |         vectors_config = {
29 |             # For the default MCP server embedding model (all-MiniLM-L6-v2)
30 |             "fast-all-minilm-l6-v2": VectorParams(
31 |                 size=384,  # all-MiniLM-L6-v2 produces 384-dimensional vectors
32 |                 distance=Distance.COSINE
33 |             )
34 |         }
35 |         
36 |         client.create_collection(
37 |             collection_name=collection_name,
38 |             vectors_config=vectors_config
39 |         )
40 |         
41 |         # Verify the collection was created properly
42 |         collection_info = client.get_collection(collection_name=collection_name)
43 |         print(f"\nCollection '{collection_name}' created successfully")
44 |         print(f"Vector configuration: {collection_info.config.params.vectors}")
45 |         
46 |         print("\nCollection is ready for the MCP server")
47 |         
48 |     except Exception as e:
49 |         print(f"\nError setting up collection: {e}")
50 | 
51 | if __name__ == '__main__':
52 |     setup_collection() 
```

--------------------------------------------------------------------------------
/docs/vector_store_best_practices.md:
--------------------------------------------------------------------------------

```markdown
 1 | # VectorStore Best Practices
 2 | 
 3 | This document outlines best practices for working with the VectorStore component in the MCP Codebase Insight project.
 4 | 
 5 | ## Metadata Structure
 6 | 
 7 | To ensure consistency and prevent `KeyError` exceptions, always follow these metadata structure guidelines:
 8 | 
 9 | ### Required Fields
10 | 
11 | Always include these fields in your metadata when adding vectors:
12 | 
13 | - `type`: The type of content (e.g., "code", "documentation", "pattern")
14 | - `language`: Programming language if applicable (e.g., "python", "javascript")
15 | - `title`: Short descriptive title
16 | - `description`: Longer description of the content
17 | 
18 | ### Accessing Metadata
19 | 
20 | Always use the `.get()` method with a default value when accessing metadata fields:
21 | 
22 | ```python
23 | # Good - safe access pattern
24 | result.metadata.get("type", "code")
25 | 
26 | # Bad - can cause KeyError
27 | result.metadata["type"]
28 | ```
29 | 
30 | ## Initialization and Cleanup
31 | 
32 | Follow these best practices for proper initialization and cleanup:
33 | 
34 | 1. Always `await vector_store.initialize()` before using a VectorStore
35 | 2. Always `await vector_store.cleanup()` in test teardown/finally blocks
36 | 3. Use unique collection names in tests to prevent conflicts
37 | 4. Check `vector_store.initialized` status before operations
38 | 
39 | Example:
40 | 
41 | ```python
42 | try:
43 |     store = VectorStore(url, embedder, collection_name=unique_name)
44 |     await store.initialize()
45 |     # Use the store...
46 | finally:
47 |     await store.cleanup()
48 |     await store.close()
49 | ```
50 | 
51 | ## Vector Names and Dimensions
52 | 
53 | - Use consistent vector dimensions (384 for all-MiniLM-L6-v2)
54 | - Be careful when overriding the vector_name parameter
55 | - Ensure embedder and vector store are compatible
56 | 
57 | ## Error Handling
58 | 
59 | - Check for component availability before use
60 | - Handle initialization errors gracefully
61 | - Log failures with meaningful messages
62 | 
63 | ## Testing Guidelines
64 | 
65 | 1. Use isolated test collections with unique names
66 | 2. Clean up all test data after tests
67 | 3. Verify metadata structure in tests
68 | 4. Use standardized test data fixtures
69 | 5. Test both positive and negative paths
70 | 
71 | By following these guidelines, you can avoid common issues like the "KeyError: 'type'" problem that was occurring in the codebase. 
```

--------------------------------------------------------------------------------
/scripts/macos_install.sh:
--------------------------------------------------------------------------------

```bash
 1 | #!/bin/bash
 2 | 
 3 | # Exit on error
 4 | set -e
 5 | 
 6 | echo "Installing MCP Codebase Insight development environment..."
 7 | 
 8 | # Check for Homebrew
 9 | if ! command -v brew &> /dev/null; then
10 |     echo "Installing Homebrew..."
11 |     /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
12 | else
13 |     echo "Homebrew already installed, updating..."
14 |     brew update
15 | fi
16 | 
17 | # Check for Python
18 | if ! command -v python3 &> /dev/null; then
19 |     echo "Installing Python..."
20 |     brew install [email protected]
21 | else
22 |     echo "Python already installed"
23 | fi
24 | 
25 | # Check for Docker
26 | if ! command -v docker &> /dev/null; then
27 |     echo "Installing Docker..."
28 |     brew install --cask docker
29 |     
30 |     echo "Starting Docker..."
31 |     open -a Docker
32 |     
33 |     # Wait for Docker to start
34 |     echo "Waiting for Docker to start..."
35 |     while ! docker info &> /dev/null; do
36 |         sleep 1
37 |     done
38 | else
39 |     echo "Docker already installed"
40 | fi
41 | 
42 | # Create virtual environment
43 | echo "Creating virtual environment..."
44 | python3.11 -m venv .venv
45 | 
46 | # Activate virtual environment
47 | echo "Activating virtual environment..."
48 | source .venv/bin/activate
49 | 
50 | # Install dependencies
51 | echo "Installing Python dependencies..."
52 | pip install --upgrade pip
53 | pip install -r requirements.txt
54 | 
55 | # Start Qdrant
56 | echo "Starting Qdrant container..."
57 | if ! docker ps | grep -q qdrant; then
58 |     docker run -d -p 6333:6333 -p 6334:6334 \
59 |         -v $(pwd)/qdrant_storage:/qdrant/storage \
60 |         qdrant/qdrant
61 |     echo "Qdrant container started"
62 | else
63 |     echo "Qdrant container already running"
64 | fi
65 | 
66 | # Create required directories
67 | echo "Creating project directories..."
68 | mkdir -p docs/adrs
69 | mkdir -p docs/templates
70 | mkdir -p knowledge/patterns
71 | mkdir -p references
72 | mkdir -p logs/debug
73 | 
74 | # Copy environment file if it doesn't exist
75 | if [ ! -f .env ]; then
76 |     echo "Creating .env file..."
77 |     cp .env.example .env
78 |     echo "Please update .env with your settings"
79 | fi
80 | 
81 | # Load example patterns
82 | echo "Loading example patterns..."
83 | python scripts/load_example_patterns.py
84 | 
85 | echo "
86 | Installation complete! 🎉
87 | 
88 | To start development:
89 | 1. Update .env with your settings
90 | 2. Activate the virtual environment:
91 |    source .venv/bin/activate
92 | 3. Start the server:
93 |    make run
94 | 
95 | For more information, see the README.md file.
96 | "
97 | 
```

--------------------------------------------------------------------------------
/start-mcpserver.sh:
--------------------------------------------------------------------------------

```bash
 1 |               #!/bin/bash
 2 | # This script starts the MCP Qdrant server with SSE transport
 3 | set -x
 4 | source .venv/bin/activate
 5 | # Set the PATH to include the local bin directory
 6 | export PATH="$HOME/.local/bin:$PATH"
 7 | 
 8 | # Define environment variables
 9 | export COLLECTION_NAME="mcp-codebase-insight"
10 | export EMBEDDING_MODEL="sentence-transformers/all-MiniLM-L6-v2"
11 | export QDRANT_URL="${QDRANT_URL:-http://localhost:6333}"
12 | export QDRANT_API_KEY="eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJhY2Nlc3MiOiJtIiwiZXhwIjoxNzQ1MTAyNzQ3fQ.3gvK8M7dJxZkSpyzpJtTGVUhjyjgbYEhEvl2aG7JodM"
13 | 
14 | # Define tool descriptions
15 | TOOL_STORE_DESCRIPTION="Store reusable code snippets and test results. 'information' contains a description. 'metadata' is a dictionary with a 'type' key: 'code' for code snippets, 'test_result' for test results. For 'code', 'metadata' includes a 'code' key with the code. For 'test_result', 'metadata' includes 'test_name', 'status' (pass/fail), and 'error_message'."
16 | 
17 | TOOL_FIND_DESCRIPTION="Search for code snippets and test results. The 'query' parameter describes what you're looking for. Returned results will have a 'metadata' field with a 'type' key indicating 'code' or 'test_result'. Use this to find code or analyze test failures."
18 | 
19 | # Default port for the SSE transport (can be overridden with PORT env var)
20 | PORT="${PORT:-8000}"
21 | 
22 | # Determine transport type (default to sse if not specified)
23 | TRANSPORT="${TRANSPORT:-sse}"
24 | 
25 | # Check if uvx and mcp-server-qdrant are installed
26 | if ! command -v uvx &> /dev/null; then
27 |     echo "Error: uvx is not installed. Please install it with: pip install uvx"
28 |     exit 1
29 | fi
30 | 
31 | if ! python -c "import importlib.util; print(importlib.util.find_spec('mcp_server_qdrant') is not None)" | grep -q "True"; then
32 |     echo "Error: mcp-server-qdrant is not installed. Please install it with: pip install mcp-server-qdrant"
33 |     exit 1
34 | fi
35 | 
36 | echo "Starting MCP Qdrant server with $TRANSPORT transport on port $PORT..."
37 | 
38 | # Run the MCP Qdrant server with the specified transport
39 | if [ "$TRANSPORT" = "sse" ]; then
40 |     # For SSE transport, we need to specify the port
41 |     uvx mcp-server-qdrant --transport sse --port $PORT
42 | else
43 |     # For other transports (e.g., stdio which is the default)
44 |     uvx mcp-server-qdrant
45 | fi
46 | 
```

--------------------------------------------------------------------------------
/docs/testing_guide.md:
--------------------------------------------------------------------------------

```markdown
 1 | # Testing Guide for MCP Codebase Insight
 2 | 
 3 | ## Asynchronous Testing
 4 | 
 5 | The MCP Codebase Insight project uses asynchronous APIs and should be tested using proper async test clients. Here are guidelines for testing:
 6 | 
 7 | ### Async vs Sync Testing Clients
 8 | 
 9 | The project provides two test client fixtures:
10 | 
11 | 1. **`test_client`** - Use for asynchronous tests
12 |    - Returns an `AsyncClient` from httpx
13 |    - Must be used with `await` for requests
14 |    - Must be used with `@pytest.mark.asyncio` decorator
15 | 
16 | 2. **`sync_test_client`** - Use for synchronous tests
17 |    - Returns a `TestClient` from FastAPI
18 |    - Used for simpler tests where async is not needed
19 |    - No need for await or asyncio decorators
20 | 
21 | ### Example: Async Test
22 | 
23 | ```python
24 | import pytest
25 | 
26 | @pytest.mark.asyncio
27 | async def test_my_endpoint(test_client):
28 |     """Test an endpoint asynchronously."""
29 |     response = await test_client.get("/my-endpoint")
30 |     assert response.status_code == 200
31 |     data = response.json()
32 |     assert "result" in data
33 | ```
34 | 
35 | ### Example: Sync Test
36 | 
37 | ```python
38 | def test_simple_endpoint(sync_test_client):
39 |     """Test an endpoint synchronously."""
40 |     response = sync_test_client.get("/simple-endpoint")
41 |     assert response.status_code == 200
42 | ```
43 | 
44 | ### Common Issues
45 | 
46 | 1. **Using TestClient with async:** The error `'TestClient' object does not support the asynchronous context manager protocol` occurs when trying to use TestClient in an async context. Always use the `test_client` fixture for async tests.
47 | 
48 | 2. **Mixing async/sync:** Don't mix async and sync patterns in the same test.
49 | 
50 | 3. **Missing asyncio mark:** Always add `@pytest.mark.asyncio` to async test functions.
51 | 
52 | ## Test Isolation
53 | 
54 | Tests should be isolated to prevent state interference between tests:
55 | 
56 | 1. Each test gets its own server instance with isolated state
57 | 2. Vector store tests use unique collection names
58 | 3. Cleanup is performed automatically after tests
59 | 
60 | ## Running Tests
61 | 
62 | Run tests using pytest:
63 | 
64 | ```bash
65 | # Run all tests
66 | pytest
67 | 
68 | # Run specific test file
69 | pytest tests/test_file_relationships.py
70 | 
71 | # Run specific test function
72 | pytest tests/test_file_relationships.py::test_create_file_relationship
73 | ```
74 | 
75 | For more advanced test running options, use the `run_tests.py` script in the project root. 
```

--------------------------------------------------------------------------------
/.compile-venv-py3.11/bin/activate.fish:
--------------------------------------------------------------------------------

```
 1 | # This file must be used with "source <venv>/bin/activate.fish" *from fish*
 2 | # (https://fishshell.com/); you cannot run it directly.
 3 | 
 4 | function deactivate  -d "Exit virtual environment and return to normal shell environment"
 5 |     # reset old environment variables
 6 |     if test -n "$_OLD_VIRTUAL_PATH"
 7 |         set -gx PATH $_OLD_VIRTUAL_PATH
 8 |         set -e _OLD_VIRTUAL_PATH
 9 |     end
10 |     if test -n "$_OLD_VIRTUAL_PYTHONHOME"
11 |         set -gx PYTHONHOME $_OLD_VIRTUAL_PYTHONHOME
12 |         set -e _OLD_VIRTUAL_PYTHONHOME
13 |     end
14 | 
15 |     if test -n "$_OLD_FISH_PROMPT_OVERRIDE"
16 |         set -e _OLD_FISH_PROMPT_OVERRIDE
17 |         # prevents error when using nested fish instances (Issue #93858)
18 |         if functions -q _old_fish_prompt
19 |             functions -e fish_prompt
20 |             functions -c _old_fish_prompt fish_prompt
21 |             functions -e _old_fish_prompt
22 |         end
23 |     end
24 | 
25 |     set -e VIRTUAL_ENV
26 |     set -e VIRTUAL_ENV_PROMPT
27 |     if test "$argv[1]" != "nondestructive"
28 |         # Self-destruct!
29 |         functions -e deactivate
30 |     end
31 | end
32 | 
33 | # Unset irrelevant variables.
34 | deactivate nondestructive
35 | 
36 | set -gx VIRTUAL_ENV /Users/tosinakinosho/workspaces/mcp-codebase-insight/.compile-venv-py3.11
37 | 
38 | set -gx _OLD_VIRTUAL_PATH $PATH
39 | set -gx PATH "$VIRTUAL_ENV/"bin $PATH
40 | 
41 | # Unset PYTHONHOME if set.
42 | if set -q PYTHONHOME
43 |     set -gx _OLD_VIRTUAL_PYTHONHOME $PYTHONHOME
44 |     set -e PYTHONHOME
45 | end
46 | 
47 | if test -z "$VIRTUAL_ENV_DISABLE_PROMPT"
48 |     # fish uses a function instead of an env var to generate the prompt.
49 | 
50 |     # Save the current fish_prompt function as the function _old_fish_prompt.
51 |     functions -c fish_prompt _old_fish_prompt
52 | 
53 |     # With the original prompt function renamed, we can override with our own.
54 |     function fish_prompt
55 |         # Save the return status of the last command.
56 |         set -l old_status $status
57 | 
58 |         # Output the venv prompt; color taken from the blue of the Python logo.
59 |         printf "%s%s%s" (set_color 4B8BBE) '(.compile-venv-py3.11) ' (set_color normal)
60 | 
61 |         # Restore the return status of the previous command.
62 |         echo "exit $old_status" | .
63 |         # Output the original/"old" prompt.
64 |         _old_fish_prompt
65 |     end
66 | 
67 |     set -gx _OLD_FISH_PROMPT_OVERRIDE "$VIRTUAL_ENV"
68 |     set -gx VIRTUAL_ENV_PROMPT '(.compile-venv-py3.11) '
69 | end
70 | 
```

--------------------------------------------------------------------------------
/setup.py:
--------------------------------------------------------------------------------

```python
 1 | from setuptools import setup, find_packages
 2 | import re
 3 | import os
 4 | 
 5 | # Read version from __init__.py
 6 | with open(os.path.join("src", "mcp_codebase_insight", "__init__.py"), "r") as f:
 7 |     version_match = re.search(r"^__version__ = ['\"]([^'\"]*)['\"]", f.read(), re.M)
 8 |     if version_match:
 9 |         version = version_match.group(1)
10 |     else:
11 |         raise RuntimeError("Unable to find version string")
12 | 
13 | setup(
14 |     name="mcp-codebase-insight",
15 |     version=version,
16 |     description="Model Context Protocol (MCP) server for codebase analysis and insights",
17 |     long_description=open("README.md").read(),
18 |     long_description_content_type="text/markdown",
19 |     author="Model Context Protocol",
20 |     author_email="[email protected]",
21 |     url="https://github.com/modelcontextprotocol/mcp-codebase-insight",
22 |     packages=find_packages(where="src"),
23 |     package_dir={"": "src"},
24 |     install_requires=[
25 |         "fastapi>=0.103.2,<0.104.0",
26 |         "uvicorn>=0.23.2,<0.24.0",
27 |         "pydantic>=2.4.2,<3.0.0",
28 |         "starlette>=0.27.0,<0.28.0",
29 |         "asyncio>=3.4.3",
30 |         "aiohttp>=3.9.0,<4.0.0",
31 |         "qdrant-client>=1.13.3",
32 |         "sentence-transformers>=2.2.2",
33 |         "torch>=2.0.0",
34 |         "transformers>=4.34.0,<5.0.0",
35 |         "python-frontmatter>=1.0.0",
36 |         "markdown>=3.4.4",
37 |         "PyYAML>=6.0.1",
38 |         "structlog>=23.1.0",
39 |         "psutil>=5.9.5",
40 |         "python-dotenv>=1.0.0",
41 |         "requests>=2.31.0",
42 |         "beautifulsoup4>=4.12.0",
43 |         "scipy>=1.11.0",
44 |         "numpy>=1.24.0",
45 |         "python-slugify>=8.0.0",
46 |         "slugify>=0.0.1",
47 |         # Temporarily commented out for development installation
48 |         # "uvx>=0.4.0",
49 |         "mcp-server-qdrant>=0.2.0",
50 |         "mcp==1.5.0",
51 |     ],
52 |     python_requires=">=3.9",
53 |     classifiers=[
54 |         "Development Status :: 3 - Alpha",
55 |         "Intended Audience :: Developers",
56 |         "License :: OSI Approved :: MIT License",
57 |         "Programming Language :: Python :: 3.9",
58 |         "Programming Language :: Python :: 3.10",
59 |         "Programming Language :: Python :: 3.11",
60 |         "Topic :: Software Development :: Libraries :: Python Modules",
61 |     ],
62 |     entry_points={
63 |         "console_scripts": [
64 |             "mcp-codebase-insight=mcp_codebase_insight.server:run",
65 |         ],
66 |     },
67 | )
```

--------------------------------------------------------------------------------
/scripts/start_mcp_server.sh:
--------------------------------------------------------------------------------

```bash
  1 | #!/bin/bash
  2 | set -e
  3 | 
  4 | # Function to log messages
  5 | log() {
  6 |     echo "[$(date +'%Y-%m-%d %H:%M:%S')] $1"
  7 | }
  8 | 
  9 | # Function to check if Qdrant is available
 10 | check_qdrant() {
 11 |     local url="${QDRANT_URL:-http://localhost:6333}"
 12 |     local max_attempts=30
 13 |     local attempt=1
 14 | 
 15 |     log "Checking Qdrant connection at $url"
 16 |     
 17 |     while [ $attempt -le $max_attempts ]; do
 18 |         if curl -s -f "$url/health" > /dev/null 2>&1; then
 19 |             log "Qdrant is available"
 20 |             return 0
 21 |         fi
 22 |         
 23 |         log "Waiting for Qdrant (attempt $attempt/$max_attempts)..."
 24 |         sleep 2
 25 |         attempt=$((attempt + 1))
 26 |     done
 27 |     
 28 |     log "Error: Could not connect to Qdrant"
 29 |     return 1
 30 | }
 31 | 
 32 | # Function to check Python environment
 33 | check_python() {
 34 |     if ! command -v python3 &> /dev/null; then
 35 |         log "Error: Python 3 is not installed"
 36 |         exit 1
 37 |     fi
 38 |     
 39 |     if ! python3 -c "import pkg_resources; pkg_resources.require('fastapi>=0.103.2')" &> /dev/null; then
 40 |         log "Error: Required Python packages are not installed"
 41 |         exit 1
 42 |     fi
 43 | }
 44 | 
 45 | # Function to setup environment
 46 | setup_env() {
 47 |     # Create required directories if they don't exist
 48 |     mkdir -p docs/adrs knowledge cache logs
 49 |     
 50 |     # Copy example env file if .env doesn't exist
 51 |     if [ ! -f .env ] && [ -f .env.example ]; then
 52 |         cp .env.example .env
 53 |         log "Created .env from example"
 54 |     fi
 55 |     
 56 |     # Set default environment variables if not set
 57 |     export MCP_HOST=${MCP_HOST:-0.0.0.0}
 58 |     export MCP_PORT=${MCP_PORT:-3000}
 59 |     export MCP_LOG_LEVEL=${MCP_LOG_LEVEL:-INFO}
 60 |     
 61 |     log "Environment setup complete"
 62 | }
 63 | 
 64 | # Main startup sequence
 65 | main() {
 66 |     log "Starting MCP Codebase Insight Server"
 67 |     
 68 |     # Perform checks
 69 |     check_python
 70 |     setup_env
 71 |     check_qdrant
 72 |     
 73 |     # Parse command line arguments
 74 |     local host="0.0.0.0"
 75 |     local port="3000"
 76 |     
 77 |     while [[ $# -gt 0 ]]; do
 78 |         case $1 in
 79 |             --host)
 80 |                 host="$2"
 81 |                 shift 2
 82 |                 ;;
 83 |             --port)
 84 |                 port="$2"
 85 |                 shift 2
 86 |                 ;;
 87 |             *)
 88 |                 log "Unknown option: $1"
 89 |                 exit 1
 90 |                 ;;
 91 |         esac
 92 |     done
 93 |     
 94 |     # Start server
 95 |     log "Starting server on $host:$port"
 96 |     exec python3 -m mcp_codebase_insight
 97 | }
 98 | 
 99 | # Run main function with all arguments
100 | main "$@"
101 | 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/__main__.py:
--------------------------------------------------------------------------------

```python
 1 | """Main entry point for MCP server."""
 2 | 
 3 | import os
 4 | from pathlib import Path
 5 | import sys
 6 | import logging
 7 | 
 8 | import uvicorn
 9 | from dotenv import load_dotenv
10 | 
11 | from .core.config import ServerConfig
12 | from .server import create_app
13 | from .utils.logger import get_logger
14 | 
15 | # Configure logging
16 | logger = get_logger(__name__)
17 | 
18 | def get_config() -> ServerConfig:
19 |     """Get server configuration."""
20 |     try:
21 |         # Load environment variables
22 |         load_dotenv()
23 |         
24 |         config = ServerConfig(
25 |             host=os.getenv("MCP_HOST", "127.0.0.1"),
26 |             port=int(os.getenv("MCP_PORT", "3000")),
27 |             log_level=os.getenv("MCP_LOG_LEVEL", "INFO"),
28 |             qdrant_url=os.getenv("QDRANT_URL", "http://localhost:6333"),
29 |             docs_cache_dir=Path(os.getenv("MCP_DOCS_CACHE_DIR", "docs")),
30 |             adr_dir=Path(os.getenv("MCP_ADR_DIR", "docs/adrs")),
31 |             kb_storage_dir=Path(os.getenv("MCP_KB_STORAGE_DIR", "knowledge")),
32 |             embedding_model=os.getenv("MCP_EMBEDDING_MODEL", "all-MiniLM-L6-v2"),
33 |             collection_name=os.getenv("MCP_COLLECTION_NAME", "codebase_patterns"),
34 |             debug_mode=os.getenv("MCP_DEBUG", "false").lower() == "true",
35 |             metrics_enabled=os.getenv("MCP_METRICS_ENABLED", "true").lower() == "true",
36 |             cache_enabled=os.getenv("MCP_CACHE_ENABLED", "true").lower() == "true",
37 |             memory_cache_size=int(os.getenv("MCP_MEMORY_CACHE_SIZE", "1000")),
38 |             disk_cache_dir=Path(os.getenv("MCP_DISK_CACHE_DIR", "cache")) if os.getenv("MCP_DISK_CACHE_DIR") else None
39 |         )
40 |         
41 |         logger.info("Configuration loaded successfully")
42 |         return config
43 |         
44 |     except Exception as e:
45 |         logger.error(f"Failed to load configuration: {e}", exc_info=True)
46 |         raise
47 | 
48 | def main():
49 |     """Run the server."""
50 |     try:
51 |         # Get configuration
52 |         config = get_config()
53 |         
54 |         # Create FastAPI app
55 |         app = create_app(config)
56 |         
57 |         # Log startup message
58 |         logger.info(
59 |             f"Starting MCP Codebase Insight Server on {config.host}:{config.port} "
60 |             f"(log level: {config.log_level}, debug mode: {config.debug_mode})"
61 |         )
62 |         
63 |         # Run using Uvicorn directly
64 |         uvicorn.run(
65 |             app=app,
66 |             host=config.host,
67 |             port=config.port,
68 |             log_level=config.log_level.lower(),
69 |             loop="auto",
70 |             lifespan="on",
71 |             workers=1
72 |         )
73 |         
74 |     except Exception as e:
75 |         logger.error(f"Server error: {e}", exc_info=True)
76 |         sys.exit(1)
77 | 
78 | if __name__ == "__main__":
79 |     # Run main directly without asyncio.run()
80 |     main()
81 | 
```

--------------------------------------------------------------------------------
/scripts/validate_knowledge_base.py:
--------------------------------------------------------------------------------

```python
 1 | #!/usr/bin/env python3
 2 | """
 3 | Knowledge Base Validation Script
 4 | Tests knowledge base operations using Firecrawl MCP.
 5 | """
 6 | 
 7 | import asyncio
 8 | import logging
 9 | from mcp_firecrawl import (
10 |     test_knowledge_operations,
11 |     validate_entity_relations,
12 |     verify_query_results
13 | )
14 | 
15 | logging.basicConfig(level=logging.INFO)
16 | logger = logging.getLogger(__name__)
17 | 
18 | async def validate_knowledge_base(config: dict) -> bool:
19 |     """Validate knowledge base operations."""
20 |     logger.info("Testing knowledge base operations...")
21 |     
22 |     # Test basic knowledge operations
23 |     ops_result = await test_knowledge_operations({
24 |         "url": "http://localhost:8001",
25 |         "auth_token": config["API_KEY"],
26 |         "test_entities": [
27 |             {"name": "TestClass", "type": "class"},
28 |             {"name": "test_method", "type": "method"},
29 |             {"name": "test_variable", "type": "variable"}
30 |         ],
31 |         "verify_persistence": True
32 |     })
33 |     
34 |     # Validate entity relations
35 |     relations_result = await validate_entity_relations({
36 |         "url": "http://localhost:8001",
37 |         "auth_token": config["API_KEY"],
38 |         "test_relations": [
39 |             {"from": "TestClass", "to": "test_method", "type": "contains"},
40 |             {"from": "test_method", "to": "test_variable", "type": "uses"}
41 |         ],
42 |         "verify_bidirectional": True
43 |     })
44 |     
45 |     # Verify query functionality
46 |     query_result = await verify_query_results({
47 |         "url": "http://localhost:8001",
48 |         "auth_token": config["API_KEY"],
49 |         "test_queries": [
50 |             "find classes that use test_variable",
51 |             "find methods in TestClass",
52 |             "find variables used by test_method"
53 |         ],
54 |         "expected_matches": {
55 |             "classes": ["TestClass"],
56 |             "methods": ["test_method"],
57 |             "variables": ["test_variable"]
58 |         }
59 |     })
60 |     
61 |     all_passed = all([
62 |         ops_result.success,
63 |         relations_result.success,
64 |         query_result.success
65 |     ])
66 |     
67 |     if all_passed:
68 |         logger.info("Knowledge base validation successful")
69 |     else:
70 |         logger.error("Knowledge base validation failed")
71 |         if not ops_result.success:
72 |             logger.error("Knowledge operations failed")
73 |         if not relations_result.success:
74 |             logger.error("Entity relations validation failed")
75 |         if not query_result.success:
76 |             logger.error("Query validation failed")
77 |     
78 |     return all_passed
79 | 
80 | if __name__ == "__main__":
81 |     import sys
82 |     from pathlib import Path
83 |     sys.path.append(str(Path(__file__).parent.parent))
84 |     
85 |     from scripts.config import load_config
86 |     config = load_config()
87 |     
88 |     success = asyncio.run(validate_knowledge_base(config))
89 |     sys.exit(0 if success else 1) 
```

--------------------------------------------------------------------------------
/test_fixes.md:
--------------------------------------------------------------------------------

```markdown
 1 | # MCP Codebase Insight Test Fixes
 2 | 
 3 | ## Identified Issues
 4 | 
 5 | 1. **Package Import Problems**
 6 |    - The tests were trying to import from `mcp_codebase_insight` directly, but the package needed to be imported from `src.mcp_codebase_insight`
 7 |    - The Python path wasn't correctly set up to include the project root directory
 8 | 
 9 | 2. **Missing Dependencies**
10 |    - The `sentence-transformers` package was installed in the wrong Python environment (Python 3.13 instead of 3.11)
11 |    - Had to explicitly install it in the correct environment
12 | 
13 | 3. **Test Isolation Problems**
14 |    - Tests were failing due to not being properly isolated
15 |    - The `component_test_runner.py` script needed fixes to properly load test modules
16 | 
17 | 4. **Qdrant Server Issue**
18 |    - The `test_vector_store_cleanup` test failed due to permission issues in the Qdrant server
19 |    - The server couldn't create a collection directory for the test
20 | 
21 | ## Applied Fixes
22 | 
23 | 1. **Fixed Import Paths**
24 |    - Modified test files to use `from src.mcp_codebase_insight...` instead of `from mcp_codebase_insight...`
25 |    - Added code to explicitly set `sys.path` to include the project root directory
26 | 
27 | 2. **Fixed Dependency Issues**
28 |    - Ran `python3.11 -m pip install sentence-transformers` to install the package in the correct environment
29 |    - Verified all dependencies were properly installed
30 | 
31 | 3. **Created a Test Runner Script**
32 |    - Created `run_test_with_path_fix.sh` to set up the proper environment variables and paths
33 |    - Modified `component_test_runner.py` to better handle module loading
34 | 
35 | 4. **Fixed Test Module Loading**
36 |    - Added a `load_test_module` function to properly handle import paths
37 |    - Ensured the correct Python path is set before importing test modules
38 | 
39 | ## Results
40 | 
41 | - Successfully ran 2 out of 3 vector store tests:
42 |   - ✅ `test_vector_store_initialization`
43 |   - ✅ `test_vector_store_add_and_search`
44 |   - ❌ `test_vector_store_cleanup` (still failing due to Qdrant server issue)
45 | 
46 | ## Recommendations for Remaining Issue
47 | 
48 | The `test_vector_store_cleanup` test is failing due to the Qdrant server not being able to create a directory for the collection. This could be fixed by:
49 | 
50 | 1. Checking the Qdrant server configuration to ensure it has proper permissions to create directories
51 | 2. Creating the necessary directories beforehand
52 | 3. Modifying the test to use a collection name that already exists or mock the collection creation
53 | 
54 | The error message suggests a file system permission issue:
55 | ```
56 | "Can't create directory for collection cleanup_test_db679546. Error: No such file or directory (os error 2)"
57 | ```
58 | 
59 | A simpler fix for testing purposes might be to modify the Qdrant Docker run command to include a volume mount with proper permissions:
60 | 
61 | ```bash
62 | docker run -d -p 6333:6333 -p 6334:6334 -v $(pwd)/qdrant_data:/qdrant/storage qdrant/qdrant
63 | ```
64 | 
65 | This would ensure the storage directory exists and has the right permissions.
66 | 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/utils/logger.py:
--------------------------------------------------------------------------------

```python
  1 | """Structured logging module."""
  2 | 
  3 | import logging
  4 | import sys
  5 | from typing import Any, Dict, Optional
  6 | 
  7 | import structlog
  8 | 
  9 | # Configure structlog
 10 | structlog.configure(
 11 |     processors=[
 12 |         structlog.stdlib.filter_by_level,
 13 |         structlog.stdlib.add_logger_name,
 14 |         structlog.stdlib.add_log_level,
 15 |         structlog.stdlib.PositionalArgumentsFormatter(),
 16 |         structlog.processors.TimeStamper(fmt="iso"),
 17 |         structlog.processors.StackInfoRenderer(),
 18 |         structlog.processors.format_exc_info,
 19 |         structlog.processors.UnicodeDecoder(),
 20 |         structlog.processors.JSONRenderer()
 21 |     ],
 22 |     context_class=dict,
 23 |     logger_factory=structlog.stdlib.LoggerFactory(),
 24 |     wrapper_class=structlog.stdlib.BoundLogger,
 25 |     cache_logger_on_first_use=True,
 26 | )
 27 | 
 28 | class Logger:
 29 |     """Structured logger."""
 30 |     
 31 |     def __init__(
 32 |         self,
 33 |         name: str,
 34 |         level: str = "INFO",
 35 |         extra: Optional[Dict[str, Any]] = None
 36 |     ):
 37 |         """Initialize logger."""
 38 |         # Set log level
 39 |         log_level = getattr(logging, level.upper())
 40 |         logging.basicConfig(
 41 |             format="%(message)s",
 42 |             stream=sys.stdout,
 43 |             level=log_level,
 44 |         )
 45 |         
 46 |         # Create logger
 47 |         self.logger = structlog.get_logger(name)
 48 |         self.extra = extra or {}
 49 |     
 50 |     def bind(self, **kwargs) -> "Logger":
 51 |         """Create new logger with additional context."""
 52 |         extra = {**self.extra, **kwargs}
 53 |         return Logger(
 54 |             name=self.logger.name,
 55 |             level=logging.getLevelName(self.logger.level),
 56 |             extra=extra
 57 |         )
 58 |     
 59 |     def debug(self, event: str, **kwargs):
 60 |         """Log debug message."""
 61 |         self.logger.debug(
 62 |             event,
 63 |             **{**self.extra, **kwargs}
 64 |         )
 65 |     
 66 |     def info(self, event: str, **kwargs):
 67 |         """Log info message."""
 68 |         self.logger.info(
 69 |             event,
 70 |             **{**self.extra, **kwargs}
 71 |         )
 72 |     
 73 |     def warning(self, event: str, **kwargs):
 74 |         """Log warning message."""
 75 |         self.logger.warning(
 76 |             event,
 77 |             **{**self.extra, **kwargs}
 78 |         )
 79 |     
 80 |     def error(self, event: str, **kwargs):
 81 |         """Log error message."""
 82 |         self.logger.error(
 83 |             event,
 84 |             **{**self.extra, **kwargs}
 85 |         )
 86 |     
 87 |     def exception(self, event: str, exc_info: bool = True, **kwargs):
 88 |         """Log exception message."""
 89 |         self.logger.exception(
 90 |             event,
 91 |             exc_info=exc_info,
 92 |             **{**self.extra, **kwargs}
 93 |         )
 94 |     
 95 |     def critical(self, event: str, **kwargs):
 96 |         """Log critical message."""
 97 |         self.logger.critical(
 98 |             event,
 99 |             **{**self.extra, **kwargs}
100 |         )
101 | 
102 | def get_logger(
103 |     name: str,
104 |     level: str = "INFO",
105 |     extra: Optional[Dict[str, Any]] = None
106 | ) -> Logger:
107 |     """Get logger instance."""
108 |     return Logger(name, level, extra)
109 | 
110 | # Default logger
111 | logger = get_logger("mcp_codebase_insight")
112 | 
```

--------------------------------------------------------------------------------
/scripts/validate_vector_store.py:
--------------------------------------------------------------------------------

```python
 1 | #!/usr/bin/env python3
 2 | """
 3 | Vector Store Validation Script
 4 | Tests vector store operations using local codebase.
 5 | """
 6 | 
 7 | import asyncio
 8 | import logging
 9 | from pathlib import Path
10 | import sys
11 | 
12 | # Add the src directory to the Python path
13 | sys.path.append(str(Path(__file__).parent.parent / "src"))
14 | 
15 | from mcp_codebase_insight.core.vector_store import VectorStore
16 | from mcp_codebase_insight.core.embeddings import SentenceTransformerEmbedding
17 | 
18 | logging.basicConfig(level=logging.INFO)
19 | logger = logging.getLogger(__name__)
20 | 
21 | async def validate_vector_store(config: dict) -> bool:
22 |     """Validate vector store operations."""
23 |     logger.info("Testing vector store operations...")
24 |     
25 |     try:
26 |         # Initialize embedder
27 |         embedder = SentenceTransformerEmbedding(
28 |             model_name="sentence-transformers/all-MiniLM-L6-v2"
29 |         )
30 |         await embedder.initialize()
31 |         logger.info("Embedder initialized successfully")
32 |         
33 |         # Initialize vector store
34 |         vector_store = VectorStore(
35 |             url=config.get("QDRANT_URL", "http://localhost:6333"),
36 |             embedder=embedder,
37 |             collection_name=config.get("COLLECTION_NAME", "mcp-codebase-insight"),
38 |             api_key=config.get("QDRANT_API_KEY", ""),
39 |             vector_name="default"
40 |         )
41 |         await vector_store.initialize()
42 |         logger.info("Vector store initialized successfully")
43 |         
44 |         # Test vector operations
45 |         test_text = "def test_function():\n    pass"
46 |         embedding = await embedder.embed(test_text)
47 |         
48 |         # Store vector
49 |         await vector_store.add_vector(
50 |             text=test_text,
51 |             metadata={"type": "code", "content": test_text}
52 |         )
53 |         logger.info("Vector storage test passed")
54 |         
55 |         # Search for similar vectors
56 |         logger.info("Searching for similar vectors")
57 |         results = await vector_store.search_similar(
58 |             query=test_text,
59 |             limit=1
60 |         )
61 |         
62 |         if not results or len(results) == 0:
63 |             logger.error("Vector search test failed: No results found")
64 |             return False
65 |             
66 |         logger.info("Vector search test passed")
67 |         
68 |         # Verify result metadata
69 |         result = results[0]
70 |         if not result.metadata or result.metadata.get("type") != "code":
71 |             logger.error("Vector metadata test failed: Invalid metadata")
72 |             return False
73 |             
74 |         logger.info("Vector metadata test passed")
75 |         return True
76 |         
77 |     except Exception as e:
78 |         logger.error(f"Vector store validation failed: {e}")
79 |         return False
80 | 
81 | if __name__ == "__main__":
82 |     # Load config from environment or .env file
83 |     from dotenv import load_dotenv
84 |     load_dotenv()
85 |     
86 |     import os
87 |     config = {
88 |         "QDRANT_URL": os.getenv("QDRANT_URL", "http://localhost:6333"),
89 |         "COLLECTION_NAME": os.getenv("COLLECTION_NAME", "mcp-codebase-insight"),
90 |         "QDRANT_API_KEY": os.getenv("QDRANT_API_KEY", "")
91 |     }
92 |     
93 |     success = asyncio.run(validate_vector_store(config))
94 |     sys.exit(0 if success else 1) 
```

--------------------------------------------------------------------------------
/tests/components/conftest.py:
--------------------------------------------------------------------------------

```python
  1 | """
  2 | Component Test Fixture Configuration.
  3 | 
  4 | This file defines fixtures specifically for component tests that might have different
  5 | scope requirements than the main test fixtures.
  6 | """
  7 | import pytest
  8 | import pytest_asyncio
  9 | import sys
 10 | import os
 11 | from pathlib import Path
 12 | import uuid
 13 | from typing import Dict
 14 | 
 15 | # Import required components
 16 | from src.mcp_codebase_insight.core.config import ServerConfig
 17 | from src.mcp_codebase_insight.core.vector_store import VectorStore
 18 | from src.mcp_codebase_insight.core.embeddings import SentenceTransformerEmbedding
 19 | from src.mcp_codebase_insight.core.knowledge import KnowledgeBase
 20 | from src.mcp_codebase_insight.core.tasks import TaskManager
 21 | # Ensure the src directory is in the Python path
 22 | sys.path.insert(0, os.path.abspath(os.path.join(os.path.dirname(__file__), '../')))
 23 | 
 24 | @pytest.fixture
 25 | def test_config():
 26 |     """Create a server configuration for tests.
 27 |     
 28 |     This is an alias for test_server_config that allows component tests to use
 29 |     their expected fixture name.
 30 |     """
 31 |     config = ServerConfig(
 32 |         host="localhost",
 33 |         port=8000,
 34 |         log_level="DEBUG",
 35 |         qdrant_url="http://localhost:6333",
 36 |         docs_cache_dir=Path(".test_cache") / "docs",
 37 |         adr_dir=Path(".test_cache") / "docs/adrs",
 38 |         kb_storage_dir=Path(".test_cache") / "knowledge",
 39 |         embedding_model="all-MiniLM-L6-v2",
 40 |         collection_name=f"test_collection_{uuid.uuid4().hex[:8]}",
 41 |         debug_mode=True,
 42 |         metrics_enabled=False,
 43 |         cache_enabled=True,
 44 |         memory_cache_size=1000,
 45 |         disk_cache_dir=Path(".test_cache") / "cache"
 46 |     )
 47 |     return config
 48 | 
 49 | @pytest.fixture
 50 | def test_metadata() -> Dict:
 51 |     """Standard test metadata for consistency across tests."""
 52 |     return {
 53 |         "type": "code",
 54 |         "language": "python",
 55 |         "title": "Test Code",
 56 |         "description": "Test code snippet for vector store testing",
 57 |         "tags": ["test", "vector"]
 58 |     }
 59 | 
 60 | @pytest_asyncio.fixture
 61 | async def embedder():
 62 |     """Create an embedder for tests."""
 63 |     return SentenceTransformerEmbedding()
 64 | 
 65 | @pytest_asyncio.fixture
 66 | async def vector_store(test_config, embedder):
 67 |     """Create a vector store for tests."""
 68 |     store = VectorStore(test_config.qdrant_url, embedder)
 69 |     await store.initialize()
 70 |     yield store
 71 |     await store.cleanup()
 72 | 
 73 | @pytest_asyncio.fixture
 74 | async def task_manager(test_config):
 75 |     """Create a task manager for tests."""
 76 |     manager = TaskManager(test_config)
 77 |     await manager.initialize()
 78 |     yield manager
 79 |     await manager.cleanup()
 80 | 
 81 | @pytest.fixture
 82 | def test_code():
 83 |     """Provide sample code for testing task-related functionality."""
 84 |     return """
 85 | def example_function():
 86 |     \"\"\"This is a test function for task manager tests.\"\"\"
 87 |     return "Hello, world!"
 88 | 
 89 | class TestClass:
 90 |     def __init__(self):
 91 |         self.value = 42
 92 |         
 93 |     def method(self):
 94 |         return self.value
 95 | """
 96 | 
 97 | @pytest_asyncio.fixture
 98 | async def knowledge_base(test_config, vector_store):
 99 |     """Create a knowledge base for tests."""
100 |     kb = KnowledgeBase(test_config, vector_store)
101 |     await kb.initialize()
102 |     yield kb
103 |     await kb.cleanup()
104 | 
```

--------------------------------------------------------------------------------
/tests/test_file_relationships.py:
--------------------------------------------------------------------------------

```python
 1 | import pytest
 2 | 
 3 | @pytest.mark.asyncio
 4 | async def test_create_file_relationship(client):
 5 |     """Test creating a file relationship."""
 6 |     relationship_data = {
 7 |         "source_file": "src/main.py",
 8 |         "target_file": "src/utils.py",
 9 |         "relationship_type": "imports",
10 |         "description": "Main imports utility functions",
11 |         "metadata": {"importance": "high"}
12 |     }
13 | 
14 |     response = await client.post("/relationships", json=relationship_data)
15 |     assert response.status_code == 200
16 |     data = response.json()
17 |     assert data["source_file"] == relationship_data["source_file"]
18 |     assert data["target_file"] == relationship_data["target_file"]
19 |     assert data["relationship_type"] == relationship_data["relationship_type"]
20 | 
21 | @pytest.mark.asyncio
22 | async def test_get_file_relationships(client):
23 |     """Test getting file relationships."""
24 |     # Create a test relationship first
25 |     relationship_data = {
26 |         "source_file": "src/test.py",
27 |         "target_file": "src/helper.py",
28 |         "relationship_type": "depends_on"
29 |     }
30 |     await client.post("/relationships", json=relationship_data)
31 | 
32 |     # Test getting all relationships
33 |     response = await client.get("/relationships")
34 |     assert response.status_code == 200
35 |     data = response.json()
36 |     assert len(data) > 0
37 |     assert isinstance(data, list)
38 | 
39 |     # Test filtering by source file
40 |     response = await client.get("/relationships", params={"source_file": "src/test.py"})
41 |     assert response.status_code == 200
42 |     data = response.json()
43 |     assert all(r["source_file"] == "src/test.py" for r in data)
44 | 
45 | @pytest.mark.asyncio
46 | async def test_create_web_source(client):
47 |     """Test creating a web source."""
48 |     source_data = {
49 |         "url": "https://example.com/docs",
50 |         "title": "API Documentation",
51 |         "content_type": "documentation",
52 |         "description": "External API documentation",
53 |         "tags": ["api", "docs"],
54 |         "metadata": {"version": "1.0"}
55 |     }
56 | 
57 |     response = await client.post("/web-sources", json=source_data)
58 |     assert response.status_code == 200
59 |     data = response.json()
60 |     assert data["url"] == source_data["url"]
61 |     assert data["title"] == source_data["title"]
62 |     assert data["content_type"] == source_data["content_type"]
63 | 
64 | @pytest.mark.asyncio
65 | async def test_get_web_sources(client):
66 |     """Test getting web sources."""
67 |     # Create a test web source first
68 |     source_data = {
69 |         "url": "https://example.com/tutorial",
70 |         "title": "Tutorial",
71 |         "content_type": "tutorial",
72 |         "tags": ["guide", "tutorial"]
73 |     }
74 |     await client.post("/web-sources", json=source_data)
75 | 
76 |     # Test getting all web sources
77 |     response = await client.get("/web-sources")
78 |     assert response.status_code == 200
79 |     data = response.json()
80 |     assert len(data) > 0
81 |     assert isinstance(data, list)
82 | 
83 |     # Test filtering by content type
84 |     response = await client.get("/web-sources", params={"content_type": "tutorial"})
85 |     assert response.status_code == 200
86 |     data = response.json()
87 |     assert all(s["content_type"] == "tutorial" for s in data)
88 | 
89 |     # Test filtering by tags
90 |     response = await client.get("/web-sources", params={"tags": ["guide"]})
91 |     assert response.status_code == 200
92 |     data = response.json()
93 |     assert any("guide" in s["tags"] for s in data)
```

--------------------------------------------------------------------------------
/pyproject.toml:
--------------------------------------------------------------------------------

```toml
  1 | [build-system]
  2 | requires = ["setuptools>=61.0", "wheel"]
  3 | build-backend = "setuptools.build_meta"
  4 | 
  5 | [project]
  6 | name = "mcp-codebase-insight"
  7 | dynamic = ["version"]
  8 | description = "MCP Codebase Insight Server"
  9 | readme = "README.md"
 10 | requires-python = ">=3.10"
 11 | license = {text = "MIT"}
 12 | authors = [
 13 |     {name = "Tosin Akinosho"}
 14 | ]
 15 | classifiers = [
 16 |     "Development Status :: 3 - Alpha",
 17 |     "Intended Audience :: Developers",
 18 |     "License :: OSI Approved :: MIT License",
 19 |     "Programming Language :: Python :: 3",
 20 |     "Programming Language :: Python :: 3.10",
 21 |     "Programming Language :: Python :: 3.11",
 22 |     "Programming Language :: Python :: 3.12",
 23 |     "Programming Language :: Python :: 3.13",
 24 |     "Topic :: Software Development :: Libraries :: Python Modules",
 25 | ]
 26 | dependencies = [
 27 |     "fastapi>=0.109.0",
 28 |     "uvicorn>=0.23.2",
 29 |     "pydantic>=2.4.2",
 30 |     "starlette>=0.35.0",
 31 |     "asyncio>=3.4.3",
 32 |     "aiohttp>=3.9.0",
 33 |     "qdrant-client>=1.13.3",
 34 |     "sentence-transformers>=2.2.2",
 35 |     "torch>=2.0.0",
 36 |     "transformers>=4.34.0",
 37 |     "python-frontmatter>=1.0.0",
 38 |     "markdown>=3.4.4",
 39 |     "PyYAML>=6.0.1",
 40 |     "structlog>=23.1.0",
 41 |     "psutil>=5.9.5",
 42 |     "python-dotenv>=1.0.0",
 43 |     "requests>=2.31.0",
 44 |     "beautifulsoup4>=4.12.0",
 45 |     "scipy>=1.11.0",
 46 |     "python-slugify>=8.0.0",
 47 |     "slugify>=0.0.1",
 48 |     "numpy>=1.24.0",
 49 |     # "uvx>=0.4.0",  # Temporarily commented out for development installation
 50 |     "mcp-server-qdrant>=0.2.0",
 51 |     "mcp>=1.5.0,<1.6.0",  # Pin to MCP 1.5.0 for API compatibility
 52 | ]
 53 | 
 54 | [project.optional-dependencies]
 55 | test = [
 56 |     "pytest>=7.4.2",
 57 |     "pytest-asyncio>=0.21.1",
 58 |     "pytest-cov>=4.1.0",
 59 |     "httpx>=0.25.0",
 60 | ]
 61 | dev = [
 62 |     "black>=23.9.1",
 63 |     "isort>=5.12.0",
 64 |     "mypy>=1.5.1",
 65 |     "flake8>=6.1.0",
 66 |     "bump2version>=1.0.1",
 67 |     "pre-commit>=3.5.0",
 68 |     "pdoc>=14.1.0",
 69 | ]
 70 | 
 71 | [project.urls]
 72 | Homepage = "https://github.com/tosin2013/mcp-codebase-insight"
 73 | Documentation = "https://github.com/tosin2013/mcp-codebase-insight/docs"
 74 | Repository = "https://github.com/tosin2013/mcp-codebase-insight.git"
 75 | Issues = "https://github.com/tosin2013/mcp-codebase-insight/issues"
 76 | 
 77 | [project.scripts]
 78 | mcp-codebase-insight = "mcp_codebase_insight.server:run"
 79 | 
 80 | [tool.setuptools]
 81 | package-dir = {"" = "src"}
 82 | 
 83 | [tool.setuptools.packages.find]
 84 | where = ["src"]
 85 | include = ["mcp_codebase_insight*"]
 86 | 
 87 | [tool.black]
 88 | line-length = 88
 89 | target-version = ['py311']
 90 | include = '\.pyi?$'
 91 | 
 92 | [tool.isort]
 93 | profile = "black"
 94 | multi_line_output = 3
 95 | include_trailing_comma = true
 96 | force_grid_wrap = 0
 97 | use_parentheses = true
 98 | ensure_newline_before_comments = true
 99 | line_length = 88
100 | 
101 | [tool.mypy]
102 | python_version = "3.11"
103 | warn_return_any = true
104 | warn_unused_configs = true
105 | disallow_untyped_defs = true
106 | check_untyped_defs = true
107 | disallow_untyped_decorators = true
108 | no_implicit_optional = true
109 | warn_redundant_casts = true
110 | warn_unused_ignores = true
111 | warn_no_return = true
112 | warn_unreachable = true
113 | 
114 | [tool.pytest.ini_options]
115 | minversion = "6.0"
116 | addopts = "-ra -q --cov=src --cov-report=term-missing"
117 | testpaths = ["tests"]
118 | asyncio_mode = "auto"
119 | 
120 | [tool.coverage.run]
121 | source = ["src"]
122 | branch = true
123 | 
124 | [tool.coverage.report]
125 | exclude_lines = [
126 |     "pragma: no cover",
127 |     "def __repr__",
128 |     "if self.debug:",
129 |     "raise NotImplementedError",
130 |     "if __name__ == .__main__.:",
131 |     "pass",
132 |     "raise ImportError",
133 | ]
134 | ignore_errors = true
135 | omit = ["tests/*", "setup.py"]
136 | 
```

--------------------------------------------------------------------------------
/tests/components/test_task_manager.py:
--------------------------------------------------------------------------------

```python
  1 | import sys
  2 | import os
  3 | import pytest
  4 | import pytest_asyncio
  5 | from pathlib import Path
  6 | from typing import AsyncGenerator
  7 | from src.mcp_codebase_insight.core.tasks import TaskManager, TaskType, TaskStatus
  8 | from src.mcp_codebase_insight.core.config import ServerConfig
  9 | 
 10 | @pytest_asyncio.fixture
 11 | async def task_manager(test_config: ServerConfig):
 12 |     manager = TaskManager(test_config)
 13 |     await manager.initialize()
 14 |     yield manager
 15 |     await manager.cleanup()
 16 | 
 17 | @pytest.mark.asyncio
 18 | async def test_task_manager_initialization(task_manager: TaskManager):
 19 |     """Test that task manager initializes correctly."""
 20 |     assert task_manager is not None
 21 |     assert task_manager.config is not None
 22 | 
 23 | @pytest.mark.asyncio
 24 | async def test_create_and_get_task(task_manager: TaskManager, test_code: str):
 25 |     """Test creating and retrieving tasks."""
 26 |     # Create task
 27 |     task = await task_manager.create_task(
 28 |         type="code_analysis",
 29 |         title="Test task",
 30 |         description="Test task description",
 31 |         context={"code": test_code}
 32 |     )
 33 |     assert task is not None
 34 |     
 35 |     # Get task
 36 |     retrieved_task = await task_manager.get_task(task.id)
 37 |     assert retrieved_task.context["code"] == test_code
 38 |     assert retrieved_task.type == TaskType.CODE_ANALYSIS
 39 |     assert retrieved_task.description == "Test task description"
 40 | 
 41 | @pytest.mark.asyncio
 42 | async def test_task_status_updates(task_manager: TaskManager, test_code: str):
 43 |     """Test task status updates."""
 44 |     # Create task
 45 |     task = await task_manager.create_task(
 46 |         type="code_analysis",
 47 |         title="Status Test",
 48 |         description="Test task status updates",
 49 |         context={"code": test_code}
 50 |     )
 51 |     
 52 |     # Update status
 53 |     await task_manager.update_task(task.id, status=TaskStatus.IN_PROGRESS)
 54 |     updated_task = await task_manager.get_task(task.id)
 55 |     assert updated_task.status == TaskStatus.IN_PROGRESS
 56 |     
 57 |     await task_manager.update_task(task.id, status=TaskStatus.COMPLETED)
 58 |     completed_task = await task_manager.get_task(task.id)
 59 |     assert completed_task.status == TaskStatus.COMPLETED
 60 | 
 61 | @pytest.mark.asyncio
 62 | async def test_task_result_updates(task_manager: TaskManager, test_code: str):
 63 |     """Test updating task results."""
 64 |     # Create task
 65 |     task = await task_manager.create_task(
 66 |         type="code_analysis",
 67 |         title="Result Test",
 68 |         description="Test task result updates",
 69 |         context={"code": test_code}
 70 |     )
 71 |     
 72 |     # Update result
 73 |     result = {"analysis": "Test analysis result"}
 74 |     await task_manager.update_task(task.id, result=result)
 75 |     
 76 |     # Verify result
 77 |     updated_task = await task_manager.get_task(task.id)
 78 |     assert updated_task.result == result
 79 | 
 80 | @pytest.mark.asyncio
 81 | async def test_list_tasks(task_manager: TaskManager, test_code: str):
 82 |     """Test listing tasks."""
 83 |     # Create multiple tasks
 84 |     tasks = []
 85 |     for i in range(3):
 86 |         task = await task_manager.create_task(
 87 |             type="code_analysis",
 88 |             title=f"List Test {i}",
 89 |             description=f"Test task {i}",
 90 |             context={"code": test_code}
 91 |         )
 92 |         tasks.append(task)
 93 |     
 94 |     # List tasks
 95 |     task_list = await task_manager.list_tasks()
 96 |     assert len(task_list) >= 3
 97 |     
 98 |     # Verify task descriptions
 99 |     descriptions = [task.description for task in task_list]
100 |     for i in range(3):
101 |         assert f"Test task {i}" in descriptions 
```

--------------------------------------------------------------------------------
/CHANGELOG.md:
--------------------------------------------------------------------------------

```markdown
  1 | # Changelog
  2 | 
  3 | All notable changes to this project will be documented in this file.
  4 | 
  5 | The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
  6 | and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
  7 | 
  8 | ## [Unreleased]
  9 | 
 10 | ### Added
 11 | - Initial project setup
 12 | - Core server implementation
 13 | - ADR management system
 14 | - Documentation management
 15 | - Knowledge base with vector search
 16 | - Debug system
 17 | - Task management
 18 | - Metrics and health monitoring
 19 | - Caching system
 20 | - Structured logging
 21 | - Docker support
 22 | - CI/CD pipeline
 23 | - Test suite
 24 | 
 25 | ### Changed
 26 | - None
 27 | 
 28 | ### Deprecated
 29 | - None
 30 | 
 31 | ### Removed
 32 | - None
 33 | 
 34 | ### Fixed
 35 | - None
 36 | 
 37 | ### Security
 38 | - None
 39 | 
 40 | ## [0.2.2] - 2025-03-25
 41 | 
 42 | ### Added
 43 | - Implemented single source of truth for versioning
 44 | 
 45 | ### Changed
 46 | - Moved version to the package's __init__.py file as the canonical source
 47 | - Updated setup.py to dynamically read version from __init__.py
 48 | - Updated pyproject.toml to use dynamic versioning
 49 | - Synchronized dependencies between setup.py, pyproject.toml and requirements.in
 50 | 
 51 | ### Fixed
 52 | - Missing dependencies in setup.py and pyproject.toml
 53 | 
 54 | ## [0.2.1] - 2025-03-25
 55 | 
 56 | ### Added
 57 | - Integrated Qdrant Docker container in CI/CD workflow for more realistic testing
 58 | - Added collection initialization step for proper Qdrant setup in CI/CD
 59 | - Created shared Qdrant client fixture for improved test reliability
 60 | 
 61 | ### Changed
 62 | - Updated Python version requirement from >=3.11 to >=3.9 for broader compatibility
 63 | - Enhanced test fixture scoping to resolve event_loop fixture scope mismatches
 64 | - Improved connection verification for Qdrant in GitHub Actions workflow
 65 | 
 66 | ### Fixed
 67 | - Resolved fixture scope mismatches in async tests
 68 | - Fixed environment variable handling in test configuration
 69 | 
 70 | ### Removed
 71 | - None
 72 | 
 73 | ### Security
 74 | - None
 75 | 
 76 | ## [0.2.0] - 2025-03-24
 77 | 
 78 | ### Added
 79 | - None
 80 | 
 81 | ### Changed
 82 | - Improved async test fixture handling in component tests
 83 | - Enhanced test discovery to properly distinguish between test functions and fixtures
 84 | - Updated component test runner for better isolation and resource management
 85 | 
 86 | ### Fixed
 87 | - Resolved fixture scope mismatches in async tests
 88 | - Fixed async event loop handling in component tests
 89 | - Corrected test_metadata fixture identification in test_vector_store.py
 90 | 
 91 | ### Removed
 92 | - None
 93 | 
 94 | ### Security
 95 | - None
 96 | 
 97 | ## [0.1.0] - 2025-03-19
 98 | 
 99 | ### Added
100 | - Initial release
101 | - Basic server functionality
102 | - Core components:
103 |   - ADR management
104 |   - Documentation handling
105 |   - Knowledge base
106 |   - Vector search
107 |   - Task management
108 |   - Health monitoring
109 |   - Metrics collection
110 |   - Caching
111 |   - Logging
112 | - Docker support
113 | - CI/CD pipeline with GitHub Actions
114 | - Test coverage with pytest
115 | - Code quality tools:
116 |   - Black
117 |   - isort
118 |   - flake8
119 |   - mypy
120 | - Documentation:
121 |   - README
122 |   - API documentation
123 |   - Contributing guidelines
124 |   - ADR templates
125 | - Development tools:
126 |   - Makefile
127 |   - Docker compose
128 |   - Environment configuration
129 |   - Version management
130 | 
131 | [Unreleased]: https://github.com/modelcontextprotocol/mcp-codebase-insight/compare/v0.2.2...HEAD
132 | [0.2.2]: https://github.com/modelcontextprotocol/mcp-codebase-insight/compare/v0.2.1...v0.2.2
133 | [0.2.1]: https://github.com/modelcontextprotocol/mcp-codebase-insight/releases/tag/v0.2.1
134 | [0.2.0]: https://github.com/modelcontextprotocol/mcp-codebase-insight/releases/tag/v0.2.0
135 | [0.1.0]: https://github.com/modelcontextprotocol/mcp-codebase-insight/releases/tag/v0.1.0
136 | 
```

--------------------------------------------------------------------------------
/docs/documentation_map.md:
--------------------------------------------------------------------------------

```markdown
  1 | # Documentation Relationship Map
  2 | 
  3 | ```mermaid
  4 | graph TD
  5 |     %% ADRs
  6 |     ADR1[ADR-0001: Testing Strategy]
  7 |     ADR2[ADR-0002: SSE Testing]
  8 |     ADR3[ADR-0003: Comprehensive Testing]
  9 |     ADR4[ADR-0004: Documentation Linking]
 10 | 
 11 |     %% Core Systems
 12 |     CS1[Vector Store System]
 13 |     CS2[Knowledge Base]
 14 |     CS3[Task Management]
 15 |     CS4[Health Monitoring]
 16 |     CS5[Error Handling]
 17 |     CS6[Metrics Collection]
 18 |     CS7[Cache Management]
 19 | 
 20 |     %% Features
 21 |     FA[Code Analysis]
 22 |     FB[ADR Management]
 23 |     FC[Documentation Management]
 24 | 
 25 |     %% Testing
 26 |     TA[Server Testing]
 27 |     TB[SSE Testing]
 28 | 
 29 |     %% Components
 30 |     C1[Server Framework]
 31 |     C2[Testing Framework]
 32 |     C3[Documentation Tools]
 33 | 
 34 |     %% Implementation Files
 35 |     I1[test_server_instance.py]
 36 |     I2[SSETestManager.py]
 37 |     I3[ServerTestFramework.py]
 38 |     I4[DocNode.py]
 39 |     I5[DocumentationMap.py]
 40 | 
 41 |     %% Core Classes
 42 |     CC1[ServerConfig]
 43 |     CC2[ErrorCode]
 44 |     CC3[ComponentState]
 45 |     CC4[TaskTracker]
 46 |     CC5[DocumentationType]
 47 | 
 48 |     %% Relationships - Core Systems
 49 |     CS1 --> CC1
 50 |     CS2 --> CS1
 51 |     CS2 --> CS7
 52 |     CS3 --> CC4
 53 |     CS4 --> CC3
 54 |     CS5 --> CC2
 55 | 
 56 |     %% Relationships - ADRs
 57 |     ADR1 --> I1
 58 |     ADR1 --> C1
 59 |     ADR2 --> I2
 60 |     ADR2 --> TB
 61 |     ADR3 --> I3
 62 |     ADR3 --> C2
 63 |     ADR4 --> I4
 64 |     ADR4 --> I5
 65 |     ADR4 --> C3
 66 | 
 67 |     %% Relationships - Features
 68 |     FA --> CS2
 69 |     FA --> CS1
 70 |     FB --> ADR1
 71 |     FB --> ADR2
 72 |     FB --> ADR3
 73 |     FB --> ADR4
 74 |     FC --> C3
 75 |     FC --> CC5
 76 | 
 77 |     %% Relationships - Testing
 78 |     TA --> I1
 79 |     TA --> I3
 80 |     TB --> I2
 81 |     TB --> ADR2
 82 | 
 83 |     %% Component Relationships
 84 |     C1 --> CC1
 85 |     C1 --> CS4
 86 |     C2 --> I2
 87 |     C2 --> I3
 88 |     C3 --> I4
 89 |     C3 --> I5
 90 | 
 91 |     %% Error Handling
 92 |     CS5 --> FA
 93 |     CS5 --> FB
 94 |     CS5 --> FC
 95 |     CS5 --> CS1
 96 |     CS5 --> CS2
 97 |     CS5 --> CS3
 98 | 
 99 |     %% Styling
100 |     classDef adr fill:#f9f,stroke:#333,stroke-width:2px
101 |     classDef feature fill:#bbf,stroke:#333,stroke-width:2px
102 |     classDef testing fill:#bfb,stroke:#333,stroke-width:2px
103 |     classDef component fill:#fbb,stroke:#333,stroke-width:2px
104 |     classDef implementation fill:#ddd,stroke:#333,stroke-width:1px
105 |     classDef core fill:#ffd,stroke:#333,stroke-width:2px
106 |     classDef class fill:#dff,stroke:#333,stroke-width:1px
107 | 
108 |     class ADR1,ADR2,ADR3,ADR4 adr
109 |     class FA,FB,FC feature
110 |     class TA,TB testing
111 |     class C1,C2,C3 component
112 |     class I1,I2,I3,I4,I5 implementation
113 |     class CS1,CS2,CS3,CS4,CS5,CS6,CS7 core
114 |     class CC1,CC2,CC3,CC4,CC5 class
115 | ```
116 | 
117 | ## Documentation Map Legend
118 | 
119 | ### Node Types
120 | - **Purple**: Architecture Decision Records (ADRs)
121 | - **Blue**: Feature Documentation
122 | - **Green**: Testing Documentation
123 | - **Red**: Key Components
124 | - **Gray**: Implementation Files
125 | - **Yellow**: Core Systems
126 | - **Light Blue**: Core Classes
127 | 
128 | ### Relationship Types
129 | - Arrows indicate dependencies or references between documents
130 | - Direct connections show implementation relationships
131 | - Indirect connections show conceptual relationships
132 | 
133 | ### Key Areas
134 | 1. **Core Systems**
135 |    - Vector Store and Knowledge Base
136 |    - Task Management and Health Monitoring
137 |    - Error Handling and Metrics Collection
138 |    - Cache Management
139 | 
140 | 2. **Testing Infrastructure**
141 |    - Centered around ADR-0001 and ADR-0002
142 |    - Connected to Server and SSE testing implementations
143 | 
144 | 3. **Documentation Management**
145 |    - Focused on ADR-0004
146 |    - Links to Documentation Tools and models
147 | 
148 | 4. **Feature Implementation**
149 |    - Shows how features connect to components
150 |    - Demonstrates implementation dependencies
151 | 
152 | 5. **Error Handling**
153 |    - Centralized error management
154 |    - Connected to all major systems
155 |    - Standardized error codes and types 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/server_test_isolation.py:
--------------------------------------------------------------------------------

```python
 1 | """Test isolation for ServerState.
 2 | 
 3 | This module provides utilities to create isolated ServerState instances for testing,
 4 | preventing state conflicts between parallel test runs.
 5 | """
 6 | 
 7 | from typing import Dict, Optional
 8 | import asyncio
 9 | import uuid
10 | import logging
11 | 
12 | from .core.state import ServerState
13 | from .utils.logger import get_logger
14 | 
15 | logger = get_logger(__name__)
16 | 
17 | # Store of server states keyed by instance ID
18 | _server_states: Dict[str, ServerState] = {}
19 | 
20 | def get_isolated_server_state(instance_id: Optional[str] = None) -> ServerState:
21 |     """Get or create an isolated ServerState instance for tests.
22 |     
23 |     Args:
24 |         instance_id: Optional unique ID for the server state
25 |                    
26 |     Returns:
27 |         An isolated ServerState instance
28 |     """
29 |     global _server_states
30 |     
31 |     if instance_id is None:
32 |         # Create a new ServerState without storing it
33 |         instance_id = f"temp_{uuid.uuid4().hex}"
34 |         
35 |     if instance_id not in _server_states:
36 |         logger.debug(f"Creating new isolated ServerState with ID: {instance_id}")
37 |         _server_states[instance_id] = ServerState()
38 |     
39 |     return _server_states[instance_id]
40 | 
41 | async def cleanup_all_server_states():
42 |     """Clean up all tracked server states."""
43 |     global _server_states
44 |     logger.debug(f"Cleaning up {len(_server_states)} isolated server states")
45 |     
46 |     # Make a copy of the states to avoid modification during iteration
47 |     states_to_clean = list(_server_states.items())
48 |     cleanup_tasks = []
49 |     
50 |     for instance_id, state in states_to_clean:
51 |         try:
52 |             logger.debug(f"Cleaning up ServerState: {instance_id}")
53 |             if state.initialized:
54 |                 # Get active tasks before cleanup
55 |                 active_tasks = state.get_active_tasks()
56 |                 if active_tasks:
57 |                     logger.debug(
58 |                         f"Found {len(active_tasks)} active tasks for {instance_id}"
59 |                     )
60 |                 
61 |                 # Schedule state cleanup with increased timeout
62 |                 cleanup_task = asyncio.create_task(
63 |                     asyncio.wait_for(state.cleanup(), timeout=5.0)
64 |                 )
65 |                 cleanup_tasks.append((instance_id, cleanup_task))
66 |             else:
67 |                 logger.debug(f"Skipping uninitialized ServerState: {instance_id}")
68 |         except Exception as e:
69 |             logger.error(
70 |                 f"Error preparing cleanup for ServerState {instance_id}: {e}",
71 |                 exc_info=True
72 |             )
73 |     
74 |     # Wait for all cleanup tasks to complete
75 |     if cleanup_tasks:
76 |         for instance_id, task in cleanup_tasks:
77 |             try:
78 |                 await task
79 |                 logger.debug(f"State {instance_id} cleaned up successfully")
80 |                 
81 |                 # Verify no tasks remain
82 |                 state = _server_states.get(instance_id)
83 |                 if state and state.get_task_count() > 0:
84 |                     logger.warning(
85 |                         f"State {instance_id} still has {state.get_task_count()} "
86 |                         "active tasks after cleanup"
87 |                     )
88 |             except asyncio.TimeoutError:
89 |                 logger.warning(f"State cleanup timed out for {instance_id}")
90 |                 # Force cleanup
91 |                 state = _server_states.get(instance_id)
92 |                 if state:
93 |                     state.initialized = False
94 |             except Exception as e:
95 |                 logger.error(f"Error during state cleanup for {instance_id}: {e}")
96 |     
97 |     # Clear all states from global store
98 |     _server_states.clear()
99 |     logger.debug("All server states cleaned up") 
```

--------------------------------------------------------------------------------
/src/mcp_codebase_insight/core/task_tracker.py:
--------------------------------------------------------------------------------

```python
  1 | """Task tracking and management for async operations."""
  2 | 
  3 | import asyncio
  4 | import logging
  5 | from typing import Set, Optional
  6 | from datetime import datetime
  7 | 
  8 | from ..utils.logger import get_logger
  9 | 
 10 | logger = get_logger(__name__)
 11 | 
 12 | class TaskTracker:
 13 |     """Tracks and manages async tasks with improved error handling and logging."""
 14 |     
 15 |     def __init__(self):
 16 |         """Initialize the task tracker."""
 17 |         self._tasks: Set[asyncio.Task] = set()
 18 |         self._loop = asyncio.get_event_loop()
 19 |         self._loop_id = id(self._loop)
 20 |         self._start_time = datetime.utcnow()
 21 |         logger.debug(f"TaskTracker initialized with loop ID: {self._loop_id}")
 22 |     
 23 |     def track_task(self, task: asyncio.Task) -> None:
 24 |         """Track a new task and set up completion handling.
 25 |         
 26 |         Args:
 27 |             task: The asyncio.Task to track
 28 |         """
 29 |         if id(asyncio.get_event_loop()) != self._loop_id:
 30 |             logger.warning(
 31 |                 f"Task created in different event loop context. "
 32 |                 f"Expected: {self._loop_id}, Got: {id(asyncio.get_event_loop())}"
 33 |             )
 34 |         
 35 |         self._tasks.add(task)
 36 |         task.add_done_callback(self._handle_task_completion)
 37 |         logger.debug(f"Tracking new task: {task.get_name()}")
 38 |     
 39 |     def _handle_task_completion(self, task: asyncio.Task) -> None:
 40 |         """Handle task completion and cleanup.
 41 |         
 42 |         Args:
 43 |             task: The completed task
 44 |         """
 45 |         self._tasks.discard(task)
 46 |         if task.exception():
 47 |             logger.error(
 48 |                 f"Task {task.get_name()} failed with error: {task.exception()}",
 49 |                 exc_info=True
 50 |             )
 51 |         else:
 52 |             logger.debug(f"Task {task.get_name()} completed successfully")
 53 |     
 54 |     async def cancel_all_tasks(self, timeout: float = 5.0) -> None:
 55 |         """Cancel all tracked tasks and wait for completion.
 56 |         
 57 |         Args:
 58 |             timeout: Maximum time to wait for tasks to cancel
 59 |         """
 60 |         if not self._tasks:
 61 |             logger.debug("No tasks to cancel")
 62 |             return
 63 |         
 64 |         logger.debug(f"Cancelling {len(self._tasks)} tasks")
 65 |         for task in self._tasks:
 66 |             if not task.done() and not task.cancelled():
 67 |                 task.cancel()
 68 |         
 69 |         try:
 70 |             await asyncio.wait_for(
 71 |                 asyncio.gather(*self._tasks, return_exceptions=True),
 72 |                 timeout=timeout
 73 |             )
 74 |             logger.debug("All tasks cancelled successfully")
 75 |         except asyncio.TimeoutError:
 76 |             logger.warning(f"Task cancellation timed out after {timeout} seconds")
 77 |         except Exception as e:
 78 |             logger.error(f"Error during task cancellation: {e}", exc_info=True)
 79 |     
 80 |     def get_active_tasks(self) -> Set[asyncio.Task]:
 81 |         """Get all currently active tasks.
 82 |         
 83 |         Returns:
 84 |             Set of active asyncio.Task objects
 85 |         """
 86 |         return self._tasks.copy()
 87 |     
 88 |     def get_task_count(self) -> int:
 89 |         """Get the number of currently tracked tasks.
 90 |         
 91 |         Returns:
 92 |             Number of active tasks
 93 |         """
 94 |         return len(self._tasks)
 95 |     
 96 |     def get_uptime(self) -> float:
 97 |         """Get the uptime of the task tracker in seconds.
 98 |         
 99 |         Returns:
100 |             Uptime in seconds
101 |         """
102 |         return (datetime.utcnow() - self._start_time).total_seconds()
103 |     
104 |     def __del__(self):
105 |         """Cleanup when the tracker is destroyed."""
106 |         if self._tasks:
107 |             logger.warning(
108 |                 f"TaskTracker destroyed with {len(self._tasks)} "
109 |                 "unfinished tasks"
110 |             ) 
```