puter/devlog.md at c53107db67dae3125c9ad6f36fd5a78133052dc9

mirrors/puter

Fork 0

mirror of https://github.com/HeyPuter/puter synced 2024-11-14 22:06:00 +00:00

KernelDeimos 9535100c85 doc: add brainstorming for mountpoints

2024-10-17 03:00:35 -04:00

4.5 KiB

Raw Blame History

2024-10-16

Considerations for Mountpoints Feature

_storage_upload takes paramter uuid instead of path
- S3 bucket strategy needs the UUID
  - If we do hashes, 10MB chunks should be fine
    - we're already able to smooth out bursty traffic using the EWA algorithm
Use of systemFSEntryService
- Is that normalized? Does everything go through this interface?
Storage interface has methods like post_insert
- as far as I can tell this doesn't pose any issue

Brainstorming Migration Strategies

Interface boundary at HL<->LL filesystem methods

-- tags: brainstorming

From the perspectice of a trait-oriented implementation, which is not how LL/HL filesystem operations are currently implemented, the LL-class operations are implemented in separate traits.

The composite trait containing all of these traits would be the trait that represents a filesystem implementation itself.

Other filesystem interfaces that I've seen, such as FUSE and 9p, all usually have a monolithic interface - that is to say, an interface which includes all of the filesystem operations, rather than several interfaces each implementing a single filesystem operaiton.

Something about the fact that the LL-class operations are in separate classes makes it difficult to reason about how to move. Is it simply that multiple files in a directory is just more annoying to think about? Maybe, but there must be something more.

Perhaps it's that there are several references. Each implementation (that is, implemenation of a single filesystem operation) could have any number of different references across any number of different files. This would not be the case with a monolithic interface.

I think the best of both worlds would be to have an interface representing the entire filesystem and, in one place, link of of the individual operation implementations to compose a filesystem implementation

Filesystem Brainstorming

Puter's backend uses a service architecture. Each service is an instance of a class extending "Service". A service can listen to events of the backend's lifecycle, interact with other services, and interact with external interfaces such as APIs and databases.

Puter's current filesystem, let's call it PuterFSv1, exists as the result of multiple services working together. We have LocalDiskStorageService which mimics an S3 bucket on a local system, and we have DatabaseFSEntryService which manages information about files, directories, and their relationships within the database, and therefore depends on DatabaseAccessService.

It is now time to introduce a MountpointService. This will allow another service or a user's configuration to assign an instance of a filesystem implementation (such as PuterFSv1) to a specific path.

The trouble here is that PuterFSv1 is composed of services, and the nature of a service is such that it exists for the lifecycle of the application. The class for a particular service can be re-used and registered with multiple names (creating multiple services with the same implementation but perhaps different configuration), but that's only a clean scenario when there is just one service. PuterFSv1, on the other hand, is like an imaginary service composed of other services.

The following possibilities then should be discussed:

CompositeService base class for a service that is composed of more than one service.
Refactor filesystem to not use service architecture.
Each filesystem service can manage state and configuration for multiple mountpoints (I don't like this idea; it feels messy. I wonder what software principles this violates)

We can take advantage of traits/interfaces here. PuterFSv1 depends on two interfaces:

An S3-like data storage implementation
An fsentry storage implementation

Counterintuitively from what I first thought, "Refactor the filesystem" actually looks like the best solution, and it doens't even look like it will be that difficult. In fact, it'll likely make the filesystem easier to maintain and more robust as a result.

Additionally, we can introduce PuterFSv2, which will introduce storing data in chunks identified by their hashes, and associated hashes with fsentries.

PuterFSService will be a new service which registers 'PuterFSv1' with FilesystemService.

An instance of a filesystem needs to be separate from a mountpoint. For example, PuterFSv1 will usually have only one instance but it may be mounted several different times. /some-user on Puter's VFS could be a mountpoint for /some-user in the instance of PuterFSv1.

4.5 KiB Raw Blame History