When he died there was no way of transcribing them automatically (there still isn't really). The boxes stood in my mothers already cramped attic for 13 years, then she got cancer, and she felt a need to finish up things, so she got a scanner and started just scanning.
When my mother died she had scanned about a thousand pages, not transcribed, not anything.
The text in the diaries were fun at times, sometimes depressing, seeing how little he cared about my mother and his family was crushing.
My brother wanted to continue the scanning but I told him that I wanted to throw the diaries away. He kept half a year of writing around his birth (there's at least a sentence) and my uncle did the same, then we just watched it all burn (not literally, we threw it away at the recycling centre).
Not everything needs to be preserved. I'm happy some parts is preserved. I'm happy that those diaries are ash.
And I'm sorry your mom experienced that weight towards the end of her life. That sounds like a significant thing to grapple with, especially considering some of the not so pleasant content mentioned.
I feel like i don't know how to emotionally react to the AI part of this story. To begin with, it is fundamentally cool we have technology like that. At the same time it felt bittersweet, like an artisan being put out of business by the factory. The first part of the story felt like much of the love was in constructing everything by hand, it seems almost sad to lose that. There is also an element of dystopia in how the AI was able to cross reference everything, bank statements, ticketmaster recipts, shazam, etc. It is kind of unsettling the power of it all.
Not sure where i'm going with this comment. Its a super cool project, thanks for sharing.
Steve Jobs saw the computer as a bicycle for the mind, a way to enable us to do more and be more. This is the metaphor against which I measure all technology.
I think that in this case, it helped someone make something deeply human by abstracting the tedium away. It did what a computer should do: aid a human with their task.
Technology has been feeling like a devil's bargain for a while now. This was a rare glimpse of how I used to see tech, and of why I was so excited about it.
If you build this encyclopedia as a purely robotic collector of facts that nobody reads, it’s probably more dystopian horror.
If you build it as a fun inner loop that reconnects you with people and memories and makes you more human, then it’s great.
We should endeavor to craft experiences that do the latter; right now we are back in the hacker days when small teams can build big new ideas, and big tech hasn’t taken over.
It'd be better if we all turned this tech off and went to be with other people.
This is basically how social media was when you needed a computer to go online. You’d sit and sift through your feeds and there’d be message chains you respond to. You’re not really doing anything else while you’re doing that and you’re putting it out of mind once you step away. When Twitter first started getting big it was sort of a joke that people are talking to you while on the toilet. The idea that you were only ever half engaged with anything you’re doing was remarkable enough to be worth pointing out instead of taken for granted.
It’s just a lot more focused and intentional when you’re dedicating time and headspace to the task instead of “microdosing” on connection via a dopamine lottery. Even if you took away the ads and the interpolation of creator-content crowding out the connections with people you actually know, I think designing for an infinite scroll just inherently makes the thing less human-centered. It sets it up so you’re interacting with these atomized bits of ‘content’ rather than people.
There's a comment by bonoboTP in a sibling thread about the emotional complexity of a project like this. There are many ways to narrate a life story: many traumatic episodes and feuds better left forgotten, different framings, and all that emotional labor of trying to choose what and how you want to remember.
The use of LLMs for creating a shared view for some information isn't inherently morally dubious-processing and storing data is what computers have been doing for generations-except for the privacy implications, but letting this projection of a mega-corporation usurp the role of narrator for such a deeply personal story feels wrong on an instinctual level.
It is a nice idea, and I can imagine how it may serve to strengthen the family's social cohesion, in a time where everyone is busy doing the rat race. Though I'd not use it as "encyclopedia", a cold-hearted fact recorder, more like more a social-focused "Our Family Diaries" and would be much better served by family members writing down their own experiences.
Because without AI it probably wouldn’t exist.
I understand the bittersweet feeling because I did all the editorial work for the wedding page and the first few others and I did feel like a historian trying to connect the dots after stumbling into some primary/secondary materials and spending a couple months doing all the editorial work
after I began experimenting with agents, it sped up my process that otherwise would've taken many more months for every page given that the kinds of data sources also increased over time
I did still spend significant amounts of time like a wikipedia contributor would deciding on what to keep, enhance or delete from the page based on my own personal preferences and what I was comfortable with seeing on the page
the dystopic feeling is also fair and unsettling, I think this ironically also made me realize how important safeguarding my personal data is, we leave digital trails of ourselves everywhere so a powerful agent can string them together to create a story of who you are
That's the use-case I enjoy with AI. Let it do the heavy-lifting, I'll enjoy the rest.
AI here is not a tool, it's the author, or at the very least a co-author that greatly influences the human author. It selects what's important and then writes the narrative. It has its own biases. The narrative isn't based on what's personally important to the human creator, but rather the availability of data, those sources that are digitized. And then in turn the output shapes the human author's own perspective, changing even what the human will write on their own.
If instead, the OP had collected this information into a physical book, when they get bored or sick or dies, the book gets pushed into a closet or garage, waiting for some grandchild, nephew or niece to pull it out and rediscover the family history. And if anyone has even a slight interest in continuing the legacy, they don't have to know how to use a computer, just some basic scrapbooking skills, which we all learned in kindergarten.
What a lovely resource, especially if it reflects stories and recollections given by the subjects themselves.
The idea of having AI do it all is really off-putting IMO. For a number of reasons:
1) You lose the curation. You'll inevitably see a bias towards documenting based on the quality and availability of the sources as opposed to the significance of the event. E.g. you might not have much info about some really special childhood event you or someone else remembers, but does that mean it shouldn't be documented? Conversely, I don't want a 10,000 word essay on (to quote one of the titles from the post) "The 3D printing saga" -- just because I happen to have hundreds of WhatsApp messages on the subject.
2) I don't want to fact check every detail. Personally, I think if grandad (RIP) would have told me he one surfed a 20ft wave of the coast of Filey, Yorkshire. I don't need a correction that it was unlikely to be that high. If these things are partly being done "in memoriam" then I think it's really important to preserve the experiences, stories and recollections if the people we're trying to remember. Dates etc are fine to validate and correct. But there's an element of subjectivity to memories that is really special IMO. What even is reality at the end of the day? We're all just one big collective story we tell ourselves.
3) It feels soulless. Enough said on this one, I think people know what I mean
Similar with this, when you're hand curating old photographs and personally interviewing relatives, you're learning something. You're deepening relationships and your own personal understanding of these people you love, spending time reflecting on your own life. But when you send an LLM at it and it produces the volume of real Wikipedia, now an automated process is producing more text than you can ever possibly read if all you did for the rest of your life is read.
Throughout the year we keep writing in it, things we learnt, discords we had and how we resolved them, recipes I experimented with and we loved, random thoughts; basically anything and everything. And that little diary becomes an embodiment of that year.
I would also like to point out the manual labor and writing into it and not using an obsidian++-AI-auto-categorizer-3000 is simply because it feels like it's worth something, it's a nice little routine we have at the start of every year, and it's really fun reading these from 2-3 years back. Also the kids will have some really interesting reading a few years down the line.
I imagine a future where this becomes a family tradition that transcends time, knowledge from different generations, living different lives all nicely recorded in these codices. Something about this whole thing feels really beautiful to me.
I was thinking the other day I need to go back to a physical recipe book too. I don't cook that many different things that I need to reference it for, but there was a charm in my old one of remembering the best recipes were the ones covered in spilled ingredients and filled with marginalia.
Thanks for the resource!
Ideally square books that can go on a coffee table. At least when I am dead there will be some part of my existence in physical form, unlike all the digital things we spend decades creating.
I might put a SD card taped in the front of each one with a video too, so someone can watch it in the future.
As a separate aside, I also found old Canon photo printers (Selphy models) on ebay for about £5! Some need the little white gear inside glueing back on (there's a video on YouTube about it), and they DO NOT work with Windows anymore, but gutenprint supports them fine on Linux, so I have been printing photos (postcard size) at home. The colour isn't going to win awards and the saturation needs boosting slightly in the printer options compared to default, but it's a wonderful way to finally get some photos from trips on the walls.
I’ve also done some light-fast testing. Laser prints (both B&W and color) survive a long while in direct harsh sunlight left in the window of my Utah home. All types of pen I tried were faded within a couple years but Pencil survives.
A 360 degree stapler is a fantastic tool for quickly binding them.
I do something similar but with email and more pro-active [1]. I have created my son an email address when he was born and I'm sending him things from our lives and ask family members to to the same. Just to write them about themselves and send photos of their current homes and gardens and partners.
I imagining him looking through his email when he's 18 and reading personalize messages sent by family members who might no longer be with us then.
[1] https://blog.haschek.at/2024/leaving-a-digital-legacy.html
As I store everything in a local Vikunja instance for notes and WIP, here's the list of links I assembled relative to this (hopefully useful; it includes calendar templates so that I can make them for my mother-in-law):
https://github.com/berteh/ScribusGenerator
https://wiki.scribus.net/canvas/Useful_Free_Resources
https://www.opendesktop.org/p/1106678
https://www.opendesktop.org/browse?cat=196&page=1&ord=latest
https://www.pling.com/s/Artwork/browse?cat=196&ord=latest
https://wiki.scribus.net/canvas/CalendarWizard
https://github.com/RaffertyR/Year-Calendar-Script-for-Scribu...
https://wiki.scribus.net/canvas/Category:Scripts
https://wiki.scribus.net/canvas/Making_a_photobook_from_a_di...
https://wiki.rjcalow.co.uk/photography/make/designaphotobook...
https://github.com/PPSchL/scribus-photobook-scripts
https://github.com/RaffertyR/PhotoBookTools-for-Scribus
https://forums.scribus.net/index.php?topic=4081.0
https://wiki.scribus.net/canvas/Automatic_import_of_images_f...
https://wiki.scribus.net/canvas/Photo_Albums
https://github.com/hawbox/scribus-book-templates
https://forums.scribus.net/index.php?topic=3735.0
http://johnosterhout.com/basic-book-template-for-scribus/
When you find a print shop, they'll talk about margins and bleeds, so it might be worth finding a print shop first to know what bleed zones you want on the pages and whether they expect left page first, or right page first.
Once you know that, you can set up Scribus appropriately.
We have started asking old family members to send us whatsapp audios with tales and things they remember from long-passed away family members; and what was life like in the 1930-40-50s. I want to start organizing all the info and data we have, my father has built a couple family trees, but this wiki format is indeed very promising. I'll keep an eye on this and see if we can use it.
My partner's ancestors came from Sitges (in fact, one of them was the mayor of the town), back in 1820s or 1830s - to Argentina, and from there to Montevideo, Uruguay. Among the various marriages in the generations, there's a Scottish clan, and English ancestry intermixed with Spaniards. She can trace her roots back to some of the founding members and prominent political families there.
The last time we were in Scotland, we found the clan she's from - but couldn't ascertain the ship they took to Argentina :-/ That's left as an exercise for some future trips.
I wouldn't give a LLM run by a US corporation access to my private photographs.
From my perspective, the American President has threatened to annex my country, American businesses have repeatedly violated my trust, spyed on me and leaked my data, and American big tech is meddling in my country's politics. No other country has demonstrated such an ability and willingness to collect information about me and use it against me.
Given the US' NSA's long-standing violation of human rights at massive scale, and the proclivity of American society to be reasonable about kidnapping people, deemed unsavory, off the streets by jackboot thugs - and the fact that China builds roads, hospitals, ports, and communities around the world in nations considered 'inferior' by America's military junta/oligarch ruling class, while America bombs them into oblivion - I'm fine with the idea of eschewing American AI.
Its kind of necessary, I think, to resist this at the moment - at scale too, I might add.
If Americans want to fix this they still can - time is running out, however.
What I don't want to do is give it to services with an agenda to abuse the data, particularly those profiling individuals for profit. Frankly, I'd trust a Chinese service more than I would an Adtech based one, but that's still not much.
https://i.kym-cdn.com/photos/images/original/001/259/257/342...
The MediaWiki server died and I had backups, but... literally no one in the family would've tried to resurrect it.
They knew I'd worked on genealogy for a while but I don't think anyone would've thought to rebuild a linux box covered in dust and somehow find an old MediaWiki install on it.
I should've made simple markdown files with images in an image directory and printed out copies. That's a legacy. A consolidated, easy to drag from grandpa's house and throw on a shelf and flip through, even in 2097.
Is anyone else feeling uncomfortable with that? It is a great project and I don't want to bash it with general concerns, but sharing all my financial and location details with any service seems like opening the floodgates to my house.
My concern is not even strictly related to AI, but about sharing all my most private data with any service. There is always a significant chance all of it is leaked sooner or later.
As an adversarial/worst-case model, it can be useful to think of every service as potentially storing forever all the data that you ever give it access to. As a practical matter, services have terms of service that they follow. If your Claude Code terms say that your data will not be used for training, you can be reasonably confident that they will not be, and storing the raw inputs forever (as suggested by “significant chance all of it is leaked sooner or later”) would be even more unlikely. (For example, Google has entire teams dedicated to compliance with users' “wipeout” settings. You can take a look at https://myactivity.google.com and https://myadcenter.google.com to see some of what Google knows and thinks about you, and if you've chosen "Auto-Delete after 3 months" or whatever, you can be very sure it will be gone after that time. Every single team that stores user data is required to comply with this.)
I do think the services make it harder than it should be, to find out what the terms are — for a given usage of their services whether and for how long the details will be stored by them. Just saying that you can find this out and generally rely on it at least at the time (at a reasonable threat model, e.g. not treating the service as a malicious adversary having a giant law-breaking conspiracy that has never been exposed).
Unlike some of the comments herein, I find this as a perfect use of technology in service of users. (Yes, with some limits). I liken this to Maggie Appleton's Home-cooked Software model [1], wherein barefoot developers use technology (AI-driven or not) for writing apps for their own purposes, nominally for a user base of 1 (or very few), with possibilities of expanding to a few dozen.
In that vein, I'm a barefoot developer, and much of the software I have written in the past few months (with help from Claude, ChatGPT) is very much for that tiny user base of a few dozen (=mostly me, if I'm honest). And that is perfectly fine by moi.
I wrote a utility to organize roughly 100K+ photographs (and videos) neatly into dates/location, both for backup, as well as to maintain the memories in an organized fashion. Asked Claude to lookup location by EXIF; haven't yet asked it to "guess the location by photo" when no GPS info existed in the EXIFs. But I think I might do that.
(no, I haven't asked Claude to go thru my Uber trips or bank statements! I draw a line there!)
That is why the OP's personal wiki made me so excited - because the whole output resonates with me.
Like a few commenters mentioned their journaling experiences. I've started doing that with some of our trips (mostly post pandemic), both to remember our experiences better, and to come back to them as needed. The simple act of writing down places visited, experiences had (mostly hikes, mountains climbed, meals consumed in distant foreign places, weird/quirky experiences) causes them to be fresh in one's memories.
Thanks, this was a great project, and a great reminder as well.
It is stalker-ish to write up biographies like this about your relatives. It's one thing to write up the weddings and upbeat things like this, but not all families lives are just sunshine and rainbows.
How about that relative of the family who spent time in prison? Grandpa in war? Many old people don't naturally talk about some parts of their lives either because they suffered some injustice like (what as an Eastern European I can think of) their properties taken away by Nazis and Soviets, or they did something they aren't proud of. Are you going to oral history interview/interrogate them to fill in all the gaps? Do you tell them you're going to upload all they say to some servers where who knows who will have access to it?
There are also longlasting family feuds between sides of families, like how one son was tricked out of the inheritance maybe wrongly, maybe he was an ass to his parents. People holding grudges and explaining their life failure and derailment by wrongly or rightly blaming others.
Maybe your aunt is presenting a story that doesn't quite add up when you triangulate it from all OSINT and private sources. Maybe your cousin isn't the daughter of who you think she is. Is it your business?
Even if no such big thing factor in, a biography of a person will be very subjective. You can narrate the same life in many ways so they appear more or less successful or an asshole.
Its fine to keep these things as oral history and memory that fades.
I don't really care about what the regular people who were my great grandparents and their cousins did. Maybe if I could read all the drama, I'd end up hating a bunch of relatives. These things have a natural life cycle of forgetting. That's fine.
Again, it's all well if you live in a family where everyone is nice and everyone was successful and helpful. Otherwise it's a can of worms. Nerds can be a bit blind to this as they just want to play with the toys and treat it like some logic puzzle.
Obviously not everyone has same needs or wants to retain stories and memories but lack of social structures and solutions seems like weird mishap.
Though from the title I didn't expect family history, I thought it was going to be more of a project like this: https://shii.bibanon.org/shii.org/knows/Everything_Shii_Know...
I had started something similar with my mom over Christmas in '24. About half way through the collection she asked to stop. We would do the rest on her next visit.
Well. It never came to that as she passed away completely unexpected in March last year.
I’ll never get the chance to record the other stories. The stories from the second half of the photo collection.
I cheer for projects like this.
Secondly, the home page seems like I am reading a family history page more than talking about the software. It is confusing to me.
Thanks for sharing.
I would probably have ended well before "I exported my Google Maps location history, Uber trips, bank transactions, and Shazam history."
Aside: I've started seeing lot of AI projects in this category say some variation of:
> it runs on your machine, your data stays with you, and any model can read it
I don’t think people fully appreciate the tension in those claims, especially when the model most area reaching for is Claude or GPT or Gemini. I think these things need more precise language about where data actually goes and what tradeoffs users are implicitly accepting.
I do like the idea of building up this history of people, and maybe when my parents pass I'll make theirs public and so on. Great work, dude! I love it.
https://wiki.roshangeorge.dev/w/Blog/2025-02-07/A_Confused_K...
Sure, the wiki is private. However, in the process your data is being uploaded straight to an AI company. Of course local LLMs exist but that’s seemingly not supported here and I think the statement on privacy could be clearer.
The family has a TON of videos and photos, but no resource to guide us through what is what.
The bank transaction + location cross referencing to figure out which restaurants you went to is pretty cool. Would be great if this could pull in social media exports too. Point it at your X, IG, FB archives, let it draft pages/content from that.
Any plan for a timeline view? Wiki format works well for depth but sometimes you just want to scroll through a year.
Right now, my wife and I are sticking to annual photo albums. They're already fun to flip through and we're not even that old yet.
Nice work!
I'll look into this more: Most appreciated, thank you.
Each year I have the wife take curated photos from our shared accounts with an overview of the event photographed.
This is then bound into a 1/2 inch book with 50 pages. We now have a dozen years of annualized memories that we can pass around with physical access.
She has done this for others with great success. The personal touches make it well worth what she charges.
I started running an private MediaWiki instance during the pandemic as I wanted something with a nice editing experience rather than editing markdown documents. I almost went with a self-hosted Confluence instance :P
Mediawiki is very very nice and it has a lot of cool features i've been loving over the years.
One of the things i like the most is the ability to embed a PDF document so that it's both downloadable and browsable from the wiki page itself (it embeds the browser pdf engine).
This means that i can, say, have a page for my microwave oven and have its user manual easily available.
Lately I've been thinking how to connect that with some LLM, most likely there's a chance to do some interesting things :)
The product naming is becoming harassment. When it's in the title, at least we know. When it's in the intro, we know what we are getting into.
What really pinch is that this project could have easily been done with some scripting, open sourced, and anyone could do it at zero cost, with total privacy.
It would in fact make fore a better result, a family wiki with content that's AI generated may overall look accurate, but the sloppy parts ridiculous. If I find an archive I would rather assemble the information myself.
> The model traced the arc of our friendships through the messages, pulled out the life episodes we had talked each other through, and wove them into multiple pages that read like it was written by someone who knew us both. When I shared the pages with my friends, they wanted to read every single one.
This is a stunning violation of the privacy of your friends.
If someone uploaded every single private conversation I had had with them to Anthropic, they would no longer be my friend.
And that brings me to my point. I've been thinking a lot lately about digital legacy. When I was a kid, it was neat to see photo books that showed my parents as kids, living their lives, having fun. Though those memories stand out to me, it's not something you revisit often. With digital memories, you can share them constantly, in great quantities. But what if you want them to stick around?
First, I think in early 2000s brain, and I think about how I've got domain names and web sites, and some of them include family photos and forums. The only way to keep them around is some kind of durable host, and a way for someone interested to get to that hosted data. Cloud + domain names = unmaintained software but subscription-based expense in perpetuity.
What about a box? A server you could plug in anywhere, uses dynamic DNS to "hook in" to the internet, and you just maintain a domain name. You could update it while you're alive, but eventually it would just be a "photo book" people could choose to pass around and connect if they so wished. And the domain name could be pre-paid for a while, but eventually die, many years after you.
Now whether you need/want a digital legacy is probably more a question of ego, and how much those you leave behind want a way to revisit memories of your life and the lives of those you touched. But if you do want that, it's not as easy as printing out a photo book, or printing photos and sliding them behind those plastic sleeves, and passing that from household to household.
I'm currently in the very early stages of going through several DVDs worth of digital photos my late grandmother took, and thinking of ways to organize them and share them with my family. And I'm wondering if I can make whatever I come up with "reasonably" durable.
On the technical side perhaps the shared nature of this helps - if you can have something replicated so that you and several other members are all running replicas there’s a
On the non technical side, take some photos and print them on good paper. Print out stories on paper.
That doesn’t cover video and perhaps other things but it’s simple and does actually work for lots and lots of stories and pictures. It’s also immediately doable right now without anything new.
This chap made it a labor of love.
This project I thought was a nice creative project. But then, as with all creative projects, I get the nagging question - who is going to use/read/wear the outputs of this work? But that's not really the point for a hobby is it? My conclusion: I should be less negative :D
Then I forgot about it. It’s not like the data is lost, but availability is. Bringing it back up is a pain. I could probably do it in a full day of work.
What I learned: Static HTML export on every change by default is a must. I don’t think HTML will cease to be easily readable in our lifetimes.
It's magical watching people learn about hyperlinks. Even technical people don't always seem to know the power of a string that says, "Go to this server and fetch this document". Love it
Video >photo >audio >text
The genealogy part – researching my ancestors' life – feels more useful.
I see this more as a digital artifact for future generations. I would love to read all about the events in the lives of my ancestors (no matter how detached the narration) going back generations.
Imagine if you could read in detail about your parental ancestors in 1500s, what they worked as, what they liked doing, where they spent their first holiday together…
Please don't pick the most provocative thing in an article or post to complain about in the thread. Find something interesting to respond to instead.