Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> You get a cool page like this -> https://shepherd.com/bboy/2024/f/bwb

How do I get to my page? The profile section is very spartan. I'm not even sure whether my profile picture was submitted.



Yep go here: https://shepherd.com/bboy/my-3-fav-reads

To submit your 3 favorites, and then we will email you once the page is ready.

Ya sorry, brand new and working on the profile to improve it and the image bit.

Hit me up at ben@shepherd.com if you need any help.


> then we will email you once the page is ready.

Ah, I completely missed this part. Long day.

> Ya sorry, brand new and working on the profile to improve it and the image bit.

Don't worry, so far I like the site quite a bit, especially since there is much more about ancient mesopotamia than I had thought!


Sweet thanks!

I am going to work a ton this winter to improve topic/genre accuracy, as I think we have more on Mesopotamia but the system is struggling to tag it correctly. We are using an older NLP/ML system that isn't working as well as newer ones. Update coming soon.


Great to hear. Also, a very interesting topic for me since this is close to my masters degree and current job. For a niche topic like ancient mesopotamia I was surprised. Is there a blog post or something about how the site works behind the scenes?


Very cool! What is your job?

I blog every 2 to 3 weeks here about building it: https://build.shepherd.com/

But I am not a developer, so it only goes into things lightly, like our topic system with NLP/ML and other topics. (hoping to be a dev in a few years enough to work on it as well)

Here are a few, are these interesting or too mundane? https://build.shepherd.com/p/a-big-focus-for-2024-improving-... https://build.shepherd.com/p/building-shepherd-updated-topic... https://build.shepherd.com/p/building-shepherd-topic-pages-n... https://build.shepherd.com/p/sneak-peek-at-genre-and-age-gro...

It is python / Django on the backend, nothing crazy I think.


Currently, I'm working as a Data Scientist on document retrieval/text matching, and I have a masters degree in a topic close to computer science and linguistics.

> Here are a few, are these interesting or too mundane?

Thanks, I'll have a look. It's quite interesting, even if it's not very technical, since recently I started thinking about building a book trade or selling network for close friends and their friends (invitation based).

> It is python / Django on the backend, nothing crazy I think.

A solid choice, I think.

Do you use something like openlibrary.org as well?


Nice!

That is a very cool idea! I've been thinking about something like that to help fund the website, which is a network of book trades, and you get credit for a new used book for every trade. Although I was going to charge $10 a year or $1 a book to go 80% toward authors and 20% toward the website.

> Do you use something like openlibrary.org as well?

When I last looked in 2020, the data quality on open library was really bad, so I didn't use them. We created every book recommended in our system manually, as we needed to find high-quality cover images, and it was an easy way to start.

Eventually, we licensed data from the Nielsen API. It has not been a great experience, but it works "ok." I also looked at Ingram and Bokwer. Now things are 90% automated, but we still have to source a high quality cover for each book as the book APIs have such small images.

I am hoping to expand our book DB in 2025 to all books for a lot of the new features. I am going to look at Open Library again then, as it might work better for that limited functionality. I also could have been too pedantic back in 2020 when i looked at them :)


> That is a very cool idea! I've been thinking about something like that to help fund the website, which is a network of book trades, and you get credit for a new used book for every trade. Although I was going to charge $10 a year or $1 a book to go 80% toward authors and 20% toward the website.

Sounds like a nice way to generate some funding, especially as you already have a number of users on your site to start with. I have no idea how much bureaucracy it takes to be able to take 5% of each transaction or so, so I'll probably leave mine completely free - if I ever get the time to implement it, that is.

> When I last looked in 2020, the data quality on open library was really bad, so I didn't use them.

> Now things are 90% automated, but we still have to source a high quality cover for each book as the book APIs have such small images.

Not great, since that would have been a starting point for me. ;) Cool though, that you can invest into a custom database, since you can tailor it to your task and aren't relying on potentially bad data.

> I am hoping to expand our book DB in 2025 to all books for a lot of the new features.

Is there a public roadmap somewhere for the features?

> I also could have been too pedantic back in 2020 when i looked at them :)

Maybe, maybe not. It takes huge effort to go back and improve a system relying on bad data, especially if it's already a certain size. I know from experience, as my company just did that with a large internal catalogue. Not fun.


I chatted with someone who had a platform like this in the past, and they said it worked really well. They ended up shutting it down when a server lost data or something weird. They also made a fair amount of money from links to buy the book new if there were no used copies available to "order."

>Not great, since that would have been a starting point for me. ;) Cool though, that you can invest >into a custom database, since you can tailor it to your task and aren't relying on potentially bad > data.

Legally, you can probably use the Google Books API for a project like this (whereas I couldn't due to their rules). Or, you could also use Open Library since you don't need great data quality, only the title and author to get this running. For me it was some of the other quality issues that made it not worth my time back then.

>Is there a public roadmap somewhere for the features?

Not a good one: https://forauthors.shepherd.com/roadmap

I promise I will update it this weekend. I am trying to find a better embed than this text list (behind the scenes, we have a much better system).

Basically, it is:

Roll out a better ad system for our Founding Author Members (as they are heavily funding the website).

Roll out book series pages (and test a notification system for users there)

Ship a massive update to our bookshelf collections of genres, age groups, and topics. This will visually navigate and break down the most loved books of all time, trending, new, and some other cool stuff.

Improve the accuracy of our genre / topic system (as of right now, it is not doing well). And I am working to add themes and tropes into he mix.

Big improvement to book section UX.

Building a DB of all books to power features needing that going forward (going to try to see if I can use Open Library).

Add a monthly "fav read of month" program for readers.

It is a rough list; still testing and thinking on a lot of these.

And waiting for a lot of data to come back on our personalized email list -> https://shepherd.com/my-book-dna

As I am trying to really do something cool with email and waiting to make sure engagement looks good with 1,000 subscribers before I start evolving and improving it.


Good point, I forgot about affiliate links.

> Legally, you can probably use the Google Books API for a project like this (whereas I couldn't due to their rules). Or, you could also use Open Library since you don't need great data quality, only the title and author to get this running. For me it was some of the other quality issues that made it not worth my time back then.

I think, UX wise I'd be in a similar position to you, since 1.) I'd need a short description of the book as well, and 2.) definitely in german, too. So, I'd probably have to create a lot of data myself.

> Not a good one: https://forauthors.shepherd.com/roadmap

> Improve the accuracy of our genre / topic system (as of right now, it is not doing well). And I am working to add themes and tropes into he mix.

I think, that's a good focus. Good quality structured data is quite good to have.

> Add a monthly "fav read of month" program for readers.

I'd definitely use this. Are you planning on adding a lot more interactivity/"blogging" features on the reader side?

It'll be interesting to see the development in the next year, big plans! I'll definitely keep an eye on the site.


Ah ya, I don't know about German, maybe there is a free source in Germany?

>I'd definitely use this. Are you planning on adding a lot more >interactivity/"blogging" features on the reader side?

Yep working that way in 2025 :)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: