A great use for reddit is the ability to search posts and opinions about any niche topic. Will that be possible with Lemmy as it grows? Will I be able to Google “instant rice Lemmy” and get a comprehensive tier list of each brand?
I imagine search engines will have trouble with all the different instances(?). EDIT: Especially with instances that don’t have Lemmy in their name, I don’t think search engines would return them for Lemmy searches?
So I’ve been working on a solution for this.
As I see it Google and others are going to have a hard if not impossible time to incorporate the fediverse, and the fact that the same content can exist on multiple servers.
So I’m working on a search engine specifically build, for Lemmy at least. Where it’ll take you to whatever your preferred instance is when tapping on a search result.
I hope to have a MVP up and running in a few more days.
Please pop a reminder here. Commenting for a bump.
Search their name on GitHub and you’ll find it. Star it to follow.
reminder: https://lemmy.world/post/963301
Can the reminder bots be migrated too?
Can’t emphasize enough how important this is for the growth of Lemmy. Many people I know only access Reddit through google searches.
Yep and I’m one of them. Go look me up on Reddit and I think I have maybe 20 posts over the 14+ years I was on the site. …joined Lemmy and immediately got frustrated that I couldn’t find anything. So I figured I take a crack at it. Especially since I couldn’t see how Google would ever be able to link me to my instance. Let alone make it easy to search the entire fediverse without having to write out every possible site, with new ones popping up every day.
Easier to find a Reddit post through Google than by Reddit search.
Interesting. I hadn’t even thought about how the fact that instance1.[post] and instance2.[post@instance1] is essentially the same thing and how search engines would handle it. Interested in what you come up with!
Thanks. If you do some digging you can find the project on GitHub but note that it’s a work in progress still. The UI is lacking and it’s rough around the edges but it’s “working”. And I still need to do some optimizations on the crawler itself, etc…
It’s also going to be completely self-hostable just like Lemmy, etc…
Hey, can you dm me the git link, i would like to contribute if i can : )
Search their name on GitHub
If this guy changes the internet include me in the screenshot.
I can see this being helpful
(commenting so I can bookmark)
we need a search engine for the entire fediverse, it would be a game changer
deleted by creator
That sounds awesome. Can’t wait to see it.
Shit dude, that’d be a sweet little tool.
I am surprised noone mentioned https://fedi-search.com . It’s working pretty well. Full credit to Benjamin Pryor for this
Tried it, pretty cool, though seems still depend on search engines’ indexing. The instance that I’m on seems not indexed.
Also it’s interesting it uses
intext
to identify whether the results are from fediverse.
Digg.com was the big thing with Reddit trailing. Digg began tweaking the experience toward a more profitable model. I had already come to Reddit when they went too far and there was a sudden enormous migration from Digg to Reddit. Digg went from being THE social media aggregator to being nothing in a matter of weeks.
Reddit is more deeply rooted, so I think it will stick around, I’m cool if Reddit keeps those who are happy with corporate model busy so we can do our thing here.
It’s certainly not going anywhere unless they end up selling it to someone who shuts it down and uses the posts and links as SEO boosting.
Well, Digg.com still exists. It’s just that no one cares.
when you just loaded their site to test you just doubled their monthly active users.
Most likely if it’s being sold for anything it’s for language model testing.
Maybe, but probable Google try to kill us
You can use a search query to include only results with Lemmy’s footer, which is consistent across all Lemmy instances. I made a post about it here: https://lemmy.world/post/342365
Reddit did not start out as the thing to google, it’s 15+ years old, only in the last 5y I started prefixing my google searches with reddit.
I actually found Reddit by googling things. I had seen it 5 or 6 times over a few years, and eventually I just went to the main site. I might have even used Reddit in the search before I joined. Regardless, I had recognized that all the best answers for tricky problems that I had were coming from Reddit before I even joined 11 years ago.
I realize Lemmy needs to get much bigger for that to happen. My question was more directed at how search engines would handle the fediverse. Though I see now that that wasn’t very clear.
Everyone’s experience on this will be different, but I personally started using reddit about 12 years or so ago largely because at that point a lot of my Google searches were already pointing me towards reddit. I wasn’t necessarily going to google specifically to find reddit results, but since that’s where I kept ending up i figured I might as well go straight to reddit. And since reddit’s search function is and has always been trash, i pretty much immediately started using Google to search reddit.
Basically use
<query> site:lemmy.world OR site:lemmy.ml OR site:beehaw.org OR site:kbin.social
(or whatever main instances you want to hit)You can also use this for custom browser search keys like the following https://duckduckgo.com/?q=%s+site%3Alemmy.world+OR+site%3Alemmyml+OR+site%3Abeehaw.org+OR+site%3Akbin.social
I imagine that would be quite inconvenient… Especially as Lemmy grows and has potentially many more instances.
I believe that DDG has a shorthand for site:Reddit (without the .com). If lemmy gets popular enough DDG may implement a similar shorthand that incorporates the fediverse without us having to use a massive string. Like if it gets big enough, we may not have to solve this problem because others will see the value in making it easy.
That’s my hope at least.
I think it is preferable to ask other search engines like DuckDuckGo to index Lemmy info. Google is full of garbage.
Brave Search would be better, they have a dedicated section on the results page for discussions.
Lemmy is not as unique as Reddit as a word. I get a lot of Lemmy Kilmister matches. But still hopeful
Use the exclusion keyword for your search provider. For example on google
lemmy -kilmister -motorhead
will get you only Lemmy software results by excluding pages with “kilmister” or “motorhead” in their contents.
I have seen at least one user claim they got a result from lemmy when searching a question on google. YMMV though. Lemmy is a fraction of the size of reddit, it will take time for posts to reach the level that google starts indexing them specifically.
I got one. The Google link brought me to the instance though and not the thread. I was able to find the thread though, so it kinda worked.
If Lemmy becomes a source of enough information like Reddit is, search engines will index it. SEO is a marketing thing, and a place like Lemmy doesn’t really need that. Google, DDG, etc. all put engineering effort into making sure sites with lots of information are indexed and available in their search results.
I would argue that eventually, yes, one will be able to google search Lemmy just like Reddit.
I wish there was a way to get an entire Reddit archive over here. Realistically I’m still going to have to search Reddit because it has 10+ years of answers to obscure questions.
Only if we make sure the tech giants don’t kill this platform
How would they? It’s all decentralized
Google could prevent lemmy pages from showing up in the results for example.
Or they could adapt the protocol, make their own slightly tweaked version of it and let it die, which apparently often also kills the original protocol due to newly introduced compatibility issues, etc.
Not sure about the second part, I read about it here somewhere where they mentioned an example of that happening as well but I can’t find it anymore.
Depends on Google. These tech companies don’t like new platforms, especially those competing with established ones like Reddit. You’ll see that Google often discriminates against Lemmy or Mastodon.
Correct me if I’m wrong but if individual admins allow their instances to be indexed wouldn’t the instance itself have some sort of metadata identifying it as a Lemmy branch?
Respectfully: Fuck that.
If you want to find the best instant rice recommendations on Lemmy, Lemmy should have a functional post search function, rather than me relying on a malevolent corporate entity like google to index all the content.
Search has gone to shit as the Internet has embraced social media sites, an upside of this is that wikipedia+Lemmy+key word search, mayas accurate as asking Google Bard or bing, and they can be built on entirety open tech.
Cool rage but you dismissing search indexing is kinda hilarious. It’s not going away and it’s what makes the web. Would you rather have 3 big websites instead of indexed web?
I came here to say something similar but you put it nicely.