Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> NYT needs to know how long you spent, on which articles, etc. They need data to produce the product and you can only achieve that with javascript tracking pixels (Server logs aren't good enough).

No they don't. They really don't need to know any of that. They don't even get a pass on tracking because they're providing a free whatever - I pay for a subscription to the NYT. The business, or a meaningfully substantial core of it, is viable without tracking.

It would be nice if the things I pay for didn't start stuffing their content with bullshit. What and who do I have to pay to get single second page loads? It's not a given that advertising has to be so bloated and privacy-invasive. Various podcasts and blogs (like Daring Fireball) plug the same ad to their entire audience each post/episode for set periods of time. If you're going to cry about needing advertising then take your geographic and demographic based targeting. But no war of attrition will get me to concede you need user-by-user tracking.

You want me to pay for your content? Fine, I like it well enough. You want to present ads as well? Okay sure, the writing and perspectives are worth that too I suppose. But in addition to all of this you want to track my behavior and correlate it to my online activity that has nothing to do with your content? No, that's ridiculous.



> No they don't. They really don't need to know any of that. They don't even get a pass on tracking because they're providing a free whatever - I pay for a subscription to the NYT. The business, or a meaningfully substantial core of it, is viable without tracking.

Clearly they disagree. Or maybe you should let them know that they don't need that.

To say it without sarcasm, what you feel you are entitled as a paying customer and what they feel they need/want to understand their customers are clearly at odds. Ultimately, what you think matters nothing in isolation and what they think matters nothing in isolation. What you two agree upon, is the only thing that matters. That is to say, if you think they shouldn't track you but you use their tracking product anyway, you've compromised and agreed to new terms.

I imagine you could come up with a subscription that would adequately compensate them for a truly no tracking experience. But I doubt you two would agree on a price to pay for said UX.


You're correct of course, but I don't really see how this isn't a vacuous observation. Yes clearly our perceptions are at odds, but that has nothing to do with the reality of whether or not they need to be doing that tracking. Obviously they think they need to, or they wouldn't do it. But I think I've laid out a pretty strong argument that they actually don't need to, which leads me to believe that they actually haven't considered it seriously enough to give it a shot.

Would they be as profitable? Maybe, maybe not. Would they become unprofitable? No, strictly speaking. I'm confident in that because the NYT weathered the decline of traditional news media before the rise or hyper-targeted ads, and because I've maintained a free website in the Alexa top 100,000 on my own, with well over 500,000 unique visitors per day. That doesn't come close to the online audience of a major newspaper, but it's illustrative. There is a phenomenal amount of advertising optimization you can do using basic analytics based on page requests and basic demographic data that still respects privacy and doesn't track individual users. I outlined a few methods, such as Daring Fireball's.

Maybe instead of this being a philosophical issue of perspective between a user and an organization, it's an issue of an organization that hasn't examined how else it can exist. Does the NYT need over 10,000 employees? Is there a long tail of unpopular and generally underperforming content that nevertheless sticks around, sucking up money and forcing ever more privacy-invasive targeting? If the NYT doesn't know its audience well enough to present demographic-targeted ads on particular articles and sections, what the hell is it doing tracking users individually? It's just taking the easy way out and giving advertising partners the enhanced tracking they want. But they don't need to do that, and whether or not they think they need to do it is orthogonal to the problem itself.


> You're correct of course, but I don't really see how this isn't a vacuous observation. Yes clearly our perceptions are at odds, but that has nothing to do with the reality of whether or not they need to be doing that tracking. Obviously they think they need to, or they wouldn't do it. But I think I've laid out a pretty strong argument that they actually don't need to, which leads me to believe that they actually haven't considered it seriously enough to give it a shot.

It most definitely is. But so is the word need, in this context. How would we define what they need to do, and what they don't need to do?

My argument is simply such that, of course they don't need to (by my definition), but nothing will change that unless they see a different, more lucrative offer. Ie, "oh hey, here's 2 million readers who will only read the page in plain html and will pay an extra $20/m". It just seems like a needless argument, as I don't believe there's anything that can change their behavior without us changing ours. Without the market changing.

Rather, I think the solution lies not in them, but in you. In us. To use blockers and filters to such an extreme degree that it's made clear that UX wins here, and they need to provide the UX to retain the customers.

Thus far, we've not done enough to change their "need". If a day comes that they do need to stop tracking us, well, they'll either live or die. But the problem, and solution, lies in us. My 2c.


> What you two agree upon, is the only thing that matters.

That's precisely why many of us use (and promote the use of) adblockers and filtering extensions.


Classic narrowcasting mistake that dying companies make.

Statista claims 2.3 million digital subscribers. NYT is trying to milk that 2.3M for everything they got, squeeze the last drops of blood from the stone while they still can.

That's a great way to go out of business, when 99.97% of the world population is not your customer and your squeezing labors are not going to encourage them to sign up.

If you hyperoptimize to squeeze every drop out of a small customer base, eventually you end up with something like legacy TV networks where 99% of the population won't watch a show even for free, and the tighter the target focus on an ever shrinking legacy audience, the smaller the audience gets, until the whole house of cards collapses.

Its similar to the slice of pie argument; there are many business strategies that make a pie slice "better" at the price of shrinking it, and eventually the paper-thin slice disappears from the market because the enormous number of the employees can't eat anymore, but that certainly will be the most hyperoptimized slice of pie ever made, right before it entirely disappears.

NYT is going to have a truly amazing spy product right before it closes.


Why is that doubtful? There's all kinds of examples of tiered subscriptions in the world. I think it would be doubtful because the NYT wouldn't want to explicitly admit all the tracking they are doing.


> Why is that doubtful? There's all kinds of examples of tiered subscriptions in the world. I think it would be doubtful because the NYT wouldn't want to explicitly admit all the tracking they are doing.

Many reasons, one of which you said. What would the price tag be for them to admit all they are tracking?


Currently the price is free, and comes bundled with uMatrix, and a cookie flush. I’d like to pay the NYT for their journalism, but only with money, not the ability to track me. As a result they get no money, and no tracking.


> Currently the price is free, and comes bundled with uMatrix, and a cookie flush. I’d like to pay the NYT for their journalism, but only with money, not the ability to track me. As a result they get no money, and no tracking.

You misunderstood me. I mean, what would they like you to pay them, for them to be 100% transparent about what they're doing for tracking, what their advertisers are doing and who they are, and possibly stopping all that entirely. Ie, what is it worth to them.


Interestingly if you pay them, and thus are logged in when you view an article, then they can better track you.

In contrast if you never sign up, disable JS, and periodically clear your cookies, then the entire site works fine and none of the third party trackers work. At best they can link your browser user agent and IP to a hit on the server side.


NYT needs to produce and recommend content that people find engaging to continue earning their subscription dollars.

The idea that tracking is purely or primarily there to support a business model of selling user data is a strawman invented by self-righteous HNers. You need to know what parts of your product are effective to make it competitive in today’s marketplace.


90% of that can be accomplished with server-side stats. Do you really need to track mouse movements and follow readers with super-cookies across the web to find out what articles people find engaging on your site?

> The idea that tracking is purely or primarily there to support a business model of selling user data

Purely, no. Primarily? You can bet your sweet ass.


I agree in general but there are some things which I don’t see going away any time soon that publishers need. Online advertisers want to know that their ads are being viewed by a human and not a bot, and that they were on screen for long enough and that the user didn’t just scroll past. Publishers want to know how far down you make it in their article, so they know where to put the ads in the body of the article.


I'm not accusing anyone of selling my data and I'm not trying to champion a crusade against the entire advertising industry. I'm asserting that the NYT can achieve the substantial majority of the advertising optimization and targeting it needs to do to be profitable 1) without doing user-specific tracking and 2) without making page loads extremely slow.

Like I said, serve me an ad. I'm not an idealist, I understand why advertising exists. But don't justify collecting data about which articles I read to serve to some inscrutable network of advertisers by saying that it has to be this way. We don't need this version of advertising.


> I'm asserting that the NYT can achieve the substantial majority of the advertising optimization and targeting it needs to do to be profitable

Majority, not all. Why should they leave money on the table, exactly?


Because it's disrespectful of user privacy, performance inefficient and computationally wasteful?

Most companies are not achieving the platonic maximalization of profit or shareholder value. They leave money on the table for a variety of reasons. It's not beyond the pale that this would be one of them. If you don't agree, then frankly it's probably an axiomatic disagreement and I don't think we can reason one another to a synthesis.


There's nothing axiomatic about our disagreement here, it's not like I'm unaware of the existence of inefficient businesses. Individual companies may choose to leave money on the table, but industries and markets as a whole do not (not intentionally, anyway).

You've just described the status quo, where businesses have to sacrifice their lifeblood to achieve your ideals. Those businesses tend to be beaten by more focused competitors, which results in the industry you see today, filled with winners that don't achieve your ideals.

But good luck trying to champion an efficient web industry by essentially moralizing.


I'm not trying to champion anything, I'm speaking my mind. I don't expect the NYT to change because I'm writing an HN comment. If market forces or legislation are insufficient to force companies to respect user privacy across unrelated domains, then I'll rely on my own setup: a Pi-Hole VPN for mobile devices, and uBlock Origin for desktop devices. I happily whitelist domains with non-intrusive ads and respect for Do-Not-Track.

But more to the point, you're presenting an argument which implies the NYT is a business which will be beaten by its competitors if it doesn't track users through their unrelated web history. I don't think that kind of tracking is an existential necessity for the NYT. It's not their core competency. Their core competency is journalism - if they are beaten by a competitor it won't be because the competitor has superior tracking, for several reasons:

1. Journalism is not a winner take all environment,

2. Newspapers were surviving in online media well before this tracking was around,

3. The NYT already has sufficiently many inefficiencies that if they actually cared about user privacy, they could trim the fat elsewhere so they wouldn't have to know to within 0.001% precision whether or not a user will read an entire article just to be profitable.

I really don't think this is too idealistic. It's not like I'm saying they need to abandon advertising altogether. I don't even have a problem with the majority of advertising. It's the poor quality control and data collection that I take issue with. All I'm saying is that they don't need to do what they're doing to be profitable.


> Journalism is not a winner take all environment

So? I'm not sure how this means that news orgs won't suffer from losing business to competitors with superior tracking.

> Newspapers were surviving in online media well before this tracking was around

Markets change. Advertisers have different expectations. Readers have more news to choose from. This is a silly argument.

> The NYT already has sufficiently many inefficiencies that if they actually cared about user privacy, they could trim the fat elsewhere

Sure, but why? Why would they do that? Why wouldn't they trim the fat elsewhere AND keep the tracking to make more money?

The point you make doesn't really make sense. Yeah, it's theoretically possible for news orgs to stop tracking in the same way that it's theoretically possible for me to take out a knife right now and cut off my legs. News orgs can make up their losses elsewhere and survive in the same way that I can still get around with a wheelchair.

But why on earth would I or the NYT do that?

I respond to you with these questions because it seems to me that both you and the OP speak out against these practices because you feel they are unnecessary. My point is that they are necessary. You just don't acknowledge the forces that make them so.


> Majority, not all. Why should they leave money on the table, exactly?

It may be that they are, in fact, driving users away. Tracking user behaviour can become a distraction.


NYT used to exist only in paper form which had no tracking abilities at all. They may benefit from this but I’m skeptical that they “need” it.


The marketplace for your attention was a lot less competitive then. Editors could even feed you true and nuanced reporting, out of a sense of professional obligation, and you had no choice but to sit there and take it.


That sounds like a case of acute metrics-itis personally - looking for things to measure as a yardstick while forgetting that you get what you measure for instead of the core of the business.

While it may give some insight does it give anything of meaning to know most people skim into the first few paragraphs before clicking away? Does it improve the writing quality or fact checking? Is it worth the risk of alienating customers over? To give a deliberately extreme and absurd example Victoria's Secret could hire people to stalk their customers to find out more about product failure conditions in the field but that would be a horrifying and stupid idea.

"Everybody is doing it." is a poor rationale for implementing a business practice.


Except that the content is directly related to user behavior. If they see no one reads the style section, they'll cut it and move resources to financial news. If they didn't have tracking they'd never know that, be wasting resources and having a comparatively inferior product.

They can't do UX anaylsis, nothing.


For the first, it's sufficient enough to just look at the number of page requests.

For the second, I never got explained to me how UX analysis really works for news sites. Isn't it enough to put 2 or 3 people in a room and show them a few variations of the UI? There isn't really much to publishing text, images, and a few graphs. Graphs are a very well explored field, I don't think you can learn more about them by just watching hot maps and click through rates.


I suspect there's lot of bullshitting happening around "UX analysis", with third-party "experts" offering analyses which may, or may not, show something significant. As long as everyone in the chain can convince their customer/superior that their work is useful, the money will flow, whether or not the work is actually useful.


That's one of the fundamental problems in tech today, namely:

"It is difficult to get a man to understand something, when his salary depends on his not understanding it.”


It absolutely isn't sufficient to look at the number of page requests. How do you discern like-reads vs hate-reads? How do you determine whether someone clicked on an article, read the first line, and then bailed vs read the whole thing? There are a heap of metrics used to determine engagement which factor into the material decisions referred to in the grandparent.


Why do you need to track user behavior across unrelated domains to achieve any of that?


It's pretty simple, try a few different designs with A/B testing and you will see which one has the most revenues.

However the result will usually be a lot of dark patterns. For instance, that's why you get popup to ask you to register.


Server logs just aren't sufficient, no matter how many times hn says so. You don't get enough data to make data-driven decisions. Thats like giving a data scientist 1/3rd of the available data, and saying "thats good enough".

You'd expect ux to be a small unit, but that includes everyone who works on revenue-generating ads. Moving one tiny thing has a direct impact on revenue, which affects every person employed by nyt.


>Thats like giving a data scientist 1/3rd of the available data, and saying "thats good enough".

And it could easily be enough. Having 1/3 of a quantity of something doesn't mean you barely had enough before.


They could certainly track what you mention here (which pages are being accessed) via logging requests - without any use of additional front-end assets.


You can do all of those analytics server side, there's no reason to deliver it via JS and have the client do the computation. You're already sending all the required info to track that sort of thing via the request itself.


It's amazing to me that no one out there seems to do server-local handling of ads, either... If you put ads directly into your page instead of relying on burdensome external systems, suddenly blocking isn't a thing anymore. ALL of the functionality supposedly needed for analytics and an ad-driven business model can happen server side, without the page becoming sentient and loading a billion scripts and scattered resources, with the one exception being filtering out headless browsers. If external systems need to be communicated with, most of that can happen before or after the fact of that page load. Advertising and analytics is implemented in the most lazy, user hostile way possible on the majority of sites.


I don't think it's very surprising. Advertisers won't let publishers serve ads directly because that requires trust in publishers to not misrepresent stats like impressions and real views. I don't know how you'd solve that trust problem when publishers are actually incentivized to cheat advertisers.


I think you may have identified the biggest issue, and it's a shame the pragmatic solution is an unpleasant technical solution.


Couldn’t they eg have some trusted proxy server that routes some requests to the real-content NYTimes server and some to the ad server?


That sounds like a viable solution to the trust issue. They don't need to respond to the requests, just see copies they can be sure are real requests.


For advertisers to trust this proxy server, the NYT cannot control this proxy server to preserve its integrity. So now you're asking the NYT to base their business on an advertiser-controlled server?

What happens when the proxy goes down? What happens when there are bugs? Do you think publishers can really trust advertisers to be good stewards of the publisher's business? Think for a moment about publishers that are not as big as the NYT.

Okay, maybe they do trust an advertiser-controlled proxy server. This means that both tracking scripts and NYT scripts are served from the same domain, meaning they no longer have cross origin security tampering protection. What's stopping the NYT from serving a script that tampers with an advertiser's tracking script?


Those are issues, but not insurmountable, especially when the benefit is "obviate any adblocker".

They can use a trusted third party to run the proxy and use industry standards/SLAs for site reliability/uptime. And they can still use different subdomains with no obvious pattern (web1.nytimes.com vs web2.nytimes.com -- which is the ad server?) or audit the scripts sent through the proxy for malice.


The way it's implemented has several "benefits":

- It externalizes resource usage - the waste happens on users' machines. Who cares that it adds up to meaningful electricity consumption anyway?

- It makes it easier for marketing people and developers to independently work on the site. Developers can optimize, marketers can waste all that effort by including yet another third-party JS.

- It supports ad auctions and other services in the whole adtech economy.

- You don't have to do much analytics yourself, as the third party provides cute graphs ideal for presenting to management.


There used to be an open source self-hosted (php) ad application called openx. It worked well for quite a while. In its later years, it suffered a number of high-profile security vulnerabilities, and the open source version was poorly maintained since OpenX [the company] was focused more on their hosted solution which probably had migrated to a different codebase or at least was a major version past the open source codebase.

The open source version has been renamed "Revive Adserver", and it looks maintained, but I don't think it's used nearly as much as the openx [open source version] of old.

If you use Revive Adserver or you design a server-local ad system in-house, it won't be as sophisticated as gigantic ad-providers who can do all sorts of segmentation and analysis (producing pretty reports which execs and stakeholders love even if that knowledge adds no value to the business).


Funny that you mention that --in a former life I had to develop around and maintain an openx system.


It's because they use systems that identify the client via js to deliver the most "expensive" ad possible. It's complete garbage of course, Google/Facebook should be held liable for what they advertise, not run massive automated systems full of fraud. If Google delivers malware they shouldn't be able to throw their hands up and go "well, section 230!".


> They can't do UX anaylsis, nothing

They could, but that would require paying people and firms like Nielsen to gather data. Instead they engage in the same freeloading that the industry derides users for.


Reading without cookies or JavaScript enables seems to fix every problem the NYT has.


FWIW: ars technica turns off tracking for paying customers (and provide full articles in the rss feed if you pay for it.)


I need to like this comment more than my single upvote allows.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: