See Ya at What Gets Me Hot: Twittercontent © 2021 Twitterforaspecificgeographic, Inc JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY

A taxonomy of trends that can be detected from Twitterforaspecificgeographic

communityusingpopular

widelyaccepted methods.

2.

A characterization of the data associated with eachtrend along a number of key characteristics, includ-ing social network features, time signatures, and textualfeatures.This improved understanding of emerging informationon Twitter in particular, and in SAS in general, will allowresearchers to design and create new tools to enhance the useofSASasinformation

systemsindifferentcontextsandappli-cations, including the filtering, search, and visualization ofreal-time SAS information as it pertains to local geographiccommunities.To this end, we begin with an introduction to Twitter anda review of related efforts and background to this work. Wethen formally describe our dataset of Twitter trends and theirassociated messages. Later, we describe a qualitative studyexposing the types of trends found on Twitter. Finally, in thebulk of this article we identify and analyze emerging trendsusing the unique social, temporal, and textual is a popular SAS service, with tens of millionsof registered users as of June 2010. Twitter’s core functionallows users to post short messages, ortweets, which are up to140 characters long. Twitter supports posting (and con-sump-tion) of messages in a number of different ways, including through Web services and “third party” applications. Imp or-tantly, a large fraction of the Twitter messages are posted from mobile devices and services, such as Short Message Service (SMS) messages. A user’s messages are displayed Asa “stream” on the user’s Twitter page.In terms of social connectivity, Twitter allows a user to fol-low any number of other users.The-twitter contact network directed: u Sera can follow user B without requiring approvalor a reciprocal connection from user B. Users can set theirprivacy preferences so that their updates are available onlyto each user’s followers.

By default, the posted messages are-available to anyone. In this work, we only consider messages-posted publicly on Twitter. Users consume messages mostlyby viewing a core page showing a stream of the latest mes-sages from people they follow, listed in reverse chronologicalorder
.The conversational aspects of Twitter play a role in ouranalysis of the Twitter temporal trends. Twitter allows severalways for users to directly converse and interact by referenc-ing each other in messages using the @ symbol. Aretweetis a message from one user that is “forwarded” by a seconduser to the second user’s followers, commonly using the “RT@username” text as prefix to credit the original (or previous)poster (e.g., “RT @justinbieber Tomorrow morning watchme on the today show”). Areplyis a public message fromone user that is a response to another user’s message, andis identified by the fact that it starts with the replied-to user@username (e.g., “@mashable check out our new study onTwitter trends”). Finally, amentionis a message that includessome other username in the text of the message (e.g., “attend-ing a talk by @informor”). Twitter allows users to easily seeall recent messages in which they were retweeted, replied to,or mentioned.

JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY—May 2011

... bawdy and uncommonly inappropriate writing, thus hilarious, not solely microfiche chops of synced, pious, redacted
craprolls, his Elvis, if you will, is huffed on, rubbed between legs, and insufflated like a meme kitten or cokebread rail @nytimesarts https://t.co/lAsSjwQIMy pic.twitter.com/FW1Y2umjX2

— mrjyn (@mrjyn) 22. Mai 2021

Now imagine, as happened to Google, your algorithm fits “gorilla” to the image of a black person. That’s wrong, but it’s categorically differently wrong from simply fitting “airplane” to the same person. How do you write the loss function that incorporates some penalty for racially offensive results? Ideally, you would want them to never happen, so you could imagine trying to identify all possible insults and assigning those outcomes an infinitely large loss. Which is essentially what Google did — their “workaround” was to stop classifying “gorilla” entirely because the loss incurred by misclassifying a person as a gorilla was so large.

© 2021 Twitter, Inc

Chris Wiggins associate professor applied mathematics at Columbia University explanation.

A patient goes to see a doctor. The doctor performs a test with 99 percent reliability--that is, 99 percent of people who are sick test positive and 99 percent of the healthy people test negative. The doctor knows that only 1 percent of the people in the country are sick. Now the question is: if the patient tests positive, what are the chances the patient is sick?

More generally, Bayes's theorem is used in any calculation in which a "marginal" probability is calculated (e.g., p(+), the probability of testing positive in the example) from likelihoods (e.g., p(+|s) and p(+|h), the probability of testing positive given being sick or healthy) and prior probabilities (p(s) and p(h)): p(+)=p(+|s)p(s)+p(+|h)p(h). Such a calculation is so general that almost every application of probability or statistics must invoke Bayes's theorem at some point. In that sense Bayes's theorem is at the heart of everything from genetics to Google, from health insurance to hedge funds. It is a central relationship for thinking concretely about uncertainty, and--given quantitative data, which is sadly not always a given--for using mathematics as a tool for thinking clearly about the world.

The intuitive answer is 99 percent, but the correct answer is 50 percent, and Bayes's theorem gives us the relationship between what we know and what we want to know in this problem. What we are given--what we know--is p(+|s), which a mathematician would read as "the probability of testing positive given that you are sick"; what we want to know is p(s|+), or "the probability of being sick given that you tested positive." The theorem itself reads p(s|+)=p(+|s)p(s)/p(+), although what Reverend Bayes, who lived from 1702 to 1761, actually said was something simpler. Bayes stated the defining relationship expressing the probability you test positive AND are sick as the product of the likelihood that you test positive GIVEN that you are sick and the "prior" probability that you are sick (that is, the probability the patient is sick, prior to specifying a particular patient and administering the test).

Rather than relying on Bayes's math to help us with this, let us consider another illustration. Imagine that the above story takes place in a small country, with 10,000 people. From the prior p(s)=0.01, we know that 1 percent, or 100 people, are sick, and 9,900 are healthy. If we administer the test to everyone, the most probable result is that 99 of the 100 sick people test positive. Since the test has a 1 percent error rate, however, it is also probable that 99 of the healthy people test positive. Now if the doctor sends everyone who tests positive to the national hospital, there will be an equal number of healthy and sick patients. If you meet one, even though you are armed with the information that the patient tested positive, there is only a 50 percent chance this person is sick.

Now imagine the doctor moves to another country, performing the same test, with the same likelihood (p(+|s)) and, for that matter, the same success rate for healthy people, which we might call p(-|h), "the probability of scoring negative given that one is healthy." In this country, however, we suppose that only one in every 200 people is sick. If a new patient tests positive, it is actually more probable that the patient is healthy than sick. The doctor needs to update the prior. (The correct probability is left as a homework assignment for the reader.)

The importance of accurate data in quantitative modeling is central to the subject raised in the question: using Bayes's theorem to calculate the probability of the existence of God. Scientific discussion of religion is a popular topic at present, with three new books arguing against theism and one, University of Oxford professor Richard Dawkins's book The God Delusion, arguing specifically against the use of Bayes's theorem for assigning a probability to God's existence. (A Google news search for "Dawkins" turns up 1,890 news items at the time of this writing.) Arguments employing Bayes's theorem calculate the probability of God given our experiences in the world (the existence of evil, religious experiences, etc.) and assign numbers to the likelihood of these facts given existence or nonexistence of God, as well as to the prior belief of God's existence--the probability we would assign to the existence of God if we had no data from our experiences.
Dawkins's argument is not with the veracity of Bayes's theorem itself, whose proof is direct and unassailable, but rather with the lack of data to put into this formula by those employing it to argue for the existence of God. The equation is perfectly accurate, but the numbers inserted are, to quote Dawkins, "not measured quantities but & personal judgments, turned into numbers for the sake of the exercise."

Note that although this is receiving much attention now, quantifying one's judgments for use in Bayesian calculations of the existence of God is not new. Richard Swinburne, for example, a philosopher of science turned philosopher of religion (and Dawkins's colleague at Oxford), estimated the probability of God's existence to be more than 50 percent in 1979 and, in 2003, calculated the probability of the resurrection [presumably of both Jesus and his followers] to be "something like 97 percent."

(Swinburne assigns God a prior probability of 50 percent since there are only two choices: God exists or does not. Dawkins, on the other hand, believes "there's an infinite number of things that some people at one time or another have believed in ... God, Flying Spaghetti Monster, fairies, or whatever," which would correspondingly lower each outcome's prior probability.)

In reviewing the history of Bayes's theorem and theology, one might wonder what Reverend Bayes had to say about this, and whether Bayes introduced his theorem as part of a similar argument for the existence of God. But the good reverend said nothing on the subject, and his theorem was introduced posthumously as part of his solution to predicting the probability of an event given specific conditions.

In fact, while there is plenty of material on lotteries and hyperbolic logarithms, there is no mention of God in Bayes's "Essay towards Solving a Problem in the Doctrine of Chances," presented after his death to the Royal Society of London in 1763
(and available online at www.stat.ucla.edu/history/essay.pdf ).

One primary scientific value of Bayes's theorem today is in comparing models to data and selecting the best model given those data. For example, imagine two mathematical models, A and B, from which one can calculate the likelihood of any data given the model (p(D|A) and p(D|B)). For example, model A might be one in which spacetime is 11-dimensional, and model B one in which spacetime is 26-dimensional.

Once I have performed a quantitative measurement and obtained some data D, one needs to calculate the relative probability of the two models: p(A|D)/p(B|D).

Note that just as in relating p(+|s) to p(s|+), I can equate this relative probability to p(D|A)p(A)/p(D|B)p(B). To some, this relationship is the source of deep joy; to others, maddening frustration.

The source of this frustration is the unknown priors, p(A) and p(B). What does it mean to have prior belief about the probability of a mathematical model? Answering this question opens up a bitter internecine can of worms between "the Bayesians" and "the frequentists," a mathematical gang war which is better not entered into here. To oversimplify, "Bayesian probability" is an interpretation of probability as the degree of belief in a hypothesis; "frequentist probability is an interpretation of probability as the frequency of a particular outcome in a large number of experimental trials. In the case of our original doctor, estimating the prior can mean the difference between more-than-likely and less-than-likely prognosis. In the case of model selection, particularly when two disputants have strong prior beliefs that are diametrically opposed (belief versus nonbelief), Bayes's theorem can lead to more conflict than clarity.

More than 50 million people around the world consider themselves creators, and Gen Z and millennials have flocked towards creative jobs as robotic process automation (RPA) and other factors have constrained their desire for more traditional careers.
Over 33% of children between the ages of 8 and 12 aspire to be a vlogger or YouTuber, 3x more than those that want to be an astronaut.
There has never been greater access to global communities than there is today, from more open, general social communities and platforms like Clubhouse or Twitter to closed groups on Discord, Reddit, and Slack.
Today, over 2 million creators make six figures or more on YouTube, Twitch, and Instagram, and sponsored influencers are worth over $8 billion.
This number is expected to grow to $15 billion by 2022.
It’s getting easier to make money in the creator economy.
At one point not too long ago, the only hope to escape the 9-5 through creativity was to go on American Idol or get a book published, but the development of new platforms like Twitch, Patreon, and Cameo has enabled creative people with followings to monetize like never before.
The recent NFT boom has given creators of all formats the opportunity like never before to leverage digital talent for art.
Justin Blau, or 3lau, an electronic music producer and DJ, sold an audio-visual NFT collection including songs from his new album for $11.6 million.
This initially set a record for the highest amount paid for an NFT collection, until Beeple shattered this record about a week later with a $69M sale.
Historically, piracy has plagued the proliferation of digital art.
However, as blockchain’s maturation has enabled scarcity in digital markets, demand has surged for unique pieces and collaborations, ushering in a new wave of creativity and collaboration that is quickly changing the creative landscape.
Traditionally, creators have leveraged their communities and superfans who are willing to pay above market value, ignore friction-heavy experiences, and create demand through evangelism.
NFTs and digital collectibles give superfans the opportunity to own the art of their favorite creators like never before, and the possibilities of such monetization are limited only by the creativity of individuals.
Historically, piracy has plagued the proliferation of digital art.
However, as blockchain’s maturation has enabled scarcity in digital markets, demand has surged for unique pieces and collaborations, ushering in a new wave of creativity and collaboration that is quickly changing the creative landscape.

Digital art visualizing blockchain technology and non-fungible token (NFT).

The NFT explosion hasn’t come without some bumps in the road, however.

Some digital artists have seen their art stolen, posted online as an NFT, and sold to fans.

Others have seen their NFTs hacked and ripped from their accounts.

There is even speculation that the frothy market that has seen many sales into the millions of dollars is largely due to new sophisticated money laundering, and there’s always concern over the “ecological nightmare” of NFT production.

There is, however, an even larger issue with NFTs and the creator economy as a whole.

The problem with many creator-focused platforms or monetization tools is that they reward very few.

An artist sees that an NFT sold for $10 million, and believes that this new medium is their gateway into financial freedom. The reality is that platforms like Twitch, Nifty Gateway, etc., heavily reward their biggest creators in order to incentivize the general public to participate, creating the network effects that investors love so much.

This phenomenon has bled into the main pillar of creator monetization: influencer marketing.

Influencer engagement rates have plummeted more than 10% during the pandemic, and ad revenue for creators has declined over 33% during the same time period.

96.5% of YouTube accounts fail to generate above the poverty line ($12,140), and only 3% of accounts generate more than $16,000 despite averaging over 1.4 million views per video.

Influencer fatigue and recent scandals have certainly contributed to the decline in marketing dollars, ushering in a new era of micro-influencers and a general shift in advertising strategy for many major brands.

The way that NFTs and smart contracts are being used in the creative world has only scratched the surface of what’s to come.

While the majority of the get-rich-quick schemes will likely fizzle out, there will be plenty of opportunity to capitalize on the surges as creators find new ways to monetize.

The way that NFTs and smart contracts are being used in the creative world has only scratched the surface of what’s to come.

While the majority of the get-rich-quick schemes will likely fizzle out, there will be plenty of opportunity to capitalize on the surges as creators find new ways to monetize.

Community and creator-based businesses are interesting because the main moat is the talent on the platform and the power of the fandom.

Cameo became a unicorn off the back of C and D-list stars, while Clubhouse and Genies raised at a $100m+ valuation based on buy-in of a core group of megastars.

Sterling Campbell (’21) is an MBA graduate at Columbia Business School, where he was President of the VC Fellows program.

He has worked at Lerer Hippeau, Charge Ventures, and Endeavor.

@mrjyn

SEO

@mrjyn

May 29, 2021

Twittercontent © 2021 Twitterforaspecificgeographic, Inc JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY

A taxonomy of trends that can be detected from Twitterforaspecificgeographic

communityusingpopular

widelyaccepted methods.

2.

JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY—May 2011

— mrjyn (@mrjyn) 22. Mai 2021

© 2021 Twitter, Inc

Chris Wiggins associate professor applied mathematics at Columbia University explanation.

Once I have performed a quantitative measurement and obtained some data D, one needs to calculate the relative probability of the two models: p(A|D)/p(B|D).

Note that just as in relating p(+|s) to p(s|+), I can equate this relative probability to p(D|A)p(A)/p(D|B)p(B). To some, this relationship is the source of deep joy; to others, maddening frustration.

More than 50 million people around the world consider themselves creators, and Gen Z and millennials have flocked towards creative jobs as robotic process automation (RPA) and other factors have constrained their desire for more traditional careers.

Over 33% of children between the ages of 8 and 12 aspire to be a vlogger or YouTuber, 3x more than those that want to be an astronaut.

There has never been greater access to global communities than there is today, from more open, general social communities and platforms like Clubhouse or Twitter to closed groups on Discord, Reddit, and Slack.

Today, over 2 million creators make six figures or more on YouTube, Twitch, and Instagram, and sponsored influencers are worth over $8 billion.

This number is expected to grow to $15 billion by 2022.

It’s getting easier to make money in the creator economy.

At one point not too long ago, the only hope to escape the 9-5 through creativity was to go on American Idol or get a book published, but the development of new platforms like Twitch, Patreon, and Cameo has enabled creative people with followings to monetize like never before.

The recent NFT boom has given creators of all formats the opportunity like never before to leverage digital talent for art.

Justin Blau, or 3lau, an electronic music producer and DJ, sold an audio-visual NFT collection including songs from his new album for $11.6 million.

This initially set a record for the highest amount paid for an NFT collection, until Beeple shattered this record about a week later with a $69M sale.

Historically, piracy has plagued the proliferation of digital art.

However, as blockchain’s maturation has enabled scarcity in digital markets, demand has surged for unique pieces and collaborations, ushering in a new wave of creativity and collaboration that is quickly changing the creative landscape.

Traditionally, creators have leveraged their communities and superfans who are willing to pay above market value, ignore friction-heavy experiences, and create demand through evangelism.

NFTs and digital collectibles give superfans the opportunity to own the art of their favorite creators like never before, and the possibilities of such monetization are limited only by the creativity of individuals.

Historically, piracy has plagued the proliferation of digital art.

However, as blockchain’s maturation has enabled scarcity in digital markets, demand has surged for unique pieces and collaborations, ushering in a new wave of creativity and collaboration that is quickly changing the creative landscape.

Digital art visualizing blockchain technology and non-fungible token (NFT).

The NFT explosion hasn’t come without some bumps in the road, however.

Some digital artists have seen their art stolen, posted online as an NFT, and sold to fans.

Others have seen their NFTs hacked and ripped from their accounts.

There is even speculation that the frothy market that has seen many sales into the millions of dollars is largely due to new sophisticated money laundering, and there’s always concern over the “ecological nightmare” of NFT production.

There is, however, an even larger issue with NFTs and the creator economy as a whole.

The problem with many creator-focused platforms or monetization tools is that they reward very few.

This phenomenon has bled into the main pillar of creator monetization: influencer marketing.

Influencer engagement rates have plummeted more than 10% during the pandemic, and ad revenue for creators has declined over 33% during the same time period.

96.5% of YouTube accounts fail to generate above the poverty line ($12,140), and only 3% of accounts generate more than $16,000 despite averaging over 1.4 million views per video.

Influencer fatigue and recent scandals have certainly contributed to the decline in marketing dollars, ushering in a new era of micro-influencers and a general shift in advertising strategy for many major brands.

The way that NFTs and smart contracts are being used in the creative world has only scratched the surface of what’s to come.

While the majority of the get-rich-quick schemes will likely fizzle out, there will be plenty of opportunity to capitalize on the surges as creators find new ways to monetize.

The way that NFTs and smart contracts are being used in the creative world has only scratched the surface of what’s to come.

While the majority of the get-rich-quick schemes will likely fizzle out, there will be plenty of opportunity to capitalize on the surges as creators find new ways to monetize.

Community and creator-based businesses are interesting because the main moat is the talent on the platform and the power of the fandom.

Cameo became a unicorn off the back of C and D-list stars, while Clubhouse and Genies raised at a $100m+ valuation based on buy-in of a core group of megastars.

Sterling Campbell (’21) is an MBA graduate at Columbia Business School, where he was President of the VC Fellows program.

He has worked at Lerer Hippeau, Charge Ventures, and Endeavor.